blob: ed070e2ac4de2ab634f2d7d4964baf2ce2e1f45c [file] [view]
## spark configure
| Module Name (Service Name) | Parameter Name | Default Value | Description |Used|
| -------- | -------- | ----- |----- | ----- |
|spark|linkis.spark.yarn.cluster.jars|hdfs:///spark/cluster|spark.yarn.cluster.jars|
|spark|linkis.spark.etl.support.hudi|false|spark.etl.support.hudi|
|spark|linkis.bgservice.store.prefix|hdfs:///tmp/bdp-ide/|bgservice.store.prefix|
|spark|linkis.bgservice.store.suffix| |bgservice.store.suffix|
|spark|wds.linkis.dolphin.decimal.precision|32 |dolphin.decimal.precision|
|spark|wds.linkis.dolphin.decimal.scale| 10 | dolphin.decimal.scale|
|spark|wds.linkis.park.extension.max.pool|2 |extension.max.pool|
|spark|wds.linkis.process.threadpool.max| 100|process.threadpool.max |
|spark|wds.linkis.engine.spark.session.hook| | spark.session.hook|
|spark|wds.linkis.engine.spark.spark-loop.init.time| 120s |spark.spark-loop.init.time|
|spark|wds.linkis.engine.spark.language-repl.init.time| 30s| spark.language-repl.init.time |
|spark|wds.linkis.spark.sparksubmit.path| spark-submit|spark.sparksubmit.path|
|spark|wds.linkis.spark.output.line.limit|10| spark.output.line.limit|
|spark|wds.linkis.spark.useHiveContext|true| spark.useHiveContext |
|spark|wds.linkis.enginemanager.core.jar| | enginemanager.core.jar|
|spark|wds.linkis.ecp.spark.default.jar|linkis-engineconn-core-1.2.0.jar|spark.default.jar|
|spark|wds.linkis.dws.ujes.spark.extension.timeout| 3000L |spark.extension.timeout|
|spark|wds.linkis.engine.spark.fraction.length| 30 |spark.fraction.length|
|spark|wds.linkis.show.df.max.res| |show.df.max.res|
|spark|wds.linkis.mdq.application.name| linkis-ps-datasource |mdq.application.name||
|spark|wds.linkis.dolphin.limit.len| 5000|dolphin.limit.len|
|spark|wds.linkis.spark.engine.is.viewfs.env| true | spark.engine.is.viewfs.env|
|spark|wds.linkis.spark.engineconn.fatal.log|error writing class;OutOfMemoryError|spark.engineconn.fatal.log|
|spark|wds.linkis.spark.engine.scala.replace_package_header.enable| true |spark.engine.scala.replace_package_header.enable|
Use spark yarn cluster mode,need to set label "engingeConnRuntimeMode": "yarnCluster",and need to upload the dependence of the spark to 'linkis.spark.yarn.cluster.jar'(the default value is 'hdfs:///spark/cluster')
spark dependencies include jars and configuration filesFor example: '/appcom/Install/linkis/lib/linkis-engineconn-plugins/spark/dist/3.2.1/lib/*.jar','/appcom/Install/linkis/conf/*'
Precautions for using yarnCluster:
Eureka url if 127.0.0.1 should be changed to the real host, such as "127.0.0.1:20303/eureka/" should be changed to "wds001:20303/eureka/"
The spark-excel package may cause class conflicts,need to download separately,put it in spark lib
wget https://repo1.maven.org/maven2/com/crealytics/spark-excel-2.12.17-3.2.2_2.12/3.2.2_0.18.1/spark-excel-2.12.17-3.2.2_2.12-3.2.2_0.18.1.jar
cp spark-excel-2.12.17-3.2.2_2.12-3.2.2_0.18.1.jar {LINKIS_HOME}/lib/linkis-engineconn-plugins/spark/dist/3.2.1/lib
spark3 is not supported by native rocketmq-spark, and the source code needs to be modified, which can be downloaded directly from the link below
https://github.com/ChengJie1053/spark3-rocketmq-connector-jar