layout: docs24-cn title: “Kylin 配置” categories: install permalink: /cn/docs24/install/configuration.html

Kylin 会自动从环境中检测 Hadoop/Hive/HBase 配置,如 “core-site.xml”, “hbase-site.xml” 和其他。除此之外,Kylin 有自己的配置,在 “conf” 文件夹下。

{% highlight Groff markup %} -bash-4.1# ls -l $KYLIN_HOME/conf

kylin_hive_conf.xml kylin_job_conf_inmem.xml kylin_job_conf.xml kylin-kafka-consumer.xml kylin.properties kylin-server-log4j.properties kylin-tools-log4j.properties setenv.sh {% endhighlight %}

kylin_hive_conf.xml

Kylin 从 Hive 中取数据时应用的 Hive 配置。

kylin_job_conf.xml and kylin_job_conf_inmem.xml

Kylin 运行 MapReduce jobs 时的 Hadoop MR 配置。在 Kylin 的 “In-mem cubing” job 的时候,“kylin_job_conf_inmem.xml” 需要更多的 memory 给 mapper。

kylin-kafka-consumer.xml

Kylin 从 Kafka brokers 中取数据时应用的 Kafka 配置。

kylin-server-log4j.properties

Kylin 服务器的日志配置。

kylin-tools-log4j.properties

Kylin 命令行的日志配置。

setenv.sh

设置环境变量的 shell 脚本。它将在 “kylin.sh” 和 “bin” 文件夹中的其它脚本中被调用。通常,您可以在这里调整 Kylin JVM 栈的大小,且可以设置 “KAFKA_HOME” 和其他环境变量。

kylin.properties

Kylin 的主要配置文件。

KeyDefault valueDescriptionOverwritten at Cube
kylin.envDevWhether this env is a Dev, QA, or Prod environmentNo
kylin.env.hdfs-working-dir/kylinWorking directory on HDFSNo
kylin.env.zookeeper-base-path/kylinPath on ZKNo
kylin.env.zookeeper-connect-stringZK connection string; If blank, use HBase's ZKNo
kylin.env.zookeeper-acl-enabledfalseNo
kylin.env.zookeeper.zk-authdigest:ADMIN:KYLINNo
kylin.env.zookeeper.zk-aclworld:anyone:rwcdaNo
kylin.metadata.urlkylin_metadata@hbaseKylin metadata storageNo
kylin.metadata.sync-retries3No
kylin.metadata.sync-error-handlerNo
kylin.metadata.check-copy-on-writefalseNo
kylin.metadata.hbase-client-scanner-timeout-period10000No
kylin.metadata.hbase-rpc-timeout5000No
kylin.metadata.hbase-client-retries-number1No
kylin.dictionary.use-forest-trietrueNo
kylin.dictionary.forest-trie-max-mb500No
kylin.dictionary.max-cache-entry3000No
kylin.dictionary.growing-enabledfalseNo
kylin.dictionary.append-entry-size10000000No
kylin.dictionary.append-max-versions3No
kylin.dictionary.append-version-ttl259200000No
kylin.snapshot.max-cache-entry500No
kylin.snapshot.max-mb300No
kylin.snapshot.ext.shard-mb500No
kylin.snapshot.ext.local.cache.pathlookup_cacheNo
kylin.snapshot.ext.local.cache.max-size-gb200No
kylin.cube.size-estimate-ratio0.25Yes
kylin.cube.size-estimate-memhungry-ratio0.05DeprecatedYes
kylin.cube.size-estimate-countdistinct-ratio0.05Yes
kylin.cube.algorithmautoCubing algorithm for MR engine, other options: layer, inmemYes
kylin.cube.algorithm.layer-or-inmem-threshold7Yes
kylin.cube.algorithm.inmem-split-limit500Yes
kylin.cube.algorithm.inmem-concurrent-threads1Yes
kylin.cube.ignore-signature-inconsistencyfalse
kylin.cube.aggrgroup.max-combination4096Max cuboid numbers in a CubeYes
kylin.cube.aggrgroup.is-mandatory-only-validfalseWhether allow a Cube only has the base cuboid.Yes
kylin.cube.rowkey.max-size63Max columns in Rowkey, and it can not be more than 63No
kylin.metadata.dimension-encoding-max-length256Max length for one dimension's encodingYes
kylin.cube.max-building-segments10Max building segments in one CubeYes
kylin.cube.allow-appear-in-multiple-projectsfalseWhether allow a Cueb appeared in multiple projectsNo
kylin.cube.gtscanrequest-serialization-level1
kylin.cube.is-automerge-enabledtrueWhether enable auto merge.Yes
kylin.job.log-dir/tmp/kylin/logs
kylin.job.allow-empty-segmenttrueWhether tolerant data source is emtpy.Yes
kylin.job.max-concurrent-jobs10Max concurrent running jobsNo
kylin.job.sampling-percentage100Data sampling percentage, to calculate Cube statistics; Default be all.Yes
kylin.job.notification-enabledfalseWhether send email notification on job error/succeed.No
kylin.job.notification-mail-enable-starttlsfalseNo
kylin.job.notification-mail-port25No
kylin.job.notification-mail-hostNo
kylin.job.notification-mail-usernameNo
kylin.job.notification-mail-passwordNo
kylin.job.notification-mail-senderNo
kylin.job.notification-admin-emailsNo
kylin.job.retry0No
kylin.job.scheduler.priority-consideredfalseNo
kylin.job.scheduler.priority-bar-fetch-from-queue20No
kylin.job.scheduler.poll-interval-second30No
kylin.job.error-record-threshold0No
kylin.source.hive.keep-flat-tablefalseWhether keep the intermediate Hive table after job finished.No
kylin.source.hive.database-for-flat-tabledefaultHive database to create the intermediate table.No
kylin.source.hive.flat-table-storage-formatSEQUENCEFILENo
kylin.source.hive.flat-table-field-delimiter\u001FNo
kylin.source.hive.redistribute-flat-tabletrueWhether or not to redistribute the flat table.Yes
kylin.source.hive.clientcliNo
kylin.source.hive.beeline-shellbeelineNo
kylin.source.hive.beeline-paramsNo
kylin.source.hive.enable-sparksql-for-table-opsfalseNo
kylin.source.hive.sparksql-beeline-shellNo
kylin.source.hive.sparksql-beeline-paramsNo
kylin.source.hive.table-dir-create-firstfalseNo
kylin.source.hive.flat-table-cluster-by-dict-column
kylin.source.hive.default-varchar-precision256No
kylin.source.hive.default-char-precision255No
kylin.source.hive.default-decimal-precision19No
kylin.source.hive.default-decimal-scale4No
kylin.source.jdbc.connection-url
kylin.source.jdbc.driver
kylin.source.jdbc.dialectdefault
kylin.source.jdbc.user
kylin.source.jdbc.pass
kylin.source.jdbc.sqoop-home
kylin.source.jdbc.sqoop-mapper-num4
kylin.source.jdbc.field-delimiter|
kylin.storage.default2No
kylin.storage.hbase.table-name-prefixKYLIN_No
kylin.storage.hbase.namespacedefaultNo
kylin.storage.hbase.cluster-fs
kylin.storage.hbase.cluster-hdfs-config-file
kylin.storage.hbase.coprocessor-local-jar
kylin.storage.hbase.min-region-count1
kylin.storage.hbase.max-region-count500
kylin.storage.hbase.hfile-size-gb2.0
kylin.storage.hbase.run-local-coprocessorfalse
kylin.storage.hbase.coprocessor-mem-gb3.0
kylin.storage.partition.aggr-spill-enabledtrue
kylin.storage.partition.max-scan-bytes3221225472
kylin.storage.hbase.coprocessor-timeout-seconds0
kylin.storage.hbase.max-fuzzykey-scan200
kylin.storage.hbase.max-fuzzykey-scan-split1
kylin.storage.hbase.max-visit-scanrange1000000
kylin.storage.hbase.scan-cache-rows1024
kylin.storage.hbase.region-cut-gb5.0
kylin.storage.hbase.max-scan-result-bytes5242880
kylin.storage.hbase.compression-codecnone
kylin.storage.hbase.rowkey-encodingFAST_DIFF
kylin.storage.hbase.block-size-bytes1048576
kylin.storage.hbase.small-family-block-size-bytes65536
kylin.storage.hbase.owner-tag
kylin.storage.hbase.endpoint-compress-resulttrue
kylin.storage.hbase.max-hconnection-threads2048
kylin.storage.hbase.core-hconnection-threads2048
kylin.storage.hbase.hconnection-threads-alive-seconds60
kylin.engine.mr.lib-dir
kylin.engine.mr.reduce-input-mb500
kylin.engine.mr.reduce-count-ratio1.0
kylin.engine.mr.min-reducer-number1
kylin.engine.mr.max-reducer-number500
kylin.engine.mr.mapper-input-rows1000000
kylin.engine.mr.max-cuboid-stats-calculator-number1
kylin.engine.mr.uhc-reducer-count1
kylin.engine.mr.build-uhc-dict-in-additional-stepfalse
kylin.engine.mr.build-dict-in-reducertrue
kylin.engine.mr.yarn-check-interval-seconds10
kylin.env.hadoop-conf-dirHadoop conf directory; If not specified, parse from environment.No
kylin.engine.spark.rdd-partition-cut-mb10.0Spark Cubing RDD partition split size.Yes
kylin.engine.spark.min-partition1Spark Cubing RDD min partition numberYes
kylin.engine.spark.max-partition5000RDD max partition numberYes
kylin.engine.spark.storage-levelMEMORY_AND_DISK_SERRDD persistent level.Yes
kylin.query.skip-empty-segmentstrueWhether directly skip empty segment (metadata shows size be 0) when run SQL query.Yes
kylin.query.force-limit-1
kylin.query.max-scan-bytes0
kylin.query.max-return-rows5000000
kylin.query.large-query-threshold1000000
kylin.query.cache-threshold-duration2000
kylin.query.cache-threshold-scan-count10240
kylin.query.cache-threshold-scan-bytes1048576
kylin.query.security-enabledtrue
kylin.query.cache-enabledtrue
kylin.query.timeout-seconds0
kylin.query.pushdown.runner-class-name
kylin.query.pushdown.update-enabledfalse
kylin.query.pushdown.cache-enabledfalse
kylin.query.pushdown.jdbc.url
kylin.query.pushdown.jdbc.driver
kylin.query.pushdown.jdbc.username
kylin.query.pushdown.jdbc.password
kylin.query.pushdown.jdbc.pool-max-total8
kylin.query.pushdown.jdbc.pool-max-idle8
kylin.query.pushdown.jdbc.pool-min-idle0
kylin.query.security.table-acl-enabledtrueNo
kylin.server.modeallKylin node mode: all|job|query.No
kylin.server.cluster-serverslocalhost:7070No
kylin.server.cluster-nameNo
kylin.server.query-metrics-enabledfalseNo
kylin.server.query-metrics2-enabledfalseNo
kylin.server.auth-user-cache.expire-seconds300No
kylin.server.auth-user-cache.max-entries100No
kylin.server.external-acl-providerNo
kylin.security.ldap.user-search-baseNo
kylin.security.ldap.user-group-search-baseNo
kylin.security.acl.admin-roleNo
kylin.web.timezonePSTNo
kylin.web.cross-domain-enabledtrueNo
kylin.web.export-allow-admintrueNo
kylin.web.export-allow-othertrueNo
kylin.web.dashboard-enabledfalseNo