blob: c64f5cf07b3ab371e6035ee10b4e83f3330c20f6 [file] [log] [blame]
Release Notes - Hive - Version 3.1.0
** Sub-task
* [HIVE-12192] - Hive should carry out timestamp computations in UTC
* [HIVE-17227] - Incremental replication load should create tasks in execution phase rather than semantic phase
* [HIVE-17657] - export/import for MM tables is broken
* [HIVE-18193] - Migrate existing ACID tables to use write id per table rather than global transaction id
* [HIVE-18748] - Rename table impacts the ACID behavior as table names are not updated in meta-tables.
* [HIVE-18840] - CachedStore: Prioritize loading of recently accessed tables during prewarm
* [HIVE-18946] - Fix columnstats merge NPE
* [HIVE-18988] - Support bootstrap replication of ACID tables
* [HIVE-19009] - Retain and use runtime statistics during hs2 lifetime
* [HIVE-19096] - query result cache interferes with explain analyze
* [HIVE-19126] - CachedStore: Use memory estimation to limit cache size during prewarm
* [HIVE-19128] - Update golden files for spark perf tests
* [HIVE-19135] - Need tool to allow admins to create catalogs and move existing dbs to catalog during upgrade
* [HIVE-19141] - TestNegativeCliDriver insert_into_notnull_constraint, insert_into_acid_notnull failing
* [HIVE-19159] - TestMTQueries.testMTQueries1 failure
* [HIVE-19164] - TestMetastoreVersion failures
* [HIVE-19171] - Persist runtime statistics in metastore
* [HIVE-19193] - TestActivePassiveHA fails
* [HIVE-19194] - TestDruidStorageHandler fails
* [HIVE-19195] - Fix flaky tests and cleanup testconfiguration to run llap specific tests in llap only.
* [HIVE-19196] - TestTriggersMoveWorkloadManager is flaky
* [HIVE-19206] - Automatic memory management for open streaming writers
* [HIVE-19209] - Streaming ingest record writers should accept input stream
* [HIVE-19210] - Create separate module for streaming ingest
* [HIVE-19211] - New streaming ingest API and support for dynamic partitioning
* [HIVE-19214] - High throughput ingest ORC format
* [HIVE-19222] - TestNegativeCliDriver tests are failing due to "java.lang.OutOfMemoryError: GC overhead limit exceeded"
* [HIVE-19243] - Upgrade hadoop.version to 3.1.0
* [HIVE-19252] - TestJdbcWithMiniKdcCookie.testCookieNegative is failing consistently
* [HIVE-19274] - Add an OpTreeSignature persistence checker hook
* [HIVE-19332] - Disable compute.query.using.stats for external table
* [HIVE-19336] - Disable SMB/Bucketmap join for external tables
* [HIVE-19340] - Disable timeout of transactions opened by replication task at target cluster
* [HIVE-19347] - TestTriggersWorkloadManager tests are failing consistently
* [HIVE-19348] - org.apache.hadoop.hive.ql.plan.mapping.TestOperatorCmp are failing
* [HIVE-19366] - Vectorization causing TestStreaming.testStreamBucketingMatchesRegularBucketing to fail
* [HIVE-19374] - Parse and process ALTER TABLE SET OWNER command syntax
* [HIVE-19409] - Disable incremental rewriting with outdated materialized views
* [HIVE-19472] - HiveStreamingConnection swallows exception on partition creation
* [HIVE-19494] - Accept shade prefix during reflective instantiation of output format
* [HIVE-19495] - Arrow SerDe itest failure
* [HIVE-19499] - Bootstrap REPL LOAD shall add tasks to create checkpoints for db/tables/partitions.
* [HIVE-19500] - Prevent multiple selectivity estimations for the same variable in conjuctions
* [HIVE-19562] - Flaky test: TestMiniSparkOnYarn FileNotFoundException in spark-submit
* [HIVE-19598] - Add Acid V1 to V2 upgrade module
* [HIVE-19637] - Add slow test report script to testutils
* [HIVE-19688] - Make catalogs updatable
* [HIVE-19727] - Fix Signature matching of table aliases
* [HIVE-19739] - Bootstrap REPL LOAD to use checkpoints to validate and skip the loaded data/metadata.
* [HIVE-19758] - Set hadoop.version=3.1.0 in standalone-metastore
* [HIVE-19768] - Utility to convert tables to conform to Hive strict managed tables mode
* [HIVE-19799] - remove jasper dependency
* [HIVE-19815] - Repl dump should not propagate the checkpoint and repl source properties
* [HIVE-19851] - upgrade jQuery version
* [HIVE-19852] - update jackson to latest
* [HIVE-19868] - Add support for float aggregator
* [HIVE-19892] - Disable query results cache for for HiveServer2 doAs=true
* [HIVE-19923] - Follow up of HIVE-19615, use UnaryFunction instead of prefix
** Bug
* [HIVE-15190] - Field names are not preserved in ORC files written with ACID
* [HIVE-18434] - Type is not determined correctly for comparison between decimal column and string constant
* [HIVE-18816] - CREATE TABLE (ACID) doesn't work with TIMESTAMPLOCALTZ column type
* [HIVE-19016] - Vectorization and Parquet: Disable vectorization for nested complex types
* [HIVE-19054] - Function replication shall use "hive.repl.replica.functions.root.dir" as root
* [HIVE-19108] - Vectorization and Parquet: Turning on vectorization in parquet_ppd_decimal.q causes Wrong Query Results
* [HIVE-19109] - Vectorization: Enabling vectorization causes TestCliDriver delete_orig_table.q to produce Wrong Results
* [HIVE-19110] - Vectorization: Enabling vectorization causes TestContribCliDriver udf_example_arraymapstruct.q to produce Wrong Results
* [HIVE-19118] - Vectorization: Turning on vectorization in escape_crlf produces wrong results
* [HIVE-19120] - catalog not properly set for some tables in SQL upgrade scripts
* [HIVE-19131] - DecimalColumnStatsMergerTest comparison review
* [HIVE-19155] - Day time saving cause Druid inserts to fail with org.apache.hive.druid.io.druid.java.util.common.UOE: Cannot add overlapping segments
* [HIVE-19157] - Assert that Insert into Druid Table fails if the publishing of metadata by HS2 fails
* [HIVE-19167] - Map data type doesn't keep the order of the key/values pairs as read (Part 2, The Sequel or SQL)
* [HIVE-19168] - Ranger changes for llap commands
* [HIVE-19186] - Multi Table INSERT statements query has a flaw for partitioned table when INSERT INTO and INSERT OVERWRITE are used
* [HIVE-19200] - Vectorization: Disable vectorization for LLAP I/O when a non-VECTORIZED_INPUT_FILE_FORMAT mode is needed (i.e. rows) and data type conversion is needed
* [HIVE-19219] - Incremental REPL DUMP should throw error if requested events are cleaned-up.
* [HIVE-19230] - Schema column width inconsistency in Oracle
* [HIVE-19231] - Beeline generates garbled output when using UnsupportedTerminal
* [HIVE-19237] - Only use an operatorId once in a plan
* [HIVE-19247] - StatsOptimizer: Missing stats fast-path for Date
* [HIVE-19248] - REPL LOAD couldn't copy file from source CM path and also doesn't throw error if file copy fails.
* [HIVE-19258] - add originals support to MM tables (and make the conversion a metadata only operation)
* [HIVE-19264] - Vectorization: Reenable vectorization in vector_adaptor_usage_mode.q
* [HIVE-19269] - Vectorization: Turn On by Default
* [HIVE-19275] - Vectorization: Defer Wrong Results / Execution Failures when Vectorization turned on
* [HIVE-19277] - Active/Passive HA web endpoints does not allow cross origin requests
* [HIVE-19312] - MM tables don't work with BucketizedHIF
* [HIVE-19317] - Handle schema evolution from int like types to decimal
* [HIVE-19327] - qroupby_rollup_empty.q fails for insert-only transactional tables
* [HIVE-19331] - Repl load config in "with" clause not pass to Context.getStagingDir
* [HIVE-19350] - Vectorization: Turn off vectorization for explainuser_1.q / spark_explainuser_1
* [HIVE-19352] - Vectorization: Disable vectorization for org.apache.hive.jdbc.TestJdbcDriver2.testResultSetMetaData
* [HIVE-19357] - Vectorization: assert_true HiveException erroneously gets suppressed to NULL
* [HIVE-19358] - CBO decorrelation logic should generate Hive operators
* [HIVE-19365] - Index on COMPLETED_TXN_COMPONENTS in Metastore RDBMS has different names in different scripts
* [HIVE-19370] - Issue: ADD Months function on timestamp datatype fields in hive
* [HIVE-19381] - Function replication in cloud fail when download resource from AWS
* [HIVE-19382] - Acquire locks before generating valid transaction list for some operations
* [HIVE-19384] - Vectorization: IfExprTimestamp* do not handle NULLs correctly
* [HIVE-19385] - Optional hive env variable to redirect bin/hive to use Beeline
* [HIVE-19389] - Schematool: For Hive's Information Schema, use embedded HS2 as default
* [HIVE-19410] - don't create serde reader in LLAP if there's no cache
* [HIVE-19418] - add background stats updater similar to compactor
* [HIVE-19423] - REPL LOAD creates staging directory in source dump directory instead of table data location
* [HIVE-19433] - HiveJoinPushTransitivePredicatesRule hangs
* [HIVE-19435] - Incremental replication cause data loss if a table is dropped followed by create and insert-into with different partition type.
* [HIVE-19454] - Test failure : org.apache.hadoop.hive.ql.TestTxnCommands2.testNonAcidToAcidConversion1 fails with java.lang.AssertionError
* [HIVE-19460] - Improve stats estimations for NOT IN operator
* [HIVE-19463] - TezTask - getting groups may fail (PartialGroupNameException in some tests)
* [HIVE-19467] - Make storage format configurable for temp tables created using LLAP external client
* [HIVE-19474] - Decimal type should be casted as part of the CTAS or INSERT Clause.
* [HIVE-19479] - encoded stream seek is incorrect for 0-length RGs in LLAP IO
* [HIVE-19481] - Tablesample uses incorrect logic to pick files corresponding to buckets.
* [HIVE-19485] - dump directory for non native tables should not be created
* [HIVE-19493] - VectorUDFDateDiffColCol copySelected does not handle nulls correctly
* [HIVE-19496] - Check untar folder
* [HIVE-19498] - Vectorization: CAST expressions produce wrong results
* [HIVE-19504] - Change default value for hive.auto.convert.join.shuffle.max.size property
* [HIVE-19516] - TestNegative merge_negative_5 and mm_concatenate are causing timeouts
* [HIVE-19529] - Vectorization: Date/Timestamp NULL issues
* [HIVE-19557] - stats: filters for dates are not taking advantage of min/max values
* [HIVE-19565] - Vectorization: Fix NULL / Wrong Results issues in STRING Functions
* [HIVE-19567] - Fix flakiness in TestTriggers
* [HIVE-19569] - alter table db1.t1 rename db2.t2 generates MetaStoreEventListener.onDropTable()
* [HIVE-19575] - TestAutoPurgeTables seems flaky
* [HIVE-19577] - CREATE TEMPORARY TABLE LIKE and INSERT generate output format mismatch errors
* [HIVE-19578] - HLL merges tempList on every add
* [HIVE-19588] - Several invocation of file listing when creating VectorizedOrcAcidRowBatchReader
* [HIVE-19589] - Disable TestAutoPurge tests and annotate TestTriggersWorkloadManager with retry
* [HIVE-19590] - mask stats in llap_smb
* [HIVE-19592] - TestWorkloadManager - add retry for now
* [HIVE-19594] - Add custom tmp folders to tests to avoid collisions
* [HIVE-19595] - Regenerate webui port in MiniHS2
* [HIVE-19604] - Incorrect Handling of Boolean in DruidSerde
* [HIVE-19605] - TAB_COL_STATS table has no index on db/table name
* [HIVE-19608] - disable flaky tests 2
* [HIVE-19613] - GenericUDTFGetSplits should handle fetch task with temp table rewrite
* [HIVE-19614] - GenericUDTFGetSplits does not honor ORDER BY
* [HIVE-19615] - Proper handling of is null and not is null predicate when pushed to Druid
* [HIVE-19619] - Allow comparisons between doubles and bigints
* [HIVE-19629] - Enable Decimal64 reader after orc version upgrade
* [HIVE-19631] - reduce epic locking in AbstractService
* [HIVE-19632] - Remove webapps directory from standalone jar
* [HIVE-19639] - a transactional Hive table cannot be imported as an external table
* [HIVE-19643] - MM table conversion doesn't need full ACID structure checks
* [HIVE-19644] - change WM syntax to avoid conflicts with identifiers starting with a number
* [HIVE-19646] - Filesystem closed error in HiveProtoLoggingHook
* [HIVE-19660] - update branch-3 to be version 3.1 and fix storage-api mismatch
* [HIVE-19675] - Cast to timestamps on Druid time column leads to an exception
* [HIVE-19677] - Disable sample6.q
* [HIVE-19680] - Push down limit is not applied for Druid storage handler.
* [HIVE-19684] - Hive stats optimizer wrongly uses stats against non native tables
* [HIVE-19687] - Export table on acid partitioned table is failing
* [HIVE-19690] - multi-insert query with multiple GBY, and distinct in only some branches can produce incorrect results
* [HIVE-19691] - Start SessionState in materialized views registry
* [HIVE-19695] - Year Month Day extraction functions need to add an implicit cast for column that are String types
* [HIVE-19697] - TestReOptimization#testStatCachingMetaStore is flaky
* [HIVE-19698] - TestAMReporter#testMultipleAM is flaky
* [HIVE-19700] - Workaround for JLine issue with UnsupportedTerminal
* [HIVE-19713] - itests/hive-jmh should not reference a concreate storage-api version
* [HIVE-19723] - Arrow serde: "Unsupported data type: Timestamp(NANOSECOND, null)"
* [HIVE-19726] - ORC date PPD is broken
* [HIVE-19728] - beeline with USE_BEELINE_FOR_HIVE_CLI fails when trying to set hive.aux.jars.path
* [HIVE-19734] - Beeline: When beeline-site.xml is present, beeline does not honor -n (username) and -p (password) arguments
* [HIVE-19744] - In Beeline if -u is specified the default connection should not be tried at all
* [HIVE-19750] - Initialize NEXT_WRITE_ID. NWI_NEXT on converting an existing table to full acid
* [HIVE-19753] - Strict managed tables mode in Hive
* [HIVE-19754] - vector_decimal_2 failing on branch-3
* [HIVE-19755] - insertsel_fail.q.out needs to be updated on branch-3
* [HIVE-19762] - Druid Queries containing Joins gives wrong results.
* [HIVE-19771] - allowNullColumnForMissingStats should not be false when column stats are estimated
* [HIVE-19772] - Streaming ingest V2 API can generate invalid orc file if interrupted
* [HIVE-19773] - CBO exception while running queries with tables that are not present in materialized views
* [HIVE-19777] - NPE in TezSessionState
* [HIVE-19789] - reenable orc_llap test
* [HIVE-19793] - disable LLAP IO batch-to-row wrapper for ACID deletes/updates
* [HIVE-19794] - Disable removing order by from subquery in GenericUDTFGetSplits
* [HIVE-19796] - Push Down TRUNC Fn to Druid Storage Handler
* [HIVE-19801] - JDBC: Add some missing classes to jdbc standalone jar and remove hbase classes
* [HIVE-19808] - GenericUDTFGetSplits should support ACID reads in the temp. table read path
* [HIVE-19810] - StorageHandler fail to ship jars in Tez intermittently
* [HIVE-19813] - SessionState.start don't have to be synchronized
* [HIVE-19817] - Hive streaming API + dynamic partitioning + json/regex writer does not work
* [HIVE-19826] - OrcRawRecordMerger doesn't work for more than one file in non vectorized case
* [HIVE-19827] - hiveserver2 startup should provide a way to override TEZ_CONF_DIR
* [HIVE-19833] - reduce LLAP IO min allocation to match ORC variable CB size
* [HIVE-19837] - Setting to have different default location for external tables
* [HIVE-19838] - simplify & fix ColumnizedDeleteEventRegistry load loop
* [HIVE-19853] - Arrow serializer needs to create a TimeStampMicroTZVector instead of TimeStampMicroVector
* [HIVE-19857] - Set 3.1.0 for sys db version
* [HIVE-19859] - Inspect lock components for DBHiveLock while verifying whether transaction list is valid
* [HIVE-19861] - Fix temp table path generation for acid table export
* [HIVE-19862] - Postgres init script has a glitch around UNIQUE_DATABASE
* [HIVE-19864] - Address TestTriggersWorkloadManager flakiness
* [HIVE-19866] - improve LLAP cache purge
* [HIVE-19869] - Remove double formatting bug followup of HIVE-19382
* [HIVE-19872] - hive-schema-3.1.0.hive.sql is missing on master and branch-3
* [HIVE-19873] - Cleanup operation log on query cancellation after some delay
* [HIVE-19875] - increase LLAP IO queue size for perf
* [HIVE-19876] - Multiple fixes for Driver.isValidTxnListState
* [HIVE-19877] - Remove setting hive.execution.engine as mr in HiveStreamingConnection
* [HIVE-19879] - Remove unused calcite sql operator.
* [HIVE-19884] - Invalidation cache may throw NPE when there is no data in table used by materialized view
* [HIVE-19889] - Wrong results due to PPD of non deterministic functions with CBO
* [HIVE-19890] - ACID: Inherit bucket-id from original ROW_ID for delete deltas
* [HIVE-19898] - Disable TransactionalValidationListener when the table is not in the Hive catalog
* [HIVE-19903] - Disable temporary insert-only transactional table
* [HIVE-19904] - Load data rewrite into Tez job fails for ACID
* [HIVE-19908] - Block Insert Overwrite with Union All on full CRUD ACID tables using HIVE_UNION_SUBDIR_
* [HIVE-19912] - Schema evolution checks prints a log line in INFO mode for each vectorized rowbatch, impacts performance
* [HIVE-19917] - Export of full CRUD transactional table fails if table is not in default database
* [HIVE-19920] - Schematool fails in embedded mode when auth is on
* [HIVE-19921] - Fix perf duration and queue name in HiveProtoLoggingHook
* [HIVE-19938] - Upgrade scripts for information schema
* [HIVE-19941] - Row based Filters added via Hive Ranger policies are not pushed to druid
* [HIVE-19946] - VectorizedRowBatchCtx.recordIdColumnVector cannot be shared between different JVMs
* [HIVE-19951] - Vectorization: Need to disable encoded LLAP I/O for ORC when there is data type conversion (Schema Evolution)
* [HIVE-19956] - Include yarn registry classes to jdbc standalone jar
* [HIVE-19964] - Apply resource plan fails if trigger expression has quotes
* [HIVE-19965] - Make HiveEndPoint use IMetaStoreClient.add_partition
* [HIVE-19972] - Followup to HIVE-19928 : Fix the check for managed table
* [HIVE-19973] - Enable materialized view rewriting by default
* [HIVE-19980] - GenericUDTFGetSplits fails when order by query returns 0 rows
* [HIVE-19997] - Batches for TestMiniDruidCliDriver
* [HIVE-20002] - Shipping jdbd-storage-handler dependency jars in LLAP
* [HIVE-20010] - Fix create view over literals
* [HIVE-20076] - ACID: Fix Synthetic ROW__ID generation for vectorized orc readers
** New Feature
* [HIVE-18739] - Add support for Import/Export from Acid table
* [HIVE-19307] - Support ArrowOutputStream in LlapOutputFormatService
** Improvement
* [HIVE-17824] - msck repair table should drop the missing partitions from metastore
* [HIVE-18079] - Statistics: Allow HyperLogLog to be merged to the lowest-common-denominator bit-size
* [HIVE-18394] - Materialized view: "Create Materialized View" should default to rewritable ones
* [HIVE-18410] - [Performance][Avro] Reading flat Avro tables is very expensive in Hive
* [HIVE-18743] - CREATE TABLE on S3 data can be extremely slow. DO_NOT_UPDATE_STATS workaround is buggy.
* [HIVE-18792] - Allow standard compliant syntax for insert on partitioned tables
* [HIVE-18866] - Semijoin and analyze: Implement a Long -> Hash64 vector fast-path
* [HIVE-19027] - Make materializations invalidation cache work with multiple active remote metastores
* [HIVE-19161] - Add authorizations to information schema
* [HIVE-19228] - Remove commons-httpclient 3.x usage
* [HIVE-19259] - Create view on tables having union all fail with "Table not found"
* [HIVE-19344] - Change default value of msck.repair.batch.size
* [HIVE-19390] - Useless error messages logged for dummy table stats
* [HIVE-19415] - Support CORS for all HS2 web endpoints
* [HIVE-19421] - Upgrade version of Jetty to 9.3.20.v20170531
* [HIVE-19440] - Make StorageBasedAuthorizer work with information schema
* [HIVE-19462] - Fix mapping for char_length function to enable pushdown to Druid.
* [HIVE-19464] - Upgrade Parquet to 1.10.0
* [HIVE-19465] - Upgrade ORC to 1.5.0
* [HIVE-19466] - Update constraint violation error message
* [HIVE-19490] - Locking on Insert into for non native and managed tables.
* [HIVE-19534] - Allow implementations to access member variables of AbstractRecordWriter
* [HIVE-19560] - Retry test runner and retry rule for flaky tests
* [HIVE-19572] - Add option to mask stats and data size in q files
* [HIVE-19586] - Optimize Count(distinct X) pushdown based on the storage capabilities
* [HIVE-19669] - Upgrade ORC to 1.5.1
* [HIVE-19682] - Provide option for GenericUDTFGetSplits to return only schema metadata
* [HIVE-19775] - Schematool should use HS2 embedded mode in privileged auth mode
* [HIVE-19776] - HiveServer2.startHiveServer2 retries of start has concurrency issues
* [HIVE-19824] - Improve online datasize estimations for MapJoins
* [HIVE-19885] - Druid Kafka Ingestion - Allow user to set kafka consumer properties via table properties
* [HIVE-20102] - Add a couple of additional tests for query parsing
* [HIVE-20135] - Fix incompatible change in TimestampColumnVector to default to UTC
** Test
* [HIVE-19271] - TestMiniLlapLocalCliDriver default_constraint and check_constraint failing
* [HIVE-19515] - TestRpc.testServerPort is consistently failing
* [HIVE-19555] - Enable TestMiniLlapLocalCliDriver#tez_dynpart_hashjoin_1.q and TestMiniLlapLocalCliDriver#tez_vector_dynpart_hashjoin_1.q
* [HIVE-19573] - Fix flaky TestMiniLlapLocalCliDriver#explainuser_4.q
* [HIVE-19612] - Add option to mask lineage in q files
* [HIVE-19617] - Rename test tables to avoid collisions during execution in batches
* [HIVE-19620] - Change tmp directory used by PigServer in HCat tests
* [HIVE-19626] - Change tmp staging mapred directory for CliDriver
* [HIVE-19654] - Change tmp staging mapred directory for TestBlobstoreCliDriver
* [HIVE-19655] - Mask stats for TestMiniLlapLocalCliDriver#smb_mapjoin_15
* [HIVE-19699] - Re-enable TestReOptimization
* [HIVE-19706] - Disable TestJdbcWithMiniHS2#testHttpRetryOnServerIdleTimeout
* [HIVE-19731] - Change staging tmp directory used by TestHCatLoaderComplexSchema
* [HIVE-20123] - Fix masking tests after HIVE-19617
** Task
* [HIVE-18875] - Enable SMB Join by default in Tez
* [HIVE-19134] - Update copyright NOTICE and fix rat check failures
* [HIVE-19140] - Update metastore upgrade scripts to prepare for 3.1.0 development
* [HIVE-19257] - HIVE-19157 commit references wrong jira
* [HIVE-19306] - Arrow batch serializer
* [HIVE-19308] - Provide an Arrow stream reader for external LLAP clients
* [HIVE-19323] - Create metastore SQL install and upgrade scripts for 3.1
* [HIVE-19488] - Enable CM root based on db parameter, identifying a db as source of replication.
* [HIVE-19509] - Disable tests that are failing continuously
* [HIVE-19512] - If parallel execution is enabled, metastore is throwing out of sequence error.
* [HIVE-19708] - Repl copy retrying with cm path even if the failure is due to network issue
* [HIVE-19725] - Add ability to dump non-native tables in replication metadata dump
* [HIVE-19880] - Repl Load to return recoverable vs non-recoverable error codes
* [HIVE-19881] - Allow metadata-only dump for database which are not source of replication
* [HIVE-19928] - Load Data for managed tables should set the owner of loaded files to a configurable user
* [HIVE-19978] - Backport HIVE-18037 to branch-3
Release Notes - Hive - Version 3.0.0
** Sub-task
* [HIVE-11133] - Support hive.explain.user for Spark
* [HIVE-11418] - Dropping a database in an encryption zone with CASCADE and trash enabled fails
* [HIVE-13567] - Enable auto-gather column stats by default
* [HIVE-13583] - E061-14: Search Conditions
* [HIVE-13673] - LLAP: handle case where no service instance is found on the host specified in the input split
* [HIVE-14412] - Add timestamp with time zone
* [HIVE-14487] - Add REBUILD statement for materialized views
* [HIVE-14495] - Add SHOW MATERIALIZED VIEWS statement
* [HIVE-14498] - Freshness period for query rewriting using materialized views
* [HIVE-14518] - Support 'having' translation for Druid GroupBy queries
* [HIVE-14747] - Remove JAVA paths from profiles by sending them from ptest-client
* [HIVE-14947] - Add support for Acid 2 in Merge
* [HIVE-15016] - Run tests with Hadoop 3.0.0-beta1
* [HIVE-15018] - ALTER rewriting flag in materialized view
* [HIVE-15051] - Test framework integration with findbugs, rat checks etc.
* [HIVE-15173] - Allow dec as an alias for decimal
* [HIVE-15212] - merge branch into master
* [HIVE-15326] - Hive shims report Unrecognized Hadoop major version number: 3.0.0-alpha2-SNAPSHOT
* [HIVE-15436] - Enhancing metastore APIs to retrieve only materialized views
* [HIVE-15490] - REPL LOAD & DUMP support for INSERT events with change management
* [HIVE-15619] - Column pruner should handle DruidQuery
* [HIVE-15642] - Replicate Insert Overwrites, Dynamic Partition Inserts and Loads
* [HIVE-15673] - Allow multiple queries with disjunction
* [HIVE-15705] - Event replication for constraints
* [HIVE-15725] - Make it possible to run checkstyle for a specific module
* [HIVE-15758] - Allow correlated scalar subqueries with aggregates which has non-equi join predicates
* [HIVE-15834] - Add unit tests for org.json usage on master
* [HIVE-15899] - Make CTAS with acid target table and insert into acid_tbl select ... union all ... work
* [HIVE-15939] - Make cast expressions comply more to sql2011
* [HIVE-15982] - Support the width_bucket function
* [HIVE-15986] - Support "is [not] distinct from"
* [HIVE-16171] - Support replication of truncate table
* [HIVE-16186] - REPL DUMP shows last event ID of the database even if we use LIMIT option.
* [HIVE-16197] - Incremental insert into a partitioned table doesn't get replicated.
* [HIVE-16207] - Add support for Complex Types in Fast SerDe
* [HIVE-16228] - Support subqueries in complex expression in SELECT clause
* [HIVE-16256] - Flaky test: TestCliDriver.testCliDriver[comments]
* [HIVE-16266] - Enable function metadata to be written during bootstrap
* [HIVE-16267] - Enable bootstrap function metadata to be loaded in repl load
* [HIVE-16268] - enable incremental repl dump to handle functions metadata
* [HIVE-16269] - enable incremental function dump to be loaded via repl load
* [HIVE-16272] - support for drop function in incremental replication
* [HIVE-16276] - Fix NoSuchMethodError: com.amazonaws.services.s3.transfer.TransferManagerConfiguration.setMultipartUploadThreshold(I)V
* [HIVE-16294] - Support snapshot for truncate table
* [HIVE-16312] - Flaky test: TestHCatClient.testTransportFailure
* [HIVE-16313] - Flaky test: TestBeeLineDriver[drop_with_concurrency]
* [HIVE-16320] - Flaky test: TestBeeLineDriver.testCliDriver[escape_comments]
* [HIVE-16330] - Improve plans for scalar subquery with aggregates
* [HIVE-16344] - Test and support replication of exchange partition
* [HIVE-16372] - Enable DDL statement for non-native tables (add/remove table properties)
* [HIVE-16400] - Fix the MDC reference to use slf4j rather than log4j
* [HIVE-16416] - Service: move constants out from HiveAuthFactory
* [HIVE-16467] - Flaky test: TestCliDriver.testCliDriver[vector_order_null]
* [HIVE-16488] - Support replicating into existing db if the db is empty
* [HIVE-16493] - Skip column stats when colStats is empty
* [HIVE-16504] - Addition of binary licenses broke rat check
* [HIVE-16530] - Add HS2 operation logs and improve logs for REPL commands
* [HIVE-16532] - HIVE on hadoop 3 build failed due to hdfs client/server jar separation
* [HIVE-16535] - Hive fails to build from source code tarball
* [HIVE-16542] - make merge that targets acid 2.0 table fail-fast
* [HIVE-16555] - Add a new thrift API call for get_metastore_uuid
* [HIVE-16556] - Modify schematool scripts to initialize and create METASTORE_DB_PROPERTIES table
* [HIVE-16566] - Set column stats default as true when creating new tables/partitions
* [HIVE-16568] - Support complex types in external LLAP InputFormat
* [HIVE-16579] - CachedStore: improvements to partition col stats caching and cache column stats for unpartitioned table
* [HIVE-16586] - Fix Unit test failures when CachedStore is enabled
* [HIVE-16591] - DR for function Binaries on HDFS
* [HIVE-16600] - Refactor SetSparkReducerParallelism#needSetParallelism to enable parallel order by in multi_insert cases
* [HIVE-16601] - Display Session Id and Query Name / Id in Spark UI
* [HIVE-16617] - Clean up javadoc from errors in module hive-shims
* [HIVE-16618] - Clean up javadoc from errors in module hive-common
* [HIVE-16619] - Clean up javadoc from errors in module hive-serde
* [HIVE-16628] - Fix query25 when it uses a mix of MergeJoin and MapJoin
* [HIVE-16637] - Improve end-of-data checking for LLAP input format
* [HIVE-16642] - New Events created as part of replv2 potentially break replv1
* [HIVE-16644] - Hook Change Manager to Insert Overwrite
* [HIVE-16647] - Improve the validation output to make the output to stderr and stdout more consistent
* [HIVE-16651] - LlapProtocolClientProxy stack trace when using llap input format
* [HIVE-16652] - LlapInputFormat: Seeing "output error" WARN message
* [HIVE-16653] - Mergejoin should give itself a correct tag
* [HIVE-16672] - Parquet vectorization doesn't work for tables with partition info
* [HIVE-16684] - Bootstrap REPL DUMP shouldn't fail when table is dropped after fetching the table names.
* [HIVE-16686] - repl invocations of distcp needs additional handling
* [HIVE-16688] - Make sure Alter Table to set transaction=true acquires X lock
* [HIVE-16691] - Add test for more datatypes for LlapInputFormat
* [HIVE-16697] - Schema table validator should return a sorted list of missing tables
* [HIVE-16702] - Use LazyBinarySerDe for LLAP InputFormat
* [HIVE-16706] - Bootstrap REPL DUMP shouldn't fail when a partition is dropped/renamed when dump in progress.
* [HIVE-16714] - make Task Dependency on Repl Load more intuitive
* [HIVE-16715] - Clean up javadoc from errors in modules llap-client, metastore, spark-client
* [HIVE-16722] - Converting bucketed non-acid table to acid should perform validation
* [HIVE-16727] - REPL DUMP for insert event should't fail if the table is already dropped.
* [HIVE-16729] - Improve location validator to check for blank paths.
* [HIVE-16747] - Remove YETUS*.sh files after a YETUS release
* [HIVE-16748] - Integreate YETUS to Pre-Commit
* [HIVE-16750] - Support change management for rename table/partition.
* [HIVE-16764] - Support numeric as same as decimal
* [HIVE-16765] - ParquetFileReader should be closed to avoid resource leak
* [HIVE-16774] - Support position in ORDER BY when using SELECT *
* [HIVE-16775] - Fix HiveFilterAggregateTransposeRule when filter is always false
* [HIVE-16779] - CachedStore leak PersistenceManager resources
* [HIVE-16782] - Flaky Test: TestMiniLlapLocalCliDriver[subquery_scalar]
* [HIVE-16785] - Ensure replication actions are idempotent if any series of events are applied again.
* [HIVE-16797] - Enhance HiveFilterSetOpTransposeRule to remove union branches
* [HIVE-16813] - Incremental REPL LOAD should load the events in the same sequence as it is dumped.
* [HIVE-16827] - Merge stats task and column stats task into a single task
* [HIVE-16837] - MetadataOnly optimizer conflicts with count distinct rewrite
* [HIVE-16838] - Improve plans for subqueries with non-equi co-related predicates
* [HIVE-16848] - NPE during CachedStore refresh
* [HIVE-16892] - Move creation of _files from ReplCopyTask to analysis phase for boostrap replication
* [HIVE-16893] - move replication dump related work in semantic analysis phase to execution phase using a task
* [HIVE-16895] - Multi-threaded execution of bootstrap dump of partitions
* [HIVE-16896] - move replication load related work in semantic analysis phase to execution phase using a task
* [HIVE-16901] - Distcp optimization - One distcp per ReplCopyTask
* [HIVE-16912] - Improve table validator's performance against Oracle
* [HIVE-16926] - LlapTaskUmbilicalExternalClient should not start new umbilical server for every fragment request
* [HIVE-16974] - Change the sort key for the schema tool validator to be <ID>
* [HIVE-16981] - hive.optimize.bucketingsorting should compare the schema before removing RS
* [HIVE-16990] - REPL LOAD should update last repl ID only after successful copy of data files.
* [HIVE-16992] - LLAP: monitoring and better default lambda for LRFU policy
* [HIVE-16996] - Add HLL as an alternative to FM sketch to compute stats
* [HIVE-16997] - Extend object store to store and use bit vectors
* [HIVE-16998] - Add config to enable HoS DPP only for map-joins
* [HIVE-17005] - Ensure REPL DUMP and REPL LOAD are authorized properly
* [HIVE-17021] - Support replication of concatenate operation.
* [HIVE-17087] - Remove unnecessary HoS DPP trees during map-join conversion
* [HIVE-17091] - "Timed out getting readerEvents" error from external LLAP client
* [HIVE-17100] - Improve HS2 operation logs for REPL commands.
* [HIVE-17112] - Reduce logging in HiveSparkClientFactory and RemoteHiveSparkClient
* [HIVE-17132] - Add InterfaceAudience and InterfaceStability annotations for UDF APIs
* [HIVE-17137] - Fix javolution conflict
* [HIVE-17153] - Flaky test: TestMiniSparkOnYarnCliDriver[spark_dynamic_partition_pruning]
* [HIVE-17157] - Add InterfaceAudience and InterfaceStability annotations for ObjectInspector APIs
* [HIVE-17167] - Create metastore specific configuration tool
* [HIVE-17168] - Create separate module for stand alone metastore
* [HIVE-17170] - Move thrift generated code to stand alone metastore
* [HIVE-17178] - Spark Partition Pruning Sink Operator can't target multiple Works
* [HIVE-17183] - Disable rename operations during bootstrap dump
* [HIVE-17185] - TestHiveMetaStoreStatsMerge.testStatsMerge is failing
* [HIVE-17195] - Long chain of tasks created by REPL LOAD shouldn't cause stack corruption.
* [HIVE-17196] - CM: ReplCopyTask should retain the original file names even if copied from CM path.
* [HIVE-17205] - add functional support for unbucketed tables
* [HIVE-17212] - Dynamic add partition by insert shouldn't generate INSERT event.
* [HIVE-17214] - check/fix conversion of unbucketed non-acid to acid
* [HIVE-17215] - Streaming Ingest API writing unbucketed tables
* [HIVE-17216] - Additional qtests for HoS DPP
* [HIVE-17224] - Move JDO classes to standalone metastore
* [HIVE-17225] - HoS DPP pruning sink ops can target parallel work objects
* [HIVE-17241] - Change metastore classes to not use the shims
* [HIVE-17247] - HoS DPP: UDFs on the partition column side does not evaluate correctly
* [HIVE-17256] - add a notion of a guaranteed task to LLAP
* [HIVE-17289] - EXPORT and IMPORT shouldn't perform distcp with doAs privileged user.
* [HIVE-17292] - Change TestMiniSparkOnYarnCliDriver test configuration to use the configured cores
* [HIVE-17297] - allow AM to use LLAP guaranteed tasks
* [HIVE-17307] - Change the metastore to not use the metrics code in hive/common
* [HIVE-17316] - Use String.startsWith for the hidden configuration variables
* [HIVE-17318] - Make Hikari CP configurable using hive properties in hive-site.xml
* [HIVE-17319] - Make BoneCp configurable using hive properties in hive-site.xml
* [HIVE-17330] - refactor TezSessionPoolManager to separate its multiple functions
* [HIVE-17346] - TestMiniSparkOnYarnCliDriver[spark_dynamic_partition_pruning] is failing every time
* [HIVE-17347] - TestMiniSparkOnYarnCliDriver[spark_dynamic_partition_pruning_mapjoin_only] is failing every time
* [HIVE-17359] - Deal with TypeInfo dependencies in the metastore
* [HIVE-17371] - Move tokenstores to metastore module
* [HIVE-17375] - stddev_samp,var_samp standard compliance
* [HIVE-17380] - refactor LlapProtocolClientProxy to be usable with other protocols
* [HIVE-17381] - When we enable Parquet Writer Version V2, hive throws an exception: Unsupported encoding: DELTA_BYTE_ARRAY.
* [HIVE-17382] - Change startsWith relation introduced in HIVE-17316
* [HIVE-17387] - implement Tez AM registry in Hive
* [HIVE-17405] - HoS DPP ConstantPropagate should use ConstantPropagateOption.SHORTCUT
* [HIVE-17409] - refactor LLAP ZK registry to make the ZK-registry part reusable
* [HIVE-17414] - HoS DPP + Vectorization generates invalid explain plan due to CombineEquivalentWorkResolver
* [HIVE-17428] - REPL LOAD of ALTER_PARTITION event doesn't create import tasks if the partition doesn't exist during analyze phase.
* [HIVE-17455] - External LLAP client: connection to HS2 should be kept open until explicitly closed
* [HIVE-17456] - Set current database for external LLAP interface
* [HIVE-17473] - implement workload management pools
* [HIVE-17482] - External LLAP client: acquire locks for tables queried directly by LLAP
* [HIVE-17488] - Move first set of classes to standalone metastore
* [HIVE-17494] - Bootstrap REPL DUMP throws exception if a partitioned table is dropped while reading partitions.
* [HIVE-17495] - CachedStore: prewarm improvement (avoid multiple sql calls to read partition column stats), refactoring and caching some aggregate stats
* [HIVE-17506] - Fix standalone-metastore pom.xml to not depend on hive's main pom
* [HIVE-17508] - Implement global execution triggers based on counters
* [HIVE-17514] - Use SHA-256 for cookie signer to improve security
* [HIVE-17515] - Use SHA-256 for GenericUDFMaskHash to improve security
* [HIVE-17527] - Support replication for rename/move table across database
* [HIVE-17528] - Add more q-tests for Hive-on-Spark with Parquet vectorized reader
* [HIVE-17534] - Add a config to turn off parquet vectorization
* [HIVE-17537] - Move Warehouse class to standalone metastore
* [HIVE-17541] - Move testing related methods from MetaStoreUtils to some testing related utility
* [HIVE-17566] - Create schema required for workload management.
* [HIVE-17581] - Replace some calcite dependencies with native ones
* [HIVE-17607] - remove ColumnStatsDesc usage from columnstatsupdatetask
* [HIVE-17608] - REPL LOAD should overwrite the data files if exists instead of duplicating it
* [HIVE-17617] - Rollup of an empty resultset should contain the grouping of the empty grouping set
* [HIVE-17629] - CachedStore - wait for prewarm at use time, not init time
* [HIVE-17645] - MM tables patch conflicts with HIVE-17482 (Spark/Acid integration)
* [HIVE-17647] - DDLTask.generateAddMmTasks(Table tbl) and other random code should not start transactions
* [HIVE-17651] - TableScanOperator might miss vectorization on flag
* [HIVE-17652] - retire ANALYZE TABLE ... PARTIALSCAN
* [HIVE-17661] - DBTxnManager.acquireLocks() - MM tables should use shared lock for Insert
* [HIVE-17671] - TableScanDesc.isAcidTable is restricted to FullAcid tables
* [HIVE-17681] - Need to log bootstrap dump progress state property to HS2 logs.
* [HIVE-17692] - Block HCat on Acid tables
* [HIVE-17696] - Vectorized reader does not seem to be pushing down projection columns in certain code paths
* [HIVE-17698] - FileSinkDesk.getMergeInputDirName() uses stmtId=0
* [HIVE-17708] - Upgrade surefire to 2.20.1
* [HIVE-17728] - TestHCatClient should use hive.metastore.transactional.event.listeners as per recommendation.
* [HIVE-17733] - Move RawStore to standalone metastore
* [HIVE-17743] - Add InterfaceAudience and InterfaceStability annotations for Thrift generated APIs
* [HIVE-17748] - ReplCopyTask doesn't support multi-file CopyWork
* [HIVE-17750] - add a flag to automatically create most tables as MM
* [HIVE-17756] - Enable subquery related Qtests for Hive on Spark
* [HIVE-17757] - REPL LOAD need to use customised configurations to execute distcp/remote copy.
* [HIVE-17771] - Implement commands to manage resource plan
* [HIVE-17778] - Add support for custom counters in trigger expression
* [HIVE-17809] - Implement per pool trigger validation and move sessions across pools
* [HIVE-17812] - Move remaining classes that HiveMetaStore depends on
* [HIVE-17835] - HS2 Logs print unnecessary stack trace when HoS query is cancelled
* [HIVE-17837] - Explicitly check if the HoS Remote Driver has been lost in the RemoteSparkJobMonitor
* [HIVE-17841] - implement applying the resource plan
* [HIVE-17842] - Run checkstyle on ptest2 module with proper configuration
* [HIVE-17850] - can VectorizedOrcAcidRowReader be removed once HIVE-17458 is done?
* [HIVE-17856] - MM tables - IOW is not ACID compliant
* [HIVE-17858] - MM - some union cases are broken
* [HIVE-17874] - Parquet vectorization fails on tables with complex columns when there are no projected columns
* [HIVE-17884] - Implement create, alter and drop workload management triggers
* [HIVE-17887] - Incremental REPL LOAD with Drop partition event on timestamp type partition column fails.
* [HIVE-17888] - Display the reason for query cancellation
* [HIVE-17897] - "repl load" in bootstrap phase fails when partitions have whitespace
* [HIVE-17902] - add notions of default pool and start adding unmanaged mapping
* [HIVE-17904] - handle internal Tez AM restart in registry and WM
* [HIVE-17905] - propagate background LLAP cluster changes to WM
* [HIVE-17906] - use kill query mechanics to kill queries in WM
* [HIVE-17907] - enable and apply resource plan commands in HS2
* [HIVE-17913] - Cleanup unused methods in Driver
* [HIVE-17926] - Support triggers for non-pool sessions
* [HIVE-17929] - Use sessionId for HoS Remote Driver Client id
* [HIVE-17931] - Implement Parquet vectorization reader for Array type
* [HIVE-17933] - make antlr output directory to use a top-level sourceset
* [HIVE-17934] - Merging Statistics are promoted to COMPLETE (most of the time)
* [HIVE-17945] - Support column projection for index access when using Parquet Vectorization
* [HIVE-17950] - Implement resource plan fetching from metastore
* [HIVE-17954] - Implement pool, user, group and trigger to pool management API's.
* [HIVE-17961] - NPE during initialization of VectorizedParquetRecordReader when input split is null
* [HIVE-17967] - Move HiveMetaStore class
* [HIVE-17970] - MM LOAD DATA with OVERWRITE doesn't use base_n directory concept
* [HIVE-17972] - Implement Parquet vectorization reader for Map type
* [HIVE-17980] - Move HiveMetaStoreClient plus a few remaining classes.
* [HIVE-17981] - Create a set of builders for Thrift classes
* [HIVE-17982] - Move metastore specific itests
* [HIVE-17983] - Make the standalone metastore generate tarballs etc.
* [HIVE-17990] - Add Thrift and DB storage for Schema Registry objects
* [HIVE-17991] - Remove CommandNeedRetryException
* [HIVE-17995] - Run checkstyle on standalone-metastore module with proper configuration
* [HIVE-17996] - Fix ASF headers
* [HIVE-17997] - Add rat plugin and configuration to standalone metastore pom
* [HIVE-18002] - add group support for pool mappings
* [HIVE-18003] - add explicit jdbc connection string args for mappings
* [HIVE-18004] - investigate deriving app name from JDBC connection for pool mapping
* [HIVE-18005] - Improve size estimation for array() to be not 0
* [HIVE-18025] - Push resource plan changes to tez/unmanaged sessions
* [HIVE-18028] - fix WM based on cluster smoke test; add logging
* [HIVE-18029] - beeline - support proper usernames based on the URL arg
* [HIVE-18031] - Support replication for Alter Database operation.
* [HIVE-18034] - Improving logging with HoS executors spend lots of time in GC
* [HIVE-18036] - Stats: Remove usage of clone() methods
* [HIVE-18053] - Support different table types for MVs
* [HIVE-18056] - CachedStore: Have a whitelist/blacklist config to allow selective caching of tables/partitions and allow read while prewarming
* [HIVE-18057] - remove PostExecute / PreExecute hook support
* [HIVE-18063] - Make CommandProcessorResponse an exception instead of a return class
* [HIVE-18071] - add HS2 jmx information about pools and current resource plan
* [HIVE-18072] - fix various WM bugs based on cluster testing - part 2
* [HIVE-18073] - AM may assert when its guaranteed task count is reduced
* [HIVE-18075] - verify commands on a cluster
* [HIVE-18076] - killquery doesn't actually work for non-trigger WM kills
* [HIVE-18078] - WM getSession needs some retry logic
* [HIVE-18084] - Upgrade checkstyle version to support lambdas
* [HIVE-18085] - Run checkstyle on storage-api module with proper configuration
* [HIVE-18088] - Add WM event traces at query level for debugging
* [HIVE-18092] - Fix exception on tables handled by HBaseHandler if columnsstats are auto-gathered
* [HIVE-18093] - Improve logging when HoS application is killed
* [HIVE-18095] - add a unmanaged flag to triggers (applies to container based sessions)
* [HIVE-18096] - add a user-friendly show plan command
* [HIVE-18125] - Support arbitrary file names in input to Load Data
* [HIVE-18133] - Parametrize TestTxnNoBuckets wrt Vectorization
* [HIVE-18134] - some alter resource plan fixes
* [HIVE-18138] - Fix columnstats problem in case schema evolution
* [HIVE-18141] - Fix StatsUtils.combineRange to combine intervals
* [HIVE-18149] - Stats: rownum estimation from datasize underestimates in most cases
* [HIVE-18153] - refactor reopen and file management in TezTask
* [HIVE-18161] - Remove hive.stats.atomic
* [HIVE-18163] - Stats: create materialized view should also collect stats
* [HIVE-18170] - User mapping not initialized correctly on start
* [HIVE-18179] - Implement validate resource plan (part 1)
* [HIVE-18187] - Add jamon generated-sources as source folder
* [HIVE-18190] - Consider looking at ORC file schema rather than using _metadata_acid file
* [HIVE-18192] - Introduce WriteID per table rather than using global transaction ID
* [HIVE-18193] - Migrate existing ACID tables to use write id per table rather than global transaction id
* [HIVE-18202] - Automatically migrate hbase.table.name to hbase.mapreduce.hfileoutputformat.table.name for hbase-based table
* [HIVE-18203] - change the way WM is enabled and allow dropping the last resource plan
* [HIVE-18209] - Fix API call in VectorizedListColumnReader to get value from BytesColumnVector
* [HIVE-18211] - Support to read multiple level definition for Map type in Parquet file
* [HIVE-18212] - Make sure Yetus check always has a full log
* [HIVE-18214] - Flaky test: TestSparkClient
* [HIVE-18222] - Update checkstyle rules to be less peeky
* [HIVE-18224] - Introduce interface above driver
* [HIVE-18229] - add the unmanaged mapping command
* [HIVE-18230] - create plan like plan, and replace plan commands for easy modification
* [HIVE-18235] - Columnstats gather on mm tables: re-enable disabled test
* [HIVE-18237] - missing results for insert_only table after DP insert
* [HIVE-18238] - Driver execution may not have configuration changing sideeffects
* [HIVE-18245] - clean up acid_vectorization_original.q
* [HIVE-18257] - implement scheduling policy configuration instead of hardcoding fair scheduling
* [HIVE-18273] - add LLAP-level counters for WM
* [HIVE-18274] - add AM level metrics for WM
* [HIVE-18275] - add HS2-level WM metrics
* [HIVE-18286] - java.lang.ClassCastException: org.apache.hadoop.hive.ql.exec.vector.DoubleColumnVector cannot be cast to org.apache.hadoop.hive.ql.exec.vector.LongColumnVector
* [HIVE-18288] - merge/concat not supported on Acid table
* [HIVE-18294] - add switch to make acid table the default
* [HIVE-18315] - update tests use non-acid tables
* [HIVE-18317] - Improve error messages in TransactionalValidationListerner
* [HIVE-18323] - Vectorization: add the support of timestamp in VectorizedPrimitiveColumnReader for parquet
* [HIVE-18366] - Update HBaseSerDe to use hbase.mapreduce.hfileoutputformat.table.name instead of hbase.table.name as the table name property
* [HIVE-18368] - Improve Spark Debug RDD Graph
* [HIVE-18372] - Create testing infra to test different HMS instances
* [HIVE-18389] - Print out Spark Web UI URL to the console log
* [HIVE-18411] - Fix ArrayIndexOutOfBoundsException for VectorizedListColumnReader
* [HIVE-18418] - clean up plugin between DAGs
* [HIVE-18437] - use plan parallelism for the default pool if both are present
* [HIVE-18438] - WM RP: it's impossible to unset things
* [HIVE-18443] - Ensure git gc finished in ptest prep phase before copying repo
* [HIVE-18457] - improve show plan output (triggers, mappings)
* [HIVE-18458] - Workload manager initializes even when interactive queue is not set
* [HIVE-18468] - Create tests to cover alterPartition and renamePartition methods
* [HIVE-18478] - Data files deleted from temp table should not be recycled to CM path
* [HIVE-18479] - Create tests to cover dropPartition methods
* [HIVE-18480] - Create tests for function related methods
* [HIVE-18481] - Create tests for table related methods (get, list, exists)
* [HIVE-18483] - Create tests to cover getPartition(s) methods
* [HIVE-18484] - Create tests to cover listPartition(s) methods
* [HIVE-18486] - Create tests to cover add partition methods
* [HIVE-18489] - Automatically migrate s3n URIs to s3a URIs
* [HIVE-18495] - JUnit rule to enable Driver level testing
* [HIVE-18496] - Create tests to cover add/alter/drop index methods
* [HIVE-18498] - Create tests to cover get and list index methods
* [HIVE-18509] - Create tests for table manipulation related methods (create, alter, drop)
* [HIVE-18511] - Fix generated checkstyle errors
* [HIVE-18536] - IOW + DP is broken for insert-only ACID
* [HIVE-18541] - Secure HS2 web UI with PAM
* [HIVE-18542] - Create tests to cover getTableMeta method
* [HIVE-18544] - Create tests to cover appendPartition methods
* [HIVE-18550] - Keep the hbase table name property as hbase.table.name
* [HIVE-18553] - Support schema evolution in Parquet Vectorization reader
* [HIVE-18566] - Create tests to cover adding partitions from PartitionSpec
* [HIVE-18580] - Create tests to cover exchange partitions
* [HIVE-18596] - Synchronize value of hive.spark.client.connect.timeout across unit tests
* [HIVE-18609] - Results cache invalidation based on ACID table updates
* [HIVE-18633] - Service discovery for Active/Passive HA mode
* [HIVE-18635] - Generalize hook dispatch logics in Driver
* [HIVE-18651] - Expose additional Spark metrics
* [HIVE-18663] - Logged Spark Job Id contains a UUID instead of the actual id
* [HIVE-18672] - Printed state in RemoteSparkJobMonitor is ambiguous
* [HIVE-18673] - ErrorMsg.SPARK_JOB_MONITOR_TIMEOUT isn't formatted correctly
* [HIVE-18677] - SparkClientImpl usage of SessionState.LogHelper doesn't respect isSilent value
* [HIVE-18679] - create/replicate open transaction event
* [HIVE-18703] - Make Operator comparision to be based on some primitive
* [HIVE-18715] - Remove index support from metastore
* [HIVE-18720] - Replicate Commit Txn operation (without writes)
* [HIVE-18745] - Fix MetaStore creation in tests, so multiple MetaStores can be started on the same machine
* [HIVE-18747] - Cleaner for TXN_TO_WRITE_ID table entries using MIN_HISTORY_LEVEL.
* [HIVE-18749] - Need to replace transactionId with writeId in RecordIdentifier and other relevant contexts.
* [HIVE-18750] - Exchange partition should be disabled on ACID/Insert-only tables with per table write ID.
* [HIVE-18751] - ACID table scan through get_splits UDF doesn't receive ValidWriteIdList configuration.
* [HIVE-18753] - Correct methods and variables names which uses writeId instead of transactionId.
* [HIVE-18755] - Modifications to the metastore for catalogs
* [HIVE-18765] - SparkClientImpl swallows exception messages from the RemoteDriver
* [HIVE-18771] - Refactor tests, so only 1 MetaStore instance will be started per test class and test configuration
* [HIVE-18781] - Create/Replicate Open, Commit (without writes) and Abort Txn events
* [HIVE-18805] - Add ConstantPropagate before stats annotation
* [HIVE-18824] - ValidWriteIdList config should be defined on tables which has to collect stats after insert
* [HIVE-18830] - RemoteSparkJobMonitor failures are logged twice
* [HIVE-18832] - Support change management for trashing data files from ACID tables.
* [HIVE-18840] - CachedStore: Prioritize loading of recently accessed tables during prewarm
* [HIVE-18846] - Query results cache: Allow queries to refer to the pending results of a query that has not finished yet
* [HIVE-18855] - Fix unit test TestMiniLlapLocalCliDriver.testCliDriver[results_cache_1]
* [HIVE-18861] - druid-hdfs-storage is pulling in hadoop-aws-2.7.x and aws SDK, creating classpath problems on hadoop 3.x
* [HIVE-18864] - ValidWriteIdList snapshot seems incorrect if obtained after allocating writeId by current transaction.
* [HIVE-18899] - Separate FetchWork required for each query that uses the results cache
* [HIVE-18909] - Metrics for results cache
* [HIVE-18926] - Imporve operator-tree matching
* [HIVE-18946] - Fix columnstats merge NPE
* [HIVE-18961] - Error in results cache when query has identifiers with spaces
* [HIVE-18982] - Provide a CLI option to manually trigger failover
* [HIVE-18988] - Support bootstrap replication of ACID tables
* [HIVE-18994] - Handle client connections on failover
* [HIVE-19009] - Retain and use runtime statistics during hs2 lifetime
* [HIVE-19031] - Mark duplicate configs in HiveConf as deprecated
* [HIVE-19083] - Make partition clause optional for INSERT
* [HIVE-19089] - Create/Replicate Allocate write-id event
* [HIVE-19112] - Support Analyze table for partitioned tables without partition spec
* [HIVE-19126] - CachedStore: Use memory estimation to limit cache size during prewarm
* [HIVE-19127] - Concurrency fixes in QueryResultsCache
* [HIVE-19128] - Update golden files for spark perf tests
* [HIVE-19129] - Support DEFAULT keyword with MERGE
* [HIVE-19135] - Need tool to allow admins to create catalogs and move existing dbs to catalog during upgrade
* [HIVE-19138] - Results cache: allow queries waiting on pending cache entries to check cache again if pending query fails
* [HIVE-19141] - TestNegativeCliDriver insert_into_notnull_constraint, insert_into_acid_notnull failing
* [HIVE-19144] - TestSparkCliDriver:subquery_scalar - golden file needs to be udpated
* [HIVE-19145] - Stabilize statsoptimizer.q test
* [HIVE-19146] - Delete dangling q.out
* [HIVE-19147] - Fix PerfCliDrivers: Tpcds30T missed CAT_NAME change
* [HIVE-19153] - Update golden files for few tests
* [HIVE-19154] - Poll notification events to invalidate the results cache
* [HIVE-19156] - TestMiniLlapLocalCliDriver.vectorized_dynamic_semijoin_reduction.q is broken
* [HIVE-19159] - TestMTQueries.testMTQueries1 failure
* [HIVE-19164] - TestMetastoreVersion failures
* [HIVE-19171] - Persist runtime statistics in metastore
* [HIVE-19175] - TestMiniLlapLocalCliDriver.testCliDriver update_access_time_non_current_db failing
* [HIVE-19178] - TestMiniTezCliDriver.testCliDriver[explainanalyze_5] failure
* [HIVE-19193] - TestActivePassiveHA fails
* [HIVE-19194] - TestDruidStorageHandler fails
* [HIVE-19195] - Fix flaky tests and cleanup testconfiguration to run llap specific tests in llap only.
* [HIVE-19196] - TestTriggersMoveWorkloadManager is flaky
* [HIVE-19197] - TestReplicationScenarios is flaky
* [HIVE-19206] - Automatic memory management for open streaming writers
* [HIVE-19209] - Streaming ingest record writers should accept input stream
* [HIVE-19210] - Create separate module for streaming ingest
* [HIVE-19211] - New streaming ingest API and support for dynamic partitioning
* [HIVE-19214] - High throughput ingest ORC format
* [HIVE-19222] - TestNegativeCliDriver tests are failing due to "java.lang.OutOfMemoryError: GC overhead limit exceeded"
* [HIVE-19232] - results_cache_invalidation2 is failing
* [HIVE-19274] - Add an OpTreeSignature persistence checker hook
* [HIVE-19319] - RuntimeStats fixes
* [HIVE-19322] - broken test: TestNegativeMinimrCliDriver#testCliDriver[minimr_broken_pipe]
* [HIVE-19335] - Disable runtime filtering (semijoin reduction opt with bloomfilter) for external tables
* [HIVE-19346] - TestMiniLlapLocalCliDriver.testCliDriver[materialized_view_create_rewrite_5] failling
* [HIVE-19347] - TestTriggersWorkloadManager tests are failing consistently
* [HIVE-19348] - org.apache.hadoop.hive.ql.plan.mapping.TestOperatorCmp are failing
* [HIVE-19371] - Add table ownerType to HMS thrift API
* [HIVE-19372] - Add table ownerType to JDO/SQL and ObjectStore
* [HIVE-19374] - Parse and process ALTER TABLE SET OWNER command syntax
* [HIVE-19400] - Adjust Hive 1.0 to 2.0 conversion utility to the upgrade
* [HIVE-19471] - bucket_map_join_tez1 and bucket_map_join_tez2 are failing
* [HIVE-19472] - HiveStreamingConnection swallows exception on partition creation
* [HIVE-19494] - Accept shade prefix during reflective instantiation of output format
** Bug
* [HIVE-4577] - hive CLI can't handle hadoop dfs command with space and quotes.
* [HIVE-6348] - Order by/Sort by in subquery
* [HIVE-6590] - Hive does not work properly with boolean partition columns (wrong results and inserts to incorrect HDFS path)
* [HIVE-6990] - Direct SQL fails when the explicit schema setting is different from the default one
* [HIVE-8937] - fix description of hive.security.authorization.sqlstd.confwhitelist.* params
* [HIVE-9815] - Metastore column"SERDE_PARAMS"."PARAM_VALUE" limited to 4000 bytes
* [HIVE-10616] - TypeInfoUtils doesn't handle DECIMAL with just precision specified
* [HIVE-10865] - Beeline needs to support DELIMITER command
* [HIVE-11064] - ALTER TABLE CASCADE ERROR unbalanced calls to openTransaction/commitTransaction
* [HIVE-11266] - count(*) wrong result based on table statistics for external tables
* [HIVE-11297] - Combine op trees for partition info generating tasks
* [HIVE-11609] - Capability to add a filter to hbase scan via composite key doesn't work
* [HIVE-12408] - SQLStdAuthorizer should not require external table creator to be owner of directory, in addition to rw permissions
* [HIVE-12425] - OrcRecordUpdater.close(true) leaves the file open
* [HIVE-12631] - LLAP IO: support ORC ACID tables
* [HIVE-12719] - As a hive user, I am facing issues using permanent UDAF's.
* [HIVE-12734] - Remove redundancy in HiveConfs serialized to UDFContext
* [HIVE-13000] - Hive returns useless parsing error
* [HIVE-13652] - Import table change order of dynamic partitions
* [HIVE-14032] - INSERT OVERWRITE command failed with case sensitive partition key names
* [HIVE-14052] - Cleanup structures when external clients use LLAP
* [HIVE-14077] - add implicit decimal arithmetic q test, fix issues if found
* [HIVE-14455] - upgrade httpclient, httpcore to match updated hadoop dependency
* [HIVE-14560] - Support exchange partition between s3 and hdfs tables
* [HIVE-14564] - Column Pruning generates out of order columns in SelectOperator which cause ArrayIndexOutOfBoundsException.
* [HIVE-14678] - Hive-on-MR deprecation warning is not diplayed when engine is set to capital letter 'MR'
* [HIVE-14731] - Use Tez cartesian product edge in Hive (unpartitioned case only)
* [HIVE-14792] - AvroSerde reads the remote schema-file at least once per mapper, per table reference.
* [HIVE-14813] - Make TransactionBatchImpl.toString() include state of each txn: commit/abort
* [HIVE-14988] - Support INSERT OVERWRITE into a partition on transactional tables
* [HIVE-15077] - Acid LockManager is unfair
* [HIVE-15104] - Hive on Spark generate more shuffle data than hive on mr
* [HIVE-15144] - JSON.org license is now CatX
* [HIVE-15160] - Can't order by an unselected column
* [HIVE-15176] - Small typo in hiveserver2 webui
* [HIVE-15249] - HIve 2.1.0 is throwing InvalidObjectException(message:Invalid column type name is too long
* [HIVE-15267] - Make query length calculation logic more accurate in TxnUtils.needNewQuery()
* [HIVE-15343] - Spelling errors in logging and exceptions for beeline, common, hbase-handler, hcatalog, llap-server, orc, serde and shims
* [HIVE-15344] - Spelling errors in logging and exceptions for metastore and service directories
* [HIVE-15442] - Driver.java has a redundancy code
* [HIVE-15483] - Database and table name is case sensitive when used in show grant
* [HIVE-15504] - ArrayIndexOutOfBoundsException in GenericUDFTrunc::initialize
* [HIVE-15515] - Remove the docs directory
* [HIVE-15552] - Unable to coalesce DATE and TIMESTAMP types
* [HIVE-15630] - add operation handle before operation.run instead of after operation.run
* [HIVE-15632] - Hive/Druid integration: Incorrect result - Limit on timestamp disappears
* [HIVE-15635] - Hive/Druid integration: timeseries query shows all days, even if no data
* [HIVE-15636] - Hive/Druid integration: wrong semantics of topN query limit with granularity
* [HIVE-15637] - Hive/Druid integration: wrong semantics of groupBy query limit with granularity
* [HIVE-15639] - Hive/Druid integration: wrong semantics for ordering within groupBy queries
* [HIVE-15680] - Incorrect results when hive.optimize.index.filter=true and same ORC table is referenced twice in query
* [HIVE-15724] - getPrimaryKeys and getForeignKeys in metastore does not normalize db and table name
* [HIVE-15739] - Incorrect exception message in PartExprEvalUtils
* [HIVE-15761] - ObjectStore.getNextNotification could return an empty NotificationEventResponse causing TProtocolException
* [HIVE-15767] - Hive On Spark is not working on secure clusters from Oozie
* [HIVE-15829] - LLAP text cache: disable memory tracking on the writer
* [HIVE-15883] - HBase mapped table in Hive insert fail for decimal
* [HIVE-15995] - Syncing metastore table with serde schema
* [HIVE-16007] - When the query does not complie the LogRunnable never stops
* [HIVE-16025] - Where IN clause throws exception
* [HIVE-16026] - Generated query will timeout and/or kill the druid cluster.
* [HIVE-16027] - <timestamp> BETWEEN <string> AND <string> must cast to TIMESTMAP
* [HIVE-16044] - LLAP: Shuffle Handler keep-alive connections are closed from the server side
* [HIVE-16053] - Remove newRatio from llap JAVA_OPTS_BASE
* [HIVE-16057] - SchemaTool ignores --passWord argument if hadoop.security.credential.provider.path is configured
* [HIVE-16061] - When hive.async.log.enabled is set to true, some output is not printed to the beeline console
* [HIVE-16077] - UPDATE/DELETE fails with numBuckets > numReducers
* [HIVE-16113] - PartitionPruner::removeNonPartCols needs to handle AND/OR cases
* [HIVE-16117] - SortProjectTransposeRule should check for monotonicity preserving CAST
* [HIVE-16125] - Split work between reducers.
* [HIVE-16130] - Remove jackson classes from hive-jdbc standalone jar
* [HIVE-16147] - Rename a partitioned table should not drop its partition columns stats
* [HIVE-16174] - Update MetricsConstant.WAITING_COMPILE_OPS metric when we acquire lock failed in Driver
* [HIVE-16177] - non Acid to acid conversion doesn't handle _copy_N files
* [HIVE-16188] - beeline should block the connection if given invalid database name.
* [HIVE-16193] - Hive show compactions not reflecting the status of the application
* [HIVE-16213] - ObjectStore can leak Queries when rollbackTransaction throws an exception
* [HIVE-16219] - metastore notification_log contains serialized message with non functional fields
* [HIVE-16222] - add a setting to disable row.serde for specific formats; enable for others
* [HIVE-16225] - Memory leak in webhcat service (FileSystem CACHE entries)
* [HIVE-16233] - llap: Query failed with AllocatorOutOfMemoryException
* [HIVE-16254] - metadata for values temporary tables for INSERTs are getting replicated during bootstrap
* [HIVE-16275] - Vectorization: Add ReduceSink support for TopN (in specialized native classes)
* [HIVE-16282] - Semijoin: Disable slow-start for the bloom filter aggregate task
* [HIVE-16287] - Alter table partition rename with location - moves partition back to hive warehouse
* [HIVE-16290] - Stats: StatsRulesProcFactory::evaluateComparator estimates are wrong when minValue == filterValue
* [HIVE-16291] - Hive fails when unions a parquet table with itself
* [HIVE-16296] - use LLAP executor count to configure reducer auto-parallelism
* [HIVE-16298] - Add config to specify multi-column joins have correlated columns
* [HIVE-16299] - MSCK REPAIR TABLE should enforce partition key order when adding unknown partitions
* [HIVE-16302] - Add junit dependency to hive-shims-common to compile with Hadoop 2.8+
* [HIVE-16305] - Additional Datanucleus ClassLoaderResolverImpl leaks causing HS2 OOM
* [HIVE-16307] - add IO memory usage report to LLAP UI
* [HIVE-16308] - PreExecutePrinter and PostExecutePrinter should log to INFO level instead of ERROR
* [HIVE-16309] - Hive Test Commands failure should be printed in hive.log in addition to stderr
* [HIVE-16315] - Describe table doesn't show num of partitions
* [HIVE-16316] - Prepare master branch for 3.0.0 development.
* [HIVE-16317] - CASE .. NULL in JOIN condition can trigger SemanticException
* [HIVE-16318] - LLAP cache: address some issues in 2.2/2.3
* [HIVE-16319] - LLAP: Better handling of an empty wait queue, should try scheduling checks
* [HIVE-16321] - Possible deadlock in metastore with Acid enabled
* [HIVE-16323] - HS2 JDOPersistenceManagerFactory.pmCache leaks after HIVE-14204
* [HIVE-16324] - Truncate table should not work when EXTERNAL property of table is true
* [HIVE-16325] - Tez session refresh based on a time interval fails
* [HIVE-16328] - HoS: more aggressive mapjoin optimization when hive.spark.use.ts.stats.for.mapjoin is true
* [HIVE-16329] - TopN: use local executor info for LLAP memory checks
* [HIVE-16333] - remove the redundant symbol "\" to appear red in sublime text 3
* [HIVE-16335] - Beeline user HS2 connection file should use /etc/hive/conf instead of /etc/conf/hive
* [HIVE-16336] - Rename hive.spark.use.file.size.for.mapjoin to hive.spark.use.ts.stats.for.mapjoin
* [HIVE-16341] - Tez Task Execution Summary has incorrect input record counts on some operators
* [HIVE-16347] - HiveMetastoreChecker should skip listing partitions which are not valid when hive.msck.path.validation is set to skip or ignore
* [HIVE-16353] - Jetty 9 upgrade breaks hive master LLAP
* [HIVE-16357] - Failed folder creation when creating a new table is reported incorrectly
* [HIVE-16363] - QueryLifeTimeHooks should catch parse exceptions
* [HIVE-16368] - Unexpected java.lang.ArrayIndexOutOfBoundsException from query with LaterView Operation for hive on MR.
* [HIVE-16369] - Vectorization: Support PTF (Part 1: No Custom Window Framing -- Default Only)
* [HIVE-16380] - removing global test dependency of jsonassert
* [HIVE-16384] - Remove jdk7 build from travis
* [HIVE-16385] - StatsNoJobTask could exit early before all partitions have been processed
* [HIVE-16388] - LLAP: Log rotation for daemon, history and gc files
* [HIVE-16389] - Allow HookContext to access SQLOperationDisplay
* [HIVE-16390] - LLAP IO should take job config into account; also LLAP config should load defaults
* [HIVE-16393] - Fix visibility of CodahaleReporter interface
* [HIVE-16394] - HoS does not support queue name change in middle of session
* [HIVE-16396] - Sync storage-api version in pom.xml
* [HIVE-16399] - create an index for tc_txnid in TXN_COMPONENTS
* [HIVE-16402] - Upgrade to Hadoop 2.8.0
* [HIVE-16403] - LLAP UI shows the wrong number of executors
* [HIVE-16404] - Renaming of public classes in Calcite 12 breeaking druid integration
* [HIVE-16406] - Remove unwanted interning when creating PartitionDesc
* [HIVE-16409] - TestEventHandlerFactory has lacked the ASF header
* [HIVE-16413] - Create table as select does not check ownership of the location
* [HIVE-16421] - Runtime filtering breaks user-level explain
* [HIVE-16422] - Should kill running Spark Jobs when a query is cancelled.
* [HIVE-16425] - Vectorization: unload old hashtables before reloadHashTable
* [HIVE-16427] - Fix multi-insert query and write qtests
* [HIVE-16433] - Not nullify variable "rj" to avoid NPE due to race condition in ExecDriver.
* [HIVE-16436] - Response times in "Task Execution Summary" at the end of the job is not correct
* [HIVE-16448] - Vectorization: Vectorized order_null.q fails with deserialize EOF exception below TEZ ReduceRecordSource.processVectorGroup
* [HIVE-16450] - Some metastore operations are not retried even with desired underlining exceptions
* [HIVE-16451] - Race condition between HiveStatement.getQueryLog and HiveStatement.runAsyncOnServer
* [HIVE-16459] - Forward channelInactive to RpcDispatcher
* [HIVE-16461] - DagUtils checks local resource size on the remote fs
* [HIVE-16462] - Vectorization: Enabling hybrid grace disables specialization of all reduce side joins
* [HIVE-16465] - NullPointer Exception when enable vectorization for Parquet file format
* [HIVE-16468] - BeeLineDriver should be able to run tests against an externally created cluster
* [HIVE-16471] - Add metrics for "waiting compilation time"
* [HIVE-16473] - Hive-on-Tez may fail to write to an HBase table
* [HIVE-16482] - Druid Ser/Des need to use dimension output name
* [HIVE-16483] - HoS should populate split related configurations to HiveConf
* [HIVE-16485] - Enable outputName for RS operator in explain formatted
* [HIVE-16487] - Serious Zookeeper exception is logged when a race condition happens
* [HIVE-16491] - CBO cant handle join involving complex types in on condition
* [HIVE-16494] - udaf percentile_approx() may fail on CBO
* [HIVE-16497] - FileUtils. isActionPermittedForFileHierarchy, isOwnerOfFileHierarchy file system operations should be impersonated
* [HIVE-16507] - Hive Explain User-Level may print out "Vertex dependency in root stage" twice
* [HIVE-16510] - Vectorization: Add vectorized PTF tests in preparation for HIVE-16369
* [HIVE-16511] - CBO looses inner casts on constants of complex type
* [HIVE-16513] - width_bucket issues
* [HIVE-16518] - Insert override for druid does not replace all existing segments
* [HIVE-16519] - Fix exception thrown by checkOutputSpecs
* [HIVE-16523] - VectorHashKeyWrapper hash code for strings is not so good
* [HIVE-16524] - Remove the redundant item type in hiveserver2.jsp and QueryProfileTmpl.jamon
* [HIVE-16533] - Vectorization: Avoid evaluating empty groupby keys
* [HIVE-16534] - Add capability to tell aborted transactions apart from open transactions in ValidTxnList
* [HIVE-16538] - TestExecDriver fails if run after TestOperators#testScriptOperator
* [HIVE-16539] - Add PTF tests for blobstores
* [HIVE-16545] - LLAP: bug in arena size determination logic
* [HIVE-16546] - LLAP: Fail map join tasks if hash table memory exceeds threshold
* [HIVE-16547] - LLAP: may not unlock buffers in some cases
* [HIVE-16553] - Change default value for hive.tez.bigtable.minsize.semijoin.reduction
* [HIVE-16554] - ACID: Make HouseKeeperService threads daemon
* [HIVE-16557] - Vectorization: Specialize ReduceSink empty key case
* [HIVE-16559] - Parquet schema evolution for partitioned tables may break if table and partition serdes differ
* [HIVE-16562] - Issues with nullif / fetch task
* [HIVE-16563] - Alter table partition set location should use fully qualified path for non-default FS
* [HIVE-16572] - Rename a partition should not drop its column stats
* [HIVE-16573] - In-place update for HoS can't be disabled
* [HIVE-16576] - Fix encoding of intervals when fetching select query candidates from druid
* [HIVE-16577] - Syntax error in the metastore init scripts for mssql
* [HIVE-16578] - Semijoin Hints should use column name, if provided for partition key check
* [HIVE-16581] - a bug in HIVE-16523
* [HIVE-16584] - Warning messages should use LogHelper.printInfo instead of printing to the infoStream directly
* [HIVE-16588] - Resource leak by druid http client
* [HIVE-16589] - Vectorization: Support Complex Types and GroupBy modes PARTIAL2, FINAL, and COMPLETE for AVG, VARIANCE
* [HIVE-16590] - Make initializing dag names in SparkWork thread safe for parallel compilation (HIVE-13512)
* [HIVE-16592] - Vectorization: Long hashCodes should bit-mix into lower bits
* [HIVE-16593] - SparkClientFactory.stop may prevent JVM from exiting
* [HIVE-16598] - LlapServiceDriver - create directories and warn of errors
* [HIVE-16599] - NPE in runtime filtering cost when handling SMB Joins
* [HIVE-16603] - Enforce foreign keys to refer to primary keys or unique keys
* [HIVE-16607] - ColumnStatsAutoGatherContext regenerates HiveConf.HIVEQUERYID
* [HIVE-16609] - col='__HIVE_DEFAULT_PARTITION__' condition in select statement may produce wrong result
* [HIVE-16610] - Semijoin Hint : Should be able to handle more than one hint per alias
* [HIVE-16613] - SaslClientHandler.sendHello is eating exceptions
* [HIVE-16625] - Extra '\0' characters in the output, when SeparatedValuesOutputFormat is used and the quoting is disabled
* [HIVE-16633] - username for ATS data shall always be the uid who submit the job
* [HIVE-16634] - LLAP Use a pool of connections to a single AM from a daemon
* [HIVE-16640] - The ASF Headers have some errors in some class
* [HIVE-16645] - Commands.java has missed the catch statement and has some code format errors
* [HIVE-16646] - Alias in transform ... as clause shouldn't be case sensitive
* [HIVE-16654] - Optimize a combination of avg(), sum(), count(distinct) etc
* [HIVE-16658] - TestTimestampTZ.java has missed the ASF header
* [HIVE-16659] - Query plan should reflect hive.spark.use.groupby.shuffle
* [HIVE-16660] - Not able to add partition for views in hive when sentry is enabled
* [HIVE-16665] - Race condition in Utilities.GetInputPathsCallable --> createDummyFileForEmptyPartition
* [HIVE-16667] - PostgreSQL metastore handling of CLOB types for COLUMNS_V2.TYPE_NAME and other field is incorrect
* [HIVE-16671] - LLAP IO: BufferUnderflowException may happen in very rare(?) cases due to ORC end-of-CB estimation
* [HIVE-16675] - Fix ConcurrentModificationException in SparkClientImpl#startDriver
* [HIVE-16677] - CTAS with no data fails in Druid
* [HIVE-16678] - Truncate on temporary table fails with "table not found" error.
* [HIVE-16679] - Missing ASF header on properties file in ptest2 project
* [HIVE-16689] - Correlated scalar subquery with comparison to constant in predicate fails
* [HIVE-16692] - LLAP: Keep alive connection in shuffle handler should not be closed until entire data is flushed out
* [HIVE-16693] - beeline "source" command freezes if you have a comment in it?
* [HIVE-16696] - Fix JoinCondDesc explain string
* [HIVE-16698] - HoS should avoid mapjoin optimization in case of union and using table stats
* [HIVE-16703] - Hive may add the same file to the session and vertex in Tez
* [HIVE-16708] - Exception while renewing a Delegation Token
* [HIVE-16721] - Inconsistent behavior in dealing with Timestamp stats
* [HIVE-16724] - increase session timeout for LLAP ZK token manager
* [HIVE-16730] - Vectorization: Schema Evolution for Text Vectorization / Complex Types
* [HIVE-16731] - Vectorization: Make "CASE WHEN (day_name='Sunday') THEN column1 ELSE null end" that involves a column name or expression THEN or ELSE vectorize
* [HIVE-16732] - Transactional tables should block LOAD DATA
* [HIVE-16737] - LLAP: Shuffle handler TCP listen queue overflows
* [HIVE-16738] - Notification ID generation in DBNotification might not be unique across HS2 instances.
* [HIVE-16742] - cap the number of reducers for LLAP at the configured value
* [HIVE-16743] - BitSet set() is incorrectly used in TxnUtils.createValidCompactTxnList()
* [HIVE-16744] - LLAP index update may be broken after ORC switch
* [HIVE-16745] - Syntax error in 041-HIVE-16556.mysql.sql script
* [HIVE-16746] - Reduce number of index lookups for same table in IndexWhereTaskDispatcher
* [HIVE-16751] - Support different types for grouping columns in GroupBy Druid queries
* [HIVE-16755] - LLAP IO: incorrect assert may trigger in tests
* [HIVE-16756] - Vectorization: LongColModuloLongColumn throws "java.lang.ArithmeticException: / by zero"
* [HIVE-16757] - Use of deprecated getRows() instead of new estimateRowCount(RelMetadataQuery..) has serious performance impact
* [HIVE-16761] - LLAP IO: SMB joins fail elevator
* [HIVE-16769] - Possible hive service startup due to the existing file /tmp/stderr
* [HIVE-16776] - Strange cast behavior for table backed by druid
* [HIVE-16777] - LLAP: Use separate tokens and UGI instances when an external client is used
* [HIVE-16778] - LLAP IO: better refcount management
* [HIVE-16780] - Case "multiple sources, single key" in spark_dynamic_pruning.q fails
* [HIVE-16784] - Missing lineage information when hive.blobstore.optimizations.enabled is true
* [HIVE-16788] - ODBC call SQLForeignKeys leads to NPE if you use PK arguments rather than FK arguments
* [HIVE-16793] - Scalar sub-query: sq_count_check not required if gby keys are constant
* [HIVE-16801] - Vectorization: throwExpandError should be an immediate fatal
* [HIVE-16803] - Alter table change column comment should not try to get column stats for update
* [HIVE-16804] - Semijoin hint : Needs support for target table.
* [HIVE-16808] - WebHCat statusdir parameter doesn't properly handle Unicode characters when using relative path
* [HIVE-16820] - TezTask may not shut down correctly before submit
* [HIVE-16821] - Vectorization: support Explain Analyze in vectorized mode
* [HIVE-16824] - PrimaryToReplicaResourceFunctionTest.java has missed the ASF header
* [HIVE-16826] - Improvements for SeparatedValuesOutputFormat
* [HIVE-16828] - With CBO enabled, Query on partitioned views throws IndexOutOfBoundException
* [HIVE-16832] - duplicate ROW__ID possible in multi insert into transactional table
* [HIVE-16835] - Addendum to HIVE-16745
* [HIVE-16844] - Fix Connection leak in ObjectStore when new Conf object is used
* [HIVE-16845] - INSERT OVERWRITE a table with dynamic partitions on S3 fails with NPE
* [HIVE-16846] - TestJdbcWithMiniHS2#testHttpHeaderSize test case is not testing in HTTP mode
* [HIVE-16847] - LLAP queue order issue
* [HIVE-16851] - Scalar subquery with group by missing sq_count_check UDF
* [HIVE-16854] - SparkClientFactory is locked too aggressively
* [HIVE-16864] - add validation to stream position search in LLAP IO
* [HIVE-16869] - Hive returns wrong result when predicates on non-existing columns are pushed down to Parquet reader
* [HIVE-16871] - CachedStore.get_aggr_stats_for has side affect
* [HIVE-16875] - Query against view with partitioned child on HoS fails with privilege exception.
* [HIVE-16876] - HoS: Make Rpc configs immutable at runtime
* [HIVE-16877] - NPE when issue query like alter table ... cascade onto non-partitioned table
* [HIVE-16886] - HMS log notifications may have duplicated event IDs if multiple HMS are running concurrently
* [HIVE-16888] - Upgrade Calcite to 1.13 and Avatica to 1.10
* [HIVE-16898] - Validation of source file after distcp in repl load
* [HIVE-16902] - investigate "failed to remove operation log" errors
* [HIVE-16903] - LLAP: Fix config name issue in SHUFFLE_MANAGE_OS_CACHE
* [HIVE-16908] - Failures in TestHcatClient due to HIVE-16844
* [HIVE-16910] - RpcConfiguration - Improper Cast From Long To Int
* [HIVE-16915] - partition column count is not determined correctly in LLAP IO non-vectorized wrapper
* [HIVE-16918] - Skip ReplCopyTask distcp for _metadata copying. Also enable -pb for distcp
* [HIVE-16920] - remove useless uri.getScheme() from EximUtil
* [HIVE-16922] - Typo in serde.thrift: COLLECTION_DELIM = "colelction.delim"
* [HIVE-16927] - LLAP: Slider takes down all daemons when some daemons fail repeatedly
* [HIVE-16930] - HoS should verify the value of Kerberos principal and keytab file before adding them to spark-submit command parameters
* [HIVE-16935] - Hive should strip comments from input before choosing which CommandProcessor to run.
* [HIVE-16937] - INFORMATION_SCHEMA usability: everything is currently a string
* [HIVE-16938] - INFORMATION_SCHEMA usability: difficult to access # of table records
* [HIVE-16939] - metastore error: 'export: -Dproc_metastore : not a valid identifier'
* [HIVE-16942] - INFORMATION_SCHEMA: schematool for setting it up is not idempotent
* [HIVE-16943] - MoveTask should separate src FileSystem from dest FileSystem
* [HIVE-16947] - Semijoin Reduction : Task cycle created due to multiple semijoins in conjunction with hashjoin
* [HIVE-16948] - Invalid explain when running dynamic partition pruning query in Hive On Spark
* [HIVE-16949] - Leak of threads from Get-Input-Paths and Get-Input-Summary thread pool
* [HIVE-16954] - LLAP IO: better debugging
* [HIVE-16958] - Setting hive.merge.sparkfiles=true will retrun an error when generating parquet databases
* [HIVE-16960] - Hive throws an ugly error exception when HDFS sticky bit is set
* [HIVE-16961] - Hive on Spark leaks spark application in case user cancels query and closes session
* [HIVE-16964] - _orc_acid_version file is missing
* [HIVE-16965] - SMB join may produce incorrect results
* [HIVE-16973] - Fetching of Delegation tokens (Kerberos) for AccumuloStorageHandler fails in HS2
* [HIVE-16975] - Vectorization: Fully vectorize CAST date as TIMESTAMP so VectorUDFAdaptor is now used
* [HIVE-16978] - HoS: add current thread ID to the log redirector for the RemoteDriver
* [HIVE-16982] - WebUI "Show Query" tab prints "UNKNOWN" instead of explaining configuration option
* [HIVE-16985] - LLAP IO: enable SMB join in elevator after the former is fixed
* [HIVE-16991] - HiveMetaStoreClient needs a 2-arg constructor for backwards compatibility
* [HIVE-17002] - decimal (binary) is not working when creating external table for hbase
* [HIVE-17006] - LLAP: Parquet caching v1
* [HIVE-17007] - NPE introduced by HIVE-16871
* [HIVE-17008] - Fix boolean flag switchup in DropTableEvent
* [HIVE-17010] - Fix the overflow problem of Long type in SetSparkReducerParallelism
* [HIVE-17013] - Delete request with a subquery based on select over a view
* [HIVE-17050] - Multiline queries that have comment in middle fail when executed via "beeline -e"
* [HIVE-17052] - Remove logging of predicate filters
* [HIVE-17066] - Query78 filter wrong estimatation is generating bad plan
* [HIVE-17067] - LLAP: Add http endpoint to provide system level configurations
* [HIVE-17069] - Refactor OrcRawRecrodMerger.ReaderPair
* [HIVE-17070] - remove .orig files from src
* [HIVE-17073] - Incorrect result with vectorization and SharedWorkOptimizer
* [HIVE-17076] - typo in itests/src/test/resources/testconfiguration.properties
* [HIVE-17079] - LLAP: Use FQDN by default for work submission
* [HIVE-17083] - DagUtils overwrites any credentials already added
* [HIVE-17085] - ORC file merge/concatenation should do full schema check
* [HIVE-17086] - LLAP: JMX Metric for max file descriptors used so far
* [HIVE-17088] - HS2 WebUI throws a NullPointerException when opened
* [HIVE-17090] - spark.only.query.files are not being run by ptest
* [HIVE-17093] - LLAP ssl configs need to be localized to talk to a wire encrypted hdfs
* [HIVE-17095] - Long chain repl loads do not complete in a timely fashion
* [HIVE-17097] - Fix SemiJoinHint parsing in SemanticAnalyzer
* [HIVE-17098] - Race condition in Hbase tables
* [HIVE-17099] - Update golden files for spark.only.query.files
* [HIVE-17109] - Remove calls to RelMetadataQuery.instance() after Calcite 1.13 upgrade
* [HIVE-17110] - BucketCodec should enforce value ranges
* [HIVE-17111] - Add TestLocalSparkCliDriver
* [HIVE-17113] - Duplicate bucket files can get written to table by runaway task
* [HIVE-17114] - HoS: Possible skew in shuffling when data is not really skewed
* [HIVE-17115] - MetaStoreUtils.getDeserializer doesn't catch the java.lang.ClassNotFoundException
* [HIVE-17116] - Vectorization: Add infrastructure for vectorization of ROW__ID struct
* [HIVE-17117] - Metalisteners are not notified when threadlocal metaconf is cleanup
* [HIVE-17128] - Operation Logging leaks file descriptors as the log4j Appender is never closed
* [HIVE-17144] - export of temporary tables not working and it seems to be using distcp rather than filesystem copy
* [HIVE-17147] - Vectorization: Add code for testing MapJoin operator in isolation and measuring its performance with JMH
* [HIVE-17148] - Incorrect result for Hive join query with COALESCE in WHERE condition
* [HIVE-17149] - Hdfs directory is not cleared if partition creation failed on HMS
* [HIVE-17150] - CREATE INDEX execute HMS out-of-transaction listener calls inside a transaction
* [HIVE-17152] - Improve security of random generator for HS2 cookies
* [HIVE-17155] - findConfFile() in HiveConf.java has some issues with the conf path
* [HIVE-17169] - Avoid extra call to KeyProvider::getMetadata()
* [HIVE-17172] - add ordering checks to DiskRangeList
* [HIVE-17176] - Add ASF header for LlapAllocatorBuffer.java
* [HIVE-17177] - move TestSuite.java to the right position
* [HIVE-17181] - HCatOutputFormat should expose complete output-schema (including partition-keys) for dynamic-partitioning MR jobs
* [HIVE-17184] - Unexpected new line in beeline output when running with -f option
* [HIVE-17188] - ObjectStore runs out of memory for large batches of addPartitions().
* [HIVE-17189] - Fix backwards incompatibility in HiveMetaStoreClient
* [HIVE-17208] - Repl dump should pass in db/table information to authorization API
* [HIVE-17209] - ObjectCacheFactory should return null when tez shared object registry is not setup
* [HIVE-17213] - HoS: file merging doesn't work for union all
* [HIVE-17217] - SMB Join : Assert if paths are different in TezGroupedSplit in KeyValueInputMerger
* [HIVE-17218] - Canonical-ize hostnames for Hive metastore, and HS2 servers.
* [HIVE-17220] - Bloomfilter probing in semijoin reduction is thrashing L1 dcache
* [HIVE-17222] - Llap: Iotrace throws java.lang.UnsupportedOperationException with IncompleteCb
* [HIVE-17228] - Bump tez version to 0.9.0
* [HIVE-17233] - Set "mapred.input.dir.recursive" for HCatInputFormat-based jobs.
* [HIVE-17235] - Add ORC Decimal64 Serialization/Deserialization (Part 1)
* [HIVE-17240] - Function ACOS(2) and ASIN(2) should be null
* [HIVE-17254] - Skip updating AccessTime of recycled files in ReplChangeManager
* [HIVE-17257] - Hive should merge empty files
* [HIVE-17258] - Incorrect log messages in the Hive.java
* [HIVE-17259] - Hive JDBC does not recognize UNIONTYPE columns
* [HIVE-17260] - Typo: exception has been created and lost in the ThriftJDBCBinarySerDe
* [HIVE-17265] - Cache merged column stats from retrieved partitions
* [HIVE-17267] - Make HMS Notification Listeners typesafe
* [HIVE-17268] - WebUI / QueryPlan: query plan is sometimes null when explain output conf is on
* [HIVE-17270] - Qtest results show wrong number of executors
* [HIVE-17272] - when hive.vectorized.execution.enabled is true, query on empty partitioned table fails with NPE
* [HIVE-17274] - RowContainer spills for timestamp column throws exception
* [HIVE-17275] - Auto-merge fails on writes of UNION ALL output to ORC file with dynamic partitioning
* [HIVE-17276] - Check max shuffle size when converting to dynamically partitioned hash join
* [HIVE-17277] - HiveMetastoreClient Log name is wrong
* [HIVE-17280] - Data loss in CONCATENATE ORC created by Spark
* [HIVE-17281] - LLAP external client not properly handling KILLED notification that occurs when a fragment is rejected
* [HIVE-17283] - Enable parallel edges of semijoin along with mapjoins
* [HIVE-17285] - Fixes for bit vector retrievals and merging
* [HIVE-17286] - Avoid expensive String serialization/deserialization for bitvectors
* [HIVE-17290] - Should use equals() rather than == to compare strings
* [HIVE-17298] - export when running distcp for large number of files should not run as privileged user
* [HIVE-17301] - Make JSONMessageFactory.getTObj method thread safe
* [HIVE-17302] - ReduceRecordSource should not add batch string to Exception message
* [HIVE-17303] - Missmatch between roaring bitmap library used by druid and the one coming from tez
* [HIVE-17305] - New insert overwrite dynamic partitions qtest need to have the golden file regenerated
* [HIVE-17309] - alter partition onto a table not in current database throw InvalidOperationException
* [HIVE-17311] - Numeric overflow in the HiveConf
* [HIVE-17313] - Potentially possible 'case fall through' in the ObjectInspectorConverters
* [HIVE-17314] - LazySimpleSerializeWrite.writeString() contains if with an empty body
* [HIVE-17321] - HoS: analyze ORC table doesn't compute raw data size when noscan/partialscan is not specified
* [HIVE-17322] - Serialise BeeLine qtest execution to prevent flakyness
* [HIVE-17327] - LLAP IO: restrict native file ID usage to default FS to avoid hypothetical collisions when HDFS federation is used
* [HIVE-17331] - Path must be used as key type of the pathToAliases
* [HIVE-17333] - Schema changes in HIVE-12274 for Oracle may not work for upgrade
* [HIVE-17336] - Missing class 'org.apache.hadoop.hive.hbase.HiveHBaseTableInputFormat' from Hive on Spark when inserting into hbase based table
* [HIVE-17338] - Utilities.get*Tasks multiple methods duplicate code
* [HIVE-17344] - LocalCache element memory usage is not calculated properly.
* [HIVE-17348] - Remove unnecessary GenSparkUtils.java.orig file
* [HIVE-17351] - use new slider package installation command in run.sh
* [HIVE-17352] - HiveSever2 error with "Illegal Operation state transition from CLOSED to FINISHED"
* [HIVE-17354] - Fix "alter view" for incremental replication
* [HIVE-17356] - Missing ASF headers 3 classes
* [HIVE-17357] - Plugin jars are not properly added for LocalHiveSparkClient
* [HIVE-17360] - Tez session reopen appears to use a wrong conf object
* [HIVE-17364] - Add unit test to "alter view" replication
* [HIVE-17365] - Druid CTAS should support CHAR/VARCHAR type
* [HIVE-17367] - IMPORT table doesn't load from data dump if a metadata-only dump was already imported.
* [HIVE-17368] - DBTokenStore fails to connect in Kerberos enabled remote HMS environment
* [HIVE-17372] - update druid dependency to druid 0.10.1
* [HIVE-17377] - SharedWorkOptimizer might not iterate through TS operators deterministically
* [HIVE-17378] - CBO: HiveReduceExpressionsWithStatsRule can operate on IS_NULL and IS_NOT_NULL
* [HIVE-17385] - Fix incremental repl error for non-native tables
* [HIVE-17389] - Yetus is always failing on rat checks
* [HIVE-17391] - Compaction fails if there is an empty value in tblproperties
* [HIVE-17392] - SharedWorkOptimizer might merge TS operators filtered by not equivalent semijoin operators
* [HIVE-17393] - AMReporter need hearbeat every external 'AM'
* [HIVE-17394] - AvroSerde is regenerating TypeInfo objects for each nullable Avro field for every row
* [HIVE-17401] - Hive session idle timeout doesn't function properly
* [HIVE-17403] - Fail concatenation for unmanaged and transactional tables
* [HIVE-17410] - repl load task during subsequent DAG generation does not start from the last partition processed
* [HIVE-17411] - LLAP IO may incorrectly release a refcount in some rare cases
* [HIVE-17412] - Add "-- SORT_QUERY_RESULTS" for spark_vectorized_dynamic_partition_pruning.q
* [HIVE-17413] - predicate involving CAST affects value returned by the SELECT statement
* [HIVE-17415] - Hit error "SemanticException View xxx is corresponding to LIMIT, rather than a SelectOperator." in Hive queries
* [HIVE-17417] - LazySimple Timestamp is very expensive
* [HIVE-17419] - ANALYZE TABLE...COMPUTE STATISTICS FOR COLUMNS command shows computed stats for masked tables
* [HIVE-17420] - bootstrap - get replid before object dump
* [HIVE-17421] - Clear incorrect stats after replication
* [HIVE-17429] - Hive JDBC doesn't return rows when querying Impala
* [HIVE-17450] - rename TestTxnCommandsBase
* [HIVE-17452] - HPL/SQL function variable block is not initialized
* [HIVE-17453] - Missing ASF headers 2 classes
* [HIVE-17457] - IOW Acid Insert Overwrite when the transaction fails
* [HIVE-17459] - View deletion operation failed to replicate on target cluster
* [HIVE-17460] - `insert overwrite` should support table schema evolution (e.g. add columns)
* [HIVE-17463] - ORC: include orc-shims in hive-exec.jar
* [HIVE-17464] - Fix to be able to disable max shuffle size DHJ config
* [HIVE-17465] - Statistics: Drill-down filters don't reduce row-counts progressively
* [HIVE-17468] - Shade and package appropriate jackson version for druid storage handler
* [HIVE-17471] - Vectorization: Enable hive.vectorized.row.identifier.enabled to true by default
* [HIVE-17472] - Drop-partition for multi-level partition fails, if data does not exist.
* [HIVE-17475] - Disable mapjoin using hint
* [HIVE-17479] - Staging directories do not get cleaned up for update/delete queries
* [HIVE-17483] - HS2 kill command to kill queries using query id
* [HIVE-17485] - Hive-Druid table on indexing for few segments- DruidRecordWriter.pushSegments throws ArrayIndexOutOfBoundsException
* [HIVE-17489] - Separate client-facing and server-side Kerberos principals, to support HA
* [HIVE-17496] - Bootstrap repl is not cleaning up staging dirs
* [HIVE-17504] - Skip ACID table for replication
* [HIVE-17510] - Make comparison of filter predicates in q files deterministic
* [HIVE-17512] - Not use doAs if distcp privileged user same as user running hive
* [HIVE-17522] - cleanup old 'repl dump' dirs
* [HIVE-17523] - Insert into druid table hangs Hive server2 in an infinite loop
* [HIVE-17529] - Bucket Map Join : Sets incorrect edge type causing execution failure
* [HIVE-17530] - ClassCastException when converting uniontype
* [HIVE-17535] - Select 1 EXCEPT Select 1 fails with NPE
* [HIVE-17553] - CBO wrongly type cast decimal literal to int
* [HIVE-17554] - Occurr java.lang.ArithmeticException: / by zero at hplsql component
* [HIVE-17556] - The test udf_mask_hash.q is failing
* [HIVE-17558] - Skip non-native/temporary tables for constraint related scenarios
* [HIVE-17560] - HiveMetastore doesn't start in secure cluster if repl change manager is enabled
* [HIVE-17563] - CodahaleMetrics.JsonFileReporter is not updating hive.service.metrics.file.location
* [HIVE-17568] - HiveJoinPushTransitivePredicatesRule may exchange predicates which are not valid on the other branch
* [HIVE-17571] - update sql standard authorization config whitelist to include distcp options for replication
* [HIVE-17576] - Improve progress-reporting in TezProcessor
* [HIVE-17582] - Followup of HIVE-15708
* [HIVE-17584] - fix mapred.job.queue.name in sql standard authorization config whitelist
* [HIVE-17585] - Improve thread safety when loading dynamic partitions in parallel
* [HIVE-17588] - LlapRowRecordReader doing name-based field lookup for every column of every row
* [HIVE-17594] - Unit format error in Copy.java
* [HIVE-17595] - Correct DAG for updating the last.repl.id for a database during bootstrap load
* [HIVE-17601] - improve error handling in LlapServiceDriver
* [HIVE-17602] - Explain plan not working
* [HIVE-17610] - LLAP IO: an exception in exception handling can hide the original exception
* [HIVE-17613] - remove object pools for short, same-thread allocations
* [HIVE-17615] - Task.executeTask has to be thread safe for parallel execution
* [HIVE-17619] - Exclude avatica-core.jar dependency from avatica shaded jar
* [HIVE-17620] - Use the default MR scratch directory (HDFS) in the only case when hive.blobstore.optimizations.enabled=true AND isFinalJob=true
* [HIVE-17621] - Hive-site settings are ignored during HCatInputFormat split-calculation
* [HIVE-17623] - Fix Select query Fix Double column serde and some refactoring
* [HIVE-17624] - MapredLocakTask running in separate JVM could throw ClassNotFoundException
* [HIVE-17625] - Replication: update hive.repl.partitions.dump.parallelism to 100
* [HIVE-17627] - Use druid scan query instead of the select query.
* [HIVE-17628] - always use fully qualified path for tables/partitions/etc.
* [HIVE-17633] - Make it possible to override the query results directory in TestBeeLineDriver
* [HIVE-17635] - Add unit tests to CompactionTxnHandler and use PreparedStatements for queries
* [HIVE-17639] - don't reuse planner context when re-parsing the query
* [HIVE-17643] - recent WM changes broke reopen due to spurious overloads
* [HIVE-17644] - directSQL errors out on key constraints until the DB is initialized
* [HIVE-17649] - Export/Import: Move export data write to a task
* [HIVE-17653] - Druid storage handler CTAS with boolean type columns fails.
* [HIVE-17659] - get_token thrift call fails for DBTokenStore in remote HMS mode
* [HIVE-17664] - Refactor and add new tests
* [HIVE-17665] - Update netty-all to latest 4.0.x.Final
* [HIVE-17679] - http-generic-click-jacking for WebHcat server
* [HIVE-17682] - Vectorization: IF stmt produces wrong results
* [HIVE-17690] - Add distcp.options.p* in sql standard authorization config whitelist
* [HIVE-17701] - Added restriction to historic queries on web UI
* [HIVE-17702] - incorrect isRepeating handling in decimal reader in ORC
* [HIVE-17706] - Add a possibility to run the BeeLine tests on the default database
* [HIVE-17715] - Exception when pushing postaggregates into Druid
* [HIVE-17720] - Bitvectors are not shown in describe statement on beeline
* [HIVE-17721] - with Postgres rdbms for metastore and dbnotification enabled, hive DDL SQL query fails
* [HIVE-17723] - Update Accumulo drive q.out files
* [HIVE-17725] - Fix misnamed tests which are not run during precommit runs.
* [HIVE-17726] - Using exists may lead to incorrect results
* [HIVE-17731] - add a backward compat option for external users to HIVE-11985
* [HIVE-17735] - ObjectStore.addNotificationEvent is leaking queries
* [HIVE-17746] - Regenerate spark_explainuser_1.q.out
* [HIVE-17749] - Multiple class have missed the ASF header
* [HIVE-17758] - NOTIFICATION_SEQUENCE_LOCK_RETRY_SLEEP_INTERVAL.defaultLongVal is -1
* [HIVE-17761] - Deprecate hive.druid.select.distribute property for Druid
* [HIVE-17762] - Exclude older jackson-annotation.jar from druid-handler shaded jar
* [HIVE-17764] - alter view fails when hive.metastore.disallow.incompatible.col.type.changes set to true
* [HIVE-17765] - expose Hive keywords
* [HIVE-17777] - Add maven coordinates in itests/pom.xml
* [HIVE-17781] - Map MR settings to Tez settings via DeprecatedKeys
* [HIVE-17782] - Inconsistent cast behavior from string to numeric types with regards to leading/trailing spaces
* [HIVE-17785] - Encription tests are not running
* [HIVE-17792] - Enable Bucket Map Join when there are extra keys other than bucketed columns
* [HIVE-17795] - Add distribution management tag in pom
* [HIVE-17798] - When replacing the src table names in BeeLine testing, the table names shouldn't be changed to lower case
* [HIVE-17803] - With Pig multi-query, 2 HCatStorers writing to the same table will trample each other's outputs
* [HIVE-17804] - Vectorization: Bug erroneously causes match for 1st row in batch (SelectStringColLikeStringScalar)
* [HIVE-17806] - Create directory for metrics file if it doesn't exist
* [HIVE-17807] - Execute maven commands in batch mode for ptests
* [HIVE-17813] - hive.exec.move.files.from.source.dir does not work with partitioned tables
* [HIVE-17815] - prevent OOM with Atlas Hive hook
* [HIVE-17817] - Stabilize crossproduct warning message output order
* [HIVE-17822] - Provide an option to skip shading of jars
* [HIVE-17825] - Socket not closed when trying to read files to copy over in replication from metadata
* [HIVE-17826] - Error writing to RandomAccessFile after operation log is closed
* [HIVE-17828] - Metastore: mysql upgrade scripts to 3.0.0 is broken
* [HIVE-17829] - ArrayIndexOutOfBoundsException - HBASE-backed tables with Avro schema in Hive2
* [HIVE-17830] - dbnotification fails to work with rdbms other than postgres
* [HIVE-17831] - HiveSemanticAnalyzerHookContext does not update the HiveOperation after sem.analyze() is called
* [HIVE-17832] - Allow hive.metastore.disallow.incompatible.col.type.changes to be changed in metastore
* [HIVE-17833] - Publish split generation counters
* [HIVE-17834] - Fix flaky triggers test
* [HIVE-17836] - Persisting nulls in bit vector field fails for postgres backed metastore
* [HIVE-17839] - Cannot generate thrift definitions in standalone-metastore
* [HIVE-17843] - UINT32 Parquet columns are handled as signed INT32-s, silently reading incorrect data
* [HIVE-17845] - insert fails if target table columns are not lowercase
* [HIVE-17853] - RetryingMetaStoreClient loses UGI impersonation-context when reconnecting after timeout
* [HIVE-17864] - PTestClient cannot start during Precommit tests
* [HIVE-17867] - Exception in windowing functions with TIMESTAMP WITH LOCAL TIME ZONE type
* [HIVE-17868] - Make queries in spark_local_queries.q have deterministic output
* [HIVE-17872] - Ignoring schema autostart doesn't work (HIVE-14152 used the wrong setting)
* [HIVE-17873] - External LLAP client: allow same handleID to be used more than once
* [HIVE-17882] - Resource plan retrieval looks incorrect
* [HIVE-17891] - HIVE-13076 uses create table if not exists for the postgres script
* [HIVE-17900] - analyze stats on columns triggered by Compactor generates malformed SQL with > 1 partition column
* [HIVE-17908] - LLAP External client not correctly handling killTask for pending requests
* [HIVE-17918] - NPE during semijoin reduction optimization when LLAP caching disabled
* [HIVE-17936] - Dynamic Semijoin Reduction : markSemiJoinForDPP marks unwanted semijoin branches
* [HIVE-17937] - llap_acid_fast test is flaky
* [HIVE-17939] - Bucket map join not being selected when bucketed tables is missing bucket files
* [HIVE-17942] - HiveAlterHandler not using conf from HMS Handler
* [HIVE-17952] - Fix license headers to avoid dangling javadoc warnings
* [HIVE-17953] - Metrics should move to destination atomically
* [HIVE-17963] - Fix for HIVE-17113 can be improved for non-blobstore filesystems
* [HIVE-17966] - org.apache.hadoop.hive.ql.io.parquet.serde.ParquetHiveArrayInspector - Review
* [HIVE-17973] - Fix small bug in multi_insert_union_src.q
* [HIVE-17976] - HoS: don't set output collector if there's no data to process
* [HIVE-17978] - Shared work optimizer may leave useless operator branches in the plan
* [HIVE-17994] - Vectorization: Serialization bottlenecked on irrelevant hashmap lookup
* [HIVE-18001] - InvalidObjectException while creating Primary Key constraint on partition key column
* [HIVE-18006] - Optimize memory footprint of HLLDenseRegister
* [HIVE-18007] - Address maven warnings
* [HIVE-18012] - fix ct_noperm_loc test
* [HIVE-18016] - org.apache.hadoop.hive.ql.util.ResourceDownloader - Review
* [HIVE-18017] - HS2 materialized view registry init loading all tables from metastore
* [HIVE-18026] - Hive webhcat principal configuration optimization
* [HIVE-18046] - Metastore: default IS_REWRITE_ENABLED=false instead of NULL
* [HIVE-18050] - LlapServiceDriver shoud split HIVE_AUX_JARS_PATH by ':' instead of ','
* [HIVE-18054] - Make Lineage work with concurrent queries on a Session
* [HIVE-18060] - UpdateInputAccessTimeHook fails for non-current database
* [HIVE-18067] - Remove extraneous golden files
* [HIVE-18068] - Upgrade to Calcite 1.15
* [HIVE-18069] - MetaStoreDirectSql to get tables has misplaced comma
* [HIVE-18077] - Vectorization: Add string conversion case for UDFToDouble
* [HIVE-18090] - acid heartbeat fails when metastore is connected via hadoop credential
* [HIVE-18109] - fix identifier usage in parser
* [HIVE-18111] - Fix temp path for Spark DPP sink
* [HIVE-18124] - clean up isAcidTable() API vs isInsertOnlyTable()
* [HIVE-18127] - Do not strip '--' comments from shell commands issued from CliDriver
* [HIVE-18136] - WorkloadManagerMxBean is missing the Apache license header
* [HIVE-18146] - Vectorization: VectorMapJoinOperator Decimal64ColumnVector key/value cast bug
* [HIVE-18147] - Tests can fail with java.net.BindException: Address already in use
* [HIVE-18148] - NPE in SparkDynamicPartitionPruningResolver
* [HIVE-18150] - Upgrade Spark Version to 2.2.0
* [HIVE-18151] - LLAP external client: Better error message propagation during submission failures
* [HIVE-18157] - Vectorization : Insert in bucketed table is broken with vectorization
* [HIVE-18160] - Jar localization during session initialization is slow
* [HIVE-18166] - Result of hive.query.string is encoded.
* [HIVE-18188] - Fix TestSSL failures in master
* [HIVE-18189] - Order by position does not work when cbo is disabled
* [HIVE-18191] - Vectorization: Add validation of TableScanOperator (gather statistics) back
* [HIVE-18194] - Migrate existing ACID tables to use write id per table rather than global transaction id
* [HIVE-18195] - Hive schema broken on postgres
* [HIVE-18196] - Druid Mini Cluster to run Qtests integrations tests.
* [HIVE-18198] - TablePropertyEnrichmentOptimizer.java is missing the Apache license header
* [HIVE-18207] - Fix the test failure for TestCliDriver#vector_complex_join
* [HIVE-18208] - SMB Join : Fix the unit tests to run SMB Joins.
* [HIVE-18210] - create resource plan allows duplicates
* [HIVE-18213] - Tests: YARN Minicluster times out if the disks are >90% full
* [HIVE-18220] - Workload Management tables have broken constraints defined on postgres schema
* [HIVE-18228] - Azure credential properties should be added to the HiveConf hidden list
* [HIVE-18232] - Packaging: add dfs-init script in package target
* [HIVE-18240] - support getClientInfo/setClientInfo in JDBC
* [HIVE-18241] - Query with LEFT SEMI JOIN producing wrong result
* [HIVE-18248] - Clean up parameters
* [HIVE-18250] - CBO gets turned off with duplicates in RR error
* [HIVE-18254] - Use proper AVG Calcite primitive instead of Other_FUNCTION
* [HIVE-18255] - spark-client jar should be prefixed with hive-
* [HIVE-18258] - Vectorization: Reduce-Side GROUP BY MERGEPARTIAL with duplicate columns is broken
* [HIVE-18263] - Ptest execution are multiple times slower sometimes due to dying executor slaves
* [HIVE-18266] - LLAP: /system references wrong file for THP
* [HIVE-18269] - LLAP: Fast llap io with slow processing pipeline can lead to OOM
* [HIVE-18271] - Druid Insert into fails with exception when committing files
* [HIVE-18290] - hbase backed table creation fails where no column comments present
* [HIVE-18293] - Hive is failing to compact tables contained within a folder that is not owned by identity running HiveMetaStore
* [HIVE-18298] - Fix TestReplicationScenarios.testConstraints
* [HIVE-18299] - DbNotificationListener fail on mysql with "select for update"
* [HIVE-18306] - Fix spark smb tests
* [HIVE-18309] - qtests: smb_mapjoin_19.q breaks bucketsortoptimize_insert_2.q
* [HIVE-18310] - Test 'vector_reduce_groupby_duplicate_cols.q' is misspelled in testconfiguration.properties
* [HIVE-18311] - Enable smb_mapjoin_8.q for cli driver
* [HIVE-18314] - qtests: semijoin_hint.q breaks hybridgrace_hashjoin_2.q
* [HIVE-18316] - HiveEndPoint should only work with full acid tables
* [HIVE-18318] - LLAP record reader should check interrupt even when not blocking
* [HIVE-18321] - Support REBUILD for MVs backed by custom storage handlers
* [HIVE-18326] - LLAP Tez scheduler - only preempt tasks if there's a dependency between them
* [HIVE-18330] - Fix TestMsgBusConnection - doesn't test tests the original intention
* [HIVE-18331] - Renew the Kerberos ticket used by Druid Query runner
* [HIVE-18335] - Vectorization : Check bounds of array before the allocation in VectorMapJoinFastBytesHashTable
* [HIVE-18341] - Add repl load support for adding "raw" namespace for TDE with same encryption keys
* [HIVE-18352] - introduce a METADATAONLY option while doing REPL DUMP to allow integrations of other tools
* [HIVE-18353] - CompactorMR should call jobclient.close() to trigger cleanup
* [HIVE-18355] - Add builder for metastore Thrift classes missed in the first pass - FunctionBuilder
* [HIVE-18356] - Fixing license headers in checkstyle
* [HIVE-18359] - Extend grouping set limits from int to long
* [HIVE-18360] - NPE in TezSessionState
* [HIVE-18365] - netty-all jar is not present in the llap tarball
* [HIVE-18367] - Describe Extended output is truncated on a table with an explicit row format containing tabs or newlines.
* [HIVE-18370] - standalone-metastore gen dir contains two annotation/package-info.java which causes IDEA build fail
* [HIVE-18379] - ALTER TABLE authorization_part SET PROPERTIES ("PARTITIONL_LEVEL_PRIVILEGE"="TRUE"); fails when authorization_part is MicroManaged table.
* [HIVE-18380] - ALTER TABLE CONCATENATE is not supported on Micro-managed table
* [HIVE-18383] - Qtests: running all cases from TestNegativeCliDriver results in OOMs
* [HIVE-18384] - ConcurrentModificationException in log4j2.x library
* [HIVE-18385] - mergejoin fails with java.lang.IllegalStateException
* [HIVE-18390] - IndexOutOfBoundsException when query a partitioned view in ColumnPruner
* [HIVE-18393] - Error returned when some other type is read as string from parquet tables
* [HIVE-18413] - Grouping of an empty result set may only contain null values
* [HIVE-18414] - upgrade to tez-0.9.1
* [HIVE-18416] - Initial support for TABLE function
* [HIVE-18417] - better error handling in TezSessionState cleanup
* [HIVE-18419] - CliDriver loads different hive-site.xml into HiveConf and MetastoreConf
* [HIVE-18420] - LLAP IO: InputStream may return 0 bytes
* [HIVE-18421] - Vectorized execution handles overflows in a different manner than non-vectorized execution
* [HIVE-18422] - Vectorized input format should not be used when vectorized input format is excluded and row.serde is enabled
* [HIVE-18426] - Memory leak in RoutingAppender for every hive operation
* [HIVE-18429] - Compaction should handle a case when it produces no output
* [HIVE-18430] - Add new determinism category for runtime constants (current_date, current_timestamp)
* [HIVE-18442] - HoS: No FileSystem for scheme: nullscan
* [HIVE-18445] - qtests: auto_join25.q fails permanently
* [HIVE-18447] - JDBC: Provide a way for JDBC users to pass cookie info via connection string
* [HIVE-18450] - Support TABLE function in CBO
* [HIVE-18452] - work around HADOOP-15171
* [HIVE-18456] - Add some tests for HIVE-18367 to check that the table information contains the query correctly
* [HIVE-18459] - hive-exec.jar leaks contents fb303.jar into classpath
* [HIVE-18465] - Hive metastore schema initialization failing on postgres
* [HIVE-18467] - support whole warehouse dump / load + create/drop database events
* [HIVE-18472] - Beeline gives log4j warnings
* [HIVE-18473] - Infer timezone information correctly in DruidSerde
* [HIVE-18482] - Copy-paste error in the RelOptHiveTable
* [HIVE-18488] - LLAP ORC readers are missing some null checks
* [HIVE-18490] - Query with EXISTS and NOT EXISTS with non-equi predicate can produce wrong result
* [HIVE-18492] - Wrong argument in the WorkloadManager.resetAndQueryKill()
* [HIVE-18494] - Regression: from HIVE-18069, the metastore directsql is getting disabled
* [HIVE-18499] - Amend point lookup tests to check for data
* [HIVE-18500] - annoying exceptions from LLAP Jmx view in the logs
* [HIVE-18501] - Typo in beeline code
* [HIVE-18504] - Hive is throwing InvalidObjectException(message:Invalid column type name is too long.
* [HIVE-18506] - LlapBaseInputFormat - negative array index
* [HIVE-18507] - AccumuloIndexedOutputFormat.AccumuloRecordWriter.close() - typo in the condition
* [HIVE-18513] - Query results caching
* [HIVE-18514] - add service output for ranger to WM DDL operations
* [HIVE-18517] - Vectorization: Fix VectorMapOperator to accept VRBs and check vectorized flag correctly to support LLAP Caching
* [HIVE-18518] - Upgrade druid version to 0.11.0
* [HIVE-18519] - do not create materialized CTEs with ACID/MM
* [HIVE-18521] - Vectorization: query failing in reducer VectorUDAFAvgDecimalPartial2 java.lang.ClassCastException StructTypeInfo --> DecimalTypeInfo
* [HIVE-18523] - Fix summary row in case there are no inputs
* [HIVE-18524] - Vectorization: Execution failure related to non-standard embedding of IfExprConditionalFilter inside VectorUDFAdaptor (Revert HIVE-17139)
* [HIVE-18529] - Vectorization: Add a debug config option to disable scratch column reuse
* [HIVE-18530] - Replication should skip MM table (for now)
* [HIVE-18531] - Vectorization: Vectorized PTF operator should not set the initial type infos
* [HIVE-18546] - Remove unnecessary code introduced in HIVE-14498
* [HIVE-18547] - WM: trigger test may fail
* [HIVE-18548] - Fix log4j import
* [HIVE-18551] - Vectorization: VectorMapOperator tries to write too many vector columns for Hybrid Grace
* [HIVE-18554] - Fix false positive test ql.io.parquet.TestHiveSchemaConverter.testMap
* [HIVE-18557] - q.outs: fix issues caused by q.out_spark files
* [HIVE-18558] - Upgrade orc version to 1.4.2
* [HIVE-18562] - Vectorization: CHAR/VARCHAR conversion in VectorDeserializeRow is broken
* [HIVE-18567] - ObjectStore.getPartitionNamesNoTxn doesn't handle max param properly
* [HIVE-18569] - Hive Druid indexing not dealing with decimals in correct way.
* [HIVE-18571] - stats issues for MM tables; ACID doesn't check state for CTAS
* [HIVE-18573] - Use proper Calcite operator instead of UDFs
* [HIVE-18574] - LLAP: Ship netty3 as part of LLAP install tarball
* [HIVE-18575] - ACID properties usage in jobconf is ambiguous for MM tables
* [HIVE-18577] - SemanticAnalyzer.validate has some pointless metastore calls
* [HIVE-18578] - Some class has missed the ASF header
* [HIVE-18579] - Changes from HIVE-18495 introduced import paths from shaded jars
* [HIVE-18585] - Return type for udfs should be determined using Hive inference rules instead of Calcite
* [HIVE-18587] - insert DML event may attempt to calculate a checksum on directories
* [HIVE-18589] - java.io.IOException: Not enough history available
* [HIVE-18590] - Assertion error on transitive join inference in the presence of NOT NULL constraint
* [HIVE-18595] - UNIX_TIMESTAMP UDF fails when type is Timestamp with local timezone
* [HIVE-18597] - LLAP: Always package the log4j2 API jar for org.apache.log4j
* [HIVE-18599] - Transactions: Fix CTAS on Micromanaged tables
* [HIVE-18600] - Vectorization: Top-Level Vector Expression Scratch Column Deallocation
* [HIVE-18601] - Support Power platform by updating protoc-jar-maven-plugin version
* [HIVE-18606] - CTAS on empty table throws NPE from org.apache.hadoop.hive.ql.exec.MoveTask
* [HIVE-18607] - HBase HFile write does strange things
* [HIVE-18610] - Performance: ListKeyWrapper does not check for hashcode equals, before comparing members
* [HIVE-18611] - Avoid memory allocation of aggregation buffer during stats computation
* [HIVE-18612] - Build subprocesses under Yetus in Ptest use 1.7 jre instead of 1.8
* [HIVE-18613] - Extend JsonSerDe to support BINARY type
* [HIVE-18614] - Fix sys db creation in Hive
* [HIVE-18616] - work around HADOOP-15171 p2
* [HIVE-18617] - Workload management Action parser does not generate the correct pool path.
* [HIVE-18622] - Vectorization: IF Statements, Comparisons, and more do not handle NULLs correctly
* [HIVE-18626] - Repl load "with" clause does not pass config to tasks
* [HIVE-18627] - PPD: Handle FLOAT boxing differently for single/double precision constants
* [HIVE-18628] - Make tez dag status check interval configurable
* [HIVE-18631] - Hive metastore schema initialization failing on mysql
* [HIVE-18637] - WorkloadManagent Event Summary leaving subscribedCounters and currentCounters fields empty
* [HIVE-18638] - Triggers for multi-pool move, failing to initiate the move event
* [HIVE-18641] - Remove MCreationMetadata from MTable class
* [HIVE-18642] - incorrect assertion in TezSessionPool for WM
* [HIVE-18643] - don't check for archived partitions for ACID ops
* [HIVE-18645] - invalid url address in README.txt from module hbase-handler
* [HIVE-18646] - Update errata.txt for HIVE-18617
* [HIVE-18647] - Cannot create table: "message:Exception thrown when executing query : SELECT DISTINCT.."
* [HIVE-18653] - Fix TestOperators test failure in master
* [HIVE-18658] - WM: allow not specifying scheduling policy when creating a pool
* [HIVE-18659] - add acid version marker to acid files/directories
* [HIVE-18660] - PCR doesn't distinguish between partition and virtual columns
* [HIVE-18662] - hive.acid.key.index is missing entries
* [HIVE-18665] - LLAP: Ignore cache-affinity if the LLAP IO elevator is disabled
* [HIVE-18666] - Materialized view: "create materialized enable rewrite" should fail if rewriting is not possible
* [HIVE-18667] - Materialized views: rewrites should be triggered without checks if the time.window=-1
* [HIVE-18671] - lock not released after Hive on Spark query was cancelled
* [HIVE-18674] - update Hive to use ORC 1.4.3
* [HIVE-18675] - make HIVE_LOCKS.HL_TXNID NOT NULL
* [HIVE-18678] - fix exim for MM tables and reinstante the test
* [HIVE-18680] - FieldTrimmer missing opportunity with SortLimit operators
* [HIVE-18686] - Installation on Postgres and Oracle broken
* [HIVE-18688] - Vectorization: Vectorizer Reason shouldn't be part of work-plan
* [HIVE-18693] - Snapshot Isolation does not work for Micromanaged table when a insert transaction is aborted
* [HIVE-18695] - fix TestAccumuloCliDriver.testCliDriver[accumulo_queries]
* [HIVE-18697] - The HiveMetastore.exchange_partitions method throws FileNotFoundException if the given partition doesn't exist in the source table
* [HIVE-18698] - Fix TestMiniLlapLocalCliDriver#testCliDriver[bucket_map_join_tez1]
* [HIVE-18699] - Check for duplicate partitions in HiveMetastore.exchange_partitions
* [HIVE-18708] - Vectorization: Delay out-of-tree fixups till whole work is vectorized
* [HIVE-18713] - Optimize: Transform IN clauses to = when there's only one element
* [HIVE-18717] - Avoid transitive dependency on jetty 6.x
* [HIVE-18733] - Missing break in CommonFastHashTable
* [HIVE-18737] - add an option to disable LLAP IO ACID for non-original files
* [HIVE-18738] - LLAP IO ACID - includes handling is broken
* [HIVE-18742] - Vectorization acid/inputformat check should allow NullRowsInputFormat/OneNullRowInputFormat
* [HIVE-18757] - LLAP IO for text fails for empty files
* [HIVE-18759] - Remove unconnected q.out-s
* [HIVE-18764] - ELAPSED_TIME resource plan setting is not getting honored
* [HIVE-18775] - HIVE-17983 missed deleting metastore/scripts/upgrade/derby/hive-schema-3.0.0.derby.sql
* [HIVE-18776] - MaterializationsInvalidationCache loading causes race condition in the metastore
* [HIVE-18777] - Add Authorization interface to support information_schema integration with external authorization
* [HIVE-18783] - ALTER TABLE post-commit listener does not include the transactional listener responses
* [HIVE-18788] - Clean up inputs in JDBC PreparedStatement
* [HIVE-18789] - Disallow embedded element in UDFXPathUtil
* [HIVE-18791] - Fix TestJdbcWithMiniHS2#testHttpHeaderSize
* [HIVE-18794] - Repl load "with" clause does not pass config to tasks for non-partition tables
* [HIVE-18796] - fix TestSSL
* [HIVE-18813] - Fix qtest mapjoin_hook.q
* [HIVE-18815] - Remove unused feature in HPL/SQL
* [HIVE-18816] - CREATE TABLE (ACID) doesn't work with TIMESTAMPLOCALTZ column type
* [HIVE-18817] - ArrayIndexOutOfBounds exception during read of ACID table.
* [HIVE-18818] - Alter table add constraint unique fails with direct sql set to false
* [HIVE-18820] - Operation doesn't always clean up log4j for operation log
* [HIVE-18826] - fix TestEncryptedHDFSCliDriver.testCliDriver[encryption_move_tbl]
* [HIVE-18828] - improve error handling for codecs in LLAP IO
* [HIVE-18833] - Auto Merge fails when "insert into directory as orcfile"
* [HIVE-18837] - add a flag and disable some object pools in LLAP until further testing
* [HIVE-18858] - System properties in job configuration not resolved when submitting MR job
* [HIVE-18859] - Incorrect handling of thrift metastore exceptions
* [HIVE-18863] - trunc() calls itself trunk() in an error message
* [HIVE-18877] - HiveSchemaTool.validateSchemaTables() should wrap a SQLException when rethrowing
* [HIVE-18879] - Disallow embedded element in UDFXPathUtil needs to work if xercesImpl.jar in classpath
* [HIVE-18886] - ACID: NPE on unexplained mysql exceptions
* [HIVE-18888] - Replace synchronizedMap with ConcurrentHashMap
* [HIVE-18889] - update all parts of Hive to use the same Guava version
* [HIVE-18892] - Fix NPEs in HiveMetastore.exchange_partitions method
* [HIVE-18898] - Fix NPEs in HiveMetastore.dropPartition method
* [HIVE-18907] - Create utility to fix acid key index issue from HIVE-18817
* [HIVE-18918] - Bad error message in CompactorMR.lanuchCompactionJob()
* [HIVE-18919] - remove separate keytab setting for ZK in LLAP
* [HIVE-18925] - Hive doesn't work when JVM is America/Bahia_Banderas time zone
* [HIVE-18933] - disable ORC codec pool for now; remove clone
* [HIVE-18944] - Groupping sets position is set incorrectly during DPP
* [HIVE-18950] - DESCRIBE EXTENDED missing details of default constraint
* [HIVE-18951] - Fix the llapdump usage error in llapdump.sh
* [HIVE-18955] - HoS: Unable to create Channel from class NioServerSocketChannel
* [HIVE-18962] - add WM task state to Tez AM heartbeat
* [HIVE-18963] - JDBC: Provide an option to simplify beeline usage by supporting default and named URL for beeline
* [HIVE-18965] - HIVE-17990 didn't update derby SQL scripts
* [HIVE-18967] - Standalone metastore SQL upgrade scripts do not properly set schema version
* [HIVE-18968] - LLAP: report guaranteed tasks count in AM registry to check for consistency
* [HIVE-18970] - improve AM WM metrics for use in Grafana and such
* [HIVE-18971] - add HS2 WM metrics for use in Grafana and such
* [HIVE-18972] - beeline command suggestion to kill job deprecated
* [HIVE-18975] - NPE when inserting NULL value in structure and array with HBase table
* [HIVE-18976] - Add ability to setup Druid Kafka Ingestion from Hive
* [HIVE-18990] - Hive doesn't close Tez session properly
* [HIVE-18991] - Drop database cascade doesn't work with materialized views
* [HIVE-18992] - enable synthetic file IDs by default in LLAP
* [HIVE-19003] - metastoreconf logs too much on info level
* [HIVE-19007] - Support REPL LOAD from primary using replica connection configurations received through WITH clause.
* [HIVE-19012] - Support builds for ARM and PPC arch
* [HIVE-19014] - utilize YARN-8028 (queue ACL check) in Hive Tez session pool
* [HIVE-19017] - Add util function to determine if 2 ValidWriteIdLists are at the same committed ID
* [HIVE-19018] - beeline -e now requires semicolon even when used with query from command line
* [HIVE-19019] - Vectorization: When vectorized, orc_merge_incompat_schema.q throws HiveException "Not implemented yet" from VectorExpressionWriterMap
* [HIVE-19021] - WM counters are not properly propagated from LLAP to AM
* [HIVE-19024] - Vectorization: Disable complex type constants for VectorUDFAdaptor
* [HIVE-19030] - Update Wiki with new rules for Load Data
* [HIVE-19032] - Vectorization: Disable GROUP BY aggregations with DISTINCT
* [HIVE-19035] - Vectorization: Disable exotic STRUCT field reference form
* [HIVE-19036] - Fix whitespace error in testconfiguration.properties after HIVE-14032
* [HIVE-19037] - Vectorization: Miscellaneous cleanup
* [HIVE-19038] - LLAP: Service loader throws "Provider not found" exception if hive-llap-server is in class path while loading tokens
* [HIVE-19042] - set MALLOC_ARENA_MAX for LLAP
* [HIVE-19043] - Vectorization: LazySimpleDeserializeRead fewer fields handling is broken for Complex Types
* [HIVE-19047] - Only the first init file is interpreted
* [HIVE-19050] - DBNotificationListener does not catch exceptions in the cleaner thread
* [HIVE-19052] - Vectorization: Disable Vector Pass-Thru SMB MapJoin in the presence of old-style MR FilterMaps
* [HIVE-19054] - Function replication shall use "hive.repl.replica.functions.root.dir" as root
* [HIVE-19055] - WM alter may fail if the name is not changed
* [HIVE-19056] - IllegalArgumentException in FixAcidKeyIndex when ORC file has 0 rows
* [HIVE-19057] - Query result caching cannot be disabled by client
* [HIVE-19061] - WM needs to output an event for allocation update
* [HIVE-19062] - Update constraint_partition_columns.q.out
* [HIVE-19065] - Metastore client compatibility check should include syncMetaStoreClient
* [HIVE-19071] - WM: backup resource plans cannot be used without quoted idenitifiers
* [HIVE-19072] - incorrect token handling for LLAP plugin endpoint
* [HIVE-19073] - StatsOptimizer may mangle constant columns
* [HIVE-19074] - Vectorization: Add llap vectorization_div0.q.out Q output file
* [HIVE-19075] - Fix NPE when trying to drop or get DB with null name
* [HIVE-19080] - Fix travis build
* [HIVE-19085] - FastHiveDecimal abs(0) sets sign to +ve
* [HIVE-19099] - HIVE-18755 forgot to update derby install script in metastore
* [HIVE-19100] - investigate TestStreaming failures
* [HIVE-19102] - Vectorization: Suppress known Q file bugs
* [HIVE-19105] - HIVE-18781 broke WarehouseInstance
* [HIVE-19108] - Vectorization and Parquet: Turning on vectorization in parquet_ppd_decimal.q causes Wrong Query Results
* [HIVE-19116] - Vectorization: Vector Map data type doesn't keep the order of the key/values pairs as read
* [HIVE-19119] - Fix the TestAppendPartitions tests which are failing in the pre-commit runs
* [HIVE-19120] - catalog not properly set for some tables in SQL upgrade scripts
* [HIVE-19121] - Fix HiveSchemaTool validation for databases that don't support schema
* [HIVE-19124] - implement a basic major compactor for MM tables
* [HIVE-19130] - NPE is thrown when REPL LOAD applied drop partition event.
* [HIVE-19131] - DecimalColumnStatsMergerTest comparison review
* [HIVE-19137] - orcfiledump doesn't print hive.acid.version value
* [HIVE-19151] - Update expected result for some TestNegativeCliDriver tests
* [HIVE-19155] - Day time saving cause Druid inserts to fail with org.apache.hive.druid.io.druid.java.util.common.UOE: Cannot add overlapping segments
* [HIVE-19157] - Assert that Insert into Druid Table fails if the publishing of metadata by HS2 fails
* [HIVE-19167] - Map data type doesn't keep the order of the key/values pairs as read (Part 2, The Sequel or SQL)
* [HIVE-19168] - Ranger changes for llap commands
* [HIVE-19186] - Multi Table INSERT statements query has a flaw for partitioned table when INSERT INTO and INSERT OVERWRITE are used
* [HIVE-19187] - Update Druid Storage Handler to Druid 0.12.0
* [HIVE-19191] - Assertion error while running materialized view rewriting
* [HIVE-19200] - Vectorization: Disable vectorization for LLAP I/O when a non-VECTORIZED_INPUT_FILE_FORMAT mode is needed (i.e. rows) and data type conversion is needed
* [HIVE-19215] - JavaUtils.AnyIdDirFilter ignores base_n directories
* [HIVE-19219] - Incremental REPL DUMP should throw error if requested events are cleaned-up.
* [HIVE-19224] - incorrect token handling for LLAP plugin endpoint - part 2
* [HIVE-19226] - Extend storage-api to print timestamp values in UTC
* [HIVE-19230] - Schema column width inconsistency in Oracle
* [HIVE-19231] - Beeline generates garbled output when using UnsupportedTerminal
* [HIVE-19233] - Add utility for acid 1.0 to 2.0 migration
* [HIVE-19240] - backport HIVE-17645 to 3.0
* [HIVE-19247] - StatsOptimizer: Missing stats fast-path for Date
* [HIVE-19248] - REPL LOAD couldn't copy file from source CM path and also doesn't throw error if file copy fails.
* [HIVE-19249] - Replication: WITH clause is not passing the configuration to Task correctly in all cases
* [HIVE-19260] - Streaming Ingest API doesn't normalize db.table names
* [HIVE-19264] - Vectorization: Reenable vectorization in vector_adaptor_usage_mode.q
* [HIVE-19269] - Vectorization: Turn On by Default
* [HIVE-19275] - Vectorization: Defer Wrong Results / Execution Failures when Vectorization turned on
* [HIVE-19277] - Active/Passive HA web endpoints does not allow cross origin requests
* [HIVE-19280] - Invalid error messages for UPDATE/DELETE on insert-only transactional tables
* [HIVE-19281] - incorrect protocol name for LLAP AM plugin
* [HIVE-19282] - don't nest delta directories inside LB directories for ACID tables
* [HIVE-19298] - Fix operator tree of CTAS for Druid Storage Handler
* [HIVE-19310] - Metastore: MetaStoreDirectSql.ensureDbInit has some slow DN calls which might need to be run only in test env
* [HIVE-19315] - Test failure org.apache.hadoop.hive.ql.lockmgr.TestDbTxnManager2#testWriteSetTracking3
* [HIVE-19324] - improve YARN queue check error message in Tez pool
* [HIVE-19327] - qroupby_rollup_empty.q fails for insert-only transactional tables
* [HIVE-19330] - multi_insert_partitioned.q fails with "src table does not exist" message.
* [HIVE-19331] - Repl load config in "with" clause not pass to Context.getStagingDir
* [HIVE-19338] - isExplicitAnalyze method may be incorrect in BasicStatsTask
* [HIVE-19339] - Regenerate alltypesorc file with latest ORC
* [HIVE-19350] - Vectorization: Turn off vectorization for explainuser_1.q / spark_explainuser_1
* [HIVE-19352] - Vectorization: Disable vectorization for org.apache.hive.jdbc.TestJdbcDriver2.testResultSetMetaData
* [HIVE-19361] - Backport HIVE-18910 to branch -3
* [HIVE-19362] - enable LLAP cache affinity by default
* [HIVE-19363] - remove cryptic metrics from LLAP IO output
* [HIVE-19365] - Index on COMPLETED_TXN_COMPONENTS in Metastore RDBMS has different names in different scripts
* [HIVE-19367] - Load Data should fail for empty Parquet files.
* [HIVE-19381] - Function replication in cloud fail when download resource from AWS
* [HIVE-19383] - Add ArrayList$SubList kryo serializer
* [HIVE-19384] - Vectorization: IfExprTimestamp* do not handle NULLs correctly
* [HIVE-19386] - Move TABLE_BUCKETING_VERSION to hive_metastore.thrift
* [HIVE-19394] - WM_TRIGGER trigger creation failed with type cast from Integer to Boolean
* [HIVE-19396] - HiveOperation is incorrectly set for analyze statement
* [HIVE-19410] - don't create serde reader in LLAP if there's no cache
* [HIVE-19420] - Support LOAD from SeqFile to ORC table
* [HIVE-19423] - REPL LOAD creates staging directory in source dump directory instead of table data location
* [HIVE-19433] - HiveJoinPushTransitivePredicatesRule hangs
* [HIVE-19435] - Incremental replication cause data loss if a table is dropped followed by create and insert-into with different partition type.
* [HIVE-19446] - QueryCache: Transaction lists needed for pending cache entries
* [HIVE-19474] - Decimal type should be casted as part of the CTAS or INSERT Clause.
* [HIVE-19476] - Fix failures in TestReplicationScenariosAcidTables, TestReplicationOnHDFSEncryptedZones and TestCopyUtils
* [HIVE-19477] - Hiveserver2 in http mode not emitting metric default.General.open_connections
* [HIVE-19479] - encoded stream seek is incorrect for 0-length RGs in LLAP IO
* [HIVE-19483] - Metastore cleaner tasks that run periodically are created more than once
* [HIVE-19506] - Test suites timing out
** New Feature
* [HIVE-1010] - Implement INFORMATION_SCHEMA in Hive
* [HIVE-8838] - Support Parquet through HCatalog
* [HIVE-15229] - 'like any' and 'like all' operators in hive
* [HIVE-15434] - Add UDF to allow interrogation of uniontype values
* [HIVE-15571] - Support Insert into for druid storage handler
* [HIVE-15691] - Create StrictRegexWriter to work with RegexSerializer for Flume Hive Sink
* [HIVE-15996] - Implement multiargument GROUPING function
* [HIVE-16281] - Upgrade master branch to JDK8
* [HIVE-16452] - Database UUID for metastore DB
* [HIVE-16520] - Cache hive metadata in metastore
* [HIVE-16575] - Support for 'UNIQUE' and 'NOT NULL' constraints
* [HIVE-16602] - Implement shared scans with Tez
* [HIVE-16605] - Enforce NOT NULL constraints
* [HIVE-16643] - BeeLine tests output should keep the PREHOOK/POSTHOOK Input/Output orderdering
* [HIVE-16917] - HiveServer2 guard rails - Limit concurrent connections from user
* [HIVE-17089] - make acid 2.0 the default
* [HIVE-17160] - Adding kerberos Authorization to the Druid hive integration
* [HIVE-17361] - Support LOAD DATA for transactional tables
* [HIVE-17366] - Constraint replication in bootstrap
* [HIVE-17432] - Enable join and aggregate materialized view rewriting
* [HIVE-17466] - Metastore API to list unique partition-key-value combinations
* [HIVE-17481] - LLAP workload management
* [HIVE-17626] - Query reoptimization using cached runtime statistics
* [HIVE-17710] - LockManager should only lock Managed tables
* [HIVE-17717] - Enable rule to push post-aggregations into Druid
* [HIVE-18281] - HiveServer2 HA for LLAP and Workload Manager
* [HIVE-18347] - Allow pluggable dynamic lookup of Hive Metastores from HiveServer2
* [HIVE-18361] - Extend shared work optimizer to reuse computation beyond work boundaries
* [HIVE-18373] - Make it easier to search for column name in a table
* [HIVE-18726] - Implement DEFAULT constraint
* [HIVE-18739] - Add support for Import/Export from Acid table
* [HIVE-18814] - Support Add Partition For Acid tables
* [HIVE-18835] - JDBC standalone jar download link in ambari
* [HIVE-18841] - Support authorization of UDF usage in hive
* [HIVE-18953] - Implement CHECK constraint
* [HIVE-19059] - Support DEFAULT keyword with INSERT and UPDATE
** Improvement
* [HIVE-8472] - Add ALTER DATABASE SET LOCATION
* [HIVE-9447] - Metastore: inefficient Oracle query for removing unused column descriptors when add/drop table/partition
* [HIVE-12274] - Increase width of columns used for general configuration in the metastore.
* [HIVE-12299] - Hive Column Data Type definition in schema limited to 4000 characters - too small
* [HIVE-12636] - Ensure that all queries (with DbTxnManager) run in a transaction
* [HIVE-13842] - Expose ability to set number of connections in the pool in TxnHandler
* [HIVE-14069] - update curator version to 2.12.0
* [HIVE-14145] - Too small length of column 'PARAM_VALUE' in table 'SERDE_PARAMS'
* [HIVE-14389] - Beeline should not output query and prompt to stdout
* [HIVE-14786] - Beeline displays binary column data as string instead of byte array
* [HIVE-15053] - Beeline#addlocaldriver - reduce classpath scanning
* [HIVE-15300] - Reuse table information in SemanticAnalyzer::getMetaData to reduce compilation time
* [HIVE-15393] - Update Guava version
* [HIVE-15396] - Basic Stats are not collected when for managed tables with LOCATION specified
* [HIVE-15433] - setting hive.warehouse.subdir.inherit.perms in HIVE won't overwrite it in hive configuration
* [HIVE-15616] - Improve contents of qfile test output
* [HIVE-15631] - Optimize for hive client logs , you can filter the log for each session itself.
* [HIVE-15665] - LLAP: OrcFileMetadata objects in cache can impact heap usage
* [HIVE-15726] - Reenable indentation checks to checkstyle
* [HIVE-15786] - Provide additional information from the llapstatus command
* [HIVE-15795] - Support Accumulo Index Tables in Hive Accumulo Connector
* [HIVE-15880] - Allow insert overwrite and truncate table query to use auto.purge table property
* [HIVE-16049] - upgrade to jetty 9
* [HIVE-16075] - MetaStore needs to reinitialize log4j to allow log specific settings via hiveconf take effect
* [HIVE-16079] - HS2: high memory pressure due to duplicate Properties objects
* [HIVE-16084] - SHOW COMPACTIONS should display CompactionID
* [HIVE-16143] - Improve msck repair batching
* [HIVE-16152] - TestBeeLineDriver logging improvements
* [HIVE-16164] - Provide mechanism for passing HMS notification ID between transactional and non-transactional listeners.
* [HIVE-16206] - Make Codahale metrics reporters pluggable
* [HIVE-16242] - Run BeeLine tests parallel
* [HIVE-16285] - Servlet for dynamically configuring log levels
* [HIVE-16297] - Improving hive logging configuration variables
* [HIVE-16311] - Improve the performance for FastHiveDecimalImpl.fastDivide
* [HIVE-16334] - Query lock contains the query string, which can cause OOM on ZooKeeper
* [HIVE-16340] - Allow Kerberos + SSL connections to HMS
* [HIVE-16343] - LLAP: Publish YARN's ProcFs based memory usage to metrics for monitoring
* [HIVE-16345] - BeeLineDriver should be able to run qtest files which are using default database tables
* [HIVE-16356] - Table#validateColumns should avoid checking exhaustively for matches in a list
* [HIVE-16360] - Improve "No delta files or original files found to compact in" message
* [HIVE-16371] - Add bitmap selection strategy for druid storage handler
* [HIVE-16383] - Switch to HikariCP as default connection pooling
* [HIVE-16386] - Add debug logging to describe why runtime filtering semijoins are removed
* [HIVE-16423] - Add hint to enforce semi join optimization
* [HIVE-16426] - Query cancel: improve the way to handle files
* [HIVE-16429] - Should call invokeFailureHooks in handleInterruption to track failed query execution due to interrupted command.
* [HIVE-16430] - Add log to show the cancelled query id when cancelOperation is called.
* [HIVE-16431] - Support Parquet StatsNoJobTask for Spark & Tez engine
* [HIVE-16441] - De-duplicate semijoin branches in n-way joins
* [HIVE-16449] - BeeLineDriver should handle query result sorting
* [HIVE-16456] - Kill spark job when InterruptedException happens or driverContext.isShutdown is true.
* [HIVE-16460] - In the console output, show vertex list in topological order instead of an alphabetical sort
* [HIVE-16501] - Add rej/orig to .gitignore ; remove *.orig files
* [HIVE-16503] - LLAP: Oversubscribe memory for noconditional task size
* [HIVE-16527] - Support outer and mixed reference aggregates in windowed functions
* [HIVE-16536] - Various improvements in TestPerfCliDriver
* [HIVE-16550] - Semijoin Hints should be able to skip the optimization if needed.
* [HIVE-16552] - Limit the number of tasks a Spark job may contain
* [HIVE-16571] - HiveServer2: Prefer LIFO over round-robin for Tez session reuse
* [HIVE-16582] - HashTableLoader should log info about the input, rows, size etc.
* [HIVE-16594] - Add more tests for BeeLineDriver
* [HIVE-16595] - fix syntax in Hplsql.g4
* [HIVE-16604] - Use [NOT] ENFORCED for column constraint characteristics
* [HIVE-16614] - Support "set local time zone" statement
* [HIVE-16635] - Progressbar: Use different timeouts for running queries
* [HIVE-16639] - LLAP: Derive shuffle thread counts and keep-alive connections from instance count
* [HIVE-16663] - String Caching For Rows
* [HIVE-16700] - Log ZK discovery info (hostname & port) for HTTP mode when connection is established
* [HIVE-16711] - Remove property_id column from metastore_db_properties table
* [HIVE-16712] - StringBuffer v.s. StringBuilder
* [HIVE-16717] - Extend shared scan optimizer to handle partitions
* [HIVE-16723] - Enable configurable MetaStoreSchemaInfo
* [HIVE-16736] - General Improvements to BufferedRows
* [HIVE-16754] - LLAP: Print hive version info on llap daemon startup
* [HIVE-16758] - Better Select Number of Replications
* [HIVE-16759] - Add table type information to HMS log notifications
* [HIVE-16771] - Schematool should use MetastoreSchemaInfo to get the metastore schema version from database
* [HIVE-16799] - Control the max number of task for a stage in a spark job
* [HIVE-16805] - Utilities isEmptyPath Logging Too Chatty and Uses Bad Format
* [HIVE-16809] - Improve filter condition for correlated subqueries
* [HIVE-16811] - Estimate statistics in absence of stats
* [HIVE-16833] - Review org.apache.hive.jdbc.HiveMetaDataResultSet
* [HIVE-16834] - Review org.apache.hadoop.hive.serde2.ByteStream
* [HIVE-16853] - Minor org.apache.hadoop.hive.ql.exec.HashTableSinkOperator Improvement
* [HIVE-16855] - org.apache.hadoop.hive.ql.exec.mr.HashTableLoader Improvements
* [HIVE-16856] - Allow For Customization Of Buffer Size In MapJoinTableContainerSerDe
* [HIVE-16857] - SparkPartitionPruningSinkOperator Buffer Size
* [HIVE-16858] - Accumulo Utils Improvements
* [HIVE-16866] - existing available UDF is used in TestReplicationScenariosAcrossInstances#testDropFunctionIncrementalReplication
* [HIVE-16867] - Extend shared scan optimizer to reuse computation from other operators
* [HIVE-16873] - Remove Thread Cache From Logging
* [HIVE-16880] - Remove ArrayList Instantiation For Empty Arrays
* [HIVE-16881] - Make extractSqlBoolean More Consistent
* [HIVE-16885] - Non-equi Joins: Filter clauses should be pushed into the ON clause
* [HIVE-16890] - org.apache.hadoop.hive.serde2.io.HiveVarcharWritable - Adds Superfluous Wrapper
* [HIVE-16900] - optimization to give distcp a list of input files to copy to a destination target directory during repl load
* [HIVE-16911] - Upgrade groovy version to 2.4.11
* [HIVE-16914] - Change HiveMetaStoreClient to AutoCloseable
* [HIVE-16933] - ORA-00060: deadlock detected while waiting on commit
* [HIVE-16934] - Transform COUNT(x) into COUNT() when x is not nullable
* [HIVE-16945] - Add method to compare Operators
* [HIVE-16955] - General Improvements To org.apache.hadoop.hive.metastore.MetaStoreUtils
* [HIVE-16962] - Better error msg for Hive on Spark in case user cancels query and closes session
* [HIVE-16969] - Improvement performance of MapOperator for Parquet
* [HIVE-16970] - General Improvements To org.apache.hadoop.hive.metastore.cache.CacheUtils
* [HIVE-16989] - Fix some issues identified by lgtm.com
* [HIVE-17000] - Upgrade Hive to PARQUET 1.9.0
* [HIVE-17022] - Add mode in lock debug statements
* [HIVE-17036] - Lineage: Minor CPU/Mem optimization for lineage transform
* [HIVE-17037] - Use 1-to-1 Tez edge to avoid unnecessary input data shuffle
* [HIVE-17048] - Pass HiveOperation info to HiveSemanticAnalyzerHook through HiveSemanticAnalyzerHookContext
* [HIVE-17054] - Expose SQL database constraints to Calcite
* [HIVE-17072] - Make the parallelized timeout configurable in BeeLine tests
* [HIVE-17078] - Add more logs to MapredLocalTask
* [HIVE-17125] - Lineage: Generate lineage information on need basis when atlas hook is enabled
* [HIVE-17139] - Conditional expressions optimization: skip the expression evaluation if the condition is not satisfied for vectorization engine.
* [HIVE-17174] - LLAP: ShuffleHandler: optimize fadvise calls for broadcast edge
* [HIVE-17194] - JDBC: Implement Gzip compression for HTTP mode
* [HIVE-17229] - HiveMetastore HMSHandler locks during initialization, even though its static variable threadPool is not null
* [HIVE-17237] - HMS wastes 26.4% of memory due to dup strings in metastore.api.Partition.parameters
* [HIVE-17251] - Remove usage of org.apache.pig.ResourceStatistics#setmBytes method in HCatLoader
* [HIVE-17253] - Adding SUMMARY statement to HPL/SQL
* [HIVE-17263] - Reduce debug logging for S3 tables
* [HIVE-17288] - LlapOutputFormatService: Increase netty event loop threads
* [HIVE-17308] - Improvement in join cardinality estimation
* [HIVE-17329] - ensure acid side file is not overwritten
* [HIVE-17340] - TxnHandler.checkLock() - reduce number of SQL statements
* [HIVE-17341] - DbTxnManger.startHeartbeat() - randomize initial delay
* [HIVE-17362] - The MAX_PREWARM_TIME should be configurable on HoS
* [HIVE-17376] - Upgrade snappy version to 1.1.4
* [HIVE-17400] - Estimate stats in absence of stats for complex types
* [HIVE-17408] - replication distcp should only be invoked if number of files AND file size cross configured limits
* [HIVE-17422] - Skip non-native/temporary tables for all major table/partition related scenarios
* [HIVE-17426] - Execution framework in hive to run tasks in parallel
* [HIVE-17458] - VectorizedOrcAcidRowBatchReader doesn't handle 'original' files
* [HIVE-17493] - Improve PKFK cardinality estimation in Physical planning
* [HIVE-17513] - Refactor PathUtils to not contain instance fields
* [HIVE-17519] - Transpose column stats display
* [HIVE-17536] - StatsUtil::getBasicStatForTable doesn't distinguish b/w absence of statistics or zero stats
* [HIVE-17538] - Enhance estimation of stats to estimate even if only one column is missing stats
* [HIVE-17542] - Make HoS CombineEquivalentWorkResolver Configurable
* [HIVE-17543] - Enable PerfCliDriver for HoS
* [HIVE-17550] - Remove unreferenced q.out-s
* [HIVE-17569] - Compare filtered output files in BeeLine tests
* [HIVE-17578] - Create a TableRef object for Table/Partition
* [HIVE-17587] - Remove unnecessary filter from getPartitionsFromPartitionIds call
* [HIVE-17604] - Add druid properties to conf white list
* [HIVE-17606] - Improve security for DB notification related APIs
* [HIVE-17609] - Tool to manipulate delegation tokens
* [HIVE-17611] - Add new LazyBinary SerDe for faster writes
* [HIVE-17614] - Notification_sequence initialization using SQL statement which is compatible with Mysql 5.1
* [HIVE-17631] - upgrade orc to 1.4.1
* [HIVE-17669] - Cache to optimize SearchArgument deserialization
* [HIVE-17732] - Minor Improvements - org.apache.hive.hcatalog.data.JsonSerDe.java
* [HIVE-17740] - HiveConf - Use SLF4J Parameterization
* [HIVE-17742] - AccumuloIndexedOutputFormat Use SLF4J
* [HIVE-17747] - HMS DropTableMessage should include the full table object
* [HIVE-17766] - Support non-equi LEFT SEMI JOIN
* [HIVE-17767] - Rewrite correlated EXISTS/IN subqueries into LEFT SEMI JOIN
* [HIVE-17787] - Apply more filters on the BeeLine test output files (follow-up on HIVE-17569)
* [HIVE-17793] - Parameterize Logging Messages
* [HIVE-17799] - Add Ellipsis For Truncated Query In Hive Lock
* [HIVE-17805] - SchemaTool validate locations should not return exit 1
* [HIVE-17824] - msck repair table should drop the missing partitions from metastore
* [HIVE-17847] - Exclude net.hydromatic:aggdesigner-algorithm jar as compile and runtime dependency
* [HIVE-17870] - Update NoDeleteRollingFileAppender to use Log4j2 api
* [HIVE-17871] - Add non nullability flag to druid time column
* [HIVE-17877] - HoS: combine equivalent DPP sink works
* [HIVE-17898] - Explain plan output enhancement
* [HIVE-17901] - org.apache.hadoop.hive.ql.exec.Utilities - Use Logging Parameterization and More
* [HIVE-17911] - org.apache.hadoop.hive.metastore.ObjectStore - Tune Up
* [HIVE-17912] - org.apache.hadoop.hive.metastore.security.DBTokenStore - Parameterize Logging
* [HIVE-17932] - Remove option to control partition level basic stats fetching
* [HIVE-17962] - org.apache.hadoop.hive.metastore.security.MemoryTokenStore - Parameterize Logging
* [HIVE-17964] - HoS: some spark configs doesn't require re-creating a session
* [HIVE-17965] - Remove HIVELIMITTABLESCANPARTITION support
* [HIVE-17969] - Metastore to alter table in batches of partitions when renaming table
* [HIVE-17988] - Replace patch utility usage with git apply in ptest
* [HIVE-18008] - Add optimization rule to remove gby from right side of left semi-join
* [HIVE-18009] - Multiple lateral view query is slow on hive on spark
* [HIVE-18010] - Update hbase version
* [HIVE-18023] - Redact the expression in lineage info
* [HIVE-18043] - Vectorization: Support List type in MapWork
* [HIVE-18048] - Vectorization: Support Struct type with vectorization
* [HIVE-18051] - qfiles: dataset support
* [HIVE-18061] - q.outs: be more selective with masking hdfs paths
* [HIVE-18123] - Explain formatted improve column expression map display
* [HIVE-18158] - Remove OrcRawRecordMerger.ReaderPairAcid.statementId
* [HIVE-18159] - Vectorization: Support Map type in MapWork
* [HIVE-18173] - Improve plans for correlated subqueries with non-equi predicate
* [HIVE-18185] - update insert_values_orig_table_use_metadata.q.out
* [HIVE-18246] - Replace toString with getExprString in AbstractOperatorDesc::getColumnExprMapForExplain
* [HIVE-18251] - Loosen restriction for some checks
* [HIVE-18259] - Automatic cleanup of invalidation cache for materialized views
* [HIVE-18283] - Better error message and error code for HoS exceptions
* [HIVE-18342] - Remove LinkedList from HiveAlterHandler.java
* [HIVE-18343] - Remove LinkedList from ColumnStatsSemanticAnalyzer.java
* [HIVE-18344] - Remove LinkedList from SharedWorkOptimizer.java
* [HIVE-18386] - Create dummy materialized views registry and make it configurable
* [HIVE-18387] - Minimize time that REBUILD locks the materialized view
* [HIVE-18410] - [Performance][Avro] Reading flat Avro tables is very expensive in Hive
* [HIVE-18423] - Support pushing computation from the optimizer for JDBC storage handler tables
* [HIVE-18448] - Drop Support For Indexes From Apache Hive
* [HIVE-18462] - Explain formatted for queries with map join has columnExprMap with unformatted column name
* [HIVE-18510] - Enable running checkstyle on test sources as well
* [HIVE-18540] - remove logic for wide terminal to display in-place updates
* [HIVE-18543] - Add print sessionid in console
* [HIVE-18552] - Split hive.strict.checks.large.query into two configs
* [HIVE-18564] - Add a mapper to make plan transformations more easily understandable
* [HIVE-18586] - Upgrade Derby to 10.14.1.0
* [HIVE-18625] - SessionState Not Checking For Directory Creation Result
* [HIVE-18654] - Add Hiveserver2 specific HADOOP_OPTS environment variable
* [HIVE-18706] - Ensure each Yetus execution has its own separate working dir
* [HIVE-18716] - Delete unnecessary parameters from TaskFactory
* [HIVE-18718] - Integer like types throws error when there is a mismatch
* [HIVE-18727] - Update GenericUDFEnforceNotNullConstraint to throw an ERROR instead of Exception on failure
* [HIVE-18730] - Use LLAP as execution engine for Druid mini Cluster Tests
* [HIVE-18743] - CREATE TABLE on S3 data can be extremely slow. DO_NOT_UPDATE_STATS workaround is buggy.
* [HIVE-18770] - Additional tests and fixes for materialized view rewriting
* [HIVE-18780] - Improve schema discovery For Druid Storage Handler
* [HIVE-18793] - Round udf should support variable as second argument
* [HIVE-18797] - ExprConstNodeDesc's getExprString should put appropriate qualifier with literals
* [HIVE-18808] - Make compaction more robust when stats update fails
* [HIVE-18825] - Define ValidTxnList before starting query optimization
* [HIVE-18839] - Implement incremental rebuild for materialized views (only insert operations in source tables)
* [HIVE-18848] - Improve readability of filter conditions in explain plan when CBO is run
* [HIVE-18857] - Store default value text instead of default value expression in metastore
* [HIVE-18878] - Lower MoveTask Lock Logging to Debug
* [HIVE-18901] - Lower ResourceDownloader Logging to Debug
* [HIVE-18979] - Enable AggregateReduceFunctionsRule from Calcite
* [HIVE-18984] - Make time window configurable per materialized view
* [HIVE-18995] - Vectorization: Add option to suppress "Execution mode: vectorized" for testing purposes
* [HIVE-19001] - ALTER TABLE ADD CONSTRAINT support for CHECK constraint
* [HIVE-19033] - Provide an option to purge LLAP IO cache
* [HIVE-19070] - Add More Test To Druid Mini Cluster queries.
* [HIVE-19092] - Somne improvement in bin shell scripts
* [HIVE-19161] - Add authorizations to information schema
* [HIVE-19288] - Implement protobuf logging hive hook.
* [HIVE-19344] - Change default value of msck.repair.batch.size
* [HIVE-19415] - Support CORS for all HS2 web endpoints
* [HIVE-19466] - Update constraint violation error message
* [HIVE-19534] - Allow implementations to access member variables of AbstractRecordWriter
** Test
* [HIVE-13843] - Re-enable the HoS tests disabled in HIVE-13402
* [HIVE-15538] - Test HIVE-13884 with more complex query predicates
* [HIVE-16288] - Add blobstore tests for ORC and RCFILE file formats
* [HIVE-16359] - Update golden file for subquery_select.q
* [HIVE-16415] - Add tests covering single inserts of zero rows
* [HIVE-16454] - Add blobstore tests for inserting empty into dynamic partition/list bucket tables & inserting cross blobstore tables
* [HIVE-16540] - dynamic_semijoin_user_level is failing on MiniLlap
* [HIVE-16636] - TestPerfCli driver is missing query24
* [HIVE-16664] - Add join related Hive blobstore tests
* [HIVE-16673] - Test for HIVE-16413
* [HIVE-16831] - Add unit tests for NPE fixes in HIVE-12054
* [HIVE-17034] - The spark tar for itests is downloaded every time if md5sum is not installed
* [HIVE-17190] - Schema changes for bitvectors for unpartitioned tables
* [HIVE-17246] - Add having related blobstore query test
* [HIVE-17430] - Add LOAD DATA test for blobstores
* [HIVE-17636] - Add multiple_agg.q test for blobstores
* [HIVE-17729] - Add Database & Explain related blobstore tests
* [HIVE-17789] - Flaky test: TestSessionManagerMetrics.testAbandonedSessionMetrics has timing related problems
* [HIVE-17820] - Add buckets.q test for blobstores
* [HIVE-18041] - Add SORT_QUERY_RESULTS to subquery_multi
* [HIVE-18089] - Update golden files for few tests
* [HIVE-18100] - Some tests time out
* [HIVE-18186] - Fix wrong assertion in TestHiveMetaStoreAlterColumnPar test
* [HIVE-18260] - Add test case scenarios for materialized views invalidation cache and registry
* [HIVE-18327] - Remove the unnecessary HiveConf dependency for MiniHiveKdc
* [HIVE-18485] - Add more unit tests for hive.strict.checks.* properties
* [HIVE-18588] - Add 'checkin' profile that runs slower tests in standalone-metastore
* [HIVE-18867] - create_with_constraints_duplicate_name and default_constraint_invalid_default_value_length failing
* [HIVE-19060] - Fix the TestAppendPartitions.testAppendPartitionNullPartValues
* [HIVE-19123] - TestNegativeCliDriver nopart_insert failing
* [HIVE-19143] - Update golden files for negative tests
* [HIVE-19271] - TestMiniLlapLocalCliDriver default_constraint and check_constraint failing
** Wish
* [HIVE-17540] - remove feature: describe pretty
** Task
* [HIVE-15708] - Upgrade calcite version to 1.12
* [HIVE-16058] - Disable falling back to non-cbo for SemanticException for tests
* [HIVE-16392] - Remove hive.warehouse.subdir.inherit.perms and all permissions inheritance logic
* [HIVE-16395] - ConcurrentModificationException on config object in HoS
* [HIVE-16411] - Revert HIVE-15199
* [HIVE-16474] - Upgrade Druid version to 0.10
* [HIVE-17107] - Upgrade Yetus to 0.5.0
* [HIVE-17234] - Remove HBase metastore from master
* [HIVE-17425] - Change MetastoreConf.ConfVars internal members to be private
* [HIVE-17480] - repl dump sub dir should use UUID instead of timestamp
* [HIVE-17521] - Improve defaults for few runtime configs
* [HIVE-17544] - Provide classname info for function authorization
* [HIVE-17672] - Upgrade Calcite version to 1.14
* [HIVE-17857] - Upgrade to orc 1.4
* [HIVE-18131] - Truncate table for Acid tables
* [HIVE-18272] - Fix check-style violations in subquery code
* [HIVE-18433] - Upgrade version of com.fasterxml.jackson
* [HIVE-18436] - Upgrade to Spark 2.3.0
* [HIVE-18560] - qtests: QTestUtil refactor/split - QOutProcessor
* [HIVE-18598] - Disallow NOT NULL constraints to be ENABLED/ENFORCED with EXTERNAL table
* [HIVE-18754] - REPL STATUS should support 'with' clause
* [HIVE-18917] - Add spark.home to hive.conf.restricted.list
* [HIVE-18957] - Upgrade Calcite version to 1.16.0
* [HIVE-18959] - Avoid creating extra pool of threads within LLAP
* [HIVE-18993] - Use Druid Expressions
* [HIVE-19049] - Add support for Alter table add columns for Druid
* [HIVE-19091] - [Hive 3.0.0 Release] Rat check failure fixes
* [HIVE-19134] - Update copyright NOTICE and fix rat check failures
* [HIVE-19172] - NPE due to null EnvironmentContext in DDLTask
* [HIVE-19173] - Add Storage Handler runtime information as part of DESCRIBE EXTENDED
* [HIVE-19184] - Hive 3.0.0 release branch preparation
* [HIVE-19257] - HIVE-19157 commit references wrong jira
* [HIVE-19309] - Add Arrow dependencies to LlapServiceDriver
* [HIVE-19311] - Partition and bucketing support for “load data” statement
* [HIVE-19451] - Druid Query Execution fails with ClassNotFoundException org.antlr.v4.runtime.CharStream
* [HIVE-19491] - Branch-3 Start using storage-api 2.6.1 once available.
Release Notes - Hive - Version 2.3.0
** Sub-task
* [HIVE-14807] - analyze table compute statistics fails due to presence of Infinity value in double column
* [HIVE-15556] - Replicate views
* [HIVE-16186] - REPL DUMP shows last event ID of the database even if we use LIMIT option.
* [HIVE-16249] - With column stats, mergejoin.q throws NPE
* [HIVE-16293] - Column pruner should continue to work when SEL has more than 1 child
* [HIVE-16387] - Fix failing test org.apache.hive.jdbc.TestJdbcDriver2.testResultSetMetaData
* [HIVE-16440] - Fix failing test columnstats_partlvl_invalid_values when autogather column stats is on
* [HIVE-16504] - Addition of binary licenses broke rat check
* [HIVE-16535] - Hive fails to build from source code tarball
* [HIVE-16537] - Add missing AL files
** Bug
* [HIVE-9815] - Metastore column"SERDE_PARAMS"."PARAM_VALUE" limited to 4000 bytes
* [HIVE-14077] - add implicit decimal arithmetic q test, fix issues if found
* [HIVE-14801] - improve TestPartitionNameWhitelistValidation stability
* [HIVE-15035] - Clean up Hive licenses for binary distribution
* [HIVE-15249] - HIve 2.1.0 is throwing InvalidObjectException(message:Invalid column type name is too long
* [HIVE-15829] - LLAP text cache: disable memory tracking on the writer
* [HIVE-15923] - Hive default partition causes errors in get partitions
* [HIVE-16007] - When the query does not complie the LogRunnable never stops
* [HIVE-16188] - beeline should block the connection if given invalid database name.
* [HIVE-16193] - Hive show compactions not reflecting the status of the application
* [HIVE-16219] - metastore notification_log contains serialized message with non functional fields
* [HIVE-16231] - Parquet timestamp may be stored differently since HIVE-12767
* [HIVE-16274] - Support tuning of NDV of columns using lower/upper bounds
* [HIVE-16287] - Alter table partition rename with location - moves partition back to hive warehouse
* [HIVE-16301] - Preparing for 2.3 development.
* [HIVE-16305] - Additional Datanucleus ClassLoaderResolverImpl leaks causing HS2 OOM
* [HIVE-16308] - PreExecutePrinter and PostExecutePrinter should log to INFO level instead of ERROR
* [HIVE-16310] - Get the output operators of Reducesink when vectorization is on
* [HIVE-16315] - Describe table doesn't show num of partitions
* [HIVE-16318] - LLAP cache: address some issues in 2.2/2.3
* [HIVE-16321] - Possible deadlock in metastore with Acid enabled
* [HIVE-16336] - Rename hive.spark.use.file.size.for.mapjoin to hive.spark.use.ts.stats.for.mapjoin
* [HIVE-16341] - Tez Task Execution Summary has incorrect input record counts on some operators
* [HIVE-16366] - Hive 2.3 release planning
* [HIVE-16380] - removing global test dependency of jsonassert
* [HIVE-16385] - StatsNoJobTask could exit early before all partitions have been processed
* [HIVE-16390] - LLAP IO should take job config into account; also LLAP config should load defaults
* [HIVE-16403] - LLAP UI shows the wrong number of executors
* [HIVE-16459] - Forward channelInactive to RpcDispatcher
* [HIVE-16461] - DagUtils checks local resource size on the remote fs
* [HIVE-16465] - NullPointer Exception when enable vectorization for Parquet file format
* [HIVE-16473] - Hive-on-Tez may fail to write to an HBase table
* [HIVE-16519] - Fix exception thrown by checkOutputSpecs
* [HIVE-16545] - LLAP: bug in arena size determination logic
* [HIVE-16547] - LLAP: may not unlock buffers in some cases
** Improvement
* [HIVE-12274] - Increase width of columns used for general configuration in the metastore.
* [HIVE-12299] - Hive Column Data Type definition in schema limited to 4000 characters - too small
* [HIVE-14145] - Too small length of column 'PARAM_VALUE' in table 'SERDE_PARAMS'
* [HIVE-15880] - Allow insert overwrite and truncate table query to use auto.purge table property
* [HIVE-16115] - Stop printing progress info from operation logs with beeline progress bar
* [HIVE-16164] - Provide mechanism for passing HMS notification ID between transactional and non-transactional listeners.
** New Feature
* [HIVE-15434] - Add UDF to allow interrogation of uniontype values
* [HIVE-15691] - Create StrictRegexWriter to work with RegexSerializer for Flume Hive Sink
** Test
* [HIVE-16288] - Add blobstore tests for ORC and RCFILE file formats
* [HIVE-16415] - Add tests covering single inserts of zero rows
* [HIVE-16454] - Add blobstore tests for inserting empty into dynamic partition/list bucket tables & inserting cross blobstore tables