tree: cb59167070c7ee937dd5bfbc136d2a40c2809147 [path history] [tgz]
  1. rfc-27/
  2. rfc-34/
  3. rfc-37/
  4. rfc-38/
  5. rfc-39/
  6. rfc-40/
  7. rfc-41/
  8. rfc-42/
  9. rfc-44/
  10. rfc-45/
  11. rfc-46/
  12. rfc-47/
  13. rfc-48/
  14. rfc-49/
  15. rfc-50/
  16. rfc-51/
  17. rfc-53/
  18. rfc-55/
  19. rfc-56/
  20. rfc-57/
  21. rfc-60/
  22. rfc-63/
  23. rfc-65/
  24. rfc-66/
  25. rfc-68/
  26. rfc-69/
  27. rfc-73/
  28. rfc-76/
  29. rfc-77/
  30. rfc-78/
  31. rfc-8/
  32. rfc-80/
  33. rfc-82/
  34. rfc-83/
  35. rfc-84/
  36. rfc-85/
  37. rfc-87/
  38. rfc-89/
  39. rfc-91/
  40. rfc-92/
  41. rfc-93/
  42. README.md
  43. template.md
rfc/README.md

RFCs

The RFC process is documented on our site. Please familiarize yourself with it, before working a new RFC.

Status can be one of these values.

StatusMeaning
UNDER REVIEWRFC has been proposed and community is actively debating the design/proposal.
IN PROGRESSThe initial phase of implementation is underway.
ONGOINGSome or most work has landed; community continues to improve or build follow on phases.
ABANDONEDThe proposal was not implemented, due to various reasons.
COMPLETEDAll work is deemed complete.

The list of all RFCs can be found here.

Older RFC content is still here.

RFC NumberTitleStatus
1CSV Source Support for Delta StreamerCOMPLETED
2ORC Storage in HudiCOMPLETED
3Timeline Service with Incremental File System View SyncingCOMPLETED
4Faster Hive incremental pull queriesCOMPLETED
5HUI (Hudi WebUI)ABANDONED
6Add indexing support to the log fileABANDONED
7Point in time Time-Travel queries on Hudi tableCOMPLETED
8Metadata based Record IndexCOMPLETED
9Hudi Dataset Snapshot ExporterCOMPLETED
10Restructuring and auto-generation of docsCOMPLETED
11Refactor of the configuration framework of hudi projectABANDONED
12Efficient Migration of Large Parquet Tables to Apache HudiCOMPLETED
13Integrate Hudi with FlinkCOMPLETED
14JDBC incremental pullerCOMPLETED
15HUDI File Listing ImprovementsCOMPLETED
16Abstraction for HoodieInputFormat and RecordReaderCOMPLETED
17Abstract common meta sync module support multiple meta serviceCOMPLETED
18Insert Overwrite APICOMPLETED
19Clustering data for freshness and query performanceCOMPLETED
20handle failed recordsONGOING
21Allow HoodieRecordKey to be VirtualCOMPLETED
22Snapshot Isolation using Optimistic Concurrency Control for multi-writersCOMPLETED
23Hudi Observability metrics collectionABANDONED
24Hoodie Flink Writer ProposalCOMPLETED
25Spark SQL Extension For HudiCOMPLETED
26Optimization For Hudi Table QueryCOMPLETED
27Data skipping index to improve query performanceCOMPLETED
28Support Z-order curveCOMPLETED
29Hash IndexCOMPLETED
30Batch operationABANDONED
31Hive integration ImprovementONGOING
32Kafka Connect Sink for HudiONGOING
33Hudi supports more comprehensive Schema EvolutionCOMPLETED
34Hudi BigQuery IntegrationCOMPLETED
35Make Flink MOR table writing streaming friendlyCOMPLETED
36HUDI Metastore ServerONGOING
37Hudi Metadata based Bloom IndexCOMPLETED
38Spark Datasource V2 IntegrationCOMPLETED
39Incremental source for DebeziumCOMPLETED
40Connector for TrinoCOMPLETED
41Snowflake Integration, supported via Apache XTable (Incubating)ABANDONED
42Consistent Hashing IndexONGOING
43Table Management ServiceONGOING
44Hudi Connector for PrestoCOMPLETED
45Asynchronous Metadata IndexingCOMPLETED
46Optimizing Record Payload HandlingCOMPLETED
47Add Call Produce Command for Spark SQLCOMPLETED
48LogCompaction for MOR tablesCOMPLETED
49Support sync with DataHubCOMPLETED
50Improve Timeline ServerIN PROGRESS
51Change Data CaptureONGOING
52Introduce Secondary Index to Improve HUDI Query PerformanceABANDONED
53Use Lock-Free Message Queue Improving Hoodie Writing EfficiencyCOMPLETED
54New Table APIs and Streamline Hudi ConfigsUNDER REVIEW
55Improve Hive/Meta sync class design and hierarchiesCOMPLETED
56Early Conflict Detection For Multi-WriterCOMPLETED
57DeltaStreamer Protobuf SupportCOMPLETED
58Integrate column stats index with all query enginesUNDER REVIEW
59Multiple event_time Fields Latest Verification in a Single TableUNDER REVIEW
60Federated Storage LayerUNDER REVIEW
61Snapshot view managementUNDER REVIEW
62Diagnostic ReporterUNDER REVIEW
63Expression IndexesONGOING
64New Hudi Table Spec API for Query IntegrationsUNDER REVIEW
65Partition TTL ManagementUNDER REVIEW
66Non Blocking Concurrency ControlUNDER REVIEW
67Hudi Bundle StandardsUNDER REVIEW
68A More Effective HoodieMergeHandler for COW Table with ParquetUNDER REVIEW
69Hudi 1.xCOMPLETED
70Hudi Reverse StreamerUNDER REVIEW
71Enhance OCC conflict detectionUNDER REVIEW
72Redesign Hudi-Spark IntegrationONGOING
73Multi-Table TransactionsUNDER REVIEW
74HoodieStorage: Hudi Storage Abstraction and APIsONGOING
75Hudi-Native HFile Reader and WriterIN PROGRESS
76Auto Record key generationIN PROGRESS
77Secondary IndexONGOING
781.0 MigrationIN PROGRESS
79Robust handling of spark task retries and failuresIN PROGRESS
80Column FamiliesUNDER REVIEW
81Log Compaction with Merge SortUNDER REVIEW
82Concurrent schema evolution detectionUNDER REVIEW
83Incremental Table ServiceCOMPLETED
84Optimized SerDe of DataStream in Flink operatorsCOMPLETED
85Hudi Issue and Sprint Management in JiraUNDER REVIEW
86DataFrame Implementation of HUDI write pathUNDER REVIEW
87Avro elimination for Flink writerUNDER REVIEW
88New Schema/DataType/Expression AbstractionsUNDER REVIEW
89Dynamic Partition Level Bucket IndexUNDER REVIEW
90Add support for cancellable clustering table service plansUNDER REVIEW
91Storage-based lock provider using conditional writesUNDER REVIEW
92Support Bitmap IndexUNDER REVIEW
93Pluggable Table Formats in HudiIN PROGRESS
94Hudi Timeline User Interface (UI)UNDER REVIEW
95Hudi Flink SourceUNDER REVIEW
96Introduce Unified Bucket IndexUNDER REVIEW