The goal is that the python library will provide a functional, performant subset of the java library. The initial focus has been on reading table metadata as well as providing the capability to both plan and execute a scan.
Operation | Java | Python |
---|---|---|
Get Schema | X | X |
Get Snapshots | X | X |
Plan Scan | X | X |
Plan Scan for Snapshot | X | X |
Update Current Snapshot | X | |
Set Table Properties | X | |
Create Table | X | X |
Drop Table | X | X |
Alter Table | X |
Pyarrow is used for reading parquet files, so read support is limited to what is currently supported in the pyarrow.parquet package.
Data Type | Java | Python |
---|---|---|
BooleanType | X | X |
DateType | X | X |
DecimalType | X | X |
FloatType | X | X |
IntegerType | X | X |
LongType | X | X |
TimeType | X | X |
TimestampType | X | X |
Data Type | Java | Python |
---|---|---|
ListType of primitives | X | X |
MapType of primitives | X | X |
StructType of primitives | X | X |
ListType of Nested Types | X | |
MapType of Nested Types | X |
The python client does not currently support write capability