Test data files for Parquet compatibility and regression testing

TODO: Document what each file is