Drill provides a mock record reader to generate test data in the package: org.apache.drill.exec.store.mock.
Mock data is available for physical plans. Here is a typical example:
{
@id:1,
pop:"mock-scan",
url: "http://apache.org",
entries:[
{records: 1000000, types: [
{name: "blue", type: "INT", mode: "REQUIRED"},
{name: "green", type: "INT", mode: "REQUIRED"}
]}
]
},
The JSON fields are:
@id: Standard required Physical OPerator (pop) ID.pop: Must be "mock-scan"url: Unusedentries: A list of schemas of the files to generate.The scanner supports parallelization based on the number of entries.
The fields for the schema are:
records: The number of records to generate.types: A list of columns (the “types” name is a misnomer.)Field schema fields include:
name: Field nametype: The Drill minor type as defined in MinorType.mode: The cardinality (mode) as defined in DataMode: OPTIONAL, REQUIRED or REPEATED.width: Optional field width (need only for variable-size types such as VARCHAR.)