Hi,
How do I specify a datasource as a directory of json files ? Is there is a CLI to ingest data?
In a regular ingestion specs, I can specify it a "local" firehose configuration:
"firehose": {
"type": "local",
"baseDir": "examples/indexing/",
"filter": "wikipedia_index_data*"
}
"ioConfig": {
"type": "index_parallel",
"firehose": {
"type": "local",
"baseDir": "/Users/mvenkat/Desktop/Deals/data",
"filter": "*.json",
"parser": null
},
"appendToExisting": false
},