TreasureData.py should be run with the following arguments in the order listed below:
- Database name - REQUIRED
- Table name - REQUIRED
- Comma separated list of columns (e.g. 'column1,column2,column3โ) - REQUIRED
- minimum timestamp in unix timestamp - OPTIONAL
- maximum timestamp in unix timestamp - OPTIONAL
- query engine: 'hive' or 'presto' - OPTIONAL
- output format: csv or tabular - OPTIONAL
python TreasureData.py sample_datasets www_access host,path 1412121600 1423232700 presto tabular
python TreasureData.py sample_datasets www_access host,path 1412121600 1423232700 presto csv
Verify Output:
more TD_QueryResults.csv