Hi,
I have a CSV file with about 20 columns, but I'm only interested in 3 of them.
I tried:
fields = bubbles.FieldList(
["Project Number", "string"],
["Description", "string"],
["Country", "string"]
)
p = bubbles.Pipeline(stores=stores)
p.source_object("csv_source", resource=URL,fields=fields, infer_fields=False)
p.pretty_print()
p.run()
+--------------+-----------+----------------------------------------------------------------------------------------------------+
|Project Number|Description|Country |
+--------------+-----------+----------------------------------------------------------------------------------------------------+
|A018823001 |2011-11-10 |National Water Quality and Availability Management Program |
|A019362001 |2012-05-01 |Microfinance Services |
|A020246001 |2013-07-23 |Popular Economy Building |
So..this is close, but it doesn't quite work. And even if it did, it would make more sense to filter out the fields in the target object. But how to do this?