Since
Google Cloud Datastore is a non-relational, NoSQL, highly-scalable database, you must create a script in the supported language of your choice that reads your CSV, converts your data into
Datastore Entities, and then saves those Entities to your Datastore. Once your data is loaded into the Datastore you can then easily export it and import it into other projects' Datastores via the
Managed Import/Export service.
The way to minimize costs in the Datastore is to use the least amount of
indexes as possible. You can think of an index as a sorted table. For every query you use, you need an associated sorted index (aka a copy of your data sorted specifically for that query). By default, an index is automatically created for each field of each
Entity Kind. Therefore to avoid having 200 automatically created indexes, it is recommended to mark all of the properties that you do not plan on querying as
un-indexed to save you money.
It is actually recommended to use
Datastore's automatic key generation when saving Entities, instead of creating custom keys. Datastore will ensure that your data is properly sharded and evenly distributed across Datastore servers in order to avoid hotspots and latency when specific entities are accessed a lot. Therefore, your link to Google Cloud Storage should be saved as an additional Entity property that will be returned when you query for an Entity. In general it is recommended to follow the
Best Practices for Datastore.
- Note that Google Groups is reserved for general product discussions and not for technical support. If you require further technical support for the Datastore, it is recommended to post
your detailed questions to
Stack Exchange using the supported Cloud tags.