In our case we have a data set which increases at the rate of about 52
million records per year, and 150,000 per day (updated twice per
day). If I uploaded the initial data set is it possible to append
additional data? Update and delete would be awesome though (I can
think of other uses).
Additionally is it possible to pivots on data using this query
language? I don't have an account (sad face) and I couldn't work it
out from the docs.
It was so amazing to see that the queries for 60B records are done in
a few seconds.
What kind of approach do you use inside it?
a. on-demand massively parallel query with thousands of nodes
b. pre-aggregation/computation (like building B-Trees for all the
columns) at the import time
c. any other unknown technique
And how long did it take to import the 60B records?
Thanks,
Kaz
On 5月26日, 午前3:52, Michael Sheldon <mshel...@google.com> wrote:
> Hello Alex,
>
> Thank you for the interest in BigQuery. Let me try and answer your
> questions:
>
> Yes, you can append additional records multiple times a day to your BigQuery
> Tables. (We intend to have a "continuous import" feature to make this easier
> by letting you configure your BigQuery Table to automatically import all
> data you push into your Google Storage for Developers bucket. However that
> feature is a bit further out.)
>
> Our query language does not support Pivots. Thanks for asking: we'll take
> that as a feature request.
>
> Regarding accounts:
>
> - You can start by making sure you have a Google Storage for Developers
> account. You can find the sign up link for this service at
> http://code.google.com/apis/storage/.
> - You can find the BigQuery sign up link near the bottom of
> http://code.google.com/apis/bigquery/. We had more interest in our
> service than we anticipated, so it will take a while before we can provision
> accounts for everyone interested. We will get back to everyone that signs
> up. Thanks for your patience!
>
> Cheers,
>
> --Michael Sheldon
>
Will BigQuery be available for "almost" real time reporting?
How long will it take (i.e. is it planned to take) for the data to be
"query-able" in BigQuery after the data has been uploaded to google
storage?
Any ideas on what the pricing model is going to be like? Per query,
per amount of data?
Yes, 60B is what really drew my attention to it :)
Will BigQuery be available for "almost" real time reporting?
How long will it take (i.e. is it planned to take) for the data to be
"query-able" in BigQuery after the data has been uploaded to google
storage?
Any ideas on what the pricing model is going to be like? Per query,
per amount of data?