--
You received this message because you are subscribed to the Google Groups "discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an email to discuss+u...@measurementlab.net.
To view this discussion visit https://groups.google.com/a/measurementlab.net/d/msgid/discuss/c8f16884-2978-449b-b656-64610472369dn%40measurementlab.net.
While the Unified Views (ndt.unified_uploads and ndt.unified_downloads) are the recommended long-term supported views, they are also significantly more resource-intensive — queries against them typically consume 10–20× more quota than queries against the underlying raw tables.
A good compromise, especially during the exploration phase of a project, is to query the ndt.ndt7 table (for data from 2020 onward) or the ndt5 and web100 tables (for earlier periods). These tables use a schema very similar to the unified views, and on large windows of analysis (aggregations of thousands of samples or more) they produce results that are statistically almost identical.
One important difference is that the raw tables contain both upload and download measurements. These can be separated using the raw field attributes. For example, in ndt7 filtering with
raw.Download.UUID IS NOT NULL selects only download tests
raw.Upload.UUID IS NOT NULL selects only upload tests
This approach allows researchers to conserve their daily quota while still producing high-quality, reproducible results.