Hi there
1. We have about 300 million rows of data every day, about 160-200 bytes per row.
2. Once the data have been written, they won't be modified, but only be deleted.
3. Relative small load of reads, and only read by primary key and filter by time (e.g., batch read to do indexing)
4. We are about to save the data for 6 months to 1 year, and will delete old data out of the date range.
So my questions:
1. Is Scylla good for our use case?
2. if yes, what about the hardware requirements?
Thanks--
You received this message because you are subscribed to the Google Groups "ScyllaDB users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to scylladb-users+unsubscribe@googlegroups.com.
To post to this group, send email to scylladb-users@googlegroups.com.
Visit this group at https://groups.google.com/group/scylladb-users.
To view this discussion on the web visit https://groups.google.com/d/msgid/scylladb-users/caab8ff4-d50f-46c0-b7be-3a380f13e0c6%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
On Thu, Dec 29, 2016 at 7:32 PM, Yatong Zhang <blue...@gmail.com> wrote:Hi there
1. We have about 300 million rows of data every day, about 160-200 bytes per row.
2. Once the data have been written, they won't be modified, but only be deleted.
3. Relative small load of reads, and only read by primary key and filter by time (e.g., batch read to do indexing)
4. We are about to save the data for 6 months to 1 year, and will delete old data out of the date range.
So my questions:
1. Is Scylla good for our use case?I'm biased but it's the best.2. if yes, what about the hardware requirements?You have 300M rows * 200B * 365 days = ~22TB.Assuming you would like to use replication of 3, you'll have 66TB.I'd use 9 good beefy nodes that can store 1/9th of the above data.Use SSD/NVMe and 10GE.
Thanks
--
You received this message because you are subscribed to the Google Groups "ScyllaDB users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to scylladb-user...@googlegroups.com.
To post to this group, send email to scyllad...@googlegroups.com.
HI Dor,
Thanks for the reply. We're going to prepare 10 boxes to test Scylla. Based on our use case:
1.What are the requirements of memory and CPU cores? Are 128G ram and 8 cores (16 hyper-threading) sufficient?
2.What about the compaction strategy? date-tirered or disable compations? Since out data are 'read-only', is disabling compaction better?
To unsubscribe from this group and stop receiving emails from it, send an email to scylladb-users+unsubscribe@googlegroups.com.
To post to this group, send email to scylladb-users@googlegroups.com.
Visit this group at https://groups.google.com/group/scylladb-users.
To view this discussion on the web visit https://groups.google.com/d/msgid/scylladb-users/1cc9a036-5958-4fe9-9174-055416e19e7f%40googlegroups.com.