Hey guys,
I'm working on a project in which a very large dataset (close to 10 gb) in SQLite files has to be queried - a lot.
The data has been precomputed - 0 writes, only a lot of reads.
Most of them are: SELECT some_attr FROM table WHERE some_str_attr LIKE %something%
i.e it requires a lot of partial searching of strings.
I'm really hopeful that CQEngine can offer better performance than SQLite, however I have some questions:
- Which index type should I use? I'm confused between Navigable, Hash, Radix, etc. Which is the best for string searches (contains, startsWith, endsWith, equals)
- Should I use on heap, off heap, or persistance storage?
Thank you.