--
You received this message because you are subscribed to the Google Groups "mongodb-user"
group.
For other MongoDB technical support options, see: http://www.mongodb.org/about/support/.
---
You received this message because you are subscribed to the Google Groups "mongodb-user" group.
To unsubscribe from this group and stop receiving emails from it, send an email to mongodb-user...@googlegroups.com.
To post to this group, send email to mongod...@googlegroups.com.
Visit this group at http://groups.google.com/group/mongodb-user.
To view this discussion on the web visit https://groups.google.com/d/msgid/mongodb-user/CAFbpF8McYs%3DUA16dCsOUDScKEwfGEy4VRe6gVANz-shzykH84w%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.
To view this discussion on the web visit https://groups.google.com/d/msgid/mongodb-user/CAOe6dJAN%2BoKwCXgw5OVAXhQycBTWy6nK8vD0Tq0rkL_QU28-2w%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.
On Mon, Mar 23, 2015 at 11:26 AM, MARK CALLAGHAN <mdca...@gmail.com> wrote:
> Does this mean that more space than expected will be used for wide covering indexes?
I'm not sure how to answer that precisely since I'm not sure how much space you would be expecting it to take.
> The linkbench schema is an example where there is a secondary index that has most of the columns from
> the table. It gets range scans and we want to avoid random IO back to the base table. I want to get more
> than prefix compression for that index.
> https://github.com/facebook/linkbench
I believe that linkbench (adapted for MongoDB) is one of the tests that our performance team uses to benchmark new vs old versions, etc. They also use "Socialite" - which takes advantage of covering compound indexes as well. I would expect index prefix compression to benefit compounded indexes even more than simple indexes.
I just did a quick check on some real world data in a replica set where two new secondaries were recently added (so they both have relatively compact indexes since they fully resync'ed) the secondaries are 3.0.1 mmapv1 and wiredTiger, default settings.
Indexes in two collections I compared range in size since they have single field, two field and three field indexes, as well as a couple of geoSpatial indexes. The compression that WT gives makes the indexes anywhere from half the size compared to Mmap to barely over 10%.
The highest level of compression was for a compound index but not longest (I.e. not most fields in index). As you can guess, selectivity of the leading fields will impact the compression in converse way (lower selectivity, highest compression).
Lowest compression, about 40%, was in one of two _id indexes, I didn't look at contents but I'm guessing it was totally or mostly random.
Majority of indexes were compressed down to about 20-30% of their mmap size, with a few in the 11-18% range.
I'm not sure how much better compression you can get on indexes, and given that you will lose some latency to compressing them on writes it seems unlikely to be a beneficial trade-off.
Asya
P.S. I suppose I can try to test whether the compressed indexes are further compressible but I'm not 100% certain what would be a most realistic test for that.
--
You received this message because you are subscribed to the Google Groups "mongodb-user"
group.
For other MongoDB technical support options, see: http://www.mongodb.org/about/support/.
---
You received this message because you are subscribed to the Google Groups "mongodb-user" group.
To unsubscribe from this group and stop receiving emails from it, send an email to mongodb-user...@googlegroups.com.
To post to this group, send email to mongod...@googlegroups.com.
Visit this group at http://groups.google.com/group/mongodb-user.
To view this discussion on the web visit https://groups.google.com/d/msgid/mongodb-user/CAOe6dJDogmxBqm0ypgmOnK1Hrb%3DRTodNKbdEjrsXoqStBWvbBw%40mail.gmail.com.
--wiredTigerIndexConfigString="block_compressor=snappy"