Tracing all aggregations using a traceID in $match.$comment alters behaviour?

156 views

Skip to first unread message

Daniel R

unread,

Oct 6, 2017, 3:33:52 AM10/6/17

to mongodb-user

Hello,

We're implementing database operations tracing by adding a trace ID in the $comment or comment element of most queries right before sending them off to MongoDB through the Java driver so this ID may appear in the database log in case of slow queries.
For most operations that support this comment element this seems to work just fine but we have some concerns regarding aggregations. The docs in https://docs.mongodb.com/manual/reference/operator/query/comment/#op._S_comment and https://docs.mongodb.com/manual/reference/operator/query/comment/#ex-comment-agg-expression says "You can use the $comment with any expression taking a query predicate" so for aggregations it means the $match stage.

Our tracing code that handles aggregations looks for a $match stage in the pipeline and adds the comment if one is found and lets the operation proceed to the driver. If none is found, it adds a new $match stage with the comment as sole attribute as first element of the pipeline (this is because some of our aggregations don't make use of $match).
So, looking at the system.profile for debugging purposes, an aggregation would look like this:
Before tracing:
--------------
{
    "op" : "command", d
    "ns" : "cloudwife.cw_task_discussion",
    "command" : {
        "aggregate" : "cw_task_discussion",
        "pipeline" : [
            {   "$unwind" : "$comments" },
            { "$group" : { "_id" : "commentsCount", "count" : { "$sum" : NumberInt(1) } } }
        ]
    } , ...
After tracing it looks like this:
--------------------
{
    "op" : "command",
    "ns" : "cloudwife.cw_task_discussion",
    "command" : {
        "aggregate" : "cw_task_discussion",
        "pipeline" : [
            { "$match" : { "$comment" : "[amzn_trace_id:-]" } },
                        {   "$unwind" : "$comments" },
            { "$group" : { "_id" : "commentsCount", "count" : { "$sum" : NumberInt(1) } } }
        ]
    }, ...

- This newly added "dummy" $match stage should not alter the behaviour/performance of the aggregation...? I'm specifically wondering if it might cause the aggregation to use a different index. Any advice?

Thank you and kind regards,
Daniel Rodríguez

Kevin Adistambha

unread,

Oct 16, 2017, 9:40:22 PM10/16/17

to mongodb-user

Hi Daniel

This newly added “dummy” $match stage should not alter the behaviour/performance of the aggregation…? I’m specifically wondering if it might cause the aggregation to use a different index. Any advice?

I would suggest you to use the db.collection.explain() method to determine if the queries are using different indexes. For example, you can use db.collection.explain().aggregate(...) in the mongo shell with and without the dummy $match stage, and examine the explain() output of the query. Look for lines containing stage: IXSCAN and the following keyPattern entry to check the indexes used by both queries. This will give you a definitive answer whether or not the dummy $match is affecting index use or not.

For more details regarding explain(), please see Explain Results.

Best regards,
Kevin

Reply all

Reply to author

Forward

0 new messages