long running compact question

Daniel Doyle

unread,

Dec 1, 2017, 4:40:13 AM12/1/17

to mongodb-user

Hi guys,

I have what I believe is a heavily fragmented, very large collection. There have been millions of deletes made to it over the year, and we're just now looking at doing a compact operation. I let it run for about 3 hours and couldn't find any indication of progress in the logs or currentOp(), so I eventually killed it to restore service.

Mongo 3.2.18, wiredtiger

A few questions:

1 - is there supposed to be any indication of how it's going? I did observe the system was doing a lot of reads and occassional write bursts and the mtimes on collection files were changing, so I suspect it was doing something.

2 - Is this something I can do multiple times where it will progress each time, or will I have to take some massive outage to let it finish?

Thanks,

Dan

Daniel Doyle

unread,

Dec 1, 2017, 4:48:25 AM12/1/17

to mongodb-user

Also forgot to ask

3 - I didn't see the files change size, nor are the indexes reported any different than before. Is this something that happens only at the end of the compact operation?

Kevin Adistambha

unread,

Dec 3, 2017, 10:52:27 PM12/3/17

to mongodb-user

Hi Daniel

In WiredTiger, the compact command attempts to release space to the operating system that is unused by WiredTiger. Although WiredTiger was designed to minimize any fragmentation in your data files, fragmentation may accumulate under certain workloads.

The available space for reuse by WiredTiger can be seen in the output of the command db.collection.stats().wiredTiger['block-manager']['file bytes available for reuse']. This space could potentially (depending on your specific workload, document size, etc.) be released to the operating system using the compact command (but there is no guarantee).

is there supposed to be any indication of how it’s going? I did observe the system was doing a lot of reads and occassional write bursts and the mtimes on collection files were changing, so I suspect it was doing something.

No I don’t believe there is a progress bar or similar. It could take a while to finish if you have a large dataset.

Is this something I can do multiple times where it will progress each time, or will I have to take some massive outage to let it finish?

It’s best to wait for it to finish. It was not designed to be interruptible.

I didn’t see the files change size, nor are the indexes reported any different than before. Is this something that happens only at the end of the compact operation?

It may also be possible that compact was not able to free up any disk space. This could be due to WiredTiger being able to fill up deleted spaces effectively, and compacting the data files would result in minimal disk space gain.

Best regards
Kevin

Daniel Doyle

unread,

Dec 4, 2017, 9:43:33 AM12/4/17

to mongodb-user

Hi Kevin,

Thanks, I appreciate the information. To clarify the exact use case, we are at a point where the application has changed and we deleted a mass amount of data. The output of the 'file bytes available for reuse' is over 1TB, and based on a test that we did using mongodump/restore we are also going to gain back a substantial amount of index space on the order of 10G or so. Note that the dump/restore process took days and was done only to "proof" the concept, this won't work for production.

We are not anticipating needing to re-use this space, and it is actually causing some sizing concerns. One thought that I had was to run compact for several hours on the secondary, interrupt it to let replication catch back up, and cycle as such until it finishes. This is based on my reading (perhaps too deeply of) the following: https://docs.mongodb.com/manual/reference/command/compact/#operation-termination

I do not know exactly how long it would take to finish, but I suspect it may be beyond the oplog window which I think means that we simply cannot do it. If compact finishes outside of the oplog window then the resulting dataset is invalid anyway. That is why I was hoping to get some indicator of progress - otherwise we're just guessing. I suppose we can try that and if it fails do a manual resync to get back to where we are right now.

I am certainly open to other ideas or suggestions as well.

Kevin Adistambha

unread,

Dec 4, 2017, 10:20:31 PM12/4/17

to mongodb-user

Hi Daniel

Sounds like you’re overhauling your application, and ended up in what is essentially a new database. Since you’re running WiredTiger, the space will eventually be reused by new data. Unless your disk space is at a critical point, I would just leave it to WiredTiger to reuse the space. Please note that the unused space may be returned to the operating system when WiredTiger completes a checkpoint (by default, every 60 seconds or 2GB of data as of MongoDB 3.4; see Snapshots and Checkpoints). Having said that, note that the space reclamation is not instant and may depend on your workload.

One thought that I had was to run compact for several hours on the secondary, interrupt it to let replication catch back up, and cycle as such until it finishes.

I would advise against this method, since compact was never designed to be interruptible. There is a significant risk that your database can end up in some undefined state.

That is why I was hoping to get some indicator of progress

There is a feature request for this in SERVER-24618. Please comment/upvote on the ticket to raise awareness into this issue and your use case.

Best regards
Kevin

Reply all

Reply to author

Forward