When I’ve done profiling in the past, the performance had almost nothing to do with the size of the documents (though I didn’t look at huge documents.) The bottlenecks were more to do with the parallelism of the replicator code. I made a bunch of improvements at the time, but that was months ago (in the spring?) and there have been a lot of changes since.
There are also some limitations of the CouchDB sync protocol that reduce efficiency — the biggest one is that most of the time it can only pull one revision per HTTP request. I have a partial workaround for that, but it only applies to first-generation revisions with no attachments.
If you want to investigate, use Instruments to do a time profile of a sync. (Make sure to do it on a real device, not the simulator, because they have very different performance characteristics.)
—Jens