What are you connecting to from your application? Mongos
When you say, "our cluster loads an unreal amount of data", what exactly do you mean?
We have a 3 node sharded cluster. Each node has 96GB of Ram. When we first start our data loader, we have seen numbers in the range of 4 - 5 million documents loaded in less than an hour. Then it slows down to something in the range of 1 - 2 once Mongo starts to cache.
Where does the data come from? Is it evenly spread across your cluster?
The data comes from our systems reporting their transaction log files to the cluster. All data comes into the first node?
When you say, "ram fills up, the chunks starts building up" - RAM on which machine?
RAM on the Primary Shard (The 2nd Node). The chunks will begin to become unbalanced and the primary shard will no distribute the chunks.
So in summary:
Node 1: Mongod, Mongos, App which loads the data
Node 2: Mongod, Mongos, Primary Shard
Node 3: Mongod, Mongos, just a Node
Each Node: 96GB Ram, 24 Hyperthreaded Cores, 6 - 300GB SAS Drives 16MB Cache 10K RPM (Not High Performance Drives) Raid-10 EXT-4, 10000 mb Network connections