Hi everyone,
I am currently working on implementing and validating TCP Prague in ns-3. My topology uses a DualQCoupledPiSquare AQM at the bottleneck to simulate an L4S scenario.
Initially, I observed unexpectedly high RTT values (around 68ms) even though my bottleneck link was configured with a 10ms propagation delay and the DualQ statistics showed the L4S queue was almost empty.

After investigating, I discovered that the underlying NetDevice queue (the default DropTailQueue) was accumulating packets. I believe this accumulation was the actual cause of the extra delay, effectively hiding the congestion from the Traffic Control layer where the DualQueue resides.
I managed to fix this by setting the NetDevice queue size to a single packet:

