Sorry for the really late reply. Some messages may be going straight to my spam folder.
The Linux CFS is just one example of higher accuracy in CloudSim Plus. The CloudletSchedulerTimeShared is not realistic at all. It's an oversimplified implementation of a time-shared scheduler. With such an implementation, Cloudlets are never preempted. This way, there is no wait time at all. That implementation considers that all Cloudlets are always executed in parallel. If there aren't enough CPUs for all of them, it just splits the CPU MIPS capacity between all existing Cloudlets. In practice, if we have 2 CPUs with hyper-treading (HT) technology and 4 applications, we can still execute them in parallel since HT gives us 2 additional virtual CPUs. But if we have one more app, that will have to wait for the CPU.
If you want a more realistic scheduler, the CSF in CloudSim Plus provides that.
More details here.
The use of integration tests is an additional way to ensure CloudSim Plus is much more accurate than CloudSim. But the main reason is the large number of bugs that were fixed in CloudSim Plus. Just check the GitHub repo to confirm that.
The code duplication in CloudSim is the source of many issues and makes it really difficult to extend without needing to change the framework code. The amount of duplicated code in CloudSim is just scary. If you have a bug, that same bug probably exists elsewhere, mainly after the container support was implemented by duplicating entire packages of classes.
If you are talking about simulation results, I haven't seen any paper that compared simulation results with a testbed running in a real cloud infrastructure. I remember there is a PhD thesis from
a CloudSim's former member that executed some real testbed, but I don't know if the results between such experiments were compared. Anyway, CloudSim itself has a lot of bugs that may put in check the scientific validity of some experiment results presented in many papers, mainly those ones performed without using multiple simulation runs and computing confidence intervals for presented metrics.
If you have any more questions, feel free to keep in touch.