Hello Lai Yi,
Am I correct in understanding that two things motivate this change?
- The complexity of parsing the multipath traceroute data structure
- The time that it takes to run the MDA in order to collect a multipath traceroute
If so, there are alternatives to abandoning the collection of multipath traceroutes. After all, M-Lab has the largest set of multipath traceroutes (well over one billion!) going back over many years. Anyone who wants to study how multipath routing has evolved over time in the internet would be hard-pressed to find any other comparable dataset.
Regarding the complex data structure, a single-path traceroute could easily be extracted from a multipath traceroute via post-processing. It would not be necessary to conduct the measurement twice.
And regarding the time that it takes, the problem has been solved in Kevin Vermeulen's Diamond-Miner work.
https://www.usenix.org/conference/nsdi20/presentation/vermeulenWhile Kevin's work focuses on rapidly collecting traceroutes towards all of the internet's routable prefixes in a short period of time, the principles behind the speed-up apply equally well to a trace towards a single destination. In a first round of probing, packets to all hop-counts can be sent in parallel, and ten rounds of probing are almost always enough to complete a multipath traceroute, meaning the time required is slightly
less than the time required for a classic traceroute, which has a time requirement that scales with the length of the route.
We've distilled the Diamond-Miner probing engine into free open-source liberally licensed code that we call Caracal.
https://github.com/dioptra-io/caracalWith Caracal, we could easily speed up M-Lab's multipath route tracing while producing both multipath and single path outputs in the existing formats that M-Lab provides.
Incidentally, we are producing daily surveys from a single vantage point of multipath traceroutes to all routable IPv4 prefixes in the internet, and these are available to any researcher upon request through our Iris platform.
https://iris.dioptra.io/#/So if M-Lab continues to collect multipath traceroutes, it will no longer be alone in doing so, which should enhance the value of M-Lab's multipath data for the research community.
Kind regards,
Timur