Paper Title
A Scalable, Commodity Data Center Network Architecture
Authors
Mohammad Al-Fares, Alexander Loukissas, Amin Vahdat
Date
SIGCOMM’08, August 17–22, 2008, Seattle, Washington, USA.
Novel Idea
Large-scale data centers usually have significant bandwidth requirement. However, in the traditional tree-like architecture the inter-node communication bandwidth has become the principle bottleneck. This paper presents an innovative data center network architecture addressing this problem. The novel idea is the fat-tree topology with commodity switches: the high fan-out of this architecture allows outgoing traffic of a single pod to be spread as evenly as possible among the core switches. Specifically, for each pair of hosts there are multiple paths connect them so that the demand of the core switches can be significantly reduced.
Impact
The architecture described in this paper is very attractive to data center design. It brings significant performance enhancement with commodity switches while has no other difficult requirement.
Evidence
The authors describe their experiments with a 4-port fat-tree network based on Click. The result shows that fat-tree architecture has better bandwidth and efficiency, which supports the initial claims.
Prior Work
The flat-tree and similar ideas are proposed in supercomputers and massively parallel processing communities.
Reproducibility
The experiment requires multiple machines, which makes it a little harder to reproduce the result. However, based on the scale of the problem they are trying to solve, I would say it's easier than it should be.
Comments
This paper provides an insight of the technologies in modern data centers, as well as their pros and cons and the problems in the data center design. It's fun to read and really helpful to understand the motivation of this paper (and many others!). The paper has nice figures and the model is well described.
The only complain I have is the experiment doesn't seem to be very convincing. The result would be much stronger if they can test on larger scale environment
Paper Title:
A Scalable, Commodity Data Center Network Architecture
Authors:
Mohammad Al-Fares, Alexander Loukissas, and Amin Vahdat
Date:
August 17-22, 2008.
Novel Idea:
The paper presents a data center communication architecture that leverages commodity Ethernet switches to deliver scalable bandwidth for large-scale clusters.
Basic requirements: (1) uses scalable interconnect bandwidth, (2) considers economy of scale, and (3) is backward compatible.
Main results:
The new architecture permits to deliver scalable bandwidth at significant lower cost than existing techniques (i.e.: extends the work of Massively parallel interconnects organized as fat-trees like the systems from Thinking Machines and SGI, Myrinet switches, and the InfiniBand interconnect).
Experiment Results:
- Two-level table switches achieve approximately 75% of the ideal bisection bandwidth for random communication patterns.
- The flow classifier outperforms both the traditional tree and the two level-table in all cases, with a worst-case bisection bandwidth of approximately 75%. The flow classifier has dynamic flow assignment and does re-allocation.
- The FlowScheduler act on global knowledge and tries to assign large flows to disjoint paths achieving 93% of the ideal bisection bandwidth for random communication mappings, and outperforming all other methods in all the benchmark tests.
- Effective Packaging to reduce wiring overhead eliminate most of the required external wiring and reduces the overall length of required cabling, which in turn simplifies cluster management and reduces costs.
Impact:
Less cabling and infrastructure in general entails a lean packaging solution that consumes less power and requires less heat dissipation
The fat-tree topology is fault-tolerant and the use of a simple failure broadcast protocol allows switches to route around link or switch failures bi-directionally between neighboring switches.
Furthermore, the scheduler marks any link reported to be down as busy or unavailable, thus disqualifying any path that includes it from consideration, in effect routing large flows around the fault.
Routing large flows represent the most important role in figuring out the achievable bisection bandwidth of a network. The researchers in this paper schedule large flows to minimize overlap with one another and a central scheduler makes this choice, with global knowledge of all active large flows in the network (i.e. using edge switches).
Evidence:
The paper shows that by interconnecting commodity switches in a fat-tree architecture it is possible to obtain the full bisection bandwidth of clusters consisting of tens of thousands of nodes. One instance of the prototyped architecture uses 48-port Ethernet switches, which can provide full bandwidth to up to 27648 hosts at lower cost than existing solutions.
Prior work:
Work inspired by the torus topology of IBM’s super computer BlueGene/L, the Cray XT3, and existing routing techniques like OSPF2, and ECMP (Equal Cost Multipath)
Question:
How do we program this new data center network architecture at a higher level of abstraction? It seems we are limited to program it at the switch and/or router level within a fat-free topology; will it work in and/or interact with a non-universal topology [1], which according to [1] seem to be more prevalent?
Furthermore, a centralized fat-tree topology, although scalable, might not be convenient for other applications that require a wider/distributed configuration.
[1] Charles E. Leiserson Fat-trees: universal networks for hardware-efficient supercomputing, IEEE Transactions on Computers, Vol. 34 , no. 10, Oct. 1985, pp. 892-901.
--
You received this message because you are subscribed to the Google Groups "CSCI2950-u Spring 13 - Brown" group.
To unsubscribe from this group and stop receiving emails from it, send an email to csci2950u-sp13-b...@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.
Rodrigo
Rodrigo