Choice Between BFD & NSF/NSR...Looking for clarity

Deepak Arora

unread,

Dec 30, 2013, 10:54:01 PM12/30/13

to ccdegro...@googlegroups.com

> How to balance between NSF/NSR and BFD as selection. Where NSF says Route Through vs BFD as Route around. So does that mean go for NSF if single home and BFD if dual home ?

What if I have two Cat 6500 (@ Distribution layer ) for example with Dual SUP each and BFD/NSF-NSR capability, with Dual home connectivity upstream towards Core?

Any detailed document talking about these two and also covering in terms of design choices?

Also I went through couple of Cisco Live videos. But none of those explain how do I pick my BFD mode and timers. They always says its a network design choice but I don't find someone explaining Maths to come out with a equation for real world.

Eric Kalisek

unread,

Jan 28, 2014, 7:56:20 PM1/28/14

to ccdegro...@googlegroups.com

BFD and NSF accomplish two separate goals and are not mutually exclusive. As the name implies, BFD is used to detect a bidirectional forwarding issue when link status cannot be used to detect a failure. In the case of the L3 links from distribution to core - these are likely P2P links, so if one side were to be down, the other side would also be down. BFD would not help in this instance because link status is end to end - detection would be interrupt driven and much faster than BFD. However, if there were a switch in-between your L3 links, link status would not be end to end and you would need to rely on your protocol timers to detect the failure. You would use BFD in this case to detect the failure and notify the protocol that the adjacency should be torn down.

As far as timers, you will likely not find a recommended value as it really is dependent on your environment and use case. Faster timers will detect failure faster but at the expense of overall stability. I have typically used a 1 second hello interval with a multiplier value of 3 or 4. I have not found the need to tune timers any less than that.

Check out BRKDCT-2333 "Data Center Network Failure Detection" is a good resource for BFD and other failure detection mechanisms.

NSF is used to prevent topology change after a supervisor switchover. A control-plane switchover on a non-NSF capable device would require the L3 adjacencies to be re-established which results in topology change and temporary loss of connectivity until the network converges. NSF allows traffic to continue being forwarded along the old path while the adjacencies are gracefully refreshed after switchover. The remote device must be NSF aware for the process to work.

The book Optimal Routing Design provides a good explanation of how NSF works for a number of protocols.

Hope this helps.

Eric

Deepak Arora

unread,

Jan 29, 2014, 12:55:05 AM1/29/14

to ccdegro...@googlegroups.com

Dear Eric Thanks For Writing. But I already understood the BFD and NSF/NSR with SSO concept. Idea was how these two fit together if I have both of tools available. But I solved the puzzle later in terms of figuring out where to position these in deployment with help of following paper:

http://www.cisco.com/en/US/technologies/tk869/tk769/technologies_white_paper0900aecd801dc5e2.html

HTH...

Deepak Arora

Evil CCIE

http://deepakarora1984.blogspot.com

Reply all

Reply to author

Forward