Internet DRAFT - draft-dubois-bgp-pm-reqs
draft-dubois-bgp-pm-reqs
Network Working Group N. Dubois
B. Decraene
B. Fondeviole
France Telecom
Internet Draft Z. Ahmad
Equant
Document: draft-dubois-bgp-pm-reqs-02.txt July 2005
Expiration Date: January 2006
Requirements for planned maintenance of BGP sessions
draft-dubois-bgp-pm-reqs-02.txt
Status of this Memo
By submitting this Internet-Draft, each author represents that any
applicable patent or other IPR claims of which he or she is aware have
been or will be disclosed, and any of which he or she becomes
aware will be disclosed, in accordance with Section 6 of BCP 79.
Internet Drafts are working documents of the Internet Engineering
Task Force (IETF), its Areas, and its Working Groups. Note that other
groups may also distribute working documents as Internet Drafts.
Internet Drafts are draft documents valid for a maximum of six
months. Internet Drafts may be updated, replaced, or obsoleted by
other documents at any time. It is not appropriate to use Internet
Drafts as reference material or to cite them other than as a "working
draft" or "work in progress".
The list of current Internet-Drafts can be accessed at
http://www.ietf.org/ietf/1id-abstracts.txt
The list of Internet-Draft Shadow Directories can be accessed at
http://www.ietf.org/shadow.html.
Abstract
To ease the maintenance of BGP [BGP] sessions and limit the amount
of traffic that is lost during planned maintenance operations on
routers, a solution is required in order to gracefully shutdown a
router or a session.
The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT",
"SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this
document are to be interpreted as described in RFC 2119 [1].
Dubois Expires January 2006 [Page 1]
Internet Draft BGP planned maintenance requirements July 2005
Table of Contents
1. Introduction.........................................2
2. Problem Statement....................................2
3. Terminology..........................................3
4. Goals and Requirements...............................3
5. Scope................................................5
6. Example..............................................5
7. Reference Topologies.................................7
8. Security Considerations..............................9
7. Intellectual Property Statement......................9
8. Security Considerations..............................10
9. Intellectual Property Considerations.................10
10.Acknowledgments......................................11
11.References...........................................11
12.Authors' Addresses:..................................11
1. Introduction
The BGP protocol is heavily used in Service Provider networks. For
resiliency purposes, most of the IP network operators deploy
redundant routers and BGP sessions to minimize the risk of BGP
session breakdown towards their customers or peers.
In a context where a Service Provider wants to upgrade or remove a
particular router that maintains one or several BGP sessions, our
requirement is to avoid customer or peer traffic loss as much as
possible. It should be made possible to reroute the customer or peer
traffic before the maintenance operation occurs and BGP session is
torn down.
Currently, the BGP specification does not include any operation to
prevent traffic loss in case of planned maintenance.
A successful approach of such mechanism should indeed minimize the
loss of traffic in most foreseen maintenance situations. It should
be easily deployable and if possible, provide backward
compatibility.
2. Problem Statement
Currently, when one (or many) BGP session needs to be shut down, a
BGP NOTIFICATION message is sent to the peer and the session is then
closed. A protocol convergence is then triggered both in the local
router and in the peer. Alternate routes to the destination are
selected, if available.
Dubois Expires January 2006 [Page 2]
Internet Draft BGP planned maintenance requirements July 2005
This behavior is not satisfactory in a maintenance situation because
customer's (or peerÆs) traffic that was directed towards the removed
next-hops is lost until the end of BGP convergence. As it is a
planned operation, a make before break solution should be made
possible.
As maintenance operations are frequent in large networks, the global
availability of the network is significantly impaired by the BGP
maintenance issues.
3. Terminology
Maintained router: The router undergoing maintenance, closing (a)
BGP session(s) and causing the rerouting.
Peer routers: Routers which have a BGP peering session with the
Maintained router.
Impacted routers: Routers which use the Maintained router as a BGP
Next Hop.
4. Goals and Requirements
When some or all BGP sessions of a Maintained router need to be
administratively shut down, instead of sending a BGP NOTIFICATION
message and/or tearing the TCP session down, our goal is to achieve
the following behavior:
First problem : session stops
Step 1:
A mechanism is implemented on the Maintained router in order to
gracefully reroute packets towards and from the BGP next-hop that is
going to be unavailable.
By doing so, packets are rerouted before the maintenance operation
and no packet is lost for all the destination prefixes for which an
alternate route is available. The proposed solution MAY be designed
in order to avoid transient routing loops.
Step 2:
Once traffic is correctly rerouted BGP sessions are shutdown.
Second problem: session starts
Step 3:
Once maintenance operation has been completed, a mechanism may be
implemented to gracefully restore traffic to the original path
avoiding transient routing loops.
Dubois Expires January 2006 [Page 3]
Internet Draft BGP planned maintenance requirements July 2005
Summary:
As a result, if another router provides an alternate path towards a
set of destination prefixes, the packets are rerouted before the BGP
session termination and no packet is lost during BGP convergence
process, since both the forwarding and the Loc-RIB tables are kept
while the peers are re-computing their forwarding tables.
From the above goals we can derive the following requirements:
a/ A mechanism to advertise the maintenance action to all Impacted
routers is REQUIRED. Such mechanism may be either implicit or
explicit.
Note that Impacted routers can be located in adjacent ASes. The
proposed solution MAY be designed in order to avoid transient
routing loops.
b/ It is REQUIRED that the Maintained router implements a mechanism
to keep the forwarding for the NLRI undergoing maintenance until all
reroutable packets has been rerouted.
c/ A mechanism may be needed to indicate the end of the graceful
maintenance operation. The proposed solution MAY be designed in
order to avoid transient routing loops.
d/ An Internet wide convergence is NOT REQUIRED. However the local
AS and its directly connected peersÆ ASes MUST be able to gracefully
converge before the service interruption.
e/ The proposed solution SHOULD be applicable to all kinds of BGP
sessions (e-BGP/MP-eBGP, i-BGP/MP-iBGP and i-BGP/MP-iBGP route
reflector client) and any address family. Depending on the session
type, there may be some variation in the proposed solution in order
to fit the requirement. If the BGP implementation allows closing a
sub-set of AFIs carried in a MP-BGP session, this mechanism is
applicable to this sub-set of AFI identifiers. However the following
cases should be handled first:
- The maintenance of one particular e-BGP/MP-eBGP session.
- The reload of one AS border router.
- The shutdown of PE <-> CE links (Static & eBGP) in a MPLS-VPN
environment.
f/ The proposed solution SHOULD not change the BGP convergence
behavior for the ASes exterior to the maintenance process. An
incremental deployment on a per AS basis MUST be made possible. It
means that the proposed solution SHOULD be interoperable with the
current BGP implementation and SHOULD improve the maintenance
process even when one of the two ASes does not support graceful
maintenance. In particular, large BGP/MPLS VPN Service Providers may
not be able to upgrade all of the deployed CEs. The solution SHOULD
Dubois Expires January 2006 [Page 4]
Internet Draft BGP planned maintenance requirements July 2005
improve the behavior during planned maintenance even with Vanilla
CEs.
g/ If possible, redistribution of static IP routes into iBGP/MP-iBGP
SHOULD also be covered. Indeed, static routes are often used between
PE and CE in a BGP/MPLS VPN environment.
5. Scope
The purpose of this requirement is neither to solve all the
convergence issues that may arise within the Internet nor to modify
the convergence properties of the BGP protocol.
The Example section illustrates typical and important cases where
this requirement should be applicable and tries to make it more
understandable.
In addition a Reference Topologies section presents some BGP
topologies (both i-BGP and e-BGP) and confronts them to the
requirement. These topologies SHOULD be used to test the proper
behavior of proposed solutions.
6. Example
Purpose of this section is to give one typical example. It should
help the reader to understand how graceful maintenance will enhance
the availability of the inter provider BGP connections.
Let us consider the following example (Figure 1 below) where one
customer router (denoted as "CUST" in the figure) is dual-homed to
two SP routers, denoted as "ASBR1" and "ASBR2". ASBR1 and ASBR2 are
in the same AS and owned by the same service provider.
Dubois Expires January 2006 [Page 5]
Internet Draft BGP planned maintenance requirements July 2005
'
'
'
AS1 ' AS2
'
/-----------ASBR1-----P1----
/ |
/ |
CUST |
\ |
X.Y/16 \ |
\-----------ASBR2-----P2----
'
'
AS1 ' AS2
'
Figure 1: Redundant peering example.
Packets are normally conveyed by the CUST-ASBR1 link. Let's assume
the service provider wants to shutdown ASBR1 for maintenance
purposes.
The behavior as defined in [BGP] is:
1. ASBR1 tears down all of its BGP sessions.
2. As a result, it removes all the BGP routes from its RIB and FIB
tables.
3. Peers routers remove all the routes that were announced by the
shutting down peer and advertise the failure to all their BGP peers.
These peers are likely Impacted routers.
4. Impacted routers, receive BGP update messages, perform a BGP
selection process and update their RIB and FIB accordingly.
During Impacted routers' convergence:
- CUST continues to send packets to ASBR1. ASBR1 drops these
packets because it has no route to destination.
- P1 and possibly P2 continue to send traffic to ASBR1. ASBR1
drops this traffic because it has no route to CUST (X.Y/16).
From the customer's point of view, packets are lost during the BGP
convergence time.
With the required behavior defined in section 4 [Goals and
Requirements]:
- On all of its BGP sessions, ASBR1 signals a maintenance
according to the requirement defined in section 4-a.
Dubois Expires January 2006 [Page 6]
Internet Draft BGP planned maintenance requirements July 2005
- During the BGP convergence of all Impacted routers, ASBR1
keeps forwarding customer traffic in both directions.
- Once traffic has been rerouted, ASBR1 closes its BGP sessions
with its peers. No packet is lost.
7. Reference Topologies
In order to benchmark the proposed solutions, some typical BGP
topologies are detailed. Solutions SHOULD be applicable to all
topologies described below.
A solution draft should study the applicability of its solution for
each of these 9 (3 E-BGP * 3 I-BGP) possible topologies.
Terminology used in this section is inspired from RFC 2547. We use
PE (provider edge router) and CE (customer edge router). However the
scope of applicability is broader and can be transposed to any inter
ûAS BGP peering solution.
7.1. E-BGP/MP-eBGP topologies
Topology 1CE <-> 2PE:
'
AS1 ' AS2
'
/-----------Router21
/ '
/ '
Router11 '
\ '
\ '
\-----------Router22
'
'
AS1 ' AS2
'
In this topology we have an asymmetric protection scheme between AS
1 and AS 2:
- On AS 2 side, two different routers have been used to connect
to AS 1.
- On AS 1 side, one single router with two BGP sessions is
used.
The requirement of section 4 should be applicable to:
- Maintenance of one of the routers of AS2.
- Maintenance of the router of AS1.
- Maintenance of one of the two sessions between AS1 and AS2.
Dubois Expires January 2006 [Page 7]
Internet Draft BGP planned maintenance requirements July 2005
Topology 2CE <-> 2PE:
'
AS1 ' AS2
'
Router11-----------Router21
'
'
'
'
'
Router12-----------Router22
'
AS1 ' AS2
'
In this topology we have a symmetric protection scheme between AS1
and AS2: On both sides, two different routers have been used to
connect AS1 to AS2.
The requirement of section 4 should be applicable to:
- Maintenance of any of the routers (in AS1 or AS2).
- Maintenance of one of the two sessions between AS1 and AS2.
Topology 2CE <-> 2ISP:
'
AS1 ' AS2
'
Router11-----------Router21
| '
| '
'''''|''''''''''
| '
| '
Router31-----------Router22
'
AS3 ' AS2
In this topology the protection scheme between AS1 and AS2 is not as
symmetric as in the two previous topologies. Depending on which
routes are exchanged between the 3 ASes, some protection for some of
the traffic may be possible.
The requirement of section 4 does not translate as easily as in the
two previous topologies because we do not require propagating the
maintenance advertisement in the Internet.
For instance if Router22 requires a maintenance impacting Router31,
then Router31 will be notified. However we do not require for
Router11 to be notified.
Dubois Expires January 2006 [Page 8]
Internet Draft BGP planned maintenance requirements July 2005
7.2. I-BGP/MP-iBGP topologies
We describe here some frequent i-BGP topologies.
Indeed maintenance of an e-BGP session needs to be propagated within
the AS so the solution may depend on the specific i-BGP/MP-iBGP
topology.
Topology "Full-Mesh":
It is a full iBGP mesh topology as represented below.
P1 -------- P2
|\ /|
| \ / |
| \ / | AS1
| / \ |
| / \ |
ASBR1------ASBR2
\ /
\ /
''''''\''''''/''''''''''''
\ /
\ / AS2
CE
When the session between CE and ASBR1 undergoes maintenance, it is
required that all i-BGP peers of ASBR1 reroute traffic to ASBR2
before the session between ASBR1 and CE is shut down.
Topology "RR":
P1 RR----- P2 RR
|\ /|
| \ / |
| \ / | AS1
| / \ |
| / \ |
ASBR1 ASBR2
\ /
\ /
''''''\''''''/''''''''''''
\ /
\ / AS2
CE
Dubois Expires January 2006 [Page 9]
Internet Draft BGP planned maintenance requirements July 2005
In this topology, route reflectors are used to limit the number of
i-BGP sessions.
When the session between CE and ASBR1 undergoes maintenance, it is
required that all BGP routers of AS1 reroute traffic to ASBR2 before
the session between ASBR1 and CE is shut down.
Topology "hierarchical RR":
In this topology, hierarchical route reflectors are used to limit
the number of i-BGP sessions.
P1/hRR -------- P2/hRR
| |
| |
| | AS1
| |
| |
P1/RR -------- P2/RR
| |
| |
| | AS1
| |
| |
ASBR1 ASBR2
\ /
\ /
''''''\'''''''''/''''''''''''
\ /
\ / AS2
CE
8. Security Considerations
Security consideration MUST be addressed by the proposed solutions.
9. Intellectual Property Considerations
The IETF takes no position regarding the validity or scope of any
intellectual property or other rights that might be claimed to per-
tain to the implementation or use of the technology described in this
document or the extent to which any license under such rights might
Dubois Expires January 2006 [Page 10]
Internet Draft BGP planned maintenance requirements July 2005
or might not be available; neither does it represent that it has made
any effort to identify any such rights. Information on the IETF's
procedures with respect to rights in standards-track and standards-
related documentation can be found in BCP-11. Copies of claims of
rights made available for publication and any assurances of licenses
to be made available, or the result of an attempt made to obtain a
general license or permission for the use of such proprietary rights
by implementors or users of this specification can be obtained from
the IETF Secretariat.
The IETF invites any interested party to bring to its attention any
copyrights, patents or patent applications, or other proprietary
rights which may cover technology that may be required to practice
this standard. Please address the information to the IETF Executive
Director.
10. Acknowledgments
Authors would like to thank Christian Jacquenet, Olivier
Bonaventure, Steve Uhlig, Xavier Vinet, Vincent Gillet and Jean-
Louis le Roux for the useful discussions on this subject, their
review and comments.
11. References
[BGP] Y. Rekhter, T. Li,
"A Border Gateway protocol 4 (BGP)", RFC 1771, March 1995.
[MP-BGP] T. Bates, Y. Rekhter, R. Chandra, D. Katz,
"Multiprotocol Extensions for BGP-4", RFC 2858 June 2000.
12. Author's Addresses
Nicolas Dubois
France Telecom
24, rue du G‰n‰ral Bertrand
75007 Paris
France
Email: nicolas.dubois@francetelecom.com
Bruno Decraene
France Telecom
38-40 rue de general Leclerc
92794 Issy Moulineaux cedex 9
France
Email: bruno.decraene@francetelecom.com
Benoit Fondeviole
France Telecom
38-40 rue de general Leclerc
Dubois Expires January 2006 [Page 11]
Internet Draft BGP planned maintenance requirements July 2005
92794 Issy Moulineaux cedex 9
France
Email: benoit.fondeviole@francetelecom.com
Zubair Ahmad
Equant
13775 McLearen Road, Oak Hill VA 20171
USA
Email: zubair.ahmad@equant.com
Full Copyright Statement
Copyright (C) The Internet Society (2005). This document is subject
to the rights, licenses and restrictions contained in BCP 78, and
except as set forth therein, the authors retain all their rights.
This document and the information contained herein are provided on an
"AS IS" basis and THE CONTRIBUTOR, THE ORGANIZATION HE/SHE REPRESENTS
OR IS SPONSORED BY (IF ANY), THE INTERNET SOCIETY AND THE INTERNET
ENGINEERING TASK FORCE DISCLAIM ALL WARRANTIES, EXPRESS OR IMPLIED,
INCLUDING BUT NOT LIMITED TO ANY WARRANTY THAT THE USE OF THE
INFORMATION HEREIN WILL NOT INFRINGE ANY RIGHTS OR ANY IMPLIED
WARRANTIES OF MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE.
Dubois Expires January 2006 [Page 12]