Internet DRAFT - draft-hao-bess-evpn-df-handshaking
draft-hao-bess-evpn-df-handshaking
BESS Weiguo Hao
Lucy Yong
Qiandeng Liang
Internet Draft Huawei
Intended status: Standard Track May 12, 2015
Expires: November 2015
Handshaking mechanism for DF election
draft-hao-bess-evpn-df-handshaking-02.txt
Abstract
In [EVPN], in the DF re-election transient period, due to Ethernet
Segment route transmission timer and timer clock discrepancy on each
PEs, dual DF PEs will co-exist, traffic loop and disruption will be
incurred. In MHN case, the consequences are particularly serious.
Handshaking mechanism for DF election is introduced in this draft to
resolve the problem.
Status of this Memo
This Internet-Draft is submitted to IETF in full conformance with
the provisions of BCP 78 and BCP 79.
Internet-Drafts are working documents of the Internet Engineering
Task Force (IETF), its areas, and its working groups. Note that
other groups may also distribute working documents as Internet-
Drafts.
Internet-Drafts are draft documents valid for a maximum of six
months and may be updated, replaced, or obsoleted by other
documents at any time. It is inappropriate to use Internet-Drafts
as reference material or to cite them other than as "work in
progress."
The list of current Internet-Drafts can be accessed at
http://www.ietf.org/1id-abstracts.html
The list of Internet-Draft Shadow Directories can be accessed at
http://www.ietf.org/shadow.html.
Hao & et,al Expires August 12, 2015 [Page 1]
Internet-Draft Handshaking mechanism for DF February 2015
Copyright Notice
Copyright (c) 2015 IETF Trust and the persons identified as the
document authors. All rights reserved.
This document is subject to BCP 78 and the IETF Trust's Legal
Provisions Relating to IETF Documents
(http://trustee.ietf.org/license-info) in effect on the date of
publication of this document. Please review these documents
carefully, as they describe your rights and restrictions with
respect to this document. Code Components extracted from this
document must include Simplified BSD License text as described in
Section 4.e of the Trust Legal Provisions and are provided without
warranty as described in the Simplified BSD License.
Table of Contents
1. Introduction ................................................ 2
2. Problems in MHN scenario..................................... 3
3. Conventions used in this document............................ 6
4. Handshaking mechanism for DF election in EVPN network.........6
5. Handshaking mechanism for DF election in PBB-EVPN network.....8
6. Network Migration Analysis................................... 8
7. DF election extended community............................... 9
8. Security Considerations..................................... 10
9. IANA Considerations ........................................ 10
10. References ................................................ 10
10.1. Normative References.................................. 10
10.2. Informative References................................ 10
11. Acknowledgments ........................................... 10
1. Introduction
[EVPN] is a L2VPN solution using BGP for distributing
customer/client MAC address reachability information over the core
MPLS/IP network. EVPN provides flexible redundancy modes for both
multi-homed device(MHD) and multi-homed network(MHN) scenarios. In
MHD case, a CE node is normally accessed to a set of PE nodes
leveraging multi-chassis Ethernet link aggregation groups(LAGs),
both all-active and single-active redundancy mode can be achieved
for the CE node. In MHN case, an Ethernet network, rather than a
single device, is multi-homed to a group of PEs, only single-active
redundancy mode can be achieved for the Ethernet network.
Hao & et,al Expires August 12, 2015 [Page 2]
Internet-Draft Handshaking mechanism for DF February 2015
No matter it is all-active or single-active case, DF election
mechanism is used to avoid packet duplication to local Ethernet
Segment(ES) from a remote PE and loop among local PEs connecting to
same ESI. The Designated Forwarder (DF) PE in (PBB-)EVPN networks is
the PE that is responsible for sending multicast, broadcast and
unknown unicast traffic to a multi-homed CE, on a given Ethernet Tag
on a particular Ethernet Segment (ES). The DF PE is selected based
on the list of PEs that advertise the Ethernet Segment Identifier
(ESI) to the EVPN network.
The DF election procedure defined in [EVPN] is as follows:
1. When a PE discovers the ESI of the attached Ethernet Segment, it
advertises an Ethernet Segment route with the associated ES-Import
extended community attribute.
2. The PE then starts a timer (default value = 3 seconds) to allow
the reception of Ethernet Segment routes from other PE nodes
connected to the same Ethernet Segment.
3. The receiver PE(s) also starts a timer when the Ethernet Segment
route is received. This timer value should be same across all PEs
connected to the same Ethernet Segment.
4. When the timer expires, each PE starts DF election process
independently using same algorithm. The PE elected as DF for a given
EVPN instance will unblock the multi-destination traffic in the
egress direction towards the Segment immediately, while the non-DF
PEs will block the traffic immediately.
If one CE device or network is accessed to multiple PEs, one PE
failure and recovery will trigger EVPN DF re-election. Because each
PE relies on independent timer expiring to trigger local DF election
process, due to Ethernet Segment route transmission timer and timer
clock discrepancy on each PEs, in the DF re-election transient
period, dual DF PEs will co-exist, traffic loop and disruption will
be incurred. In MHN case, the consequences are particularly serious
and will be described in detail in section 2.
2. Problems in MHN scenario
Hao & et,al Expires August 12, 2015 [Page 3]
Internet-Draft Handshaking mechanism for DF February 2015
----- -----
|PC1| |PC2|
----- -----
| |
------------------------
/ Native Eth Network \
\ /
-------------------------
| |
------ ------
|AGG1 | |AGG2 |
------ ------ DC1 Network
| |---------------
------ ------
|PE1 | |PE2 |
------ ------
| |
------------------------
/ \
| IP/MPLS Network |
\ /
-------------------------
| |
------ ------
|PE3 | |PE4 |
------ ------
| |---------------
------ ------ DC2 Network
|AGG3 | |AGG4 |
------ ------
| |
------------------------
/ Native Eth Network \
\ /
-------------------------
| |
----- -----
|PC3| |PC4|
----- -----
Hao & et,al Expires August 12, 2015 [Page 4]
Internet-Draft Handshaking mechanism for DF February 2015
Figure 1 DC Network interconnecting using EVPN
In figure 1, both DC1 and DC2 networks are native Ethernet network
and are multi-homed to two EVPN PEs in single-active mode
respectively, i.e., DC 1 is connected to PE1 and PE2 while DC 2 is
connected to PE3 and PE4. In regular time, PE1 and PE3 are DF PE,
PE2 and PE4 are non-DF PE. PC1 to PC4 belong to same VLAN.
When PE3 fails, PE4 will take over the duties of DF PE. After some
time, PE3 recovers and starts DF PE re-election process. In the re-
election process, due to Ethernet Segment route transmission timer
and timer clock discrepancy on PE3 and PE4, both PE3 and PE4 will
act as DF PE in short transient time. During the transient time, the
following data plane and control plane process will be performed.
Data plane:
1. PC1 in DC1 sends BUM traffic to PE1.
2. PE1 sends the traffic to PE3 (and PE4). Because both PE3 and PE4
are DF PE at this time, PE3 (and PE4) will forward the BUM traffic
to DC2 local network.
3. PE4 (and PE3) will receive the BUM traffic from DC2 local access
network and learn the PC1's MAC through data plane.
4. PE4 (and PE3) will forward the BUM traffic to PE1.
5. PE1 will forward the BUM traffic to DC1 network. MAC flip-flop of
PC1 will occur on the switches in DC1.
Control plane:
1. When PE4 (and PE3) learns PC1's MAC from DC2 local network, PE4
(and PE3) will announce the MAC of PC1 to PE1 using BGP control
plane.
2. When PE1 receives PC1's MAC from PE4 (and PE3) through BGP, it
will populate the MAC route of PC1 into local MAC-VRF, MAC flip-flop
of PC1 on PE1 will occur.
Hao & et,al Expires August 12, 2015 [Page 5]
Internet-Draft Handshaking mechanism for DF February 2015
When PC1's MAC flip-flop occurs on DC1 local switches and PE1, the
regular traffic to PC1 in DC1 local network will be disrupted.
If there are multiple PCs sending BUM traffic proactively at the
transient time, multiple MAC addresses fluctuation will occur. This
will impose extra control plane burden on all related PEs and induce
regular traffic disruption to these PCs. So in the MHN scenario,
dual DF PEs in transient period will have serious consequences, it
should be resolved.
To resolve the problem of dual DF PEs in transient period, a new
handshaking mechanism for DF election is introduced in this draft.
3. Conventions used in this document
The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT",
"SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this
document are to be interpreted as described in [RFC2119].
Ethernet Segment (ES): If a multi-homed device or network is
connected to two or more PEs via a set of Ethernet links, then that
set of links is referred to as an 'Ethernet segment'.
Ethernet Segment Identifier (ESI): A unique non-zero identifier
that identifies an Ethernet Segment is called an 'Ethernet Segment
Identifier'.
4. Handshaking mechanism for DF election in EVPN network
In figure 1, initially PE2 and PE3 boot up and have already finished
DF election process, later PE1 boots up and is elected as DF, the
timing diagram from PE1 boots up is as follows:
------- ------- -------
| PE1 | | PE2 | | PE3 |
------- ------- -------
Discover ESI | | |
and start timer| | |
-------------->| | |
Hao & et,al Expires August 12, 2015 [Page 6]
Internet-Draft Handshaking mechanism for DF February 2015
|ES Route+DF Election EC| |
|<--------------------->| |
| ES Route+DF Election EC |
|<------------------------------------------->|
Timer expires | | |
-------------->|AD Route+DF Election EC| |
|---------------------->|Block traffic |
| AD Route+DF Election EC |
|--------------------------------------------->|Block traffic
|AD Route+DF Election EC| |
|<----------------------| |
| AD Route+DF Election EC |
|<--------------------------------------------|
| Unblock traffic | |
Figure 2 Handshaking DF Election Timing Diagram
1. When PE1 discovers the ESI of the attached Ethernet Segment, it
advertises an Ethernet Segment route with the associated ES-Import
extended community attribute. If the PE supports DF election
handshaking mechanism, the ES route MUST associate with a new DF
election BGP extended community with a DF handshaking capability
Flag to show that the advertising PE itself supports DF election
handshaking mechanism.
2. PE1 then starts a timer (default value = 3 seconds) to allow the
reception of Ethernet Segment routes from other PE(PE2 and PE3)
nodes connecting to the same Ethernet Segment. This timer value
should be same across all PEs connected to the same Ethernet Segment.
3. When the timer expires, each PE knows all member PEs connecting
to the same ESI and perform DF election algorithm independently as
defined in [EVPN]. If all member PEs connecting to same ESI support
DF handshaking mechanism, for the DF PE, at this time it doesn't
unblock multi-destination traffic in the egress direction towards
the Segment directly. For the non-DF PEs, they also should keep
forwarding state unchanged(Because PE2 is old DF PE, so it keep
forwarding multi-destination traffic in the egress direction). The
DF PE of PE1 should shake hands with other non-DF PEs to ensure
unblocking action on DF PE and blocking action on non-DF PE enforced
at the same time. The elected DF PE of PE1 notifies other non-DF PEs
an Ethernet Auto-Discovery (A-D) route associated with a DF election
Hao & et,al Expires August 12, 2015 [Page 7]
Internet-Draft Handshaking mechanism for DF February 2015
BGP extended community with DF Flag to show that the advertising PE1
itself is DF PE.
4. When each non-DF PE(PE2 and PE3) receives the A-D route from DF
PE of PE1, the non-DF PE will block multi-destination traffic in the
egress direction, then it advertises an Ethernet Auto-Discovery (A-D)
route with the DF election extended community to DF PE. The DF
election extended community carries a non-DF Flag to show that the
advertising PE itself is non-DF PE and has blocked multi-destination
traffic in the egress direction towards the Segment.
5. When the DF PE1 receives the A-D route with DF election extended
community from all other non-DF PEs, the DF PE can start unblock
multi-destination traffic in the egress direction towards the
Segment. Because all non-DF PEs have blocked the traffic, so now no
packet duplication and loop among local PEs will occur.
When a DF election happens, there may be other uncompleted DF
election process existed among same PEs connecting to same Ethernet
Segments at the same time. In order to ensure that all PEs negotiate
for the newest DF election, it is necessary to introduce a sequence
number into the DF election extended community attribute. The
sequence number is generated on DF PE corresponding to each DF
election process, non-DF PEs don't generate the number and use same
sequence number generated by DF PE. Each non-DF PE should stop
negotiating old DF election when it receives a A-D route with larger
sequence number.
Through the handshaking mechanism, when a DF PE is elected, it
notifies all non-DF PEs block traffic immediately, only when all
non-DF PEs block traffic successfully, DF PE can unblock traffic, so
it can effectively reduce traffic disruption , avoid potential
issues of packet duplication and loop incurred by all-active access.
5. Handshaking mechanism for DF election in PBB-EVPN network
In PBB-EVPN network, Ethernet Auto-Discovery Routes are not needed.
DF election extended community should be carried with ES routes. The
ES routes are used for DF election handshaking among the PBB-EVPN
PEs connecting to same ESI.
6. Network Migration Analysis
To support handshaking mechanism for DF election, software upgrade
in all member PEs connecting to an ESI is needed.
Hao & et,al Expires August 12, 2015 [Page 8]
Internet-Draft Handshaking mechanism for DF February 2015
In ESI member PE discovery phase, if a member PE doesn't support the
new DF election handshaking mechanism, it advertises ES route
without the new DF election extended capability, in this case the
PEs connecting to the same ESI use old DF election method as defined
in [EVPN]. Only if all member PEs connecting to same ESI support the
new DF election handshaking capability, the new DF election method
can be used for the ESI.
7. DF election extended community
This extended community is a new transitive extended community with
the Type field of 0x06 and the Sub-Type of TBD. It may be advertised
along with ES routes or A-D routes.
The DF election Extended Community is encoded as an 8-octet value as
follows:
0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Type=0x06 | Sub-Type=TBD |C|D|N| Reserved=0 |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| sequence |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
DF election extended community
o C. If C flag is one, it indicates that the advertising PE
supports DF election handshaking mechanism. It's used in ESI member
PE discovery phase for capability announcement.
o D. If D flag is one, it indicates that the advertising PE is
the DF, the non-DF PE connecting to same ESI can block egress multi-
destination traffic immediately when it receives ES or AD route with
the DF election extended community from DF PE.
o N. If N flag is one, it indicates that the advertising PE is
non-DF and has blocked multi-destination traffic in the egress
direction towards the Segment.
o Sequence. The sequence number is used to ensure that PEs
connecting to same ESI are negotiating for same DF election. In ES
member PEs discovery phase, the sequence number is 0. In DF election
handshaking phase, the sequence number should be non-zero and is
generated by DF PE.
Hao & et,al Expires August 12, 2015 [Page 9]
Internet-Draft Handshaking mechanism for DF February 2015
8. Security Considerations
NA.
9. IANA Considerations
IANA has allocated the following EVPN Extended Community sub-types
in [RFC7153].
SUB-TYPE VALUE NAME
0x00 MAC Mobility
0x01 ESI Label
0x02 ES-Import Route Target
IANA is requested to create and maintain a new registry for "EVPN DF
Election Extended Community Sub-Types".
SUB-TYPE VALUE NAME
0x03 DF Election
10. References
10.1. Normative References
[1] [RFC2119] Bradner, S., "Key words for use in RFCs to Indicate
Requirement Levels", BCP 14, RFC 2119, March 1997.
10.2. Informative References
[2] [EVPN] Sajassi et al., "BGP MPLS Based Ethernet VPN", draft-ietf-
l2vpn-evpn-11.txt, work in progress, October, 2014
11. Acknowledgments
Authors like to thank Lili Wang, Ziqing Cao, Feng Qian for his
valuable inputs.
Hao & et,al Expires August 12, 2015 [Page 10]
Internet-Draft Handshaking mechanism for DF February 2015
Authors' Addresses
Weiguo Hao
Huawei Technologies
101 Software Avenue,
Nanjing 210012
China
Phone: +86-25-56623144
Email: haoweiguo@huawei.com
Lucy Yong
Huawei Technologies
Phone: +1-918-808-1918
Email: lucy.yong@huawei.com
Qiandeng Liang
Huawei Technologies
101 Software Avenue,
Nanjing 210012
China
Email: liangqiandeng@huawei.com
Hao & et,al Expires August 12, 2015 [Page 11]