Internet DRAFT - draft-yizhou-trill-traceable-oam
draft-yizhou-trill-traceable-oam
TRILL Working Group Y. Li
INTERNET-DRAFT D. Eastlake
Intended Status: Standard Track H. Chen
Huawei Technologies
D. Kumar
Cisco
S. Gupta
IP Infusion
Expires: January 6, 2016 July 6, 2015
TRILL: Traceable OAM
draft-yizhou-trill-traceable-oam-00
Abstract
TRILL fault management [RFC7455] specifies the messages and
operations for OAM in TRILL network. The sender collects the replies
for the OAM-relevant request it sent and uses the replies as the
indication of the network faults. In certain circumstances the sender
needs to collect multiple replies to isolate the fault, e.g.
repetitively sending Path Trace Messages (PTM) with increasing value
of hop count and collecting the replies on them to figure out the
fault point of certain path.
With the increasing deployment of Software Defined Network (SDN), a
centralized management server can be used to help with fault
management. The server then is responsible to collect OAM messages
and analyze them to either isolate the network fault or compile
overall OAM information. It naturally uses SDN structure to alleviate
the effort of the requester node and provide a centralized solution
to produce the operation and management feedback of the network.
This document specifies the extensions of TRILL OAM message and the
operations about the network nodes and the centralized management
server to trace and collect OAM relevant messages for further
analysis.
Status of this Memo
This Internet-Draft is submitted to IETF in full conformance with the
provisions of BCP 78 and BCP 79.
Internet-Drafts are working documents of the Internet Engineering
Task Force (IETF), its areas, and its working groups. Note that
other groups may also distribute working documents as
Yizhou, et al [Page 1]
INTERNET DRAFT TRILL Traceable OAM July 2015
Internet-Drafts.
Internet-Drafts are draft documents valid for a maximum of six months
and may be updated, replaced, or obsoleted by other documents at any
time. It is inappropriate to use Internet-Drafts as reference
material or to cite them other than as "work in progress."
The list of current Internet-Drafts can be accessed at
http://www.ietf.org/1id-abstracts.html
The list of Internet-Draft Shadow Directories can be accessed at
http://www.ietf.org/shadow.html
Copyright and License Notice
Copyright (c) 2015 IETF Trust and the persons identified as the
document authors. All rights reserved.
This document is subject to BCP 78 and the IETF Trust's Legal
Provisions Relating to IETF Documents
(http://trustee.ietf.org/license-info) in effect on the date of
publication of this document. Please review these documents
carefully, as they describe your rights and restrictions with respect
to this document. Code Components extracted from this document must
include Simplified BSD License text as described in Section 4.e of
the Trust Legal Provisions and are provided without warranty as
described in the Simplified BSD License.
Table of Contents
1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . . 4
2. Terminology Used in This Document . . . . . . . . . . . . . . . 5
3. Traceable Flag . . . . . . . . . . . . . . . . . . . . . . . . 5
4. Operation Theory . . . . . . . . . . . . . . . . . . . . . . . 6
4.1 Path Trace Message (PTM) with Traceable Flag . . . . . . . . 6
4.1.1 Actions by Originator RBridge . . . . . . . . . . . . . 7
4.1.2 Intermediate RBridge . . . . . . . . . . . . . . . . . . 8
4.1.3 Destination RBridge . . . . . . . . . . . . . . . . . . 8
4.1.4 Centralized Management Server . . . . . . . . . . . . . 8
4.2 Multi-Destination Tree Verification Message (MTVM) with
Traceable Flag . . . . . . . . . . . . . . . . . . . . . . . 8
4.2.1 Actions by Originator RBridge . . . . . . . . . . . . . 9
4.2.2 Receiving RBridge . . . . . . . . . . . . . . . . . . . 10
4.2.3 In-Scope RBridges . . . . . . . . . . . . . . . . . . . 10
4.2.4 Centralized Management Server . . . . . . . . . . . . . 11
Yizhou, et al [Page 2]
INTERNET DRAFT TRILL Traceable OAM July 2015
5. Security Considerations . . . . . . . . . . . . . . . . . . . . 11
6. IANA Considerations . . . . . . . . . . . . . . . . . . . . . . 11
7. References . . . . . . . . . . . . . . . . . . . . . . . . . . 11
7.1 Normative References . . . . . . . . . . . . . . . . . . . 11
7.2 Informative References . . . . . . . . . . . . . . . . . . 12
8. Acknowledgments . . . . . . . . . . . . . . . . . . . . . . . . 12
Authors' Addresses . . . . . . . . . . . . . . . . . . . . . . . . 12
Yizhou, et al [Page 3]
INTERNET DRAFT TRILL Traceable OAM July 2015
1. Introduction
TRILL fault management [RFC7455] specifies OAM messages, TLVs and
operations for fault detection and isolation in TRILL network.
Requester collects the replies of the OAM requests it sent and tries
to analyze the potential faults from them. For path tracing, the
sending RBridge utilizes a Hop Count Expiry approach [RFC7455] which
is similar as IP trace-route. The sending RBridge sends multiple
requests in sequence with incrementing value of Hop Count field and
collects the returning responses to construct the path and isolate
the fault.
With the deployment of centralized management server in TRILL
network, new requirements and approaches for fault isolation and path
tracing are emerging. Figure 1 shows a TRILL network with a
centralized management server.
+----------+
|management|
************|server |***********
* +----------+ *
* * *
* * *
* * *
* * *
* __*_ _ *
* --.--/ * \./ \._ *
* ./ * \. *
* .-- *********** \__ *
* / * * \ *
+-----+ +---+ +---+ +-----+
| RB1 |----|RB3|-----|RB4|-------| RB2 |
+-----+ +---+ +---+ +-----+
| \._ ./ |
| \.__ Trill Network ._/ |
| \. / |
+-----+ \../ \.--/''-' +-----+
|host1| |host2|
+-----+ +-----+
Figure 1. TRILL network with a centralized management server
The centralized management server is capable to construct the OAM
messages of a specified flow and feed them into an ingress RBridge,
say RB1. When the message is delivered to the egress RBridge RB2, the
intermediate RBridges RB3 and RB4 are able to replicate the messages
Yizhou, et al [Page 4]
INTERNET DRAFT TRILL Traceable OAM July 2015
to the management server. The server is responsible to do all the
analysis to trace the path and isolate the fault. Such approach is
easily deployable in a network with a controller. For instance, if
the management server is an Openflow [Openflow] controller, RBridges
may use Packet-in message to send the packets to the Openflow
controller and the controller may use Packet-out message to feed the
constructed OAM messages into the ingress RB at the beginning.
The document defines the flags and TLVs to help the RBridges to
identify the received OAM messages destined for a centralized
management server and provides the server with sufficient information
for further analysis.
2. Terminology Used in This Document
This document uses the terminology from [RFC6325], [RFC7174] and
[RFC7455]. Some additional terms listed below:
campus: Name for a TRILL network, like "bridged LAN" is a name for a
bridged network. It does not have any academic implication.
Data Label: VLAN or FGL.
ECMP: Equal Cost Multi-Path [RFC6325].
FGL: Fine Grained Label [RFC7172].
RBridge: An alternative name for a TRILL switch.
TRILL switch: A device implementing the TRILL protocol. Sometimes
called an RBridge.
The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT",
"SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this
document are to be interpreted as described in RFC-2119 [RFC2119].
3. Traceable Flag
A new flag 'T' is defined in TRILL OAM message header [RFC7455] as an
indicator for traceable message in figure 2. T flag is applicable to
Path Trace Message (PTM) and Multi-Destination Tree Verification
Message (MTVM). Loopback message and Continuity Check Message SHOULD
not set T flag to 1.
Yizhou, et al [Page 5]
INTERNET DRAFT TRILL Traceable OAM July 2015
1 2 3
0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
|MD-L | Version | OpCode | Flags |T|FirstTLVOffset |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| |
. OpCode-Specific Information .
| |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| |
. TLVs .
| |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
Figure 2. T Flag in TRILL OAM Message Header
o T (1 bit): Traceable flag. When set, indicates no response
should be sent back to the requester and the entire TRILL frame
should be sent to a centralized management server for tracing.
Basically the traceable flag implies three functions in the trill
campus:
1. To indicate the intermediate RBs to capture the frames and
replicate it to CPU.
2. To guide the intermediate RB to perform certain operations which
may be different from the traditional OAM operations. For example, as
we can use packet-in to send the whole packet to openflow controller,
it is not necessary to add Original Data Payload TLV to the packet.
3. To make sure the sender will not expect any response and turn off
certain mechanisms like time out.
4. Operation Theory
OAM message with Traceable flag is most useful in functionalities
requiring tracing, e.g., trace route like behaviors.
4.1 Path Trace Message (PTM) with Traceable Flag
TRILL fault management [RFC7455] adopts an IP trace-route like
approach which relies on the hop count expiry to send the PTM message
to RBridge for further handling. The sender needs to repetitively
send the requests with increasing value of hop count. With traceable
flag on, the centralized management server may collect the replicated
frame along the path and check the hop count value in TRILL header
Yizhou, et al [Page 6]
INTERNET DRAFT TRILL Traceable OAM July 2015
directly. By sorting the hop count value decreasingly, it is easy to
plot the path taken for a specific flow or figure out the break point
for fault isolation.
As a centralized management server normally has more memory space
than an RBridge, the server may choose to record the flow entropy to
path mapping information. When a fault is suspected between two
RBridges, the sever may optimally choose minimum number of flow
entropies from the records it saved to feed into the ingress RBridge
to spread over the paths.
4.1.1 Actions by Originator RBridge
The originator RBridge takes the following actions:
o Identifies the destination RBridge based on user specification
or based on location of the specified destination MAC address.
o Constructs the Flow Entropy based on user-specified parameters
or implementation-specific default parameters.
o Specifies the Hop Count of the TRILL Data frame to be larger
than the expected number of hops.
o Constructs the TRILL OAM header: set the OpCode to Path Trace
Message type (65). Assign an applicable Session Identification
number for the request. Return Code and Return Sub-code MUST be
set to zero. Set Traceable flags to 1.
o Includes the following OAM TLVs, where applicable:
- Out-of-Band Reply Address TLV: When Traceable flag is set,
Out-of-Band Reply Address TLV needs to be set to the address
that the traced message should be sent. It is normally the IP
address of the centralized management server. This address may
be absent if the default centralized management server address
has been configured on every RBridge.
- Diagnostic Label TLV
- Sender ID TLV
o Dispatches the OAM frame to the TRILL data plane for
transmission.
The originator RBridge SHOULD not expect the replies for the Path
Trace Message with Traceable Flag set it sent.
Yizhou, et al [Page 7]
INTERNET DRAFT TRILL Traceable OAM July 2015
4.1.2 Intermediate RBridge
The intermediate RBridges need to be configured properly as MIP for
VLAN/FGL based MA. The TRILL OAM application layer validates the
received OAM frame by examining the presence of the TRILL Alert flag,
OAM Ethertype at the end of the Flow Entropy, the OpCode being PTM
and Traceable Flag set, the intermediate RBridges take the following
actions:
o Optionally include the following TLVs:
- Previous RBridge Nickname TLV (69)
- Interface Status TLV (4)
- Next-Hop RBridge List TLV (70)
- Sender ID TLV (1)
o Forward the received message including the TRILL header, the
payload and the appended TLVs (if any) to the address specified in
Out-of-Band Reply Address TLV. If Out-of-Band Reply Address TLV is
not present, either forward it to a system default centralized
management server or discard it.
4.1.3 Destination RBridge
Processing is identical to that in Section 4.1.2. The Destination
RBridge should not further forward the message in order to prevent
leaking of the packet out of the TRILL campus
4.1.4 Centralized Management Server
The centralized management server is normally served as an SDN
controller, e.g. an Openflow controller. It is up to the
implementation how to deal with the collected packets of PTM with
traceable flag from RBridges. The common logic is the centralized
management server compares the Session Identification Number and hop
count value in TRILL header to trace the path the packet taken.
4.2 Multi-Destination Tree Verification Message (MTVM) with Traceable
Flag
MVTM can be used by the OAM tools for plotting the entire or VLAN/FGL
pruned distribution tree, reachability verification for set of
Yizhou, et al [Page 8]
INTERNET DRAFT TRILL Traceable OAM July 2015
RBridges on a given tree or trace along a specified tree to a set of
RBridges.
A new TLV is defined as shown in figure 3.
1 2 3
0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Type | Length | Reserved |E|
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
Figure 3. Tree Trace Mode TLV
o Type (1 octet): 75 (TBD), Tree Trace Mode TLV
o Length (2 octets): 1
o E (1 bit): Egress Flag. When RBridge Scope TLV is not present
and E flag is 1, trace the receiving RBridge which are egress
RBridges on the tree of the specified VLAN/FGL or multicast group;
otherwise, ignore this flag.
4.2.1 Actions by Originator RBridge
The originator RBridge takes the following actions:
o Identifies the nickname of distribution tree to be traced.
o Constructs the Flow Entropy based on user-specified parameters
or implementation-specific default parameters.
o Specifies the applicable Hop Count value.
o Constructs the TRILL OAM header: set the OpCode to Multicast
Tree Verification Message type (67). Assign an applicable Session
Identification number for the request. Return Code and Return Sub-
code MUST be set to zero. Set Traceable flags to 1.
o Includes the following OAM TLVs, where applicable:
- Out-of-Band Reply Address TLV: When Traceable flag is set,
Out-of-Band Reply Address TLV needs to be set to the address
that the traced message should be sent. It is normally the
address of the centralized management server
- RBridge Scope TLV
Yizhou, et al [Page 9]
INTERNET DRAFT TRILL Traceable OAM July 2015
- Tree Trace Mode TLV: When RBridge Scope TLV is present, E flag
of this TLV SHOULD not be set to 1.
- Diagnostic Label TLV
- Sender ID TLV
o Dispatches the OAM frame to the TRILL data plane for
transmission.
The originator RBridge SHOULD not expect the replies for the
Multicast Tree Verification Message with Traceable Flag it sent.
4.2.2 Receiving RBridge
The TRILL OAM application layer validates the received OAM frame by
examining the presence of the TRILL Alert flag and OAM Ethertype at
the end of the Flow Entropy. If Traceable Flag is set to 1 in MTVM,
the RBridge validates the frame and analyzes if it is an in-scope
RBridge.
If the RBridge Scope TLV is present and the local RBridge nickname is
specified in the scope list, the receiving RBridge proceeds with
further processing as defined in Section 4.1.3.
If the RBridge Scope TLV is absent, the receiving RBridge SHOULD
check the Tree Trace Mode TLV. If E flag is 0, the receiving RBridge
proceeds with further processing as defined in Section 4.1.3. If E
flag is 1 and the receiving RBridge is an egress BRidge for the
specified VLAN/FGL or multicast group, the receiving RBridge proceeds
with further processing as defined in Section 4.1.3.
For other cases, the receiving RBridge is not considered as in-scope
RBridge and should not perform as per section 4.2.3.
4.2.3 In-Scope RBridges
In-Scope RBridges refers to those should tentatively take actions for
MTVM request. They are part of receiving RBridges as described in
last sub-section.
o Optionally include the following TLVs:
- Previous RBridge Nickname TLV (69)
- Interface Status TLV (4)
- Next-Hop RBridge List TLV (70)
Yizhou, et al [Page 10]
INTERNET DRAFT TRILL Traceable OAM July 2015
- Sender ID TLV (1)
- Multicast Receiver Port Count TLV (71)
o Forward the received message including the TRILL header, the
payload and the appended TLVs (if any) to the address specified in
Out-of-Band Reply Address TLV. If Out-of-Band Reply Address TLV is
not present, either forward it to a system default centralized
management server or discard it.
4.2.4 Centralized Management Server
The centralized management server is normally served as an SDN
controller, e.g. an Openflow controller. It is up to the
implementation how to deal with the collected packets of MTVM with
traceable flag from RBridges. The common logic is the centralized
management server compares the Session Identification Number and hop
count value in TRILL header to trace the path the packet taken along
a distribution tree. It can be used to plot the entire tree or pruned
tree or to find out who are the edge RBridges connecting users for a
specified VLAN/FGL.
5. Security Considerations
For general TRILL fault management security considerations, please
refer to [RFC7455].
6. IANA Considerations
One TLV Type is required to be assigned from the "CFM OAM IETF TLV
Types" sub-registry as follows:
Value Assignment
----- ----------
75 Tree Trace Mode TLV
7. References
7.1 Normative References
[RFC6325] Perlman, R., et.al. "RBridge: Base Protocol Specification",
RFC 6325, July 2011.
[RFC6439] Eastlake, D. et.al., "RBridge: Appointed Forwarder", RFC
6439, November 2011.
Yizhou, et al [Page 11]
INTERNET DRAFT TRILL Traceable OAM July 2015
[RFC6905] Senevirathne, T., Bond, D., Aldrin, S., Li, Y., and R.
Watve, "Requirements for Operations, Administration, and
Maintenance (OAM) in Transparent Interconnection of Lots
of Links (TRILL)", RFC 6905, March 2013.
[RFC7172] Eastlake 3rd, D., Zhang, M., Agarwal, P., Perlman, R., and
D. Dutt, "Transparent Interconnection of Lots of Links
(TRILL): Fine-Grained Labeling", RFC 7172, May 2014.
[RFC7174] Salam, S., Senevirathne, T., Aldrin, S., and D. Eastlake
3rd, "Transparent Interconnection of Lots of Links (TRILL)
Operations, Administration, and Maintenance (OAM)
Framework", RFC 7174, May 2014,
[RFC7180] Eastlake 3rd, D., Zhang, M., Ghanwani, A., Manral, V., and
A. Banerjee, "Transparent Interconnection of Lots of Links
(TRILL): Clarifications, Corrections, and Updates", RFC
7180, May 2014.
[RFC7455] Senevirathne, T., Finn, N., Salam, S., Kumar, D., Eastlake
3rd, D., Aldrin, S., and Y. Li, "Transparent
Interconnection of Lots of Links (TRILL): Fault
Management", RFC 7455, March 2015.
7.2 Informative References
[OpenFlow] OpenFlow Switch Specification Version, March 26, 2015.
(https://www.opennetworking.org/images/stories/downloads/
sdn-resources/ onf-specifications/openflow/openflow-
switch-v1.5.1.pdf)
8. Acknowledgments
TBD
Authors' Addresses
Yizhou Li
Huawei Technologies
101 Software Avenue,
Nanjing 210012
China
Phone: +86-25-56624629
Email: liyizhou@huawei.com
Yizhou, et al [Page 12]
INTERNET DRAFT TRILL Traceable OAM July 2015
Donald Eastlake
Huawei R&D USA
155 Beaver Street
Milford, MA 01757 USA
Phone: +1-508-333-2270
Email: d3e3e3@gmail.com
Hao Chen
Huawei Technologies
101 Software Avenue,
Nanjing 210012
China
Email: philips.chenhao@huawei.com
Deepak Kumar
CISCO Systems
510 McCarthy Blvd,
Milpitas, CA 95035, USA
Phone : +1 408-853-9760
Email: dekumar@cisco.com
Sujay Gupta
IP Infusion
RMZ Centennial
Mahadevapura Post
Bangalore - 560048
India
Email: sujay.gupta@ipinfusion.com
Yizhou, et al [Page 13]