Internet DRAFT - draft-giacalone-ospf-te-express-path
draft-giacalone-ospf-te-express-path
Network Working Group S. Giacalone
Internet Draft Thomson Reuters
Intended status: Proposed Standard
Expires: March 2012 D. Ward
Juniper Networks
J. Drake
Juniper Networks
A. Atlas
Juniper Networks
S. Previdi
Cisco Systems
September 21, 2011
OSPF Traffic Engineering (TE) Express Path
draft-giacalone-ospf-te-express-path-02.txt
Abstract
In certain networks, such as, but not limited to, financial
information networks (e.g. stock market data providers), network
performance criteria (e.g. latency) are becoming as critical to data
path selection as other metrics.
This document describes extensions to OSPF TE [RFC3630] such that
network performance information can be distributed and collected in a
scalable fashion. The information distributed using OSPF TE Express
Path can then be used to make path selection decisions based on
network performance.
Note that this document only covers the mechanisms with which network
performance information is distributed. The mechanisms for measuring
network performance or acting on that information, once distributed,
are outside the scope of this document.
Status of this Memo
This Internet-Draft is submitted in full conformance with the
provisions of BCP 78 and BCP 79.
Giacalone Expires March 21, 2012 [Page 1]
Internet-Draft OSPF TE Express Path September 2011
Internet-Drafts are working documents of the Internet Engineering
Task Force (IETF), its areas, and its working groups. Note that
other groups may also distribute working documents as Internet-
Drafts.
Internet-Drafts are draft documents valid for a maximum of six months
and may be updated, replaced, or obsoleted by other documents at any
time. It is inappropriate to use Internet-Drafts as reference
material or to cite them other than as "work in progress."
The list of current Internet-Drafts can be accessed at
http://www.ietf.org/ietf/1id-abstracts.txt
The list of Internet-Draft Shadow Directories can be accessed at
http://www.ietf.org/shadow.html
This Internet-Draft will expire on March 21, 2011.
Copyright Notice
Copyright (c) 2011 IETF Trust and the persons identified as the
document authors. All rights reserved.
This document is subject to BCP 78 and the IETF Trust's Legal
Provisions Relating to IETF Documents
(http://trustee.ietf.org/license-info) in effect on the date of
publication of this document. Please review these documents
carefully, as they describe your rights and restrictions with respect
to this document. Code Components extracted from this document must
include Simplified BSD License text as described in Section 4.e of
the Trust Legal Provisions and are provided without warranty as
described in the Simplified BSD License.
Table of Contents
1. Introduction...................................................3
2. Conventions used in this document..............................4
3. Express Path Extensions to OSPF TE.............................4
4. Sub TLV Details................................................6
4.1. Unidirectional Link Delay Sub-TLV.........................6
4.1.1. Type.................................................6
4.1.2. Length...............................................6
Giacalone, et al Expires March 21, 2012 [Page 2]
Internet-Draft OSPF TE Express Path September 2011
4.1.3. A bit................................................7
4.1.4. Reserved.............................................7
4.1.5. Delay Value..........................................7
4.2. Unidirectional Delay Variation Sub-TLV....................7
4.2.1. Type.................................................7
4.2.2. Length...............................................7
4.2.3. Reserved.............................................8
4.2.4. Delay Variation......................................8
4.3. Unidirectional Link Loss Sub-TLV..........................8
4.3.1. Type.................................................8
4.3.2. Length...............................................8
4.3.3. A bit................................................8
4.3.4. Reserved.............................................9
4.3.5. Link Loss............................................9
4.4. Unidirectional Residual Bandwidth Sub-TLV.................9
4.4.1. Type.................................................9
4.4.2. Length..............................................10
4.4.3. Residual Bandwidth..................................10
4.5. Unidirectional Available Bandwidth Sub-TLV...............10
4.4.4. Type................................................10
4.4.5. Length..............................................11
4.4.6. Available Bandwidth.................................11
5. Announcement Thresholds and Filters...........................11
6. Announcement Suppression......................................11
7. Network Stability and Announcement Periodicity................12
8. Compatibility.................................................12
9. Security Considerations.......................................12
10. IANA Considerations..........................................12
11. References...................................................12
11.1. Normative References....................................12
11.2. Informative References..................................13
12. Acknowledgments..............................................13
13. Author's Addresses...........................................14
1. Introduction
In certain networks, such as, but not limited to, financial
information networks (e.g. stock market data providers), network
performance information (e.g. latency) is becoming as critical to
data path selection as other metrics.
In these networks, extremely large amounts of money rest on the
ability to access market data in "real time" and to predictably make
trades faster than the competition. Because of this, using metrics
such as hop count or cost as routing metrics is becoming only
Giacalone, et al Expires March 21, 2012 [Page 3]
Internet-Draft OSPF TE Express Path September 2011
tangentially important. Rather, it would be beneficial to be able to
make path selection decisions based on performance data (such as
latency) in a cost-effective and scalable way.
This document describes extensions to OSPF TE (hereafter called "OSPF
TE Express Path"), that can be used to distribute network performance
information (such as link delay, delay variation, packet loss,
residual bandwidth, and available bandwidth).
The data distributed by OSPF TE OSPF TE Express Path is meant to be
used as part of the operation of the routing protocol (e.g. by
replacing cost with latency or considering bandwidth as well as
cost), by enhancing CSPF, or for other uses such as supplementing the
data used by an Alto server [Alto]. With respect to CSPF, the data
distributed by OSPF TE Express Path can be used to setup, fail over,
and fail back data paths using protocols such as RSVP-TE [RFC3209].
Note that the mechanisms described in this document only disseminate
performance information. The methods for initially gathering that
performance information, such as [Frost], or acting on it once it is
distributed are outside the scope of this document.
2. Conventions used in this document
The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT",
"SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this
document are to be interpreted as described in RFC-2119 [RFC2119].
In this document, these words will appear with that interpretation
only when in ALL CAPS. Lower case uses of these words are not to be
interpreted as carrying RFC-2119 significance.
3. Express Path Extensions to OSPF TE
This document proposes new OSPF TE sub-TLVs that can be announced in
OSPF TE LSAs to distribute network performance information. The
extensions in this document build on the ones provided in OSPF TE
[RFC3630] and GMPLS [RFC4203].
OSPF TE LSAs [RFC3630] are opaque LSAs [RFC5250] with area flooding
scope. Each TLV has one or more nested sub-TLVs which permit the TE
LSA to be readily extended. There are two main types of OSPF TE LSA;
the Router Address or Link TE LSA. Like the extensions in GMPLS
Giacalone, et al Expires March 21, 2012 [Page 4]
Internet-Draft OSPF TE Express Path September 2011
(RFC4203), this document proposes several additional sub-TLVs for
the Link TE LSA:
Type Length Value
TBD1 4 Unidirectional Link Delay
TBD2 4 Unidirectional Delay Variation
TBD3 4 Unidirectional Packet Loss
TBD4 4 Unidirectional Residual Bandwidth Sub TLV
TBD5 4 Unidirectional Available Bandwidth Sub TLV
As can be seen in the list above, the sub-TLVs described in this
document carry different types of network performance information.
Many (but not all) of the sub-TLVs include a bit called the Anomalous
(or "A") bit. When the A bit is clear (or when the sub-TLV does not
include an A bit), the sub-TLV describes steady state link
performance. This information could conceivably be used to construct
a steady state performance topology for initial tunnel path
computation, or to verify alternative failover paths.
When network performance violates configurable link-local thresholds
a sub-TLV with the A bit set is advertised. These sub-TLVs could be
used by the receiving node to determine whether to fail traffic to a
backup path, or whether to calculate an entirely new path. From an
MPLS perspective, the intent of the A bit is to permit LSP ingress
nodes to:
A) Determine whether the link referenced in the sub-TLV affects any
of the LSPs for which it is ingress. If there are, then:
B) Determine whether those LSPs still meet end-to-end performance
objectives. If not, then:
C) The node could then conceivably move affected traffic to a pre-
established protection LSP or establish a new LSP and place the
traffic in it.
If link performance then improves beyond a configurable minimum
value (reuse threshold), that sub-TLV can be re-advertised with the
Anomalous bit cleared. In this case, a receiving node can
conceivably do whatever re-optimization (or failback) it wishes to
do (including nothing).
Giacalone, et al Expires March 21, 2012 [Page 5]
Internet-Draft OSPF TE Express Path September 2011
Note that when a sub-TLV does not include the A bit, that sub-TLV
cannot be used for failover purposes. The A bit was intentionally
omitted from some sub-TLVs to help mitigate oscillations. See section
7. 1. for more information.
Consistent with existing OSPF TE specifications (RFC3630), the
bandwidth advertisements defined in this draft MUST be encoded as
IEEE floating point values. The delay and delay variation
advertisements defined in this draft MUST be encoded as integer
values. Delay values MUST be quantified in units of microseconds,
packet loss MUST be quantified as a percentage of packets sent, and
bandwidth MUST be sent as bytes per second. All values (except
residual bandwidth) MUST be calculated as rolling averages where the
averaging period MUST be a configurable period of time. See section
5. for more information.
4. Sub TLV Details
4.1. Unidirectional Link Delay Sub-TLV
This sub-TLV advertises the average link delay between two directly
connected OSPF neighbors. The delay advertised by this sub-TLV MUST
be the delay from the local neighbor to the remote one (i.e. the
forward path latency). The format of this sub-TLV is shown in the
following diagram:
0 1 2 3
0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| TBD1 | 4 |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
|A| RESERVED | Delay |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
4.1.1. Type
This sub-TLV has a type of TBD1.
4.1.2. Length
The length is 4.
Giacalone, et al Expires March 21, 2012 [Page 6]
Internet-Draft OSPF TE Express Path September 2011
4.1.3. A bit
This field represents the Anomalous (A) bit. The A bit is set when
the measured value of this parameter exceeds its configured maximum
threshold. The A bit is cleared when the measured value falls below
its configured reuse threshold. If the A bit is clear, the sub-TLV
represents steady state link performance.
4.1.4. Reserved
This field is reserved for future use. It MUST be set to 0 when sent
and MUST be ignored when received.
4.1.5. Delay Value
This 24-bit field carries the average link delay over a configurable
interval in micro-seconds, encoded as an integer value. When set to
0, it has not been measured. When set to the maximum value 16,777,215
(16.777215 sec), then the delay is at least that value and may be
larger.
4.2. Unidirectional Delay Variation Sub-TLV
This sub-TLV advertises the average link delay variation between two
directly connected OSPF neighbors. The delay variation advertised by
this sub-TLV MUST be the delay from the local neighbor to the remote
one (i.e. the forward path latency). The format of this sub-TLV is
shown in the following diagram:
0 1 2 3
0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| TBD2 | 4 |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| RESERVED | Delay Variation |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
4.2.1. Type
This sub-TLV has a type of TBD2.
4.2.2. Length
The length is 4.
Giacalone, et al Expires March 21, 2012 [Page 7]
Internet-Draft OSPF TE Express Path September 2011
4.2.3. Reserved
This field is reserved for future use. It MUST be set to 0 when sent
and MUST be ignored when received.
4.2.4. Delay Variation
This 24-bit field carries the average link delay variation over a
configurable interval in micro-seconds, encoded as an integer value.
When set to 0, it has not been measured. When set to the maximum
value 16,777,215 (16.777215 sec), then the delay is at least that
value and may be larger.
4.3. Unidirectional Link Loss Sub-TLV
This sub-TLV advertises the loss (as a packet percentage) between two
directly connected OSPF neighbors. The link loss advertised by this
sub-TLV MUST be the packet loss from the local neighbor to the remote
one (i.e. the forward path loss). The format of this sub-TLV is shown
in the following diagram:
0 1 2 3
0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| TBD3 | 4 |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
|A| RESERVED | Link Loss |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
4.3.1. Type
This sub-TLV has a type of TBD3
4.3.2. Length
The length is 4
4.3.3. A bit
This field represents the Anomalous (A) bit. The A bit is set when
the measured value of this parameter exceeds its configured maximum
threshold. The A bit is cleared when the measured value falls below
Giacalone, et al Expires March 21, 2012 [Page 8]
Internet-Draft OSPF TE Express Path September 2011
its configured reuse threshold. If the A bit is clear, the sub-TLV
represents steady state link performance.
4.3.4. Reserved
This field is reserved for future use. It MUST be set to 0 when sent
and MUST be ignored when received.
4.3.5. Link Loss
This 24-bit field carries link packet loss as a percentage of the
total traffic sent over a configurable interval. The basic unit is
0.000003%, where (2^24 - 2) is 50.331642%. This value is the highest
packet loss percentage that can be expressed (the assumption being
that precision is more important on high speed links than the ability
to advertise loss rates greater than this, and that high speed links
with over 50% loss are unusable). Therefore, measured values that are
larger than the field maximum SHOULD be encoded as the maximum value.
When set to a value of all 1s (2^24 - 1), the link packet loss has
not been measured.
4.4. Unidirectional Residual Bandwidth Sub-TLV
This TLV advertises the residual bandwidth (defined in section 4.4.3.
between two directly connected OSPF neighbors. The residual bandwidth
advertised by this sub-TLV MUST be the residual bandwidth from the
system originating the LSA to its neighbor.
The format of this sub-TLV is shown in the following diagram:
0 1 2 3
0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| TBD4 | 4 |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Residual Bandwidth |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
4.4.1. Type
This sub-TLV has a type of TBD4.
Giacalone, et al Expires March 21, 2012 [Page 9]
Internet-Draft OSPF TE Express Path September 2011
4.4.2. Length
The length is 4.
4.4.3. Residual Bandwidth
This field carries the residual bandwidth on a link, forwarding
adjacency [RFC4206], or bundled link in IEEE floating point format
with units of bytes per second. For a link or forwarding adjacency,
residual bandwidth is defined to be Maximum Bandwidth [RFC3630] minus
the bandwidth currently allocated to RSVP-TE LSPs. For a bundled
link, residual bandwidth is defined to be the sum of the component
link residual bandwidths.
Note that although it may seem possible to calculate Residual
Bandwidth using the existing sub-TLVs in RFC 3630, this is not a
consistently reliable approach and hence the Residual Bandwidth sub-
TLV has been added here. For example, because the Maximum Reservable
Bandwidth [RFC3630] can be larger than the capacity of the link,
using it as part of an algorithm to determine the value of the
Maximum Bandwidth [RFC3630] minus the bandwidth currently allocated
to RSVP-TE LSPs cannot be considered reliably accurate.
4.5. Unidirectional Available Bandwidth Sub-TLV
This TLV advertises the available bandwidth (defined in section
4.4.6. ) between two directly connected OSPF neighbors. The available
bandwidth advertised by this sub-TLV MUST be the available bandwidth
from the system originating the LSA to its neighbor. The format of
this sub-TLV is shown in the following diagram:
0 1 2 3
0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| TBD5 | 4 |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Available Bandwidth |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
4.4.4. Type
This sub-TLV has a type of TBD5.
Giacalone, et al Expires March 21, 2012 [Page 10]
Internet-Draft OSPF TE Express Path September 2011
4.4.5. Length
The length is 4.
4.4.6. Available Bandwidth
This field carries the available bandwidth on a link, forwarding
adjacency, or bundled link in IEEE floating point format with units
of bytes per second. For a link or forwarding adjacency, available
bandwidth is defined to be residual bandwidth (see section 4.4. )
minus the measured bandwidth used for the actual forwarding of non-
RSVP-TE LSP packets. For a bundled link, available bandwidth is
defined to be the sum of the component link available bandwidths.
5. Announcement Thresholds and Filters
The values advertised in all sub-TLVs MUST be controlled using an
exponential filter (i.e. a rolling average) with a configurable
measurement interval and filter coefficient.
Implementations are expected to provide separately configurable
advertisement thresholds. All thresholds MUST be configurable on a
per sub-TLV basis.
The announcement of all sub-TLVs that do not include the A bit SHOULD
be controlled by variation thresholds that govern when they are sent.
Sub-TLV that include the A bit are governed by several thresholds.
Firstly, a threshold SHOULD be implemented to govern the announcement
of sub-TLVs that advertise a change in performance, but not an SLA
violation (i.e. when the A bit is not set). Secondly, implementations
MUST provide configurable thresholds that govern the announcement of
sub-TLVs with the A bit set (for the indication of a performance
violation). Thirdly, implementations SHOULD provide reuse
thresholds. These thresholds govern sub-TLV re-announcement with the
A bit cleared to permit fail back.
6. Announcement Suppression
When link performance average values change, but fall under the
threshold that would cause the announcement of a sub-TLV with the A
bit set, implementations MAY suppress or throttle sub-TLV
Giacalone, et al Expires March 21, 2012 [Page 11]
Internet-Draft OSPF TE Express Path September 2011
announcements. All suppression features and thresholds SHOULD be
configurable.
7. Network Stability and Announcement Periodicity
To mitigate concerns about stability, all values (except residual
bandwidth) MUST be calculated as rolling averages where the averaging
period MUST be a configurable period of time, rather than
instantaneous measurements.
Announcements MUST also be able to be throttled using configurable
inter-update throttle timers. The minimum announcement periodicity is
1 announcement per second.
8. Compatibility
As per (RFC3630), unrecognized TLVs should be silently ignored
9. Security Considerations
This document does not introduce security issues beyond those
discussed in [RFC3630] and [RFC5329].
10. IANA Considerations
IANA maintains the registry for the sub-TLVs. OSPF TE Express Path
will require one new type code per sub-TLV defined in this document.
11. References
11.1. Normative References
[RFC2119]Bradner, S., "Key words for use in RFCs to Indicate
Requirement Levels", BCP 14, RFC 2119, March 1997.
Giacalone, et al Expires March 21, 2012 [Page 12]
Internet-Draft OSPF TE Express Path September 2011
[RFC3630] Katz, D., Kompella, K., Yeung, D., "Traffic
Engineering (TE) Extensions to OSPF Version 2", RFC 3630,
September 2003.
11.2. Informative References
[RFC2328] Moy, J, "OSPF Version 2", RFC 2328, April 1998
[RFC3031] Rosen, E., Viswanathan, A., Callon, R., "Multiprotocol
Label Switching Architecture", January 2001
[RFC3209] Awduche, D., Berger, L., Gan, D., Li, T., Srinivasan,
V., and G. Swallow, "RSVP-TE: Extensions to RSVP for LSP
Tunnels", RFC 3209, December 2001.
[RFC5250] Berger, L., Bryskin I., Zinin, A., Coltun, R., "The OSPF
Opaque LSA Option", RFC 5250, July 2008.
[Frost] D. Frost, S. Bryant"A Packet Loss and Delay Measurement
Profile for MPLS-based Transport Networks"
[Alto] R. Alimi R. Penno Y. Yang, "ALTO Protocol"
12. Acknowledgments
The authors would like to recognize Ayman Soliman for his
contributions.
This document was prepared using 2-Word-v2.0.template.dot.
Giacalone, et al Expires March 21, 2012 [Page 13]
Internet-Draft OSPF TE Express Path September 2011
13. Author's Addresses
Spencer Giacalone
Thomson Reuters
195 Broadway
New York NY 10007, USA
Email: Spencer.giacalone@thomsonreuters.com
Dave Ward
Juniper Networks
1194 N. Mathilda Ave.
Sunnyvale, CA 94089, USA
Email: dward@juniper.net
John Drake
Juniper Networks
1194 N. Mathilda Ave.
Sunnyvale, CA 94089, USA
Email: jdrake@juniper.net
Alia Atlas
Juniper Networks
1194 N. Mathilda Ave.
Sunnyvale, CA 94089, USA
Email: akatlas@juniper.net
Stefano Previdi
Cisco Systems
Via Del Serafico 200
00142 Rome
Italy
Email: sprevidi@cisco.com
Giacalone, et al Expires March 21, 2012 [Page 14]
Internet-Draft OSPF TE Express Path September 2011
Giacalone, et al Expires March 21, 2012 [Page 15]