Internet DRAFT - draft-sudha-teas-rsvp-preemption-avoidance
draft-sudha-teas-rsvp-preemption-avoidance
Network Working Group Sudharsana Venkataraman (Ed)
Internet Draft Juniper Networks
Intended status: Informational
Expires: January 6, 2016 July 6, 2015
Avoiding Repeated Preemption of Low-Priority TE LSPs -
Recommendations
draft-sudha-teas-rsvp-preemption-avoidance-00
Status of this Memo
This Internet-Draft is submitted in full conformance with the
provisions of BCP 78 and BCP 79.
Internet-Drafts are working documents of the Internet Engineering
Task Force (IETF), its areas, and its working groups. Note that
other groups may also distribute working documents as Internet-
Drafts.
Internet-Drafts are draft documents valid for a maximum of six
months and may be updated, replaced, or obsoleted by other documents
at any time. It is inappropriate to use Internet-Drafts as
reference material or to cite them other than as "work in progress."
The list of current Internet-Drafts can be accessed at
http://www.ietf.org/ietf/1id-abstracts.txt
The list of Internet-Draft Shadow Directories can be accessed at
http://www.ietf.org/shadow.html
This Internet-Draft will expire on January 6, 2016.
Copyright Notice
Copyright (c) 2015 IETF Trust and the persons identified as the
document authors. All rights reserved.
This document is subject to BCP 78 and the IETF Trust's Legal
Provisions Relating to IETF Documents
(http://trustee.ietf.org/license-info) in effect on the date of
publication of this document. Please review these documents
carefully, as they describe your rights and restrictions with
respect to this document. Code Components extracted from this
document must include Simplified BSD License text as described in
Sudharsana, et al. Expires January 6, 2016 [Page 1]
Internet-Draft Avoiding Preemption July 2015
Section 4.e of the Trust Legal Provisions and are provided without
warranty as described in the Simplified BSD License.
Abstract
When a high priority LSP is being setup, if there is reservation
contention on a link along the path to the destination, any low
priority LSP taking the link could get preempted and eventually
rerouted.
Low priority LSPs could suffer preemption repeatedly when they are
placed in succession on heavily used links that have very less
remaining bandwidth. This document describes a solution to avoid
repeated preemption of low priority LSPs in the network.
Conventions used in this document
The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT",
"SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this
document are to be interpreted as described in RFC-2119 [RFC2119].
Table of Contents
1. Introduction...................................................3
1.1. Scenarios that can cause repeated preemption of low priority
LSPs...........................................................4
2. Recommendations................................................5
2.1. Transit based approach....................................5
2.2. Sample procedure at transit...............................5
2.2.1. Inputs to the procedure:.............................5
2.2.2. Periodic sub-procedure...............................6
2.2.3. Advantages:..........................................7
2.3. Ingress based approach:...................................7
2.4. Sample procedure at ingress...............................7
2.4.1. Inputs to the procedure:.............................7
2.4.2. Procedure at ingress.................................8
2.4.3. Disadvantages over transit based approach............9
2.4.4. Advantages:..........................................9
3. Use cases and applicability of approaches......................9
3.1. Pertaining to a particular link...........................9
3.2. Setting up general LSP behavior in the network...........10
4. Security Considerations.......................................11
5. IANA Considerations...........................................11
6. Normative References..........................................11
7. Acknowledgments...............................................11
Sudharsana, et al Expires January 06, 2016 [Page 2]
Internet-Draft Avoiding Preemption July 2015
8. Authors' Addresses............................................11
1. Introduction
When an LSP does not have configured hops as constraints, it
typically gets setup along the shortest path to the destination as
long as the required bandwidth is available. Links along the
shortest path are therefore mostly saturated with very less
remaining bandwidth to reserve. Low priority LSPs along such paths
are very likely to suffer preemption when a higher priority LSP
needs to be setup (Hard preemption of TE LSPs is described in
[RFC3209] and soft-preemption is described in [RFC5712]). When a low
priority LSP reroutes it does not try to avoid links with less
remaining bandwidth. It may therefore again be setup on another
heavily used path, as the goal is have optimal metric, with a high
chance of getting preempted again.
So a low priority LSP may suffer preemption multiple times before it
settles on a longer and less utilized path, finally owing to
bandwidth unavailability on shorter and congested paths, causing lot
of control plane churn in the network during the course.
+-----------+ +----------+
| | |\ | |
| Router A |--------+ \ | Router B |
+---------| |--------+ / | |--------+
| | | |/ | | |
| +-----------+ +----------+ |
+---------+ +---------+
| | | |
| Ingress | | Egress |
| | | |
+---------+ +---------+
| +-----------+ +----------+ |
| | | |\ | | |
+---------| Router C |--------+ \ | Router D |--------+
| |--------+ / | |
| | |/ | |
+-----------+ +----------+
Let's say the paths between Router A and B contributes to lesser cost
when establishing LSPs from Ingress to Egress, compared to the paths
between Router C and D. As a result the links lying along the paths
Sudharsana, et al Expires January 06, 2016 [Page 3]
Internet-Draft Avoiding Preemption July 2015
from A to B are saturated, with very less remaining bandwidth, as
usage increases.
Majority of the LSPs set up along the paths from Router A to B are
high priority LSPs. As there is bandwidth shortage along these
paths, low priority LSPs typically get moved out to the longer paths
which are along Router C to D. This happens only when the low
priority LSPs cannot be accommodated along any of the paths from
Router A to B
As long as the low priority LSPs are able to reserve bandwidth along
any of the paths from Router A to B they are placed in one of the
paths in that bundle from A to B, only to be preempted and moved to
another path within the bundle when a new higher priority LSP is
signaled.
The repeated preemption and rerouting can be avoided if the low
priority LSP can be setup along C to D, when the path along A to B
gets heavily used.
The objective is to avoid hot-links (ones with low remaining
bandwidth) when placing low priority LSPs, so that there is less
probability of them getting preempted repeatedly.
1.1. Scenarios that can cause repeated preemption of low priority LSPs
When a member link is removed from an aggregate bundle, it is
expected to create some congestion on the aggregate link. This
could increase the probability of preemption for any low priority
LSPs taking the aggregate link
When a member link is added to an aggregate bundle that is
currently congested or when a new link or router is added to a
network that is running hot, it could result in immediate spike in
the available bandwidth, attracting LSPs waiting to be setup, which
could include the low priority LSPs. But such a setup may be short
lived. Considering the congestion on the link, they could
subsequently get preempted.
When a particular link or router is to be taken down for
maintenance, it generally reroutes the traffic along other paths
causing congestion along those paths owing to the extra load from
the path under maintenance. This increases chances of preemption
for any low priority LSPs taking the alternate path.
It should be noted that in the scenarios specified above in which
the probability of repeated preemption of low priority LSP is high,
Sudharsana, et al Expires January 06, 2016 [Page 4]
Internet-Draft Avoiding Preemption July 2015
the use of resource affinities like link color may not be suitable
because the operator may want the low priority LSPs to use the
shortest path when the links along the shortest path are not
heavily utilized.
2. Recommendations
The idea is to avoid placing low priority LSPs on hot links, rather
than moving them away (by pre-empting them) when a higher priority
LSP needs to be setup. Hot links are the ones that are running
almost saturated with little unreserved bandwidth. The shortest
path(s) to the destination tend(s) to get hot under scale and heavy
usage. The following outlines two possible options to accomplish
this.
2.1. Transit based approach
A transit router should monitor the remaining bandwidth on all
attached links. When it falls below a threshold for a link, it
RECOMMENDED that the bandwidth subscription percentage, for low
priorities, on that link SHOULD be set to a value (i.e. reduced)
such that it prevents further placement of low priority LSPs on that
link.
This subscription percentage change on the link for low priorities
can be reversed when the remaining bandwidth at priority 0 increases
by a reasonable amount.
2.2. Sample procedure at transit
Here is a sample procedure to set per priority bandwidth
subscription on hot links. It is RECOMMENDED that the transit router
execute the following procedure on each of its attached links.
2.2.1. Inputs to the procedure:
a) Rem_bw_threshold%:
When remaining (unreserved) bandwidth at highest priority, on the
link, falls below this percentage action is to be taken.
b) Input_priority:
Sudharsana, et al Expires January 06, 2016 [Page 5]
Internet-Draft Avoiding Preemption July 2015
For priorities inferior to this, action is to be taken.
c) Subscription_percent%:
Subscription percentage to be used for setting per priority
subscription which is the action.
d) Igp_update_threshold%:
Percentage by which bandwidth utilization on a link should change to
qualify for an IGP TE update to be sent out of the box.
2.2.2. Periodic sub-procedure
a) Find the remaining available bandwidth on the link at priority 0
and see if it is below rem-bw threshold%. Note that the available
bandwidth should the actual value that is known on the router and
may be different from the value advertised in IGP TE.
b) If the remaining bandwidth (at priority 0) is below
rem-bw-threshold% of the total link capacity, the link qualifies
for action.
c) Action: Set the subscription on the link, for priorities in the
range between input_priority and 7, to value given by
subscription_percent
d) If the remaining bandwidth percentage (at priority 0) is above
rem-bw-threshold% of the total link capacity by at least
igp_update_threshold% of the links capacity, and the subscription
is not 100% for lower priorities on that link, it should be set to
100% (or a configured maximum subscription value).
Setting per priority bandwidth subscription will result in TE updates
being advertised by IGP for the link.
It is RECOMMENDED that the value of the subscription percentage
SHOULD NOT cause immediate preemption of any of the low priority LSPs
already taking the link. The link that gets selected for
subscription action, has at least (100 - rem_bw_threshold) % of its
capacity reserved. The subscription percentage that is set should be
more than current real reservation percentage which is (100 -
Sudharsana, et al Expires January 06, 2016 [Page 6]
Internet-Draft Avoiding Preemption July 2015
rem_bw_threshold) % so that none of the low priority LSPs that have
already reserved bandwidth on the link suffer preemption owing to
subscription.
2.2.3. Advantages:
a) This approach doesn't rely on IGP TE update, to identify when a
link qualifies to be hot or ceases to be one. So this procedure is
able to work even when the change in bandwidth usage leading to
toggling of links hotness state, is less than igp_update-
threshold%.
b) When per bandwidth subscription is set, IGP TE update is
triggered, and this enables all nodes to avoid placing low
priority LSPs on the given link.
2.3. Ingress based approach:
The ingress should monitor all the links in its TE database. When
the remaining bandwidth at priority 0, for any link falls below a
given threshold, it is RECOMMENDED that Ingress SHOULD instrument
its view of TE database to reflect a lesser available bandwidth for
lower priorities on that link than actually is available.
This instrumented view can be reversed for a link when the remaining
bandwidth at priority 0 increases by a reasonable amount.
2.4. Sample procedure at ingress
Here is a sample procedure local to ingress to create an
instrumented view of TE database that helps avoid saturated links
when computing path for low priority LSPs.
2.4.1. Inputs to the procedure:
a) Rem_bw_threshold%:
When remaining (unreserved) bandwidth at highest priority, on the
link, falls below this percentage action is to be taken.
b) Input_priority:
For priorities inferior to this, action is to be taken.
Sudharsana, et al Expires January 06, 2016 [Page 7]
Internet-Draft Avoiding Preemption July 2015
c) Subscription_percent%:
Subscription percentage to be used for arriving at the available
bandwidth at low priorities for the link.
d) Igp_update_threshold%:
Percentage by which bandwidth utilization on a link should change
to qualify for an IGP TE update to be sent out of the box.
2.4.2. Procedure at ingress
For each link in TE database, the following procedure is executed.
a) Find the remaining available bandwidth on the link at priority 0,
from the data available in TE database and see if it is below rem-
bw threshold%
b) If the remaining bandwidth (at priority 0) is below rem-bw-
threshold% of the total link capacity, the link qualifies for
action.
c) Action: Set the available bandwidth on the link in TE database,
for priorities in the range between input_priority and 7, to a
value derived by application of subscription_percent, on the total
link capacity, taking into account the existing reservations at
that priority, as can be obtained from the TE database.
d) If the remaining bandwidth percentage (at priority 0) is above
rem-bw-threshold% of the total link capacity by at least
igp_update_threshold% of the links capacity, the instrumented view
is reversed and the actual values received from IGP TE updates can
be used.
Just as the case in transit based approach, it is RECOMMENDED that
the value of the subscription percentage SHOULD NOT cause immediate
preemption of any of the low priority LSPs already taking the link.
Every time a TE update for the link is received, if it ceases to be
hot or becomes one owing to the update, link capacity for low
priorities can be modified based on subscription-configuration.
Sudharsana, et al Expires January 06, 2016 [Page 8]
Internet-Draft Avoiding Preemption July 2015
2.4.3. Disadvantages over transit based approach
The local instrumented view does not get sent in TE updates. Hence
other Ingress routers not following the procedure will still use the
congested link for low priority LSPs.
When a link qualifies as hot owing to placement of an LSP whose
reservation did not cross igp_update_threshold, none of the
computing nodes other than the ingress that initiated the concerned
LSP, get to know about the hotness of the link, even if we assume
all of them follow the same procedure.
This problem can be mitigated by having all routers use an adaptive
igp_update_threshold rather than using a static one. This requires
all the routers to send more frequent updates when link utilization
gets closer to a threshold.
2.4.4. Advantages:
This being an ingress based solution, transit routers do not have to
be configured or upgraded.
If owing to the local instrumented view obtained by applying
subscription, a certain low priority LSP is not coming up after
multiple retries, then the ingress can choose to relax the
subscription and try to bring up the low priority LSP. Such a
situation cannot be detected and fixed in the transit based
approach.
3. Use cases and applicability of approaches
The following sections discuss the applicability of transit and
ingress based approaches in various situations.
3.1. Pertaining to a particular link
When a member link is removed from an aggregate bundle, it is
expected to create some congestion on the aggregate link. Before
removing the member link, the subscription percentage may be set for
low priorities such that the remaining bandwidth is utilized by
higher priority LSPs.
When a member link is added to an aggregate bundle that is currently
congested or when a new link or router is added to a network that is
Sudharsana, et al Expires January 06, 2016 [Page 9]
Internet-Draft Avoiding Preemption July 2015
running hot, it could result in immediate spike in the available
bandwidth, attracting LSPs waiting to be setup, which could include
the low priority LSPs. Setting the subscription for low priorities
accordingly, before adding the member link, could avoid the
situation of low priority LSPs getting placed on the new member link
given that the aggregate link is on a congested path.
It should be noted that these use cases do not require any
monitoring of links and the link(s) requiring action is known in
advance. The transit router based approach is better suited for
these cases. That is, in the above situations the subscription
percentage for lower priorities may be decreased on the link without
checking for the conditions described in Section 3.2, and the
procedure in Section 3.2 may be executed fully after a user-
configured time period.
3.2. Setting up general LSP behavior in the network
In a network, an admission control behavior that differentiates
between the LSPs based on their priority, overriding actual
bandwidth availability at the priority, can be setup in two ways.
One approach is to start with a behavior where there is no
differentiation based on priorities. Then the low priority LSPs are
denied portion of the total capacity when links become hot as is
discussed in earlier sections of this document. This approach
assumes steady state and no congestion in the network as a whole to
start with.
Another could be to deny bandwidth for low priority LSPs on any link
that comes up. Then gradually with the passage of time, the
available bandwidth for low priority LSPs can be increased if the
utilization does not spike. With every preemption the gradual
increase in available bandwidth for low priorities can be backed off
exponentially. This approach assumes congestion and that the entire
network is running hot, when any link is brought up.
Other situations where the affected link is not precisely known in
advance is the maintenance use case. When a particular link or
router is to be taken down for maintenance, it generally reroutes
the traffic along other paths causing congestion along those paths
owing to the extra load from the path under maintenance. The
subscription on the links of the alternate paths can be set to limit
the low priority LSPs on those paths.
For setting up general network behaviors and for handling situations
where affected link is not known in advance, the ingress based
Sudharsana, et al Expires January 06, 2016 [Page 10]
Internet-Draft Avoiding Preemption July 2015
approach is better suited. The ingress could just monitor all
network links or a subset of them identified by policy.
The transit based approach coupled with a management script also can
be used though.
4. Security Considerations
The recommendations do not present any new security concerns.
5. IANA Considerations
This document makes no request of IANA.
6. Normative References
[RFC2119] Bradner, S., "Key words for use in RFCs to Indicate
Requirement Levels", BCP 14, RFC 2119, March 1997.
[RFC3209] Awduche, D., "RSVP-TE: Extensions to RSVP for LSP
Tunnels", RFC 3209, December 2001.
[RFC5712] Meyer, M., Ed. and JP. Vasseur, Ed., "MPLS Traffic
Engineering Soft Preemption", RFC 5712, January 2010.
7. Acknowledgments
We are grateful to Yakov Rekhter for his contributions to the
development of the idea and thorough review of content of the draft.
Thanks to Vishnu Pavan Beeram and Harish Sitaraman for their
comments and inputs.
8. Authors' Addresses
Sudharsana Venkataraman
Juniper Networks
Email: sudharsana@juniper.net
Chandra Ramachandran
Juniper Networks
Email: csekar@juniper.net
Raveendra Torvi
Juniper Networks
Email: rtorvi@juniper.net
Sudharsana, et al Expires January 06, 2016 [Page 11]
Internet-Draft Avoiding Preemption July 2015
Sudharsana, et al Expires January 06, 2016 [Page 12]