Internet DRAFT - draft-mjsraman-panet-intra-as-psp-te-leak
draft-mjsraman-panet-intra-as-psp-te-leak
PANET Working Group Shankar Raman
Internet-Draft Balaji Venkat Venkataswami
Intended Status: Experimental RFC Gaurav Raina
Expires: November 4, 2013 IIT Madras
May 3, 2013
Building power shortest inter-Area TE LSPs using pre-computed paths
draft-mjsraman-panet-intra-as-psp-te-leak-01
Abstract
In this paper, we propose a framework to reduce the aggregate power
consumption of an Autonomous System (AS) using a collaborative
approach between areas within an AS. We identify the low-power paths
within non-backbone areas and then use Traffic Engineering (TE)
techniques to route the packets along the stitched paths from non-
backbone areas / backbone area to other non-backbone areas. Such low-
power paths can be identified by using the power-to-available-
bandwidth (PWR) ratio as an additional constraint in the Constrained
Shortest Path First (CSPF) algorithm. For routing the data traffic
through these low-power paths, the Inter-Area Traffic Engineered
Label Switched Path (TE-LSP) that spans multiple areas can be used.
Extensions to the Interior Gateway Protocols like OSPF and IS-IS that
support TE extensions can be used to disseminate information about
low-power paths in the respective areas (backbone or non-backbone)
that minimize the PWR ratio metric on the links within the areas and
between the areas thereby creating a collaborative approach to reduce
the power consumption.
The feasibility of our approaches is illustrated by applying our
algorithm to an AS with a backbone area and several non-backbone
areas. The techniques proposed in this paper for the Inter-Area power
reduced paths require a few modifications to the existing features of
the IGPs supporting TE extensions. The proposed techniques can be
extended to other levels of Internet hierarchy, such as Inter-AS
paths, through suitable modifications as in [11].
When link state routing protocols like OSPF or ISIS are used to
discover TE topology, there is the limitation that traffic engineered
paths can be set up only when the head and tail end of the label
switched path are in the same area. There are solutions to overcome
this limitation either using offline Path Computation Engine (PCE)
that attach to multiple areas and know the topology of all areas.
This document proposes an alternative approach that does not require
any centralized PCE and uses selective leaking of low-power TE path
information from one area into other areas.
Shankar Raman et.al Expires November 4, 2013 [Page 1]
INTERNET DRAFT Building power shortest inter-Area TE-LSPs May 3, 2013
Status of this Memo
This Internet-Draft is submitted to IETF in full conformance with the
provisions of BCP 78 and BCP 79.
Internet-Drafts are working documents of the Internet Engineering
Task Force (IETF), its areas, and its working groups. Note that
other groups may also distribute working documents as
Internet-Drafts.
Internet-Drafts are draft documents valid for a maximum of six months
and may be updated, replaced, or obsoleted by other documents at any
time. It is inappropriate to use Internet-Drafts as reference
material or to cite them other than as "work in progress."
The list of current Internet-Drafts can be accessed at
http://www.ietf.org/1id-abstracts.html
The list of Internet-Draft Shadow Directories can be accessed at
http://www.ietf.org/shadow.html
Copyright and License Notice
Copyright (c) 2013 IETF Trust and the persons identified as the
document authors. All rights reserved.
This document is subject to BCP 78 and the IETF Trust's Legal
Provisions Relating to IETF Documents
(http://trustee.ietf.org/license-info) in effect on the date of
publication of this document. Please review these documents
carefully, as they describe your rights and restrictions with respect
to this document. Code Components extracted from this document must
include Simplified BSD License text as described in Section 4.e of
the Trust Legal Provisions and are provided without warranty as
described in the Simplified BSD License.
Table of Contents
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . 4
1.1 Terminology . . . . . . . . . . . . . . . . . . . . . . . . 4
1.1 Low-power routers and switches . . . . . . . . . . . . . . . 4
1.2 Power reduction using routing and traffic engineering . . . 4
2. Methodology of the proposal . . . . . . . . . . . . . . . . . 6
2.1 ABR Operation . . . . . . . . . . . . . . . . . . . . . . . 6
2.1.1 Methodology . . . . . . . . . . . . . . . . . . . . . . 7
Shankar Raman et.al Expires November 4, 2013 [Page 2]
INTERNET DRAFT Building power shortest inter-Area TE-LSPs May 3, 2013
2.1.2 ERRATA . . . . . . . . . . . . . . . . . . . . . . . . . 11
2.1.3 Power Bias . . . . . . . . . . . . . . . . . . . . . . . 11
2.1.4 Advertising Available POWER . . . . . . . . . . . . . . 12
2.1.5 ECMP links . . . . . . . . . . . . . . . . . . . . . . . 12
2.1.6 Dampening the side effects of constant change . . . . . 12
2.1.7 Calculating power shortest paths in an Area . . . . . . 12
2.1.8 Power profiles of Routers and Switches . . . . . . . . . 13
2.1.8.1 Concave and Convex power curves . . . . . . . . . . 15
2.1.8.2 Need to advertise both available power and
consumed power . . . . . . . . . . . . . . . . . . . 17
2.1.9 Power to Available Bandwidth ratio in a TLV . . . . . . 17
2.2 TE Path Head-end Operation . . . . . . . . . . . . . . . . . 20
2.2 Suppression of Frequent updates owing to fluctuation in
power and bandwidth . . . . . . . . . . . . . . . . . . . . 22
2.3 Advantages . . . . . . . . . . . . . . . . . . . . . . . . . 23
3 Security Considerations . . . . . . . . . . . . . . . . . . . . 24
4 IANA Considerations . . . . . . . . . . . . . . . . . . . . . . 24
5 References . . . . . . . . . . . . . . . . . . . . . . . . . . 24
5.1 Normative References . . . . . . . . . . . . . . . . . . . 24
5.2 Informative References . . . . . . . . . . . . . . . . . . 24
Authors' Addresses . . . . . . . . . . . . . . . . . . . . . . . . 25
Shankar Raman et.al Expires November 4, 2013 [Page 3]
INTERNET DRAFT Building power shortest inter-Area TE-LSPs May 3, 2013
1 Introduction
Estimates of power consumption for the Internet predict a 300%
increase, as access speeds increase from 10 Mbps to 100 Mbps [3],
[8]. Access speeds are likely to increase as new video, voice and
gaming devices get added to the Internet. Various approaches have
been proposed to reduce the power consumption of the Internet such as
designing low-power routers and switches, and optimizing the network
topology using traffic engineering methods [2].
1.1 Terminology
The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT",
"SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this
document are to be interpreted as described in RFC 2119 [RFC2119].
1.1 Low-power routers and switches
Low-power router and switch design aim at reducing the power consumed
by hardware architectural components such as transmission link,
lookup tables and memory. In [4] it is shown that the router's link
power consumption can vary by 20 Watts between idle and traffic
scenarios. Hence the authors suggest having more line cards and
running them to capacity: operating the router at full throughput
will lead to less power per bit, and hence larger packet lengths will
consume lower power. The two important components in routers that
have received attention for high power consumption are buffers and
TCAMs. Buffers are built using dynamic RAM (DRAM) or static RAM
(SRAM). SRAMs are limited in size and consume more power, but have
low access times. Guido [1] states that a 40Gb/s line card would
require more than 300 SRAM chips and consume 2:5kW. DRAM access times
prevent them from being used on high speed line cards. Sometimes the
buffering of packets in DRAM is done at the back end, while SRAM is
used at the front end for fast data access. But these schemes cannot
scale with increasing line speeds. Some variants of TCAMs have been
proposed for increasing line speeds and for reduced power consumption
[7].
1.2 Power reduction using routing and traffic engineering
At the Internet level, creating a topology that allows route
adaptation, capacity scaling and power-aware service rate tuning,
will reduce power consumption. In [8] the author has proposed a
technique to traffic engineer the data packets in such a way that the
link capacity between routers is optimized. Links which are not
utilized are moved to the idle state. Power consumption can be
reduced by trading off performance related measures like latency. For
Shankar Raman et.al Expires November 4, 2013 [Page 4]
INTERNET DRAFT Building power shortest inter-Area TE-LSPs May 3, 2013
example, power savings while switching from 1 Gbps to 100 Mbps is
approximately 4 W and from 100 Mbps to 10 Mbps around 0:1 Watts.
Hence instead of operating at 1 Gbps the link speed could be reduced
to a lower bandwidth under certain conditions for reduced power
consumption.
Multi layer traffic engineering based methods make use of parameters
such as resource usage, bandwidth, throughput and QoS measures, for
power reduction. In [6] an approach for reducing Intra-AS power
consumption for optical networks that uses Djikstra's shortest path
algorithm is proposed. The input to this method assumes the existence
of a network topology using which an auxiliary graph is constructed.
Power optimization is done on the auxiliary graph and traffic is
routed through the low-power links. However, the algorithm expects
the topology to be available for getting the auxiliary graph. This
topology is easy to obtain for Intra-AS scenario, but by using a
centralized PCE (Path Computation Element) as in a hierarchical PCE
approach. Here for each area a PCE is assigned and each such PCE
calculates the path from a head-end router to a tail-end router, both
falling within the same area. When TE paths have to be stitched
across several areas then the hierarchical PCE which may be one level
up from the respective area PCEs is contacted for such a stitching.
In our approach, we propose a collaborative approach by the
respective areas in calculating low-power paths that result in power
reduction within an AS. This document proposes an alternative
approach that does not require any centralized PCE and uses selective
leaking of low-power TE path information from one area into other
areas. The core of most ISP ASes use the Multi-Protocol Label
Switching (MPLS) technology. MPLS label switched paths that traverse
multiple areas carry traffic from a head-end to a tail-end that can
be situated in different areas within the AS. The AS uses the
Interior Gateway Protocol (IGP) for exchanging routing related
information. The topology of one area is not revealed to the other in
OSPF-TE and IS-IS-TE.
The CSPF algorithm as proposed here is run on a specific area with
the available power-to-bandwidth (PWR) ratio as a constraint, to
determine "k" (where k is a suitable number) low-power-paths from the
head-end to the tail-end within the same area. The low-cost power
paths that minimize the PWR ratio can be exchanged among the
collaborating areas using IGP-TE TLVs that we propose in this
document. Explicit routing using RSVP-TE (for signalling) then can be
achieved between the head-end and the tail-end routers traversing
multiple areas through these low-power paths connecting the head-end
and tail-end using the Inter-Area Traffic Engineered Label Switched
Path (TE-LSP) that span multiple areas.
Shankar Raman et.al Expires November 4, 2013 [Page 5]
INTERNET DRAFT Building power shortest inter-Area TE-LSPs May 3, 2013
2. Methodology of the proposal
There are three known solutions to inter-area TE
(a) hop expansion at area boundaries where the head end can only
choose the path to area boundary rather than right to tail end,
(b) centralized PCE is attached to all areas and is aware of entire
topology, and
(c) path stitching by designating ABRs acting as BGP route
reflectors.
It is of course possible to build out low-power paths through the
above techniques but they suffer limitations such as not knowing for
certain whether the path exists a-priori. This document proposes a
technique where a-priori low-power paths are pre-computed in the
various areas and are leaked into other areas so that provisioning
these paths is done much more quicker than is otherwise possible.
Assume {N} as the set of nodes in a network running link state
routing protocol and {N' } be the set of nodes that are known to be
the endpoints of the traffic engineered paths. The topology {N, E}
has been divided into hierarchical areas with backbone area as the
second level that connects first level of all non-backbone areas. We
assume the network runs either OSPF-TE or ISIS-TE for establishing TE
paths. The set of nodes {N'} can be situated in any non-backbone area
or the backbone area. Nodes in {N'} may become aware of being
potential endpoints through offline configuration.
Once the nodes in {N'} become aware of being TE endpoints, they
advertise themselves in a special TLV in TE link state information.
We would term this "TE Endpoint TLV". In OSPF, they would advertise a
newly defined TLV in TE LSA and in ISIS, they would advertise a newly
defined TLV in TE LSP. Apart from nodes in {N'} the area border
routers or ABRs advertise another newly defined TLV that we would
term as "Area Border TLV".
2.1 ABR Operation
Apart from standard OSPF/ISIS ABR functions, each ABR should discover
the TE endpoints in every area attached to it. Assume for an ABR, let
the set discovered be {Ai, Nj}. The ABR should compute k-power-
shortest-paths to every element in {Ai, Nj} based on the constraints
applicable to the network. The constraint applied here is the
minimization of the PWR ratio which is defined as follows.
Shankar Raman et.al Expires November 4, 2013 [Page 6]
INTERNET DRAFT Building power shortest inter-Area TE-LSPs May 3, 2013
For a given router that is an ABR for an area (straddling the
backbone and non-backbone), a set of k-shortest paths that can be
potentially be used as a link towards a TE endpoint are identified.
2.1.1 Methodology
For each router / switch there exist linecards and each linecard has
a set of ports or sometimes just one port of high capacity. This
usually applies on routers and switches that are either single
chassis or multi-chassis in their characterisation. By single chassis
we mean that there exists a single chassis and slots for the Route
Processor Card (one or more of these) typically upto to two of them,
and one or more slots for linecards each having their respective
characteristics such as number of ports (port density), type of such
ports (SONET, ethernet, ATM etc..) usually depending on the link
layer technology they support. Links are connections between ports on
these linecards to other ports on linecards of other single chassis
or multi-chassis system. A multi-chassis system is one that has
multiple such chassis interconnceted amongst each other to form a
single logical view of the system. Both single and multi-chassis have
linecards and respective ports on these linecards. Multi-chassis
typically have a switch fabric chassis which connects each of these
chassis to each other or to chassis of other multi-chassis or single
chassis systems.
Shankar Raman et.al Expires November 4, 2013 [Page 7]
INTERNET DRAFT Building power shortest inter-Area TE-LSPs May 3, 2013
Consider the following topology as one that falls within an area...
Router A Router B Router C
+---+---+ +---+---+ +-------+
| | | | | | | | |
|LC1|LC2| |LC1|LC2| |LC1|LC2|
| | | | | | L11 | | |
| P1| P1| | P1| P1|-------------- P1| P1|---+
| P2| P2|--+ | P2| P2| L12 | P2| P2| |
| P3| P3| | L4 | P3| P3|-------------- P3| P3| |
| P4| P4|--+----------- P4| P4| +---- P4| P4| |
| P5| P5| | +----P5| P5--+ L5 | | P5| P5| |
| | | | | | | | | | | | | | | | |
+-|-+-|-+ |L3 | +---+---+ | | +---+-|-+ | L13
| | | +------------+-------+ | |
| |L2 | L5 | | |
| +----+------------+ | | |
| | | | | |
|L1 | | |L6 | |
| | Router D | | Router E L12| | Router F
| | +---+---+ | | +---+---+ | |+-------+
| | | | | |L2 | | | | | || | |L
| | |LC1|LC2| | | |LC1|LC2| | ||LC1|LC2|1
| | | | | | | | | | | || | |4..
| +-| P1| P1---+ | | P1| P1|------+ || P1| P1|->
| | P2| P2| L7 +--- P2| P2| +--P2| P2|->
| | P3| P3|-------------- P3| P3| L10 | P3| P3|->
+----------| P4| P4| +---- P4| P4|-------------- P4| P4|
| P5| P5| | +-- P5| P5| +----- P5| P5|
| | | | | | | | | | | | |
+-|-+---+ L8 | | +---+---+ L9 | +---+---+
+---------------+ +------------------+
Shankar Raman et.al Expires November 4, 2013 [Page 8]
INTERNET DRAFT Building power shortest inter-Area TE-LSPs May 3, 2013
The table of links between the various routers (which are assumed to
be single chassis systems) is as follows...
+--------+----------+-----------+-----------+-----------+----------+
| Links | Routers | LC <> LC | Port Conn.| Capacity |Available |
| | | | | |Bandwidth |
+--------+----------+-----------+-----------+-----------+----------+
| L1 | A <> D | LC1<>LC1 | P5<>P4 | 10G | 7.5 |
| L2 | A <> D | LC2<>LC2 | P5<>P1 | 10G | 6.0 |
| L3 | A <> D | LC2<>LC1 | P2<>P1 | 10G | 4.0 |
| L4 | A <> B | LC2<>LC1 | P4<>P4 | 10G | 3.0 |
| L5 | B <> C | LC1<>LC1 | P5<>P4 | 10G | 3.5 |
| L6 | B <> E | LC1<>LC1 | P6<>P2 | 10G | 1.0 |
| L7 | D <> E | LC2<>LC1 | P3<>P3 | 10G | 6.0 |
| L8 | D <> E | LC1<>LC1 | P5<>P4 | 10G | 1.5 |
| L9 | E <> F | LC1<>LC2 | P5<>P5 | 100G | 20.0 |
| L10 | E <> F | LC2<>LC1 | P4<>P4 | 10G | 2.5 |
| L11 | B <> C | LC2<>LC1 | P1<>P1 | 10G | 3.0 |
| L12 | E <> C | LC2<>LC2 | P1<>P5 | 10G | 2.0 |
| L13 | C <> F | LC2<>LC1 | P1<>P2 | 10G | 1.0 |
| L14 | F <> OA | LC2<> | P1<> | | |
| | | | | | |
+--------+----------+-----------+-----------+------------+---------+
In the above topology assume all point-to-point links between the
routers. For now we will deal with P2P links alone and not venture
into Broadcast Multi-access links or Non-Broadcast Multi-access links
etc.. It is suffice to show how the scheme works for P2P links and
then move more specifically to other types of networks to demonstrate
this method of calculating the power topology of the network in the
figure.
Each linecard consumes a certain amount of power and it is vendor
dependent as to how the power consumed relates to the Available
Bandwidth on any of the links to which the linecard connects to. It
is possible that the said topology of routers come from one vendor or
from multiple vendors. It is assumed that the algorithm proposed will
have the power consumed by a linecard available as a readable value
in terms of W or kW or whichever measurable metric that is provided
by the vendor.
It is possible that some of the Linecards are more capable than the
others. Consider that Router A is a more capable router with more
powerful linecards with higher port density. This is not shown in the
figure, but assume so. LC1, LC2 on Router A could be consuming more
power than the other Linecards on other routers. The main reason
could be that LC1 and LC2 may have higher port density or higher
Shankar Raman et.al Expires November 4, 2013 [Page 9]
INTERNET DRAFT Building power shortest inter-Area TE-LSPs May 3, 2013
speed ports than the other routers. In order to calculate the power
consumed on a link by a linecard it is important that we normalize
the power as power consumed per port. Here the ports are normalized
to lowest common denominator. If all links in the topology have 10G
port capacity then the power calculated should be in terms power
consumed per 10G port.
Assuming we have done this normalization we go on to calculate the
POWER metric for each of the ports involved in a link which is
derived as follows...
POWER metric = Power consumed per XG (normalized bandwidth) port
for a given -------------------------------------------------
Port on a LC Available Bandwidth on that port
Assume link L1. The ports concerned are both 10G and the ports are P5
on Router A and P4 on Router D. For calculating the POWER metric for
a link which we will call PWRLINK we calculate the POWER metric for
each side of the link and average the two to get PWRLINK.
So PWRLINK for L1 = POWER for P5 on LC1 + Power for P4 on LC1
on Router A on Router D
============================================
2
The above can also be weighted if there is a multi-capacity port on
one side of the link and not on the other. A multi-capacity link is
one which provides multiple bandwidth capabilities such (1G/10G/100G)
for example but auto-negotiates with other end to provide a lesser
than highest capacity service.
The PWRLINK metrices once calculated are flooded in already defined
OSPF-TE-LSA as an adapted TE-metric and is typically flooded as a
link characteristic.
It is important to note that the denominator for POWER metric is
Available Bandwidth instead of Available Bandwidth on that port. The
Available Bandwidth is measured in terms of intervals and not as
discrete quantities. This is in order not to flood PWRLINK metrics
into the OSPF area in LSAs very frequently as Bandwidth may
constantly change. The same applies to POWER metric as well.
Once the LSAs have been flooded the Routers run CSPF on the graph of
the topology with PWRLINKs assigned to the links and calculate the
PWRLINK based paths which consume the least power. The shortest power
paths based on this topology can be used for forwarding high
bandwidth streams and to optimally use power within the area.
Shankar Raman et.al Expires November 4, 2013 [Page 10]
INTERNET DRAFT Building power shortest inter-Area TE-LSPs May 3, 2013
The Available Bandwidth column shows the Available bandwidth of the
link corresponding to the row and column intersection. This figure is
used as the numerator in the POWER metric computation for that port.
2.1.2 ERRATA
ERRATA : Previously the experiments were carried out with Available
Utilization since only 10G and 100G ports were considered. This
baselines the metric to 10G ports and proportionality thereof. But in
reality the actual Available Bandwidth needs to be considered for
real world experiments. Hence this draft has been changed to reflect
the Available Bandwidth to be taken as the denominator of the formula
thereof.
In our previous experiments the 100G link if it showed a utilization
of 0.2 would end up as a high POWER metric and hence would be totally
avoided. In reality this link may have been a more power optimal link
given that if it had a first power profile (Please refer section on
Power Profiles). Dividing the Power consumed or Available Power by
the Available Bandwidth gives a better picture of how much power cost
per Gb is consumed and normalizes the metric amongst links of varying
bandwidth.
An earlier version of this document rev-00 contained a different
algorithm to compute the k-shortest-power-paths. From the
experimental results gathered it was seen that the said algorithm was
prone to errors with respect to direction of traffic and
unnecessarily complex for the solution. Hence it has been set aside
for a more simple yet better one mentioned in this revision.
2.1.3 Power Bias
Assume in the figure that there exist Routers A and D and that there
is a bias on the link L1 in such a way that Router D computes a POWER
metric of 10 and the Router D computes a POWER metric of 2 on the
ports P5 and P4 respectively. Now the PWRLINK would be 6 for that
link L1. Thus even if one side is excessively power guzzling then the
PWRLINK moves up and thus is less preferred in the CSPF algorithm and
path computation based on the Power topology.
If there is no bias and both the sides of the link are optimal in
their power usage then the metric stays low even if more streams are
sent on it. This is the main objective that is set out for router and
switch manufacturers in the single chassis and multi-chassis world,
in that they are incentivised to manufacture linecards that are not
power hungry even if the number of packets flowing through them is
high and thus the Bandwidth Available is also reasonably on the
higher side compared to other routers.
Shankar Raman et.al Expires November 4, 2013 [Page 11]
INTERNET DRAFT Building power shortest inter-Area TE-LSPs May 3, 2013
For those manufacturers who set a high power value for even minimal
traffic, the vendors that dont would win out in the end.
2.1.4 Advertising Available POWER
Please see section 2.1.8 for more information on why Available POWER
plays a crucial role in determining the choice of routers based on
the Power metric.
2.1.5 ECMP links
It is possible that multiple links would have the same PWRLINK metric
after a computation cycle. In such a case load-balancing techniques
can be used to keep the ECMP links in a steady state with respect to
each other. Depending on the Available Bandwidth thereafter it is
possible that the ECMP links may no longer be Equal cost but UCMP or
Unequal Cost Paths.
2.1.6 Dampening the side effects of constant change
It is recommended in this draft that the implementation of the
proposal be adaptive, infrequent in computation to the extent
possible without sacrificing adapting to the dynamism and also reduce
any frequent oscillations. The actual methods to adopt for this
computation are outside the scope of this document.
2.1.7 Calculating power shortest paths in an Area
Assume the following topology where A,B,C etc.. are routers and
corresponding labelled edges with weights are the links. These
weights are the current values of the PWRLINK attribute that has been
flooded in the LSAs through the Area concerned. Assume B is the ABR
for Area 1 and the routers A and C are the Area 0 core routers. The
rest of the routers are assumed to be in Area 1. Once the power
topology of the Area 1 has been calculated as shown below with the
PWRLINK attributes being assigned to the links, Constrained shortest
path can be run from the ABR to any of the other routers say H, E , X
etc.. The CSPF algorithm takes the constraint in terms of the PWRLINK
attributes along with other attributes to construct a power shortest
path from say router B to other routers in Area 1.
Shankar Raman et.al Expires November 4, 2013 [Page 12]
INTERNET DRAFT Building power shortest inter-Area TE-LSPs May 3, 2013
0.5
(C) +----------------+
0.5| / |
| / |
0.05 V/ 0.1 0.03 0.2 V
(A)--->(B)--->(D)--->(G)--->(H)
| | |
| 0.5| | 0.1
| V V
+----------->(E)--->(X)
0.5 0.3
Once the path has been computed it is possible to use RSVP-TE to
construct the power shortest path with the TE-LSP being instantiated
with the labels appropriately placed in the routers on the power
shortest path. In this topology, assume one would want to construct a
path from B to X then the dotted path shows the path constructed and
to be used by a set of flows or streams of packets belonging to
multiple flows as seen fit by the router B. If the PWRLINK metrics
change after due course of time then another power shortest path that
possibly traverses the same path (if the SUM of PWRLINKs doesnt
exceed any other path's metrics' SUM) or some other path would be
constructed. Specifically this method makes use of traffic-
engineering signalling protocols as the method to place the streams
from point X to point Y (where X and Y are routers).
0.5
(C) +----------------+
0.5| / |
| / |
0.05 V/ 0.1 0.03 0.2 V
(A)--->(B)...>(D)...>(G)...>(H)
| | .
| 0.5| . 0.1
| V V
+----------->(E)--->(X)
0.5 0.3
2.1.8 Power profiles of Routers and Switches
It has been experimented and from several sources found that there
exist routers which have different power profiles. The power profile
of a router is the curve of power consumption to available bandwidth.
Mentioned below are a few of these prominent ones that have to be
taken into consideration.
The first profile that we will consider is the flattening curve. The
power consumed to available bandwidth curve takes the shape of a
Shankar Raman et.al Expires November 4, 2013 [Page 13]
INTERNET DRAFT Building power shortest inter-Area TE-LSPs May 3, 2013
steep one initially and then tapers off to a plateau. The point at
which it begins to give a delta-C (delta in Power Consumed) to delta-
B (Available Bandwidth exhausted) is the inflection point that tapers
off to a plateau. Here the delta-C/delta-B begins to slow down or
decrease rapidly. The more the traffic that is added onto the device
the lesser it draws power.
^
|
P | .
o | .
w | .
e | .
r | .
| .
c | .
o | .
n | .
s | .
u.| .
------------------------------------>
| Available Bandwidth exhausted
The second profile that we will consider is the exponential curve.
The power consumed to available bandwidth curve takes the shape of an
ever increasing steep curve as shown below. Here the delta-C/delta-B
begins to increase as more traffic is thrown onto it as the Available
bandwidth exhausted increases. This power curve beyond a point is
intolerable with respect to power guzzling.
^
|
P | .
o | .
w | .
e | .
r | .
| .
c | .
o | .
n | .
s | .
u.| .
------------------------------------>
| Available Bandwidth exhausted
Shankar Raman et.al Expires November 4, 2013 [Page 14]
INTERNET DRAFT Building power shortest inter-Area TE-LSPs May 3, 2013
The third profile that we will consider is a linear curve. In other
words just a straight line. Here delta-C/delta-B is a constant.
^
|
P | .
o | .
w | .
e | .
r | .
| .
c | .
o | .
n | .
s | .
u.| .
------------------------------------>
| Available Bandwidth exhausted
2.1.8.1 Concave and Convex power curves
Given that there are 3 kinds of major profiles in the router power
consumption, what line would we like to pick. This is an important
point when choosing the metric to pick the low power paths.
(a) If the confrontation is between 2 first profile routers the lower
of the 2 would be considered as shown below. The lower curve offers
better power savings for each GB of bandwidth transported.
^
|
P | .
o | .
w | . .
e | . .
r | . .
| . .
c | . .
o | . .
n | . .
s | . .
u.| .
------------------------------------>
| Available Bandwidth exhausted
Shankar Raman et.al Expires November 4, 2013 [Page 15]
INTERNET DRAFT Building power shortest inter-Area TE-LSPs May 3, 2013
(b) If the confrontation is between 2 second profile routers the
upper curve offers more power savings per GB of bandwidth.
^
|
P | . .
o | . .
w | . .
e | . .
r | . .
| . .
c | . .
o | . .
n | . .
s | .
u.| .
------------------------------------>
| Available Bandwidth exhausted
(c) When the confrontation is between a first profile curve and a
second profile curve, it would be optimal to pick (as shown below)
the lower of the curves because it gives us lesser power consumed for
every GB of traffic routed / switched. Here the exponential curve is
the one that offers lesser amount of power consumed per GB of traffic
is chosen. But when it gets to a point that the two curves intersect
it would be more optimal to pick the tapering curve. Thus at the
meeting point of the 2 curves the exponential curve becomes more
costly and the tapering one gives us more GB for the power buck. Thus
this switchover from one curve to the other (in other words from the
exponential curve to the tapering one) does the trick in terms of
finding an optimal solution.
Shankar Raman et.al Expires November 4, 2013 [Page 16]
INTERNET DRAFT Building power shortest inter-Area TE-LSPs May 3, 2013
^ .
| .
P | . .
o | (*)
w | . .
e | . .
r | . .
| . .
c | . .
o | . .
n | . .
s | . .
u.| ..
------------------------------------>
| Available Bandwidth exhausted
(*) Metric switchover point from Consumed Power to Available
Power.
2.1.8.2 Need to advertise both available power and consumed power
Thus the above sections have shown that both the available power and
the consumed power MUST be advertised so that case (c) can be
deciphered and the switchover of the curves be done and the
appropriate router be chosen for the rest of the bandwidth to be
switched over to.
Thus there will exist Consumed-Power to Available Bandwidth ratio and
the Available Power to Available Bandwidth ratio. Both the ratios are
computed and the lower value chosen. The Available Power can be
judged from the calibration process such as the one carried out by
independent test organizations as in [12]. An example of their
calibration is referred to in [12].
Here given below is the formula for calculating the Available Power
to Available Bandwidth ratio also called the Available POWER metric.
Available
POWER metric = Available Power consumed per XG
(normalized bandwidth) port
for a given ----------------------------------
Port on a LC Available Bandwidth on that port
2.1.9 Power to Available Bandwidth ratio in a TLV
As per [RFC3630] the Link TLV can be used to carry this power to
available Bandwidth ratio with an additional sub-TLV of the link TLV.
The sub-type number 11 is recommended to be defined for this purpose.
Shankar Raman et.al Expires November 4, 2013 [Page 17]
INTERNET DRAFT Building power shortest inter-Area TE-LSPs May 3, 2013
[RFC 3630] states in section 2.2.1 and we QUOTE ...
2.1.10 Link TLV
The Link TLV describes a single link. It is constructed of a set of
sub-TLVs. There are no ordering requirements for the sub-TLVs.
Only one Link TLV shall be carried in each LSA, allowing for fine
granularity changes in topology.
The Link TLV is type 2, and the length is variable.
The following sub-TLVs of the Link TLV are defined:
1 - Link type (1 octet)
2 - Link ID (4 octets)
3 - Local interface IP address (4 octets)
4 - Remote interface IP address (4 octets)
5 - Traffic engineering metric (4 octets)
6 - Maximum bandwidth (4 octets)
7 - Maximum reservable bandwidth (4 octets)
8 - Unreserved bandwidth (32 octets)
9 - Administrative group (4 octets)
10 - Power-to-Multicast-replication-capacity (4 octets)
11 - Consumed-Power-to-Available-Bandwidth (4 octets)
12 - Available-Power-to-Available-Bandwidth (4 octets)
This memo defines sub-Types 1 through 9. See the IANA Considerations
in [RFC3630] section for allocation of new sub-Types.
The Link Type and Link ID sub-TLVs are mandatory, i.e., must appear
exactly once. All other sub-TLVs defined here may occur at most
once. These restrictions need not apply to future sub-TLVs.
Unrecognized sub-TLVs are ignored.
Various values below use the (32 bit) IEEE Floating Point format. For
quick reference, this format is as follows:
0 1 2 3
0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
|S| Exponent | Fraction |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
S is the sign, Exponent is the exponent base 2 in "excess 127"
notation, and Fraction is the mantissa - 1, with an implied binary
point in front of it. Thus, the above represents the value:
Shankar Raman et.al Expires November 4, 2013 [Page 18]
INTERNET DRAFT Building power shortest inter-Area TE-LSPs May 3, 2013
(-1)**(S) * 2**(Exponent-127) * (1 + Fraction)
It is proposed that we use the Power-to-Available-Bandwidth ratio as
a 32 bit IEEE floating Point format field for the purpose of this
document.
Assume the following topology in a non-backbone area after
calculating the PWR ratio in a given stage of the algorithm.
0.5
(C) +----------------+
0.5| / |
| / |
0.05 V/ 0.1 0.03 0.2 V
(A)--->(B)--->(D)--->(G)--->(H)
| | |
| 0.5| | 0.1
| V V
+----------->(E)--->(X)
0.5 0.3
Here (B) is a Area Border Router and has to ingress links into it
from (C) and (A) which are in the backbone area. Connectivity within
the backbone area are not shown here. Assume (C) and (A) are
connected in some way with other routers in the backbone area.
Routers (D), (G), (E), (H), (X) are routers in the non-backbone area.
Routers (H), (E) and (X) are potential TE endpoints. The PWR metrics
shown here on the edges within the area represent metrics for a
specific TE endpoint. The metrics on edges (C)->(B) and (A)->(B) are
for any traffic ingressing through (B) into the non-backbone area
heading towards any TE endpoint (H), (E) or (X).
The number of constraints is likely to be few and the most widely
used constraints are TE metric, link groups and bandwidth. But no
restriction is assumed on use of other constraints. Thus here we add
the PWR metric of a link as an additional constraint. Once the ABR
computes k-power-shortest-paths to every {Ai, Nj} it has topology
information about, it advertises the k-power-shortest-paths as a
reachability vector in a newly defined "TE Reachability Vector TLV".
Consider an example network show below. TEh is head-end and TEt is
tail-end of a TE path, ABR1 and ABR2 are area border routers.
Shankar Raman et.al Expires November 4, 2013 [Page 19]
INTERNET DRAFT Building power shortest inter-Area TE-LSPs May 3, 2013
TEh2---R2 R4-----TEt2
\ /
\ /
TEh1---R1----ABR1-----Rb1-----Rb2-----ABR2----R3----TEt1
Area 1 Backbone Area2
In this example, ABR1's TE Reachability vector TLV for area 1 and
area 0 are given below.
{ ABR1, [<TEh1, <Path info 1>>, <TEh2, <Path info 2>>]}
{ ABR1, ABR2, [<TEt1, <Path info 3>>, <TEt2, <Path info 4>>]}
Here the vector TLVs are arranged as per increasing PWR metric
associated with each path. That is the summation of all PWR metrics
of the links in the path is done and the vector TLVs are ordered in
increasing order of PWR metric sums. So the lowest-cost-power path is
listed first and so on. If the least cost power path is to be chosen
then the path in the first TLV is chosen.
Similarly ABR2's TE Reachability vector TLV for area 2 and area 0 are
given below.
{ ABR2, [<TEt1, <Path info 3>>, <TEt2, <Path info 4>>]}
{ ABR2, ABR1, [<TEh1, <Path info 1>>, <TEh2, <Path info 2>>]}
The first thing to be noted is that head-ends are also considered as
TE-endpoints. Essentially this means any head-end or tail-end of a
inter-area TE-LSP can be considered as tail-end or head-end
respectively.
Note that the reachability vector advertised by ABR1 also contains
the reachability vector of ABR2. For example, if ABR2 is brought up
first, then it is likely that ABR1 would only have the following as
TE Reachability vector TLV for area 0 before ABR2 computes path to
the TE endpoints in area 2. { ABR1, ABR2 }
Note that <Path info> TLV would only contain the aggregate of link
attributes namely cost, bandwidth etc and most importantly the PWR
metric as well but not the complete path of intermediate nodes. For
example, <Path info 1> may be a set of <2, admin-group-1|admin-group-
2, 1Gbps> (where the 1Gbps could be the minimum bw available along
the path). The above example topology has only one path from ABRs to
TE endpoints. The number of path info "k" may have a default value or
can be configured by the operator on all nodes.
2.2 TE Path Head-end Operation
When any TE application requests TE path to be setup to an endpoint
Shankar Raman et.al Expires November 4, 2013 [Page 20]
INTERNET DRAFT Building power shortest inter-Area TE-LSPs May 3, 2013
that is not present in the same area, the head-end scans the TE
Reachability vector TLVs advertised by ABRs and selects the path
using the <Path info> contained in the vector TLVs.
Here is an example with multiple paths in area 1, backbone and area 2
called Figure 2.0
TEh3----R5---ABR3----Rb3-----Rb4------ABR4----R6--TEt4
\ / ___/ \ ___/
\ / / \/
TEh2---R2---ABR5------Rb5--------ABR6---R4-----TEt2
/ \ \____ /\___
/ \ \ / \
TEh1---R1----ABR1-----Rb1-----Rb2-----ABR2----R3----TEt1
Area 1 Backbone Area2
In this topology in figure 2.0 taking the tail-ends represented in
the diagram, it is noted that TEt4 is reachable via ABR4, ABR6 and
ABR2 as well. The TE reachability TLVs advertised by ABR6 for area 2
would be multiple to each tail-end since there exist multiple paths
to reach at least most of them in area 2 once a packet reaches any of
the ABRs in area 2.
Here again the least cost power shortest path is listed first and so
on.
{ ABR6, [<TEt4, <Path info 1>>, <TEt4, <Path info 2>>, <TEt2, <Path
Info 3>>, etc.. }
For area 0 the TE reachability TLV would be
{ ABR6, ABR1, [<TEh1, <Path info 4>>, <TEh1, <Path info 5>>...]}
{ ABR6, ABR5, [<TEh1, <Path info 6>>, <TEh1, <Path info 7>>...]}
{ ABR6, ABR3, [<TEh1, <Path info 8>>, <TEh1, <Path info 9>>...]}
For the sake of brevity we do not enumerate all path information
possible as it would be quite extensive.
It is possible that there may be already setup LSPs which are being
used for transit traffic on the backbone or in other non-backbone
areas. It is also feasible to advertize already set up LSPs in the
path info; no additional TLV is required for that purpose. The case
where this may be useful would be if such transport LSPs exist in the
backbone area and there is a willingness to provide higher preference
to these LSPs to carry transit LSPs over backbone.
There can be selective suppression of advertisements to other areas
Shankar Raman et.al Expires November 4, 2013 [Page 21]
INTERNET DRAFT Building power shortest inter-Area TE-LSPs May 3, 2013
(backbone or non-backbone) of LSPs if these are existing LSPs setup
along a path which are utilized to a greater degree. If underutilized
with respect to the PWR metric a more favourable metric could be
advertized to other areas.
For example, backbone area transport LSPs will be advertized as
transit LSPs which would provide connectivity to LSP sections lying
in non-backbone areas and would be updated more frequently since they
facilitate inter-Area TE.
Once a path in the TLV has been used for reserving bandwidth for
traffic over that path, then it is withdrawn from the advertisements
so that it becomes unusable. Another path may be computed over the
same path but with possibly a different PWR metric sum since it is
possible that the traffic over that path could have changed the PWR
metrices in the edges along that path.
2.2 Suppression of Frequent updates owing to fluctuation in power and
bandwidth
Using the power consumed and the bandwidth available as discrete
quantities will result in frequent oscillations. Such a step would
result will result in frequent re-computations of the shortest power
paths. For the sake of suppression of such frequent updates, it is
possible to handle the PWR metric as falling within reasonable
intervals of thresholds. If the interval in which PWR metric lies is
moved out of and another interval is reached then the update is sent
out in the IGP-TE mechanism. Otherwise if the interval in which the
PWR metric lies is not moved out of then the updates are not sent.
Suitable thresholds can be arrived at after suitable calibration
through tests.
Routers may have step levels in which they increase power consumption
when they additively are loaded with more large bandwidth consuming
multicast or unicast streams. Calibrating these levels may be useful
for implementing this scheme. It is possible that such calibrated
thresholds can be used for advertising the PWRLINK ratios in the OSPF
LSA advertisements. This would be useful for bringing down the
frequency of updates or advertisements from a line-card about its
PWRLINK ratio. When power consumption meanders within a certain given
interval these ratios need not be re-advertised even if further
unicast and/or multicast streams are added to it. The incentive is to
recognize a linecard that does not drastically change power
consumption even if large bandwidth streams are added onto it for
forwarding and thus give it credit for its power optimal functioning.
If a router tends to consume the highest level of power even when
carrying low amounts of unicast and multicast streams on its line
card, it would automatically have a poor ratio when compared to a
Shankar Raman et.al Expires November 4, 2013 [Page 22]
INTERNET DRAFT Building power shortest inter-Area TE-LSPs May 3, 2013
router that efficiently uses power when considering the Available
Bandwidth being observed. The best case would be a low power
consuming line-card or a router filled with such line cards that does
not leave its power interval no matter how much ever capacity is
sought to be used on it. But that would be an ideal condition but it
is definitely an idealistic scenario towards which the router
manufacturers should look at.
2.3 Advantages
1) The TE Reachability vector TLV contains the aggregate of all link
attributes along with TE constraints and so the head-end of the TE
path can explicitly select the ABR that connects the destination area
even though it does not know the complete topology of the backbone
area.
2) As the TE reachability vector contains only the aggregate
attributes of k-power-shortest-paths, the flooding overhead to
support the mechanism is limited.
3) Centralized path computation element is not required for
supporting inter-area power-shortest-path TE. The additional overhead
of computing k-power-shortest-paths on ABR can be solved by
offloading the computation overhead to additional processor in multi-
core platforms.
Shankar Raman et.al Expires November 4, 2013 [Page 23]
INTERNET DRAFT Building power shortest inter-Area TE-LSPs May 3, 2013
3 Security Considerations
None.
4 IANA Considerations
New TLV types for OSPF and IS-IS for the new TLVs that have been
introduced need to be assigned.
5 References
5.1 Normative References
5.2 Informative References
[1] G. Appenzeller, Sizing router buffers, Doctoral Thesis,
Department of Electrical Engineering, Stanford University,
2005.
[2] A. P. Bianzino, C. Chaudet, D. Rossi and J. L. Rougier, A survey
of green networking research, IEEE Communications and
Surveys Tutorials, preprint.
[3] J. Baliga, K. Hinton and R. S. Tucker, Energy consumption of the
internet, Proc. of joint international conference on
optical internet, June 2007, pp. 1-3.
[4] J. Chabarek, J. Sommers, P. Barford, C. Estan, D. Tsiang and S.
Wright, Power awareness in network design and routing,
Proc. of the IEEE INFOCOM 2008, April 2008, pp. 457-465.
[5] B. Venkat et.al, Constructing disjoint and partially disjoint
InterAS TE-LSPs, USPTO Patent 7751318, Cisco Systems,
2010.
[6] M. Xia et. al., Greening the optical backbone network: A traffic
engineering approach, IEEE ICC Proceedings, May 2010, pp.
1-5.
[7] W. Lu and S. Sahni, Low-power TCAMs for very large forwarding
tables, IEEE/ACM Transactions on Computer Networks, June
2010, vol. 18, no. 3, pp. 948-959.
[8] B. Zhang, Routing Area Open Meeting, Proceedings of the IETF 81,
Quebec, Canada, July 2011.
Shankar Raman et.al Expires November 4, 2013 [Page 24]
INTERNET DRAFT Building power shortest inter-Area TE-LSPs May 3, 2013
[9] M.J.S Raman, V.Balaji Venkat, G.Raina, Reducing Power consumption
using the Border Gateway Protocol, IARIA conferences
ENERGY 2012.
[10] A.Cianfrani et al., An OSPF enhancement for energy saving in IP
Networks, IEEE INFOCOM 2011 Workshop on Green
Communications and Networking
[11] Shankar Raman et al., draft-mjsraman-rtgwg-inter-as-psp-01.txt,
Work in Progress, February 2012.
Authors' Addresses
Shankar Raman
Department of Computer Science and Engineering
IIT Madras
Chennai - 600036
TamilNadu
India
EMail: mjsraman@cse.iitm.ac.in
Balaji Venkat Venkataswami
Department of Electrical Engineering
IIT Madras
Chennai - 600036
TamilNadu
India
EMail: balajivenkat299@gmail.com
Prof.Gaurav Raina
Department of Electrical Engineering
IIT Madras
Chennai - 600036
TamilNadu
India
EMail: gaurav@ee.iitm.ac.in
Shankar Raman et.al Expires November 4, 2013 [Page 25]