Internet DRAFT - draft-mjsraman-pim-ecmp-redirect-power-replic-cap

draft-mjsraman-pim-ecmp-redirect-power-replic-cap



 



Network Working Group                                      Shankar Raman
Internet-draft                                Balaji Venkat Venkataswami
Intended Status: Standards Track                            Gaurav Raina
Expires: September 2012                                    I.I.T Madras.
                                                          March 25, 2012


  PIM ECMP Redirect based on Linecard Replication Capacity and Power 
          draft-mjsraman-pim-ecmp-redirect-power-replic-cap-00


Abstract

   This work derives itself from [1] which proposes a ECMP redirect from
   a PIM upstream neighbor that instructs or advices the PIM downstream
   neighbor to choose another of its own ECMP links between the former
   and the latter. What we propose in this document is a criterion based
   on power consumed in the linecards that comprise the ECMP links
   between the former and the latter. Also the multicast replication
   capacity available within the said linecards which form the ECMP
   links between the two is taken into consideration while making a ECMP
   redirect.

   A PIM router uses RPF procedure to select an upstream interface and
   router to build forwarding state.  When there are equal cost multiple
   paths (ECMP), existing implementations often use hash algorithms to
   select a path.  Such algorithms do not allow the spread of traffic
   among the ECMPs according to administrative metrics.  This usually
   leads to inefficient or ineffective use of network resources.  This
   document introduces the ECMP Redirect, a mechanism to improve the RPF
   procedure over ECMPs.  It allows ECMP path selection to be based on
   administratively selected metrics, such as data transmission delays,
   path preferences and routing metrics.





Status of this Memo

   This Internet-Draft is submitted to IETF in full conformance with the
   provisions of BCP 78 and BCP 79.

   Internet-Drafts are working documents of the Internet Engineering
   Task Force (IETF), its areas, and its working groups.  Note that
   other groups may also distribute working documents as
   Internet-Drafts.

 


Shankar Raman et.al      Expires September 2012                 [Page 1]

INTERNET DRAFT Replication capability based PIM redirect      March 2012


   Internet-Drafts are draft documents valid for a maximum of six months
   and may be updated, replaced, or obsoleted by other documents at any
   time.  It is inappropriate to use Internet-Drafts as reference
   material or to cite them other than as "work in progress."

   The list of current Internet-Drafts can be accessed at
   http://www.ietf.org/1id-abstracts.html

   The list of Internet-Draft Shadow Directories can be accessed at
   http://www.ietf.org/shadow.html


Copyright and License Notice

   Copyright (c) 2012 IETF Trust and the persons identified as the
   document authors. All rights reserved.

   This document is subject to BCP 78 and the IETF Trust's Legal
   Provisions Relating to IETF Documents
   (http://trustee.ietf.org/license-info) in effect on the date of
   publication of this document. Please review these documents
   carefully, as they describe your rights and restrictions with respect
   to this document. Code Components extracted from this document must
   include Simplified BSD License text as described in Section 4.e of
   the Trust Legal Provisions and are provided without warranty as
   described in the Simplified BSD License.



Table of Contents

   1  Introduction  . . . . . . . . . . . . . . . . . . . . . . . . .  3
     1.1  Terminology . . . . . . . . . . . . . . . . . . . . . . . .  3
   2.  Conditions where this mechanism applies. . . . . . . . . . . .  3
   3  Security Considerations . . . . . . . . . . . . . . . . . . . .  6
   4  IANA Considerations . . . . . . . . . . . . . . . . . . . . . .  6
   5  References  . . . . . . . . . . . . . . . . . . . . . . . . . .  6
     5.1  Normative References  . . . . . . . . . . . . . . . . . . .  6
     5.2  Informative References  . . . . . . . . . . . . . . . . . .  6
   Authors' Addresses . . . . . . . . . . . . . . . . . . . . . . . .  6








 


Shankar Raman et.al      Expires September 2012                 [Page 2]

INTERNET DRAFT Replication capability based PIM redirect      March 2012


1  Introduction

   This work derives itself from [1] which proposes a ECMP redirect from
   a PIM upstream neighbor that instructs or advices the PIM downstream
   neighbor to choose another of its own ECMP links between the former
   and the latter. What we propose in this document is a criterion based
   on power consumed in the linecards that comprise the ECMP links
   between the former and the latter. Also the multicast replication
   capacity available within the said linecards which form the ECMP
   links between the two is taken into consideration while making a ECMP
   redirect.

   A PIM router uses RPF procedure to select an upstream interface and
   router to build forwarding state.  When there are equal cost multiple
   paths (ECMP), existing implementations often use hash algorithms to
   select a path.  Such algorithms do not allow the spread of traffic
   among the ECMPs according to administrative metrics.  This usually
   leads to inefficient or ineffective use of network resources.  This
   document introduces the ECMP Redirect, a mechanism to improve the RPF
   procedure over ECMPs.  It allows ECMP path selection to be based on
   administratively selected metrics, such as data transmission delays,
   path preferences and routing metrics.

   As mentioned earlier this document also proposes the use of the power
   being consumed by the linecards (if the ECMP links fall on multiple
   linecards with respect to their ECMP link ports) and the available
   replication capacity of the linecard within its ASICs as a measure of
   which ECMP link to which the PIM-Join is to be redirected to. If for
   example there exist multiple replication engines within different
   linecards then the lightly loaded replication engine and its
   corresponding ECMP link on that linecard can be recommended to the
   PIM downstream neighbor which sends the upstream neighbor a PIM join
   for a specific (S,G) or (*,G) group.




1.1  Terminology

   The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT",
   "SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this
   document are to be interpreted as described in RFC 2119 [RFC2119].


2.  Conditions where this mechanism applies. 


   Assume there are multiple ECMP links between a PIM upstream and
 


Shankar Raman et.al      Expires September 2012                 [Page 3]

INTERNET DRAFT Replication capability based PIM redirect      March 2012


   downstream router. Lets assume they fall on different linecards on
   the upstream neighbor with respect to the port placement of these
   ECMP links. Assume one linecard A, which is already carrying multiple
   data streams in either the incoming direction on its ports and
   replicating it to multiple other outgoing linecards through the
   switch fabric. Assume another linecard B that is lightly loaded with
   respect to multicast traffic and heavily loaded with respect to the
   unicast traffic. Assume another linecard C which is lightly loaded
   with respect to both. Now also assume that these linecards A,B and C
   are all members of the group of linecards whose ports place
   themselves within the ECMP links between the upstream and downstream
   neighbor.

   It is possible to decipher from this is that linecard C would be a
   better point of placement of the replication for the group for which
   the PIM-Join comes from the downstream neighbor. Assume now that the
   downstream neighbor selects the linecard A. It is now possible as a
   result of using the mechanism dictated to in [1], to send a redirect
   to the downstream neighbor that it would be a better choice to choose
   linecard C. This it does by recommending the neighbor address field
   as the port IP address which falls on linecard C. The PDU format for
   the ECMP redirect specified in [1] and the procedures that go along
   with it follow.

   It is also possible to make this decision in combination with the
   above or solely on a metric which we will call PWR-REPLIC-CAP. This
   metric is derived as follows...

   PWR-REPLIC-CAP = Power Consumed on that linecard
                    -------------------------------
                    Available replication capacity on the linecard.

   In the metric case the lowest PWR-REPLIC-CAP metric is chosen. This
   optimizes on the power being spent on replication to the extent
   possible.

   It is important to note that a linecard may have multiple replication
   engines and ports assigned to each replication engine or to all of
   them. It is possible that the ECMP links and their ports fall on the
   same linecard and in that case it would be possible to choose a
   specific replication engine from among the multiple replication
   engines available. that has better available replication capacity by
   choosing a neighbor address belonging to the port that falls in a set
   that belongs to the superior replication engine (with respect to the
   available replication capacity at that point in time). If there is no
   distinction amongst ports with respect to the multiple replication
   then it actually makes no difference since it would be an internal
   decision as to where the replication for that port actually happens.
 


Shankar Raman et.al      Expires September 2012                 [Page 4]

INTERNET DRAFT Replication capability based PIM redirect      March 2012


   It is also important to note here that the linecards that are
   multicast capable have well advertised replication capacities which
   the vendors advice in the linecard data sheets. This could be placed
   in a variable that can be monitored for shifts within intervals of
   threshold values. The same goes for the power consumed by the
   linecards as well.

   Pseudo code for the steps to be followed to implement this scheme is
   as follows...

   if (multiple ECMP links exist to the PIM neighbor 
       from which PIM-Join was received) then

       Get the list of ECMP ports which are members of the ECMP links;

       Get the list of linecards on which these ECMP ports are placed;

       LCA = Consider the least used linecard with respect 
       to the replication capacity;

       if (port on LCA is the same on which PIM-Join was received)

           do nothing; return;

       else if (choice is to be made on replication capacity)

           LCA = Consider the least used linecard with respect 
                 to replication capacity;

       else if (choice is to be made on PWR-REPLIC-CAP)

           LCA = Consider the linecard with the best 
                 PWR-REPLIC-CAP metric;

       end if

       Send PIM redirect to PIM downstream neighbor 
       recommending LCA ECMP link;

   end if








 


Shankar Raman et.al      Expires September 2012                 [Page 5]

INTERNET DRAFT Replication capability based PIM redirect      March 2012


3  Security Considerations

   <Security considerations text>


4  IANA Considerations

   <IANA considerations text>


5  References

5.1  Normative References

   [KEYWORDS] Bradner, S., "Key words for use in RFCs to Indicate
              Requirement Levels", BCP 14, RFC 2119, March 1997.

   [RFC1776]  Crocker, S., "The Address is the Message", RFC 1776, April
              1 1995.

   [TRUTHS]   Callon, R., "The Twelve Networking Truths", RFC 1925,
              April 1 1996.


5.2  Informative References

              [1] Yiqun, Cai et.al, Protocol Independent Multicast ECMP
              Redirect, "draft-ietf-pim-ecmp-02.txt", Work in Progress,
              October 2011.

              [2] Shankar Raman, et.al, Building power optimal Multicast
              Trees, "draft-mjsraman-rtgwg-pim-power-01.txt", Work in
              Progress, February 2011.

   [EVILBIT]  Bellovin, S., "The Security Flag in the IPv4 Header",
              RFC 3514, April 1 2003.

   [RFC5513]  Farrel, A., "IANA Considerations for Three Letter
              Acronyms", RFC 5513, April 1 2009.

   [RFC5514]  Vyncke, E., "IPv6 over Social Networks", RFC 5514, April 1
              2009.




Authors' Addresses

 


Shankar Raman et.al      Expires September 2012                 [Page 6]

INTERNET DRAFT Replication capability based PIM redirect      March 2012


   Shankar Raman
   Department of Computer Science and Engineering
   I.I.T Madras,
   Chennai - 600036
   TamilNadu,
   India.

   EMail: mjsraman@cse.iitm.ac.in



   Balaji Venkat Venkataswami
   Department of Electrical Engineering,
   I.I.T Madras,
   Chennai - 600036,
   TamilNadu,
   India.

   EMail: balajivenkat299@gmail.com



   Prof.Gaurav Raina
   Department of Electrical Engineering,
   I.I.T Madras,
   Chennai - 600036,
   TamilNadu,
   India.

   EMail: gaurav@ee.iitm.ac.in





















Shankar Raman et.al      Expires September 2012                 [Page 7]