purkayastha-sfc-service-indirection-02.txt

Internet DRAFT - draft-purkayastha-sfc-service-indirection
draft-purkayastha-sfc-service-indirection

Last Version:	draft-purkayastha-sfc-service-indirection-02.txt	Tracker Entry
Date:	`02-Mar-2018`
Disposition:	expired
Previous Versions:	draft-purkayastha-sfc-service-indirection-01.txt (diff) - 28-Oct-2017
	draft-purkayastha-sfc-service-indirection-00.txt (diff) - 02-Jul-2017





Network Working Group                                     D. Purkayastha
Internet-Draft                                                 A. Rahman
Intended status: Informational                                D. Trossen
Expires: September 2, 2018              InterDigital Communications, LLC
                                                           Z. Despotovic
                                                              R. Khalili
                                                                  Huawei
                                                           March 1, 2018


    Alternative Handling of Dynamic Chaining and Service Indirection
              draft-purkayastha-sfc-service-indirection-02

Abstract

   Many stringent requirements are imposed on today's network, such as
   low latency, high availability and reliability in order to support
   several use cases such as IoT, Gaming, Content distribution, Robotics
   etc.  Networks need to be flexible and dynamic in terms of allocation
   of services and resources.  Network Operators should be able to
   reconfigure the composition of a service and steer users towards new
   service end points as user move or resource availability changes.
   SFC allows network operators to easily create and reconfigure service
   function chains dynamically in response to changing network
   requirements.  We discuss a use case where Service Function Chain can
   adapt or self-organize as demanded by the network condition without
   requiring SPI re-classification.  This can be achieved, for example,
   by decoupling the service consumer and service endpoint by a new
   service function proposed in this draft.  We describe few
   requirements for this service function to enable dynamic switching
   between consumer and end point.

Status of This Memo

   This Internet-Draft is submitted in full conformance with the
   provisions of BCP 78 and BCP 79.

   Internet-Drafts are working documents of the Internet Engineering
   Task Force (IETF).  Note that other groups may also distribute
   working documents as Internet-Drafts.  The list of current Internet-
   Drafts is at https://datatracker.ietf.org/drafts/current/.

   Internet-Drafts are draft documents valid for a maximum of six months
   and may be updated, replaced, or obsoleted by other documents at any
   time.  It is inappropriate to use Internet-Drafts as reference
   material or to cite them other than as "work in progress."

   This Internet-Draft will expire on September 2, 2018.



Purkayastha, et al.     Expires September 2, 2018               [Page 1]

Internet-Draft         Alternative Handling of SFC            March 2018


Copyright Notice

   Copyright (c) 2018 IETF Trust and the persons identified as the
   document authors.  All rights reserved.

   This document is subject to BCP 78 and the IETF Trust's Legal
   Provisions Relating to IETF Documents
   (https://trustee.ietf.org/license-info) in effect on the date of
   publication of this document.  Please review these documents
   carefully, as they describe your rights and restrictions with respect
   to this document.  Code Components extracted from this document must
   include Simplified BSD License text as described in Section 4.e of
   the Trust Legal Provisions and are provided without warranty as
   described in the Simplified BSD License.

Table of Contents

   1.  Introduction  . . . . . . . . . . . . . . . . . . . . . . . .   2
   2.  Use Case Description  . . . . . . . . . . . . . . . . . . . .   3
     2.1.  Data Center . . . . . . . . . . . . . . . . . . . . . . .   3
     2.2.  Third party cloud service provider  . . . . . . . . . . .   4
     2.3.  ETSI MEC USE CASE . . . . . . . . . . . . . . . . . . . .   5
     2.4.  3GPP  . . . . . . . . . . . . . . . . . . . . . . . . . .   6
     2.5.  Use Case Analysis . . . . . . . . . . . . . . . . . . . .   6
   3.  NSH and Re-classification . . . . . . . . . . . . . . . . . .   8
     3.1.  Dynamic service chain creation using NSH  . . . . . . . .   9
   4.  Challenges with dynamic indirection . . . . . . . . . . . . .  10
   5.  HTTP as a transport . . . . . . . . . . . . . . . . . . . . .  12
   6.  Service Request Routing (SRR) Service Function  . . . . . . .  14
     6.1.  Overview  . . . . . . . . . . . . . . . . . . . . . . . .  14
     6.2.  Details of SRR Function . . . . . . . . . . . . . . . . .  16
   7.  Protocol Consideration  . . . . . . . . . . . . . . . . . . .  21
   8.  Next Steps  . . . . . . . . . . . . . . . . . . . . . . . . .  21
   9.  IANA Considerations . . . . . . . . . . . . . . . . . . . . .  21
   10. Security Considerations . . . . . . . . . . . . . . . . . . .  22
   11. Informative References  . . . . . . . . . . . . . . . . . . .  22
   Authors' Addresses  . . . . . . . . . . . . . . . . . . . . . . .  23

1.  Introduction

   The requirements on today's networks are very diverse, enabling
   multiple use cases such as IoT, Content Distribution, Gaming, Network
   functions such as Cloud RAN.  Every use case imposes certain
   requirements on the network.  These requirements vary from one
   extreme to other and often they are in a divergent direction.
   Network operator and service providers are pushing many functions
   towards the edge of the network in order to be closer to the users.




Purkayastha, et al.     Expires September 2, 2018               [Page 2]

Internet-Draft         Alternative Handling of SFC            March 2018


   This reduces latency and backhaul traffic, as user request can be
   processed locally.

   It becomes more challenging when network congestion, user mobility as
   well as non-deterministic availability of compute and storage
   resources are considered.  The impact is felt most in the edge of the
   network because as the users move, their point of attachment changes
   frequently, which results in (at least partially) relocating the
   service as well as the service endpoint.  Furthermore, network
   functions are pushed more and more towards the edge, where network,
   compute and storage resources are constrained and availability is
   non-deterministic.  Constrained network resources may lead into
   congestion in the network.  Also, storage resources may need to be
   moved where the user concentration is more in case of content
   delivery applications.

   We describe few use cases in the next section and derive the
   requirement for composing new services and service path in a dynamic
   edge network.  We address this dynamicity by introducing a special
   Service Function, called SRR (service request routing).  We describe
   the problems associated with today's network and Layer 3 based
   approach to handle dynamicity in the network.  We then discuss how
   such new Service Function with certain capabilities can handle the
   dynamicity better than these conventional methods.

2.  Use Case Description

2.1.  Data Center

   The data center use case draft [I-D.ietf-sfc-dc-use-cases] describes
   an East West traffic use case.  This is the predominant traffic in
   data centers today.  Server virtualization has led to the new
   paradigm where virtual machines can migrate from one server to
   another across the data center.  This explosion in east-west traffic
   is leading to newer data center network fabric architectures that
   provide consistent latencies from one point in the fabric to another.

   SFCs applied in an enterprise or service provider data center can be
   broadly categorized into two types:

   o  Access SFCs

   o  Application SFCs

   Access SFCs are focused on servicing traffic entering and leaving the
   data center while Application SFCs are focused on servicing traffic
   destined to applications.  Service providers deploy a single "Access
   SFC" and multiple "Application SFCs" for each tenant.  Enterprise



Purkayastha, et al.     Expires September 2, 2018               [Page 3]

Internet-Draft         Alternative Handling of SFC            March 2018


   data center operators on the other hand may not have a need for
   Access SFCs depending on the size and requirements of the enterprise.

   In carrier networks, operators may deploy multiple data centers
   dispersed geographically.  Each data center may host different types
   of service functions.  For example, latency sensitive or high usage
   service functions are deployed in regional data centers while other
   latency tolerant, low usage service functions are deployed in global
   or central data centers.  In such deployments, SFCs may span multiple
   data centers and enable operators to deploy services in a flexible
   and inexpensive way.

   It is clear that within the data center as well as in inter data
   center scenarios, users are serviced by multiple SFs distributed
   inside as well as outside a location.  In this scenario, it is clear
   that Service function chains should be able to reselect, redirect
   traffic very fast.  The draft identifies that Static service chains
   do not allow for modifying the SFCs as they require the ability to
   add SNs or remove SNs to scale up and down the service capacity.
   Likewise the ability to dynamically pick one among the many SN
   instance is not available.

2.2.  Third party cloud service provider

   This use case is related to an emerging business model, where
   computational resources for edge cloud service are provided by
   alternative facility providers that are non-traditional network
   operators.  This is due to the situation for many specific localized
   use cases, where network operators may not have necessary real estate
   available.  They may even not be willing to spend on CAPEX and OPEX
   for said point-of-presence, because there is no clear path for
   sustainable cost recovery [UKNIC].

   The industry is witnessing the emergence of real estate owners such
   as building asset or management companies, cell tower owners, railway
   companies or other facility owners willing to deploy edge cloud
   resources.  The facility provider, e.g. cell tower owner or building
   management company, deploys edge computing resources throughout their
   installation in the country.  They have their own operation and
   management software, which is capable of resource deployment, scale
   up or scale down resources, deploy edge applications from third party
   service providers . They are capable of offering service to more than
   one network operator at a specific location, thus acting as a
   "neutral host".  The facility provider, which owns cloud resources
   and provides application services, is referred to as "Third party
   Edge Owner (TEO)".





Purkayastha, et al.     Expires September 2, 2018               [Page 4]

Internet-Draft         Alternative Handling of SFC            March 2018


   There is more than one stakeholder in this ecosystem, E.g.  Network
   Service Provider, Real estate owner, Cloud capability (compute and
   storage resource) provider, Application/service provider.  An entity
   can assume more than one role.  From network operators point of view
   there may be "Cloud provider" or "Cloud service provider" depending
   on the roles assumed by external entity.

   "Cloud Providers" provide cloud resources (compute and storage) to
   network operators.  Network operators rent those resources and manage
   MEC host by themselves.  Network operator can set up application
   traffic rules, so that traffic can be processed, by that host.

   "Cloud Service Providers" not only make resources available to
   network operators or service providers, but also provides management
   and hosting service.  They can host edge applications on behalf of
   application service providers and sets up user plane traffic to be
   steered towards the edge application.

   Cloud Service Providers, as well as many organizations that need to
   share and analyze a quickly growing amount of data, such as
   retailers, manufacturers, telcos, financial services firms, and many
   more, are turning to localized Micro Data Centers(MDC) installed on
   the factory floor, in the telco central office, the back of a retail
   outlet, etc.  The solution applies to a broad base of applications
   that require low latency, high bandwidth, or both.

   As Micro Date centers are deployed at the edge of the network, common
   deployment options are:

   o  Micro Data Centers are deployed on L2 in the edge of the network

   o  Instead of single internet Point Of Presence (POP) deployment,
      multiple internet POP deployment is desirable to localize data

   o  Service is composed out of these multiple POP deployment of MDC,
      where data exchange and collaboration is expected among these MDCs

   o  Due to mobility, changes in network condition (e.g. congestion,
      load), service composition may change frequently to support
      promised quality of experience

2.3.  ETSI MEC USE CASE

   Take the following video orchestration service example from ETSI MEC
   Requirements document [ETSI_MEC].  The proposed use case of edge
   video orchestration suggests a scenario where visual content can be
   produced and consumed at the same location close to consumers in a
   densely populated and clearly limited area.  Such a case could be a



Purkayastha, et al.     Expires September 2, 2018               [Page 5]

Internet-Draft         Alternative Handling of SFC            March 2018


   sports event or concert where a remarkable number of consumers are
   using their handheld devices to access user select tailored content.
   The overall video experience is combined from multiple sources, such
   as local recording devices, which may be fixed as well as mobile, and
   master video from central production server.  The user is given an
   opportunity to select tailored views from a set of local video
   sources.

2.4.  3GPP

   3GPP Rel. 15 introduces the notion of the service-based interface
   (SBI) as an alternative to the traditional call pattern invocation of
   network functions.  This introduction targets the support for
   replication, e.g., driven by virtualized functions, as well as
   supporting alternative interactions, e.g., for different vertical
   market specific control planes, by making the discovery as well as
   composition of new interactions more flexible.

   We believe that SFC is a suitable framework for the interconnection
   of such network functions through the new SBI.  One of the
   aforementioned driving forces, namely the replication of functions
   aligns with our thinking in this draft in that indirections to new
   vertical instances need to be dynamic in reacting to the appearance
   of new virtual instances or to changes in policies for the selection
   of specific instances by specific calling entities.

2.5.  Use Case Analysis

   SFC allows network operators as well as service providers to compose
   new services by chaining individual service functions.

   In a dynamic network environment, like the edge of a network, the
   capability to dynamically compose new services from available
   services as well as move a service instance is desirable.  Dynamic
   composition and relocation of services may be attributed to:

   o  Congestion in the network: Due to constrained network resources,
      increase in the network load may create congestion in the network,
      resulting in a congested Service Function Path.  Service functions
      may detect congestion and reconfigure the Service Function Path to
      avoid it.

   o  In response to latency: in a dynamic network environment and with
      the need for ultra-low latency communication, instantiation of new
      service function endpoints might be the only remedy to combat the
      increase of latency caused, e.g., by increased load on a previous
      endpoint or mobility of the user and therefore increasing the
      'distance' to the service function endpoint.  Keeping the service



Purkayastha, et al.     Expires September 2, 2018               [Page 6]

Internet-Draft         Alternative Handling of SFC            March 2018


      function endpoint 'close' to the user allows for reducing latency,
      segregating communication in localized islands of service
      interaction.

   o  In response to user mobility: In a dynamic network environment
      where service functions move frequently because of user movement,
      load balancing or resource modification, service function chains
      and the service end points need to be created and recreated
      frequently

   o  Resource availability.: Availability of compute and storage
      resources varies with network load, number and type of
      applications running etc.  In the edge of the network, due to
      sudden increase of users, compute load may increase.  In this
      situation applications, running on the compute resources may be
      moved to another location where more resources are available.

   In SFC, there is a notion of logical chaining of SFs and chaining of
   actual physical locations, known as Rendered Service Path (RSP).  RSP
   provides a static binding of SFs to their physical location.  In
   order to create a chain in dynamic fashion, late binding of SFs and
   physical location may be desired.  SFC is capable of modifying the
   service chain to certain extent in response to network conditions,
   but not a complete solution has been described

   In order to route the service requests to service end points in a
   dynamic manner, we identify the following desirable features in a
   service function chain:

   o  Capability to trigger service chain reconfiguration based on
      network information such as congestion indication, mobility,
      degradation of user experience etc.  Service Functions should be
      able to process such network information, identify which section
      of the chain needs to be reconfigured and take action

   o  Fast switching from one service instance to another by not relying
      on the DNS for service location resolution.  Instead of DNS, the
      function should be able to identify the path, which will allow to
      reach the service end point.

   o  Direct path mobility, where the path between the requester and the
      responding service can be determined as being optimal (e.g.,
      shortest path or direct path to a selected instance), is needed to
      avoid the use of anchor points and further reduce service-level
      latency






Purkayastha, et al.     Expires September 2, 2018               [Page 7]

Internet-Draft         Alternative Handling of SFC            March 2018


   o  Indirect service requests at the network level, transparent to the
      requesting client and without the involvement of the DNS.  End
      user is not aware of the decision made by the SF.

   o  New methods for forwarding, such as path-based forwarding, direct
      path routing in mobility cases, path pinning for traffic steering
      and simplified service-specific peering towards the Internet.

3.  NSH and Re-classification

   [RFC7498] captures the problems associated with existing service
   deployments that are problematic.  The problems are described below
   at a high level:

   o  Network topology: Network service deployment is tightly coupled
      with network topology thus reducing the flexibility in service
      delivery.  It adds complexity in deploying network service when
      certain traffic types may need some service and other traffic
      types do not need the same service.

   o  Configuration complexity is the direct result of dependency on
      network topology.

   o  Limited availability of services

   o  Altering the order of a deployed chain is complex and cumbersome

   o  Coupling of service functions to topology may require service
      functions to support many transport encapsulations or for a
      transport gateway function to be present.

   o  In a dynamic environment like the Edge of a network service
      delivery, routing changes fast.  It may be difficult to deliver
      service dynamically due to the risk and complexity of VLANs and/or
      routing modifications.

   These factors provide motivation for a simplified and flexible
   service insertion model that addresses many of the current
   shortcomings and provides new, much needed functionality to enable
   service deployments in modern network environments.  Service chaining
   accomplishes this by considering service functions as resources, with
   associated attributes, available for scheduled consumption.
   Selective traffic, subject to policy, may then be "steered" to the
   requisite service resources, along with any "extra" information
   referred to as metadata.  This metadata is used for policy
   enforcement.





Purkayastha, et al.     Expires September 2, 2018               [Page 8]

Internet-Draft         Alternative Handling of SFC            March 2018


   A basic form of service chaining may be realized using existing
   transport encapsulations.  This method of chaining relies upon the
   tunneling of selected data between service functions.  Although this
   form of service chaining achieves some level of abstraction from the
   underlying topology, it does not truly create a service plane.  NSH
   [RFC8300] is a distinct identifiable plane that can be used across
   all transports to create a service chain and exchange metadata along
   the chain.

   Fundamentally, however, the notion of "services" in SFC is tied into
   specific service function endpoints, which lie along a well-defined
   service function path (SFP) where the path is defined through lower
   layer transport encapsulations.  If any such service function
   endpoint changes, the service chain needs to be adjusted; a procedure
   we outline in the following sub-section.

3.1.  Dynamic service chain creation using NSH

   We revisit the dynamic service chain creation capability of NSH.  NSH
   defines a new service plane protocol [RFC8300].  A Network Service
   Header (NSH) contains service path information and optionally
   metadata that are added to a packet or frame and used to create a
   service plane.  A control plane is required in order to exchange NSH
   values with participating nodes, and to provision the same nodes with
   requisite information such as service path ID to overlay mapping.

   The Network Service Header has three parts, Base header, Service Path
   Header and Context Header.  NSH Service Path Header is a 4-byte
   service path header follows the base header and defines two fields
   used to construct a service path:

   o  Service path identifier (SPI)

   o  Service index (SI)

   The following figure depicts the service path header.

     0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2
     +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
     | Service Path ID                               | Service Index |
     +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+


                         Figure 1: NSH Path Header

   The service path identifier (SPI) is used to identify the service
   path that interconnects the needed service functions.  It allows
   nodes to utilize the identifier to select the appropriate network



Purkayastha, et al.     Expires September 2, 2018               [Page 9]

Internet-Draft         Alternative Handling of SFC            March 2018


   transport protocol and forwarding techniques.  The service index (SI)
   identifies the location of a packet within a service path.  As
   packets traverse a service path, the SI is decremented post-service.

   SPI represents the service path and altering the path identifier
   results in a change of a service path.  A change in SPI value is a
   result of re-classification.  It means a node in the service path
   determined, based on policy, that the initial classification was
   incorrect or incomplete.  If the updated classification results in
   the necessity of a new service path, the node updates the SPI and SI
   fields accordingly.  The new identifier is then used to select the
   appropriate overlay topology.  This allows service functions to alter
   the path of a packet without having to participate in the network
   topology and its associated control plane(s).  The method to
   determine that an existing classification is incorrect and how to
   determine the new classification is not defined.

4.  Challenges with dynamic indirection

   The emerging trend in today's network is to deploy network functions,
   services and applications at the edge of the network to support
   latency requirements, computational offload, traffic optimization
   etc.  As users are moving, application or services being used by
   users, may need to be moved closer to the user's new location.  This
   implies another instance of the service function may need to be
   instantiated close to the user's new location.  It may result in re-
   establishing service path from the newly instantiated service
   function to other service instances.  It is also possible that the
   newly instantiated service function may be redirected to a new
   service end point (e.g.  Application Server) for various reasons,
   such as incomplete content, proximity to data store, load balancing
   etc.  In another scenario, a single instance of the service function
   may not handle all users due to latency or load constraints.  A
   single service function may be instantiated more than once to balance
   user load.  As the number of instances increase and along with
   mobility, the complexity of service routing increases.  It is
   anticipated that there may be a constant action of function chaining,
   re-chaining occurring in the network.

   The challenge of dynamic indirection may be better described by
   analyzing the working of CDNs, which dynamically (re-)direct user-
   initiated requests towards the most appropriate content instance.
   This task becomes more difficult if granularity of the instance
   placement increases.  For instance, in case of a CDN being realized
   close to end users, specifically in edge of the network, the specific
   content instance might need to be selected dynamically.  After
   initial selection, the instance may change during service execution.




Purkayastha, et al.     Expires September 2, 2018              [Page 10]

Internet-Draft         Alternative Handling of SFC            March 2018


   In a conventional network, an instance of a service is found and
   selected using DNS.  The subsequent service request is then routed
   through the network between the client and the service.  If the user
   is doing a DNS lookup to access content served by a CDN then the DNS
   service will maintain a list of IP addresses that can be returned for
   a given domain name and will try to return an IP address of a node
   geographically close to the client.  Should the service provider want
   to replace an instance of their service with another one at a
   different IP address (and potentially a different physical location
   for various reasons such as load balancing, reliability etc.) then
   the DNS tables must be updated, i.e., the service needs to be
   (re-)registered quickly.  This is done by updating the local
   authoritative DNS server which then propagates the new mapping to DNS
   services across the world.  DNS propagation can take up to 48 hours
   so fast and dynamic switching from one service instance to another is
   not possible in conventional networks; even in more localized
   scenarios, the propagation of DNS updates might still be
   insufficient.  When relying on many surrogate service endpoints to
   exist in the edge network, there is a clear issue of certain
   resources not being available in one surrogate instance while
   existing in another so that changes in redirection might be
   desirable, while also changes in local load drive the need for such
   change in redirection.  With the emergence of container-based
   virtualization platforms, service function endpoints can be
   established in a matter of seconds and we therefore believe that the
   'reachability' of such said service instance, i.e., the possibility
   of route service requests to it from a client that was previously
   served elsewhere, must follow a similar timeline, i.e., a few seconds
   or even less.

   The other issue in conventional network lies with mobility management
   procedure.  These procedures use an anchor point, which terminates a
   session at the network edge.  As user moves around, traffic is
   redirected from the anchor point to the new point of attachment.
   Relying on typical mobility management approaches found in IP
   networks, usually leads to inefficient 'triangular' routing of
   requests through this common 'anchor' point.  This triangular routing
   increases the latency in reaching the new service function or service
   end points as users move.

   Traffic steering is a common procedure in managed networks,
   particularly at the edge, due to desired subscriber-centric traffic
   policies (e.g., related to pricing structures), resource requirements
   (e.g., related to using particular paths in the network) or mobility
   (e.g., users moving in a cellular network).  Today's methods for
   traffic steering include anchor-based mobility management as well as
   traffic classification, for instance, in packet gateways of cellular
   systems (using, e.g., deep packet inspection as well as port and



Purkayastha, et al.     Expires September 2, 2018              [Page 11]

Internet-Draft         Alternative Handling of SFC            March 2018


   address classification).  While the former leads to inefficient
   'triangular' traffic forwarding, the latter often requires additional
   state in the forwarders to differentiate traffic from one user to
   another.

   The analysis of CDN network shows that dynamic indirection is a
   necessary requirement, which needs to be supported by the networks.
   The goal for this indirection is to provide user applications lowest
   possible latency.  But as discussed above, relying on today's
   technique does not help in guaranteeing same latency to user
   applications.  On the other hand, there is a high possibility that
   latency may increase if we rely on Layer 3 based service redirection
   techniques.

   SFC handles indirection through the use of SPI.  A packet needs to be
   reclassified and the intermediate node changes the SPI.  Following
   are the typical steps that happens in order to implement the
   indirection.

   o  A packet arrives at a particular node

   o  The node contacts the policy manager

   o  Identifies the current classification is incorrect

   o  Reclassifies the packet, i.e. change the SPI

   o  Inserts the packet in the pipe, possibly towards the SFF

   The indirection mechanism in SFC involves certain steps to process
   policy information and change the SPI in the packet header, making it
   suitable to handle dynamic indirection requirements.  Our proposed SF
   in this document provides an additional method to handle dynamic
   indirection of service requests, not relying on the reclassification
   mechanism.  Combining these two techniques may provide flexibility
   and improvement over single method.

5.  HTTP as a transport

   With the extensive use of "web technology", "distributed services"
   and availability of heterogeneous network, HTTP has effectively
   transitioned into the common transport for name-based E2E
   communication across the web.  In the context of SFC and SF, HTTP
   requests and response are considered as the "Service Request (SR)".
   This use case describes how these SRs are directed towards correct SF
   in a fast and dynamic way.  The routing and indirection of SRs are
   abstracted at HTTP level, instead of the traditional approach where
   routing decision for a service request is made at Layer 3.



Purkayastha, et al.     Expires September 2, 2018              [Page 12]

Internet-Draft         Alternative Handling of SFC            March 2018


   If we abstract HTTP as a transport, HTTP requests, such as GET, PUT
   and POST can be routed based on the URI associated with the request,
   with the URI being simply the name of a resource or the invocation
   point for a service transaction.  Based on the name of the resource
   requested, the appropriate HTTP request can be routed to the suitable
   service endpoint.  If Service Functions (SF) could be identified
   using URI or name, HTTP requests to an SF would be routed or directed
   using name based routing.  With that, the redirection to the most
   suitable service instance is purely done based on named services with
   HTTP being a specific (application layer) transport service.

   The ongoing EU H2020 efforts like FLAME [H2020FLAME]  are driven by
   city-scale many-POP deployments of compute infrastructure, all SDN-
   connected and OpenStack managed.  Localized media use cases drive the
   need for name-based (HTTP as the main transport protocol here)
   service instances being chained with the relationship between
   specific virtual instances being controlled at the underlying
   routing/switching level.

   The notion of 'HTTP as-a transport', utilizing URLs as addressing
   scheme, can be used to create SFP as shown in Fig 2., i.e.,
   192.168.x.x -> www.example.com -> 192.168.x.x -> www.example2.com ->
   192.168.x.x -> ... -> www.exampleN.com.  It is this 'name-based'
   relationship that we see possibly realized through specific
   replicated instances, where in turn the routing towards those
   specific instances is realized by the SRR.


                                                         +--------+
                                                         |        |
            |-------------------------|------------------+  SRR   +
            |                         |                  |        |
            |                         |                  +---/|\--+
            |                         |                       |
       +---\|/--+   +---------+   +--\|/--+   +------+   +----+---+
       |        |   |         |   |       |   |      |   |        |
       + Client +-->+  SRR    +-->+ Media +-->+ SRR  +-->+ Media  +
       |        |   |         |   |  Fn1  |   |      |   |  Fn2   |
       +--------+   +---------+   +-------+   +------+   +--------+

       SFP:192.168.x.x-->www.example.com-->192.168.x.x
       -->www.example2.com-->192.168.x.x-->www.exampleN.com


            Figure 2: SFP with new HTTP-based Transport option

   In a pure SFC architectural framework, Classifier function may
   interact with SRR to obtain an SE (Service Encapsulation).  E.g. the



Purkayastha, et al.     Expires September 2, 2018              [Page 13]

Internet-Draft         Alternative Handling of SFC            March 2018


   Classifier function may look into the network locator map in Fig 2
   and determine the next SF is www.example.com.  It provides this
   information to SRR to obtain the next hop information.  SRR returns
   the SE for next hop, which can be a "bitfield" information that is
   being used in the overlay routing for this part of the SFP.  The
   Classifier function uses this SE to route the incoming packet
   directly at the transport network level.

6.  Service Request Routing (SRR) Service Function

6.1.  Overview

   The following diagram shows the application of the new proposed SRR
   service function in an example of media clients connecting to media
   servers.  There may be more than one media functions to support CDN
   like architecture, Surrogate servers to handle mobility and load
   balancing.


                                                         +--------+
                                                         |        |
            |-------------------------|------------------+  SRR   +
            |                         |                  |        |
            |                         |                  +---/|\--+
            |                         |                       |
       +---\|/--+   +---------+   +--\|/--+   +------+   +----+---+
       |        |   |         |   |       |   |      |   |        |
       + Client +-->+  IP     +-->+ Media +-->+ SRR  +-->+ Media  +
       |        |   | Routing |   |  Fn1  |   |      |   |  Fn2   |
       +--------+   +---------+   +-------+   +------+   +--------+


    Figure 3: General SFC with SRR Flexible Chaining, initiated via IP
                         Routed Client Connection

   The clients are connected to media functions through frontend routed
   network, e.g., relying on standard IP routing, while media functions
   are chained via the new proposed service request routing (SRR)
   function.  Alternatively, we also envision to utilize the SRR
   function directly between client SF and media function SF, as
   outlined in the figure below










Purkayastha, et al.     Expires September 2, 2018              [Page 14]

Internet-Draft         Alternative Handling of SFC            March 2018


                                                         +--------+
                                                         |        |
            |-------------------------|------------------+  SRR   +
            |                         |                  |        |
            |                         |                  +---/|\--+
            |                         |                       |
       +---\|/--+   +---------+   +--\|/--+   +------+   +----+---+
       |        |   |         |   |       |   |      |   |        |
       + Client +-->+  SRR    +-->+ Media +-->+ SRR  +-->+ Media  +
       |        |   |         |   |  Fn1  |   |      |   |  Fn2   |
       +--------+   +---------+   +-------+   +------+   +--------+


    Figure 4: General SFC with SRR Flexible Chaining, initiated via SRR
                              Chained Client

   For our considerations, we assume that each SF is realized by at
   least one or more service function endpoints (SFEs).  Hence, instead
   of looking at "chaining" as a concept that connects specific SFEs
   along a well-defined SFP, we propose to look at "chaining" at the
   level of "named" service functions rather than their specific
   endpoint instances.  With this in mind, the SRR service function
   lifts the relationship between the connecting SFs to the level of
   "logical" service functions rather than their specific realizing
   endpoints.  Instead of relying on dynamic re-chaining in case of any
   dynamically changing relationship between specific SFEs, the SRR
   provides the selection of suitable SFEs while maintaining the logical
   relationship between the SFs.  In Section 6.3, we will present the
   necessary extensions to the SFP concept to support this higher
   abstraction of "chaining" via "named" logical SFs.  The SRR
   introduces the flexibility in routing service requests from client to
   specific SFEs.  In the edge network, where users are moving and
   service end points may also change, having flexibility to decide and
   steer service requests directly helps in guaranteeing the same
   latency to user applications.  Clearly, that is achieved by reducing
   the switching time from SF to another.  As service end point changes,
   the routing functions makes instantaneous decision to route the
   request to the appropriate media server.

   The SRR introduces the flexibility in routing service requests from
   client to specific SFEs in response to conditions such as congestion
   in the network, user mobility etc.  In the edge network, where users
   are moving and service end points may also change, having flexibility
   to decide and steer service requests directly helps in guaranteeing
   the same latency to user applications.  The edge of the network maybe
   congested due to limited network resources.  The SRR may be able to
   determine network congestion and quickly route service requests to
   other Service End point, which is not experiencing congestion.  In



Purkayastha, et al.     Expires September 2, 2018              [Page 15]

Internet-Draft         Alternative Handling of SFC            March 2018


   addition, application-layer control functions might utilize latency
   measurements to ensure that suitable service instances are being
   created during runtime of the scenario such as to ensure that service
   function endpoints are available 'nearby' (possibly) moving so as to
   keep a desired latency under a desired value.

   Clearly, that is achieved by reducing the switching time from one SF
   endpoint to another.  As the service end point changes, the routing
   functions makes instantaneous decision to route the request to the
   appropriate media server.

   The possible improvements of using SRR within an SFC framework are
   listed below:

   o  Fast (between 10 and 20ms) switching times from one service
      instance to another by not relying on the DNS for service
      discovery and directly routing service requests at the level of
      the transport network.

   o  The capability to indirect service requests at the network level
      will help in reducing latency, when service end points change.
      E.g. when a service request is being sent to one surrogate
      instance but results in a HTTP 404 or 5xx error response, the
      original request is redirected to another alternative surrogate
      with minimal latency, i.e., right at the destination of said
      failed service request.  Nesting these operations effectively
      leads to a net-level 'search' among all available surrogate
      instances until the search is exhausted (with a negative result)
      or the resource is found.

   o  New methods for forwarding, such as path-based forwarding, will
      enable direct path routing in mobility cases, path pinning for
      traffic steering and simplified service-specific peering towards
      the Internet.  Such capability would allow for localizing traffic,
      reduce latency and costs.

6.2.  Details of SRR Function

   Assuming such introduction of an HTTP-level transport notion, the SRR
   function can be decomposed further as shown in Fig 5.











Purkayastha, et al.     Expires September 2, 2018              [Page 16]

Internet-Draft         Alternative Handling of SFC            March 2018


                                                         +--------+
                                                         |        |
            |-------------------------|------------------+  SRR   +
            |                         |                  |        |
            |                         |                  +---/|\--+
            |                         |                       |
       +---\|/--+   +---------+   +--\|/--+   +------+   +----+----+
       |        |   |         |   |       |   |      |   |         |
       + Client +-->+  SRR    +-->+Service+-->+ SRR  +-->+ Service +
       |        |   |         |   |  Fn1  |   |      |   |  Fn2    |
       +--------+   +---------+   +-------+   +------+   +---------+
                  /             \
                /                 \
              /                     \
        +--------------------------------------+
        |   +------------------+               |
        |   |  +-----+  +----+ |     +-----+   |
        |--->  | SFC |  | SR | |     | SR  |----->
        |   |  |Proxy|  |    | |     |     |   |
        |   |  +-----+  +----+ |     +-/|\-+   |
        |   |  Use Proxy if NAP|        |      |
        |   |  is not SFC      |        |      |
        |   |  enabled         |        |      |
        |   +-------/|\--------+        |      |
        |            |                  |      |
        |            |                  |      |
        |            |  +----------+    |      |
        |            |->| tSFF1    |-----      |
        |               +---/|\----+           |
        |                    |                 |
        |                    |                 |
        |     +----------+   |                 |
        |     |          |   |                 |
        |     +   PCE    +----    +-----+      |
        |     |          |--------| RT  |      |
        |     +----------+        +-----+      |
        |                                      |
        +--------------------------------------+

                        Figure 5: SRR decomposition

   Another option for the two functions routing via the SRR could be
   entirely link-local, i.e., there's another simple tSFF2 between
   client and SRR as well as SF1 and SRR that is simply a link-local
   transport.  The following figure describes this alternate option.






Purkayastha, et al.     Expires September 2, 2018              [Page 17]

Internet-Draft         Alternative Handling of SFC            March 2018


                                                         +--------+
                                                         |        |
            |-------------------------|------------------+  SRR   +
            |                         |                  |        |
            |                         |                  +---/|\--+
            |                         |                       |
       +---\|/--+   +---------+   +--\|/--+   +------+   +----+---+
       |        |   |         |   |       |   |      |   |        |
       + Client +-->+  SRR    +-->+Service+-->+ SRR  +-->+Service +
       |        |   |         |   |  Fn1  |   |      |   |  Fn2   |
       +--------+   +---------+   +-------+   +------+   +--------+
                   /              \
                  /                  \
                 /                      \
       +-----+    +---------------------------------+
       |tSFF2|--------->+----+           +-----+    | +--------+
       +-----+    |     | SR |           | SR  |----->| tSFF2  |-->
                  |     |    |           |     |    | +--------+
                  |     +----+           +-/|\-+    |
                  |       |                 |       |
                  |       |                 |       |
                  |       |                 |       |
                  |       |                 |       |
                  |       |     +-------+   |       |
                  |       |---->| tSFF1 |---        |
                  |             +--/|\--+           |
                  |                 |               |
                  |                 |               |
                  |      +-------+  |               |
                  |      |       |  |               |
                  |      + PCE   +---     +----+    |
                  |      |       |--------| RT |    |
                  |      +-------+        +----+    |
                  |                                 |
                  +---------------------------------+





       Figure 6: SRR decomposition using link-local client/function
                               communication

   The SRR function may be composed of the following functions:

   o  Service Router(SR) at the ingress, terminates on the client side
      Layer 3 and above protocols, such as TCP




Purkayastha, et al.     Expires September 2, 2018              [Page 18]

Internet-Draft         Alternative Handling of SFC            March 2018


   o  Service Router(SR) at the egress, terminates any transport
      protocol on the outgoing (server) side

   o  PCE, Path Computation Element function is responsible for
      selecting the correct next SF, also possibly realizing path policy
      enforcement.  The result of the selection is a path identifier
      which is delivered to the ingress SR upon initial path computation
      request (i.e., when sending a request to a specific URL on the SFP
      for the first time).  The path identifier is utilized for any
      future request for a given URL-based SF.  In case of another SF
      instance becoming available, indicated to the PCE through a
      registration procedure, the PCE will instruct all ingress SRs to
      invalidate path identifiers to the specific URL of the SF,
      resulting in an initial path computation request at the next SF
      request forwarding.  Through this, the newly registered SF
      instance might be utilized if the policy-governed path computation
      will select said SF instance.

   o  Reclassification Trigger Handler (RT) : Network measurement
      information, such as latency, packet loss or network congestion,
      etc. could be processed by the handler.  This may trigger
      reconfiguration of the specific service function endpoint chain
      over which the SFC is being executed.  The handler forwards the
      information about the chain reconfiguration to PCE.

   o  Transport-derived SFF (tSFF1): the communication between ingress/
      egress SRs as well as SRs to PCE is realized via a transport-
      derived SFF.  We outline here three possible tSFFs

      *  SDN-based: This option utilizes path-based forwarding through
         SDN-based wildcard matching fields, supported with
         OF1.2+[Reed2016].  It can be embedded into slicing approach of
         underlying transport infrastructure by leaving typical slicing
         fields available (e.g., VLAN tags).  The forwarding utilizes
         the Ethernet frame format at Layer 2, representing the
         topological links of a specific forwarding path in the
         transport network as unique bits in a fixed size bit array.
         For the latter, the approach utilizes the IPv6 source and
         destination fields for storing the bit array information (in a
         simple version for this forwarding, this limits the topology to
         256 links but extensions schemes are possible, which are left
         out of this document at this stage).  AS mentioned, the SDN
         forwarding decision action is a simple wildcard matching,
         supported with OF1.2+, with the wildcard representing the
         unique bit of a switch-specific output port.  With that, the
         switch needs to consider as many forwarding rules as switch
         local output ports - see [Reed2016] for more information.  Fig.
         xx illustrate this forwarding solution, including the ability



Purkayastha, et al.     Expires September 2, 2018              [Page 19]

Internet-Draft         Alternative Handling of SFC            March 2018


         to create ad-hoc multicast relations by simply ORing individual
         bitarrays representing unicast paths.

      *  Another approach is outlined in [I-D.ietf-bier-use-cases] where
         the SFF is suggested to be realized via a BIER overlay, in turn
         realized over a BIER-compliant underlay, such as MPLS.  BIER
         utilizes a similar bit array approach for representing a
         forwarding path in the overlay network but unlike [Reed2016],
         the bit fields indicate the egress BIER-compliant router that
         the packet is supposed to reach.

      *  As yet another alternative, the tSFF may utilize a flow
         aggregation approach, outlined in [Khalili2016], called edge
         switch classification (ESC).  In this approach, a path from an
         ingress to egress SR is described as a so-called edge
         classification vector (ECV), which combines information on the
         aggregated flow (following [Khalili2016]) and the switch-local
         endpoint.  The representation has similar bitarray
         characteristics as the previous two approaches

   o  NOTE: with the ingress and egress SRs terminating SF Layer 3
      connections and the utilization of bitarray-based tSFFs, the
      transmission of packets can effectively take place as an ad-hoc
      Layer multicast while the SFC itself is denoted as an n-times
      unicast SFC.  As an example, consider the chaining of a set of n
      clients to a single video server.  Each sub-SFC from an individual
      client to the video server will semantically result in a unicast
      response from the server back to the client (e.g., carrying the
      video chunk for a MPEG DASH-based video stream).  When combining
      the sub-SFCs to the single SFC with n times unicast relations to
      the server, the SRR will deliver the responses from the server via
      one or more multicast responses to one or more clients.  The size
      of the individual multicast groups will depend on the
      synchronicity of the client requests (and therefore on the
      synchronicity of the server responses).  Note that the multicast
      relations here are ad-hoc created by ORing the bitarrays
      representing the specific clients to which the responses are meant
      to be sent.  This is illustrated in the figure below.  The HTTP
      multicast use case is being presented in the BIER use case draft
      [I-D.ietf-bier-use-cases]albeit without specific a SFC relation.











Purkayastha, et al.     Expires September 2, 2018              [Page 20]

Internet-Draft         Alternative Handling of SFC            March 2018


          +---------+   +---------+
          |         |   |         |                  +--------+
          +IP only  +---+ ICN     +         00000010 | ICN    |
          |receiver |   | SR1     |         |--------| SR3    |
          |UE       |   +----|----+         |        +---||---+
          +---------+        | 10010011     |            ||
                       +-----|----+   +----------+ |-----||-----|
                       |          |   |          |  |   Cloud  |
                       |SDN Switch|---|SDN Switch|   |        |
                       |          |   |          |    |--||--|
                       +----|-----+   +----------+       ||
                            | 10100011                   ||
          +---------+   +---|-----+                 +----||----+
          |         |   |         |                 |          |
          +IP only  +---+ ICN     +                 + IP only  +
          |sender UE|   | SR2     |                 | Server   |
          +---------+   +---------+                 +----------+


       Figure 7: Illustration of Bitfield-based Forwarding using SDN

7.  Protocol Consideration

   For the operations outlined in the previous section, we foresee the
   following protocol changes are required:

   o  SR-to-SR protocol for HTTP: HTTP based message exchange between
      client and server SRs

   o  SR-PCE protocol: Used for path computation, obtaining routing
      information as well as provide path updates

   o  Registration protocol: Used to register FQDN service endpoints

8.  Next Steps

   Feedback from the SFC WG on the validity of this solution and its
   scope within the SFC WG.  If such alternative to the re-
   classification for service indirection is seen beneficial as well as
   fitting with the charter of the WG, the next steps would be to update
   the draft to outline potential protocol solutions required for the
   realization of such SRR SF.

9.  IANA Considerations

   This document requests no IANA actions.





Purkayastha, et al.     Expires September 2, 2018              [Page 21]

Internet-Draft         Alternative Handling of SFC            March 2018


10.  Security Considerations

   TBD.

11.  Informative References

   [ETSI_MEC]
              ETSI, "Mobile Edge Computing (MEC), Technical
              Requirements", GS MEC 002 1.1.1, March 2016,
              <http://www.etsi.org/deliver/etsi_gs/
              MEC/001_099/002/01.01.01_60/gs_MEC002v010101p.pdf>.

   [H2020FLAME]
              EU, "EU H2020 FLAME PROJECT",  , March 2016,
              <https://www.ict-flame.eu/>.

   [I-D.ietf-bier-use-cases]
              Kumar, N., Asati, R., Chen, M., Xu, X., Dolganow, A.,
              Przygienda, T., Gulko, A., Robinson, D., Arya, V., and C.
              Bestler, "BIER Use Cases", draft-ietf-bier-use-cases-06
              (work in progress), January 2018.

   [I-D.ietf-sfc-dc-use-cases]
              Kumar, S., Tufail, M., Majee, S., Captari, C., and S.
              Homma, "Service Function Chaining Use Cases In Data
              Centers", draft-ietf-sfc-dc-use-cases-06 (work in
              progress), February 2017.

   [Khalili2016]
              Khalili, R., Poe, W., Despotovic, Z., and A. Hecker,
              "Reducing State of SDN Switches in Mobile Core Networks by
              Flow Rule Aggregation", ICCCN, August, 2016.

   [Reed2016]
              Reed, M., Al-Naday, M., Thomas, N., Trossen, D., and S.
              Spirou, "Reducing State of SDN Switches in Mobile Core
              Networks by Flow Rule Aggregation", ICC 2016, 2016.

   [RFC7498]  Quinn, P., Ed. and T. Nadeau, Ed., "Problem Statement for
              Service Function Chaining", RFC 7498,
              DOI 10.17487/RFC7498, April 2015,
              <https://www.rfc-editor.org/info/rfc7498>.

   [RFC8300]  Quinn, P., Ed., Elzur, U., Ed., and C. Pignataro, Ed.,
              "Network Service Header (NSH)", RFC 8300,
              DOI 10.17487/RFC8300, January 2018,
              <https://www.rfc-editor.org/info/rfc8300>.




Purkayastha, et al.     Expires September 2, 2018              [Page 22]

Internet-Draft         Alternative Handling of SFC            March 2018


   [UKNIC]    UK NIC, "5G Infrastructure Requirements in the UK", Final
              Report 3.0, December 2016,
              <https://www.gov.uk/government/uploads/system/uploads/
              attachment_data/
              file/577940/5G_Infrastructure_requirements_for_the_UK_-
              _LS_Telcom_report_for_the_NIC.pdf>.

Authors' Addresses

   Debashish Purkayastha
   InterDigital Communications, LLC
   Conshohocken
   USA

   Email: Debashish.Purkayastha@InterDigital.com


   Akbar Rahman
   InterDigital Communications, LLC
   Montreal
   Canada

   Email: Akbar.Rahman@InterDigital.com


   Dirk Trossen
   InterDigital Communications, LLC
   64 Great Eastern Street, 1st Floor
   London  EC2A 3QR
   United Kingdom

   Email: Dirk.Trossen@InterDigital.com
   URI:   http://www.InterDigital.com/


   Zoran Despotovic
   Huawei

   Email: Zoran.Despotovic@huawei.com
   URI:   http://www.huawei.com/


   Ramin Khalili
   Huawei

   Email: Ramin.khalili@huawei.com
   URI:   http://www.huawei.com/




Purkayastha, et al.     Expires September 2, 2018              [Page 23]