Network Working Group | M. Boucadair |
Internet-Draft | C. Jacquenet |
Intended status: Informational | France Telecom |
Expires: December 12, 2013 | June 10, 2013 |
Requirements for Automated (Configuration) Management
draft-boucadair-network-automation-requirements-01
Given the ever-increasing complexity of the configuration tasks required for the dynamic provisioning of IP networks and services, this document aims at listing the requirements to drive the specification of an automated configuration management framework, including the requirements for a protocol to convey configuration information towards the managed entities.
This Internet-Draft is submitted in full conformance with the provisions of BCP 78 and BCP 79.
Internet-Drafts are working documents of the Internet Engineering Task Force (IETF). Note that other groups may also distribute working documents as Internet-Drafts. The list of current Internet-Drafts is at http://datatracker.ietf.org/drafts/current/.
Internet-Drafts are draft documents valid for a maximum of six months and may be updated, replaced, or obsoleted by other documents at any time. It is inappropriate to use Internet-Drafts as reference material or to cite them other than as "work in progress."
This Internet-Draft will expire on December 12, 2013.
Copyright (c) 2013 IETF Trust and the persons identified as the document authors. All rights reserved.
This document is subject to BCP 78 and the IETF Trust's Legal Provisions Relating to IETF Documents (http://trustee.ietf.org/license-info) in effect on the date of publication of this document. Please review these documents carefully, as they describe your rights and restrictions with respect to this document. Code Components extracted from this document must include Simplified BSD License text as described in Section 4.e of the Trust Legal Provisions and are provided without warranty as described in the Simplified BSD License.
IP network and service configuration procedures are currently handled by skilled personnel who is often required to acquire a high level of expertise that grows as the variety and the complexity of the services to be delivered over an IP network. This demand for a high level of expertise is further increased by heterogeneous network and service environments where each equipment manufacturer has developed its own proprietary interfaces and configuration schemes. As a consequence, the time to deliver complex yet advanced IP service offerings (such as IPTV, VPN, etc.) is also increasing at the risk of jeopardizing customers' quality of experience.
This document advocates for the need to undertake a standardization effort to define an automated provisioning framework that includes a set of interfaces and protocol(s) for conveying configuration information which should help in facilitating the automation of the network resource allocation and service delivery procedures. Defining standard data and information models [RFC3444] to capture offered network services would help to automate the process of service ordering and activation and therefore accelerating service provisioning. Automation should not be targeted at dynamically enforcing policies only, but also be encouraged to:
This document assumes that service differentiation at the network layer can be enforced by tweaking various parameters which belong to distinct dimensions (e.g, forwarding, routing, traffic access management, traffic classification, etc.). As such, the decision point is likely to interact with several engines (e.g., routing engine, forwarding engine, etc.). In particular, this document considers that an I2RS system can be seen as a subset of an overall framework. I2RS is limited to routing and forwarding actions (see Section 5.2.2). To meet performance requirements (see Section 5.1.2), it is strongly encouraged to design a system which interacts directly with the routing and forwarding system, rather than requiring local proxy functions which are responsible for translating vendor-independent commands and policies into vendor-specific configuration commands and syntax.
In addition to protocol-related considerations, automating network operations heavily relies upon the availability of intelligent policy decision points. Sharing best design practices for policy decision point logics would facilitate the adoption of the proposed approach (see Section 4).
The document enumerates currently encountered issues (see Section 3) and identifies a set of requirements (see Section 5). A service-driven approach is purposed in Section 4.
This document makes use of the following terms:
Service providers and network operators have gained experience in implementing, deploying and manipulating a large set of protocols and associated information. Some data models have also been defined for network management purposes. Thus, several protocols have been standardized, such as SNMP (Simple Network Management Protocol [RFC3410]), COPS (Common Open Policy Service [RFC2748]), (COPS-PR [RFC3084]) or, more recently, NETCONF [RFC6241].
In addition, multiple data models have been defined and used by operators like CIM (Core Information Model), DEN (Directory-Enabled Network), SMI (Structure of Management Information [RFC2578]), SPPI (Structure of Policy Provisioning Information [RFC3159]), and, more recently, YANG [RFC6022].
Despite this standardization effort, most of the service operators still assume manual configuration through proprietary CLI (Command Line Interface) commands possibly combined with in-house developed, vendor-specific scripts to proceed with the configuration of numerous features, such as forwarding and routing capabilities, Quality of Service (sometimes including traffic engineering) capabilities, and security capabilities.
Other non-technological challenges are also to be taken into consideration when discussing network automation (e.g., to what extent an automated system will accommodate both simple and complex business scenarios, how an automated system will evolve to accommodate changes and new procedures, assess the impact on testing methodologies,etc.).
The purpose of this document is to document requirements rather than focusing on the non-technological challenges.
Maintain and operate self-adaptive networks may be seen as a long term objective for IP service providers. To achieve this goal, intermediate objectives should be defined, such as:
Discussing the items above is out of scope. This document only discusses requirements for (automated) configuration procedures and protocol.
The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this document are to be interpreted as described in [RFC2119].
The following sub-sections enumerates a set of issues.
The delivery of IP services relies upon the activation of a set of capabilities located in various devices that include routers, switches, service platforms, etc. In particular, a large set of protocols need to be configured, such as routing protocols, management protocols, security protocols, let alone capabilities that relate to addressing scheme management, policy enforcement, etc.
Such a diversity of features and protocols may increase the risk of inconsistency at the cost of QoS degradation or even service disruption. Therefore, the configuration information which is forwarded to the whole set of participating devices for delivering a given service or a set of services should be consistent, whatever the number of features/services to be activated/deployed in the network.
Network operators should have means to dynamically discover the topology of their network.
Such topological information should be as elaborate as possible, including details like the links that connect network devices, their capacity, such as the total bandwidth, the available bandwidth, the bandwidth that can be reserved, etc.
Numerous, often multi-vendor devices are involved in the delivery of IP services. These devices support various capabilities that need to be combined for the delivery of a given service or a set of services. The availability and status of such capabilities is therefore a critical information for service providers, since it is likely to affect service and network design, let alone operational procedures.
Therefore, service providers and network operators should have means to:
Configuring a set of devices to deliver a service takes time. In addition, depending on the complexity of the service, erroneous configurations may occur at the cost of jeopardizing the overall quality of a service, if not causing service disruption. From this perspective, some performance indicators must be defined and measured to assess:
Tools to qualify (by simulation or emulation) any possible impact of an elementary configuration task before such task is performed should be supported. These tools aims to prevent errors amplification.
As far as scalability is concerned, adequate indicators should be specified in order to assess the ability of configuration techniques and protocols to support a large number of simultaneous processes. The maintenance of these processes should not impact the performance of the configuration system as a whole (i.e., manager and managed entities, amount of configuration task-specific traffic exchanged between manager and managed entities, periodicity of configuration operations, etc.).
Therefore, configuration operations should be qualified with performance indicators in order to check whether the architecture designed for configuration management is scalable in terms of:
Manual configuration is not only a likely source of errors, but it also affects the time it takes to complete a configuration task (or a combination thereof) to deliver a service, as a function of the task complexity and the need for global consistency. Thus, the efficiency of a configuration process is likely to be improved by the introduction of a high level of automation. Automation is defined as follows:
Configuring a network or a service raises several security issues, including (but not limited to):
Current practice consists in configuring elementary functions, i.e., configuration management for a given service offering is decomposed into a set of elementary tasks. Thus, the consistency of configuration operations for the sake of service delivery must be checked by any means appropriate.
A network device should be seen as a means to deploy a service and not just as a component of such service. Thus, service delivery procedures should not assume the configuration of devices one after the other, but rather globally, i.e., at the scale of the network that supports the said service. Such a service-driven configuration management scheme is therefore meant to facilitate and improve the completion of configuration tasks, by means of highly automated, service-wise, global configuration procedures.
This in particular assumes the need for robust configuration mechanisms that include appropriate protocol machinery (e.g., from a reliable transport mode perspective) to convey configuration information between manager and managed entities, as well as reliable consistency check procedures. The latter is not only meant to assess the validity of all the configuration operations service-wise, but also the efficiency of the corresponding yet dynamic policy enforcement and resource allocation schemes.
An implementation example is the case of service providers who could dedicate (logical) centralized entities which are responsible for the provisioning and the management of participating devices. The main function of these centralized entities is to make appropriate decisions and generate the decision-derived configuration data that will be forwarded to the participating devices. In addition, these centralized entities will make sure of the consistency of the decisions that have been made to deliver the service, according to a dynamic configuration policy enforcement scheme. These logical entities will be responsible for assessing whether the enforced policies are compliant with the expected behavior and how efficiently they are enforced.
Service-driven configuration management leads to the following assumptions:
Configuration information MUST be provided to the participating devices by means of a protocol to be used between such devices and a presumably centralized manager entity. The latter can be seen as a decision point where configuration information is stored, maintained and updated whenever required.
Decisions about configuring additional features or devices, enforcing policies and allocating resources are made accordingly, e.g., as a function of the number of Service Level Specification templates that are processed per unit of time combined with traffic forecasts that are updated on a regular basis. Such decisions are converted into configuration information that is forwarded towards the relevant managed entities.
The vendor-independent communication protocol for conveying configuration information SHOULD have the following characteristics:
The protocol for conveying configuration information within a network should be designed so that:
The introduction and the activation of a protocol for conveying configuration information SHOULD allow for smooth migration procedures, so that vendor-specific and vendor-independent configuration procedures may gracefully co-exist on a (hopefully) limited period of time.
Also, in case of any kind of protocol failure, it MUST be possible to rely upon any vendor-specific configuration procedure as some kind of rollback procedure. Such a rollback procedure MUST protect services that are up and running from any risk of disruption.
Configuration tasks are currently performed with vendor-specific solutions that reflect technology-specific information. It is therefore more and more difficult for a service provider to get a unified, homogeneous view of the network resources service-wise (rather than device-wise).
Configuration information SHOULD therefore be provided to the participating devices as unified, vendor-agnostic, service configuration parameters. These parameters MUST reflect a standardized service data model rather than a vendor-specific information model, unlike the current situation. Examples of such service data models include a tunneling service, an intra-domain routing service, or a VPN service.
The need for a unified, homogeneous access to a multi-vendor environment is becoming critical for N-Play, residential and corporate, fixed and mobile service providers so that a high level of automation can be introduced while proceeding with the configuration of the said multi-vendor environment. This unification is clearly conditioned by the availability of two key components: A configuration protocol (the container) and a set of data models (the content).
The standardization of these two components has several yet major benefits:
As a consequence, it becomes possible to add semantic rules to automate detection and correction of erroneous configurations, either at the scale of a single device or at the scale of a whole network. Furthermore, an equipment from vendor X could de replaced by another technology from vendor Y with very little impact (if no impact at all) on the configuration management procedures.
To do so, the data models SHOULD satisfy the following requirements.
Configuration information for identification purposes mostly deals with the naming of any interface supported by a given device. This naming scheme describes the properties of an interface, and must take into account all the parameters that are required to correctly configure an interface. The following information MUST be provided:
IP services are provided with a level of quality that MAY be guaranteed (either qualitatively or quantitatively) by any means appropriate. QoS policies SHOULD be dynamically enforced according to a data model that will accurately reflect all the elementary QoS capabilities that MAY be configured and activated to enforce such policies.
For instance, in the case of the activation of the DiffServ QoS model within a network infrastructure, the participating routers SHOULD be provided with the appropriate PHB (Per Hop Behavior) configuration parameters.
Network devices usually support functions that allow the activation of specific services like HTTP, BOOTP, DHCP, SYSLOG, etc. These devices MUST therefore be provided with the corresponding configuration information.
Routing and forwarding configuration information deals with the decision that should be applied by a participating device to forward an incoming IP datagram, according to the (possibly service-specific) forwarding and routing policies defined by the service provider. From this perspective, the participating devices should be provided with the following configuration information:
Any candidate protocol MUST be compliant with the following requirements:
Traffic Engineering (TE) is an important and often complex task of configuration management: the participating devices should be provided with the configuration information that will help them to select the appropriate routes that lead to a set of destinations, according to specific constraints and requirements that may have been dynamically negotiated with the customer.
These constraints may be expressed in terms of time duration (e.g., the use of a traffic-engineered route on a weekly basis), traffic characterization (e.g., all IP multicast traffic should be forwarded along a specific distribution tree), security concerns (e.g., use IPsec tunnels), and/or QoS considerations (e.g., EF (Expedited Forwarding)-marked traffic [RFC3246] should always use a subset of "EF-compliant" routes).
The identification of a tunnel should be globally unique, especially if the tunnel is to be established and activated across autonomous systems. The tunnel identification schemes (e.g., endpoint numbering) should be left to service providers, assuming that the corresponding formalism is commonly understood, whatever the number of autonomous systems the tunnel may cross.
The tunnel identification information SHOULD at least be composed of the tunnel endpoint identification information. The tunnel identification information MAY also be composed of an informal description of the tunnel, e.g., the purpose of its establishment, customer traffic that may be forwarded into this tunnel, etc.
Any participating device MUST be provided with the configuration information related to the tunneling technique to be used for the establishment and the activation of the tunnel. Such techniques include Generic Routing Encapsulation (GRE, [RFC2784]), IP Secure in tunnel mode (IPsec, [RFC2401]), Layer 2 Tunneling Protocol (L2TP, [RFC2661]), etc.
Mechanisms to monitor and report any fault that affects service operation SHOULD be independent of the configuration protocol.
Errors during a configuration procedure could impact the availability of a given service offering, while consistency checks are mandatory so as to correctly enforce a configuration policy.
The following requirements have been identified:
To be detailed.
It MUST be possible to activate mutual authentication between manager and managed entities. The authentication MUST be checked before exchanging any configuration data, so as to prevent DoS (Denial of Service) attacks.
Two types of integrity MUST be provided. The first one may be done at the network layer, e.g., by using the IPsec protocol suite. The second type should protect configuration data at the application layer (e.g., the entire file configuration is integrity protected).
The participating device SHOULD provide security functions that provide confidentiality. Encryption algorithms MUST be standard and manually or automatically activated.
The configuration system MUST provide a scalable key management scheme. The number of keys to be managed must be at most linearly proportional to the number of the devices.
The participating device MUST log all configuration connections. At least the following information must be provided:
The configuration system MUST allow the definition and the activation of several privilege levels. Each level could be associated to a set of administrative functions. Each configuration administrator could be assigned a specific access level to perform a (possibly limited) set of configuration tasks.
This document does not require any action from IANA.
This document reflects a set of requirements as far as the design and the enforcement of configuration policies are concerned for (automated) service subscription, delivery and maintenance. As such, the document addresses some security concerns that have been depicted in Section 5.4, and that SHOULD be taken into account when considering the specification of a protocol that will convey configuration information, as well as configuration information itself.
Many thanks to M. Achemlal and Y. Adam who contributed to a first version of this text.
Thanks for W. George for the comments.
[RFC3410] | Case, J., Mundy, R., Partain, D. and B. Stewart, "Introduction and Applicability Statements for Internet-Standard Management Framework", RFC 3410, December 2002. |
[I-D.boucadair-connectivity-provisioning-profile] | Boucadair, M., Jacquenet, C. and N. Wang, "IP/MPLS Connectivity Provisioning Profile", Internet-Draft draft-boucadair-connectivity-provisioning-profile-02, September 2012. |
[RFC3444] | Pras, A. and J. Schoenwaelder, "On the Difference between Information Models and Data Models", RFC 3444, January 2003. |
[I-D.boucadair-connectivity-provisioning-protocol] | Boucadair, M. and C. Jacquenet, "Connectivity Provisioning Negotiation Protocol (CPNP)", Internet-Draft draft-boucadair-connectivity-provisioning-protocol-00, May 2013. |