Network Working Group | Z. Kahn |
Internet-Draft | |
Intended status: Informational | J. Brzozowski |
Expires: May 2, 2017 | Comcast |
R. White | |
S. Zandi | |
P. Lumbis | |
Cumulus Networks | |
F. Baker | |
xxxx | |
October 29, 2016 |
Requirements for IPv6 Routers
draft-ali-ipv6rtr-reqs-01
An example.
This Internet-Draft is submitted in full conformance with the provisions of BCP 78 and BCP 79.
Internet-Drafts are working documents of the Internet Engineering Task Force (IETF). Note that other groups may also distribute working documents as Internet-Drafts. The list of current Internet-Drafts is at http://datatracker.ietf.org/drafts/current/.
Internet-Drafts are draft documents valid for a maximum of six months and may be updated, replaced, or obsoleted by other documents at any time. It is inappropriate to use Internet-Drafts as reference material or to cite them other than as "work in progress."
This Internet-Draft will expire on May 2, 2017.
Copyright (c) 2016 IETF Trust and the persons identified as the document authors. All rights reserved.
This document is subject to BCP 78 and the IETF Trust's Legal Provisions Relating to IETF Documents (http://trustee.ietf.org/license-info) in effect on the date of publication of this document. Please review these documents carefully, as they describe your rights and restrictions with respect to this document. Code Components extracted from this document must include Simplified BSD License text as described in Section 4.e of the Trust Legal Provisions and are provided without warranty as described in the Simplified BSD License.
This memo defines and discusses requirements for devices that perform forwarding for Internet Protocol version 6 (IPv6). This can include (but is not limited to) the devices described below.
Readers should recognize that while this memo applies to IPv6, devices processing IPv6 packets will often also process IPv4 packets, forward based on MPLS labels, and potentially process many other protocols. This memo will only interact with IPv4, MPLS, and other protocols as they impact the behavior of an IPv6 forwarding device; no attempt is made to specify requirements for protocols other than IPv6.
The reader should, therefore, not count on this document as a "sole source of truth," but rather use this document as a guide.
This document is broken into the following sections: a review of Internet architecture and principles, requirements relating to device management, requirements related to telemetry, requirements related to IPv6 forwarding and addressing, and future considerations. Following these sections, a short conclusion is provided for review.
The Internet relies or interacts with a number of basic concepts and considerations. These concepts are not explicitly called out in any specification, nor do they necessarily impact protocol design or packet forwarding directly. This section provides an overview of these concepts and considerations to help the reader understand the larger context of this document.
Every point where multiple protocols, or multiple implementations of the same protocol, interact, is an interaction surface that can threaten the robustness of the overall system. While it may seem the global Internet has achieved a level of stability that makes it immune to such considerations, the reality is every network is a complex system, and is therefore subject to massive non repeatable unanticipated failures. Postel's Robustness Principle countered this problem with a simple statement: "Be conservative in what you do, and liberal in what you accept from others." [RFC7922]
However, since this time, it has been noted that following this law allows errors in protocols to accumulate over time, with overall negative effects on the system as a whole. RFC1918 [RFC1918] describes several points in conjunction with this principle that bear updating based on further experience with large scale protocol and network deployments within the Internet community, including:
Complexity, as articulated by Mike O'Dell (see RFC3439 [RFC3439]), is "the primary mechanism which impese efficient scaling, and as a result is the primary driver of increases in both capital expenditures (CAPEX) and operational expenditures (OPEX)." At the same time, complexity cannot be "solved," but rather must be "managed." The simplest and most obvious solution to any problem is often easy to design, deploy, and manage. It's also often wrong and/or broken. As much as developers, designers, and operators might like to make things as simple as possible, hard problems require complex solutions. The following observations apply to the management of complexity in both protocol and network design.
Elegance is the ultimate goal. Rather than seeking out simple solutions because they are simple, seek out solutions that will solve the problem in the simplest way possible. Often this will require seeing the problem from different angles, trying to break the problem up in multiple ways, and trying, abandoning, and rebuilding ideas and implementations until a better way is found.
There are always tradeoffs. For any protocol, network, or operational design decision, there will always be a tradeoff between at least two competing goals. If some problem appears to have a single solution without tradeoffs, this doesn't mean the tradeoffs don't exist. Rather, it means the tradeoffs haven't been discovered yet. In the area of protocol and network design, these tradeoffs often take the form of common "choose two or three" situations, such as "quick, cheap, high quality." In networks, however, the tradeoffs are often:
These three make up a "triangle problem." For instance, to increase the optimization of traffic flow through a network generally requires adding more state to the control plane, leading to problems in complexity due to amplification. To reduce amplification, the control plane (or perhaps the various functions the control plane serves) can be broken up into subsystems, or modules. Breaking the control plane up into subsystems, however, introduces interaction surfaces between the components, which is another form of complexity.
The Internet data plane is organized around broad top and bottom layers, and much thinner middle layer. This is illustrated in the figure below.
\ / \ HTTP, FTP, SNMP, ETC. / \ / \ TCP, UDP / \ / \ IPv6 / / (MPLS) \ / \ / \ / Ethernet, Wireless \ / Physical Media \ / \
Figure 1
This layering emulates or mirrors many naturally occurring systems, and is a common strategy for managing complexity. The single protocol in the center, IPv6, serves to separate the complexity of the lower layers from the complexity of the upper layers. This center layer of the Internet ecosystem has traditionally been called the Network Layer, in reference to the Department of Defense (DoD) [DoD] and OSI models. [OSI] The Internet ecosystem includes three different protocols in this central location.
These protocols are often treated as if they exist in strict hierarchical layers with a well defined and followed Application Programming Interface (API), data models, Remote Procedure Calls (RPCs), sockets, etc. The reality, however, is there are often solid reasons for violating these layers, creating interaction surfaces that are often deeper than intended or understood without some experience. Beyond this, such layering mechanisms act as information abstractions. It is well known that all such abstractions leak. Because of these intentional and unintentional leakages of information, the interactions between protocols is often subtle.
A router connects to two or more logical interfaces and at least one physical interface. A router processes packets by:
When consulting the forwarding table, the router searches for the longest prefix containing the destination address, and uses the information in the table to determine the next hop, or rather the next logically connected device to forward the packet to. The next hop will either be another router, which will presumably carry the packet closer to the final destination, or it will be the destination host itself. The following figure provides a conceptual model of a router; not all routers actually have this set of tables and interactions, and some have many more moving parts. This model is simply used as a common reference to promote understanding.
+-------------+ +-------------+ | Candidate | | Startup | | Config |<--+ +-->| Config | +--+----------+ | | +-------+-----+ | | | | v | | v +-----------------+----+-----------------+ | Running Configuration +------>----------+ +---+----------+----------+----------+---+ | | | | | | v | | | | +-------+ | | | | | IS-IS |<-----------------------------------> Adjacent | +---+---+ v | | Routers | | +-------+ | | | | | BGP |<------------------------> Peers | | +---+---+ v | | | | +-------+ | | | | | OSPF |<-------------> Adjacent | | | +---+---+ v Routers | | | | +-------+ | | | | | Other | | | | | +---+---+ | | | | | | +---+----------+----------+----------+---+ | | RIB Manager | | +---+------------------------------------+ | | | +---+------------------------------------+ | | Routing Information Base (RIB) | | +---+------------------------------------+ | | | +---+------------------------------------+ | | Forwarding Information Base (FIB) | | +---+----------+---------------------+---+ | | | | | +---+---+ +---+---+ +---+---+ | | Int 1 | | Int 2 | ... | Int X | <---------------+ +-------+ +-------+ +-------+ ^ | | v Packets In Packets Out
Figure 2
Network engineering began in the era of Command Line Interfaces (CLIs), and has generally stayed with these CLIs even as the Graphical User Interface (GUI) has become the standard way of interacting with almost every other computing device. Direct human interaction with networking devices in large scale and complex environments, however, tends to result in an unacceptably low Mean Time Between Mistakes (MTBM), directly impacting the overall availability of the network. In reaction to this, operators have increased their reliance on automation, specifically targetting machine to machine, socket, and Application Programming Interface (API) solutions in deploying and configuring devices. This section considers the various components of device management.
Configuration primarily relates to the startup, candidate, and running configurations in the router model shown above. In order to deploy networks at any scale, operators rely on automated management of network device configuration. This effort has traditionally focused on Simple Network Management Protocol (SNMP) Management Information Base (MIBs). In the future, operators expect to move towards open source/open standards YANG models.
Network devices should place a priority on supporting machine readable APIs, rather than human interaction, particularly interfaces that understand and accept configuration and other information carried in YANG models.
To support automated network device configuration, IPv6 routers and network devices SHOULD support YANG and SNMP configuration, including (but not limited to):
To operate a network at scale, operators rely on the ability to access a device to troubleshoot and gather state manually and programmatically through a number of different interfaces. These interfaces should provide current device configuration, current device state (such as interface state, packets drops, etc.), and current control plane contents (such as the RIB in the figure above). In other words, manual and programmable interfaces should provide information about the network device (the whole device stack).
To support automated state gathering and troubleshooting, routers supporting IPv6 SHOULD support:
NETCONF [RFC6241] and RESTCONF [I-D.ietf-netconf-restconf] are currently standard mechanisms designed to carry YANG models for device configuration and telemetry. However, gRPC [GRPC] is becoming a common alternative for telemtry. Devices SHOULD support one or both of these access methods to gather telemetry information.
To operate a network at scale, operators rely on protocols and mechanisms that reduce provisioning time to a minimum. The preferred state is zero touch provisioning; plug a new network device in and it just works without any manual configuration. The closer an operator can come to this ideal, the more MTBM and Operational Expenses (OPEX) can be reduced -- an important goals in the real world.
To reach this goal, IPv6 network devices should support several standards, including, but not limited to:
SLAAC SHOULD be able to be disabled by operators who prefer to use some other mechanism for address management and assignment.
Denial of Service (DoS) and Distributed Denial of Service (DDoS) attacks are unfortunately common in the Internet globally; these types of attacks cost network operators a great deal in opportunity and operational costs in prevention and responses. The network control plane, as well as protocols used to access the device. To provide for effective counters to DoS and DDoS attacks directly on network devices:
There are other useful techniques for dealing with DDoS attacks, including: transferring sessions to a new address and abandoning the address under attack, using BGP communities to spread the attack over multiple ingress ports and "eat" it, and requiring mutual authenication before allocating larger resource pools to a connection, these techniques are not "device level," and hence are not considered further here.
Telemetry relates to information devices push to systems used to monitor and track the state of the network. This applies to individual devices as well as the network as a system. Two major challenges face operators in the area of telemetry:
There are three broad categories of telemetry: device state, topology state, and flow state. These three roughly correspond to the management plane, the control plane, and the forwarding plane of the network. Each of the sections below considers one of these three telemetry types.
IPv6 network devices should be able to report errors and status changes in a number of different formats. The supported formats should focus on machine readability, rather than human readability. Specifically, IPv6 network devices SHOULD support:
NETCONF/RESTCONF transporting telemetry formatted according to YANG should be the current focus of development for vendor provided and open source software; ideally, the entire network could be monitored using a single modeling language to ease implementation of telemetry systems and increase the pace at which new software can be deployed in production environments.
IPv6 network devices are part of a system of devices that, combined, make up the entire network. Viewing the network as a system is often crucial for operational purposes, such as understanding flows, tracing changes in the topology, utilization, and other factors over time. To support systemic monitoring of the network topology, IPv6 devices SHOULD support at least:
(More to be added)
(To be added)
There are a number of capabilities that a device should have to be deployed into an IPv6 network, and several forwarding plane considerations operators and vendors need to bear in mind. The sections below explain these considerations.
The IPv6 address is commonly treated as a host identifier; it is not. Rather, it is an interface identifier that describes the topological point where a particular host connects to the Internet. It is generally harmful to embed IPv6 addresses inside upper layer headers to identify a particular host.
Internet Routing Registries may allocate a network operator a wide range of prefix lengths (see RFC6177 [RFC6177] for further information). Within this allocation, network operators will often suballocate address space along nibble boundaries (/48, /52, /56, /60, and /64) for ease of configuration and management. Several common practices are:
Given these common practices, routers designed to run IPv6 SHOULD support the following addressing conventions:
The long history of the Maximum Transmission Unit (MTU) in networks is not a happy one. Specific problems with MTU sizing include:
The final point requires some further elucidation. The time required to serialize various packets at various speeds are:
A 64 byte packet trapped behind a single 1500 byte packet on a 10Mb/s link suffers 1.7ms of serialization delay. Each additional 1500 byte packet added to the queue in front of the 64 byte packet adds and addtional 1.2ms of delay. In contrast, a 64 byte packet trapped behind a single 9000 byte packet on a 10Mb/s link suffers 7.7ms of serialization delay. Each additional 9000 byte packet added to the queue adds an additional 7.2ms of serialization delay. The practical result is that larger MTU sizes on lower speed links can add a significant amount of delay and jitter into a flow. On the other hand, increasing the MTU on higher speed links appears to add megligable additional delay and jitter.
The result is that it costs less in terms of overall systemic performance to use higher MTUs on higher speed links than on lower speed links. Based on this, increasing the MTU across any particular link may not increase overall end-to-end performance, but can greatly enhance the performance of local applications (such as a local BGP peering session, or a large/long standing elephant flow used to transfer data across a local fabric), while also providing room for tunnel encapsulations to be added with less impact on lower MTU end systems.
The general rule of thumb is to assume the largest size MTU should be used on higher speed transit only links in order to support a wide array of available link sizes, default MTUs, and tunnel encapsulations. Routers designed for a network or data center core SHOULD support at least 9000 byte MTUs on all interfaces. MTU detection mechanisms, such as IS-IS hello padding, [RFC7922] SHOULD be enabled to ensure correct point-to-point MTU configuration. Devices SHOULD also support:
Internet Control Message Protocool (ICMP) is described in [RFC0792] and [RFC4443]. ICMP is often used to perform a traceroute through a network (normally by using a TTL expired ICMP message), for Path MTU discovery, and, in IPv6, for autoconfiguration and neighbor discovery. ICMP is often blocked by middle boxes of various kinds and/or ICMP filters configured on the ingress edge of a provider network. Routers implementing IPv6 SHOULD:
In order to support treating the "network as a whole" as a single programmable system, it is important for each network device have the ability to directly program forwarding information. This programmatic interface allows controllers, which are programmed to support specific business logic and applications, to modify and filter traffic flows without interfering with the distributed control plane. While there are several programmatic interfaces available, this document suggests that the I2RS interface to the RIB be supported in all IPv6 network devices. Specifically, these drafts should be supported to enable network programmability:
(To be added)
(To be added)
(To be added)
(To be added)
This document addresses several ways in which devices designed to support IPv6 forwarding can and should be designed and/or configured to support increased security. These recommendations do not, however, modify protocol behavior, and therefore do not create new security vulnerabilities that need to be considered.
(To be added)
[RFC2119] | Bradner, S., "Key words for use in RFCs to Indicate Requirement Levels", BCP 14, RFC 2119, DOI 10.17487/RFC2119, March 1997. |