Internet Engineering Task Force | H. Pfeifer, Ed. |
Internet-Draft | ProtocolLabs |
Intended status: Informational | S. Widmann |
Expires: August 15, 2019 | University for Applied Sciences Munich |
February 11, 2019 |
Dynamic MultiPath Routing Protocol
draft-pfeifer-rtgwg-dmpr-00
Dynamic MultiPath Routing (DMPR) is a loop free path vector routing protocol with built-in support for policy based multipath routing. It has been designed from scratch to work at both low and high bandwidth networks - even with high packet loss. The objective was to keep routing overhead low and ensure a deterministic upper limit. DMPR can be used to manage huge networks with a similar feature set as BGPv4 except for the concept of autonomous systems.
This Internet-Draft is submitted in full conformance with the provisions of BCP 78 and BCP 79.
Internet-Drafts are working documents of the Internet Engineering Task Force (IETF). Note that other groups may also distribute working documents as Internet-Drafts. The list of current Internet-Drafts is at https://datatracker.ietf.org/drafts/current/.
Internet-Drafts are draft documents valid for a maximum of six months and may be updated, replaced, or obsoleted by other documents at any time. It is inappropriate to use Internet-Drafts as reference material or to cite them other than as "work in progress."
This Internet-Draft will expire on August 15, 2019.
Copyright (c) 2019 IETF Trust and the persons identified as the document authors. All rights reserved.
This document is subject to BCP 78 and the IETF Trust's Legal Provisions Relating to IETF Documents (https://trustee.ietf.org/license-info) in effect on the date of publication of this document. Please review these documents carefully, as they describe your rights and restrictions with respect to this document. Code Components extracted from this document must include Simplified BSD License text as described in Section 4.e of the Trust Legal Provisions and are provided without warranty as described in the Simplified BSD License.
Todays mobile wireless networks have a diversity of requirements on the wireless links. To meet these requirements, it is possible to attach multiple network access technologies on the router and select, depending on the CoS of the packet, over which wireless link the packet is sent. This is the main idea of policy based multipath routing. The established routing protocols do not support the use of multiple access technologies on a single router. To tackle this issues, DMPR as been developed as a protocol for policy based multipath routing in and between mobile networks, which consist of multiple wireless links with different characteristics. DMPR makes it possible to calculate multiple routing tables and maintain the best paths for multiple policies.
The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this document are to be interpreted as described in RFC 2119.
The following list describes the terminology used in this RFC
Section 2 describes the information every node has and gets and how it should handle this information. Section 3 describes the behavior of the protocol in different scenarios and how it achieves particular features. Section 5 describes the message format in detail. Section 6 describes optional features that can be implemented to improve or extend DMPR but are not required for the basic functionality of propagating routing information.
Appendix A has a few policy examples. Appendix B lists all constants used in this RFC where some of them are configurable. Appendix C shows a few simple examples of how DMPR behaves under specific network conditions.
Every router periodically sends out a multicast message to all its neighbors. This message includes the best path to each node known by this router. A router receiving this message adds itself to all these paths, chooses the best path to each node among all those it got from its neighbors and advertises these paths itself via multicast messages. This is standard procedure in a path-vector routing protocol. In DMPR the paths are further separated by a policy, therefore it is possible to have more than one path to each node. Also included in the message are all attributes of this path so that the receiving node can make informed decisions on whether this path is the best under the current policy. In the end there exist multiple paths through the network and packets can be routed according to their requirements which for example can be the path with the highest bandwidth or with the lowest latency.
Traditionally, routing protocols find the best path using a scalar metric. This metric may be a simple constant stating the preference of the link or may be a computed metric using several factors such as bandwidth, latency or cost. Furthermore, this metric is only known locally or, for example in the case of BGP-4, where external paths can be marked with a local preference for internal peers, to a small subset of the network. In DMPR a policy is a globally defined function that defines a function how to determinate the best path. For this reason, the policy has all link attributes of each path at its disposal and therefore has a higher control over which path it chooses.
To maintain the routing data, all data extracted from the received routing messages is organized in different data structures. These data structures are used for calculating the routes and creating of the routing tables.
Every message received by a node is saved in the Message Information Base. This data structure is the basis for all subsequently emitted routing messages and the routing table. The Message Information Base contains the latest (by sequence number) received message for each neighbor. Each entry is controlled by a hold timer and is purged after the hold timer expires. This timer is reset as soon as a new valid message from this neighbor is received. This data structure contains potentially duplicate or conflicting data since many neighbors send their paths and subnet information for all nodes they know of. This is intended and required because any message may be deleted at almost any time when its hold timer expires.
The Routing Data is computed from the Message Information Base. In contrast to the Message Information Base this is data structure contains no conflicting information and can be seen as the single source of truth for the routing table and routing message generation. It contains the best path to each known node grouped by policy. Furthermore the advertised subnets from each node are combined while following the network retraction rules (see Section 3.9).
The routing table contains the best path to each known subnet grouped by policy. This table is meant to be inserted into the underlying operating system's routing table.
Each node listens to a multicast address. This multicast address SHOULD be configurable. Periodically, after a defined message interval + jitter, a node sends out its routing message, which includes: Section 5.4.
When a node receives a message, it deduces a connection and therefore a path to the sender via the interface that message came in. With this mechanism the path to a node propagates through the network. Every received message SHOULD be assigned to a hold timer. When this hold timer expires, the message MUST be deleted from the Message Information Base. The timeout SHOULD be a multiple of the routing message interval. Asymmetric links are handled with a feature called reflection, which is described in
Whenever a neighbor is not reachable anymore (e.g. due to topology change), no further routing messages will be received from this neighbor. As all the received messages are assigned to a timer, no routing messages of the lost neighbor will be present in the Message Information Base, after the expiration of this timer. This means, that the loss of the neighbor is detected after the timeout of the last received routing message of the concerned neighbor. For this reason, the routing table SHOLD be recalculated, whenever the Message Information Base is purged due to timeouts.
In DMPR all interfaces that are registered with the DMPR daemon are treated completely separate from each other. Routing messages are sent over each one individually. This ensures that all possibly accessible neighbors are reachable. The message information base (seeSection 2.1) is grouped by interface. Therefore nodes can detect links on each interface individually.
Each message received by a neighbor is saved in the Message Information Base (seeSection 2.1). With this message a hold timer is set that purges the messages after the given time. To improve the message size a node can choose to only send partial updates (i.e. differential, only the changes since the last full update). To support this a node has to retain the a version from the last full message so it can apply the new partial message.
To support routing different traffic types over different routes, DMPR supports multiple policies. A policy defines, how the best path to a destination is computed using the available link attributes. For all the defined policies, seperate routes will be calculated to every reachable destination. The best path to all destinations is included in the routing message for every policy. For each policy, a seperate routing table MUST be generated. In large networks, this results in a multi topology routing. When different parts of the network have different attributes (e.g. one path has a low loss rate, another path in contrast has a higher bandwidth), different subsets of the topology will be used to forward packets that require different policies (Class of Service).
DMPR MUST know the link attributes that are required to determinate the best path for all the registered policies on all links (interfaces) to the router. These attributes can either be configured staticly by the network administrator or can be dynamically gathered from the attached modem devices. This RFC does not specify how the link information has to be gained. There is no specification defining which attributes must be given to the protocol (e.g bandwidth, loss rate, latency). The requirement of attributes depends on the policies meant to be used.
The calculation of the best route MUST be defined for each policy separately. For example simple policies only consider a single link metric (such as bandwidth or loss rate) for the calculation of the best route. Combined policies might use a combination of multiple link metrics. The route selection mechanism is part of a policy's definition and therefore can be individually defined.
For every reachable destination, the routing data is calculated for all defined policies using the policy's specific route selection mechanism. As a result, there MUST be a seperate routing table for every defined policy. Whenever new crucial information is received, the routing data MUST be recalculated. Information that causes a recalculation of the routing data can be:
Each network advertised in a message has an optional flag called "retracted". This flag is set to true when a node no longer advertises this network as available. The only node to ever set this flag to true MUST be the originally advertising node. A set retracted flag always supersedes an unset flag. Networks are forwarded with this ruleset:
Network known as not retracted | Network known as retracted | Network set as retracted in a message | Action to take |
---|---|---|---|
false | false | false | Insert network in known networks and forward |
false | false | true | ignore network, do not forward |
false | true | false | forward network as retracted |
false | true | true | forward network as retracted |
true | false | false | just forward network |
true | false | true | set network in known networks as retracted and forward retracted |
true | true | false | illegal |
true | true | true | illegal |
The parameters described in this section SHOULD be configured at the startup of the router. This specification does not define any values for the parameters.
Core Configuration describes the fields which MUST be the same on all routers in the neworks.
This field describes a list with all the policies which should be considered by the routing protocol. The policies are referenced by name. For every listed policy, the routing protocol MUST implement the according route selection mechanism.
This field describes a list with all the routers physical interfaces, which are used as DMPR interfaces. For all interfaces following fields are required:
A DMPR packet consists of a preamble, followed by zero or more extension headers followed by zero or one payload. Each extension header and payload is defined by a type.
Type | Use |
---|---|
0-119 | Extension Header |
120-127 | Extension Header, reserved for private use |
128-247 | Payload |
248-255 | Payload, reserved for private use |
Possible Types are defined in further detail below
The preamble of a DMPR packet is as follows
0 1 2 3 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ |Magic| Reserved| NextType | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
The preamble of a packet
An extension header consists of the type immediately following this header, a length specifier, and the Extension Header data.
0 1 2 3 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ | NextType | Length | | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ + | | + Data + | | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
The Payload consists of the data from the end of the preamble or last extension header until the end of the packet. Payloads may be recursive, i.e. contain a valid packet (or parts of it) in themselves. Payload processors therefore MUST have the ability to feed their result back into the message processing chain. This behavior is defined by the payload itself.
Type: 127
This is a keep-alive packet, the payload length is zero. Implementations SHOULD reset the message hold timer for the sending node upon receiving a keep-alive packet
Type: 128
Unkompressed, plain, standard-compliant I-JSON data as described in [RFC7493]. This is the main routing data; its structure is defined in Section 5.2
Type: 129
LZMA-compressed standard-compliant I-JSON data as described in [RFC7493]. This is the main routing data; its structure is defined in Section 5.2
Type: 130
A packet greater than the MTU between two nodes SHOULD be fragmented using the fragmentation payload.
0 1 2 3 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ | Identifier |L|Packet offset| | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ + | Payload | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
The Fragmentation Payload Header
When a packet is larger than the MTU of the link between two nodes it SHOULD be fragmented. For this purpose, the sending node computes the maximum effective payload size for packets sent (i.e MTU less preamble, possibly extension headers and the fragmentation header) and splits the original packet into parts with this size. For each of these parts, it sends a packet with the fragmentation header set to a common identifier, a corresponding packet offset and the LAST bit set for the last fragment.
The receiving node keeps track of all received fragments, grouping them by source address and identifier. As soon as all fragments of a packet have been received, the reconstructed packet MUST be fed back into the message processing chain as if it were a new, just received packet. Fragments MUST be regularly purged based on a hold timer.
A JSON payload is an ASCII-7 encoded JSON object. A sending node SHOULD use ascii-encoding for the JSON data. A receiving node MUST be able to decode UTF-8 encoded data. The payload can be zip compressed. A compressed payload has to be announced in the header.
Augmented requirements language for this section:
General Message Structure
{ "id": <NODE_ID>, "seq": <SEQUENCE_NUMBER>, "type": <TYPE>, "partial-base": <SEQUENCE_NUMBER>, "addr-v4": <IPv4_ADDRESS>, "addr-v6": <IPv6_ADDRESS>, "networks": { <IPvX_NETWORK>: {}, <IPvX_NETWORK>: { "retracted": true } }, "routing-data": { <POLICY>: { <NODE_ID>: { "path": <PATH> } } }, "node-data": { <NODE_ID>: { "networks": { <IPvX_NETWORK>: {}, <IPvX_NETWORK>: { "retracted": true } } } }, "link-attributes": { <LINK_ATTRIBUTE_ID>: { <LINK_ATTRIBUTE>: <METRIC> } }, "request-full": union(true, [<NODE_ID>, ...]), "reflect": { <REFLECT_DATA>: <DATA> } "reflected": { <NODE_ID>: { <REFLECT_DATA>: <DATA> } } }
path = node [node-id ">[" link-attribute-id "]>" path] node-id = *ALPHA link-attribute-id = *DIGIT
Key and value description:
Each message has a type. This RFC describes two types, namely full and partial, which are described in further detail here.
A full update SHOULD replace all data from the sending node in the receivers Message Information Base. It MUST NOT require any previous knowledge of the sender by the receiver. The following keys are specified:
A partial update only replaces the changed data in the receivers message information base. It therefore has the additional field "partial-base" which describes the sequence number of the base message, which MUST be a full update message, to which the changes apply. Note: A partial update only describes changes to a previous full update, never to a previous partial update. If the receiving node is unable to apply the partial update, e.g because it lacks the base message, then this node SHOULD use the "request-full" procedure to request a new full update (seeSection 5.3).
The following keys are specified:
When a node is not able to apply a partial update or just joined a network, it SHOULD send out a request for a full update using the request-full key in the message. This key may be an array containing NODE_IDs from which a full message is needed or may be the Boolean value true, to indicate that a full message from every neighbor is required. When a node receives a request-full key in a message that either has the value true or its ID present in the array, it MUST do one of the following: variant1: schedule the next message it sents to be a full message. variant2: send a full update immediately and reset its message interval timer, except when the last message already was a out-of-band full message, in which case it MUST/SHOULD schedule the next message according to the message interval timer to be a full message.
Reflections are a extensible mechanism and allow a node to exchange data with neighboring nodes, with 2-hop neighbors and with itself. When a node includes arbitrary JSON data in the reflect key in its message, each node receiving this message MUST send this data in the reflected key of its messages under the corresponding NODE_ID. A node MUST support reflecting all requests but is not required to actually parse that data. Because the messages are sent to all neighbors, every 2-hop neighbor becomes aware of the reflected data of a node. This fact is not used in this RFC but may be used in extensions of this protocol.
How asymmetric link detection should work/how it can be implemented
When/how to switch the mode
The original specification of xml2rfc format is in RFC 2629.
This template was derived from an initial version written by Pekka Savola and contributed by him to the xml2rfc project.
This memo includes no request to IANA.
All drafts are required to have an IANA considerations section (see Guidelines for Writing an IANA Considerations Section in RFCs for a guide). If the draft does not require IANA to do anything, the section contains an explicit statement that this is the case (as above). If there are no requirements for IANA, the section will be removed during conversion into an RFC by the RFC Editor.
All drafts are required to have a security considerations section. See RFC 3552 for a guide. Due to the Multicast Transmission, the routing message's payload data is unencrypted.
[min_ref] | authSurName, authInitials., "Minimal Reference", 2006. |
[RFC2119] | Bradner, S., "Key words for use in RFCs to Indicate Requirement Levels", BCP 14, RFC 2119, DOI 10.17487/RFC2119, March 1997. |
[RFC7493] | Bray, T., "The I-JSON Message Format", RFC 7493, DOI 10.17487/RFC7493, March 2015. |
[RFC2629] | Rose, M., "Writing I-Ds and RFCs using XML", RFC 2629, DOI 10.17487/RFC2629, June 1999. |
[RFC3552] | Rescorla, E. and B. Korver, "Guidelines for Writing RFC Text on Security Considerations", BCP 72, RFC 3552, DOI 10.17487/RFC3552, July 2003. |
[RFC5226] | Narten, T. and H. Alvestrand, "Guidelines for Writing an IANA Considerations Section in RFCs", RFC 5226, DOI 10.17487/RFC5226, May 2008. |
many exchangeable policies
use path with lowest overall loss. Overall loss is the accumulation of the loss rates of all hops to a destination. Required link attributes:
use path with highest bandwidth In multihop topologies, the overall bandwith is the minimum of the bandwidths between the hops to a destination. Required link attributes:
The different magic values and what they affect, how they could/should be set
Examples