Internet DRAFT - draft-bonica-intarea-gre-mtu
draft-bonica-intarea-gre-mtu
Intarea Working Group R. Bonica
Internet-Draft Juniper Networks
Intended status: Informational C. Pignataro
Expires: February 13, 2015 Cisco Systems
J. Touch
USC/ISI
August 12, 2014
A Widely-Deployed Solution To The Generic Routing Encapsulation (GRE)
Fragmentation Problem
draft-bonica-intarea-gre-mtu-06
Abstract
This memo describes how many vendors have solved the Generic Routing
Encapsulation (GRE) fragmentation problem. The solution described
herein is configurable. It is widely deployed on the Internet in its
default configuration.
Status of This Memo
This Internet-Draft is submitted in full conformance with the
provisions of BCP 78 and BCP 79.
Internet-Drafts are working documents of the Internet Engineering
Task Force (IETF). Note that other groups may also distribute
working documents as Internet-Drafts. The list of current Internet-
Drafts is at http://datatracker.ietf.org/drafts/current/.
Internet-Drafts are draft documents valid for a maximum of six months
and may be updated, replaced, or obsoleted by other documents at any
time. It is inappropriate to use Internet-Drafts as reference
material or to cite them other than as "work in progress."
This Internet-Draft will expire on February 13, 2015.
Copyright Notice
Copyright (c) 2014 IETF Trust and the persons identified as the
document authors. All rights reserved.
This document is subject to BCP 78 and the IETF Trust's Legal
Provisions Relating to IETF Documents
(http://trustee.ietf.org/license-info) in effect on the date of
publication of this document. Please review these documents
carefully, as they describe your rights and restrictions with respect
to this document. Code Components extracted from this document must
Bonica, et al. Expires February 13, 2015 [Page 1]
Internet-Draft GRE Fragmentation August 2014
include Simplified BSD License text as described in Section 4.e of
the Trust Legal Provisions and are provided without warranty as
described in the Simplified BSD License.
Table of Contents
1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . 2
1.1. Terminology . . . . . . . . . . . . . . . . . . . . . . . 3
2. Solutions . . . . . . . . . . . . . . . . . . . . . . . . . . 4
2.1. RFC 4459 Solutions . . . . . . . . . . . . . . . . . . . 4
2.2. A Widely-Deployed Solution . . . . . . . . . . . . . . . 4
3. Implementation Details . . . . . . . . . . . . . . . . . . . 5
3.1. General . . . . . . . . . . . . . . . . . . . . . . . . . 5
3.2. GRE MTU (GMTU) Estimation and Discovery . . . . . . . . . 5
3.3. GRE Ingress Node Procedures . . . . . . . . . . . . . . . 6
3.3.1. Procedures Affecting the GRE Payload . . . . . . . . 6
3.3.2. Procedures Affecting The GRE Deliver Header . . . . . 7
3.4. GRE Egress Node Procedures . . . . . . . . . . . . . . . 7
4. IANA Considerations . . . . . . . . . . . . . . . . . . . . . 8
5. Security Considerations . . . . . . . . . . . . . . . . . . . 8
6. Acknowledgements . . . . . . . . . . . . . . . . . . . . . . 8
7. References . . . . . . . . . . . . . . . . . . . . . . . . . 9
7.1. Normative References . . . . . . . . . . . . . . . . . . 9
7.2. Informative References . . . . . . . . . . . . . . . . . 9
Authors' Addresses . . . . . . . . . . . . . . . . . . . . . . . 9
1. Introduction
Generic Routing Encapsulation (GRE) [RFC2784] [RFC2890] can be used
to carry any network layer protocol over any network layer protocol.
GRE has been implemented by many vendors and is widely deployed in
the Internet.
The GRE specification does not describe fragmentation procedures.
Lacking guidance from the specification, vendors have developed
implementation-specific fragmentation solutions. A GRE tunnel will
operate correctly only if its ingress and egress nodes support
compatible fragmentation solutions. [RFC4459] describes several
fragmentation solutions and evaluates their relative merits.
This memo reviews the fragmentation solutions presented in [RFC4459].
It also describes how many vendors have solved the GRE fragmentation
problem. The solution described herein is configurable, and has been
widely deployed in its default configuration.
This memo addresses point-to-point unicast GRE tunnels that carry
IPv4, IPv6 or MPLS payloads. All other tunnel types are beyond the
scope of this document.
Bonica, et al. Expires February 13, 2015 [Page 2]
Internet-Draft GRE Fragmentation August 2014
1.1. Terminology
The following terms are specific to GRE and are taken from [RFC2784]:
o GRE delivery header - an IPv4 or IPv6 header whose source address
represents the GRE ingress node and whose destination address
represents the GRE egress node. The GRE delivery header
encapsulates a GRE header.
o GRE header - the GRE protocol header. The GRE header is
encapsulated in the GRE delivery header and encapsulates GRE
payload.
o GRE payload - a network layer packet that is encapsulated by the
GRE header. The GRE payload can be IPv4, IPv6 or MPLS.
Procedures for encapsulating IPv4 and IPv6 in GRE are described in
[RFC2784] and [RFC2890]. Procedures for encapsulating MPLS in GRE
are described in [RFC4023]. While other protocols may be
delivered over GRE, they are beyond the scope of this document.
o GRE delivery packet - A packet containing a GRE delivery header, a
GRE header, and GRE payload.
o GRE payload header - the IPv4, IPv6 or MPLS header of the GRE
payload
o GRE overhead - the combined size of the GRE delivery header and
the GRE header, measured in octets
The following terms are specific MTU discovery:
o link MTU (LMTU) - the maximum transmission unit, i.e., maximum
packet size in octets, that can be conveyed over a link. LMTU is
a unidirectional metric. A bidirectional link may be
characterized by one LMTU in the forward direction and another MTU
in the reverse direction.
o path MTU (PMTU) - the minimum LMTU of all the links in a path
between a source node and a destination node. If the source and
destination node are connected through an equal cost multipath
(ECMP), the PMTU is equal to the minimum LMTU of all links
contributing to the multipath.
o GRE MTU (GMTU) - the maximum transmission unit, i.e., maximum
packet size in octets, that can be conveyed over a GRE tunnel
without fragmentation of any kind. The GMTU is equal to the PMTU
associated with the path between the GRE ingress and the GRE
egress, minus the GRE overhead
Bonica, et al. Expires February 13, 2015 [Page 3]
Internet-Draft GRE Fragmentation August 2014
o Path MTU Discovery (PMTUD) - A procedure for dynamically
discovering the PMTU between two nodes on the Internet. PMTUD
procedures for IPv4 are defined in [RFC1191]. PMTUD procedures
for IPv6 are defined in [RFC1981].
The following terms are introduced by this memo:
o fragmentable packet - An IPv4 packet with DF-bit equal to 0 and
whose payload is larger than 64 bytes
o ICMP Packet Too Big (PTB) message - an ICMPv4 [RFC0792]
Destination Unreachable message with code equal to 4
(fragmentation needed and DF set) or an ICMPv6 [RFC4443] Packet
Too Big message
2. Solutions
2.1. RFC 4459 Solutions
Section 3 of [RFC4459] identifies several tunnel fragmentation
solutions. These solutions define procedures to be invoked when the
tunnel ingress router receives a packet so large that it cannot be
forwarded though the tunnel without fragmentation of any kind. When
applied to GRE, these procedures are:
1. Discard the incoming packet and send an ICMP PTB message to the
incoming packet's source.
2. Fragment the incoming packet and encapsulate each fragment within
a complete GRE header and GRE delivery header.
3. Encapsulate the incoming packet in a single GRE header and GRE
delivery header. Perform source fragmentation on the resulting
GRE delivery packet.
As per RFC 4459, Strategy 2) is applicable only when the incoming
packet is fragmentable. Also as per RFC 4459, each strategy has its
relative merits and costs.
2.2. A Widely-Deployed Solution
Many vendors have implemented a configurable GRE fragmentation
solution. In its default configuration, the solution behaves as
follows:
o When the GRE ingress node receives a fragmentable packet with
length greater than the GMTU, it fragments the incoming packet and
Bonica, et al. Expires February 13, 2015 [Page 4]
Internet-Draft GRE Fragmentation August 2014
encapsulates each fragment within a complete GRE header and GRE
delivery header.
o When the GRE ingress node receives a non-fragmentable packet with
length greater than the GMTU, it discards the packet and send an
ICMP PTB message to the packet's source.
o When the GRE egress node receives a GRE delivery packet fragment,
it silently discards the fragment, without attempting to
reassemble the GRE delivery packet to which the fragment belongs.
In non-default configurations, the GRE ingress node can execute any
of the procedures defined in RFC 4459.
The solution described above is widely-deployed on the Internet in
its default configuration.
3. Implementation Details
This section describes how many vendors have implemented the solution
described in Section 2.2.
3.1. General
The GRE ingress nodes satisfy all of the requirements stated in
[RFC2784].
3.2. GRE MTU (GMTU) Estimation and Discovery
GRE ingress nodes support a configuration option that associates a
GMTU with a GRE tunnel. By default, GMTU is equal to the MTU
associated with next-hop toward the GRE egress node minus the GRE
overhead.
Typically, GRE ingress nodes further refine their GMTU estimate by
executing PMTUD procedures. However, if an implementation supports
PMTUD for GRE tunnels, it also includes a configuration option that
disables PMTUD. This configuration option is required to mitigate
certain denial of service attacks (see Section 5).
The ingress node's GMTU estimate will not always reflect the actual
GMTU. It is only an estimate. When a tunnel's GMTU changes, the
tunnel ingress node will not discover that change immediately.
Likewise, if the ingress node performs PMTUD procedures and tunnel
interior nodes cannot deliver ICMP feedback to the tunnel ingress,
GMTU estimates may be inaccurate.
Bonica, et al. Expires February 13, 2015 [Page 5]
Internet-Draft GRE Fragmentation August 2014
3.3. GRE Ingress Node Procedures
This section defines procedures that GRE ingress nodes execute when
they receive a packet whose size is greater than the relevant GMTU.
3.3.1. Procedures Affecting the GRE Payload
3.3.1.1. IPv4 Payloads
By default, if the payload is fragmentable, the GRE ingress node
fragments the incoming packet and encapsulates each fragment within a
complete GRE header and GRE delivery header. Therefore, the GRE
egress node receives several complete, non-fragmented delivery
packets. Each delivery packet contains a fragment of the GRE
payload. The GRE egress node forwards the payload fragments to their
ultimate destination where they are reassembled.
Also by default, if the payload is not fragmentable, the GRE ingress
node discards the packet and sends an ICMPv4 Destination Unreachable
message to the packet's source. The ICMPv4 Destination Unreachable
message code equals 4 (fragmentation needed and DF set). The ICMPv4
Destination Unreachable message also contains an Next-hop MTU (as
specified by [RFC1191]) and the next-hop MTU is equal to the GMTU
associated with the tunnel.
The GRE ingress node supports a non-default configuration option that
invokes an alternative behavior. If that option is configured, the
GRE ingress node fragments the delivery header. See Section 3.3.2
for details.
3.3.1.2. IPv6 Payloads
By default, the GRE ingress node discards the packet and send an
ICMPv6 [RFC4443] Packet Too Big message to the payload source. The
MTU specified in the Packet Too Big message is equal to the GMTU
associated with the tunnel.
The GRE ingress node supports a non-default configuration option that
invokes an alternative behavior. If that option is configured, the
GRE ingress node fragments the delivery header.See Section 3.3.2 for
details.
3.3.1.3. MPLS Payloads
By default, the GRE ingress node discards the packet. As it is
impossible to reliably identify the payload source, the GRE ingress
node does not attempt to send an ICMP PTB message to the payload
source.
Bonica, et al. Expires February 13, 2015 [Page 6]
Internet-Draft GRE Fragmentation August 2014
The GRE ingress node supports a non-default configuration option that
invokes an alternative behavior. If that option is configured, the
GRE ingress node fragments the delivery header. See Section 3.3.2.
3.3.2. Procedures Affecting The GRE Deliver Header
3.3.2.1. Tunneling GRE Over IPv4
By default, the GRE ingress node does not fragment delivery packets.
However, the GRE ingress node includes a configuration option that
allows delivery packet fragmentation.
By default, the GRE ingress node sets the DF-bit in the delivery
header to 1 (Don't Fragment). However, the GRE ingress node also
supports a configuration option that invokes the following behavior:
o when the GRE payload is IPv6, the DF-bit on the delivery header is
set to 0 (Fragments Allowed)
o when the GRE payload is IPv4, the DF-bit is copied from the
payload header to the delivery header
When the DF-bit on an IPv4 delivery header is set to 0, the GRE
delivery packet can be fragmented by any node between the GRE ingress
and the GRE egress.
If the delivery packet is fragmented, it is reassembled by the GRE
egress.
3.3.2.2. Tunneling GRE Over IPv6
By default, the GRE ingress node does not fragment delivery packets.
However, the GRE ingress node includes a configuration option that
allows this.
If the delivery packet is fragmented, it is reassembled by the GRE
egress.
3.4. GRE Egress Node Procedures
By default, the GRE egress node silently discards GRE delivery packet
fragments, without attempting to reassemble the GRE delivery packets
to which the fragments belongs.
However, the GRE egress node supports a configuration option that
allows it to reassemble GRE delivery packets.
Bonica, et al. Expires February 13, 2015 [Page 7]
Internet-Draft GRE Fragmentation August 2014
4. IANA Considerations
This document makes no request of IANA.
5. Security Considerations
In the GRE fragmentation solution described above, either the GRE
payload or the GRE delivery packet can be fragmented. If the GRE
payload is fragmented, it is typically reassembled at its ultimate
destination. If the GRE delivery packet is fragmented, it is
typically reassembled at the GRE egress node.
The packet reassembly process is resource intensive and vulnerable to
several denial of service attacks. In the simplest attack, the
attacker sends fragmented packets more quickly than the victim can
reassemble them. In a variation on that attack, the first fragment
of each packet is missing, so that no packet can ever be reassembled.
Given that the packet reassembly process is resource intensive and
vulnerable to denial of service attacks, operators should decide
where reassembly process is best performed. Having made that
decision, they should decide whether to fragment the GRE payload or
GRE delivery packet, accordingly.
PMTU Discovery is vulnerable to two denial of service attacks (see
Section 8 of [RFC1191] for details). Both attacks are based upon on
a malicious party sending forged ICMPv4 Destination Unreachable or
ICMPv6 Packet Too Big messages to a host. In the first attack, the
forged message indicates an inordinately small PMTU. In the second
attack, the forged message indicates an inordinately large MTU. In
both cases, throughput is adversely affected. On order to mitigate
such attacks, GRE implementations includes a configuration option to
disable PMTU discovery on GRE tunnels. Also, they can include a
configuration option that conditions the behavior of PMTUD to
establish a minimum PMTU.
6. Acknowledgements
The authors would like to thank Fred Baker, Fred Detienne, Jagadish
Grandhi, Jeff Haas, Vanitha Neelamegam, John Scudder, Mike
Sullenberger and Wen Zhang for their constructive comments. The
authors also express their gratitude to Vanessa Ameen, without whom
this memo could not have been written.
Bonica, et al. Expires February 13, 2015 [Page 8]
Internet-Draft GRE Fragmentation August 2014
7. References
7.1. Normative References
[RFC0792] Postel, J., "Internet Control Message Protocol", STD 5,
RFC 792, September 1981.
[RFC1191] Mogul, J. and S. Deering, "Path MTU discovery", RFC 1191,
November 1990.
[RFC1981] McCann, J., Deering, S., and J. Mogul, "Path MTU Discovery
for IP version 6", RFC 1981, August 1996.
[RFC2784] Farinacci, D., Li, T., Hanks, S., Meyer, D., and P.
Traina, "Generic Routing Encapsulation (GRE)", RFC 2784,
March 2000.
[RFC2890] Dommety, G., "Key and Sequence Number Extensions to GRE",
RFC 2890, September 2000.
[RFC4023] Worster, T., Rekhter, Y., and E. Rosen, "Encapsulating
MPLS in IP or Generic Routing Encapsulation (GRE)", RFC
4023, March 2005.
[RFC4443] Conta, A., Deering, S., and M. Gupta, "Internet Control
Message Protocol (ICMPv6) for the Internet Protocol
Version 6 (IPv6) Specification", RFC 4443, March 2006.
7.2. Informative References
[RFC4459] Savola, P., "MTU and Fragmentation Issues with In-the-
Network Tunneling", RFC 4459, April 2006.
Authors' Addresses
Ron Bonica
Juniper Networks
2251 Corporate Park Drive Herndon
Herndon, Virginia 20170
USA
Email: rbonica@juniper.net
Bonica, et al. Expires February 13, 2015 [Page 9]
Internet-Draft GRE Fragmentation August 2014
Carlos Pignataro
Cisco Systems
7200-12 Kit Creek Road
Research Triangle Park, North Carolina 27709
USA
Email: cpignata@cisco.com
Joe Touch
USC/ISI
4676 Admiralty Way
Marina del Rey, California 90292-6695
USA
Phone: +1 (310) 448-9151
Email: touch@isi.edu
URI: http://www.isi.edu/touch
Bonica, et al. Expires February 13, 2015 [Page 10]