Internet DRAFT - draft-kuehlewind-tcpm-ecn-fallback
draft-kuehlewind-tcpm-ecn-fallback
TCPM Working Group M. Kuehlewind
Internet-Draft University of Stuttgart
Intended status: Experimental B. Trammell
Expires: March 14, 2014 ETH Zurich
September 10, 2013
A Mechanism for ECN Path Probing and Fallback
draft-kuehlewind-tcpm-ecn-fallback-01.txt
Abstract
Explicit Congestion Notification (ECN) is a TCP/IP extension that is
widely implemented but hardly used due to the perceived unusablilty
of ECN on many paths through the Internet caused by ECN-ignorant
routers and middleboxes. This document specifies an ECN probing and
fall-back mechanism in case ECN has be successfully negotiated
between two connection endpoints, but might not be usable on the
path.
Status of This Memo
This Internet-Draft is submitted in full conformance with the
provisions of BCP 78 and BCP 79.
Internet-Drafts are working documents of the Internet Engineering
Task Force (IETF). Note that other groups may also distribute
working documents as Internet-Drafts. The list of current Internet-
Drafts is at http://datatracker.ietf.org/drafts/current/.
Internet-Drafts are draft documents valid for a maximum of six months
and may be updated, replaced, or obsoleted by other documents at any
time. It is inappropriate to use Internet-Drafts as reference
material or to cite them other than as "work in progress."
This Internet-Draft will expire on March 14, 2014.
Copyright Notice
Copyright (c) 2013 IETF Trust and the persons identified as the
document authors. All rights reserved.
This document is subject to BCP 78 and the IETF Trust's Legal
Provisions Relating to IETF Documents
(http://trustee.ietf.org/license-info) in effect on the date of
publication of this document. Please review these documents
carefully, as they describe your rights and restrictions with respect
to this document. Code Components extracted from this document must
Kuehlewind & Trammell Expires March 14, 2014 [Page 1]
Internet-Draft ECN Fallback September 2013
include Simplified BSD License text as described in Section 4.e of
the Trust Legal Provisions and are provided without warranty as
described in the Simplified BSD License.
1. Introduction
The deployment of Explicit Congestion Notification (ECN) [RFC3168]
and AQM would arguably improve end-to-end performance in the
Internet, by providing a congestion signal to the transport layer
without relying on queue tail drop and packet loss. However, though
ECN has been standardized since 2000, implementation and deployment
have lagged significantly, in part due to the perceived unusablilty
of ECN on many paths through the Internet caused by ECN-ignorant
routers and middleboxes.
Recent research by the authors [KuNeTr13] has shown accelerating
deployment of ECN-capable servers in the Internet, due to the
deployment of TCP stacks for which ECN is enabled by default. In
addition, ECN is usable end-to-end on the vast majority of paths
measured in this study: that is, a Congestion Experienced mark will
cause a ECN Echo on the associated ACK. However, there still exist a
non-negligible number of paths on which a successfully negotiated
usage of ECN will not result in a connection on which congestion will
be correctly echoed, or worse, leads to the loss of packets with CE
or ECE set.
This document presents an experimental, in-band, runtime method for
determining the usability of ECN by a given traffic flow, based on
the active measurement method described in [KuNeTr13]. If ECN is
successfully negotiated but found by this method to be unusable, it
can be disabled on subsequent packets in the flow in order to avoid
connectivity problems caused by ECN-unusability on the path.
2. ECN Path Capability Probing
A TCP sender can determine whether or not its path to the receiver is
usable for ECN using the procedure detailed below.
1. The sender attempts to negotiate ECN usage as per Section 6.1.1
of [RFC3168]. If ECN is not successfully negotiated, the
procedure ends, and ECN is not used for the duration of the
connection.
2. The sender disables the normal usage of ECN for the duration of
the procedure, as the ECN codepoints are used for path probing.
This means all segments are sent with the Non-ECN-Capable
codepoint during this procedure unless otherwise stated.
Moreover, the sender will only take loss as a congestion signal
Kuehlewind & Trammell Expires March 14, 2014 [Page 2]
Internet-Draft ECN Fallback September 2013
and will not react with window reductions to the ECN-Echo (ECE)
feedback signal from the receiver during this procedure.
3. The sender sets the Non-ECN-Capable codepoint in the IP header
until it has completed sending the first N data segments, where N
is the size of the initial congestion window. Loss is used to
discover congestion for these segments.
4. The next three data segments sent consist of the "CE probe":
these three segments are sent with the Congestion Experienced
codepoint set.
5. If all three of the CE probe segments are lost and must be
retransmitted, the path is deemed not ECN-usable and the sender
falls back as in Section 3.
6. If the ECE flag is not set on the ACK segment(s) sent by the
receiver acknowledging the CE probe segments, the path may or may
not be usable, as that there might be middleboxes/gateways that
clear CE on segments from end hosts, because they assume that
congestion can not have occurred up to this point on the path.
In this case, the sender may continue using ECN, because while it
may not work for detecting congestion, the use of ECN does not
negatively affect connectivity.
7. While the sender does not reduce the congestion window for the
ECE ACKS acknowledging the the CE probe segments, it does set CWR
on the subsequent segment sent.
8. If no fallback has occurred by the time the ACK of the final CE
probe segment is received, the path is deemed ECN usable, and the
sender ends the probing procedure and proceeds to use ECN
normally as in [RFC3168].
The operation of this procedure on an ECN-usable path, with an
initial window size of 3, bulk data transfer initiated by the sender,
where the receiver does not perform probing, is shown in Figure 1.
Kuehlewind & Trammell Expires March 14, 2014 [Page 3]
Internet-Draft ECN Fallback September 2013
Sender Receiver
| |
|----SYN ECE CWR----------->| connection
|<-----------SYN ACK ECE----| establishment
|----ACK------------------->|
| |
|----data 0---------------->| initial window
|<------------ECT0 ACK 1----| transmission
|----data 1---------------->|
|----data 2---------------->|
|<------------ECT0 ACK 3----|
| |
|----data 3 CE------------->| CE probe
|----data 4 CE------------->|
|----data 5 CE------------->|
|----data 6 --------------->|
|<--------ECT0 ECE ACK 7----| CE probe ACK: ECN OK
| |
|----data 7 ECT0 CWR------>| CE probe CWR
|----data 8 ECT0---------->| and transition to
|----data 9 ECT0---------->| normal ECN usage
Figure 1: CE probing on ECN-usable path
Conversely, the operation of the probe and fallback procedure on an
ECN-unusable path with an initial window size of 3, bulk data
transfer initiated by the sender, where the receiver does not perform
probing, is shown in Figure 2.
Kuehlewind & Trammell Expires March 14, 2014 [Page 4]
Internet-Draft ECN Fallback September 2013
Sender Receiver
| |
|----SYN ECE CWR----------->| connection
|<-----------SYN ACK ECE----| establishment
|----ACK------------------->|
| |
|----data 0---------------->| initial window
|<-----------------ACK 1----| transmission
|----data 1---------------->|
|----data 2---------------->|
|<-----------------ACK 3----|
| |
|----data 3 CE----X | CE probe
|----data 4 CE----X | (blocked)
|----data 5 CE----X |
|----data 6 --------------->|
|<-----------------ACK 3----| dupack
|----data 7 --------------->|
|----data 8---------------->|
|<-----------------ACK 3----| dupack
| |
|----data 3---------------->| retransmit without CE
|<-----------------ACK 4----|
|----data 4---------------->|
|<-----------------ACK 5----|
|----data 5---------------->|
|<-----------------ACK 9----| and fallback (no ECN)
|----data 9---------------->|
Figure 2: Fallback on ECN-unusable path
As the probing begins after all the segments in the initial
congestion window have been sent, it requires more than an initial
congestion window plus 6 segments (3 CE probe + 3 duplicated ACKs) of
available data to send. TCP implementations can use socket send
buffer occupancy as a signal as to whether sufficient data is
available to use the probing procedure. If enough data is already in
the buffer by the time the initial congestion window is sent, then
the probe segments SHOULD be sent. Otherwise, the implementation can
use additional heuristics, outside the scope of this document to
define, to determine if significant data is likely to be available.
3. ECN Fallback
If ECN is found to be unusable on a given flow by path capability
probing as in Section 2 above, the sender simply stops setting any
ECN-Capable-Transport codepoint on subsequent packets in the flow.
The receiver MUST, however, still set ECE on any ACK for a packet
Kuehlewind & Trammell Expires March 14, 2014 [Page 5]
Internet-Draft ECN Fallback September 2013
with CE set. Note that this behavior is consistent with section
6.1.1 of [RFC3168].
A sender may keep a cache of paths found to be unusable for ECN and
disable ECN for subsequent connections on a per-destination basis.
In this case, the reciever should periodically (i.e., on the order of
hours or days) expire these cache entries to cause re-probing to
occur in order to account for routing changes in the network.
[EDITOR'S NOTE: what to do on RTO?]
4. Discussion
[EDITOR'S NOTE: the general case of this algorithm is J ECT segments
followed by K CE segments; we chose (J,K) = (0,3). We should justify
this choice.]
[EDITOR'S NOTE: need to think about how this would interact with
conex; an analysis comparing the delay caused by path probing as
opposed to the delay caused by ECN failure would be interesting.]
[EDITOR'S NOTE: initial implementation results go here?.]
5. Security Considerations
[FIXME: we'll have to explore attacks against this mechanism which
could affect network or connection stability, so the following is
wrong...]
This document has no security considerations.
6. IANA Considerations
This document has no IANA considerations.
7. Acknowledgements
Thanks to Michael Welzl and Richard Scheffenegger for the comments
and discussions. This work is partially materially supported by the
European Commission under grant agreement FP7-ICT-318627 mPlane.
8. References
Kuehlewind & Trammell Expires March 14, 2014 [Page 6]
Internet-Draft ECN Fallback September 2013
8.1. Normative References
[RFC3168] Ramakrishnan, K., Floyd, S., and D. Black, "The Addition
of Explicit Congestion Notification (ECN) to IP", RFC
3168, September 2001.
8.2. Informative References
[KuNeTr13]
Kuehlewind, M., Neuner, S., and B. Trammell, "On the state
of ECN and TCP Options on the Internet", Mar 2013.
(In LNCS 7799, Proceedings of PAM 2013, Hong Kong)
Authors' Addresses
Mirja Kuehlewind
University of Stuttgart
Pfaffenwaldring 47
70569 Stuttgart
Germany
Email: mirja.kuehlewind@ikr.uni-stuttgart.de
Brian Trammell
Swiss Federal Institute of Technology Zurich
Gloriastrasse 35
8092 Zurich
Switzerland
Phone: +41 44 632 70 13
Email: trammell@tik.ee.ethz.ch
Kuehlewind & Trammell Expires March 14, 2014 [Page 7]