Internet DRAFT - draft-kerwin-http2-encoded-data
draft-kerwin-http2-encoded-data
Network Working Group M. Kerwin
Internet-Draft October 11, 2016
Intended status: Experimental
Expires: April 14, 2017
HTTP/2 Gzipped Data
draft-kerwin-http2-encoded-data-09
Abstract
This document introduces a new frame type for transporting gzip-
encoded data between peers in the Hypertext Transfer Protocol Version
2 (HTTP/2), and an associated error code for handling invalid
encoding.
Note to Readers
The issues list for this draft can be found at
<https://github.com/phluid61/internet-drafts/labels/
HTTP%2F2%20Gzipped%20Data>
The most recent (often unpublished) draft is at
<http://phluid61.github.io/internet-drafts/http2-encoded-data/>
Status of This Memo
This Internet-Draft is submitted in full conformance with the
provisions of BCP 78 and BCP 79.
Internet-Drafts are working documents of the Internet Engineering
Task Force (IETF). Note that other groups may also distribute
working documents as Internet-Drafts. The list of current Internet-
Drafts is at http://datatracker.ietf.org/drafts/current/.
Internet-Drafts are draft documents valid for a maximum of six months
and may be updated, replaced, or obsoleted by other documents at any
time. It is inappropriate to use Internet-Drafts as reference
material or to cite them other than as "work in progress."
This Internet-Draft will expire on April 14, 2017.
Copyright Notice
Copyright (c) 2016 IETF Trust and the persons identified as the
document authors. All rights reserved.
Kerwin Expires April 14, 2017 [Page 1]
Internet-Draft http2-encoded-data October 2016
This document is subject to BCP 78 and the IETF Trust's Legal
Provisions Relating to IETF Documents
(http://trustee.ietf.org/license-info) in effect on the date of
publication of this document. Please review these documents
carefully, as they describe your rights and restrictions with respect
to this document. Code Components extracted from this document must
include Simplified BSD License text as described in Section 4.e of
the Trust Legal Provisions and are provided without warranty as
described in the Simplified BSD License.
Table of Contents
1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . 2
1.1. Notational Conventions . . . . . . . . . . . . . . . . . 2
2. Additions to HTTP/2 . . . . . . . . . . . . . . . . . . . . . 3
2.1. SETTINGS_ACCEPT_GZIPPED_DATA . . . . . . . . . . . . . . 3
2.2. GZIPPED_DATA . . . . . . . . . . . . . . . . . . . . . . 3
2.3. DATA_ENCODING_ERROR . . . . . . . . . . . . . . . . . . . 5
3. Experimental Status . . . . . . . . . . . . . . . . . . . . . 5
4. Security Considerations . . . . . . . . . . . . . . . . . . . 6
5. IANA Considerations . . . . . . . . . . . . . . . . . . . . . 6
5.1. HTTP/2 Frame Type Registry Update . . . . . . . . . . . . 6
5.2. HTTP/2 Settings Registry Update . . . . . . . . . . . . . 7
5.3. HTTP/2 Error Code Registry Update . . . . . . . . . . . . 7
6. Acknowledgements . . . . . . . . . . . . . . . . . . . . . . 7
7. References . . . . . . . . . . . . . . . . . . . . . . . . . 7
7.1. Normative References . . . . . . . . . . . . . . . . . . 7
7.2. Informative References . . . . . . . . . . . . . . . . . 8
Appendix A. Changelog . . . . . . . . . . . . . . . . . . . . . 8
Author's Address . . . . . . . . . . . . . . . . . . . . . . . . 9
1. Introduction
This document introduces a mechanism for applying gzip encoding
[RFC1952] to data transported between two endpoints in the Hypertext
Transfer Protocol Version 2 (HTTP/2) [RFC7540], analogous to
Transfer-Encoding in HTTP/1.1 [RFC7230].
1.1. Notational Conventions
The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT",
"SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this
document are to be interpreted as described in [RFC2119].
Kerwin Expires April 14, 2017 [Page 2]
Internet-Draft http2-encoded-data October 2016
2. Additions to HTTP/2
This document introduces a new HTTP/2 frame type ([RFC7540],
Section 11.2), a new HTTP/2 setting ([RFC7540], Section 11.3), and a
new HTTP/2 error code ([RFC7540], Section 7), to allow the
compression of data.
Note that while compressing some or all data in a stream might affect
the total length of the corresponding HTTP message body, the
"content-length" header, if present, should continue to reflect the
total length of the _uncompressed_ data. This is particularly
relevant when detecting malformed messages ([RFC7540],
Section 8.1.2.6).
2.1. SETTINGS_ACCEPT_GZIPPED_DATA
*NOTE*: This is an experimental value; if standardised, a permanent
value will be assigned.
SETTINGS_ACCEPT_GZIPPED_DATA (0xf000) is used to indicate the
sender's ability and willingness to receive GZIPPED_DATA frames. An
endpoint MUST NOT send a GZIPPED_DATA frame unless it receives this
setting with a value of 1.
The initial value is 0, which indicates that GZIPPED_DATA frames are
not supported. Any value other than 0 or 1 MUST be treated as a
connection error ([RFC7540], Section 5.4.1) of type PROTOCOL_ERROR.
An endpoint may advertise support for GZIPPED_DATA frames and later
decide that it no longer supports them. After sending an
ACCEPT_GZIPPED_DATA setting with the value 0, the endpoint SHOULD
continue to accept GZIPPED_DATA frames for a reasonable amount of
time to account for frames that may already be in flight.
2.2. GZIPPED_DATA
*NOTE*: This is an experimental value; if standardised, a permanent
value will be assigned.
GZIPPED_DATA frames (type code=0xf000) are semantically identical to
DATA frames ([RFC7540], Section 6.1), but their payload is encoded
using gzip compression. Significantly: the order of DATA and
GZIPPED_DATA frames is semantically significant; and GZIPPED_DATA
frames are subject to flow control ([RFC7540], Section 5.2). Gzip
compression is an LZ77 coding with a 32 bit CRC that is commonly
produced by the gzip file compression program [RFC1952].
Kerwin Expires April 14, 2017 [Page 3]
Internet-Draft http2-encoded-data October 2016
Any compression or decompression context for a GZIPPED_DATA frame is
unique to that frame. An endpoint MAY interleave DATA and
GZIPPED_DATA frames on a single stream.
+---------------+
|Pad Length? (8)|
+---------------+-----------------------------------------------+
| Data (*) ...
+---------------------------------------------------------------+
| Padding (*) ...
+---------------------------------------------------------------+
GZIPPED_DATA Frame Payload
The GZIPPED_DATA frame contains the following fields:
o Pad Length: An 8-bit field containing the length of the frame
padding in units of octets. This field is optional and is only
present if the PADDED flag is set.
o Data: Encoded application data. The amount of encoded data is the
remainder of the frame payload after subtracting the length of the
other fields that are present.
o Padding: Padding octets that contain no application semantic
value. Padding octets MUST be set to zero when sending and
ignored when receiving.
The GZIPPED_DATA frame defines the following flags:
o "END_STREAM" (0x1): Bit 1 being set indicates that this frame is
the last that the endpoint will send for the identified stream.
Setting this flag causes the stream to enter one of the "half
closed" states or the "closed" state ([RFC7540], Section 5.1).
o "PADDED" (0x8): Bit 4 being set indicates that the Pad Length
field is present.
A GZIPPED_DATA frame MUST NOT be sent if the ACCEPT_GZIPPED_DATA
setting of the peer is set to 0. See Section 3.
An intermediary, on receiving a GZIPPED_DATA frame, MAY decode the
data and forward it to its downstream peer in one or more DATA
frames. If the downstream peer has not advertised support for
GZIPPED_DATA frames (by sending an ACCEPT_GZIPPED_DATA setting with
the value 1) the intermediary MUST decode the data before forwarding
it.
Kerwin Expires April 14, 2017 [Page 4]
Internet-Draft http2-encoded-data October 2016
If an endpoint detects that the payload of a GZIPPED_DATA frame is
not encoded correctly, for example with an incorrect checksum, the
endpoint MUST treat this as a stream error (see [RFC7540],
Section 5.4.2) of type DATA_ENCODING_ERROR (Section 2.3). The
endpoint MAY then choose to immediately send an ACCEPT_GZIPPED_DATA
setting with the value 0.
If an intermediary propagates a GZIPPED_DATA frame from the source
peer to the destination peer without modifying the payload or its
encoding, and receives a DATA_ENCODING_ERROR from the receiving peer,
it SHOULD pass the error on to the source peer.
GZIPPED_DATA frames MUST be associated with a stream. If a
GZIPPED_DATA frame is received whose stream identifier field is 0x0,
the recipient MUST respond with a connection error ([RFC7540],
Section 5.4.1) of type PROTOCOL_ERROR.
GZIPPED_DATA frames are subject to flow control and can only be sent
when a stream is in the "open" or "half closed (remote)" states. The
entire GZIPPED_DATA frame payload is included in flow control,
including the Pad Length and Padding fields if present. If a
GZIPPED_DATA frame is received whose stream is not in "open" or "half
closed (local)" state, the recipient MUST respond with a stream error
([RFC7540], Section 5.4.2) of type STREAM_CLOSED.
GZIPPED_DATA frames can include padding. Padding fields and flags
are identical to those defined for DATA frames ([RFC7540],
Section 6.1).
2.3. DATA_ENCODING_ERROR
The following new error code is defined:
o "DATA_ENCODING_ERROR" (0xf0000000): The endpoint detected that its
peer sent a GZIPPED_DATA frame with an invalid encoding.
*NOTE*: This is an experimental value; if standardised, a
permanent value will be assigned.
3. Experimental Status
This extension is classified as an experiment because it alters the
base semantics of HTTP/2; a change that, if specified insufficiently
or implemented incorrectly, could result in data loss that is hard to
detect or diagnose.
[RFC7540], Section 5.5, mandates that "implementations MUST discard
frames that have unknown or unsupported types"; so if an endpoint or
Kerwin Expires April 14, 2017 [Page 5]
Internet-Draft http2-encoded-data October 2016
intermediary mishandles GZIPPED_DATA frames, for example by
incorrectly emitting an ACCEPT_GZIPPED_DATA setting or propagating
GZIPPED_DATA frames, and those frames are subsequently discarded,
data will be lost. There is no reliable mechanism to detect such a
loss[*].
The experiment therefore is to explore the robustness of the HTTP/2
ecosystem in the presence of such potential failures.
[*] For some unreliable mechanisms (i.e. not guaranteed to be in use
in all cases, and/or requiring inspection of HTTP headers) see:
o Section 8.1.2.6 of [RFC7540], for using the content-length header
to detect malformed messages
o [RFC3230], for HTTP instance digests
4. Security Considerations
Further to the Use of Compression in HTTP/2 ([RFC7540],
Section 10.6), intermediaries MUST NOT apply compression to DATA
frames, or alter the compression of GZIPPED_DATA frames other than
decompressing, unless additional information is available that allows
the intermediary to identify the source of data. In particular,
frames that are not compressed cannot be compressed, and frames that
are separately compressed cannot be merged into a single compressed
frame.
5. IANA Considerations
This document updates the registries for frame types, settings, and
error codes in the "Hypertext Transfer Protocol (HTTP) 2 Parameters"
section.
5.1. HTTP/2 Frame Type Registry Update
This document updates the "HTTP/2 Frame Type" registry ([RFC7540],
Section 11.2). The entries in the following table are registered by
this document.
+--------------+------+-------------+
| Frame Type | Code | Section |
+--------------+------+-------------+
| GZIPPED_DATA | TBD | Section 2.2 |
+--------------+------+-------------+
Kerwin Expires April 14, 2017 [Page 6]
Internet-Draft http2-encoded-data October 2016
5.2. HTTP/2 Settings Registry Update
This document updates the "HTTP/2 Settings" registry ([RFC7540],
Section 11.3). The entries in the following table are registered by
this document.
+---------------------+------+---------------+---------------+
| Frame Type | Code | Initial Value | Specification |
+---------------------+------+---------------+---------------+
| ACCEPT_GZIPPED_DATA | TBD | 0 | Section 2.1 |
+---------------------+------+---------------+---------------+
5.3. HTTP/2 Error Code Registry Update
This document updates the "HTTP/2 Error Code" registry ([RFC7540],
Section 11.4). The entries in the following table are registered by
this document.
+---------------------+------+----------------------+---------------+
| Name | Code | Description | Specification |
+---------------------+------+----------------------+---------------+
| DATA_ENCODING_ERROR | TBD | Invalid encoding | Section 2.3 |
| | | detected | |
+---------------------+------+----------------------+---------------+
6. Acknowledgements
Thanks to Keith Morgan for his advice, input, and editorial
contributions.
7. References
7.1. Normative References
[RFC1952] Deutsch, P., "GZIP file format specification version 4.3",
RFC 1952, DOI 10.17487/RFC1952, May 1996,
<http://www.rfc-editor.org/info/rfc1952>.
[RFC2119] Bradner, S., "Key words for use in RFCs to Indicate
Requirement Levels", BCP 14, RFC 2119,
DOI 10.17487/RFC2119, March 1997,
<http://www.rfc-editor.org/info/rfc2119>.
[RFC7540] Belshe, M., Peon, R., and M. Thomson, Ed., "Hypertext
Transfer Protocol Version 2 (HTTP/2)", RFC 7540,
DOI 10.17487/RFC7540, May 2015,
<http://www.rfc-editor.org/info/rfc7540>.
Kerwin Expires April 14, 2017 [Page 7]
Internet-Draft http2-encoded-data October 2016
7.2. Informative References
[RFC3230] Mogul, J. and A. Van Hoff, "Instance Digests in HTTP",
RFC 3230, DOI 10.17487/RFC3230, January 2002,
<http://www.rfc-editor.org/info/rfc3230>.
[RFC7230] Fielding, R., Ed. and J. Reschke, Ed., "Hypertext Transfer
Protocol (HTTP/1.1): Message Syntax and Routing",
RFC 7230, DOI 10.17487/RFC7230, June 2014,
<http://www.rfc-editor.org/info/rfc7230>.
Appendix A. Changelog
Since -08:
o use experimental values for frame/setting/error codes
Since -07:
o define "reliable" in the 'experimental' section, and provide
pointers to potential workarounds
o remove fragmentation, since the text added no value
Since -06:
o change document title from "Encoded" to "Gzipped"
o improve text under GZIPPED_DATA (Section 2.2)
o clarify that GZIPPED_DATA and DATA can be interleaved
o explain experimental status and risks of broken implementations
Since -05:
o changed ACCEPT_ENCODED_DATA back from a frame to a setting, since
it carries a single scalar value now
Since -04:
o reduced encoding options to only gzip (suggested by Martin
Thomson)
o remove fragmentation and segment stuff, including reference to
'http2-segments' I-D
o updated HTTP/2 reference from I-D to (freshly published) RFC7230
Kerwin Expires April 14, 2017 [Page 8]
Internet-Draft http2-encoded-data October 2016
Since -03:
o added 'identity' encoding; removed 'compress' and 'zlib'
(suggested by PHK)
o added SEGMENT flag, for segments that don't continue
o clarified that ACCEPT is for a connection, and ENCODED_DATA is for
a stream
o copied "padding" text from HTTP/2 draft
Since -02:
o moved all discussion of fragmentation and segments to its own
section
Since -01:
o referenced new draft-kerwin-http2-segments to handle fragmentation
Since -00:
o changed ACCEPT_ENCODED_DATA from a complex setting to a frame
o improved IANA Considerations section (with lots of input from
Keith Morgan)
Author's Address
Matthew Kerwin
Email: matthew@kerwin.net.au
URI: http://matthew.kerwin.net.au/
Kerwin Expires April 14, 2017 [Page 9]