Internet-Draft | LPWAN CoAP compression | October 2020 |
Minaburo, et al. | Expires 23 April 2021 | [Page] |
This draft defines how Static Context Header Compression (SCHC) can be applied to the Constrained Application Protocol (CoAP). SCHC is a header compression mechanism adapted for constrained devices. SCHC uses a static description of the header to reduce the redundancy and size of the header's information. While RFC 8724 describes the SCHC compression and fragmentation framework, and its application for IPv6/UDP headers, this document applies SCHC for CoAP headers. The CoAP header structure differs from IPv6 and UDP since CoAP uses a flexible header with a variable number of options, themselves of variable length. The CoAP protocol messages format is asymmetric: the request messages have a header format different from the one in the response messages. This specification gives guidance on applying SCHC to flexible headers and how to leverage the asymmetry for more efficient compression Rules.¶
This Internet-Draft is submitted in full conformance with the provisions of BCP 78 and BCP 79.¶
Internet-Drafts are working documents of the Internet Engineering Task Force (IETF). Note that other groups may also distribute working documents as Internet-Drafts. The list of current Internet-Drafts is at https://datatracker.ietf.org/drafts/current/.¶
Internet-Drafts are draft documents valid for a maximum of six months and may be updated, replaced, or obsoleted by other documents at any time. It is inappropriate to use Internet-Drafts as reference material or to cite them other than as "work in progress."¶
This Internet-Draft will expire on 23 April 2021.¶
Copyright (c) 2020 IETF Trust and the persons identified as the document authors. All rights reserved.¶
This document is subject to BCP 78 and the IETF Trust's Legal Provisions Relating to IETF Documents (https://trustee.ietf.org/license-info) in effect on the date of publication of this document. Please review these documents carefully, as they describe your rights and restrictions with respect to this document. Code Components extracted from this document must include Simplified BSD License text as described in Section 4.e of the Trust Legal Provisions and are provided without warranty as described in the Simplified BSD License.¶
CoAP [rfc7252] is a command/response protocol designed for micro-controllers with a small amount of RAM and ROM and is optimized for REST-based (Representational state transfer) services. Although CoAP was designed for Low-Power Wireless Personal Area Networks (6LoWPAN), a CoAP header's size is still too large for LPWAN (Low Power Wide Area Networks) and some compression of the CoAP header is required either to increase performances or allow CoAP other some LPWAN technologies.¶
The [rfc8724] defines SCHC, a header compression mechanism for the LPWAN network based on a static context. Section 5 of the [rfc8724] explains the architecture where compression and decompression are done. The SCHC compression scheme assumes as a prerequisite that the static context is known to both endpoints before transmission. The way the context is configured, provisioned or exchanged is out of this document's scope.¶
CoAP is an application protocol, so CoAP compression requires installing common rules between the two SCHC instances. SCHC compression may apply at two different levels: one to compress IP and UDP in the LPWAN network and another at the application level for CoAP. These two compressions may be independent. Both follow the same principle described in RFC8724. SCHC rules driving the compression/decompression are different and may be managed by different entities. The [rfc8724] describes how the IP and UDP headers may be compressed. This document specifies how the SCHC compression rules can be applied to CoAP traffic.¶
SCHC compresses and decompresses headers based on shared contexts between devices. Each context consists of multiple Rules. Each Rule can match header fields and specific values or ranges of values. If a Rule matches, the matched header fields are replaced by the RuleID and some residual bits. Thus, different Rules may correspond to divers protocols packets that a device expects to send or receive.¶
A Rule describes the packets's entire header with an ordered list of fields descriptions; see section 7 of [rfc8724]. Thereby each description contains the field ID (FID), its length (FL), and its position (FP), a direction indicator (DI) (upstream, downstream, and bidirectional), and some associated Target Values (TV). The direction indicator is used for compression to give the best TV to the FID when these values differ in the transmission direction. So a field may be described several times depending on the asymmetry of its possible TVs.¶
A Matching Operator (MO) is associated with each header field description. The Rule is selected if all the MOs fit the TVs for all fields of the incoming header. A rule cannot be selected if the message contains a field unknown to the SCHC compressor.¶
In that case, a Compression/Decompression Action (CDA) associated with each field give the method to compress and decompress each field. Compression mainly results in one of 4 actions:¶
After applying the compression, there may be some bits to be sent. These values are called Compression Residues.¶
SCHC is a general mechanism applied to different protocols, the exact Rules to be used depending on the protocol and the application. Section 10 of the This document targets the CoAP header compression using SCHC.[rfc8724] describes the compression scheme for IPv6 and UDP headers.¶
The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", "SHOULD NOT", "RECOMMENDED", "NOT RECOMMENDED", "MAY", and "OPTIONAL" in this document are to be interpreted as described in BCP 14 [RFC2119][rfc8174] when, and only when, they appear in all capitals, as shown here.¶
The SCHC Compression Rules can be applied to CoAP headers. SCHC Compression of the CoAP header MAY be done in conjunction with the lower layers (IPv6/UDP) or independently. The SCHC adaptation layers, described in Section 5 of [rfc8724], may be used, as shown in Figure 1,Figure 2 and Figure 3¶
In the first example, Figure 1, a Rule compresses the complete header stack from IPv6 to CoAP. In this case, SCHC C/D (Static Context Header Compression Compressor/Decompressor) is performed at the device and the application. The host communicating with the device does not implement SCHC C/D.¶
The SCHC can be viewed as a layer above layer 2. This layer received non-encrypted packets and can apply compression rule to all the headers. On the other end, the NGW receives the SCHC packet and reconstructs the headers from the rule, identified by its ID and the header residues. The result is a regular IPv6 packet that can be forwarded toward the destination. The same process applies in the other direction. A not encrypted packet arrived at the NGW, thanks to IP forwarding based on the IPv6 prefix. The NGW identifies the device and compresses headers using the device's rules.¶
In the second example, Figure 2, the SCHC compression is applied in the CoAP layer, compressing the CoAP header independently of the other layers. The RuleID, the Compression Residue, and CoAP payload are encrypted using a mechanism such as DTLS. Only the other end (App) can decipher the information. If needed, layers below use SCHC to compress the header as defined in [rfc8724] document (represented in dotted lines).¶
This use case needs an end-to-end context initialization between the device and the application and is out-of-scope of this document.¶
In the third example, Figure 3, the Object Security for Constrained RESTful Environments (OSCORE) [rfc8613] is used. In this case, two rulesets are used to compress the CoAP message. A first ruleset focused on the inner header compresses it. The result is encrypted using the OSCORE mechanism. A second ruleset compresses the outer header, including the OSCORE Options.¶
In the case of several SCHC instances, as shown in Figure 3 and Figure 3, the rulesets may come from different provisioning domains.¶
This document focuses on CoAP compression represented in the dashed boxes in the previous figures.¶
The use of SCHC over the CoAP header uses the same description and compression/decompression techniques like the one for IP and UDP explained in the [rfc8724]. For CoAP, SCHC Rules description uses the direction information to optimize the compression by reducing the number of Rules needed to compress headers. The field description MAY define both request/response headers and target values in the same Rule, using the DI (direction indicator) to make the difference.¶
As for other header compression protocols, when the compressor does not find a correct Rule to compress the header, the packet MUST be sent uncompressed using the RuleID dedicated to this purpose. Where the Compression Residue is the complete header of the packet. See section 6 of [rfc8724].¶
CoAP compression differs from IPv6 and UDP compression on the following aspects:¶
The CoAP protocol is asymmetric; the headers are different for a request or a response. For example, the URI-Path option is mandatory in the request, and it may not be present in the response. A request may contain an Accept option, and the response may include a Content-Format option. In comparison, IPv6 and UDP returning path swap the value of some fields in the header. But all the directions have the same fields (e.g., source and destination address fields).¶
The [rfc8724] defines the use of a Direction Indicator (DI) in the Field Descriptor, which allows a single Rule to process a message headers differently depending on the direction.¶
Even when a field is "symmetric" (i.e., found in both directions), the values carried in each direction are different.¶
The compression may use a matching list in the TV to limit the range of expected values in a particular direction and therefore reduce the Compression Residue's size. Through the Direction Indicator (DI), a field description in the Rules splits the possible field value into two parts, one for each direction. For instance, if a client sends only CON requests, the type can be elided by compression, and the answer may use one single bit to carry either the ACK or RST type. The field Code has the same behavior, the 0.0X code format value in the request, and Y.ZZ code format in the response.¶
Headers in IPv6 and UDP have a fixed size. The size is not sent as part of the Compression Residue but is defined in the Rule. Some CoAP header fields have variable lengths, so the length is also specified in the Field Description. For example, the Token size may vary from 0 to 8 bytes. And the CoAP options have a variable length since they use the Type-Length-Value encoding format, as URI-path or URI-query.¶
Section 7.5.2 from [rfc8724] offers the possibility to define a function for the Field length in the Field Description to know the length before compression. When doing SCHC compression of a variable-length field,¶
if the field size is unknown, the Field Length in the Rule is set as variable, and the size is sent with the Compression Residue.¶
This section discusses the compression of the different CoAP header fields. The CoAP compression with SCHC follows the Section 7.1 of [rfc8724].¶
CoAP version is bidirectional and MUST be elided during the SCHC compression since it always contains the same value. In the future, if new versions of CoAP are defined, new Rules will be needed to avoid ambiguities between versions.¶
The CoAP Protocol [rfc7252] has four types of messages: two requests (CON, NON), one response (ACK), and one empty message (RST).¶
The field SHOULD be elided if, for instance, a client is sending only NON or only CON messages. For the RST message, a dedicated Rule may be needed. For other usages, a mapping list can be used.¶
The code field indicates the Request Method used in CoAP, an IANA registry [rfc7252]. The compression of the CoAP code field follows the same principle as that of the CoAP type field. If the device plays a specific role, the set of code values can be split into two parts, the request codes with the 0 class and the response values.¶
If the device only implements a CoAP client, the request code can be reduced to the set of requests the client can to process.¶
A mapping list can be used for known values. The field cannot be compressed for other values, and the value needs to be sent in the Compression Residue.¶
The Message ID field can be compressed with the MSB(x) MO and the Least Significant Bits (LSB) CDA. See section 7.4 of [rfc8724].¶
A Token is defined through two CoAP fields, Token Length in the mandatory header and Token Value directly following the mandatory CoAP header.¶
Token Length is processed as any protocol field. If the value does not change, the size can be stored in the TV and elided during the transmission. Otherwise, it will have to be sent in the Compression Residue.¶
Token Value MUST NOT be sent as a variable-length residue to avoid ambiguity with Token Length. Therefore, the Token Length value MUST be used to define the size of the Compression Residue. A specific function designated as "TKL" MUST be used in the Rule. During the decompression, this function returns the value contained in the Token Length field.¶
CoAP defines options that are placed after the based header in Option Numbers order, see [rfc7252]. Each Option instance in a message uses the format Delta-Type (D-T), Length (L), Value (V). When applying SCHC compression to the Option, the D-T, L, and V format serve to make the Rule description of the Option. The SCHC compression builds the description of the Option by using in the Field ID the Option Number built from D-T; in TV, the Option Value; and the Option Length uses section 7.4 of [rfc8724]. When the Option Length has a wellknown size, it can be stored in the Rule. Therefore, SCHC compression does not send it. Otherwise, SCHC Compression carries the length of the Compression Residue, in addition to the Compression Residue value.¶
CoAP requests and responses do not include the same options. So Compression Rules may reflect this asymmetry by tagging the direction indicator.¶
Note that length coding differs between CoAP options and SCHC variable size Compression Residue.¶
The following sections present how SCHC compresses some specific CoAP options.¶
If a new option is introduced in CoAP, a new Field ID has to be assigned in the Rules to allow its compression. Otherwise, if no Rule describes this Option, the SCHC compression is not possible, and the CoAP header is sent without compression.¶
If the client expects a single value, it can be stored in the TV and elided during the transmission. Otherwise, if the client expects several possible values, a matching-list SHOULD be used to limit the Compression Residue's size. Otherwise, the value has to be sent as a Compression Residue (fixed or variable length).¶
If both ends know the value, the value can be elided.¶
A matching list can be used if some well-known values are defined.¶
Otherwise, these options can be sent as a Compression Residue.¶
Uri-Path and Uri-Query elements are repeatable options. The Field Position (FP) gives the position in the path.¶
A Mapping list can be used to reduce the size of variable Paths or Queries. In that case, to optimize the compression, several elements can be regrouped into a single entry. The Numbering of elements do not change; MO comparison is set with the first element of the matching.¶
In Figure 4, a single bit residue can be used to code one of the 2 paths. If regrouping were not allowed, a 2 bits residue would be needed. The third path element is sent as a variable size residue.¶
When the length is not known at the Rule creation, the Field Length MUST be set to variable, and the unit is set to bytes.¶
The MSB MO can be applied to a Uri-Path or Uri-Query element. Since MSB value is given in bit, the size MUST always be a multiple of 8 bits.¶
The length sent at the beginning of a variable-length residue indicates the size of the LSB in bytes.¶
For instance, for a CORECONF path /c/X6?k="eth0" the Rule can be set to:¶
Figure 5 shows the parsing and the compression of the URI, where c is not sent. The second element is sent with the length (i.e., 0x2 X 6) followed by the query option (i.e. 0x05 "eth0").¶
The number of Uri-Path or Uri-Query elements in a Rule is fixed at the Rule creation time. If the number varies, several Rules SHOULD be created to cover all the possibilities. Another possibility is to define the length of Uri-Path to variable and send a Compression Residue with a length of 0 to indicate that this Uri-Path is empty. This adds 4 bits to the variable Residue size. See section 7.5.2 [rfc8724]¶
If the field value has to be sent, TV is not set, MO is set to "ignore", and CDA is set to "value-sent." A mapping MAY also be used.¶
Otherwise, the TV is set to the value, MO is set to "equal", and CDA is set to "not-sent".¶
These fields' values cannot be stored in a Rule entry. They MUST always be sent with the Compression Residues.¶
Block [rfc7959] allows a fragmentation at the CoAP level. SCHC also includes a fragmentation protocol. They can be both used. If a block option is used, its content MUST be sent as a Compression Residue.¶
The [rfc7641] defines the Observe option. The TV is not set, MO is set to "ignore", and the CDA is set to "value-sent". SCHC does not limit the maximum size for this option (3 bytes). To reduce the transmission size, either the device implementation MAY limit the delta between two consecutive values, or a proxy can modify the increment.¶
Since an RST message may be sent to inform a server that the client does not require Observe response; a Rule SHOULD exist to allow the message's compression with the RST type.¶
The [rfc7967] defines a No-Response option limiting the responses made by a server to a request. If both ends know the value, then TV is set to this value, MO is set to "equal", and CDA is set to "not-sent".¶
Otherwise, if the value is changing over time, TV is not set, MO is set to "ignore", and CDA to "value-sent". A matching list can also be used to reduce the size.¶
OSCORE [rfc8613] defines end-to-end protection for CoAP messages. This section describes how SCHC Rules can be applied to compress OSCORE-protected messages.¶
The encoding of the OSCORE Option Value defined in Section 6.1 of [rfc8613] is repeated in Figure 6.¶
The first byte specifies the content of the OSCORE options using flags. The three most significant bits of this byte are reserved and always set to 0. Bit h, when set, indicates the presence of the kid context field in the option. Bit k, when set, indicates the presence of a kid field. The three least significant bits n indicate the length of the piv (Partial Initialization Vector) field in bytes. When n = 0, no piv is present.¶
The flag byte is followed by the piv field, kid context field, and kid field in this order, and if present, the length of the kid context field is encoded in the first byte denoting by s the length of the kid context in bytes.¶
This specification recommends identifying the OSCORE Option and the fields it contains. Conceptually, it discerns up to 4 distinct pieces of information within the OSCORE option: the flag bits, the piv, the kid context, and the kid. The SCHC Rule splits into four field descriptions the OSCORE option to compress them:¶
Figure 6 shows the OSCORE Option format with those four fields superimposed on it. Note that the CoAP OSCORE_kidctx field includes directly the size octet s.¶
In this first scenario, the LPWAN Compressor at the Network Gateway side receives from an Internet client a POST message, which is immediately acknowledged by the Device. For this simple scenario, the Rules are described in Figure 7.¶
The version and Token Length fields are elided. The 26 method and response codes defined in [rfc7252] has been shrunk to 5 bits using a matching list. Uri-Path contains a single element indicated in the matching operator.¶
SCHC Compression reduces the header sending only the Type, a mapped code and the least significant bits of Message ID (9 bits in the example above).¶
Note that a request sent by a client located in an Application Server to a server located in the device, may not be compressed through this Rule since the MID will not start with 7 bits equal to 0. A CoAP proxy, before the core SCHC C/D can rewrite the message ID to a value matched by the Rule.¶
OSCORE aims to solve the problem of end-to-end encryption for CoAP messages. The goal, therefore, is to hide as much of the message as possible while still enabling proxy operation.¶
Conceptually this is achieved by splitting the CoAP message into an Inner Plaintext and Outer OSCORE Message. The Inner Plaintext contains sensitive information that is not necessary for proxy operation. This, in turn, is the part of the message which can be encrypted until it reaches its end destination. The Outer Message acts as a shell matching the regular CoAP message format and includes all Options and information needed for proxy operation and caching. This decomposition is illustrated in Figure 8.¶
CoAP options are sorted into one of 3 classes, each granted a specific type of protection by the protocol:¶
Additionally, the OSCORE Option is added as an Outer option, signaling that the message is OSCORE protected. This option carries the information necessary to retrieve the Security Context with which the message was encrypted to be correctly decrypted at the other end-point.¶
Figure 8 shows the message format for the OSCORE Message and Plaintext.¶
In the Outer Header, the original message code is hidden and replaced by a default dummy value. As seen in Sections 4.1.3.5 and 4.2 of [rfc8613], the message code is replaced by POST for requests and Changed for responses when Observe is not used. If Observe is used, the message code is replaced by FETCH for requests and Content for responses.¶
The original message code is put into the first byte of the Plaintext. Following the message code, the class E options come, and, if present, the original message Payload is preceded by its payload marker.¶
The Plaintext is now encrypted by an AEAD algorithm which integrity protects Security Context parameters and, eventually, any class I options from the Outer Header. Currently, no CoAP options are marked class I. The resulting Ciphertext becomes the new Payload of the OSCORE message, as illustrated in Figure 9.¶
As defined in [rfc5116], this Ciphertext is the concatenation of the encrypted Plaintext and its authentication tag. Note that Inner Compression only affects the Plaintext before encryption. Thus only the first variable-length of the Ciphertext can be reduced. The authentication tag is fixed in length and is considered part of the cost of protection.¶
The SCHC Compression scheme consists of compressing both the Plaintext before encryption and the resulting OSCORE message after encryption, see Figure 10.¶
This translates into a segmented process where SCHC compression is applied independently in 2 stages, each with its corresponding set of Rules, with the Inner SCHC Rules and the Outer SCHC Rules. This way, compression is applied to all fields of the original CoAP message.¶
Note that since the corresponding end-point can only decrypt the Inner part of the message, this end-point will also have to implement Inner SCHC Compression/Decompression.¶
An example is given with a GET Request and its consequent Content Response from a device-based CoAP client to a cloud-based CoAP server. A possible set of Rules for the Inner and Outer SCHC Compression is shown. A dump of the results and a contrast between SCHC + OSCORE performance with SCHC + COAP performance is also listed. This gives an approximation to the cost of security with SCHC-OSCORE.¶
Our first example CoAP message is the GET Request in Figure 11¶
Its corresponding response is the CONTENT Response in Figure 12.¶
The SCHC Rules for the Inner Compression include all fields already present in a regular CoAP message. The methods described in Section 4 apply to these fields. As an example, see Figure 13.¶
Figure 14 shows the Plaintext obtained for the example GET Request and follows the process of Inner Compression and Encryption until the end up with the Payload to be added in the outer OSCORE Message.¶
In this case, the original message has no payload, and its resulting Plaintext can be compressed up to only 1 byte (size of the RuleID). The AEAD algorithm preserves this length in its first output and yields a fixed-size tag that cannot be compressed and has to be included in the OSCORE message. This translates into an overhead in total message length, limiting the amount of compression that can be achieved and plays into the cost of adding security to the exchange.¶
In Figure 15, the process is repeated for the example CONTENT Response. The residue is 1 bit long. Note that since SCHC adds padding after the payload, this misalignment causes the hexadecimal code from the payload to differ from the original, even though it has not been compressed.¶
On top of this, the overhead from the tag bytes is incurred as before.¶
The Outer SCHC Rules (Figure 18) must process the OSCORE Options fields. The Figure 16 and Figure 17 show a dump of the OSCORE Messages generated from the example messages once they have been provided with the Inner Compressed Ciphertext in the payload. These are the messages that have to be compressed by the Outer SCHC Compression.¶
For the flag bits, some SCHC compression methods are useful, depending on the application. The simplest alternative is to provide a fixed value for the flags, combining MO equal and CDA not- sent. This saves most bits but could prevent flexibility. Otherwise, match-mapping could be used to choose from an interesting number of configurations for the exchange. Otherwise, MSB could be used to mask off the 3 hard-coded most significant bits.¶
Note that fixing a flag bit will limit CoAP Options choice that can be used in the exchange since their values are dependent on certain options.¶
The piv field lends itself to having some bits masked off with MO MSB and CDA LSB. This could be useful in applications where the message frequency is low such as LPWAN technologies. Note that compressing the sequence numbers effectively reduces the maximum number of sequence numbers used in an exchange. Once this amount is exceeded, the OSCORE keys need to be re-established.¶
The size s included in the kid context field MAY be masked off with CDA MSB. The rest of the field could have additional bits masked off or have the whole field be fixed with MO equal and CDA not-sent. The same holds for the kid field.¶
Figure 18 shows a possible set of Outer Rules to compress the Outer Header.¶
These Outer Rules are applied to the example GET Request and CONTENT Response. The resulting messages are shown in Figure 19 and Figure 20.¶
In contrast, comparing these results with what would be obtained by SCHC compressing the original CoAP messages without protecting them with OSCORE is done by compressing the CoAP messages according to the SCHC Rules in Figure 21.¶
This yields the results in Figure 22 for the Request, and Figure 23 for the Response.¶
As can be seen, the difference between applying SCHC + OSCORE as compared to regular SCHC + COAP is about 10 bytes.¶
This document has no request to IANA.¶
When applied to LPWAN, the Security Considerations of SCHC header compression [rfc8724] are valid for SCHC CoAP header compression. When CoAP uses OSCORE, the security considerations defined in [rfc8613] does not change when SCHC header compression is applied.¶
The definition of SCHC over CoAP header fields permits the compression of header information only. The SCHC header compression itself does not increase or reduce the level of security in the communication. When the connection does not use any security protocol as OSCORE, DTLS, or other, it is highly necessary to use a layer two security.¶
DoS attacks are possible if an intruder can introduce a compressed SCHC corrupted packet onto the link and cause a compression efficiency reduction. However, an intruder having the ability to add corrupted packets at the link layer raises additional security issues than those related to the use of header compression.¶
SCHC compression returns variable-length Residues for some CoAP fields. In the compressed header, the length sent is not the original header field length but the length of the Residue. So if a corrupted packet comes to the decompressor with a longer or shorter length than the one in the original header, SCHC decompression will detect an error and drops the packet.¶
OSCORE compression is also based on the same compression method described in [rfc8724]. The size of the Initialisation Vector (IV) residue must be considered carefully. A residue size obtained with LSB CDA over the IV impacts on the compression efficiency and the frequency the device will renew its key. This operation requires several exchanges and is energy-consuming.¶
SCHC header and compression Rules MUST remain tightly coupled. Otherwise, an encrypted residue may be decompressed differently by the receiver. To avoid this situation, if the Rule is modified in one location, the OSCORE keys MUST be re-established.¶
The authors would like to thank (in alphabetic order): Christian Amsuss, Dominique Barthel, Carsten Bormann, Theresa Enghardt, Thomas Fossati, Klaus Hartke, Francesca Palombini, Alexander Pelov and Goran Selander.¶