Network Working Group | V. Birk |
Internet-Draft | H. Marques |
Intended status: Standards Track | pEp Foundation |
Expires: December 28, 2018 | B. Hoeneisen |
Ucom.ch | |
June 26, 2018 |
IANA Registration of Trustword Lists: Guide, Template and IANA Considerations
draft-birk-pep-trustwords-02
This document specifies the IANA Registration Guidelines for Trustwords, describes corresponding registration procedures, and provides a guideline for creating Trustword list specifications.
Trustwords are common words in a natural language (e.g., English) to which the hexadecimal strings are mapped to. This makes verification processes (e.g., comparison of fingerprints), more practical and less prone to misunderstandings.
This Internet-Draft is submitted in full conformance with the provisions of BCP 78 and BCP 79.
Internet-Drafts are working documents of the Internet Engineering Task Force (IETF). Note that other groups may also distribute working documents as Internet-Drafts. The list of current Internet-Drafts is at https://datatracker.ietf.org/drafts/current/.
Internet-Drafts are draft documents valid for a maximum of six months and may be updated, replaced, or obsoleted by other documents at any time. It is inappropriate to use Internet-Drafts as reference material or to cite them other than as "work in progress."
This Internet-Draft will expire on December 28, 2018.
Copyright (c) 2018 IETF Trust and the persons identified as the document authors. All rights reserved.
This document is subject to BCP 78 and the IETF Trust's Legal Provisions Relating to IETF Documents (https://trustee.ietf.org/license-info) in effect on the date of publication of this document. Please review these documents carefully, as they describe your rights and restrictions with respect to this document. Code Components extracted from this document must include Simplified BSD License text as described in Section 4.e of the Trust Legal Provisions and are provided without warranty as described in the Simplified BSD License.
In public-key cryptography comparing the public keys’ fingerprints of the communication partners involved is vital to ensure that there is no man-in-the-middle (MITM) attack on the communication channel. Fingerprints normally consist of a chain of hexadecimal chars. However, comparing hexadecimal strings is often impractical for regular human users and prone to misunderstandings.
To mitigate these challenges, several systems offer the comparison of Trustwords as an alternative to hexadecimal strings. Trustwords are common words in a natural language (e.g., English) to which the hexadecimal strings are mapped to. This makes the verification process more natural for human users.
For example, in pEp’s proposition of Privacy by Default [I-D.birk-pep] Trustwords are used to achieve easy contact verification for end-to-end encryption. Trustword comparison is offered after the peers have exchanged public keys opportunistically. Examples for Trustword lists used by current pEp implementations can be found in CSV format, here: https://pep.foundation/dev/repos/pEpEngine/file/tip/db.
In addition to contact verification, Trustwords are also used for other purposes, such as Human-Readable 128-bit Keys [RFC1751], One Time Passwords (OTP) [RFC1760] [RFC2289], SSH host-key verification, VPN Server certificate verification, and to import or synchronize secret key across different devices of the same user [E-D.birk-pep-keysync]. Further ideas include to use Trustwords for contact verification in Extensible Messaging and Presence Protocol (XMPP) [RFC6120], for X.509 [RFC3647] certificate verification in browsers or in block chain applications for crypto currencies.
The key words “MUST”, “MUST NOT”, “REQUIRED”, “SHALL”, “SHALL NOT”, “SHOULD”, “SHOULD NOT”, “RECOMMENDED”, “MAY”, and “OPTIONAL” in this document are to be interpreted as described in [RFC2119].
A fingerprint typically looks like:
Its mapping to English Trustwords could look like:
Or its mapping to German Trustwords could like like:
Instead of the former hexadecimal string, users can compare ten common words of their language.
Note: This examples are for illustration purposes only and do not make use any any published Trustword list.
The basic concept of Trustwords mapping has been already documented in the past, e.g. for use in One-Time Passwords (OTP) [RFC1751] [RFC1760] [RFC2289] or the PGP Word List (“Pretty Good Privacy word list” [PGP.wl], also called a biometric word list, to compare fingerprints.
Regarding today’s needs, previous proposals have the following shortcomings:
Furthermore, there are differences in the basic concept:
If the number of Trustwords is low, a lot of Trustwords need to be compared, which make a comparison somewhat cumbersome for users. This may lead to degraded usability.
To reduce the number of Trustwords to compare, in pEp’s proposition of Privacy by Default [I-D.birk-pep] 16-bit scalars are mapped to natural language words. Therefore, the size (by number of key – value pairs) of any key – value pair structure is 65536. However, the number of unique values to be used in a language may be less than 65536. This can be addressed e.g. by using the same value (Trustword) for more than one key. In these cases, the entropy of the representation is slightly reduced. (Example: A Trustwords list of just 42000 words still allows for an entropy of log_2(42000) ~= 15.36 bits in 16-bit mappings.)
On the other hand, small sized Trustword lists allow for Trustwords with shorter strings, which are easier to use in scenarios where Trustwords have to be typed in e.g. OTP applications.
The specification allows for different dictionary sizes.
Although English is rather widespread around the world, the vast majority of the worlds’ population does not speak English. For an application to be useful for ordinary people, localization is a must. Thus, Trustword lists in different languages can be registered.
For applications where two human establish communication it is very likely that they share a common language. So far no real use case for translations between Trustword lists in different languages has been identified. As translations also drastically increases the complexity for IANA registrations, translations of Trustwords beyond the scope of this document.
Every Trustwords list SHOULD be cleared from swearwords in order to not offense users. This is a task to be carried out by speakers of the respective natural language (i.e., by native language speakers).
Each natural language requires a different set of Trustwords. To allow implementers for identical Trustword lists, a IANA registry is to be established. The IANA registration policy according to [RFC8126] is “Expert Review” and “Specification Required”.
[[ Note: Further details of the IANA registry and requirements for the expert to assess the specification are for further study. A similar approach as used in [RFC6117] is likely followed. ]]
<record> <languagecode> <!-- ISO 639-3 (e.g. eng, deu, ...) --> </languagecode> <bitsize> <!-- How many bits can be mapped with this list (e.g. 8, 16, ...) --> </bitsize> <numberofuniquewords> <!-- number of unique words registered (e.g. 256, 65536, ...) --> </numberofuniquewords> <bijective> <!-- whether or not the list allows for a two-way-mapping (e.g. yes, no) --> </bijective> <version> <!-- version number within language (e.g. b.1.2, n.0.1, ...) --> </version> <registrationdocs> <!-- Change accordingly --> <xref type="rfc" data="rfc2551"/> </registrationdocs> <requesters> <!-- Change accordingly --> <xref type="person" data="John_Doe"/> <xref type="person" data="Jane_Dale"/> </requesters> <additionalinfo> <paragraph> <!-- Text with additional information about the Wordlist to be registered --> </paragraph> <artwork> <!-- There can be artwork sections, too --> </artwork> </additionalinfo> <wordlist> <!-- Change accordingly --> <w0>first</w0> <w1>second</w1> [...] <w65535>last<w65535> </wordlist> </record> <people> <person id="John_Doe"> <name> <!-- Firstname Lastname --> </name> <org> <!-- Organization Name --> </org> <uri> <!-- mailto: or http: URI --> </uri> <updated> <!-- date format YYYY-MM-DD --> </updated> </person> <!-- repeat person section for each person --> </people>
Authors of a Wordlist are encouraged to use these XML chunks as a template to create the IANA Registration Template.
An IANA registration will contain the fallowing elements:
The language code follows the ISO 639-3 specification [ISO693], e.g., eng, deu.
[[ Note: It is for further study, which of the ISO 639 Specifications is most suitable to address the Trustwords’ challenge. ]]
Example usage for German:
e.g. <languagecode>deu</languagecode>
The bit size is the number of bits that can be mapped with the Wordlist. The number of registered words in a word list MUST be 2 ^ (<bitsize>).
Example usage for 16-bit Wordlist:
e.g. <bitsize>16</bitsize>
The number of unique words that are registered.
e.g. <numberofuniquewords>65536</numberofuniquewords>
Whether the registered Wordlist has a one-to-one mapping, meaning the number of unique words registered equals 2 ^ (<bitsize>).
Valid content: ( yes | no )
e.g. <bijective>yes</bijective>
The version of the Wordlist MUST be unique within a language code.
[[ Note: Requirements to a “smart” composition of the version number are for further study ]]
e.g. <version>b.1.2</version>
Reference(s) to the Document(s) containing the Wordlist
e.g. <registrationdocs> <xref type="rfc" data="rfc4979"/> </registrationdocs> e.g. <registrationdocs> <xref type="rfc" data="rfc8888"/> (obsoleted by RFC 9999) <xref type="rfc" data="rfc9999"/> </registrationdocs> e.g. <registrationdocs> [International Telecommunications Union, "Wordlist for Foobar application", ITU-F Recommendation B.193, Release 73, Mar 2009.] </registrationdocs>
The persons requesting the registration of the Wordlist. Usually these are the authors of the Wordlist.
e.g. <requesters> <xref type="person" data="John_Doe"/> </requesters> <people> <person id="John_Doe"> <name>John Doe</name> <org>Example Inc.</org> <uri>mailto:john.doe@example.com</uri> <updated>2018-06-20</updated> </person> </people>
Note: If there is more than one requester, there must be one <xref> element per requester in the <requesters> element, and one <person> chunk per requester in the <people> element.
Any other information the authors deem interesting.
e.g. <additionalinfo> <paragraph>more info goes here</paragraph> </additionalinfo>
Note: If there is no such additional information, then the <additionalinfo> element is omitted.
The full Wordlist to be registered. The number of words MUST be a power of 2 as specified above. The element names serve as key used for enumeration of the Trustwords (starting at 0) and the elements contains the values being individual natural language words in the respective language.
e.g. <wordlist> <w0>first</w0> <w1>second</w1> [...] <w65535>last<w65535> </wordlist> ] ]>
[[ Note: The exact representation of the Wordlist is for further study. ]]
There are no special security considerations.
The authors would like to thank the following people who have provided feedback or significant contributions to the development of this document: Andrew Sullivan, Claudio Luck, Daniel Kahn Gilmore, Michael Richardson, Rich Salz, and Yoav Nir.
This work was initially created by pEp Foundation, and then reviewed and extended with funding by the Internet Society’s Beyond the Net Programme on standardizing pEp. [ISOC.bnet]
[RFC2119] | Bradner, S., "Key words for use in RFCs to Indicate Requirement Levels", BCP 14, RFC 2119, DOI 10.17487/RFC2119, March 1997. |
[RFC4949] | Shirey, R., "Internet Security Glossary, Version 2", FYI 36, RFC 4949, DOI 10.17487/RFC4949, August 2007. |
[RFC8126] | Cotton, M., Leiba, B. and T. Narten, "Guidelines for Writing an IANA Considerations Section in RFCs", BCP 26, RFC 8126, DOI 10.17487/RFC8126, June 2017. |
This section contains a non-normative example of the IANA Registration Template XML chunk.
<record> <languagecode>lat</languagecode> <bitsize>16</bitsize> <numberofuniquewords>57337</numberofuniquewords> <bijective>no</bijective> <version>n.0.1</version> <registrationdocs> <xref type="rfc" data="rfc2551"/> </registrationdocs> <requesters> <xref type="person" data="Julius_Caesar"/> </requesters> <additionalinfo> <paragraph> This Wordlist has been optimized for the Roman Standards Process. </paragraph> </additionalinfo> <wordlist> <w0>errare</w0> <w1>humanum</w1> [...] <w65535>est<w65535> </wordlist> </record> <people> <person id="Julius_Caesar"> <name>Julius Caesar</name> <org>Curia Romana</org> <uri>mailto:julius.cesar@example.com</uri> <updated>1999-12-31</updated> </person> </people>
[[ RFC Editor: This section is to be removed before publication ]]
[[ RFC Editor: This section should be empty and is to be removed before publication ]]