Network Working Group | A.L. Newton |
Internet-Draft | ARIN |
Intended status: Standards Track | K. Ranjbar |
Expires: January 12, 2013 | RIPE NCC |
A.L. Servin | |
LACNIC | |
B.J. Ellacott | |
APNIC | |
S. Hollenbeck | |
Verisign | |
S. Sheng | |
F. Arias | |
ICANN | |
N. Kong | |
CNNIC | |
F. Obispo | |
ISC | |
July 13, 2012 |
Using HTTP for RESTful Whois Services by Internet Registries
draft-designteam-weirds-using-http-01
This document describes the use of HTTP in Whois services using RESTful web methodologies.
This Internet-Draft is submitted in full conformance with the provisions of BCP 78 and BCP 79.
Internet-Drafts are working documents of the Internet Engineering Task Force (IETF). Note that other groups may also distribute working documents as Internet-Drafts. The list of current Internet- Drafts is at http:/⁠/⁠datatracker.ietf.org/⁠drafts/⁠current/⁠.
Internet-Drafts are draft documents valid for a maximum of six months and may be updated, replaced, or obsoleted by other documents at any time. It is inappropriate to use Internet-Drafts as reference material or to cite them other than as "work in progress."
This Internet-Draft will expire on January 12, 2013.
Copyright (c) 2012 IETF Trust and the persons identified as the document authors. All rights reserved.
This document is subject to BCP 78 and the IETF Trust's Legal Provisions Relating to IETF Documents (http:/⁠/⁠trustee.ietf.org/⁠license-⁠info) in effect on the date of publication of this document. Please review these documents carefully, as they describe your rights and restrictions with respect to this document. Code Components extracted from this document must include Simplified BSD License text as described in Section 4.e of the Trust Legal Provisions and are provided without warranty as described in the Simplified BSD License.
This document describes the usage of HTTP for Registration Data Directory Services running on RESTful web servers. The goal of this document is to tie together the usage patterns of HTTP into a common profile applicable to the various types of Directory Services serving Registration Data using RESTful styling. By giving the various Directory Services common behavior, a single client is better able to retrieve data from Directory Services adhering to this behavior.
In designing these common usage patterns, this draft endeavours to satisfy requirements for Registration Data Access Protocols that are documented in [draft-kucherawy-weirds-requirements]. This draft also introduces an additional design consideration to define a simple use of HTTP. Where complexity may reside, it is the goal of this specification to place it upon the server and to keep the client as simple as possible. A client should be possible using common operating system scripting tools.
This is the basic usage pattern for this protocol:
It is important to note that it is not the intent of this document to redefine the meaning and semantics of HTTP. The purpose of this document is to clarify the use of standard HTTP mechanisms for this application.
As is noted in SSAC Report on WHOIS Terminology and Structure [SAC-051], the term "Whois" is overloaded, often referring to a protocol, a service and data. In accordance with [SAC-051], this document describes the base behavior for a Registration Data Access Protocol (RD-AP). At present, there are two known types of RD-AP, a Domain Name Registration Data Access Protocol (DNRD-AP) and a Number Resource Registration Data Access Protocol (NRRD-AP). Both the DNRD-AP and NRRD-AP are to be built upon this base behavior, the RD-AP.
Note that other types of RD-AP may exist in the future.
There are a few design criteria this document attempts to support.
First, each query is meant to return either zero or one result. With the maximum upper bound being set to one, the issuance of redirects is simplified to the known query/respone model used by HTTP [RFC2616]. Should a result contain more than one result, some of which are better served by other servers, the redirection model becomes much more complicated.
Second, multiple response formats are supported by this protocol. This document outlines the base usage of JSON and XML, but server operators may support other formats as they desire if appropriate.
Third, HTTP offers a number of transport protocol mechanisms not described further in this document. Operators are able to make use of these mechanisms according to their local policy, including cache control, authorization, compression, and redirection. HTTP also benefits from widespread investment in scalability, reliability, and performance, and widespread programmer understanding of client behaviours for RESTful web services, reducing the cost to deploy Registration Data Directory Services and clients.
Clients SHOULD put the media type of the format they desire in the Accept header field, and SHOULD use the Accept header parameter "level" to indicate the version of the format acceptable [RFC2616].
Accept: applicaiton/weirds_blah+json;level=0
Figure 1
Servers SHOULD respond with an appropriate media type in the Content-Type header in accordance with the preference rules for the Accept header in HTTP [RFC2616]. Servers SHOULD affix a media type parameter of "level" appropriate to the version of the format being sent.
Content-Type: application/weirds_blah+json;level=0
Figure 2
Clients MAY use a generic media type for the desired data format of the response (e.g. "application/json"), but servers SHOULD respond with the most appropriate media type and corresponding level (e.g. "application/weirds+json;level=0"). In other words, a client may use "application/json" to express that it desires JSON or "application/weirds_blah+json" to express that it desires WEIRDS BLAH in JSON. The server MUST respond with "application/weirds_blah+json;level=0".
Servers SHOULD ignore unknown query parameters. Use of unknown query parameters for cache-busting is described in Appendix Appendix A.
This section describes the various types of responses a server may send to a client. While no standard HTTP response code is forbidden in usage, at a minimum clients should understand the response codes described in this section. It is expected that usage of response codes and types for this application not defined here will be described in subsequent documents.
If a server has the information requested by the client and wishes to respond to the client with the information according to its policies, it should encode the answer in the format most appropriate according to the standard and defined rules for processing the HTTP Accept header, and return that answer in the body of a 200 response.
If a server wishes to inform a client that the answer to a given query can be found elsewhere, it SHOULD return either a 301 or a 307 response code and an HTTP URL in the Redirect header. The client is expected to issue a subsequent query using the given URL without any processing of the URL. In other words, the server is to hand back a complete URL and the client should not have to transform the URL to follow it.
A server should use a 301 response to inform the client of a permanent move and a 307 response otherwise. For this application, such an example of a permanent move might be a TLD operator informing a client the information being sought can be found with another TLD operator (i.e. a query for the domain bar in foo.example is found at http://foo.example/domain/bar).
If a server wishes to respond that it has no information regarding the query, it SHOULD return a 404 response code. Optionally, it may include additional information regarding the lack of information as defined by Section 8.
If a server receives a query which it cannot understand, it SHOULD return a 400 response code. Optionally, it may include additional information about why it does not understand the query as defined by Section 8.
Clients may signal their desire for JSON using the "application/json" media type or a more application specific JSON media type.
Clients processing JSON [RFC4627] responses SHOULD ignore values associated with unrecognized names. Servers MAY insert values signified by names into the JSON responses which are not specified in this document. Insertion of unspecified values into JSON responses SHOULD have names prefixed with a short identifier followed by an underscore followed by a meaningful name.
For example, a JSON object may have "handle" and "remarks" formally documented in a specification. Clients adhering to that specification will have appropriate knowledge of the meaning of "handle" and "remarks".
Consider the following JSON response with JSON names.
{ "handle" : "ABC123", "remarks" : [ "she sells seas shells", "down by the seashore" ] }
Figure 3
If The Registry of the Moon desires to express information not found in the specification, it might select "lunarNic" as its identifying prefix and insert, as an example, the name "lunarNic_beforeOneSmallStep" to signify registrations occuring before the first moon landing and the name "lunarNic_harshMistressNotes" containing other descriptive text.
Consider the following JSON response with JSON names, some of which should be ignored by clients without knowledge of their meaning.
{ "handle" : "ABC123", "lunarNic_beforeOneSmallStep" : "TRUE THAT!", "remarks" : [ "she sells seas shells", "down by the seashore" ], "lunarNic_harshMistressNotes" : [ "In space,", "nobody can hear you scream." ] }
Figure 4
Insertion of unrecognized names ignored by clients may also be used for future revisions to specifications and specifications deriving extensions from a base specification.
JSON names SHOULD only consist of the alphabetic ASCII characters A through Z in both uppercase and lowercase, the numerical digits 0 through 9, underscore characters, and SHOULD NOT begin with an underscore character, numerical digit or the characters "xml". The following describes the produciton of JSON names in ABNF [RFC5234].
ABNF for JSON names
name = ALPHA *( ALPHA / DIGIT / "_" )
Figure 5
This restriction is a union of the Ruby programming language identifier syntax and the XML element name syntax and has two purposes. First, client implementers using modern programming languages such as Ruby or Java may use libraries that automatically promote JSON names to first order object attributes or members (e.g. using the example above, the values may be referenced as network.handle or network.lunarNic_beforeOneSmallStep). Second, a clean mapping between JSON and XML is easy to accomplish using the JSON representation.
Clients processing JSON responses MUST be prepared for values specified in the registry response documents to be absent from a response as no JSON value listed is required to appear in the response. In other words, servers MAY remove values as is needed by the policies of the server operator.
Clients may signal their desire for XML using the "application/xml" media type or a more application specific XML media type.
Well-formed XML may be programmatically produced using the JSON encodings due to the JSON naming rules outlined in Section 6.2 and the following simple rules:
Consider the following JSON response.
{ "startAddress" : "10.0.0.0", "endAddress" : "10.0.0.255", "remarks" : [ "she sells seas shells", "down by the seashore" ], "uris" : [ { "type" : "source", "uri" : "http://whois-rws.net/network/xxxx" }, { "type" : "parent", "uri" : "http://whois-rws.net/network/yyyy" } ] }
Figure 6
The corresponding XML would look like this:
<response> <startAddress>10.0.0.0</startAddress> <endAddress>10.0.0.255</endAddress> <remarks>She sells sea shells</remarks> <remarks>down by the seashore</remarks> <uris> <type>source</type> <uri>http://whois-rws.net/network/xxxx</uri> </uris> <uris> <type>parent</type> <uri>http://whois-rws.net/network/yyyy</uri> </uris> </response>
JSON values converted to XML element content MUST be properly escaped. XML offers various means for escaping data, but such escaping MUST account for the '<', '>', and '&' characters and MUST redact all C0 control characters except tab, carriage return, and new-line. (Redaction of disallowed control characters is a protocol requirement, though in practice most Internet registries do not allow this data in their data stores and therefore do not need to account for this rule.)
The rules for clients processing XML responses are the same as those with JSON: clients SHOULD ignore unrecognized XML elements, and servers MAY insert XML elements with tag names according to the naming rules in Section 6.2. And as with JSON, clients MUST be prepared for XML elements specified in the registry response documents to be absent from a response as no XML element listed is required to appear in the response.
As specified in Section 5, some non-answer responses may return entity bodies with information that could be more descriptive.
The basic structure of that response is a data class containing an error code number (corresponding to the HTTP response code) followed by a string named "title" followed by an array of strings named "description".
This is an example of the JSON version of the common response body.
{ "errorCode": 418 "title": "Your beverage choice is not available", "description": [ "I know coffee has more ummppphhh.", "But I cannot provide." ] }
Figure 7
This is an example of the XML version of the common response body.
<response> <errorCode>418</errorCode> <title>Your beverage choice is not available</title> <description>I know coffee has more ummppphhh.</description> <description>But I cannot provide.</description> </response>
Figure 8
The media type for the JSON structure is "application/rdap_error+json" and the media type for the XML document is "application/rdap_error+xml". Conformance to this specification is considered to be level 0 for both media types.
A client MAY simply use the HTTP response code as the server is not required to include error data in the response body. However, if a client wishes to parse the error data, it SHOULD first check that the Content-Type header contains the appropriate media type.
This section defines two common data structures to be used by DNRD-AP, NRRD-AP, and other RD-AP protocols. As such, the names identifying these data structures are not to be redefined by any registry specific RD-AP specifications. Each of these datatypes MAY appear within any other data object of a response, but the intended purpose is that they will be mostly used in the top-most data object of a response.
The first data structure is named "rdapConformance" and is simply an array of strings, each providing a hint as to the specifications used in the construction of the response.
An example rdapConformance data structure.
"rdapConformance" : [ "nrrdap_level_0" ]
Figure 9
The second data structure is named "notices" and is an array of "notice" objects. Each "notice" object contains a "title" string representing the title of the notice object, an array of strings named "description" for the purposes of conveying any descriptive text about the notice, and a "uri" string holding a URI referencing a service that may provide additional information about the notice.
An exmaple of the notices data structure.
"notices" : [ "notice" : { "title" : "Terms of Use", "description" : [ "This service is subject to The Registry of the Moons", "terms of service." ], "uri" : "http://example.com/our-terms-of-use" } ]
Figure 10
This is an example response with both rdapConformance and notices embedded.
{ "rdapConformance" : [ "nrrdap_level_0" ] "notices" : [ "notice" : { "title" : "Content Redacted", "description" : [ "Without full authorization, content has been redacted.", "Sorry, dude!" ], "uri" : "http://example.com/our-redaction-policies" } ] "startAddress" : "10.0.0.0", "endAddress" : "10.0.0.255", "remarks" : [ "she sells seas shells", "down by the seashore" ], "uris" : [ { "type" : "source", "uri" : "http://whois-rws.net/network/xxxx" }, { "type" : "parent", "uri" : "http://whois-rws.net/network/yyyy" } ] }
Figure 11
This section describes common data types found in Internet registries, the purpose being a common and normalized list of normative references to other specifications to be used by multiple RD-AP applications. Unless otherwise stated by the response specification of an Internet registry using this specification as a basis, the data types can assume to be as follows:
This specification registers the "application/rdap_error+json" media type.
This specification registers the "application/rdap_error+xml" media type.
Clients MAY use IRIs as they see fit, but MUST transform them to URIs [RFC3986] for interaction with RD-AP servers. RD-AP servers MUST use URIs in all responses, and clients MAY transform these URIs to IRIs.
The default text encoding for JSON and XML responses in RD-AP is UTF-8, and all servers and clients MUST support UTF-8. Servers and clients MAY optionally support other character encodings.
To overcome issues with misbehaving HTTP [RFC2616] cache infrastructure, clients may use the adhoc and improbably used query parameter with a random value of their choosing. As Section 4.2 instructs servers to ignore unknown parameters, this is unlikely to have any known side effects.
An example of using an unknown query parameter to bust caches:
http://example.com/ip/192.0.2.0?__fuhgetaboutit=xyz123
Use of an unknown parameter to overcome misbehaving caches is not part of any specification and is offered here for informational purposes.
Things that need to be done to this draft.