Internet DRAFT - draft-thierry-bulk-xml
draft-thierry-bulk-xml
Network Working Group P. Thierry
Internet-Draft Thierry Technologies
Intended status: Experimental August 7, 2013
Expires: February 8, 2014
BULK Simple XML namespace
draft-thierry-bulk-xml-00
Abstract
This specification describes a BULK serialization of XML data.
Status of This Memo
This Internet-Draft is submitted in full conformance with the
provisions of BCP 78 and BCP 79.
Internet-Drafts are working documents of the Internet Engineering
Task Force (IETF). Note that other groups may also distribute
working documents as Internet-Drafts. The list of current Internet-
Drafts is at http://datatracker.ietf.org/drafts/current/.
Internet-Drafts are draft documents valid for a maximum of six months
and may be updated, replaced, or obsoleted by other documents at any
time. It is inappropriate to use Internet-Drafts as reference
material or to cite them other than as "work in progress."
This Internet-Draft will expire on February 8, 2014.
Copyright Notice
Copyright (c) 2013 IETF Trust and the persons identified as the
document authors. All rights reserved.
This document is subject to BCP 78 and the IETF Trust's Legal
Provisions Relating to IETF Documents
(http://trustee.ietf.org/license-info) in effect on the date of
publication of this document. Please review these documents
carefully, as they describe your rights and restrictions with respect
to this document. Code Components extracted from this document must
include Simplified BSD License text as described in Section 4.e of
the Trust Legal Provisions and are provided without warranty as
described in the Simplified BSD License.
Thierry Expires February 8, 2014 [Page 1]
Internet-Draft BULK-XML August 2013
Table of Contents
1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . . 3
1.1. Conventions and Terminology . . . . . . . . . . . . . . . . 3
2. BULK XML namespace . . . . . . . . . . . . . . . . . . . . . . 3
2.1. XML 1.0 entity . . . . . . . . . . . . . . . . . . . . . . 4
2.2. XML 1.1 entity . . . . . . . . . . . . . . . . . . . . . . 4
2.3. Processing instruction . . . . . . . . . . . . . . . . . . 4
2.4. Comment . . . . . . . . . . . . . . . . . . . . . . . . . . 4
2.5. Element . . . . . . . . . . . . . . . . . . . . . . . . . . 5
2.6. Attribute . . . . . . . . . . . . . . . . . . . . . . . . . 5
2.7. xml: namespace . . . . . . . . . . . . . . . . . . . . . . 5
2.8. xmlns: namespace . . . . . . . . . . . . . . . . . . . . . 5
2.9. space preservation . . . . . . . . . . . . . . . . . . . . 5
3. References . . . . . . . . . . . . . . . . . . . . . . . . . . 5
3.1. Normative References . . . . . . . . . . . . . . . . . . . 5
3.2. Informative references . . . . . . . . . . . . . . . . . . 5
Appendix A. Shorthand examples . . . . . . . . . . . . . . . . . . 6
Thierry Expires February 8, 2014 [Page 2]
Internet-Draft BULK-XML August 2013
1. Introduction
1.1. Conventions and Terminology
The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT",
"SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this
document are to be interpreted as described in RFC 2119 [RFC2119].
Literal numerical values are provided in decimal or hexadecimal as
appropriate. Hexadecimal literals are prefixed with "0x" to
distinguish them from decimal literals.
BULK bytes sequences and expressions are described with the same
conventions than used in the BULK 1.0 specification [BULK1]
2. BULK XML namespace
The Simple XML namespace is an official namespace identified by the
UUID <urn:uuid:8e9fbe9c-2b0c-5307-bb81-edb1a7727563> (BULK,
http://www.w3.org/XML/Core/).
It is a simplistic notation for the XML data available after parsing
an XML document is complete. As such, it doesn't provide a way to
serialize some components of an XML document, such as the contents of
the DTD or unparsed entities. In particular, the encoding
declaration of the XML prolog is not serialized, as BULK sreams
already carry encoding information. As for any other BULK data,
everywhere within an Simple XML form, a "stringenc" form can appear.
An application writing Simple XML data SHOULD use "rdf:prefix" to
denote element and attribute prefixed names and arrays for unprefixed
names. The semantics of the "rdf:prefix" form are application-
dependent; an application could choose to evaluate it without
changing its value beforehand, thus using expanded names (cf.
[XML-NAMES10] and [XML-NAMES11]), or follow the XML Infoset
[XML-INFOSET] principle of keeping the namespace and local part of
the name separate.
Note that Simple XML cannot store the original prefix associated with
a namespace. The goal is to be able to serialize just enough data
from a parsed XML document that an equivalent XML document can be
unserialized. By equivalent, we mean that it would have the same
semantics as the original XML document for applications using this
XML format (but defining this latter equivalence in a remotely decent
way is outside the scope of this specification).
Character data (corresponding to "CDATA" sections and "#PCDATA" in
XML) SHOULD be serialized as arrays.
Thierry Expires February 8, 2014 [Page 3]
Internet-Draft BULK-XML August 2013
Note: authors of BULK XML namespaces are encouraged to define names
in such a way that frequently-used expressions can be serialized as
compactly as possible, using "rdf:prefix", "define", "subst" or any
similar constructs. This is particularly straightforward and useful
for enumerated attributes (but could be used for more complex
structures).
2.1. XML 1.0 entity
name "0x1" (mnemonic: "xml1.0" )
shape "( xml1.0 {content} )"
This form denotes an entity (document entity or external entity)
whose XML version is 1.0.
Type: "Entity"
2.2. XML 1.1 entity
name "0x2" (mnemonic: "xml1.1" )
shape "( xml1.1 {content} )"
This form denotes an entity (document entity or external entity)
whose XML version is 1.1.
Type: "Entity"
2.3. Processing instruction
name "0x3" (mnemonic: "pi" )
shape "( pi {target} {content} )"
Type: "PI"
2.4. Comment
name "0x4" (mnemonic: "comment" )
shape "( comment {content} )"
Type: "Comment"
Thierry Expires February 8, 2014 [Page 4]
Internet-Draft BULK-XML August 2013
2.5. Element
name "0x5" (mnemonic: "element" )
shape "( element {name} {content} )"
Type: "Element"
2.6. Attribute
name "0x6" (mnemonic: "attribute" )
shape "( attribute {name} {value} )"
Type: "Attribute"
2.7. xml: namespace
name "0x7" (mnemonic: "xml:" )
value "( prefix "http://www.w3.org/XML/1998/namespace" )"
2.8. xmlns: namespace
name "0x8" (mnemonic: "xmlns:" )
value "( prefix "http://www.w3.org/2000/xmlns/" )"
2.9. space preservation
name "0x9" (mnemonic: "preserve" )
value "( attribute ( xml: "space" ) "preserve" )"
3. References
3.1. Normative References
[BULK1] Thierry, P., "Binary Uniform Language Kit 1.0",
draft-thierry-bulk-02 (work in progress), August 2013.
[RFC2119] Bradner, S., "Key words for use in RFCs to Indicate
Requirement Levels", BCP 14, RFC 2119, March 1997.
3.2. Informative references
[XML-INFOSET] Cowan, J. and R. Tobin, "XML Information Set (Second
Edition)", February 2004,
Thierry Expires February 8, 2014 [Page 5]
Internet-Draft BULK-XML August 2013
<http://www.w3.org/TR/2004/REC-xml-infoset-20040204>.
[XML-NAMES10] Bray, T., Hollander, D., Layman, A., Tobin, R., and H.
Thompson, "Namespaces in XML 1.0 (Third Edition)",
December 2009, <http://www.w3.org/TR/xml-names/>.
[XML-NAMES11] Bray, T., Hollander, D., Layman, A., and R. Tobin,
"Namespaces in XML 1.1 (Second Edition)", August 2006,
<http://www.w3.org/TR/xml-names11/>.
[XML10] Bray, T., Paoli, J., Sperberg-McQueen, C., Maler, E.,
and F. Yergeau, "Extensible Markup Language (XML) 1.0
(Fifth Edition)", November 2008,
<http://www.w3.org/TR/xml/>.
[XML11] Bray, T., Paoli, J., Sperberg-McQueen, C., Maler, E.,
Yergeau, F., and J. Cowan, "Extensible Markup Language
(XML) 1.1 (Second Edition)", August 2006,
<http://www.w3.org/TR/xml11/>.
Appendix A. Shorthand examples
Here are a couple of examples of the kind of definitions that can be
used to increase compactness in an XML serialization.
First, this specification defines general forms for XML data. But
namespaces can be dedicated to XML formats, with names for each
element, attribute or PI:
o "( define ?rfc ( subst ( pi "rfc" ( rest 0 ) ) ) )"
o "( define t ( subst ( element "t" ( rest 0 ) ) ) )"
Secondly, recurring patterns can be abbreviated:
o "( define thang ( subst ( t ( attribute "hangText" ( arg 0 ) ) (
rest 1 )"
o "( define verb ( subst ( spanx ( attribute "verb" ) ( rest 0 ) ) )
)"
Author's Address
Pierre Thierry
Thierry Technologies
EMail: pierre@nothos.net
Thierry Expires February 8, 2014 [Page 6]