Internet DRAFT - draft-murata-atom-phonetics
draft-murata-atom-phonetics
Network Working Group M. Murata
Internet-Draft Japan Electronic Publishing Association
Intended status: Informational September 18, 2013
Expires: March 22, 2014
Representing phonetics for Japanese names in Atom feeds
draft-murata-atom-phonetics-04
Abstract
This specification introduces an attribute for representing Japanese
phonetics within feeds of the Atom Syndication Format [RFC4287].
Such phonetics MAY be attached to author names and article titles.
This attribute is applicable to OPDS(Open Publication Distribution
System)[OPDS], which is based on Atom.
Status of This Memo
This Internet-Draft is submitted in full conformance with the
provisions of BCP 78 and BCP 79.
Internet-Drafts are working documents of the Internet Engineering
Task Force (IETF). Note that other groups may also distribute
working documents as Internet-Drafts. The list of current Internet-
Drafts is at http://datatracker.ietf.org/drafts/current/.
Internet-Drafts are draft documents valid for a maximum of six months
and may be updated, replaced, or obsoleted by other documents at any
time. It is inappropriate to use Internet-Drafts as reference
material or to cite them other than as "work in progress."
This Internet-Draft will expire on March 22, 2014.
Copyright Notice
Copyright (c) 2013 IETF Trust and the persons identified as the
document authors. All rights reserved.
This document is subject to BCP 78 and the IETF Trust's Legal
Provisions Relating to IETF Documents
(http://trustee.ietf.org/license-info) in effect on the date of
publication of this document. Please review these documents
carefully, as they describe your rights and restrictions with respect
to this document. Code Components extracted from this document must
include Simplified BSD License text as described in Section 4.e of
the Trust Legal Provisions and are provided without warranty as
described in the Simplified BSD License.
Murata Expires March 22, 2014 [Page 1]
Internet-Draft PhoneticsInAtom September 2013
1. Introduction
The Japanese language uses kanji (CJK ideographic characters) and
kana(Japanese phonetic characters). A human name or article title is
often represented by two strings: one with kanji and another without
kanji. The latter is used to unambiguously specify the phonetics of
the human name or article title. More about this issue, see
[Miyazawa].
The Atom Syndication Format [RFC4287] does not provide mechanisms for
representing a name in more than one language or script. As a
solution specialized to Japanese phonetics, this specification
introduces an attribute for representing the phonetics for a Japanese
author name or article title. This attribute is applicable to
OPDS(Open Publication Distribution System)[OPDS], which is based on
Atom.
2. The phonetics attribute
_Namespace:_ http://www.jepa.or.jp/xmlns/atomPhonetics/1.0
_Local name:_ phonetics
_Parent element:_ atom:name, atom:title, and atom:subTitle
_Cardinality:_ 0 or 1
_Permissible values:_ Strings in the Japanese language without CJK
ideographic characters. Such strings typically consists of
Katakana or Hiragana characters but may include digits, symbols
and Latin alphabets. [NDL] gives one way to create the phonetics
of a given name and shows commonly used characters for phonetics.
2.1. Example
Murata Expires March 22, 2014 [Page 2]
Internet-Draft PhoneticsInAtom September 2013
<feed
xmlns="http://www.w3.org/2005/Atom"
xmlns:jepa="http://www.jepa.or.jp/xmlns/atomPhonetics/1.0">
<title
jepa:phonetics="XMLニュウモン"
>XML入門
</title>
<author>
<name jepa:phonetics="ムラタ"
>村田</name>
</author>
...
</feed>
"ニュウモン" and
"ムラタ" are Katakana strings, while
"入門" and "村田" are CJK ideographic
character strings.
3. RELAX NG schema
The original schema (atom.rnc) allows any foreign attribute, and is
thus usable for validating feeds containing the attribute for
Japanese phonetics. However, the following schema is provided as an
extension of the original schema for explicitly allowing the
phonetics attribute for name, title, and subtitle elements and no
other elements. Instead of the original schema, this schema MAY be
used for validating atom feeds having phonetics.
namespace atom = "http://www.w3.org/2005/Atom"
namespace jepa = "http://www.jepa.or.jp/xmlns/atomPhonetics/1.0"
namespace local = ""
include "atom.rnc" {
undefinedAttribute =
attribute * - (xml:base | xml:lang | local:* | jepa:*) { text }
atomPersonConstruct =
atomCommonAttributes,
(element atom:name {attribute jepa:phonetics {text }?, text }
& element atom:uri { atomUri }?
& element atom:email { atomEmailAddress }?
& extensionElement*)
atomTitle =
element atom:title
{attribute jepa:phonetics { text }?, atomTextConstruct }
atomSubtitle =
element atom:subtitle
{attribute jepa:phonetics { text }?, atomTextConstruct }
Murata Expires March 22, 2014 [Page 3]
Internet-Draft PhoneticsInAtom September 2013
}
4. Schematron schema
This additional schema is provided for explicitly allowing the
phonetics attribute for name, title, and subtitle elements and no
other elements. This schema MAY be used for validating atom feeds
having phonetics. It MAY be used together with the original schema
(atom.rnc) or schemas for OPDS.
<?xml version="1.0" encoding="UTF-8"?>
<schema xmlns="http://purl.oclc.org/dsdl/schematron"
xmlns:atom="http://www.w3.org/2005/Atom"
xmlns:jepa="http://www.jepa.or.jp/xmlns/atomPhonetics/1.0">
<ns prefix="atom" uri="http://www.w3.org/2005/Atom"/>
<ns prefix="jepa"
uri="http://www.jepa.or.jp/xmlns/atomPhonetics/1.0"/>
<pattern >
<rule context="atom:*[@jepa:phonetics]">
<let name="ln" value="local-name()"/>
<assert test="($ln='name')
or ($ln='title')
or ($ln = 'subTitle') "
>The jepa:phonetics attribute is allowed only
for atom:name, atom:title, and atom:subTitle
elements.</assert>
</rule>
</pattern>
</schema>
5. Security Considerations
There are no new issues beyond those shown in [RFC4287].
6. References
6.1. Normative References
[RFC2119] Bradner, S., "Key words for use in RFCs to Indicate
Requirement Levels", BCP 14, RFC 2119, March 1997.
[RFC4287] Nottingham, M., Ed. and R. Sayre, Ed., "The Atom
Syndication Format", RFC 4287, December 2005.
[Miyazawa]
Murata Expires March 22, 2014 [Page 4]
Internet-Draft PhoneticsInAtom September 2013
Miyazawa, A., "Parallel writing in East Asian languages
and its representation in metadata in light of the DCMI
Abstract Model, Int'l Conf. on Dublin Core and Metadata
Applications", 2007, <http://dcpapers.dublincore.org/
index.php/pubs/article/view/863/859>.
6.2. Informative References
[OPDS] , "Open Publication Distribution System", ,
<http://opds-spec.org/>.
[NDL] , "On phonetics (in Japanese)", ,
<http://www.ndl.go.jp/jp/library/data/yomi.html>.
Appendix A. Acknowldgements
The author is grateful to Hiroshi Hoshi, Ando Kazuhiro, Akira
Miyazawa, Akimoto Ryoji, Masayoshi Takahashi, Hiroshi Takase, and
Toshiyuki Takashima for their feedbacks.
Author's Address
MURATA Makoto (FAMILY Given)
Japan Electronic Publishing Association
Email: eb2m-mrt@asahi-net.or.jp
URI: http://www.asahi-net.or.jp/~eb2m-mrt/
Murata Expires March 22, 2014 [Page 5]