Internet DRAFT - draft-kahn-dsii-id-res-sys
draft-kahn-dsii-id-res-sys
INTERNET-DRAFT R. Kahn
IETF Stream CNRI
Intended Status: Informational A. Maffei
WHOI
Expires: 26 December 2012 24 June 2012
Terminology and Use Cases for Interoperability of
Identifier Resolution Systems
draft-kahn-dsii-id-res-sys-00
Abstract
Identifier Systems have been in existence for many years and are in
widespread use. An opaque identifier conveys essentially nothing
about the information identified and must be converted into useful
intermediate information, known as state information, before the
information being identified can be accessed or used. An identifier
that is can be processed by a system in the Internet to provide
useful and actionable intermediate state information is said to be
resolvable. Although there is ongoing discussion about the value of
opaque versus non-opaque identifiers, we assume the existence of a
minimal syntax structure for the identifier and also that the
resolution mechanisms only knows about the minimal syntax of the
identifier. In this document we introduce the notion of
interoperability of identifier resolution systems as a key component
for enabling interoperability of heterogeneous information systems
more broadly, and discuss the role of unique persistent resolvable
identifiers. Terminology is proposed to facilitate this discussion
and a set of use cases demonstrating the need for interoperability of
resolution systems are included.
Status of this Memo
This Internet-Draft is submitted to IETF in full conformance with the
provisions of BCP 78 and BCP 79.
Internet-Drafts are working documents of the Internet Engineering
Task Force (IETF), its areas, and its working groups. Note that
other groups may also distribute working documents as
Internet-Drafts.
Internet-Drafts are draft documents valid for a maximum of six months
and may be updated, replaced, or obsoleted by other documents at any
time. It is inappropriate to use Internet-Drafts as reference
material or to cite them other than as "work in progress."
Kahn and Maffei Expires 26 December 2012 [Page 1]
INTERNET-DRAFT 24 June 2012
The list of current Internet-Drafts can be accessed at
http://www.ietf.org/1id-abstracts.html
The list of Internet-Draft Shadow Directories can be accessed at
http://www.ietf.org/shadow.html
Copyright and License Notice
Copyright (c) 2012 IETF Trust and the persons identified as the
document authors. All rights reserved.
This document is subject to BCP 78 and the IETF Trust's Legal
Provisions Relating to IETF Documents
(http://trustee.ietf.org/license-info) in effect on the date of
publication of this document. Please review these documents
carefully, as they describe your rights and restrictions with respect
to this document. Code Components extracted from this document must
include Simplified BSD License text as described in Section 4.e of
the Trust Legal Provisions and are provided without warranty as
described in the Simplified BSD License.
1. Introduction
Identifier Systems have been in existence for many years and are in
widespread use in society. They range from the simplest systems of
enumeration to more complex means of conveying information about the
things, values, services, or other identified items. Opaque
identifiers, such as strings from cryptographic hash functions,
convey essentially nothing about the information identified and must
be resolved before they can be used in a digital environment. An
identifier that can be processed by a system in the Internet to
provide useful and actionable state information is said to be
resolvable. In this document, we focus on resolvable identifiers and
the resolution systems that support them. Specifically, we introduce
the notion of interoperability of resolution systems and discuss the
notion of unique persistent resolvable identifiers along with a
common terminology for discussing these systems. This capability has
been implemented as part of the Digital Object Architecture [1].
Finally, a set of selected use cases that support this notion is also
provided.
Kahn and Maffei Expires 26 December 2012 [Page 2]
INTERNET-DRAFT 24 June 2012
2. Definitions
Identifier
While some systems of identification have used names and semantics
to identify a variety of things, many other systems have used
combinations of syntax and semantics. In this document we assume
no semantics are available in the identifier and only enough
syntax for the resolution system to understand how to resolve it.
Identifier Generation Rules
These rules specify how to create a well-formed identifier for use
within a resolution system. Normally, the rules will consist of
syntax constraints, but may go beyond that, as in requiring that
some or all of an identifier be generated in a specific manner.
Identifierspace
This refers to the set of identifiers that are generated using the
rules.
Digital Object
A digital object consists of structured data in the form of a bit
sequence or a set of bit sequences that can be interpreted by a
computer or other computational facility. Each sequence consists
of a type-value pair, at least one of which is the associated
unique persistent identifier for the digital object. A digital
object may incorporate separately identifiable digital objects or
only the identifiers for those objects or a combination of the
two. The digital objects need not all be stored in one place, or
stored at all, but if they are stored, they are generally assumed
to be accessible from the one or more known locations in the
Internet. Indeed some digital objects may constitute mobile
programs that given specific tasks to perform in the Internet, and
would interact with other programs, registries, repositories or
information resources structured as digital objects in order to
carry out its assigned tasks.
Resolution System
A system in the Internet that accepts an identifier as input and
produces useful and actionable "state information" such that, for
example, the digital object associated with that identifier may be
accessed and used. This state information is intended to inform
the user or the user's system as to specific actions to take,
restrictions on actions and associated information such as a means
Kahn and Maffei Expires 26 December 2012 [Page 3]
INTERNET-DRAFT 24 June 2012
of authentication, public keys, and whether certain actions will
incur costs or require allocation of significant resources such as
storage. It is assumed that many different resolution systems may
exist in the Internet. Each such system may have its own
identifier rules and produce information in different ways.
Uniqueness
An identifier is said to be unique at a given time with respect to
a given resolution system if there is only one authorized set of
useful and actionable information held by the resolution system
for that identifier, and that information is produced by a
resolution request with that identifier.
Persistence
An identifier is said to be persistent if its associated
resolution system can be relied upon to return useful and
actionable state information about the identified digital object
over a period of time that is at least as long as the digital
object exists and in the face of changes to the digital object,
such as ownership and location, without the identifier itself
changing. The period of time may vary somewhat depending on the
domain of use. For example, the identifier may outlast the
identified digital object, such that a reference to it using the
identifier can be resolved to useful and actionable state
information even when the digital object is no longer available.
The quality of persistence is dependent on a combination of social
and business infrastructure, e.g., an organization assuming
responsibility for maintaining the resolution information and
underlying resolution system and technology.
Immutability
The term immutable refers to a digital object and its associated
identifier that is fixed and does not change. Other than the
digital object identifier, however, other aspects of the digital
object metadata may change over time as, for example, the object
is moved from place to place, as long as the basic structured
information that comprises the digital object does not change over
time.
Mutability
This term refers to a digital object that is not immutable.
Kahn and Maffei Expires 26 December 2012 [Page 4]
INTERNET-DRAFT 24 June 2012
Structured Data and Metadata
Normally, a resolution request leads to one of two basic results.
One result is obtaining structured data in digital form that
represents a service, resource or other information represented in
digital form such as a document, bill of lading, or movie. The
other result is obtaining metadata about the service, resource, or
information. In this document, we focus on metadata that is used
for rapid reaction by the user's system, as in clicking an
identifier to find the location(s) of the item, or even the
location of the systems to provide parts of the metadata. More
general forms of metadata, such as might typically be used in a
search system, would not be provided by the resolution system;
rather, the user could be referred to a more substantive registry
of such information with advanced browsing and searching
capabilities.
3. Use cases for Actionable Identifier Resolver Interoperability
Interoperable Proxying
Let us assume a user of resolution system A needs to resolve an
identifier known only to resolution system B. We assume the user
does not have the software needed to use resolution system B, or
else the user might try to resolve the identifier directly from
resolution system B. Rather, the user asks resolution system A to
do the resolution and system A takes on the responsibility to ask
resolution system B. These two resolution systems are
interoperable if a resolution request sent from resolution system
A to resolution system B will succeed in producing the information
in a form usable by the party who requested it. Typically, this
will require a kind of service level agreement (SLA) between the
two resolution system administrators to share information about
how they store and otherwise manage resolution information in
order to produce a useful response to the user, and also to agree
on what various fields mean in order to do an effective
translation into the terms of the asking resolution system.
This document does not address how such SLAs should be structured
or how any resolution system should operate internally. Most
likely, only a small number of resolution systems will be
desirable for handling the bulk of identifier resolution
requirements. A list of such systems could be made available in
the Internet, if desired.
Each resolution system may have its own means of providing
security, but this document does not address how security may be
achieved for the overall interoperable proxy service.
Kahn and Maffei Expires 26 December 2012 [Page 5]
INTERNET-DRAFT 24 June 2012
Distributed replica management
Retention and caching decisions depend on the status of replicas
of digital objects managed by multiple, distributed services.
Duplicates may, in principle, have different identifiers in
different systems. In order to query or harvest the status of
these digital objects, to make retention or migration decisions
(e.g., deduplication, or migrating a digital object which a
service is about to drop), it might be necessary to resolve
multiple identifiers against multiple resolution services.
Provenance tracking
In a complex distributed workflow, source data and related data
services may be managed separately, perhaps via heterogeneous
systems that rely on different identifier systems. In order to
reason over provenance it may be necessary to access these objects
or their metadata.
Data fusion
In this case a digital object is assembled from component digital
objects, which may not all be accessible via the same service or
within the same identifierspace; as a result, the identifiers may
not all be resolvable in the same identifierspace and the assembly
process would require resolving each identifier separately.
Harvesting and spidering
In this case, which is similar to the data fusion case, an agent
fetches a digital object which contains references to other
digital objects and then needs to be able to resolve those
references recursively.
4. Security Considerations
<This section purposely left blank in this version.>
5. IANA Considerations
No IANA actions are needed.
6. Informational References
[1] Corporation for National Research Initiatives, "A brief overview
of the Digital Object Architecture", 1 June 2010.
<http://cnri.reston.va.us/papers/
OverviewDigitalObjectArchitecture.pdf>
Kahn and Maffei Expires 26 December 2012 [Page 6]
INTERNET-DRAFT 24 June 2012
Authors' Addresses
Robert Kahn
Corporation for National Research Initiatives
1895 Preston White Drive, Suite 100
Reston, VA 20191
USA
EMail: rkahn@cnri.reston.va.us
Andrew Maffei
Woods Hole Oceanographic Institution
Mailstop 44
Woods Hole, MA 02543
USA
EMail: amaffei@whoi.edu
Kahn and Maffei Expires 26 December 2012 [Page 7]