Internet DRAFT - draft-alimi-protocol
draft-alimi-protocol
APPSAWG R. Alimi
Internet-Draft Google
Intended status: Informational A. Rahman
Expires: December 12, 2013 InterDigital Communications, LLC
D. Kutscher
NEC
Y. Yang
Yale University
H. Song
K. Pentikousis
Huawei Technologies
June 10, 2013
DECADE: DECoupled Application Data Enroute
draft-alimi-protocol-01
Abstract
Content distribution applications, such as those those employing
peer-to-peer (P2P) technologies, are widely used on the Internet and
make up a large portion of the traffic in many networks. Often,
however, content distribution applications use network resources in a
counter-productive manner. One way to improve efficiency is to
introduce storage capabilities within the network and enable
cooperation between end-host and in-network content distribution
mechanisms. This is the capability provided by a DECADE-compatible
system, which is introduced in this document. DECADE enables
applications to take advantage of in-network storage when
distributing data objects as opposed to using solely end-to-end
resources. This document presents the underlying principles and key
functionalities of such a system and illustrates operation through a
set of examples.
Requirements Language
The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT",
"SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this
document are to be interpreted as described in RFC 2119 [RFC2119].
Status of This Memo
This Internet-Draft is submitted in full conformance with the
provisions of BCP 78 and BCP 79.
Alimi, et al. Expires December 12, 2013 [Page 1]
Internet-Draft DECADE Architecture June 2013
Internet-Drafts are working documents of the Internet Engineering
Task Force (IETF). Note that other groups may also distribute
working documents as Internet-Drafts. The list of current Internet-
Drafts is at http://datatracker.ietf.org/drafts/current/.
Internet-Drafts are draft documents valid for a maximum of six months
and may be updated, replaced, or obsoleted by other documents at any
time. It is inappropriate to use Internet-Drafts as reference
material or to cite them other than as "work in progress."
This Internet-Draft will expire on December 12, 2013.
Copyright Notice
Copyright (c) 2013 IETF Trust and the persons identified as the
document authors. All rights reserved.
This document is subject to BCP 78 and the IETF Trust's Legal
Provisions Relating to IETF Documents
(http://trustee.ietf.org/license-info) in effect on the date of
publication of this document. Please review these documents
carefully, as they describe your rights and restrictions with respect
to this document. Code Components extracted from this document must
include Simplified BSD License text as described in Section 4.e of
the Trust Legal Provisions and are provided without warranty as
described in the Simplified BSD License.
Table of Contents
1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . 3
2. Architectural Principles . . . . . . . . . . . . . . . . . . 5
2.1. Data and Control/Metadata Plane Decoupling . . . . . . . 5
2.2. Immutable Data Objects . . . . . . . . . . . . . . . . . 6
2.3. Data Object Identifiers . . . . . . . . . . . . . . . . . 7
2.4. Explicit Control . . . . . . . . . . . . . . . . . . . . 8
2.5. Resource and Data Access Control through Delegation . . . 8
3. System Components . . . . . . . . . . . . . . . . . . . . . . 9
3.1. Content Distribution Application . . . . . . . . . . . . 9
3.2. DECADE Client . . . . . . . . . . . . . . . . . . . . . . 10
3.3. DECADE Server . . . . . . . . . . . . . . . . . . . . . . 11
3.4. Data Sequencing and Naming . . . . . . . . . . . . . . . 12
3.5. Token-based Authorization and Resource Control . . . . . 13
3.6. Discovery . . . . . . . . . . . . . . . . . . . . . . . . 14
4. DECADE Protocol Design . . . . . . . . . . . . . . . . . . . 15
4.1. Naming . . . . . . . . . . . . . . . . . . . . . . . . . 15
4.2. Resource Protocol . . . . . . . . . . . . . . . . . . . . 15
4.3. Data Transfer . . . . . . . . . . . . . . . . . . . . . . 19
4.4. Server-to-Server Protocols . . . . . . . . . . . . . . . 19
Alimi, et al. Expires December 12, 2013 [Page 2]
Internet-Draft DECADE Architecture June 2013
5. In-Network Storage Components Mapping to DECADE . . . . . . . 20
6. Security Considerations . . . . . . . . . . . . . . . . . . . 21
6.1. Threat: System Denial of Service Attacks . . . . . . . . 21
6.2. Threat: Authorization Mechanisms Compromised . . . . . . 22
6.3. Threat: Data Object Spoofing . . . . . . . . . . . . . . 22
7. IANA Considerations . . . . . . . . . . . . . . . . . . . . . 23
8. Acknowledgments . . . . . . . . . . . . . . . . . . . . . . . 23
9. Informative References . . . . . . . . . . . . . . . . . . . 23
Authors' Addresses . . . . . . . . . . . . . . . . . . . . . . . 24
1. Introduction
Content distribution applications, such as peer-to-peer (P2P)
applications, are widely used on the Internet to distribute data
objects, and comprise a large portion of the traffic in many
networks. Said applications can often introduce performance
bottlenecks in otherwise well-provisioned networks. In some cases,
operators are forced to invest substantially in infrastructure to
accommodate the use of such applications. For instance, in many
subscriber networks, it can be expensive to upgrade network equipment
in the "last-mile", because it can involve replacing equipment and
upgrading wiring and devices at individual homes, businesses,
DSLAMs(Digital Subscriber Line Access Multiplexers) and CMTSs (Cable
Modem Termination Systems) in remote locations. It may be more
practical and economical to upgrade the core infrastructure, instead
of the edge part of the network, as this involves fewer components
that are shared by many subscribers. See [RFC6646] and [RFC6392] for
a more complete discussion of the problem domain and general
discussions of the capabilities envisioned for a DECADE system.
This document presents mechanisms for providing in-network storage
that can be integrated into content distribution applications. The
primary focus is P2P-based content distribution, but DECADE may be
useful to other applications with similar characteristics and
requirements. The approach we adopt in this document is to define
the core functionalities and protocol functions that are needed to
support a DECADE system. This document provides illustrative
examples so that implementers can understand the main concepts in
DECADE, but it is generally assumed that readers are familiar with
the terms and concepts used in [RFC6646] and [RFC6392].
Figure 1 is a schematic of a simple DECADE system with two DECADE
clients and two DECADE servers. As illustrated, a client uses the
DECADE Resource Protocol (DRP) to convey to a server information
related to access control and resource scheduling policies. DRP can
also be used between servers for exchanging this type of information.
A DECADE system employs standard data transfer (SDT) protocol(s) to
transfer data objects to and from a server, as we will explain later.
Alimi, et al. Expires December 12, 2013 [Page 3]
Internet-Draft DECADE Architecture June 2013
Native Application
.-------------. Protocol(s) .-------------.
| Application | <------------------> | Application |
| End-Point | | End-Point |
| | | |
| .--------. | | .--------. |
| | DECADE | | | | DECADE | |
| | Client | | | | Client | |
| `--------' | | `--------' |
`-------------' `-------------'
| ^ | ^
DECADE | | Standard | |
Resource | | Data DRP | | SDT
Protocol | | Transfer | |
(DRP) | | (SDT) | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
v V v V
.=============. DRP .=============.
| DECADE | <------------------> | DECADE |
| Server | <------------------> | Server |
`=============' SDT `============='
Figure 1: DECADE Overview
With Figure 1 at hand, assume that Application End-Point B requests a
data object from Application End-Point A. In this case, End-Point A
will act as the sender and End-Point B as the receiver for said data
object. Let S(A) denote the DECADE storage server to which A has
access. Figure 2 illustrates the four steps involved in the request,
starting with the initial contact between B and A during which the
former requests a data object using their native application protocol
(see Section 3.1). Next, A uses DRP to obtain a token corresponding
to the data object that was requested by B. There may be several ways
for A to obtain such a token, e.g., compute it locally or request one
from its DECADE storage server, S(A); see Section 4.2.1 for more
details. Once obtained, A then provides the token to B (again, using
their native application protocol). Finally, B provides the received
token to S(A) via DRP, and subsequently requests and downloads the
data object via SDT.
.----------.
2. Obtain --------> | S(A) | <------
Token / `----------' \ 4. Request and
Alimi, et al. Expires December 12, 2013 [Page 4]
Internet-Draft DECADE Architecture June 2013
(DRP) / \ Download Data
Locally / \ Object
or From / \ (DRP + SDT)
S(A) v 1. App Request v
.-------------. <--------------------------- .-------------.
| Application | | Application |
| End-Point A | | End-Point B |
`-------------' ---------------------------> `-------------'
3. App Response (token)
Figure 2: Download from Storage Server
2. Architectural Principles
This section presents the key principles followed by any DECADE
system.
2.1. Data and Control/Metadata Plane Decoupling
A DECADE system aims to be application-independent and SHOULD support
multiple content distribution applications. Typically, a complete
content distribution application implements a set of control plane
functions including content search, indexing and collection, access
control, replication, request routing, and QoS scheduling.
Implementers of different content distribution applications may have
unique considerations when designing the control plane functions.
For example, with respect to the metadata management scheme,
traditional file systems provide a standard metadata abstraction: a
recursive structure of directories to offer namespace management
where each file is an opaque byte stream. Content distribution
applications may use different metadata management schemes. For
instance, one application might use a sequence of blocks (e.g., for
file sharing), while another application might use a sequence of
frames (with different sizes) indexed by time.
With respect to resource scheduling algorithms, a major advantage of
many successful P2P systems is their substantial expertise in
achieving efficient utilization of peer resources. For instance,
many streaming P2P systems include optimization algorithms for
constructing overlay topologies that can support low-latency, high-
bandwidth streaming. The research community as well as implementers
of such systems continuously fine-tune existing algorithms and invent
new ones. A DECADE system should be able to accommodate and benefit
from all new developments.
In short, given the diversity of control plane functions, a DECADE
system should allow for as much flexibility as possible to the
Alimi, et al. Expires December 12, 2013 [Page 5]
Internet-Draft DECADE Architecture June 2013
control plane to implement specific policies. This conforms to the
end-to-end systems principle and allows innovation and satisfaction
of specific performance goals. Decoupling the control plane from the
data plane is not new, of course. For example, OpenFlow is an
implementation of this principle for Internet routing, where the
computation of the forwarding table and the application of the
forwarding table are separated. The Google File System
[GoogleFileSystem] applies the same principle to file system design
by utilizing a Master to handle meta-data management and several
Chunk servers to handle data plane functions (i.e., read and write of
chunks of data). Finally, NFSv4.1's pNFS extension [RFC5661] also
adheres to this principle.
2.2. Immutable Data Objects
A common property of bulk content to be broadly distributed is that
it is immutable -- once content is generated, it is typically not
modified. For example, once a movie has been edited and released for
distribution it is very uncommon that the corresponding video frames
and images need to be modified. The same applies to document
distribution, such as RFCs, audio files, such as podcasts, and
program patches. Focusing on immutable data can substantially
simplify data plane design, since consistency requirements can be
relaxed. It also simplifies data reuse and implementation of de-
duplication.
Depending on its specific requirements, an application may store
immutable data objects in DECADE servers such that each data object
is completely self-contained (e.g., a complete, independently
decodable video segment). An application may also divide data into
data objects that require application level assembly. Many content
distribution applications divide bulk content into data objects for
multiple reasons, including (a) fetching different data objects from
different sources in parallel; and (b) faster recovery and
verification as individual data objects might be recovered and
verified. Typically, applications use a data object size larger than
a single packet in order to reduce control overhead.
A DECADE system SHOULD be agnostic to the nature of the data objects
and SHOULD NOT specify a fixed size for them. A protocol
specification based on this architecture MAY prescribe requirements
on minimum and maximum sizes for compliant implementations.
Note that immutable data objects can still be deleted. Applications
can support modification of existing data stored at a DECADE server
through a combination of storing new data objects and deleting
existing data objects. For example, a meta-data management function
of the control plane might associate a name with a sequence of
Alimi, et al. Expires December 12, 2013 [Page 6]
Internet-Draft DECADE Architecture June 2013
immutable data objects. If one of the data objects is modified, the
meta-data management function changes the mapping of the name to a
new sequence of immutable data objects.
Throughout this document, all data objects are assumed to be
immutable.
2.3. Data Object Identifiers
A data object stored in a DECADE server SHALL be accessed by content
consumers via a data object identifier. Each content consumer may be
able to access more than one storage server. A data object that is
replicated across different storage servers managed by a DECADE
Storage Provider MAY be accessed through a single identifier. Since
data objects are immutable, it SHALL be possible to support
persistent identifiers for data objects.
Data object identifiers SHOULD be created by content providers when
uploading the corresponding objects to a DECADE server. The scheme
for the assignment/derivation of the data object identifier to a data
object depends as the data object naming scheme and is out of scope
of this document. One possibility is to name data objects using
hashes as described in [RFC6920]. Note that this document describes
naming schemes on a semantic level only but specific SDTs and DRPs
will use specific representations.
In particular, for some applications it is important that clients and
servers are able to validate the name-object binding, i.e., by
verifying that a received object really corresponds to the name
(identifier) that was used for requesting it (or that was provided by
a sender). Data object identifiers can support name-object binding
validation by providing message digests or so-called self-certifying
naming information -- if a specific application has this requirement.
Different name-object binding validation mechanisms MAY be supported
in a single DECADE system. Content distribution applications can
decide what mechanism to use, or to not provide name-object
validation (e.g., if authenticity and integrity can by ascertained by
alternative means). We expect that applications may be able to
construct unique names (with high probability) without requiring a
registry or other forms of coordination. Names may be self-
describing so that a receiving entity (i.e. the content consumer)
understands, for example, which hash function to use for validating
name-object binding.
Some content distribution applications will derive the name of a data
object from the hash over the data object, which is made possible by
the fact that DECADE objects are immutable. But there may be other
Alimi, et al. Expires December 12, 2013 [Page 7]
Internet-Draft DECADE Architecture June 2013
applications such as live streaming where object names will not based
on hashes but rather on an enumeration scheme. The naming scheme
will also enable those applications to construct unique names.
In order to enable the uniqueness, flexibility and self-describing
properties, the naming scheme used in a DECADE system SHOULD provide
a "type" field that indicates the name-object validation function
type (for example, "sha-256") and the cryptographic data (such as an
object hash) that corresponds to the type information. Moreover, the
naming scheme MAY additionally provide application or publisher
information.
The specific format of the name (e.g., encoding, hash algorithms,
etc.) is out of scope of this document.
2.4. Explicit Control
To support the functions of an application's control plane,
applications SHOULD be able to keep track and coordinate which data
is stored at particular servers. Thus, in contrast with traditional
caches, applications are given explicit control over the placement
(selection of a DECADE server), deletion (or expiration policy), and
access control for stored data objects. Consider deletion/expiration
policy as a simple example. An application might require that a
DECADE server stores data objects for a relatively short period of
time (e.g., for live-streaming data). Another application might need
to store data objects for a longer duration (e.g., for video-on-
demand), and so on.
2.5. Resource and Data Access Control through Delegation
A DECADE system provides a shared infrastructure to be used by
multiple content consumers and content providers spanning multiple
content distribution applications. Thus, it needs to provide both
resource and data access control, as discussed in the following
subsections.
2.5.1. Resource Allocation
There are two primary interacting entities in a DECADE system.
First, in-network storage providers coordinate DECADE server
provisioning, including their total available resource; see
Section 4.2.1. Second, applications coordinate data transfers
amongst available DECADE servers and between servers and clients. A
form of isolation is required to enable concurrently-running
applications to each explicitly manage its own data objects and share
of resources at the available servers. Therefore, a storage provider
should delegate resource management on a DECADE server to content
Alimi, et al. Expires December 12, 2013 [Page 8]
Internet-Draft DECADE Architecture June 2013
providers, enabling them to explicitly and independently manage their
own share of resources on a server.
2.5.2. User Delegation
In-network storage providers will have the ability to explicitly
manage the entities allowed to utilize the resources available on a
DECADE server. This is needed for reasons such as capacity-planning
and legal considerations in certain deployment scenarios. The DECADE
server SHOULD grant a share of the resources to the DECADE client of
a content provider or content consumer. The client can in turn share
the granted resources amongst its (possibly) multiple applications.
The share of resources granted by a server is called a User
Delegation. As a simple example, a DECADE server operated by an ISP
might be configured to grant each ISP subscriber 1.5 Mb/s of network
capacity. The ISP subscriber might in turn divide this share of
resources amongst a video streaming application and file-sharing
application which are running concurrently.
3. System Components
As noted earlier, the primary focus of this document is the
architectural principles and the system components that implement
them. While specific system components might differ between
implementations, this document details the major components and their
overall roles in the architecture. To keep the scope narrow, we only
discuss the primary components related to protocol development.
Particular deployments will require additional components (e.g.,
monitoring and accounting at a server), but they are intentionally
omitted from this document.
3.1. Content Distribution Application
Content distribution applications have many functional components.
For example, many P2P applications have components and algorithms to
manage overlay topology, rate allocation, piece selection, and so on.
In this document, we focus on the components directly engaged in a
DECADE system. Figure 3 illustrates the components discussed in this
section from the perspective of a single Application End-Point.
Native Protocol(s)
(with other Application End-Points)
.--------------------->
|
|
.----------------------------------------------------------------.
| Application End-Point |
| .-------------------. .-------------------. |
Alimi, et al. Expires December 12, 2013 [Page 9]
Internet-Draft DECADE Architecture June 2013
| | Application-Layer | ... | App Data Assembly | |
| | Algorithms | | Sequencing | |
| `-------------------' `-------------------' |
| |
| .==========================================================. |
| | DECADE Client | |
| | .-------------------------. .--------------------------. | |
| | | Resource Controller | | Data Controller | | |
| | | .--------. .----------. | | .------------. .-------. | | |
| | | | Data | | Resource | | | | Data | | Data | | | |
| | | | Access | | Sharing | | | | Scheduling | | Index | | | |
| | | | Policy | | Policy | | | | | | | | | |
| | | '--------' `----------' | | `------------' `-------' | | |
| | `-------------------------' `--------------------------' | |
| | | ^ | |
| `== | ============================== | ====================' |
`----- | ------------------------------ | -----------------------'
| |
| DECADE Resource Protocol (DRP) | Standard Data Transfer
v V
Figure 3: Application and DECADE Client Components
A DECADE system is geared towards supporting applications that can
distribute content using data objects. To accomplish this,
applications can include a component responsible for creating the
individual data objects before distribution and then re-assembling
data objects at the content consumer. We call this component
Application Data Assembly. In producing and assembling data objects,
two important considerations are sequencing and naming. A DECADE
system assumes that applications implement this functionality
themselves. See Section 4.1 for further discussion. In addition to
DECADE DRP/SDT, applications will most likely also support other,
native application protocols (e.g., P2P control and data transfer
protocols).
3.2. DECADE Client
The DECADE client provides the local support to an application, and
can be implemented standalone, embedded into the application, or
integrated in other entities such as network devices themselves. In
general, applications may have different Resource Sharing Policies
and Data Access Policies to control their resource and data in DECADE
servers. These policies may be existing policies of applications or
custom policies. The specific implementation is decided by the
application.
Alimi, et al. Expires December 12, 2013 [Page 10]
Internet-Draft DECADE Architecture June 2013
Recall that DECADE decouples the control and the data transfer of
applications. A Data Scheduling component schedules data transfers
according to network conditions, available servers, and/or available
server resources. The Data Index indicates data available at remote
servers. The Data Index (or a subset of it) can be advertised to
other clients. A common use case for this is to provide the ability
to locate data amongst distributed Application End-Points (i.e., a
data search mechanism such as a Distributed Hash Table).
3.3. DECADE Server
Figure 4 illustrates the primary components of a DECADE server. Note
that the description below does not assume a single-host or
centralized implementation: a DECADE server is not necessarily a
single physical machine but can also be implemented in a distributed
manner on a cluster of machines.
| DECADE Resource | Standard Data Transfer
| Protocol (DRP) |
| |
.= | ================= | ===========================.
| | v DECADE Server |
| | .----------------. |
| |----> | Access Control | <--------. |
| | `----------------' | |
| | ^ | |
| | | | |
| | v | |
| | .---------------------. | |
| `-> | Resource Scheduling | <------| |
| `---------------------' | |
| ^ | |
| | | |
| v .-----------------. |
| .-----------------. | User Delegation | |
| | Data Store | | Management | |
| `-----------------' `-----------------' |
`==================================================='
Figure 4: DECADE Server Components
Provided sufficient authorization, a client SHALL be able to access
its own data or other client's data in a DECADE server. Clients MAY
also authorize other clients to store data. If access is authorized
by a client, the server SHOULD provide access. Applications may
apply resource sharing policies or use a custom policy. DECADE
Servers will then perform resource scheduling according to the
resource sharing policies indicated by the client as well as any
Alimi, et al. Expires December 12, 2013 [Page 11]
Internet-Draft DECADE Architecture June 2013
other previously configured User Delegations. Data from applications
will be stored at a DECADE server. Data may be deleted from storage
either explicitly or automatically (e.g., after a TTL expiration).
3.4. Data Sequencing and Naming
The DECADE naming scheme implies no sequencing or grouping of
objects, even if this is done at the application layer. To
illustrate these properties, this section presents several
illustrative examples of use.
3.4.1. Application with Fixed-Size Chunks
Similar to the example in Section 3.1, consider an application in
which each individual application-layer segment of data is called a
"chunk" and has a name of the form: "CONTENT_ID:SEQUENCE_NUMBER".
Furthermore, assume that the application's native protocol uses
chunks of size 16 KB. Now, assume that this application wishes to
store data in a DECADE server in data objects of size 64 KB. To
accomplish this, it can map a sequence of 4 chunks into a single data
object, as shown in Figure 5.
Application Chunks
.---------.---------.---------.---------.---------.---------.--------
| | | | | | |
| Chunk_0 | Chunk_1 | Chunk_2 | Chunk_3 | Chunk_4 | Chunk_5 | Chunk_6
| | | | | | |
`---------`---------`---------`---------`---------`---------`--------
DECADE Data Objects
.---------------------------------------.----------------------------
| |
| Object_0 | Object_1
| |
`---------------------------------------`----------------------------
Figure 5: Mapping Application Chunks to DECADE Data Objects
In this example, the application maintains a logical mapping that is
able to determine the name of a DECADE data object given the chunks
contained within that data object. The name may be conveyed from
either the original content provider, another End-Point with which
the application is communicating, etc. As long as the data contained
within each sequence of chunks is globally unique, the corresponding
data objects have globally unique names.
3.4.2. Application with Continuous Streaming Data
Alimi, et al. Expires December 12, 2013 [Page 12]
Internet-Draft DECADE Architecture June 2013
Consider an application whose native protocol retrieves a continuous
data stream (e.g., an MPEG2 stream) instead of downloading and
redistributing chunks of data. Such an application could segment the
continuous data stream to produce either fixed-sized or variable-
sized data objects. Figure 6 depicts how a video streaming
application might produce variable-sized data objects such that each
data object contains 10 seconds of video data. Similarly with the
previous example, the application may maintain a mapping that is able
to determine the name of a data object given the time offset of the
video chunk.
Application's Video Stream
.--------------------------------------------------------------------
|
|
|
`--------------------------------------------------------------------
^ ^ ^ ^ ^
| | | | |
0 Seconds 10 Seconds 20 Seconds 30 Seconds 40 Seconds
0 B 400 KB 900 KB 1200 KB 1500 KB
DECADE Data Objects
.--------------.--------------.--------------.--------------.--------
| | | | |
| Object_0 | Object_1 | Object_2 | Object_3 |
| (400 KB) | (500 KB) | (300 KB) | (300 KB) |
`--------------`--------------`--------------`--------------`--------
Figure 6: Mapping a Continuous Data Stream to DECADE Data Objects
3.5. Token-based Authorization and Resource Control
A key feature of a DECADE system is that an application endpoint can
authorize other application endpoints to store or retrieve data
objects from in-network storage. This is accomplished using an OAuth
[RFC6749] based authorization scheme. A separate OAuth flow can be
used for this purpose: a client authenticates with the application
server or the P2P application peer, and requests the trusted by the
client, and the token contains particular self-contained properties
(see Section 4.2.1 for details). The client then uses the token when
sending requests to the DECADE server. Upon receiving a token, the
server validates the signature and the operation being performed.
This is a simple scheme, but has some important advantages over an
alternative approach, for example, in which a client explicitly
manipulates an Access Control List (ACL) associated with each data
object. In particular, it has the following advantages when applied
Alimi, et al. Expires December 12, 2013 [Page 13]
Internet-Draft DECADE Architecture June 2013
to DECADE target applications. First, authorization policies are
implemented within the application, thus it explicitly controls when
tokens are generated and to whom they are distributed and for how
long they will be valid. Second, fine-grained access and resource
control can be applied to data objects; see Section 4.2.1 for the
list of restrictions that can be enforced with a token. Third, there
is no messaging between a client and server to manipulate data object
permissions. This can simplify, in particular, applications which
share data objects with many dynamic peers and need to frequently
adjust access control policies attached to data objects. Finally,
tokens can provide anonymous access, in which a server does not need
to know the identity of each client that accesses it. This enables a
client to send tokens to clients belonging to other storage
providers, and allow them to read or write data objects from the
storage of its own storage provider. In addition to clients applying
access control policies to data objects, the server MAY be configured
to apply additional policies based on user, object properties,
geographic location, etc. A client might thus be denied access even
though it possesses a valid token.
There are existing protocols (e.g., OAuth [RFC6749]) that implement
similar referral mechanisms using tokens. A protocol specification
for a DECADE system SHOULD endeavor to use existing mechanisms
wherever possible.
3.6. Discovery
A DECADE system SHOULD include a discovery mechanism through which
clients locate an appropriate server. A discovery mechanism SHOULD
allow a client to determine an IP address or some other identifier
that can be resolved to locate the server for which the client will
be authorized to generate tokens (via DRP). (The discovery mechanism
might also result in an error if no such servers can be located.)
After discovering one or more servers, a client can distribute load
and requests across them (subject to resource limitations and
policies of the servers themselves) according to the policies of the
Application End-Point in which it is embedded. The discovery
mechanism outlined here does not provide the ability to locate
arbitrary DECADE servers to which a client might obtain tokens from
others. To do so will require application-level knowledge, and it is
assumed that this functionality is implemented in the content
distribution application.
The particular protocol used for discovery is out of scope of this
document, but any specification SHOULD re-use standard protocols
wherever possible.
Alimi, et al. Expires December 12, 2013 [Page 14]
Internet-Draft DECADE Architecture June 2013
4. DECADE Protocol Design
This section presents the DRP and the SDT protocol in terms of
abstract protocol interactions that are intended to be mapped to
specific protocols in an implementation. In general, the DRP/SDT
functionality between a DECADE client-server are very similar to the
DRP/SDT functionality between server-server. Any differences are
highlighted below. DRP is used by a DECADE client to configure the
resources and authorization used to satisfy requests (reading,
writing, and management operations concerning data objects) at a
server. SDT will be used to transport data between a client and a
server, as illustrated in Figure 1.
4.1. Naming
A DECADE system SHOULD use [RFC6920] as the recommended and default
naming scheme. Other naming schemes that meet the guidelines in
Section 2.3 may alternatively be used. In order to provide a simple
and generic interface, the DECADE server will be responsible only for
storing and retrieving individual data objects.
The DECADE naming format SHOULD NOT attempt to replace any naming or
sequencing of data objects already performed by an Application.
Instead, naming is intended to apply only to data objects referenced
by DECADE-specific purposes. An application using a DECADE client
may use a naming and sequencing scheme independent of DECADE names.
The DECADE client SHOULD maintain a mapping from its own data objects
and their names to the DECADE-specific data objects and names.
Furthermore, the DECADE naming scheme implies no sequencing or
grouping of objects, even if this is done at the application layer.
4.2. Resource Protocol
DRP will provide configuration of access control and resource sharing
policies on DECADE servers. A content distribution application,
e.g., a live P2P streaming session, can have permission to manage
data at several servers, for instance, servers belonging to different
storage providers. DRP allows one instance of such an application,
i.e., an Application End-Point, to apply access control and resource
sharing policies on each of them.
On a single DECADE server, the following resources SHOULD be managed:
a) communication resources in terms of bandwidth (upload/download)
and also in terms of number of active clients (simultaneous
connections); and b) storage resources.
4.2.1. Access and Resource Control Token
Alimi, et al. Expires December 12, 2013 [Page 15]
Internet-Draft DECADE Architecture June 2013
As in DECADE system, the resource owner agent is always the same
entity or co-located with the authorization server, so we use a
separate OAuth 2.0 request and response flow for the access and
resource control token.
An OAuth request to access the data objects MUST include the
following fields:
response_type: REQUIRED. Value MUST be set to "token".
client_id: the client_id indicates either the application that is
using the DECADE service or the end user who is using the DECADE
service from a DECADE storage service provider. DECADE storage
service providers MUST provide the ID distribution and management
function, which is out of the scope of this document.
scope: data object names that are requested.
An OAuth response includes the following information:
token_type: "Bearer"?
expires_in: The lifetime in seconds of the access token.
access_token: a token denotes the following information.
service URI: the server address or URI which is providing the
service;
Permitted operations (e.g., read, write) and objects (e.g., names
of data objects that might be read or written);
Priority: optional. If it is presented, value MUST be set to be
either "Urgent", "High", "Normal" or "Low".
Bandwidth: given to requested operation, a weight value used in a
weighted bandwidth sharing scheme, or a integer in number of bps;
Amount: data size in number of bytes that might be read or
written.
token_signature: the signature of the access token.
The tokens SHOULD be generated by an entity trusted by both the
DECADE client and the server at the request of a DECADE client. For
example, this entity could be the client, a server trusted by the
client, or another server managed by a storage provider and trusted
by the client. It is important for a server to trust the entity
Alimi, et al. Expires December 12, 2013 [Page 16]
Internet-Draft DECADE Architecture June 2013
generating the tokens since each token may incur a resource cost on
the server when used. Likewise, it is important for a client to
trust the entity generating the tokens since the tokens grant access
to the data stored at the server.
Upon generating a token, a client can distribute it to another client
(e.g., via their native application protocol). The receiving client
can then connect to the server specified in the token and perform any
operation permitted by the token. The token SHOULD be sent along
with the operation. The server SHOULD validate the token to identify
the client that issued it and whether the requested operation is
permitted by the contents of the token. If the token is successfully
validated, the server SHOULD apply the resource control policies
indicated in the token while performing the operation.
Tokens SHOULD include a unique identifier to allow a server to detect
when a token is used multiple times and reject the additional usage
attempts. Since usage of a token incurs resource costs to a server
(e.g., bandwidth and storage) and a Content Provider may have a
limited budget (see Section 2.5), the Content Provider should be able
to indicate if a token may be used multiple times.
It SHOULD be possible to revoke tokens after they are generated.
This could be accomplished by supplying the server the unique
identifiers of the tokens which are to be revoked.
4.2.2. Status Information
DRP SHOULD provide a status request service that clients can use to
request status information of a server. Access to such status
information SHOULD require client authorization; that is, clients
need to be authorized to access the requested status information.
This authorization is based on the user delegation concept as
described in Section 2.5. The following status information elements
SHOULD be obtained: a) list of associated data objects (with
properties); and b) resources used/available. In addition, the
following information elements MAY be available: c) list of servers
to which data objects have been distributed (in a certain time-
frame); and d) list of clients to which data objects have been
distributed (in a certain time-frame).
For the list of servers/clients to which data objects have been
distributed to, the server SHOULD be able to decide on time bounds
for which this information is stored and specify the corresponding
time frame in the response to such requests. Some of this
information may be used for accounting purposes, e.g., the list of
clients to which data objects have been distributed.
Alimi, et al. Expires December 12, 2013 [Page 17]
Internet-Draft DECADE Architecture June 2013
Access information MAY be provided for accounting purposes, for
example, when content providers are interested in access statistics
for resources and/or to perform accounting per user. Again, access
to such information requires client authorization and SHOULD based on
the delegation concept as described in Section 2.5. The following
type of access information elements MAY be requested: a) what data
objects have been accessed by whom and for how many times; and b)
access tokens that a server as seen for a given data object.
The server SHOULD decide on time bounds for which this information is
stored and specify the corresponding time frame in the response to
such requests.
4.2.3. Data Object Attributes
Data Objects that are stored on a DECADE server SHOULD have
associated attributes (in addition to the object identifier and data
object) that relate to the data storage and its management. These
attributes may be used by the server (and possibly the underlying
storage system) to perform specialized processing or handling for the
data object, or to attach related server or storage-layer properties
to the data object. These attributes have a scope local to a server.
In particular, these attributes SHOULD NOT be applied to a server or
client to which a data object is copied.
Depending on authorization, clients SHOULD be permitted to get or set
such attributes. This authorization is based on the delegation as
per Section 2.5. DECADE does not limit the set of permissible
attributes, but rather specifies a set of baseline attributes that
SHOULD be supported:
Expiration Time: Time at which the data object can be deleted;
Data Object size: In bytes;
Media type Labelling of type as per [RFC6838];
Access statistics: How often the data object has been accessed (and
what tokens have been used).
The data object attributes defined here are distinct from application
metadata (see Section 2.1). Application metadata is custom
information that an application might wish to associate with a data
object to understand its semantic meaning (e.g., whether it is video
and/or audio, its playback length in time, or its index in a stream).
If an application wishes to store such metadata persistently, it can
be stored within data objects themselves.
Alimi, et al. Expires December 12, 2013 [Page 18]
Internet-Draft DECADE Architecture June 2013
4.3. Data Transfer
A DECADE server will provide a data access interface, and SDT will be
used to write data objects to a server and to read (download) data
objects from a server. Semantically, SDT is a client-server
protocol; that is, the server always responds to client requests.
To write a data object, a client first generates the object's name
(see Section 4.1), and then uploads the object to a server and
supplies the generated name. The name can be used to access
(download) the object later; for example, the client can pass the
name as a reference to other clients that can then refer to the
object. Data objects can be self-contained objects such as
multimedia resources, files etc., but also chunks, such as chunks of
a P2P distribution protocol that can be part of a containing object
or a stream. If supported, a server can verify the integrity and
other security properties of uploaded objects.
A client can request named data objects from a server. In a
corresponding request message, a client specifies the object name and
a suitable access and resource control token. The server checks the
validity of the received token and its associated resource usage-
related properties. If the named data object exists on the server
and the token can be validated, the server delivers the requested
object in a response message. If the data object cannot be delivered
the server provides a corresponding status/reason information in a
response message. Specifics regarding error handling, including
additional error conditions (e.g., overload), precedence for returned
errors and its relation with server policy, are deferred to eventual
protocol specification.
4.4. Server-to-Server Protocols
An important feature of a DECADE system is the capability for one
server to directly download data objects from another server. This
capability allows applications to directly replicate data objects
between servers without requiring end-hosts to use uplink capacity to
upload data objects to a different server.
DRP and SDT SHOULD support operations directly between servers.
Servers are not assumed to trust each other nor are configured to do
so. All data operations are performed on behalf of clients via
explicit instruction. However, the objects being processed do not
necessarily have to originate or terminate at the client (i.e., the
data object might be limited to being exchanged between servers even
if the instruction is triggered by the client). Clients thus will be
able to indicate to a server which remote server(s) to access, what
operation is to be performed, the content provider at the remote
Alimi, et al. Expires December 12, 2013 [Page 19]
Internet-Draft DECADE Architecture June 2013
server from which to retrieve the data object, or in which the object
is to be stored, and the credentials indicating access and resource
control to perform the operation at the remote server.
Server-to-server support is focused on reading and writing data
objects between servers. The data object referred to at the remote
server is the same as the original data object requested by the
client. Object attributes (see Section 4.2.3) might also be
specified in the request to the remote server. In this way, a server
acts as a proxy for a client, and a client can instantiate requests
via that proxy. The operations will be performed as if the original
requester had its own client co-located with the server. When a
client sends a request to a server with these additional parameters,
it is giving the server permission to act (proxy) on its behalf.
Thus, it would be prudent for the supplied token to have narrow
privileges (e.g., limited to only the necessary data objects) or
validity time (e.g., a small expiration time).
In the case of a retrieval operation, the server is to retrieve the
data object from the remote server using the specified credentials,
and then optionally return the object to a client. In the case of a
storage operation, the server is to store the object to the remote
server using the specified credentials. The object might optionally
be uploaded from the client or might already exist at the server.
5. In-Network Storage Components Mapping to DECADE
This section evaluates how the basic components of an in-network
storage system (see Section 3 of [RFC6392]) map into a DECADE system.
With respect to Data Access Interface, DECADE clients can read and
write objects of arbitrary size through the client's Data Controller,
making use of standard data transfer (SDT). With respect to Data
Management Operations, clients can move or delete previously stored
objects via the client's Data Controller, making use of SDT. Clients
can enumerate or search contents of servers to find objects matching
desired criteria through services provided by the Content
Distribution Application (e.g., buffer-map exchanges, a DHT, or peer-
exchange). In doing so, Application End-Points might consult their
local Data Index in the client's Data Controller (Data Search
Capability).
With respect to Access Control Authorization, all methods of access
control are supported: public-unrestricted, public-restricted and
private. Access Control Policies are generated by a content
distribution application and provided to the client's Resource
Controller. The server is responsible for implementing the access
control checks. Clients can manage the resources (e.g., bandwidth)
Alimi, et al. Expires December 12, 2013 [Page 20]
Internet-Draft DECADE Architecture June 2013
on the DECADE server that can be used by other Application End-Points
(Resource Control Interface). Resource Sharing Policies are
generated by a content distribution application and provided to the
client's Resource Controller. The server is responsible for
implementing the resource sharing policies.
Although the particular protocol used for discovery is outside the
scope of this document, different options and considerations have
been discussed in Section 3.6. Finally with respect to the storage
mode, DECADE servers provide an object-based storage mode. Immutable
data objects might be stored at a server. Applications might
consider existing blocks as data objects, or they might adjust block
sizes before storing in a server.
6. Security Considerations
In general, the security considerations mentioned in [RFC6646] apply
to this document as well. A DECADE system provides a distributed
storage service for content distribution and similar applications.
The system consists of servers and clients that use these servers to
upload data objects, to request distribution of data objects, and to
download data objects. Such a system is employed in an overall
application context -- for example in a P2P application, and it is
expected that DECADE clients take part in application-specific
communication sessions. The security considerations here focus on
threats related to the DECADE system and its communication services,
i.e., the DRP/SDT protocols that have been described in an abstract
fashion in this document.
6.1. Threat: System Denial of Service Attacks
A DECADE network might be used to distribute data objects from one
client to a set of servers using the server-to-server communication
feature that a client can request when uploading an object; see
Section 4.4. Multiple clients uploading many objects at different
servers at the same time and requesting server-to-server distribution
for them could thus mount massive distributed denial of service
(DDOS) attacks, overloading a network of servers. This threat is
addressed by the server's access control and resource control
framework. Servers can require Application End-Points to be
authorized to store and to download objects, and Application End-
Points can delegate authorization to other Application End-Points
using the token mechanism. Of course the effective security of this
approach depends on the strength of the token mechanism. See below
for a discussion of this and related communication security threats.
Denial of Service Attacks against a single server (directing many
requests to that server) might still lead to considerable load for
Alimi, et al. Expires December 12, 2013 [Page 21]
Internet-Draft DECADE Architecture June 2013
processing requests and invalidating tokens. SDT therefore MUST
provide a redirection mechanism.
6.2. Threat: Authorization Mechanisms Compromised
A DECADE system does not require Application End-Points to
authenticate in order to access a server for downloading objects,
since authorization is not based on End-Point or user identities but
on a delegation-based authorization mechanism. Hence, most protocol
security threats are related to the authorization scheme. The
security of the token mechanism depends on the strength of the token
mechanism and on the secrecy of the tokens. A token can represent
authorization to store a certain amount of data, to download certain
objects, to download a certain amount of data per time etc. If it is
possible for an attacker to guess, construct or simply obtain tokens,
the integrity of the data maintained by the servers is compromised.
This is a general security threat that applies to authorization
delegation schemes. Specifications of existing delegation schemes
such as [RFC6749] discuss these general threats in detail. We can
say that the DRP has to specify appropriate algorithms for token
generation. Moreover, authorization tokens should have a limited
validity period that should be specified by the application. Token
confidentiality should be provided by application protocols that
carry tokens, and the SDT and DRP should provide secure
(confidential) communication modes.
6.3. Threat: Data Object Spoofing
In a DECADE system, an Application End-Point is referring other
Application End-Points to servers to download a specified data
objects. An attacker could "inject" a faked version of the object
into this process, so that the downloading End-Point effectively
receives a different object (compared to what the uploading End-Point
provided). As result, the downloading End-Point believes that is has
received an object that corresponds to the name it was provided
earlier, whereas in fact it is a faked object. Corresponding attacks
could be mounted against the application protocol (that is used for
referring other End-Points to servers), servers themselves (and their
storage sub-systems), and the SDT by which the object is uploaded,
distributed and downloaded.
A DECADE systems fundamental mechanism against object spoofing is
name-object binding validation, i.e., the ability of a receiver to
check whether the name he was provided and that he used to request an
object, actually corresponds to the bits he received. As described
above, this allows for different forms of name-object binding, for
example using hashes of data objects, with different hash functions
Alimi, et al. Expires December 12, 2013 [Page 22]
Internet-Draft DECADE Architecture June 2013
(different algorithms, different digest lengths). For those
application scenarios where hashes of data objects are not applicable
(for example live-streaming) other forms of name-object binding can
be used (see Section 4.1). This flexibility also addresses
cryptographic algorithm evolution: hash functions might get
deprecated, better alternatives might be invented etc., so that
applications can choose appropriate mechanisms meeting their security
requirements.
DECADE servers MAY perform name-object binding validation on stored
objects, but Application End-Points MUST NOT rely on that. In other
words, Application End-Points SHOULD perform name-object binding
validation on received objects.
7. IANA Considerations
This document does not have any IANA considerations.
8. Acknowledgments
We thank the following people for their contributions to and/or
detailed reviews of this or earlier versions of this document:
Carsten Bormann, David Bryan, Dave Crocker, Yingjie Gu, David
Harrington, Hongqiang (Harry) Liu, David McDysan, Borje Ohlman,
Martin Stiemerling, Richard Woundy, and Ning Zong.
9. Informative References
[RFC2119] Bradner, S., "Key words for use in RFCs to Indicate
Requirement Levels", BCP 14, RFC 2119, March 1997.
[RFC5661] Shepler, S., Eisler, M., and D. Noveck, "Network File
System (NFS) Version 4 Minor Version 1 Protocol", RFC
5661, January 2010.
[RFC6392] Alimi, R., Rahman, A., and Y. Yang, "A Survey of In-
Network Storage Systems", RFC 6392, October 2011.
[RFC6646] Song, H., Zong, N., Yang, Y., and R. Alimi, "DECoupled
Application Data Enroute (DECADE) Problem Statement", RFC
6646, July 2012.
[RFC6749] Hardt, D., "The OAuth 2.0 Authorization Framework", RFC
6749, October 2012.
[RFC6838] Freed, N., Klensin, J., and T. Hansen, "Media Type
Specifications and Registration Procedures", BCP 13, RFC
6838, January 2013.
Alimi, et al. Expires December 12, 2013 [Page 23]
Internet-Draft DECADE Architecture June 2013
[RFC6920] Farrell, S., Kutscher, D., Dannewitz, C., Ohlman, B.,
Keranen, A., and P. Hallam-Baker, "Naming Things with
Hashes", RFC 6920, April 2013.
[GoogleFileSystem]
Ghemawat, S., Gobioff, H., and S. Leung, "The Google File
System", SOSP 2003, October 2003.
Authors' Addresses
Richard Alimi
Google
Email: ralimi@google.com
Akbar Rahman
InterDigital Communications, LLC
Email: akbar.rahman@interdigital.com
Dirk Kutscher
NEC
Email: dirk.kutscher@neclab.eu
Y. Richard Yang
Yale University
Email: yry@cs.yale.edu
Haibin Song
Huawei Technologies
Email: haibin.song@huawei.com
Kostas Pentikousis
Huawei Technologies
Email: k.pentikousis@huawei.com
Alimi, et al. Expires December 12, 2013 [Page 24]