Internet DRAFT - draft-xu-avt-dra
draft-xu-avt-dra
AVT Working Group M. Xu
Internet-Draft CS. Zheng
Intended status: Informational T. Jiang
Expires: February 25, 2010 WX. Zhang
JP. Zhang
Digital Wave Co., Ltd
August 24, 2009
RTP Payload Format for DRA Audio
draft-xu-avt-dra-00
Status of this Memo
This Internet-Draft is submitted to IETF in full conformance with the
provisions of BCP 78 and BCP 79.
Internet-Drafts are working documents of the Internet Engineering
Task Force (IETF), its areas, and its working groups. Note that
other groups may also distribute working documents as Internet-
Drafts.
Internet-Drafts are draft documents valid for a maximum of six months
and may be updated, replaced, or obsoleted by other documents at any
time. It is inappropriate to use Internet-Drafts as reference
material or to cite them other than as "work in progress."
The list of current Internet-Drafts can be accessed at
http://www.ietf.org/ietf/1id-abstracts.txt.
The list of Internet-Draft Shadow Directories can be accessed at
http://www.ietf.org/shadow.html.
This Internet-Draft will expire on February 25, 2010.
Copyright Notice
Copyright (c) 2009 IETF Trust and the persons identified as the
document authors. All rights reserved.
This document is subject to BCP 78 and the IETF Trust's Legal
Provisions Relating to IETF Documents in effect on the date of
publication of this document (http://trustee.ietf.org/license-info).
Please review these documents carefully, as they describe your rights
and restrictions with respect to this document.
Xu, et al. Expires February 25, 2010 [Page 1]
Internet-Draft DRA-RTP Format August 2009
Abstract
The present document describes a RTP packaging scheme for DRA
compressed audio data transmission, as well as the corresponding RTP
payload format. According to the properties of DRA compressed audio
frame and the Maximum Transmission Unit (MTU) of network, the scheme
provides 3 packaging modes for different coding and transmission
requirements.
Table of Contents
1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . . 3
2. Requirements Language . . . . . . . . . . . . . . . . . . . . 3
3. Overview of DRA . . . . . . . . . . . . . . . . . . . . . . . 3
4. RTP Payload Format . . . . . . . . . . . . . . . . . . . . . . 4
4.1. RTP Header Usage . . . . . . . . . . . . . . . . . . . . . 4
4.2. Structure of the RTP DRA Payload Format . . . . . . . . . 5
5. SDP related information for DRA Audio . . . . . . . . . . . . 6
5.1. Media Type Registration . . . . . . . . . . . . . . . . . 7
5.2. SDP Format for DRA audio . . . . . . . . . . . . . . . . . 8
5.2.1. Attributes (a=) . . . . . . . . . . . . . . . . . . . 8
5.2.2. Media Description(m=) . . . . . . . . . . . . . . . . 9
5.2.3. Examples . . . . . . . . . . . . . . . . . . . . . . . 9
6. IANA Considerations . . . . . . . . . . . . . . . . . . . . . 9
7. Security Considerations . . . . . . . . . . . . . . . . . . . 10
8. References . . . . . . . . . . . . . . . . . . . . . . . . . . 10
8.1. Normative References . . . . . . . . . . . . . . . . . . . 10
8.2. Informative References . . . . . . . . . . . . . . . . . . 10
Authors' Addresses . . . . . . . . . . . . . . . . . . . . . . . . 10
Xu, et al. Expires February 25, 2010 [Page 2]
Internet-Draft DRA-RTP Format August 2009
1. Introduction
DRA multi-channel digital audio coding technology is capable of
preserving high fidelity sounds over limited storage and limited
transmission bandwidth, suitable for a wide variety of application
areas including digital audio broadcasting, digital TV accompany
sound, home theater, Internet streaming media, and personal media
player. In the fields of IP network transmission and real-time
broadcasting, Real-Time Transport Protocol, as described in RFC 3550
[RFC3550], can be employed for transmission and synchronization of
DRA audio streams.
This document describes an RTP payload format for transmitting audio
data compressed by DRA technology .
2. Requirements Language
The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT",
"SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this
document are to be interpreted as described in RFC 2119 [RFC2119].
3. Overview of DRA
DRA digital audio coding technology keeps 24-bit precision (except
deliberating quantization loss)for all channels. Its supported
channel configurations include commonly known stereo, 5.1 , 6.1, and
7.1 surround sound, besides a large space reserved for future audio
development (maximum 64.3 surround sound). DRA can handle standard
sampling frequencies spanning from 8kHz to 192kHz, including 44.1kHz
and 48kHz. There is no explicit restriction on DRA encoding bitrate,
which is determined by the channel bandwidth and audio quality
requirement.
DRA has a fixed input frame length of 1024 samples for encoding, thus
its time duration can be calculated according to sampling rate. Bit
count for encoded frame is controlled by encoding configuration
parameters, which are recorded in the frame header. Figure 1
illustrates the structure of a DRA encoded frame described in
"Specification for multichannel digital audio coding technology"
(China national standard) [GB/T 22726-2008].
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
|Sync Word|Header|normal ch data|LFE ch data|error check|ancillary|
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
Figure 1: DRA Frame Structure
Xu, et al. Expires February 25, 2010 [Page 3]
Internet-Draft DRA-RTP Format August 2009
Every audio frame starts with the sync word 0x7fff, then frame
header. Figure 2 illustrates parts of the header structure.
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
|frame type |audio data frame length| other header information |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
Figure 2: DRA Frame Header's Structure
Frame type: 1 bit, '0' for normal frame header, '1' for extension
header.
Audio data frame length: 10 bits (normal frame header) or 13 bits
(extension header), representing the length from the sync word to the
error check word, counting in unit of 32-bit.
4. RTP Payload Format
DRA encoded frame length varies with sampling rate and bitrate, and
possibly overruns the Maximum Transmission Unit (MTU) in high bitrate
modes. In this case, a single audio frame is parted to several
blocks. On the other hand, DRA supports Variable Bit-Rate (VBR)
encoding, allocating bit resources according to perceptible
information in each frame. Therefore, within the limit of MTU,
several short audio frames are packaged into one RTP packet for
higher transmission efficiency.
3 RTP DRA payload formats are employed:
o Single-Frame packet: RTP payload comprising a single complete DRA
compressed frame;
o Multi-Frame Packet: RTP payload comprising multiple complete DRA
compressed frame;
o Frag-Frame Packet: RTP payload comprising a fragment of one DRA
compressed frame.
Subsection 4.2 has detailed description of the above 3 modes.
4.1. RTP Header Usage
The usage of RTP header information specified in this document
observes the RTP header information defined inRFC 3550 [RFC3550].
Figure 3 gives the RTP header format for the reader's convenience.
Xu, et al. Expires February 25, 2010 [Page 4]
Internet-Draft DRA-RTP Format August 2009
0 1 2 3
0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
|V=2|P|X| CC |M| PT | sequence number |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| timestamp |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| synchronization source (SSRC) identifier |
+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+=+
| contributing source (CSRC) identifiers |
| .... |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
Figure 3: RTP Header according to RFC 3550
Marker bit (M): 1 bit
Set to '1' only if at least one complete audio frame or the last
block of an audio frame contained in the current RTP packet.
Payload type (PT): 7 bits
DRA RTP packet payload type designation is not discussed in the
present document. Its is set by the RTP profile of DRA RTP
payload format, or dynamically set.
TimeStamp (timestamp): 32 bits
The timestamp of RTP is used to record the sampling time of the
first audio frame in the current RTP packet. For DRA RTP
payload, the clock frequency corresponding to the time stamp is
equal to the audio sampling frequency. If the current RTP
packet comprises more than one DRA audio frame, the sampling
time for each audio frame can be derived from the time stamp
plus the accumulated audio frame duration time. In Frag-Frame
Packet mode, all RTP packets belong to the same audio frame
should have the same time stamp.
4.2. Structure of the RTP DRA Payload Format
RTP DRA payload comprises a 16-bit payload header and audio frame
data. Only the latter vary among the 3 RTP packaging modes. Figure
4 illustrates the payload structure of the Multi-Frame Packet mode.
Xu, et al. Expires February 25, 2010 [Page 5]
Internet-Draft DRA-RTP Format August 2009
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Payload Header | DRA frame 1 | DRA frame 2 | ... DRA frame n|
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
Figure 4: Multi-Frame Packet RTP payload structure
The payload header structure is given in figure 5.
0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
|PM | MBZ | N |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
Figure 5: RTP payload header structure
Packaging Mode (PM): 2 bits
Indicating the current RTP packaging modes, as is shown in table
1:
+----------+---------------------+
| PM value | Mode |
+----------+---------------------+
| 0 | Single-Frame Packet |
| 1 | Multi-Frame Packet |
| 2 | Frag-Frame Packet |
| 3 | Forbidden |
+----------+---------------------+
Table 1: Packaging Mode
Must Be Zero (MBZ, reserved): 6 bits
Reserved for future use, all 6 bits must be set to '0'
Number (N, block serial number or frame number): 8 bits
If PM = 1, or Multi-Frame Packet mode, 'Number' is set to the
number of DRA frames in the current RTP packet; if PM = 2,
'Number' is set to the serial number of the current block in a
DRA audio frame; if PM = 0, 'Number' is set to '1'.
5. SDP related information for DRA Audio
Xu, et al. Expires February 25, 2010 [Page 6]
Internet-Draft DRA-RTP Format August 2009
5.1. Media Type Registration
This section uses the template provided in RFC 4288 [RFC4288] and
follows RFC 4855 [RFC4855].
o MIME media type name: audio
o MIME subtype name: vnd.dra
o Required parameters:
rate: The RTP timestamp clock rate is equal to one of the
audio sampling rates. The sampling rates consist of 13
possible values as specified in Table 10 of GB/T 22726-2008
[GB/T 22726-2008]: 8000 Hz, 11025 Hz, 12000 Hz, 16000 Hz,
22050 Hz, 24000 Hz, 32000 Hz, 44100 Hz, 48000 Hz, 88200 Hz,
96000 Hz, 176400 Hz, 192000 Hz.
o Optional parameters: none
o Encoding considerations: Transport_provided
This media type is framed and contains binary data.
o Security considerations:
See section 7 of this document.
o Interoperability considerations: None
o Published specification:
This specification and see GB/T 22726-2008 [GB/T 22726-2008].
o Applications which use this media type:
Audio and audio for video can be multi-channel audio
compressed with this media type.
o Additional information:
Magic number(s): The first 16 bits of an DRA frame are fixed
as synchronization word, which is 0x7FFF in hex expression.
o Person & email address to contact for further information:
Jiang Tian <jiangt@digitalwave.cn>
Xu, et al. Expires February 25, 2010 [Page 7]
Internet-Draft DRA-RTP Format August 2009
o Intended usage: COMMON
o Author/Change controller: XU Mao, et al.
5.2. SDP Format for DRA audio
Only the channel number and the sampling rate information are needed
besides raw DRA bitstream defined in GB/T 22726-2008 [GB/T
22726-2008] for decoding. Both will be present in SDP defined in
RFC 4566 [RFC4566]to assist decoding. But there is no explicit
configuration information embedded.
5.2.1. Attributes (a=)
a. First, a short item.a=maxptime:<maximum packet time>
This is an optional parameter, found by:
<maximum packet time>
= <maximum packet size in byte>*8*1000 /<minimum bitrate in
kbps>.
The maximum packet size is confined by Maximum Transmission Unit
(MTU).
b. a=rtpmap:<payload type> <encoding name> / <clock rate>
[/<encoding parameters>]
This is a required attribute, where
<payload type> = 97, defined in RFC 3551 [RFC3551], dynamically
assigned
<encoding name> = "dra"
<clock rate> = DRA audio sampling rate
<encoding parameters> = channel number
c. a=fmtp:<format> <format specific parameters>
This is a optional attribute used if and only if the < format
specific parameters> is not empty, where
<format> = <payload type> = 97,
<format specific parameters> currently includes only bitrate=#
Xu, et al. Expires February 25, 2010 [Page 8]
Internet-Draft DRA-RTP Format August 2009
where "#" is an integer bitrate in bps.
5.2.2. Media Description(m=)
m=<media> <port>/<number of ports> <proto> <fmt>
This is a required field, where
<media> = "audio"
<port> = 58636 or any available even numbered port
<number of ports>
= number of RTP sessions, only used for multiple sessions
<proto> = "RTP/AVP"
<fmt> = <payload type> = 97.
5.2.3. Examples
This section provides two examples of the SDP data for DRA as
follows:
a. 48000Hz, stereo DRA bitstream:
m=audio 58636 RTP/AVP 97
a=rtpmap:97 dra/48000/2
b. 320kbps, 44100Hz, 5.1 channel DRA bitstream, two sessions:
m=audio 58800/2 RTP/AVP 97
a=rtpmap:97 dra/44100/6
a=fmtp:97 bitrate=320000
6. IANA Considerations
A new media subtype has been assigned for DRA by IANA; see Section
5.1.
Xu, et al. Expires February 25, 2010 [Page 9]
Internet-Draft DRA-RTP Format August 2009
7. Security Considerations
The payload format described in this document is subject to the
security considerations defined in RFC 3550 [RFC3550].
Confidentiality protection would have to be applied, so as to protect
the users' provacy and contents with copyright.
8. References
8.1. Normative References
[GB/T 22726-2008]
You et al., "Specification for multichannel digital audio
coding technology", Dec, 2008.
[RFC2119] Bradner, S., "Key words for use in RFCs to Indicate
Requirement Levels", BCP 14, RFC 2119, March 1997.
[RFC3550] Schulzrinne, H., Casner, S., Frederick, R., and V.
Jacobson, "RTP: A Transport Protocol for Real-Time
Applications", STD 64, RFC 3550, July 2003.
[RFC3551] Schulzrinne, H. and S. Casner, "RTP Profile for Audio and
Video Conferences with Minimal Control", STD 65, RFC 3551,
July 2003.
[RFC4288] Freed, N. and J. Klensin, "Media Type Specifications and
Registration Procedures", BCP 13, RFC 4288, December 2005.
[RFC4566] Handley, M., Jacobson, V., and C. Perkins, "SDP: Session
Description Protocol", RFC 4566, July 2006.
[RFC4855] Casner, S., "Media Type Registration of RTP Payload
Formats", RFC 4855, February 2007.
8.2. Informative References
[RFC2736] Handley, M. and C. Perkins, "Guidelines for Writers of RTP
Payload Format Specifications", BCP 36, RFC 2736,
December 1999.
[RFC4184] Link, B., Hager, T., and J. Flaks, "RTP Payload Format for
AC-3 Audio", RFC 4184, October 2005.
Xu, et al. Expires February 25, 2010 [Page 10]
Internet-Draft DRA-RTP Format August 2009
Authors' Addresses
Mao Xu
Digital Wave Co., Ltd
Room 2002A, Building B, Cyber Tower, No.2, ZhongGuanCun South Avenue
Beijing, 100031
China
Phone: +86-10-5278-6263
Email: xumao@digitalwave.cn
ChuSheng Zheng
Digital Wave Co., Ltd
Room 2002A, Building B, Cyber Tower, No.2, ZhongGuanCun South Avenue
Beijing, 100031
China
Phone: +86-10-5278-6263
Email: zhengcs@digitalwave.cn
Tian Jiang
Digital Wave Co., Ltd
Room 2002A, Building B, Cyber Tower, No.2, ZhongGuanCun South Avenue
Beijing, 100031
China
Phone: +86-10-5278-6263
Email: jiangt@digitalwave.cn
WeiXiong Zhang
Digital Wave Co., Ltd
Room 2002A, Building B, Cyber Tower, No.2, ZhongGuanCun South Avenue
Beijing, 100031
China
Phone: +86-10-5278-6263
Email: zhangwx@digitalwave.cn
Xu, et al. Expires February 25, 2010 [Page 11]
Internet-Draft DRA-RTP Format August 2009
JingPing Zhang
Digital Wave Co., Ltd
Room 2002A, Building B, Cyber Tower, No.2, ZhongGuanCun South Avenue
Beijing, 100031
China
Phone: +86-10-5278-6263
Email: zhangjp@digitalwave.cn
Xu, et al. Expires February 25, 2010 [Page 12]