Internet DRAFT - draft-huang-avtext-feedback-image-control
draft-huang-avtext-feedback-image-control
INTERNET-DRAFT R. Huang
Intended Status: Standard Track R. Even
Expires: January 7, 2016 Huawei
July 6, 2015
Real-time Transport Control Protocol (RTCP) Feedback Message Extension
for Image Control Message
draft-huang-avtext-feedback-image-control-00
Abstract
This document introduces a method to let receivers control the part
of the full video image that they wish to watch. It specifies an
extension to the messages defined in the Audio-Visual Profile with
Feedback (AVPF), and it can be applied to any RTP video
applications.
Status of this Memo
This Internet-Draft is submitted to IETF in full conformance with the
provisions of BCP 78 and BCP 79.
Internet-Drafts are working documents of the Internet Engineering
Task Force (IETF), its areas, and its working groups. Note that
other groups may also distribute working documents as
Internet-Drafts.
Internet-Drafts are draft documents valid for a maximum of six months
and may be updated, replaced, or obsoleted by other documents at any
time. It is inappropriate to use Internet-Drafts as reference
material or to cite them other than as "work in progress."
The list of current Internet-Drafts can be accessed at
http://www.ietf.org/1id-abstracts.html
The list of Internet-Draft Shadow Directories can be accessed at
http://www.ietf.org/shadow.html
Copyright and License Notice
Copyright (c) 2015 IETF Trust and the persons identified as the
document authors. All rights reserved.
This document is subject to BCP 78 and the IETF Trust's Legal
<R. Huang, et al> Expires January 7, 2016 [Page 1]
INTERNET DRAFT <RTCP Feedback Image Control> July 6, 2015
Provisions Relating to IETF Documents
(http://trustee.ietf.org/license-info) in effect on the date of
publication of this document. Please review these documents
carefully, as they describe your rights and restrictions with respect
to this document. Code Components extracted from this document must
include Simplified BSD License text as described in Section 4.e of
the Trust Legal Provisions and are provided without warranty as
described in the Simplified BSD License.
Table of Contents
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . 3
2 Terminology . . . . . . . . . . . . . . . . . . . . . . . . . . 3
3. Motivation . . . . . . . . . . . . . . . . . . . . . . . . . . 3
4. RTCP Receiver Report Extensions . . . . . . . . . . . . . . . . 4
4.1. Payload-Specific Feedback Messages . . . . . . . . . . . . 5
4.1.1. Image Control Request (ICR) . . . . . . . . . . . . . . 5
4.1.1.1 Message Format . . . . . . . . . . . . . . . . . . . 5
4.1.1.2. Semantics . . . . . . . . . . . . . . . . . . . . . 6
4.1.1.3. Timing Rules . . . . . . . . . . . . . . . . . . . 6
4.1.1.4 Handling of Message in Mixers and Translators . . . 8
4.1.1.5 Remarks . . . . . . . . . . . . . . . . . . . . . . 8
4.1.2 Image Control Notification (ICN) . . . . . . . . . . . 7
4.1.2.1 Message Format . . . . . . . . . . . . . . . . . . . 7
4.1.2.2 Semantics . . . . . . . . . . . . . . . . . . . . . 8
4.1.2.3. Timing Rules . . . . . . . . . . . . . . . . . . . 8
4.1.1.4 Handling of Message in Mixers and Translators . . . 8
4.1.1.5 Remarks . . . . . . . . . . . . . . . . . . . . . . 8
5 Security Considerations . . . . . . . . . . . . . . . . . . . . 8
6 IANA Considerations . . . . . . . . . . . . . . . . . . . . . . 8
7 Acknowledgments . . . . . . . . . . . . . . . . . . . . . . . . 8
8 References . . . . . . . . . . . . . . . . . . . . . . . . . . 9
8.1 Normative References . . . . . . . . . . . . . . . . . . . 9
8.2 Informative References . . . . . . . . . . . . . . . . . . 9
Authors' Addresses . . . . . . . . . . . . . . . . . . . . . . . . 9
<R. Huang, et al> Expires January 7, 2016 [Page 2]
INTERNET DRAFT <RTCP Feedback Image Control> July 6, 2015
1 Introduction
4K Ultra HD technology is by itself a very new trend in the overall
electronics landscape, and the impact of it is growing month by
month. The new resolution format itself is slowly starting to remake
perceptions of where the entire visual media industry will be going
over the next few years.
However, there are not many 4K display devices out yet, and 99.9% of
all final visual media productions are only 1080p HD. And sometimes
due to the bandwidth limitation, application providers are unable to
provide real 4K services to end users.
This document introduces a method to let receivers control the part
of the full video image that they wish to watch. It specifies an
extension to the messages defined in the Audio-Visual Profile with
Feedback (AVPF), and it can be applied to any RTP video
applications.
2 Terminology
The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT",
"SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this
document are to be interpreted as described in RFC 2119 [RFC2119].
3. Motivation
This document considers following use cases:
1. In a real-time monitoring application, the user who watches the
monitor screen may want to get a closer view of some detail of the
video image. He will want an option to ask for a specific part of
the full picture allowing the sender to either zoom in to this
part or send part of the full picture (may be a 4K resolution) to
reduce the used bandwidth.
2. In a video conferencing scenario, one user may prefer a
detailed image of one specific person, e.g., the facial expression
of the person, while another user may be fine with the initial
image.
Changing the image can be also done by delivering the original
encoder image resolution to the receivers and changing it locally, or
by the remote side using camera zoom function. However, that's not
always feasible for all cases. In a UHD environment, the issue of
bandwidth manifests itself in all of the same processes that are in
effect for those files in 1080p resolution. Even after the 4K streams
<R. Huang, et al> Expires January 7, 2016 [Page 3]
INTERNET DRAFT <RTCP Feedback Image Control> July 6, 2015
have been optimized at the source, they'll still require at least two
to three times the bandwidth you'd need today to watch a 1080p HD
feed. And in case 2, it's also not feasible to use camera zoom
because not all the receivers want to enlarge their pictures.
In this document, a new mechanism is introduced which allows
applications to signal to the remote encoder the desired image area
based on the original one. It is useful for cases where higher
definition resolution is available at the source while only streaming
of lower definition resolution is supported due to the issues of
bandwidth or user presenting devices limitation.
The basic idea is to use a new feedback message to signal the area
that the receiver requests to enlarge. The specific area is a
rectangle indicated by the coordinate of the upper left point and the
length and the width. The unit of the coordinate is based on the
original picture sent by the sender. It can be illustrated in the
following figure 1.
2160 @(x,y) is the start point (upper left point)
|
|
| x Length
|.........@****************
| * *
| * *width
| y * *
| *****************
| .
|_________._________________________________3840
Figure 1 The illustration of enlarged image area
The motivation for using the media path instead of signaling path is
similar to [RFC5104]. Media and signaling paths are sometimes
separated in systems employing middle box which has the image control
ability. As the messages are intended for the media part rather than
the signaling part, using media path avoid the transmission across
interfaces and unnecessary control traffic between signaling and
processing. Moreover, the control handled in media path is faster
than signaling path as explained in [RFC5104].
4. RTCP Receiver Report Extensions
The new feedback messages are defined in the following subsections,
following a similar structure to that in sections 6.3 of the AVPF
specification [RFC4585].
<R. Huang, et al> Expires January 7, 2016 [Page 4]
INTERNET DRAFT <RTCP Feedback Image Control> July 6, 2015
4.1. Payload-Specific Feedback Messages
As specified by Section 6.1 of [RFC4585], Payload-Specific Feedback
Messages are identified by the RTCP packet type value PSFB (206).
The following subsections define 2 new FCI formats for the payload-
specific feedback messages to extend those defined in [RFC4585] and
[RFC5104].
4.1.1. Image Control Request (ICR)
The ICR feedback message is identified by RTCP packet type value
PT=PSFB and FMT=ICR.
[Note to RFC Editor: Please replace ICR with the IANA provided
feedback message type for this message.]
The FCI field MUST contain one ICR FCI entries.
4.1.1.1 Message Format
The content of the FCI entry for the Image Control Request is showed
in Figure 2. The length of the feedback message MUST be set to 4.
0 1 2 3
0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Seq nr. | x axis of start point | y axis of start |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| point | length | width |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
Figure 2: Syntax of an FCI Entry in the ICN Message
Seq nr.: 8 bits
Request sequence number. The sequence number space is unique for
pairing of the SSRC of request source and the SSRC of the request
target. The sequence number SHALL be increased by 1 modulo 256 for
new command. The initial value is arbitrary. A repetition message
SHOULD NOT increase the sequence number.
x axis of start point: 14 bits
The start point is the upper left point of the rectangle requested
to be enlarged. X axis of start point is the value of the
horizontal axis of this point. A value of 16383 indicates the
point is invalid.
<R. Huang, et al> Expires January 7, 2016 [Page 5]
INTERNET DRAFT <RTCP Feedback Image Control> July 6, 2015
y axis of start point: 14 bits
Y axis of start point is the value of the vertical axis of this
point. A value of 16383 indicates the point is invalid.
length: 14 bits
Length indicates the horizontal side of the rectangle requested to
be enlarged. A value of 16383 indicates the rectangle is invalid.
width: 14 bits
Width indicates the vertical side of the rectangle requested to be
enlarged. A value of 16383 indicates the rectangle is invalid.
4.1.1.2. Semantics
A decoder can request a image control by sending a ICR message to an
encoder. If the encoder has the ability to do such kind of change, it
SHOULD take into account the received ICR messages. If one of the
values in the message is not valid, the encoder SHOULD give up the
attempt of changing and give a notice. The rectangle requested in the
ICR message may not suit the final presenting of the image quite
well. In that case, the decoder is allowed to adjust according to the
final presenting devices.
Within the common packet header for feedback messages (as defined in
section 6.1 of [RFC4585]),the "SSRC of packet sender" field indicates
the source of the request, and the "SSRC of media source" indicates
the media source required to be applied the change to.
4.1.1.3. Timing Rules
The timing follows the rules outlined in section 3 of [RFC4585]. This
request message is not time critical and SHOULD be sent using regular
RTCP timing. Only if it is known that the user interface requires
quick feedback, the message MAY be sent with early or immediate
feedback timing.
4.1.1.4 Handling of Message in Mixers and Translators
A mixer of media translator that encodes content sent to the session
participant issuing the ICR SHALL consider the request to determine
if it can fulfill it. A media translator unable to fulfill the
request MAY forward the request unchanged towards the media sender.
4.1.1.5 Remarks
<R. Huang, et al> Expires January 7, 2016 [Page 6]
INTERNET DRAFT <RTCP Feedback Image Control> July 6, 2015
None.
4.1.2 Image Control Notification (ICN)
The ICN feedback message is identified by RTCP packet type value
PT=PSFB and FMT=ICN.
[Note to RFC Editor: Please replace ICN with the IANA provided
feedback message type for this message.]
The FCI field MUST contain one ICR FCI entries.
4.1.2.1 Message Format
The content of the FCI entry for the Image Control Notification is
showed in Figure 3. The length of the feedback message MUST be set to
3.
0 1 2 3
0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Seq nr. | R | reserved |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
Figure 3: Syntax of an FCI Entry in the ICN Message
Seq nr.: 8 bits
The corresponding request sequence number from the ICR that
resulted in this notification.
Result (R): 2 bits
This field indicates the result that the media sender deals with
the ICR message.
R=0: the media sender applies the image change successfully.
R=1: The values of the ICR message are invalid.
R=2: The media sender does not support the image change.
R=3: Reserved.
Reserved: 22 bits
All bits SHALL be set to 0 by the sender and SHALL be ignored on
<R. Huang, et al> Expires January 7, 2016 [Page 7]
INTERNET DRAFT <RTCP Feedback Image Control> July 6, 2015
reception.
4.1.2.2 Semantics
This feedback message is used to acknowledge the reception of a ICR.
For each ICR received at the media source, a ICN message SHALL be
sent. The notification message SHALL also be sent in response to ICR
repetitions received. If the request receiver has received ICR with
several different sequence numbers from a single requestor, it SHALL
only respond to the request with the highest (modulo 265) sequence
number which may be a smaller integer value due to the wrapping of
the field.
Within the common packet header for feedback messages (as defined in
section 6.1 of [RFC4585]),the "SSRC of packet sender" field indicates
the source of the notification, and the "SSRC of media source"
indicates the media source applying the change to.
4.1.2.3. Timing Rules
The timing follows the rules outlined in section 3 of [RFC4585]. This
request message is not time critical and SHOULD be sent using regular
RTCP timing.
4.1.1.4 Handling of Message in Mixers and Translators
A mixer of media translator that acts upon a ICR SHALL send the
corresponding ICN. In cases where it needs to forward a ICR itself,
the notification message MAY need to be delayed until the ICRhas been
responded to.
4.1.1.5 Remarks
None.
5 Security Considerations
TBD
6 IANA Considerations
TBD
7 Acknowledgments
TBD
<R. Huang, et al> Expires January 7, 2016 [Page 8]
INTERNET DRAFT <RTCP Feedback Image Control> July 6, 2015
8 References
8.1 Normative References
[KEYWORDS] Bradner, S., "Key words for use in RFCs to Indicate
Requirement Levels", BCP 14, RFC 2119, March 1997.
[RFC3550] Schulzrinne, H., Casner, S., Frederick, R., and V.
Jacobson, "RTP: A Transport Protocol for Real-Time
Applications", STD 64, RFC 3550, July 2003.
[RFC4585] Ott, J., Wenger, S., Sato, N., Burmeister, C., and J. Rey,
"Extended RTP Profile for Real-time Transport Control
Protocol (RTCP)-Based Feedback (RTP/AVPF)", RFC 4585, July
2006.
[RFC5104] Wenger, S., Chandra, U., Westerlund, M., and B. Burman,
"Codec Control Messages in the RTP Audio-Visual Profile
with Feedback (AVPF)", RFC 5104, February 2008.
8.2 Informative References
TBD
Authors' Addresses
Rachel Huang
Huawei
101 Software Avenue, Yuhua District
Nanjing 210012
China
Email: rachel.huang@huawei.com
Roni Even
Huawei
14 David Hamelech
Tel Aviv 64953
Israel
Email: roni.even@mail01.huawei.com
<R. Huang, et al> Expires January 7, 2016 [Page 9]