Dispatch | D.R. Worley |
Internet-Draft | Ariadne |
Intended status: Informational | June 25, 2013 |
Expires: December 27, 2013 |
Session Initiation Protocol Service Example -- Music on Hold
draft-worley-service-example-13
The "music on hold" feature is one of the most desired features of telephone systems in the business environment. "Music on hold" is where, when one party to a call has the call "on hold", that party's telephone provides an audio stream (often music) to be heard by the other party. Architectural features of SIP make it difficult to implement music-on-hold in a way that is fully compliant with the standards. The implementation of music-on-hold described in this document is fully effective and standards-compliant, and has a number of advantages over the methods previously documented. In particular, it is less likely to produce peculiar user interface effects and more likely to work in systems which perform authentication than the music-on-hold method described in section 2.3 of RFC 5359.
This Internet-Draft is submitted in full conformance with the provisions of BCP 78 and BCP 79.
Internet-Drafts are working documents of the Internet Engineering Task Force (IETF). Note that other groups may also distribute working documents as Internet-Drafts. The list of current Internet-Drafts is at http://datatracker.ietf.org/drafts/current/.
Internet-Drafts are draft documents valid for a maximum of six months and may be updated, replaced, or obsoleted by other documents at any time. It is inappropriate to use Internet-Drafts as reference material or to cite them other than as "work in progress."
This Internet-Draft will expire on December 27, 2013.
Copyright (c) 2013 IETF Trust and the persons identified as the document authors. All rights reserved.
This document is subject to BCP 78 and the IETF Trust's Legal Provisions Relating to IETF Documents (http://trustee.ietf.org/license-info) in effect on the date of publication of this document. Please review these documents carefully, as they describe your rights and restrictions with respect to this document. Code Components extracted from this document must include Simplified BSD License text as described in Section 4.e of the Trust Legal Provisions and are provided without warranty as described in the Simplified BSD License.
Within SIP[sip]-based systems, it is desirable to be able to provide features that are similar to those provided by traditional telephony systems. A frequently requested feature is "music on hold": The music-on-hold feature is where, when one party to a call has the call "on hold", that party's telephone provides an audio stream (often music) to be heard by the other party.
Architectural features of SIP make it difficult to implement music-on-hold in a way that is fully compliant with the standards. The purpose of this document is to describe a method that is reasonably simple yet fully effective and standards-compliant.
The "intended status" of this document is "Informational". The reason that it is not "Best Current Practice" is that this method is not specified as "best", nor is this specification intended to supersede all other methods for implementing music-on-hold.
All current methods of implementing music-on-hold interoperate with each other, in that the two user agents in a call can use different methods for implementing music-on-hold with the same functionality as if either of the methods was used by both user agents. Thus, there is no loss of functionality if different music-on-hold methods are used by different user agents within a telephone system, or if a single user agent uses different methods within different calls, or at different times within one call.
However, this method has significant advantages over other methods now in use, as described in Section 3.
The essence of the technique is that when the executing UA (the user's UA) performs a re-INVITE of the remote UA (the other user's UA) to establish the hold state, it provides no SDP[sdp] offer[offer-answer][offer-answer-bis], thus compelling the remote UA to provide an SDP offer. The executing UA then extracts the offer SDP from the remote UA's 2xx response, and uses that as the offer SDP in a new INVITE to the external media source. The external media source is thus directed to provide media directly to the remote UA. The media source's answer SDP is returned to the remote UA in the ACK to the re-INVITE.
This section shows a message flow which is an example of this technique. The scenario is: Alice establishes a call with Bob. Bob then places the call on hold, with music-on-hold provided from an external source. Bob then takes the call off hold. In this scenario, Bob's user agent is the executing UA, while Alice's UA is the remote UA. Note that this is just one possible message flow that illustrates this technique; numerous variations on these operations are allowed by the applicable standards.
Alice Bob Music Source Alice establishes the call: | | | | INVITE F1 | | |--------------->| | | 180 Ringing F2 | | |<---------------| | | 200 OK F3 | | |<---------------| | | ACK F4 | | |--------------->| | | RTP | | |<==============>| | | | | Bob places Alice on hold, compelling Alice's UA to provide SDP: | | | | INVITE F5 | | | (no SDP) | | |<---------------| | | 200 OK F6 | | | (SDP offer) | | |--------------->| | | | | Bob's UA initiates music-on-hold: | | | | | INVITE F7 | | | (SDP offer, | | | rev. hold) | | |------------->| | | 200 OK F8 | | | (SDP answer, | | | hold) | | |<-------------| | | ACK F9 | | |------------->| | | | Bob's UA provides an SDP answer containing the address/port of the Music Source: | | | | ACK F10 | | | (SDP answer, | | | hold | | |<---------------| | | no RTP | | |<..............>| | | Music-on-hold RTP | |<==============================| | | | The music on hold is active. Bob takes Alice off hold: | | | | INVITE F11 | | | (SDP offer) | | |<---------------| | | 200 OK F12 | | | (SDP answer) | | |--------------->| | | ACK F13 | | |<---------------| | | | BYE F14 | | |------------->| | | 200 F15 | | |<-------------| | RTP | | |<==============>| | | | | The normal media session between Alice and Bob is resumed.
/* Alice calls Bob. */ F1 INVITE Alice -> Bob INVITE sips:bob@biloxi.example.com SIP/2.0 Via: SIP/2.0/TLS atlanta.example.com:5061 ;branch=z9hG4bK74bf9 Max-Forwards: 70 From: Alice <sips:alice@atlanta.example.com>;tag=1234567 To: Bob <sips:bob@biloxi.example.com> Call-ID: 12345600@atlanta.example.com CSeq: 1 INVITE Contact: <sips:a8342043f@atlanta.example.com;gr> Allow: INVITE, ACK, CANCEL, OPTIONS, BYE, REFER, NOTIFY Supported: replaces, gruu Content-Type: application/sdp Content-Length: [omitted] v=0 o=alice 2890844526 2890844526 IN IP4 atlanta.example.com s= c=IN IP4 atlanta.example.com t=0 0 m=audio 49170 RTP/AVP 0 a=rtpmap:0 PCMU/8000 F2 180 Ringing Bob -> Alice SIP/2.0 180 Ringing Via: SIP/2.0/TLS atlanta.example.com:5061 ;branch=z9hG4bK74bf9 ;received=192.0.2.103 From: Alice <sips:alice@atlanta.example.com>;tag=1234567 To: Bob <sips:bob@biloxi.example.com>;tag=23431 Call-ID: 12345600@atlanta.example.com CSeq: 1 INVITE Contact: <sips:bob@biloxi.example.com> Content-Length: 0 F3 200 OK Bob -> Alice SIP/2.0 200 OK Via: SIP/2.0/TLS atlanta.example.com:5061 ;branch=z9hG4bK74bf9 ;received=192.0.2.103 From: Alice <sips:alice@atlanta.example.com>;tag=1234567 To: Bob <sips:bob@biloxi.example.com>;tag=23431 Call-ID: 12345600@atlanta.example.com CSeq: 1 INVITE Contact: <sips:bob@biloxi.example.com> Allow: INVITE, ACK, CANCEL, OPTIONS, BYE, REFER, NOTIFY Supported: replaces Content-Type: application/sdp Content-Length: [omitted] v=0 o=bob 2890844527 2890844527 IN IP4 biloxi.example.com s= c=IN IP4 biloxi.example.com t=0 0 m=audio 3456 RTP/AVP 0 a=rtpmap:0 PCMU/8000 F4 ACK Alice -> Bob ACK sips:bob@biloxi.example.com SIP/2.0 Via: SIP/2.0/TLS atlanta.example.com:5061 ;branch=z9hG4bK74bfd Max-Forwards: 70 From: Alice <sips:alice@atlanta.example.com>;tag=1234567 To: Bob <sips:bob@biloxi.example.com>;tag=23431 Call-ID: 12345600@atlanta.example.com CSeq: 1 ACK Allow: INVITE, ACK, CANCEL, OPTIONS, BYE, REFER, NOTIFY Supported: replaces Content-Length: 0 /* Bob places Alice on hold. */ /* The re-INVITE contains no SDP, thus compelling Alice's UA to provide an offer. */ F5 INVITE Bob -> Alice INVITE sips:a8342043f@atlanta.example.com;gr SIP/2.0 Via: SIP/2.0/TLS biloxi.example.com:5061 ;branch=z9hG4bK874bk To: Alice <sips:alice@atlanta.example.com>;tag=1234567 From: Bob <sips:bob@biloxi.example.com>;tag=23431 Call-ID: 12345600@atlanta.example.com CSeq: 712 INVITE Contact: <sips:bob@biloxi.example.com>;+sip.rendering="no" Allow: INVITE, ACK, CANCEL, OPTIONS, BYE, REFER, NOTIFY Supported: replaces Content-Length: 0 /* Alice's UA provides an SDP offer. Since it does not know that it is being put on hold, the offer is the same as the original offer and describes bidirectional media. */ F6 200 OK Alice -> Bob SIP/2.0 200 OK Via: SIP/2.0/TLS biloxi.example.com:5061 ;branch=z9hG4bK874bk ;received=192.0.2.105 To: Alice <sips:alice@atlanta.example.com>;tag=1234567 From: Bob <sips:bob@biloxi.example.com>;tag=23431 Call-ID: 12345600@atlanta.example.com CSeq: 712 INVITE Contact: <sips:a8342043f@atlanta.example.com;gr> Allow: INVITE, ACK, CANCEL, OPTIONS, BYE, REFER, NOTIFY Supported: replaces, gruu Content-Type: application/sdp Content-Length: [omitted] v=0 o=alice 2890844526 2890844526 IN IP4 atlanta.example.com s= c=IN IP4 atlanta.example.com t=0 0 m=audio 49170 RTP/AVP 0 a=rtpmap:0 PCMU/8000 a=active /* Bob's UA initiates music-on-hold. */ /* This INVITE contains Alice's offer, but with the media direction set to "reverse hold", receive-only. */ F7 INVITE Bob -> Music Source INVITE sips:music@source.example.com SIP/2.0 Via: SIP/2.0/TLS biloxi.example.com:5061 ;branch=z9hG4bKnashds9 Max-Forwards: 70 From: Bob <sips:bob@biloxi.example.com>;tag=02134 To: Music Source <sips:music@source.example.com> Call-ID: 4802029847@biloxi.example.com CSeq: 1 INVITE Contact: <sips:bob@biloxi.example.com> Allow: INVITE, ACK, CANCEL, OPTIONS, BYE, REFER, NOTIFY Supported: replaces, gruu Content-Type: application/sdp Content-Length: [omitted] v=0 o=bob 2890844534 2890844534 IN IP4 atlanta.example.com s= c=IN IP4 atlanta.example.com t=0 0 m=audio 49170 RTP/AVP 0 a=rtpmap:0 PCMU/8000 a=recvonly F8 200 OK Music Source -> Bob SIP/2.0 200 OK Via: SIP/2.0/TLS biloxi.example.com:5061 ;branch=z9hG4bKnashds9 ;received=192.0.2.105 From: Bob <sips:bob@biloxi.example.com>;tag=02134 To: Music Source <sips:music@source.example.com>;tag=56323 Call-ID: 4802029847@biloxi.example.com Contact: <sips:music@source.example.com>;automaton ;+sip.byeless;+sip.rendering="no" CSeq: 1 INVITE Content-Length: [omitted] v=0 o=MusicSource 2890844576 2890844576 IN IP4 source.example.com s= c=IN IP4 source.example.com t=0 0 m=audio 49170 RTP/AVP 0 a=rtpmap:0 PCMU/8000 a=sendonly F9 ACK Bob -> Music Source ACK sips:music@source.example.com SIP/2.0 Via: SIP/2.0/TLS source.example.com:5061 ;branch=z9hG4bK74bT6 From: Bob <sips:bob@biloxi.example.com>;tag=02134 To: Music Source <sips:music@source.example.com>;tag=56323 Max-Forwards: 70 Call-ID: 4802029847@biloxi.example.com CSeq: 1 ACK Content-Length: 0 /* Bob's UA now sends the ACK that completes the re-INVITE to Alice and completes the SDP offer/answer. The ACK contains the SDP received from the Music Source, and thus contains the address/port from which the Music Source will send media, and implies the address/port which the Music Source will use to send/receive RTCP. */ F10 ACK Bob -> Alice ACK sips:a8342043f@atlanta.example.com;gr SIP/2.0 Via: SIP/2.0/TLS biloxi.example.com:5061 ;branch=z9hG4bKq874b To: Alice <sips:alice@atlanta.example.com>;tag=1234567 From: Bob <sips:bob@biloxi.example.com>;tag=23431 Call-ID: 12345600@atlanta.example.com CSeq: 712 ACK Contact: <sips:bob@biloxi.example.com>;+sip.rendering="no" Allow: INVITE, ACK, CANCEL, OPTIONS, BYE, REFER, NOTIFY Supported: replaces Content-Length: [omitted] v=0 o=bob 2890844527 2890844528 IN IP4 biloxi.example.com s= c=IN IP4 source.example.com t=0 0 m=audio 49170 RTP/AVP 0 a=rtpmap:0 PCMU/8000 a=sendonly /* Bob picks up the call by sending a re-INVITE to Alice. */ F11 INVITE Bob -> Alice INVITE sips:a8342043f@atlanta.example.com;gr SIP/2.0 Via: SIP/2.0/TLS biloxi.example.com:5061 ;branch=z9hG4bK874bk To: Alice <sips:alice@atlanta.example.com>;tag=1234567 From: Bob <sips:bob@biloxi.example.com>;tag=23431 Call-ID: 12345600@atlanta.example.com CSeq: 713 INVITE Contact: <sips:bob@biloxi.example.com> Allow: INVITE, ACK, CANCEL, OPTIONS, BYE, REFER, NOTIFY Supported: replaces Content-Type: application/sdp Content-Length: [omitted] v=0 o=bob 2890844527 2890844529 IN IP4 biloxi.example.com s= c=IN IP4 biloxi.example.com t=0 0 m=audio 3456 RTP/AVP 0 a=rtpmap:0 PCMU/8000 F12 200 OK Alice -> Bob SIP/2.0 200 OK Via: SIP/2.0/TLS biloxi.example.com:5061 ;branch=z9hG4bK874bk ;received=192.0.2.105 To: Alice <sips:alice@atlanta.example.com>;tag=1234567 From: Bob <sips:bob@biloxi.example.com>;tag=23431 Call-ID: 12345600@atlanta.example.com CSeq: 713 INVITE Contact: <sips:a8342043f@atlanta.example.com;gr> Allow: INVITE, ACK, CANCEL, OPTIONS, BYE, REFER, NOTIFY Supported: replaces, gruu Content-Type: application/sdp Content-Length: [omitted] v=0 o=alice 2890844526 2890844527 IN IP4 atlanta.example.com s= c=IN IP4 atlanta.example.com t=0 0 m=audio 49170 RTP/AVP 0 a=rtpmap:0 PCMU/8000 F13 ACK Bob -> Alice ACK sips:a8342043f@atlanta.example.com;gr SIP/2.0 Via: SIP/2.0/TLS biloxi.example.com:5061 ;branch=z9hG4bKq874b To: Alice <sips:alice@atlanta.example.com>;tag=1234567 From: Bob <sips:bob@biloxi.example.com>;tag=23431 Call-ID: 12345600@atlanta.example.com CSeq: 713 ACK Contact: <sips:bob@biloxi.example.com> Allow: INVITE, ACK, CANCEL, OPTIONS, BYE, REFER, NOTIFY Supported: replaces Content-Length: 0 F14 BYE Bob -> Music Source BYE sips:music@source.example.com SIP/2.0 Via: SIP/2.0/TLS biloxi.example.com:5061 ;branch=z9hG4bK74rf Max-Forwards: 70 From: Bob <sips:bob@biloxi.example.com>;tag=02134 To: Music Source <sips:music@source.example.com>;tag=56323 Call-ID: 4802029847@biloxi.example.com CSeq: 2 BYE Contact: <sips:bob@biloxi.example.com> Allow: INVITE, ACK, CANCEL, OPTIONS, BYE, REFER, NOTIFY Supported: replaces, gruu Content-Length: [omitted] F15 200 OK Music Source -> Bob SIP/2.0 200 OK Via: SIP/2.0/TLS atlanta.example.com:5061 ;branch=z9hG4bK74rf ;received=192.0.2.103 From: Bob <sips:bob@biloxi.example.com>;tag=02134 To: Music Source <sips:music@source.example.com>;tag=56323 Call-ID: 4802029847@biloxi.example.com Contact: <sips:music@source.example.com>;automaton ;+sip.byeless;+sip.rendering="no" CSeq: 2 BYE Content-Length: 0 /* Normal media session between Alice and Bob is resumed */
While the call is on-hold, the remote UA can send a request to modify the SDP or the feature parameters of its Contact header. This can be done with either an INVITE or UPDATE method, both of which have much the same effect in regard to MOH.
A common reason for a re-INVITE is when the remote UA desires to put the dialog on hold on its end. And because of the need to support this case, an implementation must process INVITEs and UPDATEs during the on-hold state as described below.
The executing UA handles these requests by echoing requests and responses: an incoming request from the remote UA causes the executing UA to send a similar request to the MOH source and an incoming response from the MOH source causes the executing UA to send a similar response to the remote UA. In all cases, SDP offers or answers that are received are added as bodies to the stimulated request or response to the other UA.
The passed-through SDP will usually need its o= line modified. The directionality attributes may need to be restricted by changing "active" to "recvonly" and "sendonly" to "inactive", as the executing UA will not render media from the remote UA. (If all passed-through directionality attributes are "inactive", the optimization described in section Section 2.10 may be applied.) In regard to payload type numbers, since the mapping has already been established within the MOH dialog, a=rtpmap lines need not be added.
The executing UA must be prepared to receive an INVITE request with a Replaces header that specifies the dialog with the remote UA. If the executing UA wants to create this new dialog in the on-hold state, it creates a new dialog with the MOH source to obtain MOH. The executing UA negotiates the SDP within the dialog created by the INVITE-with-Replaces by passing the offer through to the new MOH dialog (if the INVITE contains an offer), or by creating the new MOH dialog with an offerless INVITE (if the INVITE does not contain an offer).
Continuing the example of Section 2.3, the executing UA receives an INVITE-with-Replaces that contains an offer:
Alice Bob Music Source Carol Bob receives INVITE-with-Replaces from Carol: | | | | | | | INVITE/Replaces | | | | From: Carol | | | | To: Bob | | | | (SDP offer) | | |<-------------------------------| | | INVITE | | | | From: Bob | | | | To: Music Source | | | (SDP offer, | | | | rev. hold) | | | |------------->| | | | 200 OK | | | | From: Bob | | | | To: Music Source | | | (SDP answer, | | | | hold) | | | |<-------------| | | | ACK | | | | From: Bob | | | | To: Music Source | | |------------->| | | | | 200 OK | | | | From: Carol | | | | To: Bob | | | | (SDP answer, | | | | hold) | | |------------------------------->| | | | ACK | | | | From: Carol | | | | To: Bob | | |<-------------------------------| | | | Music-on-hold RTP | | |================>| | | | | Bob terminates the previous dialog with Alice: | | | | | BYE | | | | From: Bob | | | | To: Alice | | | |<---------------| | | | 200 OK | | | | From: Bob | | | | To: Alice | | | |--------------->| | | | | | | Bob terminates the MOH dialog for the dialog with Alice: | | | | | | BYE | | | | From: Bob | | | | To: Music Source | | |------------->| | | | 200 OK | | | | From: Music Source | | | To: Bob | | | |<-------------| | | | | | The new session continues on hold, between Bob and Carol.
The executing UA must be prepared to receive a REFER request within the dialog with the remote UA. The SDP within the dialog created by the REFER is negotiated by sending an offerless INVITE (or offerless re-INVITE) to the MOH source to obtain an offer, and then using that offer in the INVITE to the refer target.
Similar processing is used for an out-of-dialog REFER whose Target-Dialog header refers to the dialog with the remote UA.
Continuing the example of Section 2.3, the executing UA receives an INVITE-with-Replaces that contains an offer:
Bob receives REFER from Alice: Alice Bob Music Source Carol | | | | | REFER | | | | From: Bob | | | | To: Alice | | | | Refer-To: Carol| | | |--------------->| | | | | re-INVITE | | | | From: Bob | | | | To: Music Source | | | (no SDP) | | | |------------->| | | | 200 OK | | | | From: Bob | | | | To: Music Source | | | (SDP offer, | | | | hold) | | | |<-------------| | | | | INVITE | | | | From: Bob | | | | To: Carol | | | | (SDP offer, | | | | hold) | | |------------------------------->| | | | 200 OK | | | | From: Bob | | | | To: Carol | | | | (SDP answer, | | | | rev. hold) | | |------------------------------->| | | ACK | | | | From: Bob | | | | To: Music Source | | | (SDP answer, | | | | rev. hold) | | | |------------->| | | | | ACK | | | | From: Bob | | | | To: Carol | | |------------------------------->| | | | Music-on-hold RTP | | |================>| | | | | Bob terminates the previous dialog with Alice: | | | | | BYE | | | | From: Bob | | | | To: Alice | | | |<---------------| | | | 200 OK | | | | From: Bob | | | | To: Alice | | | |--------------->| | | | | | |
It is possible for the MOH source to send a re-INVITE or UPDATE request, and the executing UA can support doing so in similar manner as requests from the remote UA. However, if the MOH source is within the same administrative domain as the executing UA, the executing UA may have knowledge that the MOH source will not (or need not) make such requests, and so can respond to any such request with a failure response, avoiding the need to pass the request through. The 403 (Forbidden) response is suitable for this purpose because [sip] specifies that this response indicates "the request SHOULD NOT be repeated".
However, in an environment in which ICE[ice] is supported, the MOH source may need to send requests as part of ICE negotiation with the remote UA. Hence, in environments that support ICE, the executing UA must be able to pass through requests from the MOH source as well as requests from the remote UA.
Again, as SDP is passed through, its o= line will need to be modified. In some cases, the directionality attributes will need to be restricted.
In this technique, the MOH source generates an SDP answer that the executing UA presents to the remote UA as an answer within the original dialog. In basic functionality, this presents no problem, because [offer-answer] (section 6.1, at the very end) specifies that the payload type numbers used in either direction of RTP are the ones specified in the SDP sent by the recipient of the RTP. Thus, the MOH source will send RTP to the remote UA using the payload type numbers specified in the offer SDP it received (ultimately) from the remote UA.
But strict compliance to [offer-answer] (section 8.3.2) requires that payload type numbers used in SDP may only duplicate the payload type numbers used in any previous SDP sent in the same direction if the payload type numbers represent the same media format (codec) as they did previously. However, the MOH source has no knowledge of the payload type numbers previously used in the original dialog, and it may accidentally specify a different media format for a previously used payload type number in its answer (or in a subsequently generated INVITE or UPDATE). This would cause no problem with media decoding, as it cannot send any format that was not in the remote UA's offer, but it would violate [offer-answer].
Strictly speaking, it is impossible to avoid this problem because the generator of a first answer in its dialog can choose the payload numbers independently of the payload numbers in the offer, and the MOH server believes that its answer is first in the dialog. Thus the only absolute solution is to have the executing UA rewrite the SDP that passes through it to reassign payload type numbers, which would also require it to rewrite the payload type numbers in the RTP packets -- a very undesirable solution.
The difficulty solving this problem (and similar problems in other situations) argues that strict adherence should not be required to the rule that payload type numbers not be reused for different codecs.
The remainder of this section is devoted to describing a technique to eliminate this problem, in case it is of practical significance in some application. We do not expect that user agents would need to implement it in most applications.
We can construct a technique that will strictly adhere to the payload type rule by exploiting a SHOULD-level requirement in [offer-answer] (section 6.1): "In the case of RTP, if a particular codec was referenced with a specific payload type number in the offer, that same payload type number SHOULD be used for that codec in the answer." Or rather, we exploit the "implied requirement" that if a specific payload number in the offer is used for a particular codec, then the answer should not use that payload number for a different codec. If the MOH source obeys this restriction, the executing UA can modify the offer SDP to "reserve" all payload type numbers that have ever been offered by the executing UA to prevent the MOH source from using them for different media formats.
When the executing UA is composing the INVITE to the MOH source, it compiles a list of all the (dynamically-assigned) payload type numbers and associated media formats which have been used by it (or by MOH sources on its behalf) in the original dialog. (The executing UA must be maintaining a list of all previously used payload type numbers anyway, in order to comply with [offer-answer].)
Any payload type number that is present in the offer but has been used previously by the executing UA in the original dialog for a different media format is rewritten to describe a dummy media format. (One dummy media format name can be used for many payload type numbers as multiple payload type numbers can refer to the same media format.) A payload type number is added to describe the deleted media format, the number being either previously unused or previously used by the executing UA for that media format.
Any further payload type numbers which have been used by the executing UA in the original dialog but which are not mapped to a media format in the current offer are then mapped to a dummy media format.
The result is that the modified offer SDP:
These properties are sufficient to force an MOH server that obeys the implied requirement to generate an answer that is a correct answer to the original offer and is also compatible with previous SDP from the executing UA.
Note that any re-INVITEs from the remote UA that the executing UA passes through to the MOH server require similar modification, as payload type numbers that the MOH server receives in past offers are not absolutely reserved against its use (as they have not been sent in SDP by the MOH server) nor is there a SHOULD-level proscription against using them in the current answer (as they do not appear in the current offer).
This should provide an adequate solution to the problems with payload type numbers, as it will fail only if (1) the remote UA is particular that other UAs follow the rule about not redefining payload type numbers, and (2) the MOH server does not follow the implied requirement of [offer-answer] section 6.1.
Let us show how this process works by modifying the example of Section 2.3 with this specific assignment of supported codecs: Section 2.3 are:
In this case, the SDP exchanges are:
The messages that are changed from
F1 INVITE Alice -> Bob INVITE sips:bob@biloxi.example.com SIP/2.0 Via: SIP/2.0/TLS atlanta.example.com:5061 ;branch=z9hG4bK74bf9 Max-Forwards: 70 From: Alice <sips:alice@atlanta.example.com>;tag=1234567 To: Bob <sips:bob@biloxi.example.com> Call-ID: 12345600@atlanta.example.com CSeq: 1 INVITE Contact: <sips:a8342043f@atlanta.example.com;gr> Allow: INVITE, ACK, CANCEL, OPTIONS, BYE, REFER, NOTIFY Supported: replaces, gruu Content-Type: application/sdp Content-Length: [omitted] v=0 o=alice 2890844526 2890844526 IN IP4 atlanta.example.com s= c=IN IP4 atlanta.example.com t=0 0 m=audio 49170 RTP/AVP 90 91 a=rtpmap:90 X/8000 a=rtpmap:91 Y/8000 F3 200 OK Bob -> Alice SIP/2.0 200 OK Via: SIP/2.0/TLS atlanta.example.com:5061 ;branch=z9hG4bK74bf9 ;received=192.0.2.103 From: Alice <sips:alice@atlanta.example.com>;tag=1234567 To: Bob <sips:bob@biloxi.example.com>;tag=23431 Call-ID: 12345600@atlanta.example.com CSeq: 1 INVITE Contact: <sips:bob@biloxi.example.com> Allow: INVITE, ACK, CANCEL, OPTIONS, BYE, REFER, NOTIFY Supported: replaces Content-Type: application/sdp Content-Length: [omitted] v=0 o=bob 2890844527 2890844527 IN IP4 biloxi.example.com s= c=IN IP4 biloxi.example.com t=0 0 m=audio 3456 RTP/AVP 90 92 a=rtpmap:90 X/8000 a=rtpmap:92 Z/8000 F6 200 OK Alice -> Bob SIP/2.0 200 OK Via: SIP/2.0/TLS biloxi.example.com:5061 ;branch=z9hG4bK874bk ;received=192.0.2.105 To: Alice <sips:alice@atlanta.example.com>;tag=1234567 From: Bob <sips:bob@biloxi.example.com>;tag=23431 Call-ID: 12345600@atlanta.example.com CSeq: 712 INVITE Contact: <sips:a8342043f@atlanta.example.com;gr> Allow: INVITE, ACK, CANCEL, OPTIONS, BYE, REFER, NOTIFY Supported: replaces, gruu Content-Type: application/sdp Content-Length: [omitted] v=0 o=alice 2890844526 2890844526 IN IP4 atlanta.example.com s= c=IN IP4 atlanta.example.com t=0 0 m=audio 49170 RTP/AVP 90 91 a=rtpmap:90 X/8000 a=rtpmap:91 Y/8000 a=active F7 INVITE Bob -> Music Source INVITE sips:music@source.example.com SIP/2.0 Via: SIP/2.0/TLS biloxi.example.com:5061 ;branch=z9hG4bKnashds9 Max-Forwards: 70 From: Bob <sips:bob@biloxi.example.com>;tag=02134 To: Music Source <sips:music@source.example.com> Call-ID: 4802029847@biloxi.example.com CSeq: 1 INVITE Contact: <sips:bob@biloxi.example.com> Allow: INVITE, ACK, CANCEL, OPTIONS, BYE, REFER, NOTIFY Supported: replaces, gruu Content-Type: application/sdp Content-Length: [omitted] v=0 o=bob 2890844534 2890844534 IN IP4 atlanta.example.com s= c=IN IP4 atlanta.example.com t=0 0 m=audio 49170 RTP/AVP 90 91 92 a=rtpmap:90 X/8000 a=rtpmap:91 Y/8000 a=rtpmap:92 x-reserved/8000 a=recvonly F8 200 OK Music Source -> Bob SIP/2.0 200 OK Via: SIP/2.0/TLS biloxi.example.com:5061 ;branch=z9hG4bKnashds9 ;received=192.0.2.105 From: Bob <sips:bob@biloxi.example.com>;tag=02134 To: Music Source <sips:music@source.example.com>;tag=56323 Call-ID: 4802029847@biloxi.example.com Contact: <sips:music@source.example.com>;automaton ;+sip.byeless;+sip.rendering="no" CSeq: 1 INVITE Content-Length: [omitted] v=0 o=MusicSource 2890844576 2890844576 IN IP4 source.example.com s= c=IN IP4 source.example.com t=0 0 m=audio 49170 RTP/AVP 91 a=rtpmap:91 Y/8000 a=sendonly
The executing UA may discover that either the remote UA or the MOH source wishes to use dialog/session liveness timers.[timers] Since the timers verify the liveness of dialogs, not sessions (despite the terminology of [timers]), the executing UA can support the timers on each dialog (to the remote UA and to the MOH source) independently. (If the executing UA becomes obliged to initiate a refresh transaction, it must send an offerless UPDATE or re-INVITE, as if it sends an offer, the remote element has the opportunity to provide an answer which is different from its previous SDP, which could not easily be conveyed to the other remote element.)
The directionality of the media stream in the SDP offer in an INVITE or re-INVITE to the music source can be "inactive" if the SDP offer from the remote UA was "sendonly" or "inactive". Generally, this happens when the remote UA also has put the call on hold and provided a directionality of "sendonly". In this situation, the executing UA can omit establishing the dialog with the music source (or can terminate the existing dialog with the music source).
If the executing UA uses this optimization, it creates the SDP answer itself, with directionality "inactive" and using its own RTP/RTCP ports, and returns that answer to the remote UA.
The executing UA must be prepared for the remote UA to send a re-INVITE with directionality "active" or "recvonly", in which case the executing UA must initiate a dialog with the music source, as described above.
There may be multiple media streams (multiple m= lines) in any of the the SDPs involved in the dialogs. As the SDPs are manipulated, each media description (each starting with an m= line) is manipulated as described above for a single media stream, largely independently of the manipulation of the other media streams. But there are some elaborations that the executing UA may implement to achieve specific effects.
If the executing UA desires to present only certain media types as on-hold media, when passing the offer SDP through, it can reject any particular media streams by setting the port number in the m= line to zero.[offer-answer] This ensures that the answer SDP will also have a rejection for that m= line.
If the executing UA wishes to provide its own on-hold media for a particular m= line, it can do so by providing the answer information for that m= line. The executing UA may decide to do this when the offer SDP is received (by modifying the m= line to rejected state when sending it to the music source), or upon receiving the answer from the music source and discovering that the m= line has been rejected.
The executing UA may not want to pass a rejected m= line from the music source to the remote UA (when the remote UA provided a non-rejected m= line), and instead provide an answer with directionality "inactive" (and specifying its own RTP/RTCP ports).
This technique for providing music-on-hold has advantages over other methods now in use:
Unnecessary failures can happen if SDP offerers do not always offer all media formats that they support. Doing so is considered best practice ([offer-answer-bis] sections 5.1 and 5.3), but some SIP elements offer only formats that have already been in use in the dialog.
An example of how omitting media formats in an offer can lead to failure is as follows: Suppose that the UAs in Section 2.3 each support the following media formats:
In this case, the SDP exchanges are:
Note that in exchange 2, if Alice assumes that because only format X is currently in use that she should offer only X, the exchange fails. In exchange 3, Bob offers formats X and Z, even though neither is in use at the time (because Bob is not involved in the media streams).
Many UAs provide MOH in the interval during which it is processing a blind transfer, between receiving the REFER and receiving the final response to the stimulated INVITE. This process involves switching the user's interface between three media sources: (1) the session of the original dialog, (2) the session with the MOH server, and (3) the session of the new dialog; it involves a number of race conditions that must be handled correctly. If the UA is a B2BUA whose "other side" is maintaining a single dialog with another UA, each switching of media sources potentially causes a re-INVITE transaction within the other-side dialog. Since re-INVITEs take time and must be sequenced correctly ([sip] section 14), such a B2BUA must allow the events on each side to be non-synchronous and must coordinate them correctly. Failing to do so will lead to "glare" errors (491 or 500), leaving the other-side UA not rendering the correct session.
Some UAs filter incoming media based on the address of origin as a media security measure. The technique described in this document ensures that any UA that should render MOH media will be informed of the source address of the media via the SDP that it receives. This should allow such UAs to filter without interfering with MOH operation.
The original version of this proposal was derived from [service-examples-11] section 2.3 and the similar implementation of MOH in the Snom UA. Significant improvements to the sequence of operations, allowing improvements to the SDP handling, were suggested by Venkatesh[venkatesh].
John Elwell[elwell] pointed out the need for the executing UA to pass through re-INVITEs/UPDATEs in order to allow ICE negotiation, suggested mentioning the role of RTCP listening ports, suggested the possibility of omitting the dialog to the music source if the directionality would be "inactive", and pointed that if there are multiple media streams, the executing UA may want to select which streams receive MOH.
Paul Kyzivat[kyzivat-1][kyzivat-2] pointed out the difficulties regarding reuse of payload type numbers and considerations that could be used to avoid those difficulties, leading to the writing of Section 2.8.
Paul Kyzivat suggested adding Section 4.1 showing why offerers should always include all supported formats.
M. Ranganathan pointed out the difficulties experienced by a B2BUA (Section 4.2) due to the multiple changes of media source.
Section 4.1 was significantly clarified based on advice from Attila Sipos[sipos].
The need to discuss dialog/session timers[Section 2.9] was pointed out by Rifaat Shekh-Yusef[shekh-yusef].
Robert Sparks clarified the purpose of the "Best Common Practice" status, leading to revising the intended status of this document to "Informational".[Section 1.1]
Numerous improvements to the text were due to reviewers, including Rifaat Shekh-Yusef and Richard Barnes.
[Note to RFC Editor: Please remove this entire section upon publication as an RFC.]
Removed the original "Example Message Flow" and promoted the "Alternative Example Message Flow" to replace it because of a number of flaws that were found during the discussion of -00 on the SIPPING mailing list.
Described the use of the sip.rendering feature parameter to indicate on-hold status.
Added discussion of passing though re-INVITEs and UPDATEs.
Added discussion of payload type numbers.
Added Acknowledgments section.
Added Section 4.1 showing the importance of the offerer always including all supported media formats.
Updated references.
Revised handling of payload type numbers when passing offer to MOH server Section 2.8, based on observations by Paul Kyzivat.
Added Section 4.2 discussing handling of re-INVITEs by B2BUAs when using this method.
Added "avoidance of out-of-dialog REFER" as an advantage.Section 3
Added "automaton", "sip.rendering", and "sip.byeless" feature tags to the Contact URI of the Music Server in the examples.[dialog-event][ua-capabilities]
Added initial discussion of dialog/session timer support.Section 2.9
Revised handling of payload type numbers based on further observations by Paul Kyzivat[kyzivat-2].
Changed references to "SPIT" to refer to "media security", per suggestion by Scott Lawrence.
Removed reference to the idea of having the executing UA not maintain session timers itself, but rather, passing through session timer negotiation and updates. Examination showed this idea to be much more complex to implement than having the executing UA terminate session timers itself for both dialogs. (Suggested by Rifaat Shekh-Yusef.)
On advice from Robert Sparks, changed the "intended status" from "BCP" to "Informational", and added a section to explain the change.
Noted that the rule on not reusing payload type numbers is undesirable because it complicates third-party operations (as noted by Paul Kyzivat[kyzivat-3]).
Updated author's contact information.
On suggestion from John Elwell, added mention that the Music Source's SDP address/port implies its RTCP address/port, which will be used to receive RTCP.
Updated references to [service-examples] and [service-examples-11] to specify the sections of documents, which are the ones that discuss music on hold.
Update reference to [offer-answer-bis] to refer to the -18 version.
Sections Section 2.10 and Section 2.11 added at the suggestion of John Elwell.
Update reference to [offer-answer-bis] to refer to RFC 6337.
Renew Internet-Draft, no changes.
Renew Internet-Draft, no changes.
Numerous improvements resulting from Rifaat Shekh-Yusef's review, includng:
Adding examples for how the executing UA processes INVITE-with-Replaces and REFER directed toward the dialog with the remote UA.
Recommending a 403 response to a re-INVITE/UPDATE request from the MOH server if the UA knows that it need not be acted upon.
Improvements resulting from Richard Barnes' review.
Update the Acknowledgments to credit the reviewers.
[offer-answer] | Rosenberg, J. and H. Schulzrinne, "An Offer/Answer Model with the Session Description Protocol (SDP)", RFC 3264, June 2002. |
[sdp] | Handley, M., Jacobson, V. and C. Perkins, "SDP: Session Description Protocol", RFC 4566, July 2006. |
[sip] | Rosenberg, J., Schulzrinne, H., Camarillo, G., Johnston, A., Peterson, J., Sparks, R., Handley, M. and E. Schooler, "SIP: Session Initiation Protocol", RFC 3261, June 2002. |
[timers] | Donovan, S. and J. Rosenberg, "Session Timers in the Session Initiation Protocol (SIP)", RFC 4028, April 2005. |