Internet DRAFT - draft-hwy-opsawg-ifl-framework
draft-hwy-opsawg-ifl-framework
OPSAWG Working Group L. Han
Internet-Draft M. Wang
Intended status: Informational China Mobile
Expires: 28 January 2024 X. Wang
T. Zhou
Huawei
27 July 2023
Inband Flow Learning Framework
draft-hwy-opsawg-ifl-framework-04
Abstract
On-path telemetry techniques can provide high-precision inband flow
insight and real-time network performance monitoring by embedding
instructions or metadata into user packets. They are benificial but
still has problems of deployability and flexibility in large scale
deployment scenario. This document proposes a reference framework
called Inband Flow Learning (IFL), which outlines the architecture
and functional modules for automatic deployment and adjustment of
flow-oriented monitoring using on-path telemetry techniques, trying
to provide a solution for reference to solve the problems. This
document also provides different deployment approaches and
considerations in practical network deployment.
Requirements Language
The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT",
"SHOULD", "SHOULD NOT", "RECOMMENDED", "NOT RECOMMENDED", "MAY", and
"OPTIONAL" in this document are to be interpreted as described in BCP
14 [RFC2119] [RFC8174] when, and only when, they appear in all
capitals, as shown here.
Status of This Memo
This Internet-Draft is submitted in full conformance with the
provisions of BCP 78 and BCP 79.
Internet-Drafts are working documents of the Internet Engineering
Task Force (IETF). Note that other groups may also distribute
working documents as Internet-Drafts. The list of current Internet-
Drafts is at https://datatracker.ietf.org/drafts/current/.
Internet-Drafts are draft documents valid for a maximum of six months
and may be updated, replaced, or obsoleted by other documents at any
time. It is inappropriate to use Internet-Drafts as reference
material or to cite them other than as "work in progress."
Han, et al. Expires 28 January 2024 [Page 1]
Internet-Draft Inband Flow Learning Framework July 2023
This Internet-Draft will expire on 28 January 2024.
Copyright Notice
Copyright (c) 2023 IETF Trust and the persons identified as the
document authors. All rights reserved.
This document is subject to BCP 78 and the IETF Trust's Legal
Provisions Relating to IETF Documents (https://trustee.ietf.org/
license-info) in effect on the date of publication of this document.
Please review these documents carefully, as they describe your rights
and restrictions with respect to this document. Code Components
extracted from this document must include Revised BSD License text as
described in Section 4.e of the Trust Legal Provisions and are
provided without warranty as described in the Revised BSD License.
Table of Contents
1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . 2
2. Terminology and Conventions . . . . . . . . . . . . . . . . . 3
2.1. Requirement Language . . . . . . . . . . . . . . . . . . 3
2.2. Terminology . . . . . . . . . . . . . . . . . . . . . . . 3
3. Framework of Inband Flow Learning . . . . . . . . . . . . . . 3
3.1. Service Discovery . . . . . . . . . . . . . . . . . . . . 4
3.2. Inband Flow Information Telemetry Deployment . . . . . . 5
3.2.1. Telemetry Mode . . . . . . . . . . . . . . . . . . . 5
3.2.2. Telemetry Policy . . . . . . . . . . . . . . . . . . 6
3.2.3. Telemetry Instance . . . . . . . . . . . . . . . . . 6
4. Inband Flow Information Telemetry Adjustment . . . . . . . . 7
5. IANA Considerations . . . . . . . . . . . . . . . . . . . . . 8
6. Security Considerations . . . . . . . . . . . . . . . . . . . 8
7. References . . . . . . . . . . . . . . . . . . . . . . . . . 8
7.1. Normative References . . . . . . . . . . . . . . . . . . 8
7.2. Informative References . . . . . . . . . . . . . . . . . 8
Authors' Addresses . . . . . . . . . . . . . . . . . . . . . . . 9
1. Introduction
On-path telemetry techniques described in
[I-D.song-opsawg-ifit-framework] such as IOAM [RFC9197] and
Alternate-Marking [RFC9341] can provide high-precision inband flow
insight and real-time network performance monitoring (e.g., jitter,
latency, packet loss) by embedding instructions or metadata into user
packets. They are benificial for network operation to monitor live
traffic running in the network, based on inband flow information
telemetry on the entire forwarding path.
Han, et al. Expires 28 January 2024 [Page 2]
Internet-Draft Inband Flow Learning Framework July 2023
However, when deploying flow-oriented monitoring using on-path
telemetry techniques on live traffic, problems like changes of flow
characteristics or paths may occur whitch make the traditional static
configuration mode no longer applicable.
[I-D.hwyh-ippm-ps-inband-flow-learning] states problems of flow
identification applying on-path telemetry techniques in real network
scenarios, and describes the requirements for inband flow learning
mechanism whitch intends to address the problems of deployability and
flexibility. This document proposes a reference framework called
Inband Flow Learning (IFL), which outlines the architecture and
functional modules for automatic deployment and adjustment of flow-
oriented monitoring using on-path telemetry techniques. This
document also provides different deployment approaches and
considerations in practical network deployment. Note that this
document focuses on the generation of inband flow telemetry object,
and inband flow performance measurement methods are out of the scope
of this document.
2. Terminology and Conventions
2.1. Requirement Language
The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT",
"SHOULD", "SHOULD NOT", "RECOMMENDED", "NOT RECOMMENDED", "MAY", and
"OPTIONAL" in this document are to be interpreted as described in BCP
14 [RFC2119] [RFC8174] when, and only when, they appear in all
capitals, as shown here.
2.2. Terminology
IFL: Inband Flow Learning
IFITI: Inband Flow Information Telemetry Instance
3. Framework of Inband Flow Learning
The domain of inband flow information telemetry consists of ingress
nodes, transit nodes and egress nodes. The ingress nodes are
responsible for enabling monitoring functions and the egress nodes
are responsible for terminating them. All the nodes in the domain
may participate in the inband flow learning by excecuting
corresponding functions in the framework of Inband Flow Learning
(IFL). The framework of IFL includes three components of Service
Discovery, Inband Flow Information Telemetry Deployment and Inband
Flow Information Telemetry Adjustment shown in Figure 1. Among these
different components, inband flow learning can be embodied in
Han, et al. Expires 28 January 2024 [Page 3]
Internet-Draft Inband Flow Learning Framework July 2023
automatic service discovery, automatic flow telemetry deployment, and
automatic flow telemetry adjustment.
+---------+-------------------+------------------+------------------+
|Component| Service | Inband Flow | Inband Flow |
| | Discovery | Information | Information |
| | | Telemetry | Telemetry |
| | | Deployment | Adjustment |
+---------+-------------------+------------------+------------------+
|Functions| Sampling polic | Telemetry policy | |
| |-------------------+------------------+ Aging |
| |Flow characteristic|Telemetry instance| |
| | acquisition | | |
+---------+-------------------+------------------+------------------+
Figure 1 Framework of Inband Flow Learning
3.1. Service Discovery
Before starting the telemetry on service flows, the service should be
discovered in order to further determine which flow should be
monitored. The target of service discovery function is to obtain the
flow characteristics, whitch are represented in terms of IP source
address, IP destination address, TCP/UDP port number, VRF, incoming/
outgoing interface etc.
Automatic service discovery is implemented based on the sampling
policy delivered by the control plane and flow characteristic
acquisition on the forwarding plane, whitch is usually performed on
the ingress node. Sampling policy is a set of rules that instruct
the forwarding plane to identify service flow characteristics based
on a specific scope. Flow characteristic acquisition is a process in
which the forwarding plane identifies, extracts, and reports service
flow characteristic on the live traffic based on the sampling policy.
For example, if the service traffic to be monitored has a particular
port number, to automatically discover all flows of the service
identified by 5-tuple, a sampling policy can be configured to match
the live traffic with the particular port number and generate flow
information at the 5-tuple granularity. When live traffic passes
through the ingress node, the forwarding plane can filters traffic
based on the specified sampling policy, identifies all flows with the
particular port number, and reports the flows with 5-tuple
information. The automatically discovered service flow information
can be stored distributedly on the ingress node, or reported to the
newwork controller for centralized management.
Han, et al. Expires 28 January 2024 [Page 4]
Internet-Draft Inband Flow Learning Framework July 2023
3.2. Inband Flow Information Telemetry Deployment
After acquiring the flow characteristics by service discovery,
telemetry based on the inband flow information can be deployed
automatically. Automatic flow telemetry deployment is implemented by
creating telemetry instances based on telemetry policy, and executed
on different types of network nodes in the domain according to the
telemetry mode.
3.2.1. Telemetry Mode
There are two modes to deploy inband flow information telemetry: End-
to-End (E2E) and Hop-by-Hop (HbH). For majority of the services, E2E
telemetry of service flows can meet the requirements of network
operators by providing the entire performance insight of the service.
In E2E mode shown in Figure 2, ingress node discovers the
characteristics of service flows and proceed on-path telemetry on the
flows to be monitored. Egress node need to deploy the same
monitoring flows and complete the telemetry. If the telemetry data
is not carried in the data packet but is reported at each node, flow
identifier is required to associate the data on data consumer.
Documents like [RFC9326] [RFC9343]
[I-D.ietf-mpls-inband-pm-encapsulation] provide the encapsulation
format of flow identifier.
+-------------+
|Data Consumer| compute E2E flow info
+-------------+
| |
___flow info__| |____flow info____
| telemetry telemetry |
| |
+---------+ +---------+ +---------+ +---------+
| Ingress |---| Transit | ...| Transit |---| Egress |
| Node | | Node | | Node | | Node |
+---------+ +---------+ +---------+ +---------+
Figure 2 End-to-End Telemetry Mode
The distinction of HbH mode to E2E mode is that transit node also
participates the inband flow information learning and telemetry. In
HbH mode shown in Figure 3, telemetry covers the flow information on
every node of the forwarding path the flow packet is transmitted,
which provides detailed flow information on each hop. Hop-by-Hop
telemetry usually works in the need of an on-demand fault diagnose.
Han, et al. Expires 28 January 2024 [Page 5]
Internet-Draft Inband Flow Learning Framework July 2023
+-------------+
|Data Consumer| compute HbH flow info
+-------------+
| | | | flow info telemetry
______________| | | |_________________
| ___| |___ |
| | | |
+---------+ +---------+ +---------+ +---------+
| Ingress |---| Transit | ...| Transit |---| Egress |
| Node | | Node | | Node | | Node |
+---------+ +---------+ +---------+ +---------+
Figure 3 Hop-by-Hop Telemetry Mode
3.2.2. Telemetry Policy
Telemetry policy is used to determine which flow should be monitored.
By configuring telemetry policy, it can increase the priority of
learning and telemetry to critical flow and reduce or filter the
learning and telemetry of unimportant flows. It is crucial to
network deployment for two reasons, one is the number of flows can be
huge, another is the limitation of processing capability either on
the controller or the network node. There might be millions of flows
in a large scale network, for example 5G mobile backhaul network. It
is important to wisely choose the granularity of inband flow
information telemetry.
Regarding IP traffics, the telemetry policy can be based on either
one of or combination of flow characteristics, such as IP source/
destination address, TCP/UDP port number, VRFs, or network device
interfaces etc. An IP address with a flexible wildcard mask can also
be used as means to provide telemetry policy to an aggregation of
flows.
3.2.3. Telemetry Instance
Inband Flow Information Telemetry Instance(IFITI), in short called
telemetry instance, is the management object of the monitored flow
for the deployment of flow-oriented on-path telemetry techniques
under the framework of IFL. During its life cycle, IFITI is
responsible for providing performance telemetry data on the nodes
that the flow it monitors traverses.
On ingress nodes IFITIs can be automatically generated in either
distributed or centralized way by implementing telemetry policies for
automatically discovered service flows. The transit nodes and egress
nodes can also automatically generate IFITIs by learning some special
information of the monitored flows whitch is embedded by the ingress
Han, et al. Expires 28 January 2024 [Page 6]
Internet-Draft Inband Flow Learning Framework July 2023
nodes without configuring flow characteristics. Flow identifier is
such special information whitch may be a unique value within a domain
encapsulated in the service packets to setup the relationship between
the characteristic information, telemetry instance and the service
flow. It can not only correlate the telemetry data of flows on each
node, as mentioned in the previous section, but also serve as the key
marker for the forwarding plane to identify the monitored flow. For
the forwarding plane, it is much easier to identify a piece of data
in a service packet than to identify various types of flow
characteristics.
The following uses flow identifier as an example to describe the flow
learning process on transit and egress node. Once the telemetry
instance is created, ingress node can start the telemetry of flow
information based on the method of on-path telemetry techniques. At
the same time, ingress node encodes inband monitoring information in
the service packets, including the identifier. When a service flow
packet passes through the transit node or egress node, if the node
detects that the packet contains a flow identifier, it considers that
the packet is a service flow packet to be monitored, and
automatically creates a telemetry instance using the identifier as
the key.
The automatic creation of telemetry instance on network node can
greatly facilitate the dynamic and incremental deployment. On all
types of nodes, network operators do not need to statically configure
characteristics of monitored flows, which saves a lot of workload and
reduces error probability in a large-scale deployment scenario. When
the path of the monitored flow changes, the monitored flow can be
automatically detected on the new path node and the corresponding
telemetry instance can be automatically deployed.
4. Inband Flow Information Telemetry Adjustment
When route convergence happens to the network, service flow may
switch to other forwarding nodes. When the traffic changes,
telemetry instance varies as well. Regarding the telemetry instance
running on the fault path, the aging of IFITI should be supported in
order to recycle the network resources. IFITI should be deleted once
it becomes stale. To monitor the same flow information, new
telemetry instance is required to add on the new transit or egress
node. Note that aging and adjustment of IFITI can be initiated by
controller or network node. When a specific timer used for flow
information telemetry timeout, the IFITI would be deleted to stop the
telemetry of the flow.
Han, et al. Expires 28 January 2024 [Page 7]
Internet-Draft Inband Flow Learning Framework July 2023
5. IANA Considerations
This document has no request to IANA
6. Security Considerations
TBD
7. References
7.1. Normative References
[RFC2119] Bradner, S., "Key words for use in RFCs to Indicate
Requirement Levels", BCP 14, RFC 2119,
DOI 10.17487/RFC2119, March 1997,
<https://www.rfc-editor.org/info/rfc2119>.
[RFC8174] Leiba, B., "Ambiguity of Uppercase vs Lowercase in RFC
2119 Key Words", BCP 14, RFC 8174, DOI 10.17487/RFC8174,
May 2017, <https://www.rfc-editor.org/info/rfc8174>.
7.2. Informative References
[I-D.hwyh-ippm-ps-inband-flow-learning]
Han, L., Wang, M., Wang, X., and J. Huang, "Problem
Statement and Requirement for Inband Flow Learning", Work
in Progress, Internet-Draft, draft-hwyh-ippm-ps-inband-
flow-learning-03, 27 July 2023,
<https://datatracker.ietf.org/api/v1/doc/document/draft-
hwyh-ippm-ps-inband-flow-learning/>.
[I-D.ietf-mpls-inband-pm-encapsulation]
Cheng, W., Min, X., Zhou, T., Dai, J., and Y. Peleg,
"Encapsulation For MPLS Performance Measurement with
Alternate Marking Method", Work in Progress, Internet-
Draft, draft-ietf-mpls-inband-pm-encapsulation-06, 14 June
2023, <https://datatracker.ietf.org/doc/html/draft-ietf-
mpls-inband-pm-encapsulation-06>.
[I-D.song-opsawg-ifit-framework]
Song, H., Qin, F., Chen, H., Jin, J., and J. Shin,
"Framework for In-situ Flow Information Telemetry", Work
in Progress, Internet-Draft, draft-song-opsawg-ifit-
framework-20, 24 April 2023,
<https://datatracker.ietf.org/doc/html/draft-song-opsawg-
ifit-framework-20>.
Han, et al. Expires 28 January 2024 [Page 8]
Internet-Draft Inband Flow Learning Framework July 2023
[RFC9197] Brockners, F., Ed., Bhandari, S., Ed., and T. Mizrahi,
Ed., "Data Fields for In Situ Operations, Administration,
and Maintenance (IOAM)", RFC 9197, DOI 10.17487/RFC9197,
May 2022, <https://www.rfc-editor.org/info/rfc9197>.
[RFC9326] Song, H., Gafni, B., Brockners, F., Bhandari, S., and T.
Mizrahi, "In Situ Operations, Administration, and
Maintenance (IOAM) Direct Exporting", RFC 9326,
DOI 10.17487/RFC9326, November 2022,
<https://www.rfc-editor.org/info/rfc9326>.
[RFC9341] Fioccola, G., Ed., Cociglio, M., Mirsky, G., Mizrahi, T.,
and T. Zhou, "Alternate-Marking Method", RFC 9341,
DOI 10.17487/RFC9341, December 2022,
<https://www.rfc-editor.org/info/rfc9341>.
[RFC9343] Fioccola, G., Zhou, T., Cociglio, M., Qin, F., and R.
Pang, "IPv6 Application of the Alternate-Marking Method",
RFC 9343, DOI 10.17487/RFC9343, December 2022,
<https://www.rfc-editor.org/info/rfc9343>.
Authors' Addresses
Liuyan Han
China Mobile
Beijing
China
Email: hanliuyan@chinamobile.com
Minxue Wang
China Mobile
Beijing
China
Email: wangminxue@chinamobile.com
Xuanxuan Wang
Huawei
Nanjing
China
Email: wxxuan@huawei.com
Tianran Zhou
Huawei
Beijing
China
Han, et al. Expires 28 January 2024 [Page 9]
Internet-Draft Inband Flow Learning Framework July 2023
Email: zhoutianran@huawei.com
Han, et al. Expires 28 January 2024 [Page 10]