janz-nmrg-performance-digital-twin-00.txt

Internet DRAFT - draft-janz-nmrg-performance-digital-twin
draft-janz-nmrg-performance-digital-twin

Last Version:	draft-janz-nmrg-performance-digital-twin-00.txt	Tracker Entry
Date:	`14-Mar-2023`
Disposition:	expired





Network Management Research Group                                C. Janz
Internet-Draft                                                    Y. You
Intended status: Informational                             Huawei Canada
Expires: 15 September 2023                                        A. Guo
                                                  Futurewei Technologies
                                                           14 March 2023


    Functional Design Aspects of Performance-Oriented Digital Twins
              draft-janz-nmrg-performance-digital-twin-00

Abstract

   Performance-Oriented Digital Twins (PODTs) provide "what-if" analysis
   - predictions of performance or, perhaps, of other behaviours in
   hypothetical situations of network, services, traffic, etc.  Key
   functional and design aspects of PODTs in support of multiple,
   concurrently-operating use case Management Plane (MP) applications
   are discussed.  Data and model management in concurrent session
   handling, inter-working with variable composition (from data and
   functional model perspectives) networks, performance and scalability
   are considered.

Status of This Memo

   This Internet-Draft is submitted in full conformance with the
   provisions of BCP 78 and BCP 79.

   Internet-Drafts are working documents of the Internet Engineering
   Task Force (IETF).  Note that other groups may also distribute
   working documents as Internet-Drafts.  The list of current Internet-
   Drafts is at https://datatracker.ietf.org/drafts/current/.

   Internet-Drafts are draft documents valid for a maximum of six months
   and may be updated, replaced, or obsoleted by other documents at any
   time.  It is inappropriate to use Internet-Drafts as reference
   material or to cite them other than as "work in progress."

   This Internet-Draft will expire on 15 September 2023.

Copyright Notice

   Copyright (c) 2023 IETF Trust and the persons identified as the
   document authors.  All rights reserved.

   This document is subject to BCP 78 and the IETF Trust's Legal
   Provisions Relating to IETF Documents (https://trustee.ietf.org/
   license-info) in effect on the date of publication of this document.



Janz, et al.            Expires 15 September 2023               [Page 1]

Internet-Draft     Performance-Oriented Digital Twins         March 2023


   Please review these documents carefully, as they describe your rights
   and restrictions with respect to this document.  Code Components
   extracted from this document must include Revised BSD License text as
   described in Section 4.e of the Trust Legal Provisions and are
   provided without warranty as described in the Revised BSD License.

Table of Contents

   1.  Introduction  . . . . . . . . . . . . . . . . . . . . . . . .   2
   2.  Terminology . . . . . . . . . . . . . . . . . . . . . . . . .   3
   3.  Data and Model Management Supporting Concurrent Scenario-Based
           Computational Sessions  . . . . . . . . . . . . . . . . .   4
   4.  Data and Model Management Supporting Differences and Evolutions
           in MP Applications and Physical Networks  . . . . . . . .   6
   5.  Performance-Related Functional Design Aspects . . . . . . . .   8
   6.  Functional Design Implications of NDT Functional Extension
           Beyond Analysis . . . . . . . . . . . . . . . . . . . . .   9
   7.  Conclusion  . . . . . . . . . . . . . . . . . . . . . . . . .  10
   8.  Manageability Considerations  . . . . . . . . . . . . . . . .  10
   9.  Security Considerations . . . . . . . . . . . . . . . . . . .  10
   10. IANA Considerations . . . . . . . . . . . . . . . . . . . . .  10
   11. Informative References  . . . . . . . . . . . . . . . . . . .  10
   Acknowledgments . . . . . . . . . . . . . . . . . . . . . . . . .  11
   Authors' Addresses  . . . . . . . . . . . . . . . . . . . . . . .  11

1.  Introduction

   A Network Digital Twin (NDT) - also referred to as a Digital Network
   Twin (DNT) - is a virtual replica of a real network (often referred
   to as a 'physical' network) that faithfully mimics the behaviours of
   the real network [I-D.draft-zhou-nmrg-digitaltwin-network-concepts].
   Its role may be limited to synthesizing behavioural information, such
   as performance predictions, for consumption by other MP functions;
   such analytical Performance-Oriented Digital Twins (PODTs) are
   considered in [I-D.draft-paillisse-nmrg-performance-digital-twin].
   Alternatively, the NDT may be held to encompass further management
   functions, such as closed loop components beyond data analysis, and
   thus to exercise active control over the physical network, services,
   etc.  In such cases, the functional equivalent of a PODT lies at the
   heart of the NDT.  Therefore, this draft focuses discussion on PODTs;
   considerations related to functional extension of the NDT role,
   beyond an analytical one, are considered in Section 6.

   In [I-D.draft-paillisse-nmrg-performance-digital-twin], PODT
   implementation challenges and certain implementation options -
   especially concerning key functional models - are discussed, as are
   required interfaces and related standards (Sections 4 and 7 of the
   reference, respectively).  This draft looks not at details of models



Janz, et al.            Expires 15 September 2023               [Page 2]

Internet-Draft     Performance-Oriented Digital Twins         March 2023


   or interfaces but rather at general aspects of functional design.
   Several functional design principles are considered that may apply
   generically to PODTs.

   One such functional design principle is based on the fact that the
   behavioural or performance prediction information, synthesized from
   data by a particular PODT, may be used by multiple use case-based MP
   applications, and these applications may - usefully - operate
   concurrently.  What is different among such applications - or even
   among successive predictions sought by a single application - is not
   the nature of the behavioural or performance information yielded.
   The difference lies, rather, in the details of the scenarios for
   which prediction information is sought.  A PODT supporting multiple
   concurrent MP functions and applications requires careful handling of
   data and modeling to accommodate differences in scenario-driven data
   used in (what amount to) parallel - or at least, different -
   computational sessions.  This aspect is considered in Section 3.

   Another important functional design principle is related to the fact
   that, in practice, there are likely to be - among real network
   instances for which use of the PODT is desired, or with respect to
   evolutions of a given real network instance over time - considerable
   variations in network details, including network size, equipment
   types used, available instrumentation or other data sources, etc.
   Additionally, differences among MP applications, in terms of the
   nature and degree of hypotheticals involved in the scenarios they
   generate for performance prediction, may also drive variations in
   data types, availability, precision, etc.  As a practical matter, a
   PODT should be able to accommodate such variations, while
   consistently providing the best possible scenario-based performance
   predictions in support of all MP applications.  This, too, has
   implications for data and model management.  This aspect is
   considered in Section 4.

   Finally, supporting multiple, concurrently operating MP applications
   - many of which (e.g.  optimization-related applications) may require
   performance prediction on large numbers of different, detailed
   scenarios - requires high-performance PODT implementations.  Models
   may be more or less complicated and computationally intensive; data
   consumed may be considerable and run to large archived volumes over
   time.  Performance and scalability considerations are discussed in
   Section 5.

2.  Terminology

   *  Network Digital Twin (NDT): a virtual replica of a real network
      (often referred to as a 'physical' network) that faithfully mimics
      the behaviours of the real network.



Janz, et al.            Expires 15 September 2023               [Page 3]

Internet-Draft     Performance-Oriented Digital Twins         March 2023


   *  Digital Network Twin (DNT): an alternative term for NDT.

   *  Performance-Oriented Digital Twin (PODT): An analytical NDT
      generates performance-oriented information based on "what-if"
      scenarios specified by other Management Plane applications.

   *  Optical Performance Digital Twin (OPDT): A PODT that generates
      optical transmission performance predictions.

3.  Data and Model Management Supporting Concurrent Scenario-Based
    Computational Sessions

   That multiple use case-based MP applications may rely on the same
   PODT-based analytical function and information that it synthesizes,
   is illustrated in
   [I-D.draft-paillisse-nmrg-performance-digital-twin]: for greater
   simplicity and coherence, what follows refers specifically to the use
   cases associated with the Optical Performance Digital Twin (OPDT)
   that are described in that reference (Section 6).  However, as
   discussed in the reference, several of these use cases have generic
   counter-parts (i.e. are applicable to networks other than optical
   transmission networks) and, additionally, further use case categories
   are considered.

   With respect to OPDTs,
   [I-D.draft-paillisse-nmrg-performance-digital-twin] describes the
   following use cases:

   (1) Optical service (re-)provisioning

   (2.a) Optical service/network risk planning
   (2.b) Optical service/network dynamic restoration planning

   (3) Optical service planning

   (4) Optical network planning (incremental/brown field to green field)

   (5) Optical services/network optimization

   Note that: (1) describes an active operation undertaken on deployed
   and operating optical networks, potentially in an automation or semi-
   automation context; (2.a) and (2.b) describe potentially coupled
   analysis & planning operations that may be undertaken periodically or
   on an event-driven basis on deployed and operating optical networks,
   potentially in an automation or semi-automation context; (3)
   describes a planning operation, that may be undertaken periodically
   or on an event-driven basis on deployed and operating optical
   networks, potentially in an automation or semi-automation context;



Janz, et al.            Expires 15 September 2023               [Page 4]

Internet-Draft     Performance-Oriented Digital Twins         March 2023


   (4) describes a planning operation, relevant to either deployed and
   operating optical networks or to hypothetical future optical
   networks.  In either context, optical service planning (3) may be an
   'embedded' function; (5) describes an 'embedded' function used in
   support of (2) through (4).

   All of these use cases may be handled by MP applications that rely on
   the same OPDT - or at least the same type and style of OPDT -
   generating the same type and format of transmission performance
   information (predictions) from the same fundamental data, using
   fundamentally the same models.  The differences consist in the
   detailed sourcing of data - in particular, between real network and
   service data and postulated, hypothetical data - and in the detailed
   usage of models.  For example, the OPDT-based modeling implicated in
   cases (1) and (3) above involves performance, status and other data
   strictly from the network as-deployed and operating, but also
   involves at least partly hypothetical postulated optical service
   states.  On the other hand, the OPDT-based modeling implicated in
   cases (2) and (4) involves performance, equipment specification and
   status, and other network data that is at least partly (and may be
   wholly - e.g. green field planning) hypothetical, as well as at least
   partly (and potentially wholly) hypothetical optical service states.
   All of these applications, and especially the potentially embedded
   application case (5), furthermore involve some degree of sequential
   or parallel presentation of multiple, partly differing scenarios for
   performance prediction.

   Accommodating these realities requires (ideally) concurrent operation
   of multiple, scenario-based computational 'sessions', requiring
   careful management of the different data sets and models involved in
   each, as well as of their differing life cycles.  Note that whether a
   given implementation is better described as operating concurrent
   sessions of the same OPDT, or as concurrently operating multiple OPDT
   instances of the same type, is largely a distinction without a
   meaningful difference for purposes of this discussion.

   OPDT (or, more broadly, PODT) computational sessions - in the sense
   just described - thus make use of network, service and other data
   that generally only partly map to current (or historical) data from
   the real twin.  However, that 'real' data must remain accessible to
   every session and uncorrupted by the partial substitution of real
   data with hypothetical data occurring differentially in each session,
   while being continuously updated and kept current with new data from
   the real network, management and control systems, etc.  There are
   different implementation methods available to achieve this, but
   careful design and implementation is critical.





Janz, et al.            Expires 15 September 2023               [Page 5]

Internet-Draft     Performance-Oriented Digital Twins         March 2023


   Different PODT computational sessions also generally involve
   different detailed functional model compositions.  First, scenarios
   may involve additions to, deletions from, or other modifications to
   the set of deployed equipment or other elements, changes in
   connection configurations, etc.  Where network functional models are
   generated through composition from atomic equipment or element
   instances and their related models, differential network model
   composition by scenario-based session is clearly required.  Second,
   details of functional models invoked may differ depending on whether
   the counter-part to the digital instance is real - i.e. deployed and
   potentially operating - or hypothetical.  For example, in the case of
   OPDTs, the important erbium-doped fibre amplifier (EDFA) network
   element is a critical focal point for functional modeling.  According
   to [EDFA1], such functional models may be - in increasing order of
   accuracy and precision - generic, type-specific but instance-generic,
   or both type-and instance-specific.  Where the scenario-based network
   is fully deployed and operational, the latter models may be available
   and will yield more accurate performance predictions than the
   alternatives; whereas if the scenario-based network is entirely
   hypothetical (e.g. green field planning) then only type-specific
   models may be available (and would be preferred to generic models).
   This issue will be re-visited in Section 4.

   Finally, scenario-based computational session life cycles must be
   managed.  Such sessions may be short-lived, e.g. seeking one-time
   "what-if" scenario-based predictions; or, they may be long-lived,
   e.g. seeking continuous generation of prediction data as real twin-
   sourced data evolves.  MP application consumers of PODT-generated
   information control these life cycles, but PODT implementations must
   be equipped to manage them.

4.  Data and Model Management Supporting Differences and Evolutions in
    MP Applications and Physical Networks

   In practical application, PODTs are likely to be required to operate
   in conjunction with real networks that vary in a number of ways,
   including network size, specific equipment types involved, available
   instrumentation or other data sources, types and quality, etc.
   Additionally, as discussed in Section 3, differences among MP
   applications - in terms of the nature and degree of hypotheticals
   involved in the scenarios they generate for performance prediction -
   may also drive variations in data types, availability, precision,
   etc.  PODTs should be able to accommodate these variations, while
   always providing the best possible scenario-based performance
   predictions.  This has implications for both session-based functional
   model management and data management.





Janz, et al.            Expires 15 September 2023               [Page 6]

Internet-Draft     Performance-Oriented Digital Twins         March 2023


   As a rule, the network functional models 'composed' for each MP
   application-driven PODT computational session, should make use of the
   most accurate atomic functional models available and usable under the
   circumstances.  For example, as discussed in Section 3, better
   functional models may be available when a given network element is
   deployed and operating - its type and potentially instance-specific
   information thus being known - than when it is a hypothetical
   element, in which case, only generic or type-specific models may be
   available.  Further, a real network may include equipment types for
   which type- and instance-specific models are not available; in such
   cases, generic models must be used, or a selection made from type-
   specific models thought to best represent the behaviours of the real
   equipment in question.

   Available data to 'feed' functional models may also vary according to
   circumstances of both real network and MP application-driven
   scenarios.  Functional models and available data should be used in
   flexible combination to deliver the best possible performance
   predictions under all circumstances.  This idea may be illustrated
   with an example based on optical transmission networks and OPDTs.
   Channel input powers to the various optical amplifiers in the network
   represent important input data to amplifier functional models.  If a
   given modeled amplifier is deployed and operating, then channel input
   powers - for channels currently operating - may possibly be measured
   on the real network directly at amplifier inputs.  If such
   measurements are not available, or the optical service channel in
   question is only a hypothetical, prospective channel, or if the
   scenario for modeling includes other hypothetical changes to the
   network or service configuration or state; then alternatives must be
   sought.  For example, if optical fibre link losses are accurately
   known at proximate channel wavelengths, they may be used in
   conjunction with measured or modeled channel powers at upstream
   points to estimate amplifier ingress powers.  Note that fibre link
   losses may have been obtained by direct or indirect measurement -
   meaning, in the latter case, inferred from available data (e.g. fibre
   link input and output powers on proximate operating channel
   wavelengths).  Finally, if fibre link losses are not accurately known
   - for example, in brown or green field planning scenarios, such data
   may not be available in respect of prospective incremental or new
   fibre plant - default loss coefficients and hypothesized fibre link
   lengths may be used to generate fibre link loss estimations.

   Again: functional models and available data should be used in
   flexible combination to deliver the best possible performance
   predictions under all circumstances.  Increased 'instrumentation' of
   the real network - scope and quality of relevant data generation -
   reduces the scope and degree of required modeling and leads to
   improved performance prediction results, but modeling may generally



Janz, et al.            Expires 15 September 2023               [Page 7]

Internet-Draft     Performance-Oriented Digital Twins         March 2023


   be used in combination with available data to close 'gaps' in such
   instrumentation.  In respect of any needed input data to elements of
   session-based composed network functional models, a 'preference
   hierarchy' of data, and sources of data, may be established a priori
   and used in operation.  For example, considering the case of the
   prior paragraph: if directly measured channel input power is
   available, use it; if not, and if measured proximate channel fibre
   link losses have been directly measured, use them; if not, use
   default loss coefficient for the posited fibre type and length.  Data
   'tables' may be dynamic and built up through increasing measurement
   experience, and even PODT-based functional modeling experience, with
   respect to a given real or (partly) hypothetical network.  Increasing
   effective data quality may thus yield improvements in performance
   prediction quality over time - over and above the impacts of any
   experience-driven improvement to functional models.

   It should also be noted that flexible conjoint use of data and models
   may yield ways to verify the accuracy of data and models, through
   'prediction of measurables' - i.e. using modeling to generate data
   which is in fact directly measurable or available on the real
   network.  For example, fibre link losses or channel power
   measurements may be used to 'predict' - i.e., to use functional
   models to calculate - the optical signal to noise ratio of an
   operating channel at each point in an amplified transmission span.
   If such ratios are in fact directly measured by available network
   instrumentation, then the measured and 'modeled' values, in respect
   of operating optical service channels, may be compared.  Differences
   may be due to either data/measurement inaccuracies, functional model
   accuracy limitations, or both.  Where functional models have been
   established to be accurate, discrepancies may indicate variations in
   data - assumed or measured vs. 'actual' - that may prove useful in
   e.g. 'soft fault' detection, classification or localization.

5.  Performance-Related Functional Design Aspects

   The support of multiple, concurrently operating MP applications -
   many of which (e.g. optimization-related applications) may require
   performance prediction on large numbers of different, detailed
   scenarios - requires high-performance PODT implementations.
   Functional models themselves may be more or less complicated and
   computationally-intensive, and data consumed may be considerable and
   run to large archive volumes over time.High-performance data
   collection methodologies are considered in
   [I-D.draft-zcz-nmrg-digitaltwin-data-collection].  Data comes from
   other MP applications e.g. network or service configuration, from
   managed entities - the real network - or from the real network
   through MP applications.




Janz, et al.            Expires 15 September 2023               [Page 8]

Internet-Draft     Performance-Oriented Digital Twins         March 2023


   Various techniques can be used to support high-performance PODT to
   support functional design and implementation.  The common idea is to
   split functional modeling into smaller tasks.  Such tasks can be of
   two types: having dependencies on other tasks, or independent of
   other tasks.  Independent tasks can be assigned to different compute
   resources (workers), and executed in parallel:

   *  In an embedded system, a multiple worker-thread strategy can be
      used to support high performance;

   *  On a server-based platform, a PODT may run under a microservices
      architecture, with workers located in difference processors or on
      different servers depending on the size and complexity of the
      emulated network.  These distributed computing systems can greatly
      improve performance and support complex functional modeling of
      large-scale networks and concurrent sessions supporting multiple
      MP applications.

   *  Processors on real network elements can serve as distributed
      worker resources as well, performing functional modeling tasks
      directly related to them, and easy computational requirements on
      server-based platforms.

   Dependent tasks need orchestration to schedule, synchronize and
   dispatch compute requests to tasks.  Stream processing framework -
   like Kafka - are effective frameworks to support this.

6.  Functional Design Implications of NDT Functional Extension Beyond
    Analysis

   As described in Section 1 (and in e.g.
   [I-D.draft-zhou-nmrg-digitaltwin-network-concepts]), some functional
   conceptions of NDTs extend beyond the analytical function of PODTs to
   encompass further management functions, such as closed loop
   components extending to decision and action.  Such NDTs may thus
   exercise active control over the real network, service
   configurations, etc.
   However, in such cases, the functional equivalent of a PODT lies at
   the heart of the NDT - i.e.  representing the data and analysis
   closed loop components.  Therefore, the PODT-specific functional
   design aspects considered in this document may be considered to apply
   to NDTs broadly, with the qualification that what amount to the MP
   applications driving and using the functional equivalent of PODTs -
   in the manner described in this document - are partly or wholly
   encompassed within the functional perimeter of the NDT.






Janz, et al.            Expires 15 September 2023               [Page 9]

Internet-Draft     Performance-Oriented Digital Twins         March 2023


7.  Conclusion

   PODTs represent an important class of NDTs in their own right, and
   their functional equivalent lies at the heart of NDTs with active
   network-driving capabilities.  The 'what-if' scenarios analyzed by
   PODTs are generated by various use case-oriented MP applications.
   Information and model management among concurrent computational
   sessions, driven by such MP applications, is thus an important
   functional design concern.  Also important is information and model
   management to accommodate variations and evolutions among real
   network co-operating with PODTs.  Functional design for high and
   scalable operational performance is a further important functional
   design criterion.

8.  Manageability Considerations

   TBD

9.  Security Considerations

   TBD

10.  IANA Considerations

   This document requires no IANA actions.

11.  Informative References

   [EDFA1]    You, Y., Jiang, Z., and C. Janz, "Machine Learning-Based
              EDFA Gain Model",
              Web https://doi.org/10.1109/ECOC.2018.8535397, September
              2018.

   [I-D.draft-paillisse-nmrg-performance-digital-twin]
              Paillisse, J., Almasan, P., Ferriol, M., Barlet, P.,
              Cabellos, A., Xiao, S., Shi, X., Cheng, X., Janz, C., Guo,
              A., Perino, D., Lopez, D., and A. Pastor, "Performance-
              Oriented Digital Twins for Packet and Optical Networks",
              Work in Progress, Internet-Draft, draft-paillisse-nmrg-
              performance-digital-twin-01, 24 October 2022,
              <https://datatracker.ietf.org/doc/html/draft-paillisse-
              nmrg-performance-digital-twin-01>.









Janz, et al.            Expires 15 September 2023              [Page 10]

Internet-Draft     Performance-Oriented Digital Twins         March 2023


   [I-D.draft-zcz-nmrg-digitaltwin-data-collection]
              Zhou, C., Chen, D., Martinez-Julia, P., and Q. Ma, "Data
              Collection Requirements and Technologies for Digital Twin
              Network", Work in Progress, Internet-Draft, draft-zcz-
              nmrg-digitaltwin-data-collection-02, 12 March 2023,
              <https://datatracker.ietf.org/doc/html/draft-zcz-nmrg-
              digitaltwin-data-collection-02>.

   [I-D.draft-zhou-nmrg-digitaltwin-network-concepts]
              Zhou, C., Yang, H., Duan, X., Lopez, D., Pastor, A., Wu,
              Q., Boucadair, M., and C. Jacquenet, "Digital Twin
              Network: Concepts and Reference Architecture", Work in
              Progress, Internet-Draft, draft-zhou-nmrg-digitaltwin-
              network-concepts-07, 5 March 2022,
              <https://datatracker.ietf.org/doc/html/draft-zhou-nmrg-
              digitaltwin-network-concepts-07>.

Acknowledgments

   TBD

Authors' Addresses

   Christopher Janz
   Huawei Canada
   Email: christopher.janz@huawei.com


   Yuren You
   Huawei Canada
   Email: yuren.you@huawei.com


   Aihua Guo
   Futurewei Technologies
   Email: aihuaguo.ietf@gmail.com















Janz, et al.            Expires 15 September 2023              [Page 11]