Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
PERFORMANCE EVALUATION FOR ARTIFICIAL INTELLIGENCE/MACHINE LEARNING INFERENCE
Document Type and Number:
WIPO Patent Application WO/2024/091970
Kind Code:
A1
Abstract:
This disclosure describes systems, methods, and devices related to performance evaluation. A device may receive a request from a service consumer for obtaining an inference output from an artificial intelligence (AI)/machine learning (ML) inference function. The device may determine the AI/ML inference function requested by the service consumer, the AI/ML inference function being selected from a group consisting of network data analytics function (NWDAF), management data analytics function (MDAF), radio access network (RAN) intelligence mobility robustness optimization (MRO) function, RAN intelligence mobility load balancing (MLB) function, and RAN intelligence energy saving (ES) function. The device may configure the selected AI/ML inference function based on the request from the service consumer. The device may execute the configured AI/ML inference function to generate the inference output. The device may send the inference output to the service consumer.

Inventors:
YAO YIZHI (US)
CHOU JOEY (US)
Application Number:
PCT/US2023/077679
Publication Date:
May 02, 2024
Filing Date:
October 24, 2023
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
INTEL CORP (US)
International Classes:
H04W24/02; G06N20/00; H04L41/0894; H04L41/14; H04L41/16
Domestic Patent References:
WO2022022334A12022-02-03
Foreign References:
US11373119B12022-06-28
US20210345138A12021-11-04
US20200195495A12020-06-18
US20220287104A12022-09-08
Attorney, Agent or Firm:
ZOGAIB, Nash M. et al. (US)
Download PDF:
Claims:
CLAIMS

What is claimed is:

1. An apparatus for a management service producer comprising: a processing circuitry configured to: receive a request from a service consumer for obtaining an inference output from an artificial intelligence (AI)/machine learning (ML) inference function; determine the AI/ML inference function requested by the service consumer, the AI/ML inference function being selected from a group consisting of network data analytics function (NWDAF), management data analytics function (MDAF), radio access network (RAN) intelligence mobility robustness optimization (MRO) function, RAN intelligence mobility load balancing (MLB) function, and RAN intelligence energy saving (ES) function; configure the selected AI/ML inference function based on the request from the service consumer; execute the configured AI/ML inference function to generate the inference output; and send the inference output to the service consumer; and a memory to store the inference output.

2. The apparatus of claim 1, wherein the processing circuitry' is further configured to report the inference output to the consumer through at least one of notification, file, or data streaming.

3. The apparatus of claim 1, wherein the configured AI/ML inference function is dynamically adjusted in real-time based on changing network conditions.

4. The apparatus of claim 1, wherein the processing circuitry is further configured to receive feedback related to the inference output.

5. The apparatus of claim 4, wherein the feedback is represented by a Managed Object Instance (MOI).

6. The apparatus of claim 1, wherein the processing circuitry is further configured to execute actions within a 5G system (5GS) based on the inference output and subsequently report the actions performed to the consumer.

7. The apparatus of claim 6, wherein the actions taken within the 5G system are reported to the consumer through notifications.

8. The apparatus of claim 1, wherein the processing circuitry is further configured to monitor network performance related to the configured AI/ML inference function and report performance data associated with the AI/ML inference function to the service consumer.

9. The apparatus of claim 8, wherein the processing circuitry is further configured to receive performance data requests from the consumer, respond to the requests, and provide results.

10. The apparatus of any one of claims 1-9, wherein the processing circuitry is further configured to: receive a request from a sendee consumer to manage the AI/ML inference function via a managed objected instance (MOI); configure the AI/ML inference function; and respond to the consumer to indicate a result of the AI/ML inference function management.

11. The apparatus of claim 10, wherein the MOI contains at least one of a policy for the inference function, a target for the inference function, conditions for triggering the inference, or activation and deactivation status.

12. A computer-readable medium storing computer-executable instructions which when executed by one or more processors result in performing operations comprising: receiving a request from a service consumer for obtaining an inference output from an artificial intelligence (AI)/machine learning (ML) inference function; determining the AI/ML inference function requested by the service consumer, the AI/ML inference function being selected from a group consisting of network data analytics function (NWDAF), management data analytics function (MDAF), radio access network (RAN) intelligence mobility robustness optimization (MRO) function, RAN intelligence mobility load balancing (MLB) function, and RAN intelligence energy7 saving (ES) function; configuring the selected AI/ML inference function based on the request from the service consumer; executing the configured AI/ML inference function to generate the inference output; and sending the inference output to the service consumer.

13. The computer-readable medium of claim 12, wherein the operations further comprise reporting the inference output to the consumer through at least one of notification, file, or data streaming.

14. The computer-readable medium of claim 12, wherein the configured AI/ML inference function is dynamically adjusted in real-time based on changing network conditions.

15. The computer-readable medium of claim 12, wherein the operations further comprise receiving feedback related to the inference output.

16. The computer-readable medium of claim 15, wherein the feedback is represented by a Managed Object Instance (MOI).

17. The computer-readable medium of claim 12, wherein the operations further comprise executing actions within a 5G system (5GS) based on the inference output and subsequently report the actions performed to the consumer.

18. The computer-readable medium of claim 17, wherein the actions taken within the 5G system are reported to the consumer through notifications.

19. The computer-readable medium of any one of claims 12-18, wherein the operations further comprise monitoring network performance related to the configured AI/ML inference function and report performance data associated with the AI/ML inference function to the service consumer.

20. A method comprising: receiving, by one or more processors, a request from a service consumer for obtaining an inference output from an artificial intelligence (Al)Zmachine learning (ML) inference function; determining the AI/ML inference function requested by the service consumer, the AI/ML inference function being selected from a group consisting of network data analytics function (NWDAF), management data analytics function (MDAF), radio access network (RAN) intelligence mobility7 robustness optimization (MRO) function, RAN intelligence mobility load balancing (MLB) function, and RAN intelligence energy saving (ES) function; configuring the selected AI/ML inference function based on the request from the service consumer; executing the configured AI/ML inference function to generate the inference output; and sending the inference output to the service consumer.

Description:
PERFORMANCE EVALUATION FOR ARTIFICIAL INTELLIGENCE/MACHINE LEARNING INFERENCE

CROSS-REFERENCE TO RELATED PATENT APPLICATION(S)

This application claims the benefit of U.S. Provisional Application No. 63/419,248, filed October 25, 2022, and U.S. Provisional Application No. 63/421,018, filed October 31, 2022, the disclosures of which are incorporated herein by reference as if set forth in full.

TECHNICAL FIELD

This disclosure generally relates to systems and methods for wireless communications and, more particularly, to performance evaluation for artificial intelligence/machine learning (AI/ML) inference.

BACKGROUND

Artificial intelligence and machine learning (AI/ML) play a pivotal role in various aspects of fifth generation (5G) networks and/or later releases, encompassing 5G Core (5GC), Next-Generation Radio Access Network (NG-RAN), and network management systems.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 depicts an illustrative schematic diagram for performance evaluation, in accordance with one or more example embodiments of the present disclosure.

FIG. 2 depicts an illustrative schematic diagram for performance evaluation, in accordance with one or more example embodiments of the present disclosure.

FIG. 3 illustrates a flow diagram of illustrative process for an illustrative performance evaluation system, in accordance with one or more example embodiments of the present disclosure.

FIG. 4 illustrates an example network architecture, in accordance with one or more example embodiments of the present disclosure.

FIG. 5 schematically illustrates a wireless network, in accordance with one or more example embodiments of the present disclosure.

FIG. 6 illustrates components of a computing device, in accordance with one or more example embodiments of the present disclosure.

FIG. 7 illustrates a network 700 in accordance with various embodiments. FIG. 8 illustrates a simplified block diagram of artificial (Al)-assisted communication between a user equipment (UE) and a radio access network (RAN), in accordance with various embodiments.

DETAILED DESCRIPTION

The following description and the drawings sufficiently illustrate specific embodiments to enable those skilled in the art to practice them. Other embodiments may incorporate structural, logical, electrical, process, algorithm, and other changes. Portions and features of some embodiments may be included in, or substituted for, those of other embodiments. Embodiments set forth in the claims encompass all available equivalents of those claims.

Artificial intelligence/machine learning (AI/ML) techniques are widely used in 5GS (including 5GC, NG-RAN and management system). Note that in this disclosure the term “AI/ML’" can be used interchangeably with “ML”. In some embodiments, the ML entity is either an ML model or an entity contains an ML model and its related metadata.

Example embodiments of the present disclosure relate to systems, methods, and devices for performance evaluation for artificial intelligence/machine learning (AI/ML) inference.

Among other things, embodiments of this disclosure are directed to performance evaluation for AI/ML inference. The names of the information object class (IOC), attribute, information element and measurement name are not significant, and they may be named differently in alternate embodiments.

In one or more embodiments, a performance evaluation system may provide solutions for configuration of AI/ML inference functions and ML entities. The name of the IOC, attribute or information element is not significant, and they may be named differently in alternate embodiments.

For the inference phase, the AI/ML inference function (e.g., network data analytics function (NWDAF), management data analytics function (MDAF), NG-RAN intelligence energy saving (ES) function) uses the ML entity for inference. The AI/ML inference function needs to be configured (e.g., with policies, target, conditions where applicable) in order to conduct inference in the 5G system aligning with operator's expectation. To enable the AI/ML inference function to perform inference using the preferred ML entity, the ML entities need to be able to be activated and deactivated.

In some embodiments, the ML entity is either an ML model or an entity contains an ML model and its related metadata. Among other things, embodiments of the present disclosure provide solutions for configuration of AI/ML inference functions and ML entities.

The above descriptions are for purposes of illustration and are not meant to be limiting. Numerous other examples, configurations, processes, algorithms, etc., may exist, some of which are described in greater detail below. Example embodiments will now be described with reference to the accompanying figures.

FIG. 1 depicts an illustrative schematic diagram for performance evaluation, in accordance with one or more example embodiments of the present disclosure.

Referring to FIG. 1, there is shown an AI/ML operational workflow.

Artificial intelligence/machine learning (AI/ML) techniques are widely used in 5GS (including 5GC, NG-RAN and management system), and the generic workflow of the operational steps in the lifecycle of an AI/ML entity, is depicted in FIG. 1.

In this example, the workflow involves 3 main phases; the training phase, deployment phase, and inference phase. In the AI/ML inference phase, the performance of the inference function and ML model needs to be evaluated against consumer's performance expectations, and to find the problem, figure out what the problem is, and fix it in time (for example to trigger the ML model re-training, testing, and deployment).

Note that in this disclosure the term “AI/ML” can be used interchangeably with “ML”. In some embodiments, the ML entity is either an ML model or an entity 7 contains an ML model and its related metadata.

In one or more embodiments, a performance evaluation system may facilitate AI/ML performance evaluation in inference phase. In the inference phase, the inference function (including MDAF, NWDAF and RAN intelligence functions) uses one or more ML entities for inference and generate the inference output. The consumer (e.g., a network function or management function) may take some actions according to the inference output provided by the inference function. If the actions are taken accordingly, the network performance is expected to be optimized. Each inference function has its specific focus and will impact the network performance from different perspectives.

The consumer may choose to not take any actions by various reasons, for instance lacking confidence on the inference output, or no actions are needed or recommended according to the inference output. Among other things, for evaluating the performance of the AI/ML inference function and ML model, the operator needs to be able to: get the inference output generated by each inference function;

- provide the feedback about the quality of the inference output;

- be informed if actions are taken according to the inference output; monitor the network performance related to each inference function.

Depending on the performance, the operator may request the ML training function to re-train the ML model(s).

In one or more embodiments, a performance evaluation system may facilitate requirements for the management service (MnS).

REQ-AI/ML PERF-INF-l: the MnS producer for AI/ML inference performance management should have a capability to allow the authorized consumer to get the inference output provided by an inference function (MDAF, NWDAF or RAN intelligence function).

REQ-AI/ML PERF-INF-2: the MnS producer for AI/ML inference performance management should have a capability to allow the authorized consumer to provide the feedback about the quality of an inference ouptut.

REQ-AI/ML PERF-INF-3: the MnS producer for AI/ML inference performance management should have a capability to allow the authorized consumer to be informed about the actions taken that were triggered by the inference output provided by an inference function (MDAF, NWDAF or RAN intelligence function).

REQ-AI/ML PERF-INF-4: the MnS producer for AI/ML inference performance management should have a capability to allow the authorized consumer to collect the performance data (see Note) related to an inference function (MDAF, NWDAF or RAN intelligence function).

In one or more embodiments, a performance evaluation system may facilitate solutions for AI/ML performance evaluation in inference phase. Some embodiments may address one or more of the following aspects:

1) For getting the inference output, the MDA MnS already supports MDA reporting by notifications, file and data streaming. The same approach can be applied to reporting other kinds of inference output (NWDAF analytics report, RAN intelligence output). A common data format may be defined for all kinds of inference outputs, and the format will be decided in normative phase.

2) For providing the feedback about the inference output, the IOC representing the feedback, for example named as InferenceFeedback, can be used to allow the MnS consumer to create an instance on the producer. This IOC contains the following attributes: inference report id; indication of whether there are actions to be taken triggered by the inference report;

- quality feedback of the inference report, e.g., lack of confidence or accuracy for a specific output information element. 3) For being informed about the actions taken trigged by the inference output, the NRM notification representing the already taken actions triggered by the inference is used. For example defining a new IOC named as ActionsTriggeredBylnferenceOutput, or enhancing the existing notifications for the NRMs. The notification contains the following information: inference report id that triggers the action; actions taken (this information is already supported when enhancing the existing notifications).

4) For monitoring the network performance related to each inference function, performance measurements related to each inference function need to be defined to allow the MnS consumer to collect:

For the performance measurements related to MDAF, the performance measurements listed in the analytics enabling data for each management data analytics (MDA) capability can be used for performance evaluation of MDAF;

For the performance measurements related to NWDAF;

- For the performance data related to RAN intelligence functions, including RAN intelligence ES function, RAN intelligence mobility robustness optimization (MRO) function, RAN intelligence MLB function, the MDT data and following performance measurements for MRO, Energy Efficiency and mobility' load balancing (MLB) respectively can be reused: for RAN intelligence ES function, the measurements related to distributed energy saving for NG-RAN can be reused; for RAN intelligence MRO function, the measurements related to D-MRO can be reused; for RAN intelligence MLB function, the measurements related to D-MLB can be reused.

In one or more embodiments, a performance evaluation system may operate as a sendee producer, configured to perform various functions. It may begin by receiving a consumer's request to obtain the inference output for a specific inference function, then proceed to generate the inference output accordingly. Subsequently, the system may send the resulting inference output to the consumer. Additionally, the inference function could encompass various possibilities, such as network data analytics function (NWDAF), management data analytics function (MDAF), radio access network (RAN) intelligence mobility' robustness optimization (MRO) function, RAN intelligence mobility load balancing (MLB) function, or RAN intelligence energy saving (ES) function. Furthermore, in the context of this method, the inference output may be conveyed to the consumer through different means, including notification, file transmission, or data streaming, as per the requirements of the particular embodiment.

In a related aspect, the performance evaluation system may also receive feedback concerning an inference output. This feedback, represented by an MOI (Managed Object Instance), could serve as valuable input for further actions.

Additionally, the system is configured to take specific actions within the 5G system based on the inference output and subsequently report these actions to the consumer. Notably, these actions could be conveyed via notifications, enhancing the overall communication process.

Moreover, the performance evaluation system plays a crucial role in monitoring network performance related to the inference function. Subsequently, it reports the performance data associated with this function to the consumer. In certain scenarios, the system may also accommodate requests from the consumer to collect performance data related to the inference function, promptly responding with the requested results.

Lastly, the performance data may encompass various types, including MDA data and performance measurements. These performance measurements could further branch into different categories, such as those pertaining to MRO, MLB, or Energy' Saving, depending on the specific requirements of the embodiment in question.

FIG. 2 depicts an illustrative schematic diagram for performance evaluation, in accordance with one or more example embodiments of the present disclosure.

Referring to FIG. 2, there is shown an example of AI/ML inference configuration related NRMs. It should be noted that the xyzFunction represents the inference function, and the exact name is not defined here.

The NG-RAN AI/ML inference configuration may be initiated by the MnS consumer. The MnS consumer monitors network performance and determine whether to trigger the AI/ML inference configuration. For example, for NG-RAN intelligence ES function, the MnS consumer collects the information of the capacity booster cells and coverage cells inside the RAN domain area, then makes the decision for configuring the policy and target, activating/deactivating the ES function, and activating/deactivating the ML entity for the ES function. In this case, the consumer can initiate the AI/ML inference function configuration so as to get better network performance.

The NG-RAN AI/ML inference configuration may be initiated by the MnS producer. The MnS producer can determine whether to trigger AI/ML inference configuration based on network performance and service requirements. For example, NG-RAN intelligence ES function, after receiving an RAN ES target, the AI/ML MnS producer may decide to allow or disallow the ES with a particular neighbour cell, or decide to use another ML entity for inference by activating and deactivating the corresponding ML entities. In this case, the MnS producer may initiate the configuration and inform an authorized consumer about the configurations.

In one or more embodiments, a performance evaluation system may facilitate partial activation of AI/ML inference capabilities for an ML entity.

It is described that an AI/ML entity may provide the AI/ML inference capabilities for a scope (e.g., a list of new radio (NR) cells) of the system. For a given AI/ML inference function, first, it is very difficult to accurately "predict" the benefits and to quantify such benefits of using an AI/ML entity in a given context of operational system, before using it. Secondly, testing the AI/ML entity using test data does not give a “full picture” on how the AI/ML model will impact the system when it is activated in operational environment. For example, the testing may provide the insights on the accuracy of the ML entity that can be expected once the ML entity is deployed and activated.

Furthermore, it is also necessary to ensure that AI/ML inference capabilities of an ML entity that are being activated in operational system will bring the benefits and will not further downgrade the existing network performance. Moreover, it is important to provide means to check which particular AI/ML inference capabilities of an ML entity are beneficial to be activated in a given context of operational network. Correspondingly, the MnS producer for AI/ML inference management may provide different steps through which the capabilities of the ML entities may be activated. These abstraction of the scope of the ML entities may be called Abstract activation steps. For example, for such Abstract activation steps, the producer may support that only a sub-scope is activated e.g. to only a part of the geography of its cells and not the whole city or only a certain period (say between 18:00 and 6:00) and not the entire time.

So, it is possible that the AI/ML inference function is configured to start using a newly deployed ML entity for one part (e.g., one NR cell of the gNB) of the function but the existing ML entity for the rest parts, and then gradually switch to use the new AI/ML entities for the larger or full scope, by activating/deactivating the AI/ML inference capabilities in the corresponding scope for the AI/ML entities.

Together, these imply that it is important to ensure that the MnS consumer has a finer control on activation and de-activation of AI/ML inference capabilities for an ML entity. Potential requirements:

REQ-AIML INF CFG-l : the MnS producer for AI/ML inference management should have a capability to allow the authorized consumer to configure the inference function.

REQ-AIML_INF_CFG-2: the AI/ML MnS producer responsible for AI/ML inference management should have a capability to configure inference function and inform an authorized consumer about the configurations of the AI/ML inference function.

REQ-AIML INF ACT-l : the MnS producer for AI/ML inference management should have a capability to allow the authorized consumer to activate an NG-RAN AI/ML inference function for one or more NR cells.

REQ-AIML_INF_ACT-2: the MnS producer for AI/ML inference management should have a capability to allow the authorized consumer to deactivate an NG-RAN AI/ML AI/ML inference function for one or more NR cells.

REQ-AIML_INF_ACT-3: the MnS producer for AI/ML inference management should have a capability to inform an authorized MnS consumer about the activation and deactivation of an NG-RAN AI/ML AI/ML inference function for one or more NR cells.

REQ-AI/ML_ENTITY_ACT-1 : the MnS producer for AI/ML inference management should have a capability to allow an authorized MnS consumer to activate the AI/ML inference capabilities for the full scope or a sub-scope (see Note 1) of an ML entity

REQ-AI/ML_ENTITY_ACT-2: the MnS producer for AI/ML inference management should have a capability to allow an authorized MnS consumer to deactivate the AI/ML inference capabilities for the full scope or a sub-scope (see Note 1) of an ML entity 7 .

REQ-AI/ML_ENTITY_ACT-3: the MnS producer for AI/ML inference management should have a capability to inform an authorized MnS consumer about the activation and deactivation of the AI/ML inference capabilities for the full scope or a sub-scope (see Note 1) of an ML entity.

Note 1 : the scope could be the NR cell(s) or applicable expected runtime context for the ML entity.

In one or more embodiments, a solution uses the instances of following IOCS for interaction between MnS producer and consumer to support the AI/ML inference configuration:

1 ) The IOC representing the inference function. Each inference function is represented by a specific IOC. For the example the NG-RAN intelligence ES function is presented by an IOC, which may be named as DESIntelligenceFunction. This IOC inherits from ManagedF unction IOC and contains the respective attributes that are applicable to the inference function it represents:

- For MDAF, the IOCS are defined in TS 28.104, v. 17.1.1 , 2022-09-27.

For NWDAF, the configuration is studied in TR 28.864;

For RAN intelligence functions, including RAN intelligence ES function, RAN intelligence MRO function, RAN intelligence MLB function, IOC may contain the following attributes that are applicable to the specific inference function: policy for the inference function; target for the inference function; conditions (e.g., threshold) for triggering the inference;

- activation and deactivation scope (e.g., NR cell).

2) The IOC representing the ML entity deployed to the inference function, for example the existing IOC MLEntity may be reused.

This IOC is contained by the IOC which represents the inference function as described in 1), and contains the following attributes:

- activation and deactivation scope (e.g., NR cell, subset of expected runtime context).

The examples of IOCs and their relations between the IOCs are depicted in the figure below.

In one or more embodiments, a service producer, without specifying the number of processors involved, is configured to carry out the following tasks:

Firstly, it may receive a request from a service consumer, aimed at configuring an AI/ML inference function, achieved by managing a managed object instance (MOI). Subsequently, the system configures the AI/ML inference function in accordance with the consumer's request and finally responds to the consumer, conveying the outcome of the AI/ML inference function configuration.

Furthermore, within this method, the AI/ML inference function can encompass a variety of possibilities, including NWDAF, MDAF, or a RAN intelligence function, which in turn could be further specified as RAN intelligence Energy Saving function, RAN intelligence Mobility Robustness Optimization (MRO) function, or RAN intelligence Mobility Load Balancing (MLB) function.

Additionally, the MOI mentioned in the previous steps is essentially the instance of the IOC (Information Object Class) representing the AI/ML inference function. Moreover, the MOI may encompass various attributes, potentially including the policy for the inference function, the target for the inference function, conditions (e.g., threshold) for triggering the inference, and the scope of activation and deactivation (e.g., NR cell).

In a related aspect, the service producer, following a request from the consumer, can also configure an ML entity using a similar MOI management process. It then responds to the consumer, indicating the result of the ML entity configuration.

This MOI, in this case, represents the ML entity, e.g., MLEntity, within the context of the method. Similar to the AI/ML inference function MOL it may contain attributes such as activation and deactivation scope (e.g., NR cell, subset of expected runtime context).

Furthermore, it's worth noting that the MOI representing the ML entity can be contained by the MOI representing the AI/ML inference function, establishing a hierarchical relationship.

Additionally, the management of the MOI can encompass tasks such as the creation of the MOI, modification of the MOI, or deletion of the MOI, depending on the specific needs of the operation.

Lastly, responding to the consumer can involve either returning the output parameters for the MOI management operation to the consumer or sending a notification to the consumer, depending on the nature of the MOI management operation itself.

In some embodiments, the electronic device(s), network(s), system(s), chip(s) or component(s), or portions or implementations thereof, of FIGs. 4-6, or some other figure herein, may be configured to perform one or more processes, techniques, or methods as described herein, or portions thereof. One such process is depicted in FIG. 3.

For example, the process may include, at 302, receiving a request from a service consumer for obtaining an inference output from an artificial intelligence (AI)/machine learning (ML) inference function.

The process further includes, at 304, determining the AI/ML inference function requested by the service consumer, the AI/ML inference function being selected from a group consisting of network data analytics function (NWDAF), management data analytics function (MDAF), radio access network (RAN) intelligence mobility robustness optimization (MRO) function, RAN intelligence mobility load balancing (MLB) function, and RAN intelligence energy saving (ES) function.

The process further includes, at 306, configuring the selected AI/ML inference function based on the request from the service consumer.

The process further includes, at 308, executing the configured AI/ML inference function to generate the inference output. The process further includes, at 310, sending the inference output to the service consumer.

The device may involve processing circuitry that may report the inference output to the consumer through at least one of notification, file, or data streaming. The device may also consider the possibility of dynamically adjusting the configured AI/ML inference function in real-time, depending on changing network conditions. Furthermore, the device may have processing circuitry that might be additionally configured to receive feedback related to the inference output, with the feedback potentially being represented by a Managed Object Instance (MOI). Additionally, the device may encompass processing circuitry that could execute actions within a 5G system (5GS) based on the inference output and subsequently report the actions performed to the consumer, where these actions taken within the 5G system may be reported to the consumer through notifications. Moreover, the device may involve processing circuitry that might be further configured to monitor network performance related to the configured AI/ML inference function and report performance data associated with the AI/ML inference function to the service consumer. This processing circuitry could potentially also receive performance data requests from the consumer, respond to the requests, and provide results. Lastly, the device may comprise processing circuitry that could receive a request from a service consumer to manage the AI/ML inference function via a managed object instance (MOI), configure the AI/ML inference function, and respond to the consumer to indicate a result of the AI/ML inference function management. The MOI may contain at least one of a policy for the inference function, a target for the inference function, conditions for triggering the inference, or activation and deactivation status.

For one or more embodiments, at least one of the components set forth in one or more of the preceding figures may be configured to perform one or more operations, techniques, processes, and/or methods as set forth in the example section below. For example, the baseband circuitry as described above in connection with one or more of the preceding figures may be configured to operate in accordance with one or more of the examples set forth below. For another example, circuitry associated with a UE, base station, network element, etc. as described above in connection with one or more of the preceding figures may be configured to operate in accordance with one or more of the examples set forth below in the example section.

It is understood that the above descriptions are for purposes of illustration and are not meant to be limiting. FIGs. Error! Reference source not found.-Error! Reference source not found, illustrate various systems, devices, and components that may implement aspects of disclosed embodiments.

FIG. 4 illustrates an example network architecture 400 according to various embodiments. The network 400 may operate in a manner consistent with 3GPP technical specifications for LTE or 5G/NR systems. However, the example embodiments are not limited in this regard and the described embodiments may apply to other networks that benefit from the principles described herein, such as future 3GPP systems, or the like.

The network 400 includes a UE 402, which is any mobile or non-mobile computing device designed to communicate with a RAN 404 via an over-the-air connection. The UE 402 is communicatively coupled with the RAN 404 by a Uu interface, which may be applicable to both LTE and NR systems. Examples of the UE 402 include, but are not limited to, a smartphone, tablet computer, wearable computer, desktop computer, laptop computer, in- vehicle infotainment system, in-car entertainment system, instrument cluster, head-up display (HUD) device, onboard diagnostic device, dashtop mobile equipment, mobile data terminal, electronic engine management system, electron! c/engine control unit, electron! c/engine control module, embedded system, sensor, microcontroller, control module, engine management system, networked appliance, machine-type communication device, machine-to-machine (M2M), device-to-device (D2D), machine-type communication (MTC) device, Internet of Things (loT) device, and/or the like. The network 400 may include a plurality of UEs 402 coupled directly with one another via aD2D, ProSe, PC5, and/or sidelink (SL) interface. These UEs 402 may be M2M/D2D/MTC/IoT devices and/or vehicular systems that communicate using physical sidelink channels such as, but not limited to, PSBCH, PSDCH, PSSCH, PSCCH, PSFCH, etc. The UE 402 may perform blind decoding attempts of SL channels/links according to the various embodiments herein.

In some embodiments, the UE 402 may additionally communicate with an AP 406 via an over-the-air (OTA) connection. The AP 406 manages a WLAN connection, which may serve to offload some/all network traffic from the RAN 404. The connection between the UE 402 and the AP 406 may be consistent with any IEEE 802. 11 protocol. Additionally, the UE 402, RAN 404, and AP 406 may utilize cellular- WLAN aggregation/integration (e.g., LWA/LWIP). Cellular- WLAN aggregation may involve the UE 402 being configured by the RAN 404 to utilize both cellular radio resources and WLAN resources.

The RAN 404 includes one or more access network nodes (ANs) 408. The ANs 408 terminate air-interface(s) for the UE 402 by providing access stratum protocols including RRC, PDCP, RLC, MAC, and PHY/L1 protocols. In this manner, the AN 408 enables data/voice connectivity between CN 420 and the UE 402. The ANs 408 may be a macrocell base station or a low power base station for providing femtocells, picocells or other like cells having smaller coverage areas, smaller user capacity, or higher bandwidth compared to macrocells; or some combination thereof. In these implementations, an AN 408 be referred to as a BS, gNB, RAN node. eNB. ng-eNB, NodeB. RSU, TRxP, etc.

One example implementation is a "CU/DU split” architecture where the ANs 408 are embodied as a gNB-Central Unit (CU) that is communicatively coupled with one or more gNB- Distributed Units (DUs), where each DU may be communicatively coupled with one or more Radio Units (RUs) (also referred to as RRHs, RRUs, or the like) (see e g., 3GPP TS 38.401 V16.1.0 (2020-03)). In some implementations, the one or more RUs may be individual RSUs. In some implementations, the CU/DU split may include an ng-eNB-CU and one or more ng- eNB-DUs instead of, or in addition to, the gNB-CU and gNB-DUs, respectively. The ANs 408 employed as the CU may be implemented in a discrete device or as one or more software entities running on server computers as part of, for example, a virtual network including a virtual Base Band Unit (BBU) or BBU pool, cloud RAN (CRAN), Radio Equipment Controller (REC), Radio Cloud Center (RCC), centralized RAN (C-RAN), virtualized RAN (vRAN), and/or the like (although these terms may refer to different implementation concepts). Any other type of architectures, arrangements, and/or configurations can be used.

The plurality of ANs may be coupled with one another via an X2 interface (if the RAN 404 is an LTE RAN or Evolved Universal Terrestrial Radio Access Network (E-UTRAN) 410) or an Xn interface (if the RAN 404 is a NG-RAN 414). The X2/Xn interfaces, which may be separated into control/user plane interfaces in some embodiments, may allow the ANs to communicate information related to handovers, data/context transfers, mobility, load management, interference coordination, etc.

The ANs of the RAN 404 may each manage one or more cells, cell groups, component carriers, etc. to provide the UE 402 with an air interface for network access. The UE 402 may be simultaneously connected with a plurality of cells provided by the same or different ANs 408 of the RAN 404. For example, the UE 402 and RAN 404 may use carrier aggregation to allow the UE 402 to connect with a plurality of component carriers, each corresponding to a Pcell or Scell. In dual connectivity scenarios, a first AN 408 may be a master node that provides an MCG and a second AN 408 may be secondary node that provides an SCG. The first/second ANs 408 may be any combination of eNB, gNB, ng-eNB, etc. The RAN 404 may provide the air interface over a licensed spectrum or an unlicensed spectrum. To operate in the unlicensed spectrum, the nodes may use LAA, eLAA. and/or feLAA mechanisms based on CA technology with PCells/Scells. Prior to accessing the unlicensed spectrum, the nodes may perform medium/carrier-sensing operations based on, for example, a listen-before-talk (LBT) protocol.

In V2X scenarios the UE 402 or AN 408 may be or act as a roadside unit (RSU), which may refer to any transportation infrastructure entity used for V2X communications. An RSU may be implemented in or by a suitable AN or a stationary (or relatively stationary) UE. An RSU implemented in or by: a UE may be referred to as a ”UE-type RSU”; an eNB may be referred to as an “eNB-type RSU”; a gNB may be referred to as a “gNB-type RSU”; and the like. In one example, an RSU is a computing device coupled with radio frequency circuitry located on a roadside that provides connectivity support to passing vehicle UEs. The RSU may also include internal data storage circuitry to store intersection map geometry, traffic statistics, media, as well as applications/software to sense and control ongoing vehicular and pedestrian traffic. The RSU may provide very low latency communications required for high speed events, such as crash avoidance, traffic warnings, and the like. Additionally or alternatively, the RSU may provide other cellular/WLAN communications services. The components of the RSU may be packaged in a weatherproof enclosure suitable for outdoor installation, and may include a network interface controller to provide a wired connection (e.g.. Ethernet) to a traffic signal controller or a backhaul network.

In some embodiments, the RAN 404 may be an E-UTRAN 410 with one or more eNBs 412. The an E-UTRAN 410 provides an LTE air interface (Uu) with the following characteristics: SCS of 15 kHz; CP-OFDM waveform for DL and SC-FDMA waveform for UL; turbo codes for data and TBCC for control; etc. The LTE air interface may rely on CSI- RS for CSI acquisition and beam management; PDSCH/PDCCH DMRS for PDSCH/PDCCH demodulation; and CRS for cell search and initial acquisition, channel quality measurements, and channel estimation for coherent demodulation/detection at the UE. The LTE air interface may operating on sub-6 GHz bands.

In some embodiments, the RAN 404 may be an next generation (NG)-RAN 414 with one or more gNB 416 and/or on or more ng-eNB 418. The gNB 416 connects with 5G-enabled UEs 402 using a 5G NR interface. The gNB 416 connects with a 5GC 440 through an NG interface, which includes an N2 interface or an N3 interface. The ng-eNB 418 also connects with the 5GC 440 through an NG interface, but may connect with a UE 402 via the Uu interface. The gNB 416 and the ng-eNB 418 may connect with each other over an Xn interface. In some embodiments, the NG interface may be split into two parts, an NG user plane (NG-U) interface, which carries traffic data between the nodes of the NG-RAN 414 and a UPF 448 (e.g., N3 interface), and an NG control plane (NG-C) interface, which is a signaling interface between the nodes of the NG-RAN 414 and an AMF 444 (e.g., N2 interface).

The NG-RAN 414 may provide a 5G-NR air interface (which may also be referred to as a Uu interface) with the following characteristics: variable SCS; CP-OFDM for DL, CP- OFDM and DFT-s-OFDM for UL; polar, repetition, simplex, and Reed-Muller codes for control and LDPC for data. The 5G-NR air interface may rely on CSI-RS, PDSCH/PDCCH DMRS similar to the LTE air interface. The 5G-NR air interface may not use a CRS, but may use PBCH DMRS for PBCH demodulation; PTRS for phase tracking for PDSCH: and tracking reference signal for time tracking. The 5G-NR air interface may operating on FR1 bands that include sub-6 GHz bands or FR2 bands that include bands from 24.25 GHz to 52.6 GHz. The 5G-NR air interface may include an SSB that is an area of a downlink resource grid that includes PSS/SSS/PBCH.

The 5G-NR air interface may utilize BWPs for various purposes. For example, BWP can be used for dynamic adaptation of the SCS. For example, the UE 402 can be configured with multiple BWPs where each BWP configuration has a different SCS. When a BWP change is indicated to the UE 402, the SCS of the transmission is changed as well. Another use case example of BWP is related to power saving. In particular, multiple BWPs can be configured for the UE 402 with different amount of frequency resources (e.g., PRBs) to support data transmission under different traffic loading scenarios. A BWP containing a smaller number of PRBs can be used for data transmission with small traffic load while allowing power saving at the UE 402 and in some cases at the gNB 41 . A BWP containing a larger number of PRBs can be used for scenarios with higher traffic load.

The RAN 404 is communicatively coupled to CN 420 that includes network elements and/or network functions (NFs) to provide various functions to support data and telecommunications services to customers/subscribers (e.g., UE 402). The components of the CN 420 may be implemented in one physical node or separate physical nodes. In some embodiments, NFV may be utilized to virtualize any or all of the functions provided by the network elements of the CN 420 onto physical compute/storage resources in servers, switches, etc. A logical instantiation of the CN 420 may be referred to as a netw ork slice, and a logical instantiation of a portion of the CN 420 may be referred to as a network sub-slice.

The CN 420 may be an LTE CN 422 (also referred to as an Evolved Packet Core (EPC) 422). The EPC 422 may include MME 424, SGW 426, SGSN 428, HSS 430, PGW 432, and PCRF 434 coupled with one another over interfaces (or ‘‘reference points'’) as shown. The NFs in the EPC 422 are briefly introduced as follows.

The MME 424 implements mobility management functions to track a current location of the UE 402 to facilitate paging, bearer activation/deactivation, handovers, gateway selection, authentication, etc.

The SGW 426 terminates an SI interface toward the RAN 410 and routes data packets between the RAN 410 and the EPC 422. The SGW 426 may be a local mobility anchor point for inter-RAN node handovers and also may provide an anchor for inter-3 GPP mobility. Other responsibilities may include lawful intercept, charging, and some policy enforcement.

The SGSN 428 tracks a location of the UE 402 and performs security functions and access control. The SGSN 428 also performs inter-EPC node signaling for mobility between different RAT networks; PDN and S-GW selection as specified by MME 424; MME 424 selection for handovers; etc. The S3 reference point between the MME 424 and the SGSN 428 enable user and bearer information exchange for inter-3GPP access network mobility in idle/active states.

The HSS 430 includes a database for network users, including subscription-related information to support the network entities’ handling of communication sessions. The HSS 430 can provide support for routing/roaming, authentication, authorization, naming/addressing resolution, location dependencies, etc. An S6a reference point between the HSS 430 and the MME 424 may enable transfer of subscription and authentication data for authenticating/authorizing user access to the EPC 420.

The PGW 432 may terminate an SGi interface toward a data network (DN) 436 that may include an application (app)Zcontent server 438. The PGW 432 routes data packets between the EPC 422 and the data network 436. The PGW 432 is communicatively coupled with the SGW 426 by an S5 reference point to facilitate user plane tunneling and tunnel management. The PGW 432 may further include a node for policy enforcement and charging data collection (e.g., PCEF). Additionally, the SGi reference point may communicatively couple the PGW 432 with the same or different data network 436. The PGW 432 may be communicatively coupled with a PCRF 434 via a Gx reference point.

The PCRF 434 is the policy and charging control element of the EPC 422. The PCRF 434 is communicatively coupled to the app/content server 438 to determine appropriate QoS and charging parameters for service flows. The PCRF 432 also provisions associated rules into a PCEF (via Gx reference point) with appropriate TFT and QCI. The CN 420 may be a 5GC 440 including an AUSF 442, AMF 444, SMF 446, UPF 448, NSSF 450, NEF 452, NRF 454, PCF 456, UDM 458. and AF 460 coupled with one another over various interfaces as shown. The NFs in the 5GC 440 are briefly introduced as follows.

The AUSF 442 stores data for authentication of UE 402 and handle authentication- related functionality. The AUSF 442 may facilitate a common authentication framework for various access types..

The AMF 444 allows other functions of the 5GC 440 to communicate with the UE 402 and the RAN 404 and to subscribe to notifications about mobility events with respect to the UE 402. The AMF 444 is also responsible for registration management (e.g., for registering UE 402). connection management, reachability management, mobility management, lawful interception of AMF-related events, and access authentication and authorization. The AMF 444 provides transport for SM messages between the UE 402 and the SMF 446, and acts as a transparent proxy for routing SM messages. AMF 444 also provides transport for SMS messages between UE 402 and an SMSF. AMF 444 interacts with the AUSF 442 and the UE 402 to perform various security anchor and context management functions. Furthermore, AMF 444 is a termination point of a RAN-CP interface, which includes the N2 reference point between the RAN 404 and the AMF 444. The AMF 444 is also a termination point of NAS (Nl) signaling, and performs NAS ciphering and integrity protection.

AMF 444 also supports NAS signaling with the UE 402 over an N3IWF interface. The N3IWF provides access to untrusted entities. N3IWF may be a termination point for the N2 interface between the (R)AN 404 and the AMF 444 for the control plane, and may be a termination point for the N3 reference point between the (R)AN 414 and the 448 for the user plane. As such, the AMF 444 handles N2 signalling from the SMF 446 and the AMF 444 for PDU sessions and QoS, encapsulate/de-encapsulate packets for IPSec and N3 tunnelling, marks N3 user-plane packets in the uplink, and enforces QoS corresponding to N3 packet marking taking into account QoS requirements associated with such marking received overN2. N3IWF may also relay UL and DL control-plane NAS signalling between the UE 402 and AMF 444 via an N l reference point between the UE 402and the AMF 444, and relay uplink and downlink user-plane packets between the UE 402 and UPF 448. The N3IWF also provides mechanisms for IPsec tunnel establishment with the UE 402. The AMF 444 may exhibit an Narnf sendeebased interface, and may be a termination point for an N14 reference point betw een two AMFs 444 and an N17 reference point between the AMF 444 and a 5G-EIR (not shown by FIG. 4). The SMF 446 is responsible for SM (e.g., session establishment, tunnel management between UPF 448 and AN 408); UE IP address allocation and management (including optional authorization); selection and control of UP function; configuring traffic steering at UPF 448 to route traffic to proper destination; termination of interfaces toward policy control functions; controlling part of policy enforcement, charging, and QoS; lawful intercept (for SM events and interface to LI system); termination of SM parts of NAS messages; downlink data notification; initiating AN specific SM information, sent via AMF 444 over N2 to AN 408: and determining SSC mode of a session. SM refers to management of a PDU session, and a PDU session or “session” refers to a PDU connectivity service that provides or enables the exchange of PDUs between the UE 402 and the DN 436.

The UPF 448 acts as an anchor point for intra-RAT and inter-RAT mobility, an external PDU session point of interconnect to data network 436, and a branching point to support multihomed PDU session. The UPF 448 also performs packet routing and forwarding, packet inspection, enforces user plane part of policy rules, lawfully intercept packets (UP collection), performs traffic usage reporting, perform QoS handling for a user plane (e.g., packet filtering, gating, UL/DL rate enforcement), performs uplink traffic verification (e.g., SDF-to-QoS flow mapping), transport level packet marking in the uplink and downlink, and performs downlink packet buffering and downlink data notification triggering. UPF 448 may include an uplink classifier to support routing traffic flows to a data network.

The NSSF 450 selects a set of network slice instances serving the UE 402. The NSSF 450 also determines allowed NSSAI and the mapping to the subscribed S-NSSAIs, if needed. The NSSF 450 also determines an AMF set to be used to serve the UE 402, or a list of candidate AMFs 444 based on a suitable configuration and possibly by querying the NRF 454. The selection of a set of network slice instances for the UE 402 may be triggered by the AMF 444 with which the UE 402 is registered by interacting with the NSSF 450; this may lead to a change of AMF 444. The NSSF 450 interacts with the AMF 444 via an N22 reference point; and may communicate with another NSSF in a visited network via an N31 reference point (not shown).

The NEF 452 securely exposes services and capabilities provided by 3GPP NFs for third party, internal exposure/re-exposure, AFs 460, edge computing or fog computing systems (e.g., edge compute node, etc. In such embodiments, the NEF 452 may authenticate, authorize, or throttle the AFs. NEF 452 may also translate information exchanged with the AF 460 and information exchanged with internal network functions. For example, the NEF 452 may translate between an AF-Service-Identifier and an internal 5GC information. NEF 452 may also receive information from other NFs based on exposed capabilities of other NFs. This information may be stored at the NEF 452 as structured data, or at a data storage NF using standardized interfaces. The stored information can then be re-exposed by the NEF 452 to other NFs and AFs, or used for other purposes such as analytics.

The NRF 454 supports service discovery' functions, receives NF discovery' requests from NF instances, and provides information of the discovered NF instances to the requesting NF instances. NRF 454 also maintains information of available NF instances and their supported services. The NRF 454 also supports service discovery^ functions, wherein the NRF 454 receives NF Discovery Request from NF instance or an SCP (not shown), and provides information of the discovered NF instances to the NF instance or SCP.

The PCF 456 provides policy rules to control plane functions to enforce them, and may also support unified policy framework to govern network behavior. The PCF 456 may also implement a front end to access subscription information relevant for policy decisions in a UDR of the UDM 458. In addition to communicating with functions over reference points as shown, the PCF 456 exhibit an Npcf service-based interface.

The UDM 458 handles subscription-related information to support the network entities’ handling of communication sessions, and stores subscription data of UE 402. For example, subscription data may be communicated via an N8 reference point between the UDM 458 and the AMF 444. The UDM 458 may include two parts, an application front end and a UDR. The UDR may store subscription data and policy data for the UDM 458 and the PCF 456. and/or structured data for exposure and application data (including PFDs for application detection, application request information for multiple UEs 402) for the NEF 452. The Nudr servicebased interface may be exhibited by the UDR 221 to allow the UDM 458, PCF 456, and NEF 452 to access a particular set of the stored data, as well as to read, update (e.g., add. modify), delete, and subscribe to notification of relevant data changes in the UDR. The UDM may include a UDM-FE, which is in charge of processing credentials, location management, subscription management and so on. Several different front ends may serve the same user in different transactions. The UDM-FE accesses subscription information stored in the UDR and performs authentication credential processing, user identification handling, access authorization, registration/mobility management, and subscription management. In addition to communicating with other NFs over reference points as shown, the UDM 458 may exhibit the Nudm service-based interface.

AF 460 provides application influence on traffic routing, provide access to NEF 452, and interact with the policy framework for policy control. The AF 460 may influence UPF 448 (re)selection and traffic routing. Based on operator deployment, when AF 460 is considered to be a trusted entity, the network operator may permit AF 460 to interact directly with relevant NFs. Additionally, the AF 460 may be used for edge computing implementations,

The 5GC 440 may enable edge computing by selecting operator/3rd party services to be geographically close to a point that the UE 402 is attached to the network. This may reduce latency and load on the network. In edge computing implementations, the 5GC 440 may select a UPF 448 close to the UE 402 and execute traffic steering from the UPF 448 to DN 436 via the N6 interface. This may be based on the UE subscription data, UE location, and information provided by the AF 460, which allows the AF 460 to influence UPF (re)selection and traffic routing.

The data network (DN) 436 may represent various network operator services, Internet access, or third party sendees that may be provided by one or more servers including, for example, application (app)/content server 438. The DN 436 may be an operator external public, a private PDN, or an intra-operator packet data network, for example, for provision of IMS services. In this embodiment, the app server 438 can be coupled to an IMS via an S-CSCF or the I-CSCF. In some implementations, the DN 436 may represent one or more local area DNs (LADNs), which are DNs 436 (or DN names (DNNs)) that is/are accessible by a UE 402 in one or more specific areas. Outside of these specific areas, the UE 402 is not able to access the LADN/DN 436.

Additionally or alternatively, the DN 436 may be an Edge DN 436, which is a (local) Data Network that supports the architecture for enabling edge applications. In these embodiments, the app server 438 may represent the physical hardware systems/devices providing app server functionality and/or the application software resident in the cloud or at an edge compute node that performs server function(s). In some embodiments, the app/content server 438 provides an edge hosting environment that provides support required for Edge Application Server's execution.

In some embodiments, the 5GS can use one or more edge compute nodes to provide an interface and offload processing of wireless communication traffic. In these embodiments, the edge compute nodes may be included in, or co-located with one or more RAN410, 414. For example, the edge compute nodes can provide a connection between the RAN 414 and UPF 448 in the 5GC 440. The edge compute nodes can use one or more NFV instances instantiated on virtualization infrastructure within the edge compute nodes to process wireless connections to and from the RAN 414 and UPF 448. The interfaces of the 5GC 440 include reference points and service-based itnterfaces. The reference points include: N1 (between the UE 402 and the AMF 444), N2 (between RAN 414 and AMF 444), N3 (between RAN 414 and UPF 448), N4 (between the SMF 446 and UPF 448), N5 (between PCF 456 and AF 460), N6 (between UPF 448 and DN 436), N7 (between SMF 446 and PCF 456), N8 (between UDM 458 and AMF 444), N9 (between two UPFs 448), N10 (between the UDM 458 and the SMF 446), Ni l (between the AMF 444 and the SMF 446). N12 (between AUSF 442 and AMF 444), N13 (between AUSF 442 and UDM 458), N14 (between two AMFs 444; not shown), N15 (between PCF 456 and AMF 444 in case of a nonroaming scenario, or between the PCF 456 in a visited network and AMF 444 in case of a roaming scenario), N16 (between two SMFs 446; not shown), and N22 (between AMF 444 and NSSF 450). Other reference point representations not shown in FIG. 4 can also be used. The service-based representation of FIG. 4 represents NFs within the control plane that enable other authorized NFs to access their services. The service-based interfaces (SBIs) include: Namf (SBI exhibited by AMF 444), Nsrnf (SBI exhibited by SMF 446), Nnef (SBI exhibited by NEF 452), Npcf (SBI exhibited by PCF 456), Nudm (SBI exhibited by the UDM 458), Naf (SBI exhibited by AF 460), Nnrf (SBI exhibited by NRF 454), Nnssf (SBI exhibited by NSSF 450), Nausf (SBI exhibited by AUSF 442). Other service-based interfaces (e.g., Nudr, N5g-eir, and Nudsf) not shown in FIG. 4 can also be used. In some embodiments, the NEF 452 can provide an interface to edge compute nodes 436x, which can be used to process wireless connections with the RAN 414. In some implementations, the system 400 may include an SMSF, which is responsible for SMS subscription checking and verification, and relaying SM messages to/from the UE 402 to/from other entities, such as an SMS-GMSC/IWMSC/SMS- router. The SMS may also interact with AMF 444 and UDM 458 for a notification procedure that the UE 402 is available for SMS transfer (e.g., set a UE not reachable flag, and notifying UDM 458 when UE 402 is available for SMS).

The 5GS may also include an SCP (or individual instances of the SCP) that supports indirect communication (see e.g., 3GPP TS 23.501 section 7.1.1); delegated discovery (see e.g., 3GPP TS 23.501 section 7.1.1); message forwarding and routing to destination NF/NF service(s), communication security (e.g., authorization of the NF Service Consumer to access the NF Service Producer API) (see e.g., 3GPP TS 33.501), load balancing, monitoring, overload control, etc.; and discovery and selection functionality for UDM(s), AUSF(s), UDR(s), PCF(s) with access to subscription data stored in the UDR based on UE's SUPI, SUCI or GPSI (see e.g., 3GPP TS 23.501 section 6.3). Load balancing, monitoring, overload control functionality provided by the SCP may be implementation specific. The SCP may be deployed in a distributed manner. More than one SCP can be present in the communication path between various NF Services. The SCP, although not an NF instance, can also be deployed distributed, redundant, and scalable.

FIG. 5 schematically illustrates a wireless network 500 in accordance with various embodiments. The wireless network 500 may include a UE 502 in wireless communication with an AN 504. The UE 502 and AN 504 may be similar to, and substantially interchangeable with, like-named components described with respect to FIG. 4.

The UE 502 may be communicatively coupled with the AN 504 via connection 506. The connection 506 is illustrated as an air interface to enable communicative coupling, and can be consistent with cellular communications protocols such as an LTE protocol or a 5G NR protocol operating at mmWave or sub-6GHz frequencies.

The UE 502 may include a host platform 508 coupled with a modem platform 510. The host platform 508 may include application processing circuitry 512, which may be coupled with protocol processing circuitry 7 514 of the modem platform 510. The application processing circuitry 512 may run various applications for the UE 502 that source/sink application data. The application processing circuitry 512 may further implement one or more layer operations to transmit/receive application data to/from a data network. These layer operations may include transport (for example UDP) and Internet (for example, IP) operations

The protocol processing circuitry 514 may implement one or more of layer operations to facilitate transmission or reception of data over the connection 506. The layer operations implemented by the protocol processing circuitry 514 may include, for example, MAC, RLC, PDCP, RRC and NAS operations.

The modem platform 510 may further include digital baseband circuitry 516 that may implement one or more layer operations that are “below” layer operations performed by the protocol processing circuitry 7 514 in a network protocol stack. These operations may include, for example, PHY operations including one or more of HARQ acknowledgement (ACK) functions, scrambling/descrambling, encoding/decoding, layer mapping/de-mapping, modulation symbol mapping, received symbol/bit metric determination, multi-antenna port precoding/decoding, which may include one or more of space-time, space-frequency or spatial coding, reference signal generation/detection, preamble sequence generation and/or decoding, synchronization sequence generation/detection, control channel signal blind decoding, and other related functions.

The modem platform 510 may further include transmit circuitry 518. receive circuitry 520, RF circuitry 7 522, and RF front end (RFFE) 524, which may include or connect to one or more antenna panels 526. Briefly, the transmit circuitry' 518 may include a digital-to-analog converter, mixer, intermediate frequency (IF) components, etc.; the receive circuitry 520 may include an analog-to-digital converter, mixer, IF components, etc.; the RF circuitry 522 may include a low-noise amplifier, a power amplifier, power tracking components, etc. ; RFFE 524 may include filters (for example, surface/bulk acoustic wave filters), sw itches, antenna tuners, beamforming components (for example, phase-array antenna components), etc. The selection and arrangement of the components of the transmit circuitry 518, receive circuitry 520, RF circuitry 522, RFFE 524, and antenna panels 526 (referred generically as “transmit/receive components”) may be specific to details of a specific implementation such as, for example, whether communication is TDM or FDM, in mmWave or sub-6 gHz frequencies, etc. In some embodiments, the transmit/receive components may be arranged in multiple parallel transmit/receive chains, may be disposed in the same or different chips/modules, etc.

In some embodiments, the protocol processing circuitry' 514 may include one or more instances of control circuitry (not shown) to provide control functions for the transmit/receive components.

A UE 502 reception may be established by and via the antenna panels 526, RFFE 524, RF circuitry 522, receive circuitry 520, digital baseband circuitry 516, and protocol processing circuitry' 514. In some embodiments, the antenna panels 526 may receive a transmission from the AN 504 by receive-beamforming signals received by a plurality of antennas/antenna elements of the one or more antenna panels 526.

A UE 502 transmission may be established by and via the protocol processing circuitry 514, digital baseband circuitry' 516, transmit circuitry' 518, RF circuitry' 522, RFFE 524, and antenna panels 526. In some embodiments, the transmit components of the UE 504 may apply a spatial filter to the data to be transmitted to form a transmit beam emitted by the antenna elements of the antenna panels 526.

Similar to the UE 502, the AN 504 may include a host platform 528 coupled with a modem platform 530. The host platform 528 may include application processing circuitry' 532 coupled with protocol processing circuitry 534 of the modem platform 530. The modem platform may further include digital baseband circuitry 536, transmit circuitry 538, receive circuitry 540, RF circuitry 542, RFFE circuitry 544, and antenna panels 546. The components of the AN 504 may be similar to and substantially interchangeable with like-named components of the UE 502. In addition to performing data transmission/reception as described above, the components of the AN 508 may perform various logical functions that include, for example, RNC functions such as radio bearer management, uplink and downlink dynamic radio resource management, and data packet scheduling.

FIG. 6 illustrates components of a computing device 600 according to some example embodiments, able to read instructions from a machine-readable or computer-readable medium (e.g., a non-transitory machine-readable storage medium) and perform any one or more of the methodologies discussed herein. Specifically, FIG. 6 shows a diagrammatic representation of hardware resources 601 including one or more processors (or processor cores) 610, one or more memory /storage devices 620, and one or more communication resources 630, each of which may be communicatively coupled via a bus 640 or other interface circuitry. For embodiments where node virtualization (e.g., NFV) is utilized, a hypervisor 602 may be executed to provide an execution environment for one or more network slices/sub-slices to utilize the hardware resources 601.

The processors 610 include, for example, processor 612 and processor 614. The processors 610 include circuitry such as, but not limited to one or more processor cores and one or more of cache memory. low drop-out voltage regulators (LDOs), interrupt controllers, serial interfaces such as SPI, I2C or universal programmable serial interface circuit, real time clock (RTC), timer-counters including interval and watchdog timers, general purpose I/O, memory card controllers such as secure digital/multi-media card (SD/MMC) or similar, interfaces, mobile industry processor interface (MIPI) interfaces and Joint Test Access Group (JTAG) test access ports. The processors 610 may be, for example, a central processing unit (CPU), reduced instruction set computing (RISC) processors, Acom RISC Machine (ARM) processors, complex instruction set computing (CISC) processors, graphics processing units (GPUs), one or more Digital Signal Processors (DSPs) such as a baseband processor, Application-Specific Integrated Circuits (ASICs), an Field-Programmable Gate Array (FPGA), a radio-frequency integrated circuit (RFIC), one or more microprocessors or controllers, another processor (including those discussed herein), or any suitable combination thereof. In some implementations, the processor circuitry 610 may include one or more hardware accelerators, which may be microprocessors, programmable processing devices (e.g., FPGA, complex programmable logic devices (CPLDs), etc.), or the like.

The memory /storage devices 620 may include main memory, disk storage, or any suitable combination thereof. The memory /storage devices 620 may include, but are not limited to, any type of volatile, non-volatile, or semi-volatile memory such as random access memory (RAM), dynamic RAM (DRAM), static RAM (SRAM), synchronous DRAM (SDRAM), erasable programmable read-only memory (EPROM), electrically erasable programmable read-only memory (EEPROM), Flash memory, solid-state storage, phase change RAM (PRAM), resistive memory such as magnetoresistive random access memory (MRAM), etc., and may incorporate three-dimensional (3D) cross-point (XPOINT) memories from Intel® and Micron®. The memory/storage devices 620 may also comprise persistent storage devices, which may be temporal and/or persistent storage of any type, including, but not limited to, nonvolatile memory, optical, magnetic, and/or solid state mass storage, and so forth.

The communication resources 630 may include interconnection or network interface controllers, components, or other suitable devices to communicate with one or more peripheral devices 604 or one or more databases 606 or other network elements via a network 608. For example, the communication resources 630 may include wired communication components (e.g., for coupling via USB, Ethernet, Ethernet, Ethernet over GRE Tunnels, Ethernet over Multiprotocol Label Switching (MPLS), Ethernet over USB, Controller Area Network (CAN), Local Interconnect Network (LIN), DeviceNet. ControlNet, Data Highway-i-, PROFIBUS, or PROFINET, among many others), cellular communication components, NFC components, Bluetooth® (or Bluetooth® Low Energy ) components, WiFi® components, and other communication components. Network connectivity may be provided to/from the computing device 600 via the communication resources 630 using a physical connection, which may be electrical (e.g.. a “copper interconnect”) or optical. The physical connection also includes suitable input connectors (e g., ports, receptacles, sockets, etc.) and output connectors (e.g., plugs, pins, etc.). The communication resources 630 may include one or more dedicated processors and/or FPGAs to communicate using one or more of the aforementioned network interface protocols.

Instructions 650 may comprise software, a program, an application, an applet, an app, or other executable code for causing at least any of the processors 610 to perform any one or more of the methodologies discussed herein. The instructions 650 may reside, completely or partially, within at least one of the processors 610 (e.g., within the processor’s cache memory), the memory/storage devices 620, or any suitable combination thereof. Furthermore, any portion of the instructions 650 may be transferred to the hardware resources 601 from any combination of the peripheral devices 604 or the databases 606. Accordingly, the memory' of processors 610, the memory/storage devices 620, the peripheral devices 604, and the databases 606 are examples of computer-readable and machine-readable media.

Figure 7 illustrates a network 700 in accordance with various embodiments. The network 700 may operate in a matter consistent with 3GPP technical specifications or technical reports for 6G systems. In some embodiments, the network 700 may operate concurrently with network 400. For example, in some embodiments, the network 700 may share one or more frequency or bandwidth resources with network 400. As one specific example, a UE (e.g., UE 702) may be configured to operate in both network 700 and network 400. Such configuration may be based on a UE including circuitry configured for communication with frequency and bandwidth resources of both networks 400 and 700. In general, several elements of network 700 may share one or more characteristics with elements of network 400. For the sake of brevity 7 and clarity 7 , such elements may not be repeated in the description of network 700.

The network 700 may include a UE 702, which may include any mobile or non-mobile computing device designed to communicate with a RAN 708 via an over-the-air connection. The UE 702 may be similar to, for example, UE 402. The UE 702 may be, but is not limited to, a smartphone, tablet computer, wearable computer device, desktop computer, laptop computer, in-vehicle infotainment, in-car entertainment device, instrument cluster, head-up display device, onboard diagnostic device, dashtop mobile equipment, mobile data terminal, electronic engine management system, electronic/engine control unit, electron! c/engine control module, embedded system, sensor, microcontroller, control module, engine management system, networked appliance, machine-ty pe communication device, M2M or D2D device, loT device, etc.

Although not specifically shown in Figure 7, in some embodiments the network 700 may include a plurality of UEs coupled directly with one another via a sidelink interface. The UEs may be M2M/D2D devices that communicate using physical sidelink channels such as, but not limited to, PSBCH, PSDCH, PSSCH, PSCCH, PSFCH, etc. Similarly, although not specifically shown in Figure 7, the UE 702 may be communicatively coupled with an AP such as AP 406 as described with respect to Figure 4. Additionally, although not specifically shown in Figure 7, in some embodiments the RAN 708 may include one or more ANss such as AN 408 as described with respect to Figure 4. The RAN 708 and/or the AN of the RAN 708 may be referred to as a base station (BS), a RAN node, or using some other term or name.

The UE 702 and the RAN 708 may be configured to communicate via an air interface that may be referred to as a sixth generation (6G) air interface. The 6G air interface may include one or more features such as communication in a terahertz (THz) or sub-THz bandwidth, or joint communication and sensing. As used herein, the term “joint communication and sensing” may refer to a system that allows for wireless communication as well as radar-based sensing via various types of multiplexing. As used herein, THz or sub-THz bandwidths may refer to communication in the 80 GHz and above frequency ranges. Such frequency ranges may additionally or alternatively be referred to as “millimeter wave” or “mmWave” frequency ranges. The RAN 708 may allow for communication between the UE 702 and a 6G core network (CN) 710. Specifically, the RAN 708 may facilitate the transmission and reception of data between the UE 702 and the 6G CN 710. The 6G CN 710 may include various functions such as NSSF 450, NEF 452, NRF 454, PCF 456, UDM 458, AF 460, SMF 446, and AUSF 442. The 6G CN 710 may additional include UPF 448 and DN 436 as shown in Figure 7.

Additionally, the RAN 708 may include various additional functions that are in addition to, or alternative to, functions of a legacy cellular network such as a 4G or 5G network. Two such functions may include a Compute Control Function (Comp CF) 724 and a Compute Service Function (Comp SF) 736. The Comp CF 724 and the Comp SF 736 may be parts or functions of the Computing Service Plane. Comp CF 724 may be a control plane function that provides functionalities such as management of the Comp SF 736. computing task context generation and management (e.g., create, read, modify, delete), interaction with the underlaying computing infrastructure for computing resource management, etc.. Comp SF 736 may be a user plane function that sen es as the gateway to interface computing sen ice users (such as UE 702) and computing nodes behind a Comp SF instance. Some functionalities of the Comp SF 736 may include: parse computing service data received from users to compute tasks executable by computing nodes; hold service mesh ingress gateway or service API gateway; service and charging policies enforcement; performance monitoring and telemetry 7 collection, etc. In some embodiments, a Comp SF 736 instance may serve as the user plane gateway for a cluster of computing nodes. A Comp CF 724 instance may control one or more Comp SF 736 instances.

Two other such functions may include a Communication Control Function (Comm CF) 728 and a Communication Service Function (Comm SF) 738, which may be parts of the Communication Service Plane. The Comm CF 728 may be the control plane function for managing the Comm SF 738, communication sessions creation/configuration/releasing, and managing communication session context. The Comm SF 738 may be a user plane function for data transport. Comm CF 728 and Comm SF 738 may be considered as upgrades of SMF 446 and UPF 448, which were described with respect to a 5G system in Figure 4. The upgrades provided by the Comm CF 728 and the Comm SF 738 may enable service-aware transport. For legacy (e g., 4G or 5G) data transport, SMF 446 and UPF 448 may still be used.

Tw o other such functions may include a Data Control Function (Data CF) 722 and Data Service Function (Data SF) 732 may be parts of the Data Service Plane. Data CF 722 may be a control plane function and provides functionalities such as Data SF 732 management. Data service creation/configuration/releasing, Data service context management, etc. Data SF 732 may be a user plane function and serve as the gateway between data service users (such as UE 702 and the various functions of the 6G CN 710) and data service endpoints behind the gateway. Specific functionalities may include include: parse data service user data and forward to corresponding data service endpoints, generate charging data, report data service status.

Another such function may be the Sendee Orchestration and Chaining Function (SOCF) 720. which may discover, orchestrate and chain up communication/computing/data services provided by functions in the network. Upon receiving service requests from users, SOCF 720 may interact with one or more of Comp CF 724, Comm CF 728, and Data CF 722 to identify Comp SF 736, Comm SF 738, and Data SF 732 instances, configure sendee resources, and generate the service chain, which could contain multiple Comp SF 736, Comm SF 738, and Data SF 732 instances and their associated computing endpoints. Workload processing and data movement may then be conducted within the generated service chain. The SOCF 720 may also responsible for maintaining, updating, and releasing a created service chain.

Another such function may be the service registration function (SRF) 714, which may act as a registry for system services provided in the user plane such as services provided by service endpoints behind Comp SF 736 and Data SF 732 gateways and sendees provided by the UE 702. The SRF 714 may be considered a counterpart of NRF 454, which may act as the registry for network functions.

Other such functions may include an evolved service communication proxy (eSCP) and service infrastructure control function (SICF) 726, which may provide service communication infrastructure for control plane services and user plane sendees. The eSCP may be related to the service communication proxy (SCP) of 5G with user plane sendee communication proxy capabilities being added. The eSCP is therefore expressed in two parts: eCSP-C 712 and eSCP- U 734, for control plane sendee communication proxy and user plane service communication proxy, respectively. The SICF 726 may control and configure eCSP instances in terms of service traffic routing policies, access rules, load balancing configurations, performance monitoring, etc.

Another such function is the AMF 744. The AMF 744 may be similar to 444, but with additional functionality. Specifically, the AMF 744 may include potential functional repartition, such as move the message forwarding functionality from the AMF 744 to the RAN 708. Another such function is the sen-ice orchestration exposure function (SOEF) 718. The SOEF may be configured to expose service orchestration and chaining services to external users such as applications.

The UE 702 may include an additional function that is referred to as a computing client service function (comp CSF) 704. The comp CSF 704 may have both the control plane functionalities and user plane functionalities, and may interact with corresponding network side functions such as SOCF 720, Comp CF 724, Comp SF 736. Data CF 722, and/or Data SF 732 for sendee discovery, request/response, compute task workload exchange, etc. The Comp CSF 704 may also work with network side functions to decide on whether a computing task should be run on the UE 702, the RAN 708, and/or an element of the 6G CN 710.

The UE 702 and/or the Comp CSF 704 may include a service mesh proxy 706. The service mesh proxy 706 may act as a proxy for service-to-service communication in the user plane. Capabilities of the sendee mesh proxy 706 may include one or more of addressing, security, load balancing, etc.

Figure 8 illustrates a simplified block diagram of artificial (Al)-assisted communication between a UE 805 and a RAN 810, in accordance with various embodiments. More specifically, as described in further detail below, Al/machine learning (ML) models may be used or leveraged to facilitate over-the-air communication between UE 805 and RAN 810.

One or both of the UE 805 and the RAN 810 may operate in a matter consistent with 3GPP technical specifications or technical reports for 6G systems. In some embodiments, the wireless cellular communication between the UE 805 and the RAN 810 may be part of, or operate concurrently with, networks 700, 400, and/or some other network described herein.

The UE 805 may be similar to, and share one or more features with, UE 702, UE 402, and/or some other UE described herein. The UE 805 may be. but is not limited to, a smartphone, tablet computer, wearable computer device, desktop computer, laptop computer, in-vehicle infotainment, in-car entertainment device, instrument cluster, head-up display device, onboard diagnostic device, dashtop mobile equipment, mobile data terminal, electronic engine management system, electronic/engine control unit, electronic/engine control module, embedded system, sensor, microcontroller, control module, engine management system, networked appliance, machine-type communication device, M2M or D2D device, loT device, etc. The RAN 810 may be similar to, and share one or more features with, RAN 414, RAN 708, and/or some other RAN described herein.

As may be seen in Figure 8, the Al-related elements of UE 805 may be similar to the Al -related elements of RAN 810. For the sake of discussion herein, description of the various elements will be provided from the point of view of the UE 805. however it will be understood that such discussion or description will apply to equally named/numbered elements of RAN 810, unless explicitly stated otherwise.

As previously noted, the UE 805 may include various elements or functions that are related to AI/ML. Such elements may be implemented as hardware, software, firmware, and/or some combination thereof. In embodiments, one or more of the elements may be implemented as part of the same hardware (e.g., chip or multi-processor chip), software (e.g., a computing program), or firmware as another element.

One such element may be a data repository 815. The data repository 815 may be responsible for data collection and storage. Specifically, the data repository 815 may collect and store RAN configuration parameters, measurement data, performance key performance indicators (KPIs), model performance metrics, etc., for model training, update, and inference. More generally, collected data is stored into the repository. Stored data can be discovered and extracted by other elements from the data repository 815. For example, as may be seen, the inference data selection/filter element 850 may retrieve data from the data repository 815. In various embodiments, the UE 805 may be configured to discover and request data from the data repository 810 in the RAN, and vice versa. More generally, the data repository 815 of the UE 805 may be communicatively coupled with the data repository' 815 of the RAN 810 such that the respective data repositories of the UE and the RAN may share collected data with one another.

Another such element may be a training data selection/filtering functional block 820. The training data selection/filter functional block 820 may be configured to generate training, validation, and testing datasets for model training. Training data may be extracted from the data repository 815. Data may be selected/filtered based on the specific AI/ME model to be trained. Data may optionally be transformed/augmented/pre-processed (e.g., normalized) before being loaded into datasets. The training data selection/filter functional block 820 may label data in datasets for supervised learning. The produced datasets may then be fed into model training the model training functional block 825.

As noted above, another such element may be the model training functional block 825. This functional block may be responsible fortraining and updating(re-training) AI/ML models. The selected model may be trained using the fed-in datasets (including training, validation, testing) from the training data selection/filtering functional block. The model training functional block 825 may produce trained and tested AI/ML models which are ready for deployment. The produced trained and tested models can be stored in a model repository 835. The model repository 835 may be responsible for AI/ML models’ (both trained and untrained) storage and exposure. Trained/updated model(s) may be stored into the model repository 835. Model and model parameters may be discovered and requested by other functional blocks (e.g., the training data selection/filter functional block 820 and/or the model training functional block 825). In some embodiments, the UE 805 may discover and request AI/ML models from the model repository 835 of the RAN 810. Similarly, the RAN 810 may be able to discover and/or request AI/ML models from the model repository 835 of the UE 805. In some embodiments, the RAN 810 may configure models and/or model parameters in the model repository 835 of the UE 805.

Another such element may be a model management functional block 840. The model management functional block 840 may be responsible for management of the AI/ML model produced by the model training functional block 825. Such management functions may include deployment of a trained model, monitoring model performance, etc. In model deployment, the model management functional block 840 may allocate and schedule hardware and/or software resources for inference, based on received trained and tested models. As used herein, "inference” refers to the process of using trained AI/ML model(s) to generate data analytics, actions, policies, etc. based on input inference data. In performance monitoring, based on wireless performance KPIs and model performance metrics, the model management functional block 840 may decide to terminate the running model, start model re-training, select another model, etc. In embodiments, the model management functional block 840 of the RAN 810 may be able to configure model management policies in the UE 805 as shown.

Another such element may be an inference data selection/filtering functional block 850. The inference data selection/fdter functional block 850 may be responsible for generating datasets for model inference at the inference functional block 845, as described below. Specifically, inference data may be extracted from the data repository 815. The inference data selection/filter functional block 850 may select and/or filter the data based on the deployed AI/ML model. Data may be transformed/augmented/pre-processed following the same transformation/augmentation/pre-processing as those in training data selection/filtering as described with respect to functional block 820. The produced inference dataset may be fed into the inference functional block 845.

Another such element may be the inference functional block 845. The inference functional block 845 may be responsible for executing inference as described above. Specifically, the inference functional block 845 may consume the inference dataset provided by the inference data selection/filtering functional block 850, and generate one or more outcomes. Such outcomes may be or include data analytics, actions, policies, etc. The outcome(s) may be provided to the performance measurement functional block 830.

The performance measurement functional block 830 may be configured to measure model performance metrics (e.g., accuracy, model bias, run-time latency, etc.) of deployed and executing models based on the inference outcome(s) for monitoring purpose. Model performance data may be stored in the data repository 815.

For one or more embodiments, at least one of the components set forth in one or more of the preceding figures may be configured to perform one or more operations, techniques, processes, and/or methods as set forth in the example section below. For example, the baseband circuitry as described above in connection with one or more of the preceding figures may be configured to operate in accordance with one or more of the examples set forth below. For another example, circuitry associated with a UE, base station, network element, etc. as described above in connection with one or more of the preceding figures may be configured to operate in accordance with one or more of the examples set forth below in the example section.

Additional examples of the presently described embodiments include the following, non-limiting implementations. Each of the following non-limiting examples may stand on its own or may be combined in any permutation or combination with any one or more of the other examples provided below or throughout the present disclosure.

For one or more embodiments, at least one of the components set forth in one or more of the preceding figures may be configured to perform one or more operations, techniques, processes, and/or methods as set forth in the example section below. For example, the baseband circuitry as described above in connection with one or more of the preceding figures may be configured to operate in accordance with one or more of the examples set forth below. For another example, circuitry associated with a UE, base station, network element, etc. as described above in connection with one or more of the preceding figures may be configured to operate in accordance with one or more of the examples set forth below.

The following examples pertain to further embodiments.

Example 1 may include an apparatus comprising a processing circuitry’ configured to receive a request from a service consumer for obtaining an inference output from an artificial intelligence (AI)/machine learning (ML) inference function; determine the AI/ML inference function requested by the service consumer, the AI/ML inference function being selected from a group consisting of network data analytics function (NWDAF), management data analytics function (MDAF), radio access network (RAN) intelligence mobility robustness optimization (MRO) function, RAN intelligence mobility load balancing (MLB) function, and RAN intelligence energy saving (ES) function; configure the selected AI/ML inference function based on the request from the service consumer; execute the configured AI/ML inference function to generate the inference output; and send the inference output to the sendee consumer.

Example 2 may include the apparatus of example 1 and/or some other example herein, wherein the processing circuitry may be further configured to report the inference output to the consumer through at least one of notification, file, or data streaming.

Example 3 may include the apparatus of example 1 and/or some other example herein, wherein the configured AI/ML inference function may be dynamically adjusted in real-time based on changing network conditions.

Example 4 may include the apparatus of example 1 and/or some other example herein, wherein the processing circuitry may be further configured to receive feedback related to the inference output.

Example 5 may include the apparatus of example 4 and/or some other example herein, wherein the feedback may be represented by a Managed Object Instance (MOI).

Example 6 may include the apparatus of example 1 and/or some other example herein, wherein the processing circuitry may be further configured to execute actions within a 5G system (5GS) based on the inference output and subsequently report the actions performed to the consumer.

Example 7 may include the apparatus of example 6 and/or some other example herein, wherein the actions taken within the 5G system are reported to the consumer through notifications.

Example 8 may include the apparatus of example 1 and/or some other example herein, wherein the processing circuitry may be further configured to monitor network performance related to the configured AI/ML inference function and report performance data associated with the AI/ML inference function to the service consumer.

Example 9 may include the apparatus of example 8 and/or some other example herein, wherein the processing circuitry may be further configured to receive performance data requests from the consumer, respond to the requests, and provide results.

Example 10 may include the apparatus of example 1 and/or some other example herein, wherein the processing circuitry may be further configured to: receive a request from a sendee consumer to manage the AI/ML inference function via a managed objected instance (MOI); configure the AI/ML inference function; and respond to the consumer to indicate a result of the AI/ML inference function management. Example 11 may include the apparatus of example 10 and/or some other example herein, wherein the MOI contains at least one of a policy for the inference function, a target for the inference function, conditions for triggering the inference, or activation and deactivation status.

Example 12 may include a computer-readable medium storing computer-executable instructions which when executed by one or more processors result in performing operations comprising: receiving a request from a service consumer for obtaining an inference output from an artificial intelligence (AI)/machine learning (ML) inference function; determining the AI/ML inference function requested by the service consumer, the AI/ML inference function being selected from a group consisting of network data analytics function (NWDAF), management data analytics function (MDAF), radio access network (RAN) intelligence mobility robustness optimization (MRO) function, RAN intelligence mobility load balancing (MLB) function, and RAN intelligence energy saving (ES) function; configuring the selected AI/ML inference function based on the request from the service consumer; executing the configured AI/ML inference function to generate the inference output; and sending the inference output to the service consumer.

Example 13 may include the computer-readable medium of example 12 and/or some other example herein, wherein the operations further comprise reporting the inference output to the consumer through at least one of notification, file, or data streaming.

Example 14 may include the computer-readable medium of example 12 and/or some other example herein, wherein the configured AI/ML inference function may be dynamically adjusted in real-time based on changing network conditions.

Example 15 may include the computer-readable medium of example 12 and/or some other example herein, wherein the operations further comprise receiving feedback related to the inference output.

Example 16 may include the computer-readable medium of example 15 and/or some other example herein, wherein the feedback may be represented by a Managed Obj ect Instance (MOI).

Example 17 may include the computer-readable medium of example 12 and/or some other example herein, wherein the operations further comprise executing actions within a 5G system (5GS) based on the inference output and subsequently report the actions performed to the consumer. Example 18 may include the computer-readable medium of example 17 and/or some other example herein, wherein the actions taken within the 5G system are reported to the consumer through notifications.

Example 19 may include the computer-readable medium of example 12 and/or some other example herein, wherein the operations further comprise monitoring network performance related to the configured AI/ML inference function and report performance data associated with the AI/ML inference function to the service consumer.

Example 20 may include the computer-readable medium of example 19 and/or some other example herein, wherein the operations further comprise receiving performance data requests from the consumer, responding to the requests, and providing results.

Example 21 may include the computer-readable medium of example 12 and/or some other example herein, wherein the operations further comprise: receiving a request from a service consumer to manage the AI/ML inference function via a managed objected instance (MOI); configuring the AI/ML inference function; and responding to the consumer to indicate a result of the AI/ML inference function management.

Example 22 may include the computer-readable medium of example 21 and/or some other example herein, wherein the MOI contains at least one of a policy for the inference function, a target for the inference function, conditions for triggering the inference, or activation and deactivation status.

Example 23 may include a method comprising: receiving, by one or more processors, a request from a service consumer for obtaining an inference output from an artificial intelligence (AI)/machine learning (ML) inference function; determining the AI/ML inference function requested by the service consumer, the AI/ML inference function being selected from a group consisting of network data analytics function (NWDAF), management data analytics function (MDAF), radio access network (RAN) intelligence mobility robustness optimization (MRO) function, RAN intelligence mobility load balancing (MLB) function, and RAN intelligence energy saving (ES) function; configuring the selected AI/ML inference function based on the request from the service consumer; executing the configured AI/ML inference function to generate the inference output; and sending the inference output to the sendee consumer.

Example 24 may include the method of example 23 and/or some other example herein, further comprising reporting the inference output to the consumer through at least one of notification, file, or data streaming. Example 25 may include the method of example 23 and/or some other example herein, wherein the configured AI/ML inference function may be dynamically adjusted in real-time based on changing network conditions.

Example 26 may include the method of example 23 and/or some other example herein, further comprising receiving feedback related to the inference output.

Example 27 may include the method of example 26 and/or some other example herein, wherein the feedback may be represented by a Managed Object Instance (MOI).

Example 28 may include the method of example 23 and/or some other example herein, further comprising executing actions within a 5G system (5GS) based on the inference output and subsequently report the actions performed to the consumer.

Example 29 may include the method of example 28 and/or some other example herein, wherein the actions taken within the 5G system are reported to the consumer through notifications.

Example 30 may include the method of example 23 and/or some other example herein, further comprising monitoring network performance related to the configured AI/ML inference function and report performance data associated with the AI/ML inference function to the service consumer.

Example 31 may include the method of example 30 and/or some other example herein, further comprising receiving performance data requests from the consumer, responding to the requests, and providing results.

Example 32 may include the method of example 23 and/or some other example herein, further comprising: receiving a request from a service consumer to manage the AI/ML inference function via a managed objected instance (MOI); configuring the AI/ML inference function; and responding to the consumer to indicate a result of the AI/ML inference function management.

Example 33 may include the method of example 32 and/or some other example herein, wherein the MOI contains at least one of a policy for the inference function, a target for the inference function, conditions for triggering the inference, or activation and deactivation status.

Example 34 may include an apparatus comprising means for: receiving a request from a service consumer for obtaining an inference output from an artificial intelligence (AI)/machine learning (ML) inference function; determining the AI/ML inference function requested by the service consumer, the AI/ML inference function being selected from a group consisting of network data analytics function (NWDAF), management data analytics function (MDAF), radio access network (RAN) intelligence mobility robustness optimization (MRO) function, RAN intelligence mobility' load balancing (MLB) function, and RAN intelligence energy saving (ES) function; configuring the selected AI/ML inference function based on the request from the service consumer; executing the configured AI/ML inference function to generate the inference output; and sending the inference output to the service consumer.

Example 35 may include the apparatus of example 34 and/or some other example herein, further comprising reporting the inference output to the consumer through at least one of notification, file, or data streaming.

Example 36 may include the apparatus of example 34 and/or some other example herein, wherein the configured AI/ML inference function may be dynamically adjusted in realtime based on changing network conditions.

Example 37 may’ include the apparatus of example 34 and/or some other example herein, further comprising receiving feedback related to the inference output.

Example 38 may include the apparatus of example 37 and/or some other example herein, wherein the feedback may be represented by a Managed Object Instance (MOI).

Example 39 may include the apparatus of example 34 and/or some other example herein, further comprising executing actions within a 5G system (5GS) based on the inference output and subsequently report the actions performed to the consumer.

Example 40 may include the apparatus of example 39 and/or some other example herein, wherein the actions taken within the 5G system are reported to the consumer through notifications.

Example 41 may include the apparatus of example 34 and/or some other example herein, further comprising monitoring network performance related to the configured AI/ML inference function and report performance data associated with the AI/ML inference function to the service consumer.

Example 42 may include the apparatus of example 41 and/or some other example herein, further comprising receiving performance data requests from the consumer, responding to the requests, and providing results.

Example 43 may include the apparatus of example 34 and/or some other example herein, further comprising: receiving a request from a service consumer to manage the AI/ML inference function via a managed objected instance (MOI); configuring the AI/ML inference function; and responding to the consumer to indicate a result of the AI/ML inference function management.

Example 44 may’ include the apparatus of example 43 and/or some other example herein, wherein the MOI contains at least one of a policy for the inference function, a target for the inference function, conditions for triggering the inference, or activation and deactivation status.

Example 45 may include an apparatus comprising means for performing any of the methods of examples 1-44.

Example 46 may include a network node comprising a communication interface and processing circuitry connected thereto and configured to perform the methods of examples 1 - 44.

Example 47 may include an apparatus comprising means to perform one or more elements of a method described in or related to any of examples 1-44, or any other method or process described herein.

Example 48 may include one or more non-transitory computer-readable media comprising instructions to cause an electronic device, upon execution of the instructions by one or more processors of the electronic device, to perform one or more elements of a method described in or related to any of examples 1-44, or any other method or process described herein.

Example 49 may include an apparatus comprising logic, modules, or circuitry to perform one or more elements of a method described in or related to any of examples 1-44, or any other method or process described herein.

Example 50 may include a method, technique, or process as described in or related to any of examples 1-44, or portions or parts thereof.

Example 51 may include an apparatus comprising: one or more processors and one or more computer-readable media comprising instructions that, when executed by the one or more processors, cause the one or more processors to perform the method, techniques, or process as described in or related to any of examples 1-44, or portions thereof.

Example 52 may include a signal as described in or related to any of examples 1-44, or portions or parts thereof.

Example 53 may include a datagram, packet, frame, segment, protocol data unit (PDU), or message as described in or related to any of examples 1-44, or portions or parts thereof, or otherwise described in the present disclosure.

Example 54 may include a signal encoded w ith data as described in or related to any of examples 1-44, or portions or parts thereof, or otherwise described in the present disclosure.

Example 55 may include a signal encoded with a datagram, packet, frame, segment, protocol data unit (PDU), or message as described in or related to any of examples 1-44, or portions or parts thereof, or otherwise described in the present disclosure. Example 56 may include an electromagnetic signal carrying computer-readable instructions, wherein execution of the computer-readable instructions by one or more processors is to cause the one or more processors to perform the method, techniques, or process as described in or related to any of examples 1-44, or portions thereof.

Example 57 may include a computer program comprising instructions, wherein execution of the program by a processing element is to cause the processing element to carry out the method, techniques, or process as described in or related to any of examples 1-44, or portions thereof.

Example 58 may include a signal in a wireless network as shown and described herein.

Example 59 may include a method of communicating in a wireless network as shown and described herein.

Example 60 may include a system for providing wireless communication as shown and described herein.

Example 61 may include a device for providing wireless communication as shown and described herein.

An example implementation is an edge computing system, including respective edge processing devices and nodes to invoke or perform the operations of the examples above, or other subject matter described herein. Another example implementation is a client endpoint node, operable to invoke or perform the operations of the examples above, or other subject matter described herein. Another example implementation is an aggregation node, network hub node, gateway node, or core data processing node, within or coupled to an edge computing system, operable to invoke or perform the operations of the examples above, or other subject matter described herein. Another example implementation is an access point, base station, road-side unit, street-side unit, or on-premise unit, within or coupled to an edge computing system, operable to invoke or perform the operations of the examples above, or other subject matter described herein. Another example implementation is an edge provisioning node, service orchestration node, application orchestration node, or multi-tenant management node, within or coupled to an edge computing system, operable to invoke or perform the operations of the examples above, or other subject matter described herein. Another example implementation is an edge node operating an edge provisioning service, application or service orchestration service, virtual machine deployment, container deployment, function deployment, and compute management, within or coupled to an edge computing system, operable to invoke or perform the operations of the examples above, or other subject matter described herein. Another example implementation is an edge computing system operable as an edge mesh, as an edge mesh with side car loading, or with mesh-to-mesh communications, operable to invoke or perform the operations of the examples above, or other subject matter described herein. Another example implementation is an edge computing system including aspects of network functions, acceleration functions, acceleration hardware, storage hardware, or computation hardware resources, operable to invoke or perform the use cases discussed herein, with use of the examples above, or other subject matter described herein. Another example implementation is an edge computing system adapted for supporting client mobility, vehicle-to-vehicle (V2V), vehicle-to-everything (V2X), or vehicle-to-infrastructure (V2I) scenarios, and optionally operating according to ETSI MEC specifications, operable to invoke or perform the use cases discussed herein, with use of the examples above, or other subject matter described herein. Another example implementation is an edge computing system adapted for mobile wireless communications, including configurations according to an 3GPP 4G/LTE or 5G network capabilities, operable to invoke or perform the use cases discussed herein, with use of the examples above, or other subject matter described herein. Another example implementation is a computing system adapted for network communications, including configurations according to an O-RAN capabilities, operable to invoke or perform the use cases discussed herein, with use of the examples above, or other subject matter described herein.

Any of the above-described examples may be combined with any other example (or combination of examples), unless explicitly stated otherwise. The foregoing description of one or more implementations provides illustration and description, but is not intended to be exhaustive or to limit the scope of embodiments to the precise form disclosed. Modifications and variations are possible in light of the above teachings or may be acquired from practice of various embodiments.

TERMINOLOGY

The terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting of the disclosure. As used herein, the singular forms “a,” “an” and “the” are intended to include plural forms as well, unless the context clearly indicates otherwise. It will be further understood that the terms “comprises” and/or “comprising,” when used in this specification, specific the presence of stated features, integers, steps, operations, elements, and/or components, but do not preclude the presence or addition of one or more other features, integers, steps, operation, elements, components, and/or groups thereof.

For the purposes of the present disclosure, the phrase "A and/or B” means (A), (B), or (A and B). For the purposes of the present disclosure, the phrase "‘A, B, and/or C” means (A), (B), (C), (A and B), (A and C), (B and C), or (A, B and C). The description may use the phrases “in an embodiment,” or “In some embodiments,” which may each refer to one or more of the same or different embodiments. Furthermore, the terms “comprising,” “including,” “having,” and the like, as used with respect to embodiments of the present disclosure, are synonymous.

The terms “coupled,” “communicatively coupled,” along with derivatives thereof are used herein. The term “coupled” may mean two or more elements are in direct physical or electrical contact with one another, may mean that two or more elements indirectly contact each other but still cooperate or interact with each other, and/or may mean that one or more other elements are coupled or connected between the elements that are said to be coupled with each other. The term “directly coupled” may mean that two or more elements are in direct contact with one another. The term “communicatively coupled” may mean that two or more elements may be in contact with one another by a means of communication including through a wire or other interconnect connection, through a wireless communication channel or ink, and/or the like.

The term “circuitry” as used herein refers to, is part of, or includes hardware components such as an electronic circuit, a logic circuit, a processor (shared, dedicated, or group) and/or memory (shared, dedicated, or group), an Application Specific Integrated Circuit (ASIC), a field-programmable device (FPD) (e.g., a field-programmable gate array (FPGA), a programmable logic device (PLD), a complex PLD (CPLD), a high-capacity PLD (HCPLD), a structured ASIC, or a programmable SoC). digital signal processors (DSPs), etc., that are configured to provide the described functionality. In some embodiments, the circuitry may execute one or more software or firmware programs to provide at least some of the described functionality. The term “circuitry” may also refer to a combination of one or more hardware elements (or a combination of circuits used in an electrical or electronic system) with the program code used to carry out the functionality of that program code. In these embodiments, the combination of hardware elements and program code may be referred to as a particular type of circuitry.

The term “processor circuitry ” as used herein refers to, is part of, or includes circuitry capable of sequentially and automatically carrying out a sequence of arithmetic or logical operations, or recording, storing, and/or transferring digital data. Processing circuitry may include one or more processing cores to execute instructions and one or more memory structures to store program and data information. The term “processor circuitry” may refer to one or more application processors, one or more baseband processors, a physical central processing unit (CPU), a single-core processor, a dual-core processor, a triple-core processor, a quad-core processor, and/or any other device capable of executing or otherwise operating computer-executable instructions, such as program code, software modules, and/or functional processes. Processing circuitry may include more hardware accelerators, which may be microprocessors, programmable processing devices, or the like. The one or more hardware accelerators may include, for example, computer vision (CV) and/or deep learning (DL) accelerators. The terms “application circuitry” and/or “baseband circuitry " may be considered synonymous to, and may be referred to as. “processor circuitry.”

The term “memory” and/or “memory circuitry” as used herein refers to one or more hardware devices for storing data, including RAM, MRAM, PRAM, DRAM, and/or SDRAM, core memory, ROM, magnetic disk storage mediums, optical storage mediums, flash memory devices or other machine readable mediums for storing data. The term “computer-readable medium” may include, but is not limited to, memory', portable or fixed storage devices, optical storage devices, and various other mediums capable of storing, containing or carry ing instructions or data.

The term “interface circuitry” as used herein refers to, is part of, or includes circuitry that enables the exchange of information between two or more components or devices. The term “interface circuitry” may refer to one or more hardware interfaces, for example, buses, I/O interfaces, peripheral component interfaces, network interface cards, and/or the like.

The term “user equipment” or “UE” as used herein refers to a device with radio communication capabilities and may describe a remote user of network resources in a communications network. The term “user equipment” or “UE” may be considered synonymous to, and may be referred to as, client, mobile, mobile device, mobile terminal, user terminal, mobile unit, mobile station, mobile user, subscriber, user, remote station, access agent, user agent, receiver, radio equipment, reconfigurable radio equipment, reconfigurable mobile device, etc. Furthermore, the term “user equipment” or “UE” may include any type of wireless/wired device or any computing device including a wireless communications interface.

The term “network element” as used herein refers to physical or virtualized equipment and/or infrastructure used to provide wired or wireless communication network services. The term “network element” may be considered synonymous to and/or referred to as a networked computer, networking hardware, network equipment, network node, router, switch, hub, bridge, radio network controller, RAN device, RAN node, gateway, server, virtualized VNF, NFVI, and/or the like.

The term '‘computer system” as used herein refers to any type interconnected electronic devices, computer devices, or components thereof. Additionally, the term “computer system” and/or “system” may refer to various components of a computer that are communicatively coupled with one another. Furthermore, the term “computer system” and/or “system” may refer to multiple computer devices and/or multiple computing systems that are communicatively coupled with one another and configured to share computing and/or networking resources.

The term “appliance,” “computer appliance,” or the like, as used herein refers to a computer device or computer system with program code (e.g., software or firmware) that is specifically designed to provide a specific computing resource. A “virtual appliance” is a virtual machine image to be implemented by a hypervisor-equipped device that virtualizes or emulates a computer appliance or otherwise is dedicated to provide a specific computing resource. The term “element” refers to a unit that is indivisible at a given level of abstraction and has a clearly defined boundary, wherein an element may be any Npe of entity including, for example, one or more devices, systems, controllers, network elements, modules, etc., or combinations thereof. The term “device” refers to a physical entity embedded inside, or attached to, another physical entity in its vicinity, with capabilities to convey digital information from or to that physical entity. The term “entity” refers to a distinct component of an architecture or device, or information transferred as a payload. The term “controller” refers to an element or entity that has the capability to affect a physical entity, such as by changing its state or causing the physical entity 7 to move.

The term “cloud computing” or “cloud” refers to a paradigm for enabling network access to a scalable and elastic pool of shareable computing resources with self-service provisioning and administration on-demand and without active management by users. Cloud computing provides cloud computing services (or cloud services), which are one or more capabilities offered via cloud computing that are invoked using a defined interface (e.g.. an API or the like). The term “computing resource” or simply “resource” refers to any physical or virtual component, or usage of such components, of limited availability within a computer system or network. Examples of computing resources include usage/access to, for a period of time, servers, processor(s), storage equipment, memory devices, memory areas, networks, electrical power, input/output (peripheral) devices, mechanical devices, network connections (e.g., channels/links, ports, network sockets, etc.), operating systems, virtual machines (VMs), software/applications, computer files, and/or the like. A ‘‘hardware resource” may refer to compute, storage, and/or network resources provided by physical hardware element(s). A ■‘virtualized resource” may refer to compute, storage, and/or network resources provided by virtualization infrastructure to an application, device, system, etc. The term '‘network resource” or “communication resource” may refer to resources that are accessible by computer devices/systems via a communications network. The term “system resources” may refer to any kind of shared entities to provide services, and may include computing and/or network resources. System resources may be considered as a set of coherent functions, network data objects or services, accessible through a server where such system resources reside on a single host or multiple hosts and are clearly identifiable. As used herein, the term “cloud service provider” (or CSP) indicates an organization which operates typically large-scale “cloud” resources comprised of centralized, regional, and edge data centers (e.g., as used in the context of the public cloud). In other examples, a CSP may also be referred to as a Cloud Service Operator (CSO). References to “cloud computing” generally refer to computing resources and services offered by a CSP or a CSO, at remote locations with at least some increased latency, distance, or constraints relative to edge computing.

As used herein, the term “data center” refers to a purpose-designed structure that is intended to house multiple high-performance compute and data storage nodes such that a large amount of compute, data storage and network resources are present at a single location. This often entails specialized rack and enclosure systems, suitable heating, cooling, ventilation, security, fire suppression, and power delivery systems. The term may also refer to a compute and data storage node in some contexts. A data center may vary' in scale between a centralized or cloud data center (e.g., largest), regional data center, and edge data center (e.g., smallest).

As used herein, the term “edge computing” refers to the implementation, coordination, and use of computing and resources at locations closer to the “edge” or collection of “edges” of a network. Deploying computing resources at the network’s edge may reduce application and network latency, reduce network backhaul traffic and associated energy' consumption, improve service capabilities, improve compliance with security’ or data privacy requirements (especially as compared to conventional cloud computing), and improve total cost of ownership). As used herein, the term “edge compute node” refers to a real-world, logical, or virtualized implementation of a compute-capable element in the form of a device, gateway, bridge, system or subsystem, component, whether operating in a server, client, endpoint, or peer mode, and whether located at an “edge” of an network or at a connected location further within the network. References to a “node” used herein are generally interchangeable with a “device”, “component”, and “sub-system”; however, references to an “edge computing system” or “edge computing network” generally refer to a distributed architecture, organization, or collection of multiple nodes and devices, and which is organized to accomplish or offer some aspect of services or resources in an edge computing setting.

Additionally or alternatively, the term “Edge Computing” refers to a concept, as described in [6], that enables operator and 3rd party services to be hosted close to the UE's access point of attachment, to achieve an efficient service delivers- through the reduced end-to- end latency and load on the transport network. As used herein, the term “Edge Computing Service Provider” refers to a mobile network operator or a 3rd part) 7 service provider offering Edge Computing service. As used herein, the term “Edge Data Network” refers to a local Data Network (DN) that supports the architecture for enabling edge applications. As used herein, the term “Edge Hosting Environment” refers to an environment providing support required for Edge Application Server's execution. As used herein, the term “Application Server” refers to application software resident in the cloud performing the server function.

The term “Internet of Things” or “loT” refers to a system of interrelated computing devices, mechanical and digital machines capable of transferring data with little or no human interaction, and may involve technologies such as real-time analytics, machine learning and/or Al, embedded systems, wireless sensor networks, control systems, automation (e.g., smarthome, smart building and/or smart city technologies), and the like. loT devices are usually low-power devices without heavy compute or storage capabilities. “Edge loT devices” may be any kind of loT devices deployed at a network’s edge.

As used herein, the term “cluster” refers to a set or grouping of entities as part of an edge computing system (or systems), in the form of physical entities (e.g., different computing systems, networks or network groups), logical entities (e.g., applications, functions, security constructs, containers), and the like. In some locations, a “cluster” is also referred to as a “group” or a “domain”. The membership of cluster may be modified or affected based on conditions or functions, including from dynamic or property -based membership, from network or system management scenarios, or from various example techniques discussed below which may add, modify, or remove an entity in a cluster. Clusters may also include or be associated with multiple layers, levels, or properties, including variations in security features and results based on such layers, levels, or properties.

The term “application” may refer to a complete and deploy able package, environment to achieve a certain function in an operational environment. The term "AI/ML application” or the like may be an application that contains some AI/ML models and application-level descriptions. The term “machine learning"’ or “ML"’ refers to the use of computer systems implementing algorithms and/or statistical models to perform specific task(s) without using explicit instructions, but instead relying on patterns and inferences. ML algorithms build or estimate mathematical model(s) (referred to as “ML models” or the like) based on sample data (referred to as “training data,” “model training information,” or the like) in order to make predictions or decisions without being explicitly programmed to perform such tasks. Generally, an ML algorithm is a computer program that leams from experience with respect to some task and some performance measure, and an ML model may be any object or data structure created after an ML algorithm is trained with one or more training datasets. After training, an ML model may be used to make predictions on new datasets. Although the term “ML algorithm” refers to different concepts than the term “ML model,” these terms as discussed herein may be used interchangeably for the purposes of the present disclosure.

The term “machine learning model,” “ML model,” or the like may also refer to ML methods and concepts used by an ML-assisted solution. An “ML-assisted solution” is a solution that addresses a specific use case using ML algorithms during operation. ML models include supervised learning (e.g., linear regression, k-nearest neighbor (KNN), decision tree algorithms, support machine vectors, Bayesian algorithm, ensemble algorithms, etc.) unsupervised learning (e.g., K-means clustering, principle component analysis (PCA), etc.), reinforcement learning (e.g., Q-leaming, multi-armed bandit learning, deep RL, etc.), neural networks, and the like. Depending on the implementation a specific ML model could have many sub-models as components and the ML model may train all sub-models together. Separately trained ML models can also be chained together in an ML pipeline during inference. An “ML pipeline” is a set of functionalities, functions, or functional entities specific for an ML-assisted solution; an ML pipeline may include one or several data sources in a data pipeline, a model training pipeline, a model evaluation pipeline, and an actor. The “actor” is an entity that hosts an ML assisted solution using the output of the ML model inference). The term “ML training host” refers to an entity, such as a network function, that hosts the training of the model. The term “ML inference host” refers to an entity, such as a network function, that hosts model during inference mode (which includes both the model execution as well as any online learning if applicable). The ML-host informs the actor about the output of the ML algorithm, and the actor takes a decision for an action (an “action” is performed by an actor as a result of the output of an ML assisted solution). The term “model inference information” refers to information used as an input to the ML model for determining inference(s); the data used to train an ML model and the data used to determine inferences may overlap, however, “training data” and “inference data” refer to different concepts.

The terms “instantiate,” “instantiation,” and the like as used herein refers to the creation of an instance. An “instance” also refers to a concrete occurrence of an object, which may occur, for example, during execution of program code. The term “information element” refers to a structural element containing one or more fields. The term “field” refers to individual contents of an information element, or a data element that contains content. As used herein, a “database object”, “data structure”, or the like may refer to any representation of information that is in the form of an object, attribute-value pair (AVP), key -value pair (KVP), tuple, etc., and may include variables, data structures, functions, methods, classes, database records, database fields, database entities, associations between data and/or database entities (also referred to as a “relation”), blocks and links between blocks in block chain implementations, and/or the like.

An “information object,” as used herein, refers to a collection of structured data and/or any representation of information, and may include, for example electronic documents (or “documents”), database objects, data structures, files, audio data, video data, raw data, archive files, application packages, and/or any other like representation of information. The terms “electronic document” or “document,” may refer to a data structure, computer file, or resource used to record data, and includes various file types and/or data formats such as word processing documents, spreadsheets, slide presentations, multimedia items, webpage and/or source code documents, and/or the like. As examples, the information objects may include markup and/or source code documents such as HTML, XML, JSON, Apex®, CSS, JSP, MessagePack™, Apache® Thrift™, ASN. l, Google® Protocol Buffers (protobuf), or some other document(s)/format(s) such as those discussed herein. An information object may have both a logical and a physical structure. Physically, an information object comprises one or more units called entities. An entity is a unit of storage that contains content and is identified by a name. An entity may refer to other entities to cause their inclusion in the information object. An information object begins in a document entity, which is also referred to as a root element (or "root"). Logically, an information object comprises one or more declarations, elements, comments, character references, and processing instructions, all of w hich are indicated in the information object (e.g., using markup).

The term “data item” as used herein refers to an atomic state of a particular object with at least one specific property at a certain point in time. Such an object is usually identified by an object name or object identifier, and properties of such an object are usually defined as database objects (e.g., fields, records, etc.), object instances, or data elements (e.g., mark-up language elements/tags, etc.). Additionally or alternatively, the term “data item” as used herein may refer to data elements and/or content items, although these terms may refer to difference concepts. The term “data element” or “element” as used herein refers to a unit that is indivisible at a given level of abstraction and has a clearly defined boundary'. A data element is a logical component of an information object (e.g., electronic document) that may begin with a start tag (e.g., “<element>”) and end with amatching end tag (e.g., “</element>”), or only has an empty element tag (e g., “<element />”). Any characters between the start tag and end tag, if any, are the element’s content (referred to herein as “content items” or the like).

The content of an entity may include one or more content items, each of which has an associated datatype representation. A content item may include, for example, attribute values, character values, URIs, qualified names (qnames), parameters, and the like. A qname is a fully qualified name of an element, attribute, or identifier in an information object. A qname associates a URI of a namespace with a local name of an element, attribute, or identifier in that namespace. To make this association, the qname assigns a prefix to the local name that corresponds to its namespace. The qname comprises a URI of the namespace, the prefix, and the local name. Namespaces are used to provide uniquely 7 named elements and attributes in information objects. Content items may include text content (e.g., “<element>content item</element>”), attributes (e.g., “<element attribute- 'attributeValue">”), and other elements referred to as “child elements” (e.g., “<elementl><element2>content item</element2></elementl >”). An “attribute” may refer to a markup construct including a name-value pair that exists within a start tag or empty' element tag. Attributes contain data related to its element and/or control the element’s behavior.

The term “resource” as used herein refers to a physical or virtual device, a physical or virtual component within a computing environment, and/or a physical or virtual component within a particular device, such as computer devices, mechanical devices, memory space, processor/CPU time, processor/CPU usage, processor and accelerator loads, hardware time or usage, electrical power, input/output operations, ports or network sockets, channel/link allocation, throughput, memory usage, storage, network, database and applications, workload units, and/or the like. A “hardware resource” may refer to compute, storage, and/or network resources provided by physical hardware element(s). A “virtualized resource” may refer to compute, storage, and/or network resources provided by virtualization infrastructure to an application, device, system, etc. The term “network resource” or “communication resource” may refer to resources that are accessible by computer devices/systems via a communications network. The term “system resources” may refer to any kind of shared entities to provide services, and may include computing and/or network resources. System resources may be considered as a set of coherent functions, network data objects or services, accessible through a server where such system resources reside on a single host or multiple hosts and are clearly identifiable. The term “channel” as used herein refers to any transmission medium, either tangible or intangible, which is used to communicate data or a data stream. The term “channel” may be synonymous with and/or equivalent to “communications channel,” “data communications channel,” “transmission channel,” “data transmission channel,” “access channel,” “data access channel,” “link,” “data link,” “carrier,” “radiofrequency carrier,” and/or any other like term denoting a pathway or medium through which data is communicated. Additionally, the term “link” as used herein refers to a connection between two devices through a RAT for the purpose of transmitting and receiving information. As used herein, the term “radio technology” refers to technology for wireless transmission and/or reception of electromagnetic radiation for information transfer. The term “radio access technology” or “RAT” refers to the technology used for the underlying physical connection to a radio based communication network. As used herein, the term “communication protocol” (either wired or wireless) refers to a set of standardized rules or instructions implemented by a communication device and/or system to communicate with other devices and/or systems, including instructions for packetizing/depacketizing data, modulating/demodulating signals, implementation of protocols stacks, and/or the like.

As used herein, the term “radio technology” refers to technology for wireless transmission and/or reception of electromagnetic radiation for information transfer. The term “radio access technology” or “RAT” refers to the technology used for the underlying physical connection to a radio based communication network. As used herein, the term “communication protocol” (either wired or wireless) refers to a set of standardized rules or instructions implemented by a communication device and/or system to communicate with other devices and/or systems, including instructions for packetizing/depacketizing data, modulating/demodulating signals, implementation of protocols stacks, and/or the like. Examples of wireless communications protocols may be used in various embodiments include a Global System for Mobile Communications (GSM) radio communication technology, a General Packet Radio Service (GPRS) radio communication technology, an Enhanced Data Rates for GSM Evolution (EDGE) radio communication technology, and/or a Third Generation Partnership Project (3GPP) radio communication technology including, for example, 3GPP Fifth Generation (5G) or New Radio (NR), Universal Mobile Telecommunications System (UMTS), Freedom of Multimedia Access (FOMA), Long Term Evolution (LTE), LTE- Advanced (LTE Advanced). LTE Extra, LTE-A Pro, cdmaOne (2G). Code Division Multiple Access 2000 (CDMA 2000), Cellular Digital Packet Data (CDPD), Mobitex, Circuit Switched Data (CSD), High-Speed CSD (HSCSD), Universal Mobile Telecommunications System (UMTS), Wideband Code Division Multiple Access (W-CDM), High Speed Packet Access (HSPA), HSPA Plus (HSPA+), Time Division-Code Division Multiple Access (TD-CDMA), Time Division-Synchronous Code Division Multiple Access (TD-SCDMA), LTE LAA, MuLTEfire, UMTS Terrestrial Radio Access (UTRA), Evolved UTRA (E-UTRA), Evolution- Data Optimized or Evolution-Data Only (EV-DO), Advanced Mobile Phone System (AMPS), Digital AMPS (D-AMPS), Total Access Communication System/Extended Total Access Communication System (TACS/ETACS), Push-to-talk (PTT), Mobile Telephone System (MTS), Improved Mobile Telephone System (IMTS), Advanced Mobile Telephone System (AMTS), Cellular Digital Packet Data (CDPD), DataTAC, Integrated Digital Enhanced Network (iDEN), Personal Digital Cellular (PDC), Personal Handy-phone System (PHS), Wideband Integrated Digital Enhanced Network (WiDEN), iBurst, Unlicensed Mobile Access (UMA), also referred to as also referred to as 3GPP Generic Access Network, or GAN standard), Bluetooth®, Bluetooth Low Energy (BLE), IEEE 802.15.4 based protocols (e.g., IPv6 over Low power Wireless Personal Area Networks (6L0WPAN), WirelessHART, MiWi, Thread, 802.11a, etc.) WiFi-direct, ANT/ANT+, ZigBee, Z-Wave, 3GPP device-to-device (D2D) or Proximity Services (ProSe), Universal Plug and Play (UPnP), Low-Power Wide- Area-Network (LPWAN), Long Range Wide Area Network (LoRA) or LoRaWAN™ developed by Semtech and the LoRa Alliance, Sigfox, Wireless Gigabit Alliance (WiGig) standard, Worldwide Interoperability for Micro wave Access (WiMAX), mmWave standards in general (e.g., wireless systems operating at 10-300 GHz and above such as WiGig, IEEE 802.1 lad, IEEE 802. Hay, etc.), V2X communication technologies (including 3GPP C-V2X), Dedicated Short Range Communications (DSRC) communication systems such as Intelligent- Transport-Systems (ITS) including the European ITS-G5, ITS-G5B, ITS-G5C, etc. In addition to the standards listed above, any number of satellite uplink technologies may be used for purposes of the present disclosure including, for example, radios compliant with standards issued by the International Telecommunication Union (ITU), or the European Telecommunications Standards Institute (ETSI), among others. The examples provided herein are thus understood as being applicable to various other communication technologies, both existing and not yet formulated. The term "access network” refers to any network, using any combination of radio technologies. RATs, and/or communication protocols, used to connect user devices and service providers. In the context of WLANs, an "access network” is an IEEE 802 local area network (LAN) or metropolitan area network (MAN) between terminals and access routers connecting to provider services. The term “access router” refers to router that terminates a medium access control (MAC) service from terminals and forwards user traffic to information servers according to Internet Protocol (IP) addresses.

The term “SMTC” refers to an SSB-based measurement timing configuration configured by SSB-MeasurementTimingConfiguration. The term “SSB” refers to a synchronization signal/Physical Broadcast Channel (SS/PBCH) block, which includes a Primary Syncrhonization Signal (PSS), a Secondary Syncrhonization Signal (SSS), and a PBCH. The term “a “Primary Cell” refers to the MCG cell, operating on the primary frequency, in which the UE either performs the initial connection establishment procedure or initiates the connection re-establishment procedure. The term “Primary SCG Cell” refers to the SCG cell in which the UE performs random access when performing the Reconfiguration with Sync procedure for DC operation. The term “Secondary Cell” refers to a cell providing additional radio resources on top of a Special Cell for a UE configured with CA. The term “Secondary Cell Group” refers to the subset of serving cells comprising the PSCell and zero or more secondary cells for a UE configured with DC. The term “Serving Cell” refers to the primary cell for a UE in RRC CONNECTED not configured with CA/DC there is only one serving cell comprising of the primary cell. The term “serving cell” or “serving cells” refers to the set of cells comprising the Special Cell(s) and all secondary cells for a UE in RRC CONNECTED configured with CA. The term “Special Cell” refers to the PCell of the MCG or the PSCell of the SCG for DC operation; otherwise, the term “Special Cell” refers to the Pcell.

The term “Al policy” refers to a type of declarative policies expressed using formal statements that enable the non-RT RIC function in the SMO to guide the near-RT RIC function, and hence the RAN, tow ards better fulfilment of the RAN intent.

The term “Al Enrichment information” refers to information utilized by near-RT RIC that is collected or derived at SMO/non-RT RIC either from non-network data sources or from network functions themselves.

The term “Al -Policy Based Traffic Steering Process Mode” refers to an operational mode in which the Near-RT RIC is configured through Al Policy to use Traffic Steering Actions to ensure a more specific notion of network performance (for example, applying to smaller groups of E2 Nodes and UEs in the RAN) than that which it ensures in the Background Traffic Steering.

The term “Background Traffic Steering Processing Mode” refers to an operational mode in which the Near-RT RIC is configured through 01 to use Traffic Steering Actions to ensure a general background network performance which applies broadly across E2 Nodes and UEs in the RAN.

The term “Baseline RAN Behavior” refers to the default RAN behavior as configured at the E2 Nodes by SMO

The term “E2” refers to an interface connecting the Near-RT RIC and one or more O- CU-CPs, one or more O-CU-UPs, one or more O-DUs, and one or more O-eNBs.

The term “E2 Node” refers to a logical node terminating E2 interface. In this version of the specification, ORAN nodes terminating E2 interface are: for NR access: O-CU-CP, O- CU-UP, 0-DU or any combination; and for E-UTRA access: 0-eNB.

The term “Intents”, in the context of 0-RAN systems/implementations, refers to declarative policy to steer or guide the behavior of RAN functions, allowing the RAN function to calculate the optimal result to achieve stated objective.

The term “0-RAN non-real-time RAN Intelligent Controller” or “non-RT RIC” refers to a logical function that enables non-real-time control and optimization of RAN elements and resources, AI/ML workflow including model training and updates, and policy-based guidance of applications/features in Near-RT RIC.

The term “Near-RT RIC” or “0-RAN near-real-time RAN Intelligent Controller” refers to a logical function that enables near-real-time control and optimization of RAN elements and resources via fine-grained (e.g., UE basis, Cell basis) data collection and actions over E2 interface.

The term “0-RAN Central Unit” or “O-CU” refers to a logical node hosting RRC, SDAP and PDCP protocols.

The term “0-RAN Central Unit - Control Plane” or “O-CU-CP” refers to a logical node hosting the RRC and the control plane part of the PDCP protocol.

The term “0-RAN Central Unit - User Plane” or “O-CU-UP” refers to a logical node hosting the user plane part of the PDCP protocol and the SDAP protocol

The term “0-RAN Distributed Unit” or “0-DU” refers to a logical node hosting RLC/MAC/High-PHY layers based on a lower layer functional split.

The term “0-RAN eNB” or “0-eNB” refers to an eNB or ng-eNB that supports E2 interface. The term “O-RAN Radio Unit” or “O-RU” refers to a logical node hosting Low-PHY layer and RF processing based on a lower layer functional split. This is similar to 3GPP's “TRP” or “RRH” but more specific in including the Low-PHY layer (FFT/iFFT, PRACH extraction).

The term “01” refers to an interface between orchestration & management entities (Orchestration/NMS) and O-RAN managed elements, for operation and management, by which FCAPS management, Software management, File management and other similar functions shall be achieved.

The term “RAN UE Group” refers to an aggregations of UEs whose grouping is set in the E2 nodes through E2 procedures also based on the scope of Al policies. These groups can then be the target of E2 CONTROL or POLICY messages.

The term “Traffic Steering Action” refers to the use of a mechanism to alter RAN behavior. Such actions include E2 procedures such as CONTROL and POLICY.

The term “Traffic Steering Inner Loop” refers to the part of the Traffic Steering processing, triggered by the arrival of periodic TS related KPM (Key Performance Measurement) from E2 Node, which includes UE grouping, setting additional data collection from the RAN, as well as selection and execution of one or more optimization actions to enforce Traffic Steering policies.

The term “Traffic Steering Outer Loop” refers to the part of the Traffic Steering processing, triggered by the near-RT RIC setting up or updating Traffic Steering aware resource optimization procedure based on information from Al Policy setup or update, Al Enrichment Information (El) and/or outcome of Near-RT RIC evaluation, which includes the initial configuration (preconditions) and injection of related Al policies, Triggering conditions for TS changes.

The term “Traffic Steering Processing Mode” refers to an operational mode in which either the RAN or the Near-RT RIC is configured to ensure a particular network performance. This performance includes such aspects as cell load and throughput, and can apply differently to different E2 nodes and UEs. Throughout this process, Traffic Steering Actions are used to fulfill the requirements of this configuration.

The term “Traffic Steering Target” refers to the intended performance result that is desired from the network, which is configured to Near-RT RIC over 01.

Furthermore, any of the disclosed embodiments and example implementations can be embodied in the form of various types of hardware, software, firmware, middleware, or combinations thereof, including in the form of control logic, and using such hardware or software in a modular or integrated manner. Additionally, any of the software components or functions described herein can be implemented as software, program code, script, instructions, etc., operable to be executed by processor circuitry. These components, fun chons, programs, etc., can be developed using any suitable computer language such as, for example. Python, PyTorch, NurnPy, Ruby, Ruby on Rails, Scala, Smalltalk, Java™, C++, C#, “C”, Kotlin, Swift, Rust, Go (or “Golang”), EMCAScript, JavaScript, TypeScript. Jscript, ActionScript, Server- Side JavaScript (SSJS), PHP, Pearl, Lua, Torch/Lua with Just-In Time compiler (LuaJIT), Accelerated Mobile Pages Script (AMPscript), VBScript, JavaServer Pages (JSP), Active Server Pages (ASP), Node.js, ASP.NET, JAMscript, Hypertext Markup Language (HTML), extensible HTML (XHTML), Extensible Markup Language (XML). XML User Interface Language (XUL). Scalable Vector Graphics (SVG). RESTful API Modeling Language (RAML), wiki markup or Wikitext, Wireless Markup Language (WML), Java Script Object Notion (JSON), Apache® MessagePack™, Cascading Stylesheets (CSS), extensible sty lesheet language (XSL), Mustache template language, Handlebars template language, Guide Template Language (GTL), Apache® Thrift, Abstract Syntax Notation One (ASN. 1). Google® Protocol Buffers (protobuf), Bitcoin Script, EVM® bytecode. Solidity™, Vyper (Python derived). Bamboo, Lisp Like Language (LLL), Simplicity’ provided by Blockstream™, Rholang, Michelson, Counterfactual, Plasma, Plutus, Sophia, Salesforce® Apex®, and/or any other programming language or development tools including proprietary programming languages and/or development tools. The software code can be stored as a computer- or processorexecutable instructions or commands on a physical non-transitory computer-readable medium. Examples of suitable media include RAM, ROM, magnetic media such as a hard-drive or a floppy disk, or an optical medium such as a compact disk (CD) or DVD (digital versatile disk), flash memory, and the like, or any combination of such storage or transmission devices.

ABBREVIATIONS

Unless used differently herein, terms, definitions, and abbreviations may be consistent with terms, definitions, and abbreviations defined in 3GPP TR 21.905 V16.0.0 (2019-06). For the purposes of the present document, the following abbreviations may apply to the examples and embodiments discussed herein.

Table 1 Abbreviations:

The foregoing description provides illustration and description of various example embodiments, but is not intended to be exhaustive or to limit the scope of embodiments to the precise forms disclosed. Modifications and variations are possible in light of the above teachings or may be acquired from practice of various embodiments. Where specific details are set forth in order to describe example embodiments of the disclosure, it should be apparent to one skilled in the art that the disclosure can be practiced without, or with variation of, these specific details. It should be understood, however, that there is no intent to limit the concepts of the present disclosure to the particular forms disclosed, but on the contrary, the intention is to cover all modifications, equivalents, and alternatives consistent with the present disclosure and the appended claims.