Towards addressing training data scarcity challenge in emerging radio access networks: a survey and framework

Qureshi, H. N. et al. (2023) Towards addressing training data scarcity challenge in emerging radio access networks: a survey and framework. IEEE Communications Surveys and Tutorials, 25(3), pp. 1954-1990. (doi: 10.1109/COMST.2023.3271419)

[img] Text
305062.pdf - Published Version
Available under License Creative Commons Attribution.

16MB

Abstract

The future of cellular networks is contingent on artificial intelligence (AI) based automation, particularly for radio access network (RAN) operation, optimization, and troubleshooting. To achieve such zero-touch automation, a myriad of AI-based solutions are being proposed in literature to leverage AI for modeling and optimizing network behavior to achieve the zero-touch automation goal. However, to work reliably, AI based automation, requires a deluge of training data. Consequently, the success of the proposed AI solutions is limited by a fundamental challenge faced by cellular network research community: scarcity of the training data. In this paper, we present an extensive review of classic and emerging techniques to address this challenge. We first identify the common data types in RAN and their known use-cases. We then present a taxonomized survey of techniques used in literature to address training data scarcity for various data types. This is followed by a framework to address the training data scarcity. The proposed framework builds on available information and combination of techniques including interpolation, domain-knowledge based, generative adversarial neural networks, transfer learning, autoencoders, fewshot learning, simulators and testbeds. Potential new techniques to enrich scarce data in cellular networks are also proposed, such as by matrix completion theory, and domain knowledge-based techniques leveraging different types of network geometries and network parameters. In addition, an overview of state-of-the art simulators and testbeds is also presented to make readers aware of current and emerging platforms to access real data in order to overcome the data scarcity challenge. The extensive survey of training data scarcity addressing techniques combined with proposed framework to select a suitable technique for given type of data, can assist researchers and network operators in choosing the appropriate methods to overcome the data scarcity challenge in leveraging AI to radio access network automation.

Item Type:Articles
Additional Information:This work was supported in part by the National Science Foundation under Grant 1923669, 1730650, the Qatar National Research Fund (QNRF) under Grant NPRP12-S 0311-190302 and in part by an unrestricted award from Ericsson Research, CA, USA.
Status:Published
Refereed:Yes
Glasgow Author(s) Enlighten ID:Imran, Professor Ali
Authors: Qureshi, H. N., Masood, U., Manalastas, M., Zaidi, S. M. A., Farooq, H., Forgeat, J., Bouton, M., Bothe, S., Karlsson, P., Rizwan, A., and Imran, A.
College/School:College of Science and Engineering > School of Engineering > Autonomous Systems and Connectivity
Journal Name:IEEE Communications Surveys and Tutorials
Publisher:IEEE
ISSN:1553-877X
ISSN (Online):1553-877X
Copyright Holders:Copyright © The Author(s) 2023
First Published:First published in IEEE Communications Surveys and Tutorials 25(3):1954 - 1990
Publisher Policy:Reproduced under a Creative Commons license

University Staff: Request a correction | Enlighten Editors: Update this record