Abstract
Non-orthogonal Multiple Access (NOMA) techniques offer potential enhancements in spectral efficiency for 5G and 6G wireless networks, facilitating broader network access. Central to realizing optimal system performance are factors like joint power control, user grouping, and decoding order. This study investigates power control and user grouping to optimize spectral efficiency in NOMA uplink systems, aiming to reduce computational difficulty. While previous research on this integrated optimization has identified several near-optimal solutions, they often come with considerable system and computational overheads. To address this, this study employed an improved Grey Wolf Optimizer (GWO), a nature-inspired metaheuristic optimization method. Although GWO is effective, it can sometimes converge prematurely and might lack diversity. To enhance its performance, this study introduces a new version of GWO, integrating Competitive Learning, Q-learning, and Greedy Selection. Competitive learning adopts agent competition, balancing exploration and exploitation and preserving diversity. Q-learning guides the search based on past experiences, enhancing adaptability and preventing redundant exploration of sub-optimal regions. Greedy selection ensures the retention of the best solutions after each iteration. The synergistic integration of these three components substantially enhances the performance of the standard GWO. This algorithm was used to manage power and user-grouping in NOMA systems, aiming to strengthen system performance while restricting computational demands. The effectiveness of the proposed algorithm was validated through numerical evaluations. Simulated outcomes revealed that when applied to the joint challenge in NOMA uplink systems, it surpasses the spectral efficiency of conventional orthogonal multiple access. Moreover, the proposed approach demonstrated superior performance compared to the standard GWO and other state-of-the-art algorithms, achieving reduced system complexity under identical constraints.
Keywords: Competitive learning, Grey Wolf Optimizer, Non-orthogonal multiple access (NOMA), Q-learning, Spectral efficiency, User-grouping
Subject terms: Engineering, Mathematics and computing
Introduction
The evolution of wireless communication technologies has been nothing short of transformative. From the early days of 1G to the current 5G era, each generation has brought forth significant advancements, shaping the way we communicate, work, and live. As we transition into the 5G era, the promise of ultra-reliable, low-latency communication, massive device connectivity, and enhanced mobile broadband beckons1. However, with these advancements come new challenges, especially in the realm of uplink communication. As we stand on the cusp of a fully interconnected world, the importance of optimizing uplink communication in 5G networks cannot be overstated2. Uplink communication refers to the transmission of data from a user device to a network and plays a pivotal role in ensuring a seamless user experience, especially in applications that require real-time data transmission. Uplink communication, the process through which data is sent from user devices to the network, is a critical component of any wireless system. In the dense, high-demand environment of 5G networks, ensuring efficient and reliable uplink communication becomes paramount. This necessitates innovative approaches to manage the myriad of devices and the vast amounts of data they transmit3.
Power regulation is one of the primary challenges in uplink communication. It is essential to make sure every device transmits at the best possible power level. While too little electricity can lead to lost or delayed data, too much power can interfere with other devices. Additionally, when the number of 5G devices increases as expected, individual management deteriorates. This introduces the idea of user grouping, a device clustering technique for more effective and economical resource distribution4. However, the integration of power control and user grouping, while essential, is not straightforward. It requires sophisticated optimization techniques to achieve the desired balance of efficiency and performance. The realm of 5G uplink communication presents a multifaceted environment with dynamic variables, making it a challenging domain to optimize5. Due to the following reasons, sophisticated optimization techniques are essential for this application. (i) Unlike static systems, 5G networks are characterized by their dynamic nature. Factors such as user mobility, fluctuating data traffic, and varying environmental conditions mean that the system's parameters are constantly changing. Simple optimization techniques that work for static or slowly changing systems may not be suitable for such a dynamic environment; (ii) The problem space in 5G uplink communication is massive. With numerous devices, each with its own set of parameters like power levels, transmission rates, and channel conditions, the optimization problem becomes high-dimensional. Sophisticated techniques are required to navigate this vast problem space efficiently; (iii) Many applications in 5G demand real-time or near-real-time responses. This necessitates optimization techniques that can provide solutions quickly, even in complex scenarios; and (iv) Achieving efficiency and performance often involves trade-offs6,7. For instance, maximizing data throughput might conflict with minimizing power consumption. Sophisticated optimization techniques can help in finding a balance between such conflicting objectives8.
Several optimization algorithms have been applied to the challenges of 5G uplink communication. For instance, a Genetic Algorithm (GA), inspired by the process of natural selection, is used for searching for optimal solutions by evolving a population of individual solutions9, Particle Swarm Optimization (PSO) is stimulated by the social conduct of birds flocking, PSO adjusts the trajectory of each individual solution based on its own experience and the experience of its neighbours10, Grey Wolf Optimizer (GWO) mimics the leadership hierarchy and hunting behaviour of grey wolves11. It has been applied to various optimization problems, including those in wireless communication and an area of machine learning where algorithms learn by interacting with an environment and receiving feedback in the form of rewards or penalties. Apart from the algorithms mentioned above, there are several other algorithms, namely artificial bee colony12, ant colony optimization13, differential evolution algorithm14, cuckoo search algorithm15, gravitational search algorithm16, firefly algorithm17, salp swarm optimizer18, whale optimizer19, marine predator algorithm20, equilibrium optimizer21, Slime Mould Algorithm (SMA)22, gradient-based optimizer23, Golden Jackal Algorithm (GJA)24, mountain gazelle optimizer25, thermal exchange optimizer26, red deer algorithm27, Aquila optimizer28, reptile search algorithm29, arithmetic optimization algorithm30, etc. are available in various literature. Many metaheuristics algorithms can sometimes converge to a solution too quickly before exploring the entire solution space, leading to sub-optimal solutions. Some algorithms tend to get stuck in local optima, meaning they find solutions that are optimal for a nearby region but not necessarily for the entire problem space31,32. Algorithms, especially deep reinforcement learning ones, can be computationally intensive, making them unsuitable for real-time applications. Many algorithms have parameters that need to be fine-tuned. The performance of the algorithm can be highly sensitive to these parameters, making it challenging to find the right set for a given problem. Some algorithms work well for small-scale problems but struggle to scale up to the size and complexity of real-world 5G networks.
Given these challenges and limitations, there is a pressing need for enhanced or new optimization techniques that can address the unique challenges posed by 5G uplink communication. In this context, drawing inspiration from the social hierarchy and hunting behaviour of grey wolves, GWO is a nature-inspired optimization method known for its efficacy in complex problem-solving11. The proposed Multi-Learning GWO (MLGWO) introduces several novel components specifically designed to address the limitations of the standard GWO, namely premature convergence and lack of diversity. These enhancements are achieved through the integration of Competitive Learning, Q-learning, and Greedy Selection mechanisms, each contributing uniquely to the overall performance of the optimizer. Competitive Learning is incorporated to balance exploration and exploitation during the optimization process. In this approach, multiple agents (grey wolves) compete to find the best solutions. This competition fosters a more diverse search process by preventing the optimizer from settling into local optima prematurely. The agents continuously adapt and improve their search strategies based on their performance relative to others, thereby maintaining a higher level of diversity in the population. Q-learning, a reinforcement learning technique, is utilized to guide the search process based on past experiences. By learning from previous iterations, the optimizer can better navigate the search space, avoiding redundant exploration of sub-optimal regions. This learning mechanism enhances the adaptability of the GWO, allowing it to adjust its search strategy dynamically and improve its chances of finding global optima. The Q-learning component helps the optimizer to remember and leverage beneficial moves while discouraging those that lead to premature convergence. The greedy selection mechanism ensures that the best solutions are retained after each iteration. This approach guarantees that the optimizer consistently builds upon the highest-quality solutions discovered, thus improving convergence speed without sacrificing solution quality. By focusing on retaining superior solutions, Greedy selection minimizes the risk of the optimizer getting stuck in sub-optimal regions, further mitigating premature convergence. This paper delves deep into the synergistic integration of power control, user grouping, and the enhanced GWO algorithm. Through rigorous research and testing, the study aims to present a comprehensive solution to the challenges of 5G uplink communication, setting the stage for a more connected and efficient future. The main contributions of this study are as follows.
Introduction of MLGWO that integrates a multi-learning strategy which enhances the balance between exploration and exploitation, ensuring a more thorough and efficient search of the solution space.
Model the objective function and constraints specifically tailored for 5G uplink communications. This involves accurately capturing the unique requirements and challenges of power allocation and user grouping in NOMA systems, enabling more effective optimization.
Investigate user pairing and grouping in uplink NOMA transmission by examining specific predetermined power control approaches combined with near-perfect successive interference cancellation to provide insights into optimizing user allocation strategies to enhance spectral efficiency and overall system performance.
Conduction of a detailed performance comparison of the proposed MLGWO with other state-of-the-art algorithms. Despite potentially higher complexity per iteration, the proposed approach demonstrates faster convergence and superior performance, as evidenced by the simulation results.
The paper is structured as follows. “Related works” section covers the related work and literature survey. “System modelling” section discusses in detail the system model and the objective function formulation. “Proposed algorithm” section discusses the formulation of the proposed mult-learning GWO algorithm along with the basic concepts of the GWO algorithm. “Results and discussions” section comprehensively discusses the obtained results, and “Conclusions” section concludes the paper.
Related works
When examining spectral-power efficiency, NOMA methods have been confirmed to be more effective than Orthogonal Multiple Access (OMA) methods, particularly in fluctuating signal conditions. While OMA methods are simpler to implement, they are hindered by reduced spectral efficiency and scarce radio resources, making them less suitable for the extensive connectivity anticipated in upcoming network systems. The edge of NOMA over OMA arises from its ability to allow multiple users to share the same frequency channel within a single cell33,34. Among the various NOMA strategies, power-domain NOMA stands out, where users share the power domain. In such networks, Superpose Coding (SC) facilitates the blending of signals from different users on the same sub-carrier but with varied power intensities at the transmission end. Given that NOMA permits multiple users to utilize identical time, frequency, and code resources, it is inevitable that users encounter co-channel disruptions35,36. This necessitates intricate sub-band allocation methods and scheduling. Despite its advantages, NOMA systems demand significant computational resources, especially for Successive Interference Cancellation (SIC) and power distribution algorithms on the receiving end. The computational demands of SIC become even more pronounced when there is a need for high-speed data transfer and minimal delay37,38.
Recent research has explored the potential of integrating NOMA into standard protocols for IoT infrastructures and vehicle-to-everything communications. One of the hallmarks of NOMA systems is the use of the SIC method at the receiver side to enable the detection and decoding of multiple users. The application of the SIC method varies depending on whether it's for a downlink or an uplink, each having its unique operational sequence. In scenarios where multiple users are part of the NOMA cluster, there's a significant demand for computational resources39. As a result, SIC is typically applied in downlink NOMA scenarios. Conventionally, downlink NOMA configurations consist of pairs of users. Given the superior processing and computational abilities of the Base Station (BS), it is deemed feasible to employ SIC there for uplink NOMA systems. A notable benefit of uplink NOMA is its capacity to support concurrent transmissions from multiple users. However, to optimize the performance of NOMA systems while managing the computational demands of the SIC method, it is essential to develop innovative user pairing/grouping and power management tactics. The efficacy of both uplink and downlink NOMA configurations is closely tied to the chosen user pairing/grouping and power regulation methods40,41.
Cognitive radio technology enhances spectrum efficiency by allowing unlicensed users to access underutilized licensed spectrum bands opportunistically. Cognitive radios employ dynamic spectrum access techniques to identify and utilize spectrum holes, thereby improving overall spectrum utilization. However, NOMA offers several advantages over cognitive radio. NOMA can simultaneously serve multiple users in the same frequency band by exploiting power domain multiplexing, resulting in higher spectral efficiency without the need for extensive spectrum sensing and dynamic access protocols required in cognitive radio networks. Furthermore, NOMA's inherent capability to handle heterogeneous user demands and its compatibility with existing cellular infrastructure make it a more versatile and practical solution for enhancing spectral efficiency in modern wireless networks42. Integrated Sensing and Communication (ISAC) aim to integrate communication and sensing functionalities within the same wireless framework, thereby improving spectrum efficiency and enabling new applications such as vehicular networks and IoT. ISAC systems leverage joint resource allocation and cooperative strategies to optimize the dual functions of sensing and communication. While ISAC technologies present promising advancements, NOMA provides distinct advantages in terms of simultaneous user support and power domain multiplexing. NOMA's ability to dynamically allocate power levels to users based on their channel conditions and quality of service requirements allows for more flexible and efficient spectrum utilization. Additionally, NOMA's robustness in managing interference and its scalability to support massive connectivity make it a compelling choice for future wireless networks43,44. UAV-enabled integrated systems combine sensing, computing, and communication capabilities to enhance spectrum efficiency and enable advanced applications such as IoT and 6G communications. These systems involve joint resource allocation and trajectory design to optimize the overall performance. However, NOMA offers unique advantages by supporting multiple users simultaneously through power domain multiplexing, which can be further enhanced when integrated with UAV systems. NOMA's flexibility in power allocation and user grouping complements the dynamic nature of UAV deployments, leading to improved spectral efficiency and reduced latency45,46.
Numerous studies have delved into crafting effective user pairing/grouping strategies in NOMA systems with goals such as optimizing the overall rate, reducing transmission power, or ensuring equitable distribution. The researchers in one study proposed an innovative framework for devising suitable user pairing/grouping techniques to utilize the same resource blocks. Meanwhile, another study showcased a dual-user grouping approach rooted in distinct channel gains, with both types of research aiming to optimize the overall rate47. NOMA approaches surpass OMA systems in terms of spectral efficiency. Yet, to handle user signals as distinct units, a power control strategy at the transmission end is imperative. Additionally, various power distribution techniques are introduced for the standard single-cell uplink NOMA transmission framework. Power allocation for a single cell is noticeably simpler than that in a multi-cell scenario due to the interference between the channels. A viable solution to mitigate interference in multi-cell contexts is the adoption of a power-distributed downlink allocation strategy38. The authors of48 investigate the facilitation of Ultra-Reliable Low-Latency Communication (URLLC) systems through unmanned aerial vehicles (UAV)-enabled relaying for multi-access edge computing (MEC) systems in 6G networks. This study provides a forward-looking perspective on the integration of UAVs and MEC systems, emphasizing the role of URLLC in future networks. Their findings on resource allocation and relay positioning are particularly relevant to our research, as they address similar challenges in optimizing network performance and efficiency.
Research on resource allocation for NOMA is detailed in49, with a predominant emphasis on optimizing the combined rate while adhering to total power and proportional rate limitations. Additionally, multi-cell NOMA is explored in50. The authors of51 have suggested a solution nearing optimality for power distribution with the assumption of flawless Channel State Information (CSI) at the base station. Conversely, the authors of50 introduced a proficient power allocation method tailored for varying Quality-of-Service (QoS) needs, especially when CSI is not perfect. The primary objective of these studies was to curtail the overall transmission power. Moreover, the combined efforts of power distribution and subcarrier assignment for NOMA are discussed in52–54. The authors of53 focused on investigating the best power allocation to boost the weighted combined rate, given QoS restrictions. Specifically, the authors of52 showcased a near-best combined power and subcarrier distribution aimed at enhancing the weighted system output. Meanwhile, the authors of54 provided theoretical perspectives and a strategy for maximizing the combined rate. Yet, these methodologies primarily target either system output enhancement or total rate optimization, often side-lining user fairness, which is a pivotal aspect in shaping NOMA networks. In another relevant study, the authors of55 address the quasi-optimization of resource allocation and positioning for solar-powered UAVs. This research highlights the importance of resource allocation and efficient positioning strategies for enhancing the performance of UAV networks. The methodologies and optimization techniques discussed in this paper complement our approach to power control and user grouping, offering additional insights into resource management in next-generation networks.
In a study referenced in56, the authors explored the best power allocation for three unique uplink NOMA setups using the Hungarian method, adjusting the cost function related to user pairing techniques. They also developed various user pairing/grouping strategies based on matching games. Recent research delved into user-pairing and grouping methods for different NOMA systems. The authors of33 highlighted the significance of user grouping and power control for attaining high sum-rate capability, providing a lookup table derived from comprehensive calculations. The authors of57 introduced a resource allocation method for uplink scenarios based on the Cumulative Distributive Function (CDF), selecting the top two users based on the CDF's highest value. Meanwhile, research in58 presented a broad strategy for user categorization in overlapping NOMA networks, emphasizing machine learning to determine optimal user grouping. This research also tackled user grouping, affiliation, and power allocation, aiming to enhance the uplink network's capacity and overall system efficiency. The authors of59 explore power-efficient URLLC in UAV networks under jittering conditions. This study highlights the significance of power efficiency and reliable communication, which are critical factors for the effective deployment of UAV networks. Their work aligns with the focus on optimizing spectral efficiency and resource allocation in NOMA systems, providing a valuable perspective on power management in wireless communication networks.
In an early exploration of NOMA, uplink transmissions were examined in60, utilizing a power-control technique at the transmission end and employing a least mean squared error-based SIC process at the receiving end. The study in34 delved into the combined challenges of sub-carrier allocation and power distribution, ultimately deriving a near-optimal method to boost the spectral efficiency of NOMA participants. Diverging from prior research, the authors of61 introduced a resilient user-scheduling technique for NOMA uplink transmissions, capitalizing on the unique channel gains of network users and employing single-carrier frequency division multiple access to facilitate effective user grouping. For a two-user NOMA uplink transmission, the authors of62 formulated closed-loop solutions for sum outage and throughput probability under fixed power conditions. Additionally, if target sum rates are not properly chosen for NOMA downlink broadcasts, a comparison with time division multiple access-based OMA showed that a NOMA user could regularly experience an outage.
Metaheuristic algorithms are versatile optimization tools rooted in the principle of natural selection, making them adept at addressing diverse engineering challenges. They utilize a specific parameter to denote the intricacy of power control problems, formulating the function as a network's sum rate utility function57,63,64. As a result, all systems uniformly apply these power control parameters. Research has delved into NOMA-based mobile edge computing systems to enhance energy efficiency through task offloading. A coalition-based strategy has been employed to tackle power equilibrium and resource distribution33. Various meta-heuristic techniques were introduced to handle the localization challenges in wireless sensor networks, including the bat65, whale optimization66, firework67, and cuckoo search algorithms68. Additionally, a routing method leveraging a hybrid optimization algorithm has been developed for wireless sensor networks. Recently, swarm intelligence algorithms have gained traction as effective optimization tools in wireless communication69,70, standing alongside game theory71 and convex optimization72. Utilizing swarm intelligence algorithms can potentially resolve wireless network challenges, including power management, spectrum distribution, and security concerns. For optimal sum rate in NOMA downlink systems, GWO and PSO have been the focus. Furthermore, the research introduces a discrete version of the distributed GWO73 for the assignment of interdependent tasks to virtual machines. Within the framework, scheduling is perceived as a challenge with the primary objective of minimizing data movement and processing expenses. Recent advancements in global optimization have spotlighted the improved Grey Wolf Optimizer, addressing the limitations inherent in the original GWO algorithm74. The island-based cuckoo search represents a parallel adaptation of the traditional Cuckoo Search (CS) algorithm, incorporating the notably disruptive island CS polynomial mutation75. An algorithm with opposition-based learning was introduced for task scheduling in cloud computing environments, primarily focusing on data transfer costs and computational aspects76. Moreover, numerous iterative algorithms have been introduced to address energy efficiency optimization in NOMA networks, such as within NOMA HetNets77, a single-cell NOMA system78, and massive MIMO networks79. While the iterative method has been adapted to diverse contexts, the network configuration examined in this study is distinct, rendering the current solutions unsuitable. Specifically, if rigorous fairness guidelines are enforced to ensure equity among all users, the strategies presented in77–79 become inapplicable.
Th authors of80 propose a joint optimization framework for user grouping and power allocation in NOMA downlink systems to maximize spectral efficiency. By developing an efficient algorithm to solve this complex problem, the study demonstrates significant improvements in system performance through comprehensive simulations, addressing critical challenges in NOMA optimization. The authors of81 introduce a hybrid heuristic-based approach, combining genetic algorithm and particle swarm optimization, to optimize network parameters in NOMA systems, focusing on enhancing spectral and energy efficiency while ensuring QoS maximization. The integration of these heuristic algorithms effectively balances these objectives, with simulations showcasing superior performance compared to traditional methods. The study presented in82 proposed an improved Bat Algorithm to optimize throughput in NOMA cognitive relay networks incorporating RF energy harvesting. The novel system model and tailored optimization algorithm result in significant enhancements in throughput and energy efficiency, validated through detailed simulation results. The authors of83 propose the nonlinear marine predator algorithm for fair power allocation in NOMA Visible Light Communication (VLC) systems for B5G networks. By applying this bio-inspired algorithm, the study demonstrates improved performance and fairness in power allocation, offering a cost-effective solution for emerging VLC-based networks. The study presented in84 addresses resource allocation in downlink multi-channel NOMA systems, considering complex constraints such as interference and user mobility. The proposed efficient allocation scheme enhances system performance and robustness, as evidenced by comprehensive simulation results, making it relevant for practical deployment in NOMA networks. The study presented in85 explores the application of various meta-heuristic algorithms, including particle swarm optimization, differential evolution, satin bowerbird optimizer, symbiotic organism search, and artificial bee colony, for optimizing power allocation in indoor MIMO-NOMA VLC systems. By comparing the performance of these algorithms, the authors identify effective strategies for improving system capacity and energy efficiency, contributing to advancements in indoor communication technologies. The authors of86 develop a low-complexity heuristic algorithm to optimize power allocation and access mode selection in machine-to-machine networks. The algorithm balances performance with computational efficiency, showing enhanced network efficiency and reduced complexity through simulation results, which is crucial for IoT applications. The study presented in87 examines the optimization of resource allocation in NOMA-based mobile edge computing surveillance networks using genetic algorithm and particle swarm optimization. The dual optimization approach significantly enhances system performance, validated through simulations, and highlights the practical relevance for surveillance applications. The study presented in88 investigates power allocation in indoor NOMA VLC systems using meta-heuristic optimization algorithms, including genetic algorithm, particle swarm optimization, and differential evolution. By applying and comparing multiple algorithms, the research demonstrates substantial improvements in spectral efficiency and system performance, advancing the field of indoor VLC technologies. The authors of88 explore the use of various metaheuristic algorithms, such as genetic algorithm, particle swarm optimization, and ant colony optimization, for optimizing resource allocation in NOMA networks. Through a comparative analysis, the study identifies the most effective strategies, demonstrating significant performance enhancements in simulations, thus providing valuable insights for effective NOMA optimization.
System modelling
This section details the basics of the uplink NOMA transmission system, the modelling of the NOMA uplink transmission system, and the objective function formulation. In addition, this section also discusses the solution to the selected problem.
Uplink NOMA transmission
This segment delves into the idea of a two-user uplink NOMA transmission involving a singular BS. As depicted in Fig. 1, the uplink NOMA transmission's power-domain multiplexing is executed distinctively compared to the downlink NOMA transmission. This is because the BS utilizes superposition coding to achieve power province multiplexing in the downlink. In contrast, during the uplink, the broadcasting power of users is primarily constrained by their battery life. As a result, users can transmit at their maximum amount of power. The concurrent transmission of both users is represented by and to the appropriate base station33,62. The signals obtained from both users at the BS can be expressed as follows:
1 |
where signifies the Gaussian noise, transmission power for with , and the signal of the users . The signal is closer to the BS and is considered more robust, benefiting from a higher channel gain. In contrast, signal is weaker and situated further from the BS, resulting in a reduced channel gain. In uplink NOMA transmission, the SIC is utilized to decode a user's signal at the base station. As a result, the BS decodes signal first, treating signal as interference.
Fig. 1.
Illustration of uplink NOMA transmission for two users.
To decode signal , the already decoded is subtracted, and then is decoded. In this uplink scenario, faces interference from . However, transmission of is free from interference since is decoded before . In the context of downlink NOMA transmission, the decoding process differs. The encounters interference from , while transmission of is interference-free because is decoded before . For the optimal functioning of SIC at the base station, the achievable data rates are denoted as , with being either 1 or 2, for a specified bandwidth of 1 Hz62.
2 |
3 |
For an efficient uplink NOMA system, users must have sufficiently distinct channel gains. If the channel gains of both users are too similar, the BS might struggle to distinguish their signals in the power domain, necessitating a robust power control strategy. This means users should transmit with varying power levels. Another crucial point from the earlier discussion is the reversed order of SIC in the uplink NOMA. In this setup, the signal of the user closer to the BS is decoded first, while in downlink NOMA, the past user's signal takes preference. Moreover, the BS typically has the capacity for enhanced optimization in terms of power and energy. As a result, uplink NOMA transmission can support more users than its downlink counterpart. The base station can also employ advanced decoding algorithms to reduce interference effects on users. Thus, for extensive machine-type communication, uplink NOMA is often more advantageous than downlink NOMA.
Uplink NOMA transmission model
Consider an uplink transmission based on NOMA within a single cell, denoted as , with being equal to 1. Within this cell, the BS is centrally located and facilitates communication with users, as illustrated in Fig. 2. It is assumed that both the base station and all users are equipped with one antenna. Additionally, the base station is fully informed of the CSI and caters to the users inside its service range. The total number of Physical Resource Blocks (PRBs), whether in terms of frequency, time, or code, is represented by , with being less than . Therefore, the distribution of is organized within , ensuring each in the network meets the data rate condition. In the NOMA framework, users within the same PRB/group can send their PRB. In contrast, those in separate groups can transmit different PRBs in OMA62. Consequently, the BS can receive various signals, denoted as , which can be mathematically expressed as follows:
4 |
5 |
6 |
Fig. 2.
Illustration of the uplink NOMA transmission.
The overall transmission power and power control coefficient for all in the network are represented by and , respectively. The signal emanating from is denoted as , with an expected value of being 1. The channel response is Gaussian, characterizing the distribution between and the BS, and is symbolized by . Meanwhile, the white Gaussian noise, having mean power, is represented by and within the th group33,34,62. The allocation of to the th group is symbolized by and it is represented as follows.
7 |
8 |
For the decoding process, the BS utilizes SIC on every PRB to retrieve the intended information from users within a similar group. For user , the sequence of decoding within the th group or PRB is denoted by . Assuming indicates that for every , the decoding is the ath one in the th group. Consequently, the spectral efficiency for each can be mathematically articulated as follows.
9 |
In this context, the ratio of transmission power to noise is symbolized by , where equals . Consider two users, and , within a cell regarding their decoding sequence. The decoding of user precedes that of user in a group, depicted as . It is presumed that the base station possesses superior processing capabilities and is completely informed, meaning it has complete CSI to decode signals or information efficiently. Furthermore, to handle user grouping and power control, every user in the network should transmit values and 62. Thus, for each user that is part of the th group, the spectral efficiency can be mathematically expressed as follows.
10 |
Equation (10) demonstrates that while each group is unaffected by the decoding order, individual user spectral performance inside a network is.
Problem formulation
This study considers a comprehensive approach to address several interconnected challenges. Specifically, this study integrated the formulation of the power control issue with the task of grouping users and determining the sequence in which their signals are decoded. This integration was done while ensuring that all operations met the minimum requirements set for each component. To investigate deeper into the details: (i) Power control pertains to the management and allocation of power resources to ensure efficient communication. Proper power control is crucial to prevent interference, save energy, and maintain the quality of service; (ii) User grouping involves categorizing users into specific groups based on certain criteria. Effective user grouping can optimize resource utilization and enhance overall system performance; and (iii) In systems like NOMA, the sequence in which user signals are decoded can impact the overall efficiency and quality of the received signals. Therefore, determining the optimal order is crucial. All these components were addressed while ensuring that they adhered to their respective minimum requirements. For instance, every user, represented as in the network, has a baseline rate requirement for their data transmission. This rate is symbolized by . Given these details, the objective is to enhance the spectral performance of the system. From the references34,89, this study formulated a problem statement that seeks to maximize this spectral performance.
11 |
Subjected to:
12 |
The component signifies the sequence in which the signals from different users are decoded at the base station. The order can influence the efficiency and clarity of the received signals. The variable encompasses all potential sequences or combinations in which users' signals might be decoded. This means that for a set of users, there are multiple ways their signals can be processed, and 'h' captures all these possibilities. The constraint pertains to the power with which signals are transmitted. It takes into account the maximum power limit or the upper bound for every user, denoted as , within the network. Ensuring that transmissions stay within this power limit is crucial to prevent interference, conserve energy, and maintain signal quality. The constraint embodies the minimum data transmission rate that should be attainable for the system to function effectively. This rate is a benchmark to ensure that data is transmitted and received at a speed that meets the system's operational requirements. The constraints and play a role in user allocation. They act as markers or indicators to confirm that a user, represented as , is correctly assigned to a specific group within the network. Proper user allocation is essential for efficient resource utilization and to ensure that each user's requirements are met within the system's framework. At the core, these components and constraints provide a structured framework to understand and optimize the system's operations, ensuring that signals are decoded efficiently, power is used carefully, data transmission rates are maintained, and users are appropriately grouped.
Maximization of spectral efficiency for the uplink NOMA transmission systems
The problem dimensions, such as , and are intricately interconnected, which complicates the search for an optimal solution to the problem outlined in Eq. (11). Adding to the complexity, the variables associated with user grouping, are of a combinatorial-integer nature. To address this, one should first tackle the combinatorial aspect of the power/control problem and subsequently determine the decoding sequence of the user. Once these are set, an effective strategy for user-grouping can be implemented62. When a fixed user-grouping scheme is in place, the value of remains independent across all unique groups, irrespective of the decoding order and power control considerations.
13 |
14 |
where denotes the possible combinations of the th PRB.
Method of decoding order
To execute the SIC process, the receiver decodes all signals of all users in a sequence determined by their channel conditions, starting from the strongest to the weakest. For instance, in the context of NOMA uplink, users with superior channel conditions are decoded at the base station earlier than others. Conversely, those with subpar channel conditions are decoded later. As a consequence, the durable user encounters intrusion from other network users, while the weak user experiences no interference. For the most efficient decoding sequence, as mentioned in89, the users, denoted as , within a similar PRB or group are organized based on . The sequence in which each user's signal is decoded is influenced by the power control strategy, leading to various potential feasible areas, which can be described as follows:
15 |
16 |
From now on, Eqs. (15), (16) denotes that any user is decoded first as per the largest .
Uplink power control
To handle the challenge of power control, this study turns to optimization techniques, specifically leveraging the principles of linear programming. Linear programming is a mathematical method used to find the best possible outcome in a given mathematical model for some list of requirements represented as linear relationships. In this study, the linear programming approach allows us to systematically adjust power levels to achieve optimal performance while adhering to certain constraints. The power control optimization is crucial to ensure efficient communication, minimize interference, and optimize energy consumption. By framing this as a linear programming problem, it is possible to determine the optimal power levels for each user or device in the network, ensuring that the overall system operates at its peak efficiency62. The mathematical representation of this power control optimization problem is detailed as follows:
17 |
Subjected to:
18 |
In this context, represents the power control coefficient, with belonging to the set Both Eqs. (16–18) exhibit linearity and are articulated using SNR (Signal-to-Noise Ratio) formulations. Consequently, the constraint 2 can be detailed as follows.
19 |
Equation (19) is represented in matrix form as follows.
20 |
21 |
Then, Eq. (20) can be articulated as follows.
22 |
In conclusion, the problem can be reformulated as follows.
23 |
Subjected to:
24 |
where, .
The optimization challenge, labelled as , regarding power control, can be depicted in the conventional format used for linear programming. As a result, can be articulated subsequently:
25 |
Equation (26) is the standard format of Eq. (25).
26 |
where and , denotes the slack parameters and denotes the excess parameter, therefore . Lastly, the usual form of Eq. (24) is as follows.
27 |
28 |
29 |
From now on, Eqs. (28) and (29) can be reformulated as follows.
30 |
31 |
Lastly, the optimization problem can be reformulated as follows.
32 |
Subjected to:
33 |
where , , and .
User grouping/pairing
Developing an efficient algorithm that requires minimal computational time for user pairing or grouping is pivotal for a high-performing NOMA uplink system. Addressing this, a new variant of the GWO algorithm is introduced to tackle the inherent complexity. It has been explored to derive both optimal and near-optimal solutions for the user pairing/grouping challenge, with the ultimate goal of enhancing overall system performance. The essence of this problem lies in leveraging the differences in channel gain among various users within a network. The primary motivation is to amplify the spectral efficiency of the system. A specific strategy to resolve the user pairing/grouping dilemma involves employing a search methodology. When a user pairing/grouping scheme is fixed, the optimal solution can be identified. The next step involves ranking all users in descending order based on the metric . The algorithm in this study for the user pairing/grouping issue is detailed in “Proposed algorithm”. The first step is to delineate the viable solutions for user grouping, considering both the exhaustive and the proposed algorithm. An exhaustive search meticulously examines every data point within the designated search region, ensuring the most accurate match is identified. However, this method demands significant computational resources. This becomes especially challenging for discrete problems where no straightforward solution is apparent. In such cases, it might be necessary to sequentially scrutinize every potential scenario to identify the most suitable solution. Relying solely on the exhaustive search algorithm becomes increasingly challenging as the number of comparisons intensifies quickly.
Proposed algorithm
This section details the formulation of the proposed Multi-Learning Grey Wolf Optimizer (MLGWO), in addition to the basic concepts of the original GWO algorithm.
Original GWO algorithm
The Grey Wolf Optimizer (GWO) is an optimization algorithm inspired by the social hierarchy and hunting behaviour of grey wolves in nature11. In the wild, grey wolves are known for their strategic hunting techniques, which involve tracking, encircling, and pursuing their prey. This behaviour is mimicked in the GWO algorithm to search for optimal solutions in a given problem space. Grey wolves can be categorized into four main classes based on their social dominance: (i) Alpha (): It is the leader of the pack and makes decisions about hunting, sleeping places, and the time of hunting; (ii) Beta (): These wolves assist the alpha in decision-making and take the lead in their absence; (iii) Delta (): These wolves obey the alpha and beta but dominate the omegas; (iv) Omega (): The lowest ranking wolves.
The mathematical modelling of the GWO algorithm is discussed as follows. Start by initializing a population of grey wolves, where each wolf represents a potential solution in the problem space. Each wolf has a position defined in a multi-dimensional space.
34 |
where is the position of the wolf and is the dimensionality of the problem. The hunting behaviour of the wolves is modelled using three main steps: encircling the prey, hunting, and attacking. The wolves try to encircle the prey, which is mathematically modelled as follows.
35 |
36 |
where are coefficient vectors, are the positions of the alpha, beta, and delta wolves, respectively, and are random vectors in , where decreases linearly from 2 to 0 over iterations. The position of each wolf is updated based on the positions of the alpha, beta, and delta wolves:
37 |
The wolves approach the prey based on a random approach as follows.
38 |
where is the best position found so far, rand denotes the random number between 0 and 1, and is the current position of the wolf. Based on fitness, the wolves are sorted after every iteration. The alpha wolf is the best, the beta wolf is the second best, and the delta wolf is the third best. Up until a stopping condition is satisfied, the algorithm iterates, adjusting the wolves' position in accordance with their hunting behaviour. This could be any other problem-specific constraint, such as a minimal error limit or a maximum number of iterations. The alpha wolf's position indicates the best way to solve the issue once the algorithm has converged. The GWO algorithm looks for the best solution in a given problem area by drawing on the social hierarchy and hunting habits of grey wolves. The algorithm converges when a stopping requirement is satisfied, and the wolf locations are updated in accordance with the best solutions discovered thus far (alpha, beta, and delta).
Multi-learning Grey Wolf Optimizer
The GWO is a bio-inspired optimization algorithm that simulates the hunting behaviour of grey wolves. While GWO has shown promise in various optimization tasks, it has certain limitations that have motivated researchers to propose improved versions. The primary reasons for improving the performance of the GWO and the problems associated with the original version are as follows. (i) One of the main issues with the original GWO is its tendency to converge prematurely to a local optimum, especially in complex, multi-modal optimization landscapes. Therefore, the algorithm might find a solution quickly, but it might not be the best possible solution; (ii) Over iterations, the wolf agents in GWO can cluster around specific regions of the search space, leading to a lack of diversity. This clustering can further exacerbate the problem of premature convergence as the algorithm becomes less explorative and more exploitative; (iii) Like many optimization algorithms, GWO faces the challenge of balancing exploration and exploitation. The original GWO can sometimes lean towards exploitation, missing out on potentially better solutions in unexplored regions; (iv) For high-dimensional problems, the original GWO might struggle to find optimal solutions within a reasonable computational time; (v) The original GWO uses fixed mathematical equations for updating wolf positions. This lack of adaptability means that the algorithm might not be well-suited for problems that require dynamic adjustments based on the search landscape90,91.
To address these challenges, various enhancements, such as integrating competitive learning92, Q-learning93, and greedy selection94, have been proposed in this study. These enhancements aim to increase the diversity of the search agents, guide the search process based on learned experiences, and ensure consistent improvement over iterations, making the algorithm more robust, adaptable, and efficient. Competitive learning can maintain diversity in the population, preventing agents from clustering around specific regions. By making agents compete, competitive learning can ensure a balance between exploring new regions and exploiting known good regions. Q-learning introduces a memory component, allowing the algorithm to learn from past actions and their outcomes. This memory-guided exploration can prevent the algorithm from repeatedly exploring sub-optimal regions. Q-learning allows GWO to adapt its behaviour based on the learned Q-values, making the search process more intelligent. Greedy selection ensures that the algorithm always retains the best solutions, leading to consistent improvement over iterations, and it helps in avoiding sub-optimal decisions made in any single iteration, ensuring that the algorithm is always moving in the right direction.
Competitive learning
Competitive learning is a type of neural network learning algorithm where neurons in a layer compete among themselves to be activated. It is often associated with self-organizing maps and vector quantization92,95. The fundamental idea is that, given an input, only one neuron is activated, and this neuron is the one whose weight vector is closest to the input vector. In optimization algorithms, competitive learning can be thought of as agents or solutions competing against each other based on their performance or fitness. The competition ensures that only the best agents get selected or updated, leading to a more refined search process. In the proposed MLGWO, competitive learning is introduced as a mechanism where wolves (agents) compete against each other during the optimization process. The following competitive learning strategies are included in the MLGWO. Equation (39) introduces the first competitive learning strategy.
39 |
Equation (40) introduces the second competitive learning strategy.
40 |
Equation (4) introduces the third competitive learning strategy.
41 |
where denotes the position of the alpha wolf, denotes the uniform random number between 0 and 1, denotes the problem dimension, and are randomly selected solutions from the population. Finally, the update based on competitive learning can be modelled as follows.
42 |
where is a scaling factor, are random coefficients in the range [0,1], determining the influence of each competitor, and denotes the wolf's position. A set of competitor wolves with positions , where and is the number of competitors. Equation (42) represents the wolf's position being updated based on the differences between its position and the positions of its competitors. The random coefficients ensure variability in the updates, promoting exploration. For each wolf in the population, a subset of other wolves is randomly selected as competitors. This subset does not include the current wolf. The current wolf's position is updated based on its interaction with the selected competitors. This update can be based on various strategies, such as (i) using the difference between the positions of two randomly selected competitors and (ii) combining the differences between multiple pairs of competitors. As wolves compete, they are forced to explore different regions of the search space, ensuring diversity. This is crucial in preventing premature convergence to local optima. Competitive learning strikes a balance between exploration and exploitation. While the competition ensures that wolves are constantly trying to outdo each other, the best-performing wolves guide the search towards promising regions. By making wolves compete against each other, competitive learning ensures that the search space is explored more thoroughly. In standard GWO, wolves can sometimes cluster around specific regions, leading to stagnation. Competitive learning disrupts this clustering by constantly pushing wolves to outperform their competitors. The competitive mechanism allows the algorithm to adapt its behaviour based on the current state of the search. If a wolf finds itself in a less promising region, the competition will likely push it towards a better region. Competitive learning makes the algorithm more robust against getting trapped in local optima. The constant competition ensures that even if some wolves converge to a local optimum, others continue to explore and potentially find better solutions. In summary, competitive learning introduces a dynamic, adaptive competition mechanism into GWO, ensuring a more thorough exploration of the search space, maintaining diversity, and enhancing the algorithm's robustness and adaptability. The features of combining GWO and competitive learning are as follows. (i) The variability in updates ensures that different regions of the search space are explored; (ii) By being influenced by multiple competitors, wolves are less likely to cluster around specific regions, preventing stagnation; and (iii) While the competition promotes exploration, the influence of the best-performing competitors ensures that the search is also guided towards promising regions. The pseudocode for competitive learning is provided in Algorithm 1.
Algorithm 1.
Pseudocode for the competitive learning.
Q-learning integration
Q-learning is a model-free reinforcement learning algorithm used to find the optimal action-selection policy for a given finite Markov decision process96,97. It works by learning an action-value function and using it to choose the most valuable action at each state. Each agent maintains a Q-table, allowing it to make informed decisions on its movements in the search space. This memory-guided exploration ensures that the algorithm learns from past actions, preventing it from repeatedly exploring sub-optimal regions and enhancing its adaptability. In the proposed MLGWO, Q-learning is integrated to guide the search of the agents in the optimization landscape based on the learned Q-values. The Q-values represent the expected future rewards for taking an action in a particular state. The Q-learning update rule is as follows.
43 |
where is the current state, is the action taken, denotes the reward received after taking action in state . is the learning rate, and is the discount factor. In this study, states are the current positions of the agents, and actions are the decisions made by the agents to move in the search space. Q-learning introduces a memory component, allowing the algorithm to learn from past actions and their outcomes. This memory-guided exploration can prevent the algorithm from repeatedly exploring sub-optimal regions. Q-learning allows GWO to adapt its behaviour based on the learned Q-values, making the search process more intelligent. The following steps are followed in this study.
Initialize the Q-table with zeros. The Q-table table has states as rows and actions as columns. In this study, states can be the positions of the wolves, and actions are the decisions made by the wolves to move in the search space.
At each iteration, for each agent, select an action based on the Q-values, and it can be done using an -greedy strategy where with probability , a random action is chosen, and with probability , the action with the highest Q-value for the current state is chosen.
After taking action and observing the new state and reward, update the Q-values using the Q-learning update rule.
After several iterations, the Q-values converge, and the policy (optimal action for each state) can be extracted from the Q-table.
The pseudocode for the Q-learning is provided in Algorithm 2.
Algorithm 2.
Pseudocode for the Q-learning.
Q-learning allows the GWO to build a knowledge base from past iterations. Each agent (grey wolf) records the outcomes of its actions in a Q-table, where the state-action pairs are associated with a value representing the quality (Q-value) of taking a specific action in a given state. The Q-value is updated iteratively based on the rewards received from the environment (i.e., the improvement in solution quality). By referencing the Q-table, agents can make informed decisions about which actions are likely to yield the best results based on historical performance. This guidance helps in avoiding previously explored suboptimal regions and directs the search towards more promising areas of the solution space. Consequently, this enhances the efficiency of the search process, ensuring that computational resources are utilized effectively. Q-learning inherently balances exploration and exploitation through its action-selection strategy, often implemented using an epsilon-greedy approach. Initially, agents are encouraged to explore various actions to gather comprehensive knowledge about the solution space. As the optimization progresses and the Q-table becomes more accurate, the agents increasingly exploit known high-quality actions to refine their solutions. The Q-values are continuously updated based on new experiences, allowing the optimizer to adapt dynamically to the evolving search landscape. This adaptability ensures that the search process remains relevant and effective even as the optimizer discovers new high-potential regions. By learning from past actions and adjusting future actions accordingly, Q-learning enhances the overall robustness and reliability of the optimization process.
Greedy selection
Greedy selection is about making the locally optimal choice at each stage with the hope of finding a global optimum. In this study, it is about selecting the best solutions between the standard GWO and the MLGWO at each iteration. Greedy selection ensures that the algorithm always retains the best solutions found so far, leading to consistent improvement over iterations. By always choosing the best available solutions, the algorithm avoids getting stuck with sub-optimal decisions made in any single iteration. By consistently keeping the best solutions, the algorithm can converge faster to the optimal or near-optimal solution.
At every decision point, choose the option that seems best at the moment. This means that the algorithm makes a series of locally optimal choices. Greedy algorithms do not consider the broader implications of a choice; they pick the best option available right then. This can be both an advantage (due to its simplicity and speed) and a disadvantage (as it might not lead to the global optimum). Greedy algorithms are used in a wide range of applications, from network routing task scheduling to some classic optimization problems like the Knapsack problem, Huffman coding, and the coin change problem. After each iteration or update, evaluate the fitness of all solutions or agents. Compare the new solutions with the previous ones. Retaining the solution that is better between the new and the old one is the "greedy" step, as the algorithm immediately accepts any improvement without considering any long-term implications. By always choosing the best solution available at each step, the algorithm can quickly move towards better solutions. Greedy selection is straightforward to implement and understand. It will not require complex decision-making processes. By continuously opting for better solutions, the algorithm can avoid getting stuck or stagnating at sub-optimal solutions for extended periods.
The primary limitation of greedy selection is that it can lead the algorithm to converge to local optima. Since it doesn't consider the broader landscape of the solution space, it might miss better solutions that would require initially moving to a worse position. Greedy algorithms can be overly exploitative, focusing too much on refining the current best solutions and not enough on exploring new regions of the solution space. In the proposed MLGWO, greedy selection ensures that after each iteration, the best solutions are retained. This helps in consistently moving towards optimal or near-optimal solutions and provides a mechanism to ensure that the algorithm doesn't regress in its search for the best solution.
Finally, various enhancements, such as integrating competitive learning, Q-learning, and greedy selection, have been proposed in this study. These enhancements aim to increase the diversity of the search agents, guide the search process based on learned experiences, and ensure consistent improvement over iterations, making the algorithm more robust, adaptable, and efficient. By combining these mechanisms, the GWO can achieve a more balanced and efficient search, potentially leading to faster convergence, better solution quality, and a higher likelihood of escaping local optima. The pseudocode of the proposed MLGWO is presented in Algorithm 3. The flowchart of the MLGWO is shown in Fig. 3.
Algorithm 3.
Pseudocode of the proposed MLGWO algorithm.
Fig. 3.
Flowchart of the proposed MLGWO.
System complexity
The MLGWO integrates Competitive Learning, Q-learning, and Greedy Selection, which streamline the optimization process. This integration reduces redundant calculations by focusing on high-potential solutions and dynamically adapting search strategies. In contrast, the standard GWO lacks these adaptive mechanisms, leading to higher iterations and increased complexity. Golden Jackal Algorithm (GJA) involves complex adaptive mechanisms for balancing exploration and exploitation, contributing to higher system complexity. PSO requires extensive parameter tuning and velocity updates, adding to the computational burden. The Slime Mould Algorithm (SMA) is known for its adaptive mechanisms based on slime mould behaviour and also suffers from increased complexity due to its iterative nature and complex biological inspirations.
The position matrix stores the positions of all search agents. Its size is , so its space complexity is . The Q-values for Q-learning are stored in a 3D matrix of size . Its space complexity is . The overall space complexity is dominated by the ‘Positions’ matrix and the Q-table, making it .
Initializing the positions and evaluating their fitness values takes . The main loop runs for ‘’ iterations. Within each iteration: (i) Updating the positions of the agents based on the alpha, beta, and delta positions involves operations proportional to ; (ii) The competitive learning-based update also involves operations proportional to ; (iii) The Q-learning update, including action selection and Q-value update, involves operations proportional to . Given that these operations are sequential and not nested, the time complexity within the main loop is . Since the main loop runs for the maximum number of iterations, the overall time complexity is .
The proposed MLGWO offers significant advantages over standard GWO, GJA, PSO, and SMA despite sharing a similar theoretical complexity. MLGWO's integration of Competitive Learning, Q-learning, and Greedy Selection ensures faster convergence and reduced computational time by requiring fewer iterations to achieve high-quality solutions. This efficiency contrasts with the standard GWO's higher iteration count, GJA's complexity from adaptive mechanisms, PSO's increased demands in high-dimensional problems, and SMA's computational overhead from intricate biological-inspired updates.
Results and discussions
Case study descriptions
The section evaluates the performance of a NOMA uplink scenario. Precisely, it looks at how well a proposed user-grouping algorithm and power control approach works. MATLAB, a high-level language and interactive environment used for numerical computation, visualization, and programming, is used for the simulation process. The parameters used for the simulation are taken from existing literature and are listed in Table 1. The channel and position of the user are dispersed at random. This means that in the simulation, users can be anywhere and, on any channel, representing real-world unpredictability. The distance (or range) between the BS and the user is uniformly distributed, and it means that any distance between the user and the BS is equally likely. The channel response follows a Gaussian distribution. In simpler terms, most users have a channel response close to the average, with fewer users having exceptionally good or bad channel responses. This section aims to compare the effectiveness of the proposed algorithm established for (i) power control, (ii) user grouping, and (ii) decoding order in NOMA uplink systems. The proposed algorithm's performance is compared with four other algorithms, such as original GWO, SMA, PSO, and GJA. All algorithms, including the proposed MLGWO, are nature-inspired optimization techniques. They mimic certain behaviours observed in nature to find optimal or near-optimal solutions to problems. Each has its unique characteristics and mechanisms, making them suitable for different types of optimization challenges. The summary of all algorithms, except GWO and the proposed MLGWO, is presented as follows.
Table 1.
Simulation parameters of the uplink NOMA transmission.
Parameter | Value |
---|---|
Number of PRB/Group | 3 |
Number of users | 10 |
Number of cells | 1 |
Transmission power to noise ratio | 30 dB |
Minimum transmission rate requirement | 1.1 bits/s/Hz |
SMA: A population-based optimization technique that mimics the behaviour of slime mould is called the SMA. The SMA replicates the positive and negative feedback of the slime mould propagation wave using a mathematical model. The SMA mimics the ways in which slime mould moves and forages. To guarantee that they receive the highest possible concentration of substances, slime moulds can form a stronger path where food concentration is higher. It is used in network design problems, pathfinding, and other optimization tasks.
PSO: The PSO iteratively refines a potential solution based on a predetermined quality criterion and is modelled after the cooperative behaviour of real animals, such as fish or birds, that move towards a shared objective. In 1995, Russell Eberhart and James Kennedy originally introduced PSO. It is used for a variety of optimization and search problems. PSO moves a group of people known as particles through an area in stages, evaluates the objective function of each particle at each step, and multiplies the current velocity by an inertia variable. Both the social and cognitive coefficients are used. The theory states that the possibility of creating new particles is maximized in the right direction. PSO is versatile and has been applied to a wide range of optimization problems, including function optimization, neural network training, and system design.
GJA: The GJA is inspired by the social and hunting behaviour of golden jackals. Similar to the GWO, the golden jackal algorithm mimics the hunting behaviour of jackals. Jackals can hunt both individually and in small groups. The algorithm captures this behaviour to search and converge to an optimal solution. The GJA can be applied to various optimization problems, similar to the other nature-inspired algorithms.
The specific control parameters for these algorithms are provided in Table 2. The population size refers to the number of potential solutions that the algorithms start with. In this study, the population size is 30. The maximum number of iterations is 500 iterations. Each algorithm is run 30 times to ensure that the results are consistent. By running the algorithm multiple times, the users get a better sense of its average performance.
Table 2.
Control parameters of the selected algorithms.
Algorithms | Parameters | Value |
---|---|---|
GWO | Linearly varying from 2 to 0 | |
MLGWO | Linearly varying from 2 to 0 | |
Learning rate | 0.6 | |
Discount factor | 0.8 | |
Exploration rate | 0.1 | |
Number of competitors | 5 | |
PSO | Inertial weight | 0.9 |
Coefficient and | 2 | |
SMA | 0.03 | |
GJA | 1.5 (linearly decreased over iterations) |
User grouping strategies
This sub-section presents the simulation results for various user grouping strategies aimed at optimizing spectral efficiency and achievable sum rate using the proposed MLGWO in 5G uplink communication systems. Five strategies were evaluated: random, channel-based, distance-based, equal-power, and mixed grouping. It is also aimed to demonstrate the superior performance of the proposed MLGWO algorithm and other NOMA and OMA strategies in terms of spectral efficiency.
Random grouping serves as a baseline, where users are assigned to groups without considering channel conditions, distances, or power levels. This strategy provides a reference point for evaluating the effectiveness of more sophisticated grouping methods. As expected, the performance of random grouping is suboptimal, showing significant variability due to the lack of optimization. The spectral efficiency and sum rates are the lowest among all strategies, highlighting the importance of informed user grouping. Channel-based grouping prioritizes users with stronger channel gains, assigning them to the same group. This strategy significantly enhances performance metrics, as evidenced by higher spectral efficiency and achievable sum rates. The rationale behind this improvement is that users with stronger channels can better utilize the available power, leading to higher data rates. This strategy is particularly effective in environments with substantial channel gain variations, where exploiting these gains results in superior performance. Distance-based grouping focuses on user proximity to the base station, grouping users with similar distances together. This strategy aims to minimize path loss and signal degradation. The performance improvements are moderate compared to the channel-based approach. While it provides better spectral efficiency and sum rates than random grouping, it does not fully exploit channel conditions. However, in scenarios where user distances significantly impact signal strength and interference, this strategy offers noticeable benefits. The equal-power grouping strategy seeks to balance power distribution among groups, ensuring each group receives an equitable share of the available power. This approach results in some performance gains over random grouping, but it falls short of the channel-based and mixed strategies. The main limitation is that it does not consider channel gains or user distances, leading to suboptimal use of the available resources. Consequently, the spectral efficiency and sum rates are lower than those achieved by strategies that leverage more comprehensive information. Mixed grouping combines the strengths of channel-based and distance-based strategies. By considering both channel gains and user proximity, this approach adapts to varying conditions more effectively. The mixed strategy consistently yields the best performance, with the highest spectral efficiency and sum rates among all strategies. This is because it can dynamically adjust to different scenarios, exploiting strong channels while also minimizing path loss and interference. The synergy of channel and distance information results in balanced and optimal user grouping.
In channel-based grouping, the MLGWO algorithm efficiently identifies users with strong channels and groups them to maximize spectral efficiency. By dynamically adjusting the positions of search agents (representing user groupings), MLGWO ensures that the grouping strategy quickly converges to an optimal or near-optimal solution. In distance-based grouping, the MLGWO helps minimize path loss by effectively grouping users based on their distances from the base station. The algorithm iteratively refines the user groupings to ensure that users with similar distances are grouped together, enhancing signal strength and reducing interference. In the mixed grouping, the MLGWO algorithm's adaptive nature allows it to balance the influence of channel gains and user proximity. By iteratively adjusting the groupings based on real-time feedback from the objective function, MLGWO ensures that the mixed strategy can dynamically adapt to varying conditions, providing robust performance across different scenarios. The spectral efficiency of different user grouping strategies is presented in Fig. 4.
Fig. 4.
Spectral efficiency achieved by MLGWO for different user grouping strategies.
Since random grouping does not rely on optimization, the impact of MLGWO is minimal. The performance remains relatively low, serving as a baseline for comparison. In channel-based grouping, the proposed MLGWO optimizes the selection of users with strong channel gains, leading to higher spectral efficiency and sum rates. The algorithm ensures that the best possible groupings are identified quickly and effectively, leveraging the channel conditions to maximize performance. In distance-based grouping, the proposed MLGWO improves the grouping of users based on their proximity to the base station. While the impact is less pronounced than in channel-based grouping, the algorithm still enhances performance by minimizing path loss and improving signal strength. In equal power grouping, the MLGWO helps balance power distribution among groups more effectively than a non-optimized approach. However, the lack of consideration for channel gains and distances limits the overall impact of MLGWO compared to other strategies. The proposed MLGWO has the most significant impact on mixed grouping. By balancing channel gains and user distances, the algorithm ensures optimal groupings that adapt to varying conditions. The iterative refinement process of MLGWO leads to the highest spectral efficiency and sum rates among all strategies.
From the results, it is observed that channel-based and mixed strategies consistently outperform others, demonstrating significant improvements in spectral efficiency. Their ability to improve channel gains and user proximity results in superior performance. The channel-based and mixed strategies converge faster, indicating their efficiency in optimizing user grouping. The distance-based and equal-power strategies offer moderate improvements but do not fully improve available information for optimal performance. Random grouping serves as a baseline, illustrating the need for informed user grouping to achieve higher performance metrics.
Experimental results
This study aimed to demonstrate the superior performance of the proposed MLGWO algorithm over other existing algorithms and other NOMA and OMA strategies in terms of spectral efficiency. To achieve this, the study conducted a series of simulations and plotted the results to represent the performance metrics visually. The bandwidth settings in the simulations are chosen to reflect typical values used in 5G and beyond networks. Bandwidths of 10 MHz, 20 MHz, and higher are commonly employed to simulate realistic communication scenarios. These settings are in line with the bandwidth allocations specified for 5G networks by regulatory bodies such as the 3rd generation partnership project98,99. In the study, a comparison is made between the achievable sum rate of two communication approaches: NOMA and OMA. This comparison is visually represented in Fig. 5. The SNR from 10 to 60 dB used in the simulations cover a broad spectrum of realistic channel conditions. This range allows us to assess the system performance from moderate to high SNR scenarios, providing a comprehensive analysis of the achievable sum rate in NOMA uplink systems. The choice of this range is supported by established literature100–103, which frequently employs similar SNR ranges to evaluate the performance of communication systems. Using SNR from 10 to 60 dB enables us to evaluate the performance of the proposed MLGWO across a wide range of operational conditions. This comprehensive evaluation is crucial for understanding how the algorithm performs under varying signal quality, which directly impacts the achievable sum rate. By covering both moderate and high SNR scenarios, it is ensured that the simulation results provide a realistic and thorough assessment of the algorithm's effectiveness. The chosen SNR is reflective of practical scenarios encountered in real-world wireless communication systems. Higher SNR values (up to 60 dB) are particularly relevant for assessing performance in environments with strong signal strength and low noise, which are common in advanced 5G and beyond networks. This range allows us to demonstrate the robustness and scalability of our proposed algorithm under optimal conditions, as well as its effectiveness in less favourable conditions. By using these SNR values in the simulations, it is ensured that the results are both realistic and comparable with existing literature, providing a robust foundation for evaluating the performance of the proposed MLGWO in optimizing spectral efficiency in NOMA uplink systems.
Fig. 5.
Achievable sum rate in bps/Hz by the proposed MLGWO.
The spectral-efficiency metric is crucial in communication systems as it measures how efficiently the spectral bandwidth is utilized. A higher spectral efficiency indicates that more data can be transmitted over a given bandwidth, making the communication system more efficient. From the data presented, it is evident that the NOMA approach outperforms the OMA approach in terms of spectral efficiency. Specifically, the spectral efficiency achieved by the NOMA scheme is significantly higher than that of the OMA scheme. This superiority of NOMA over OMA in spectral efficiency is a notable finding. The study also introduces a sub-optimal approach for NOMA. Interestingly, the spectral efficiency of this sub-optimal approach is very close to what would be considered the optimal value. This means that even though the approach might not be the absolute best possible, it is very close to achieving the highest spectral efficiency. The proposed MLGWO plays a pivotal role in this study. The MLGWO is designed to find near-optimal solutions. One of the standout features of the MLGWO is that it achieves this near-optimal performance without requiring extensive computational resources and makes the proposed MLGWO both efficient and effective. As the number of users in the system increases, the computational demands of certain algorithms also rise. The exhaustive search algorithm, for instance, sees a significant increase in its computational cost as more users are added. In contrast, the MLGWO maintains a relatively low computational complexity, making it a more scalable and practical choice for systems with a large number of users. For NOMA uplink systems, a benchmark is set using the power control approach detailed in the study by35. The spectral efficiency values achieved in their study are close to the optimal value, making it a valuable reference point. However, it is crucial to note a limitation in35: it is designed to work effectively only for systems with two user-pairings. Given this limitation, the proposed scheme in the current study offers a significant advantage. It is adept at efficiently grouping multiple users, not just two. This capability ensures that the proposed scheme is versatile and can handle scenarios with more than just a pair of users, making it a more comprehensive solution for NOMA uplink systems. Therefore, the study provides a thorough comparison of NOMA and OMA approaches, highlighting the superior spectral efficiency of NOMA. The proposed sub-optimal approach and the MLGWO further enhance the efficiency and effectiveness of the NOMA system, especially when dealing with multiple users.
In addition, the study has also presented an achievable sum rate, i.e., spectral efficiency obtained from various competitive algorithms. Figure 6a–d present a comprehensive evaluation of the performance of four distinct algorithms: the GWO, the GJA, the PSO, and the SMA. The primary metric used to assess the performance of these algorithms is spectral efficiency. An interesting observation from the study is the closeness in performance between the optimal and sub-optimal solutions. In many scenarios, the difference between an optimal solution and a sub-optimal solution can be significant. However, in this context, the spectral efficiency of both solutions is remarkably close to each other. This suggests that even if the system does not achieve the absolute best solution, the performance drop is not significant, and the system still operates near its peak efficiency.
Fig. 6.
Achievable sum rate in bps/Hz by: (a) GWO, (b) GJA, (c) PSO, (d) SMA.
The study by35 provides a power control scheme specifically designed for the NOMA uplink system. This scheme is employed as a benchmark or reference point in the current study. Benchmarks are essential as they offer a standard against which new or alternative solutions can be compared. Upon analysis, a noteworthy observation emerges that the spectral efficiency achieved by GWO, PSO, and SMA surpasses the results of both the power control scheme from35 and the OMA scheme. However, the performance of GJA is worse compared to all algorithms and the benchmark results but better than the OMA scheme. This is a significant finding as it underscores the effectiveness and superiority of the evaluated algorithms in optimizing spectral efficiency. Therefore, Fig. 4 offers a deep dive into the performance of four nature-inspired algorithms in the realm of spectral efficiency. The results highlight the prowess of the NOMA scheme over OMA and emphasize the near-par performance of optimal and sub-optimal solutions. Furthermore, the assessed algorithms not only meet but exceed the benchmark set by35 (except GJA), showcasing their potential to enhance communication systems.
Figure 7 shows the spectral efficiency versus the number of users for different iterations. In this plot, the study compared the spectral efficiency against the number of users for different iterations, specifically for 100, 200, 300, 400, and 500 iterations. The proposed algorithm consistently achieved higher spectral efficiency across all user counts when compared to other algorithms. As the number of iterations increased, the spectral efficiency of the MLGWO algorithm showed a more pronounced improvement, indicating the algorithm's ability to optimize better over time. Figure 8 shows the rates for specific iterations, i.e., 100. By focusing on a specific iteration, it is possible to delve deeper into the performance distinctions. Even at specific iterations, the MLGWO consistently outperformed others, further solidifying its superiority. Figure 9 shows the heatmaps of rates, and it provides a visual representation of the rate distributions over iterations and power levels. The colour intensity in the heatmap corresponding to the MLGWO was consistently higher, indicating better rate achievements across the board.
Fig. 7.
Performance of MLGWO over different iterations for both NOMA and OMA.
Fig. 8.
Rates obtained for 100 iterations by MLGWO for the NOMA and OMA.
Fig. 9.
Heatmaps generated obtained for 500 iterations by MLGWO: (a) sub-optimal NOMA, (b) optimal NOMA, (c) OMA.
Figure 10 shows the rates over iterations for specific power levels, and this plot provides a detailed view of rate variations over iterations for a chosen power level. The proposed algorithm's curve was consistent, showcasing its ability to maintain superior performance across iterations. Figure 11a–c present a meticulous comparison of various algorithms and approaches in the context of spectral efficiency. Upon analyzing Fig. 11a–c, it becomes evident that the performance of the proposed optimal, sub-optimal, and OMA approaches, when evaluated using the algorithms GWO, GJA, PSO, and SMA, are remarkably close to each other. This means that there is no significant difference in spectral efficiency among these approaches when assessed using the mentioned algorithms. Such a close performance indicates that each approach, regardless of being optimal, sub-optimal, or traditional (OMA), can achieve near-similar results when optimized using these algorithms. However, the spectral efficiency achieved by the proposed MLGWO is remarkably better than all the selected algorithms and traditional approaches. The study further delves into the performance of these algorithms at higher values of the parameter, specifically when this value exceeds 20 dB. In such scenarios, a standout observation is made: the optimal approach, when evaluated using the MLGWO, exhibits superior spectral efficiency compared to all other algorithms.
Fig. 10.
Rates at =30 dB were obtained for 500 iterations by MLGWO for sub-optimal NOMA, optimal NOMA and OMA.
Fig. 11.
Performance comparison among all algorithms: (a) Optimal NOMA, (b) Sub-optimal NOMA, (c) OMA.
This suggests that at higher values of , the MLGWO algorithm, when applied to the optimal approach, is the most efficient in maximizing spectral efficiency. Figure 11 offers a detailed comparison of various communication approaches using different optimization algorithms. While the performance of these approaches is closely matched across the algorithms, the MLGWO shines in scenarios with higher values of , showcasing its potential to enhance spectral efficiency in such conditions.
Figure 12 provides an in-depth analysis of the convergence rates of all algorithms applied to the NOMA uplink system. Upon analyzing Fig. 12, it becomes evident that the MLGWO, GWO, GJA, PSO, and SMA algorithms all exhibit similar convergence rates when applied to the NOMA uplink system. This means that these algorithms, when tasked with finding the optimal or near-optimal solution for the system, take roughly the same number of iterations. However, a notable exception is observed with the MLGWO. The MLGWO takes longer to converge, specifically requiring over 200 iterations, compared to the other algorithms. This might initially seem like a disadvantage, but there is a significant improvement after 200 iterations: despite its slower convergence, the spectral efficiency achieved by MLGWO is significantly higher. Among all the algorithms evaluated, the proposed MLGWO stands out in terms of performance. When assessing spectral efficiency, which measures how efficiently the spectral bandwidth is utilized, the MLGWO outshines the GWO, GJA, PSO, and SMA algorithms. This superior performance indicates that the MLGWO is adept at maximizing the use of available bandwidth, leading to more efficient data transmission. Furthermore, the MLGWO offers another crucial advantage: stability. While some algorithms might exhibit fluctuations or drops in performance, the MLGWO consistently provides stable results. Moreover, it ensures that the minimum rate requirement for the system is met without any significant dips in performance. This stability, combined with its high spectral efficiency, underscores the potential of the MLGWO as a robust and efficient optimization algorithm for the NOMA uplink system.
Fig. 12.
Convergence curves obtained by all selected algorithms.
One of the most insightful plots in this study is the comparison of spectral efficiency against the number of users. This plot is crucial as it demonstrates how the MLGWO algorithm performs as the network becomes denser, which is a common scenario in real-world applications. As the number of users increased, the MLGWO algorithm maintained a high spectral efficiency, indicating its robustness in denser networks. In contrast, other algorithms showed a decline in performance with an increase in the number of users. This is indicative of their inability to handle increased network complexity as efficiently as the proposed MLGWO algorithm. The performance gap between the MLGWO algorithm and others became more pronounced with a higher number of users, emphasizing the scalability of the proposed approach. The comprehensive analysis, including the spectral efficiency against the number of users, has provided compelling evidence of the superiority of the proposed algorithm. Whether it is handling denser networks, achieving faster convergence, or optimizing spectral efficiency, the proposed algorithm consistently outperforms its counterparts. This makes it a promising candidate for real-world applications where network density and efficiency are of paramount importance. In order to compare the computational time of the proposed algorithm and other algorithms, Table 3 is provided with the computational time.
Table 3.
Computational time of all algorithms for different strategies.
Strategy | Computation time in minutes | ||||
---|---|---|---|---|---|
MLGWO | GWO | PSO | SMA | GJA | |
Optimal NOMA | 3.475 | 3.211 | 3.178 | 3.971 | 3.593 |
Sub-optimal NOMA | 3.417 | 3.114 | 3.076 | 3.848 | 3.509 |
OMA | 3.227 | 3.077 | 2.974 | 3.796 | 3.414 |
Table 3 compares the computational time of the proposed MLGWO with other state-of-the-art algorithms across different strategies, such as optimal NOMA, sub-optimal NOMA, and OMA. Optimal NOMA requires the highest computational time because it involves the most comprehensive and detailed optimization process. This strategy necessitates precise power allocation and user grouping to maximize spectral efficiency, which involves evaluating a larger solution space and more intricate calculations to achieve near-perfect SIC. Sub-optimal NOMA, on the other hand, takes slightly less time as it simplifies some of the optimization criteria, reducing the complexity of the calculations compared to the Optimal NOMA. While it still aims to enhance spectral efficiency, it employs approximations or less stringent conditions that require fewer iterations and computational resources than the optimal strategy. OMA takes the least computational time since it involves a simpler resource allocation method that does not require the same level of detailed optimization as NOMA strategies. The simpler nature of OMA means fewer calculations and iterations are needed, leading to reduced computational time. The results indicate that while MLGWO exhibits slightly higher computational times per iteration compared to GWO and PSO, it consistently outperforms SMA and GJA. For instance, MLGWO requires 3.475 min for the Optimal NOMA strategy, which is marginally higher than GWO (3.211 min) and PSO (3.178 min) but significantly lower than SMA (3.971 min) and GJA (3.593 min). Despite the marginally higher computational time per iteration, the advanced learning mechanisms in MLGWO, such as competitive learning, Q-learning, and greedy selection, ensure faster overall convergence and superior optimization performance, justifying the slight increase in computational time. These computational times are reasonable given the significant improvements in solution quality and convergence speed achieved by MLGWO, making it a highly effective and efficient algorithm for optimizing spectral efficiency in NOMA uplink systems.
Further discussions
The MLGWO integrates three mechanisms, as discussed earlier, which effectively balances exploration and exploitation. Competitive Learning maintains diversity by allowing agents to compete and adapt their strategies, preventing premature convergence. Q-learning leverages past experiences to guide the search process, ensuring that the optimizer learns from previous iterations and avoids redundant exploration of suboptimal regions. Greedy Selection retains the best solutions at each iteration, ensuring that high-quality solutions are not lost. This enhanced balance is a significant factor in the superior performance of MLGWO, as it ensures a comprehensive search of the solution space while refining the best solutions. The adaptive learning mechanisms in MLGWO, particularly Q-learning, allow the algorithm to adjust its search strategy based on historical performance data dynamically. This adaptability ensures that the optimizer can respond effectively to the evolving search landscape, improving convergence speed and solution quality. In contrast, traditional methods such as standard GWO, GJA, PSO, and SMA lack such adaptive learning capabilities, making them more prone to getting stuck in local optima or requiring extensive parameter tuning.
The integration of Greedy Selection and Q-learning in MLGWO leads to more efficient use of computational resources. By focusing computational efforts on high-potential solutions and avoiding redundant calculations, MLGWO reduces the overall computational burden. This efficiency is reflected in faster convergence times and lower computational demands compared to other state-of-the-art algorithms, which often involve more complex iterative processes and higher computational overheads. Competitive Learning introduces a mechanism where agents compete based on their performance, ensuring that a diverse set of solutions is explored. This diversity is crucial for avoiding premature convergence and ensuring that the optimizer can explore a wide range of potential solutions. Traditional methods like standard GWO and PSO often suffer from reduced diversity as the search progresses, leading to suboptimal performance. GJA and SMA, while innovative, still struggle with maintaining an optimal balance between exploration and exploitation without the structured competition provided by Competitive Learning. The empirical results demonstrate that MLGWO consistently achieves higher spectral efficiency and lower system complexity compared to standard GWO, GJA, PSO, and SMA. The observed performance improvements are attributed to the algorithm's ability to dynamically adapt its search strategy, maintain solution diversity, and efficiently exploit high-quality solutions. The quantitative metrics, including convergence speed, solution quality, and computational time, provide strong evidence of superior performance.
Conclusions
The NOMA approach has attracted significant interest lately due to its notable spectral efficiency, positioning it as a key player in enhancing the capacity of future networks. Effective user grouping and a robust power control strategy are pivotal in optimizing the performance of communication systems. The proposed study delves into a combined optimization challenge, focusing on user grouping and power control while ensuring minimum rate requirements. This is done within the context of uplink NOMA transmissions, aiming to strengthen spectral efficiency. A linear programming methodology is employed to handle the power control challenge. Additionally, the task of user pairing and grouping is addressed using five distinct algorithms: proposed MLGWO, GWO, GJA, PSO, and SMA. Simulation results indicate that the achieved spectral efficiency closely matches the optimal solution and surpasses the performance of OMA systems. Notably, the MLGWO algorithm, when applied to this integrated challenge for uplink transmissions, outperforms the traditional OMA in terms of spectral efficiency. Moreover, the suggested MLGWO demonstrates superior results compared to GWO, GJA, PSO, SMA, and other existing algorithms from previous studies. Better performance is achieved while maintaining a reduced system complexity and adhering to constraints specific to uplink NOMA systems. Furthermore, the results highlight that as the user count increases, the combined optimization problem becomes increasingly complicated, demanding more network resources for resolution.
The following future extensions provide a roadmap for continued research and development in the NOMA uplink systems, ensuring that the technology remains cutting-edge and reports the growing demands of communication networks. While the MLGWO algorithm has shown promising results, there is a potential to explore other nature-inspired algorithms or hybrid models that combine the strengths of multiple algorithms. This could further optimize the uplink NOMA transmissions. Machine learning techniques could be integrated to predict user behaviour and network demands, allowing for dynamic adjustments in user grouping and power control, leading to even more efficient resource utilization. As highlighted, the complexity increases with the number of users. Future research could focus on scalability solutions, ensuring that as networks grow, the performance remains optimal. The integration of NOMA with other emerging technologies, such as Massive MIMO or mmWave, could be explored to see how they complement each other in enhancing network performance. With increased spectral efficiency and user grouping, security challenges might arise. Future extensions could delve into ensuring that as performance is optimized, security is not compromised. Alongside spectral efficiency, energy consumption is a critical concern for future networks. Research could be directed towards making NOMA systems more energy-efficient without sacrificing performance. Instead of focusing solely on the physical layer, future research could explore optimizations that span multiple layers of the communication protocol stack, ensuring holistic improvements.
Acknowledgements
The authors appreciate the anonymous reviewers for their valuable insights and constructive feedback, which significantly enhanced the paper's quality.
Author contributions
SS: Conceptualization, Methodology, Software, Data curation, Formal analysis, Resources, Visualization, and Writing-original draft preparation. CK: Conceptualization, Software, Resources, Data curation, Validation, Visualization, and Writing-original draft preparation. MPK: Methodology, Software, Formal analysis, Investigation, Validation, Visualization, and Writing-review and editing. SR: Methodology, Software, Formal analysis, Validation, Visualization, and Writing-review and editing.
Funding
Open access funding provided by Manipal Academy of Higher Education, Manipal.
Data availability
The datasets used during the current study are available from the corresponding author upon reasonable request.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Contributor Information
Sobana Sikkanan, Email: sobanaa@gmail.com.
Premkumar Manoharan, Email: mprem.me@gmail.com.
Sowmya Ravichandran, Email: sowmya.pr@manipal.edu.
References
- 1.Ghafouri, N., Movahhedinia, N. & Khayyambashi, M. R. A novel user grouping algorithm for downlink NOMA. Wirel. Pers. Commun.126(2), 1895–1914. 10.1007/S11277-022-09827-3/METRICS (2022). 10.1007/S11277-022-09827-3/METRICS [DOI] [Google Scholar]
- 2.Tian, Y. A non-orthogonal multiple access and interference mitigation combined strategy in multi-user networks. Wirel. Pers. Commun.104(1), 111–128. 10.1007/s11277-018-6011-z (2019). 10.1007/s11277-018-6011-z [DOI] [Google Scholar]
- 3.Wang, P., Xiao, J. & Ping, L. Comparison of orthogonal and non-orthogonal approaches to future wireless cellular systems. IEEE Veh. Technol. Mag.1(3), 4–11. 10.1109/MVT.2006.307294 (2006). 10.1109/MVT.2006.307294 [DOI] [Google Scholar]
- 4.Ding, Z. et al. A survey on non-orthogonal multiple access for 5G networks: Research challenges and future trends. IEEE J. Sel. Areas Commun.35(10), 2181–2195. 10.1109/JSAC.2017.2725519 (2017). 10.1109/JSAC.2017.2725519 [DOI] [Google Scholar]
- 5.Agiwal, M., Roy, A. & Saxena, N. Next generation 5G wireless networks: A comprehensive survey. IEEE Commun. Surv. Tutor.18(3), 1617–1655. 10.1109/COMST.2016.2532458 (2016). 10.1109/COMST.2016.2532458 [DOI] [Google Scholar]
- 6.Zhang, X., Gao, Q., Gong, C. & Xu, Z. User grouping and power allocation for noma visible light communication multi-cell networks. IEEE Commun. Lett.21(4), 777–780. 10.1109/lcomm.2016.2642921 (2017). 10.1109/lcomm.2016.2642921 [DOI] [Google Scholar]
- 7.Zhang, Y. & Ge, J. Impact analysis for user pairing on noma-based energy harvesting relaying networks with imperfect csi. IET Commun.12(13), 1609–1614. 10.1049/iet-com.2017.1120 (2018). 10.1049/iet-com.2017.1120 [DOI] [Google Scholar]
- 8.Liaqat, M., Noordin, K. A., AbdulLatef, T. & Dimyati, K. Power-domain non orthogonal multiple access (PD-NOMA) in cooperative networks: An overview. Wirel. Netw.26(1), 181–203. 10.1007/s11276-018-1807-z (2020). 10.1007/s11276-018-1807-z [DOI] [Google Scholar]
- 9.Holland, J. H. Genetic algorithms. Sci. Am.267(1), 66–72. 10.1038/SCIENTIFICAMERICAN0792-66 (1992).1411454 10.1038/SCIENTIFICAMERICAN0792-66 [DOI] [Google Scholar]
- 10.Bratton, D. & Kennedy, J. Defining a standard for particle swarm optimization. In Proceedings of the 2007 IEEE Swarm Intelligence Symposium, SIS 2007, 120–127, 10.1109/SIS.2007.368035 (2007).
- 11.Mirjalili, S., Mirjalili, S. M. & Lewis, A. Grey Wolf Optimizer. Adv. Eng. Softw.69, 46–61. 10.1016/j.advengsoft.2013.12.007 (2014). 10.1016/j.advengsoft.2013.12.007 [DOI] [Google Scholar]
- 12.Karaboga, D. & Basturk, B. Artificial bee colony (ABC) optimization algorithm for solving constrained optimization problems. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Vol. 4529 LNAI, 789–798, 10.1007/978-3-540-72950-1_77 (2007).
- 13.Dorigo, M., Birattari, M. & Stutzle, T. Ant colony optimization. IEEE Comput. Intell. Mag.1(4), 28–39. 10.1109/MCI.2006.329691 (2006). 10.1109/MCI.2006.329691 [DOI] [Google Scholar]
- 14.Kumar, B. V., Oliva, D. & Suganthan, P. N. eds. Differential Evolution: From Theory to Practice, Vol. 1009 10.1007/978-981-16-8082-3 (2022) .
- 15.Yang, X.-S. & Deb, S. Cuckoo search via levy flights. In World Congress on Nature & Biologically Inspired Computing (NaBIC), Coimbatore, India, 210–214. http://arxiv.org/abs/1003.1594 (IEEE, 2010).
- 16.Rashedi, E., Nezamabadi-pour, H. & Saryazdi, S. GSA: A gravitational search algorithm. Inf. Sci.179(13), 2232–2248. 10.1016/j.ins.2009.03.004 (2009). 10.1016/j.ins.2009.03.004 [DOI] [Google Scholar]
- 17.Johari, N. F., Zain, A. M., Noorfa, M. H. & Udin, A. Firefly algorithm for optimization problem. Appl. Mech. Mater.421, 512–517. 10.4028/WWW.SCIENTIFIC.NET/AMM.421.512 (2013). 10.4028/WWW.SCIENTIFIC.NET/AMM.421.512 [DOI] [Google Scholar]
- 18.Mirjalili, S. et al. Salp Swarm Algorithm: A bio-inspired optimizer for engineering design problems. Adv. Eng. Softw.114, 163–191. 10.1016/j.advengsoft.2017.07.002 (2017). 10.1016/j.advengsoft.2017.07.002 [DOI] [Google Scholar]
- 19.Mirjalili, S. & Lewis, A. The Whale Optimization Algorithm. Adv. Eng. Softw.95, 51–67. 10.1016/j.advengsoft.2016.01.008 (2016). 10.1016/j.advengsoft.2016.01.008 [DOI] [Google Scholar]
- 20.Faramarzi, A., Heidarinejad, M., Mirjalili, S. & Gandomi, A. H. Marine Predators Algorithm: A nature-inspired metaheuristic. Expert Syst. Appl.152, 113377. 10.1016/j.eswa.2020.113377 (2020). 10.1016/j.eswa.2020.113377 [DOI] [Google Scholar]
- 21.Faramarzi, A., Heidarinejad, M., Stephens, B. & Mirjalili, S. Equilibrium optimizer: A novel optimization algorithm. Knowl. Based Syst.191, 105190. 10.1016/j.knosys.2019.105190 (2020). 10.1016/j.knosys.2019.105190 [DOI] [Google Scholar]
- 22.Li, S., Chen, H., Wang, M., Asghar, A. & Mirjalili, S. Slime mould algorithm: A new method for stochastic optimization. Future Gener. Comput. Syst.111, 300–323. 10.1016/j.future.2020.03.055 (2020). 10.1016/j.future.2020.03.055 [DOI] [Google Scholar]
- 23.Ahmadianfar, I., Bozorg-haddad, O. & Chu, X. Gradient-based optimizer: A new metaheuristic optimization algorithm. Inf. Sci.540, 131–159. 10.1016/j.ins.2020.06.037 (2020). 10.1016/j.ins.2020.06.037 [DOI] [Google Scholar]
- 24.Chopra, N. & Mohsin Ansari, M. Golden jackal optimization: A novel nature-inspired optimizer for engineering applications. Expert Syst. Appl.198, 116924. 10.1016/J.ESWA.2022.116924 (2022). 10.1016/J.ESWA.2022.116924 [DOI] [Google Scholar]
- 25.Abdollahzadeh, B., Gharehchopogh, F. S., Khodadadi, N. & Mirjalili, S. Mountain Gazelle optimizer: A new nature-inspired metaheuristic algorithm for global optimization problems. Adv. Eng. Softw.174, 103282. 10.1016/J.ADVENGSOFT.2022.103282 (2022). 10.1016/J.ADVENGSOFT.2022.103282 [DOI] [Google Scholar]
- 26.Kumar, S., Jangir, P., Tejani, G. G. & Premkumar, M. MOTEO: A novel physics-based multiobjective thermal exchange optimization algorithm to design truss structures. Knowl. Based Syst.10.1016/J.KNOSYS.2022.108422 (2022). 10.1016/J.KNOSYS.2022.108422 [DOI] [Google Scholar]
- 27.Fathollahi-Fard, A. M., Hajiaghaei-Keshteli, M. & Tavakkoli-Moghaddam, R. Red deer algorithm (RDA): A new nature-inspired meta-heuristic. Soft Comput.24(19), 14637–14665. 10.1007/S00500-020-04812-Z (2020). 10.1007/S00500-020-04812-Z [DOI] [Google Scholar]
- 28.Abualigah, L. et al. Aquila Optimizer: A novel meta-heuristic optimization algorithm. Comput. Ind. Eng.157, 107250. 10.1016/j.cie.2021.107250 (2021). 10.1016/j.cie.2021.107250 [DOI] [Google Scholar]
- 29.Abualigah, L., Elaziz, M. A., Sumari, P., Geem, Z. W. & Gandomi, A. H. Reptile Search Algorithm (RSA): A nature-inspired meta-heuristic optimizer. Expert Syst. Appl.10.1016/J.ESWA.2021.116158 (2022). 10.1016/J.ESWA.2021.116158 [DOI] [Google Scholar]
- 30.Abualigah, L., Diabat, A., Mirjalili, S., AbdElaziz, M. & Gandomi, A. H. The arithmetic optimization algorithm. Comput. Methods Appl. Mech. Eng.376, 113609. 10.1016/J.CMA.2020.113609 (2021). 10.1016/J.CMA.2020.113609 [DOI] [Google Scholar]
- 31.Abdel-Basset, M., Abdel-Fatah, L. & Sangaiah, A. K. Metaheuristic algorithms: A comprehensive review. In Computational Intelligence for Multimedia Big Data on the Cloud with Engineering Applications, 185–231, 10.1016/B978-0-12-813314-9.00010-4 (2018).
- 32.Hussain, K., MohdSalleh, M. N., Cheng, S. & Shi, Y. Metaheuristic research: A comprehensive survey. Artif. Intell. Rev.52(4), 2191–2233. 10.1007/S10462-017-9605-Z (2018). 10.1007/S10462-017-9605-Z [DOI] [Google Scholar]
- 33.ur Rehman, B. et al. Joint user grouping and power control using whale optimization algorithm for NOMA uplink systems. PeerJ Comput. Sci.8, e882. 10.7717/PEERJ-CS.882/SUPP-1 (2022). 10.7717/PEERJ-CS.882/SUPP-1 [DOI] [Google Scholar]
- 34.Rehman, B. U. et al. Joint power control and user grouping for uplink power domain non-orthogonal multiple access. Int. J. Distrib. Sens. Netw.10.1177/15501477211057443/ASSET/IMAGES/LARGE/10.1177_15501477211057443-FIG3.JPEG (2021). 10.1177/15501477211057443/ASSET/IMAGES/LARGE/10.1177_15501477211057443-FIG3.JPEG [DOI] [Google Scholar]
- 35.Sedaghat, M. A. & Muller, R. R. On user pairing in uplink noma. IEEE Trans. Wirel. Commun.17(5), 3474–3486. 10.1109/twc.2018.2815005 (2018). 10.1109/twc.2018.2815005 [DOI] [Google Scholar]
- 36.MS Ali, H. T. E. H. Dynamic user clustering and power allocation for uplink and downlink non-orthogonal multiple access (NOMA) systems. IEEE Access4, 6325–6343 (2016). [Google Scholar]
- 37.Albataineh, Z., Hayajneh, K. F., Bany Salameh, H., Al Athamneh, R. & Jararweh, Y. Joint power control and user grouping mechanism for efficient uplink non-orthogonal multiple access-based 5G communication: Utilising the Lèvy-flight firefly algorithm. IET Netw.10.1049/NTW2.12084 (2023). 10.1049/NTW2.12084 [DOI] [Google Scholar]
- 38.Muhammed, A. J., Ma, Z., Diamantoulakis, P. D., Li, L. & Karagiannidis, G. K. Energy-efficient resource allocation in multicarrier NOMA systems with fairness. IEEE Trans. Commun.67(12), 8639–8654. 10.1109/TCOMM.2019.2938963 (2019). 10.1109/TCOMM.2019.2938963 [DOI] [Google Scholar]
- 39.Naeem, M., Illanko, K., Karmokar, A., Anpalagan, A. & Jaseemuddin, M. Optimal power allocation for green cognitive radio: fractional programming approach. IET Commun.7(12), 1279–1286. 10.1049/IET-COM.2012.0604 (2013). 10.1049/IET-COM.2012.0604 [DOI] [Google Scholar]
- 40.Biyoghe, J. & Balyan, V. Users’ grouping algorithm for fairness improvement of NOMA-based multi-beams satellite networks intended for 5G. IET Commun.17(15), 1780–1790. 10.1049/CMU2.12653 (2023). 10.1049/CMU2.12653 [DOI] [Google Scholar]
- 41.Huang, H., Shi, Y., Liang, L., Zhang, X. & Wang, C. On the outage probability of energy harvested cooperative multiuser cognitive NOMA network. IET Commun.17(13), 1465–1477. 10.1049/CMU2.12635 (2023). 10.1049/CMU2.12635 [DOI] [Google Scholar]
- 42.Wang, B. & Liu, K. J. R. Advances in cognitive radio networks: A survey. IEEE J. Sel. Top. Signal Process.5(1), 5–23. 10.1109/JSTSP.2010.2093210 (2011). 10.1109/JSTSP.2010.2093210 [DOI] [Google Scholar]
- 43.Pan, Y. et al. Cooperative trajectory planning and resource allocation for UAV-enabled integrated sensing and communication systems. IEEE Trans. Veh. Technol.73(5), 6502–6516. 10.1109/TVT.2023.3337106 (2024). 10.1109/TVT.2023.3337106 [DOI] [Google Scholar]
- 44.Cui, Y., Liu, F., Jing, X. & Mu, J. Integrating sensing and communications for ubiquitous IoT: Applications, trends, and challenges. IEEE Netw.35(5), 158–167. 10.1109/MNET.010.2100152 (2021). 10.1109/MNET.010.2100152 [DOI] [Google Scholar]
- 45.Aung, N., Zhu, T., Mahmoud El-Gayar, M. & Nasser Ajour, M. Resource allocation in UAV-enabled NOMA networks for enhanced six-G communications systems. Electronics12(24), 5033. 10.3390/ELECTRONICS12245033 (2023). 10.3390/ELECTRONICS12245033 [DOI] [Google Scholar]
- 46.Zhou, Y., Liu, X., Zhai, X., Zhu, Q. & Durrani, T. S. UAV-Enabled integrated sensing, computing, and communication for internet of things: Joint resource allocation and trajectory design. IEEE Internet Things J11(7), 12717–12727. 10.1109/JIOT.2023.3335937 (2024). 10.1109/JIOT.2023.3335937 [DOI] [Google Scholar]
- 47.Linh, N. T., Son, P. N. & Bao, V. N. Intelligent reflecting surface-assisted beamforming-NOMA networks for short-packet communications: Performance analysis and deep learning approach. IET Commun.17(16), 1940–1954. 10.1049/CMU2.12667 (2023). 10.1049/CMU2.12667 [DOI] [Google Scholar]
- 48.Ranjha, A., Naboulsi, D., El Emary, M. & Gagnon, F. Facilitating URLLC vis-á-vis UAV-enabled relaying for MEC systems in 6-G networks. IEEE Trans. Reliab.10.1109/TR.2024.3357356 (2024). 10.1109/TR.2024.3357356 [DOI] [Google Scholar]
- 49.Al-Abbasi, Z. Q. & So, D. K. C. Resource allocation in non-orthogonal and hybrid multiple access system with proportional rate constraint. IEEE Trans. Wirel. Commun.16(10), 6309–6320. 10.1109/TWC.2017.2721936 (2017). 10.1109/TWC.2017.2721936 [DOI] [Google Scholar]
- 50.Wei, Z., Ng, D. W. K., Yuan, J. & Wang, H. M. Optimal resource allocation for power-efficient MC-NOMA with imperfect channel state information. IEEE Trans. Commun.65(9), 3944–3961. 10.1109/TCOMM.2017.2709301 (2017). 10.1109/TCOMM.2017.2709301 [DOI] [Google Scholar]
- 51.Wei, Z., Ng, D. W. K. & Yuan, J. Power-efficient resource allocation for MC-NOMA with statistical channel state information. In 2016 IEEE Global Communications Conference, GLOBECOM 2016 - Proceedings, 10.1109/GLOCOM.2016.7842161 (2016).
- 52.Sun, Y., Ng, D. W. K., Ding, Z. & Schober, R. Optimal joint power and subcarrier allocation for MC-NOMA systems. In 2016 IEEE Global Communications Conference, GLOBECOM 2016 - Proceedings, 10.1109/GLOCOM.2016.7842087 (2016).
- 53.Wang, J., Peng, Q., Huang, Y., Wang, H. M. & You, X. Convexity of weighted sum rate maximization in NOMA systems. IEEE Signal. Process. Lett.24(9), 1323–1327. 10.1109/LSP.2017.2722546 (2017). 10.1109/LSP.2017.2722546 [DOI] [Google Scholar]
- 54.Suraci, C. et al. Joint power and channel allocation for non-orthogonal multiple access in 5G networks and beyond. Sensors23(19), 8040. 10.3390/S23198040 (2023). 10.3390/S23198040 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Ranjha, A., Javed, M. A., Srivastava, G. & Asif, M. Quasi-optimization of resource allocation and positioning for solar-powered UAVs. IEEE Trans Netw Sci Eng10(6), 4071–4081. 10.1109/TNSE.2023.3282870 (2023). 10.1109/TNSE.2023.3282870 [DOI] [Google Scholar]
- 56.Panda, S. Joint user patterning and power control optimization of MIMO–NOMA systems. Wirel. Pers. Commun.112(4), 2557–2573. 10.1007/S11277-020-07164-X/METRICS (2020). 10.1007/S11277-020-07164-X/METRICS [DOI] [Google Scholar]
- 57.Gandomi, A. H. & Kashani, A. R. Construction cost minimization of shallow foundation using recent swarm intelligence techniques. IEEE Trans. Indu.Inform.14(3), 1099–1106. 10.1109/TII.2017.2776132 (2018). 10.1109/TII.2017.2776132 [DOI] [Google Scholar]
- 58.Zeng, M., Nguyen, N. P., Dobre, O. A., Ding, Z. & Poor, H. V. Spectral-and energy-efficient resource allocation for multi-carrier uplink NOMA systems. IEEE Trans. Veh. Technol.68(9), 9293–9296. 10.1109/TVT.2019.2926701 (2019). 10.1109/TVT.2019.2926701 [DOI] [Google Scholar]
- 59.Ranjha, A. et al. Toward facilitating power efficient URLLC systems in UAV networks under jittering. IEEE Trans. Consum. Electron.70(1), 3031–3041. 10.1109/TCE.2023.3305550 (2024). 10.1109/TCE.2023.3305550 [DOI] [Google Scholar]
- 60.Endo, Y., Kishiyama, Y. & Higuchi, K. Uplink non-orthogonal access with MMSE-SIC in the presence of inter-cell interference. In Proceedings of the International Symposium on Wireless Communication Systems, 261–265, 10.1109/ISWCS.2012.6328370 (2012) .
- 61.Li, A., Benjebbour, A., Chen, X., Jiang, H. & Kayama, H. Uplink non-orthogonal multiple access (NOMA) with single-carrier frequency division multiple access (SC-FDMA) for 5G systems. IEICE Trans. Commun.5(8), 1426–1435. 10.1587/TRANSCOM.E98.B.1426 (2015). 10.1587/TRANSCOM.E98.B.1426 [DOI] [Google Scholar]
- 62.Rehman, B. U. et al. Uplink power control scheme for spectral efficiency maximization in NOMA systems. Alex. Eng. J.64, 667–677. 10.1016/J.AEJ.2022.11.030 (2023). 10.1016/J.AEJ.2022.11.030 [DOI] [Google Scholar]
- 63.Pham, Q.-V. et al. Swarm intelligence for next-generation networks: Recent advances and applications. J. Netw. Comput. Appl.191, 103141. 10.1016/J.JNCA.2021.103141 (2021). 10.1016/J.JNCA.2021.103141 [DOI] [Google Scholar]
- 64.Afridi, A., Hameed, I., Garcia, C. E. & Koo, I. Throughput maximization of wireless powered IoT network with hybrid NOMA-TDMA scheme: A genetic algorithm approach. IEEE Access12, 65241–65253. 10.1109/ACCESS.2024.3396497 (2024). 10.1109/ACCESS.2024.3396497 [DOI] [Google Scholar]
- 65.Mohar, S. S., Goyal, S. & Kaur, R. Localization of sensor nodes in wireless sensor networks using bat optimization algorithm with enhanced exploration and exploitation characteristics. J. Supercomput.78(9), 11975–12023. 10.1007/S11227-022-04320-X/TABLES/8 (2022). 10.1007/S11227-022-04320-X/TABLES/8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Bacanin, N., Antonijevic, M., Bezdan, T., Zivkovic, M. & Rashid, T. A. Wireless Sensor Networks Localization by Improved Whale Optimization Algorithm, 769–783, 10.1007/978-981-16-6332-1_62 (2022).
- 67.Tuba, E., Tuba, M. & Beko, M. Node localization in ad hoc wireless sensor networks using fireworks algorithm. In International Conference on Multimedia Computing and Systems -Proceedings, Vol. 0, 223–229 10.1109/ICMCS.2016.7905647 (2017).
- 68.Cheng, J. & Xia, L. An effective cuckoo search algorithm for node localization in wireless sensor network. Sensors16(9), 1390. 10.3390/S16091390 (2016). 10.3390/S16091390 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Paul, A. K. & Sato, T. Localization in wireless sensor networks: A survey on algorithms, measurement techniques, applications and challenges. J. Sens. Actuator Netw.6(4), 24. 10.3390/JSAN6040024 (2017). 10.3390/JSAN6040024 [DOI] [Google Scholar]
- 70.Kaur, G., Jyoti, K., Mittal, N., Mittal, V. & Salgotra, R. Optimized approach for localization of sensor nodes in 2D wireless sensor networks using modified learning enthusiasm-based teaching–learning-based optimization algorithm. Algorithms16(1), 11. 10.3390/A16010011 (2022). 10.3390/A16010011 [DOI] [Google Scholar]
- 71.Hendrarini, N., Asvial, M. & Sari, R. F. Wireless sensor networks optimization with localization-based clustering using game theory algorithm. Int. J. Technol.13(1), 213–224. 10.14716/IJTECH.V13I1.4850 (2022). 10.14716/IJTECH.V13I1.4850 [DOI] [Google Scholar]
- 72.Doherty, L., Pister, K. S. J. & El Ghaoui, L. Convex position estimation in wireless sensor networks. Proc. IEEE INFOCOM3, 1655–1663. 10.1109/INFCOM.2001.916662 (2001). 10.1109/INFCOM.2001.916662 [DOI] [Google Scholar]
- 73.Goudos, S. K, Diamantoulakis, P. D., Boursianis, A. D., Papanikolaou, V. K. & Karagiannidis, G. K. Joint user association and power allocation using swarm intelligence algorithms in non-orthogonal multiple access networks. In 2020 9th International Conference on Modern Circuits and Systems Technologies, MOCAST 2020, 10.1109/MOCAST49295.2020.9200248 (2020).
- 74.Nadimi-Shahraki, M. H., Taghian, S. & Mirjalili, S. An improved grey wolf optimizer for solving engineering problems. Expert Syst. Appl.166, 113917. 10.1016/J.ESWA.2020.113917 (2021). 10.1016/J.ESWA.2020.113917 [DOI] [Google Scholar]
- 75.Alawad, N. A. & Abed-alguni, B. H. Discrete island-based cuckoo search with highly disruptive polynomial mutation and opposition-based learning strategy for scheduling of workflow applications in cloud environments. Arab. J. Sci. Eng.46(4), 3213–3233. 10.1007/S13369-020-05141-X/METRICS (2021). 10.1007/S13369-020-05141-X/METRICS [DOI] [Google Scholar]
- 76.Wang, H., Li, K. & Pedrycz, W. An elite hybrid metaheuristic optimization algorithm for maximizing wireless sensor networks lifetime with a sink node. IEEE Sens. J.20(10), 5634–5649. 10.1109/JSEN.2020.2971035 (2020). 10.1109/JSEN.2020.2971035 [DOI] [Google Scholar]
- 77.Fang, F., Cheng, J. & Ding, Z. Joint energy efficient subchannel and power optimization for a downlink NOMA heterogeneous network. IEEE Trans. Veh. Technol.68(2), 1351–1364. 10.1109/TVT.2018.2881314 (2019). 10.1109/TVT.2018.2881314 [DOI] [Google Scholar]
- 78.Fang, F., Zhang, H., Cheng, J. & Leung, V. C. M. Energy-efficient resource allocation for downlink non-orthogonal multiple access network. IEEE Trans. Commun.64(9), 3722–3732. 10.1109/TCOMM.2016.2594759 (2016). 10.1109/TCOMM.2016.2594759 [DOI] [Google Scholar]
- 79.Yang, Y. & Pesavento, M. A parallel algorithm for energy efficiency maximization in massive MIMO networks. In 2016 IEEE Global Communications Conference, GLOBECOM 2016 - Proceedings10.1109/GLOCOM.2016.7841532 (2016).
- 80.Li, J., Gao, T., He, B., Zheng, W. & Lin, F. Power allocation and user grouping for NOMA downlink systems. Appl. Sci.13(4), 2452. 10.3390/APP13042452 (2023). 10.3390/APP13042452 [DOI] [Google Scholar]
- 81.Devi, R. P. & Prabakaran, N. Efficient NOMA system: Hybrid heuristic-based network parameter optimization for spectral and energy efficiency with QoS maximization. J. Opt. Commun.10.1515/JOC-2023-0075/MACHINEREADABLECITATION/RIS (2023). 10.1515/JOC-2023-0075/MACHINEREADABLECITATION/RIS [DOI] [Google Scholar]
- 82.Luo, Y. et al. Throughput optimization for NOMA cognitive relay network with rf energy harvesting based on improved bat algorithm. Mathematics10(22), 4357. 10.3390/MATH10224357 (2022). 10.3390/MATH10224357 [DOI] [Google Scholar]
- 83.Sadiq, A. S., Dehkordi, A. A., Mirjalili, S. & Pham, Q. V. Nonlinear marine predator algorithm: A cost-effective optimizer for fair power allocation in NOMA-VLC-B5G networks. Expert Syst. Appl.203, 117395. 10.1016/J.ESWA.2022.117395 (2022). 10.1016/J.ESWA.2022.117395 [DOI] [Google Scholar]
- 84.Xu, Z., Petrunin, I., Li, T. & Tsourdos, A. Efficient allocation for downlink multi-channel NOMA systems considering complex constraints. Sensors21(5), 1833. 10.3390/S21051833 (2021). 10.3390/S21051833 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85.Altunbas, Y. & Turk, K. Power allocation with meta-heuristic algorithms for indoor MIMO-NOMA based VLC systems. Wirel. Pers. Commun.136(1), 617–630. 10.1007/S11277-024-11340-8/FIGURES/4 (2024). 10.1007/S11277-024-11340-8/FIGURES/4 [DOI] [Google Scholar]
- 86.Ahmad, T., Chai, R., Adnan, M. & Chen, Q. Low-complexity heuristic algorithm for power allocation and access mode selection in M2M networks. IEEE Internet Things J9(2), 1095–1108. 10.1109/JIOT.2021.3079213 (2022). 10.1109/JIOT.2021.3079213 [DOI] [Google Scholar]
- 87.Van Truong, T. & Nayyar, A. System performance and optimization in NOMA mobile edge computing surveillance network using GA and PSO. Comput. Netw.223, 109575. 10.1016/J.COMNET.2023.109575 (2023). 10.1016/J.COMNET.2023.109575 [DOI] [Google Scholar]
- 88.Younis, A. J. M., Wadday, A. G., Aljaafari, M. A. & Abedi, F. Resource Allocation optimization of NOMA network via metaheuristic algorithms. In IICETA 2022 - 5th International Conference on Engineering Technology and its Applications, 45–51, 10.1109/IICETA54559.2022.9888750 (2022).
- 89.Zhang, J. et al. Optimal and sub-optimal uplink NOMA: Joint user grouping, decoding order, and power control. IEEE Wirel. Commun. Lett.9(2), 254–257. 10.1109/lwc.2019.2951765 (2020). 10.1109/lwc.2019.2951765 [DOI] [Google Scholar]
- 90.Al-Betar, M. A., Awadallah, M. A., Faris, H., Aljarah, I. & Hammouri, A. I. Natural selection methods for Grey Wolf Optimizer. Expert Syst. Appl.113, 481–498. 10.1016/J.ESWA.2018.07.022 (2018). 10.1016/J.ESWA.2018.07.022 [DOI] [Google Scholar]
- 91.Mehmood, K., Chaudhary, N. I., Khan, Z. A., Cheema, K. M. & Raja, M. A. Z. Variants of chaotic grey wolf heuristic for robust identification of control autoregressive model. Biomimetics8(2), 141. 10.3390/BIOMIMETICS8020141 (2023). 10.3390/BIOMIMETICS8020141 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 92.Aala Kalananda, V. K. R. & Komanapalli, V. L. N. A competitive learning-based Grey wolf Optimizer for engineering problems and its application to multi-layer perceptron training. Multimed. Tools Appl.10.1007/S11042-023-15146-X/TABLES/12 (2023). 10.1007/S11042-023-15146-X/TABLES/12 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 93.Yin, S. et al. Reinforcement-learning-based parameter adaptation method for particle swarm optimization. Complex Intell. Syst.10.1007/S40747-023-01012-8/FIGURES/12 (2023). 10.1007/S40747-023-01012-8/FIGURES/12 [DOI] [Google Scholar]
- 94.Gokalp, O., Tasci, E. & Ugur, A. A novel wrapper feature selection algorithm based on iterated greedy metaheuristic for sentiment classification. Expert Syst. Appl.146, 113176. 10.1016/J.ESWA.2020.113176 (2020). 10.1016/J.ESWA.2020.113176 [DOI] [Google Scholar]
- 95.Li, W. & Gu, Y. Improvement of stochastic competitive learning for social network. Comput. Mater. Continua63(2), 755–768. 10.32604/CMC.2020.07984 (2020). 10.32604/CMC.2020.07984 [DOI] [Google Scholar]
- 96.Lu, L., Zheng, H., Jie, J., Zhang, M. & Dai, R. Reinforcement learning-based particle swarm optimization for sewage treatment control. Complex Intell. Syst.7(5), 2199–2210. 10.1007/S40747-021-00395-W/TABLES/4 (2021). 10.1007/S40747-021-00395-W/TABLES/4 [DOI] [Google Scholar]
- 97.Mazyavkina, N., Sviridov, S., Ivanov, S. & Burnaev, E. Reinforcement learning for combinatorial optimization: A survey. Comput. Oper. Res.134, 105400. 10.1016/J.COR.2021.105400 (2021). 10.1016/J.COR.2021.105400 [DOI] [Google Scholar]
- 98.Lin, X., Rommer, S., Euler, S., Yavuz, E. A. & Karlsson, R. S. 5G from space: An overview of 3GPP non-terrestrial networks. IEEE Commun. Stand. Mag.5(4), 147–153. 10.1109/MCOMSTD.011.2100038 (2021). 10.1109/MCOMSTD.011.2100038 [DOI] [Google Scholar]
- 99.Lin, X. et al. 5G new radio: Unveiling the essentials of the next generation wireless access technology. IEEE Commun. Stand. Mag.3(3), 30–37. 10.1109/MCOMSTD.001.1800036 (2019). 10.1109/MCOMSTD.001.1800036 [DOI] [Google Scholar]
- 100.Zhang, Y., Yang, Z., Feng, Y. & Yan, S. Performance analysis of a novel uplink cooperative NOMA system with full-duplex relaying. IET Commun.12(19), 2408–2417. 10.1049/IET-COM.2018.5500 (2018). 10.1049/IET-COM.2018.5500 [DOI] [Google Scholar]
- 101.Doorbash, Z. & Jamshidi, A. Performance analysis of the cooperative uplink-downlink NOMA and OMA scheme in cognitive radio network with max-min relay selection strategy. AEU Int. J. Electron. Commun.142, 154014. 10.1016/J.AEUE.2021.154014 (2021). 10.1016/J.AEUE.2021.154014 [DOI] [Google Scholar]
- 102.Zhang, L. et al. Performance analysis and optimization in downlink NOMA systems with cooperative full-duplex relaying. IEEE J. Sel. Areas Commun.35(10), 2398–2412. 10.1109/JSAC.2017.2724678 (2017). 10.1109/JSAC.2017.2724678 [DOI] [Google Scholar]
- 103.Banafaa, M. K. et al. A comprehensive survey on 5G-and-beyond networks with UAVs: Applications, emerging technologies, regulatory aspects, research trends and challenges. IEEE Access12, 7786–7826. 10.1109/ACCESS.2023.3349208 (2024). 10.1109/ACCESS.2023.3349208 [DOI] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The datasets used during the current study are available from the corresponding author upon reasonable request.