Abstract
The rapid proliferation of connected vehicles equipped with both Vehicle-to-Vehicle (V2V) sidelink and cellular interfaces creates new opportunities for real-time vehicular applications, yet achieving ultra-reliable communication without prohibitive cellular costs remains challenging. This paper addresses reliable inter-vehicle video streaming for safety-critical applications such as See-Through for Passing and Obstructed View Assist, which require stringent Service Level Objectives (SLOs) of 50 ms latency with 99% reliability. Through measurements in Seoul urban environments, we characterize the complementary nature of V2V and Vehicle-to-Network-to-Vehicle (V2N2V) paths: V2V provides ultra-low latency (mean 2.99 ms) but imperfect reliability (95.77%), while V2N2V achieves perfect reliability but exhibits high latency variability (: 120.33 ms in centralized routing) that violates target SLOs. We propose a hybrid framework that exploits V2V as the primary path while selectively retransmitting only lost packets via V2N2V. The key innovation is a dual loss detection mechanism combining gap-based and timeout-based triggers leveraging Real-Time Protocol (RTP) headers for both immediate response and comprehensive coverage. Trace-driven simulation demonstrates that the proposed framework achieves a 99.96% packet reception rate and 99.71% frame playback ratio, approaching lossless transmission while maintaining cellular utilization at only 5.54%, which is merely 0.84 percentage points above the V2V loss rate. This represents a cost reduction versus PLR Switching (4.2 GB vs. 28 GB monthly) while reducing video stalls by . These results demonstrate that packet-level selective redundancy enables cost-effective ultra-reliable V2X communication at scale.
Keywords: V2V communication, V2N2V, connected vehicles, hybrid multipath, video streaming, QoE, cellular cost optimization
1. Introduction
As the adoption of connected vehicles accelerates, vehicular communication technologies are reaching a critical inflection point. In 2024, the majority of passenger cars sold globally were equipped with embedded 4G/5G cellular connectivity as standard, and by 2028, over half of newly sold vehicles are expected to include 5G connectivity. In parallel, the standardization of Vehicle-to-Vehicle (V2V) direct communication has progressed steadily: Long-Term Evolution (LTE) Vehicle-to-Everything (V2X) was incorporated into 3GPP Release 14 (2017) and 5G-V2X into Release 16 (2020). Policy initiatives such as the U.S. National Highway Traffic Safety Administration (NHTSA)’s proposed V2V mandate in 2025 are expected to accelerate V2V deployment, resulting in vehicles equipped with both cellular-based V2N (Vehicle-to-Network) and direct-communication-based V2V interfaces [1].
This dual-interface architecture significantly expands the feasibility of various V2X applications requiring real-time data exchange between vehicles. The 5G Automotive Association (5GAA) defines multiple real-time video streaming use cases for inter-vehicle communication, as shown in Table 1. Applications such as See-Through for Passing, Obstructed View Assist, and Non-analyzed Sensor Signal Sharing commonly specify Service Level Objectives (SLOs) of less than 50 ms end-to-end latency with 99% reliability [2].
Table 1.
5GAA-defined real-time video-based V2X use cases (SLO: 50 ms latency, 99% reliability) [2].
| Use Case | Description |
|---|---|
| See-Through for Passing | Streaming real-time video from a leading vehicle to a host vehicle to visualize the road ahead, specifically to support safe passing maneuvers. |
| Obstructed View Assist | Providing alternative views from infrastructure (CCTV) or other vehicles to resolve blind spots caused by obstacles, typically at intersections or driveways. |
| Non-analysed Sensor Signal Sharing | Sharing raw sensor data streams (e.g., LiDAR, Camera) instead of processed object lists. Used to enhance the host vehicle’s sensor fusion and perception reliability. |
Inter-vehicle data sharing for these use cases can be realized through two fundamentally different paths. The first is direct communication via a V2V sidelink, and the second is network-relayed communication via cellular infrastructure, which we refer to as Vehicle-to-Network-to-Vehicle (V2N2V). These two paths exhibit contrasting characteristics. V2V direct communication provides ultra-low latency of a few milliseconds with minimal jitter through single-hop wireless links, but it suffers from fundamental reliability limitations due to wireless interference, signal blockage, and dynamic topology changes. In contrast, V2N2V communication achieves high reliability through managed cellular infrastructure, but it incurs relatively higher end-to-end latency and latency variability under network load [3], and it may impose an economic burden for the sustained exchange of large-volume sensor data over extended periods, depending on the selected data plan.
For the V2N2V path, achieving the 50 ms/99% SLO presents fundamental challenges in conventional cellular deployments. Research by Coll-Perales et al. [3] reports that in centralized multi-Mobile Network Operator (MNO) environments, the 90th percentile latency reaches approximately 58 ms, which is attributed to delays induced by Remote Peering Points for inter-operator traffic exchange and latency variability in the public Internet segment. Our measurements from actual urban driving environments confirm these findings: while the mean latency of 39.40 ms appears acceptable, the reaches 59.86 ms and reaches 120.33 ms, which is more than double the SLO threshold. This latency is fundamentally incompatible with ultra-reliable real-time V2X applications.
To address this limitation, 3GPP standardized Mobile Edge Computing (MEC) architectures specifically for Ultra-Reliable Low-Latency Communication (URLLC) applications in Release 17 [4]. MEC@gNB deployment with local User Plane Function (UPF) breakout reduces end-to-end latency by eliminating remote routing: traffic terminates at the base station edge rather than traversing the core network and Internet. Coll-Perales et al. [3] demonstrate that MEC@gNB reduces latency from 58 ms to 3.6 ms, making the 50 ms/99% SLO achievable. Consequently, this paper assumes MEC-based edge architecture as the target deployment environment and proposes a hybrid data-sharing strategy that exploits the complementary characteristics of V2V and V2N2V in this architecture (Figure 1).
Figure 1.
Communication paths from A to B: V2V vs. V2N2V.
The core idea of this paper is a selective redundancy strategy that utilizes V2V as the low-latency, cost-free primary path while performing selective retransmission via V2N2V only when packet loss occurs on V2V. Unlike existing mode-switching approaches that incur high costs by shifting all traffic to cellular upon V2V link quality degradation, the proposed framework achieves both cost efficiency and reliability by recovering only lost packets through V2N2V.
The main contributions of this paper are as follows:
Measurement-based path characterization: Through real-world urban driving experiments, we quantify the complementary nature of V2V (low latency, imperfect reliability) and V2N2V (high reliability, variable latency) paths.
Hybrid framework design: We introduce an on-demand retransmission strategy that utilizes V2N2V as a backup path only upon packet loss on the V2V link, simultaneously achieving data reliability and minimizing cellular overhead.
Performance validation: Through trace-driven simulation, we demonstrate that the proposed framework achieves 99.96% packet reception rate and 99.71% frame playback success rate while providing approximately 7× cellular cost reduction compared to existing approaches [5].
2. Related Work
Recent studies highlight the fundamental role of V2X communication in establishing robust Intelligent Transportation Systems (ITSs). Comprehensive surveys and taxonomies demonstrate how V2X technologies are evolving to enhance traffic efficiency and safety through interconnected vehicular frameworks [6,7]. In parallel, the adoption of Mobile Edge Computing (MEC) is becoming increasingly critical. Research into future vehicular networks, including 6G and edge-based streaming optimization, validates the necessity of edge-assisted architectures to support bandwidth-intensive and intelligent applications [8,9]. Against this backdrop of evolving infrastructure, ensuring reliable real-time transmission remains a primary challenge. Autonomous and connected vehicles require diverse V2X applications with varying QoS requirements, often demanding high bandwidth, high reliability, and ultra-low latency simultaneously. These multifaceted requirements are difficult to satisfy with a single Radio Access Technology (RAT), motivating extensive research on multi-RAT vehicular networks. Below, we categorize existing approaches by their use of cellular infrastructure.
2.1. Multi-RAT V2V Without Cellular Infrastructure
Sepulcre and Gozalvez [10] proposed a context-aware algorithm where vehicles exchange Channel Busy Ratio (CBR) information and select the RAT with the lowest load among those satisfying reliability requirements, with hysteresis to prevent frequent switching. Jacob et al. [11] demonstrated that redundant transmission over IEEE 802.11p [12] and LTE-V2X PC5 simultaneously can significantly extend the communication range and reception success rate even under high congestion. Yacheur et al. [13,14] applied Deep Reinforcement Learning to determine the optimal transmission mode (single, redundant, or split) per message in ITS-G5/LTE-PC5 environments.
However, these approaches rely solely on V2V direct paths, which have inherent reliability limitations under challenging propagation conditions, motivating our hybrid approach utilizing V2N2V as a supplementary backup.
2.2. Cellular-Supported Multi-RAT Without Cost Consideration
Jacob et al. [15] proposed a Hybrid Communications Management layer for DSRC and LTE-V2X (PC5 and Uu), defining communication profiles as the selection unit and enabling adaptive multiprofile transmission for throughput or reliability enhancement. Albonda et al. [16] proposed dynamic selection between Cellular (Uu) and Sidelink (PC5) based on SINR and resource availability at the cluster level with spatial resource reuse between distant clusters. González et al. [17] presented a Multi-RAT Dual Connectivity architecture where vehicles connect to both LTE and NR base stations, using PC5 and Uu independently or in parallel for reliability enhancement through packet duplication.
2.3. Cellular-Supported Multi-RAT with Cost Consideration
Since commercial cellular networks incur usage-based costs, several studies explicitly minimize cellular infrastructure usage.
Brahim et al. [5] activated cellular only when the DSRC packet loss exceeds a threshold with minimum retention time and hysteresis for stable switching. Hui et al. [18] modeled inter-vehicle cooperation as a Coalition Formation Game, deriving Nash-stable access strategies that minimize download costs while suppressing unnecessary cellular usage. Mir et al. [19,20] proposed QoS-aware RAT selection that first adjusts the beacon rate during DSRC congestion and utilizes LTE only when the QoS remains unsatisfied. Altahrawi et al. [21] combined LSTM-based RSSI prediction with an analytic hierarchy process for service-oriented RAT selection optimizing efficiency and cost. Mir et al. [22] proposed traffic steering maintaining DSRC as the default and offloading to LTE only during congestion, combining uplink unicast and downlink eMBMS for cellular efficiency. Khalid et al. [23] proposed a decision tree-based selection of optimal technology and transmission frequency based on context (distance, latency, speed, traffic density). Chowdhury et al. [24] prioritized WiFi and DSRC-based multi-hop V2V cooperation, restricting cellular to minimum representative vehicles for CDN-based live streaming. Bréhon-Grataloup et al. [25] combined location-based QoS grids with PC5 relay for connection survivability, minimizing cellular fallback in urban NLOS environments.
Table 2 summarizes these multi-RAT multipath approaches.
Table 2.
Comparison of Multi-RAT V2X communication strategies.
| Prior Work | Assumed RATs | Multi-RAT Usage | Cellular Use Minimization | Cellular Usage Logic | Target Usecase |
|---|---|---|---|---|---|
| Sepulcre [10] | DSRC, WiFi, TVWS | Select | N/A (No Uu) | N/A | Cooperative Perception |
| Jacob [11] | DSRC, PC5 | Duplicate | N/A (No Uu) | N/A | Platooning |
| Yacheur [13,14] | DSRC, PC5 | Select, Duplicate, Split | N/A (No Uu) | N/A | Platooning, See-through |
| Jacob [15] | DSRC, PC5, Uu | Select, Duplicate, Split | No | Use cellular (Uu) when selected communication profile requires Uu either alone or in combination with V2V | Connected Autonomous Driving |
| Albonda [16] | PC5, Uu | Select | No | Select Uu when uplink/downlink SINR exceeds the sidelink SINR if sufficient resource is available | Safety Messaging |
| González [17] | PC5, Uu | Select, Duplicate | No | Use Uu for centralized sidelink resource scheduling and V2N services or when redundantly transmitting data alongside PC5 | Advanced V2X Services |
| Brahim [5] | DSRC, Uu | Select | Yes | Use Uu only when DSRC PLR crosses threshold; revert after n good windows | Real-time V2V video |
| Hui [18] | DSRC, Uu | Select | Yes | Select Uu if the cellular link results in a lower total cost than V2I/V2V communication | On-Demand Content Downloading |
| Mir [19,20] | DSRC, Uu | Select | Yes | Switch to Uu only when DSRC congestion persists despite rate control allowed by QoS requirements | Periodic Safety Messaging |
| Altahrawi [21] | DSRC, WiFi, Uu | Select | Yes | Use Uu only to maintain service continuity when RSU-based DSRC/WiFi links are unavailable or insufficient to meet QoS requirements | V2I Download |
| Mir [22] | DSRC, Uu | Select | Yes | Offload to Uu only when DSRC becomes congested while the cellular network has sufficient capacity | Periodic Safety Messaging |
| Khalid [23] | DSRC, PC5, Uu | Select, Duplicate | Yes | Use Uu if DSRC/PC5 cannot satisfy communication requirements | Periodic Safety Messaging |
| Chowdhury [24] | WiFi, DSRC, Uu | Select | Yes | Use WiFi/DSRC first; cellular only if no alternative source can deliver content | CDN-Based Live Video Streaming |
| Bréhon [25] | DSRC, PC5, Uu | Select | Yes | Select Uu as a fallback when DSRC/PC5 fail to meet predefined latency bound | Periodic Safety Messaging |
(1) RATs: DSRC = IEEE 802.11p; PC5 = 3GPP sidelink interface; Uu = 3GPP cellular uplink/downlink interface; TVWS = TV white space spectrum. (2) Multi-RAT Usage: Select = using only a single RAT at a time; Duplicate = transmitting identical packets over multiple RATs; Split = distributing data streams or packets across different RATs. (3) Cellular Use Minimization: N/A = no cellular path involved; No = cellular is used without cellular usage minimization; Yes = implicit or explicit cellular usage minimization or cost optimization.
3. Problem Description
This section examines the characteristics of V2N2V paths that connected vehicles can provide compared to V2V paths. Understanding the characteristics of these two paths, which will be used in combination, is essential for finding reliable and economically rational solutions for V2V applications, which is our goal. This will also provide an objective perspective on the various multipath combination approaches we reviewed in the previous section.
3.1. Characteristics of V2V vs. V2N2V Paths
V2V and cellular network communication (V2N2V) paths exhibit distinct latency and reliability characteristics. To quantitatively characterize V2V and V2N2V paths, we conducted measurement experiments for each path during actual driving.
3.1.1. Experimental Setup
Driving experiments were conducted in Seoul’s inner city, which represents typical urban driving environments.
Equipment and Configuration
The two vehicles participating in the experiments were equipped with vehicular communication devices. Figure 2 shows the configuration and placement of this equipment. Each vehicle had a 5G modem for Uu interface and a sidelink on-board unit (OBU) for the PC5 interface, respectively. They were connected to a laptop for data processing along with a front-facing camera. All communication equipment was placed inside the cabin except the V2V antenna that was mounted on the roof.
Figure 2.
Experimental setup for real-world driving experiments. (a) Devices configuration: antennas, PC5 modems, and Uu modems; (b) V2V antenna installation; (c) In-vehicle devices installation.
V2V Link Setup: For the V2V communication measurement, we used Ettifos’ Software-Defined Radio (SDR)-based 5G NR OBU. Table 3 summarizes the NR Sidelink Mode 2 configuration used in our experiments. We employed a non-periodic resource allocation scheme by setting the Resource Reservation Period (RRP) to zero, which results in dynamic resource selection for each transmission. This configuration is well suited for aperiodic video traffic, as it allows the transmitter to flexibly select resources in response to irregular frame arrivals unlike semi-persistent resource reservations with fixed periodicity. Furthermore, in the current real-world driving environment, neighboring vehicles rarely perform concurrent V2X transmissions, making resource collisions negligible. Therefore, while more advanced resource allocation and coordination mechanisms will be necessary in future high-density scenarios, the adopted configuration is sufficient for evaluating the intrinsic performance characteristics of the V2V link.
V2N2V Path Setup: For V2N2V communication measurement, we employed 5G modems from two different mobile operators (Korea Telecom and LGU+). In commercial cellular network environments, direct Internet Protocol (IP) communication between terminals is restricted due to Network Address Translation (NAT) structures where terminals (i.e., 5G modems) are given private IP addresses. Accordingly, to enable end-to-end communication between the two 5G modems, we configured a communication path via a Traversal Using Relays around NAT (TURN) server on the Internet.
Table 3.
NR Sidelink Mode 2 configuration parameters.
| Parameter | Value |
|---|---|
| Center Frequency | 5875 MHz |
| Bandwidth | 20 MHz |
| Subcarrier Spacing (SCS) | 15 kHz () |
| Subchannel Size | 15 RBs |
| Number of Subchannels | 7 |
| Resource Reserve Period (RRP) | 0 (Dynamic resource selection) |
| MCS Table/Index | 64QAM/26 |
| Max HARQ Transmissions | 4 |
| PSFCH Period | 1 slot |
| Tx Power | 20 dBm |
Video Traffic Coding and Transmission
In the measurements, real-time video with a resolution of at 30 frames per second (fps) was encoded using the H.264 codec, implemented via FFmpeg, and transmitted via the Real-time Transport Protocol (RTP). To satisfy real-time constraints, the encoder was configured to disable B-frames, set the Group of Pictures (GoP) size to 30, and enable zero-latency mode.
Since NR Sidelink resource allocation (Mode 2) has fixed periodicity, transmitting frame-level video traffic as is cannot be naturally accommodated by the Sidelink. It is because a single video frame has to be transmitted in multiple RTP packets due to the relatively small sidelink resource sizes. Consequently, an instantaneous packet burst occurs upon every video frame arrival. To cope with this issue, we set the resource period to 1 ms (aligned with the 1 ms slot structure in Numerology ) and applied packet pacing that limited the RTP packet transmission interval to 1 ms. This approach is essential for stably loading traffic onto the sidelink channel despite variable video frame sizes. Additionally, Hybrid Automatic Repeat Request (HARQ) functionality was enabled in sidelink transmission, performing up to three retransmissions excluding the initial transmission.
3.1.2. V2V Path: Low Latency and Imperfect Reliability
V2V communication achieves low latency through direct transmission. Our urban driving experiments show that while the V2V path delivers ultra-low latency with minimal jitter, it falls short of the reliability requirements for our target applications.
Low Latency
The cumulative distribution function (CDF) in Figure 3a visually demonstrates the excellent latency characteristics of the V2V path. Despite using SDR equipment, the mean latency was only 2.99 ms, and the 99th percentile () was merely 3.65 ms. The standard deviation representing latency variability (jitter) was also very low at 0.32 ms, showing stable latency performance. The maximum latency of 12.30 ms observed in the tail region is attributed to HARQ retransmission operations to recover transient errors.
Figure 3.
Measured characteristics of communication paths: (a) V2V latency, (b) inter-vehicle distance, (c) V2V burst loss length, and (d) V2N2V latency.
Reliability Limitations
In contrast, data reliability revealed problems. As shown in Figure 3b, the two vehicles maintained close proximity, ensuring conducive wireless channel conditions, with an average distance of 17.19 m (standard deviation 10.99 m) throughout the experiment. Despite this highly favorable communication environment where the 99th percentile () remained within 59.98 m, the Packet Reception Ratio (PRR) was only 95.77%. This residual loss occurred despite the HARQ retransmission mechanism operating up to three times, suggesting that V2V links alone cannot satisfy the ultra-reliability required by target services. The experimental results demonstrate that V2V communication reliability must be conservatively estimated depending on environmental factors such as complex urban structures and vehicle blocking.
3.1.3. V2N2V Path: High Reliability and Large, Variable Latency
The V2N2V path available to connected vehicles has high reliability through the managed cellular infrastructure. However, the longer path results in higher latency compared to V2V, and latency jitter is also inevitably high due to dynamic resource scheduling influenced by variable cell user counts and loads.
High Reliability
In inter-vehicle video transmission experiments using 5G modems, a total of 243,597 RTP packets transmitted during approximately 19 min of driving achieved a 100% reception rate without a single loss. This demonstrates that commercial 5G cellular networks can provide high reliability even for video data communication.
Latency Limitations
Despite high reliability, the V2N2V path exhibits clear latency limitations. As shown in Figure 3d, the mean latency of 39.40 ms appears to satisfy the SLO (50 ms). However, the tail distribution reveals a different picture: the 90th percentile () reaches 59.86 ms, and the 99th percentile () reaches 120.33 ms. This indicates that achieving our target requirement, latency within 50 ms with 99% reliability is structurally infeasible. The exceeds the threshold by 2.4 times, and even already surpasses 50 ms. In other words, more than 10% of packets experience latency spikes beyond the allowable limit, demonstrating that V2N2V as is cannot guarantee ultra-low latency.
This finding aligns with prior research. Coll-Perales et al. [3] report that the tail latency induced by jitter in 5G end-to-end communication is substantial, making it practically difficult to ensure that more than 99% of packets arrive within 50 ms, especially in multi-MNO environments or centralized cloud architectures. Our measured latency of 120.33 ms confirms this limitation. The experimental setup, where 5G modems from different operators (Korea Telecom, LGU+) communicated via a TURN server on the public Internet, represents a typical multi-MNO deployment without edge computing support. In such centralized architectures, inter-vehicle packets traverse a lengthy path: vehicle A → gNB → core network → Internet peering point → core network → gNB → vehicle B, accumulating substantial queuing and propagation delays. Therefore, in this paper, we assume an MEC-based architecture where the application server is deployed at the network edge, significantly reducing the V2N2V path latency to complement V2V reliability limitations. Table 4 shows that the MEC@gNB architecture can achieve the latency reduction necessary for ultra-reliable V2X applications.
Table 4.
V2N2V path latency: measured vs. standard architectures.
Fortunately, MEC deployment at gNB with local UPF breakout, standardized in 3GPP Release 17 [4], eliminates this lengthy routing path. Therefore, for performance evaluation in Section 5, we will adopt the MEC@gNB latency model from Coll-Perales et al. [3] as our target deployment architecture. Specifically, we utilize the multi-MNO scenario provided by this model, which incorporates local peering latency, to ensure a realistic evaluation of inter-operator connectivity.
It is important to note that in our framework, the MEC node serves solely as a low-latency relay point for packet forwarding rather than a computation platform; thus, edge computing power is not a critical factor. Consequently, we posit MEC coverage as a structural prerequisite for the V2N2V path, as standard cellular routing cannot meet the strict latency constraints of the target application.
3.2. Implications of V2V-Only Transmission on QoE
By analyzing the V2V trace, this section considers the QoE implication of using the V2V path alone without the hybrid V2V+V2N2V framework proposed in this paper. V2V packet loss does not directly translate to QoE degradation in real-time video transmission. Structured codecs like H.264 exhibit packet-to-frame dependency, inter-frame dependency, and error propagation, which combine to amplify single packet losses nonlinearly.
3.2.1. Packet-to-Frame Dependency
A single video frame encoded with H.264 is fragmented into multiple RTP packets for transmission. Since all packets constituting a frame must be received for successful decoding, even a single packet loss causes decoding failure of the entire frame. In our V2V measurement trace, the 4.23% packet loss rate was amplified to 8.83% frame loss rate: only 41,708 out of 45,464 video frames could be decoded.
3.2.2. Inter-Frame Dependency
H.264’s inter-frame prediction achieves high compression efficiency but introduces structural vulnerability to data loss. When an I-frame or preceding P-frame is lost, subsequent P-frames referencing them become undecodable even if their own packets arrive intact. Of the 41,708 decodable frames, only 38,745 were playable due to reference dependencies, yielding a frame loss rate of 14.78%, which is 3.5 times the original packet loss rate. During the 26-minute experiment, 1522 playback stalls occurred, averaging one interruption per second.
3.2.3. Error Propagation
Even when decoding succeeds, error propagation causes continuous quality degradation. Regions restored through error concealment propagate residual errors to subsequent frames via H.264’s motion compensation, causing artifacts to persist until a new I-frame refreshes the Group of Pictures (GoP). The V2V-only path recorded an average SSIM of 0.8625, which is classified as poor quality [26].
3.3. Summary
Using V2V alone, the 4.23% packet loss rate was amplified to 14.78% frame loss with 1522 playback stalls and an average SSIM of 0.8625. We need to resolve this problem for the targeted applications. After the proposed solution approach to the problem is presented in Section 4, the V2V-only results will serve as the baseline for evaluation in Section 5.3.
4. Solution Approach
This section proposes a practical strategy that utilizes both V2V and V2N2V paths to address the problems raised in the previous section while reflecting the characteristics of both paths observed in Section 3.1 and minimizing monetary costs by limiting the use of the paid V2N2V path to essential cases only.
4.1. V2V Data Transmission Augmented with V2N2V Retransmission
The core idea of the proposed framework is to use V2V as the main data path and parsimoniously use the V2N2V path for only retransmission requests and reply on the packet level. Since the framework operates at the application layer, it is agnostic to the underlying RAT, aligning with the ITS station architecture [27], which is designed to accommodate heterogeneous access technologies (e.g., IEEE 802.11p for V2V and 3GPP Uu for V2N).
4.1.1. Overall Workflow
The detailed operational procedure of the proposed framework is described in Figure 4. The algorithm consists of dual-path management at the sender side and loss detection and request logic at the receiver side.
Figure 4.
Flow chart of the proposed algorithm; B: packet buffer at sender side, R: set of sequence numbers to be retransmitted, H: set of sequence numbers of packets already requested for retransmission, : last successfully received sequence number, : intra-frame retransmission timeout, : inter-frame retransmission timeout.
At Sender: When a video frame is generated, the sender terminal encodes it, packetizes it into RTP packets, transmits via V2V, and simultaneously stores in local buffer (B) for retransmission handling. When retransmission requests (R) are received via V2N2V, only the corresponding packets are transmitted via the V2N2V path by referencing the buffer.
At Receiver: The receiver terminal operates based on packet reception events and timeout events. When a packet is received, the RTP sequence number is examined to verify continuity with the previously received packet. When a gap in sequence numbers is detected, the missing sequence numbers are determined as lost packets and added to set R, and retransmission is immediately requested via the V2N2V path. Simultaneously, the RTP Marker bit of the received packet is checked to identify frame boundaries, and retransmission timers are set according to the Marker value. When the Marker is 0, the timer is set to respond to intra-frame packet loss; when the Marker is 1, the timer is set for inter-frame loss detection with each timer being reset and restarted accordingly. When timeout occurs, packets that have not yet arrived after the last successfully received sequence number are considered lost and added to R, and retransmission is requested.
4.1.2. Loss Detection Mechanisms
This framework applies two complementary trigger mechanisms utilizing RTP header information for rapid and accurate packet loss detection. They are illustrated in Figure 5.
Figure 5.
Loss detection mechanisms: (a) gap-based trigger, (b) intra-frame timeout, and (c) inter-frame timeout.
Gap-Based Trigger
The most immediate loss detection method is examining the RTP sequence number continuity. When a gap () occurs between the previously received packet () and the current packet (), the receiver determines packets as lost and immediately requests retransmission. Indeed, our 26-minute V2V trace (173,673 packets) confirmed zero out-of-order arrivals, with 100% of sequence numbers arriving in monotonically increasing order with occasional losses. This is expected in the V2V sidelink due to single-hop transmission without intermediate routing nodes and MAC-layer HARQ preserving the original sequence order. Even if sporadic reordering were to occur, a consequent false gap detection and retransmission request would only cause a duplicate packet delivery, which would be safely discarded. Figure 5a shows an example.
Timeout-Based Trigger
Since gap-based detection requires subsequent packet arrivals, it cannot identify loss when no packets arrive for some duration. To address this, we use parallel timeout triggers with values differentiated by the RTP Marker bit, which indicates the last packet of a video frame.
Intra-frame Timeout (): When the Marker bit is 0, this indicates that constituent packets of the currently receiving frame still remain. Since the transmission interval between packets within a frame is very short (1 ms), a short timeout () is applied to respond quickly to intra-frame loss. Figure 5b shows an example of this case.
Inter-frame Timeout (): When the Marker bit is 1, this indicates that the transmission of the frame is complete. Since the next packet will not arrive until the next video frame is generated, a relatively long timeout () considering the video frame rate is applied to prevent unnecessary retransmission requests. That is, expires only when it can be determined that the first packet of the next frame has not arrived. Figure 5c shows an example of this case.
4.1.3. Request Optimizations
To minimize unnecessary cellular usage, we apply two optimizations:
The receiver maintains a request history set (H) to filter duplicate requests. Packets already in H do not generate additional requests even if detected as lost again by subsequent triggers.
Since the exact length of the loss interval cannot be known in advance in timeout-based retransmission requests, packets after that were not actually lost may be requested. To mitigate this problem, the number of retransmitted packets is limited in timeout-based retransmission requests.
4.1.4. Other System Details
The sender maintains a packet buffer B for selective retransmission. Given the 50 ms SLO and 1 ms pacing interval, at most 50 packets (≈75 KB) need to be buffered. Packets are evicted once they exceed the SLO window. At the receiver, packets whose end-to-end network latency exceeds 50 ms are regarded as invalid and are discarded by comparing the packet transmission timestamp with the final reception time. When duplicate packets are received, the receiver consistently uses the first-arriving packet and discards subsequent copies.
Each retransmission request consists of IP/UDP headers (28 bytes) and a payload specifying the starting RTP sequence number and count. Gap-based requests may include an optional exclusion field for packets already requested; timeout-based requests always specify contiguous ranges. The request overhead is negligible: for instance, over a 26-min experiment, gap-based detection generated 307.69 KB of request traffic (0.16% of video data), while the gap-and-timeout triggers generated 334.12 KB (0.18%).
5. Performance Evaluation
This section develops an analytical model for the proposed hybrid V2V+V2N2V recovery scheme, validates it through trace-driven simulation, and analyzes system performance comprehensively.
5.1. Analytical Framework
5.1.1. System Model
We consider a video stream encoded using H.264/AVC with a Group of Pictures (GoP) structure. Each GoP consists of one I-frame followed by P-frames. The I-frame serves as a reference and can be decoded independently, while each P-frame depends on all preceding frames in the GoP for successful decoding. This dependency structure implies that if frame k fails to decode, all subsequent frames in the same GoP become unplayable. Let and denote the average number of packets in an I-frame and P-frame, respectively. Packets are transmitted with a pacing interval of ms.
Table 5 lists the baseline parameter values used throughout the analysis.
Table 5.
Base parameter set.
| Symbol | Description | Value |
|---|---|---|
| Video Parameters | ||
| I-frame size | 20.41 packets | |
| P-frame size | 2.69 packets | |
| G | GoP size | 30 frames |
| Frame interval | 33.33 ms | |
| Channel Parameters | ||
| p | Packet loss rate | 0.0423 |
| Burstiness factor | (fitted) | |
| q | Burst termination rate | 0.656 |
| Timing Parameters | ||
| Latency deadline | 50 ms | |
| Inter-frame timeout | 60 ms | |
| Intra-frame timeout | 2.5 ms | |
| Packet pacing interval | 1.0 ms | |
| Recovery Parameters | ||
| N | Max packets per request | 3 |
| Min V2N2V delay | 2.5 ms | |
| Log-normal location | ||
| Log-normal scale | 0.68 | |
5.1.2. Channel Error Model
Vehicular wireless channels exhibit bursty error patterns due to multipath fading, shadowing, and interference. Under an independent and identically distributed (IID) packet loss model with loss probability p, the frame failure probability would be
| (1) |
However, vehicular channels typically exhibit significantly more bursty behavior than the IID model predicts. When losses are bursty, multiple packet losses concentrate within the same frame rather than spreading across different frames. This concentration effect reduces the number of frames that experience failure compared to IID losses. To capture this burstiness, we introduce a burstiness factor :
| (2) |
Other channel parameters are directly measurable from the V2V trace:
- Packet loss rate p is determined from the measured PRR:
(3) The burst termination rate q is estimated from the burst length CDF in Figure 3c, which shows that 95.92% of bursts contain three or fewer packets. Assuming a geometric distribution for burst length, we obtain , corresponding to an expected burst length of packets.
5.1.3. V2N2V Backup Path Model
We model the V2N2V round-trip time as a shifted log-normal distribution:
| (4) |
The probability that V2N2V recovery completes within budget B is
| (5) |
where is the standard normal CDF.
5.1.4. Frame Playback Rate Analysis
V2V-Only Baseline
In the V2V-only scheme, a frame is successfully decoded only if all its packets arrive. The frame decode probabilities are
| (6) |
| (7) |
Due to the GoP dependency structure, frame k is playable only if all preceding frames are decoded. The frame playback rate is
| (8) |
Proposed Scheme
The proposed scheme adds V2N2V recovery when packet loss is detected on the V2V path. A frame is successfully decoded if either (a) no packet loss occurs, or (b) all lost packets are successfully recovered via V2N2V within the SLO deadline.
Detection Mechanism Interaction: A critical design constraint arises from the interaction between loss detection mechanisms. With intra-frame timeout ms and packet pacing interval ms, the detection trigger depends on burst length m:
: The next packet arrives within ms , triggering gap-based detection. All m lost packets are identified and requested.
: The timeout fires at ms before the next packet arrives at ms, triggering timeout-based detection. Only packets are requested.
This threshold determines whether full or partial recovery is possible.
Burst Length Distribution: Based on the analysis of the real-life V2V trace, we assume that the burst length follows a geometric distribution with termination probability :
| (9) |
Key probabilities:
| (10) |
| (11) |
| (12) |
Recovery Analysis by Scenario: We analyze recovery success based on the burst length and position within the frame.
Scenario 1: Gap-based detection ( and ). When burst length and the burst does not extend to the frame boundary (), the next packet (index ) arrives within ms, triggering gap-based detection before ms expires. The recovery budget is
| (13) |
For I-frames (), the worst case yields ms. For P-frames (), the worst case yields ms. Both provide sufficient time:
| (14) |
Scenario 2: Intra-frame timeout (, mid-frame). When and the burst occurs mid-frame (, ), triggers after packet . Only packets are requested. The recovery budget is
| (15) |
Recovery succeeds only if both conditions hold: (a) V2N2V completes within budget, and (b) burst length :
| (16) |
With , only 65.6% of timeout-triggered bursts are recoverable.
Scenario 3: Tail loss (, ). When the burst extends to the frame boundary, triggers. The tail loss size is . Recovery requires :
| (17) |
Only positions where allow recovery.
Scenario 4: Frame-start burst (). When the burst begins at the first packet, inter-frame timeout triggers after the previous frame completes. The detection delay is
| (18) |
The recovery budget becomes
| (19) |
For frames following an I-frame (),
| (20) |
For frames following a P-frame (),
| (21) |
Recovery requires both V2N2V completion and :
| (22) |
Combined Recovery Probability: Let denote the probability of gap-based detection. For a frame of size , the overall recovery probability combines all scenarios weighted by their occurrence probabilities.
For mid-frame bursts ():
| (23) |
Substituting and ,
| (24) |
With baseline parameters yielding :
| (25) |
The overall recovery probability for a frame averages over burst start positions:
| (26) |
Frame Playback Rate: The frame decode probability with recovery is
| (27) |
where is the frame failure probability.
For I-frames and P-frames:
| (28) |
| (29) |
Then, Equation (8) can be used to compute the FPR for the proposed scheme.
Dominant Failure Modes The analysis identifies two primary failure modes:
Long bursts (): When and timeout-based detection triggers, only three packets are recovered. The remaining packets are lost, causing frame failure. This occurs with probability of all bursts.
Tight timeout budget after I-frames: The recovery budget ms is marginal for a V2N2V round-trip, reducing the recovery success probability for frame-start bursts following I-frames.
5.1.5. Sensitivity Analysis
We analyze how FPR varies with key parameters. Figure 6 shows the results, where is set to the fitted value determined in Section 5.4.
Figure 6.
FPR sensitivity to system parameters; red dashed: V2V-only baseline. Green solid: proposed scheme. (a) FPR vs. p; (b) FPR vs. ; (c) FPR vs. ; (d) FPR vs. ; (e) FPR vs. G; (f) FPR vs. N.
Packet Loss Rate (p)
The packet loss rate has the strongest impact on V2V-only performance. As p increases from 0.01 to 0.15, the V2V-only FPR drops from 0.95 to below 0.55. In contrast, the proposed scheme maintains FPR above 0.97 across the entire range. This demonstrates the robustness of V2N2V backup: even when the direct V2V path experiences significant degradation, the cellular backup path successfully recovers most lost frames.
Minimum V2N2V Delay ()
The FPR curve for exhibits three distinct drop regions, each corresponding to a critical recovery budget threshold. To understand this behavior, we first identify the three recovery budgets in the system:
ms: budget for frame-start bursts following an I-frame;
ms: budget for frame-start bursts following a P-frame;
ms: budget for mid-frame bursts.
Recovery via V2N2V is possible only when , since the minimum round-trip time cannot exceed the available budget. As increases, it sequentially exceeds each threshold:
First drop ( ms). When exceeds ms, frame-start burst recovery becomes impossible for frames following an I-frame. This is the tightest constraint because I-frames are large ( packets), causing the inter-frame timeout to trigger late:
| (30) |
This leaves only ms for recovery. The FPR drop at this point is relatively small because frame-start bursts are infrequent (probability for each frame).
Second drop ( ms). When exceeds ms, frame-start burst recovery also fails for frames following P-frames. Since P-frames are smaller ( packets), ms provides more recovery time. Beyond this threshold, all frame-start bursts become unrecoverable.
Third drop ( ms). When exceeds ms, even mid-frame burst recovery fails. Since mid-frame bursts constitute the majority of loss events (probability ), this causes a dramatic FPR collapse to the V2V-only level (≈0.85).
The multi-step degradation pattern reveals the relation between the network latency and application timing constraints with I-frame-related recovery being the most vulnerable due to the large I-frame size.
Latency Deadline ()
The FPR curve for exhibits a distinctive three-stage staircase pattern as the deadline increases. This behavior arises because each recovery budget becomes viable (i.e., exceeds ) at a different threshold. Beyond ms, all three recovery budgets comfortably exceed , and for all scenarios. The FPR curve saturates at approximately 0.993, which is limited only by the residual probability of
Bursts longer than packets (undetectable within the request window);
V2N2V delays exceeding the available budget (tail of the log-normal distribution).
Design implications. The staircase pattern reveals that requirements are driven primarily by the I-frame bottleneck. The baseline value of ms is positioned just above the third threshold (48.6 ms), ensuring all recovery mechanisms are active with minimal margin.
For applications requiring lower latency, the analysis suggests that
ms is sufficient for most recovery scenarios (Stages 1–2), achieving FPR ;
ms is necessary for full recovery capability (all three stages), achieving FPR .
Inter-Frame Timeout ()
The inter-frame timeout is designed to detect frame-start losses: cases where the first packet of a new frame is lost, leaving no gap-based detection opportunity. However, even without any loss, there is a natural delay between the last packet of frame k and the first packet of frame :
| (31) |
where ms is the frame interval, is the encoder processing variability, and is the network jitter. Our V2V trace measurements show that the 99th percentile of inter-frame packet arrival intervals is 59.08 ms. If is set below this value, the timeout will trigger even when no loss has occurred, causing false-positive retransmission requests.
In Figure 6d, the sharp transition at –80 ms occurs when approaches ms. At ms, ms , making frame-start burst recovery impossible for all frame types. Beyond 80 ms, FPR stabilizes at a reduced level because only mid-frame recovery (with budget ms, unaffected by ) remains functional. Beyond 80 ms, FPR stabilizes at a reduced level because only mid-frame recovery remains functional. The system does not collapse to V2V-only performance because provides sufficient margin for the majority of burst events.
GoP Size (G)
The FPR curves for G reveal a contrast: while V2V-only FPR degrades significantly as G increases, the proposed scheme maintains nearly constant performance. In Equation (8), the summation represents the expected number of consecutive decodable P-frames before a failure. Its behavior depends critically on how close is to unity:
If : The sum converges to as , making FPR ;
If : The sum , making FPR (independent of G).
With V2N2V recovery, the P-frame decode probability reaches . At this level, the term remains close to unity even for large G, so
| (32) |
This explains why the proposed scheme’s FPR curve is nearly flat, decreasing only slightly from 0.996 at to 0.989 at .
Without recovery, however, we have
| (33) |
As G increases, the summation saturates at . Thus,
| (34) |
This inverse relationship explains the steady decline from 0.92 at to 0.77 at . It shows the importance of trying to maintain the decodability of P-frames as close to unity as possible through the V2N2V retransmissions.
N (Max Packets per Request)
The parameter N affects recovery whenever the receiver cannot determine the exact burst length through gap-based detection. They are long mid-frame bursts (), long end-of-frame bursts (), and all frame-start bursts. We assumed that the burst length follows a geometric distribution with termination probability . Consequently, saturates quickly. By , the probability already reaches 0.959, leaving only 4% of bursts unrecoverable. Additional increases yield marginal gains: captures 99.5%, and captures effectively 100%. Moreover, 88% of bursts are short enough for gap-based detection (see Equation (10)), and only 12% require timeout-based recovery where N is relevant. Combined with the rapid saturation of , this explains why FPR sensitivity to N diminishes quickly.
5.1.6. Discussion on Analytical Model
The sensitivity analysis reveals five key insights about the proposed scheme:
Robustness to channel degradation: The proposed scheme maintains high FPR even when the packet loss rate increases significantly, which is valuable for vehicular environments where channel conditions vary rapidly.
Parsimonious retransmission suffices: Even with , FPR reaches 0.99, demonstrating that selective packet-level recovery is highly cost-effective.
Edge support is essential: The proposed scheme outperforms V2V-only only when ms, with full recovery requiring ms, necessitating MEC@gNB deployment.
Coding efficiency is preserved: Unlike V2V-only where FPR , the proposed scheme maintains stable FPR even at , enabling aggressive compression.
Timing constraints are coupled: The parameters , , and are interdependent; e.g., ms or ms can preclude recovery for certain burst scenarios.
5.2. Trace-and-Model-Driven Simulation
We conducted trace-and-model-driven simulation based on V2V communication traces from Section 3.1, which are collected in actual urban driving environments. Figure 7 shows the simulator architecture. It emulates dynamic V2V packet loss patterns by replaying the collected traces. When a V2V packet loss is detected, the MEC@gNB delay model [3] is consulted to determine the V2N2V latency for retransmission, reflecting a target deployment scenario aligned with current 3GPP standardization for URLLC-oriented V2X services. This hybrid approach taken in this paper is necessitated by the unavailability of MEC-based V2N2V traces.
Figure 7.
Trace-and-model-driven simulation architecture.
The simulator applies the logic presented in Section 4.1 to each packet arrival event to determine whether loss was detected by sequence number gap or timeout. When loss is detected, two delay intervals occur in the recovery process via the V2N2V path: (1) required for the retransmission request to reach the sender via the V2N2V path, and (2) required for the requested packet to be retransmitted and reach the receiver. To reflect this, the simulator sequentially extracts two delay samples from the MEC@gNB delay model and adds them. That is, the final packet arrival time is calculated as
where we set when there is no loss. Then, the simulator checks whether each packet’s arrival time () satisfies the SLO (50 ms). When all packets belonging to the same frame are received within the SLO, the resulting frame is input to the codec to attempt decoding. If the decoding is successful for final playback, QoE evaluation is performed.
5.2.1. Compared Strategies
We compared and analyzed the following four transmission strategies:
V2V Only (Baseline): transmission via a V2V single path only without any retransmission mechanism.
Gap Only: strategy that detects only discontinuity (gap) in received sequence numbers to request retransmission of lost packets.
Gap+Timeout (Proposed): the proposed framework using both a gap trigger and timeout trigger.
PLR Switching [5]: strategy that switches the transmission path based on application SLO (loss rate < 1%). When the V2V path Packet Loss Ratio (PLR) exceeds the threshold, the entire path switches to V2N2V, and it returns to V2V when the V2V path PLR stabilizes for a certain period. It is a hysteresis-based approach, which maintains cellular connectivity for a minimum retention time after switch.
Among cellular-supported multi-RAT approaches with cost consideration, we compare against Brahim et al. [5] as it is the only prior work addressing cost-effective inter-vehicle real-time video streaming. Some approaches target disparate application settings: Internet content download with inter-vehicle cooperation [18,24], periodic beacon transmission [19,20,22,23,25], and V2I-based download scenarios [21]. Others take approaches that are not easily comparable, such as exploiting a location-based QoS map [25] or entirely offloading traffic to cellular upon DSRC congestion [22].
5.2.2. Parameters Configuration
Timeout-based loss detection and retransmission parameters were set based on the statistics from the V2V communication traces.
Intra-frame timeout (): V2V experiment results showed that the of inter-packet arrival intervals within a frame was 2.11 ms. It reflects the 1 ms packet pacing interval with jitter. Considering this, ms was used. In real-world deployments, this parameter can be flexibly configured by considering the expected packet inter-arrival time determined by the resource allocation strategy and adding a margin for network jitter, which varies depending on the specific hardware performance and the driving environment.
Inter-frame timeout (): In a 30 fps video streaming experiment, the value of inter-frame packet arrival intervals including the software encoder (e.g., FFmpeg) processing delay and protocol stack delay was 59.08 ms. Accordingly, ms was used. In practical applications, this parameter can be adaptively updated based on the target video frame rate (1/FPS) and the specific processing capabilities of the system in use.
The maximum number of packets N that can be requested in timeout-based retransmission is limited to 3. Under our video encoding configuration (640 × 480, H.264, RTP), a single video frame averages 3.28 packets with a median of two. Since burst losses primarily occur among packets within the same frame rather than across frames, and Figure 3c shows that 95.92% of loss events involve three or fewer consecutive packets, limiting timeout-based retransmission requests to at most three packets effectively covers the majority of loss patterns in practical V2V environments. However, N is a design parameter that should be adjusted according to video resolution, encoding scheme, and frame structure to suit specific application requirements.
In the considered MEC@gNB deployment, the mean end-to-end V2N2V latency is 3.22 ms. Consequently, retransmission round trips typically remain within approximately 10 ms, which is well below the 50 ms delay SLO. Thus, retransmission latency itself does not dominate packet reception outcomes; rather, the packet recovery performance depends primarily on when loss is detected. Packet arrival timing may deviate from expectations due to V2V HARQ retransmissions and variability in frame capture and packetization, affecting the time available for recovery within the SLO. The simulation processes over 170,000 packets across more than 45,000 video frames, providing sufficient coverage to reflect these timing-related effects.
5.2.3. Metrics
We evaluate performance using two categories of metrics. The first category consists of network-layer metrics that characterize the packet and frame delivery:
Packet Reception Ratio (PRR): the ratio of packets successfully received within the SLO to the total packets transmitted.
Cellular Network Utilization: the fraction of total video traffic transmitted via the cellular path for retransmission. Lower utilization indicates a more efficient use of cellular resources.
The second category consists of application-layer QoE metrics that reflect the end-user viewing experience:
Frame Playback Ratio (FPR): the ratio of frames successfully decoded and displayed to the total frames transmitted, accounting for codec dependencies where the loss of a reference frame renders dependent frames undecodable.
Stall Count: the number of playback interruptions caused by missing frames at their scheduled display time.
Structural Similarity Index (SSIM): a perceptual quality metric quantifying the visual similarity between transmitted and received video frames, ranging from 0 (no similarity) to 1 (identical).
5.3. Simulation Results
5.3.1. PRR
This section evaluates PRR based on 5GAA’s V2V video streaming use case requirements (end-to-end latency < 50 ms) [2].
Overall Reliability
Figure 8a shows the PRR for the entire trace. V2V Only recorded 95.77% PRR, failing to meet target application requirements (99%). PLR Switching achieved a high PRR of 99.50%, but it has limitations of excessive cellular data consumption, as will be shown in the subsequent cost analysis. In contrast, Gap+Timeout achieved 99.96% PRR, perfectly compensating for V2V reliability limitations with V2N2V to demonstrate nearly lossless performance.
Figure 8.
Transmission reliability analysis comparison: (a) PRR of each strategy, and (b) conditional PRR across V2V PLR quantiles.
Robustness in Deep Fading Conditions
The difference between Gap Only (99.90%) and Gap+Timeout (99.96%) may appear negligible in PRR numbers. However, an in-depth analysis by channel state clearly reveals the robustness difference of the proposed framework. To precisely analyze the V2V channel loss rate variability, we segmented the entire trace into PLR intervals using the Binary Segmentation algorithm [28] and analyzed the conditional PRR for each interval, and the results are shown in Figure 8b. Analysis results show that in severe channel degradation intervals (Q75, top 25% high-loss interval) where the V2V PLR exceeds 19%, Gap Only’s PRR drops to 98.90%, failing to achieve SLO (99%). This is because Gap Only depends on subsequent packet arrivals and cannot quickly recover losses occurring in consecutive packet loss (burst loss) or traffic tail loss. In contrast, Gap+Timeout maintained a stable reliability of 99.66% even under such adverse conditions through the proactive timeout mechanism operation. Particularly in the worst communication environment Q90 (PLR ≥ 32.1%) interval, it recorded 99.49% PRR, demonstrating its robustness.
5.3.2. Resource Efficiency and Cost
Cellular Network Utilization
Pure data loss on the V2V path measured in the experimental interval is 4.70% of total traffic. This corresponds to the theoretical lower bound of data that must be invested for data integrity. In Figure 9, Gap Only recorded 4.87% cellular link utilization. This is close to the theoretical lower bound even including the retransmission request overhead, demonstrating its lean operation. In contrast, PLR Switching recorded 37.23%, wasting times the actual required amount (4.70%). Compared to this, Gap+Timeout recorded 5.54% utilization. This is only a 0.84% increase over the theoretical minimum, demonstrating maximized resource efficiency while achieving lossless-level reliability (99.96% PRR).
Figure 9.
Cellular link utilization comparison, demonstrating the efficiency of the proposed framework.
Monetary Cost
To quantify the economic value of data efficiency, we assume a vehicle operating 2 h daily for commuting with the above utilization efficiency. Considering a typical high-definition video streaming bitrate of approximately 4 Mbps, continuously serving video to surrounding vehicles for 2 h produces approximately 3.8 GB of data. Of this, Gap Only must transmit 177 MB, Gap+Timeout 209 MB, and PLR Switching 1.4 GB via the cellular path. When consuming data at this rate for approximately 20 days per month, these three strategies transmit 3.5 GB, 4.2 GB, and 28 GB, respectively, via the cellular path. If the 5G sidelink wireless environment is much worse than that of downtown Seoul measured in our study, the cellular path usage would exceed the data amounts presented above, making economic considerations non-negligible for connected vehicles.
The data consumption difference (e.g., 4.2 GB vs. 28 GB monthly) has varying economic impact depending on data plans:
Unlimited plans with throttling (assume 22 GB threshold used by AT&T): Gap+Timeout stays below threshold; PLR Switching triggers rate limiting, affecting concurrent services.
Tiered plans (e.g., 10/20/30 GB tiers): Gap+Timeout remains in lower tiers; PLR Switching forces tier upgrades ( $20–40/month).
Pay-per-GB plans (e.g., $10/GB overage): direct savings of $240/month per vehicle.
Thus, these economic considerations could be decisive in whether vehicle owners allow their cellular connectivity to support V2X, ultimately shaping whether the promising applications arising from the integration of V2N and V2X technologies will become a reality. Beyond monetary cost, reducing cellular utilization benefits network operators by enabling a higher vehicle density per cell, which is increasingly critical as hybrid transmission becomes widespread.
While the preceding analysis focused on network-level performance, we also evaluated user-perceived quality through QoE metrics: the Frame Playback Ratio (FPR), stall count, and video quality (SSIM). Figure 10 shows the QoE performance comparison results for each transmission strategy.
Figure 10.
Comparison of QoE metrics: (a) Frame Playback Ratio (FPR); (b) stall count (during 26 minutes of live vehicle-to-vehicle video streaming simulation); (c) average SSIM.
5.3.3. Frame Playback Ratio
Due to the packet-to-frame dependency and the structural dependency of the video frames discussed in Section 3.2, packet loss is amplified nonlinearly in FPR. Figure 10a shows the FPR for each strategy. As established in Section 3.2, the V2V Only results clearly reveal single-path reliability limitations. The 85.22% FPR corresponds to four to five frames being continuously lost per second for 30 fps video, causing video discontinuity that could impede driver situation awareness and pose a safety threat.
Hybrid strategies significantly improved performance, but gaps exist among them. PLR Switching recorded 99.50% PRR but only 97.05% FPR, while the proposed Gap+Timeout achieved 99.71% FPR from 99.96% PRR. Notably, a mere 0.46% difference in PRR expanded to 2.66% in FPR (over 5× amplification), demonstrating that minor network-level reliability differences can result in substantial application-level impact. Consequently, the proposed framework saves approximately 50 additional frames per minute compared to PLR Switching, providing drivers with seamless visual information during high-speed driving.
5.3.4. Stall Count
Playback stalls impair user immersion and disrupt information delivery. Figure 10b shows the total stall count during 26 min of simulation experiments. V2V Only recorded 1522 stalls as reported in Section 3.2. In contrast, PLR Switching generated 497 stalls, while the proposed Gap+Timeout reduced this to 49, which is a 10× reduction from that of PLR Switching. Due to the high inter-packet dependency in video data, even minor network-level losses can cause stalls. These results indicate that our proposed framework, by immediately recovering lost data through the reliability-guaranteed V2N2V path, is effective in maintaining video streaming QoE.
5.3.5. Video Quality
Figure 10c shows the average SSIM. V2V Only recorded 0.8625 as reported in Section 3.2, which is substantially lower than the hybrid strategies, indicating that drivers would clearly perceive a significant degradation in video quality. Among hybrid strategies, the proposed Gap+Timeout achieved the highest SSIM of 0.9431, slightly exceeding Gap Only (0.9423) and notably outperforming PLR Switching (0.9228). Flynn et al. [29] reported that the Just Noticeable Difference (JND) threshold for SSIM is approximately 0.95, below which human observers begin to perceive quality degradation. While PLR Switching falls well below this threshold, Gap+Timeout approaches it closely, suggesting that modest further improvements could achieve perceptually lossless quality. This result demonstrates that the proposed framework minimizes error propagation from packet loss and provides quality closest to the original video.
5.4. Model Validation
We validate the analytical model developed in Section 5.1 by comparing its predictions against the simulation results presented above.
5.4.1. Parameter Fitting
The burstiness factor is determined by fitting the V2V-only FPR model to match the simulation result:
| (35) |
With from simulation, fitting yields , indicating highly bursty error patterns.
5.4.2. Numerical Evaluation
Using the fitted with baseline parameters:
Frame failure: , ;
Recovery (I-frame): (limited by tight and cases);
Recovery (P-frame): (better due to smaller frame size and larger ).
Substituting into Equations (28) and (29), the frame decode probabilities become
| (36) |
| (37) |
Substituting into Equation (8) yields .
5.4.3. Validation Results
Table 6 compares the model predictions with the simulation results.
Table 6.
Model validation results.
| V2V-Only | Proposed | |||
|---|---|---|---|---|
| Metric | Model | Simulation | Model | Simulation |
| PRR | 0.9577 | 0.9577 | 0.9999 | 0.9996 |
| FPR | 0.8522 (fitted) | 0.8522 | 0.9973 | 0.9971 |
The V2V-only results match exactly by construction: p is measured from the trace and is fitted to FPR. The meaningful validation is whether these parameters predict the proposed scheme’s performance.
For the proposed scheme:
PRR: Model predicts 0.9999 vs. simulation 0.9996 (error: );
FPR: Model predicts 0.9973 vs. simulation 0.9971 (error: ).
Both metrics are slightly overestimated, indicating that the model is optimistic about recovery success. This bias arises from simplifying assumptions: a uniform distribution of burst start positions, independence between consecutive bursts, and perfect V2N2V reliability.
Despite this small prediction error, the model correctly captures the key qualitative insight: V2N2V backup recovery transforms moderate V2V reliability (PRR 95.77%, FPR 85.22%) into near-perfect performance (PRR 99.96%, FPR 99.71%).
We acknowledge that the validation is limited by having only one simulation scenario. A more rigorous validation would require multiple channel conditions, testing whether the fitted parameters generalize. We leave this extension for future work.
5.5. Qualitative Analysis on Scalability
While our evaluation focuses on a single vehicle pair, the architectural characteristics of the proposed framework suggest robust scalability in multi-vehicle scenarios.
In high-density scenarios, the Sidelink Channel Busy Ratio (CBR) increases, leading to higher packet loss. Conventional strategies, such as PLR Switching, typically offload the entire video stream to the cellular network when loss exceeds a threshold. This behavior can cause a sudden surge in cellular bandwidth demand. In contrast, by design, our framework retransmits only the lost packets. Therefore, even as sidelink congestion worsens, the cellular traffic load generated by our method increases only in proportion to the loss rate rather than spiking to 100%. This structural efficiency is expected to significantly reduce the aggregate load on the cellular network compared to full-offloading schemes.
Regarding the V2N2V path, multi-vehicle traffic may increase the processing load on the MEC server. However, the MEC@gNB architecture fundamentally reduces network latency by eliminating the need for traffic to traverse the core network and the internet. This architectural advantage provides significant latency headroom compared to centralized routing. Even if queuing delays occur due to high server load, this inherent margin allows the system to absorb the delays while remaining within the 50 ms SLO.
6. Conclusions
This paper addressed the challenge of achieving reliable and cost-effective real-time video streaming between connected vehicles equipped with both V2V sidelink and cellular (V2N2V) interfaces. Through measurement studies in Seoul urban driving environments, we characterized the complementary nature of these paths: V2V provides ultra-low latency (mean 2.99 ms) but imperfect reliability (95.77% PRR), while V2N2V achieves perfect reliability but suffers from high latency variability that can violate stringent V2X timing requirements. We proposed a hybrid framework that uses V2V as the primary path while selectively utilizing V2N2V only for retransmitting lost packets. The key innovation is a dual loss detection mechanism combining gap-based and timeout-based triggers that leverage RTP header information. The gap-based trigger provides immediate detection upon sequence number discontinuity, while the timeout-based trigger handles burst and tail losses using differentiated intra-frame (2.5 ms) and inter-frame (60 ms) timeouts. Trace-driven simulation demonstrated the effectiveness of the proposed approach. Gap+Timeout achieved 99.96% PRR and 99.71% FPR, reduced stalls by 10× compared to PLR Switching (49 vs. 497), and improved video quality (SSIM: 0.9431 vs. 0.9228). Most importantly, the framework maintained cellular utilization at only 5.54% (merely 0.84%p above the theoretical lower bound) while PLR Switching consumed 37.23%, representing approximately a 7× cost reduction. Minimizing cellular usage is critical for bridging V2N and V2X technologies, as economic pressures may ultimately determine whether vehicle owners permit their cellular connectivity to support V2X services. By demonstrating that reliable video streaming is achievable with minimal cellular cost, this work contributes toward enabling the promising applications that arise from integrating connected vehicles with V2X capabilities.
Author Contributions
Conceptualization, H.K.; methodology, S.H.; software, S.H. and S.K.; validation, S.H., H.K. and Y.-S.S.; formal analysis, S.H. and H.K.; investigation, S.H. and S.K.; resources, H.K.; data curation, S.H.; writing—original draft preparation, S.H. and H.K.; writing—review and editing, S.H. and H.K.; visualization, S.H.; supervision, H.K.; project administration, H.K. and Y.-S.S.; funding acquisition, H.K. and Y.-S.S. All authors have read and agreed to the published version of the manuscript.
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
Data available on request due to restrictions.
Conflicts of Interest
The authors declare no conflicts of interest.
Funding Statement
This work was supported by the Institute of Information & Communications Technology Planning & Evaluation (IITP) grant funded by the Korean government (MSIT) (RS-2025-02283230, Development of autonomous cooperative driving technology based on seamless communication of heterogeneous V2X for autonomous driving).
Footnotes
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
References
- 1.U.S. Department of Transportation, Intelligent Transportation Systems Joint Program Office . Vehicle-to-Everything (V2X) Technology Executive Briefing. U.S. Department of Transportation; Washington, DC, USA: 2024. Technical Report. [Google Scholar]
- 2.5G Automotive Association (5GAA) C-V2X Use Cases and Service Level Requirements (Vols. I, II and III) 5G Automotive Association; Munich, Germany: 2025. Technical Report. [Google Scholar]
- 3.Coll-Perales B., Lucas-Estañ M.C., Shimizu T., Gozalvez J., Higuchi T., Avedisov S., Altintas O., Sepulcre M. End-to-end V2X latency modeling and analysis in 5G networks. IEEE Trans. Veh. Technol. 2022;72:5094–5109. doi: 10.1109/TVT.2022.3224614. [DOI] [Google Scholar]
- 4.3GPP . 5G System Enhancements for Edge Computing; Stage 2 (Release 17) 3rd Generation Partnership Project (3GPP); Sophia Antipolis, France: 2022. Technical Specification TS 23.548. [Google Scholar]
- 5.Brahim M.B., Mir Z.H., Znaidi W., Filali F., Hamdi N. QoS-aware video transmission over hybrid wireless network for connected vehicles. IEEE Access. 2017;5:8313–8323. doi: 10.1109/ACCESS.2017.2682278. [DOI] [Google Scholar]
- 6.Zhang X., Li J., Zhou J., Zhang S., Wang J., Yuan Y., Liu J., Li J. Vehicle-to-Everything Communication in Intelligent Connected Vehicles: A Survey and Taxonomy. Automot. Innov. 2025;8:13–45. doi: 10.1007/s42154-024-00310-2. [DOI] [Google Scholar]
- 7.Naeem M.A., Chaudhary S., Meng Y. Road to efficiency: V2v enabled intelligent transportation system. Electronics. 2024;13:2673. doi: 10.3390/electronics13132673. [DOI] [Google Scholar]
- 8.Naseh D., Shinde S.S., Tarchi D. Enabling intelligent vehicular networks through distributed learning in the non-terrestrial networks 6G vision; Proceedings of the 28th European Wireless Conference on European Wireless 2023, VDE; Rome, Italy. 2–4 October 2023; pp. 136–141. [Google Scholar]
- 9.Naseh D., Bozorgchenani A., Tarchi D. Deep Reinforcement Learning for Edge-DASH-Based Dynamic Video Streaming; Proceedings of the 2025 IEEE Wireless Communications and Networking Conference (WCNC); Milan, Italy. 24–27 March 2025; Piscataway, NJ, USA: IEEE; 2025. pp. 1–6. [Google Scholar]
- 10.Sepulcre M., Gozalvez J. Heterogeneous V2V communications in multi-link and multi-RAT vehicular networks. IEEE Trans. Mob. Comput. 2019;20:162–173. doi: 10.1109/TMC.2019.2939803. [DOI] [Google Scholar]
- 11.Jacob R., Anwar W., Fettweis G., Pohlmann J. Exploiting multi-RAT diversity in vehicular ad-hoc networks to improve reliability of cooperative automated driving applications; Proceedings of the 2019 IEEE 90th Vehicular Technology Conference (VTC2019-Fall); Honolulu, HI, USA. 22–25 September 2019; Piscataway, NJ, USA: IEEE; 2019. pp. 1–7. [Google Scholar]
- 12.IEEE Standard for Information Technology–Local and Metropolitan Area Networks–Specific Requirements–Part 11: Wireless LAN Medium Access Control (MAC) and Physical Layer (PHY) Specifications Amendment 6: Wireless Access in Vehicular Environments. IEEE; New York, NY, USA: 2010. [Google Scholar]
- 13.Yacheur B.Y., Ahmed T., Mosbah M. DRL-based RAT selection in a hybrid vehicular communication network; Proceedings of the 2023 IEEE 97th Vehicular Technology Conference (VTC2023-Spring); Florence, Italy. 20–23 June 2003; Piscataway, NJ, USA: IEEE; 2023. pp. 1–5. [Google Scholar]
- 14.Yacheur B.Y., Ahmed T., Mosbah M. Efficient DRL-based selection strategy in hybrid vehicular networks. IEEE Trans. Netw. Serv. Manag. 2023;20:2400–2411. doi: 10.1109/TNSM.2023.3300653. [DOI] [Google Scholar]
- 15.Jacob R., Franchi N., Fettweis G. Hybrid V2X communications: Multi-RAT as enabler for connected autonomous driving; Proceedings of the 2018 IEEE 29th Annual International Symposium on Personal, Indoor and Mobile Radio Communications (PIMRC); Bologna, Italy. 9–12 September 2018; Piscataway, NJ, USA: IEEE; 2018. pp. 1370–1376. [Google Scholar]
- 16.Albonda H.D.R., Pérez-Romero J. A new mode selection and resource reuse strategy for V2X in future cellular networks; Proceedings of the 2020 IEEE 91st Vehicular Technology Conference (VTC2020-Spring); Antwerp, Belgium. 25–28 May 2020; Piscataway, NJ, USA: IEEE; 2020. pp. 1–6. [Google Scholar]
- 17.González E.E., Garcia-Roger D., Monserrat J.F. LTE/NR V2X communication modes and future requirements of intelligent transportation systems based on MR-DC architectures. Sustainability. 2022;14:3879. doi: 10.3390/su14073879. [DOI] [Google Scholar]
- 18.Hui Y., Su Z., Luan T.H., Cai J. A Game Theoretic Scheme for Optimal Access Control in Heterogeneous Vehicular Networks. IEEE Trans. Intell. Transp. Syst. 2019;20:4590–4603. doi: 10.1109/TITS.2019.2894716. [DOI] [Google Scholar]
- 19.Mir Z.H., Toutouh J., Filali F., Alba E. QoS-aware radio access technology (RAT) selection in hybrid vehicular networks; Proceedings of the International Workshop on Communication Technologies for Vehicles; Sousse, Tunisia. 6–8 May 2015; Berlin/Heidelberg, Germany: Springer; 2015. pp. 117–128. [Google Scholar]
- 20.Mir Z.H., Toutouh J., Filali F., Ko Y.B. Enabling DSRC and C-V2X integrated hybrid vehicular networks: Architecture and protocol. IEEE Access. 2020;8:180909–180927. doi: 10.1109/ACCESS.2020.3027074. [DOI] [Google Scholar]
- 21.Altahrawi M., Abdullah N.F., Nordin R. Service-oriented LSTM multi-criteria RAT selection scheme for vehicle-to-infrastructure communication. IEEE Access. 2022;10:110261–110284. doi: 10.1109/ACCESS.2022.3214852. [DOI] [Google Scholar]
- 22.Mir Z.H., Dreyer N., Kürner T., Filali F. Investigation on cellular LTE C-V2X network serving vehicular data traffic in realistic urban scenarios. Future Gener. Comput. Syst. 2024;161:66–80. doi: 10.1016/j.future.2024.07.002. [DOI] [Google Scholar]
- 23.Khalid I., Maglogiannis V., Naudts D., Shahid A., Moerman I. Optimizing hybrid v2x communication: An intelligent technology selection algorithm using 5g, c-v2x PC5 and dsrc. Future Internet. 2024;16:107. doi: 10.3390/fi16040107. [DOI] [Google Scholar]
- 24.Chowdhury D.R., Nandi S., Goswami D. Cost-effective live video streaming for internet of connected vehicles using heterogeneous networks. Ad Hoc Netw. 2024;153:103334. doi: 10.1016/j.adhoc.2023.103334. [DOI] [Google Scholar]
- 25.Bréhon-Grataloup L., Kacimi R., Beylot A.-L. Reliable multi-RAT connectivity in urban V2X architectures: An experimental campaign. Ad Hoc Netw. 2025;179:104007. doi: 10.1016/j.adhoc.2025.104007. [DOI] [Google Scholar]
- 26.Zanforlin M., Munaretto D., Zanella A., Zorzi M. SSIM-based video admission control and resource allocation algorithms; Proceedings of the 2014 12th International Symposium on Modeling and Optimization in Mobile, Ad Hoc, and Wireless Networks (WiOpt); Hammamet, Tunisia. 12–16 May 2014; Piscataway, NJ, USA: IEEE; 2014. pp. 656–661. [Google Scholar]
- 27.Intelligent Transport Systems—Station and Communication Architecture. International Organization for Standardization; Geneva, Switzerland: 2020. [Google Scholar]
- 28.Truong C., Oudre L., Vayatis N. Selective review of offline change point detection methods. Signal Process. 2020;167:107299. doi: 10.1016/j.sigpro.2019.107299. [DOI] [Google Scholar]
- 29.Flynn J.R., Ward S., Abich J., IV, Poole D. Image Quality Assessment Using the SSIM and the Just Noticeable Difference Paradigm; Proceedings of the Engineering Psychology and Cognitive Ergonomics. Understanding Human Cognition (EPCE 2013); Las Vegas, NV, USA. 21–26 July 2013; Berlin/Heidelberg, Germany: Springer; 2013. pp. 23–30. Lecture Notes in Computer Science. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
Data available on request due to restrictions.










