Skip to main content
Springer logoLink to Springer
. 2025 Dec 17;7(1):3. doi: 10.1007/s42979-025-04533-w

To Signal or Not to Signal? A Non-cooperative Game-Theoretic Approach to Discretionary Communication Between Road Users

Isam Bitar 1,, Albert Solernou Crusat 1, Richard Romano 1, David Watling 1
PMCID: PMC12711921  PMID: 41426655

Abstract

Reciprocal communication between road users is a vital element of road user interaction. Non-cooperative game theory is an effective framework for modelling and characterising communicative behaviour between road users, which enables the study of emergent benefits for both the issuer and recipient of communicative signals. In this paper, we introduce discretionary communication to gain an advantage over the other road user by masking one’s intent if beneficial to do so. We conduct a series of experiments with simulated interactions and compare interaction outcomes where communication is mandatory against those where communication is discretionary. Our findings further support the premise that non-cooperative game theory is an effective paradigm for modelling and producing emergent behaviours which benefit the network. Moreover, we see that including a layer of discretionary communication reaps benefits in interaction outcome to the communicator. It also provides benefits in safety to all parties involved above and beyond the benefits seen from mandatory communication.

Keywords: Game theory, Communication, Cheap talk, Non-cooperative games, Bayesian games, Emergent cooperation, Discretionary communication, First mover advantage

Introduction

The reciprocal interaction between road users in which they engage in competitive, cooperative, and communicative behaviours to negotiate priority and road space is an integral part of navigating the road network. Properly understanding and modelling these interactions is a growing field, especially as autonomous vehicles get closer to technological and market maturity.

Until recently, research on modelling communication as an active component of road user interaction rejected the game-theoretic approach due in part to its perception as a framework that does not lend itself to communicative behaviour. For example, the researchers in [1] relied on an underlying assumption of the existence of a shared goal between interacting players to model communicative behaviour. However, we have provided a proof of concept in [2] that non-cooperative game theory (where players only seek to maximise one’s own utility) can indeed provide a robust framework for modelling road user communication descriptively and prescriptively, without the need for an underlying shared goal.

In our recent paper [2], we concluded that issuing and receiving communication to influence interaction outcomes from a non-cooperative game-theoretic perspective could feasibly occur, and that it could produce emergent, population-wide benefits.

The current paper seeks to expand on that paradigm by exploring whether discretionary communication further improves a road user’s utility in an interaction. The original model confines communicative behaviour to the Main-Lane Vehicle Inline graphic to communicate its intent to the Joining Vehicle Inline graphic. It is implied that Vehicle Inline graphic makes its intent clear to Vehicle Inline graphic prior to the interaction, hence Vehicle Inline graphic is always the vehicle to move first. The enhanced model we propose in this paper allows Vehicle Inline graphic to make a discretionary choice between the behaviour implied in the original model (signalling the desire to join ahead) or to forego a signal in favour of a ‘surprise manoeuvre’ to attempt to force a lane change.

Thus, we aim to explore the validity of the following two hypotheses as part of this study.

  • Vehicles which engage in discretionary communication have an advantage (better payoff) over vehicles in the same situation which always communicate their intent to their opponents

  • Communication in a non-cooperative game-theoretic framework can make interactions safer (fewer crashes and dangerous interactions) and more efficient (better payoffs for all parties involved), even when this communication is optional

One way in which we can measure interaction efficiency is by studying the occurrence of non-ideal outcomes. Non-ideal outcomes are an important metric to measure in the context of road user communication, since they reflect either miscommunication or misreading of one or both road users in an interaction. Such outcomes are non-ideal because they often result in an action by one vehicle that is opposite to what the other intended. For example, at a T-junction, the main-road vehicle may choose to yield to the minor-road vehicle, which chooses not to accept right of way anyway. Thus, the major-road vehicle loses time and momentum, and the minor-road vehicle waits unnecessarily. As such, these outcomes often return worse payoffs to one or both vehicles than what would have been achieved if either vehicle took a different decision. Outcomes like these effectively leave some utility ‘on the table’, and are commonly referred to in game theory as Pareto inefficient [3]. We posit that access to better information (via communication) should incrementally reduce the occurrence of non-ideal outcomes. This concept is known as Pareto improvement.

To this end, we build on the experimental design we’ve developed in [2], where we simulate the interaction between vehicles in a non-cooperative game-theoretic setting. We develop one set of simulations which forces the agent vehicle into communicating its intent every time, and another which allows it to choose whether to communicate, based on its assessment of the benefit of doing so. By comparing the results, we can gauge the effect of discretionary communication on both the issuer and the recipient of communication, as well as the effect of this behaviour on the safety and efficiency of the interaction in general.

Literature Review

Historically, the domain of road user interaction has been left as an accessory to microsimulation models of different multi-agent driving scenarios, such as car following [4, 5] and lane changing [6]. In these models, road user interaction is often restricted to collision avoidance. Increasingly, game-theoretic models have emerged to take a more in-depth look at the interaction element itself, especially from an autonomous vehicle’s perspective [7, 8].

Lane-change modelling is a topic which has been explored in depth in the literature. Traditional lane changing models use preset rules to determine the necessity and feasibility of lane changing, irrespective of individual road users’ preferences or constraints [9, 10]. The general formula is that an incentive criterion competes with a safety criterion to dictate whether a lane change occurs. Incentive is often some form of speed or space advantage, whilst safety concerns the risk of collision. Such models at first assumed homogeneity amongst road users, hence the use of a global set of rules. Later models introduced some individuality to the lane change interaction [1113]. For example, [12] introduced a ‘politeness’ factor which considers the disutility to the rest of the traffic population should the agent carry out a lane change. Conversely, [13] introduced an ‘aggressiveness’ factor which influences an agent’s preference for space over safety. Neither model, however, attempted to build a framework in which an agent advertises these preferences to other road users.

Increasingly, the premise of interaction between two or more agents has become the domain of game theory. One of the first to introduce a game-theoretic lane changing model is Kita [14]. Kita employed a simple, two-player non-cooperative game with complete information, where each player chooses from a set of two strategies, validated and calibrated against real-world data.

Later models extended the game-theoretic approach in several directions. Some built on Kita’s original framework by incorporating variation in vehicle kinematics and more robust payoff functions [15]. Others introduced hierarchical structures to capture interaction at multiple levels. For example, [16] separated long-horizon strategic reasoning from short-horizon tactical games, whilst [17] modelled bounded rationality through “level-k” reasoning, where agent sophistication in anticipating the opponent is layered recursively with every next level.

Another common approach is the adoption of repeated games. Repeated games allow behaviour to unfold across multiple stages, enabling history-dependent strategies such as reciprocity [18]. Kang and Rakha [19] followed this approach to capture ongoing tactical adjustment, whereas [20] applied a receding-horizon method where the game is rebuilt at each timestep without persistent memory or cumulative payoffs. The former enabled the emergence of reciprocal behaviours, whilst the latter emphasised interaction safety.

A more seldom explored but promising paradigm is evolutionary game theory. Iwamura and Tanimoto [21] combined evolutionary game theory with a cellular automaton to demonstrate how emergent stable strategies vary under varying traffic conditions. Bitar et al. [22] extended this line by analysing how spatial factors such as cluster size and vehicle range shape the evolution and success of emergent strategies.

Finally, extensive-form Bayesian games have gained traction as a means of capturing bounded rationality [23]. By allowing agents to update their beliefs about their opponents’ states and preferences, these models move closer to real-world conditions. Applications include [24] on mandatory versus discretionary lane changes, [25] on connected versus non-connected environments, and [2], where we introduced communication itself, through implicit and explicit signals, as an active component of interaction in a game-theoretic setting.

Thus, [2] adds road user interaction to the growing body of fields in which non-cooperative game theory is used to describe and explain emergent cooperative and communicative behaviour [18, 2634]. This concept carries its own implications regarding interaction with autonomous vehicles, given the general tendency for humans to behave less cooperatively towards machines [3537]. This means that there is a question to be raised on whether autonomous vehicles should consider if it is beneficial to advertise their intent to other road users. Indeed, evidence suggests that communication can be used to deceive other players when there is an asymmetry in available information [34]. We have previously shown that autonomous vehicles would need to perform better than human-driven vehicles in terms of interaction outcomes to survive in a mixed population [22, 38]. Therefore, exploring this aspect in the context of road user communication may be of use, especially in the broader context of autonomous vehicle interaction with human road users. This paper builds on that conclusion by looking at the operational/tactical level behaviour, such as discretionary signalling.

Research suggests that most instances of communication between road users are implicit [3941]. However, explicit communication, less common as it may be, remains an emphatic conveyor of information and road user intent [42]. In our recent work [2], we conceptualised a non-cooperative game as a lane-change scenario in which a Joining Vehicle Inline graphic desires to change lanes ahead of a Main-Lane Vehicle Inline graphic. The paper concluded that both vehicles benefit from Vehicle Inline graphic issuing such helpful communication. It is important to note that neither vehicle earned nor lost payoff directly from this communication. That is, the communication did not have an intrinsic utilitarian value. In game theory, this form of communication is known as cheap talk, where providing and receiving information is free [43]. The paper’s model also assumed that the vehicles received and interpreted communication perfectly. In the real world, communication may be obscured, ignored, or misunderstood. In fact, Bayesian game-theoretic models exist that are entirely dedicated to the utterance, receipt and understanding of communication between players [44, 45]. Though such a paradigm would add interesting complexity to an interaction model, it was beyond the scope of that study. The paper also assumed that the Joining Vehicle Inline graphic would implicitly but unambiguously make its intent clear that it wishes to join. As such, the question of whether there’s merit to either vehicle to mask its intent from the other remains open.

Masking one’s intent may be considered a form of deception. Deception is a game-theoretic concept in which the deceiving player limits, distorts, or alters information about the game (usually one’s own attributes, preferences, actions or payoffs) to trick the opponent into taking action that favours the deceiver at the expense of the deceived [46, 47]. The topic of deception has been explored in various applications, including sociology [48], politics [49], animal behaviour [50, 51] and even cyber security [52, 53]. By masking its intent, the Joining Vehicle Inline graphic robs the Main-Lane Vehicle Inline graphic of the ability to anticipate (and potentially block) Vehicle Inline graphic’s join attempt. To our knowledge, this concept is yet to be explored in the context of road user interaction.

We believe there is merit in investigating the effect of deception in road user communication in a non-cooperative game-theoretic setting. To date, research on the topic of road user communication has not explored this aspect. There is a particular interest in studying this concept in the context of the interaction between autonomous vehicles and other road users. Research shows that human road users are likely to behave less cooperatively towards autonomous vehicles. Perhaps, then, masking its intent may prove a useful tool in the autonomous vehicle’s toolbelt, which would help it navigate a potentially unfriendly environment. We aim to explore the feasibility of this form of behaviour in a non-cooperative game-theoretic setting, whether such behaviour would bring benefit to the agent vehicle, and what impact such behaviour may have on general interaction safety and efficiency.

Method

We conceptualise a discretionary lane change scenario between a Joining Vehicle Inline graphic and a Main-Lane Vehicle Inline graphic. In this section, we describe an expanded model which we built on top of the model we developed in [2]. In the original model, Vehicle Inline graphic moves first to allow or block Inline graphic, followed by Vehicle Inline graphic’s response. In the present paper, this basic model is expanded to accommodate an additional, pre-emptive move by Vehicle Inline graphic, where Inline graphic may decide to signal its intent to join ahead or forego a signal to try to force a join instead.

We discuss in this section the game-theoretic interaction model, the kinematic framework used to play an interaction, the payoff functions, and the communication element, which is the focus of this paper. We then detail the experimental design adopted in this paper to study the hypotheses we outlined in the introduction.

We employ a simple ‘lane change’ operation based on the bi-directional General Motors Car Following Model [54]. Figure 1 illustrates the conceptual layout of the proposed scenario. We use the General Motors model for its relative simplicity and reliance on Newtonian kinematics. This allows us to easily identify and isolate individual parameters to aid in interaction development.

Fig. 1.

Fig. 1

Conceptual layout of the interaction between main-lane vehicle Inline graphic and joining vehicle Inline graphic

The Game-Theoretic Interaction Model

The interaction model employed is a two-player, sequential, non-cooperative Bayesian game between Vehicle Inline graphic and Vehicle Inline graphic. The extensive form of the model (game tree) is illustrated in Fig. 2. Vehicle Inline graphic has two stochastic properties whose values are based on predetermined probabilities. These two properties are Inline graphic (possible values: Inline graphic) and Inline graphic (possible values: Inline graphic) . Inline graphic relates to Vehicle Inline graphic’s awareness of and responsiveness to Vehicle Inline graphic’s movement. Literature on attention in the context of game-theoretic interaction is scarce [4]. For example, [55] extends the Gipps car following model to incorporate differences in reaction time but falls short of modelling distraction. Inline graphic relates to whether Vehicle Inline graphic would attempt to punish a lane-change it did not agree to. This is a common concept in game-theoretic interaction models and manifests in different forms [6, 8]. Vehicle Inline graphic knows whether it is Inline graphic or Inline graphic (solid horizontal line between the branches) but does not know its Inline graphic level (dashed line between the branches). In game-theoretic sequential models, such properties are modelled as moves by Nature. Inline graphic is a third-party entity which selects the values of the stochastic elements of a sequential game based on predetermined probabilities. These elements create uncertainty about the information available to one or more players, which warrant the establishment of beliefs. Beliefs are probabilistic assumptions about one or more properties relevant to a player or the interaction. We elaborate further on this concept later in this section.            

Fig. 2.

Fig. 2

The sequential game between the main-lane vehicle Inline graphic and the Joining Vehicle Inline graphic (game tree)

Vehicle Inline graphic begins the interaction by choosing whether to Inline graphic its intent to Vehicle Inline graphic (e.g., turn signal) or attempt to Inline graphic the join without warning. If Vehicle Inline graphic chooses to Inline graphic, Vehicle Inline graphic may choose to Inline graphic Vehicle Inline graphic to join or Inline graphic its attempt. However, Vehicle Inline graphic does not get the opportunity to do so if the interaction is forced, or if Inline graphic’s Inline graphic state is Inline graphic (note the absence of this step from the respective branches of the game tree). Once Inline graphic has made its first decision (if applicable), Vehicle Inline graphic chooses whether to Inline graphic ahead of Inline graphic or Inline graphic until Inline graphic has passed. If Inline graphic had previously attempted to Inline graphic the join, it may at this stage choose to Inline graphic its original decision and continue, or Inline graphic the manoeuvre and wait instead. Finally, Vehicle Inline graphic must take a further action if Vehicle Inline graphic chooses to Inline graphic or Inline graphic. This final action depends on Vehicle Inline graphic’s Inline graphic and Inline graphic states. If Inline graphic’s Inline graphic state is Inline graphic, it will adopt a Inline graphic trajectory when Inline graphic and a Inline graphic trajectory when Inline graphic. Inline graphic entails tailgating Vehicle Inline graphic to induce a negative headway utility (see Inline graphic in the Payoffs section). If Vehicle Inline graphic’s Inline graphic state is Inline graphic or Inline graphic, it will Inline graphic Vehicle Inline graphic and continue moving as if it were in free flow and will not take measures to prevent a collision if one were imminent. This state only lasts for a finite amount of time, after which Vehicle Inline graphic employs Inline graphic or Inline graphic as appropriate.    

The Kinematic Model

The movement model is a discretised approach, where at each timestep each vehicle’s kinematic properties are evaluated. Acceleration is governed by a modified version of the bi-directional General Motors Car Following Model [2, 54] subject to the relevant acceleration and deceleration constraints for each vehicle. The general formula is illustrated in Eq. (1). All other kinematic properties are governed by Newtonian equations of motion.

graphic file with name d33e883.gif 1

where Inline graphic is the acceleration of Vehicle Inline graphic at the start of the next timestep Inline graphic. Inline graphic is the velocity of Vehicle Inline graphic at the current timestep Inline graphic. Inline graphic is the distance between Vehicle Inline graphic and its car-following target at timestep Inline graphic. Inline graphic is the velocity difference between Vehicle Inline graphic and its target at timestep Inline graphic. Inline graphic is a sensitivity factor which governs Vehicle Inline graphic’s acceleration rate to maintain car-following behaviour. Higher Inline graphic means more conservative movement. Inline graphic are parametric factors which in this study are set to 1.    

Inline graphic is constrained by the agent vehicle’s acceleration preferences and physical limitations (see Table 2 for the value ranges used in this paper).

Table 2.

Values and value ranges used in the simulation (based on values used in [2])

Property/constant Description Value/value range
Inline graphic Maximum comfortable acceleration (0.20, 2.00)Inline graphic  
Inline graphic   Maximum allowable acceleration (2.50, 3.50) Inline graphic [59]
Inline graphic Maximum comfortable deceleration (− 0.50, − 1.50)Inline graphic  
Inline graphic   Minimum acceptable time headway (0.50, 3.50)Inline graphic  
Inline graphic Decision time (0.50, 1.50)Inline graphic     
Inline graphic   Punitive sensitivity factor (exclusive to Inline graphic) (0.15, 0.35)
Inline graphic Initial velocity Inline graphic: (8, 18) Inline graphic; Inline graphic: (4, 10)Inline graphic  
Inline graphic Initial distance between the vehicles (14, 89)Inline graphic  
Inline graphic Desired velocity (exclusive to Inline graphic) (0.75, 1.50) × Inline graphicInline graphic  
Inline graphic Inline graphic penalty reduction factor (0.10, 0.20)
Inline graphic   Total number of distraction timesteps 10 (5–8 Inline graphic)
Inline graphic   Maximum number of timesteps 63 (31.5–34.5 Inline graphic)
Inline graphic   Lane change duration 5 [60, 61]
Inline graphic   General motors sensitivity factors 1
Inline graphic Timestep Inline graphic (from Inline graphic onward) 0.5 Inline graphic
Inline graphic   Phantom vehicle time headway * 4 Inline graphic
Inline graphic   Crash penalty − 250
Inline graphic   Maximum safe deceleration − 4.5 Inline graphic [59, 62]

*Used in free-flow mode to simulate gradual return to initial/desired velocity

Vehicle Inline graphic’s Inline graphic movement is governed by Eq. (2), derived from Newtonian equations of motion. It is the only acceleration value which is not governed by Eq. (1).

graphic file with name d33e979.gif 2

where Inline graphic is the acceleration of Vehicle Inline graphic at the start of the next timestep Inline graphic. Inline graphic is Vehicle Inline graphic’s maximum acceleration. Inline graphic is the distance between Vehicle Inline graphic and Vehicle Inline graphic at timestep Inline graphic. Inline graphic, Inline graphic are Vehicle Inline graphic and Vehicle Inline graphic’s velocities, respectively, at timestep Inline graphic. Inline graphic is the lane change duration.

A brief description of the model’s timesteps is outlined below.

  • Inline graphic = 0; begins the interaction the moment Vehicle Inline graphic takes its first action

  • Inline graphic = Vehicle Inline graphic’s Decision-Reaction Time Inline graphic; Vehicle Inline graphic reacts to Vehicle Inline graphic’s initial action (Inline graphic if Inline graphic, Inline graphic if Inline graphic)    

  • Inline graphic = Vehicle Inline graphic’s Decision-Reaction Time Inline graphic; Vehicle Inline graphic takes its second decision (Inline graphic if first decision was Inline graphic, Inline graphic if first decision was Inline graphic)

  • Inline graphic = Inline graphic; Vehicle Inline graphic reacts to Vehicle Inline graphic’s second decision (Inline graphic)

  • Inline graphic = 0.5 seconds; regular timesteps where the vehicles employ the modified General Motors Car Following Model as the interaction resolves

  • Inline graphic; the final timestep in an interaction, which is reached if a crash is detected, a predefined maximum duration is reached, or when all of the following interaction conclusion conditions [2] are met:
    • Inline graphic  
    • Inline graphic  
    • If Vehicle Inline graphic’s second action is Inline graphic or Inline graphic, Vehicle Inline graphic’s current time headway Inline graphic is greater than or equal to Vehicle Inline graphic’s minimum acceptable time headway Inline graphic  

Payoff Functions

The payoff for each vehicle is a function of several components. These are listed and described below.

Ride Comfort Inline graphic

Ride comfort is best represented as the change in acceleration over time (jerk). Acceleration is a common element in game payoffs in the literature [13, 14, 24]. In this model, we base ride comfort on both the acceleration values at each timestep (with respect to the vehicle’s comfortable acceleration value Inline graphic) and a measure of jerk over the interaction period (measured as the standard deviation of acceleration about its mean). The jerk element of the payoff calculation allows for a utilitarian distinction between different strategies. For example, a failed Inline graphic attempt where Vehicle Inline graphic accelerates then sharply decelerates will yield a worse payoff than a sustained sharp deceleration of equal magnitude.

Ride comfort Inline graphic applies to all interaction possibilities, and it is calculated as prescribed in Eq. (3).

graphic file with name d33e1259.gif 3

where Inline graphic is the total count of the interaction’s timesteps. Inline graphic is the agent vehicle’s acceleration at timestep Inline graphic. Inline graphic is the mean of Inline graphic. Inline graphic, Inline graphic are the vehicle’s maximum comfortable acceleration and deceleration, respectively. Inline graphic is used in the denominator of the first term if Inline graphic, else Inline graphic. Inline graphic is the duration of timestep Inline graphic.

Time Headway Inline graphic

Time headway is another common metric in many interaction models and, together with time to collision (TTC), forms the basis of interaction safety [14, 56]. In this study, we define Inline graphic as a function of the minimum time headway achieved during the interaction with respect to the agent vehicle’s minimum acceptable headway Inline graphic. Time headway Inline graphic applies to all interactions where Vehicle Inline graphic chooses or attempts to join at any point. It is calculated as prescribed in Eq. (4).

graphic file with name d33e1348.gif 4

where Inline graphic, Inline graphic are the current timestep and the set of all timesteps, respectively. Inline graphic is the disutility from being involved in a crash. Inline graphic is the vehicle’s time headway with respect to the lead vehicle at timestep Inline graphic. Inline graphic is the vehicle’s minimum acceptable time headway.        

Speed Difference Inline graphic

Speed difference concerns the steady states before and after an interaction. It is often used as part of the incentive function which triggers a lane change [15] or as part of the reward for changing lanes [24, 57].

For Vehicle Inline graphic, the speed difference payoff is based on the difference between the vehicle’s initial and final (stable) velocities, where a lower final velocity (caused by following a slower Inline graphic) brings a negative utility to Inline graphic. Similarly, if Vehicle Inline graphic opts to Inline graphic, Inline graphic becomes the difference between Inline graphic’s desired velocity after the lane change and Inline graphic’s stable velocity. This would mean that Inline graphic would incur a penalty if it is forced to join behind a Vehicle Inline graphic that is slower than Vehicle Inline graphic’s target (desired) velocity. Speed difference Inline graphic is applicable to either vehicle when it is the lag vehicle. This is in a variation to the original application of Inline graphic in [2], where it also applied to Vehicle Inline graphic as a lead vehicle if at any point during the interaction it was forced to accelerate beyond its initial or desired velocities. We have opted to remove Vehicle Inline graphic’s payoff function when it is the lead vehicle from the current study, since Vehicle Inline graphic’s highest achieved velocity did not factor into its own Inline graphic, and so it did not seem to make sense that we consider this element for Vehicle Inline graphic but not for Vehicle Inline graphic. Thus, Inline graphic is calculated according to Eqs. (5) and (6) below.

graphic file with name d33e1496.gif 5
graphic file with name d33e1500.gif 6

where Inline graphic is Vehicle Inline graphic’s initial (and stable) velocity, Inline graphic is Vehicle Inline graphic’s desired velocity after the lane change.

Time Penalty Inline graphic

Many game-theoretic models consider some form of time penalty in their payoffs, typically represented as time spent in the undesirable lane [14, 15, 56]. In the proposed model, Vehicle Inline graphic is subject to Inline graphic when it chooses to Inline graphic. It is a function of the amount of time Inline graphic needs to wait for Inline graphic to pass before it can Inline graphic behind it. It is calculated as the time until Inline graphic is equal to or greater than Inline graphic’s minimum acceptable time headway Inline graphic. If an interaction ends before this condition is met, the remainder is estimated based on Vehicle Inline graphic’s velocity and position at the end of the interaction relative to Vehicle Inline graphic. The payoff value is then multiplied by a factor Inline graphic that is specific to that instance of Inline graphic. Inline graphic represents Vehicle Inline graphic’s sensitivity to losing time. The higher, the less tolerant Inline graphic is to waiting its turn and the more likely it is to make riskier merges to avoid the wait. The calculation is mathematically represented in Eq. (7).

graphic file with name d33e1612.gif 7

*Inline graphic is used here to prevent Vehicle Inline graphic from waiting indefinitely for a slower Inline graphic to pass

where Inline graphic is the total count of the interaction’s timesteps. Inline graphic, Inline graphic are Vehicle Inline graphic’s time headway with respect to Vehicle Inline graphic at timestep Inline graphic and the final interaction timestep, respectively. Inline graphic is Vehicle Inline graphic’s minimum acceptable time headway. Inline graphic, Inline graphic are Vehicle Inline graphic’s and Vehicle Inline graphic’s initial velocities, respectively. Inline graphic, Inline graphic are Vehicle Inline graphic’s and Vehicle Inline graphic’s final positions, respectively. Inline graphic is Vehicle Inline graphic’s final velocity. Inline graphic is a sensitivity factor which represents Vehicle Inline graphic’s tolerance to losing time.        

Decision Making

The goal for each vehicle is rooted in fundamental non-cooperative game theory: to maximise one’s own payoff. The structure of the payoff functions ensures that the best interaction is no interaction (maximum possible payoff is zero). This would prevent vehicles from seeking conflict when one is unnecessary. Each vehicle will simulate an interaction with its opponent to establish expected payoffs using assumptions made about the opponent’s attributes, which are discussed later in the experimental design section. For Vehicle Inline graphic, this is done for both the Inline graphic and Inline graphic actions, then backward induction is used to determine the best action. For Vehicle Inline graphic, the expected payoffs for each of its actions are calculated for all Inline graphic’s possible Inline graphic and Inline graphic combinations. Each is then multiplied by the appropriate probability value to produce the total expected payoff per action.

Table 1 provides a summary of the applicable payoff components depending on the action(s) taken.

Table 1.

Payoff composition for each vehicle given the appropriate action set

Action set(s) Vehicle M Vehicle J

Inline graphic  

Inline graphic  

Inline graphic Inline graphic  

Inline graphic  

Inline graphic  

Inline graphic Inline graphic  

Inline graphic, Inline graphic are the probability that Inline graphic’s Inline graphic state is Inline graphic and Inline graphic is Inline graphic, respectively

Communication

Our work in [2] provided a pathway for Vehicle Inline graphic to update its prior beliefs on Vehicle Inline graphic’s stochastic states via receiving useful communication from Vehicle Inline graphic. The main contribution of this paper is to expand on that model by allowing Vehicle Inline graphic to choose whether to communicate its intent to join and explore the impact of this discretionary communication on the interaction outcome. Communication in this study takes place in three distinct stages.

Stage 0: Initial Communication from Vehicle Inline graphic to Vehicle Inline graphic

Before Vehicle Inline graphic begins the interaction, it will observe Vehicle Inline graphic’s current state and process any communication Vehicle Inline graphic may engage in. We characterise eye contact as a form of (explicit) communication that is available at this stage. That is, Vehicle Inline graphic will perceive eye contact with Vehicle Inline graphic as an explicit signal that Vehicle Inline graphic has seen Vehicle Inline graphic. We discuss how this is quantified later in the experimental design.

Stage 1: Discretionary Communication from Vehicle Inline graphic to Vehicle Inline graphic

At the beginning of the interaction, Vehicle Inline graphic has the option to issue communication to Vehicle Inline graphic about its intent to join ahead. We have designed the game in a way where Vehicle Inline graphic is incentivised to force-join where possible. However, if Vehicle Inline graphic decides against forcing the join, it will Inline graphic its intent to Vehicle Inline graphic and await a response. This gives Vehicle Inline graphic the opportunity to Inline graphic or Inline graphic the request.

Stage 2: Further Communication from Vehicle Inline graphic to Vehicle Inline graphic

Following Vehicle Inline graphic’s initial decision (Inline graphic), Vehicle Inline graphic will signal its intent explicitly and implicitly to Vehicle Inline graphic. We use the same signalling paradigm in this case as that in [2], which consists of a mixture of implicit signals (acceleration) and explicit signals (e.g., eye contact, a gesture, flashing of headlights). We elaborate further on these signals in the experimental design section.

Updating Beliefs: Bayesian Inference

Vehicle Inline graphic’s Inline graphic and Inline graphic states are assigned upon the vehicle’s instantiation. We outline the base probabilities used for these states in the experimental design section. The base probabilities are known to Vehicle Inline graphic as prior beliefs. Vehicle Inline graphic will use the communication it receives from Vehicle Inline graphic in stages 0 and 2 to incrementally update these beliefs in accordance with Bayes’ Theorem [58].

graphic file with name d33e2056.gif 8

where Inline graphic is the probability that Inline graphic is true given Observation Inline graphic (posterior belief), Inline graphic is the base probability that Inline graphic is true within the population (prior belief), Inline graphic is the probability of observing Inline graphic given that Property Inline graphic is true (likelihood), Inline graphic is the total probability of observing Inline graphic within the population

In the original model [2], Vehicle Inline graphic assigned a single Bayesian probability to both Inline graphic states. In the expanded model, Vehicle Inline graphic assigns a separate Bayesian probability to Vehicle Inline graphic’s probability to be Inline graphic, or to be Inline graphic. We believe this improves the Bayesian inference process in the interaction.

Experimental Design

To properly investigate the impact of two-way discretionary communication on interaction outcomes, we run a simulation as a Inline graphic where neither vehicle engages in any form of explicit communication, nor does either vehicle read any implicit signals from the other. In addition, we run two test groups: Inline graphic and Inline graphic, each with a different style of communication.

Each vehicle’s attributes and kinematic conditions are generated from set ranges prior to the interaction itself. These are outlined and described in Table 2. The base probability values assigned to Vehicle Inline graphic’s Inline graphic and Inline graphic states are outlined in Table 3. Table 4 gives a breakdown of the different communicable signals employed in this experiment and how these signals translate to Bayesian likelihoods. The likelihoods are in turn based on the signalling probabilities shown in Table 5.

Table 3.

The base probability for each stochastic property of Inline graphic (based on values used in [2])

Property/state Base probability (J's Prior elief)
Inline graphic   0.75
Inline graphic   0.6
Inline graphic   0.5

Table 4.

Probabilities of Vehicle Inline graphic issuing various communicative signals

Signal category Description Probability of occurrence
Attentive cooperative Attentive punitive Distracted cooperative Distracted punitive
Implicit: acceleration Inline graphic alters its velocity as appropriate 1 1 0.5 0.5
Explicit: attention e.g. eye contact Inline graphic makes eye contact with Inline graphic 0.9 0.9 0.05 0.05
Explicit: intention e.g. gestures Inline graphic issues a cooperative signal (if Inline graphic) 0.8 0.2 0.1 0.05
Inline graphic issues a threatening signal (if Inline graphic) 0.1 0.8 0.05 0.1

Table 5.

Breakdown of the likelihoods of each signal given vehicle Inline graphic's different possible stochastic attributes

Signal category Value Description Inline graphic *
Attentive cooperative Attentive punitive Distracted cooperative Distracted punitive
Implicit: accelera-tion 0 Inline graphic observes no acceleration from Inline graphic 0.05 0.05 0.55 0.55
1 Inline graphic observes deceleration from Inline graphic 0.55 0.3 0.25 0.15
− 1 observes acceleration from 0.4 0.65 0.2 0.3
Explicit: Inline graphic e.g. eye contact 0 Inline graphic is unable to make eye contact with 0.1 0.1 0.95 0.95
1 Inline graphic makes eye contact with 0.9 0.9 0.05 0.05
Explicit: Inline graphic e.g. gestures 0 Inline graphic does not observe an intention signal from Inline graphic 0.585 0.47 0.9275 0.9225
1 Inline graphic observes a positive-intent signal from Inline graphic 0.36 0.09 0.045 0.0225
− 1 Inline graphic observes a negative-intent signal from Inline graphic 0.055 0.44 0.0275 0.055

*Inline graphic values are based on results obtained from a pilot simulation run of 30,000 interactions

In the Inline graphic, Vehicle Inline graphic relies solely on the base probabilities of Vehicle Inline graphic’s Inline graphic and Inline graphic states as prior beliefs. Furthermore, Vehicle Inline graphic does not engage in any action at Inline graphic. The interaction effectively begins at Inline graphic, at Vehicle Inline graphic’s Inline graphic decision node.

In Inline graphic, Vehicle Inline graphic always advertises its intent to join and Vehicle Inline graphic employs the full suite of communication signals described in Table 4 during its first decision phase. Vehicle Inline graphic can interpret all signals issued implicitly or explicitly by Vehicle Inline graphic. As with the Inline graphic, the interaction effectively begins at Inline graphic. This group is analogous to [2]’s Test Group B. We include this test group in this experiment to benchmark our expanded model’s results against the original findings of [2], and to provide a second benchmark for the main test group of this paper. All communication in Inline graphic is mandatory and takes place the moment Vehicle Inline graphic makes its Inline graphic decision.

Inline graphic is the main test group of this paper. In this test group, Vehicle Inline graphic begins the interaction at Inline graphic by choosing whether to Inline graphic (signal intent as usual) or Inline graphic without a signal. In this scenario, Inline graphic is equivalent to Inline graphic and the Inline graphic in that it passes on the first decision to Vehicle Inline graphic. Choosing Inline graphic allows Vehicle Inline graphic to take control of the interaction by moving first. Communication in Inline graphic follows the stages outlined earlier in this section under Communication.

Each simulation group (Inline graphic, Inline graphic and Inline graphic) comprises ten simulations of 30,000 interactions each. Every interaction involves a unique instance of Vehicle Inline graphic and Vehicle Inline graphic. Each vehicle is spawned with attributes and preferences generated randomly from a uniform distribution of the preset ranges shown in Table 2. Each of the ten simulations uses a predefined random generator seed, which is repeated in all the three experiment groups. Using ten different random seeds per simulation group ensures that the findings are repeatable across different rolls of the randomiser dice. Reusing each random seed across all three experiment groups ensures that every resultant interaction has a corresponding mirror in the other experiment groups. That is, interactions can be paired and compared using pairwise statistics, such as the paired samples t-test.

The experiment is completed under two different rulesets. We use the same rulesets set out in the original model [2], which are briefly described below.

Ruleset 1 (Transparent): both vehicles have full knowledge of each other’s attributes and preferences, apart from the stochastic elements of Inline graphic and Inline graphic. This is a game of near-complete information, where the uncertainty is confined to these two elements and allows for the study of the effect of communication without noise.

Ruleset 2 (Blind): neither vehicle has any knowledge of the other’s attributes and preferences. They assume that their opponent has the same attributes as they do. The only accurate information that’s available is on the other vehicle’s velocity and position. This game of incomplete information allows for the study of communication in a more noisy/uncertain setting.

A visual representation of the simulation suite is shown in Fig. 3.

Fig. 3.

Fig. 3

Schematic of the composition of the simulation suite

Hardware and software requirement

The simulations are conducted in a purpose-built simulation suite developed in Python 3.11.0 by the authors and run on a Windows 11 PC with a 2.9-GHz, six-core processor. Please refer to the Data availability section under Statements and declarations for the source code.

Results

All simulations were completed successfully, with no aborted or incomplete runs. The results from the simulations are aggregated and presented in Table 6. Overall, the simulations produced interactions where vehicles behaved according to their characteristics, preferences and physical positioning. Vehicles also generally favoured safer interactions and avoided taking catastrophic risks. There were no recorded crashes under Ruleset 1, and the average number of recorded crashes under Ruleset 2 was 113 crashes per 300,000 interactions (0.04%). The distribution of starting conditions was balanced, as most scenarios resulted in a relatively even split between Inline graphic and Inline graphic outcomes. (average 55.54% and 44.46%, respectively across both rulesets). Non-ideal outcomes, i.e. Inline graphic and Inline graphic (where applicable) were low in number, but non-trivial (1.89% and 0.97%, respectively across both rulesets).

Table 6.

Summary of simulation results

Metric Ruleset 1 (transparent) Ruleset 2 (blind)
Control Group I Group II Control Group I Group II
allow/join* 49.36% 49.76% 63.07% 49.93% 50.39% 64.91%
allow/wait 1.96% 1.57% 1.62% 2.37% 1.91% 1.92%
block/join 1.15% 0.67% 0.00% 2.20% 1.82% 0.00%
block/wait 47.52% 48.00% 35.30% 45.50% 45.89% 33.17%
Near misses 0.44% 0.14% 0.08% 0.63% 0.43% 0.19%
Crashes 0.00% 0.00% 0.00% 0.07% 0.04% 0.01%
Average payoff (vehicle Inline graphic) − 0.751 − 0.724 − 0.758 − 1.059 − 0.952 − 0.809
One-tailed paired t-test (vs Control) < 0.01 < 0.01 < 0.01 < 0.01
One-tailed paired t-test (vs Group I) < 0.01 < 0.01
Average payoff (vehicle Inline graphic) − 0.511 − 0.492 − 0.403 − 0.761 − 0.654 − 0.435
One-tailed paired t-test (vs Control) < 0.01 < 0.01 < 0.01 < 0.01
One-tailed paired t-test (vs Group I) < 0.01 < 0.01

*For test/group II, allow/join also includes forced joins, regardless of vehicle Inline graphic's Inline graphic

Ruleset 1 (Transparent)

Ruleset 1 produced safer and more efficient interactions than Ruleset 2. 2.63% of all interactions under Ruleset 1 had non-ideal outcomes (Inline graphic or Inline graphic). Ruleset 1 had no crashes and an average near-miss rate (defined as having a time headway of less than half a second at any point during the interaction) of 0.22%.    

Compared to the Inline graphic, Inline graphic came with a 68.4% decrease in near-misses and a 3.6% and 3.7% improvement of average utility (payoff) for Vehicle Inline graphic and Vehicle Inline graphic, respectively. These figures are statistically significant (Inline graphic < 0.01).  

Inline graphic of Ruleset 1 further improves the interaction outcomes, providing an 81.6% decrease in near misses on the Inline graphic, and a 21% improvement of average utility for Vehicle Inline graphic. Conversely, however, Inline graphic saw a 4.8% worsening of average utility for Vehicle Inline graphic compared to the Inline graphic. The figures are statistically significant against both the Inline graphic and Inline graphic (Inline graphic < 0.01).        

Ruleset 2 (Blind)

Ruleset 2 had a higher occurrence of non-ideal outcomes (4.08%) and near misses (0.78%). Unlike Ruleset 1, some interactions under Ruleset 2 resulted in a crash (0.07%). Furthermore, Ruleset 2 showed a markedly worse average utility for both Vehicle Inline graphic and Vehicle Inline graphic compared to Ruleset 1 (41.06% and 48.93% worse, respectively).

Inline graphic’s incidence of near misses saw a 31% reduction in near misses compared to the Inline graphic. Similarly, Inline graphic’s crash rate went down by 41%. The average utility improvement compared to the Inline graphic for Vehicle Inline graphic and Vehicle Inline graphic was 10% and 14%, respectively. The improvements are statistically significant (Inline graphic < 0.01).

Inline graphic continued this trend by further improving interaction safety and efficiency across the board. Thus, Inline graphic saw crashes and near misses reduced by 87% and 70%, respectively, compared to the Inline graphic. As for average utility, an improvement of 43% compared to the Inline graphic is seen for Vehicle Inline graphic. Unlike in Ruleset 1, Ruleset 2’s Inline graphic also improved Vehicle Inline graphic’s average utility by 15% compared to the Inline graphic.    

Discussion

We draw a comparison between our results and the results presented in the original paper [2]. We also compare between the different rulesets and simulation groups which form the experimental design of this paper, discuss the different patterns and trends that emerge and pit the results against our hypotheses to draw conclusions.

Comparison with the Results of the Original Paper [2]

Comparing Inline graphic and Inline graphic to their equivalent in [2], this work produces consistently similar results. The Inline graphic of both rulesets exhibited interaction outcomes that are on par with what is seen in the Control Groups of the original paper [2], if not slightly safer. The marginal increase in safety may be attributable to the addition of the extra interaction timestep described at the beginning of the Methods section, which allows more time for vehicles to react to one another. Similarly, Inline graphic under both rulesets corroborate the general trends seen in [2] in the equivalent Test Groups B.    

Both Rulesets produce progressively safer and more efficient interactions with better communication. Ruleset 2 returns the largest percent improvement in safety across the board compared to Ruleset 1. These findings corroborate and further reinforce the findings in [2]. Namely that communication improves interaction safety and vehicle payoffs in a statistically significant manner, and that the effect is more pronounced when information is more limited (as in Ruleset 2). We explore some of these findings in more detail below.

Effect on Interaction Advantage (Payoffs)

The main exception to the trend of improvement with communication is Vehicle Inline graphic’s average payoff under Ruleset 1. Inline graphic saw a statistically significant worsening compared to both the Inline graphic and Inline graphic. This suggests that Vehicle Inline graphic finds itself at a relative disadvantage when Inline graphic has the option to force a join, compared to the scenarios where it does not. This is especially evident when there are few other benefits to be gained (there were no crashes in any of the test groups under Ruleset 1, the reduction of which would have helped offset this disadvantage). Yet, the total interaction payoff (payoff of Inline graphic + payoff of Inline graphic) is higher in Inline graphic than Inline graphic and Inline graphic. This suggests that the interaction overall is more effective. Interestingly, however, Vehicle Inline graphic sees a significant improvement in its own payoff in Inline graphic compared to Inline graphic and Inline graphic under Ruleset 2. The reduction in other utility-damaging factors such as crashes raises Vehicle Inline graphic’s average payoff into a net improvement. This is an important result, since it shows that even when Vehicle Inline graphic engages in seemingly aggressive behaviour, benefits could be had for both parties in the interaction.        

In contrast, Vehicle Inline graphic’s average payoff significantly improved under both Rulesets in Inline graphic compared to Inline graphic and Inline graphic. This suggests that Vehicle Inline graphic can see an advantage from masking its intent and bullying its way into the merge. More broadly, Vehicle Inline graphic’s clear advantage over Vehicle Inline graphic under both Rulesets’ Inline graphic is a testament to what is known as the first mover advantage. In game theory, the first mover advantage is the advantage a player gains by being the first to carry out an action in a sequential game. An example of this from economic game theory is the competitive advantage gained by the first company to enter a certain market [63]. Our interaction design allows Vehicle Inline graphic such an advantage by limiting Vehicle Inline graphic’s response in return, i.e. Inline graphic can only Inline graphic a forced join but not Inline graphic it. When deceiving Vehicle Inline graphic into inaction by not advertising its intent, Vehicle Inline graphic secures its first mover advantage. The positive effect of this is evident in the rate of improvement Vehicle Inline graphic’s average payoff has between Inline graphic and Inline graphic compared to that of Vehicle Inline graphic. This rate of improvement averages just 5% for Vehicle Inline graphic across both Rulesets, whilst Vehicle Inline graphic sees a 26% improvement in turn. Thus, Vehicle Inline graphic secures a relative advantage. This advantage may carry important ramifications to the success of autonomous vehicles if they are able to capitalise on their inherent advantages in reaction time to secure the first move. Literature suggests that in Stackelberg Oligopoly games, an aggressive first move by the Leader Firm can often induce the rival Follower Firm to take a more ‘submissive’ action that favours the Leader Firm [64]. Of course, this also comes with its own pitfalls. By forcing the join, Vehicle Inline graphic is committing to joining ahead of Inline graphic. This commitment can often prove costly if Vehicle Inline graphic is unable to follow through with it. Indeed, we observe the negative impact of the failure to maintain a commitment in our own results. The average payoff for Vehicle Inline graphic in Inline graphic interactions is approximately 2.6 times worse than the average payoff in Inline graphic interactions in both rulesets.            

Effect on Interaction Safety

Given the findings in [2], we expected to see a reduction in the occurrence of crashes and near misses with communication. Our observations indeed demonstrate this trend. We see Ruleset 1’s Inline graphic reduce near misses by 68% compared to the Inline graphic, whilst Ruleset 2’s Inline graphic reduces crashes and near misses by 31% and 41%, respectively, compared to the Inline graphic. What is worthy of note is that compared to Inline graphic, Inline graphic in both rulesets delivered a far greater reduction in near misses under Ruleset 1 (82%) and more than double the reduction under Ruleset 2 (70% and 87% for near misses and crashes, respectively). This suggests a profound positive impact on interaction safety from making Inline graphic’s communication discretionary. This is an interesting observation, as one would expect the apparently more risk tolerant approach of choosing to force an interaction to have more dangerous consequences. We examined the data by comparing all 115 Inline graphic crashes which occurred under Ruleset 2, Inline graphic, against their mirror occurrences in Inline graphic. Whilst none resulted in a crash, we discovered that in 87 out of the 115 interactions in Inline graphic, Vehicle Inline graphic would have Inline graphic Vehicle Inline graphic to join in retrospect. This is the Stackelberg Oligopoly phenomenon described earlier in action; Vehicle Inline graphic’s aggressive first move coaxed Vehicle Inline graphic into more submissive behaviour, which in these 87 cases prevented a crash.

Effect on Interaction Efficiency (Non-ideal Outcomes)

We see a clear reduction in non-ideal outcomes as more information is made available via communication, especially with the Inline graphic outcome. We note the slight increase in Inline graphic outcomes in the Inline graphic runs compared to Inline graphic (+0.05% and + 0.01% under Rulesets 1 and 2, respectively), despite the over-all reduction compared to the Inline graphic. This can be explained by considering Vehicle Inline graphic’s added ability to change its mind in Inline graphic. That is, to back out of a forced join. If we exclude these interactions, the percentage of Inline graphic outcomes goes down to 1.25%. This is a more appropriate direct comparison since Vehicle Inline graphic does not have the option to change its mind in the other simulation groups.

Notable Observations

Despite the appeal and tangible benefit of masking its intent from Vehicle Inline graphic, Vehicle Inline graphic did not adopt Inline graphic as a pure strategy. In fact, Vehicle Inline graphic chose to signal its intent to Vehicle Inline graphic in 38% of all interactions across both rulesets. This means that Vehicle Inline graphic still found an advantage in communicating its intent under the right circumstances in a significant number of cases.

Interestingly, whilst it is noted that Ruleset 2 generally performed worse than Ruleset 1, the gap between the two rulesets is significantly reduced as more communication is introduced. For example, Ruleset 2’s Inline graphic’s average utility (for both vehicles) was 45% worse than Ruleset 1’s Inline graphic. This gap is narrowed to 32% in Inline graphic and down to just 7.25% in Inline graphic. This convergence in average utility suggests that intelligence on the other vehicle’s current state (in this case, Inline graphic and Inline graphic) can go a long way in counteracting the effect of having no knowledge of the other vehicle’s kinematic attributes and preferences. This finding should be treated with caution, however, as we have not conducted enough trials to test its sensitivity to preset parameters. Nevertheless, we can observe a clear correlation between communication and improved utility. This improvement is much more pronounced when at least one vehicle can choose when and if to communicate.

On Bayesian Statistics and Bounded Rationality

Whilst our model adopts Bayesian inference as the basis for vehicles’ decision-making, we acknowledge that this represents an idealised form of rationality. Bayesian inference is often used to solve games with limited information; thus, it is a form of addressing bounded rationality in and of itself. Indeed, human road users typically operate under conditions of bounded rationality, relying on heuristics, limited information, and satisficing strategies rather than precise probabilistic reasoning. Bayesian inference is ultimately a principled, mathematical approach in which humans generally do not engage—at least not on a conscious level. Nevertheless, Bayesian games do offer a transparent and principled method to represent uncertainty, capture the influence of available information on interaction outcomes, and formalise the updating of beliefs as new information comes to light. This makes Bayesian games particularly suitable as a modelling baseline, especially given its widespread adoption in autonomous driving research. Our use of Bayesian inference is thus not intended to suggest that human drivers are perfectly rational, but rather to provide a systematic and extensible foundation for comparing interaction outcomes under different communication strategies.

Conclusion

We set out in this paper to examine whether discretionary communication can enhance the outcome of road user interaction from a game-theoretic perspective. We investigated two hypotheses. First, that vehicles which communicate selectively achieve better payoffs than those which communicate unconditionally. Second, that within a non-cooperative game-theoretic framework, communication (even when selective) can yield safer and more efficient interactions.

Our experiments reinforce our previous findings that non-cooperative game theory is a viable framework to model the exchange of communication between road users [2]. Furthermore, our introduction of discretionary communication to allow the joining vehicle to gain first mover advantage has shown promising results. Namely that the joining vehicle is able to complete more interactions in its favour, thus demonstrating that there is advantage in masking one’s own intent under the right circumstances. We also find that by behaving more ‘aggressively’, the joining vehicle elicits more ‘submissive’ behaviour from the main-lane vehicle. This creates an emergent phenomenon where interaction safety is improved as conflicts are reduced. We conclude that non-cooperative communication can produce emergent benefits in safety and efficiency for all parties involved.

We amplified the occurrence rate of explicit communication in this experiment to facilitate comparison. Thus, future work should investigate the sensitivity of our results to the occurrence rate of explicit signals. Our work would also benefit from sensitivity analysis of factors such as the crash penalty, the wait penalty factor Inline graphic and the punitive sensitivity factor Inline graphic. By better understanding how the different variables influence the results of the simulation, one can draw wider conclusions on how human and autonomous preferences can shape an interaction. Real-world validation of parameters would also aid in supporting the assumptions of this model and expanding its application in real-world settings.    

In this paper, we only explored one aspect of deception: masking intent. Future iterations on the model should investigate other forms of deception, such as giving misleading information, exaggeration of vehicle capabilities or feigning inattention to discourage an opponent from action. With that said, there is a clear conceptual distinction between simply exercising discretion with what to communicate and actively communicating misleading information. This is particularly important in the context of autonomous vehicles. Unlike human drivers, autonomous vehicles are expected to conform to strict safety and transparency standards, which makes the intentional use of deception problematic from both regulatory and societal trust perspectives. Thus, although our model allows for the exploration of deceptive signalling as a theoretical construct, we emphasise that its application to autonomous vehicles must be framed within clear ethical boundaries and accountability structures.

Funding

This work was supported by the Engineering and Physical Sciences Research Council Doctoral Training Partnership, Grant No. EP/R513258/1.

Data Availability

The model source code and the generated data used to support the findings of this paper are available from the University of Leeds at 10.5518/1608.

Declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Footnotes

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  • 1.Siebinga O, Zgonnikov A, Abbink DA. Modelling communication-enabled traffic interactions. R Soc Open Sci. 2023;10(5):230537. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Bitar I, Crusat AS, Watling DP. Modelling implicit and explicit communication between road users from a non-cooperative game-theoretic perspective: an exploratory study. In: Proceedings of the 10th international conference on vehicle technology and intelligent transport systems—VEHITS. 2024, SciTePress: Angers, France. pp 456–64.
  • 3.Osborne MJ. An introduction to game theory. New York: Oxford University Press; 2003. p. 552. [Google Scholar]
  • 4.Saifuzzaman M, Zheng Z. Incorporating human-factors in car-following models: a review of recent developments and research needs. Transp Res Part C Emerg Technol. 2014;48:379–403. [Google Scholar]
  • 5.Zhang T et al. Car-following models: a multidisciplinary review. arXiv, 2023.
  • 6.Rahman M, et al. Review of microscopic lane-changing models and future research opportunities. IEEE Transp Intell Trans Syst. 2013;14(4):1942–56. [Google Scholar]
  • 7.Elvik R. A review of game-theoretic models of road user behaviour. Accid Anal Prev. 2014;62:388–96. [DOI] [PubMed] [Google Scholar]
  • 8.Ji A, Levinson D. A review of game theory models of lane changing. Transp Res A Transp Sci. 2020;16(3):1628–47. [Google Scholar]
  • 9.Gipps PG. A model for the structure of lane-changing decisions. Transp Res Part B Methodol. 1986;20(5):403–14. [Google Scholar]
  • 10.Hidas P. Modelling vehicle interactions in microscopic simulation of merging and weaving. Transp Res Part C Emerg Technol. 2005;13(1):37–62. [Google Scholar]
  • 11.Ahmed K et al. Models of freeway lane changing and gap acceptance behavior. In: Proceedings of the 13th International Symposium on Transportation and Traffic Theory. 1996. Lyon, France.
  • 12.Kesting A, Treiber M, Helbing D. General lane-changing model MOBIL for car-following models. Transp Res Rec. 2007;1999(1):86–94. [Google Scholar]
  • 13.Yu H, Tseng HE, Langari R. A human-like game theory-based controller for automatic lane changing. Transp Res Part C Emerg Technol. 2018;88:140–58. [Google Scholar]
  • 14.Kita H. A merging–giveway interaction model of cars in a merging section: a game theoretic analysis. Transp Res Part A Policy Pract. 1999;33(3):305–12. 10.1016/S0965-8564(98)00039-1. [Google Scholar]
  • 15.Liu et al. A game theoretical approach for modeling merging and yielding behavior at freeway on-ramp section. In: Proceedings of the 17th international symposium on transportation and traffic theory. 2007.
  • 16.Fisac J et al. Hierarchical Game-Theoretic Planning for Autonomous Vehicles In: 2019 International Conference on Robotics and Automation, ICRA 2019. 2018, Institute of Electrical and Electronics Engineers Inc.: Montreal. pp 9590–9596.
  • 17.Li N et al. Hierarchical reasoning game theory based approach for evaluation and testing of autonomous vehicle control systems. In: 2016 IEEE 55th conference on decision and control (CDC). 2016.
  • 18.Axelrod R, Hamilton WD. The evolution of cooperation. Science. 1981;211(4489):1390. [DOI] [PubMed] [Google Scholar]
  • 19.Kang K, Rakha HA. A repeated game freeway lane changing model. Sensors. 2020;20(6):1554. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Meng F et al. Dynamic decision making in lane change: game theory with receding horizon. In: 2016 UKACC 11th international conference on control (CONTROL). 2016.
  • 21.Iwamura Y, Tanimoto J. Complex traffic flow that allows as well as hampers lane-changing intrinsically contains social-dilemma structures. J Stat Mech Theory Exp. 2018;2018:023408. [Google Scholar]
  • 22.Bitar I, Watling D, Romano R. Sensitivity analysis of the spatial parameters in modelling the evolutionary interaction between autonomous vehicles and other road users. SN Comput Sci. 2023;4(4):336. [Google Scholar]
  • 23.Sent E-M. Rationality and bounded rationality: you can’t have one without the other. Eur J History Econ Thought. 2018;25(6):1370–86. [Google Scholar]
  • 24.Talebpour A, Mahmassani HS, Hamdar SH. Modeling lane-changing behavior in a connected environment: a game theory approach. Transp Res Procedia. 2015;7:420–40. [Google Scholar]
  • 25.Ali Y, et al. A game theory-based approach for modelling mandatory lane-changing behaviour in a connected environment. Transp Res Part C Emerg Technol. 2019;106:220–42. [Google Scholar]
  • 26.Bendor J, Swistak P. Types of evolutionary stability and the problem of cooperation. Proc Natl Acad Sci U S A. 1995;92(8):3596–600. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Nowak MA. Five rules for the evolution of cooperation. Science. 2006;314(5805):1560–3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Altman E, et al. The evolution of transport protocols: an evolutionary game perspective. Comput Netw. 2009;53(10):1751–9. [Google Scholar]
  • 29.Rubenstein DR, Kealey J. Cooperation, conflict, and the evolution of complex animal societies. Nat Educ Knowl. 2010; 78.
  • 30.He J, et al. Spatial games and the maintenance of cooperation in an asymmetric Hawk-Dove game. Chin Sci Bull. 2013;58(18):2248–54. [Google Scholar]
  • 31.Stewart AJ, Plotkin JB. From extortion to generosity, evolution in the Iterated Prisoner’s Dilemma. Proc Natl Acad Sci U S A. 2013;110(38):15348–53. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Fernández Domingos E, et al. Emerging cooperation in N-person iterated prisoner’s dilemma over dynamic complex networks. Comput Inform. 2017;36:493–516. [Google Scholar]
  • 33.Gilles RP, Mallozzi L, Messalli R. Emergent collaboration in social purpose games. arXiv [cs.GT], 2021.
  • 34.Orzan N et al. Emergent cooperation and deception in public good games. In: 2023 adaptive and learning agents workshop at AAMAS. 2023.
  • 35.Harris CM. Autonomous vehicle decision-making: should we be bio-inspired? in towards autonomous robotic systems. 2017. Cham: Springer International Publishing.
  • 36.Millard-Ball A. Pedestrians, autonomous vehicles, and cities. J Plann Educ Res. 2018;38(1):6–12. [Google Scholar]
  • 37.Sun H, Ge Y, Qu W. Greater prosociality toward other human drivers than autonomous vehicles: Human drivers’ discriminatory behavior in mixed traffic. Accid Anal Prev. 2024;203:107623. 10.1016/j.aap.2024.107623. [DOI] [PubMed] [Google Scholar]
  • 38.Bitar I, Watling D, Romano R. How can autonomous road vehicles coexist with human-driven vehicles? An evolutionary-game-theoretic perspective. In: Proceedings of the 8th international conference on vehicle technology and intelligent transport systems-VEHITS. 2022, SciTePress. pp 376–83.
  • 39.Dey D, Terken J. Pedestrian interaction with vehicles: roles of explicit and implicit communication. In: Proceedings of the 9th international conference on automotive user interfaces and interactive vehicular applications. 2017, Association for Computing Machinery: Oldenburg, Germany. pp 109–13.
  • 40.Harkin AM, Harkin KA, Petzoldt T. What to rely on—implicit communication between pedestrians and turning automated vehicles. Transp Res Part F Traffic Psychol Behav. 2023;98:297–317. [Google Scholar]
  • 41.Lee YM, et al. Road users rarely use explicit communication when interacting in today’s traffic: implications for automated vehicles. Cogn Technol Work. 2021;23(2):367–80. [Google Scholar]
  • 42.Lee YM, Sheppard E. The effect of motion and signalling on drivers’ ability to predict intentions of other road users. Accid Anal Prev. 2016;95:202–8. [DOI] [PubMed] [Google Scholar]
  • 43.Durlauf SN, Blume LE. Cheap Talk. In: Durlauf SN, Blume LE, editors. Game theory. London: Palgrave Macmillan UK; 2010. p. 38–47. [Google Scholar]
  • 44.Parikh P. The use of language. Stanford: CSLI Publications; 2001. [Google Scholar]
  • 45.Allott N. Game theory and communication. In: Benz A, Jäger G, van Rooij R, editors. Game theory and pragmatics. London: Palgrave Macmillan UK; 2006. p. 123–52. [Google Scholar]
  • 46.Brams SJ. Deception in 2 × 2 games. J Peace Sci. 1977;2(2):171–203. [Google Scholar]
  • 47.Tao Z, Zhu Q. A game-theoretic foundation of deception: knowledge acquisition and fundamental limits. ArXiv, 2018.
  • 48.Sarkadi Ş, et al. The evolution of deception. R Soc Open Sci. 2021;8(9):201032. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Zagare FC. The Geneva Conference of 1954: a case of tacit deception. Int Stud Q. 1979;23(3):390–411. [Google Scholar]
  • 50.Fallis D, Lewis PJ. Animal deception and the content of signals. Stud Hist Philos Sci. 2021;87:114–24. [DOI] [PubMed] [Google Scholar]
  • 51.Adams ES, Caldwell RL. Deceptive communication in asymmetric fights of the stomatopod crustacean Gonodactylus bredini. Anim Behav. 1990;39(4):706–16. [Google Scholar]
  • 52.Ferguson-Walter K et al. Game theory for adaptive defensive cyber deception. In: Proceedings of the 6th annual symposium on hot topics in the science of security. 2019, Association for Computing Machinery: Nashville, Tennessee, USA. p. Article 4.
  • 53.Carroll TE, Grosu D. A game theoretic investigation of deception in network security. In: 2009 proceedings of 18th international conference on computer communications and networks. 2009.
  • 54.Jin PJ, et al. Bidirectional control characteristics of General Motors and optimal velocity car-following models: implications for coordinated driving in a connected vehicle environment. Transp Res Rec. 2013;2381(1):110–9. [Google Scholar]
  • 55.Bevrani K, Chung E. A safety adapted car following model for traffic safety studies. Adv Human Asp Road Rail Transp. 2012; 550–59.
  • 56.Yulong P, Huizhi X. The control mechanism of lane changing in jam condition. In: 2006 6th world congress on intelligent control and automation. 2006.
  • 57.Wang M, et al. Game theoretic approach for predictive lane-changing and car-following control. Transp Res Part C Emerg Technol. 2015;58:73–92. [Google Scholar]
  • 58.Joyce J. Bayes’ theorem. 2021 [cited 2024 2024-02-22]; Available from: https://plato.stanford.edu/archives/fall2021/entries/bayes-theorem/.
  • 59.Bokare PS, Maurya AK. Acceleration-deceleration behaviour of various vehicle types. Transp Res Procedia. 2017;25:4733–49. [Google Scholar]
  • 60.Finnegan P, Green P. Time to change lanes: a literature review. 1990.
  • 61.Salvucci DD, Liu A. The time course of a lane change: driver control and eye-movement behavior. Transp Res Part F Traffic Psychol Behav. 2002;5(2):123–32. [Google Scholar]
  • 62.AASHTO. A policy on geometric design of highways and streets, 6th edition. Washington.: American Association of State Highway and Transportation Officials; 2011. [Google Scholar]
  • 63.Tarver E. First mover: What it means, examples, and first mover advantages. [cited 2024 15 September]; Available from: https://www.investopedia.com/terms/f/firstmover.asp.
  • 64.Heifetz A. Commitment. In: Heifetz A, Yalon-Fortus J, editors. Game theory: interactive strategies in economics and management. Cambridge: Cambridge University Press; 2012. p. 333–52. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The model source code and the generated data used to support the findings of this paper are available from the University of Leeds at 10.5518/1608.


Articles from Sn Computer Science are provided here courtesy of Springer

RESOURCES