To Signal or Not to Signal? A Non-cooperative Game-Theoretic Approach to Discretionary Communication Between Road Users

Isam Bitar; Albert Solernou Crusat; Richard Romano; David Watling

doi:10.1007/s42979-025-04533-w

. 2025 Dec 17;7(1):3. doi: 10.1007/s42979-025-04533-w

To Signal or Not to Signal? A Non-cooperative Game-Theoretic Approach to Discretionary Communication Between Road Users

Isam Bitar ^1,^✉, Albert Solernou Crusat ¹, Richard Romano ¹, David Watling ¹

PMCID: PMC12711921 PMID: 41426655

Abstract

Reciprocal communication between road users is a vital element of road user interaction. Non-cooperative game theory is an effective framework for modelling and characterising communicative behaviour between road users, which enables the study of emergent benefits for both the issuer and recipient of communicative signals. In this paper, we introduce discretionary communication to gain an advantage over the other road user by masking one’s intent if beneficial to do so. We conduct a series of experiments with simulated interactions and compare interaction outcomes where communication is mandatory against those where communication is discretionary. Our findings further support the premise that non-cooperative game theory is an effective paradigm for modelling and producing emergent behaviours which benefit the network. Moreover, we see that including a layer of discretionary communication reaps benefits in interaction outcome to the communicator. It also provides benefits in safety to all parties involved above and beyond the benefits seen from mandatory communication.

Keywords: Game theory, Communication, Cheap talk, Non-cooperative games, Bayesian games, Emergent cooperation, Discretionary communication, First mover advantage

Introduction

The reciprocal interaction between road users in which they engage in competitive, cooperative, and communicative behaviours to negotiate priority and road space is an integral part of navigating the road network. Properly understanding and modelling these interactions is a growing field, especially as autonomous vehicles get closer to technological and market maturity.

Until recently, research on modelling communication as an active component of road user interaction rejected the game-theoretic approach due in part to its perception as a framework that does not lend itself to communicative behaviour. For example, the researchers in [1] relied on an underlying assumption of the existence of a shared goal between interacting players to model communicative behaviour. However, we have provided a proof of concept in [2] that non-cooperative game theory (where players only seek to maximise one’s own utility) can indeed provide a robust framework for modelling road user communication descriptively and prescriptively, without the need for an underlying shared goal.

In our recent paper [2], we concluded that issuing and receiving communication to influence interaction outcomes from a non-cooperative game-theoretic perspective could feasibly occur, and that it could produce emergent, population-wide benefits.

The current paper seeks to expand on that paradigm by exploring whether discretionary communication further improves a road user’s utility in an interaction. The original model confines communicative behaviour to the Main-Lane Vehicle Inline graphic to communicate its intent to the Joining Vehicle . It is implied that Vehicle makes its intent clear to Vehicle prior to the interaction, hence Vehicle is always the vehicle to move first. The enhanced model we propose in this paper allows Vehicle to make a discretionary choice between the behaviour implied in the original model (signalling the desire to join ahead) or to forego a signal in favour of a ‘surprise manoeuvre’ to attempt to force a lane change.

Thus, we aim to explore the validity of the following two hypotheses as part of this study.

Vehicles which engage in discretionary communication have an advantage (better payoff) over vehicles in the same situation which always communicate their intent to their opponents
Communication in a non-cooperative game-theoretic framework can make interactions safer (fewer crashes and dangerous interactions) and more efficient (better payoffs for all parties involved), even when this communication is optional

One way in which we can measure interaction efficiency is by studying the occurrence of non-ideal outcomes. Non-ideal outcomes are an important metric to measure in the context of road user communication, since they reflect either miscommunication or misreading of one or both road users in an interaction. Such outcomes are non-ideal because they often result in an action by one vehicle that is opposite to what the other intended. For example, at a T-junction, the main-road vehicle may choose to yield to the minor-road vehicle, which chooses not to accept right of way anyway. Thus, the major-road vehicle loses time and momentum, and the minor-road vehicle waits unnecessarily. As such, these outcomes often return worse payoffs to one or both vehicles than what would have been achieved if either vehicle took a different decision. Outcomes like these effectively leave some utility ‘on the table’, and are commonly referred to in game theory as Pareto inefficient [3]. We posit that access to better information (via communication) should incrementally reduce the occurrence of non-ideal outcomes. This concept is known as Pareto improvement.

To this end, we build on the experimental design we’ve developed in [2], where we simulate the interaction between vehicles in a non-cooperative game-theoretic setting. We develop one set of simulations which forces the agent vehicle into communicating its intent every time, and another which allows it to choose whether to communicate, based on its assessment of the benefit of doing so. By comparing the results, we can gauge the effect of discretionary communication on both the issuer and the recipient of communication, as well as the effect of this behaviour on the safety and efficiency of the interaction in general.

Literature Review

Historically, the domain of road user interaction has been left as an accessory to microsimulation models of different multi-agent driving scenarios, such as car following [4, 5] and lane changing [6]. In these models, road user interaction is often restricted to collision avoidance. Increasingly, game-theoretic models have emerged to take a more in-depth look at the interaction element itself, especially from an autonomous vehicle’s perspective [7, 8].

Lane-change modelling is a topic which has been explored in depth in the literature. Traditional lane changing models use preset rules to determine the necessity and feasibility of lane changing, irrespective of individual road users’ preferences or constraints [9, 10]. The general formula is that an incentive criterion competes with a safety criterion to dictate whether a lane change occurs. Incentive is often some form of speed or space advantage, whilst safety concerns the risk of collision. Such models at first assumed homogeneity amongst road users, hence the use of a global set of rules. Later models introduced some individuality to the lane change interaction [11–13]. For example, [12] introduced a ‘politeness’ factor which considers the disutility to the rest of the traffic population should the agent carry out a lane change. Conversely, [13] introduced an ‘aggressiveness’ factor which influences an agent’s preference for space over safety. Neither model, however, attempted to build a framework in which an agent advertises these preferences to other road users.

Increasingly, the premise of interaction between two or more agents has become the domain of game theory. One of the first to introduce a game-theoretic lane changing model is Kita [14]. Kita employed a simple, two-player non-cooperative game with complete information, where each player chooses from a set of two strategies, validated and calibrated against real-world data.

Later models extended the game-theoretic approach in several directions. Some built on Kita’s original framework by incorporating variation in vehicle kinematics and more robust payoff functions [15]. Others introduced hierarchical structures to capture interaction at multiple levels. For example, [16] separated long-horizon strategic reasoning from short-horizon tactical games, whilst [17] modelled bounded rationality through “level-k” reasoning, where agent sophistication in anticipating the opponent is layered recursively with every next level.

Another common approach is the adoption of repeated games. Repeated games allow behaviour to unfold across multiple stages, enabling history-dependent strategies such as reciprocity [18]. Kang and Rakha [19] followed this approach to capture ongoing tactical adjustment, whereas [20] applied a receding-horizon method where the game is rebuilt at each timestep without persistent memory or cumulative payoffs. The former enabled the emergence of reciprocal behaviours, whilst the latter emphasised interaction safety.

A more seldom explored but promising paradigm is evolutionary game theory. Iwamura and Tanimoto [21] combined evolutionary game theory with a cellular automaton to demonstrate how emergent stable strategies vary under varying traffic conditions. Bitar et al. [22] extended this line by analysing how spatial factors such as cluster size and vehicle range shape the evolution and success of emergent strategies.

Finally, extensive-form Bayesian games have gained traction as a means of capturing bounded rationality [23]. By allowing agents to update their beliefs about their opponents’ states and preferences, these models move closer to real-world conditions. Applications include [24] on mandatory versus discretionary lane changes, [25] on connected versus non-connected environments, and [2], where we introduced communication itself, through implicit and explicit signals, as an active component of interaction in a game-theoretic setting.

Thus, [2] adds road user interaction to the growing body of fields in which non-cooperative game theory is used to describe and explain emergent cooperative and communicative behaviour [18, 26–34]. This concept carries its own implications regarding interaction with autonomous vehicles, given the general tendency for humans to behave less cooperatively towards machines [35–37]. This means that there is a question to be raised on whether autonomous vehicles should consider if it is beneficial to advertise their intent to other road users. Indeed, evidence suggests that communication can be used to deceive other players when there is an asymmetry in available information [34]. We have previously shown that autonomous vehicles would need to perform better than human-driven vehicles in terms of interaction outcomes to survive in a mixed population [22, 38]. Therefore, exploring this aspect in the context of road user communication may be of use, especially in the broader context of autonomous vehicle interaction with human road users. This paper builds on that conclusion by looking at the operational/tactical level behaviour, such as discretionary signalling.

Research suggests that most instances of communication between road users are implicit [39–41]. However, explicit communication, less common as it may be, remains an emphatic conveyor of information and road user intent [42]. In our recent work [2], we conceptualised a non-cooperative game as a lane-change scenario in which a Joining Vehicle Inline graphic desires to change lanes ahead of a Main-Lane Vehicle . The paper concluded that both vehicles benefit from Vehicle issuing such helpful communication. It is important to note that neither vehicle earned nor lost payoff directly from this communication. That is, the communication did not have an intrinsic utilitarian value. In game theory, this form of communication is known as cheap talk, where providing and receiving information is free [43]. The paper’s model also assumed that the vehicles received and interpreted communication perfectly. In the real world, communication may be obscured, ignored, or misunderstood. In fact, Bayesian game-theoretic models exist that are entirely dedicated to the utterance, receipt and understanding of communication between players [44, 45]. Though such a paradigm would add interesting complexity to an interaction model, it was beyond the scope of that study. The paper also assumed that the Joining Vehicle Inline graphic would implicitly but unambiguously make its intent clear that it wishes to join. As such, the question of whether there’s merit to either vehicle to mask its intent from the other remains open.

Masking one’s intent may be considered a form of deception. Deception is a game-theoretic concept in which the deceiving player limits, distorts, or alters information about the game (usually one’s own attributes, preferences, actions or payoffs) to trick the opponent into taking action that favours the deceiver at the expense of the deceived [46, 47]. The topic of deception has been explored in various applications, including sociology [48], politics [49], animal behaviour [50, 51] and even cyber security [52, 53]. By masking its intent, the Joining Vehicle Inline graphic robs the Main-Lane Vehicle of the ability to anticipate (and potentially block) Vehicle ’s join attempt. To our knowledge, this concept is yet to be explored in the context of road user interaction.

We believe there is merit in investigating the effect of deception in road user communication in a non-cooperative game-theoretic setting. To date, research on the topic of road user communication has not explored this aspect. There is a particular interest in studying this concept in the context of the interaction between autonomous vehicles and other road users. Research shows that human road users are likely to behave less cooperatively towards autonomous vehicles. Perhaps, then, masking its intent may prove a useful tool in the autonomous vehicle’s toolbelt, which would help it navigate a potentially unfriendly environment. We aim to explore the feasibility of this form of behaviour in a non-cooperative game-theoretic setting, whether such behaviour would bring benefit to the agent vehicle, and what impact such behaviour may have on general interaction safety and efficiency.

Method

We conceptualise a discretionary lane change scenario between a Joining Vehicle Inline graphic and a Main-Lane Vehicle . In this section, we describe an expanded model which we built on top of the model we developed in [2]. In the original model, Vehicle moves first to allow or block , followed by Vehicle ’s response. In the present paper, this basic model is expanded to accommodate an additional, pre-emptive move by Vehicle Inline graphic , where may decide to signal its intent to join ahead or forego a signal to try to force a join instead.

We discuss in this section the game-theoretic interaction model, the kinematic framework used to play an interaction, the payoff functions, and the communication element, which is the focus of this paper. We then detail the experimental design adopted in this paper to study the hypotheses we outlined in the introduction.

We employ a simple ‘lane change’ operation based on the bi-directional General Motors Car Following Model [54]. Figure 1 illustrates the conceptual layout of the proposed scenario. We use the General Motors model for its relative simplicity and reliance on Newtonian kinematics. This allows us to easily identify and isolate individual parameters to aid in interaction development.

Fig. 1 — Conceptual layout of the interaction between main-lane vehicle and joining vehicle

The Game-Theoretic Interaction Model

The interaction model employed is a two-player, sequential, non-cooperative Bayesian game between Vehicle Inline graphic and Vehicle . The extensive form of the model (game tree) is illustrated in Fig. 2. Vehicle has two stochastic properties whose values are based on predetermined probabilities. These two properties are (possible values: ) and (possible values: ) . relates to Vehicle ’s awareness of and responsiveness to Vehicle Inline graphic ’s movement. Literature on attention in the context of game-theoretic interaction is scarce [4]. For example, [55] extends the Gipps car following model to incorporate differences in reaction time but falls short of modelling distraction. relates to whether Vehicle would attempt to punish a lane-change it did not agree to. This is a common concept in game-theoretic interaction models and manifests in different forms [6, 8]. Vehicle Inline graphic knows whether it is or (solid horizontal line between the branches) but does not know its level (dashed line between the branches). In game-theoretic sequential models, such properties are modelled as moves by Nature. is a third-party entity which selects the values of the stochastic elements of a sequential game based on predetermined probabilities. These elements create uncertainty about the information available to one or more players, which warrant the establishment of beliefs. Beliefs are probabilistic assumptions about one or more properties relevant to a player or the interaction. We elaborate further on this concept later in this section.

Fig. 2 — The sequential game between the main-lane vehicle and the Joining Vehicle (game tree)

Vehicle Inline graphic begins the interaction by choosing whether to its intent to Vehicle (e.g., turn signal) or attempt to the join without warning. If Vehicle chooses to , Vehicle may choose to Vehicle to join or its attempt. However, Vehicle does not get the opportunity to do so if the interaction is forced, or if Inline graphic ’s state is (note the absence of this step from the respective branches of the game tree). Once has made its first decision (if applicable), Vehicle chooses whether to ahead of or until has passed. If had previously attempted to the join, it may at this stage choose to its original decision and continue, or Inline graphic the manoeuvre and wait instead. Finally, Vehicle must take a further action if Vehicle chooses to or . This final action depends on Vehicle ’s and states. If ’s state is , it will adopt a trajectory when and a trajectory when . entails tailgating Vehicle to induce a negative headway utility (see Inline graphic in the Payoffs section). If Vehicle ’s state is or , it will Vehicle and continue moving as if it were in free flow and will not take measures to prevent a collision if one were imminent. This state only lasts for a finite amount of time, after which Vehicle employs or as appropriate.

The Kinematic Model

The movement model is a discretised approach, where at each timestep each vehicle’s kinematic properties are evaluated. Acceleration is governed by a modified version of the bi-directional General Motors Car Following Model [2, 54] subject to the relevant acceleration and deceleration constraints for each vehicle. The general formula is illustrated in Eq. (1). All other kinematic properties are governed by Newtonian equations of motion.

where Inline graphic is the acceleration of Vehicle at the start of the next timestep . is the velocity of Vehicle at the current timestep . is the distance between Vehicle and its car-following target at timestep . is the velocity difference between Vehicle and its target at timestep . is a sensitivity factor which governs Vehicle Inline graphic ’s acceleration rate to maintain car-following behaviour. Higher means more conservative movement. are parametric factors which in this study are set to 1.

Inline graphic is constrained by the agent vehicle’s acceleration preferences and physical limitations (see Table 2 for the value ranges used in this paper).

Table 2.

Values and value ranges used in the simulation (based on values used in [2])

Property/constant	Description	Value/value range
	Maximum comfortable acceleration	(0.20, 2.00)
	Maximum allowable acceleration	(2.50, 3.50) [59]
	Maximum comfortable deceleration	(− 0.50, − 1.50)
	Minimum acceptable time headway	(0.50, 3.50)
	Decision time	(0.50, 1.50)
	Punitive sensitivity factor (exclusive to )	(0.15, 0.35)
	Initial velocity	: (8, 18) ; : (4, 10)
	Initial distance between the vehicles	(14, 89)
	Desired velocity (exclusive to )	(0.75, 1.50) ×
	penalty reduction factor	(0.10, 0.20)
	Total number of distraction timesteps	10 (5–8 )
	Maximum number of timesteps	63 (31.5–34.5 )
	Lane change duration	5 [60, 61]
	General motors sensitivity factors	1
	Timestep (from onward)	0.5
	Phantom vehicle time headway *	4
	Crash penalty	− 250
	Maximum safe deceleration	− 4.5 [59, 62]

Open in a new tab

*Used in free-flow mode to simulate gradual return to initial/desired velocity

Vehicle Inline graphic ’s movement is governed by Eq. (2), derived from Newtonian equations of motion. It is the only acceleration value which is not governed by Eq. (1).

where Inline graphic is the acceleration of Vehicle at the start of the next timestep . is Vehicle ’s maximum acceleration. is the distance between Vehicle and Vehicle at timestep . , are Vehicle and Vehicle ’s velocities, respectively, at timestep . is the lane change duration.

A brief description of the model’s timesteps is outlined below.

= 0; begins the interaction the moment Vehicle takes its first action
= Vehicle ’s Decision-Reaction Time ; Vehicle reacts to Vehicle ’s initial action ( if , if )
= Vehicle ’s Decision-Reaction Time ; Vehicle takes its second decision ( if first decision was , if first decision was )
= ; Vehicle reacts to Vehicle ’s second decision ()
= 0.5 seconds; regular timesteps where the vehicles employ the modified General Motors Car Following Model as the interaction resolves
; the final timestep in an interaction, which is reached if a crash is detected, a predefined maximum duration is reached, or when all of the following interaction conclusion conditions [2] are met:
- If Vehicle ’s second action is or , Vehicle ’s current time headway is greater than or equal to Vehicle ’s minimum acceptable time headway

Payoff Functions

The payoff for each vehicle is a function of several components. These are listed and described below.

Ride Comfort

Ride comfort is best represented as the change in acceleration over time (jerk). Acceleration is a common element in game payoffs in the literature [13, 14, 24]. In this model, we base ride comfort on both the acceleration values at each timestep (with respect to the vehicle’s comfortable acceleration value Inline graphic ) and a measure of jerk over the interaction period (measured as the standard deviation of acceleration about its mean). The jerk element of the payoff calculation allows for a utilitarian distinction between different strategies. For example, a failed attempt where Vehicle accelerates then sharply decelerates will yield a worse payoff than a sustained sharp deceleration of equal magnitude.

Ride comfort Inline graphic applies to all interaction possibilities, and it is calculated as prescribed in Eq. (3).

where Inline graphic is the total count of the interaction’s timesteps. is the agent vehicle’s acceleration at timestep . is the mean of . , are the vehicle’s maximum comfortable acceleration and deceleration, respectively. is used in the denominator of the first term if , else . is the duration of timestep Inline graphic .

Time Headway

Time headway is another common metric in many interaction models and, together with time to collision (TTC), forms the basis of interaction safety [14, 56]. In this study, we define Inline graphic as a function of the minimum time headway achieved during the interaction with respect to the agent vehicle’s minimum acceptable headway . Time headway applies to all interactions where Vehicle chooses or attempts to join at any point. It is calculated as prescribed in Eq. (4).

where Inline graphic , are the current timestep and the set of all timesteps, respectively. is the disutility from being involved in a crash. is the vehicle’s time headway with respect to the lead vehicle at timestep . is the vehicle’s minimum acceptable time headway.

Speed Difference

Speed difference concerns the steady states before and after an interaction. It is often used as part of the incentive function which triggers a lane change [15] or as part of the reward for changing lanes [24, 57].

For Vehicle Inline graphic , the speed difference payoff is based on the difference between the vehicle’s initial and final (stable) velocities, where a lower final velocity (caused by following a slower ) brings a negative utility to . Similarly, if Vehicle opts to , becomes the difference between ’s desired velocity after the lane change and Inline graphic ’s stable velocity. This would mean that would incur a penalty if it is forced to join behind a Vehicle that is slower than Vehicle ’s target (desired) velocity. Speed difference is applicable to either vehicle when it is the lag vehicle. This is in a variation to the original application of Inline graphic in [2], where it also applied to Vehicle as a lead vehicle if at any point during the interaction it was forced to accelerate beyond its initial or desired velocities. We have opted to remove Vehicle ’s payoff function when it is the lead vehicle from the current study, since Vehicle Inline graphic ’s highest achieved velocity did not factor into its own , and so it did not seem to make sense that we consider this element for Vehicle but not for Vehicle . Thus, is calculated according to Eqs. (5) and (6) below.

where Inline graphic is Vehicle ’s initial (and stable) velocity, is Vehicle ’s desired velocity after the lane change.

Time Penalty

Many game-theoretic models consider some form of time penalty in their payoffs, typically represented as time spent in the undesirable lane [14, 15, 56]. In the proposed model, Vehicle Inline graphic is subject to when it chooses to . It is a function of the amount of time needs to wait for to pass before it can behind it. It is calculated as the time until is equal to or greater than ’s minimum acceptable time headway . If an interaction ends before this condition is met, the remainder is estimated based on Vehicle Inline graphic ’s velocity and position at the end of the interaction relative to Vehicle . The payoff value is then multiplied by a factor that is specific to that instance of . represents Vehicle ’s sensitivity to losing time. The higher, the less tolerant is to waiting its turn and the more likely it is to make riskier merges to avoid the wait. The calculation is mathematically represented in Eq. (7).

* Inline graphic is used here to prevent Vehicle from waiting indefinitely for a slower to pass

where Inline graphic is the total count of the interaction’s timesteps. , are Vehicle ’s time headway with respect to Vehicle at timestep and the final interaction timestep, respectively. is Vehicle ’s minimum acceptable time headway. , are Vehicle ’s and Vehicle ’s initial velocities, respectively. Inline graphic , are Vehicle ’s and Vehicle ’s final positions, respectively. is Vehicle ’s final velocity. is a sensitivity factor which represents Vehicle ’s tolerance to losing time.

Decision Making

The goal for each vehicle is rooted in fundamental non-cooperative game theory: to maximise one’s own payoff. The structure of the payoff functions ensures that the best interaction is no interaction (maximum possible payoff is zero). This would prevent vehicles from seeking conflict when one is unnecessary. Each vehicle will simulate an interaction with its opponent to establish expected payoffs using assumptions made about the opponent’s attributes, which are discussed later in the experimental design section. For Vehicle Inline graphic , this is done for both the and actions, then backward induction is used to determine the best action. For Vehicle , the expected payoffs for each of its actions are calculated for all ’s possible and combinations. Each is then multiplied by the appropriate probability value to produce the total expected payoff per action.

Table 1 provides a summary of the applicable payoff components depending on the action(s) taken.

Table 1.

Payoff composition for each vehicle given the appropriate action set

Action set(s)

Vehicle M

Vehicle J

Inline graphic

Open in a new tab

Inline graphic , are the probability that ’s state is and is , respectively

Communication

Our work in [2] provided a pathway for Vehicle Inline graphic to update its prior beliefs on Vehicle ’s stochastic states via receiving useful communication from Vehicle . The main contribution of this paper is to expand on that model by allowing Vehicle to choose whether to communicate its intent to join and explore the impact of this discretionary communication on the interaction outcome. Communication in this study takes place in three distinct stages.

Stage 0: Initial Communication from Vehicle to Vehicle

Before Vehicle Inline graphic begins the interaction, it will observe Vehicle ’s current state and process any communication Vehicle may engage in. We characterise eye contact as a form of (explicit) communication that is available at this stage. That is, Vehicle will perceive eye contact with Vehicle as an explicit signal that Vehicle Inline graphic has seen Vehicle . We discuss how this is quantified later in the experimental design.

Stage 1: Discretionary Communication from Vehicle to Vehicle

At the beginning of the interaction, Vehicle Inline graphic has the option to issue communication to Vehicle about its intent to join ahead. We have designed the game in a way where Vehicle is incentivised to force-join where possible. However, if Vehicle decides against forcing the join, it will its intent to Vehicle and await a response. This gives Vehicle Inline graphic the opportunity to or the request.

Stage 2: Further Communication from Vehicle to Vehicle

Following Vehicle Inline graphic ’s initial decision (), Vehicle will signal its intent explicitly and implicitly to Vehicle . We use the same signalling paradigm in this case as that in [2], which consists of a mixture of implicit signals (acceleration) and explicit signals (e.g., eye contact, a gesture, flashing of headlights). We elaborate further on these signals in the experimental design section.

Updating Beliefs: Bayesian Inference

Vehicle Inline graphic ’s and states are assigned upon the vehicle’s instantiation. We outline the base probabilities used for these states in the experimental design section. The base probabilities are known to Vehicle as prior beliefs. Vehicle will use the communication it receives from Vehicle in stages 0 and 2 to incrementally update these beliefs in accordance with Bayes’ Theorem [58].

where Inline graphic is the probability that is true given Observation (posterior belief), is the base probability that is true within the population (prior belief), is the probability of observing given that Property is true (likelihood), is the total probability of observing within the population

In the original model [2], Vehicle Inline graphic assigned a single Bayesian probability to both states. In the expanded model, Vehicle assigns a separate Bayesian probability to Vehicle ’s probability to be , or to be . We believe this improves the Bayesian inference process in the interaction.

Experimental Design

To properly investigate the impact of two-way discretionary communication on interaction outcomes, we run a simulation as a Inline graphic where neither vehicle engages in any form of explicit communication, nor does either vehicle read any implicit signals from the other. In addition, we run two test groups: and , each with a different style of communication.

Each vehicle’s attributes and kinematic conditions are generated from set ranges prior to the interaction itself. These are outlined and described in Table 2. The base probability values assigned to Vehicle Inline graphic ’s and states are outlined in Table 3. Table 4 gives a breakdown of the different communicable signals employed in this experiment and how these signals translate to Bayesian likelihoods. The likelihoods are in turn based on the signalling probabilities shown in Table 5.

Table 3.

The base probability for each stochastic property of Inline graphic (based on values used in [2])

Property/state	Base probability (J's Prior elief)
	0.75
	0.6
	0.5

Open in a new tab

Table 4.

Probabilities of Vehicle Inline graphic issuing various communicative signals

Signal category	Description	Probability of occurrence
Signal category	Description	Attentive cooperative	Attentive punitive	Distracted cooperative	Distracted punitive
Implicit: acceleration	alters its velocity as appropriate	1	1	0.5	0.5
Explicit: attention e.g. eye contact	makes eye contact with	0.9	0.9	0.05	0.05
Explicit: intention e.g. gestures	issues a cooperative signal (if )	0.8	0.2	0.1	0.05
Explicit: intention e.g. gestures	issues a threatening signal (if )	0.1	0.8	0.05	0.1

Open in a new tab

Table 5.

Breakdown of the likelihoods of each signal given vehicle Inline graphic 's different possible stochastic attributes

Signal category	Value	Description	*
Signal category	Value	Description	Attentive cooperative	Attentive punitive	Distracted cooperative	Distracted punitive
Implicit: accelera-tion	0	observes no acceleration from	0.05	0.05	0.55	0.55
	1	observes deceleration from	0.55	0.3	0.25	0.15
	− 1	observes acceleration from	0.4	0.65	0.2	0.3
Explicit: e.g. eye contact	0	is unable to make eye contact with	0.1	0.1	0.95	0.95
Explicit: e.g. eye contact	1	makes eye contact with	0.9	0.9	0.05	0.05
Explicit: e.g. gestures	0	does not observe an intention signal from	0.585	0.47	0.9275	0.9225
	1	observes a positive-intent signal from	0.36	0.09	0.045	0.0225
	− 1	observes a negative-intent signal from	0.055	0.44	0.0275	0.055

Open in a new tab

* Inline graphic values are based on results obtained from a pilot simulation run of 30,000 interactions

In the Inline graphic , Vehicle relies solely on the base probabilities of Vehicle ’s and states as prior beliefs. Furthermore, Vehicle does not engage in any action at . The interaction effectively begins at , at Vehicle ’s decision node.

In Inline graphic , Vehicle always advertises its intent to join and Vehicle employs the full suite of communication signals described in Table 4 during its first decision phase. Vehicle can interpret all signals issued implicitly or explicitly by Vehicle . As with the , the interaction effectively begins at Inline graphic . This group is analogous to [2]’s Test Group B. We include this test group in this experiment to benchmark our expanded model’s results against the original findings of [2], and to provide a second benchmark for the main test group of this paper. All communication in is mandatory and takes place the moment Vehicle Inline graphic makes its decision.

Inline graphic is the main test group of this paper. In this test group, Vehicle begins the interaction at by choosing whether to (signal intent as usual) or without a signal. In this scenario, is equivalent to and the in that it passes on the first decision to Vehicle . Choosing allows Vehicle Inline graphic to take control of the interaction by moving first. Communication in follows the stages outlined earlier in this section under Communication.

Each simulation group ( Inline graphic , and ) comprises ten simulations of 30,000 interactions each. Every interaction involves a unique instance of Vehicle and Vehicle . Each vehicle is spawned with attributes and preferences generated randomly from a uniform distribution of the preset ranges shown in Table 2. Each of the ten simulations uses a predefined random generator seed, which is repeated in all the three experiment groups. Using ten different random seeds per simulation group ensures that the findings are repeatable across different rolls of the randomiser dice. Reusing each random seed across all three experiment groups ensures that every resultant interaction has a corresponding mirror in the other experiment groups. That is, interactions can be paired and compared using pairwise statistics, such as the paired samples t-test.

The experiment is completed under two different rulesets. We use the same rulesets set out in the original model [2], which are briefly described below.

Ruleset 1 (Transparent): both vehicles have full knowledge of each other’s attributes and preferences, apart from the stochastic elements of Inline graphic and . This is a game of near-complete information, where the uncertainty is confined to these two elements and allows for the study of the effect of communication without noise.

Ruleset 2 (Blind): neither vehicle has any knowledge of the other’s attributes and preferences. They assume that their opponent has the same attributes as they do. The only accurate information that’s available is on the other vehicle’s velocity and position. This game of incomplete information allows for the study of communication in a more noisy/uncertain setting.

A visual representation of the simulation suite is shown in Fig. 3.

Hardware and software requirement

The simulations are conducted in a purpose-built simulation suite developed in Python 3.11.0 by the authors and run on a Windows 11 PC with a 2.9-GHz, six-core processor. Please refer to the Data availability section under Statements and declarations for the source code.

Results

All simulations were completed successfully, with no aborted or incomplete runs. The results from the simulations are aggregated and presented in Table 6. Overall, the simulations produced interactions where vehicles behaved according to their characteristics, preferences and physical positioning. Vehicles also generally favoured safer interactions and avoided taking catastrophic risks. There were no recorded crashes under Ruleset 1, and the average number of recorded crashes under Ruleset 2 was 113 crashes per 300,000 interactions (0.04%). The distribution of starting conditions was balanced, as most scenarios resulted in a relatively even split between Inline graphic and outcomes. (average 55.54% and 44.46%, respectively across both rulesets). Non-ideal outcomes, i.e. and (where applicable) were low in number, but non-trivial (1.89% and 0.97%, respectively across both rulesets).

Table 6.

Summary of simulation results

Metric	Ruleset 1 (transparent)			Ruleset 2 (blind)
Metric	Control	Group I	Group II	Control	Group I	Group II
allow/join*	49.36%	49.76%	63.07%	49.93%	50.39%	64.91%
allow/wait	1.96%	1.57%	1.62%	2.37%	1.91%	1.92%
block/join	1.15%	0.67%	0.00%	2.20%	1.82%	0.00%
block/wait	47.52%	48.00%	35.30%	45.50%	45.89%	33.17%
Near misses	0.44%	0.14%	0.08%	0.63%	0.43%	0.19%
Crashes	0.00%	0.00%	0.00%	0.07%	0.04%	0.01%
Average payoff (vehicle )	− 0.751	− 0.724	− 0.758	− 1.059	− 0.952	− 0.809
One-tailed paired t-test (vs Control)	–	< 0.01	< 0.01	–	< 0.01	< 0.01
One-tailed paired t-test (vs Group I)	–	–	< 0.01	–	–	< 0.01
Average payoff (vehicle )	− 0.511	− 0.492	− 0.403	− 0.761	− 0.654	− 0.435
One-tailed paired t-test (vs Control)	–	< 0.01	< 0.01	–	< 0.01	< 0.01
One-tailed paired t-test (vs Group I)	–	–	< 0.01	–	–	< 0.01

Open in a new tab

*For test/group II, allow/join also includes forced joins, regardless of vehicle Inline graphic 's

Ruleset 1 (Transparent)

Ruleset 1 produced safer and more efficient interactions than Ruleset 2. 2.63% of all interactions under Ruleset 1 had non-ideal outcomes ( Inline graphic or ). Ruleset 1 had no crashes and an average near-miss rate (defined as having a time headway of less than half a second at any point during the interaction) of 0.22%.

Compared to the Inline graphic , came with a 68.4% decrease in near-misses and a 3.6% and 3.7% improvement of average utility (payoff) for Vehicle and Vehicle , respectively. These figures are statistically significant ( < 0.01).

Inline graphic of Ruleset 1 further improves the interaction outcomes, providing an 81.6% decrease in near misses on the , and a 21% improvement of average utility for Vehicle . Conversely, however, saw a 4.8% worsening of average utility for Vehicle compared to the . The figures are statistically significant against both the Inline graphic and ( < 0.01).

Ruleset 2 (Blind)

Ruleset 2 had a higher occurrence of non-ideal outcomes (4.08%) and near misses (0.78%). Unlike Ruleset 1, some interactions under Ruleset 2 resulted in a crash (0.07%). Furthermore, Ruleset 2 showed a markedly worse average utility for both Vehicle Inline graphic and Vehicle compared to Ruleset 1 (41.06% and 48.93% worse, respectively).

Inline graphic ’s incidence of near misses saw a 31% reduction in near misses compared to the . Similarly, ’s crash rate went down by 41%. The average utility improvement compared to the for Vehicle and Vehicle was 10% and 14%, respectively. The improvements are statistically significant ( < 0.01).

Inline graphic continued this trend by further improving interaction safety and efficiency across the board. Thus, saw crashes and near misses reduced by 87% and 70%, respectively, compared to the . As for average utility, an improvement of 43% compared to the is seen for Vehicle . Unlike in Ruleset 1, Ruleset 2’s Inline graphic also improved Vehicle ’s average utility by 15% compared to the .

Discussion

We draw a comparison between our results and the results presented in the original paper [2]. We also compare between the different rulesets and simulation groups which form the experimental design of this paper, discuss the different patterns and trends that emerge and pit the results against our hypotheses to draw conclusions.

Comparison with the Results of the Original Paper [2]

Comparing Inline graphic and to their equivalent in [2], this work produces consistently similar results. The of both rulesets exhibited interaction outcomes that are on par with what is seen in the Control Groups of the original paper [2], if not slightly safer. The marginal increase in safety may be attributable to the addition of the extra interaction timestep described at the beginning of the Methods section, which allows more time for vehicles to react to one another. Similarly, Inline graphic under both rulesets corroborate the general trends seen in [2] in the equivalent Test Groups B.

Both Rulesets produce progressively safer and more efficient interactions with better communication. Ruleset 2 returns the largest percent improvement in safety across the board compared to Ruleset 1. These findings corroborate and further reinforce the findings in [2]. Namely that communication improves interaction safety and vehicle payoffs in a statistically significant manner, and that the effect is more pronounced when information is more limited (as in Ruleset 2). We explore some of these findings in more detail below.

Effect on Interaction Advantage (Payoffs)

The main exception to the trend of improvement with communication is Vehicle Inline graphic ’s average payoff under Ruleset 1. saw a statistically significant worsening compared to both the and . This suggests that Vehicle finds itself at a relative disadvantage when has the option to force a join, compared to the scenarios where it does not. This is especially evident when there are few other benefits to be gained (there were no crashes in any of the test groups under Ruleset 1, the reduction of which would have helped offset this disadvantage). Yet, the total interaction payoff (payoff of Inline graphic + payoff of ) is higher in than and . This suggests that the interaction overall is more effective. Interestingly, however, Vehicle sees a significant improvement in its own payoff in compared to and under Ruleset 2. The reduction in other utility-damaging factors such as crashes raises Vehicle Inline graphic ’s average payoff into a net improvement. This is an important result, since it shows that even when Vehicle engages in seemingly aggressive behaviour, benefits could be had for both parties in the interaction.

In contrast, Vehicle Inline graphic ’s average payoff significantly improved under both Rulesets in compared to and . This suggests that Vehicle can see an advantage from masking its intent and bullying its way into the merge. More broadly, Vehicle ’s clear advantage over Vehicle under both Rulesets’ is a testament to what is known as the first mover advantage. In game theory, the first mover advantage is the advantage a player gains by being the first to carry out an action in a sequential game. An example of this from economic game theory is the competitive advantage gained by the first company to enter a certain market [63]. Our interaction design allows Vehicle Inline graphic such an advantage by limiting Vehicle ’s response in return, i.e. can only a forced join but not it. When deceiving Vehicle into inaction by not advertising its intent, Vehicle secures its first mover advantage. The positive effect of this is evident in the rate of improvement Vehicle Inline graphic ’s average payoff has between and compared to that of Vehicle . This rate of improvement averages just 5% for Vehicle across both Rulesets, whilst Vehicle sees a 26% improvement in turn. Thus, Vehicle secures a relative advantage. This advantage may carry important ramifications to the success of autonomous vehicles if they are able to capitalise on their inherent advantages in reaction time to secure the first move. Literature suggests that in Stackelberg Oligopoly games, an aggressive first move by the Leader Firm can often induce the rival Follower Firm to take a more ‘submissive’ action that favours the Leader Firm [64]. Of course, this also comes with its own pitfalls. By forcing the join, Vehicle Inline graphic is committing to joining ahead of . This commitment can often prove costly if Vehicle is unable to follow through with it. Indeed, we observe the negative impact of the failure to maintain a commitment in our own results. The average payoff for Vehicle in interactions is approximately 2.6 times worse than the average payoff in Inline graphic interactions in both rulesets.

Effect on Interaction Safety

Given the findings in [2], we expected to see a reduction in the occurrence of crashes and near misses with communication. Our observations indeed demonstrate this trend. We see Ruleset 1’s Inline graphic reduce near misses by 68% compared to the , whilst Ruleset 2’s reduces crashes and near misses by 31% and 41%, respectively, compared to the . What is worthy of note is that compared to , in both rulesets delivered a far greater reduction in near misses under Ruleset 1 (82%) and more than double the reduction under Ruleset 2 (70% and 87% for near misses and crashes, respectively). This suggests a profound positive impact on interaction safety from making Inline graphic ’s communication discretionary. This is an interesting observation, as one would expect the apparently more risk tolerant approach of choosing to force an interaction to have more dangerous consequences. We examined the data by comparing all 115 crashes which occurred under Ruleset 2, Inline graphic , against their mirror occurrences in . Whilst none resulted in a crash, we discovered that in 87 out of the 115 interactions in , Vehicle would have Vehicle to join in retrospect. This is the Stackelberg Oligopoly phenomenon described earlier in action; Vehicle ’s aggressive first move coaxed Vehicle Inline graphic into more submissive behaviour, which in these 87 cases prevented a crash.

Effect on Interaction Efficiency (Non-ideal Outcomes)

We see a clear reduction in non-ideal outcomes as more information is made available via communication, especially with the Inline graphic outcome. We note the slight increase in outcomes in the runs compared to (+0.05% and + 0.01% under Rulesets 1 and 2, respectively), despite the over-all reduction compared to the . This can be explained by considering Vehicle ’s added ability to change its mind in . That is, to back out of a forced join. If we exclude these interactions, the percentage of Inline graphic outcomes goes down to 1.25%. This is a more appropriate direct comparison since Vehicle does not have the option to change its mind in the other simulation groups.

Notable Observations

Despite the appeal and tangible benefit of masking its intent from Vehicle Inline graphic , Vehicle did not adopt as a pure strategy. In fact, Vehicle chose to signal its intent to Vehicle in 38% of all interactions across both rulesets. This means that Vehicle still found an advantage in communicating its intent under the right circumstances in a significant number of cases.

Interestingly, whilst it is noted that Ruleset 2 generally performed worse than Ruleset 1, the gap between the two rulesets is significantly reduced as more communication is introduced. For example, Ruleset 2’s Inline graphic ’s average utility (for both vehicles) was 45% worse than Ruleset 1’s . This gap is narrowed to 32% in and down to just 7.25% in . This convergence in average utility suggests that intelligence on the other vehicle’s current state (in this case, and ) can go a long way in counteracting the effect of having no knowledge of the other vehicle’s kinematic attributes and preferences. This finding should be treated with caution, however, as we have not conducted enough trials to test its sensitivity to preset parameters. Nevertheless, we can observe a clear correlation between communication and improved utility. This improvement is much more pronounced when at least one vehicle can choose when and if to communicate.

On Bayesian Statistics and Bounded Rationality

Whilst our model adopts Bayesian inference as the basis for vehicles’ decision-making, we acknowledge that this represents an idealised form of rationality. Bayesian inference is often used to solve games with limited information; thus, it is a form of addressing bounded rationality in and of itself. Indeed, human road users typically operate under conditions of bounded rationality, relying on heuristics, limited information, and satisficing strategies rather than precise probabilistic reasoning. Bayesian inference is ultimately a principled, mathematical approach in which humans generally do not engage—at least not on a conscious level. Nevertheless, Bayesian games do offer a transparent and principled method to represent uncertainty, capture the influence of available information on interaction outcomes, and formalise the updating of beliefs as new information comes to light. This makes Bayesian games particularly suitable as a modelling baseline, especially given its widespread adoption in autonomous driving research. Our use of Bayesian inference is thus not intended to suggest that human drivers are perfectly rational, but rather to provide a systematic and extensible foundation for comparing interaction outcomes under different communication strategies.

Conclusion

We set out in this paper to examine whether discretionary communication can enhance the outcome of road user interaction from a game-theoretic perspective. We investigated two hypotheses. First, that vehicles which communicate selectively achieve better payoffs than those which communicate unconditionally. Second, that within a non-cooperative game-theoretic framework, communication (even when selective) can yield safer and more efficient interactions.

Our experiments reinforce our previous findings that non-cooperative game theory is a viable framework to model the exchange of communication between road users [2]. Furthermore, our introduction of discretionary communication to allow the joining vehicle to gain first mover advantage has shown promising results. Namely that the joining vehicle is able to complete more interactions in its favour, thus demonstrating that there is advantage in masking one’s own intent under the right circumstances. We also find that by behaving more ‘aggressively’, the joining vehicle elicits more ‘submissive’ behaviour from the main-lane vehicle. This creates an emergent phenomenon where interaction safety is improved as conflicts are reduced. We conclude that non-cooperative communication can produce emergent benefits in safety and efficiency for all parties involved.

We amplified the occurrence rate of explicit communication in this experiment to facilitate comparison. Thus, future work should investigate the sensitivity of our results to the occurrence rate of explicit signals. Our work would also benefit from sensitivity analysis of factors such as the crash penalty, the wait penalty factor Inline graphic and the punitive sensitivity factor . By better understanding how the different variables influence the results of the simulation, one can draw wider conclusions on how human and autonomous preferences can shape an interaction. Real-world validation of parameters would also aid in supporting the assumptions of this model and expanding its application in real-world settings.

In this paper, we only explored one aspect of deception: masking intent. Future iterations on the model should investigate other forms of deception, such as giving misleading information, exaggeration of vehicle capabilities or feigning inattention to discourage an opponent from action. With that said, there is a clear conceptual distinction between simply exercising discretion with what to communicate and actively communicating misleading information. This is particularly important in the context of autonomous vehicles. Unlike human drivers, autonomous vehicles are expected to conform to strict safety and transparency standards, which makes the intentional use of deception problematic from both regulatory and societal trust perspectives. Thus, although our model allows for the exploration of deceptive signalling as a theoretical construct, we emphasise that its application to autonomous vehicles must be framed within clear ethical boundaries and accountability structures.

Funding

This work was supported by the Engineering and Physical Sciences Research Council Doctoral Training Partnership, Grant No. EP/R513258/1.

Data Availability

The model source code and the generated data used to support the findings of this paper are available from the University of Leeds at 10.5518/1608.

Declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Footnotes

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

1.Siebinga O, Zgonnikov A, Abbink DA. Modelling communication-enabled traffic interactions. R Soc Open Sci. 2023;10(5):230537. [DOI] [PMC free article] [PubMed] [Google Scholar]
2.Bitar I, Crusat AS, Watling DP. Modelling implicit and explicit communication between road users from a non-cooperative game-theoretic perspective: an exploratory study. In: Proceedings of the 10th international conference on vehicle technology and intelligent transport systems—VEHITS. 2024, SciTePress: Angers, France. pp 456–64.
3.Osborne MJ. An introduction to game theory. New York: Oxford University Press; 2003. p. 552. [Google Scholar]
4.Saifuzzaman M, Zheng Z. Incorporating human-factors in car-following models: a review of recent developments and research needs. Transp Res Part C Emerg Technol. 2014;48:379–403. [Google Scholar]
5.Zhang T et al. Car-following models: a multidisciplinary review. arXiv, 2023.
6.Rahman M, et al. Review of microscopic lane-changing models and future research opportunities. IEEE Transp Intell Trans Syst. 2013;14(4):1942–56. [Google Scholar]
7.Elvik R. A review of game-theoretic models of road user behaviour. Accid Anal Prev. 2014;62:388–96. [DOI] [PubMed] [Google Scholar]
8.Ji A, Levinson D. A review of game theory models of lane changing. Transp Res A Transp Sci. 2020;16(3):1628–47. [Google Scholar]
9.Gipps PG. A model for the structure of lane-changing decisions. Transp Res Part B Methodol. 1986;20(5):403–14. [Google Scholar]
10.Hidas P. Modelling vehicle interactions in microscopic simulation of merging and weaving. Transp Res Part C Emerg Technol. 2005;13(1):37–62. [Google Scholar]
11.Ahmed K et al. Models of freeway lane changing and gap acceptance behavior. In: Proceedings of the 13th International Symposium on Transportation and Traffic Theory. 1996. Lyon, France.
12.Kesting A, Treiber M, Helbing D. General lane-changing model MOBIL for car-following models. Transp Res Rec. 2007;1999(1):86–94. [Google Scholar]
13.Yu H, Tseng HE, Langari R. A human-like game theory-based controller for automatic lane changing. Transp Res Part C Emerg Technol. 2018;88:140–58. [Google Scholar]
14.Kita H. A merging–giveway interaction model of cars in a merging section: a game theoretic analysis. Transp Res Part A Policy Pract. 1999;33(3):305–12. 10.1016/S0965-8564(98)00039-1. [Google Scholar]
15.Liu et al. A game theoretical approach for modeling merging and yielding behavior at freeway on-ramp section. In: Proceedings of the 17th international symposium on transportation and traffic theory. 2007.
16.Fisac J et al. Hierarchical Game-Theoretic Planning for Autonomous Vehicles In: 2019 International Conference on Robotics and Automation, ICRA 2019. 2018, Institute of Electrical and Electronics Engineers Inc.: Montreal. pp 9590–9596.
17.Li N et al. Hierarchical reasoning game theory based approach for evaluation and testing of autonomous vehicle control systems. In: 2016 IEEE 55th conference on decision and control (CDC). 2016.
18.Axelrod R, Hamilton WD. The evolution of cooperation. Science. 1981;211(4489):1390. [DOI] [PubMed] [Google Scholar]
19.Kang K, Rakha HA. A repeated game freeway lane changing model. Sensors. 2020;20(6):1554. [DOI] [PMC free article] [PubMed] [Google Scholar]
20.Meng F et al. Dynamic decision making in lane change: game theory with receding horizon. In: 2016 UKACC 11th international conference on control (CONTROL). 2016.
21.Iwamura Y, Tanimoto J. Complex traffic flow that allows as well as hampers lane-changing intrinsically contains social-dilemma structures. J Stat Mech Theory Exp. 2018;2018:023408. [Google Scholar]
22.Bitar I, Watling D, Romano R. Sensitivity analysis of the spatial parameters in modelling the evolutionary interaction between autonomous vehicles and other road users. SN Comput Sci. 2023;4(4):336. [Google Scholar]
23.Sent E-M. Rationality and bounded rationality: you can’t have one without the other. Eur J History Econ Thought. 2018;25(6):1370–86. [Google Scholar]
24.Talebpour A, Mahmassani HS, Hamdar SH. Modeling lane-changing behavior in a connected environment: a game theory approach. Transp Res Procedia. 2015;7:420–40. [Google Scholar]
25.Ali Y, et al. A game theory-based approach for modelling mandatory lane-changing behaviour in a connected environment. Transp Res Part C Emerg Technol. 2019;106:220–42. [Google Scholar]
26.Bendor J, Swistak P. Types of evolutionary stability and the problem of cooperation. Proc Natl Acad Sci U S A. 1995;92(8):3596–600. [DOI] [PMC free article] [PubMed] [Google Scholar]
27.Nowak MA. Five rules for the evolution of cooperation. Science. 2006;314(5805):1560–3. [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Altman E, et al. The evolution of transport protocols: an evolutionary game perspective. Comput Netw. 2009;53(10):1751–9. [Google Scholar]
29.Rubenstein DR, Kealey J. Cooperation, conflict, and the evolution of complex animal societies. Nat Educ Knowl. 2010; 78.
30.He J, et al. Spatial games and the maintenance of cooperation in an asymmetric Hawk-Dove game. Chin Sci Bull. 2013;58(18):2248–54. [Google Scholar]
31.Stewart AJ, Plotkin JB. From extortion to generosity, evolution in the Iterated Prisoner’s Dilemma. Proc Natl Acad Sci U S A. 2013;110(38):15348–53. [DOI] [PMC free article] [PubMed] [Google Scholar]
32.Fernández Domingos E, et al. Emerging cooperation in N-person iterated prisoner’s dilemma over dynamic complex networks. Comput Inform. 2017;36:493–516. [Google Scholar]
33.Gilles RP, Mallozzi L, Messalli R. Emergent collaboration in social purpose games. arXiv [cs.GT], 2021.
34.Orzan N et al. Emergent cooperation and deception in public good games. In: 2023 adaptive and learning agents workshop at AAMAS. 2023.
35.Harris CM. Autonomous vehicle decision-making: should we be bio-inspired? in towards autonomous robotic systems. 2017. Cham: Springer International Publishing.
36.Millard-Ball A. Pedestrians, autonomous vehicles, and cities. J Plann Educ Res. 2018;38(1):6–12. [Google Scholar]
37.Sun H, Ge Y, Qu W. Greater prosociality toward other human drivers than autonomous vehicles: Human drivers’ discriminatory behavior in mixed traffic. Accid Anal Prev. 2024;203:107623. 10.1016/j.aap.2024.107623. [DOI] [PubMed] [Google Scholar]
38.Bitar I, Watling D, Romano R. How can autonomous road vehicles coexist with human-driven vehicles? An evolutionary-game-theoretic perspective. In: Proceedings of the 8th international conference on vehicle technology and intelligent transport systems-VEHITS. 2022, SciTePress. pp 376–83.
39.Dey D, Terken J. Pedestrian interaction with vehicles: roles of explicit and implicit communication. In: Proceedings of the 9th international conference on automotive user interfaces and interactive vehicular applications. 2017, Association for Computing Machinery: Oldenburg, Germany. pp 109–13.
40.Harkin AM, Harkin KA, Petzoldt T. What to rely on—implicit communication between pedestrians and turning automated vehicles. Transp Res Part F Traffic Psychol Behav. 2023;98:297–317. [Google Scholar]
41.Lee YM, et al. Road users rarely use explicit communication when interacting in today’s traffic: implications for automated vehicles. Cogn Technol Work. 2021;23(2):367–80. [Google Scholar]
42.Lee YM, Sheppard E. The effect of motion and signalling on drivers’ ability to predict intentions of other road users. Accid Anal Prev. 2016;95:202–8. [DOI] [PubMed] [Google Scholar]
43.Durlauf SN, Blume LE. Cheap Talk. In: Durlauf SN, Blume LE, editors. Game theory. London: Palgrave Macmillan UK; 2010. p. 38–47. [Google Scholar]
44.Parikh P. The use of language. Stanford: CSLI Publications; 2001. [Google Scholar]
45.Allott N. Game theory and communication. In: Benz A, Jäger G, van Rooij R, editors. Game theory and pragmatics. London: Palgrave Macmillan UK; 2006. p. 123–52. [Google Scholar]
46.Brams SJ. Deception in 2 × 2 games. J Peace Sci. 1977;2(2):171–203. [Google Scholar]
47.Tao Z, Zhu Q. A game-theoretic foundation of deception: knowledge acquisition and fundamental limits. ArXiv, 2018.
48.Sarkadi Ş, et al. The evolution of deception. R Soc Open Sci. 2021;8(9):201032. [DOI] [PMC free article] [PubMed] [Google Scholar]
49.Zagare FC. The Geneva Conference of 1954: a case of tacit deception. Int Stud Q. 1979;23(3):390–411. [Google Scholar]
50.Fallis D, Lewis PJ. Animal deception and the content of signals. Stud Hist Philos Sci. 2021;87:114–24. [DOI] [PubMed] [Google Scholar]
51.Adams ES, Caldwell RL. Deceptive communication in asymmetric fights of the stomatopod crustacean Gonodactylus bredini. Anim Behav. 1990;39(4):706–16. [Google Scholar]
52.Ferguson-Walter K et al. Game theory for adaptive defensive cyber deception. In: Proceedings of the 6th annual symposium on hot topics in the science of security. 2019, Association for Computing Machinery: Nashville, Tennessee, USA. p. Article 4.
53.Carroll TE, Grosu D. A game theoretic investigation of deception in network security. In: 2009 proceedings of 18th international conference on computer communications and networks. 2009.
54.Jin PJ, et al. Bidirectional control characteristics of General Motors and optimal velocity car-following models: implications for coordinated driving in a connected vehicle environment. Transp Res Rec. 2013;2381(1):110–9. [Google Scholar]
55.Bevrani K, Chung E. A safety adapted car following model for traffic safety studies. Adv Human Asp Road Rail Transp. 2012; 550–59.
56.Yulong P, Huizhi X. The control mechanism of lane changing in jam condition. In: 2006 6th world congress on intelligent control and automation. 2006.
57.Wang M, et al. Game theoretic approach for predictive lane-changing and car-following control. Transp Res Part C Emerg Technol. 2015;58:73–92. [Google Scholar]
58.Joyce J. Bayes’ theorem. 2021 [cited 2024 2024-02-22]; Available from: https://plato.stanford.edu/archives/fall2021/entries/bayes-theorem/.
59.Bokare PS, Maurya AK. Acceleration-deceleration behaviour of various vehicle types. Transp Res Procedia. 2017;25:4733–49. [Google Scholar]
60.Finnegan P, Green P. Time to change lanes: a literature review. 1990.
61.Salvucci DD, Liu A. The time course of a lane change: driver control and eye-movement behavior. Transp Res Part F Traffic Psychol Behav. 2002;5(2):123–32. [Google Scholar]
62.AASHTO. A policy on geometric design of highways and streets, 6th edition. Washington.: American Association of State Highway and Transportation Officials; 2011. [Google Scholar]
63.Tarver E. First mover: What it means, examples, and first mover advantages. [cited 2024 15 September]; Available from: https://www.investopedia.com/terms/f/firstmover.asp.
64.Heifetz A. Commitment. In: Heifetz A, Yalon-Fortus J, editors. Game theory: interactive strategies in economics and management. Cambridge: Cambridge University Press; 2012. p. 333–52. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The model source code and the generated data used to support the findings of this paper are available from the University of Leeds at 10.5518/1608.

[CR1] 1.Siebinga O, Zgonnikov A, Abbink DA. Modelling communication-enabled traffic interactions. R Soc Open Sci. 2023;10(5):230537. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR2] 2.Bitar I, Crusat AS, Watling DP. Modelling implicit and explicit communication between road users from a non-cooperative game-theoretic perspective: an exploratory study. In: Proceedings of the 10th international conference on vehicle technology and intelligent transport systems—VEHITS. 2024, SciTePress: Angers, France. pp 456–64.

[CR3] 3.Osborne MJ. An introduction to game theory. New York: Oxford University Press; 2003. p. 552. [Google Scholar]

[CR4] 4.Saifuzzaman M, Zheng Z. Incorporating human-factors in car-following models: a review of recent developments and research needs. Transp Res Part C Emerg Technol. 2014;48:379–403. [Google Scholar]

[CR5] 5.Zhang T et al. Car-following models: a multidisciplinary review. arXiv, 2023.

[CR6] 6.Rahman M, et al. Review of microscopic lane-changing models and future research opportunities. IEEE Transp Intell Trans Syst. 2013;14(4):1942–56. [Google Scholar]

[CR7] 7.Elvik R. A review of game-theoretic models of road user behaviour. Accid Anal Prev. 2014;62:388–96. [DOI] [PubMed] [Google Scholar]

[CR8] 8.Ji A, Levinson D. A review of game theory models of lane changing. Transp Res A Transp Sci. 2020;16(3):1628–47. [Google Scholar]

[CR9] 9.Gipps PG. A model for the structure of lane-changing decisions. Transp Res Part B Methodol. 1986;20(5):403–14. [Google Scholar]

[CR10] 10.Hidas P. Modelling vehicle interactions in microscopic simulation of merging and weaving. Transp Res Part C Emerg Technol. 2005;13(1):37–62. [Google Scholar]

[CR11] 11.Ahmed K et al. Models of freeway lane changing and gap acceptance behavior. In: Proceedings of the 13th International Symposium on Transportation and Traffic Theory. 1996. Lyon, France.

[CR12] 12.Kesting A, Treiber M, Helbing D. General lane-changing model MOBIL for car-following models. Transp Res Rec. 2007;1999(1):86–94. [Google Scholar]

[CR13] 13.Yu H, Tseng HE, Langari R. A human-like game theory-based controller for automatic lane changing. Transp Res Part C Emerg Technol. 2018;88:140–58. [Google Scholar]

[CR14] 14.Kita H. A merging–giveway interaction model of cars in a merging section: a game theoretic analysis. Transp Res Part A Policy Pract. 1999;33(3):305–12. 10.1016/S0965-8564(98)00039-1. [Google Scholar]

[CR15] 15.Liu et al. A game theoretical approach for modeling merging and yielding behavior at freeway on-ramp section. In: Proceedings of the 17th international symposium on transportation and traffic theory. 2007.

[CR16] 16.Fisac J et al. Hierarchical Game-Theoretic Planning for Autonomous Vehicles In: 2019 International Conference on Robotics and Automation, ICRA 2019. 2018, Institute of Electrical and Electronics Engineers Inc.: Montreal. pp 9590–9596.

[CR17] 17.Li N et al. Hierarchical reasoning game theory based approach for evaluation and testing of autonomous vehicle control systems. In: 2016 IEEE 55th conference on decision and control (CDC). 2016.

[CR18] 18.Axelrod R, Hamilton WD. The evolution of cooperation. Science. 1981;211(4489):1390. [DOI] [PubMed] [Google Scholar]

[CR19] 19.Kang K, Rakha HA. A repeated game freeway lane changing model. Sensors. 2020;20(6):1554. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR20] 20.Meng F et al. Dynamic decision making in lane change: game theory with receding horizon. In: 2016 UKACC 11th international conference on control (CONTROL). 2016.

[CR21] 21.Iwamura Y, Tanimoto J. Complex traffic flow that allows as well as hampers lane-changing intrinsically contains social-dilemma structures. J Stat Mech Theory Exp. 2018;2018:023408. [Google Scholar]

[CR22] 22.Bitar I, Watling D, Romano R. Sensitivity analysis of the spatial parameters in modelling the evolutionary interaction between autonomous vehicles and other road users. SN Comput Sci. 2023;4(4):336. [Google Scholar]

[CR23] 23.Sent E-M. Rationality and bounded rationality: you can’t have one without the other. Eur J History Econ Thought. 2018;25(6):1370–86. [Google Scholar]

[CR24] 24.Talebpour A, Mahmassani HS, Hamdar SH. Modeling lane-changing behavior in a connected environment: a game theory approach. Transp Res Procedia. 2015;7:420–40. [Google Scholar]

[CR25] 25.Ali Y, et al. A game theory-based approach for modelling mandatory lane-changing behaviour in a connected environment. Transp Res Part C Emerg Technol. 2019;106:220–42. [Google Scholar]

[CR26] 26.Bendor J, Swistak P. Types of evolutionary stability and the problem of cooperation. Proc Natl Acad Sci U S A. 1995;92(8):3596–600. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR27] 27.Nowak MA. Five rules for the evolution of cooperation. Science. 2006;314(5805):1560–3. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR28] 28.Altman E, et al. The evolution of transport protocols: an evolutionary game perspective. Comput Netw. 2009;53(10):1751–9. [Google Scholar]

[CR29] 29.Rubenstein DR, Kealey J. Cooperation, conflict, and the evolution of complex animal societies. Nat Educ Knowl. 2010; 78.

[CR30] 30.He J, et al. Spatial games and the maintenance of cooperation in an asymmetric Hawk-Dove game. Chin Sci Bull. 2013;58(18):2248–54. [Google Scholar]

[CR31] 31.Stewart AJ, Plotkin JB. From extortion to generosity, evolution in the Iterated Prisoner’s Dilemma. Proc Natl Acad Sci U S A. 2013;110(38):15348–53. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR32] 32.Fernández Domingos E, et al. Emerging cooperation in N-person iterated prisoner’s dilemma over dynamic complex networks. Comput Inform. 2017;36:493–516. [Google Scholar]

[CR33] 33.Gilles RP, Mallozzi L, Messalli R. Emergent collaboration in social purpose games. arXiv [cs.GT], 2021.

[CR34] 34.Orzan N et al. Emergent cooperation and deception in public good games. In: 2023 adaptive and learning agents workshop at AAMAS. 2023.

[CR35] 35.Harris CM. Autonomous vehicle decision-making: should we be bio-inspired? in towards autonomous robotic systems. 2017. Cham: Springer International Publishing.

[CR36] 36.Millard-Ball A. Pedestrians, autonomous vehicles, and cities. J Plann Educ Res. 2018;38(1):6–12. [Google Scholar]

[CR37] 37.Sun H, Ge Y, Qu W. Greater prosociality toward other human drivers than autonomous vehicles: Human drivers’ discriminatory behavior in mixed traffic. Accid Anal Prev. 2024;203:107623. 10.1016/j.aap.2024.107623. [DOI] [PubMed] [Google Scholar]

[CR38] 38.Bitar I, Watling D, Romano R. How can autonomous road vehicles coexist with human-driven vehicles? An evolutionary-game-theoretic perspective. In: Proceedings of the 8th international conference on vehicle technology and intelligent transport systems-VEHITS. 2022, SciTePress. pp 376–83.

[CR39] 39.Dey D, Terken J. Pedestrian interaction with vehicles: roles of explicit and implicit communication. In: Proceedings of the 9th international conference on automotive user interfaces and interactive vehicular applications. 2017, Association for Computing Machinery: Oldenburg, Germany. pp 109–13.

[CR40] 40.Harkin AM, Harkin KA, Petzoldt T. What to rely on—implicit communication between pedestrians and turning automated vehicles. Transp Res Part F Traffic Psychol Behav. 2023;98:297–317. [Google Scholar]

[CR41] 41.Lee YM, et al. Road users rarely use explicit communication when interacting in today’s traffic: implications for automated vehicles. Cogn Technol Work. 2021;23(2):367–80. [Google Scholar]

[CR42] 42.Lee YM, Sheppard E. The effect of motion and signalling on drivers’ ability to predict intentions of other road users. Accid Anal Prev. 2016;95:202–8. [DOI] [PubMed] [Google Scholar]

[CR43] 43.Durlauf SN, Blume LE. Cheap Talk. In: Durlauf SN, Blume LE, editors. Game theory. London: Palgrave Macmillan UK; 2010. p. 38–47. [Google Scholar]

[CR44] 44.Parikh P. The use of language. Stanford: CSLI Publications; 2001. [Google Scholar]

[CR45] 45.Allott N. Game theory and communication. In: Benz A, Jäger G, van Rooij R, editors. Game theory and pragmatics. London: Palgrave Macmillan UK; 2006. p. 123–52. [Google Scholar]

[CR46] 46.Brams SJ. Deception in 2 × 2 games. J Peace Sci. 1977;2(2):171–203. [Google Scholar]

[CR47] 47.Tao Z, Zhu Q. A game-theoretic foundation of deception: knowledge acquisition and fundamental limits. ArXiv, 2018.

[CR48] 48.Sarkadi Ş, et al. The evolution of deception. R Soc Open Sci. 2021;8(9):201032. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR49] 49.Zagare FC. The Geneva Conference of 1954: a case of tacit deception. Int Stud Q. 1979;23(3):390–411. [Google Scholar]

[CR50] 50.Fallis D, Lewis PJ. Animal deception and the content of signals. Stud Hist Philos Sci. 2021;87:114–24. [DOI] [PubMed] [Google Scholar]

[CR51] 51.Adams ES, Caldwell RL. Deceptive communication in asymmetric fights of the stomatopod crustacean Gonodactylus bredini. Anim Behav. 1990;39(4):706–16. [Google Scholar]

[CR52] 52.Ferguson-Walter K et al. Game theory for adaptive defensive cyber deception. In: Proceedings of the 6th annual symposium on hot topics in the science of security. 2019, Association for Computing Machinery: Nashville, Tennessee, USA. p. Article 4.

[CR53] 53.Carroll TE, Grosu D. A game theoretic investigation of deception in network security. In: 2009 proceedings of 18th international conference on computer communications and networks. 2009.

[CR54] 54.Jin PJ, et al. Bidirectional control characteristics of General Motors and optimal velocity car-following models: implications for coordinated driving in a connected vehicle environment. Transp Res Rec. 2013;2381(1):110–9. [Google Scholar]

[CR55] 55.Bevrani K, Chung E. A safety adapted car following model for traffic safety studies. Adv Human Asp Road Rail Transp. 2012; 550–59.

[CR56] 56.Yulong P, Huizhi X. The control mechanism of lane changing in jam condition. In: 2006 6th world congress on intelligent control and automation. 2006.

[CR57] 57.Wang M, et al. Game theoretic approach for predictive lane-changing and car-following control. Transp Res Part C Emerg Technol. 2015;58:73–92. [Google Scholar]

[CR58] 58.Joyce J. Bayes’ theorem. 2021 [cited 2024 2024-02-22]; Available from: https://plato.stanford.edu/archives/fall2021/entries/bayes-theorem/.

[CR59] 59.Bokare PS, Maurya AK. Acceleration-deceleration behaviour of various vehicle types. Transp Res Procedia. 2017;25:4733–49. [Google Scholar]

[CR60] 60.Finnegan P, Green P. Time to change lanes: a literature review. 1990.

[CR61] 61.Salvucci DD, Liu A. The time course of a lane change: driver control and eye-movement behavior. Transp Res Part F Traffic Psychol Behav. 2002;5(2):123–32. [Google Scholar]

[CR62] 62.AASHTO. A policy on geometric design of highways and streets, 6th edition. Washington.: American Association of State Highway and Transportation Officials; 2011. [Google Scholar]

[CR63] 63.Tarver E. First mover: What it means, examples, and first mover advantages. [cited 2024 15 September]; Available from: https://www.investopedia.com/terms/f/firstmover.asp.

[CR64] 64.Heifetz A. Commitment. In: Heifetz A, Yalon-Fortus J, editors. Game theory: interactive strategies in economics and management. Cambridge: Cambridge University Press; 2012. p. 333–52. [Google Scholar]

PERMALINK

To Signal or Not to Signal? A Non-cooperative Game-Theoretic Approach to Discretionary Communication Between Road Users

Isam Bitar

Albert Solernou Crusat

Richard Romano

David Watling

Abstract

Introduction

Literature Review

Method

Fig. 1.

The Game-Theoretic Interaction Model

Fig. 2.

The Kinematic Model

Table 2.

Payoff Functions

Ride Comfort

Time Headway

Speed Difference

Time Penalty

Decision Making

Table 1.

Communication

Stage 0: Initial Communication from Vehicle to Vehicle

Stage 1: Discretionary Communication from Vehicle to Vehicle

Stage 2: Further Communication from Vehicle to Vehicle

Updating Beliefs: Bayesian Inference

Experimental Design

Table 3.

Table 4.

Table 5.

Fig. 3.

Hardware and software requirement

Results

Table 6.

Ruleset 1 (Transparent)

Ruleset 2 (Blind)

Discussion

Comparison with the Results of the Original Paper [2]

Effect on Interaction Advantage (Payoffs)

Effect on Interaction Safety

Effect on Interaction Efficiency (Non-ideal Outcomes)

Notable Observations

On Bayesian Statistics and Bounded Rationality

Conclusion

Funding

Data Availability

Declarations

Conflict of interest

Footnotes

References

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases