1. INTRODUCTION
A truel is a duel-like competition between three players, in which the players fight for survival or for a prize by targeting (shooting at) opponents. Only one opponent can be targeted at a time. Originally, the sequential truel appeared as a brain teaser and has been part of many collections of mathematical puzzles (Phillips, 1937; Kinnaird, 1946; Gardner, 1966; Mosteller, 1987). The problem statement can be written as follows. Each of the three players is endowed with marksmanship  the probability of eliminating the opponent with a shot. Three players are different, with marksmanships of , , and  (or , , and  ). Whom should they target? Who wins? These problems have a paradoxical answer  the weakest player has the highest chance of surviving while the strongest player (a sure shot) has the lowest survival odds. There is also an extra layer to the problem  the weakest player might do even better by intentionally abstaining, e.g., by shooting in the ground or in the air.
M. Shubik (Shubik, 1964, 4346) coined the term truel and described it as an example of a game in which the pursuit of individual goals leads to a paradoxical result. D. Kilgour (Kilgour, 1975) initiated a game-theoretic analysis of sequential truels as infinitely repeated games with and without abstentions. He considered undominated Nash equilbria in stationary strategies and showed that if the players marksmanships are different from each other, then the equilibrium is essentially unique, and for many values of the parameters the weakest player has the strongest survival odds. This happens because for two stronger opponents it is optimal to target each other, while the weakest player can simply stay aside and wait until one of the stronger opponents is eliminated. Many variations of truels have been considered: sequential truels with arbitrary duel values (Kilgour, 1978), simultaneous truels (Kilgour, 1972), truels with random order of shooting, finitely many rounds or bullets, possibilities for cooperation, target-undetectable or silent truels (Kilgour, Brams, 1997; Bossert, Brams, Kilgour, 2002). P. Amengual and R. Toral (Amengual, Toral, 2006) analyse different versions of the sequential game, and analyse them as Markov chains with three absorbing states (i.e. the possible victories of each of the players). A qualitatively robust prediction appears to follow from existing results: whenever there is a fight, the two strongest opponents shoot at each other, and, unless the marksmanships are dramatically different, the weakest player has the highest survival odds.
R. Shubik (Shubik, 1954) used a three-person duel to model ‘what sort of individual is best suited to survive, when every man acts for himself and by himself’. He pointed out immediate applications of truels to the analysis of elections, political competition, and multi-country interaction, arguing that a weaker country can benefit if it gets itself in the middle of a non-cooperative conflict between two stronger countries. Truels and  -els were used for modeling decisions in intense conflict situations (Cole, Phillips, Hartman, 1977), evolutionary biology to explain persistent extensive variation in competitive skills (Archetti, 2012), dynamic targeted competition, where actions can be directed at a particular opponent (Dubovik, Parakhonyak, 2014), negative advertising, especially in political campaigning (Skaperdas, Grofman, 1995; Richman, 2020). (Toral, Amengual, 2005) consider a variant of a truel  a game of opinion making or dynamic persuasion, in which players try to convince the others in their opinion on some matter and so all players are active until the consensus is reached. In all these applications the central feature is the advantage of the weakest player. Recently, in the article M. Wegener and E. Mutlu (Wegener, Mutlu, 2021) offered a model of network formation and evolution, in which players of three possible types (marksmanships) engage in truels with their neighbors. The authors assume that players target the strongest among the remaining opponents but not of the similar type, and show that the evolutionary pressures over network actually lead to strongest types surviving more often.
In this paper we challenge the existing findings regarding the survival of the weakest. We present a novel subgame-perfect equilibrium construction for a sequential truel in which the strongest player has the largest survival odds. We depart from the existing literature in three key elements: 1) we allow players to use non-stationary (and non-Markovian) strategies; 2) we consider mixed strategies; 3) we explicitly define the conditions and outcomes for a ‘peace’ scenario  the game ends if all the three players abstain in a row. This equilibrium exists for a specific order of play, in which the two stronger opponents act before the weakest one. When it exists, there are multiple subgame-perfect equilibria including the existing stationary construction in which two stronger opponents target each other.
In equilbrium, the strongest opponent, player  abstains if the middle-skilled player  has not targeted  previously; player  abstains if the weaker player  targeted  in the previous round, and randomizes between abstaining and targeting , if  shot at . If both stronger players abstain, the weaker player  has to target someone and randomizes between the opponents. The player  ’s incentives to target  are provided by the expectation that  then targets  with some positive probability, in which case  ends up in a duel shooting first. In turn, randomization by  is needed to provide incentives for  to abstain. In essence, the two stronger players tacitly collude to force  to target someone. The strongest player benefits when  targets , which can happen in a subgame-perfect non-stationary equilibrium!
2. SEQUENTIAL TRUEL
There are three players competing for being the sole survivor (winner) in a sequential duel-like game. At the beginning of the truel all players are alive. At each turn, a player whose turn is to shoot can target any of the remaining alive opponents or abstain from shooting (delope, e.g. shoot at the air or shoot at the ground). A targeted player is eliminated with probability equal to the marksmanship of the shooter. All the players observe who was the intended target and what is the outcome of the shot. The order of shooting is chosen in advance, and once and for all. No player can have two turns while some other (alive) player had none. Eliminated players do not shoot. The truel ends when all but one player was eliminated or when all alive players abstained from shooting one after another. The single survivor obtains the ‘survivor’ prize, the payoff to which is normalized to 1, while the payoff of all the other players is 0. In the case the truel ends due to ‘inactivity’, all the players receive the payoff of 0. The solution concept is subgame perfect equilibrium (SPE).
In our game-theoretic analysis we consider not only pure strategies for players that specify a particular target (or abstaining) after any history that can arise but also mixed strategies, where each player can randomize over its pure strategies and, in particular, randomly choose its target (or abstain) with some probabilities when called to act. The payoff from a mixed strategy is an expectation of the profits from pure strategies[1].
We refer to the players by their marksmanship, , , , and suppose that ,  Let  and  be the probabilities of surviving a shot from a respective player. Denote by  the order (sequence) of shooting. For instance, if , then player  shoots first, then player  shoots (if alive), then player , then again player  and so on. We write  with  0, 1 or 2 to highlight the number of successive abstentions prior to the first player in  shooting. Finally, let  and  be, respectively, the lowest and the highest SPE payoffs to player  if the sequence of shooting is seq. For instance,  is the minimal SPE payoff of player  for the sequence  for the subgame following one abstention (by player 
Some of our parametric assumptions, such as , unequal marksmanships, and 0 payoffs to the ‘peace’ outcome, are made to simplify the exposition. For example, all results (with slight modification in statements) are true for  in small neighbourhood of 1, but will require more technical details.
Some other parametric assumptions, such as three abstentions mean ‘peace’ are needed to complete specification of the game, which in turn allows for analysis of mixed-strategy equilibria. It would be interesting to know, how equilibria are affected when the number of abstentions that terminate the game or the ‘peace’ payoff vary, but this is left for future analysis.
There are six possible sequences of players, which can be divided in two groups: either  acts immediately after  or vise versa. Since our main goal is to present an explicit construction of equilibria in which the strongest player obtains high payoffs we will concentrate on the first case. Once we present the new equilibrium for sequence  in Section 6, as a corollary, we obtain a similar construction for sequence , in which  acts immediately after . Since we need to stimulate player  to shoot in our equilibrium, it is crucial that  must be the last player to shoot.
Lemma 1. The minimal and maximal SPE payoffs of players  and  satisfy the following inequalities 
   
(Dots here denote any of the appropriate order of shooting.)
Proof. The minimal SPE payoff of player  (or  ) cannot be lower than what the player  (or  ) can obtain by targeting  and, if successful, ending up in a duel shooting second. Clearly, there would be no abstentions in any duel. Similarly, the maximal possible payoff a player can get cannot be higher than that from a duel against the weakest of the opponents shooting first.■
3. PLAYER γ STRATEGIES 
Lemma 2. In any SPE player  does not target  if  is alive. Player  ’s minimal SPE payoff for any sequence of moves in which  shoots first is bounded from below: 
Proof. If player  shoots at another player, the target is eliminated and  ends up in a duel with the remaining player where that player shoots first. A duel with the weakest opponent is preferred. Thus, player  can guarantee payoff of  by targeting .■
Can  obtain a higher payoff than  in equilibrium when  is the first to act? For this to happen,  has to shoot in the air and one of the opponents has to target the other one with positive probability in the future, so that  reaches a duel shooting first. Otherwise, if players  and  target  or abstain each time they act, player  has to target someone to end up in a duel (shooting second) to obtain positive payoff, which is at most .
4. PLAYER β STRATEGIES 
So, from here on we consider sequences of players in which  shoots immediately after 
Lemma 3. For all sequences of players in which player  shoots immediately after , player  never targets . 
Proof. Suppose not, and that there exists an SPE in which  targets . Consider a subgame where this happens. If  eliminates , then  eliminates  in the next round with probability 1. Therefore,  has to expect a substantial continuation payoff  following a miss:  or, from Lemma 1 
                                                               (1)
Let us focus on  and the equilibrium in which  obtains . As follows from Lemma 2, player  has to abstain with positive probability for  to get a positive payoff as otherwise  eliminates . Player  then receives at least , while  who moves next obtains at least . Combined, the sum of payoffs of all three players has to be at least  From Lemma 1 and inequality (1), this sum exceeds 
since   that is a contradiction.
Thus, we have shown that for considered sequences of players in any SPE neither  nor  target . This is consistent with previous findings (Shubik, 1954; Kilgour, 1975; Toral, Amengual, 2005), yet here we have derived it without assumptions of stationarity, pure strategies, or necessity of targeting someone.
5. PLAYER α STRATEGIES 
We explore first, what player  can get by abstaining.
Lemma 4. For any SPE, for any subgame in which player  has a chance to abstain without ending the game, we have (here  or 1) 
In addition, for sequences  and  player  targets , while for sequence  player  targets . 
Proof. Suppose player  abstains. If  also abstains, then  targets  (who obtains payoff  ), while  ends up in the duel with  shooting first, getting payoff . Thus,  will not abstain, and target  instead.
If  eliminates , then  obtains . If  misses and  targets , then  obtains payoff . Finally, if  misses and  abstains, we come to the subgame in which  can again abstain, and, as above, either guarantee herself at least  or reach a subgame in which she can abstain, and so on. In expectation over all possible outcomes, player  obtains at least  in any SPE whenever she (he) can abstain.
Player  does not abstain if  can abstain next round, and thus  targets  for sequence . Indeed, since  can abstain and guarantee herself payoff , by targeting , player  obtains . If  abstains, then  obtains a positive payoff, in this case, what remains to  is strictly smaller than .
In turn, if player  knows that  is able to abstain, she will expect that  abstains and then  targets . So, player  should target  and not abstain in this case.■
Thus, having an option to abstain allows player  to guarantee herself a decent payoff as the other players target each other. Will  actually abstain in equilibrium? Suppose that  and  target each other also when  targets one of them (as in (Kilgour, 1975)). Then, for  it is better to target , since the duel with the weaker of the two opponents is preferred. By targeting , player  obtains payoff  and by abstaining she gets 
Therefore, targeting  is better for , if 
; ; ; 
; 
We have shown the following lemma.
Lemma 5. A subgame perfect equilibrium in which neither  nor  abstain exists. In this equilibrium, player  targets , player  targets , and player  targets  if  and abstains if . 
It is trivial to check that the described strategies indeed form an SPE. Note that if  or , player  abstains no matter what  is.
6. NOVEL SPE CONSTRUCTION: SURVIVAL OF THE STRONGEST
Can it happen in equilibrium that players  or  or both abstain? Consider sequence  To abstain, player  should expect that  will also abstain with high probability in the next round and that  later (not necessarily immediately in round 3) will target , hoping for a duel with  in which  shoots first. In turn, for player  to abstain, player  should expect that  will be shooting at  with positive probability, as otherwise  ’s expected payoff is strictly below . For both  and  to abstain, player  has to target someone!
Theorem 1 (main result). For sequence  and , there exists a mixed SPE in which players  and  abstain with positive probabilities, while player  mixes between the opponents. The equilibrium utilities  of the players (  ) are given by the following formulas: 
where 
Proof. Consider the following profile of strategies  a candidate for the equilibrium. Player  abstains for  (following an abstention by  if it is the only one in a row) and targets  in any other subgame. Player  abstains in round 1 and for any subgame in which  targeted and missed  in the previous round. If  targeted and missed , then  mixes between abstaining and targeting . (This will provide incentives for  to target .) For all other subgames, player  targets . Finally, player  mixes shooting at  and  for  and abstains in all other subgames.
For the transition matrix between subgames for the proposed profile of strategies see Figure. Each decision node is marked by the acting player with the number of previous abstentions in the brackets. Only the transitions following misses are shown. Horizontal arrows point to the intended target,  stands for abstention. In figure 1  and  are the probabilities with which players  and  target  in their mixed strategies.
 
Figure. Transitions in equilibrium
 
For the suggested profile to be an SPE, the following set of conditions has to be satisfied. Here,  is the equilibrium utility of player , , and . By  we denote the utility of player  using pure strategy shooting . 
Probability  can be found from the condition for player  to be indifferent between targeting  and abstaining:
 
Similarly, from the indifference condition for player  we can find : 
;
; ;
.
Finally, we need to check that  prefers abstaining from targeting . From the expression for  above we can compute 
We want to show that , when . We have 
If , then , , and , . Therefore, the difference of products in the above expression is positive. ■
The presented equilibrium construction clearly favors the strongest player as compared to the equilibrium in Lemma 5. In this equilibrium player  had to rely on both  and  missing their shots to survive. In the new construction, the same scenario following  targeting  occurs as well, but with probability less than one, while in a complementary event of  targeting  and making a shot,  wins the prize. The probability of this complimentary event is maximized in this equilibrium.
For a specific example, consider  , . Then ,  and players payoffs are  In the equilibrium from lemma 5, 
Corollary 2. If  and the sequence of players is , then there exists a continuum of subgame perfect equilibria in which player  earns the same expected payoff as in the equilibrium in Theorem 1, and payoffs of players  and  vary:  takes any value from  to the value from the Theorem 1. 
Proof. Consider the equilibrium construction in the Theorem. Player  in the first round can instead shoot at  with some probability . The higher is  the higher is  (and lower  ). For  we obtain the payoffs as in Lemma 5 with . ■
Corollary 3. For sequence  and sufficiently high , there exists an SPE with players  and  abstaining (with probability 1), and  earning the highest expected payoff. 
Proof. The equilibrium construction here is similar to the one in Theorem 1 with one correction to account for  shooting immediately after  and not before. Player  is going to abstain if  did not abstain and  did not target  previously. Player  is going to mix targets, hoping that  will target  with positive probability. Differently from above, player  is going to abstain with probability  whenever  abstained in the previous round and  did not target  in the round before. If  targets ,  survives,  abstains, then  is going to randomize between abstaining and shooting at . Similarly to Theorem 1, this will provide incentives for  to shoot at  ■
Condition  in Theorem 1 is a simple sufficient condition for the equilibrium we present. The exact condition on  and  jointly is for the expression at the end of the proof of Theorem 1 to be positive to guarantee that . If , when shooting first,  can get payoff  (simply shooting  ) and therefore also get highest payoff among the players.
Clearly our equilibrium construction also extends for values of  less than but close to 1 (while . When  is close to 1, the payoffs of all the players are similar to those in Theorem 1, and the equilibrium is supported by probabilities  and , that make players  and  to randomize their actions. Certainly,  and  should be adjusted depending on , but as long as they are within  the equilibrium construction works. 
With our assumptions of three abstentions leading to “peace” outcome and  similar mixed strategy equilibria do not exist for other sequences of play, in particular, when  is the first player to act and there are no previous abstentions. If we consider a different assumption on abstentions, for instance, if the “peace” realizes once some player abstains for the second time following abstentions by every player, then the presented equilibrium construction exists for all sequences of play. Player  randomizes over  and , and if abstains, the other two players abstain as well, forcing player  to pick a target.
Corollary 4. Suppose that there is no limit on how many abstentions lead to the “peace” outcome. “Peace” is resched only if all three players abstain forever. Then, for any sequences of players and sufficiently high , there exists an SPE with players  and  abstaining (with probability 1), and  earning the highest expected payoff. 
Proof. The above constructions can be amended by specifying that both players  and  necessarily abstain if player  abstains. Clearly then, abstaining forever is not optimal for player  ■
7. CONCLUSION
Truels and  -els are fascinating conflicts with numerous applications that bring in new insights with each new wave of attention to them. The theory of truelts started with the survival of the weakest observation which was and continues to be the main driver of interest in truels. It continued with abstentions as a strategic choice which the weakest player is likely to make. An abstention is a practically feasible strategy so it has to be modeled formally. This eventually lead to a cooperative element in the analysis (Bossert, Brams, Kilgour, 2002)  players can potentially cooperate even if they are fighting for survival and only one survives. Our novel equilibrium construction can also be interpreted like that: two strongest players cooperate to force the weakest player to shoot hoping that one of them survives and be the first to shoot in the ensuing duel. In this paper we show: it is not necessary that the stronger opponents target each other in equilibrium and that other types of SPE equilibria exist, with the strongest player having largest odds of surviving and in which multiple people abstain from shooting with positive probability.
Using mixed strategies is another important element of our construction. This allows to break the curse of the strongest player, viz. being everyone’s target. In our construction, using a mixed strategy by player  provides the incentive for the strongest player to abstain from shooting, while using a mixed strategy by player  conditional on some histories of playing provides the incentive for  to mix. It is easy to overlook such possibilities if the strategies are assumed to be stationary.
Many questions remain for further research. In particular, it would be valuable to characterize the whole set of achievable subgame perfect equilibrium payoffs for different values of parameters for truels. Are there any other kinds of equilibria in  -els with  players?
 
 
[1] For example, consider two possible strategies of the player : always target  (if not eliminated, otherwise target  ) and always target  (if not eliminated, otherwise target  ). Fix some strategies of the players  and . Payoffs in the game (expected probability of being a sole survivor) are well defined once all the strategies are specified, and let  and  be the payoffs to the player  from the two considered pure strategies. Suppose that the player  considers a mixed strategy: flip a coin and if heads target  always and if tails target  always. Then the player  ’s payoff from this strategy is simply . Of course, more complex mixed strategies can be considered, e.g., those that implement randomization at any time a player acts. In the example, the player  can flip a coin between two players (if available) any time it has a shot. In equilibrium, a player’s mixed strategy has to give the maximal and identical payoff to the pure strategies it randomizes over.