Suplemento
Pigeons’ choice between shared and unshared feeding sites in game situations
Elección de palomas entre fuentes de Alimento compartidas y no en situaciones de juego compartidas
Pigeons’ choice between shared and unshared feeding sites in game situations
Revista Mexicana de Análisis de la Conducta, vol. 45, núm. 2, pp. 434-453, 2019
Sociedad Mexicana de Análisis de la Conducta
Recepción: 15 Febrero 2019
Aprobación: 18 Octubre 2019
Financiamiento
Fuente: JSPS KAKENHI
Nº de contrato: 21530764
Financiamiento
Fuente: Osaka City University (OCU) Strategic Research Grant 2019
Abstract: Cooperative behavior in nonhuman animals has been studied within the framework of game theory, typically by using the prisoner’s dilemma game. Previous studies on cooperation by pigeons using this game have revealed that, under these conditions, the animals did not learn the tit-for-tat strategy played by their opponents. In many cases, animals fail to choose cooperation and in so doing do not maximize their gains. The present experiment examined pigeons’ cooperative choices in the prisoner’s dilemma game situation by using a different type of apparatus than that used in previous studies: Subjects moved to choose one of two feeding sites, one of which was shared by another, stooge, pigeon whose choices were controlled by a computer and the other of which was not shared by other pigeons. In this choice situation, the presence of the stooge pigeon increased the subjects’ choices of the shared feeding site significantly. Further, the pigeons learned the other player’s choice strategy (tit-for-tat and random), showing that choice proportions for the shared feeding site were significantly higher in the tit-for-tat condition than in the random condition. These results suggest that the presence of a conspecific at the feeding site is a reinforcer for choosing it and that the choice situation constituted by the apparatus used in the present experiment could promote learning of the opponent’s choice strategy.
Keywords: cooperation, prisoner’s dilemma game, tit-for-tat, stooge, pigeons.
Resumen: El comportamiento cooperativo en animales no humanos se ha estudiado en el marco de la teoría de juegos, generalmente mediante el uso del dilema del prisionero. Estudios previos sobre la cooperación en palomas usando este procedimiento han revelado que, bajo estas condiciones, los animales no aprendieron la estrategia de tit for tat utilizada por sus oponentes. En muchos casos, los animales no eligieron cooperar y, al no hacerlo, no maximizaron sus ganancias. El presente experimento examinó las elecciones cooperativas de palomas en la situación del dilema del prisionero mediante el uso de un tipo diferente de aparato que el utilizado en estudios anteriores: los sujetos se movieron para elegir uno de los dos sitios de alimentación, uno de los cuales fue compartido con una paloma observadora cuyas opciones fueron controladas por una computadora y la otra no fue compartida por otras palomas. En esta situación de elección, la presencia de la paloma observadora aumentó significativamente las elecciones de los sujetos del sitio de alimentación compartido. Además, las palomas aprendieron la estrategia de elección del otro jugador (tit for tat y aleatorio), mostrando que las proporciones de elección para el sitio de alimentación compartido fueron significativamente mayores en la condición de tit for tat que en la condición aleatoria. Estos resultados sugieren que la presencia de otro organismo de la misma especie en el sitio de alimentación funciona como reforzador para elegirlo y que la situación de elección constituida por el aparato utilizado en el presente experimento podría promover el aprendizaje de la estrategia de elección del oponente.
Palabras clave: cooperación, dilema del prisionero, tit for tat, observador, palomas.
Introduction
In many research areas related to the social behavior of human and nonhuman animals, cooperative choices have been analyzed using a framework of game theory, which originated in economics. Of the many types of game structures, the prisoner’s dilemma (PD) game may be the most popular one that has been applied to animal choice in social situations (Clements & Stephens, 1995; Flood, Lendenmann, & Rapoport, 1983; Gardner, Corbin, Beltramo, & Nickell, 1984; Green, Price, & Hamburger, 1995; Stephens, McLinn, & Stevens, 2002). The present experiment examined factors that promote pigeons’ cooperative choices in the PD game, especially the effects of the presence of a stooge pigeon and the use of a new apparatus in which pigeons choose between sharing and not sharing feeding sites.
In the PD game, two players choose between cooperation and defection without knowing the opponent’s choices. The reinforcer amount given to each player is determined by both players’ choices. Figure 1A illustrates a payoff matrix showing the combination of the reinforcer amounts given to each player for different choices (Green et al., 1995) as labeled by Tucker (see Poundstone, 1993). When both players choose cooperation, each receives three units of the reinforcer (labeled Reward). When both players choose defection, each player receives one unit of the reinforcer (labeled Punishment, even though it does not always meet the functional definition of punishment in behavior analysis). When one player chooses cooperation and the other chooses defection, the player who chose cooperation receives zero units of the reinforcer (labeled Sucker) and the player who chose defection receives five units of the reinforcer (labeled Temptation). Any payoff matrix of the PD game should satisfy the following two inequalities: (1) Temptation > Reward > Punishment > Sucker and (2) Reward > (Temptation + Sucker) / 2 (Poundstone, 1993). The payoff matrix in Figure 1A satisfies these inequalities because 5 > 3 > 1 > 0 and 3 > (5+0)/2. At a group level, the combination of choices leading to the greatest amount of reinforcement is the choice of cooperation by both players. The rational choice at the individual level, however, is defection because this leads to greater amounts of reinforcement than choosing cooperation, regardless of the other player’s choices. By comparison, in the chicken game (Figure 1B), which is created by exchanging the reinforcer amounts between Sucker (0) and Punishment (1), mutual defection leads to the least amounts of reinforcement at both the individual and group level.

Payoff matrices of (A) the prisoner’s dilemma game and (B) chicken game. Left and right numbers in each cell represent reinforcer amount (number of food pellets) given to Players 1 and 2.
If, however, the choice trial is not one shot, but instead trials are repeated, the optimal choice strategy in the PD game for a player would depend on the strategies taken by the other player (opponent). Choosing defection is optimal when the opponent chooses cooperation and defection randomly (RND), whereas choosing cooperation (except for the final trial) is optimal when the opponent adopts a tit-for-tat (TFT) strategy (Axelrod & Hamilton, 1980). The TFT strategy is choosing cooperation in the first trial and, during the following trials, choosing the same alternative the opponent chose on the previous trial. It has been reported that, with human participants, preference for cooperation is greater when the opponent’s strategy is TFT than when it is RND (Baker & Rachlin, 2001; Silverstein, Cross, Brown, & Rachlin, 1998). It is difficult, however, for nonhuman animals to learn an opponent’s strategy, although not impossible (Baker & Rachlin, 2002; Sanabria, Baker, & Rachlin, 2003).
Green et al. (1995) examined the effects on pigeons’ choice strategies of payoff matrices, including the PD and chicken game, and choice strategies taken when a computer is the opponent (TFT and RND). The pigeon chose between two response keys, one red and the other green, with one associated with cooperation and the other with defection. After a choice, a blue or yellow cue lamp was lit to represent the opponent computer’s choice. After that, food pellets prescribed by the payoff matrices were presented to the subject. The payoff matrices of the PD and chicken game used in Green et al. (1995) were the same as in Figure 1A and B, respectively, except that the reinforcers were not presented to the opponent (computer). The pigeons strongly preferred defection in the PD game in both the TFT and RND conditions (Hall, 2003). In most cases, choice proportion for cooperation was about 0.1 in both the TFT and RND conditions; in the chicken game condition, pigeons’ choice proportion for cooperation increased to 0.5. These results indicate that the pigeons did not learn the TFT strategy that the computer adopted. They learned, however, the difference in the payoff matrices between the PD game and chicken game to increase their amounts of reinforcement. After the Green et al. (1995) study, it has been reported that short intertrial intervals (Baker & Rachlin, 2002) and presentation of stimuli that represent the subject’s previous choice (Sanabria et al., 2003) increase pigeons’ cooperative choices. However, in these latter two experiments, the reinforcer amounts of Sucker were greater than 0, a variable that also might promote cooperation.
Stephens et al. (2002) examined cooperative choice in blue jays in the PD game by using two adjoining operant chambers each of which housed a single bird. During each trial, birds moved to one of the perches to choose between cooperation and defection. After a choice was made, each bird could see its opponent through a transparent wall. One of the blue jays was a stooge that had been trained to choose between cooperation and defection indicated by a computer. They examined the effects of the TFT and All-D (choosing defection in all trials) strategies taken by the stooge. The choice proportion for cooperation in the TFT condition (about 0.4) was higher than that in the All-D condition (less than 0.1). Because they did not use a no-stooge condition, the effect of the stooge conspecific cannot be determined.
From the previous experiments on cooperative choice in nonhuman animals, it can be concluded that cooperation is rarely observed in the condition where the opponent player takes the TFT strategy, even though a subject can increase its reinforcer amount by choosing cooperation. That the level of cooperative choices was found to be higher by Stephens et al. (2002) than by Green et al. (1995) might be the result of the use of a stooge conspecific and the apparatus in which the subjects physically moved to choose between alternatives. The present experiment therefore examined pigeons’ cooperative choice in the PD game by using an apparatus in which subjects’ choice responses were indicated by walking to one of two alternatives in the presence of a stooge pigeon whose choice strategies were controlled by a computer. The alternatives also were changed from cooperation and defection to shared and unshared feeding site: At the shared feeding site, subjects received reinforcers simultaneously with the other, stooge, pigeon (although each pigeon had its own feeder), and at the unshared feeding site, subjects received reinforcers alone. Sharing can be thought as a kind of cooperative behavior (de Waal and Berger, 2000; Wilkinson, 1984). Using this apparatus, the effects of social context (the presence or absence of the stooge pigeon, selected by choosing the shared or unshared feeding site, respectively), opponent’s strategy (TFT and RND), and payoff matrices (PD game and chicken game) were examined.
Method
Subjects
Seventeen male pigeons (Columba livia) were used (P01-P17, see Table 1). They, and two other pigeons, used as stooges, were housed together in a flying cage (1.9 m high, 2.5 m long, and 2.0 m wide). The pigeons received food pellets (20 mg, Bio-Serv) during each experimental session, and, after the session, they individually received sufficient mixed grain in the flying cage to maintain their body weights at approximately 80 % of their free-feeding weights. When feeding, each pigeon was isolated in a small box to ensure that only that pigeon was fed. In the flying cage, grit and water were continuously available. The light/dark cycle of the flying cage was 12 h/12 h, and the light period started at 0700. The experiment was conducted during the light period.
Apparatus
Two experimental chambers as shown in Figure 2 were used. The apparatus had two shared feeding sites, each 32.5 cm high, 23.0 cm long, and 25.0 cm wide, connected to two compartments. Each array had a neutral zone (32.5 cm high, 25.0 cm long, and 25.0 cm wide) with an unshared feeding site the same size as the shared feeding sites at the opposite end of the compartment. Six other chambers around the shared feeding site (labeled A to F in Figure 2) were not used. All chamber walls were gray polyvinyl chloride, except the common wall between the two neutral zones, which was transparent acrylic (see Figure 2), as were the ceilings. The floors were stainless wire mesh. A pigeon placed in either array could walk from its neutral zone to either its shared or unshared feeding site. Each pigeon’s locomotion was detected by microswitches (V-154-1A5, OMRON) placed under the floor of each feeding site. A pigeon in one compartment could see the other pigeon in the neutral zone and into the aperture at the shared feeding site (explained below).

Figure 3 shows a layout of LEDs and the aperture on the front wall of each feeding site. At the shared feeding sites, each of which had an aperture (12.0 cm high and 4.4 cm wide, labeled “Aperture” in Figure 3) on the front wall (28.0 cm high and 6.0 cm wide), pigeons could receive food pellets delivered from pellet dispensers (H14-22M-2D, Coulbourn) located above the feeding site. There was a transparent acrylic wall under the two pellet dispensers so that food pellets prescribed by the payoff matrices were delivered to each pigeon in such a way that they could see the other pigeon collecting its pellets. Two feeder lamps (24-V dc) located within the aperture were lit when reinforcers were delivered. Both of the unshared feeding sites were equipped with the same pellet delivery system described for the shared sites.
On the wall containing the feeder-access aperture at both the shared and unshared feeding sites, a white LED (24-V dc) lamp (“Trial lamp” in Figure 3), located 25.0 cm above the floor and in the center of the right and left edges of the wall, was turned on to signal that a trial was in effect. Two other LEDs (24-V dc), each of which could be green and red, also were located on the front wall horizontally 22 cm above the floor, 4 cm apart. Only the right LED (“Feedback lamp” in Figure 3) was used during the experiment, to indicate the opponent’s choices. Events were controlled and recorded by a program written in Visual Basic 6.0 (Microsoft) and run on an IBM compatible computer (PCG-4D2N, SONY).

A schematic diagram of the choice procedure used in the condition where the color of the LED when the opponent chooses the shared feeding site is green. Black lamps and aperture indicate that they were not lit; otherwise, they were lit. “G” and “R” indicate that the LED was lit green and red, respectively.
Procedure
The experimental design was a 2 (social context: stooge and no-stooge conditions) by 2 (opponent’s strategy: TFT and RND conditions) by 2 (payoff matrix: PD game and chicken game conditions) factorial. All subjects were exposed to all (eight) conditions. The order of the conditions was counterbalanced across subjects and is shown in Table 1. Opponent choices were controlled by the computer independently of the behavior of the stooge pigeon. Before starting the experiment proper, pretraining sessions for both subject and stooge pigeons were conducted.
Pretraining for subject pigeons. For six of the 17 subject pigeons, shaping was not needed because they had already had experience in the same apparatus. For the remaining 11, the response of moving from the neutral zone to the two feeding sites was individually trained. At first, 10 food pellets were put in the apertures at both of the shared and unshared feeding sites; the pigeon could move between the two feeding sites to consume the pellets. After that, a pretraining session of 100 trials was conducted for each subject, where reinforcement was assigned to one of the two feeding sites in a random order. When the session started, the trial lamp was lit. After the subject moved to the feeding site to which the reinforcement was assigned, the trial lamp was extinguished and a food pellet was delivered immediately to the aperture at the chosen feeding site. The feeder lamps were lit for 3-s during the reinforcement. If the subject moved to the other feeding site, no food was delivered at that other site. The next trial started only when the subject returned to the neutral zone. Each subject pigeon was exposed to one such training session.
Pretraining for stooge pigeons. The two stooge pigeons also had prior experience in the apparatus, so shaping was not needed. They were trained individually to choose the feeding site illuminated by the right LED. Pretraining sessions of 100 trials each were continued until >90% of each stooge pigeon’s choices were for the site illuminated by the right LED.
General procedure. Daily sessions consisted of eight forced-choice trials followed by 60 free-choice trials. In the forced-choice trials, subjects’ choices were limited to the feeding site that the computer selected. Each subject was exposed to each of the four combinations of the payoffs determined by the payoff matrix condition twice in a random order.
Both forced- and free-choice trials consisted of an intertrial interval (ITI) (3 s), choice period (3 s), feedback period (2 s), and reinforcement period (15 s) as shown in the diagram in Figure 3. During the ITI, the subject had to stay in the neutral zone for 3 s. If they moved to either feeding site during the ITI, it was prolonged until the 3-s criterion was met.
A choice period started with the illumination of the trial lamps. The subject pigeon then had 3 s in which to choose one or the other of the feeding sites. The subject could move to either feeding site during the free-choice trials; however, during forced-choice trials, it had to move to the feeding site that the computer selected. An effective moving response ended the choice period and started the feedback period.
During the feedback period, the feedback lamp indicating the opponent’s choice (green or red; counterbalanced across the subjects, see Table 1) was lit for 2 s at the feeding site that the subject chose. After the feedback period, the reinforcement period started: the feeder lamps were lit for 15 s and the pellet dispenser was operated to deliver food pellets at the chosen feeding site. However, according to the payoff matrices, when the subject chose the shared and the opponent chose the unshared feeding site in the PD game condition and when both the subject and opponent chose the unshared feeding site in the chicken game condition, the aperture at the feeding site chosen by the subject was lit, but food pellets were not delivered. After the nominal reinforcement period, the feedback lamps and feeder lamps were extinguished and the next trial started.
Social context. In the stooge condition, the timing of the onset of each period of the trial, described above, was the same for both stooge and subject pigeon: Not only the subject pigeon, but also the stooge pigeon had to stay in the neutral zone for 3-s during the ITI; it had to move from the neutral zone to one of the feeding sites where the right wall lamp was lit within 3 s, determined by the opponent’s strategy condition in the choice period. When the stooge pigeon moved to the feeding site where the lamp was not lit, the trial lamp extinguished and the ITI of that same trial reset. During the reinforcement period, the feeder lamps were lit and the pellet feeder was operated at the feeding site where the stooge pigeon as well as the subject pigeon stood. Reinforcer amounts for the stooge pigeon were determined by the payoff matrix condition. The right-left position of the chambers that two pigeons were placed was fixed in the apparatus as shown in Figure 2.
The no-stooge condition was the same in all respects as the stooge condition, except for the absence of the stooge pigeon. Thus, in this condition, only the subjects chose between the feeding sites.
Opponent’s strategy. In the TFT condition, the computer as the opponent chose the shared feeding site on the first trial and, during subsequent trials, the computer’s choice mimicked the subject’s choice on the previous trial. In the RND condition, the opponent’s choices were random (p = .5).
Payoff matrix. The choice of the shared and unshared feeding sites corresponded to cooperation and defection, respectively, in the standard PD game. The payoff matrix of the PD game was the same as Green et al. (1995; see Figure 1A): Reward = 3 food pellets, Temptation = 5 food pellets, Punishment = 1 food pellet, and Sucker = 0 food pellets. The payoff matrix for the chicken game, also were the same as Green et al. (1995; see Figure 1B): Reward=3 food pellets, Temptation = 5 food pellets, Punishment = 0 food pellets and Sucker = 1 food pellet.
Stability criteria. Each condition was in effect for a minimum of 14 and a maximum of 25 sessions for each subject. After 14 sessions for each condition, the choice proportion for the shared feeding site obtained during the free-choice trials was calculated for each session. Choice proportions were considered stable if, for the last six sessions, means of the choice proportions over successive two-session blocks showed neither an increasing nor decreasing trend and if the difference between the highest and lowest mean was within 0.1. When the stability criteria were not met in 25 sessions, the condition was ended and the next condition was started. There were only two conditions in which the stability criteria were not satisfied within 25 sessions for P05. Table 1 shows three cases of 25 sessions (two cases for P05 and one case for P14). Of these cases, one for P14 satisfied the stability criteria on the 25 th session.

Color of feedback lamp when the opponent chose the shared feeding site, opponent's strategy, stooge condition, game type, number of sessions, and mean choice proportion for the shared feeding site for each subject.



Results
Table 1 shows mean choice proportions for the shared feeding site obtained from the last six sessions for each condition for each subject. Group mean choice proportions for the shared feeding site are shown in Figure 4. One-sample . tests were conducted after the arcsine transformation for each choice proportion data, to examine whether the choice proportions for the shared feeding site were different from 0.5 (indifference). Mean choice proportion was significantly higher than 0.5 in all of the stooge conditions (PD game/TFT condition: . [16] = 2.872, . < .05; PD game/RND condition: . [16] = 2.411, . < .05; Chicken game/TFT: . [16] = 7.895, . < .05; Chicken game/RND: . [16] = 3.684, . < .05). In the no-stooge conditions, choice proportion was significantly higher than 0.5 in the TFT conditions (PD game condition: . [16] = 2.228, . < .05; chicken condition: . [16] = 5.410, . < .05); however, in the no-stooge/RND conditions, choice proportion was significantly lower than 0.5 (PD game condition: . [16] = 2.629, . < .05) or was not significantly different from 0.5 (chicken game condition: . [16] = 0.773, ns). These results suggest that pigeons preferred cooperative choices under the conditions where they could see their opponent when choosing cooperation and/or when their opponent adopted the TFT strategy.
In Figure 4, mean choice proportion tended to be higher in the stooge than in the no-stooge condition, higher in the TFT than in the RND condition, and higher in the chicken game than in the PD game condition. A three-way ANOVA conducted after the arcsine transformation for each choice proportion showed that two main effects of the social context (. [1,16] = 10.70, . < .01) and opponent’s strategy . [1,16] = 17.22, . < .005) were statistically significant, whereas the main effect of the payoff matrix and interactions were not significant. That is, the presence of the other pigeon as an opponent and the TFT strategy that the opponent adopted increased the subject pigeons’ cooperative choices significantly. However, contrary to the results of Green et al. (1995), the pigeons’ cooperative choices were not different between the PD and chicken game conditions. The mean proportion of correct responses by the stooge pigeons was > 90 % across all conditions.

Group mean of choice proportion for the shared feeding site. Error bars represent standard deviations. A broken line shows the indifference point for choices.
Discussion
In this experiment, subject pigeons’ cooperative choices in two PD-related games increased when they chose between the shared and unshared feeding site by moving between them, when the stooge pigeon was present, and when the opponent adopted the TFT strategy. The effects of these factors have not been reported in the previous studies.
The presence of a conspecific strongly affected the choice of the feeding site. Unlike previous experiments using dyads (Flood et al., 1983; Gardner et al., 1984) and stooge animals (Stephens et al., 2002), in the present experiment, the subject pigeon could see its co-actor only when it chose the shared feeding site. The result that the choice proportion for the shared feeding site was significantly higher in the stooge than in the no-stooge condition suggests that the sight of a co-actor was a reinforcer for the subjects to choose the sharing feeding site. However, to clarify the reinforcement value of the sight of the co-actor, cooperative choices should be examined under the condition where the subject pigeons can see their co-actors when they choose the unshared feeding site in future research.
Social contact between the subject and stooge pigeons might have an effect on cooperative choices. Some experiments report a positive relation between social contact and cooperation between animals (Rutte & Taborsky, 2008; St-Pierre, Larose, & Dubois, 2009). In the present experiment, the subject and stooge pigeons lived in the same flying cage. It is possible that living in the same cage increased the reinforcer value of the sight of the stooge pigeons. Whether the extent of the social interaction with other pigeons would affect their reinforcer value also seems a suitable topic for future research.
Feral pigeons forage in flock (Giraldeau & Caraco, 2000; Giraldeau & Lefebvre, 1986, 1987) and such social foragers can be attracted by feeding conspecifics (Beauchamp, Bélisle, & Giraldeau, 1997). This suggests that the conspecifics at feeding sites could function as discriminative stimuli as well as reinforcers. In the present study, however, the stooge pigeon might not function as the discriminative stimulus because the subject and stooge pigeons chose between the feeding sites almost simultaneously; the subject pigeons could not choose the feeding sites after confirming the opponent’s choices.
In the present experiment, choice proportions for the shared feeding site (i.e., cooperative choice) were significantly higher in the TFT than RND condition. The effect of the opponent’s strategy has not been found in the previous studies where the same payoff matrices as in the present experiment were used (Hall, 2003; Green et al., 1995). From the results of the present experiment, we can conclude that the opponent’s strategy affected the subject pigeons’ choices independently of the social context because the ANOVA showed that the interactions were not significant. The differences in results between the present experiment and previous ones could be related to the different apparatuses used. In the present experiment, the pigeons chose between feeding sites by moving to one of them, not by pecking one of two response keys; furthermore, there were two places where reinforcers were delivered, which is different from the standard operant chamber with one aperture for the delivery of all reinforcers. These apparatus difference might affect pigeons’ discrimination between alternatives. Future research might attempt to isolate the contributions of the different variables (topography of responses and number of places for the reinforcer delivery) to the increases in cooperative responses.
Further, in the present experiment, the length of the ITI was relatively short (3-s) compared to that used in previous experiments (10-s). The short ITI could promote cooperative choices in the TFT condition in comparison with the RND condition. In the TFT condition, to increase the cooperative choices, pigeons have to learn that the opponent’s choice in the present trial should be same as their choices in the previous trial, that is, they have to learn the relation between trials (Baker & Rachlin, 2002; Green et al., 1995). Baker & Rachlin (2002) reported that pigeons’ cooperative choices in the TFT condition were significantly higher when a 0-s ITI was used than when the ITI was 18-s. Accordingly, the use of a short ITI could contribute to the relatively high proportion of the cooperative choices obtained in the present experiment.
In the present experiment, there was no significant effect of the payoff matrix, although Green et al. (1995) reported that pigeons showed more cooperative choices in the chicken game than in the PD game. This result also might be related to the present use of an apparatus where pigeons moved between feeding sites. In the study of the ideal-free distribution in pigeons (Baum & Kraft, 1998), pigeons choosing by moving between two feeding sites to which different amounts of reinforcement were assigned undermatched (Baum, 1974), that is, more pigeons visited the less efficient feeding site than the prediction derived from perfect matching (optimal distribution). If it is common that animals show undermatching in choosing between feeding sites that provide different amounts of reinforcement, pigeons in the chicken game condition in the present experiment might choose the unshared feeding site more than pigeons in the standard operant chamber. This possibility should be also examined in future research.
Lastly, as a factor promoting pigeons’ cooperative choices, we point out the resemblance between the choice situation used in the present experiment and that in the natural setting. Studies in behavioral ecology have reported that wild animals show reciprocal altruism (e.g., Krama, Krams, Mänd, & Igaune, 2009; Wilkinson, 1984). In the natural setting, animals typically choose among feeding sites by moving to one of them and see their conspecifics at the feeding sites where they share food with them. These factors (moving responses, sharing feeding sites, and sights of conspecific) were included in the choice situation used in the present experiment, and some of them might promote cooperative choices. Experimental examinations of these factors would lead to better understanding of cooperation in nonhuman animals in the wild as well as in the laboratory.
Acknowledgments
This research was supported by JSPS KAKENHI Grant Number 21530764 and (in part) by the Osaka City University (OCU) Strategic Research Grant 2019 for top priority researches.
Address correspondence to Daisuke Saeki, Department of Psychology, Graduate School of Literature and Human Sciences, Osaka City University, Sugimoto 3-3-138, Sumiyoshi-ku, Osaka, 558-8585 Japan.
References
Axelrod, R., & Hamilton, W. D. (1981). The evolution of cooperation. Science, 211, 1390-1396.
Baker, F., & Rachlin, H. (2001). Probability of reciprocation in repeated prisoner’s dilemma games. Journal of Behavioral Decision Making, 14, 51-67.
Baker, F., & Rachlin, H. (2002). Self-control by pigeons in the prisoner’s dilemma. Psychonomic Bulletin & Review, 9, 482-488.
Baum, W. M. (1974). On two types of deviation from the matching law: Bias and undermathing. Journal of the Experimental Analysis of Behavior, 22, 231-242.
Baum, W. M., & Kraft, J. R. (1998). Group choice: Competition, travel, and the ideal free distribution. Journal of the Experimental Analysis of Behavior, 69, 227-245.
Beauchamp, G., Bélisle, M., & Giraldeau, L.-A. (1997). Influence of conspecific attraction on the spatial distribution of learning foragers in a patchy habitat. Journal of Animal Ecology, 66, 671-682.
Clements, K. C., & Stephens, D. W. (1995). Testing models of non-kin cooperation: mutualism and the Prisoner’s Dillemma. Animal Behaviour, 50, 527-535.
de Waal, F. B. M., & Berger, M. L. (2000) Payment for labour in monkeys. Nature, 404, 563.
Flood, M., Lendenmann, K., & Rapoport, A. (1983). 2×2 games played by rats: Different delays of reinforcement as payoffs. Behavioral Science, 28, 65-78.
Gardner, R. M., Corbin, T. L., Beltramo, J. S., & Nickell, G. S. (1984). The prisoner’s dilemma game and cooperation in the rat. Psychological Reports, 55, 687-696.
Giraldeau, L.-A., & Caraco, T. (2000). Social foraging theory. NJ: Princeton University Press.
Giraldeau, L.-A., & Lefebvre, L. (1986). Exchangeable producer and scrounger roles in a captive flock of feral pigeons: a case for the skill pool effect. Animal Behaviour, 34, 797-803.
Giraldeau, L.-A., & Lefebvre, L. (1987). Scrounging prevents cultural transmission of food-finding behaviour in pigeons. Animal Behaviour, 35, 387-394.
Green, L., Price, P. C., & Hamburger, M. E. (1995). Prisoner’s dilemma and the pigeon: Control by immediate consequences. Journal of the Experimental Analysis of Behavior, 64, 1-17.
Hall, S. S. (2003). Transitions between cooperative and non-cooperative responding in the ‘Pigeon’s Dilemma’. Behavioural Processes, 60, 199-208.
Krama, T., Krams, I., Mänd, R., & Igaune, K. (2009). Reciprocal altruism in birds. Saarbrücken: VDM Verlag Dr. Müller.
Poundstone, W. (1993). Prisoner’s dilemma: John von Neumann, game theory, and the puzzle of the bomb. New York: Anchor Books.
Rutte, C., & Taborsky, M. (2008). The influence of social experience on cooperative behaviour of rats (Rattus norvegicus): direct vs generalized reciprocity. Behavioral Ecology and Sociobiology, 62, 499-505.
Sanabria, F., Baker, F., & Rachlin, H. (2003). Learning by pigeons playing against tit-for-tat in an operant prisoner’s dilemma. Learning & Behavior, 31, 318-331.
Silverstein, A., Cross, D., Brown, J., & Rachlin, H. (1998). Prior experience and patterning in a prisoner’s dilemma game. Journal of Behavioral Decision Making, 11, 123-138.
Stephens, D. W., McLinn, C. M., & Stevens, J. R. (2002). Discounting and reciprocity in an iterated prisoner’s dilemma. Science, 298, 2216-2218.
St-Pierre, A., Larose, K., & Dubois, F. (2009). Long-term social bonds promote cooperation in the iterated Prisoner’s Dilemma. Proceedings of the Royal Society of London B: Biological Sciences, 276, 4223-4228.
Wilkinson, G. S. (1984). Reciprocal food sharing in the vampire bat. Nature, 308, 181-184.