Skip to main content
PLOS One logoLink to PLOS One
. 2023 Jun 23;18(6):e0287402. doi: 10.1371/journal.pone.0287402

In the absence of extensive initial training, cleaner wrasse Labroides dimidiatus fail a transitive inference task

Leonore Bonin 1,*, Redouan Bshary 1
Editor: Rachael Miller (Harrison)2
PMCID: PMC10289426  PMID: 37352163

Abstract

Transitive inference (TI) is a reasoning capacity that allows individuals to deduce unknown pair relationships from previous knowledge of other pair relationships. Its occurrence in a wide range of animals, including insects, has been linked to their ecological needs. Thus, TI should be absent in species that do not rely on such inferences in their natural lives. We hypothesized that the latter applies to the cleaner wrasse Labroides dimidiatus and tested this with 19 individuals using a five-term series (A > B > C > D > E) experiment. Cleaners first learned to prefer a food-rewarding plate (+) over a non-rewarding plate (-) in four plate pairs that imply a hierarchy from plate A to plate E (A+B-, B+C-, C+D-, D+E-), with the learning order counterbalanced between subjects. We then tested for spontaneous preferences in the unknown pairs BD (transitive inference task) and AE (as a control for anchors), interspersed between trials involving a mix of all known adjacent pairs. The cleaners systematically preferred A over E and showed good performance for A+B- and D+E- trials. Conversely, cleaners did not prefer B over D. These results were unaffected by the reinforcement history, but the order of learning of the different pairs of plates had a main impact on the remembrance of the initial training pairs. Overall, cleaners performed randomly in B+C- and C+D- trials. Thus, a memory constraint may have prevented subjects from applying TI. Indeed, a parallel study on cleaner wrasse provided positive evidence for TI but was achieved following extensive training on the non-adjacent pairs which may have over-ridden the ecological relevance of the task.

Introduction

Animal cognition involves a suite of mechanisms through which an individual acquires, stores, and uses information from the environment [1]. Many learned abilities among animals may be explained with Pavlovian or operant conditioning, where individuals either learn to associate a primarily involuntary action with an external stimulus [2] or associate a voluntary action with a consequence [3]. However, it remains a major challenge to identify more complex cognitive processes, such as transitive inference (TI), in non-human animals. TI is defined as the ability to deduce the relative value of two objects that were never observed together—and thus not subject to potential reinforcement learning—from other previously learned pair relationships. For instance, in a list of three objects: A, B, and C; if A is greater than B, and B is greater than C, one can deduce that A is greater than C. Despite its supposed cognitive complexity, evidence of TI has been suggested in a wide variety of non-human vertebrates, including birds (e.g., jays [4, 5]; crows [6, 7]; pigeons [8, 9]; geese [10]), mammals (e.g., chimpanzees [11]; macaques [12]; lemurs [13]; mice [14], and fish (e.g., brook trout [15]; cichlids [16]; cleaner wrasse [17]), as well as some invertebrates (e.g., paper wasps [18]). The occurrence of this capacity has been largely linked to specific aspects of social structure [4, 5, 19]. For example, in large and complex social groups, TI may be used to minimize competitive interactions and thus the cost of fighting and injuries [16, 20]. This is achieved because TI allows an individual to deduce dominance relationships from observations and avoid fighting stronger individuals.

The ecological approach to cognition [1, 21, 22] predicts a tight link between ecological needs and performance in cognitive tasks. The positive results from studies involving social dominances in fishes [15, 16] and paper wasps [18] support the ecological approach to cognition by showing a link between the ecological need and the presence of TI in species that lack highly derived brain structures like a mammalian neocortex or an avian neostriatum. More recently, intraspecific differences observed in TI capacity that are directly linked to sex and rank further demonstrate the important modulation of TI by social need [23]. The ecological approach to cognition also predicts that in the absence of a need to represent hierarchical orders in the brain, TI should be absent, again irrespective of brain anatomy. This is exemplified by a study yielding negative results for honeybees [24] compared to positive results in paper wasps [18]. However, few negative results are ever reported, and hence more studies on TI are needed on species where the ecological approach predicts failure.

Here, we tested the presence or absence of TI in the bluestreak cleaner wrasse Labroides dimidiatus. This protogynous species has a size-based social hierarchy [25], where the largest individual in a group is a male that has a harem of smaller females. Males have a higher reproductive output than females, and sex change is socially suppressed via aggression [25] from the male to the targeted female. While this has yet to be studied in female–female cleaner wrasse interactions, size-based hierarchies are apparently stabilized by individuals suppressing the growth of subordinate individuals through the threat of aggression [26, 27]. Cleaner wrasse harems are of small size (typically 3–6 adult females in [25] (mean 3.5 females per harem [28]). Thus, cleaner wrasse moving between harems can use absolute information (size) to assess the relative strength of a few conspecifics without the need for TI. Similarly, interactions with clients allow direct assessment of client size, mucus quality, and parasite load, so that TI is not needed to establish a hierarchy of client quality as a food patch. Therefore, if the presence of TI is driven by ecological needs [1, 21, 22], we hypothesize that we should not find it in cleaners. Note that our interpretation of the cleaner wrasse social system differs from that of colleagues who independently and almost in parallel to us, also studied TI in cleaner wrasse [17]. While we do not see an ecological need for TI in cleaner wrasse, they have a complex interspecific social life, with about 2000 interactions per day with ‘client’ fishes [29]. Clients visit to have ectoparasites removed, but cleaners prefer to eat client mucus [30]. The resulting conflict of interest has apparently led to the evolution of high strategic sophistication in cleaners [30, 31] allowing them to match or even outperform mammals in various tasks [3133]. Also, cleaners have an impressive cognitive tool kit for an ectotherm vertebrate that includes generalized rule learning [34], long-term memory retention of single events [35], and mirror self-recognition [36, 37]. Thus, we decided to test a “smart” species that we predict, based on our understanding of its ecological needs, should not possess TI mechanism. This will help elucidate whether TI could be an emergent property of a complex social life (see [38] for the similarities between intra- and interspecific social interactions) that involves cooperation, defection, and competition and thus warrants social competence [39].

To test for the presence of TI in cleaner wrasse, we used a standard experimental setup that is based on a five-term series test where the individual must learn and understand the transitive link between five objects (e.g., A > B > C > D > E). Previous studies investigating TI varied the number of objects from three to seven [18, 39, 40]. Three objects (A > B > C, testing AC pair [40]) are not sufficient for assessing TI presence/absence with this paradigm, since success in the AC test can be explained as a simple associative learning component (i.e., A was always rewarded and C never, so I choose A). Even with five objects and the crucial test B vs D, researchers have reported various potential alternative explanations for transitive-like responses [41, 42]. These explanations are linked to different types of learning, such as reinforcement history, value transfer mechanisms, and absolute knowledge. For example, TI experiments may yield a systematic difference in reinforcement history between BC and DE combinations. This is because B is losing against the most dominant object (A) and its value needs to be re-established, while E is never rewarded and hence learning D > E is relatively straightforward and achievable in fewer trials than B > C. These differences between pairs may cause individuals to favor B over D simply because B has been rewarded more often until the learning criterion had been reached, without any implication of transitive inference. To control for these alternative explanations/confounding variables, we explicitly tested for effects of the order of pair presentations and the number of trials for each object pair on individual performance in the experimental trials. Note that at the time of data collection, we were unaware that the parallel study obtained positive evidence for TI, using the same general paradigm (see below) but different training methods [17]. In light of diverging results, we dedicate an important part of the discussion to compare the methods.

Material and methods

The study was conducted in Australia, at the Lizard Island Research Station, from February to March 2020. Due to the loss of one individual during the training phase, a total of 19 fish were tested. All individuals were released at the site of capture after the experiment. The manipulations were approved by Animal Ethics Committee (AEC), permit number CA 2019/11/1336.

Training

The training phase aimed to make fish understand the relative value of each plate compared to the others. To do this, 4 pairs of plates (hereafter referred to as the training pairs) were presented: A+B-, B+C-, C+D, D+E-, where plus and minus indicate the reinforced and the non-reinforced choice, respectively. On the reinforced plate, a small food item of mashed prawn was put on the back of the plate. The non-reinforced plate did not contain any food. Cleaners had to choose between the two simultaneously presented plates, which were separated by an opaque barrier (Fig 1). A choice was scored once the cleaner went behind one of the plates. The cleaners could hence always inspect the back of the chosen plate, which showed the same color pattern as the front to allow a post-choice feedback. Thus, they could eat the food item if the choice had been correct, and afterward still access the non-rewarding plate to see that there was no food. In contrast, the correct plate was removed and hence inaccessible if the choice had been wrong. In order to prevent any systematic preferences in color patterns and/or biases due to the sequence of presentation of pairs of plates, we counterbalanced these variables across subjects (Fig 2).

Fig 1. Experimental setup.

Fig 1

The fish is put behind a see-through barrier (dotted lines) while the plates are inserted and separated by an opaque barrier (full line and grey rectangle). If the fish goes to the right one, he gets the reward and can inspect the other plate (green “V”) whereas both were removed directly if he chose the wrong one (red cross). After the training, we performed the transitive inference task to test whether fish can infer the value of plates from new combinations (i.e. test combinations): B+D- and A+E-. During the task, the training combinations were randomly exposed to the fish and the test combinations were also randomly presented once in a while.

Fig 2. Representation of the four experimental colors and order of learning groups.

Fig 2

The 20 initial fish were equally divided between four experimental groups that differed with respect to the color rank of the plates: Normal, reverse, counter-balanced 1 and 2. Within each group, training pairs are learned in 4 different orders. Each color group included the 4 order of learning groups. After one loss, the counter-balanced group 2 had only 4 fish and the order 1 was not represented anymore.

To facilitate learning, subjects could inspect both plates during the first five trials for each pair of plates, irrespective of their initial choice. With this addition, we ensured early exposure to a rewarding plate that had just before been the non-rewarding plate in the previous combination. Subjects were considered to have learned the correct choice if they reached one of the following criteria: 3 times at least 7 correct choices out of 10 trials, 2 times at least 8 correct choices out of 10 trials, or 10 correct out of 10 trials. Once subjects had subsequently reached learning criteria for all four combinations, we ran a single set of four trials in which each pair was presented once. We conducted 10 trials per fish per day.

Transitive inference task

Prior to the task, four reminding trials were performed in order to expose individuals to each training pair again right before the task. For the task, we tested the subjects’ spontaneous preferences when confronted with two novel combinations: plate A versus plate E, and plate B versus plate D. As A had previously only been used as a rewarding stimulus, and E only as a non-rewarding stimulus, we expected a clear preference for plate A when presented with E. In contrast, both plate B and plate D had been rewarding in one plate combination, and non-rewarding in the second plate combination. Thus, only if subjects had formed a hierarchical representation of the five plates, the use of transitive inference would make them spontaneously prefer plate B over plate D. Otherwise, plate choice was expected to be random. Subjects were tested with both AE and BD combinations three times each. To avoid habituation to such conditions, the six test trials were interspersed in between a sequence of learned plate pairs (AB, BC, CD, DE) where the correct choice yielded the standard reward of a prawn item. More precisely, a single test trial was included in a sequence of 4 training pairs trials. Thus, a total of 30 trials were conducted to obtain our six experimental trials. Moreover, the plate pair presented in the trial before a test was varied as much as possible within individuals and counterbalanced across individuals.

Statistical analyses

All the analyses were carried out on R v.4.0.2.

To check whether the fish performed above chance for the entire set of training pairs during the task, we first calculated the proportion of successful trials for each individual. The % values were then used for a generalized linear mixed effect model that compared observed values to the expected binomial distribution (50% success) based on the null hypothesis that no learning had taken place. We included the fish ID as random variable to avoid pseudo-replication. The significance of the intercept was the indicator of the overall learning of the training pairs or not.

While every fish passed the learning criterion for every training pair and was hence ready for testing, we also analyzed whether all the training pairs were equally remembered during the actual TI task. We performed a Friedman rank sum test on the count of success per training pair, considering the fish identity. We then conducted pairwise comparisons between all training pairs using the Wilcoxon rank sum test with continuity correction. We built different binomial generalized linear mixed-effects models to test the different possible explanatory variables. The choice at each trial was the response variable, and the training pair was always included. Because the amount of data was not sufficient to include both the color groups and the order of learning groups plus all the interaction terms, we had to perform separate models. After model selection using variances analyses, and because of the experimental design, the order of learning groups as well as the interaction term with the training pair were kept in the model while the color group was not.

Finally, to check which training pair was remembered above chance, we performed individual Wilcoxon rank sum tests with continuity corrections for each training pair after graphically checking that the data were homogeneously distributed around the median. The same analyses were performed on each training pair for each order of learning and repeated separately with the fish having an overall memory above 65% on the training pairs.

To compare the test pairs AE and BD, we started by investigating the data of the first trial of the task only, using a two-tailed binomial test for each pairing. When considering the three trials of the task, we applied the same method as we used on the training pairs to see whether the fish succeeded above chance and to investigate the potential roles of the different variables. In this case, the final model included only the test pair as an explanatory variable, since the amount of data did not allow to run the model including the variables considered in our other analyses (i.e., color and order of learning).

To evaluate the extent to which reinforcement history during the initial training phase may have affected choices in the BD test pair, we calculated the ratio between the number of times that B or D plate was rewarded versus non-rewarded: for B the number of succeeded BC trials over failed AB trials, for D the number of succeeded DE trials over failed CD trials. The difference between these “Rb” and “Rd” was calculated and analyzed in a non-parametric Kruskal-Wallis test (because of the small sample size) to see whether there was a systematic difference in reinforcement history between B and D, or a difference between each group of learning order. Furthermore, to test whether variation in the ratio difference could be correlated to a difference in individual success in the BD task, we used a Spearman correlation test on the ratio difference and the counts of success for BD.

All the data and R codes are publicly accessible on figshare (https://figshare.com/projects/No_evidence_for_transitive_Inference_in_cleaner_wrasse_data/158132).

Results

Training pairs

Overall, the fish remembered the training pairs above chance, succeeding on average in 61.4% of trials (null binomial glmer, intercept: p < 0.0001). However, the training pairs were not all equally remembered (Friedman chi-squared = 10.34, df = 3, p = 0.02, Fig 3A). AB (median = 5 with se = 0.359) was remembered significantly better than BC (median = 2 with se = 0.413, pairwise Wilcoxon rank sum test with Bonferroni correction, p = 0.008), and DE (median = 4 with se = 0.350) tended to be remembered better than BC (p = 0.07). The anchoring pairs (AB and DE) were remembered above chance levels (Wilcoxon signed rank test with continuity correction, respectively V = 143 with p = 0.001 and V = 117 with p = 0.01, Fig 3A), while the performances in trials involving the two middle pairs (BC and CD) did not differ significantly from chance (respectively V = 50.5 with p = 0.37 and V = 72 with p = 0.51). A closer inspection of the most successful fish (overall performance above 65%, 6 individuals) showed similar tendencies, with the middle pairs (BC and CD) not equally remembered (Friedman rank sum test, chi-squared = 11.509 of 3 df, p = 0.009). Due to the limited sample size of this subset of fish, the assumption for the Wilcoxon rank sum tests was not met (homogeneity around the median) and no further analyses were conducted.

Fig 3. Graphical representations of the count of choices for each training pair globally (A) and for each order of learning separately (B) during the task.

Fig 3

The A boxplot shows the number of correct choices fish made during the task. Each test pair was presented 6 times. The large horizontal lines indicate the medians, boxes are limited by the upper and the lower quartiles, and the vertical lines indicate the deciles. On the A graph, the large black dotted line indicates the random percentage of success of 50%. The stars inside the boxes show the significance of the difference from the random threshold. Significance code: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1. The B boxplots contain the same information for each order of learning separately. In addition, the clear grey full-lined circles point the pair lastly learned and the darker dotted circles show the group of the pairs learned in the first and the last position.

We also found that there was a significant effect of learning order on remembrance of the training pairs during the task (anova on binomial glmer, Chisq = 103.8 of 9 df, p < 0.005, Fig 3B). All groups were able to reach the chance level and above for AB and DE (mostly at chance for DE), while middle pairs (BC and CD) were only remembered above chance when learned in the last position. For example, BC was the last pair to be learned for group 4 and they remembered it in 80% of trials (Wilcoxon rank sum test, W = 22.5, p = 0.006, Fig 3B), whereas the three other groups show performances at or lower than chance level.

Test pairs

For the AE pair on the first trial of the task only, 19 of 19 subjects chose the A plate (two-sided binomial test: probability of success = 1 and p < 0.0001, Fig 4A). In contrast, only 8 of 19 cleaners chose B over D (two-sided binomial test, probability of success = 0.42 and p = 0.65, Fig 4A).

Fig 4. Graphical representations of the success of all fish for each test pair separately including on the first trial of the task (A) and including the 3 trials (B).

Fig 4

The stars above the boxes show the significance of the difference from it. The large horizontal lines indicate the medians, boxes are limited by the upper and the lower quartiles and the vertical lines indicate the deciles. Significance code: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ‘ 1.

To compare our results to the study of Hotta et al. 2020, we checked whether the fish who learned the training pair in the rank-descending order (from highest to lowest) as their subjects had done (our order 1: AB, BC, CD, DE) would show similar performances. Among these 4 individuals, two chose B during the first trial, compared to 4 of 4 in their study (Note that Hotta et al. erroneously reported 3 of 4 correct first choices; the archived data show 4 of 4).

Considering all 3 trials per task, the results remain unchanged. For the AE pair, fish significantly preferred plate A over plate E (mean of success = 2.6, Wilcoxon signed rank test with continuity correction, V = 186.5, p < 0.001), while they did not solve the BD pair above chance (mean of success = 1.47, V = 92, p = 0.92). The four fish trained in a similar order to those from Hotta et al. (2020) made five correct choices out of twelve trials. This strongly contrasts with the findings of Hotta et al. (2020), which reported a strong preference for D (our B) in the BD pair in all subjects, with a minimum of 10 of 12 correct choices per fish.

Because of the difference among groups of learning order for the training pairs, even if the learning order appeared non-significant overall in the analyses of the test pairs, we checked for the group that remembered the training pair BC the most (73.3% of all trials were correct during the task). It is also the group that learned BC last. In this group of fish (N = 5, learning order 4), 3 of 5 fish chose B over D in the 3 trials. The 2 last fish succeeded only in the last trial and one of these successfully remembered the training pairs 42% of the time (highlighting a more general weakness in the remembrance of the pairs). Note also that DE was equally remembered in this group (around 75%, does not appear significantly different from the random threshold, probably due to the small sample size). Due to the small sample size (N = 5), no subsequent analyses were carried out for these groups.

In our final analysis, we evaluated the reinforcement history of plates B and D. The differences between Rb and Rd ratios were not similar among the four groups of learning order (Kruskal-Wallis, Chisq = 9.95, p = 0.02, Fig 5). The first group significantly differed from group 2 and 4 (pairwise comparisons using Fisher’s least significant difference, for both: difference = 10.5, p = 0.01, Fig 5) and tended to differ from group 3 in the same way (Fisher’s least significant difference, difference = 7.5, p = 0.06, Fig 5).

Fig 5. Graphical representations of the difference between Rb and Rd ratios in each of the four groups of learning orders.

Fig 5

Rb and Rd are the ratio between the number of times that B or D were rewarded and non-rewarded during the training. The Y axis is the calculated difference between those 2 ratios. The large horizontal lines indicate the medians, boxes are limited by the upper and the lower quartiles and the vertical lines indicate the deciles. N indicates the number of fish for each group. The stars above the boxes show the significance of the comparison between the boxes indicated by the arrows. Significance code: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1.

Finally, the individual differences in the reinforcement history of B and D did not correlate with the success in the BD task (Spearman’s correlation test, rho = -0.05, p = 0.8).

Discussion

An increasing number of studies have found evidence for TI among birds (e.g., jays [4, 5]; crows [6, 7]; pigeons [8, 9]; geese [10]), mammals (e.g., chimpanzees [11]; macaques [12]; lemurs [13]; mice [14]), and fishes (e.g., brook trout [15]; cichlids [16]; cleaner wrasses [17]) that is predicted to derive from the ecological needs of the species of interest. However, potential alternative explanations associated with positive findings, as well as a paucity of published negative results (but see [24]), make it important to also test species where the ecological approach to cognition predicts that they should lack TI. In this study, we tested whether cleaner fish use TI to organize information in their brain. We hypothesized that cleaners are lacking ecological conditions where TI would be a useful cognitive tool and thus should lack this cognitive ability according to the ecological approach to cognition. Our view contrasts with that of Hotta et al. [17], who proposed that the ecology of cleaner wrasse predicts the presence of TI. Readers need to decide whether they would expect TI to be present in a protogynous species living in harems that very rarely contain more than six females, with a size-based hierarchy. While they found positive evidence for TI in cleaner fish in their almost simultaneously conducted study [17], we found no evidence for it in our study. A large part of our discussion therefore focuses on potential explanations for the diverging results. As we see it, differences in training provide the most likely explanation.

Importantly, failure to show TI in an experiment could be due to a number of factors, which may or may not be mutually exclusive. One potential explanation would be that the species tested lacks the cognitive process. Alternatively, an inability to memorize the relationship in every training pair may prevent the subject from exhibiting their capacity for TI. The latter is evident in our results, as cleaners lacked the remembrance of the middle pairs BC and CD that were presented between the actual TI trials. This was not expected, since all fish successfully reached the learning criterion during initial training for each pair. Thus, cleaners could not properly remember all four pairs. Instead, our results highlight two well-known effects in TI studies that contribute to this middle pairs outcome: (1) the recency effect (i.e., an item is favored if it was seen just before) and (2) the anchoring effect (i.e., the first and the last pair of the list are better remembered than the middle pairs). The memory performances on our training pairs were highly impacted by the recency effect, as the better-remembered pair was the one that was learned the last. Moreover, the observed U-shaped pattern of the remembrance of each training pair, in addition to the failure to remember the middle pairs above chance, are indicative of the anchoring effect. This U-shaped pattern has been previously reported in both unsuccessful [24] and successful [4346] TI experiments. In previous successful studies [4346], the middle pairs were still remembered above chance, unlike in the present study. The U-shaped pattern provides a potential explanation for lower preferences in the BD pair compared to the AE pair [47] and could be explained by the serial position effect (SPE), which predicts that performances on further objects in a series are higher compared to the performances on neighboring objects [41, 43, 44, 4850]. In our case, the U-shaped pattern could not be found when the results from the 4 groups of learning order were looked at independently. Only the combination of the results from all of these groups showed the U-shaped pattern. When we looked at each order of learning independently, we could see again the anchoring effect and clearly observe the recency effect. Indeed, by comparing orders of learning 1 and 2 to orders 3 and 4, we saw that the difference between the first and last learned pairs was reduced when those were the anchoring pairs AB and DE. We observed that the better-remembered training pair for each group of learning order was the pair that was learned the last, which is a clear sign of the recency effect. These effects had the same importance for the 6 fish with an overall memory equal to or above 65%, which indicates that the failure in BC and CD was not influenced by the overall success in the memorization of the training pairs. These memory differences between the anchoring and the middle pairs can explain both the precision with which subjects preferred to approach A over E and the lack of preference in BD trials.

Several other potentially confounding variables reported in previous studies were absent in our study, and our sample size allowed us to control for different factors that could have biased the results. Our color groups avoided a particular value attribution to a specific plate (as in [16] with artificial dominance). Also, using 4 groups of counterbalanced learning order made it unnecessary to have a “bias reversal procedure” that consists of over-reinforcing D over B. Then, based on reinforcement history, the individual would significantly choose D in the BD task [6, 7]. Hotta et al. (2020) [17] used the same sequence of training for their four subjects. Our group of four subjects that experienced the same order of learning as the fish in the study by Hotta et al. [17] still showed considerably different results. Moreover, in our study, the pair of plates presented in the trial before a test was varied as much as possible within individuals and counterbalanced across individuals to avoid a systematic bias that could result from a recency effect [5], which could, for example, influence an individual to choose B over D if exposed to B+C- in the previous trial. Furthermore, it appears that the success/failure of our fish was not affected by differences in the reinforcement history of the various plate combinations, which have been suggested as means to yield transitive-like responses in TI tasks [8, 51]. In contrast, reinforcement history could explain the success of three out of four fish used by Hotta et al. [17]. In summary, it appears that no systematic bias could have produced positive results with our study design; poor memorization of the middle pairs (BC and CD) emerges as the main reason for the cleaners’ poor performance. The reason for this failure is still interesting as it sticks to ecological relevancy, so we discuss it later.

Comparison to Hotta et al. 2020

When comparing our study to the work of Hotta et al. [17], there are several methodological differences that may have contributed to our divergent findings (presented in Table 1). One potential explanation, which we consider unlikely, is that the positive results reported by Hotta et al. [17] were due to a combination of chance and small sample size. The authors only tested four cleaners compared to our 19, but all these four cleaners consistently preferred the B plate over the D plate from trial 1 onwards over a series of 12 trials (note that the authors erroneously report only 3 of 4 correct first choices; their published data show 4 of 4 correct choices). Thus, while their sample size was much smaller than ours, we consider it parsimonious to conclude that the differences in performance are due to differences in methods.

Table 1. Main differences in methods and results between Hotta et al. 2020 and the present study.

Difference Hotta et al. 2020 This study
Methods Plates Different colors Colors and pattern differences
Reward Smeared prawn on the front, get one if succeeded or none if failed One prawn item hidden in the back, get it if succeeded or none if failed
Time between trials 1 or 2min between the trials, 2 or 3h between the sessions 20 to 50min between trials
Sample size N = 4 N = 19
Training
  • Phase 1:

    A-B+, B-C+, A-C+, C-D+, D-E+, C-E+

    Criteria: 2 x 5 or 6/6

  • Phase 2:

    A-B+, B-C+, C-D+, D-E+

    Criteria: 1 x 5 or 6/6

  • Phase 3:

    Sessions of 8 trials, each pair presented twice. Criteria: at least 1/2 for each pair per session and over 6/8 in two consecutive sessions

  • 5 trials to present each new pair.

  • Learning:

    A+B-, B+C-, C+D-, D+E-

    Criteria: 3 x 7/10; 2 x 8,9 or 10/10, 9 or 10/10 + 7/10

  • Reminding trials:

    4 trials, each pair presented once before testing

Test 3 x 4 trials for BD 3 x BD and 3 x AE, randomly presented within training pairs trials [10 trials per day, 1 test pair presented per 5 trials]
Counterbalancing BD colors Colors, order of learning, which training plate was presented prior to a test pair
Results Training pairs All remembered equally Not remembered equally
Test B-D+ succeeded B+D- failed.
A+E- succeeded
Reinforcement history Could explain the transitive response for 3 out of 4 fish No global difference for B and D but variations depending on the group of learning order. Not correlated with BD success

Note: the list is not exhaustive, only the bigger ones were listed. Readers are advised to refer to Hotta et al. 2020 to get more details on terminology or else.

A major difference between our two studies involves the number of training pairs. Hotta et. al [17], as well as other studies [10, 12], added non-adjacent pairs to their training (i.e., AC and CE). These combinations may have helped to consolidate the subjects’ memory of the BC and CD combinations. One obvious prerequisite for the ability to demonstrate TI is that subjects memorize the ranks within all pairs so that they can deduce the ranking within novel pairs. In our series of 30 experimental trials, this prerequisite was fulfilled only for the anchoring pairs AB and DE. By adding those non-adjacent pairs, subjects obtained complete information about the rank position of plate C; it is non-rewarding when paired with A or B, and it is rewarding when paired with D and E. This extended knowledge could be necessary for the fish to establish the link between the different plates and be able to infer the value of B over D, potentially by helping to the establishment of a hierarchical list of the plates. An individual then has absolute knowledge about the plates located in both parts of the final list (i.e., A, B, C and C, D, E) that they can use to infer the new BD relationship. Without this supplementary knowledge, it is possible that the list could not be established, so the different pairs would remain as unlinked objects, preventing TI to be applied. The fact that some studies have yielded positive results using a similar experimental design to that used here—without the non-adjacent pairs training—could be a sign that the establishment of the list is already a cognitively demanding step that necessitates extended training for fish compared to some other species.

An interesting middle ground between these two cleaner studies could be to present only the four adjacent pairs but with additional rounds of BC and CD trials until all correct choices are properly memorized. On the other hand, we think that it is an important result of our study that cleaners find it challenging to memorize the middle pairs. This result raises an interesting question—what ecological conditions would yield enough training for memorization of frequently occurring pairs before TI can be used to spontaneously resolve a unique new combination? The advantage of TI is that it is a capacity that allows one to get maximum information with minimum learning. Our negative results suggest that cleaners cannot apply TI without extensive prior training in the four combinations, which negates its major advantage. While the positive results by Hotta et al. [17] seem to suggest that cleaners can apply TI provided that they receive extensive training, we reiterate that we consider their additional training on the AC and CE combinations as a potentially confounding factor. In any case, we consider the fact that our cleaners failed to remember all combinations after they all succeeded the initial learning also points at the ecological irrelevancy of the TI task. Nevertheless, we also acknowledge that replicating the social paradigm used in other species of fishes [15, 16] could help to clarify whether cleaners could succeed more easily (i.e., without an additional non-adjacent pair training) in this type of TI task compared to that utilized in a foraging context. The cognitive process of TI might still be present in cleaners, even if our study did not find evidence for it in this context.

Reinforcement history

It has been noted several times in the literature that a difference in reinforcement history could lead to a transitive-like response in TI tasks [8, 51]. For example, fish subjected to five-item tasks in our study might on average need more reinforcement trials to learn BC than DE because B only loses against a plate with a pure positive association (plate A), while D only wins against a plate with a pure negative association (plate E). More reinforcement trials on B could thus potentially lead to a preference for B over D. That said, we did not find such inequality in the number of reinforcement trials (except for one group of learning order) in our study. Moreover, individual differences in reinforcement history ratios did not correlate with a preference for B or D.

Transitive inference tests

The types of evidence needed to adequately test for the presence of TI continue to be debated [42, 52, 53], but the generally accepted paradigm typically involves information on social dominance—where few observations of pairs fighting may reveal information of relative strength without subjects receiving any rewards [15, 16]. In contrast, our paradigm can be criticized on the basis that a plate’s ‘state’ is 1 (food) or 0 (no food) depending on the plate it is paired with. Such a presentation does not match on some continuous scale. Unfortunately, experiments involving reinforcement learning cannot use a continuous scale (like fighting ability) as subjects would obtain absolute information on food reward (e.g., A offers more food than B, which offers more food than C, etc.) so that TI is not needed for subjects to prefer B over D. Nevertheless, we note that the food paradigm has been used on various species ([9, 11, 24, 54]; mix with social hierarchy: [13]), often yielding positive results. These differences are interesting and interpretable. The comparison between paper wasps and bees [18, 24], as well as within-species variations in chicks [23], fits the ecological approach to cognition, and we argue that our negative results do so as well. Cleaner fish obtain absolute information about client value as a food source in nature, and their social hierarchies in each harem are based on relative body size (i.e., also based on absolute information). Thus, we would argue that cleaners should not need to employ transitive inference under natural conditions.

Conclusion and outlook

The ecological approach to cognition has helped to identify various high performances in species that are phylogenetically distant from humans, as demonstrated in specific tasks that correspond to relevant ecological pressures. A classic example involves the selective spatial memory abilities of Clarke’s nutcrackers compared to closely related species that rely less on seed caching [55]. The ecological approach has also helped to identify suitable experimental paradigms to test for advanced cognitive processes, for example by using food caching to test for episodic-like memory in scrub-jays [56]. The cleaner wrasse L. dimidiatus provides another suitable species for the ecological approach as their foraging conflict with clients has apparently selected for the ability to adjust to context-specific situations—i.e., whether the interaction is observed [57], whether they clean alone or with a partner [58], their own physiological state [59], client species features [60] or fish densities [33, 35]. This fine-tuned strategic sophistication raises questions regarding the underlying cognitive processes that can explain these behaviors. As it stands, cleaners perform well in tasks testing for two executive functions, namely self-control [32] and flexibility [61]. While these functions are clearly useful for their interactions with clients, transitive inference is not needed as far as we can judge. Given this, we would argue that designing studies where failure is predicted is an important complement to test the ecological approach to cognition. This is because the ecological approach implicitly assumes that brains function in rather modular ways. Thus, positive results where the ecological approach predicts failure would identify aspects of brain functioning that are more general-purpose tools. Our study suggests that cleaner fish would not use the general-purpose tool of transitive inference in their daily life, though our data do not allow us to distinguish whether this is due to memory constraints or to the absence of the TI process. We also propose that our data can be reconciled with the results of Hotta et al. [17] in that only additional training and performance control before the crucial test may yield positive results, albeit with the caveat that associative learning cannot be excluded as a simpler mechanism. It would be interesting to test larger-brained endotherm vertebrate species that also do not need TI in their daily lives to see whether this cognitive tool might eventually emerge as a by-product of larger computing power.

Acknowledgments

The staff from the Lizard Island Research Station. N. Truskanov and Z. Yanai for helping with the setup and manipulations, the statistician R. Slobodeanu for his advice and answers, and C. Clements for his review of the manuscript.

Data Availability

All the data and R codes are publicly accessible on figshare (https://figshare.com/projects/No_evidence_for_transitive_Inference_in_cleaner_wrasse_data/158132).

Funding Statement

Founder: The Swiss National Science Foundation The recipient: Professor Redouan Bshary. Grant numbers: 310030B_173334 and 310030_192673 website: https://snf.ch/en/FKhU9kAtfXx7w9AI/page/home The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Shettleworth SJ. Cognition, evolution, and behavior. Oxford university press; 2009. [Google Scholar]
  • 2.Pavlov PI. Conditioned reflexes: an investigation of the physiological activity of the cerebral cortex. Ann Neurosci. 2010;17[3]:136. doi: 10.5214/ans.0972-7531.1017309 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Thorndike EL. The law of effect. Am J Psychol. 1927;39[1/4]:212–22. [Google Scholar]
  • 4.Paz-y-Miño C G, Bond AB, Kamil AC, Balda RP. Pinyon jays use transitive inference to predict social dominance. Nature. août 2004;430[7001]:778–81. doi: 10.1038/nature02723 [DOI] [PubMed] [Google Scholar]
  • 5.Bond AB, Kamil AC, Balda RP. Social complexity and transitive inference in corvids. Anim Behav. mars 2003;65[3]:479–87. [Google Scholar]
  • 6.Lazareva OF, Smirnova AA, Bagozkaja MS, Zorina ZA, Rayevsky VV, Wasserman EA. Transitive responding in hooded crows requires linearly ordered stimuli. J Exp Anal Behav. 2004;82[1]:1–19. doi: 10.1901/jeab.2004.82-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Lazareva OF, Wasserman EA. Effect of stimulus orderability and reinforcement history on transitive responding in pigeons. Behav Processes. 2006;72[2]:161–72. doi: 10.1016/j.beproc.2006.01.008 [DOI] [PubMed] [Google Scholar]
  • 8.Couvillon P, Bitterman M. A Conventional Conditioning Analysis of « Transitive Inference » in Pigeons. J Exp Psychol Anim Behav Process. 1 juill 1992;18:308–10. [Google Scholar]
  • 9.Zentall TR, Peng D, Miles L. Transitive inference in pigeons may result from differential tendencies to reject the test stimuli acquired during training. Anim Cogn. 2019;22[5]:619–24. doi: 10.1007/s10071-019-01257-2 [DOI] [PubMed] [Google Scholar]
  • 10.Weiß BM, Kehmeier S, Schloegl C. Transitive inference in free-living greylag geese, Anser anser. Anim Behav. 2010;79[6]:1277–83. [Google Scholar]
  • 11.Gillan DJ. Reasoning in the chimpanzee: II. Transitive inference. J Exp Psychol Anim Behav Process. 1981;7[2]:150. [Google Scholar]
  • 12.Jensen G, Alkan Y, Munoz F, Ferrera VP, Terrace HS. Transitive inference in humans and rhesus macaques after massed training of the last two list items. bioRxiv. 2016;055335. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Tromp D, Meunier H, Roeder JJ. Transitive inference in two lemur species [Eulemur macaco and Eulemur fulvus]. Am J Primatol. 2015;77[3]:338–45. doi: 10.1002/ajp.22349 [DOI] [PubMed] [Google Scholar]
  • 14.Silverman JL, Gastrell PT, Karras MN, Solomon M, Crawley JN. Cognitive abilities on transitive inference using a novel touchscreen technology for mice. Cereb Cortex. 2015;25[5]:1133–42. doi: 10.1093/cercor/bht293 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.White SL, Gowan C. Brook trout use individual recognition and transitive inference to determine social rank. Behav Ecol. 2013;24[1]:63–9. [Google Scholar]
  • 16.Grosenick L, Clement TS, Fernald RD. Fish can infer social rank by observation alone. Nature. 2007;445[7126]:429–32. doi: 10.1038/nature05511 [DOI] [PubMed] [Google Scholar]
  • 17.Hotta T, Ueno K, Hataji Y, Kuroshima H, Fujita K, Kohda M. Transitive inference in cleaner wrasses [Labroides dimidiatus]. PloS One. 2020;15[8]:e0237817. doi: 10.1371/journal.pone.0237817 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Tibbetts EA, Agudelo J, Pandit S, Riojas J. Transitive inference in Polistes paper wasps. Biol Lett. 2019;15[5]:20190015. doi: 10.1098/rsbl.2019.0015 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.MacLean EL, Merritt DJ, Brannon EM. Social complexity predicts transitive reasoning in prosimian primates. Anim Behav. 2008;76[2]:479–86. doi: 10.1016/j.anbehav.2008.01.025 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Bond AB, Wei CA, Kamil AC. Cognitive representation in transitive inference: A comparison of four corvid species. Behav Processes. oct 2010;85[3]:283–92. doi: 10.1016/j.beproc.2010.08.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Kamil AC. On the proper definition of cognitive ethology. In: Animal cognition in nature. Elsevier; 1998. p. 1–28. [Google Scholar]
  • 22.Hutchins E. Cognitive Ecology. Top Cogn Sci. 2010;2[4]:705–15. doi: 10.1111/j.1756-8765.2010.01089.x [DOI] [PubMed] [Google Scholar]
  • 23.Daisley JN, Vallortigara G, Regolin L. Low-rank Gallus gallus domesticus chicks are better at transitive inference reasoning. Commun Biol. 9 déc 2021;4[1]:1–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Benard J, Giurfa M. A Test of Transitive Inferences in Free-Flying Honeybees: Unsuccessful Performance Due to Memory Constraints. Learn Mem. 5 janv 2004;11[3]:328–36. doi: 10.1101/lm.72204 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Robertson DR. Social control of sex reversal in a coral-reef fish. Science. 1972;177[4053]:1007–9. doi: 10.1126/science.177.4053.1007 [DOI] [PubMed] [Google Scholar]
  • 26.Buston P. Size and growth modification in clownfish. Nature. juill 2003;424[6945]:145–6. [DOI] [PubMed] [Google Scholar]
  • 27.Booth DJ. Juvenile Groups in a Coral-Reef Damselfish: Density-Dependent Effects on Individual Fitness and Population Demography. Ecology. janv 1995;76[1]:91–106. [Google Scholar]
  • 28.Sakai Y, Kohda M. Spawning timing of the cleaner wrasse, Labroides dimidiatus, on a warm temperate rocky shore. Ichthyol Res. 2001;48:23–30. [Google Scholar]
  • 29.Grutter A. Parasite removal rates by the cleaner wrasse Labroides dimidiatus. Mar Ecol Prog Ser. 1996;130:61–70. [Google Scholar]
  • 30.Grutter AS, Bshary R. Cleaner wrasse prefer client mucus: support for partner control mechanisms in cleaning interactions. Proc R Soc Lond B Biol Sci [Internet]. 7 nov 2003. [cité 25 janv 2023];270[suppl_2]. Disponible sur: https://royalsocietypublishing.org/doi/10.1098/rsbl.2003.0077 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Salwiczek LH, Prétôt L, Demarta L, Proctor D, Essler J, Pinto AI, et al. Adult cleaner wrasse outperform capuchin monkeys, chimpanzees and orang-utans in a complex foraging task derived from cleaner–client reef fish cooperation. PLoS One. 2012;7[11]:e49068. doi: 10.1371/journal.pone.0049068 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Aellen M, Dufour V, Bshary R. Cleaner fish and other wrasse match primates in their ability to delay gratification. Anim Behav. 1 juin 2021;176:125–43. [Google Scholar]
  • 33.Triki Z, Li Richter XY, Demairé C, Kurokawa S, Bshary R. Marine cleaning mutualism defies standard logic of supply and demand. Am Nat. 2022;199[4]:455–67. doi: 10.1086/718315 [DOI] [PubMed] [Google Scholar]
  • 34.Wismer S, Grutter A, Bshary R. Generalized rule application in bluestreak cleaner wrasse [Labroides dimidiatus]: using predator species as social tools to reduce punishment. Anim Cogn. 1 juill 2016;19[4]:769–78. doi: 10.1007/s10071-016-0975-4 [DOI] [PubMed] [Google Scholar]
  • 35.Triki Z, Wismer S, Rey O, Ann Binning S, Levorato E, Bshary R. Biological market effects predict cleaner fish strategic sophistication. Behav Ecol. 2019;30[6]:1548–57. [Google Scholar]
  • 36.Kohda M, Hotta T, Takeyama T, Awata S, Tanaka H, Asai J ya, et al. If a fish can pass the mark test, what are the implications for consciousness and self-awareness testing in animals? PLoS Biol. 2019;17[2]:e3000021. doi: 10.1371/journal.pbio.3000021 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Kohda M, Sogawa S, Jordan AL, Kubo N, Awata S, Satoh S, et al. Further evidence for the capacity of mirror self-recognition in cleaner fish and the significance of ecologically relevant marks. PLoS Biol. 2022;20[2]:e3001529. doi: 10.1371/journal.pbio.3001529 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Oliveira RF, Bshary R. Expanding the concept of social behavior to interspecific interactions. Ethology. 2021;127[10]:758–73. [Google Scholar]
  • 39.Taborsky B, Oliveira RF. Social competence: an evolutionary approach. Trends Ecol Evol. 1 déc 2012;27[12]:679–88. doi: 10.1016/j.tree.2012.09.003 [DOI] [PubMed] [Google Scholar]
  • 40.Gazes RP, Hampton RR, Lourenco SF. Transitive inference of social dominance by human infants. Dev Sci [Internet]. mars 2017. [cité 24 janv 2023];20[2]. Disponible sur: https://onlinelibrary.wiley.com/doi/10.1111/desc.12367 doi: 10.1111/desc.12367 [DOI] [PubMed] [Google Scholar]
  • 41.Vasconcelos M. Transitive inference in non-human animals: An empirical and theoretical analysis. Behav Processes. 2008;78[3]:313–34. doi: 10.1016/j.beproc.2008.02.017 [DOI] [PubMed] [Google Scholar]
  • 42.Allen C. Transitive inference in animals: Reasoning or conditioned associations? In: Hurley S, Nudds M, éditeurs. Rational Animals? [Internet]. Oxford University Press; 2006. [cité 24 janv 2023]. p. 175–86. Disponible sur: https://academic.oup.com/book/5590/chapter/148575184 [Google Scholar]
  • 43.Bryant PE, Trabasso T. Transitive Inferences and Memory in Young Children. Nature. août 1971;232[5311]:456–8. doi: 10.1038/232456a0 [DOI] [PubMed] [Google Scholar]
  • 44.Woocher FD, Glass AL, Holyoak KJ. Positional discriminability in linear orderings. Mem Cognit. 1978;6[2]:165–73. [Google Scholar]
  • 45.Mikolasch S, Kotrschal K, Schloegl C. Transitive inference in jackdaws [Corvus monedula]. Behav Processes. 2013;92:113–7. doi: 10.1016/j.beproc.2012.10.017 [DOI] [PubMed] [Google Scholar]
  • 46.Daisley JN, Vallortigara G, Regolin L. Logic in an asymmetrical [social] brain: Transitive inference in the young domestic chick. Soc Neurosci. 2 juin 2010;5[3]:309–19. doi: 10.1080/17470910903529795 [DOI] [PubMed] [Google Scholar]
  • 47.Von Fersen L, Wynne CD, Delius JD, Staddon JE. Transitive inference formation in pigeons. J Exp Psychol Anim Behav Process. 1991;17[3]:334. [DOI] [PubMed] [Google Scholar]
  • 48.McGonigle B, Chalmers M. The selective impact of question form and input mode on the symbolic distance effect in children. J Exp Child Psychol. 1984;37[3]:525–54. doi: 10.1016/0022-0965(84)90075-4 [DOI] [PubMed] [Google Scholar]
  • 49.Lazareva OF, Paxton Gazes R, Elkins Z, Hampton R. Associative models fail to characterize transitive inference performance in rhesus monkeys [Macaca mulatta]. Learn Behav. 2020;48[1]:135–48. doi: 10.3758/s13420-020-00417-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Wu X, Levy WB. Simulating symbolic distance effects in the transitive inference problem. Neurocomputing. 2001;38:1603–10. [Google Scholar]
  • 51.Wynne CD, Von Fersen L, Staddon JE. Pigeons’ inferences are transitive and the outcome of elementary conditioning principles: A response. 1992; [Google Scholar]
  • 52.Guez D, Audley C. Transitive or not: a critical appraisal of transitive inference in animals. Ethology. 2013;119[9]:703–26. [Google Scholar]
  • 53.Galizio A, Doughty AH, Williams DC, Saunders KJ. Understanding behavior under nonverbal transitive-inference procedures: Stimulus-control-topography analyses. Behav Processes. juill 2017;140:202–15. doi: 10.1016/j.beproc.2017.05.010 [DOI] [PubMed] [Google Scholar]
  • 54.Davis H. Transitive inference in rats [Rattus norvegicus]. J Comp Psychol Wash DC 1983. 1 déc 1992;106:342–9. doi: 10.1037/0735-7036.106.4.342 [DOI] [PubMed] [Google Scholar]
  • 55.Romonchuk Wayne Joseph. The role of memory in cache-recovery in Clark’s nutcrackers and pinyon jays. Northern Arizona University; 1995.
  • 56.Clayton NS, Dickinson A. Episodic-like memory during cache recovery by scrub jays. Nature. sept 1998;395[6699]:272–4. doi: 10.1038/26216 [DOI] [PubMed] [Google Scholar]
  • 57.Pinto A, Oates J, Grutter A, Bshary R. Cleaner wrasses Labroides dimidiatus are more cooperative in the presence of an audience. Curr Biol. 2011;21[13]:1140–4. doi: 10.1016/j.cub.2011.05.021 [DOI] [PubMed] [Google Scholar]
  • 58.Bshary R, Grutter AS, Willener AST, Leimar O. Pairs of cooperating cleaner fish provide better service quality than singletons. Nature. oct 2008;455[7215]:964–6. doi: 10.1038/nature07184 [DOI] [PubMed] [Google Scholar]
  • 59.Binning SA, Rey O, Wismer S, Triki Z, Glauser G, Soares MC, et al. Reputation management promotes strategic adjustment of service quality in cleaner wrasse. Sci Rep. 21 août 2017;7[1]:8425. doi: 10.1038/s41598-017-07128-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Roche DG, Jornod M, Douet V, Grutter AS, Bshary R. Client fish traits underlying variation in service quality in a marine cleaning mutualism. Anim Behav. 2021;175:137–51. [Google Scholar]
  • 61.Triki Z, Bshary R. Fluctuations in coral reef fish densities after environmental disturbances on the northern Great Barrier Reef. PeerJ. 2019;7:e6720. doi: 10.7717/peerj.6720 [DOI] [PMC free article] [PubMed] [Google Scholar]

Decision Letter 0

Rachael Miller (Harrison)

21 Feb 2023

PONE-D-23-02374No evidence for transitive inference in cleaner wrasse Labroides dimidiatusPLOS ONE

Dear Dr. Bonin,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

In particular, please carefully address the issues raised by Reviewer 2, including the overall framing and motivation of the manuscript, and Reviewer 1's point regarding the need for increased emphasis on the possibility that the fish failure reflects that of training, rather than inference.

Please submit your revised manuscript by Apr 07 2023 11:59PM​. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Rachael Miller (Harrison)

Academic Editor

PLOS ONE

Journal requirements:

When submitting your revision, we need you to address these additional requirements.

1.  Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf  and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. In your Methods section, please provide additional information regarding the permits you obtained for the work. Please ensure you have included the full name of the authority that approved the field site access and, if no permits were required, a brief statement explaining why.

3. We note that you have included the phrase “data not shown” in your manuscript. Unfortunately, this does not meet our data sharing requirements. PLOS does not permit references to inaccessible data. We require that authors provide all relevant data within the paper, Supporting Information files, or in an acceptable, public repository. Please add a citation to support this phrase or upload the data that corresponds with these findings to a stable repository (such as Figshare or Dryad) and provide and URLs, DOIs, or accession numbers that may be used to access these data. Or, if the data are not a core part of the research being presented in your study, we ask that you remove the phrase that refers to these data.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: No

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: I believe this is a very good paper, well-written, clear, with properly analyzed results. It seems to me that the authors should stress more perhaps that fish showed a failure of training more than a failure of transfer (inference), i.e. that the latter is mostly the outcome of the former. At any event, I believe this paper can be published as it stands.

Reviewer #2: The authors set out to test transitive inference (TI) in cleaner wrasse which are known for their social cognitive complexity. They reason the task is not ecologically relevant, so even though the fish are smart in other contexts, this particular task is irrelevant to them. The reasoning is sound with the exception of a previous paper on the same species and methodology which not only found evidence of TI but also justified why they expect this species to be capable of doing it on ecological grounds. So which expectation is true?? Moreover, TI is typically examined in a social context (agonistic interactions) but here the methodology is a foraging task. It is not immediately apparent if it is appropriate to expect that TI is transferable across contexts. At least some of the introduction needs to address “the cleaner fish in the closet”. Lastly, the test seems to be set up in a discrete way, whereas my intuition says that TI would best work with continuous variables.

The authors seem to be claiming that they found no evidence for TI, but this is not really a true picture of the results. In fact, the fish basically failed the pre-training prerequisites and should never have been tested in the TI test at all. In other words, they were destined to fail. This is particularly problematic because the middle training pairs which are essential to the ultimate TI test were simply not learned above random. The authors mention this in the discussion, but ultimately it’s a fatal problem.

Having said that, I commend the authors for so thoroughly investigating their results, because it clearly highlights potential issues with how these tests are conducted. For me this is a major strength of the paper (lessons on what not to do). Both anchoring and recency effects were present in the training data sets.

I did not find the motivation for this study to be terribly convincing. It is just not clean. One paper says TI is to be expected given the fish’s ecology. Here the authors say its not. Would it not be simpler to just revisit Hotta and say you’ve greatly increased the sample size? That sounds like a better justification to me.

So overall my feeling is the motivation for the study needs to change, as does the take-home message: They failed because we failed… lessons learned.

Specific comments:

Abstract: It is a bit odd that the opening statement suggest that you do no expect to find that cleaner wrasse are capable of AI given their ecology but your last sentence says there is already evidence of TI in this species. This is a little confusing.

L60: evidence of TI has been

L67: I guess the question here is, does it make sense to test TI outside of social hierarchies? The other cleaner fish paper did as I recall but the other fish ones used agonistic interactions.

L70: This is fine, but 2 of the 3 studies done on fish support the ecological explanation. The other doesn’t, although the introduction of Hotta et al 2020 claims their task is ecologically relevant. Here you claim it isn’t. So who to believe??

L86: growth through the threat of aggression

L87: So here you are suggesting they don’t need TI because size is a true predictor of hierarchy position. Do you know if the hierarchy in your model is truly linear? How do you then explain the previous result which used pretty much the same approach? Moreover, im not sure you can transpose TI from a social context to a feeding context. This is an interesting question in its own right.

L135: I have niggling doubts about this task. In my mind TI would be particularly useful for determining where things lie on a sliding scale, A is bigger than B which is bigger than C. But here the procedure is a little bit abstract. In some cases B has a reward (B+C1), in others it doesn’t (A+B-). It’s quite complicated.

L137: in previous iterations of this test the “non preferred” plate would have flake food. Why no reward as apposed to a non-preferred award?

L204: between the number of times that

L228L sooo the fish failed to learn the middle associations? Is there any point in testing for TI given this result? Especially since your test involved contrasts with D

L235: Again this is a problem since the outcome of the TI will be dependent on what came before it.

L262: earned BC last

L281: Yeah im not sure you can sell this idea too hard given Hora et al’s results. What you could say is you have repeated that test with greater sample size. I think this would be a more convincing motivation for the study.

L282: organize information in their

L287: My feeling is that your training regime failed, so the test for TI was not valid. You must seriously consider this as the main reason for your negative finding.

L296: (45) and unsuccessful

L356 & L381: see my earlier comments about this being a discreet test as opposed to a continuous test.

L389: except they don’t because your training failed. This is likely a false negative.

L409: I also think expectations of failure are valuable in the literature, but beware false negatives.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: Yes: Culum Brown

**********

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2023 Jun 23;18(6):e0287402. doi: 10.1371/journal.pone.0287402.r002

Author response to Decision Letter 0


13 Apr 2023

Journal requirements:

1. Please ensure that your manuscript meets PLOS ONE's style requirements

Those have been checked

2. In your Methods section, please provide additional information regarding the permits you obtained for the work.

The permit number has been added

3. We note that you have included the phrase “data not shown” in your manuscript.

It was badly used. I just did not display any figure for it, but the data are obviously accessible at the indicated figshare link. I deleted this sentence to avoid any misunderstanding, and just indicated the test results.

Response to reviewer:

We made all the demanded changes in wording and clarified our point of view.

Title:

In the absence of extensive initial training, cleaner wrasse Labroides dimidiatus fail a transitive inference task.

Reviewer #1: I believe this is a very good paper, well-written, clear, with properly analyzed results. It seems to me that the authors should stress more perhaps that fish showed a failure of training more than a failure of transfer (inference), i.e. that the latter is mostly the outcome of the former. At any event, I believe this paper can be published as it stands.

Thank you very much for this positive assessment. We agree that the title can be misleading as our paper is more a highlight of the biases of this TI task and a discussion on the methodological comparisons between our and Hotta et al. study. In light of the comments by referee 2, we also highlight more the importance of failure after the initial training in which they all managed to learn the different pairs.

Reviewer #2: The authors set out to test transitive inference (TI) in cleaner wrasse which are known for their social cognitive complexity. They reason the task is not ecologically relevant, so even though the fish are smart in other contexts, this particular task is irrelevant to them. The reasoning is sound with the exception of a previous paper on the same species and methodology which not only found evidence of TI but also justified why they expect this species to be capable of doing it on ecological grounds. So which expectation is true?? Moreover, TI is typically examined in a social context (agonistic interactions) but here the methodology is a foraging task. It is not immediately apparent if it is appropriate to expect that TI is transferable across contexts. At least some of the introduction needs to address “the cleaner fish in the closet”. Lastly, the test seems to be set up in a discrete way, whereas my intuition says that TI would best work with continuous variables.

We agree that we did not explain well the issue of ecological validity. In the new version, we quote what the Japanese team used as an argument. We disagree with them. We guess it is up to the reader to decide whether their logic or our logic is more parsimonious. They argue with the hierarchy within harems, where new members can use TI to infer the hierarchy without seeing each pair fighting. We think that this is not a good argument because cleaners (as other haremic sex changers) have a size-based hierarchy. So size alone predicts position within the hierarchy. A > B > C is absolute knowledge, no transitive inference is needed (A is always the “winner” and C is always the “looser”). The same logic holds for clients and their value as a function of body size. This contrasts with Grosenick et al. 2007 and their work on an African cichlid fish, where the authors used translocation to create different hierarchies with the very same fish. That was a brilliant design, and TI was absolutely necessary to get it right. Also note that cleaner harem size is rather small compared to the shoaling cichlids, as well as brook trout used by White and Gowan 2013, which can live in groups up to 27 individuals organized in a hierarchy impacted by different factors so as fish must adapt regularly.

Regarding our foraging paradigm (and the discrete reward scheme), we note that the majority of studies on TI have used it before us, including the Japanese team. We added a part just before the “reinforcement history” paragraph in the discussion to introduce the possibility that cleaners could potentially succeed in a social TI paradigm. It is true that the discrete side of could be a problem as the relationship must be continuous (e.g. stronger, bigger ..). We have it in our discussion (section “transitive inference test”) but we did not intend to discuss it more as multiple papers already discussed this problem (see Allen 2006, Guez and Audley 2013 for instance). Even if the relationship between the objects is not transitive (or let’s say, continuous), the response can still be transitive (see the reviews) Importantly, there are many positive results with this approach, including Hotta et al. 2020. Thus, our negative results are meaningful for a comparison.

One extra point that we now highlight in the methods section: our study was conducted in parallel with the Japanese group: we planned the study and collected the data without knowing about their study. Hence, the entire logic in the introduction is the logic we used to plan our study. We did not conduct the experiments to disprove them. We are just slower in publishing… to be precise, they submitted a first version to the journal on 23.3.20; we finished data collection on 15.3.20.

The authors seem to be claiming that they found no evidence for TI, but this is not really a true picture of the results. In fact, the fish basically failed the pre-training prerequisites and should never have been tested in the TI test at all. In other words, they were destined to fail. This is particularly problematic because the middle training pairs which are essential to the ultimate TI test were simply not learned above random. The authors mention this in the discussion, but ultimately it’s a fatal problem.

We adjusted the title to address this comment. However, it is incorrect to say that cleaners failed the pre-training prerequisites. They all learned to prefer the rewarding plate in all 4 plate combinations. Thus, we could have run the TI tests without any further testing. We interspersed the TI trials into a sequence of trials because we wanted a) have several trials for each individual on AE and BD to increase statistical power and mixing those trials with training ones seemed a good tool to lower chances of learning, and b) to look at performance in the 4 plate combinations when they are mixed. Those trials are the ones for which the data are presented. Note that it was a surprise to us that cleaners did not remember above chance the middle pairs. As numerously reported in the literature, we were expecting lower success (although the different order of learning groups should have flattened the curve a little) but the “not above chance” part was not expected. Only because of this design are we able to explain why the cleaners failed at the BD trials. What we explain better now is that they fail at the TI TASK, which doesn’t necessarily mean that they lack the cognitive process (explanation added just before the “reinforcement history” paragraph in the discussion). However, if you cannot memorize 4 relationships, then there is a memory constraint that prevents them from applying any TI even if they would in principle have the process in their repertoire..

In addition, our paper ultimately focuses on the why of the failure, mainly the biases leading to failure in the test of the training pairs and methodological comparisons with Hotta et al, which bring a better understanding of the cleaner wrasse cognition and of potential methodological improvements to be made in general. This is clarified by a change in the title.

We do more than mentioning this problem as the bigger part of the discussion focuses on this particular failure. The end of the first paragraph of the discussion is now clarifying the initial difference in our hypothesis so that it is easier for the reader to keep it in mind.

I did not find the motivation for this study to be terribly convincing. It is just not clean. One paper says TI is to be expected given the fish’s ecology. Here the authors say its not. Would it not be simpler to just revisit Hotta and say you’ve greatly increased the sample size? That sounds like a better justification to me.

This was initially not based on Hotta et al. as the paper came out after we had collected the data. So saying this would be wrong and misleading, especially that there is this entire part of the discussion summarizing the differences between both methodologies. We adapted to their publication and compared both our studies to also allow more debate and discussion about the TI paradigm we used. We also explain why we think the ecology predicts negative results. We do not believe that the Hotta et al. justification is relevant. But their paper has been published and cannot be rewritten anymore. It does not necessarily imply that they are right and we are wrong. Also, the methods differed substantially, so talking about a revisit would be a little misleading.

Abstract: It is a bit odd that the opening statement suggest that you do no expect to find that cleaner wrasse are capable of AI given their ecology but your last sentence says there is already evidence of TI in this species. This is a little confusing.

We have to agree that it is a bit confusing but that is what empirical research can sometimes be about. As explained above, we follow the initial motivation to conduct the study. The abstract is too short to explain the differences between the 2 studies. In our view, the biggest methodological problem of the other study is that fish were also trained on AC and CE pairs. Nobody has done this before. In our view, this gives extra information about the order. It fixes ACE, and makes B and D embedded in different triangles. In summary our abstract says: “we don’t expect TI, we don’t find it, but as another study says yes, we discuss why there might be diverging results”. We do not believe this is misleading, but more of a scientific debate.

L67: I guess the question here is, does it make sense to test TI outside of social hierarchies? The other cleaner fish paper did as I recall but the other fish ones used agonistic interactions.

The referee raises one of the most fundamental questions in cognition research, i.e. whether experimental designs should be rather abstract or capture as much ecological validity as possible. We think a combination of both types would be most powerful. If TI were a general tool, then subjects should be able to apply TI also in abstract tasks like the one we used. As noted earlier, most experiments on TI used the design we used in our study, including the Japanese colleagues.

L70: This is fine, but 2 of the 3 studies done on fish support the ecological explanation. The other doesn’t, although the introduction of Hotta et al 2020 claims their task is ecologically relevant. Here you claim it isn’t. So who to believe??

This has now been clarified in different places in the paper, and it remains up to the reader to decide which justification makes sense for him/her. Picking up on the referee’s comment regarding L67: the experimental design for cleaners is not ecologically relevant. Therefore, independently of whether or not cleaners could benefit from using TI in their social environment, the task used by the Japanese colleagues and us tests for an abstract presence of TI.

L87: So here you are suggesting they don’t need TI because size is a true predictor of hierarchy position. Do you know if the hierarchy in your model is truly linear? How do you then explain the previous result which used pretty much the same approach? Moreover, im not sure you can transpose TI from a social context to a feeding context. This is an interesting question in its own right.

This comment goes in two directions and picks up earlier points. An honest answer is that nobody has precise data on the strictness of the correlation between size and dominance. But we are talking about ever growing females in a sex changing species, living in harems of relatively small number of individuals. The probability of joining a harem that includes five females of almost similar size (Thus that size alone does not allow a prediction of the hierarchy) seems to be almost zero to us. Regarding the second part: indeed, both studies rely on cleaners being able to apply TI to an abstract scenario in order to succeed.

L135: I have niggling doubts about this task. In my mind TI would be particularly useful for determining where things lie on a sliding scale, A is bigger than B which is bigger than C. But here the procedure is a little bit abstract. In some cases B has a reward (B+C1), in others it doesn’t (A+B-). It’s quite complicated.

You are absolutely right, and we also put this in our discussion (see “transitive inference task” section). The main point is that a binary paradigm is, by definition, not representing a transitive (or sliding scale, such as bigger, stronger etc..) relationship. But as we also explained, there are more complications to this. Mainly absolute knowledge, which is then also a major problem to talk about transitive inference. We chose a commonly used paradigm, but we think that some other research group would be more relevant to investigate this question in detail (it actually already being debated in the literature, so the question is absolutely known and raised already)

L137: in previous iterations of this test the “non preferred” plate would have flake food. Why no reward as apposed to a non-preferred award?

There are multiple complications in using preferred vs non-preferred food items. The individual preferences / tolerance are not equal and fix so we would need to be able to adapt quickly to avoid biasing the learning. The food/no food is easier to keep unchanged for the entire duration of this experiment (the data collection lasted for a bit less than 40 days for the slower individuals). And also, again, we used a standardized method to allow comparison. I hope the answer is relevant as I am not sure which iterations you are talking about. Hotta et. al did not use flake either.

L228L sooo the fish failed to learn the middle associations? Is there any point in testing for TI given this result? Especially since your test involved contrasts with D

See the method sections. Those data were collected simultaneously to the TI data so it was unknown. In addition, they all passed the training so we actually did not have any obligation to analyse those. We just wanted to make sure we had all the possible informations to understand the process. As we point out now very clearly, cleaners fail the TASK. Which is not the same as the conclusion that they lack TI as a mechanism. Indeed, they may fail the task simply because they cannot remember four combinations.

L235: Again this is a problem since the outcome of the TI will be dependent on what came before it.

See reply above.

L281: Yeah im not sure you can sell this idea too hard given Hora et al’s results. What you could say is you have repeated that test with greater sample size. I think this would be a more convincing motivation for the study.

As explained above this would not be true; it is not a repeat study. Therefore, the training methods differ. And we are pretty confident that Hotta et.al had positive results because of their overtraining.

L287: My feeling is that your training regime failed, so the test for TI was not valid. You must seriously consider this as the main reason for your negative finding.

It is presented as is in the discussion. You are right that the failure may be due to a lack of memory rather than a lack of TI. But then TI would become rather useless if memory limits prevent fish from using it.

L356 & L381: see my earlier comments about this being a discreet test as opposed to a continuous test.

See replies above.

L389: except they don’t because your training failed. This is likely a false negative.

Again, they all passed the learning. The fact that it does not remain in their memory goes in the same direction: they do not need it. Apparently, an intense and extensive training such as the one Hotta et al. used can overcome this, but we are then totally outside the frame of what TI is helpful for: diminishing the amount of exposure needed to get information. This is also now added in the second part on the discussion.

Attachment

Submitted filename: Response to Reviewers.docx

Decision Letter 1

Rachael Miller (Harrison)

19 Apr 2023

PONE-D-23-02374R1In the absence of extensive initial training, cleaner wrasse Labroides dimidiatus fail a transitive inference task.PLOS ONE

Dear Dr. Bonin,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

==============================

ACADEMIC EDITOR: Thank you for revising the original manuscript. The reviewer (who kindly agreed to do a second review following their suggested changes in round 1) and I feel you have now adequately addressed all major comments. We both recommend minor revisions - to do a further proof-read paying specific attention to grammar corrections (some of which have been highlighted by the reviewer), particularly of the new revised text.  

==============================

Please submit your revised manuscript by Jun 03 2023 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript: 

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Rachael Miller (Harrison)

Academic Editor

PLOS ONE

Journal Requirements:

Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #2: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #2: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #2: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #2: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #2: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #2: The authors have done a great job addressing my earlier comments. The MS could still use a proof read by a native English speaker to improve the grammar. I found multiple grammatical errors (see below) but there are others.

L38: I suggest rephrasing the final sentence of the abstract: Indeed, a parallel study on cleaner wrasse provided positive evidence for TI but was achieved following extensive training on the non-adjacent pairs which may have over-ridden the ecological relevance of the task.

L46: “stores”, not “stocks”

L57: insert bracket after “mice”

L57 & 311: brook trout (trout is singular and plural)

L82: 3.5

Ll314-5: delete “in their brain”

335: replace “who” with “that”. Delete “the”

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #2: Yes: Culum Brown

**********

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2023 Jun 23;18(6):e0287402. doi: 10.1371/journal.pone.0287402.r004

Author response to Decision Letter 1


1 Jun 2023

The rephrasing has been used as is in the new version of the manuscript. In addition, grammar and clarity have been re-checked by both an appropriated software and a native speaker.

Attachment

Submitted filename: Response to Reviewers.docx

Decision Letter 2

Rachael Miller (Harrison)

5 Jun 2023

In the absence of extensive initial training, cleaner wrasse Labroides dimidiatus fail a transitive inference task.

PONE-D-23-02374R2

Dear Dr. Bonin,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Rachael Miller (Harrison)

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

No further comments

Reviewers' comments:

None

Acceptance letter

Rachael Miller (Harrison)

13 Jun 2023

PONE-D-23-02374R2

In the absence of extensive initial training, cleaner wrasse Labroides dimidiatus fail a transitive inference task.

Dear Dr. Bonin:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Rachael Miller (Harrison)

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    Attachment

    Submitted filename: Response to Reviewers.docx

    Attachment

    Submitted filename: Response to Reviewers.docx

    Data Availability Statement

    All the data and R codes are publicly accessible on figshare (https://figshare.com/projects/No_evidence_for_transitive_Inference_in_cleaner_wrasse_data/158132).


    Articles from PLOS ONE are provided here courtesy of PLOS

    RESOURCES