|
Gibson, B. M., Juricevic, I., Shettleworth, S. J., Pratt, J., & Klein, R. M. (2005). Looking for inhibition of return in pigeons. Learn Behav, 33(3), 296–308.
Abstract: We conducted four experiments in order to investigate whether pigeons' responses to a recently attended (i.e., recently pecked) location are inhibited. In Experiments 1 and 2, stimulus displays were similar to those used in studies of inhibition of return (IOR) with humans; responses to cued targets tended to be facilitated rather than inhibited. In Experiments 3 and 4, birds were presented with stimulus displays that mimicked clusters of small grains and were relatively localized, which should have been more appropriate for detecting IOR in pigeons. The results from these experiments again provided evidence for facilitation of responding to cued targets, rather than for IOR.
|
|
|
Hendriksen, P., Elmgreen, K., & Ladewig, J. (2011). Trailer-loading of horses: Is there a difference between positive and negative reinforcement concerning effectiveness and stress-related signs? J. Vet. Behav., 6(5), 261–266.
Abstract: The traditional way to train horses is by the application of negative reinforcement (NR). In the past few years, however, the use of positive reinforcement (PR) has become more common. To evaluate the effectiveness and the possible stressor effect of the 2 training methods, 12 horses showing severe trailer-loading problems were selected and exposed to trailer-loading. They were randomly assigned to one of the 2 methods. NR consisted of various degrees of pressure (lead rope pulling, whip tapping). Pressure was removed as soon as the horse complied. PR horses were exposed to clicker training and taught to follow a target into the trailer. Heart rate (HR) was recorded every 5 seconds and behavior denoting discomfort was observed using one-zero sampling with 10 seconds sampling intervals. Training was completed when the horse could enter the trailer upon a signal, or was terminated after a maximum of 15 sessions. Of the 12 horses, 10 reached the criterion within the 15 sessions. One horse was eliminated from the study because of illness and 1 PR horse failed to enter the trailer. A Mann-Whitney U-test indicated that the horses trained with NR displayed significantly more discomfort behavior per training session than horses trained with PR (NR: 13.26 ± 3.25; PR: 3.17 ± 8.93, P < 0.0001) and that horses in the PR group spent less time (second) per session to complete the training criterion (NR: 672.9 ± 247.12; PR: 539.81 ± 166.37, P < 0.01). A Mann-Whitney U-test showed that no difference existed in mean HR (bpm) between the 2 groups (NR: 53.06 ± 11.73 bpm; PR: 55.54 ± 6.7 bpm, P > 0.05), but a Wilcoxon test showed a difference in the PR group between the baseline of HR and mean HR obtained during training sessions (baseline PR: 43 ± 8.83 bpm; PR: 55.54 ± 6.7 bpm, P < 0.05). In conclusion, the PR group provided the fastest training solution and expressed less stress response. Thus, the PR procedure could provide a preferable training solution when training horses in potentially stressing situations.
|
|
|
Heyes, C. M. (1994). Social learning in animals: categories and mechanisms. Biol. Rev., 69(2), 207–231.
Abstract: There has been relatively little research on the psychological mechanisms of social learning. This may be due, in part, to the practice of distinguishing categories of social learning in relation to ill-defined mechanisms (Davis, 1973; Galef, 1988). This practice both makes it difficult to identify empirically examples of different types of social learning, and gives the false impression that the mechanisms responsible for social learning are clearly understood. It has been proposed that social learning phenomena be subsumed within the categorization scheme currently used by investigators of asocial learning. This scheme distinguishes categories of learning according to observable conditions, namely, the type of experience that gives rise to a change in an animal (single stimulus vs. stimulus-stimulus relationship vs. response-reinforcer relationship), and the type of behaviour in which this change is detected (response evocation vs. learnability) (Rescorla, 1988). Specifically, three alignments have been proposed: (i) stimulus enhancement with single stimulus learning, (ii) observational conditioning with stimulus-stimulus learning, or Pavlovian conditioning, and (iii) observational learning with response-reinforcer learning, or instrumental conditioning. If, as the proposed alignments suggest, the conditions of social and asocial learning are the same, there is some reason to believe that the mechanisms underlying the two sets of phenomena are also the same. This is so if one makes the relatively uncontroversial assumption that phenomena which occur under similar conditions tend to be controlled by similar mechanisms. However, the proposed alignments are intended to be a set of hypotheses, rather than conclusions, about the mechanisms of social learning; as a basis for further research in which animal learning theory is applied to social learning. A concerted attempt to apply animal learning theory to social learning, to find out whether the same mechanisms are responsible for social and asocial learning, could lead both to refinements of the general theory, and to a better understanding of the mechanisms of social learning. There are precedents for these positive developments in research applying animal learning theory to food aversion learning (e.g. Domjan, 1983; Rozin & Schull, 1988) and imprinting (e.g. Bolhuis, de Vox & Kruit, 1990; Hollis, ten Cate & Bateson, 1991). Like social learning, these phenomena almost certainly play distinctive roles in the antogeny of adaptive behaviour, and they are customarily regarded as 'special kinds' of learning (Shettleworth, 1993).(ABSTRACT TRUNCATED AT 400 WORDS)
|
|
|
Hogan, D. E., Zentall, T. R., & Pace, G. (1983). Control of pigeons' matching-to-sample performance by differential sample response requirements. Am J Psychol, 96(1), 37–49.
Abstract: Pigeons were trained on a matching-to-sample task in which sample hue and required sample-specific observing behavior provided redundant, relevant cues for correct choices. On trials that involved red and yellow hues as comparison stimuli, a fixed-ratio 16 schedule (FR 16) was required to illuminate the comparisons when the sample was red, and a differential-reinforcement-of-low-rates 3-sec schedule (DRL 3-sec) was required when the sample was yellow. On trials involving blue and green hues as comparison stimuli, an FR 16 schedule was required when the sample was blue and a DRL 3-sec schedule was required when the sample was green. For some pigeons, a 0-sec delay intervened between sample offset and comparison onset, whereas other pigeons experienced a random mixture of 0-sec and 2-sec delay trials. Test trial performance at 0-sec delay indicated that sample-specific behavior controlled choice performance considerably more than sample hue did. Test performance was independent of whether original training involved all 0-sec delay trials or a mixture of 0-sec and 2-sec delays. Sample-specific observing response requirements appear to facilitate pigeons' matching-to-sample performance by strengthening associations between the observing response and correct choice.
|
|
|
Innes, L., & McBride, S. (2008). Negative versus positive reinforcement: An evaluation of training strategies for rehabilitated horses. Appl. Anim. Behav. Sci., 112(3–4), 357–368.
Abstract: Rescued equids are often exposed to rehabilitation and training (or retraining) programmes to improve their physical and psychological well-being as well as to facilitate the re-homing process. Training uses either positive or negative reinforcement learning procedures and it is considered here that, there may be welfare implications associated with using the latter technique as it has the potential to overlay acute stress on animals with a chronic stress life history. The aim of this study, therefore, was to compare these training strategies (negative versus positive reinforcement) on equine behaviour and physiology as the first step in establishing an optimal rehabilitation approach (from a welfare perspective) for equids that have been subjected to chronic stress in the form of long-term neglect/cruelty. Over a 7-week period, 16 ponies (aged 6–18 months) were trained using either positive (‘positive’) (n = 8) or negative reinforcement (‘negative’) (n = 8) techniques to lead in hand, stand to be groomed, traverse an obstacle course and load into a trailer. Heart rate was measured (5 s intervals) on days 1 and 4 of each training week, ‘Pre’- (1 h), ‘During’ (0.5 h) and ‘Post’- (1 h) training session. Ethograms (10.00–20.00 h) outside of the training period were also compiled twice weekly. In addition, weekly arena tests (as a measure of reactivity) were also performed 1 week before and during the 7 weeks of training. Results showed significant differences between the two training schedules for some measures during the latter stages of the trial and suggested that animals trained under a positive reinforcement schedule were more motivated to participate in the training sessions and exhibited more exploratory or ‘trial and error’ type behaviours in novel situations/environments. In this context, the incorporation of positive reinforcement schedules within a rehabilitation programme may be of benefit to the animal from a welfare perspective.
|
|
|
Kaiser, D. H., Zentall, T. R., & Neiman, E. (2002). Timing in pigeons: effects of the similarity between intertrial interval and gap in a timing signal. J Exp Psychol Anim Behav Process, 28(4), 416–422.
Abstract: Previous research suggests that when a fixed interval is interrupted (known as the gap procedure), pigeons tend to reset memory and start timing from 0 after the gap. However, because the ambient conditions of the gap typically have been the same as during the intertrial interval (ITI), ambiguity may have resulted. In the present experiment, the authors found that when ambient conditions during the gap were similar to the ITI, pigeons tended to reset memory, but when ambient conditions during the gap were different from the ITI, pigeons tended to stop timing, retain the duration of the stimulus in memory, and add to that time when the stimulus reappeared. Thus, when the gap was unambiguous, pigeons timed accurately.
|
|
|
Kaplan, A. I., & Borodovskii, M. I. (1989). [Alternative animal behavior: a model and its statistical characteristics]. Nauchnye Doki Vyss Shkoly Biol Nauki, (3), 29–32.
Abstract: The rats' alternative behaviour in T-maze at simultaneous two-sided food refreshment in 13 trials a day during 6 days has been studied. It has been found that in the first testing days the indexes of alternative behaviour of animals correspond to the characteristics of the random alternation. However, on the 5-6th day of testing in the overwhelming majority of rats the true deviation of alternation index above or below than the theoretical values has been revealed. A question on the existence of two strategies of cognitive behaviour alteration and perseveration in rat population is under discussion.
|
|
|
Langbein, J., Siebert, K., Nuernberg, G., & Manteuffel, G. (2007). The impact of acoustical secondary reinforcement during shape discrimination learning of dwarf goats (Capra hircus). Appl. Anim. Behav. Sci., 103(1-2), 35–44.
Abstract: The use of secondary reinforcement is widely accepted to support operant learning in animals. In farm animals, however, the efficacy of secondary reinforcement has up to now been studied systematically only in horses (“clicker training”), and the results are controversial. We investigated the impact of acoustical secondary reinforcement on voluntary, self-controlled visual discrimination learning of two-dimensional shapes in group-housed dwarf goats (Capra hircus). Learning tests were conducted applying a computer-controlled learning device that was integrated in the animals' home pen. Shapes were presented on a TFT-screen using a four-choice design. Drinking water was used as primary reinforcement. In the control group (Gcontrol, n = 5) animals received only primary reinforcement, whereas in the sound group (Gsound, n = 6) animals got additional acoustical secondary reinforcement. Testing recall of shapes which had been successfully learned by the goats 6 weeks earlier (T1), we found a weak impact of secondary reinforcement on daily learning success (P = 0.07), but not on the number of trials the animals needed to reach the learning criterion (trials to criterion, n.s.). Results in T1 indicated that dwarf goats did not instantly recall previously learned shapes, but, re-learned within 250-450 trials. When learning a set of new shapes (T2), there was a strong influence of secondary reinforcement on daily learning success and on trials to criterion. Animals in Gsound reached the learning criterion earlier (P < 0.05) and needed fewer trials (1320 versus 3700; P < 0.01), compared to animals in Gcontrol. Results suggest that acoustical secondary reinforcement supports visual discrimination learning of dwarf goats, especially when the task is new and the salience of S+ is low.
|
|
|
Lazareva, O. F., Smirnova, A. A., Bagozkaja, M. S., Zorina, Z. A., Rayevsky, V. V., & Wasserman, E. A. (2004). Transitive responding in hooded crows requires linearly ordered stimuli. J Exp Anal Behav, 82(1), 1–19.
Abstract: Eight crows were taught to discriminate overlapping pairs of visual stimuli (A+ B-, B+ C-, C+ D-, and D+ E-). For 4 birds, the stimuli were colored cards with a circle of the same color on the reverse side whose diameter decreased from A to E (ordered feedback group). These circles were made available for comparison to potentially help the crows order the stimuli along a physical dimension. For the other 4 birds, the circles corresponding to the colored cards had the same diameter (constant feedback group). In later testing, a novel choice pair (BD) was presented. Reinforcement history involving stimuli B and D was controlled so that the reinforcement/nonreinforcement ratios for the latter would be greater than for the former. If, during the BD test, the crows chose between stimuli according to these reinforcement/nonreinforcement ratios, then they should prefer D; if they chose according to the diameter of the feedback stimuli, then they should prefer B. In the ordered feedback group, the crows strongly preferred B over D; in the constant feedback group, the crows' choice did not differ significantly from chance. These results, plus simulations using associative models, suggest that the orderability of the postchoice feedback stimuli is important for crows' transitive responding.
|
|
|
McLean, A. N., & Christensen, J. W. (2017). The application of learning theory in horse training. Appl. Anim. Behav. Sci., 190, 18–27.
Abstract: The millennia-old practices of horse training markedly predate and thus were isolated from the mid-twentieth century revelation of animal learning processes. From this standpoint, the progress made in the application and understanding of learning theory in horse training is reviewed including a discussion of how learning processes are employed or otherwise under-utilised in training. This review describes the process of habituation and the most commonly applied desensitisation techniques (systematic desensitisation, counter-conditioning, overshadowing, response prevention) and propose two additional techniques (approach conditioning and stimulus blending). The salience of different types of cues, the interaction of operant and classical conditioning and the impact of stress are also discussed. This paper also exposes the inflexibility and occasional inadequacy of the terminology of learning theory when translated from the research laboratory situation to the practical setting in horse training. While learning theory provides a rich toolbox for riders and trainers, the training process is subject to the simultaneous use of multiple learning processes. In addition, learning/behavioural outcomes and trained responses are not just the result of simple stimulus-response based interactions but are further shaped by arousal, affective and attachment states. More research is needed in these areas. For the field of equitation science to progress and to improve clarity and use of learning processes, changes in nomenclature are required. In particular, the use of the terms 'positive' and 'negative' as descriptive labels in both reinforcement and punishment modalities are unacceptably misleading for everyday use. These labels inhibit the understanding and recognition of the learning processes that these terms supposedly represent, yet the learning processes they describe are vital for horse riders, handlers and trainers to understand. We therefore propose that these labels should be re-labelled more appropriately as 'addition' or 'subtraction' reinforcement/punishment. This would enlighten trainers on the correct application of learning theory, and safety and welfare benefits for people and horses would follow. Finally it is also proposed that the term 'conflict theory' be taken up in equitation science to facilitate diagnosis of training-related behaviour disorders and thus enable the emergence of improved training practices. The optimal use of learning theory should be established as a fundamental principle in equestrian education.
|
|