|
Boysen, S. T., & Berntson, G. G. (1995). Responses to quantity: perceptual versus cognitive mechanisms in chimpanzees (Pan troglodytes). J Exp Psychol Anim Behav Process, 21(1), 82–86.
Abstract: Two chimpanzees were trained to select among 2 different amounts of candy (1-6 items). The task was designed so that selection of either array by the active (selector) chimpanzee resulted in that array being given to the passive (observer) animal, with the remaining (nonselected) array going to the selector. Neither animal was able to select consistently the smaller array, which would reap the larger reward. Rather, both animals preferentially selected the larger array, thereby receiving the smaller number of reinforcers. When Arabic numerals were substituted for the food arrays, however, the selector animal evidenced more optimal performance, immediately selecting the smaller numeral and thus receiving the larger reward. These findings suggest that a basic predisposition to respond to the perceptual-motivational features of incentive stimuli can interfere with task performance and that this interference can be overridden when abstract symbols serve as choice stimuli.
|
|
|
Boysen, S. T., Bernston, G. G., Hannan, M. B., & Cacioppo, J. T. (1996). Quantity-based interference and symbolic representations in chimpanzees (Pan troglodytes). J Exp Psychol Anim Behav Process, 22(1), 76–86.
Abstract: Five chimpanzees with training in counting and numerical skills selected between 2 arrays of different amounts of candy or 2 Arabic numerals. A reversed reinforcement contingency was in effect, in which the selected array was removed and the subject received the nonselected candies (or the number of candies represented by the nonselected Arabic numeral). Animals were unable to maximize reward by selecting the smaller array when candies were used as array elements. When Arabic numerals were substituted for the candy arrays, all animals showed an immediate shift to a more optimal response strategy of selecting the smaller numeral, thereby receiving the larger reward. Results suggest that a response disposition to the high-incentive candy stimuli introduced a powerful interference effect on performance, which was effectively overridden by the use of symbolic representations.
|
|
|
Burke, D., Cieplucha, C., Cass, J., Russell, F., & Fry, G. (2002). Win-shift and win-stay learning in the short-beaked echidna (Tachyglossus aculeatus). Anim. Cogn., 5(2), 79–84.
Abstract: Numerous previous investigators have explained species differences in spatial memory performance in terms of differences in foraging ecology. In three experiments we attempted to extend these findings by examining the extent to which the spatial memory performance of echidnas (or “spiny anteaters”) can be understood in terms of the spatio-temporal distribution of their prey (ants and termites). This is a species and a foraging situation that have not been examined in this way before. Echidnas were better able to learn to avoid a previously rewarding location (to “win-shift”) than to learn to return to a previously rewarding location (to “win-stay”), at short retention intervals, but were unable to learn either of these strategies at retention intervals of 90 min. The short retention interval results support the ecological hypothesis, but the long retention interval results do not.
|
|
|
Cerutti, D. T., & Staddon, J. E. R. (2004). Immediacy versus anticipated delay in the time-left experiment: a test of the cognitive hypothesis. J Exp Psychol Anim Behav Process, 30(1), 45–57.
Abstract: In the time-left experiment (J. Gibbon & R. M. Church, 1981), animals are said to compare an expectation of a fixed delay to food, for one choice, with a decreasing delay expectation for the other, mentally representing both upcoming time to food and the difference between current time and upcoming time (the cognitive hypothesis). The results of 2 experiments support a simpler view: that animals choose according to the immediacies of reinforcement for each response at a time signaled by available time markers (the temporal control hypothesis). It is not necessary to assume that animals can either represent or subtract representations of times to food to explain the results of the time-left experiment.
|
|
|
Christensen, J. W., Rundgren, M., & Olsson, K. (2006). Training methods for horses: habituation to a frightening stimulus. Equine Vet J, 38(5), 439–443.
Abstract: REASONS FOR PERFORMING STUDY: Responses of horses in frightening situations are important for both equine and human safety. Considerable scientific interest has been shown in development of reactivity tests, but little effort has been dedicated to the development of appropriate training methods for reducing fearfulness. OBJECTIVES: To investigate which of 3 different training methods (habituation, desensitisation and counter-conditioning) was most effective in teaching horses to react calmly in a potentially frightening situation. HYPOTHESES: 1) Horses are able to generalise about the test stimulus such that, once familiar with the test stimulus in one situation, it appears less frightening and elicits a reduced response even when the stimulus intensity is increased or the stimulus is presented differently; and 2) alternative methods such as desensitisation and counter-conditioning would be more efficient than a classic habituation approach. METHODS: Twenty-seven naive 2-year-old Danish Warmblood stallions were trained according to 3 different methods, based on classical learning theory: 1) horses (n = 9) were exposed to the full stimulus (a moving, white nylon bag, 1.2 x 0.75 m) in 5 daily training sessions until they met a predefined habituation criterion (habituation); 2) horses (n = 9) were introduced gradually to the stimulus and habituated to each step before the full stimulus was applied (desensitisation); 3) horses (n = 9) were trained to associate the stimulus with a positive reward before being exposed to the full stimulus (counter-conditioning). Each horse received 5 training sessions of 3 min per day. Heart rate and behavioural responses were recorded. RESULTS: Horses trained with the desensitisation method showed fewer flight responses in total and needed fewer training sessions to learn to react calmly to test stimuli. Variations in heart rate persisted even when behavioural responses had ceased. In addition, all horses on the desensitisation method eventually habituated to the test stimulus whereas some horses on the other methods did not. CONCLUSIONS AND POTENTIAL RELEVANCE: Desensitisation appeared to be the most effective training method for horses in frightening situations. Further research is needed in order to investigate the role of positive reinforcement, such as offering food, in the training of horses.
|
|
|
Clement, T. S., & Zentall, T. R. (2002). Second-order contrast based on the expectation of effort and reinforcement. J Exp Psychol Anim Behav Process, 28(1), 64–74.
Abstract: Pigeons prefer signals for reinforcement that require greater effort (or time) to obtain over those that require less effort to obtain (T. S. Clement, J. Feltus, D. H. Kaiser, & T. R. Zentall, 2000). Preference was attributed to contrast (or to the relatively greater improvement in conditions) produced by the appearance of the signal when it was preceded by greater effort. In Experiment 1, the authors of the present study demonstrated that the expectation of greater effort was sufficient to produce such a preference (a second-order contrast effect). In Experiments 2 and 3, low versus high probability of reinforcement was substituted for high versus low effort, respectively, with similar results. In Experiment 3, the authors found that the stimulus preference could be attributed to positive contrast (when the discriminative stimuli represented an improvement in the probability of reinforcement) and perhaps also negative contrast (when the discriminative stimuli represented reduction in the probability of reinforcement).
|
|
|
Clement, T. S., Feltus, J. R., Kaiser, D. H., & Zentall, T. R. (2000). “Work ethic” in pigeons: reward value is directly related to the effort or time required to obtain the reward. Psychon Bull Rev, 7(1), 100–106.
Abstract: Stimuli associated with less effort or with shorter delays to reinforcement are generally preferred over those associated with greater effort or longer delays to reinforcement. However, the opposite appears to be true of stimuli that follow greater effort or longer delays. In training, a simple simultaneous discrimination followed a single peck to an initial stimulus (S+FR1 S-FR1) and a different simple simultaneous discrimination followed 20 pecks to the initial stimulus (S+FR20 S-FR20). On test trials, pigeons preferred S+FR20 over S+FR1 and S-FR20 over S-FR1. These data support the view that the state of the animal immediately prior to presentation of the discrimination affects the value of the reinforcement that follows it. This contrast effect is analogous to effects that when they occur in humans have been attributed to more complex cognitive and social factors.
|
|
|
Coleman, K., Tully, L. A., & McMillan, J. L. (2005). Temperament correlates with training success in adult rhesus macaques. Am. J. Primatol., 65(1), 63–71.
Abstract: In recent years there has been a marked increase in awareness of issues involving the psychological well-being of nonhuman primates (NHPs) used in biomedical research. As a result, many facilities are starting to train primates to voluntarily cooperate with veterinary, husbandry, and research procedures, such as remaining still for blood draws or injections. Such training generally reduces the stress associated with these procedures, resulting in calmer animals and, ultimately, better research models. However, such training requires great investments in time, and there can be vast individual differences in training success. Some animals learn tasks quickly, while others make slower progress in training. In this study, we examined whether temperament, as measured by response to a novel food object, correlated with the amount of time it took to train 20 adult female rhesus macaques to perform a simple task. The monkeys were categorized as “exploratory” (i.e., inspected a novel object placed in the home cage within 10 sec), “moderate” (i.e., inspected the object within 10-180 sec), or “inhibited” (i.e., did not inspect the object within 3 min). We utilized positive reinforcement techniques to train the monkeys to touch a target (PVC pipe shaped like an elbow) hung on their cage. Temperament correlated with training success in this study (Pearson chi2=7.22, df=2, P=0.03). We easily trained over 75% of the animals that inspected the novel food (i.e., exploratory or moderate individuals) to touch the target. However, only 22% of the inhibited monkeys performed the task. By knowing which animals may not respond to conventional training methods, we may be able to develop alternate training techniques to address their specific needs. In addition, these results will allow us to screen monkeys to be assigned to research projects in which they will be trained, with the goal of obtaining the best candidates for those studies.
|
|
|
Cooper, J. J. (1998). Comparative learning theory and its application in the training of horses. Equine Vet J Suppl, (27), 39–43.
Abstract: Training can best be explained as a process that occurs through stimulus-response-reinforcement chains, whereby animals are conditioned to associate cues in their environment, with specific behavioural responses and their rewarding consequences. Research into learning in horses has concentrated on their powers of discrimination and on primary positive reinforcement schedules, where the correct response is paired with a desirable consequence such as food. In contrast, a number of other learning processes that are used in training have been widely studied in other species, but have received little scientific investigation in the horse. These include: negative reinforcement, where performance of the correct response is followed by removal of, or decrease in, intensity of a unpleasant stimulus; punishment, where an incorrect response is paired with an undesirable consequence, but without consistent prior warning; secondary conditioning, where a natural primary reinforcer such as food is closely associated with an arbitrary secondary reinforcer such as vocal praise; and variable or partial conditioning, where once the correct response has been learnt, reinforcement is presented according to an intermittent schedule to increase resistance to extinction outside of training.
|
|
|
Dorrance, B. R., & Zentall, T. R. (2001). Imitative learning in Japanese quail (Coturnix japonica) depends on the motivational state of the observer quail at the time of observation. J Comp Psychol, 115(1), 62–67.
Abstract: The 2-action method was used to examine whether imitative learning in Japanese quail (Coturnix japonica) depends on the motivational state of the observer quail at the time of observation of the demonstrated behavior. Two groups of observers were fed before observation (satiated groups), whereas 2 other groups of observers were deprived of food before observation (hungry groups). Quail were tested either immediately following observation or after a 30-min delay. Results indicated that quail in the hungry groups imitated, whereas those in the satiated groups did not, regardless of whether their test was immediate or delayed. The results suggest that observer quail may not learn (through observation) behavior that leads to a reinforcer for which they are unmotivated at the time of test. In addition, the results show that quail are able to delay the performance of a response acquired through observation (i.e., they show deferred imitation).
|
|