Abstract
In three experiments using rats, we examined the role of a discriminative stimulus (S) in governing the relation between a response (R) and an outcome (O) in an appetitive instrumental learning paradigm. In each experiment, we attempted to distinguish between a simple S-O association and a hierarchical relation in which S is associated with the R-O association. We used three variations on discriminative training procedures and three different assessment techniques-for revealing the hierarchical structure. In Experiment 1, we employed a training procedure in which S signaled a change in the R-O relation but no change in the likelihood of O. Although such an arrangement should not produce an excitatory S-O association, it nevertheless generated an S that controlled responding and transferred that control to other responses. In Experiment 2, we used a discrimination procedure in which two Ss each had the same two Rs and Os occur in their presence but each S signaled that a different R-O combination would be in effect. This design provided the opportunity for equivalent pairwise associations among S, R, and O but unique hierarchical relations. The subjects learned the hierarchical structure, as revealed by the specific depressive effect of a subsequent lithium-chloride-induced devaluation of O on responding only in the presence of the S in which that response had led to that outcome. In Experiment 3, one S signaled two different R-O outcomes. Then, two new stimuli were presented with the original S; the R-O relations were retained in the presence of one of the added stimuli but were rearranged in the presence of the other. The added S came to control less responding when it was redundant with respect to the R-O relations than when it was informative. Although all of the results were of modest size and each has an alternative interpretation, together they provide converging evidence for the hierarchical role of S in controlling an R-O association.
Article PDF
Similar content being viewed by others
References
Baxter, D. J., &Zamble, E. (1982). Reinforcer and response specificity in appetitive transfer of control.Animal Learning & Behavior,10, 201–210.
Bersh, P. J., &Lambert, J. V. (1975), The discriminative control of free-operant avoidance despite exposure to shock during the stimulus correlated with nonreinforcement.Journal of the Experimental Analysis of Behavior,23, 111–120.
Boakes, R. A. (1973). Response decrements produced by extinction and by response-independent reinforcement.Journal of the Experimental Analysis of Behavior,19, 293–302.
Brady, J. V. (1961). Emotional conditioning. in D. E. Sheer (Ed.),Electrical stimulation of the brain (pp. 413–430). Austin, TX: Hogg Foundation for Mental Health and the University of Texas.
Colwill, R. M., &Rescorla, R. A. (1985). Post-conditioning devaluation of a reinforcer affects instrumental responding.Journal of Experimental Psychology: Animal Behavior Processes,11, 120–132.
Colwill, R. M., &Rescorla, R. A. (1986). Associative structures in instrumental learning. In G. H. Bower (Ed.),The psychology of learning and motivation (Vol. 20, pp. 55–104). New York: Academic Press.
Colwill, R. M., &Rescorla, R. A. (1988). Associations between the discriminative stimulus and the reinforcer in instrumental learning.Journal of Experimental Psychology: Animal Behavior Processes,14, 155–164.
Durlach, P. J. (1983). The effect of signaling intertrial USs in auto-shaping.Journal of Experimental Psychology: Animal Behavior Processes,9, 374–389.
Gamzu, E. R., &Williams, D. R. (1973). Associative factors underlying the pigeon’s keypecking in autoshaping procedures.Journal of the Experimental Analysis of Behavior,19, 225–232.
Henton, W. W., &Brady, J. V. (1970). Operant acceleration during a pre-reward stimulus.Journal of the Experimental Analysis of Behavior,13, 205–209.
Holland, P. C. (1983). Occasion-setting in Pavlovian feature positive discriminations. In M. L. Commons, R. J. Herrnstein, & A. R. Wagner (Eds.),Quantitative analyses of behavior: Discrimination processes (Vol. 4, pp. 183–206). New York: Ballinger.
Holland, P. C. (1985). The nature of conditioned inhibition in serial and simultaneous feature negative discriminations. In R. R. Miller & N. E. Spear (Eds.),Information processing in animals: Conditioned inhibition (pp. 267–297). Hillsdale, NJ: Erlbaum.
Huff, R. C., Sherman, J. E., &Cohn, M. (1975). Some effects of response-independent reinforcement on auditory generalization gradients.Journal of the Experimental Analysis of Behavior,23, 81–86.
Hull, C. L. (1943).Principles of behavior. New York: Appleton-Century-Crofts.
Jenkins, H. M. (1985). Conditioned inhibition of keypecking in the pigeon. In R. R. Miller & N. E. Spear (Eds.),Information processing in animals: Conditioned inhibition (pp. 327–353). Hillsdale, NJ: Erlbaum.
Kamin, L. J. (1968). Attention-like processes in classical conditioning. In M. R. Jones (Ed.),Miami symposium on predictability, behavior, andaversive stimulation (pp. 9–33). Coral Gables, FL: University of Miami Press.
Kamin, L. J. (1969). Predictability, surprise, attention and conditioning. In B. Campbell & R. Church (Eds.),Punishment and aversive behavior (pp. 279–296). New York: Appleton-Century-Crofts.
Kelly, D. D. (1973). Suppression of random ratio and acceleration of temporally spaced responding by the same prereward stimulus in monkeys.Journal of the Experimental Analysis of Behavior,20, 363–373.
Konorski, J. (1948).Conditioned reflexes and neuron organization. Cambridge: Cambridge University Press.
Kruse, J. M., Overmier, J. B., Konz, W. A., &Rokke, E. (1983). Pavlovian conditioned stimulus effects upon instrumental choice behavior are reinforcer specific.Learning & Motivation,14, 165–181.
Lattal, K. A., &Maxey, G. C. (1971). Some effects of response independent reinforcers in multiple schedules.Journal of the Experimental Analysis of Behavior,16, 225–231.
Lovibond, P. F. (1983). Facilitation of instrumental behavior by a Pavlovian appetitive conditioned stimulus.Journal of Experimental Psychology: Animal Behavior Processes,9, 225–247.
Mackintosh, N. J. (1983).Conditioning and associative learning. Oxford: Oxford University Press.
Mackintosh, N. J., &Dickinson, A. (1979). Instrumental (Type II) conditioning. In A. Dickinson & R. A. Boakes (Eds.),Mechanisms of learning and motivation (pp. 143–167). Hillsdale, NJ: Erlbaum.
Meltzer, D., &Brahlek, J. A. (1970). Conditioned suppression and conditioned enhancement with the same positive UCS: An effect of CS duration.Journal of the Experimental Analysis of Behavior,13, 67–73.
Meltzer, D., &Hamm, R. J. (1974). Conditioned enhancement as a function of schedule of reinforcement.Bulletin of the Psychonomic Society,3, 99–101.
Meltzer, D., &Hamm, R. J. (1978). Differential conditioning of conditioned enhancement and positive conditioned suppression.Bulletin of the Psychonomic Society,11, 29–32.
Rescorla, R. A. (1968). Probability of shock in the presence and absence of CS in fear conditioning.Journal of Comparative & Physiological Psychology,66, 1–5.
Rescorla, R. A. (1979). Conditioned inhibition and extinction. In A. Dickinson & R. A. Boakes (Eds.),Mechanisms of learning and motivation (pp. 83–110). Hillsdale, NJ: Erlbaum.
Rescorla, R. A. (1985). Inhibition and facilitation. In R. R. Miller & N. E. Spear (Eds.),Information processing in animals: Conditioned inhibition (pp. 299–326). Hillsdale, NJ: Erlbaum.
Rescorla, R. A., &Holland, P. C. (1982). Behavioral studies of associative learning in animals.Annual Review of Psychology,33, 265–308.
Rescorla, R. A., &Solomon, R. L. (1967). Two-process learning theory: Relationships between Pavlovian conditioning and instrumental learning.Psychological Review,74, 151–182.
Skinner, B. F. (1938).The behavior of organisms. New York: Appleton-Century-Crofts.
Spence, K. W. (1956).Behavior theory and conditioning. New Haven, CT: Yale University Press.
St. Claire-Smith, R. (1979a). The overshadowing and blocking of punishment.Quarterly Journal of Experimental Psychology,4, 51–61.
St. Claire-Smith, R. (1979b). The overshadowing of instrumental conditioning by a stimulus that predicts reinforcement better than the response.Animal Learning & Behavior,7, 224–228.
Trapold, M. A., &Overmier, J. B. (1972). The second learning process in instrumental learning. In A. A. Black & W. F. Prokasy (Eds.),Classical conditioning: 2. Current research and theory (pp. 427–452). New York: Appleton-Century-Crofts.
Wagner, A. R. (1969). Stimulus validity and stimulus selection in associative learning. In N. J. Mackintosh & W. K. Honig (Eds.),Fundamental issues in associative learning (pp. 90–122). Halifax, Nova Scotia: Dalhousie University Press.
Weisman, R. G., &Ramsden, M. (1973). Discrimination of a response-independent component in a multiple schedule.Journal of the Experimental Analysis of Behavior,19, 65–73.
Author information
Authors and Affiliations
Additional information
This research was supported by National Science Foundation Grant BNS 83-08176.
Rights and permissions
About this article
Cite this article
Colwill, R.M., Rescorla, R.A. Evidence for the hierarchical structure of instrumental learning. Animal Learning & Behavior 18, 71–82 (1990). https://doi.org/10.3758/BF03205241
Received:
Accepted:
Issue Date:
DOI: https://doi.org/10.3758/BF03205241