46 cards generated

在牌组消失前保存

这些记忆卡还没保存——离开页面后会消失。创建一个免费账户来保留它们,并解锁下面的所有功能。

保存并复习
  • Save this deck to your account
  • Study with spaced repetition
  • Export to Anki (.apkg) or PDF
更大更强的生成
  • Process documents up to 100 pages
  • Images extracted from your PDFs
  • Sharper text extraction & a more advanced AI model
Sign up free → Free forever · No credit card

Flashcards in this deck (46)

正在搜索...
  • What is instrumental conditioning?


    Instrumental conditioning is the learning of a contingency between a behaviour and a consequence.

  • What does 'stamping in' mean in instrumental conditioning?


    'Stamping in' means determining which behaviours are maintained (reinforced).

  • What does 'stamping out' mean in instrumental conditioning?


    'Stamping out' means determining which behaviours are eliminated (not reinforced).

  • How does instrumental conditioning differ from classical conditioning?


    Instrumental conditioning considers overt behaviours operated by an actor that lead to a reinforcer, unlike classical conditioning.

  • What is a reinforcer?


    A reinforcer is any stimulus presented after a response that impacts the frequency that the response is performed.

  • What is reward training in instrumental conditioning?


    Reward training is the presentation of a positive reinforcer that increases behaviour.

  • What is escape training in instrumental conditioning?


    Escape training is the removal of a negative reinforcer that increases behaviour.

  • What is punishment training as described in the text?


    Punishment training is the presentation of a negative reinforcer that decreases behaviour.

  • What is omission training?


    Omission training is the removal of a positive reinforcer that decreases behaviour.

  • What is acquisition in instrumental conditioning?


    Acquisition is learning the relationship between a response and a consequence.

  • What is autoshaping (as named in the text)?


    Autoshaping is listed as a form of acquisition in instrumental conditioning.

  • What is chaining in behavioural training?


    Chaining develops a sequence of behaviours where each behaviour is reinforced with the opportunity to perform the next behaviour.

  • How does shaping differ from chaining?


    Shaping reinforces improvement toward a target response; chaining reinforces performing behaviours in the correct order.

  • Give an example of an SD and corresponding behaviour and reinforcement.


    • Behaviour: child eating vegetables
    • Reinforcement: child gets dessert
    • SD: at parents' home
  • Give an example of an S− using the vegetable-eating scenario.


    • S−: grandparents' home
    • Effect: vegetable eating will not lead to dessert
  • What does discriminative stimulus generalization mean?


    Responding to stimuli similar to the original discriminative stimulus, producing a varied response rate.

  • What is continuous reinforcement?


    When a response leads to a reinforcer on every single trial.

  • Why is continuous reinforcement rare in the real world?


    Because some good deeds are unrewarded and bad deeds go unpunished, so reinforcement is not given every time.

  • What determines reinforcement delivery in partial reinforcement schedules?


    Reinforcement is determined either by the total number of responses or by time.

  • How does a ratio schedule determine reinforcement?


    Reinforcement is based on the number of responses made by the subject.

  • How does an interval schedule determine reinforcement?


    Reinforcement is based on the time since the last response that was reinforced.

  • What do the symbols \(FR\), \(VR\), \(FI\), and \(VI\) stand for?


    • \(FR\): fixed ratio
    • \(VR\): variable ratio
    • \(FI\): fixed interval
    • \(VI\): variable interval
  • What is an example of an FR-1 schedule?


    A reward with food for each pecking response (every response is reinforced).

  • Give a real-world example of a variable ratio (VR) schedule.


    Slot machines deliver reinforcement after a random number of responses around a characteristic mean.

  • How does the slope of a VR schedule relate to the schedule mean?


    A VR schedule with a smaller mean (e.g., VR-10) has a steeper slope than one with a larger mean (e.g., VR-40).

  • What characteristic pattern does a cumulative record show for a fixed interval (FI) schedule?


    A characteristic scallop pattern in the cumulative record.

  • Provide an example illustrating the FI scallop pattern.


    Weekly quizzes: low responding right after a quiz and high responding right before the next weekly quiz.

  • Provide an example of a VI schedule and its response pattern.


    Pop quizzes produce a consistent response rate under a VI schedule.

  • What is ratio strain?


    When a fixed schedule is too demanding (too stingy) and the subject stops responding.

  • What is the pause-and-run pattern after reinforcement?


    Following reinforcement, the participant pauses with inactivity before beginning the next run of responding.

  • Give an example explaining the pause-and-run pattern in animals.


    A pigeon receiving food may pause because it is full and lacks motivation before resuming work for food.

  • What is the effect of a partial reinforcement (PRF) schedule compared to continuous reinforcement (CRF)?


    Partial reinforcement schedules produce more robust behavior that is more resistant to extinction than continuous reinforcement schedules.

  • Define extinction in operant conditioning.


    Extinction is the stopping of a desired behavior once reinforcement is no longer given.

  • Why are variable reinforcement schedules more resistant to extinction than fixed schedules?


    Variable schedules create fewer expectations about when reinforcement will come, so it takes longer to realize reinforcement has stopped.

  • State the Law of Effect.


    A response followed by a satisfying effect is strengthened and likely to occur again, while a response followed by an unsatisfying effect is weakened and less likely to occur again.

  • What is a primary reinforcer?


    A primary reinforcer is a reinforcer with intrinsic value such as food, water, or a mate.

  • What is a secondary reinforcer?


    A secondary reinforcer is a reinforcer that can be exchanged for a primary reinforcer, for example money for humans.

  • What is an operant chamber (Skinner box)?


    An operant chamber is a special chamber with a lever or mechanism by which an animal can respond to produce a reinforcer.

  • How does the operant chamber differ from Thorndike's puzzle box?


    Operant chambers allow shorter trials, no constraints on responding, and the animal remains free to respond again and again, unlike Thorndike's puzzle box.

  • What does a cumulative recorder measure in instrumental conditioning?


    A cumulative recorder records the cumulative response rate during an instrumental conditioning experiment.

  • What are discriminative stimuli in operant conditioning?


    Discriminative stimuli signal when a given response-reinforcer relationship is valid and can indicate the presence (S+) or absence (S-) of that relationship.

  • What is shaping in operant conditioning?


    Shaping breaks a complex desired behavior into smaller steps, reinforcing successive approximations until the full behavior is learned.

  • What is negative contrast in reward value?


    Negative contrast occurs when a response originally receiving a high reward is shifted to a lower reward, resulting in reduced responding.

  • What is positive contrast in reward value?


    Positive contrast occurs when a response originally receiving a low reward is shifted to a high reward, resulting in increased responding.

  • What is the overjustification effect?


    The overjustification effect is when a newly introduced reward for a previously unrewarded task alters perception so the task shifts from intrinsic value to being seen as work with extrinsic value.

  • What is a post-reinforcement pause and when does it occur?


    A post-reinforcement pause is a brief period when the organism stops responding after reinforcement; it occurs after reinforcement on a fixed ratio schedule.

学习笔记

Overview

  • Instrumental (operant) conditioning: learning that a specific behaviour (an action by an actor) leads to a consequence that changes the future probability of that behaviour.
  • Core idea: behaviours that produce satisfying outcomes are strengthened; those producing aversive outcomes are weakened (Law of Effect).

Key definitions

  • Reinforcer: any stimulus presented after a response that changes how often the response occurs.
  • Punishment: presentation of an aversive stimulus that reduces behaviour.
  • Escape (negative reinforcement): behaviour removes a negative stimulus, increasing that behaviour.
  • Omission (response cost): removal of a positive stimulus to decrease behaviour.
  • Discriminative stimulus (S+ / S−): signals when a response–reinforcer contingency is valid (S+) or not valid (S−).
  • Acquisition: learning the response–consequence relationship.
  • Extinction: decline in responding when reinforcement stops.

Types of instrumental outcomes (how behaviour changes)

  • Increase behaviour:
  • Positive reinforcement: present a reward after the response.
  • Negative reinforcement (escape): remove an aversive event when the response occurs.
  • Decrease behaviour:
  • Punishment: present an aversive stimulus after the response.
  • Omission/response cost: remove a positive stimulus after the response.

Acquisition techniques

  • Autoshaping: responses appear because of contingent pairing, often without explicit shaping.
  • Shaping (successive approximations): reinforce incremental improvements toward a complex target behaviour.
  • Chaining: build a sequence of behaviours where each step is reinforced by access to the next step (useful for ordered tasks).
  • Shaping vs chaining: shaping rewards approximations; chaining rewards correct order and completion of steps.

Stimulus control: generalization and discrimination

  • Discriminative stimulus (S+) signals when a response will be reinforced; S− signals the contingency is absent.
  • Generalization: responding to stimuli similar to the original S+; response rate usually varies with similarity.
  • Practical example: child eats vegetables at home (S+) to get dessert, but not at grandparents' (S−).

Schedules of reinforcement (how and when reinforcement is delivered)

  • Two dimensions: whether schedule is based on number of responses (ratio) or time since last reinforcement (interval), and whether it is fixed or variable.
  • Four basic schedules: \(FR\text{-}\#\), \(VR\text{-}\#\), \(FI\text{-}\#\), \(VI\text{-}\#\).
  • Ratio schedules (based on responses):
    • \(FR\text{-}n\) (fixed ratio): reinforcement after every \(n\) responses (e.g. \(FR\text{-}1\) = every response).
    • \(VR\text{-}n\) (variable ratio): reinforcement after a variable number of responses averaging \(n\) (e.g. slot machines).
  • Interval schedules (based on time):
    • \(FI\text{-}t\) (fixed interval): first response after a fixed time \(t\) is reinforced (produces a "scallop" pattern).
    • \(VI\text{-}t\) (variable interval): first response after varying intervals averaging \(t\) is reinforced (produces steady rates; e.g. pop quizzes).
  • Pause-and-run: on fixed ratio schedules, organisms often pause after reinforcement then respond rapidly.
  • Ratio strain: if a fixed-ratio requirement increases too much, responding may break down.

Partial vs continuous reinforcement

  • Continuous reinforcement (CRF): every response is reinforced (rare in real life).
  • Partial (intermittent) reinforcement (PRF): only some responses are reinforced; determined by ratio or interval schedules.
  • Partial-reinforcement effect: behaviours learned under PRF are more resistant to extinction than those learned under CRF.
  • Variable schedules are generally more resistant to extinction than fixed schedules because reinforcement is less predictable.

Extinction and related phenomena

  • Extinction: stopping reinforcement causes decline of the behaviour, but decline is slower after PRF training.
  • Post-reinforcement pause: pause after reinforcement, common on fixed-ratio schedules.
  • Contrast effects:
  • Negative contrast: shifting from a high reward to a lower reward reduces responding.
  • Positive contrast: shifting from low to high reward increases responding.
  • Overjustification effect: adding an extrinsic reward for an activity that had intrinsic value can reduce intrinsic motivation.

Experimental tools and classic concepts

  • Operant (Skinner) chamber: controlled box with manipulandum (lever, key) used to study instrumental conditioning.
  • Cumulative recorder: device that records response rate over time, useful for visualizing patterns (e.g., scallops).
  • Primary reinforcers: stimuli with innate value (food, water, sex).
  • Secondary reinforcers: learned reinforcers that predict or can be exchanged for primary ones (money, tokens).

Practical study tips & examples

  • Remember the Law of Effect: satisfying outcomes strengthen behaviour; unsatisfying outcomes weaken it.
  • Map each real example to: response, consequence, schedule, and discriminative stimuli.
  • Examples to practice:
  • Slot machine → \(VR\text{-}n\) (variable ratio, high steady rate, high resistance to extinction).
  • Weekly quiz preparation → \(FI\text{-}1\ \text{week}\) (scalloped responding: low after quiz, higher before next).
  • Pop quizzes → \(VI\text{-}t\) (steady moderate responding).

Quick-reference table

  • FR (fixed ratio): reward after fixed number of responses → high rate, post-reinforcement pause.
  • VR (variable ratio): reward after varying responses (avg) → very high, steady rate, resistant to extinction.
  • FI (fixed interval): reward for first response after fixed time → scallop pattern.
  • VI (variable interval): reward for first response after varying intervals → steady, moderate rate.

Summary: what to memorize

  • Definitions: reinforcement, punishment, escape, omission, discriminative stimulus.
  • Four schedule types and their characteristic response patterns.
  • Shaping vs chaining and practical uses.
  • Partial-reinforcement effect and why variable schedules resist extinction.