Module III....Learning
Module III....Learning
Behavior Module
III
LEARNING
LEARNING
Learning is one of the important psychological processes that determine the human
behaviour. All human behaviours either directly or indirectly are affected by learning
process. Learning can be conceptualised as acquiring new behaviour in an
interactional environment. It is the process of acquiring, assimilating and
internalizing cognitive, motor or behavioural inputs for their varied uses as and
when required.
Definitions of Learning
With every new experience and every new event or situation, we learn something new. But, this concept of
learning is purely theoretical. In real terms, learning can be described as the modification of one’s
behaviour through practice, training and experience. A few definitions of learning are given below to
understand the nature of learning.
“Learning is a relatively permanent change in behaviour that occurs as a result of prior experience.”
“Learning has taken place if an individual behaves, reacts, and responds as a result of experience in a
manner different from the way he formerly behaved.”
“Learning can be defined as relatively permanent change in behaviour potentiality that results from
reinforced practice or experience.”
Theories of Learning
1. Classical conditioning theory of
learning
Classical conditioning is defined as association of one event with another desired event resulting in a behaviour. The most
popular experiment on the classical conditioning was conducted by a famous Russian psychologist Ivan Pavlov.
Pavlov conducted an experiment on dogs and tried to establish a stimulus-response connection amongst them. Pavolv was
honored with a Nobel Prize for his experiment on this subject. In his experiment on dogs, Pavlov made an attempt to
relate salivation of dogs with ringing of bell. The amount of saliva secretions were measured with the help of a surgical
procedure.
In the first stage of experiment, Pavlov presented a piece of meat in front of the dog. The dog secreted increased amount of
saliva in response to the meat. This response was unconditional. Then, he withheld the presentation of meat and rang a bell
in front of dog at the same time when meat was presented to him. The dog did not exhibit any salivation or we can say that
there was no response.
In the next stage of the experiment, he started ringing the bell at the same time when meat was presented to him. By doing
so he linked meat with ringing of the bell. Thus an association between two stimuli viz. the bell and meat was established.
He continued with this process for some time. After some time, he found that the ringing of bell alone elicited the saliva of
dog even without presentation of meat. Hence, the bell became conditioned stimulus resulting into conditioned or learned
response. The experiment is shown in the following exhibit:
As given in the exhibit, following observations can be drawn from this experiment:
Salivation in response to food was unlearned and natural response or reflex. This response was called unconditioned response.
Operant conditioning was first described by psychologist B.F. Skinner. His theory was based on two
assumptions. First, the cause of human behavior is something in a person’s environment. Second, the
consequences of a behavior determine the possibility of it being repeated. Behaviors followed by a
pleasant consequence are likely to be repeated and those followed by an unpleasant consequence are
less likely to be repeated.
Through his experiments, Skinner identified three types of responses that followed behavior:
• Neutral responses. They are responses from the environment that produce no stimulus other than
focusing attention. They neither increase nor decrease the probability of a behavior being repeated.
• Reinforcers. They are responses from the environment that increase the likelihood of a behavior being
repeated. They can either be positive or negative.
• Punishers. These are negative operants that decrease the likelihood of a behavior. Punishment weakens
behavior.
Types of operant behaviors
B.F. Skinner divided behavior into two different types: respondent and operant.
Respondent behavior. This is the type of behavior that you can’t control. It’s Skinner’s term for what
happened with Pavlov’s dogs -- when they heard a bell, they responded by salivating. It was a reflex, not a
choice. People have respondent behaviors, too. If someone puts your favorite food in front of you, you
likely will start salivating, just like Pavlov’s dogs.
Operant behavior. These are voluntary behaviors that you choose to do based on previous consequences.
You choose to behave in a certain way to get an expected result. For example, you study hard in
anticipation of a reward from your parents. Or if you get punished for talking back to your parents, you are
more likely to choose not to do that in the future.
Positive Reinforcement
Positive reinforcement involves providing a pleasant stimulus to increase the likelihood of a
behavior happening in the future. For example, if your child does chores without being asked,
you can reward them by taking them to a park or giving them a treat.
Skinner used a hungry rat in a Skinner box to show how positive reinforcement works. The box
contained a lever on the side, and as the rat moved about the box, it would accidentally knock
the lever. Immediately after it did so, a food pellet would drop into a container next to the lever.
The consequence of receiving food every time the rat hit the lever ensured that the animal
repeated the action again and again.
Negative Reinforcement
Reinforcement
Positive
Avoidance Extinction Punishment
reinforcement
Principles of Learning
Principles of learning help the managers gain maximum efficiency in a learning situation. However, blind
observance to these principles can cause damaging effects than the gain. Therefore, each principle should be
carefully interpreted and applied after considering the nature of task being learned and the context in which it is
being learned. Some important principles of learning are explained as below:
i. Motivation: Without motivation learning cannot take place. Motivation is the basic principle of learning.
Motivation can be seen at different levels of complexity in different situations. In an experiment conducted by
Edward Tolman, the hungry rat learnt the path through a complex maze to the food. In the experiment
conducted by Thorndike, desire to escape from the puzzle box motivated the animals to learn the way to escape
from the box.
ii. Knowledge of Results: It is a common fact that knowledge of one’s own performance is essential for learning.
Feedback of the performance stimulates the individual to take corrective action if there is any deviation in his
performance.
iii. Reinforcement: Reinforcement is the most important principle of learning. Reinforcement can be defined as
anything that both increases the strength of response and tends to induce repetitions of the behaviour that preceded
the reinforcement. In simple words, reinforcement refers to the consequences of behaviour. Reinforcement is
external environment based. It is described as external environmental events that follow a response. There are
four basic forms of reinforcement viz. positive reinforcement, avoidance, punishment and extinction.
a. Positive reinforcement: Positive reinforcement strengthens and increases the behavior by offering desirable
consequences. In simple words, positive reinforcement is the reward for a desired behavior. The reward should
be such that which stimulates individual’s desired behavior. For example, if an employee performs extremely well in the
organization and receives awards for the same, the probability of repeating the same behavior will increase.
b. Avoidance: It is also known as negative reinforcement. It is another alternative of increasing the desired
behavior. In this, instead of associating a reward with the desired behavior, the individual gets an opportunity to
avoid the punishment by displaying the desired behavior. For example, a manager usually criticizes his subordinates
for not achieving their targets. The employees will achieve their targets to avoid the criticism.
c. Extinction: Whereas positive reinforcement and negative reinforcement are the ways to increase the occurrence
of desired behavior, extinction refers to decreasing the frequency of undesirable behavior. When the rewards are
withdrawn from the behaviors which were earlier associated with the rewards, frequency of such behaviors
will tend to decrease or eventually extinct.
d. Punishment: It is also used to decrease the occurrence of undesired behavior. Punishment refers to the
presentation of an unpleasant or an aversive consequence of undesirable behavior. For example, in an
organization written warnings, pay cuts or terminations are given for the undesirable behaviors like theft,
financial embezzlement, damaging the property and coming late consistently etc.
iv. Schedules of Learning: Reinforcement does not always follow a particular response. An employee, who
always finishes his assignment on time, sometimes receives recognition and appreciation and sometimes is
ignored. Schedules of reinforcement refer to the pattern or timing of administering the reinforcers for the desired
behavior. Schedules of reinforcement are of two types: Continuous and partial.
Schedules of
reinforcement
Continuous Partial
Variable
Fixed ratio Variable ratio Fixed interval
interval
Schedules of Reinforcement
• Continuous schedule: Continuous schedule reinforces the desired behavior every time it is displayed. The reinforcement
is
immediate.
• Partial schedule: In partial or intermittent schedule, the behavior is not reinforced every time it is displayed but,
reinforcement is often enough to make the behavior worth repeating.
Fixed interval schedule: When reinforcement is given at fixed time intervals.
Variable interval schedule: When reinforcement is given at variable times. In other words, reinforcement is given at random
times so that nobody can predict the timing of reinforcement.
Fixed ratio schedule: In this method, reward is given after the fixed number of desired responses is displayed by the
employee.
Variable ratio schedule: In this method, rewards are offered after variable amount of desired responses.