Chapter 9
Chapter 9
Chapter 9
- refers to the complex concept by which heredity and environment are presumed to interact and
influence the development of one’s intelligence
FACTOR-ANALYTIC THEORIES
-the focus is squarely on identifying THE ABILITY OR GROUPS OF ABILITIES deemed to constitute
intelligence.
INFORMATION-PROCESSING THEORIES
-the focus is on identifying the specific mental processes that constitute intelligence
As early as 1904, British psychologist Charles Spearman developed new methods to measure how
different tests are related. He discovered that intelligence tests often show some level of correlation
with each other. In 1927, Spearman formalized this idea into a theory of general intelligence, proposing
that a general intellectual ability (called "g") is present in all mental abilities to some extent. This theory,
known as the two-factor theory of intelligence, suggests that "g" represents the common factor in all
intelligence tests, while the remaining differences are due to specific factors ("s") or errors ("e").
Factor Analysis - is a group of statistical techniques designed to determine the existence of underlying
relationships between sets of variables, including test scores.
REPHRASED:
Factor analysis is a set of statistical methods used to identify connections between different variables, like
test scores. In efforts to define intelligence, researchers have used factor analysis to examine how tests
of different abilities, which are thought to reflect intelligence, are related.
All of the perfect 1.00 correlations in the table show that each item perfectly correlates with itself. In the
actual data analysis, these perfect correlations are ignored, and only the correlations below the
diagonal line of 1.00 correlations are analyzed.
Using the correlations shown in Table 1, factor analysis suggests that the behavior rating scale
measures two factors. The high correlations between Items 1 through 3 indicate they all strongly relate
to the first factor, while Items 4 through 6 correlate highly with the second factor. These correlations
with the factors are called factor loadings, and they are shown in Table 2.
Factor analysis helps identify which items relate to which factors, but it doesn't explain what those
factors mean. Researchers usually review the items connected to each factor and, using their intuition
or theoretical knowledge, decide on a name for the factor. For example, Factor 1 could be named
Conduct Problems, Acting Out, or Externalizing Behaviors, while Factor 2 could be called Mood
Problems, Negative Affectivity, or Internalizing Behaviors. This means that the issues on this behavior
rating scale can be effectively summarized with just two scores instead of six. While this reduction may
seem small in this example, in practice, factor analysis can significantly simplify complex behavior
rating scales.
The Cattell-Horn and Carroll models are alike in many ways, such as both identifying broad abilities that
include several narrower abilities. However, merging these two models would need to address their
differences. One key difference is the role of a general intellectual (g) factor. Carroll's model includes g as
the top-level factor, covering broad abilities like fluid intelligence (Gf) and crystallized intelligence (Gc). In
contrast, the Cattell-Horn model does not include g at all. Another difference is how each model treats
"quantitative knowledge" and "reading/writing ability." In the Cattell-Horn model, these are broad abilities,
while Carroll considers them narrow abilities. There are also differences in how they label and define
abilities and how they group factors related to memory.
To bridge these differences, Kevin S. McGrew proposed an integrated model in 1997, later modified by
McGrew and Flanagan in 1998. The current McGrew-Flanagan CHC model includes ten broad abilities,
each covering multiple narrow abilities. These broad abilities are fluid intelligence (Gf), crystallized
intelligence (Gc), quantitative knowledge (Gq), reading/writing ability (Grw), short-term memory (Gsm),
visual processing (Gv), auditory processing (Ga), long-term storage and retrieval (Glr), processing speed
(Gs), and decision/reaction time or speed (Gt).
Notably, the CHC model does not include the general intellectual ability factor (g). The reason for this is
rooted in the model's purpose: to improve psychological assessments in education by identifying and
using tests from different sources to give a complete picture of a student's abilities. The authors felt that g
wasn't useful for these educational evaluations, which is why it was left out of the model.
The greater the MAGNITUDE of g in a test, the better the test was thought to predict overall a specific
variable.
GROUP FACTORS
- is neither as general as g nor as specific as s
- example of these broad group factors include linguistic, mechanical, and arithmetical abilities
INTERPERSONAL INTELLIGENCE
- is the ability to understand other people: what motivates them, how they work, how to work
cooperatively with them
- successful sales people, politicians, teachers, clinicians, and religious leader are all likely to be
individuals with high degrees of interpersonal intelligence
INTRAPERSONAL INTELLIGENCE
- a 7th kind of intelligence
- is a correlative ability, turned inward
- it is a capacity to form an accurate, vertical model of oneself and to be able to use that model to
operate effectively in life
Some factor-analytic studies are conducted on the subtests of a single test or from two or more tests
The type of factor analysis employed by a theorist may well be the tool that presents that theorist’s
conclusions in the possible light.
2 MAJOR TYPES OF COGNITIVE ABILITIES ACCORDING TO CATELL: crystallized intelligence & fluid
intelligence
MAINTAINED ABILITIES
-abilities that tend to not decline with age and may return to preinjury levels follow brain damage
second stratum - composed of eight abilities and processes: fluid intelligence (Gf), crystallized
intelligence (Gc), general memory and learning (Y), broad visual perception (V), broad auditory perception
(U), broad retrieval capacity ( R ) broad cognitive speediness (S), and processing/decision speed (T)
Below each of the abilities in the second stratum are many “level factors” and/or “speed factors” each
different, depednign on the second-level stratum to which they are linked.
For example, three factors linked to Gf are general reasoning, quantitative reasoning, and Piagetian
reasoning. A speed factor linked to Gf is speed of reasoning. Four factors linked to Gc are language
development, comprehension, spelling ability, and communication ability. Two speed factors linked to
Gc are oral fluency and writing ability
HIERARCHICAL MODEL
-meaning that all of the abilities listed in a stratum are subsumed by or incorporated in the strata above
The model was the product of efforts designed to improve the practice of psychological assessment in
education (aka PSYCHOEDUCATIONAL ASSESSMENT)by identifying tests from different batteries that
could be used to provide a comprehensive assessment of a student’s abilities.
CROSS-BATTERY ASSESSMENT
-assessment that employs tests from different tests batteries and entails interpretation of data from
specified subtests to provide a comprehensive assessment
For Thorndike, one’s ability to learn is determined by the number and speed of the bonds that can be
marshaled.
No major tests of intelligence were ever developed based on Thorndike’s multifactor theory.
PASS MODEL
-(planning, attention, simultaneous, successive)
MEASURING INTELLIGENCE
-entails sampling an examinee’s performance on different types of tests and tasks as a function of
developmental level
Measuring intelligence involves testing a person's performance on various tasks based on their
developmental stage. This process also allows for a standardized way to observe how the person
approaches these tasks. The insights gained from this assessment can be very useful in different
settings, such as schools, the military, and businesses.
In infancy (the period from birth through 18 months), intellectual assessment consists primarily of
measuring sensorimotor development - measurement of nonverbal motor responses such as turning over,
lifting the head, sitting up, following a moving object with the eyes, imitating gestures, reaching for a group
of objects
According to Wechsler, adult intelligence scales should tap abilities such as retention of general
information, quantitative reasoning, expressive language and memory, and social judgment
-Tests of Intelligence are more administered in a clinical setting to obtain clinically relevant information or
some measure of learning potential and skill acquisition.
- It is seldom administered for purposes of EDUCATIONAL PLACEMENT
- can be also used to evaluate an impaired individual for the purpose of judging that person’s competency
to make important decisions
- it can also be used to help make decisions about vocational and career decisions and transitions
From the test user’s standpoint, several considerations figure into a test’s appeal:
● the theory (if any) on which the test is based
● the ease with which the test can be administered
● the ease with which the test can be scored
● the ease with which results can be interpreted for a particular purpose
● the adequacy and appropriateness of the norms
● the acceptability of the published reliability and validity indices
● the test’s utility in terms of costs versus benefits
SUBTEST DESCRIPTION
-was the first published intelligence test to provide organized and detailed administration and scoring
instructions
*Lewis Madison Terman - authored the English Translation of the Binet-Simon Test
*The first edition of the Stanford-Binet test had lack of representativeness of the standardization sample
ALTERNATE ITEM
-an item to be substituted for a regular item under specified conditions
*Innovations in the 1937 scale included the development of 2 EQUIVALENT FORMS, labeled
L (Lewis) M (Maud)
as well as new types of tasks for use with preschool-level and adult test takers
*A serious criticism of the test remained: lack of representation of minority groups during the test’s
development
Another revision of the Stanford-Binet was well under way at the time of Terman’s death at age 79 in
1956. This edition of the Stanford-Binet, the 1960 revision, consisted of only a single form (labeled L-M)
and included the items considered to be the best from the two forms of the 1937 test, with no new items
added to the test. A major innovation, however, was the use of the deviation IQ tables in place of the ratio
IQ tables. Earlier versions of the StanfordBinet had employed the ratio IQ, which was based on the
concept of mental age.
MENTAL AGE
-the age level at which an individual appears to be functioning intellectually as indicated by the level
of items responded to correctly
RATIO IQ
- is the ratio of the test taker’s mental age divided by his or her chronological age, multiplied by 100 to
eliminate decimals
A child whose mental age and chronological age were EQUAL would thus have an IQ of 100.
DEVIATION IQ
-reflects a comparison of the performance of the individual with the performance of others of the same
age in the standardization sample
*Essentially, test performance is converted into a standard score with a mean of 100 and a standard
deviation of 16. Meaning that if a performance of a standard deviation above the mean for the
examinee’s age group, the deviation IQ is 116.
AGE SCALE
POINT SCALE
-is a test organized into subtest by category of item, not by age at which most test takers are presumed
capable of responding in the way that is keyed correct
TEST COMPOSITE
-a test score or index derived from the combination of, and/or a mathematical transformation of, one
or more subtest scores
The fifth edition of the Stanford-Binet was designed for administration to assessees as young as 2
and as old a 85 (or older)
-It measures 5 CHC factors by different types of tasks and subsets at different levels.
CHC FACTOR NAME: Fluid Intelligence (Gf)
BRIEF DEFINITION: Novel problem solving; understanding of relationships that are not culturally
bound
BRIEF DEFINITION: Skills and knowledge acquired by formal and informal education
SAMPLE SB5 SUBTEST: Verbal Quantitative Reasoning (verbal) Nonverbal Quantitative Reasoning
(nonverbal)
BRIEF DEFINITION: Ability to see patterns and relationships and spatial orientation as well as the
gestalt among diverse visual stimuli
BRIEF DEFINITION: Cognitive process of temporarily storing and then transforming or sorting
information in memory
The test yields a number of composite scores, including a Full Scale IQ derived from the administration of
10 subsets.
All composite scores of the SB5 have a mean set at 100 and a standard deviation of 15.
To check the reliability of the SB5 Full Scale IQ, a formula for multiple tests was used, showing
consistently high reliability across different ages (.97 to .98) and a good reliability for the Abbreviated
Battery IQ (average .91). Test-retest reliability was high, with a shorter interval of 5 to 8 days compared to
other tests. Inter-scorer reliability varied from .74 to .97, with a median of .90. Poorly agreed-upon items
were removed during development.
ROUTING TEST
-is administered after the examiner has established rapport with the test taker
-it directs or routes an examinee to a particular level of questions that have a high probability of being at
an optimal level of difficulty
TEACHING ITEMS
-are designed to illustrate the task required and assure the examiner that the examinee understands
FLOOR
-refers to the lowest level of the items on a subtest
e.g; at one end of the spectrum and the other at the former end
CEILING
-highest level item of the subtest
If and when examinees fail a certain number of items IN A ROW, a ceiling level is said to have been
reached and testing is discontinued.
*For each subtest on the SB5, there are explicit rules for where to start, where to reverse, and where to
stop (or discontinue).
*Test takers do not earn formal credit for passing the more difficult items. Rather, the examiner would
simply note on the protocol that testing the limits was conducted with regard to a particular subtest and
then record the findings.
For example, an examiner might start at the examinee’s estimated present ability level. The examiner
might reverse if the examinee scores 0 on the first two items from the start point. The examiner would
discontinue testing (stop) after a certain number of item failures after reversing.
If a vague or ambiguous response (open to more than one interpretation) is given on some verbal items
in items in subtests such as Vocabulary, Verbal Absurdities, or Verbal Analogies, the examiner is
encourage to give the examinee a prompt such as “Tell me more”.
The test was constructed this way to accommodate test takers with special needs and to fit the item
response theory model used to calibrate the difficulty of items.
SB5 has a test administration protocol that could be characterized as adaptive in nature.
ADAPTIVE TESTING (tailored testing, sequential testing, branch testing, and response-contingent
testing)
-testing individually tailored (customized) to the test taker
- Adaptive testing might entail beginning a subtest with a question in the middle range of difficulty. If the
test taker responds correctly to the item, an item of greater difficulty is posed next. If the test taker
responds incorrectly to the item, an item of lesser difficulty is posed
Computerized Adaptive Testing is designed to “mimic automatically what a wise examiner would
do”.
Adaptive testing helps ensure that the early test or subtest items are not so difficult as to frustrate the test
taker and not so easy as to lull the testtaker into a false sense of security or a state of mind in which the
task will not be taken seriously enough.
(1) It allows the test user to collect the maximum amount of information in the minimum amount of
time
(2) it facilitates rapport
(3) it minimizes the potential for examinee fatigue from being administered too many items
ANDERSON “intelligence is
two-dimensional and based
on individual differences in
information-processing speed
and executive functioning
influenced largely by inhibitory
processes
PSYCHOMETRIC APPROACH
- the oldest
-examines the ELEMENTAL STRUCTURE OF A TEST
*Following the psychometric approach, we examine the properties of a test through an evaluation of its
correlated and underlying dimensions.
INFORMATION-PROCESSING
-we examine the processes that underlie how we learn and solve problems
COGNITIVE TRADITION
-focuses on how humans adapat to real-world demands
EXTRA-TEST BEHAVIOR
-the way the examinees cope with frustration; how the examinee reacts to items considered very easy;
the amount of support the examinee seems to require; the general approach to the task; how anxious,
fatigued, cooperative, distractible, or compulsive the examinee appears to be—these are the types of
behavioral observations that will supplement formal scores.
120-129 Superior
90-109 Average
WAIS-IV
- is the current Wechsler adult scale that is made up of subtests that are designated either as core or
supplemental.
-contains 10 core subtest (Block Design, Similarities, Digit Span, Matrix Reasoning, Vocabulary,
Arithmetic, Symbol Search, Visual Puzzles, Information, and Coding) and 5 Supplemental Subtest
(Letter-Numbering Sequencing, Figure Weights, Comprehension, Cancellation, and Picture Completion)
CORE SUBTEST
-is one that is administered to obtain a composite score
The success of the Wechsler adult intelligence scale led to the development of similar tests for children,
including the Wechsler Intelligence Scale for Children (WISC), first published in 1949 and now in its fifth
edition, and the Wechsler Preschool and Primary Scale of Intelligence (WPPSI), first published in 1967
and now in its fourth edition.
In general, the Wechsler tests have been evaluated favorably from a psychometric standpoint.
Although the coefficients of reliability will vary as a function of the specific type of reliability
assessed, reported reliability estimates for the Wechsler tests in various categories (internal
consistency, test-retest reliability, inter-scorer reliability) tend to be satisfactory and, in many
cases, more than satisfactory. Wechsler manuals also typically contain a great deal of
information on validity studies, usually in the form of correlational studies or factor-analytic
studies.
From a psychometric perspective, the accuracy of a test (its validity) is closely related to its consistency
(reliability). If changes are made to a test that reduce its reliability, its validity may also decrease. For
example, shortening a test by reducing the number of items often leads to lower reliability, which can
make the test less accurate. Therefore, decisions based on short forms of tests should be made carefully.
If a short form suggests the need for intervention or placement, it might be better to administer the full test
to get more accurate results.
Given the demand for shorter tests, the Wechsler Abbreviated Scale of Intelligence (WASI) was
introduced in 1999. It was designed to quickly assess intellectual ability in individuals aged 6 to 89 years.
The WASI is available in two versions: a two-subtest form that takes about 15 minutes and a four-subtest
form that takes about 30 minutes. These subtests were chosen because they had strong correlations with
the Full Scale IQ on other Wechsler tests, and they measure a broad range of cognitive abilities. The
WASI provides scores for Verbal IQ, Performance IQ, and Full Scale IQ, with a mean of 100 and a
standard deviation of 15. Although the WASI is generally considered reliable and useful, some experts
have expressed concerns about how its validity was tested and reported.
In 2011, a revised version called the WASI-2 was released, with improvements aimed at better integration
with other Wechsler tests, user-friendliness, and enhanced psychometric reliability. Despite these
improvements, users are still cautioned that an abbreviated test like the WASI-2 may not be as clinically
accurate as a full-length test.
The term SHORT FORM refers to a test that has been abbreviated in length, typically to reduce the time
needed for test administration, scoring, and interpretation.
-Sometimes, when the test taker is believed to have an atypically short attention span or other problems
that would make administration of the complete test impossible, a sampling of representative subtests is
administered.
-Ryan and Ward advised that anytime a short form is used, the score should be reported on the official
record with the abbreviation “Est” next to it, indicating that the report value is only an estimate.
GROUP TESTS OF INTELLIGENCE
The tests give a reliable index of a man’s ability to learn, to think quickly and accurately, and to
comprehend instructions. They do not measure loyalty, bravery, dependability, or the emotional traits that
make a man “carry on”. A man’s value to the service is measured by his intelligence plus other necessary
qualifications.
In general, group tests are useful screening tools when large numbers of examinees must be evaluated
either simultaneously or within a limited time frame. We qualify group testing with traditional
because more contemporary forms of group testing, especially testing with all test takers seated at a
computer station, might more aptly be termed individual assessment simultaneously administered in a
group rather than group testing.