Avaliaçao Psicologica: Interamerican Journal of Psychological Assessment, 2012
This paper revisits the classic texts in psychometrics and presents the mathematical foundations ... more This paper revisits the classic texts in psychometrics and presents the mathematical foundations of the classical test theory. It discusses the mathematical model of factor analysis, the classical linear model, the derivation of the reliability and types of calculation of the reliability coefficient, the standard error of measurement, the integration of validity with factor analy sis and, finally, item analysis procedures. The text concerns those who want to deepen their knowledge in the concepts of psychometrics, understanding the origin of the main formulas that we use when doing psychometric analysis of tests and scales.
Avaliaçao Psicologica: Interamerican Journal of Psychological Assessment, 2003
EnglishThe paper offers a historical view and the basic rationale of the modern theory in psychom... more EnglishThe paper offers a historical view and the basic rationale of the modern theory in psychometrics: item response theory (IRT). This theory has its roots in the 1930, but it was fully developed only in the 1950's, and became the standard theory in psychometrics in the 1980's. The IRT is one of the latent trait modeling theories that appeared in the 1930's. Latent trait modeling theories assume that the human behavior, called observable variables, is dependent and caused by latent traits, the hypothetical variables. The IRT assume this modeling and expresses the relationship between these two types of variables through a mathematical equation called the logistic equation. This equation produces a curve called the item characteristic curve (ICC). This curve defines the items parameters (difficulty, discrimination) in terms of the level of the latent trait, symbolized as theta (q). The paper also discusses the advantages that IRT offers over the traditional theory in p...
Este trabajo tiene por objeto probar la utilidad de la Bateria para la Evaluacion de la Superdota... more Este trabajo tiene por objeto probar la utilidad de la Bateria para la Evaluacion de la Superdotacion (BaSH/S, por sus siglas en portugues) para identificar diferentes grupos de alumnos superdotados en las aereas de talento academico y artistico. La bateria valora cuatro factores latentes: (a) inteligencia fluida, (b) produccion de metaforas (creatividad verbal), (c) fluidez figurativa (creatividad figurativa), y (d) calidad del pensamiento divergente figurativo (creatividad figurativa). Se tomo una muestra de 987 alumnos adolescentes, 464 chicos y 523 chicas de edades de 8 a 17 anos, que pertenecian a dos grupos: alumnos no superdotados (N=866) y alumnos superdotados (N= 67 habilidades academicas, N=34 habilidades artisticas y N=20 no identificados en un dominio especifico). El grupo de superdotados academicos presento las puntuaciones mas altas en razonamiento y podian producir metaforas mas originales y remotas, eran figurativamente mas fluidos y sus dibujos eran mas originale...
One of key points in psychological tests is related to the meaning of test scores, that is, the i... more One of key points in psychological tests is related to the meaning of test scores, that is, the interpretation of different levels of achievement. Usually the scores intelligibility is accomplished by the use of three procedures: (a) norm reference, content reference and criterion reference. The most common method, norm reference, informs the relative standing of a specific score in relation to a group of reference. Although, the main limitation of this procedure is the lack of information about what kind of attainments a person is capable to achieve. Behavioural scaling, defined by John B. Carroll, is a procedure based on Item Response Theory which overcame this limitation. Using this method it is possible to state, in behavioural terms, the implications of test results in respect to what the subject knows or is capable to realize. This paper discusses this procedure and illustrates its application in the assessment of reading comprehension and
Whereas the structure of individual differences in many social and emotional attributes is well u... more Whereas the structure of individual differences in many social and emotional attributes is well understood in adults, much less work has been done in children and adolescents. The main goals of this research were to specify the major content domains that are assessed across multiple socioemotional instruments (self-esteem, grit, self-efficacy, strengths and difficulties, Big Five) in research in the United States and Europe, to test them in a less developed context with considerable educational challenges (Brazilian schools). We selected the five most promising instruments and studied their structure at the item level in a large sample of Brazilian school students (N = 3,023). The extracted factors to capture the major domains of child differences represented in these instruments closely resembled the Big Five personality dimensions. We discuss the contribution of our findings to the assessment of socio-emotional skills in education research, as well as limitations of the current st...
We test the utility of the Battery for Giftedness Assessment (BaAH/S) in identifying differences ... more We test the utility of the Battery for Giftedness Assessment (BaAH/S) in identifying differences in two groups of already known gifted students in the areas of academic and artistic talents. Four latent factors were assessed (a) fluid intelligence, (b) metaphor production (verbal creativity), (c) figural fluency (figural creativity), and (d) divergent thinking figural task quality (figural creativity). A sample of 987 children and adolescents, 464 boys and 523 girls, of ages ranging from 8 to 17 of two groups: regular students (N=866) and gifted students (N= 67 academic abilities, N=34 artistic abilities and N=20 no domain identified). Academic giftedness group of have higher reasoning, can produce more remote/original metaphors, high figural fluency and drawings rated as more original. Children in the group of artistic giftedness have higher reasoning, high figural fluency and drawings rated as more original. Reasoning abilities are relatively higher in academic giftedness group th...
Responding to the need for school-based, broadly applicable, low-cost, and brief assessments of s... more Responding to the need for school-based, broadly applicable, low-cost, and brief assessments of socio-emotional skills, we describe the conceptual background and empirical development of the SENNA inventory and provide new psychometric information on its internal structure. Data were obtained through a computerized survey from 50,000 Brazilian students enrolled in public school grades 6 to 12, spread across the entire State of São Paulo. The SENNA inventory was designed to assess 18 particular skills (e.g., empathy, responsibility, tolerance of frustration, and social initiative), each operationalized by nine items that represent three types of items: three positively keyed trait-identity items, three negatively keyed identity items, and three (always positively keyed) self-efficacy items, totaling a set of 162 items. Results show that the 18 skill constructs empirically defined a higher-order structure that we interpret as the social-emotional Big Five, labeled as Engaging with Oth...
Acquiescence is a commonly observed response style that may distort respondent scores. One approa... more Acquiescence is a commonly observed response style that may distort respondent scores. One approach to control for acquiescence involves creating a balanced scale and computing sum scores. Other model-based approaches may explicitly include an acquiescence factor as part of a factor analysis or multidimensional item response model. Under certain assumptions, both approaches may result in acquiescence-controlled scores for each respondent. However, the validity of the resulting scores is one issue that is sometimes ignored. In this paper, we present an application of these approaches under both balanced and unbalanced scales, and we report changes in criterion validity and respondent scores.
A Inteligencia Fluida (Gf) refere-se a capacidade geral de raciocinio em situacoes novas pouco es... more A Inteligencia Fluida (Gf) refere-se a capacidade geral de raciocinio em situacoes novas pouco estruturadas. Em termos de processos cognitivos subjacentes, estudos recentes apontam que a Gf esta associada a memoria de trabalho, especialmente as funcoes do executivo central, nomeadamente, coordenacao simultânea de tarefas e atencao seletiva/abstracao. Esse estudo verificou a estrutura fatorial de um conjunto de itens de raciocinio analogico com figuras geometricas criados sistematicamente para representarem esses dois componentes do constructo. O instrumento e informatizado e composto por: (a) Pre-teste contendo doze problemas; (b) Fase de treino na qual sao ensinados os componentes do processamento cognitivo e a estrutura geral dos problemas e (c) Pos-teste com mais doze problemas estruturalmente identicos aos do pre-teste e com feedback sobre a correcao da resposta e tres tentativas possiveis. Participaram 343 estudantes universitarios, 56,5% homens e 43,5% mulheres de cinco cursos...
O estudo buscou investigar as evidências de validade de critério de um instrumento intitulado Tri... more O estudo buscou investigar as evidências de validade de critério de um instrumento intitulado Triagem de Indicadores de Altas Habilidades/Superdotação (AH/S). A escala, respondida pelo professor, avalia o desenvolvimento do estudante em cinco áreas: capacidade intelectual geral, habilidades acadêmicas específicas, liderança, criatividade e talento artístico. A amostra foi composta por 568 participantes: 213 do grupo-controle e 355 do grupo-critério. Os resultados da análise de variância fatorial e do teste t de Student indicaram diferenças de médias significativas entre os grupos, confirmando o tipo de evidência de validade investigada. A regressão logística também foi conduzida visando identificar o quanto a pontuação em cada área específica da escala conseguiria prever a área de identificação do indivíduo com AH/S.Palavras-chave: Aluno com Altas Habilidades/Superdotação, Validade Estatística, Construção de Teste, Avaliação Psicológica. Clasificación de indicadores de altas habilid...
A presente pesquisa correlacionou dados provenientes da aplicação do teste de Inteligência BPR-5 ... more A presente pesquisa correlacionou dados provenientes da aplicação do teste de Inteligência BPR-5 com uma avaliação escolar de matemática e língua portuguesa em uma amostra de 679 alunos do nono ano do ensino fundamental de quatro escolas de uma rede particular de ensino. Os resultados dessas avaliações se mostraram fortemente correlacionados e estatisticamente significativos com escores dos testes de QI (r =,58, p < 0,01), evidenciando elevadas cargas em Inteligência Fluida (Gf). Uma análise longitudinal (5° ao 9° ano) foi aplicada através do Modelo de Curva de Crescimento Latente que investigou a média da variância inicial (intercepto) e a média de crescimento (slope) no desempenho acadêmico (DA) dos sujeitos, em dois modelos (com e sem a variável independente BPR), com o objetivo de investigar a capacidade preditiva de Gf no DA. Quando inserida a variável BPR, seu impacto no intercepto foi estimado em 20,288 e no slope, 6,381. Essas estimativas indicam o acréscimo no desempenho...
Avaliaçao Psicologica: Interamerican Journal of Psychological Assessment, 2012
This paper revisits the classic texts in psychometrics and presents the mathematical foundations ... more This paper revisits the classic texts in psychometrics and presents the mathematical foundations of the classical test theory. It discusses the mathematical model of factor analysis, the classical linear model, the derivation of the reliability and types of calculation of the reliability coefficient, the standard error of measurement, the integration of validity with factor analy sis and, finally, item analysis procedures. The text concerns those who want to deepen their knowledge in the concepts of psychometrics, understanding the origin of the main formulas that we use when doing psychometric analysis of tests and scales.
Avaliaçao Psicologica: Interamerican Journal of Psychological Assessment, 2003
EnglishThe paper offers a historical view and the basic rationale of the modern theory in psychom... more EnglishThe paper offers a historical view and the basic rationale of the modern theory in psychometrics: item response theory (IRT). This theory has its roots in the 1930, but it was fully developed only in the 1950's, and became the standard theory in psychometrics in the 1980's. The IRT is one of the latent trait modeling theories that appeared in the 1930's. Latent trait modeling theories assume that the human behavior, called observable variables, is dependent and caused by latent traits, the hypothetical variables. The IRT assume this modeling and expresses the relationship between these two types of variables through a mathematical equation called the logistic equation. This equation produces a curve called the item characteristic curve (ICC). This curve defines the items parameters (difficulty, discrimination) in terms of the level of the latent trait, symbolized as theta (q). The paper also discusses the advantages that IRT offers over the traditional theory in p...
Este trabajo tiene por objeto probar la utilidad de la Bateria para la Evaluacion de la Superdota... more Este trabajo tiene por objeto probar la utilidad de la Bateria para la Evaluacion de la Superdotacion (BaSH/S, por sus siglas en portugues) para identificar diferentes grupos de alumnos superdotados en las aereas de talento academico y artistico. La bateria valora cuatro factores latentes: (a) inteligencia fluida, (b) produccion de metaforas (creatividad verbal), (c) fluidez figurativa (creatividad figurativa), y (d) calidad del pensamiento divergente figurativo (creatividad figurativa). Se tomo una muestra de 987 alumnos adolescentes, 464 chicos y 523 chicas de edades de 8 a 17 anos, que pertenecian a dos grupos: alumnos no superdotados (N=866) y alumnos superdotados (N= 67 habilidades academicas, N=34 habilidades artisticas y N=20 no identificados en un dominio especifico). El grupo de superdotados academicos presento las puntuaciones mas altas en razonamiento y podian producir metaforas mas originales y remotas, eran figurativamente mas fluidos y sus dibujos eran mas originale...
One of key points in psychological tests is related to the meaning of test scores, that is, the i... more One of key points in psychological tests is related to the meaning of test scores, that is, the interpretation of different levels of achievement. Usually the scores intelligibility is accomplished by the use of three procedures: (a) norm reference, content reference and criterion reference. The most common method, norm reference, informs the relative standing of a specific score in relation to a group of reference. Although, the main limitation of this procedure is the lack of information about what kind of attainments a person is capable to achieve. Behavioural scaling, defined by John B. Carroll, is a procedure based on Item Response Theory which overcame this limitation. Using this method it is possible to state, in behavioural terms, the implications of test results in respect to what the subject knows or is capable to realize. This paper discusses this procedure and illustrates its application in the assessment of reading comprehension and
Whereas the structure of individual differences in many social and emotional attributes is well u... more Whereas the structure of individual differences in many social and emotional attributes is well understood in adults, much less work has been done in children and adolescents. The main goals of this research were to specify the major content domains that are assessed across multiple socioemotional instruments (self-esteem, grit, self-efficacy, strengths and difficulties, Big Five) in research in the United States and Europe, to test them in a less developed context with considerable educational challenges (Brazilian schools). We selected the five most promising instruments and studied their structure at the item level in a large sample of Brazilian school students (N = 3,023). The extracted factors to capture the major domains of child differences represented in these instruments closely resembled the Big Five personality dimensions. We discuss the contribution of our findings to the assessment of socio-emotional skills in education research, as well as limitations of the current st...
We test the utility of the Battery for Giftedness Assessment (BaAH/S) in identifying differences ... more We test the utility of the Battery for Giftedness Assessment (BaAH/S) in identifying differences in two groups of already known gifted students in the areas of academic and artistic talents. Four latent factors were assessed (a) fluid intelligence, (b) metaphor production (verbal creativity), (c) figural fluency (figural creativity), and (d) divergent thinking figural task quality (figural creativity). A sample of 987 children and adolescents, 464 boys and 523 girls, of ages ranging from 8 to 17 of two groups: regular students (N=866) and gifted students (N= 67 academic abilities, N=34 artistic abilities and N=20 no domain identified). Academic giftedness group of have higher reasoning, can produce more remote/original metaphors, high figural fluency and drawings rated as more original. Children in the group of artistic giftedness have higher reasoning, high figural fluency and drawings rated as more original. Reasoning abilities are relatively higher in academic giftedness group th...
Responding to the need for school-based, broadly applicable, low-cost, and brief assessments of s... more Responding to the need for school-based, broadly applicable, low-cost, and brief assessments of socio-emotional skills, we describe the conceptual background and empirical development of the SENNA inventory and provide new psychometric information on its internal structure. Data were obtained through a computerized survey from 50,000 Brazilian students enrolled in public school grades 6 to 12, spread across the entire State of São Paulo. The SENNA inventory was designed to assess 18 particular skills (e.g., empathy, responsibility, tolerance of frustration, and social initiative), each operationalized by nine items that represent three types of items: three positively keyed trait-identity items, three negatively keyed identity items, and three (always positively keyed) self-efficacy items, totaling a set of 162 items. Results show that the 18 skill constructs empirically defined a higher-order structure that we interpret as the social-emotional Big Five, labeled as Engaging with Oth...
Acquiescence is a commonly observed response style that may distort respondent scores. One approa... more Acquiescence is a commonly observed response style that may distort respondent scores. One approach to control for acquiescence involves creating a balanced scale and computing sum scores. Other model-based approaches may explicitly include an acquiescence factor as part of a factor analysis or multidimensional item response model. Under certain assumptions, both approaches may result in acquiescence-controlled scores for each respondent. However, the validity of the resulting scores is one issue that is sometimes ignored. In this paper, we present an application of these approaches under both balanced and unbalanced scales, and we report changes in criterion validity and respondent scores.
A Inteligencia Fluida (Gf) refere-se a capacidade geral de raciocinio em situacoes novas pouco es... more A Inteligencia Fluida (Gf) refere-se a capacidade geral de raciocinio em situacoes novas pouco estruturadas. Em termos de processos cognitivos subjacentes, estudos recentes apontam que a Gf esta associada a memoria de trabalho, especialmente as funcoes do executivo central, nomeadamente, coordenacao simultânea de tarefas e atencao seletiva/abstracao. Esse estudo verificou a estrutura fatorial de um conjunto de itens de raciocinio analogico com figuras geometricas criados sistematicamente para representarem esses dois componentes do constructo. O instrumento e informatizado e composto por: (a) Pre-teste contendo doze problemas; (b) Fase de treino na qual sao ensinados os componentes do processamento cognitivo e a estrutura geral dos problemas e (c) Pos-teste com mais doze problemas estruturalmente identicos aos do pre-teste e com feedback sobre a correcao da resposta e tres tentativas possiveis. Participaram 343 estudantes universitarios, 56,5% homens e 43,5% mulheres de cinco cursos...
O estudo buscou investigar as evidências de validade de critério de um instrumento intitulado Tri... more O estudo buscou investigar as evidências de validade de critério de um instrumento intitulado Triagem de Indicadores de Altas Habilidades/Superdotação (AH/S). A escala, respondida pelo professor, avalia o desenvolvimento do estudante em cinco áreas: capacidade intelectual geral, habilidades acadêmicas específicas, liderança, criatividade e talento artístico. A amostra foi composta por 568 participantes: 213 do grupo-controle e 355 do grupo-critério. Os resultados da análise de variância fatorial e do teste t de Student indicaram diferenças de médias significativas entre os grupos, confirmando o tipo de evidência de validade investigada. A regressão logística também foi conduzida visando identificar o quanto a pontuação em cada área específica da escala conseguiria prever a área de identificação do indivíduo com AH/S.Palavras-chave: Aluno com Altas Habilidades/Superdotação, Validade Estatística, Construção de Teste, Avaliação Psicológica. Clasificación de indicadores de altas habilid...
A presente pesquisa correlacionou dados provenientes da aplicação do teste de Inteligência BPR-5 ... more A presente pesquisa correlacionou dados provenientes da aplicação do teste de Inteligência BPR-5 com uma avaliação escolar de matemática e língua portuguesa em uma amostra de 679 alunos do nono ano do ensino fundamental de quatro escolas de uma rede particular de ensino. Os resultados dessas avaliações se mostraram fortemente correlacionados e estatisticamente significativos com escores dos testes de QI (r =,58, p < 0,01), evidenciando elevadas cargas em Inteligência Fluida (Gf). Uma análise longitudinal (5° ao 9° ano) foi aplicada através do Modelo de Curva de Crescimento Latente que investigou a média da variância inicial (intercepto) e a média de crescimento (slope) no desempenho acadêmico (DA) dos sujeitos, em dois modelos (com e sem a variável independente BPR), com o objetivo de investigar a capacidade preditiva de Gf no DA. Quando inserida a variável BPR, seu impacto no intercepto foi estimado em 20,288 e no slope, 6,381. Essas estimativas indicam o acréscimo no desempenho...
Uploads