Cohen´s Kappa Research Papers

This paper aims to estimate intercoder reliability in content analysis of international online media. In order to measure this rate of agreement, we have selected three of the twenty-five online media that configure the corpus of the... more

Bookmark
Download
- by Javier Odriozola
- •
- 5
  Content Analysis, Online Media, Intercoder Reliability, Holsti

[Contenido en castellano]. La Kappa (Cohen, 1960) es un índice estadístico que mide el acuerdo entre las evaluaciones o diagnósticos de dos jueces, evaluadores o investigadores siempre y cuando estén evaluando lo mismo con los mismos... more

Bookmark
Download
- by Jacob Sierra-Díaz
- •
- 2
  Cohen´s Kappa, Kappa de Cohen

The question of data reliability is of first importance to assess the quality of manually annotated corpora. Although Cohen ' s κ is the prevailing reliability measure used in NLP, alternative statistics have been proposed. This paper... more

Bookmark
Download
- by JY Jya
- •
- 5
  Data Quality (Computer Science), Reliability, Corpus Annotation, Cohen´s Kappa

Bookmark
Download
- by Gary Holden
- •
- 9
  Statistics, Research Methodology, Social Work Education, Research

Objective: To run logistic regression after merging two variables of raters depending on percent of agreement. Methodology: Percent of agreement identifies quantitative expression of observer variation between two raters. Correct... more

Bookmark
Download
- by Timothy Mutsvari
- •
- 18
  Statistics, Humans, Child, Markov chains

Bookmark
Download
- by David Powers
- •
- 22
  Correlation, Boosting, AdaBoost, Accuracy

The author has developed a general theory for measuring the statistical interconnections between events, called ``mathematical eventology'' \cite[2007]{Vorobyev2007}, \cite[2011]{Vorobyev2011em}. The paper represents fresh... more

Bookmark
Download
- by Zoltan Kovacs
- •
- 3
  Analytical Chemistry, Chemometrics, Cohen´s Kappa

Both empirical and mathematical demonstrations of the importance of chance-corrected measures are discussed, and a new model of learning is proposed based on empirical psychological results on association learning. Two forms of this model... more

Stack Overflow is the most popular Q&A website which is used by developers and programmers for many developments and programming purposes. Stack Overflow commits occur based on various types of programming languages. But among all the... more

Bookmark
- by Md. Raihan Talukder
- •
- 11
  Computer Science, Programming Languages, Statistics, Data Mining

Objective: Self-reported information from questionnaires is frequently used in epidemiological studies, but few of these studies provide information on the reproducibility of individual items contained in the questionnaire. We studied... more

Objective:
Self-reported information from questionnaires is frequently used in epidemiological studies, but few of these studies provide information on the reproducibility of individual items contained in the questionnaire. We studied the test–retest reliability of self-reported diabetes among 33,919 participants in Norwegian Women and Cancer Study.

Methods:
The test–retest reliability of self-reported type 1 and type 2 diabetes diagnoses was evaluated between three self-administered questionnaires (completed in 1991, 1998, and 2005 by Norwegian Women and Cancer participants) by kappa agreement. The time interval between the test–retest studies was ~7 and ~14 years. Sensitivity of the kappa agreement for type 1 and type 2 diabetes diagnoses was assessed. Subgroup analysis was performed to assess whether test–retest reliability varies with age, body mass index, physical activity, education, and smoking status.

Results:
The kappa agreement for both types of self-reported diabetes diagnoses combined was good (⩾0.65) for all three test–retest studies (1991–1998, 1991–2005, and 1998–2005). The kappa agreement for type 1 diabetes was good (⩾0.73) in the 1991–2005 and the 1998–2005 test–retest studies, and very good (0.83) in the 1991–1998 test–retest study. The kappa agreement for type 2 diabetes was moderate (0.57) in the 1991–2005 test–retest study and good (⩾0.66) in the 1991–1998 and 1998–2005 test–retest studies. The overall kappa agreement in the 1991–1998 test–retest study was stronger than in the 1991–2005 test–retest study and the 1998–2005 test–retest study. There was no clear pattern of inconsistency in the kappa agreements within different strata of age, BMI, physical activity, and smoking. The kappa agreement was strongest among the respondents with 17 or more years of education, while generally it was weaker among the least educated group.

Conclusion:
The test–retest reliability of the diabetes was acceptable and there was no clear pattern of inconsistency in the kappa agreement stratified by age, body mass index, physical activity, and smoking. The study suggests that self-reported diabetes diagnosis from middle-aged women enrolled in the Norwegian Women and Cancer Study is reliable.

Bookmark
Download
- by Mashhood Ahmed Sheikh
- •
- 12
  Type 2 Diabetes, Metabolic syndrome, Norway, Questionnaires

Evaluation often aims to reduce the correctness or error characteristics of a system down to a single number, but that always involves trade-offs. Another way of dealing with this is to quote two numbers, such as Recall and Precision, or... more

It is becoming clear that traditional evaluation measures used in Computational Linguistics (including Error Rates, Accuracy, Recall, Precision and F-measure) are of limited value for unbiased evaluation of systems, and are not... more

Bookmark
Download
- by David Powers
- •
- 40
  Cognitive Psychology, Cognitive Science, Statistics, Machine Learning

Bookmark
Download
- by Melor Md Yunus
- •
- 7
  Psychology, Linguistics, English language teaching, Cohen´s Kappa

Stack Overflow is the most popular Q&A website which is used by developers and programmers for many developments and programming purposes. Stack Overflow commits occur based on various types of programming languages. But among all the... more

Bookmark
- by Muradul Bashir
- •
- 11
  Computer Science, Programming Languages, Statistics, Data Mining

Bookmark
Download
- by Faried Banimahd
- •
- 4
  Emergency Medicine, Pediatric, Cohen´s Kappa, Inter-rater Agreement

Stack Overflow is the most popular Q&A website which is used by developers and programmers for many developments and programming purposes. Stack Overflow commits occur based on various types of programming languages. But among all the... more

Bookmark
Download
- by Raihan Talukder
- •
- 9
  Programming Languages, Statistics, Data Mining, Data Analysis

We are concerned that the quality of results produced by an NLP parser bears little, if any, relation to the percentage-results claimed by the various NLP parser-systems presently available for use. To illustrate this problem we examine... more

Cohen´s Kappa

Log In