Cohen´s Kappa
51 Followers
Recent papers in Cohen´s Kappa
This paper aims to estimate intercoder reliability in content analysis of international online media. In order to measure this rate of agreement, we have selected three of the twenty-five online media that configure the corpus of the... more
[Contenido en castellano]. La Kappa (Cohen, 1960) es un índice estadístico que mide el acuerdo entre las evaluaciones o diagnósticos de dos jueces, evaluadores o investigadores siempre y cuando estén evaluando lo mismo con los mismos... more
The question of data reliability is of first importance to assess the quality of manually annotated corpora. Although Cohen ' s κ is the prevailing reliability measure used in NLP, alternative statistics have been proposed. This paper... more
Objective: To run logistic regression after merging two variables of raters depending on percent of agreement. Methodology: Percent of agreement identifies quantitative expression of observer variation between two raters. Correct... more
The author has developed a general theory for measuring the statistical interconnections between events, called ``mathematical eventology'' \cite[2007]{Vorobyev2007}, \cite[2011]{Vorobyev2011em}. The paper represents fresh... more
Both empirical and mathematical demonstrations of the importance of chance-corrected measures are discussed, and a new model of learning is proposed based on empirical psychological results on association learning. Two forms of this model... more
Stack Overflow is the most popular Q&A website which is used by developers and programmers for many developments and programming purposes. Stack Overflow commits occur based on various types of programming languages. But among all the... more
Objective: Self-reported information from questionnaires is frequently used in epidemiological studies, but few of these studies provide information on the reproducibility of individual items contained in the questionnaire. We studied... more
Evaluation often aims to reduce the correctness or error characteristics of a system down to a single number, but that always involves trade-offs. Another way of dealing with this is to quote two numbers, such as Recall and Precision, or... more
It is becoming clear that traditional evaluation measures used in Computational Linguistics (including Error Rates, Accuracy, Recall, Precision and F-measure) are of limited value for unbiased evaluation of systems, and are not... more
Stack Overflow is the most popular Q&A website which is used by developers and programmers for many developments and programming purposes. Stack Overflow commits occur based on various types of programming languages. But among all the... more
Stack Overflow is the most popular Q&A website which is used by developers and programmers for many developments and programming purposes. Stack Overflow commits occur based on various types of programming languages. But among all the... more
We are concerned that the quality of results produced by an NLP parser bears little, if any, relation to the percentage-results claimed by the various NLP parser-systems presently available for use. To illustrate this problem we examine... more