Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2526188.2526218acmotherconferencesArticle/Chapter ViewAbstractPublication PageswebmediaConference Proceedingsconference-collections
research-article

An investigation of the relationship between the amount of extra-textual data and the quality of Wikipedia articles

Published: 05 November 2013 Publication History

Abstract

Wikipedia, a web-based collaboratively maintained free encyclopedia, is emerging as one of the most important websites on the internet. However, its openness raises many concerns about the quality of the articles and how to assess it automatically. In the Portuguese-speaking Wikipedia, articles can be rated by bots and by the community. In this paper, we investigate the correlation between these ratings and the count of media items (namely images and sounds) through a series of experiments. Our results show that article ratings and the count of media items are correlated.

References

[1]
B. T. Adler, K. Chatterjee, L. de Alfaro, M. Faella, I. Pye, and V. Raman. Assigning trust to wikipedia content. In Proceedings of the 4th International Symposium on Wikis, WikiSym '08, pages 26:1--26:12, New York, NY, USA, 2008. ACM.
[2]
Alexa. Wikipedia site info, 2013. Acesso em: 22 maio 2013.
[3]
M. Anderka and B. Stein. A breakdown of quality aws in wikipedia. In Proceedings of the 2nd Joint WICOW/AIRWeb Workshop on Web Quality, WebQuality '12, pages 11--18, New York, NY, USA, 2012. ACM.
[4]
J. E. Blumenstock. Size matters: word count as a measure of quality on wikipedia. In Proceedings of the 17th international conference on World Wide Web, WWW '08, pages 1095--1096, New York, NY, USA, 2008. ACM.
[5]
D. H. Dalip, M. A. Gonçalves, M. Cristo, and P. Calado. Automatic assessment of document quality in web collaborative digital libraries. J. Data and Information Quality, 2(3):14:1--14:30, Dec. 2011.
[6]
G. De la Calzada and A. Dekhtyar. On measuring the quality of wikipedia articles. In Proceedings of the 4th workshop on Information credibility, WICOW '10, pages 11--18, New York, NY, USA, 2010. ACM.
[7]
P. Dondio and S. Barrett. Computational trust in web content quality: A comparative evalutation on the wikipedia project. Informatica, 31(2):151--160, 2007.
[8]
R. T. S. Hanada, M. da Graça Campos Pimentel, and M. Cristo. Relacao entre métricas de analise de ligacoes e qualidade, importancia e popularidade na wikipédia. In G. Bressan, R. M. Silveira, E. V. Munson, A. Santancha, and M. da Graca Campos Pimentel, editors, Brazilian Symposium on Multimedia and the Web, WebMedia'13, Salvador, Brazil, Novembro, 2013 (to appear). ACM, 2013.
[9]
M. Hu, E.-P. Lim, A. Sun, H. W. Lauw, and B.-Q. Vuong. Measuring article quality in wikipedia: models and evaluation. In Proceedings of the sixteenth ACM conference on Conference on information and knowledge management, CIKM '07, pages 243--252, New York, NY, USA, 2007. ACM.
[10]
M. Kendall. Rank correlation methods. Griffin, London, 1948.
[11]
E.-P. Lim, B.-Q. Vuong, H. W. Lauw, and A. Sun. Measuring qualities of articles contributed by online communities. In Proceedings of the 2006 IEEE/WIC/ACM International Conference on Web Intelligence, WI '06, pages 81--87, Washington, DC, USA, 2006. IEEE Computer Society.
[12]
M. Mortazavi. Why web 2.0 has come to exist?. on the margins, 2006. Acesso em: 22 maio 2013.
[13]
S. T. Moturu and H. Liu. Evaluating the trustworthiness of wikipedia articles through quality and credibility. In Proceedings of the 5th International Symposium on Wikis and Open Collaboration, WikiSym '09, pages 28:1--28:2, New York, NY, USA, 2009. ACM.
[14]
Y. Suzuki and M. Yoshikawa. Mutual evaluation of editors and texts for assessing quality of wikipedia articles. In Proceedings of the Eighth Annual International Symposium on Wikis and Open Collaboration, WikiSym '12, pages 18:1--18:10, New York, NY, USA, 2012. ACM.
[15]
B. S. M. B. Twidale. Assessing information quality of a community-based encyclopedia. In Proceedings of the International Conference on Information Quality, pages 442--454, 2005.
[16]
S. Wang and M. Iwaihara. Quality evaluation of wikipedia articles through edit history and editor groups. In Proceedings of the 13th Asia-Pacific web conference on Web technologies and applications, APWeb'11, pages 188--199, Berlin, Heidelberg, 2011. Springer-Verlag.
[17]
Wikipedia. Wikipedia:avaliacao automatica, 2013. Acesso em: 22 maio 2013.
[18]
Wikipedia. Wikipedia:avaliacao de artigos, 2013. Acesso em: 22 maio 2013.
[19]
Wikipedia. Wikipedia:conteudo restrito, 2013. Acesso em: 29 maio 2013.
[20]
Wikipedia. Wikipedia:escolha do artigo em destaque, 2013. Acesso em: 22 maio 2013.
[21]
Wikipedia. Wikipedia:versao 1.0/avaliacao, 2013. Acesso em: 22 maio 2013.
[22]
Wikipedia. Wikipedia, 2013. Acesso em: 22 maio 2013.
[23]
T. Wohner and R. Peters. Assessing the quality of wikipedia articles with lifecycle based metrics. In Proceedings of the 5th International Symposium on Wikis and Open Collaboration, WikiSym '09, pages 16:1--16:10, New York, NY, USA, 2009. ACM.

Cited By

View all
  • (2023)Automatic Quality Assessment of Wikipedia Articles—A Systematic Literature ReviewACM Computing Surveys10.1145/362528656:4(1-37)Online publication date: 10-Nov-2023

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
WebMedia '13: Proceedings of the 19th Brazilian symposium on Multimedia and the web
November 2013
360 pages
ISBN:9781450325592
DOI:10.1145/2526188
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

  • SBC: Brazilian Computer Society

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 05 November 2013

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. content quality
  2. correlations
  3. extra-textual data
  4. wikipedia

Qualifiers

  • Research-article

Conference

WebMedia '13
Sponsor:
  • SBC

Acceptance Rates

WebMedia '13 Paper Acceptance Rate 29 of 87 submissions, 33%;
Overall Acceptance Rate 270 of 873 submissions, 31%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)3
  • Downloads (Last 6 weeks)0
Reflects downloads up to 11 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2023)Automatic Quality Assessment of Wikipedia Articles—A Systematic Literature ReviewACM Computing Surveys10.1145/362528656:4(1-37)Online publication date: 10-Nov-2023

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media