Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1822258.1822293acmotherconferencesArticle/Chapter ViewAbstractPublication PageswikisymConference Proceedingsconference-collections
research-article

Assigning trust to Wikipedia content

Published: 08 September 2008 Publication History

Abstract

The Wikipedia is a collaborative encyclopedia: anyone can contribute to its articles simply by clicking on an "edit" button. The open nature of the Wikipedia has been key to its success, but has also created a challenge: how can readers develop an informed opinion on its reliability? We propose a system that computes quantitative values of trust for the text in Wikipedia articles; these trust values provide an indication of text reliability.
The system uses as input the revision history of each article, as well as information about the reputation of the contributing authors, as provided by a reputation system. The trust of a word in an article is computed on the basis of the reputation of the original author of the word, as well as the reputation of all authors who edited text near the word. The algorithm computes word trust values that vary smoothly across the text; the trust values can be visualized using varying text-background colors. The algorithm ensures that all changes to an article's text are reflected in the trust values, preventing surreptitious content changes.
We have implemented the proposed system, and we have used it to compute and display the trust of the text of thousands of articles of the English Wikipedia. To validate our trust-computation algorithms, we show that text labeled as low-trust has a significantly higher probability of being edited in the future than text labeled as high-trust.

References

[1]
B. T. Adler and L. de Alfaro. A content-driven reputation system for the Wikipedia. In Proc. of the 16th Intl. World Wide Web Conf. (WWW 2007). ACM Press, 2007.
[2]
C. Castelfranchi and eds. Y. Tan. Trust and Deception in Virtual Societies. Kluwer Academic Publishers, 2001.
[3]
A. Cheng and E. Friedman. Sybilproof reputation mechanisms. In Proc. of the ACM SIGCOMM workshop on Economics of peer-to-peer systems. ACM Press, 2005.
[4]
T. Cross. Puppy smoothies: Improving the reliability of open, collaborative wikis. First Monday, 11(9), September 2006.
[5]
C. Dellarocas. The digitization of word-of-mouth: Promises and challenges of online reputation systems. Management Science, October 2003.
[6]
J. R. Douceur. The sybil attack. In Peer-to-Peer Systems: First Intl. Workshop, volume 2429 of Lect. Notes in Comp. Sci., pages 251--260, 2002.
[7]
W. Emigh and S. Herring. Collaborative authoring on the Web. In Proc. of HSCC, 2005.
[8]
J. Giles. Internet encyclopaedias go head to head. Nature, pages 900--901, December 2005.
[9]
J. A. Golbeck. Computing and Applying Trust in Web-Based Social Networks. PhD thesis, University of Maryland, 2005.
[10]
T. Grandison and M. Sloman. A survey of trust in internet application. IEEE Comm. Surveys Tutorials, 3(4), 2000.
[11]
R. Guha, R. Kumar, P. Raghavan, and A. Tomkins. Propagation of trust and distrust. In Proc. of the 13th Intl. Conf. on World Wide Web, pages 403--412. ACM Press, 2004.
[12]
M. Hickman and G. Roberts. Wikipedia --- separating fact from fiction. The New Zealand Herald, Feb. 13 2006.
[13]
S. D. Kamvar, M. T. Schlosser, and H. Garcia-Molina. The eigentrust algorithm for reputation management in p2p networks. In Proc. of the 12th Intl. Conf. on World Wide Web, pages 640--651. ACM Press, 2003.
[14]
R. King. Contributor ranking system, 2007. White paper available from http://trust.cse.ucsc.edu/Related_Work.
[15]
J. M. Kleinberg. Authoritative sources in a hyperlinked environment. J. ACM, 46(5):604--632, 1999.
[16]
Xavier Leroy. Objective caml. http://caml.inria.fr/ocaml/index.en.html.
[17]
B. N. Levine, C. Shields, and N. B. Margolin. A survey of solutions to the sybil attack. Technical Report Technical Report 2006-052, Univ. of Massachussets Amherst, 2006.
[18]
A. Lih. Wikipedia as participatory journalism. In Proc. 5th International Symposium on Online Journalism, 2004.
[19]
V. B. Livshits and T. Zimmerman. Dynamine: Finding common error patterns by mining software revision histories. In ESEC/FSE, pages 296--305, 2005.
[20]
P. Massa. Wikipedia trust network, 2007. http://www.gnuband.org/2007/06/26/wikipedia_trust_network/.
[21]
D. L. McGuinness, H. Zeng, P. P. da Silva, L. Ding, D. Narayanan, and M. Bhaowal. Investigation into trust for collaborative information repositories: A Wikipedia case study. In Proceedings of the Workshop on Models of Trust for the Web, 2006.
[22]
http://www.mediawiki.org/.
[23]
B. Mingus, T. Pincock, and L. Rassbach. Using natural language processing to determine the quality of Wikipedia articles. In Wikimania, Taipei, Taiwan, 2007. http://wikimania2007.wikimedia.org/wiki/Proceedings:BM1.
[24]
F. Ortega and J. M. Gonzales-Barahona. Quantitative analysis of the Wikipedia community of users. In Proc. of Wikisym. ACM Press, 2007.
[25]
P. Resnick, R. Zeckhauser, E. Friedman, and K. Kiwabara. Reputation systems. Comm. ACM, 43(12):45--48, 2000.
[26]
J.-M. Seigneur, A. Gray, and C. D. Jensen. Trust transfer: Encouraging self-recommendations without sybil attack. In Trust Management, volume 3477 of Lect. Notes in Comp. Sci. Springer-Verlag, 2005.
[27]
R. Stross. Anonymous source is not the same as open source. The New York Times, Mar. 12, 2006.
[28]
W. F. Tichy. The string-to-string correction problem with block move. ACM Trans. on Computer Systems, 2(4), 1984.
[29]
The ucsc wikipedia trust project, 2007. http://trust.cse.ucsc.edu.
[30]
F. Viégas, M. Wattenberg, and K. Dave. Studying cooperation and conflict between authors with history flow visualizations. In Proc. of the SIGCHI Conf. on Human Factors in Computing Systems, pages 575--582, 2004.
[31]
J. Voss. Measuring Wikipedia. In Proc. of ISSI, 2005.
[32]
http://stats.wikimedia.org/EN/TablesDatabaseEdits.htm.
[33]
D. Wilkinson and B. Huberman. Cooperation and quality in Wikipedia. In Proc. of WikiSym. ACM Press, 2007.
[34]
H. Zeng, M. Alhossaini, R. Fikes, and D. L. McGuinness. Mining revision history to assess trustworthiness of article fragments. In Proc. of the 2nd Intl. Conf. on Collaborative Computing: Networking, Applications, and Worksharing (COLLABORATECOM), 2006.
[35]
H. Zeng, M. A. Alhoussaini, L. Ding, R. Fikes, and D. L. McGuinness. Computing trust from revision history. In Intl. Conf. on Privacy, Security and Trust, 2006.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
WikiSym '08: Proceedings of the 4th International Symposium on Wikis
September 2008
219 pages
ISBN:9781605581286
DOI:10.1145/1822258
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

  • University of Porto

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 08 September 2008

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Research-article

Conference

WikiSym08
Sponsor:
WikiSym08: 2008 International Symposium on Wikis
September 8 - 10, 2008
Porto, Portugal

Acceptance Rates

Overall Acceptance Rate 69 of 145 submissions, 48%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)33
  • Downloads (Last 6 weeks)2
Reflects downloads up to 15 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2023)40 Years of Designing Code Comprehension Experiments: A Systematic Mapping StudyACM Computing Surveys10.1145/362652256:4(1-42)Online publication date: 9-Nov-2023
  • (2023)Automatic Quality Assessment of Wikipedia Articles—A Systematic Literature ReviewACM Computing Surveys10.1145/362528656:4(1-37)Online publication date: 10-Nov-2023
  • (2023)A Survey of Privacy Attacks in Machine LearningACM Computing Surveys10.1145/362401056:4(1-34)Online publication date: 10-Nov-2023
  • (2023)Edit-History Vis: An Interactive Visual Exploration and Analysis on Wikipedia Edit History2023 IEEE 16th Pacific Visualization Symposium (PacificVis)10.1109/PacificVis56936.2023.00025(157-166)Online publication date: Apr-2023
  • (2023)PD-Box: A People Place Data Box for Processing Engine Anatomy2023 2nd Edition of IEEE Delhi Section Flagship Conference (DELCON)10.1109/DELCON57910.2023.10127379(1-6)Online publication date: 24-Feb-2023
  • (2023)Comparing and extending the use of defeasible argumentation with quantitative data in real-world contextsInformation Fusion10.1016/j.inffus.2022.08.02589(537-566)Online publication date: Jan-2023
  • (2022)Using natural language generation to bootstrap missing Wikipedia articles: A human-centric perspectiveSemantic Web10.3233/SW-21043113:2(163-194)Online publication date: 3-Feb-2022
  • (2022)Templates and Trust-o-meters: Towards a widely deployable indicator of trust in WikipediaProceedings of the 2022 CHI Conference on Human Factors in Computing Systems10.1145/3491102.3517523(1-17)Online publication date: 29-Apr-2022
  • (2022)Contradiction Detection Approach Based on Semantic Relations and Evidence of UncertaintyComputational Collective Intelligence10.1007/978-3-031-16014-1_19(232-245)Online publication date: 28-Sep-2022
  • (2022)An Iterative Model for Quality Assessment in Collaborative Content Generation SystemsService-Oriented Computing – ICSOC 2021 Workshops10.1007/978-3-031-14135-5_10(125-138)Online publication date: 24-Aug-2022
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media