Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1641309.1641328acmotherconferencesArticle/Chapter ViewAbstractPublication PageswikisymConference Proceedingsconference-collections
research-article

An architecture to support intelligent user interfaces for Wikis by means of Natural Language Processing

Published: 25 October 2009 Publication History
  • Get Citation Alerts
  • Abstract

    We present an architecture for integrating a set of Natural Language Processing (NLP) techniques with a wiki platform. This entails support for adding, organizing, and finding content in the wiki. We perform a comprehensive analysis of how NLP techniques can support the user interaction with the wiki, using an intelligent interface to provide suggestions. The architecture is designed to be deployed with any existing wiki platform, especially those used in corporate environments. We implemented a prototype integrating the NLP techniques keyphrase extraction and text segmentation, as well as an improved search engine. The prototype is integrated with two widely used wiki platforms: Media-Wiki and TWiki.

    References

    [1]
    M. Buffa. Intranet Wikis. Proceedings of the IntraWebs Workshop 2006 at the 15th International World Wide Web Conference, 2006.
    [2]
    M. Buffa and F. Gandon. SweetWiki: Semantic Web Enabled Technologies in Wiki. Human Factors, pages 69--78, 2006.
    [3]
    E. H. Chi, M. Gumbrecht, and L. Hong. Visual Foraging of Highlighted Text: An Eye-Tracking Study. In Human-Computer Interaction. HCI Intelligent Multimodal Interaction Environments, volume 4552 of Lecture Notes in Computer Science, pages 589--598. Springer, 2007.
    [4]
    F. Y. Y. Choi, P. Wiemer-Hastings, and J. Moore. Latent Semantic Analysis for Text Segmentation. In Proceedings of the 2001 Conference on Empirical Methods in Natural Language Processing, pages 109--117, 2001.
    [5]
    P. Cimiano. Ontology Learning and Population from Text: Algorithms, Evaluation and Applications. Springer-Verlag, New York, NY, USA, 2006.
    [6]
    H. Cunningham, D. Maynard, K. Bontcheva, and V. Tablan. GATE: A Framework and Graphical Development Environment for Robust NLP Tools and Applications. In Proceedings of the 40th Anniversary Meeting of the Association for Computational Linguistics, pages 168--175, July 2002.
    [7]
    S. Feldman and C. Sherman. The High Cost of Not Finding Information. An IDC White Paper, 2001.
    [8]
    D. Ferrucci and A. Lally. UIMA: An Architectural Approach to Unstructured Information Processing in the Corporate Research Environment. Natural Language Engineering, 10(3--4):327--348, 2004.
    [9]
    E. Frank, G. W. Paynter, I. Witten, C. Gutwin, and C. G. Nevill-Manning. Domain-Specific Keyphrase Extraction. In Proceedings of the 16th International Joint Conference on Aritificial Intelligence, pages 668--673, San Mateo, CA, 1999. Morgan Kaufmann.
    [10]
    J. Gemmell, A. Shepitsen, B. Mobasher, and R. Burke. Personalizing Navigation in Folksonomies Using Hierarchical Tag Clustering. Data Warehousing and Knowledge Discovery, 5182:196--205, 2008.
    [11]
    L. Getoor and C. P. Diehl. Link Mining: A Survey. SIGKDD Explorations, 7:3--12, 2005.
    [12]
    S. A. Golder and B. A. Huberman. Usage Patterns of Collaborative Tagging Systems. Journal of Information Science, 32(2):198--208, 2006.
    [13]
    I. Gurevych, C. Müller, and T. Zesch. What to be? - Electronic Career Guidance Based on Semantic Relatedness. In Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, pages 1032--1039, Prague, Czech Republic, Jun 2007. ACL.
    [14]
    I. Gurevych and T. Zesch. Selbstorganisierende Wikis. In Proceedings of KnowTech, pages 317--324, Frankfurt, Germany, Oct 2008. BITKOM.
    [15]
    M. Hartmann, D. Schreiber, and M. Mühlhäuser. Tailoring the Interface to Individual Users. In 5th International workshop on Ubiquitous User Modeling at IUI'08, New York, NY, USA, 2008. ACM.
    [16]
    M. A. Hearst. TextTiling: Segmenting Text Into Multi-Paragraph Subtopic Passages. Computational Linguistics, 23(1):33--64, 1997.
    [17]
    D. Jurafsky and J. H. Martin. Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics and Speech Recognition. Prentice Hall, second edition, February 2008.
    [18]
    M. Krötzsch, D. Vrandecic, M. Völkel, H. Haller, and R. Studer. Semantic Wikipedia. Journal of Web Semantics, 5:251--261, Sep 2007.
    [19]
    B. Leuf and W. Cunningham. The Wiki Way: Collaboration and Sharing on the Internet. Addison-Wesley Professional, April 2001.
    [20]
    A. Majchrzak, C. Wagner, and D. Yates. Corporate Wiki Users: Results of a Survey. In WikiSym '06: Proceedings of the 2006 International Symposium on Wikis, pages 99--104, New York, NY, USA, 2006. ACM.
    [21]
    O. Medelyan, C. Legg, D. Milne, and I. H. Witten. Mining Meaning from Wikipedia. Working Paper, arXiv:0809.4530v2, 2008.
    [22]
    R. Mihalcea and A. Csomai. Wikify! Linking Documents to Encyclopedic Knowledge. In Proceedings of the 16th ACM Conference on Information and Knowledge Management, pages 233--242, 2007.
    [23]
    R. Mihalcea and P. Tarau. TextRank: Bringing Order into Texts. In Proceedings of the Conference on Empirical Methods in Natural Language Processing, pages 404--411, Barcelona, Spain, July 2004.
    [24]
    D. Milne and I. H. Witten. Learning to Link with Wikipedia. In Proceedings of the 17th ACM Conference on Information and Knowledge Mining, pages 509--518, New York, NY, USA, 2008. ACM.
    [25]
    C. Müller, I. Gurevych, and M. Mühlhäuser. Closing the Vocabulary Gap for Computing Text Similarity and Information Retrieval. International Journal of Semantic Computing, 2(2):(253--272), 2008.
    [26]
    C. Müller, T. Zesch, M.-C. Müller, D. Bernhard, K. Ignatova, I. Gurevych, and M. Mühlhäuser. Flexible UIMA Components for Information Retrieval Research. In Proceedings of the LREC 2008 Workshop 'Towards Enhanced Interoperability for Large HLT Systems: UIMA for NLP', pages 24--27, May 2008.
    [27]
    L. Page, S. Brin, R. Motwani, and T. Winograd. The PageRank Citation Ranking: Bringing Order to the Web. Technical Report 1999--66, Stanford InfoLab, November 1999.
    [28]
    L. Qu, C. Müller, and I. Gurevych. Using Tag Semantic Network for Keyphrase Extraction in Blogs. In ACM 17th Conference on Information and Knowledge Management, pages 1381--1382, New York, NY, USA, Oct 2008. ACM.
    [29]
    S. Schaffert. IkeWiki: A Semantic Wiki for Collaborative Knowledge Management. In WETICE '06: Proceedings of the 15th IEEE International Workshops on Enabling Technologies: Infrastructure for Collaborative Enterprises, pages 388--396, 2006.
    [30]
    N. Shadbolt, T. Berners-Lee, and W. Hall. The Semantic Web Revisited. IEEE Intelligent Systems, 21(3):96--101, 2006.
    [31]
    B. Stvilia, M. B. Twidale, L. C. Smith, and L. Gasser. Assessing Information Quality of a Community-Based Encyclopedia. In Proceedings of the 2005 International Conference on Information Quality, 2005.
    [32]
    R. Witte and T. Gitzinger. Connecting Wikis and Natural Language Processing Systems. In WikiSym '07: Proceedings of the 2007 International Symposium on Wikis, pages 165--176, 2007.
    [33]
    T. Zesch, C. Müller, and I. Gurevych. Extracting Lexical Semantic Knowledge from Wikipedia and Wiktionary. In Proceedings of the Conference on Language Resources and Evaluation, May 2008.
    [34]
    T. Zesch, C. Müller, and I. Gurevych. Using Wiktionary for Computing Semantic Relatedness. In Proceedings of AAAI, pages 861--867, 2008.

    Cited By

    View all
    • (2012)Natural language processing for MediaWikiProceedings of the Eighth Annual International Symposium on Wikis and Open Collaboration10.1145/2462932.2462946(1-10)Online publication date: 27-Aug-2012
    • (2012)PhoneCon: Voice-driven SmartPhone Controllable Wireless Sensor Networks2012 IEEE 31st International Performance Computing and Communications Conference (IPCCC)10.1109/PCCC.2012.6407654(440-447)Online publication date: Dec-2012
    • (2011)VisualWikiCuratorCHI '11 Extended Abstracts on Human Factors in Computing Systems10.1145/1979742.1979806(1549-1554)Online publication date: 7-May-2011
    • Show More Cited By

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    WikiSym '09: Proceedings of the 5th International Symposium on Wikis and Open Collaboration
    October 2009
    200 pages
    ISBN:9781605587301
    DOI:10.1145/1641309
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    • John Ernest Foundation

    In-Cooperation

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 25 October 2009

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Wiki
    2. content organization
    3. natural language processing
    4. user interaction

    Qualifiers

    • Research-article

    Funding Sources

    Conference

    WikiSym '09
    Sponsor:
    WikiSym '09: 2009 International Symposium on Wikis
    October 25 - 27, 2009
    Florida, Orlando

    Acceptance Rates

    WikiSym '09 Paper Acceptance Rate 16 of 45 submissions, 36%;
    Overall Acceptance Rate 69 of 145 submissions, 48%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)1
    • Downloads (Last 6 weeks)0

    Other Metrics

    Citations

    Cited By

    View all
    • (2012)Natural language processing for MediaWikiProceedings of the Eighth Annual International Symposium on Wikis and Open Collaboration10.1145/2462932.2462946(1-10)Online publication date: 27-Aug-2012
    • (2012)PhoneCon: Voice-driven SmartPhone Controllable Wireless Sensor Networks2012 IEEE 31st International Performance Computing and Communications Conference (IPCCC)10.1109/PCCC.2012.6407654(440-447)Online publication date: Dec-2012
    • (2011)VisualWikiCuratorCHI '11 Extended Abstracts on Human Factors in Computing Systems10.1145/1979742.1979806(1549-1554)Online publication date: 7-May-2011
    • (2011)VisualWikiCuratorProceedings of the 16th international conference on Intelligent user interfaces10.1145/1943403.1943467(367-370)Online publication date: 13-Feb-2011

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media