Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1242572.1242818acmconferencesArticle/Chapter ViewAbstractPublication PageswebconfConference Proceedingsconference-collections
Article

On building graphs of documents with artificial ants

Published: 08 May 2007 Publication History
  • Get Citation Alerts
  • Abstract

    We present an incremental algorithm for building a neighborhood graph from a set of documents. This algorithm is based on a population of artificial agents that imitate the way real ants build structures with self-assembly behaviors. We show that our method outperforms standard algorithms for building such neighborhood graphs (up to 2230 times faster on the tested databases with equal quality) and how the user may interactively explore the graph.

    References

    [1]
    H. Azzag, C. Guinot, and G. Venturini. Anttree: web document clustering using artificial ants. In R. L. de M'antaras and L. Saitta, editors, Proceedings of the 16th European Conference on Artificial Intelligence (ECAI 04), pages 480--484. IOS Press, 8 2004.
    [2]
    T. Fruchterman and E. Reingold. Graph drawing by force-directed placement. In Software -- Practice and Experience, volume 21, pages 1129--1164, 1991.
    [3]
    C. Guinot, D. J.-M. Malvy, F. Morizot, M. Tenenhaus, J. Latreille, S. Lopez, E. Tschachler, and L. Dubertret. Classification of healthy human facial skin. Textbook of Cosmetic Dermatology Third edition, 2003.
    [4]
    H. Hacid and D. A. Zighed. An effective method for locally neighborhood graphs updating. In DEXA 2005, pages 930--939, 2005.
    [5]
    E.-H. Han, D. Boley, M. Gini, R. Gross, K. Hastings, G. Karypis, V. Kumar, B. Mobasher, and J. Moore. Webace: a web agent for document categorization and exploration. In AGENTS '98: Proceedings of the second international conference on Autonomous agents, pages 408--415, New York, NY, USA, 1998. ACM Press.
    [6]
    G. Salton and C. Buckley. Term-weighting approaches in automatic text retrieval. Inf. Process. Manage., 24(5):513--523, 1988.
    [7]
    G. T. Toussaint. The relative neighborhood graphs in a finite planar set. In Pattern recognition, chapter 12, pages 261--268. 1980.

    Recommendations

    Reviews

    Suma Adabala

    The task of clustering similar or related documents is important to information retrieval systems, like search engines. This is done by building graphs, where the given set of documents form the nodes and the edges represent the similarity between the documents and nodes. The authors present a graph-building algorithm that closely follows the self-assembly behavior observed when ants build living structures by connecting their bodies together. The tabulated results show that the proposed algorithm outperforms standard methods, such as relative neighborhood graphs (RNG) methods, for building graphs, while finding more similarity, that is, creating more links between documents. The key principle of the proposed algorithm is that the graph is built incrementally. When a new document is added, it follows the path of maximum similarity: it is connected to all neighboring nodes and documents whose similarity to the new document is higher than a given similarity threshold. As this is a short poster session paper, details of the algorithm and evaluation are omitted. It may be worthwhile to investigate other related publications by the authors, as the high performance of this algorithm makes it a promising substitute for current clustering algorithms. Online Computing Reviews Service

    Access critical reviews of Computing literature here

    Become a reviewer for Computing Reviews.

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    WWW '07: Proceedings of the 16th international conference on World Wide Web
    May 2007
    1382 pages
    ISBN:9781595936547
    DOI:10.1145/1242572
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 08 May 2007

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. artificial ants
    2. clustering
    3. documents
    4. graph
    5. interactive visualization
    6. web

    Qualifiers

    • Article

    Conference

    WWW'07
    Sponsor:
    WWW'07: 16th International World Wide Web Conference
    May 8 - 12, 2007
    Alberta, Banff, Canada

    Acceptance Rates

    Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 214
      Total Downloads
    • Downloads (Last 12 months)1
    • Downloads (Last 6 weeks)0

    Other Metrics

    Citations

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media