Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1551609.1551627acmconferencesArticle/Chapter ViewAbstractPublication PageshpdcConference Proceedingsconference-collections
research-article

An adaptive online system for efficient processing of hierarchical data

Published: 11 June 2009 Publication History

Abstract

Concept hierarchies greatly help in the organization and reuse of information and are widely used in a variety of information systems applications. In this paper, we describe a method for efficiently storing and querying data organized into concept hierarchies and dispersed over a DHT. In our method, peers individually decide on the level of indexing according to the granularity of the incoming queries. Roll-up and drill-down operations are performed on a per-node basis in order to minimize the required bandwidth for answering queries on variable aggregation levels. We motivate our approach by applying it on a large-scale Grid system: Specifically, we plan to apply our fully decentralized scheme that creates, queries and updates large volumes of hierarchical data on-line and replace the traditional centralized and strictly indexed information systems. Our extensive experimental results support this argument on many diverse configurations: Our system proves very efficient in skewed workloads, both over single and multiple hierarchy levels at the same time. It adapts to sudden changes in popularity and effectively stores and updates large amounts of data at very low cost.

References

[1]
Egee accounting portal. http://www3.egee.cesga.es/gridsite/accounting/CESGA/.
[2]
Ganglia Monitoring System. http://ganglia.info/.
[3]
GT Information Services: Monitoring and Discovery System (MDS). http://www.globus.org/toolkit/mds/.
[4]
Hawkeye: A Monitoring and Management Tool for Distributed Systems. http://www.cs.wisc.edu/condor/hawkeye/.
[5]
R-GMA: Relational Grid Monitoring Architecture. http://www.r-gma.org/.
[6]
The Globus Toolkit. http://www.globus.org/.
[7]
K. Aberer, P. Cudre-Mauroux, and M. Hauswirth. The Chatty Web: Emergent Semantics Through Gossiping. In WWW Conference, 2003.
[8]
K. Aberer, P. Cudre-Mauroux, M. Hauswirth, and T. V. Pelt. Gridvine:Building internet-scale semantic overlay networks. In International Semantic Web Conference, 2004.
[9]
OLAP Council APB-1 OLAP Benchmark. http://www.olapcouncil.org/research/resrchly.htm.
[10]
M. Ester, J. Kohlhammer, and P. Kriegel. The dc-tree: A fully dynamic index structure for data warehouses. In ICDE, 2000.
[11]
R. B. et.al. Apel: An implementation of grid accounting using r-gma. In UK e-Science All Hands Conference, 2005.
[12]
FreePastry. http://freepastry.rice.edu/FreePastry.
[13]
R. Huebsch, J. Hellerstein, N. L. Boon, T. Loo, S. Shenker, and I. Stoica. Querying the Internet with PIER. In VLDB, 2003.
[14]
V. Kantere, D. Tsoumakos, T. Sellis, and N. Roussopoulos. GrouPeer: Dynamic clustering of P2P databases. Inf. Syst., 34(1):62--86, 2009.
[15]
G. Koloniari and E. Pitoura. Content-based routing of path quieries in peer-to-peer systems. In EDBT, 2004.
[16]
L. Lakshmanan, J. Pei, and Y. Zhao. QC-trees: An Efficient Summary Structure for Semantic OLAP. In SIGMOD, 2003.
[17]
B. Ooi, Y. Shu, K. Tan, and A. Zhou. PeerDB: A P2P-based System for Distributed Data Sharing. In ICDE, 2003.
[18]
Y. Sismanis, A. Deligiannakis, Y. Kotidis, and N. Roussopoulos. Hierarchical dwarfs for the rollup cube. In DOLAP, 2003.
[19]
C. Tang, Z. Xu, and S. Dwarkadas. Peer-to-peer information retrieval using self-organizing semantic overlay networks. In SIGCOMM, 2003.
[20]
I. Tatarinov and A.Halevy. Efficient Query Reformulation in Peer-Data Management Systems. In SIGMOD, 2004.
[21]
W. Wang, H. Lu, J. Feng, and J. X. Yu. Condensed Cube: An Effective Approach to Reducing Data Cube Size. In ICDE, 2002.
[22]
X. Zhang, J. Freschl, and J. Schopf. Scalability analysis of three monitoring and information systems: MDS2, R-GMA, and Hawkeye. J. Parallel Distrib. Comput., 67(8):883--902, 2007.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
HPDC '09: Proceedings of the 18th ACM international symposium on High performance distributed computing
June 2009
237 pages
ISBN:9781605585871
DOI:10.1145/1551609
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 11 June 2009

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. adaptive indexing
  2. concept hierarchies
  3. distributed hash table
  4. grid information system

Qualifiers

  • Research-article

Conference

HPDC '09
Sponsor:

Acceptance Rates

Overall Acceptance Rate 166 of 966 submissions, 17%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 209
    Total Downloads
  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 09 Nov 2024

Other Metrics

Citations

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media