An Access Scheduling Tree to Achieve Optimal Freshness in Local Repositories

Qin, Li; Atluri, Vijayalakshmi

doi:10.1007/978-3-540-45229-4_23

Li Qin⁷ &
Vijayalakshmi Atluri⁷

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2738))

Included in the following conference series:

International Conference on Electronic Commerce and Web Technologies

609 Accesses

Abstract

To achieve improved availability and performance, often, local copies of remote data from autonomous sources are maintained. Examples of such local copies include data warehouses and repositories managed by web search engines. As the size of the local data grows, it is not always feasible to maintain the freshness (up-to-dateness) of the entire data due to resource limitations. Previous contributions to maintaining freshness of local data use a freshness metric as the proportion of fresh documents within the total repository (we denote this as average freshness). As a result, even though updates to more frequently changing data are not captured, the average freshness measure may still be high. In this paper, we argue that, in addition to average freshness, it is important that the freshness metric should also include the proportion of changes captured for each document, which we call object freshness. The latter is particularly important when both the current and historical versions of information sources are queried or mined. We propose an approach by building an access scheduling tree (AST) to precisely schedule access to remote sources that achieves optimal freshness of the local data under limited availability of resources. We show, via experiments, the performance of our approach is significantly higher than a linear priority queue.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

A third-party replication service for dynamic hidden databases

Article Open access 08 January 2021

HTTP Extensions for the Management of Highly Dynamic Data Resources

$\varvec{\textsc {Orpheus}}$DB: bolt-on versioning for relational databases (extended version)

Article 20 December 2019

References

Brewington, B.E., Cybenko, G.: How Dynamic is the Web? In: 9th World Wide Web Conference, WWW9 (2000)
Google Scholar
Carrano, F.M., Prichard, J.J.: Data Abstraction and Problem Solving with C++, 3rd edn. Addison-Wesley, Reading (2001)
Google Scholar
Cho, J., Garcia-Molina, H.: The Evolution of the Web and Implications for an Incremental Crawler. In: 26th International Conference on Very Large Databases (VLDB), pp. 200–209 (2000)
Google Scholar
Cho, J., Garcia-Molina, H.: Synchronizing a Database to Improve Freshness. In: ACM SIGMOD International Conference on Management of Data, pp. 117–128 (2000)
Google Scholar
Cho, J., Garcia-Molina, H., Page, L.: Efficient Crawling Through URL Ordering. In: 7th World Wide Web Conference, WWW7 (1998)
Google Scholar
Douglis, F., Feldmann, A., Krishnamurthy, B., Mogul, J.: Rate of Change and Other Metrics: A Live Study of the World Wide Web. In: USENIX Symposium on Internetworking Technologies and Systems (December 1997)
Google Scholar
Heydon, A., Najork, M.: Mercator: A Scalable, Extensible Web Crawler. World Wide Web 2(4), 219–229 (1999)
Article Google Scholar
Qin, L., Atluri, V.: An Access Scheduling Tree to Achieve Optimal Freshness in Local Repositories. Technical report (2002)
Google Scholar
Wang, Y., DeWitt, D.J., Cai, J.-Y.: X-Diff: An Effective Change Detection Algorithm for XML Documents. In: 19th International Conference on Data Engineering, ICDE (2003)
Google Scholar

Download references

Author information

Authors and Affiliations

MSIS Department and, Center for Information Management, Integration and Connectivity (CIMIC), Rutgers University, 180 University Avenue, Newark, NJ, 07102, USA
Li Qin & Vijayalakshmi Atluri

Authors

Li Qin
View author publications
You can also search for this author in PubMed Google Scholar
Vijayalakshmi Atluri
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

University of Zurich, Department of Informatics (IFI), Winterthurer Stra{ß}e 190, 8057, Zurich, Switzerland
Kurt Bauknecht
Institute of Software Technology and Interactive Systems, Vienna University of Technology, Favoritenstr. 9-11/188, A-1040,, Wien, Austria
A Min Tjoa
Division of Information Technology, Engineering and the Environment, School of Computer and Information Science, University of South Australia, 5095, Mawson Lakes, SA, Australia
Gerald Quirchmayr

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Qin, L., Atluri, V. (2003). An Access Scheduling Tree to Achieve Optimal Freshness in Local Repositories. In: Bauknecht, K., Tjoa, A.M., Quirchmayr, G. (eds) E-Commerce and Web Technologies. EC-Web 2003. Lecture Notes in Computer Science, vol 2738. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-45229-4_23

Download citation

DOI: https://doi.org/10.1007/978-3-540-45229-4_23
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40808-6
Online ISBN: 978-3-540-45229-4
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

An Access Scheduling Tree to Achieve Optimal Freshness in Local Repositories

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

A third-party replication service for dynamic hidden databases

HTTP Extensions for the Management of Highly Dynamic Data Resources

\(\varvec{\textsc {Orpheus}}\)DB: bolt-on versioning for relational databases (extended version)

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

An Access Scheduling Tree to Achieve Optimal Freshness in Local Repositories

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

A third-party replication service for dynamic hidden databases

HTTP Extensions for the Management of Highly Dynamic Data Resources

\(\varvec{\textsc {Orpheus}}\)DB: bolt-on versioning for relational databases (extended version)

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation