Proceedings of the 7th annual ACM international workshop on Web information and data management

WIDM '05: Proceedings of the 7th annual ACM international workshop on Web information and data management

November 2005

2005 Proceeding

Program Chairs:
Angela Bonifati
Icar CNR, Italy
,
Dongwon Lee
Penn State University, USA

Publisher:

Association for Computing Machinery
New York
NY
United States

Conference:

CIKM05: Conference on Information and Knowledge Management Bremen Germany 4 November 2005

ISBN:

978-1-59593-194-8

Published:

04 November 2005

Sponsors:

ACM, SIGIR

Recommend ACM DL

ALREADY A SUBSCRIBER?SIGN IN

Get Alerts for this ConferenceAlerts Save to BinderBinder

Save to Binder

Create a New Binder

Name

Export CitationCitation

Share on

Next Conference

CIKM '25

Sponsor:
sigir
sigweb

The 34th ACM International Conference on Information and Knowledge Management

November 10 - 14, 2025

Seoul , Republic of Korea

Reflects downloads up to 26 Jan 2025Bibliometrics

Citation Count

719

Downloads (6 weeks)

Downloads (12 months)

Downloads (cumulative)

12,992

Sections

WIDM '05: Proceedings of the 7th annual ACM international workshop on Web information and data management

2005

Previous Next

Skip Abstract Section

Abstract

The 2005 International Workshop on Web Information and Data Management (WIDM 2005) is the seventh in a series of workshops on Web Information and Data Management held in conjunction with the International Conference on Information and Knowledge Management (CIKM). The objective of the workshop is to bring together researchers, industrial practitioners, and developers to study how Web information can be extracted, stored, analyzed, and processed to provide useful knowledge to the end users for various advanced database applications. WIDM 2005 has received the sponsorship from ACM SIGIR and the cooperation of ACM SIGMOD.The call for papers resulted in the submission of 44 papers from 15 countries around the world. Starting from this year, a one-day workshop schedule lets accommodate regular papers (up to 8 pages long) along with a few short papers (up to 6 pages long). All papers were thoroughly reviewed by the program committee and external reviewers. The program committee accepted 12 papers (8 full and 4 short papers) for this year novel one-day program, resulting in competitive 27% acceptance rate. The authors of these papers are from 7 countries. The 12 accepted papers were divided into 3 sessions: "Web Ranking and Retrieval," "XML Data Management and Web Discovery," and "Web Clustering, Filtering and Applications". In addition, the WIDM 2005 program also includes an invited talk on "A Web of Data: New Architectures for New Technology?" by Prof. Donald Kossmann, from ETH Zurich (Switzerland).The workshop would not be possible without the support from the NIKE (Nittany Information, Knowledge and wEb) Research Group of The Pennsylvania State University. The group provided both the manpower and computing resources to host the workshop Web site and to run the ConfMan paper submission and review system.

Proceeding Downloads

PDF(title page, copyright, foreword, contents, workshop organization, sponsors)

PDF(author index)

Skip Table Of Content Section

Select All

Export Citations Save to Binder

Article

A web of data: new architectures for new technology?

Donald Kossmann

Page 1https://doi.org/10.1145/1097047.1097048

The last decade has seen a wave of new technology to publish, access, and integrate data on the Web. Furthermore, many new applications have emerged and Web technologies have penetrated almost all systems from small mobile applications to large-scale ...

- 1
- 277
Metrics
Total Citations1
Total Downloads277
Last 12 Months0
Last 6 weeks0

Abstract
Get Access

SESSION: Web ranking and retrieval

section

Session details: Web ranking and retrieval

A. Dekhtyar

https://doi.org/10.1145/3246250

- 0
Metrics
Total Citations0

Article

Web path recommendations based on page ranking and Markov models

Magdalini Eirinaki,
Michalis Vazirgiannis,
Dimitris Kapogiannis

Pages 2–9https://doi.org/10.1145/1097047.1097050

Markov models have been widely used for modelling users' navigational behaviour in the Web graph, using the transitional probabilities between web pages, as recorded in the web logs. The recorded users' navigation is used to extract popular web paths ...

- 51
- 1,463
Metrics
Total Citations51
Total Downloads1,463
Last 12 Months16
Last 6 weeks1

Abstract
Get Access

Article

Semantic similarity methods in wordNet and their application to information retrieval on the web

Giannis Varelas,
Epimenidis Voutsakis,
Paraskevi Raftopoulou,
Euripides G.M. Petrakis,
Evangelos E. Milios

Pages 10–16https://doi.org/10.1145/1097047.1097051

Semantic Similarity relates to computing the similarity between concepts which are not lexicographically similar. We investigate approaches to computing semantic similarity by mapping terms (concepts) to an ontology and by examining their relationships ...

- 205
- 2,952
Metrics
Total Citations205
Total Downloads2,952
Last 12 Months38
Last 6 weeks4

Abstract
Get Access

Article

DirectoryRank: ordering pages in web directories

Vlassis Krikos,
Sofia Stamou,
Pavlos Kokosis,
Alexandros Ntoulas,
Dimitris Christodoulakis

Pages 17–22https://doi.org/10.1145/1097047.1097052

Web Directories are repositories of Web pages organized in a hierarchy of topics and sub-topics. In this paper, we present DirectoryRank, a ranking framework that orders the pages within a given topic according to how informative they are about the ...

- 11
- 429
Metrics
Total Citations11
Total Downloads429
Last 12 Months0
Last 6 weeks0

Abstract
Get Access

SESSION: XML data management and web discovery

section

Session details: XML data management and web discovery

E. G. M. Petrakis

https://doi.org/10.1145/3246251

- 0
Metrics
Total Citations0

Article

Exploiting native XML indexing techniques for XML retrieval in relational database systems

Felix Weigel,
Klaus U. Schulz,
Holger Meuss

Pages 23–30https://doi.org/10.1145/1097047.1097054

In XML retrieval, two distinct approaches have been established and pursued without much cross-fertilization taking place so far. On the one hand, native XML databases tailored to the semistructured data model have received considerable attention, and a ...

- 9
- 1,374
Metrics
Total Citations9
Total Downloads1,374
Last 12 Months1
Last 6 weeks0

Abstract
Get Access

Article

Query translation scheme for heterogeneous XML data sources

Cindy X. Chen,
George A. Mihaila,
Sriram Padmanabhan,
Isabelle M. Rouvellou

Pages 31–38https://doi.org/10.1145/1097047.1097055

In order to formulate a meaningful XML query, a user must have some knowledge of the schema of the XML documents to be queried. The query will succeed only if the schema of the actual documents is consistent with the user's information. When a user ...

- 7
- 209
Metrics
Total Citations7
Total Downloads209
Last 12 Months0
Last 6 weeks0

Abstract
Get Access

Article

Impact of XML schema evolution on valid documents

Giovanna Guerrini,
Marco Mesiti,
Daniele Rossi

Pages 39–44https://doi.org/10.1145/1097047.1097056

In this paper we investigate the problem of XML Schema evolution. We first discuss the different kinds of changes that may be needed on an XML Schema. Then, we investigate how to minimize document revalidation, that is, detecting the document parts ...

- 55
- 629
Metrics
Total Citations55
Total Downloads629
Last 12 Months0
Last 6 weeks0

Abstract
Get Access

Article

A framework for semantic web services discovery

Jyotishman Pathak,
Neeraj Koul,
Doina Caragea,
Vasant G. Honavar

Pages 45–50https://doi.org/10.1145/1097047.1097057

This paper describes a framework for ontology-based flexible discovery of Semantic Web services. The proposed approach relies on user-supplied, context-specific mappings from an user ontology to relevant domain ontologies used to specify Web services. ...

- 70
- 1,614
Metrics
Total Citations70
Total Downloads1,614
Last 12 Months1
Last 6 weeks0

Abstract
Get Access

SESSION: Web Clustering, filtering and applications

section

Session details: Web Clustering, filtering and applications

C. Chen

https://doi.org/10.1145/3246252

- 0
Metrics
Total Citations0

Article

Narrative text classification for automatic key phrase extraction in web document corpora

Yongzheng Zhang,
Nur Zincir-Heywood,
Evangelos Milios

Pages 51–58https://doi.org/10.1145/1097047.1097059

Automatic key phrase extraction is a useful tool in many text related applications such as clustering and summarization. State-of-the-art methods are aimed towards extracting key phrases from traditional text such as technical papers. Application of ...

- 36
- 882
Metrics
Total Citations36
Total Downloads882
Last 12 Months3
Last 6 weeks1

Abstract
Get Access

Article

On improving local website search using web server traffic logs: a preliminary report

Qing Cui,
Alex Dekhtyar

Pages 59–66https://doi.org/10.1145/1097047.1097060

In this paper we give a preliminary report on our study of the use of web server traffic logs to improve local search. Web server traffic logs are, typically, private to individual websites and as such -- are unavailable to traditional web search ...

- 5
- 572
Metrics
Total Citations5
Total Downloads572
Last 12 Months2
Last 6 weeks0

Abstract
Get Access

Article

Preventing shilling attacks in online recommender systems

Paul-Alexandru Chirita,
Wolfgang Nejdl,
Cristian Zamfir

Pages 67–74https://doi.org/10.1145/1097047.1097061

Collaborative filtering techniques have been successfully employed in recommender systems in order to help users deal with information overload by making high quality personalized recommendations. However, such systems have been shown to be vulnerable ...

- 211
- 1,310
Metrics
Total Citations211
Total Downloads1,310
Last 12 Months20
Last 6 weeks0

Abstract
Get Access

Article

Looking at both the present and the past to efficiently update replicas of web content

Luciano Barbosa,
Ana Carolina Salgado,
Francisco de Carvalho,
Jacques Robin,
Juliana Freire

Pages 75–80https://doi.org/10.1145/1097047.1097062

Since Web sites are autonomous and independently updated, applications that keep replicas of Web data, such as Web warehouses and search engines, must periodically poll the sites and check for changes.Since this is a resource-intensive task, in order to ...

- 13
- 312
Metrics
Total Citations13
Total Downloads312
Last 12 Months4
Last 6 weeks0

Abstract
Get Access

Article

A search result clustering method using informatively named entities

Hiroyuki Toda,
Ryoji Kataoka

Pages 81–86https://doi.org/10.1145/1097047.1097063

Clustering the results of a search helps the user to overview the information returned. In this paper, we regard the clustering task as indexing the search results. Here, an index means a structured label list that can makes it easier for the user to ...

- 45
- 960
Metrics
Total Citations45
Total Downloads960
Last 12 Months6
Last 6 weeks2

Abstract
Get Access

Save to Binder

Create a New Binder

Name

Contributors

Angela Bonifati
Claude Bernard Lyon 1 University
- Publication Years1999 - 2025
- Publication counts103
- Citation count1,368
- Available for Download77
- Downloads (cumulative)68,418
- Downloads (12 months)7,563
- Downloads (6 weeks)896
- Average Downloads per Article889
- Average Citation per Article13
View Full Profile
Dongwon Lee
Pennsylvania State University
- Publication Years1999 - 2024
- Publication counts147
- Citation count2,741
- Available for Download90
- Downloads (cumulative)81,130
- Downloads (12 months)15,979
- Downloads (6 weeks)1,673
- Average Downloads per Article901
- Average Citation per Article19
View Full Profile

Proceedings of the 7th annual ACM international workshop on Web information and data management
1. Information systems

Comments

Recommendations

WIDM '06: Proceedings of the 8th annual ACM international workshop on Web information and data management
MoVid '15: Proceedings of the 7th ACM International Workshop on Mobile Video
WIDM '04: Proceedings of the 6th annual ACM international workshop on Web information and data management

Export Citations

Select Citation format

Please download or close your previous search result export first before starting a new bulk export.
Preview is not available.
By clicking download,a status dialog will open to start the export process. The process may takea few minutes but once it finishes a file will be downloadable from your browser. You may continue to browse the DL while the export process is in progress.
Download
- Download citation
- Copy citation

Save to Binder

Sections

Proceeding Downloads

Save to Binder

Recommendations

WIDM '06: Proceedings of the 8th annual ACM international workshop on Web information and data management

MoVid '15: Proceedings of the 7th ACM International Workshop on Mobile Video

WIDM '04: Proceedings of the 6th annual ACM international workshop on Web information and data management