Proceedings of the 4th workshop on Workshop for Ph.D. students in information & knowledge management

PIKM '11: Proceedings of the 4th workshop on Workshop for Ph.D. students in information & knowledge management

October 2011

2011 Proceeding

Program Chairs:
Anisoara Nica
Sybase, An SAP Company, Canada
,
Fabian M. Suchanek
INRIA, France

Publisher:

Association for Computing Machinery
New York
NY
United States

Conference:

CIKM '11: International Conference on Information and Knowledge Management Glasgow Scotland, UK 28 October 2011

ISBN:

978-1-4503-0953-0

Published:

28 October 2011

Sponsors:

SIGWEB, SIGIR

Recommend ACM DL

ALREADY A SUBSCRIBER?SIGN IN

Get Alerts for this ConferenceAlerts Save to BinderBinder

Save to Binder

Create a New Binder

Name

Export CitationCitation

Share on

Next Conference

CIKM '25

Sponsor:
sigir
sigweb

The 34th ACM International Conference on Information and Knowledge Management

November 10 - 14, 2025

Seoul , Republic of Korea

Reflects downloads up to 31 Jan 2025Bibliometrics

Citation Count

Downloads (6 weeks)

Downloads (12 months)

Downloads (cumulative)

3,881

Sections

PIKM '11: Proceedings of the 4th workshop on Workshop for Ph.D. students in information & knowledge management

2011

Previous Next

Skip Abstract Section

Abstract

For the 4th time, the International Conference on Information and Knowledge Management (ACM CIKM) hosts a workshop for Ph.D. students: PIKM 2011. The goal of this workshop is two-fold: First, a Ph.D. workshop gives doctoral students an opportunity to present their work in an early stage to a global audience. This allows the students not only to crystallize their ideas into a scientific article, and to practice scientific presentation, but also to receive feedback from reviewers, from fellow students and from the general CIKM audience. Second, we believe that the research community, too, benefits from such a workshop: Ph.D. theses are the grassroots of research. They point out new research avenues and indicate current promising topics. They provide fresh viewpoints from the researchers of tomorrow. Last, we hope that the interaction with other researchers at the workshop itself, across all levels of seniority, will help propel science forward.

The PIKM workshop covers topics in all core areas of the general CIKM conference: information retrieval (IR), databases (DB), and knowledge management (KM). This includes subjects as diverse as resource monitoring, semantic search, pattern recognition, data mining, and data warehousing.

This diversity of topics was reflected in the submissions we received. The call for papers attracted 18 submissions from nearly all continents of the world. Out of these, 9 papers were accepted as full papers. In addition, 4 papers were accepted as poster papers. The papers cover proposals at various stages of the dissertation, from early outline of research plans, to in-depth investigations of acute questions and mid-term reports of work in progress. The dissertations touch all main areas of the PIKM including, for example, work on user interaction and ranking, as well as research on workflow management. Similar to past PIKM workshops, the best submission will receive a best paper award. This year's award will go to Minsuk Kahng, Sangkeun Lee and Sang-Goo Lee for their paper "Ranking Objects by Following Paths in Entity-Relationship Graphs".

As a special highlight, this year's PIKM features a keynote talk by Prof. Dr. Felix Naumann from the Hasso-Plattner-Institute, Potsdam, Germany. Prof. Naumann will talk about the challenges of "Extreme Web Data Integration" -- a task that becomes ever more challenging with the relentless growth of the Web.

Proceeding Downloads

PDF(title page, copyright, foreword, contents, organization)

PDF(author index)

Skip Table Of Content Section

Select All

Export Citations Save to Binder

SESSION: Keynote address

section

Session details: Keynote address

Fabian M. Suchanek

https://doi.org/10.1145/3253146

- 0
Metrics
Total Citations0

keynote

Extreme web data integration

Felix Naumann

Pages 1–2https://doi.org/10.1145/2065003.2065005

- 0
- 118
Metrics
Total Citations0
Total Downloads118
Last 12 Months0
Last 6 weeks0

Get Access

SESSION: Information retrieval

section

Session details: Information retrieval

Anisoara Nica

https://doi.org/10.1145/3253147

- 0
Metrics
Total Citations0

research-article

A user interaction model based on the principle of polyrepresentation

David Zellhöfer,
Ingo Schmitt

Pages 3–10https://doi.org/10.1145/2065003.2065007

Recently, the cognitively motivated principle of polyrepresentation has been shown to correlate with quantum mechanics-inspired IR models. The principle's core hypothesis is that a document is defined by different representations such as low-level ...

- 6
- 147
Metrics
Total Citations6
Total Downloads147
Last 12 Months1
Last 6 weeks0

Abstract
Get Access

research-article

Ranking objects by following paths in entity-relationship graphs

Minsuk Kahng,
Sangkeun Lee,
Sang-goo Lee

Pages 11–18https://doi.org/10.1145/2065003.2065008

In this paper, we propose an object ranking method for search and recommendation. By selecting schema-level paths and following them in an entity-relationship graph, it can incorporate diverse semantics existing in the graph. Utilizing this kind of ...

- 7
- 205
Metrics
Total Citations7
Total Downloads205
Last 12 Months0
Last 6 weeks0

Abstract
Get Access

research-article

Online conversation mining for author characterization and topic identification

Giacomo Inches,
Fabio Crestani

Pages 19–26https://doi.org/10.1145/2065003.2065009

The increasing popularity of online-based services (Twitter, Facebook, IRC, Myspace, blogs, just to mention few of them) results in a production of a huge amount of novel documents. These documents present properties that can not be found in standard ...

- 12
- 488
Metrics
Total Citations12
Total Downloads488
Last 12 Months1
Last 6 weeks0

Abstract
Get Access

SESSION: Data mining and knowledge management

section

Session details: Data mining and knowledge management

Aparna Varde

https://doi.org/10.1145/3253148

- 0
Metrics
Total Citations0

research-article

Pattern recognition in multivariate time series: dissertation proposal

Stephan Spiegel,
Brijnesh Johannes Jain,
Ernesto William De Luca,
Sahin Albayrak

Pages 27–34https://doi.org/10.1145/2065003.2065011

Nowadays computer scientists are faced with fast growing and permanently evolving data, which are represented as observations made sequentially in time. A common problem in the data mining community is the recognition of recurring patterns within ...

- 4
- 498
Metrics
Total Citations4
Total Downloads498
Last 12 Months5
Last 6 weeks0

Abstract
Get Access

research-article

Resource monitoring in industrial production with knowledge-based models and rules

Lisa Abele,
Martin Kleinsteuber,
Thorbjørn Hansen

Pages 35–42https://doi.org/10.1145/2065003.2065012

The manufacturing domain currently experiences a significant increase in resource expenses for industrial plants. However, the implementation of systems to monitor the resource consumption in such complex plants requires high investment concerning time ...

- 4
- 157
Metrics
Total Citations4
Total Downloads157
Last 12 Months3
Last 6 weeks1

Abstract
Get Access

research-article

Towards a version control model with uncertain data

Mouhamadou Lamine Ba,
Talel Abdessalem,
Pierre Senellart

Pages 43–50https://doi.org/10.1145/2065003.2065013

Content-based online collaborative platforms and office applications are widely used for collaborating and exchanging data, in particular in the form of XML-based electronic documents. Usually, a version control system is built-in in these applications ...

- 3
- 185
Metrics
Total Citations3
Total Downloads185
Last 12 Months1
Last 6 weeks0

Abstract
Get Access

SESSION: Databases

section

Session details: Databases

Anisoara Nica

https://doi.org/10.1145/3253149

- 0
Metrics
Total Citations0

research-article

Aggregation strategies for columnar in-memory databases in a mixed workload

Stephan Müller,
Hasso Plattner

Pages 51–58https://doi.org/10.1145/2065003.2065015

The recent trend towards analytics on operational data has led to an approach of reunifying online transactional processing and online analytical processing in one single database. The advent of columnar in-memory databases makes this viable and ...

- 2
- 406
Metrics
Total Citations2
Total Downloads406
Last 12 Months11
Last 6 weeks0

Abstract
Get Access

research-article

E-ETL: framework for managing evolving etl processes

Artur Wojciechowski

Pages 59–66https://doi.org/10.1145/2065003.2065016

External data sources (EDSs) being integrated in a data warehouse (DW) frequently change their data structures (schemas). As a consequence, in many cases, an already deployed ETL workflow executes with errors. Since structural changes of EDSs are ...

- 6
- 740
Metrics
Total Citations6
Total Downloads740
Last 12 Months23
Last 6 weeks2

Abstract
Get Access

research-article

Minimal data sets vs. synchronized data copies in a schema and data versioning system

Bob Wall,
Rafal Angryk

Pages 67–74https://doi.org/10.1145/2065003.2065017

In this paper, we describe a key component of our proposed data-base schema and data versioning system, ScaDaVer. The versioning system is based on common practices used to manage source code changes in software development. It allows users of a data-...

- 7
- 120
Metrics
Total Citations7
Total Downloads120
Last 12 Months4
Last 6 weeks1

Abstract
Get Access

POSTER SESSION: Posters

section

Session details: Posters

Fabian M. Suchanek

https://doi.org/10.1145/3253150

- 0
Metrics
Total Citations0

poster

Utilizing sub-topical structure of documents for information retrieval

Debasis Ganguly,
Johannes Leveling,
Gareth J.F. Jones

Pages 75–78https://doi.org/10.1145/2065003.2065019

Text segmentation in natural language processing typically refers to the process of decomposing a document into constituent subtopics. Our work centers on the application of text segmentation techniques within information retrieval (IR) tasks. For ...

- 4
- 96
Metrics
Total Citations4
Total Downloads96
Last 12 Months1
Last 6 weeks0

Abstract
Get Access

poster

Optimizing the cost of information retrieval testcollections

Mehdi Hosseini,
Ingemar Cox,
Natasa Milic-Frayling

Pages 79–82https://doi.org/10.1145/2065003.2065020

We consider the problem of optimally allocating limited resources to construct relevance judgements for a test collection that facilities reliable evaluation of retrieval systems. We assume that there is a large set of test queries, for each of which a ...

- 3
- 91
Metrics
Total Citations3
Total Downloads91
Last 12 Months1
Last 6 weeks1

Abstract
Get Access

poster

Towards semantic methodologies for automatic regulatory compliance support

Krishna Sapkota,
Arantza Aldea,
David A. Duce,
Muhammad Younas,
René Bañares-Alcántara

Pages 83–86https://doi.org/10.1145/2065003.2065021

Businesses and organizations must comply with requirements and expectations such as regulations, policies, mandates and guidelines to meet public standards and avoid hefty penalties. Checking compliance manually is a laborious, extensive and error-prone ...

- 9
- 279
Metrics
Total Citations9
Total Downloads279
Last 12 Months10
Last 6 weeks1

Abstract
Get Access

poster

RW.KNN: a proposed random walk KNN algorithm for multi-label classification

Xin Xia,
Xiaohu Yang,
Shanping Li,
Chao Wu,
Linlin Zhou

Pages 87–90https://doi.org/10.1145/2065003.2065022

Multi-label classification refers to the problem that predicts each single instance to be one or more labels in a set of associated labels. It is common in many real-world applications such as text categorization, functional genomics and semantic scene ...

- 8
- 313
Metrics
Total Citations8
Total Downloads313
Last 12 Months2
Last 6 weeks0

Abstract
Get Access

Save to Binder

Create a New Binder

Name

Contributors

Anisoara Nica
SAP SE
- Publication Years1995 - 2017
- Publication counts34
- Citation count512
- Available for Download22
- Downloads (cumulative)8,851
- Downloads (12 months)435
- Downloads (6 weeks)54
- Average Downloads per Article402
- Average Citation per Article15
View Full Profile
Fabian M Suchanek
Polytechnic Institute of Paris
- Publication Years2006 - 2024
- Publication counts89
- Citation count5,481
- Available for Download52
- Downloads (cumulative)34,400
- Downloads (12 months)3,086
- Downloads (6 weeks)395
- Average Downloads per Article662
- Average Citation per Article62
View Full Profile

Index Terms

Proceedings of the 4th workshop on Workshop for Ph.D. students in information & knowledge management
1. Information systems
2. Social and professional topics
  1. Professional topics

Index terms have been assigned to the content through auto-classification.

Comments

Recommendations

PIKM '10: Proceedings of the 3rd workshop on Ph.D. students in information and knowledge management
PIKM '15: Proceedings of the 8th Workshop on Ph.D. Workshop in Information and Knowledge Management
PIKM '14: Proceedings of the 7th Workshop on Ph.D Students

Acceptance Rates

Overall Acceptance Rate 25 of 62 submissions, 40%

Year	Submitted	Accepted	Rate
PIKM '15	16	5	31%
PIKM '14	10	4	40%
PIKM '13	13	6	46%
PIKM '10	23	10	43%
Overall	62	25	40%

Export Citations

Select Citation format

Please download or close your previous search result export first before starting a new bulk export.
Preview is not available.
By clicking download,a status dialog will open to start the export process. The process may takea few minutes but once it finishes a file will be downloadable from your browser. You may continue to browse the DL while the export process is in progress.
Download
- Download citation
- Copy citation

Save to Binder

Sections

Proceeding Downloads

Save to Binder

Index Terms

Recommendations

PIKM '10: Proceedings of the 3rd workshop on Ph.D. students in information and knowledge management

PIKM '15: Proceedings of the 8th Workshop on Ph.D. Workshop in Information and Knowledge Management

PIKM '14: Proceedings of the 7th Workshop on Ph.D Students

Acceptance Rates