[PDF][PDF] DEADLINER: Building a new niche search engine

A Kruger, CL Giles, FM Coetzee, E Glover… - Proceedings of the …, 2000 - dl.acm.org
Proceedings of the ninth international conference on Information and …, 2000dl.acm.org
We present DEADLINER, a search engine that catalogs conference and workshop
announcements, and ultimately will monitor and extract a wide range of academic
convocation material from the web. The system currently extracts speakers, locations, dates,
paper submission (and other) deadlines, topics, program committees, abstracts, and
affiliations. A user or user agent can perform detailed searches on these fields. DEADLINER
was constructed using a methodology for rapid implementation of specialized search …
Abstract
We present DEADLINER, a search engine that catalogs conference and workshop announcements, and ultimately will monitor and extract a wide range of academic convocation material from the web. The system currently extracts speakers, locations, dates, paper submission (and other) deadlines, topics, program committees, abstracts, and affiliations. A user or user agent can perform detailed searches on these fields. DEADLINER was constructed using a methodology for rapid implementation of specialized search engines. This methodology avoids complex hand-tuned text extraction solutions, or natural language processing, by Bayesian integration of simple extractors that exploit loose formatting and keyw ord con ventions. The Bayesian framework further produces a search engine where each user can control the false alarm rate on a field in an intuitive yet rigorous fashion.
ACM Digital Library