The document discusses topic modeling and classification of short texts. It describes using Latent Dirichlet Allocation (LDA) to extract hidden topics from a large universal text corpus consisting of Wikipedia and MEDLINE articles. These topics are then used as features for a maximum entropy classifier to categorize short texts like tweets and web snippets. Parallelized LDA is implemented using th
