Neural Topic Modeling via Discrete Variational Inference
Abstract
A Baseline Topic Modeling Architectures
B Latent Topics Identified by Various Models
Model | Topic ID | Topic Coherence | Top 10 nearest words |
---|---|---|---|
ProdLDA | 0 | 0.4658 | [minnesota, angeles, toronto, san, montreal, vancouver, ottawa, stanley, calgary, louis] |
1 | 0.3945 | [finish, play, team, hitter, offense, smith, defensive, ice, tie, hit] | |
2 | 0.5129 | [christian, god, scripture, interpretation, jesus, resurrection, teaching, doctrine, existence, holy] | |
3 | 0.3626 | [troops, israel, border, turks, army, israeli, fire, arab, civilian, minority] | |
4 | 0.5116 | [ram, windows, quadra, fine, speed, scsi, faster, microsoft, apple, external] | |
5 | 0.3740 | [entry, char, compile, db, file, section, variable, contest, distribution, remark] | |
6 | 0.5557 | [christian, scripture, god, teaching, resurrection, existence, jesus, christianity, doctrine, biblical] | |
7 | 0.3890 | [ibm, interface, transfer, external, virtual, cpu, ram, dec, path, default] | |
8 | 0.4294 | [fire, car, safety, btw, andy, surrender, country, rider, cold, stupid] | |
9 | 0.4889 | [wiretap, escrow, drug, agency, warrant, clipper, illegal, encryption, crime, country] | |
10 | 0.4657 | [wiretap, encryption, escrow, nsa, chip, clipper, pat, cheaper, scheme, car] | |
11 | 0.4874 | [turks, armenian, army, turkish, russian, armenia, united, mountain, armenians, director] | |
12 | 0.4587 | [music, fine, crash, ram, sl, external, simm, honda, wm, hd] | |
13 | 0.4824 | [bmw, car, bike, rider, baseball, btw, cop, ball, hit, motorcycle] | |
14 | 0.4628 | [satellite, mission, distribute, nasa, spacecraft, km, space, earth, distribution, module] | |
15 | 0.5309 | [fine, ram, windows, apple, amp, simm, external, scsi, crash, quadra] | |
16 | 0.4720 | [agency, encryption, cryptography, telephone, wiretap, enforcement, des, privacy, distribution, encrypt] | |
17 | 0.3659 | [neighbor, christian, heart, harm, building, scripture, jesus, daughter, woman, holy] | |
18 | 0.4507 | [scsi, ram, hd, external, mhz, meg, scsus, ide, fine, bus] | |
19 | 0.5172 | [sl, wm, ram, mi, external, hd, connector, mg, mb, mw] | |
ProdLDA\(^\prime\) | 0 | 0.3261 | [first, series, information, next, team, get, san, mailing, use, also] |
1 | 0.5680 | [buf, hp, exit, oname, char, printf, toolkit, mov, bh, saturn] | |
2 | 0.6156 | [wings, flyers, puck, pit, leafs, scripture, que, hitter, resurrection, penalty] | |
3 | 0.5557 | [lebanese, armenians, apartment, troops, azerbaijan, hitter, wings, coach, armenian, armenia] | |
4 | 0.3214 | [key, strong, secret, warrant, people, la, phone, des, algorithm, rights] | |
5 | 0.3737 | [power, little, league, game, head, go, team, make, turn, sport] | |
6 | 0.4611 | [lebanese, muslim, batf, bike, lebanon, arabs, troops, witness, apartment, massacre] | |
7 | 0.5425 | [coach, braves, leafs, hitter, rangers, tor, playoff, stanley, puck, pat] | |
8 | 0.5674 | [scsus, sl, mb, quadra, hd, meg, pd, byte, motherboard, workstation] | |
9 | 0.6024 | [satan, ford, simm, resurrection, bmw, bike, doctrine, gear, quadra, scripture] | |
10 | 0.6974 | [encrypt, encryption, wiretap, escrow, crypto, cipher, rsa, anonymous, pgp, cryptography] | |
11 | 0.3734 | [vehicle, police, chip, technique, transmission, dealer, cop, car, traffic, radar] | |
12 | 0.6243 | [mi, eus, spacecraft, mw, moon, mg, orbit, ah, rg, ax] | |
13 | 0.3331 | [want, buy, like, drive, solution, control, people, know, driver, anyone] | |
14 | 0.4621 | [armenia, morality, proceed, revelation, bmw, verse, turks, soul, resurrection, bike] | |
15 | 0.5764 | [xlib, toolkit, pixel, xterm, visual, motif, turbo, microsoft, printf, meg] | |
16 | 0.4564 | [bhj, spacecraft, bh, eus, wm, byte, device, ripem, wire, digital] | |
17 | 0.4783 | [mhz, mb, adapter, bhj, scsi, scsus, windows, wm, hd, isa] | |
18 | 0.3429 | [escape, window, conflict, see, document, terrorist, let, shell, motif, cambridge] | |
19 | 0.4948 | [militia, arab, palestinian, homicide, lebanese, arabs, turks, armenia, troops, armenians] | |
ETM | 0 | 0.3489 | [get, know, one, say, think, like, see, thing, people, time] |
1 | 0.3254 | [use, may, make, case, many, also, part, however, system, president] | |
2 | 0.4505 | [god, jesus, christian, say, believe, bible, one, christ, make, belief] | |
3 | 0.3682 | [gun, people, child, kill, drug, crime, weapon, police, case, claim] | |
4 | 0.5255 | [datum, space, db, launch, output, hus, widget, dod, nasa, sun] | |
5 | 0.3390 | [file, use, program, send, available, list, code, email, please, line] | |
6 | 0.4139 | [hockey, team, new, division, san, canada, nhl, toronto, york, gm] | |
7 | 0.3439 | [new, look, buy, price, good, sell, include, package, offer, like] | |
8 | 0.3851 | [car, power, use, drive, speed, engine, wire, water, fast, low] | |
9 | 0.4651 | [game, year, play, win, team, player, season, go, good, run] | |
10 | 0.3312 | [information, make, get, please, mail, use, go, file, help, take] | |
11 | 0.3378 | [book, first, one, time, science, study, earth, author, history, find] | |
12 | 0.3516 | [university, group, internet, information, computer, fax, center, year, call, research] | |
13 | 0.3628 | [thanks, david, john, appreciate, steve, mark, wonder, jim, mike, michael] | |
14 | 0.3749 | [write, article, post, question, read, opinion, ask, please, yes, answer] | |
15 | 0.3397 | [period, la, pt, vs, de, van, pp, cal, power, second] | |
16 | 0.3607 | [key, government, law, use, encryption, state, chip, public, right, security] | |
17 | 0.3354 | [go, take, back, one, day, put, get, right, also, call] | |
18 | 0.3987 | [use, drive, system, window, card, run, windows, disk, problem, image] | |
19 | 0.4854 | [israel, people, war, israeli, jews, turkish, armenians, country, armenian, government] | |
\(\text{ETM}^{\prime }\) | 0 | 0.3749 | [use, wiring, connector, code, line, voltage, get, ground, find, might] |
1 | 0.3876 | [game, read, news, times, know, beat, hear, braves, go, back] | |
2 | 0.5399 | [apple, macintosh, amiga, graphics, processor, pc, modem, computer, server, printer] | |
3 | 0.3503 | [know, want, get, really, say, see, ca, never, think, tell] | |
4 | 0.3341 | [think, know, name, really, something, people, many, feel, thing, like] | |
5 | 0.4001 | [gm, please, want, get, maybe, know, somebody, make, lot, hey] | |
6 | 0.3833 | [book, verse, copy, write, author, manual, guide, reader, story, edition] | |
7 | 0.4091 | [tax, billion, budget, pay, fee, federal, package, dollar, money, please] | |
8 | 0.3644 | [get, know, want, think, tell, go, really, take, find, ask] | |
9 | 0.3416 | [god, know, say, think, see, like, make, use, want, one] | |
10 | 0.3316 | [like, make, use, write, go, many, get, help, need, know] | |
11 | 0.3827 | [league, nhl, game, team, list, mail, international, use, first, new] | |
12 | 0.4393 | [peace, palestinian, visit, state, foreign, israel, islamic, arab, conference, muslim] | |
13 | 0.5090 | [fax, computer, pc, electronic, systems, hardware, software, graphics, nt, server] | |
14 | 0.3841 | [go, let, oh, hey, please, eat, red, hang, stay, waco] | |
15 | 0.4863 | [belief, christianity, religion, islam, faith, people, muslims, sense, believe, god] | |
16 | 0.3508 | [shell, institute, justice, research, fund, professor, microsoft, minority, secretary, panel] | |
17 | 0.3367 | [god, see, use, know, say, go, want, like, one, need] | |
18 | 0.4485 | [federal, constitution, law, amendment, state, act, government, enforcement, authority, right] | |
19 | 0.5186 | [christianity, religion, armenians, islam, jews, religious, turks, turkish, muslims, turkey] |
C Baseline Models in Case Studies
C.1 VAE-LM: Baseline Model in Case Study 1
C.2 AARM: Baseline Model in Case Study 2
References
Index Terms
- Neural Topic Modeling via Discrete Variational Inference
Recommendations
Jointly Discovering Fine-grained and Coarse-grained Sentiments via Topic Modeling
MM '14: Proceedings of the 22nd ACM international conference on MultimediaThe ever-increasing user-generated contents in social media and other web services make it highly desirable to discover opinions of users on all kinds of topics. Motivated by the assumption that individual word and paragraph in documents will deliver ...
Extractive text summarization using clustering-based topic modeling
AbstractText summarization is the process of converting the input document into a short form, provided that it preserves the overall meaning associated with it. Primarily, text summarization is achieved in two ways, i.e., abstractive and extractive. ...
Understanding Sparse Topical Structure of Short Text via Stochastic Variational-Gibbs Inference
CIKM '16: Proceedings of the 25th ACM International on Conference on Information and Knowledge ManagementWith the soaring popularity of online social media like Twitter, analyzing short text has emerged as an increasingly important task which is challenging to classical topic models, as topic sparsity exists in short text. Topic sparsity refers to the ...
Comments
Information & Contributors
Information
Published In
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
Check for updates
Author Tags
Qualifiers
- Research-article
Contributors
Other Metrics
Bibliometrics & Citations
Bibliometrics
Article Metrics
- 0Total Citations
- 388Total Downloads
- Downloads (Last 12 months)133
- Downloads (Last 6 weeks)6
Other Metrics
Citations
View Options
Get Access
Login options
Check if you have access through your login credentials or your institution to get full access on this article.
Sign inFull Access
View options
View or Download as a PDF file.
PDFeReader
View online with eReader.
eReaderFull Text
View this article in Full Text.
Full TextHTML Format
View this article in HTML Format.
HTML Format