Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1814245.1814249acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

Emerging topic detection on Twitter based on temporal and social terms evaluation

Published: 25 July 2010 Publication History

Abstract

Twitter is a user-generated content system that allows its users to share short text messages, called tweets, for a variety of purposes, including daily conversations, URLs sharing and information news. Considering its world-wide distributed network of users of any age and social condition, it represents a low level news flashes portal that, in its impressive short response time, has the principal advantage.
In this paper we recognize this primary role of Twitter and we propose a novel topic detection technique that permits to retrieve in real-time the most emergent topics expressed by the community. First, we extract the contents (set of terms) of the tweets and model the term life cycle according to a novel aging theory intended to mine the emerging ones. A term can be defined as emerging if it frequently occurs in the specified time interval and it was relatively rare in the past. Moreover, considering that the importance of a content also depends on its source, we analyze the social relationships in the network with the well-known Page Rank algorithm in order to determine the authority of the users. Finally, we leverage a navigable topic graph which connects the emerging terms with other semantically related keywords, allowing the detection of the emerging topics, under user-specified time constraints. We provide different case studies which show the validity of the proposed approach.

References

[1]
Trendistic. http://trendistic.com/.
[2]
Tweet tabs. http://tweettabs.com/.
[3]
Twitter API. http://apiwiki.twitter.com/.
[4]
Twopular. http://twopular.com/.
[5]
Where-what-when. http://where-what-when.husk.org/.
[6]
S. Abrol and L. Khan. Twinner: understanding news queries with geo-content using twitter. In GIR '10: Proceedings of the 6th Workshop on Geographic Information Retrieval, pages 1--8, New York, NY, USA, 2010. ACM.
[7]
J. Allan, editor. Topic detection and tracking: event-based information organization. Kluwer Academic Publishers, Norwell, MA, USA, 2002.
[8]
M. Balabanovic and Y. Shoham. Fab: Content-based, collaborative recommendation. Communications of the ACM, 40:66--72, 1997.
[9]
K. K. Bun, M. Ishizuka, and B. M. Ishizuka. Topic extraction from news archive using tf*pdf algorithm. In Proceedings of 3rd Int'l Conference on Web Informtion Systems Engineering (WISE 2002), IEEE Computer Soc, pages 73--82. WISE, 2002.
[10]
M. Cataldi, C. Schifanella, K. S. Candan, M. L. Sapino, and L. D. Caro. Cosena: a context-based search and navigation system. In MEDES, pages 218--225. ACM, 2009.
[11]
C. C. Chen, Y.-T. Chen, Y. S. Sun, and M. C. Chen. Life cycle modeling of news events using aging theory. In ECML, pages 47--59, 2003.
[12]
J. Chen, W. Geyer, C. Dugan, M. Muller, and I. Guy. Make new friends, but keep the old: recommending people on social networking sites. In CHI '09: Proceedings of the 27th international conference on Human factors in computing systems, pages 201--210, New York, NY, USA, 2009. ACM.
[13]
J. Chen, R. Nairn, L. Nelson, M. Bernstein, and E. Chi. Short and tweet: Experiments on recommending content from information. Atlanta, USA, 2009. ACM Press.
[14]
L. Di Caro, K. S. Candan, and M. L. Sapino. Using tagflake for condensing navigable tag hierarchies from tag clouds. In KDD '08: Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 1069--1072, New York, NY, USA, 2008. ACM.
[15]
A. Favenza, M. Cataldi, M. L. Sapino, and A. Messina. Topic development based refinement of audio-segmented television news. In NLDB '08, pages 226--232, Berlin, Heidelberg, 2008. Springer-Verlag.
[16]
G. P. C. Fung, J. X. Yu, H. Liu, and P. S. Yu. Time-dependent event hierarchy construction. In KDD '07: Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 300--309, New York, NY, USA, 2007. ACM.
[17]
D. Goldberg, D. Nichols, B. M. Oki, and D. Terry. Using collaborative filtering to weave an information tapestry. Communications of the ACM, 35(12):61--70, 1992.
[18]
T. L. Griffiths and M. Steyvers. Finding scientific topics. Proceedings of the National Academy of Sciences, 101(Suppl. 1):5228--5235, April 2004.
[19]
A. Hassan, D. Radev, J. Cho, and A. Joshi. Content based recommendation and summarization in the blogosphere. International AAAI Conference on Weblogs and Social Media, 2009.
[20]
Q. He, K. Chang, and E.-P. Lim. Using burstiness to improve clustering of topics in news streams. Data Mining, IEEE International Conference on, 0:493--498, 2007.
[21]
R. Jäschke, L. Marinho, A. Hotho, L. Schmidt-Thieme, and G. Stumme. Tag recommendations in folksonomies. In PKDD 2007, pages 506--514, Berlin, Heidelberg, 2007. Springer-Verlag.
[22]
J. Leskovec and C. Faloutsos. Sampling from large graphs. In KDD '06: Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 631--636, New York, NY, USA, 2006. ACM.
[23]
J. Makkonen, H. Ahonen-Myka, and M. Salmenkivi. Simple semantics in topic detection and tracking. Inf. Retr., 7(3--4):347--368, 2004.
[24]
P. Melville, R. J. Mooney, and R. Nagarajan. Content-boosted collaborative filtering. In In Proceedings of the 2001 SIGIR Workshop on Recommender Systems, 2001.
[25]
L. Page, S. Brin, R. Motwani, and T. Winograd. The pagerank citation ranking: Bringing order to the web. In Proceedings of the 7th International World Wide Web Conference, pages 161--172, Brisbane, Australia, 1998.
[26]
Y. Qi and K. S. Candan. Cuts: Curvature-based development pattern analysis and segmentation for blogs and other text streams. In HYPERTEXT '06, pages 1--10, New York, NY, USA, 2006. ACM.
[27]
I. Ruthven and M. Lalmas. A survey on the use of relevance feedback for information access systems. Knowl. Eng. Rev., 18(2):95--145, June 2003.
[28]
G. Salton and C. Buckley. Term-weighting approaches in automatic text retrieval. In Information Processing and Management, pages 513--523, 1988.
[29]
M. O. Takeshi Sakaki and Y. Matsuo. Earthquake shakes twitter users: Real-time event detection by social sensors. In WWW 2010, 2010.
[30]
P. Treeratpituk and J. Callan. Automatically labeling hierarchical clusters. In dg.o '06: Proceedings of the 2006 international conference on Digital government research, pages 167--176, New York, NY, USA, 2006. ACM.
[31]
C. Wang, M. Zhang, L. Ru, and S. Ma. Automatic online news topic ranking using media focus and user attention based on aging theory. In CIKM '08, pages 1033--1042, New York, NY, USA, 2008. ACM.
[32]
Y. Wu, Y. Ding, X. Wang, and J. Xu. On-line hot topic recommendation using tolerance rough set based topic clustering. Journal of Computers, 5(4), 2010.
[33]
Q. Zhao, P. Mitra, and B. Chen. Temporal and information flow based event detection from social text streams. In AAAI'07: Proceedings of the 22nd national conference on Artificial intelligence, pages 1501--1506. AAAI Press, 2007.

Cited By

View all
  • (2024)Learning Joint Topic Representation for Detecting Drift in Social Media TextInternational Journal of Uncertainty, Fuzziness and Knowledge-Based Systems10.1142/S021848852450024732:06(955-983)Online publication date: 21-Oct-2024
  • (2024)Next Topic Recommendation for Influencers on Social Media2024 IEEE International Conference on Big Data (BigData)10.1109/BigData62323.2024.10825496(723-728)Online publication date: 15-Dec-2024
  • (2024)An Adaptive Hot Ranking Algorithm for Popular Item Recommendation in the Express IndustryCognitive Computing - ICCC 202410.1007/978-3-031-77954-1_5(71-87)Online publication date: 29-Nov-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
MDMKDD '10: Proceedings of the Tenth International Workshop on Multimedia Data Mining
July 2010
86 pages
ISBN:9781450302203
DOI:10.1145/1814245
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 25 July 2010

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. aging theory
  2. text analysis
  3. topic detection

Qualifiers

  • Research-article

Conference

KDD '10
Sponsor:

Acceptance Rates

Overall Acceptance Rate 3 of 5 submissions, 60%

Upcoming Conference

KDD '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)60
  • Downloads (Last 6 weeks)6
Reflects downloads up to 27 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Learning Joint Topic Representation for Detecting Drift in Social Media TextInternational Journal of Uncertainty, Fuzziness and Knowledge-Based Systems10.1142/S021848852450024732:06(955-983)Online publication date: 21-Oct-2024
  • (2024)Next Topic Recommendation for Influencers on Social Media2024 IEEE International Conference on Big Data (BigData)10.1109/BigData62323.2024.10825496(723-728)Online publication date: 15-Dec-2024
  • (2024)An Adaptive Hot Ranking Algorithm for Popular Item Recommendation in the Express IndustryCognitive Computing - ICCC 202410.1007/978-3-031-77954-1_5(71-87)Online publication date: 29-Nov-2024
  • (2024)Exploring Spreaders in a Retweet Network: A Case from the 2023 Kahramanmaraş Earthquake SequenceEmerging Trends and Applications in Artificial Intelligence10.1007/978-3-031-56728-5_40(481-492)Online publication date: 30-Apr-2024
  • (2023)An Unsupervised Graph-Based Approach for Detecting Relevant Topics: A Case Study on the Italian Twitter Cohort during the Russia–Ukraine ConflictInformation10.3390/info1406033014:6(330)Online publication date: 12-Jun-2023
  • (2023)Sosyal medyada siyaset ve dezenformasyon: Haber metinlerine yönelik bir söylem analiziPolitics and disinformatipn in social media: A discourse analysis on news textsÖmer Halisdemir Üniversitesi İktisadi ve İdari Bilimler Fakültesi Dergisi10.25287/ohuiibf.123054716:3(674-692)Online publication date: 31-Jul-2023
  • (2023)The political motives of party elites and public responses in Twitter: The idea of postponing Indonesia’s 2024 electionMasyarakat, Kebudayaan dan Politik10.20473/mkp.V36I22023.288-30136:2(288-301)Online publication date: 30-Jun-2023
  • (2023)Monitoring of Public Opinion on Typhoon Disaster Using Improved Clustering Model Based on Single-Pass ApproachSage Open10.1177/2158244023120009813:3Online publication date: 29-Sep-2023
  • (2023)Social Network Analysis: A Survey on Measure, Structure, Language Information Analysis, Privacy, and ApplicationsACM Transactions on Asian and Low-Resource Language Information Processing10.1145/353973222:5(1-47)Online publication date: 9-May-2023
  • (2023)BTD: An effective business-related hot topic detection scheme in professional social networksInformation Sciences10.1016/j.ins.2022.12.081630(420-442)Online publication date: Jun-2023
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media