Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2808797.2809339acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
poster

Enriching Arabic Tweets Representation based on Web Search Engine and the Rough Set Theory

Published: 25 August 2015 Publication History

Abstract

Twitter is a popular micro-blogging service where users search for timely and social information. Users post short text messages called Tweets, which are limited in length. These Tweets are different from traditional documents in its shortness and sparseness. As a result, short text tends to be ambiguous without enough contextual information. To address these issues, we propose an efficient method to enrich the tweet's representation for the Arabic language using web search engine as a large and open corpus and the Rough Set Theory which is a mathematical tool to deal with vagueness and uncertainty. To assess the performance of the proposed system, a series of experiments has been conducted. The effectiveness of our system has been evaluated and compared in terms of the F1-measure using the Naïve Bayesian (NB) and the Support Vector Machine (SVM) classifiers in our Arabic Tweets Categorization System. The obtained results show that enriching the tweet's representation increases significantly the F1-measure of the Arabic tweets categorization system.

References

[1]
Go, A. Bhayani, R. & Huang, L. Twitter Sentiment Classification using Distant Supervision. Processing. 1-6. (2009)
[2]
B. Sriram, D. Fuhry, E. Demir, H. Ferhatosmanoglu. Short Text Classification in Twitter to Improve Information Filtering, SIGIR'10, July 19--23, 2010, Geneva, Switzerland.ACM 978-1 60558-896-4/10/07. (2010)
[3]
Jiliang TANG, Xufei WANG, Huiji GAO, Xia HU and Huan LIU.Enriching short text representation in microblog for clustering Front. Comput. Sci., 6(1) DOI 10.1007/s11704-009-0000-0. (2012)
[4]
Pawlak, Z. Rough sets: Theoretical aspects of reasoning about data, Kluwer Dordrecht. (1991)
[5]
http://social-dynamics.org/twitter-network-data/
[6]
https://developers.google.com/custom-search/docs/start
[7]
https://datamarket.azure.com/dataset/5BA839F1-12CE-4CCE-BF57-A49D98D29A44

Cited By

View all
  • (2019)Arabic Sentiment Analysis based on Topic ModelingProceedings of the New Challenges in Data Sciences: Acts of the Second Conference of the Moroccan Classification Society10.1145/3314074.3314091(1-6)Online publication date: 28-Mar-2019
  • (2018)An effective short text conceptualization based on new short text similaritySocial Network Analysis and Mining10.1007/s13278-018-0544-89:1Online publication date: 3-Dec-2018
  1. Enriching Arabic Tweets Representation based on Web Search Engine and the Rough Set Theory

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      ASONAM '15: Proceedings of the 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 2015
      August 2015
      835 pages
      ISBN:9781450338547
      DOI:10.1145/2808797
      Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 25 August 2015

      Check for updates

      Author Tags

      1. Arabic Tweets
      2. Rough Set Theory
      3. Short Text
      4. Snippets
      5. Twitter
      6. Web Search Engine

      Qualifiers

      • Poster
      • Research
      • Refereed limited

      Conference

      ASONAM '15
      Sponsor:

      Acceptance Rates

      Overall Acceptance Rate 116 of 549 submissions, 21%

      Upcoming Conference

      KDD '25

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)1
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 24 Dec 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2019)Arabic Sentiment Analysis based on Topic ModelingProceedings of the New Challenges in Data Sciences: Acts of the Second Conference of the Moroccan Classification Society10.1145/3314074.3314091(1-6)Online publication date: 28-Mar-2019
      • (2018)An effective short text conceptualization based on new short text similaritySocial Network Analysis and Mining10.1007/s13278-018-0544-89:1Online publication date: 3-Dec-2018

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media