Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1007/978-3-319-05693-7_1guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Big Data in Online Social Networks: User Interaction Analysis to Model User Behavior in Social Networks

Published: 24 March 2014 Publication History

Abstract

With hundreds of millions of users worldwide, social networks provide incredible opportunities for social connection, learning, political and social change, and individual entertainment and enhancement in a multiple contexts. Because many social interactions currently take place in online networks, social scientists have access to unprecedented amounts of information about social interaction. Prior to the advent of such online networks, these investigations required resource-intensive activities such as random trials, surveys, and manual data collection to gather even small data sets. Now, massive amounts of information about social networks and social interactions are recorded. This wealth of big data can allow social scientists to study social interactions on a scale and at a level of detail that has never before been possible. Our goal is to evaluate the value of big data in various social applications and build a framework that models the cost/utility of data. By considering important problems such as Trend Analysis, Opinion Change and User Behavior Analysis during major events in online social networks, we demonstrate the significance of this problem. Furthermore, in each case we present scalable techniques and algorithms that can be used in an online manner. Finally, we propose the big data value evaluation framework that weighs in the cost as well as the value of data to determine capacity modeling in the context of data acquisition.

References

[1]
New tweets per second record, and how! https://blog.twitter.com/2013/new-tweets-per-second-record-and-how
[2]
Aggarwal, C.C., Han, J., Wang, J., Yu, P.S.: A framework for clustering evolving data streams. In: Proc. 29th Int. Conf. on Very Large Data Bases, pp. 81—92. VLDB Endowment (2003)
[3]
Aggarwal, C.C., Yu, P.S.: Online analysis of community evolution in data streams. In: Proc. SIAM International Data Mining Conference (2005)
[4]
Allan, J. (ed.): Topic detection and tracking: event-based information organization. Kluwer Academic Publishers, Norwell (2002)
[5]
Alon, N., Yuster, R., Zwick, U.: Finding and counting given length cycles. Algorithmicaä1717, 209—223 (1997)
[6]
Bar-Yossef, Z., Kumar, R., Sivakumar, D.: Reductions in streaming algorithms, with an application to counting triangles in graphs. In: SODA 2002, pp. 623—632 (2002)
[7]
Becchetti, L., Boldi, P., Castillo, C., Gionis, A.: Efficient semi-streaming algorithms for local triangle counting in massive graphs. In: KDD 2008, pp. 16—24 (2008)
[8]
Charikar, M., Chen, K., Farach-Colton, M.: Finding frequent items in data streams. In: Widmayer, P., Triguero, F., Morales, R., Hennessy, M., Eidenbenz, S., Conejo, R. (eds.) ICALP 2002. LNCS, vol.ä2380, pp. 693—703. Springer, Heidelberg (2002)
[9]
Chen, B.: Topic oriented evolution and sentiment analysis. Ph.D. Dissertation, Penn State University (2011)
[10]
Chen, W., Wang, Y., Yang, S.: Efficient influence maximization in social networks. In: KDD 2009, pp. 199—208 (2009)
[11]
Chierichetti, F., Kleinberg, J., Panconesi, A.: How to schedule a cascade in an arbitrary graph. In: EC 2012, pp. 355—368 (2012)
[12]
Cormode, G., Hadjieleftheriou, M.: Finding frequent items in data streams. Proc. VLDB Endow.ä1(2), 1530—1541 (2008)
[13]
Cormode, G., Muthukrishnan, S.: What's Hot and What's Not: Tracking Most Frequent Items Dynamically. TODS 2005ä30(1), 249—278 (2005)
[14]
The curse of big data, http://www.analyticbridge.com/profiles/blogs/the-curse-of-big-data
[15]
Friedkin, N.E.: The attitude-behavior linkage in behavioral cascades. Social Psychology Quarterly, 73—196 (2010)
[16]
Glorot, X., Bordes, A., Bengio, Y.: Domain adaptation for large-scale sentiment classification: A deep learning approach. In: ICML 2011 (2011)
[17]
Hartline, J., Mirrokni, V., Sundararajan, M.: Optimal marketing strategies over social networks. In: WWW 2008, pp. 189—198 (2008)
[18]
Hong, L., Ahmed, A., Gurumurthy, S., Smola, A.J., Tsioutsiouliklis, K.: Discovering geographical topics in the twitter stream. In: WWW 2012, pp. 769—778 (2012)
[19]
Horrigan, J., Rainie, L.: When facing a tough decision, 60 million americans now seek the internet's help: The internet's growing role in life's major moments (2006), http://pewresearch.org/obdeck/?ObDeckID=19 (retrieved October 13, 2006)
[20]
Howe, J.: The rise of crowdsourcing. Northä14(14), 1—5 (2006)
[21]
Hughes, A.L., Palen, L.: Twitter adoption and use in mass convergence and emergency events. International Journal of Emergency Managementä6(3/4), 248 (2009)
[22]
Indonesia, brazil and venezuela lead global surge in twitter usage, http://www.comscore.com/Press_Events/Press_Releases/2010/8/Indonesia_Brazil_and_Venezuela_Lead_Global_Surge_in_Twitter_Usage
[23]
Jiang, L., Yu, M., Zhou, M., Liu, X., Zhao, T.: Target-dependent twitter sentiment classification. In: HLT 2011, pp. 151—160 (2011)
[24]
Jin, C., Qian, W., Sha, C., Yu, J.X., Zhou, A.: Dynamically maintaining frequent items over a data stream. In: CIKM 2003, pp. 287—294. ACM (2003)
[25]
Katz, I., Tunstrom, K., Ioannou, C., Huepe, C., Couzin, I.: Inferring the structure and dynamics of interactions in schooling fish. In: PNAS 2011, pp. 18720—18725 (2011)
[26]
Kempe, D., Kleinber, J., Tardos, E.: Maximizing the spread of influence through a social network. In: KDD 2003, pp. 137—146 (2003)
[27]
Kempe, D., Kleinberg, J.M., Tardos, É.: Maximizing the spread of influence through a social network. In: Proceedings of the Ninth ACM International Conference on Knowledge Discovery and Data Mining, pp. 137—146 (2003)
[28]
Kifer, D., Ben-David, S., Gehrke, J.: Detecting change in data streams. In: Proc. 30th Int. Conf. on Very Large Data Bases, pp. 180—191. VLDB Endowment (2004)
[29]
Kimura, M., Saito, K., Nakano, R., Motoda, H.: Extracting influential nodes on a social network for information diffusion. Data Mining and Knowledge Discoveryä20, 70—97 (2010)
[30]
Kimura, M., Saito, K., Ohara, K., Motoda, H.: Learning to predict opinion share in social networks. In: AAAI 2010, pp. 1364—1370 (2010)
[31]
Kittur, A., Kraut, R.E.: Harnessing the wisdom of crowds in wikipedia: quality through coordination. In: Proceedings of the 2008 ACM Conference on Computer Supported Cooperative Work, CSCW 2008, pp. 37—46. ACM, New York (2008)
[32]
Krishnamurthy, B., Gill, P., Arlitt, M.: A few chirps about twitter. In: Proceedings of the First Workshop on Online Social Networks, WOSN 2008, pp. 19—24. ACM (2008)
[33]
Kwak, H., Lee, C., Park, H., Moon, S.: What is Twitter, a social network or a news media. In: WWW 2010, pp. 591—600 (2010)
[34]
Leskovec, J., Backstrom, L., Kleinberg, J.: Meme-tracking and the dynamics of the news cycle. In: Proc. 15th ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining, pp. 497—506 (2009)
[35]
Leskovec, J., Backstrom, L., Kleinberg, J.: Meme-tracking and the dynamics of the news cycle. In: KDD 2009, pp. 497—506 (2009)
[36]
Libert, B., Spector, J.: We are smarter than me: how to unleash the power of crowds in your business, 1st edn. Wharton School Publishing (2007)
[37]
Lin, Y.R., Margolin, D., Keegan, B., Lazer, D.: Voices of Victory: A Computational Focus Group Framework for Tracking Opinion Shift in Real Time. In: WWW 2013, pp. 737—747 (2013)
[38]
MacEachren, A.M., Robinson, A.C., Jaiswal, A., Pezanov, S., Savelyev, A., Blanford, J., Mitra, P.: Geo-Twitter analytics: Application in crisis management. In: 25th International Cartographic Conference (July 2011)
[39]
Macropol, K., Singh, A.K.: Content-based modeling and prediction of information dissemination. In: ASONAM 2011, pp. 21—28 (2011)
[40]
Manku, G.S., Motwani, R.: Approximate frequency counts over data streams. In: VLDB 2002, pp. 346—357 (2002)
[41]
Mehta, R., Mehta, D., Chheda, D., Shah, C., Chawan, P.: Sentiment analysis and influence tracking using twitter. International Journal of Advanced Research in Computer Science and Electronics Engineeringä1, 72—79 (2012)
[42]
Melville, W.G.P., Lawrence, R.D.: Sentiment analysis of blogs by combining lexical knowledge with text classification. In: KDD 2009, pp. 1275—1284 (2009)
[43]
Metwally, A., Agrawal, D., El Abbadi, A.: An integrated efficient solution for computing frequent and top-k elements in data streams. ACM Trans. Database Syst.ä31(3), 1095—1133 (2006)
[44]
Metwally, A., Emekçi, F., Agrawal, D., El Abbadi, A.: Sleuth: Single-publisher attack detection using correlation hunting. Proc. VLDB Endow.ä1(2), 1217—1228 (2008)
[45]
Mudhakar, S., Srivatsa, L., Abdelzaher, T.: Mining diverse opinions. In: MILCOM 2012, pp. 1—7 (2012)
[46]
Palen, L.: Online social media in crisis events. Educause Quarterlyä(3), 76—78 (2008)
[47]
Patterson, S., Bamieh, B.: Interaction-driven opinion dynamics in online social networks. In: SOMA 2010, pp. 98—105 (2010)
[48]
Petrovic, S., Osborne, M., McCreadie, R., Macdonald, C., Ounis, I., Shrimpton, L.: Can Twitter replace Newswire for breaking news? In: ICWSM 2013, pp. 713—716 (2013)
[49]
Rosenfeld, A., Hummel, R.A., Zucker, S.W.: Scene labeling by relaxation operations. IEEE Transactions on Systems Man and Cyberneticsä6, 420—433 (1976)
[50]
Sachan, M., Contractor, D., Faruquie, T.A., Subramaniam, L.V.: Using content and interactions for discovering communities in social networks. In: WWW 2012, pp. 331—340 (2012)
[51]
Sakaki, T., Okazaki, M., Matsuo, Y.: Earthquake shakes twitter users: real-time event detection by social sensors. In: Proceedings of the 19th International Conference on World Wide Web, WWW 2010, pp. 851—860. ACM, New York (2010)
[52]
Sankaranarayanan, J., Samet, H., Teitler, B.E., Lieberman, M.D., Sperling, J.: Twitterstand: news in tweets. In: GIS 2009: Proceedings of the 17th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, pp. 42—51. ACM, New York (2009)
[53]
Teitler, B.E., Lieberman, M.D., Panozzo, D., Sankaranarayanan, J., Samet, H., Sperling, J.: Newsstand: a new view on news. In: GIS 2008, pp. 1—10 (2008)
[54]
Teitler, B.E., Lieberman, M.D., Panozzo, D., Sankaranarayanan, J., Samet, H., Sperling, J.: Newsstand: a new view on news. In: GIS 2008: Proceedings of the 16th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, pp. 1—10. ACM, New York (2008)
[55]
Thelwall, M., Buckley, K., Paltoglou, G., Cai, D., Kappas, A.: Sentiment strength detection in short informal text. Journal of the American Society for Information Science and Technologyä61, 2544—2558 (2010)
[56]
Twitter, http://www.twitter.com
[57]
Wang, X., Wei, F., Liu, X., Zhou, M., Zhang, M.: Topic sentiment analysis in twitter: a graph-based hashtag sentiment classification approach. In: CIKM 2011, pp. 1031—1040 (2011)
[58]
Wu, M.: The big data fallacy and why we need to collect even bigger data. TechCrunch (2012)

Cited By

View all
  • (2022)Effects of the COVID-19 pandemic on the online learning behaviors of university students in TaiwanEducation and Information Technologies10.1007/s10639-021-10677-y27:1(469-491)Online publication date: 1-Jan-2022
  • (2019)Predicting Dynamic Embedding Trajectory in Temporal Interaction NetworksProceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining10.1145/3292500.3330895(1269-1278)Online publication date: 25-Jul-2019
  • (2017)Online Behavior Analysis-Based Student Profile for Intelligent E-LearningJournal of Electrical and Computer Engineering10.1155/2017/97203962017Online publication date: 1-Jan-2017
  • Show More Cited By

Index Terms

  1. Big Data in Online Social Networks: User Interaction Analysis to Model User Behavior in Social Networks
    Index terms have been assigned to the content through auto-classification.

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image Guide Proceedings
    DNIS 2014: Proceedings of the 9th International Workshop on Databases in Networked Information Systems - Volume 8381
    March 2014
    273 pages
    ISBN:9783319056920
    • Editors:
    • Aastha Madaan,
    • Shinji Kikuchi,
    • Subhash Bhalla

    Publisher

    Springer-Verlag

    Berlin, Heidelberg

    Publication History

    Published: 24 March 2014

    Author Tags

    1. Big Data
    2. Complex Networks
    3. Data Streams
    4. Social Analytics
    5. Social Networks

    Qualifiers

    • Article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 04 Oct 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2022)Effects of the COVID-19 pandemic on the online learning behaviors of university students in TaiwanEducation and Information Technologies10.1007/s10639-021-10677-y27:1(469-491)Online publication date: 1-Jan-2022
    • (2019)Predicting Dynamic Embedding Trajectory in Temporal Interaction NetworksProceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining10.1145/3292500.3330895(1269-1278)Online publication date: 25-Jul-2019
    • (2017)Online Behavior Analysis-Based Student Profile for Intelligent E-LearningJournal of Electrical and Computer Engineering10.1155/2017/97203962017Online publication date: 1-Jan-2017
    • (2016)I-HASTREAMProceedings of the 16th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing10.1109/CCGrid.2016.102(656-665)Online publication date: 16-May-2016
    • (2014)Big Graph AnalyticsProceedings of the 17th International Workshop on Data Warehousing and OLAP10.1145/2666158.2668454(99-101)Online publication date: 7-Nov-2014

    View Options

    View options

    Get Access

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media