Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Predicting User Posting Activities in Online Health Communities with Deep Learning

Published: 21 July 2020 Publication History
  • Get Citation Alerts
  • Abstract

    Online health communities (OHCs) represent a great source of social support for patients and their caregivers. Better predictions of user activities in OHCs can help improve user engagement and retention, which are important to manage and sustain a successful OHC. This article proposes a general framework to predict OHC user posting activities. Deep learning methods are adopted to learn from users’ temporal trajectories in both the volumes and content of posts published over time. Experiments based on data from a popular OHC for cancer survivors demonstrate that the proposed approach can improve the performance of user activity predictions. In addition, several topics of users’ posts are found to have strong impact on predicting users’ activities in the OHC.

    References

    [1]
    André Altmann, Laura Toloşi, Oliver Sander, and Thomas Lengauer. 2010. Permutation importance: A corrected feature importance measure. Bioinformatics 26, 10 (2010), 1340--1347.
    [2]
    David M. Blei, Andrew Y. Ng, and Michael I. Jordan. 2003. Latent Dirichlet allocation. J. Mach. Learn. Res 3 (2003), 993--1022.
    [3]
    Andrea Bommert, Xudong Sun, Bernd Bischl, Jörg Rahnenführer, and Michel Lang. 2020. Benchmark for filter methods for feature selection in high-dimensional classification data. Comput. Stat. Data Anal. 143 (2020), 106839.
    [4]
    Roger Burrows, Sarah Nettleton, Nicholas Pleace, Brian Loader, and Steven Muncer. 2000. Virtual community care? Social policy and the emergence of computer mediated social support. Inf., Commun. Soc. 3, 1 (2000), 95--121.
    [5]
    Langtao Chen. 2019. A classification framework for online social support using deep learning. In Proceedings of the International Conference on Human-Computer Interaction. 178--188.
    [6]
    Munmun De Choudhury and Sushovan De. 2014. Mental health discourse on Reddit: Self-disclosure, social support, and anonymity. In Proceedings of the 8th International AAAI Conference on Weblogs and Social Media.
    [7]
    Koustuv Dasgupta, Rahul Singh, Balaji Viswanathan, Dipanjan Chakraborty, Sougata Mukherjea, Amit A. Nanavati, and Anupam Joshi. 2008. Social ties and their relevance to churn in mobile telecom networks. In Proceedings of the 11th International Conference on Extending Database Technology: Advances in Database Technology (EDBT’08). ACM, New York, NY, 668.
    [8]
    Christine Dunkel-Schetter. 1984. Social support and cancer: Findings based on patient interviews and their implications. J. Soc. Issues 40, 4 (1984), 77--98.
    [9]
    Susannah Fox. 2011. The Social Life of Health Information, 2011. Pew Internet 8 American Life Project, Washington, DC.
    [10]
    Hong Han, Xiaoling Guo, and Hua Yu. 2016. Variable selection using mean decrease accuracy and mean decrease Gini based on random forest. In Proceedings of the 2016 7th IEEE International Conference on Software Engineering and Service Science (ICSESS’16). IEEE, Los Alamitos, CA, 219--224.
    [11]
    Md Al Mehedi Hasan, Mohammed Nasser, Shamim Ahmad, and Khademul Islam Molla. 2016. Feature selection for intrusion detection using random forest. J. Inf. Secur. 07, 03 (2016), 129--140.
    [12]
    Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long short-term memory. Neural Comput. 9, 8 (1997), 1735--1780.
    [13]
    Ian T. Jolliffe and Jorge Cadima. 2016. Principal component analysis: A review and recent developments. Philos. Trans. R. Soc. A Math. Phys. Eng. Sci. 374, 2065 (2016), 20150202.
    [14]
    Hamed Khanpour, Cornelia Caragea, and Prakhar Biyani. 2018. Identifying emotional support in online health communities. In Proceedings of the 32nd AAAI Conference on Artificial Intelligence.
    [15]
    Amanda Leiter, Tomasz Sablinski, Michael Diefenbach, Marc Foster, Alex Greenberg, John Holland, William K. Oh, and Matthew D. Galsky. 2014. Use of crowdsourcing for cancer clinical trial development. J. Natl. Cancer Inst. 106, 10 (2014), dju258.
    [16]
    Xi Long, Wenjing Yin, Le An, Haiying Ni, Lixian Huang, Qi Luo, and Yan Chen. 2012. Churn analysis of online social network users using data mining techniques. In Proceedings of the International MultiConference of Engineers and Computer Scientists.
    [17]
    Yingjie Lu. 2013. Automatic topic identification of health-related messages in online health community using text classification. Springerplus 2, 1 (2013), 309.
    [18]
    Diane Maloney-Krichmar and Jenny Preece. 2005. A multilevel analysis of sociability, usability, and community dynamics in an online health community. ACM Trans. Comput. Interact. 12, 2 (2005), 201--232.
    [19]
    Michael V. Mannino and Murlidhar V. Koushik. 2000. The cost-minimizing inverse classification problem: A genetic algorithm approach. Decis. Support Syst. 29, 3 (2000), 283--300.
    [20]
    Bjoern H. Menze, B. Michael Kelm, Ralf Masuch, Uwe Himmelreich, Peter Bachert, Wolfgang Petrich, and Fred A. Hamprecht. 2009. A comparison of random forest and its Gini importance with standard chemometric methods for the feature selection and classification of spectral data. BMC Bioinformatics 10, 1 (2009), 213.
    [21]
    Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean. 2013. Efficient estimation of word representations in vector space. arXiv:1301.3781.
    [22]
    Blaise Ngonmang, Emmanuel Viennet, and Maurice Tchuente. 2012. Churn prediction in a real online social network using local community analysis. In Proceedings of the 2012 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining. IEEE, Los Alamitos, CA, 282--288.
    [23]
    Jie Ning and Robert G. Beiko. 2015. Phylogenetic approaches to microbial community classification. Microbiome 3, 1 (2015), 47.
    [24]
    Jennifer L. Pearson, Michael S. Amato, George D. Papandonatos, Kang Zhao, Bahar Erar, Xi Wang, Sarah Cha, Amy M. Cohn, and Amanda L. Graham. 2018. Exposure to positive peer sentiment about nicotine replacement therapy in an online smoking cessation community is associated with NRT use. Addict. Behav. 87 (2018), 39--45.
    [25]
    Kenneth Portier, Greta E. Greer, Lior Rokach, Nir Ofek, Yafei Wang, Prakhar Biyani, Mo Yu, Siddhartha Banerjee, Kang Zhao, Prasenjit Mitra, and J. Yen. 2013. Understanding topics and sentiment in an online cancer survivor community. JNCI Monogr. 2013, 47 (2013), 195--198.
    [26]
    Jagat Sastry Pudipeddi, Leman Akoglu, and Hanghang Tong. 2014. User churn in focused question answering sites. In Proceedings of the 23rd International Conference on World Wide Web (WWW’14 Companion). ACM, New York, NY, 469--474.
    [27]
    Baojun Qiu, Kang Zhao, Prasenjit Mitra, Dinghao Wu, Cornelia Caragea, John Yen, Greta E. Greer, and Kenneth Portier. 2011. Get online support, feel better—Sentiment analysis and dynamics in an online cancer survivor community. In Proceedings of the 2011 IEEE 3rd International Conference on Privacy, Security, Risk, and Trust and the 2011 IEEE 3rd International Conference on Social Computing. IEEE, Los Alamitos, CA, 274--281.
    [28]
    Avi Rosenfeld, Sigal Sina, David Sarne, Or Avidov, and Sarit Kraus. 2018. A study of WhatsApp usage patterns and prediction models without message content. arXiv:1802.03393.
    [29]
    Matthew Rowe. 2013. Mining user lifecycles from online community platforms and their application to churn prediction. In Proceedings of the 2013 IEEE 13th International Conference on Data Mining. IEEE, Los Alamitos, CA, 637--646.
    [30]
    NehaShankar Sharma. 2015. Patient centric approach for clinical trials: Current trend and new opportunities. Perspect. Clin. Res. 6, 3 (2015), 134.
    [31]
    Fengxi Song, Zhongwei Guo, and Dayong Mei. 2010. Feature selection using principal component analysis. In Proceedings of the 2010 International Conference on System Science, Engineering Design, and Manufacturing Informatization. IEEE, Los Alamitos, CA, 27--30.
    [32]
    Vivek Veeriah, Naifan Zhuang, and Guo-Jun Qi. 2015. Differential recurrent neural networks for action recognition. In Proceedings of the IEEE International Conference on Computer Vision. 4041--4049.
    [33]
    Xi Wang, Kang Zhao, Sarah Cha, Michael S. Amato, Amy M. Cohn, Jennifer L. Pearson, George D. Papandonatos, and Amanda L. Graham. 2019. Mining user-generated content in an online smoking cessation community to identify smoking status: A machine learning approach. Decis. Support Syst. 116 (2019), 26--34.
    [34]
    Xi Wang, Kang Zhao, and Nick Street. 2014. Social support and user engagement in online health communities. In Proceedings of the International Conference on Smart Health. 97--110.
    [35]
    Xi Wang, Kang Zhao, and Nick Street. 2017. Analyzing and predicting user participations in online health communities: A social support perspective. J. Med. Internet Res. 19, 4 (2017), e130.
    [36]
    Yi-Chia Wang, Robert Kraut, and John M. Levine. 2012. To stay or leave? In Proceedings of the ACM 2012 Conference on Computer Supported Cooperative Work (CSCW’12). ACM, New York, NY, 833.
    [37]
    Ronghua Xu and Qingpeng Zhang. 2016. Understanding online health groups for depression: Social network and linguistic perspectives. J. Med. Internet Res. 18, 3 (2016), e63.
    [38]
    Ronghua Xu, Jiaqi Zhou, Qingpeng Zhang, and James A. Hendler. 2018. Research on online health communities: A systematic review. Encyclopedia of Social Network Analysis and Mining, R. Alhajj and J. Rokne (Eds). Springer, New York, NY.
    [39]
    Carl Yang, Xiaolin Shi, Luo Jie, and Jiawei Han. 2018. I know you'll be back. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ACM, New York, NY, 914--922.
    [40]
    Christopher C. Yang and Ling Jiang. 2018. Enriching user experience in online health communities through thread recommendations and heterogeneous information network mining. IEEE Trans. Comput. Soc. Syst. 5, 4 (2018), 1049--1060.
    [41]
    Haodong Yang and Christopher C. Yang. 2015. Using health-consumer-contributed data to detect adverse drug reactions by association mining with temporal analysis. ACM Trans. Intell. Syst. Technol. 6, 4 (2015), 1--27.
    [42]
    Mi Zhang and Christopher C. Yang. 2015. Using content and network analysis to understand the social support exchange patterns and user behaviors of an online smoking cessation intervention program. J. Assoc. Inf. Sci. Technol. 66, 3 (2015), 564--575.
    [43]
    Shaodian Zhang, Edouard Grave, Elizabeth Sklar, and Noémie Elhadad. 2017. Longitudinal analysis of discussion topics in an online breast cancer community using convolutional neural networks. J. Biomed. Inform. 69, (2017), 1--9.
    [44]
    Shaodian Zhang, Lin Qiu, Frank Chen, Weinan Zhang, Yong Yu, and Noémie Elhadad. 2017. We make choices we think are going to save us. In Proceedings of the 26th International Conference on World Wide Web Companion (WWW’17 Companion). ACM, New York, NY, 1073--1081.
    [45]
    Kang Zhao, Greta E. Greer, John Yen, Prasenjit Mitra, and Kenneth Portier. 2015. Leader identification in an online health community for cancer survivors: A social network-based classification approach. Inf. Syst. E-Bus. Manag. 13, 4 (2015), 629--645.
    [46]
    Kang Zhao, Xi Wang, Sarah Cha, Amy M. Cohn, George D. Papandonatos, Michael S. Amato, Jennifer L. Pearson, and Amanda L. Graham. 2016. A multirelational social network analysis of an online health community for smoking cessation. J. Med. Internet Res. 18, 8 (2016), e233.
    [47]
    Binjun Zhu, Xiaofeng Cai, and Ruichu Cai. 2018. Answer quality evaluation in online health care community. In Proceedings of the 2018 International Conference on Network, Communication, and Computer Engineering (NCCE’18). France.

    Cited By

    View all
    • (2024)Unsupervised domain adaptation with local structure preservation for colon histopathological image classificationJournal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology10.3233/JIFS-23492046:1(1129-1142)Online publication date: 1-Jan-2024
    • (2024)Domain Generalization in Time Series ForecastingACM Transactions on Knowledge Discovery from Data10.1145/364303518:5(1-24)Online publication date: 31-Jan-2024
    • (2024)Patient influencer: the impact of homophily on value co-creation behaviour in online health communitiesBehaviour & Information Technology10.1080/0144929X.2024.2349961(1-21)Online publication date: 3-May-2024
    • Show More Cited By

    Index Terms

    1. Predicting User Posting Activities in Online Health Communities with Deep Learning

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image ACM Transactions on Management Information Systems
        ACM Transactions on Management Information Systems  Volume 11, Issue 3
        Special Section on WITS 2018 and Regular Articles
        September 2020
        140 pages
        ISSN:2158-656X
        EISSN:2158-6578
        DOI:10.1145/3407737
        Issue’s Table of Contents
        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 21 July 2020
        Online AM: 07 May 2020
        Accepted: 01 February 2020
        Revised: 01 November 2019
        Received: 01 June 2019
        Published in TMIS Volume 11, Issue 3

        Permissions

        Request permissions for this article.

        Check for updates

        Author Tags

        1. Predictive model
        2. text analytics
        3. trajectory mining
        4. user churn

        Qualifiers

        • Research-article
        • Research
        • Refereed

        Funding Sources

        • National Natural Science Foundation of China

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)35
        • Downloads (Last 6 weeks)0
        Reflects downloads up to 26 Jul 2024

        Other Metrics

        Citations

        Cited By

        View all
        • (2024)Unsupervised domain adaptation with local structure preservation for colon histopathological image classificationJournal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology10.3233/JIFS-23492046:1(1129-1142)Online publication date: 1-Jan-2024
        • (2024)Domain Generalization in Time Series ForecastingACM Transactions on Knowledge Discovery from Data10.1145/364303518:5(1-24)Online publication date: 31-Jan-2024
        • (2024)Patient influencer: the impact of homophily on value co-creation behaviour in online health communitiesBehaviour & Information Technology10.1080/0144929X.2024.2349961(1-21)Online publication date: 3-May-2024
        • (2024)Redirected transfer learning for robust multi-layer subspace learningPattern Analysis & Applications10.1007/s10044-024-01233-827:1Online publication date: 28-Feb-2024
        • (2023)ForETaxi: Data-Driven Fleet-Oriented Charging Resource Allocation in Large-Scale Electric Taxi NetworksACM Transactions on Sensor Networks10.1145/357095819:3(1-25)Online publication date: 2-Mar-2023
        • (2023)Synthetic Behavior Sequence Generation Using Generative Adversarial NetworksACM Transactions on Computing for Healthcare10.1145/35639504:1(1-23)Online publication date: 27-Feb-2023
        • (2023)Multimodal Image Synthesis and Editing: The Generative AI EraIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2023.330524345:12(15098-15119)Online publication date: 1-Dec-2023
        • (2023)Vision-Language Matching for Text-to-Image Synthesis via Generative Adversarial NetworksIEEE Transactions on Multimedia10.1109/TMM.2022.321738425(7062-7075)Online publication date: 1-Jan-2023
        • (2023)Generalizing to Unseen Domains: A Survey on Domain GeneralizationIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2022.317812835:8(8052-8072)Online publication date: 1-Aug-2023
        • (2023)Dynamic Classifier Alignment for Unsupervised Multi-Source Domain AdaptationIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2022.314442335:5(4727-4740)Online publication date: 1-May-2023
        • Show More Cited By

        View Options

        Get Access

        Login options

        Full Access

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        HTML Format

        View this article in HTML Format.

        HTML Format

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media