Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article
Public Access

Value and Misinformation in Collaborative Investing Platforms

Published: 04 May 2017 Publication History

Abstract

It is often difficult to separate the highly capable “experts” from the average worker in crowdsourced systems. This is especially true for challenge application domains that require extensive domain knowledge. The problem of stock analysis is one such domain, where even the highly paid, well-educated domain experts are prone to make mistakes. As an extremely challenging problem space, the “wisdom of the crowds” property that many crowdsourced applications rely on may not hold.
In this article, we study the problem of evaluating and identifying experts in the context of SeekingAlpha and StockTwits, two crowdsourced investment services that have recently begun to encroach on a space dominated for decades by large investment banks. We seek to understand the quality and impact of content on collaborative investment platforms, by empirically analyzing complete datasets of SeekingAlpha articles (9 years) and StockTwits messages (4 years). We develop sentiment analysis tools and correlate contributed content to the historical performance of relevant stocks. While SeekingAlpha articles and StockTwits messages provide minimal correlation to stock performance in aggregate, a subset of experts contribute more valuable (predictive) content. We show that these authors can be easily identified by user interactions, and investments based on their analysis significantly outperform broader markets. This effectively shows that even in challenging application domains, there is a secondary or indirect wisdom of the crowds.
Finally, we conduct a user survey that sheds light on users’ views of SeekingAlpha content and stock manipulation. We also devote efforts to identify potential manipulation of stocks by detecting authors controlling multiple identities.

References

[1]
Ahmed Abbasi and Hsinchun Chen. 2008. Writeprints: A stylometric approach to identity-level identification and similarity detection in cyberspace. ACM Transactions on Information Systems (TOIS) 26, 2 (2008), 7.
[2]
Lada A. Adamic, Jun Zhang, Eytan Bakshy, and Mark S. Ackerman. 2008. Knowledge sharing and yahoo answers: Everyone knows something. In Proc. of World Wide Web (WWW).
[3]
Sadia Afroz, Michael Brennan, and Rachel Greenstadt. 2012. Detecting hoaxes, frauds, and deception in writing style online. In Proc. of IEEE S8P.
[4]
Leman Akoglu, Rishi Chandy, and Christos Faloutsos. 2013. Opinion fraud detection in online reviews by network effects. In Proc. of ICWSM.
[5]
Roy Bar-Haim, Elad Dinur, Ronen Feldman, Moshe Fresko, and Guy Goldstein. 2011. Identifying and following expert investors in stock microblogs. In Proc. of EMNLP.
[6]
Luciano Barbosa and Junlan Feng. 2010. Robust sentiment detection on Twitter from biased and noisy data. In Proc. of COLING.
[7]
Fabrício Benevenuto, Gabriel Magno, Tiago Rodrigues, and Virgilio Almeida. 2010. Detecting spammers on Twitter. In Proc. of CEAS.
[8]
Mudit Bhargava, Pulkit Mehndiratta, and Krishna Asawa. 2013. Stylometric analysis for authorship attribution on Twitter. In Proc. of International Conference on Big Data Analytics. 37--47.
[9]
Johan Bollen, Huina Mao, and Xiaojun Zeng. 2011. Twitter mood predicts the stock market. Journal of Computational Science 2, 1 (2011), 1--8.
[10]
Eric D. Brown. 2012. Will Twitter make you a better investor? A look at sentiment, user reputation and their effect on the stock market. In Proc. of SAIS.
[11]
Hailiang Chen, Prabuddha De, J. Hu, and Byoung-Hyoun Hwang. 2014. Wisdom of crowds: The value of stock opinions transmitted through social media. Review of Financial Studies 27, 5 (2014), 1367--1403.
[12]
Munmun De Choudhury and others. 2008. Can blog communication dynamics be correlated with stock market activity? In Proc. of HyperText.
[13]
Douglas W. Diamond and Robert E. Verrecchia. 1987. Constraints on short-selling and asset price adjustment to private information. Journal of Financial Economics 18, 2 (1987), 277--311.
[14]
Andrea Esuli and Fabrizio Sebastiani. 2007. Pageranking wordnet synsets: An application to opinion mining. In Proc. of ACL.
[15]
Adam Feuerstein. 2014. Galena Biopharma Pays For Stock-Touting Campaign While Insiders Cash Out Millions. TheStreet News. (February 2014).
[16]
Clifton Forlines, Sarah Miller, Leslie Guelcher, and Robert Bruzzi. 2014. Crowdsourcing the future: Predictions made with a social network. In Proc. of CHI.
[17]
Gabriel Pui Cheong Fung, Jeffrey Xu Yu, and Wai Lam. 2003. Stock prediction: Integrating text mining approach using real-time news. In Proc. of CIFER.
[18]
Hongyu Gao and others. 2010. Detecting and characterizing social spam campaigns. In Proc. of IMC.
[19]
Mikros K. George and Eleni K. Argiri. 2007. Investigating topic influence in authorship attribution. In Proc. of PAN.
[20]
Eric Gilbert and Karrie Karahalios. 2010. Widespread worry and the stock market. In Proc. of ICWSM.
[21]
Namrata Godbole, Manja Srinivasaiah, and Steven Skiena. 2007. Large-scale sentiment analysis for news and blogs. In Proc. of ICWSM.
[22]
Pollyanna Gonçalves, Matheus Araújo, Fabrício Benevenuto, and Meeyoung Cha. 2013. Comparing and combining sentiment analysis methods. In Proc. of COSN.
[23]
F. Maxwell Harper, Daphne Raban, Sheizaf Rafaeli, and Joseph A. Konstan. 2008. Predictors of answer quality in online Q8A sites. In Proc. of CHI.
[24]
Investment 2013. 2013 Investment Company Fact Book. Technical Report. Investment Company Institute. Retrieved from http://www.ici.org/pdf/2013_factbook.pdf.
[25]
Michal Jacovi, Ido Guy, Shiri Kremer-Davidson, Sara Porat, and Netta Aizenbud-Reshef. 2014. The perception of others: Inferring reputation from social media in the enterprise. In Proc. of CSCW.
[26]
Yigitcan Karabulut. 2011. Can Facebook predict stock market activity? SSRN eLibrary (2011).
[27]
Joy Kim, Justin Cheng, and Michael S Bernstein. 2014. Ensemble: Exploring complementary strengths of leaders and crowds in creative collaboration. In Proc. of CSCW.
[28]
John Kimelman. 2014. An insider’s tale of a stock promotion plan. Barrons News. (March 2014).
[29]
Saijel Kishan and Kelly Bit. 2013. Hedge funds trail stocks by the widest margin since 2005. Bloomberg News. (December 2013).
[30]
Aniket Kittur, Jeffrey V. Nickerson, Michael Bernstein, Elizabeth Gerber, Aaron Shaw, John Zimmerman, Matt Lease, and John Horton. 2013. The future of crowd work. In Proc. of CSCW.
[31]
Dan Klein and Christopher D. Manning. 2003. Accurate unlexicalized parsing. In Proc. of ACL.
[32]
Q. Vera Liao, Claudia Wagner, Peter Pirolli, and Wai-Tat Fu. 2012. Understanding experts’ and novices’ expertise judgment of Twitter users. In Proc. of CHI.
[33]
Wenhui Liao, Sameena Shah, and Masoud Makrehchi. 2014. Winning by following the winners: Mining the behaviour of stock market experts in social media. In SBP. 103--110.
[34]
Bing Liu. 2012. Sentiment analysis and opinion mining. Synthesis Lectures on Human Language Technologies 5, 1 (2012), 1--167.
[35]
Tim Loughran and Bill McDonald. 2011. When is a liability not a liability? Textual analysis, dictionaries, and 10-Ks. Journal of Finance 66, 1 (2011), 35--65.
[36]
Masoud Makrehchi, Sameena Shah, and Wenhui Liao. 2013. Stock prediction using event-based sentiment analysis. In Proc. of WI-IAT.
[37]
Helen Susannah Moat, Chester Curme, Adam Avakian, Dror Y. Kenett, H. Eugene Stanley, and Tobias Preis. 2013. Quantifying wikipedia usage patterns before stock market moves. Scientific Reports 3 (2013).
[38]
Arjun Mukherjee, Abhinav Kumar, Bing Liu, Junhui Wang, Meichun Hsu, Malu Castellanos, and Riddhiman Ghosh. 2013. Spotting opinion spammers using behavioral footprints. In Proc. of SIGKDD. ACM, 632--640.
[39]
Arvind Narayanan, Hristo Paskov, Neil Zhenqiang Gong, John Bethencourt, Emil Stefanov, Eui Chul Richard Shin, and Dawn Song. 2012. On the feasibility of internet-scale author identification. In Proc. of IEEE S8P.
[40]
Jeffrey Nichols, Michelle Zhou, Huahai Yang, Jeon-Hyung Kang, and Xiao Hua Sun. 2013. Analyzing the quality of information solicited from targeted strangers on social media. In Proc. of CSCW.
[41]
Chong Oh and Olivia Sheng. 2011. Investigating predictive power of stock micro blog sentiment in forecasting future stock price directional movement. In Proc. of ICIS.
[42]
Nuno Oliveira, Paulo Cortez, and Nelson Areal. 2013. On the predictability of stock market behavior using stockTwits sentiment and posting volume. In Progress in AI. 355--365.
[43]
Judith S. Olson and Wendy A. Kellogg. 2014. Ways of Knowing in HCI. Springer.
[44]
Jahna Otterbacher. 2009. “Helpfulness” in online communities: A measure of message quality. In Proc. of CHI.
[45]
Bo Pang and Lillian Lee. 2008. Opinion mining and sentiment analysis. Foundations and Trends® in Information Retrieval 2, 1--2 (2008), 1--135.
[46]
Bo Pang, Lillian Lee, and Shivakumar Vaithyanathan. 2002. Thumbs up?: Sentiment classification using machine learning techniques. In Proc. of ACL.
[47]
Karl Pearson. 1895. Contributions to the mathematical theory of evolution. II. Skew variation in homogeneous material. Philosophical Transactions of the Royal Society A 186 (1895), 343--414.
[48]
Richard Pearson. 2014. Behind the promotion of Northwest Bio. Seeking Alpha. (July 2014).
[49]
Tobias Preis, Helen Susannah Moat, and H. Eugene Stanley. 2013. Quantifying trading behavior in financial markets using google trends. Scientific Reports 3 (2013).
[50]
Tushar Rao and Saket Srivastava. 2012. Analyzing stock market movements using Twitter sentiment analysis. In Proc. of ASONAM.
[51]
Robert P. Schumaker and Hsinchun Chen. 2009. Textual analysis of stock market prediction using breaking financial news: The AZFin text system. ACM Transactions on Information Systems (TOIS) 27, 2 (2009), 12.
[52]
SeekingAlpha. 2014. About SeekingAlpha. http://seekingalpha.com/page/about_us.
[53]
Victor S. Sheng, Foster Provost, and Panagiotis G. Ipeirotis. 2008. Get another label? Improving data quality and data mining using multiple, noisy labelers. In Proc. of KDD. 614--622.
[54]
Dae-Neung Sohn, Jung-Tae Lee, and Hae-Chang Rim. 2009. The contribution of stylistic information to content-based mobile spam filtering. In Proc. of the ACL. 321--324.
[55]
Timm O. Sprenger, Andranik Tumasjan, Philipp G. Sandner, and Isabell M. Welpe. 2013. Tweets and trades: The information content of stock microblogs. European Financial Management (2013).
[56]
StockTwits. 2014. About StockTwits. http://stocktwits.com/about.
[57]
Yu-An Sun and Christopher R. Dance. 2012. When majority voting fails: Comparing quality assurance methods for noisy human computation environment. In Proc. of Collective Intelligence.
[58]
Yla R. Tausczik, Aniket Kittur, and Robert E. Kraut. 2014. Collaborative problem solving: A study of mathoverflow. In Proc. of CSCW.
[59]
Dylan Tweney. 2013. Seeking Alpha: Who needs an acquisition when were doing so well? VentureBeat News. (October 2013).
[60]
Bimal Viswanath and others. 2010. An analysis of social network-based sybil defenses. In Proc. of SIGCOMM.
[61]
Alex Hai Wang. 2010. Don’t follow me: Spam detection in Twitter. In Proc. of SECRYPT.
[62]
Gang Wang, Konark Gill, Manish Mohanlal, Haitao Zheng, and Ben Y. Zhao. 2013a. Wisdom in the social crowd: An analysis of quora. In Proc. of WWW.
[63]
Gang Wang, Tristan Konolige, Christo Wilson, Xiao Wang, Haitao Zheng, and Ben Y. Zhao. 2013b. You are how you click: Clickstream analysis for sybil detection. In Proc. of USENIX Security.
[64]
Gang Wang, Manish Mohanlal, Christo Wilson, Xiao Wang, Miriam Metzger, Haitao Zheng, and Ben Y. Zhao. 2013c. Social turing tests: Crowdsourcing sybil detection. In Proc. of NDSS.
[65]
Gang Wang, Tianyi Wang, Bolun Wang, Divya Sambasivan, Zengbin Zhang, Haitao Zheng, and Ben Y. Zhao. 2015. Crowds on wall street: Extracting value from collaborative investing platforms. In Proc. of CSCW.
[66]
Gang Wang, Tianyi Wang, Haitao Zheng, and Ben Y. Zhao. 2014. Man vs. machine: Practical adversarial detection of malicious crowdsourcing workers. In Proc. of USENIX Security.
[67]
Gang Wang, Christo Wilson, Xiaohan Zhao, Yibo Zhu, Manish Mohanlal, Haitao Zheng, and Ben Y. Zhao. 2012. Serf and turf: Crowdturfing for fun and profit. In Proc. of WWW.
[68]
Anbang Xu, Shih-Wen Huang, and Brian Bailey. 2014. Voyant: Generating structured feedback on visual designs using a crowd of non-experts. In Proc. of CSCW.
[69]
Yahoo. 2014. Yahoo Finance API. https://code.google.com/p/yahoo-finance-managed/.
[70]
Zhi Yang and others. 2011. Uncovering social network sybils in the wild. In Proc. of IMC.
[71]
Haifeng Yu, Michael Kaminsky, Phillip B. Gibbons, and Abraham Flaxman. 2006. SybilGuard: Defending against sybil attacks via social networks. In Proc. of SIGCOMM.
[72]
Rong Zheng, Jiexun Li, Hsinchun Chen, and Zan Huang. 2006. A framework for authorship identification of online messages: Writing-style features and classification techniques. Journal of the American Society for Information Science and Technology 57, 3 (2006), 378--393.

Cited By

View all
  • (2024)Misinformation Knowledge Synthesis: Using Meta-Analysis to Develop Interdisciplinary InterventionsProceedings of the ALISE Annual Conference10.21900/j.alise.2024.1744Online publication date: 17-Oct-2024
  • (2024)Demand-driven Urban Facility Visit PredictionACM Transactions on Intelligent Systems and Technology10.1145/362523315:2(1-24)Online publication date: 22-Feb-2024
  • (2024)FollowAKOInvestorExpert Systems with Applications: An International Journal10.1016/j.eswa.2024.123522249:PBOnline publication date: 1-Sep-2024
  • Show More Cited By

Recommendations

Reviews

Salvatore F. Pileggi

Collaborative investing platforms often rely on the common "wisdom of the crowd" concept in a domain in which even highly paid, well-educated, and experienced professionals make mistakes, providing wrong or inaccurate evaluations. Under the realistic assumption of the coexistence between capable experts and average workers, the authors analyze data from two well-known platforms, focusing on the thin line that separates the direct and the indirect wisdom of crowds. Indeed, the empirical experiments, consisting in the correlation between sentiment analysis and historical performance of relevant stocks, clearly show that experts can be identified. Those people provide the real key value. The remaining part of the network plays a key role by helping to indirectly identify the valuable content through interactions. I appreciated this contribution. The authors provide clear results and seem aware of both the advantages as well as the limitations of the adopted approach. Moreover, the future work outlined in the paper looks very promising, especially concerning the possible implementation of systems underpinned by meta-reputation. Online Computing Reviews Service

Access critical reviews of Computing literature here

Become a reviewer for Computing Reviews.

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on the Web
ACM Transactions on the Web  Volume 11, Issue 2
May 2017
199 pages
ISSN:1559-1131
EISSN:1559-114X
DOI:10.1145/3079924
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 04 May 2017
Accepted: 01 December 2016
Revised: 01 September 2016
Received: 01 March 2016
Published in TWEB Volume 11, Issue 2

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Crowdsourcing
  2. sentiment analysis
  3. stock market

Qualifiers

  • Research-article
  • Research
  • Refereed

Funding Sources

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)220
  • Downloads (Last 6 weeks)38
Reflects downloads up to 17 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Misinformation Knowledge Synthesis: Using Meta-Analysis to Develop Interdisciplinary InterventionsProceedings of the ALISE Annual Conference10.21900/j.alise.2024.1744Online publication date: 17-Oct-2024
  • (2024)Demand-driven Urban Facility Visit PredictionACM Transactions on Intelligent Systems and Technology10.1145/362523315:2(1-24)Online publication date: 22-Feb-2024
  • (2024)FollowAKOInvestorExpert Systems with Applications: An International Journal10.1016/j.eswa.2024.123522249:PBOnline publication date: 1-Sep-2024
  • (2023)Predicting abnormal trading behavior from internet rumor propagation: a machine learning approachFinancial Innovation10.1186/s40854-022-00423-99:1Online publication date: 3-Jan-2023
  • (2023)F$^{3}$3VeTrac: Enabling Fine-Grained, Fully-Road-Covered, and Fully-Individual- Penetrative Vehicle Trajectory RecoveryIEEE Transactions on Mobile Computing10.1109/TMC.2023.330187123:5(4975-4991)Online publication date: 4-Aug-2023
  • (2022)Analysing the controversial social media community2022 IEEE 16th International Scientific Conference on Informatics (Informatics)10.1109/Informatics57926.2022.10083476(299-303)Online publication date: 23-Nov-2022
  • (2021)Tag a Teacher: A Qualitative Analysis of WhatsApp-Based Teacher Networks in Low-Income Indian SchoolsProceedings of the 2021 CHI Conference on Human Factors in Computing Systems10.1145/3411764.3445221(1-16)Online publication date: 6-May-2021
  • (2021)Managing Users’ Behaviors on Open Content Crowdsourcing PlatformJournal of Computer Information Systems10.1080/08874417.2021.198348762:6(1125-1135)Online publication date: 22-Oct-2021
  • (2021)Assessing dynamic qualities of investor sentiments for stock recommendationInformation Processing & Management10.1016/j.ipm.2020.10245258:2(102452)Online publication date: Mar-2021
  • (2020)FollowAKOInvestor: Using Machine Learning to Hear Voices from All Kinds of Investors2020 IEEE 32nd International Conference on Tools with Artificial Intelligence (ICTAI)10.1109/ICTAI50040.2020.00137(875-882)Online publication date: Nov-2020
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Full Access

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media