Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

The Design of Reciprocal Learning Between Human and Artificial Intelligence

Published: 18 October 2021 Publication History

Abstract

The need for advanced automation and artificial intelligence (AI) in various fields, including text classification, has dramatically increased in the last decade, leaving us critically dependent on their performance and reliability. Yet, as we increasingly rely more on AI applications, their algorithms are becoming more nuanced, more complex, and less understandable precisely at a time we need to understand them better and trust them to perform as expected. Text classification in the medical and cybersecurity domains is a good example of a task where we may wish to keep the human in the loop. Human experts lack the capacity to deal with the high volume and velocity of data that needs to be classified, and ML techniques are often unexplainable and lack the ability to capture the required context needed to make the right decision and take action. We propose a new abstract configuration of Human-Machine Learning (HML) that focuses on reciprocal learning, where the human and the AI are collaborating partners. We employ design-science research (DSR) to learn and design an application of the HML configuration, which incorporates software to support combining human and artificial intelligences. We define the HML configuration by its conceptual components and their function. We then describe the development of a system called Fusion that supports human-machine reciprocal learning. Using two case studies of text classification from the cyber domain, we evaluate Fusion and the proposed HML approach, demonstrating benefits and challenges. Our results show a clear ability of domain experts to improve the ML classification performance over time, while both human and machine, collaboratively, develop their conceptualization, i.e., their knowledge of classification. We generalize our insights from the DSR process as actionable principles for researchers and designers of 'human in the learning loop' systems. We conclude the paper by discussing HML configurations and the challenge of capturing and representing knowledge gained jointly by human and machine, an area we feel has great potential.

References

[1]
Ahmed Abbasi and Hsinchun Chen. 2005. Applying authorship analysis to extremist-group Web forum messages. IEEE Intelligent Systems 20, 5 (2005), 67--75. https://doi.org/10.1109/MIS.2005.81
[2]
Ahmed Abbasi and Hsinchun Chen. 2008. CyberGate: a design framework and system for text analysis of computer-mediated communication. MIS Quarterly (2008), 811--837.
[3]
Steven Alter. 2017. Nothing is more practical than a good conceptual artifact. . . which may be a theory, framework, model, metaphor, paradigm or perhaps some other abstraction. Information Systems Journal 27, 5 (2017), 671--693. https://doi.org/10.1111/isj.12116 arXiv:https://onlinelibrary.wiley.com/doi/pdf/10.1111/isj.12116
[4]
Ofra Amir, Finale Doshi-Velez, and David Sarne. 2019. Summarizing agent strategies. Autonomous Agents and Multi-Agent Systems 33, 5 (2019), 628--644.
[5]
Philippe Baecke, Shari De Baets, and Karlien Vanderheyden. 2017. Investigating the added value of integrating human judgement into statistical demand forecasting systems. International Journal of Production Economics 191 (2017), 85--96.
[6]
Evangelia Baralou and Haridimos Tsoukas. 2015. How is New organizational knowledge created in a virtual context? An ethnographic study. Organization Studies 36, 5 (2015), 593--620.
[7]
Jean-Philippe Bernardy, Shalom Lappin, and Jey Han Lau. 2018. The influence of context on sentence acceptability judgements. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers). 456--461.
[8]
Przemyslaw Biecek and Tomasz Burzykowski. 2021. Explanatory Model Analysis. Chapman and Hall/CRC, New York. https://pbiecek.github.io/ema/
[9]
Richard J Boland Jr, Ramkrishnan V Tenkasi, and Dov Te'eni. 1994. Designing information technology to support distributed cognition. Organization science 5, 3 (1994), 456--475.
[10]
Andreas Buja, John Alan McDonald, John Michalak, and Werner Stuetzle. 1991. Interactive data visualization using focusing and linking. In IEEE Visualization, Vol. 91. 156--163.
[11]
Diogo V Carvalho, Eduardo M Pereira, and Jaime S Cardoso. 2019. Machine learning interpretability: A survey on methods and metrics. Electronics 8, 8 (2019), 832.
[12]
Kathy Charmaz. 2006. Constructing grounded theory: A practical guide through qualitative analysis. sage.
[13]
Nitesh V Chawla, Kevin W Bowyer, Lawrence O Hall, and W Philip Kegelmeyer. 2002. SMOTE: synthetic minority over-sampling technique. Journal of artificial intelligence research 16 (2002), 321--357.
[14]
Hsinchun Chen, Wingyan Chung, Jialun Qin, Edna Reid, Marc Sageman, and Gabriel Weimann. 2008. Uncovering the dark Web: A case study of Jihad on the Web. Journal of the American society for information science and technology 59, 8 (2008), 1347--1359.
[15]
Danielle Keats Citron and Frank Pasquale. 2014. The scored society: Due process for automated predictions. Wash. L. Rev. 89 (2014), 1.
[16]
John W Creswell and J David Creswell. 2017. Research design: Qualitative, quantitative, and mixed methods approaches. Sage publications.
[17]
John W Creswell and Cheryl N Poth. 2016. Qualitative inquiry and research design: Choosing among five approaches. Sage publications.
[18]
Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. arXiv:1810.04805 [cs.CL]
[19]
Andreas Diedrich, Ulla Eriksson-Zetterquist, and Alexander Styhre. 2011. Sorting people out: The uses of one-dimensional classificatory schemes in a multi-dimensional world. Culture and Organization 17, 4 (2011), 271--292.
[20]
Jonathan Dinu, Jeffrey Bigham, and J. Zico Kolter. 2020. Challenging common interpretability assumptions in feature attribution explanations. arXiv:2012.02748 [cs.LG]
[21]
Natasha Duarte, Emma Llanso, and Anna C Loup. 2018. Mixed Messages? The Limits of Automated Social Media Content Analysis. In FAT. 106.
[22]
Paul M Fitts. 1951. Human engineering for an effective air-navigation and traffic-control system. (1951).
[23]
Jerome H. Friedman, Trevor Hastie, and Rob Tibshirani. 2010. Regularization Paths for Generalized Linear Models via Coordinate Descent. Journal of Statistical Software, Articles 33, 1 (2010), 1--22. https://doi.org/10.18637/jss.v033.i01
[24]
Matt Gardner, Joel Grus, Mark Neumann, Oyvind Taf jord, Pradeep Dasigi, Nelson Liu, Matthew Peters, Michael Schmitz, and Luke Zettlemoyer. 2018. AllenNLP: A Deep Semantic Natural Language Processing Platform. arXiv:1803.07640 [cs.CL]
[25]
Marco Gillies, Rebecca Fiebrink, Atau Tanaka, Jérémie Garcia, Frédéric Bevilacqua, Alexis Heloir, Fabrizio Nunnari, Wendy Mackay, Saleema Amershi, Bongshin Lee, Nicolas d'Alessandro, Joëlle Tilmanne, Todd Kulesza, and Baptiste Caramiaux. 2016. Human-Centred Machine Learning. In Proceedings of the 2016 CHI Conference Extended Abstracts on Human Factors in Computing Systems (San Jose, California, USA) (CHI EA '16). Association for Computing Machinery, New York, NY, USA, 3558--3565. https://doi.org/10.1145/2851581.2856492
[26]
E Bruce Goldstein. 2014. Cognitive psychology: Connecting mind, research and everyday experience. Nelson Education.
[27]
Charles Goodwin. 2000. Practices of color classification. Mind, culture, and activity 7, 1--2 (2000), 19--36.
[28]
Carol Grbich. 2012. Qualitative data analysis: An introduction. Sage.
[29]
Shirley Gregor, Leona Chandra Kruse, and Stefan Seidel. 2020. Research Perspectives: The Anatomy of a Design Principle. Journal of the Association for Information Systems 21, 6 (2020), 2.
[30]
Shirley Gregor and Alan R Hevner. 2013. Positioning and presenting design science research for maximum impact. MIS quarterly (2013), 337--355.
[31]
Tor Grønsund and Margunn Aanestad. 2020. Augmenting the algorithm: Emerging human-in-the-loop work configurations. The Journal of Strategic Information Systems 29, 2 (2020), 101614.
[32]
Yihui He, Ji Lin, Zhijian Liu, Hanrui Wang, Li-Jia Li, and Song Han. 2018. Amc: Automl for model compression and acceleration on mobile devices. In Proceedings of the European Conference on Computer Vision (ECCV). 784--800.
[33]
WK Ip, Leela Damodaran, C Wendy Olphert, and Martin C Maguire. 1990. The use of task allocation charts in system design: a critical appraisal. In Proceedings of the IFIP TC13 Third Interational Conference on Human-Computer Interaction. 289--294.
[34]
Ton Jörg. 2004. A theory of reciprocal learning in dyads. Cognitive systems 6, 2 (2004), 3.
[35]
Adi Katz and Dov Te'eni. 2007. The contingent impact of contextualization on computer-mediated collaboration. Organization Science 18, 2 (2007), 261--279.
[36]
Daniel Khashabi, Erfan Sadeqi Azer, Tushar Khot, Ashish Sabharwal, and Dan Roth. 2020. On the Possibilities and Limitations of Multi-hop Reasoning Under Linguistic Imperfections. arXiv:1901.02522 [cs.CL]
[37]
Kowsari, Jafari Meimandi, Heidarysafa, Mendu, Barnes, and Brown. 2019. Text Classification Algorithms: A Survey. Information 10, 4 (Apr 2019), 150. https://doi.org/10.3390/info10040150
[38]
Bing Liu. 2012. Sentiment analysis and opinion mining. Synthesis lectures on human language technologies 5, 1 (2012), 1--167.
[39]
William Marcellino, Christian Johnson, Marek N Posard, and Todd C Helmus. 2020. Foreign Interference in the 2020 Election. (2020).
[40]
Scott Mayer McKinney, Marcin Sieniek, Varun Godbole, Jonathan Godwin, Natasha Antropova, Hutan Ashrafian, Trevor Back, Mary Chesus, Greg S Corrado, Ara Darzi, et al. 2020. International evaluation of an AI system for breast cancer screening. Nature 577, 7788 (2020), 89--94.
[41]
Oren Melamud, Jacob Goldberger, and Ido Dagan. 2016. context2vec: Learning generic context embedding with bidirectional lstm. In Proceedings of the 20th SIGNLL conference on computational natural language learning. 51--61.
[42]
Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S Corrado, and Jeff Dean. 2013. Distributed representations of words and phrases and their compositionality. In Advances in neural information processing systems. 3111--3119.
[43]
Tim Miller. 2019. Explanation in artificial intelligence: Insights from the social sciences. Artificial Intelligence 267 (2019), 1 -- 38. https://doi.org/10.1016/j.artint.2018.07.007
[44]
Roxana Moreno. 2004. Decreasing cognitive load for novice students: Effects of explanatory versus corrective feedback in discovery-based multimedia. Instructional science 32, 1 (2004), 99--113.
[45]
Ikujiro Nonaka and Hirotaka Takeuchi. 1995. The knowledge-creating company: How Japanese companies create the dynamics of innovation. Oxford university press.
[46]
Bo Pang and Lillian Lee. 2004. A sentimental education: Sentiment analysis using subjectivity summarization based on minimum cuts. arXiv preprint cs/0409058 (2004).
[47]
Uta Priss. 2006. Formal concept analysis in information science. Arist 40, 1 (2006), 521--543.
[48]
Sebastian Raisch and Sebastian Krakowski. 2020. Artificial Intelligence and Management: The Automation-Augmentation Paradox. Academy of Management Review ja (2020).
[49]
Marco Tulio Ribeiro, Sameer Singh, and Carlos Guestrin. 2016. "Why Should I Trust You?": Explaining the Predictions of Any Classifier. arXiv:1602.04938 [cs.LG]
[50]
Wolff-Michael Roth. 2005. Making Classifications (at) Work: Ordering Practices in Science. Social Studies of Science 35, 4 (2005), 581--621.
[51]
Cynthia Rudin. 2019. Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead. Nature Machine Intelligence 1, 5 (2019), 206--215.
[52]
Margrit Schreier. 2012. Qualitative content analysis in practice. Sage publications.
[53]
David G Schwartz. 1995. Cooperating Heterogeneous Systems.
[54]
Maung K Sein, Ola Henfridsson, Sandeep Purao, Matti Rossi, and Rikard Lindgren. 2011. Action design research. MIS quarterly (2011), 37--56.
[55]
Thomas B Sheridan. 1995. Human centered automation: oxymoron or common sense?. In 1995 IEEE International Conference on Systems, Man and Cybernetics. Intelligent Systems for the 21st Century, Vol. 1. IEEE, 823--828.
[56]
Galit Shmueli. 2017. Analyzing behavioral big data: methodological, practical, ethical, and moral issues. Quality Engineering 29, 1 (2017), 57--74.
[57]
Galit Shmueli. 2017. Research dilemmas with behavioral big data. Big data 5, 2 (2017), 98--119.
[58]
Yash Raj Shrestha, Shiko M Ben-Menahem, and Georg Von Krogh. 2019. Organizational decision-making structures in the age of artificial intelligence. California Management Review 61, 4 (2019), 66--83.
[59]
Chaehan So. 2020. Human-in-the-Loop Design Cycles--A Process Framework that Integrates Design Sprints, Agile Processes, and Machine Learning with Humans. Lecture Notes in Artificial Intelligence, 1st International Conference on Artificial Intelligence in HCI, AI-HCI, Held as Part of HCI International 2020, Kopenhagen, Denmark (19--24 July 2020).
[60]
Chaehan So. 2020. Understanding the Prediction Mechanism of Sentiments by XAI Visualization. arXiv preprint arXiv:2003.01425 (2020).
[61]
Ji Y Son and Robert L Goldstone. 2009. Contextualization in perspective. Cognition and Instruction 27, 1 (2009), 51--89.
[62]
Lucy A Suchman. 1987. Plans and situated actions: The problem of human-machine communication. Cambridge university press.
[63]
Lucy A Suchman. 2007. Human-machine reconfigurations: Plans and situated actions. Cambridge university press.
[64]
Yla R Tausczik and James W Pennebaker. 2010. The psychological meaning of words: LIWC and computerized text analysis methods. Journal of language and social psychology 29, 1 (2010), 24--54.
[65]
Dov Te'eni. 2001. A cognitive-affective model of organizational communication for designing IT. MIS quarterly 25, 2 (2001), 251--312.
[66]
Lev S Vygotsky. 1978. Mind in society: The development of higher mental processes (E. Rice, Ed. & Trans.).
[67]
Xiaoyan Wang, Pavan Kapanipathi, Ryan Musa, Mo Yu, Kartik Talamadupula, Ibrahim Abdelaziz, Maria Chang, Achille Fokoue, Bassem Makni, Nicholas Mattei, and Michael Witbrock. 2019. Improving Natural Language Inference Using External Knowledge in the Science Questions Domain. Proceedings of the AAAI Conference on Artificial Intelligence 33, 01 (Jul. 2019), 7208--7215. https://doi.org/10.1609/aaai.v33i01.33017208
[68]
Richard Webby and Marcus O'Connor. 1996. Judgemental and statistical time series forecasting: a review of the literature. International Journal of forecasting 12, 1 (1996), 91--118.
[69]
Karl E Weick, Kathleen M Sutcliffe, and David Obstfeld. 2005. Organizing and the process of sensemaking. Organization science 16, 4 (2005), 409--421.
[70]
Gabriel Weimann. 2016. Going dark: Terrorism on the dark web. Studies in Conflict & Terrorism 39, 3 (2016), 195--206.
[71]
David D Woods and Erik Hollnagel. 2006. Joint cognitive systems: Patterns in cognitive systems engineering. CRC Press.
[72]
Doris Xin, Litian Ma, Jialin Liu, Stephen Macke, Shuchen Song, and Aditya Parameswaran. 2018. Accelerating human-in-the-loop machine learning: Challenges and opportunities. In Proceedings of the Second Workshop on Data Management for End-To-End Machine Learning. 1--4.
[73]
Inbal Yahav, Onn Shehory, and David Schwartz. 2018. Comments mining with TF-IDF: the inherent bias and its removal. IEEE Transactions on Knowledge and Data Engineering 31, 3 (2018), 437--450.
[74]
Efpraxia D Zamani, Anastasia Griva, Konstantina Spanaki, Paidi O'Raghallaigh, and David Sammon. 2021. Making sense of business analytics in project selection and prioritisation: insights from the start-up trenches. Information Technology & People (2021).

Cited By

View all
  • (2023)ML-Based Teaching Systems: A Conceptual FrameworkProceedings of the ACM on Human-Computer Interaction10.1145/36101977:CSCW2(1-25)Online publication date: 4-Oct-2023
  • (2023)Building Knowledge through Action: Considerations for Machine Learning in the WorkplaceACM Transactions on Computer-Human Interaction10.1145/358494730:5(1-51)Online publication date: 23-Sep-2023
  • (2023)Human-Computer Collaborative Visual Design Creation Assisted by Artificial IntelligenceACM Transactions on Asian and Low-Resource Language Information Processing10.1145/355473522:9(1-21)Online publication date: 22-Sep-2023
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Proceedings of the ACM on Human-Computer Interaction
Proceedings of the ACM on Human-Computer Interaction  Volume 5, Issue CSCW2
CSCW2
October 2021
5376 pages
EISSN:2573-0142
DOI:10.1145/3493286
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 18 October 2021
Published in PACMHCI Volume 5, Issue CSCW2

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. AI
  2. accuracy
  3. context
  4. cyber-security
  5. explainabilitiy
  6. feedback
  7. human intelligence
  8. text classification

Qualifiers

  • Research-article

Funding Sources

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)210
  • Downloads (Last 6 weeks)29
Reflects downloads up to 18 Aug 2024

Other Metrics

Citations

Cited By

View all
  • (2023)ML-Based Teaching Systems: A Conceptual FrameworkProceedings of the ACM on Human-Computer Interaction10.1145/36101977:CSCW2(1-25)Online publication date: 4-Oct-2023
  • (2023)Building Knowledge through Action: Considerations for Machine Learning in the WorkplaceACM Transactions on Computer-Human Interaction10.1145/358494730:5(1-51)Online publication date: 23-Sep-2023
  • (2023)Human-Computer Collaborative Visual Design Creation Assisted by Artificial IntelligenceACM Transactions on Asian and Low-Resource Language Information Processing10.1145/355473522:9(1-21)Online publication date: 22-Sep-2023
  • (2023)Faulty or Ready? Handling Failures in Deep-Learning Computer Vision Models until Deployment: A Study of Practices, Challenges, and NeedsProceedings of the 2023 CHI Conference on Human Factors in Computing Systems10.1145/3544548.3581555(1-20)Online publication date: 19-Apr-2023
  • (2023)What is Human-Centered about Human-Centered AI? A Map of the Research LandscapeProceedings of the 2023 CHI Conference on Human Factors in Computing Systems10.1145/3544548.3580959(1-23)Online publication date: 19-Apr-2023
  • (2023)Assertiveness-based Agent Communication for a Personalized Medicine on Medical Imaging DiagnosisProceedings of the 2023 CHI Conference on Human Factors in Computing Systems10.1145/3544548.3580682(1-20)Online publication date: 19-Apr-2023
  • (2023)Human-Machine Task Allocation in Learning Reciprocally to Solve ProblemsHCI International 2023 – Late Breaking Posters10.1007/978-3-031-49215-0_9(65-77)Online publication date: 12-Dec-2023
  • (2023)Conducting Design Science Research in Society 5.0 – Proposal of an Explainable Artificial Intelligence Research MethodologyDesign Science Research for a New Society: Society 5.010.1007/978-3-031-32808-4_16(250-265)Online publication date: 31-May-2023
  • (2022)SMARTEN—A Sample-Based Approach towards Privacy-Friendly Data RefinementJournal of Cybersecurity and Privacy10.3390/jcp20300312:3(606-628)Online publication date: 15-Aug-2022
  • (2022)Reciprocal Learning in Production and LogisticsIFAC-PapersOnLine10.1016/j.ifacol.2022.09.51955:10(854-859)Online publication date: 2022
  • Show More Cited By

View Options

Get Access

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media