Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3613905.3650810acmconferencesArticle/Chapter ViewAbstractPublication PageschiConference Proceedingsconference-collections
Work in Progress

NLP4Gov: A Comprehensive Library for Computational Policy Analysis

Published: 11 May 2024 Publication History

Abstract

Formal rules and policies are fundamental in formally specifying a social system: its operation, boundaries, processes, and even ontology. Recent scholarship has highlighted the role of formal policy in collective knowledge creation, game communities, the production of digital public goods, and national social media governance. Researchers have shown interest in how online communities convene tenable self-governance mechanisms to regulate member activities and distribute rights and privileges by designating responsibilities, roles, and hierarchies. We present NLP4Gov, an interactive kit to train and aid scholars and practitioners alike in computational policy analysis. The library explores and integrates methods and capabilities from computational linguistics and NLP to generate semantic and symbolic representations of community policies from text records. Versatile, documented, and accessible, NLP4Gov provides granular and comparative views into institutional structures and interactions, along with other information extraction capabilities for downstream analysis.

Supplemental Material

MP4 File
Talk Video

References

[1]
Roba Abbas, Jeremy Pitt, and Katina Michael. 2021. Socio-Technical Design for Public Interest Technology. IEEE Transactions on Technology and Society 2, 2 (2021), 55–61. https://doi.org/10.1109/TTS.2021.3086260
[2]
Alexander Artikis, Marek Sergot, and Jeremy Pitt. 2009. Specifying norm-governed computational societies. ACM Transactions on Computational Logic (TOCL) 10, 1 (2009), 1–42.
[3]
Julia Bauer, Nikolaus Franke, and Philipp Tuertscher. 2016. Intellectual Property Norms in Online Communities: How User-Organized Intellectual Property Regulation Supports Innovation. Information Systems Research 27, 4 (Dec. 2016), 724–750. https://doi.org/10.1287/isre.2016.0649
[4]
Yochai Benkler. 2006. The Wealth of Networks: How Social Production Transforms Markets and Freedom.
[5]
Claire Bonial, Olga Babko-Malaya, Jinho D Choi, Jena Hwang, and Martha Palmer. 2010. Propbank annotation guidelines. Center for Computational Language and Education Research, CU-Boulder 9 (2010), 90 pages.
[6]
Tom Brown, Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared D Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, 2020. Language models are few-shot learners. Advances in neural information processing systems 33 (2020), 1877–1901.
[7]
Sébastien Bubeck, Varun Chandrasekaran, Ronen Eldan, Johannes Gehrke, Eric Horvitz, Ece Kamar, Peter Lee, Yin Tat Lee, Yuanzhi Li, Scott Lundberg, 2023. Sparks of artificial general intelligence: Early experiments with gpt-4. arXiv:2303.12712
[8]
David P Carter, Christopher M Weible, Saba N Siddiki, and Xavier Basurto. 2016. Integrating core concepts from the institutional analysis and development framework for the systematic analysis of policy designs: An illustration from the US National Organic Program regulation. Journal of Theoretical Politics 28, 1 (2016), 159–185.
[9]
Mahasweta Chakraborti, Curtis Atkisson, Stefan Stanciulescu, Vladimir Filkov, and Seth Frey. 2023. Do We Run How We Say We Run? Formalization and Practice of Governance in OSS Communities. arXiv:2309.14245
[10]
Eshwar Chandrasekharan, Shagun Jhaver, Amy Bruckman, and Eric Gilbert. 2022. Quarantined! Examining the Effects of a Community-Wide Moderation Intervention on Reddit. ACM Transactions on Computer-Human Interaction 29, 4 (Aug. 2022), 1–26. https://doi.org/10.1145/3490499
[11]
Eshwar Chandrasekharan, Mattia Samory, Anirudh Srinivasan, and Eric Gilbert. 2017. The Bag of Communities: Identifying Abusive Behavior Online with Preexisting Internet Data. In Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems. ACM, Denver Colorado USA, 3175–3187. https://doi.org/10.1145/3025453.3026018
[12]
Sue ES Crawford and Elinor Ostrom. 1995. A grammar of institutions. American political science review 89, 3 (1995), 582–600. Publisher: Cambridge University Press.
[13]
Daniel A DeCaro, Marco A Janssen, and Allen Lee. 2021. Motivational foundations of communication, voluntary cooperation, and self-governance in a common-pool resource dilemma. Current Research in Ecological and Social Psychology 2 (2021), 100016.
[14]
Nadia Eghbal. 2020. Working in public: the making and maintenance of open source software.
[15]
Casey Fiesler, Jialun Jiang, Joshua McCann, Kyle Frye, and Jed Brubaker. 2018. Reddit rules! characterizing an ecosystem of governance. https://doi.org/10.1609/icwsm.v12i1.15033
[16]
Adam Fish, Luis FR Murillo, Lilly Nguyen, Aaron Panofsky, and Christopher M. Kelty. 2011. Birds of the Internet: Towards a field guide to the organization and governance of participation. Journal of Cultural Economy 4, 2 (2011), 157–187. ISBN: 1753-0350 Publisher: Taylor & Francis.
[17]
Anders Forsman, Tine De Moor, René Van Weeren, Mike Farjam, Molood Ale Ebrahim Dehkordi, Amineh Ghorbani, and Giangiacomo Bravo. 2021. Comparisons of historical Dutch commons inform about the long-term dynamics of social-ecological systems. Plos one 16, 8 (2021), e0256803.
[18]
Christopher K Frantz and Saba Siddiki. 2021. Institutional Grammar 2.0: A specification for encoding and analyzing institutional design. Public Administration 99, 2 (2021), 222–247. Publisher: Wiley Online Library.
[19]
Christopher K Frantz and Saba Siddiki. 2022. Institutional Grammar. Springer, New York, USA.
[20]
Seth Frey, Jules Hedges, Joshua Tan, and Philipp Zahn. 2023. Composing games into complex institutions. Plos one 18, 3 (2023), e0283361.
[21]
Seth Frey, PM Krafft, and Brian C Keegan. 2019. " This Place Does What It Was Built For" Designing Digital Institutions for Participatory Change. Proceedings of the ACM on Human-Computer Interaction 3, CSCW (2019), 1–31.
[22]
Seth Frey and Robert W Sumner. 2019. Emergence of integrated institutions in a large population of self-governing communities. PloS one 14, 7 (2019), e0216335.
[23]
Seth Frey and Robert W. Sumner. 2019. Emergence of integrated institutions in a large population of self-governing communities. PLOS ONE 14, 7 (July 2019), e0216335. https://doi.org/10.1371/journal.pone.0216335
[24]
Seth Frey, Qiankun Zhong, Beril Bulat, William D. Weisman, Caitlyn Liu, Stephen Fujimoto, Hannah Wang, and Charles M. Schweik. 2022. Governing Online Goods: Maturity and Formalization in Minecraft, Reddit, and World of Warcraft Communities. Proceedings of the ACM on Human-Computer Interaction 6, CSCW2 (Nov. 2022), 1–23. https://doi.org/10.1145/3555191
[25]
Brett M Frischmann, Michael J Madison, and Katherine Jo Strandburg. 2014. Governing knowledge commons. Oxford University Press, Oxford, UK.
[26]
Amineh Ghorbani, Francien Dechesne, Virginia Dignum, and Catholijn Jonker. 2014. Enhancing ABM into an inevitable tool for policy analysis. Policy and Complex Systems 1, 1 (2014), 61–76.
[27]
Tarleton Gillespie. 2010. The politics of ‘platforms’. New media & society 12, 3 (2010), 347–364.
[28]
Maarten Grootendorst. 2022. BERTopic: Neural topic modeling with a class-based TF-IDF procedure. arXiv:2203.05794
[29]
Garrett Hardin. 1968. The tragedy of the commons: the population problem has no technical solution; it requires a fundamental extension in morality.science 162, 3859 (1968), 1243–1248.
[30]
Colin Harris. 2018. Institutional solutions to free-riding in peer-to-peer networks: a case study of online pirate communities. Journal of Institutional Economics 14, 5 (Oct. 2018), 901–924. https://doi.org/10.1017/S1744137417000650
[31]
Charlotte Hess and Elinor Ostrom. 2007. Understanding knowledge as a commons: From theory to practice.
[32]
David J. Hess. 2005. Technology-and product-oriented movements: Approximating social movement studies and science and technology studies. Science, Technology, & Human Values 30, 4 (2005), 515–535.
[33]
Benjamin Mako Hill and Aaron Shaw. 2021. The Hidden Costs of Requiring Accounts: Quasi-Experimental Evidence From Peer Production. Communication Research 48, 6 (Aug. 2021), 771–795. https://doi.org/10.1177/0093650220910345
[34]
Eric von Hippel and Georg von Krogh. 2003. Open source software and the “private-collective” innovation model: Issues for organization science. Organization science 14, 2 (2003), 209–223.
[35]
Sohyeon Hwang and Aaron Shaw. 2022. Rules and Rule-Making in the Five Largest Wikipedias. Proceedings of the International AAAI Conference on Web and Social Media 16 (May 2022), 347–357. https://doi.org/10.1609/icwsm.v16i1.19297
[36]
Shagun Jhaver, Seth Frey, and Amy X Zhang. 2023. Decentralizing platform power: A design space of multi-level governance in online social platforms. Social Media+ Society 9, 4 (2023), 20563051231207857.
[37]
Daniel Jurafsky and James H Martin. 2000. Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition.
[38]
Aniket Kittur, Jeffrey V. Nickerson, Michael Bernstein, Elizabeth Gerber, Aaron Shaw, John Zimmerman, Matt Lease, and John Horton. 2013. The future of crowd work. In Proceedings of the 2013 Conference on Computer Supported Cooperative Work (San Antonio, Texas, USA) (CSCW ’13). Association for Computing Machinery, New York, NY, USA, 1301–1318. https://doi.org/10.1145/2441776.2441923
[39]
Kenton Lee, Luheng He, and Luke Zettlemoyer. 2018. Higher-Order Coreference Resolution with Coarse-to-Fine Inference. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Short Papers), Marilyn Walker, Heng Ji, and Amanda Stent (Eds.). Association for Computational Linguistics, New Orleans, Louisiana, 687–692. https://doi.org/10.18653/v1/N18-2108
[40]
J Nathan Matias. 2019. The civic labor of volunteer moderators online. Social Media+ Society 5, 2 (2019), 2056305119836778.
[41]
Tri Nguyen, Mir Rosenberg, Xia Song, Jianfeng Gao, Saurabh Tiwary, Rangan Majumder, and Li Deng. 2016. MS MARCO: A human generated machine reading comprehension dataset. choice 2640 (2016), 660.
[42]
Matthew N. Nicholson, Brian C Keegan, and Casey Fiesler. 2023. Mastodon Rules: Characterizing Formal Rules on Popular Mastodon Instances. In Companion Publication of the 2023 Conference on Computer Supported Cooperative Work and Social Computing (Minneapolis, MN, USA) (CSCW ’23 Companion). Association for Computing Machinery, New York, NY, USA, 86–90. https://doi.org/10.1145/3584931.3606970
[43]
Elinor Ostrom. 1990. Governing the Commons: The evolution of institutions for collective action. Cambridge University Press, Cambridge, MA.
[44]
Elinor Ostrom. 2009. Understanding institutional diversity. Princeton University Press, Princeton, NJ.
[45]
Siobhán O’Mahony. 2003. Guarding the commons: how community managed software projects protect their work. Research policy 32, 7 (2003), 1179–1198.
[46]
Jessica A. Pater, Moon K. Kim, Elizabeth D. Mynatt, and Casey Fiesler. 2016. Characterizations of Online Harassment: Comparing Policies Across Social Media Platforms. In Proceedings of the 19th International Conference on Supporting Group Work. ACM, Sanibel Island Florida USA, 369–374. https://doi.org/10.1145/2957276.2957297
[47]
LEAH PIEPER, SANTIAGO VIRGÜEZ, EDELLA SCHLAGER, and CHARLIE SCHWEIK. 2023. The Use of the Institutional Grammar 1.0 for Institutional Analysis: A Literature Review. International Journal of the Commons 17, 1 (2023), pp. 256–270. https://www.jstor.org/stable/48756450
[48]
Margaret M. Polski and Elinor Ostrom. 1999. An institutional framework for policy analysis and design.
[49]
Amy R Poteete, Marco A Janssen, and Elinor Ostrom. 2010. Working together: collective action, the commons, and multiple methods in practice. Princeton University Press, Princeton, NJ.
[50]
Peng Qi, Yuhao Zhang, Yuhui Zhang, Jason Bolton, and Christopher D Manning. 2020. Stanza: A Python Natural Language Processing Toolkit for Many Human Languages. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations. Association for Computational Linguistics, Online, 101–108.
[51]
Nils Reimers and Iryna Gurevych. 2019. Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP). Association for Computational Linguistics, Hong Kong, China, 3980–3990. https://doi.org/10.18653/v1/D19-1410
[52]
Douglas Rice, Saba Siddiki, Seth Frey, Jay H Kwon, and Adam Sawyer. 2021. Machine coding of policy texts with the Institutional Grammar. Public Administration 99, 2 (2021), 248–262. Publisher: Wiley Online Library.
[53]
Victor Sanh, Lysandre Debut, Julien Chaumond, and Thomas Wolf. 2019. DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter. arXiv:1910.01108
[54]
Edella Schlager and E. Ostrom. 1992. Property-Rights Regimes and Natural Resources: A Conceptual Analysis. Land Economics 68 (1992), 249. https://api.semanticscholar.org/CorpusID:2908275
[55]
Nathan Schneider, Primavera De Filippi, Seth Frey, Joshua Z. Tan, and Amy X. Zhang. 2021. Modular Politics: Toward a Governance Layer for Online Communities. Proceedings of the ACM on Human-Computer Interaction 5, CSCW1 (April 2021), 1–26. https://doi.org/10.1145/3449090
[56]
Charles M Schweik and Robert English. 2007. Tragedy of the FOSS commons? Investigating the institutional designs of free/libre and open source software projects. First Monday 12 (2007), 35 pages. https://doi.org/10.5210/fm.v12i2.1619
[57]
Charles M. Schweik and Robert C. English. 2012. Internet success: a study of open-source software commons. MIT Press, Cambridge, MA.
[58]
Charles M. Schweik and Meelis Kitsing. 2010. Applying Elinor Ostrom’s Rule Classification Framework to the Analysis of Open Source Software Commons. Transnational Corporations Review 2, 1 (Jan 2010), 13–26. https://doi.org/10.1080/19186444.2010.11658219
[59]
Joseph Seering. 2020. Reconsidering Self-Moderation: the Role of Research in Supporting Community-Based Models for Online Content Moderation. Proc. ACM Hum.-Comput. Interact. 4, CSCW2, Article 107 (oct 2020), 28 pages. https://doi.org/10.1145/3415178
[60]
Anamika Sen, Curtis Atkisson, and Charlie Schweik. 2022. Cui Bono: Do open source software incubator policies and procedures benefit the projects or the incubator?International Journal of the Commons 16, 1 (2022), 64–77.
[61]
Aaron Shaw and Benjamin M. Hill. 2014. Laboratories of Oligarchy? How the Iron Law Extends to Peer Production: Laboratories of Oligarchy. Journal of Communication 64, 2 (Apr 2014), 215–238. https://doi.org/10.1111/jcom.12082
[62]
Peng Shi and Jimmy Lin. 2019. Simple bert models for relation extraction and semantic role labeling. arXiv:1904.05255
[63]
Saba Siddiki. 2014. Assessing Policy Design and Interpretation: An Institutions-Based Analysis in the Context of Aquaculture in F lorida and V irginia, U nited S tates. Review of Policy Research 31, 4 (2014), 281–303.
[64]
Saba Siddiki and Graham Ambrose. 2023. Evaluating Change in Representation and Coordination in Collaborative Governance Over Time: A Study of Environmental Justice Councils. Environmental Management 71, 3 (2023), 620–640.
[65]
Saba Siddiki, Xavier Basurto, and Christopher M Weible. 2012. Using the institutional grammar tool to understand regulatory compliance: The case of Colorado aquaculture. Regulation & Governance 6, 2 (2012), 167–188.
[66]
Saba Siddiki and Christopher Frantz. 2023. Understanding the Effects of Social Value Orientations in Shaping Regulatory Outcomes through Agent-Based Modeling: An Application in Organic Farming. https://doi.org/10.4000/irpp.3398
[67]
Saba Siddiki, Tanya Heikkila, Christopher M. Weible, Raul Pacheco‐Vega, David Carter, Cali Curley, Aaron Deslatte, and Abby Bennett. 2022. Institutional analysis with the institutional grammar. Policy Studies Journal 50, 2 (2022), 315–339.
[68]
Kaitao Song, Xu Tan, Tao Qin, Jianfeng Lu, and Tie-Yan Liu. 2020. Mpnet: Masked and permuted pre-training for language understanding. Advances in Neural Information Processing Systems 33 (2020), 16857–16867.
[69]
Ofer Tchernichovski, Seth Frey, Nori Jacoby, and Dalton Conley. 2021. Experimenting With Online Governance. Frontiers in Human Dynamics 3 (April 2021), 629285. https://doi.org/10.3389/fhumd.2021.629285
[70]
Matia Vannoni. 2022. A political economy approach to the grammar of institutions: Theory and methods. Policy Studies Journal 50, 2 (2022), 453–471.
[71]
Jason Wei, Xuezhi Wang, Dale Schuurmans, Maarten Bosma, Fei Xia, Ed Chi, Quoc V Le, Denny Zhou, 2022. Chain-of-thought prompting elicits reasoning in large language models. Advances in Neural Information Processing Systems 35 (2022), 24824–24837.
[72]
Christopher M Weible, Saba N Siddiki, and Jonathan J Pierce. 2011. Foes to friends: Changing contexts and changing intergroup perceptions. Journal of Comparative Policy Analysis: Research and Practice 13, 5 (2011), 499–525.
[73]
Ralph Weischedel, Martha Palmer, Mitchell Marcus, Eduard Hovy, Sameer Pradhan, Lance Ramshaw, Nianwen Xue, Ann Taylor, Jeff Kaufman, Michelle Franchini, 2013. Ontonotes release 5.0 ldc2013t19. Linguistic Data Consortium, Philadelphia, PA 23 (2013), 170.
[74]
Likang Yin, Mahasweta Chakraborti, Yibo Yan, Charles Schweik, Seth Frey, and Vladimir Filkov. 2022. Open Source Software Sustainability: Combining Institutional Analysis and Socio-Technical Networks. Proceedings of the ACM on Human-Computer Interaction 6, CSCW2 (2022), 1–23.
[75]
Likang Yin, Zhiyuan Zhang, Qi Xuan, and Vladimir Filkov. 2021. Apache software foundation incubator project sustainability dataset. In 2021 IEEE/ACM 18th International Conference on Mining Software Repositories (MSR). IEEE, Madrid, Spain, 595–599. https://doi.org/10.1109/MSR52588.2021.00081
[76]
Amy X. Zhang, Grant Hugh, and Michael S. Bernstein. 2020. PolicyKit: Building Governance in Online Communities. In Proceedings of the 33rd Annual ACM Symposium on User Interface Software and Technology. ACM, Virtual Event USA, 365–378. https://doi.org/10.1145/3379337.3415858

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
CHI EA '24: Extended Abstracts of the 2024 CHI Conference on Human Factors in Computing Systems
May 2024
4761 pages
ISBN:9798400703317
DOI:10.1145/3613905
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 11 May 2024

Check for updates

Author Tags

  1. Collective Action
  2. OSS Governance
  3. Online Communities
  4. Open Source Software
  5. Peer Production
  6. Policy Analysis

Qualifiers

  • Work in progress
  • Research
  • Refereed limited

Data Availability

Funding Sources

Conference

CHI '24

Acceptance Rates

Overall Acceptance Rate 6,164 of 23,696 submissions, 26%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 180
    Total Downloads
  • Downloads (Last 12 months)180
  • Downloads (Last 6 weeks)30
Reflects downloads up to 26 Sep 2024

Other Metrics

Citations

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Full Text

View this article in Full Text.

Full Text

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media