research-article

Building concise logical patterns by constraining tsetlin machine clause size

AUTHORs:

K. Darshana Abeyrathna,

Ahmed A. O. Abouzeid,

Bimal Bhattarai,

Sondre Glimsdal,

Ole-Christoffer Granmo,

Jivitesh Sharma,

Svein A. Tunheim,

Xuan ZhangAuthors Info & Claims

IJCAI '23: Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence

Article No.: 378, Pages 3395 - 3403

https://doi.org/10.24963/ijcai.2023/378

Published: 19 August 2023 Publication History

Abstract

Tsetlin machine (TM) is a logic-based machine learning approach with the crucial advantages of being transparent and hardware-friendly. While TMs match or surpass deep learning accuracy for an increasing number of applications, large clause pools tend to produce clauses with many literals (long clauses). As such, they become less interpretable. Further, longer clauses increase the switching activity of the clause logic in hardware, consuming more power. This paper introduces a novel variant of TM learning - Clause Size Constrained TMs (CSC-TMs) - where one can set a soft constraint on the clause size. As soon as a clause includes more literals than the constraint allows, it starts expelling literals. Accordingly, oversized clauses only appear transiently. To evaluate CSC-TM, we conduct classification, clustering, and regression experiments on tabular data, natural language text, images, and board games. Our results show that CSC-TM maintains accuracy with up to 80 times fewer literals. Indeed, the accuracy increases with shorter clauses for TREC, IMDb, and BBC Sports. After the accuracy peaks, it drops gracefully as the clause size approaches a single literal. We finally analyze CSC-TM power consumption and derive new convergence properties.

References

[1]

K. Darshana Abeyrathna, Ole-Christoffer Granmo, and Morten Goodwin. Extending the Tsetlin Machine With Integer-Weighted Clauses for Increased Interpretability. arXiv preprint arXiv:2005.05131, 2020.

[2]

K. Darshana Abeyrathna, Ole-Christoffer Granmo, Rishad Shafik, Alex Yakovlev, Adrian Wheeldon, Jie Lei, and Morten Goodwin. A Novel Multi-Step Finite-State Automaton for Arbitrarily Deterministic Tsetlin Machine Learning. In the 40th International Conference on Innovative Techniques and Applications of Artificial Intelligence (SGAI-2020). Springer International Publishing, 2020.

[3]

K. Darshana Abeyrathna, Ole-Christoffer Granmo, Xuan Zhang, Lei Jiao, and Morten Goodwin. The Regression Tsetlin Machine - A Novel Approach to Interpretable Non-Linear Regression. Philosophical Transactions of the Royal Society A, 378, 2020.

[4]

K. Darshana Abeyrathna, Bimal Bhattarai, Morten Goodwin, Saeed Gorji, Ole-Christoffer Granmo, Lei Jiao, Rupsa Saha, and Rohan K. Yadav. Massively Parallel and Asynchronous Tsetlin Machine Architecture Supporting Almost Constant-Time Scaling. In International Conference on Machine Learning (ICML), 2021.

[5]

Ahmed Abouzeid, Ole-Christoffer Granmo, Morten Goodwin, and Christian Webersik. Label-Critic Tsetlin Machine: A Novel Self-supervised Learning Scheme for Interpretable Clustering. In International Symposium on the Tsetlin Machine (ISTM), pages 41-48. IEEE, 2022.

[6]

Abu Bakar, Tousif Rahman, Alessandro Montanari, Jie Lei, Rishad Shafik, and Fahim Kawsar. Logic-based Intelligence for Batteryless Sensors. In the Annual InternationalWorkshop on Mobile Computing Systems and Applications (HotMobile), pages 22-28. Association for Computing Machinery, 2022.

Digital Library

[7]

Abu Bakar, Tousif Rahman, Rishad Shafik, Fahim Kawsar, and Alessandro Montanari. Adaptive Intelligence for Batteryless Sensors Using Software-Accelerated Tsetlin Machines. In the 20th Conference on Embedded Networked Sensor Systems. ACM, 2022.

[8]

Bimal Bhattarai, Ole-Christoffer Granmo, and Lei Jiao. Measuring the novelty of natural language text using the conjunctive clauses of a Tsetlin machine text classifier. In International Conference on Agents and Artificial Intelligence, 2021.

[9]

Bimal Bhattarai, Ole-Christoffer Granmo, and Lei Jiao. Explainable Tsetlin Machine Framework for Fake News Detection with Credibility Score Assessment. In the 13th Conference on Language Resources and Evaluation, pages 4894-4903, 2022.

[10]

Bimal Bhattarai, Ole-Christoffer Granmo, and Lei Jiao. Word-level Human Interpretable Scoring Mechanism for Novel Text Detection Using Tsetlin Machines. Applied Intelligence, 52:17465-17489, 2022.

Digital Library

[11]

Bimal Bhattarai, Ole-Christoffer Granmo, and Lei Jiao. An interpretable knowledge representation framework for natural language processing with cross-domain application. In Advances in Information Retrieval: 45th European Conference on Information Retrieval, ECIR 2023, Dublin, Ireland, April 2-6, 2023, Proceedings, Part I, pages 167-181, 2023.

Digital Library

[12]

Bimal Bhattarai, Ole-Christoffer Granmo, Lei Jiao, Rohan Yadav, and Jivitesh Sharma. Tsetlin Machine Embedding: Representing Words Using Logical Expressions. arXiv preprint arXiv:2301.00709, 2023.

[13]

Karl Audun Kagnes Borgersen, Morten Goodwin, and Jivitesh Sharma. A Comparison Between Tsetlin Machines and Deep Neural Networks in the Context of Recommendation Systems. arXiv preprint arXiv:2212.10136, 2022.

[14]

Eric Chang, Frank Seide, Helen M Meng, Zhuoran Chen, Yu Shi, and Yuk-Chi Li. A System for Spoken Query Information Retrieval on Mobile Devices. IEEE Transactions on Speech and Audio processing, 10(8):531-541, 2002.

[15]

William J. Dally, Harting R. Curtis, and Tor M. Aamodt. Digital Design Using VHDL: a Systems Approach. Cambridge University Press, 2016.

[16]

Franca Debole and Fabrizio Sebastiani. An analysis of the relative hardness of reuters- 21578 subsets. Journal of the American Society for Information Science and technology, 56(6):584-596, 2005.

Digital Library

[17]

Charul Giri, Ole-Christoffer Granmo, Herke Van Hoof, and Christian D. Blakely. Logic-based AI for Interpretable Board Game Winner Prediction with Tsetlin Machine. In 2022 International Joint Conference on Neural Networks (IJCNN), pages 1-9, 2022.

[18]

Sondre Glimsdal and Ole-Christoffer Granmo. Coalesced Multi-Output Tsetlin Machines with Clause Sharing. arXiv preprint, arXiv:2108.07594, 2021.

[19]

Sondre Glimsdal, Rupsa Saha, Bimal Bhattarai, Charul Giri, Jivitesh Sharma, Svein Anders Tunheim, and Rohan Kumar Yadav. Focused Negative Sampling for Increased Discriminative Power in Tsetlin Machines. In 2022 International Symposium on the Tsetlin Machine (ISTM), pages 73-80, 2022.

[20]

Ole-Christoffer Granmo, Sondre Glimsdal, Lei Jiao, Morten Goodwin, Christian W. Omlin, and Geir Thore Berge. The Convolutional Tsetlin Machine. arXiv preprint arXiv:1905.09688, 2019.

[21]

Ole-Christoffer Granmo. The Tsetlin Machine - A Game Theoretic Bandit Driven Approach to Optimal Pattern Recognition with Propositional Logic. arXiv preprint arXiv:1804.01508, 2018.

[22]

Derek Greene and Pádraig Cunningham. Practical Solutions to the Problem of Diagonal Dominance in Kernel Document Clustering. In International Conference on Machine Learning (ICML), pages 377-384. ACM Press, 2006.

Digital Library

[23]

Iris Hendrickx, Su Nam Kim, Zornitsa Kozareva, Preslav Nakov, Diarmuid Ó Séaghdha, Sebastian Padó, Marco Pennacchiotti, Lorenza Romano, and Stan Szpakowicz. Semeval-2010 task 8: Multiway classification of semantic relations between pairs of nominals. In the Workshop on Semantic Evaluations: Recent Achievements and Future Directions, pages 94-99. Association for Computational Linguistics, 2009.

[24]

Lei Jiao, Xuan Zhang, and Ole-Christoffer Granmo. On the Convergence of Tsetlin Machines for the AND and the OR Operators. arXiv preprint https://arxiv.org/abs/2109.09488, 2021.

[25]

Lei Jiao, Xuan Zhang, Ole-Christoffer Granmo, and K. Darshana Abeyrathna. On the Convergence of Tsetlin Machines for the XOR operator. IEEE Transactions on Pattern Analysis and Machine Intelligence, 45(5):6072-6085, 2023.

Digital Library

[26]

Jie Lei, Tousif Rahman, Rishad Shafik, Adrian Wheeldon, Alex Yakovlev, Ole-Christoffer Granmo, Fahim Kawsar, and Akhil Mathur. Low-Power Audio Keyword Spotting Using Tsetlin Machines. Journal of Low Power Electronics and Applications, 11, 2021.

[27]

Yitao Liang and Guy Van den Broeck. Learning logistic circuits. In Proceedings of the 33rd Conference on Artificial Intelligence (AAAI), jan 2019.

Digital Library

[28]

Yitao Liang and Guy Van den Broeck. Learning logistic circuits. In the AAAI Conference on Artificial Intelligence, volume 33, pages 4277-4286, 2019.

Digital Library

[29]

Andrew Maas, Raymond E Daly, Peter T Pham, Dan Huang, Andrew Y Ng, and Christopher Potts. LearningWord Vectors for Sentiment Analysis. In the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, pages 142-150, 2011.

[30]

Sidharth Maheshwari, Tousif Rahman, Rishad ShafikSenior member, Alex Yakovlev, Ashur Rafiev, Lei Jiao, and Ole-Christoffer Granmo. Redress: Generating compressed models for edge inference using tsetlin machines. IEEE Transactions on Pattern Analysis and Machine Intelligence, pages 1-16, 2023.

[31]

Rupsa Saha, Ole-Christoffer Granmo, and Morten Goodwin. Using Tsetlin Machine to Discover Interpretable Rules in Natural Language Processing Applications. Expert Systems, page e12873, 2021.

[32]

Rupsa Saha, Ole-Christoffer Granmo, Vladimir Zadorozhny, and Morten Goodwin. A Relational Tsetlin Machine with Applications to Natural Language Understanding. Journal of Intelligent Information Systems, 2022.

Digital Library

[33]

Raihan Seraj, Jivitesh Sharma, and Ole Christoffer Granmo. Tsetlin Machine for Solving Contextual Bandit Problems. In Neural Information Processing Systems (NeurIPS), 2022.

[34]

Jivitesh Sharma, Rohan Kumar Yadav, Ole-Christoffer Granmo Granmo, and Lei Jiao. Drop Clause: Enhancing Performance, Robustness and Pattern Recognition Capabilities of the Tsetlin Machine. In the AAAI Conference on Artificial Intelligence (AAAI), 2023.

[35]

Michael Lvovitch Tsetlin. On Behaviour of Finite Automata in Random Medium. Avtomat. i Telemekh, 22(10):1345-1354, 1961.

[36]

Leslie G Valiant. A Theory of the Learnable. Communications of the ACM, 27(11):1134-1142, 1984.

Digital Library

[37]

Adrian Wheeldon, Rishad Shafik, Tousif Rahman, Jie Lei, Alex Yakovlev, and Ole-Christoffer Granmo. Learning Automata based Energy-efficient AI Hardware Design for IoT. Philosophical Transactions of the Royal Society A, 2020.

[38]

Rohan Yadav, Lei Jiao, Ole-Christoffer Granmo, and Morten Goodwin. Enhancing interpretable clauses semantically using pretrained word representation. In the 4th BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, 2021.

[39]

Rohan Yadav, Lei Jiao, Ole-Christoffer Granmo, and Morten Goodwin. Human-Level Interpretable Learning for Aspect-Based Sentiment Analysis. In the AAAI Conference on Artificial Intelligence (AAAI), 2021.

[40]

Rohan Kumar Yadav, Lei Jiao, Ole Christoffer Granmo, and Morten Goodwin. Robust Interpretable Text Classification against Spurious Correlations Using AND-rules with Negation. In International Joint Conference on Artificial Intelligence (IJCAI), 2022.

[41]

Xuan Zhang, Lei Jiao, Ole-Christoffer Granmo, and Morten Goodwin. On the Convergence of Tsetlin Machines for the IDENTITY- and NOT Operators. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44(10):6345-6359, 2022.

Digital Library

Index Terms

Building concise logical patterns by constraining tsetlin machine clause size
1. Computing methodologies
  1. Machine learning
    1. Learning paradigms
      1. Supervised learning
        Supervised learning by classification
    2. Machine learning approaches
      1. Classification and regression trees
      2. Neural networks
2. Information systems
  1. Information systems applications

Index terms have been assigned to the content through auto-classification.

Recommendations

Clause elimination for SAT and QSAT

The famous archetypical NP-complete problem of Boolean satisfiability (SAT) and its PSPACE-complete generalization of quantified Boolean satisfiability (QSAT) have become central declarative programming paradigms through which real-world instances of ...
Limitations of restricted branching in clause learning

The techniques for making decisions, that is, branching, play a central role in complete methods for solving structured instances of constraint satisfaction problems (CSPs). In this work we consider branching heuristics in the context of propositional ...
Lean clause-sets: generalizations of minimally unsatisfiable clause-sets
The renesse issue on satisfiability

We study the problem of (efficiently) deleting such clauses from conjunctive normal forms (clause-sets) which cannot contribute to any proof of unsatisfiability. For that purpose we introduce the notion of an autarky system A, which detects deletion of ...

Comments

Information & Contributors

Information

Published In

cover image Guide Proceedings

IJCAI '23: Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence

August 2023

7242 pages

ISBN:978-1-956792-03-4

Editor:
Edith Elkind

Copyright © 2023 International Joint Conferences on Artificial Intelligence.

Sponsors

International Joint Conferences on Artifical Intelligence (IJCAI)

Publisher

Unknown publishers

Publication History

Published: 19 August 2023

Qualifiers

Research-article
Research
Refereed limited

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
0
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 30 Aug 2024

Other Metrics

View Author Metrics

Citations

View Options

View options

Media

Figures

Other

Tables

View Table of Contents