Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3529466.3529485acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiciaiConference Proceedingsconference-collections
research-article

Mass Mahjong Decision System Based on Transfer Learning

Published: 04 June 2022 Publication History

Abstract

In this paper, we propose a transfer learning to solve the problem of lacking in data and the difficulty in constructing models effectively, which is typically represented by Mass Mahjong in the field of imperfect information. Design and implement the Mass Mahjong Discard model based on transfer learning. The previously well-trained Blood Mahjong Discard model on a large dataset is migrated to Mass Mahjong Discard model in a similar domain. In the subsequent model optimization, a self-play based approaching is used to improve the Mass Mahjong Discard model. The experimental results show that the transfer learning-based Mass Mahjong Discard model performs well in the situation of less data, and can fit the Mass Mahjong Discard rule. And the model won the second prize in the Mass Mahjong event of the National University Computer Gaming Competition in 2021.

References

[1]
Silver D, Huang A, Maddison C J, Mastering the game of Go with deep neural networks and tree search[J]. nature, 2016, 529(7587): 484-489.
[2]
Zha D, Xie J, Ma W, DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning[J]. arXiv preprint arXiv:2106.06135, 2021.
[3]
Silver D, Hubert T, Schrittwieser J, Mastering chess and shogi by self-play with a general reinforcement learning algorithm[J]. arXiv preprint arXiv:1712.01815, 2017.
[4]
Van der Kleij A A J. Monte Carlo tree search and opponent modeling through player clustering in no-limit Texas hold'em poker[J]. University of Groningen, The Netherlands, 2010.
[5]
Li J, Koyamada S, Ye Q, Suphx: Mastering Mahjong with Deep Reinforcement Learning[J]. arXiv preprint arXiv:2003.13590, 2020.
[6]
Gao S, Li S. Bloody Mahjong playing strategy based on the integration of deep learning and XGBoost[J]. CAAI Transactions on Intelligence Technology, 2021.
[7]
Qingyue Wang. Game of Mahjong [M]. Chengdu: Shurong Chess Publishing House.2003.
[8]
Gao S, Okuya F, Kawahara Y, Supervised Learning of Imperfect Information Data in the Game of Mahjong via Deep Convolutional Neural Networks[J]. Information Processing Society of Japan, 2018.
[9]
Gao S, Okuya F, Kawahara Y, Building a Computer Mahjong Player via Deep Convolutional Neural Networks[J]. arXiv preprint arXiv:1906.02146, 2019.
[10]
Wang M, Yan T, Luo M, A novel deep residual network-based incomplete information competition strategy for four-players Mahjong games[J]. Multimedia Tools and Applications, 2019: 1-25.
[11]
Huang G, Liu Z, Van Der Maaten L, Densely connected convolutional networks[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2017: 4700-4708.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
ICIAI '22: Proceedings of the 2022 6th International Conference on Innovation in Artificial Intelligence
March 2022
240 pages
ISBN:9781450395502
DOI:10.1145/3529466
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 04 June 2022

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Artificial intelligence
  2. Machine learning
  3. Mahjong
  4. Transfer learning

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Funding Sources

  • Construction Project of computer technology specialty
  • Normal projects of General Science and Technology research program
  • Normal projects of promoting graduated education program at Beijing Information Science and Technology University

Conference

ICIAI 2022

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 59
    Total Downloads
  • Downloads (Last 12 months)9
  • Downloads (Last 6 weeks)1
Reflects downloads up to 26 Jan 2025

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media