Publishing Service

Polishing & Checking

Frontiers of Information Technology & Electronic Engineering

ISSN 2095-9184 (print), ISSN 2095-9230 (online)

TibetanGoTinyNet: a lightweight U-Net style network for zero learning of Tibetan Go

Abstract: The game of Tibetan Go faces the scarcity of expert knowledge and research literature. Therefore, we study the zero learning model of Tibetan Go under limited computing power resources and propose a novel scale-invariant U-Net style two-headed output lightweight network TibetanGoTinyNet. The lightweight convolutional neural networks and capsule structure are applied to the encoder and decoder of TibetanGoTinyNet to reduce computational burden and achieve better feature extraction results. Several autonomous self-attention mechanisms are integrated into TibetanGoTinyNet to capture the Tibetan Go board’s spatial and global information and select important channels. The training data are generated entirely from self-play games. TibetanGoTinyNet achieves 62%-78% winning rate against other four U-Net style models including Res-UNet, Res-UNet Attention, Ghost-UNet, and Ghost Capsule-UNet. It also achieves 75% winning rate in the ablation experiments on the attention mechanism with embedded positional information. The model saves about 33% of the training time with 45%-50% winning rate for different Monte-Carlo tree search (MCTS) simulation counts when migrated from 9×9 to 11×11 boards. Code for our model is available at https://github.com/paulzyy/TibetanGoTinyNet.

Key words: Zero learning; Tibetan Go; U-Net; Self-attention mechanism; Capsule network; Monte-Carlo tree search

Chinese Summary  <7> TibetanGoTinyNet:一种应用于藏式围棋的U型网络风格的轻量级零学习模型

李霞丽1,2,张焱垠1,2,吴立成1,2,陈彦东1,2,喻俊志3
1中央民族大学民族语言智能分析与安全治理教育部重点实验室,中国北京市,100081
2中央民族大学信息工程学院,中国北京市,100081
3北京大学工学院先进制造与机器人系,中国北京市,100871
摘要:藏式围棋面临专家知识和研究文献匮乏的问题。因此,我们研究了有限计算能力资源下藏式围棋的零学习模型,并提出一种新颖的尺度不变U型网络(U-Net)风格的双头输出轻量级网络TibetanGoTinyNet。该网络的编码和解码器应用了轻量级卷积神经网络(CNN)和胶囊网络,以减少计算负担并提升特征提取效果。网络中集成了数种自注意力机制,以捕获藏式围棋棋盘的空间和全局信息,并选择有价值通道。训练数据完全由自我对弈生成。TibetanGoTinyNet在与Res-UNet,Res-UNet Attention,Ghost-UNet和Ghost Capsule-UNet 4个U-Net风格模型的对弈中获得了62%–78%的胜率。在捕获棋盘位置信息的轻量级自注意机制消融实验中,它也实现了75%的胜率。当模型从9×9棋盘直接迁移到11×11棋盘时,该模型在不同的蒙特卡洛树搜索(MCTS)次数下节省了约33%的训练时间,并获得了45%–50%的胜率。本文模型代码可在https://github.com/paulzyy/TibetanGoTinyNet上获取。

关键词组:零学习;藏式围棋;U型网络;自注意力机制;胶囊网络;蒙特卡洛树搜索


Share this article to: More

Go to Contents

References:

<Show All>

Open peer comments: Debate/Discuss/Question/Opinion

<1>

Please provide your name, email address and a comment





DOI:

10.1631/FITEE.2300493

CLC number:

TP39

Download Full Text:

Click Here

Downloaded:

622

Download summary:

<Click Here> 

Downloaded:

287

Clicked:

967

Cited:

0

On-line Access:

2024-08-27

Received:

2023-10-17

Revision Accepted:

2024-05-08

Crosschecked:

2023-12-17

Journal of Zhejiang University-SCIENCE, 38 Zheda Road, Hangzhou 310027, China
Tel: +86-571-87952276; Fax: +86-571-87952331; E-mail: jzus@zju.edu.cn
Copyright © 2000~ Journal of Zhejiang University-SCIENCE