CLC number:
On-line Access: 2024-08-27
Received: 2023-10-17
Revision Accepted: 2024-05-08
Crosschecked: 0000-00-00
Cited: 0
Clicked: 596
Yinghao LI, Heyan HUANG, Baojun WANG, Yang GAO. DRMSpell: dynamically reweighting multimodality for Chinese spelling correction[J]. Frontiers of Information Technology & Electronic Engineering, 1998, -1(-1): .
@article{title="DRMSpell: dynamically reweighting multimodality for Chinese spelling correction",
author="Yinghao LI, Heyan HUANG, Baojun WANG, Yang GAO",
journal="Frontiers of Information Technology & Electronic Engineering",
volume="-1",
number="-1",
pages="",
year="1998",
publisher="Zhejiang University Press & Springer",
doi="10.1631/FITEE.2300816"
}
%0 Journal Article
%T DRMSpell: dynamically reweighting multimodality for Chinese spelling correction
%A Yinghao LI
%A Heyan HUANG
%A Baojun WANG
%A Yang GAO
%J Journal of Zhejiang University SCIENCE C
%V -1
%N -1
%P
%@ 2095-9184
%D 1998
%I Zhejiang University Press & Springer
%DOI 10.1631/FITEE.2300816
TY - JOUR
T1 - DRMSpell: dynamically reweighting multimodality for Chinese spelling correction
A1 - Yinghao LI
A1 - Heyan HUANG
A1 - Baojun WANG
A1 - Yang GAO
J0 - Journal of Zhejiang University Science C
VL - -1
IS - -1
SP -
EP -
%@ 2095-9184
Y1 - 1998
PB - Zhejiang University Press & Springer
ER -
DOI - 10.1631/FITEE.2300816
Abstract: chinese spelling correction (CSC) is a task that aims to detect and correct the spelling errors that may occur in Chinese texts. However, the Chinese language exhibits a high degree of complexity, characterized by the presence of multiple phonetic representations known as pinyin, which possess distinct tonal variations that can correspond to various characters. In light of the complexity inherent in the Chinese language, the CSC task becomes imperative for ensuring the accuracy and clarity of written communication. Recent research has included external knowledge into the model using phonological and visual modalities. However, these methods do not effectively utilize the modality information in a targeted manner for addressing the different types of errors. In this paper, we propose a multimodal pretrained language model called DRMSpell for CSC, which takes into consideration the interaction between the modalities. A dynamically reweighting multimodality (DRM) module is introduced to reweight various modalities for obtaining more multimodal information. To fully utilize the multimodal information obtained and to further strengthen the model, an independent-modality masking strategy (IMS) is proposed to independently mask three modalities of a token in the pretraining stage. Our method achieves state-of-the-art performance on most metrics constituting widely used benchmarks. The findings of the experiments demonstrate that our method is capable of modeling the interactive information between modalities and is also robust to incorrect modal information.
Open peer comments: Debate/Discuss/Question/Opinion
<1>