Abstract
Once lossless compressed data is damaged in the transmission process, a specific fault-tolerant decompression algorithm is required to correct the error. A novel fault-tolerant decompression method for English text files has been proposed for error detection and correction in previous research work, which is trained with natural language model of English. In this paper, we transfer action scope of this fault-tolerant decompression method from English compressed files to Chinese text files. In order to apply the algorithm framework to Chinese compressed files, a N-Gram-based language model is built adhering to the compression coding rules and the grammar rules of Chinese, with which the prior information of source can be fully expressed, and the character encoding adaptation problem in the process of algorithm transfer-learning is solved. The experiment results demonstrate that the proposed algorithm can meet the requirements for fault-tolerant decompression of lossless compressed Chinese text files.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Park B, Savoldi A, Gubian P, Park J, Lee S (2008) Recovery of damaged compressed files for digital forensic purposes. In: International conference on multimedia and ubiquitous engineering, MUE 2008. IEEE
Kwon B, Gong M, Lee S (2017) Novel error detection algorithm for LZSS compressed data. IEEE Access 5(99):8940–8947
Chen YX, Zheng H, Zhao YQ et al (2010) A new LZW fault-tolerant decoding algorithm based on remaining redundancy of sources. J Wuhan Univ Technol 32(10):159–163
Wang DG, Zhao XQ, Sun QQ (2018) Novel fault-tolerant decompression method of corrupted Huffman files. Wirel Pers Commun 102(4):2555–2574
Wang DG, Zhao XQ, Sun QQ (2018) Novel fault-tolerant decompression method of corrupted LZSS files. Wirel Pers Commun 102:2499–2518
Wang DG (2018) Fault-tolerant decompression of lossless compressed files data. Tongji University, Shanghai, pp 12–100
Bengio Y, Ducharme R, Vincent P et al (2003) A neural probabilistic language model. J Mach Learn Res 3:1137–1155
Li Y, Bo L (2008) A normalized Levenshtein distance metric. IEEE Trans Pattern Anal Mach Intell 2007, 29(6):1091–1095
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Wang, Y., Zhao, X., Wang, D. (2020). Fault-Tolerant Decompression Method of Compressed Chinese Text Files. In: Liang, Q., Wang, W., Liu, X., Na, Z., Jia, M., Zhang, B. (eds) Communications, Signal Processing, and Systems. CSPS 2019. Lecture Notes in Electrical Engineering, vol 571. Springer, Singapore. https://doi.org/10.1007/978-981-13-9409-6_114
Download citation
DOI: https://doi.org/10.1007/978-981-13-9409-6_114
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-9408-9
Online ISBN: 978-981-13-9409-6
eBook Packages: EngineeringEngineering (R0)