English Pronunciation Error Detection Based on Multimedia Data
- DOI
- 10.2991/978-94-6463-262-0_91How to use a DOI?
- Keywords
- media data; English; Pronunciation; detection
- Abstract
Pronunciation is an interdisciplinary subject, and speech recognition is gradually becoming the key technology of man-machine interface in information technology. In recent years, the application of computer speech recognition has made great progress. Based on the special status of English, there have been many designs and productions of English as a second language speech database in the world. However, due to the increasing popularity of English, more and more people use English as a second language, so it is necessary to establish an English pronunciation error detection system for multimedia data. In this paper, the initial data is designed, made and trained as a model, and the data of standard phonetic database (using the existing database of AVICAR) is tested and compared with the collected phonetic database. It is found that the recognition rate of collected voice data is much lower than that of standard voice data, and the conclusion is drawn that it is important to collect voice databases from different regions. The reason of low recognition rate is analyzed. Then, the data collected in the phonetic database are compared with each other according to different regions, and the reasons for the difference in recognition rate are summarized, which provides reference experience for the design and manufacture of phonetic database.
- Copyright
- © 2024 The Author(s)
- Open Access
- Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.
Cite this article
TY - CONF AU - Lijuan Wu PY - 2023 DA - 2023/10/09 TI - English Pronunciation Error Detection Based on Multimedia Data BT - Proceedings of the 3rd International Conference on Management Science and Software Engineering (ICMSSE 2023) PB - Atlantis Press SP - 872 EP - 878 SN - 2589-4943 UR - https://doi.org/10.2991/978-94-6463-262-0_91 DO - 10.2991/978-94-6463-262-0_91 ID - Wu2023 ER -