Proceedings of the 3rd International Conference on Management Science and Software Engineering (ICMSSE 2023)

English Pronunciation Error Detection Based on Multimedia Data

Authors
Lijuan Wu1, *
1English Department, Heilongjiang International University, Harbin, China
*Corresponding author. Email: cerrity1982@sina.com
Corresponding Author
Lijuan Wu
Available Online 9 October 2023.
DOI
10.2991/978-94-6463-262-0_91How to use a DOI?
Keywords
media data; English; Pronunciation; detection
Abstract

Pronunciation is an interdisciplinary subject, and speech recognition is gradually becoming the key technology of man-machine interface in information technology. In recent years, the application of computer speech recognition has made great progress. Based on the special status of English, there have been many designs and productions of English as a second language speech database in the world. However, due to the increasing popularity of English, more and more people use English as a second language, so it is necessary to establish an English pronunciation error detection system for multimedia data. In this paper, the initial data is designed, made and trained as a model, and the data of standard phonetic database (using the existing database of AVICAR) is tested and compared with the collected phonetic database. It is found that the recognition rate of collected voice data is much lower than that of standard voice data, and the conclusion is drawn that it is important to collect voice databases from different regions. The reason of low recognition rate is analyzed. Then, the data collected in the phonetic database are compared with each other according to different regions, and the reasons for the difference in recognition rate are summarized, which provides reference experience for the design and manufacture of phonetic database.

Copyright
© 2024 The Author(s)
Open Access
Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

Download article (PDF)

Volume Title
Proceedings of the 3rd International Conference on Management Science and Software Engineering (ICMSSE 2023)
Series
Atlantis Highlights in Engineering
Publication Date
9 October 2023
ISBN
978-94-6463-262-0
ISSN
2589-4943
DOI
10.2991/978-94-6463-262-0_91How to use a DOI?
Copyright
© 2024 The Author(s)
Open Access
Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

Cite this article

TY  - CONF
AU  - Lijuan Wu
PY  - 2023
DA  - 2023/10/09
TI  - English Pronunciation Error Detection Based on Multimedia Data
BT  - Proceedings of the 3rd International Conference on Management Science and Software Engineering (ICMSSE 2023)
PB  - Atlantis Press
SP  - 872
EP  - 878
SN  - 2589-4943
UR  - https://doi.org/10.2991/978-94-6463-262-0_91
DO  - 10.2991/978-94-6463-262-0_91
ID  - Wu2023
ER  -