English Pronunciation Error Detection Based on Multimedia Data

Lijuan Wu

doi:10.2991/978-94-6463-262-0_91

<Previous Article In Volume

Next Article In Volume>

English Pronunciation Error Detection Based on Multimedia Data

Authors

Lijuan Wu¹^{, *}

¹English Department, Heilongjiang International University, Harbin, China

^*Corresponding author. Email: cerrity1982@sina.com

Corresponding Author

Lijuan Wu

Available Online 9 October 2023.

DOI: 10.2991/978-94-6463-262-0_91 How to use a DOI?
Keywords: media data; English; Pronunciation; detection
Abstract: Pronunciation is an interdisciplinary subject, and speech recognition is gradually becoming the key technology of man-machine interface in information technology. In recent years, the application of computer speech recognition has made great progress. Based on the special status of English, there have been many designs and productions of English as a second language speech database in the world. However, due to the increasing popularity of English, more and more people use English as a second language, so it is necessary to establish an English pronunciation error detection system for multimedia data. In this paper, the initial data is designed, made and trained as a model, and the data of standard phonetic database (using the existing database of AVICAR) is tested and compared with the collected phonetic database. It is found that the recognition rate of collected voice data is much lower than that of standard voice data, and the conclusion is drawn that it is important to collect voice databases from different regions. The reason of low recognition rate is analyzed. Then, the data collected in the phonetic database are compared with each other according to different regions, and the reasons for the difference in recognition rate are summarized, which provides reference experience for the design and manufacture of phonetic database.
Copyright: © 2024 The Author(s)
Open Access: Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

Download article (PDF)

<Previous Article In Volume

Next Article In Volume>

Volume Title: Proceedings of the 3rd International Conference on Management Science and Software Engineering (ICMSSE 2023)
Series: Atlantis Highlights in Engineering
Publication Date: 9 October 2023
ISBN: 978-94-6463-262-0
ISSN: 2589-4943
DOI: 10.2991/978-94-6463-262-0_91 How to use a DOI?
Open Access: Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

Cite this article

ris enw bib

TY  - CONF
AU  - Lijuan Wu
PY  - 2023
DA  - 2023/10/09
TI  - English Pronunciation Error Detection Based on Multimedia Data
BT  - Proceedings of the 3rd International Conference on Management Science and Software Engineering (ICMSSE 2023)
PB  - Atlantis Press
SP  - 872
EP  - 878
SN  - 2589-4943
UR  - https://doi.org/10.2991/978-94-6463-262-0_91
DO  - 10.2991/978-94-6463-262-0_91
ID  - Wu2023
ER  -

download .riscopy to clipboard