Language:

A Speech Recognition Method Based on Domain-Specific Datasets and Confidence Decision Networks

Sensors (Basel, Switzerland), 2023-06, Vol.23 (13), p.6036 [Peer Reviewed Journal]

COPYRIGHT 2023 MDPI AG ;2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;2023 by the authors. 2023 ;ISSN: 1424-8220 ;EISSN: 1424-8220 ;DOI: 10.3390/s23136036 ;PMID: 37447886

Full text available

Citations Cited by

Actions
1. Add to My Research
2. Remove from My Research
3. E-mail
4. Print
5. Permalink
6. Citation
7. EasyBib
8. EndNote
9. RefWorks
10. Delicious
11. Export RIS
12. Export BibTeX

Title:
A Speech Recognition Method Based on Domain-Specific Datasets and Confidence Decision Networks
Author: Dong, Zhe ; Ding, Qianqian ; Zhai, Weifeng ; Zhou, Meng
Subjects: Accuracy ; Acoustics ; Automatic speech recognition ; Classifiers ; confidence decision making ; CTC ; Datasets ; Deep learning ; domain specific ; Domain specific languages ; Human-computer interaction ; Importance sampling ; Language ; Language modeling ; Methods ; Model accuracy ; Neural networks ; Neural Networks, Computer ; Speech ; speech networks ; Speech Perception ; Speech recognition ; Speech Recognition Software ; Voice recognition
Is Part Of: Sensors (Basel, Switzerland), 2023-06, Vol.23 (13), p.6036
Description: This paper proposes a speech recognition method based on a domain-specific language speech network (DSL-Net) and a confidence decision network (CD-Net). The method involves automatically training a domain-specific dataset, using pre-trained model parameters for migration learning, and obtaining a domain-specific speech model. Importance sampling weights were set for the trained domain-specific speech model, which was then integrated with the trained speech model from the benchmark dataset. This integration automatically expands the lexical content of the model to accommodate the input speech based on the lexicon and language model. The adaptation attempts to address the issue of out-of-vocabulary words that are likely to arise in most realistic scenarios and utilizes external knowledge sources to extend the existing language model. By doing so, the approach enhances the adaptability of the language model in new domains or scenarios and improves the prediction accuracy of the model. For domain-specific vocabulary recognition, a deep fully convolutional neural network (DFCNN) and a candidate temporal classification (CTC)-based approach were employed to achieve effective recognition of domain-specific vocabulary. Furthermore, a confidence-based classifier was added to enhance the accuracy and robustness of the overall approach. In the experiments, the method was tested on a proprietary domain audio dataset and compared with an automatic speech recognition (ASR) system trained on a large-scale dataset. Based on experimental verification, the model achieved an accuracy improvement from 82% to 91% in the medical domain. The inclusion of domain-specific datasets resulted in a 5% to 7% enhancement over the baseline, while the introduction of model confidence further improved the baseline by 3% to 5%. These findings demonstrate the significance of incorporating domain-specific datasets and model confidence in advancing speech recognition technology.
Publisher: Switzerland: MDPI AG
Language: English
Identifier: ISSN: 1424-8220
EISSN: 1424-8220
DOI: 10.3390/s23136036
PMID: 37447886
Source: Freely Accessible Journals
MEDLINE
PubMed Central
ROAD: Directory of Open Access Scholarly Resources
ProQuest Central
DOAJ Directory of Open Access Journals

Back to results list


INSPIRE LIBRARY - TON DUC THANG UNIVERSITY	(84-028) 37 755 057	Feedback
19 Nguyen Huu Tho St. Dist.7, HCM	thuvien@tdtu.edu.vn	Feedback

A Speech Recognition Method Based on Domain-Specific Datasets and Confidence Decision Networks

Searching Remote Databases, Please Wait