Language:

Speech enhancement from fused features based on deep neural network and gated recurrent unit network

EURASIP journal on advances in signal processing, 2021-10, Vol.2021 (1), p.1-19, Article 104 [Peer Reviewed Journal]

The Author(s) 2021 ;COPYRIGHT 2021 Springer ;The Author(s) 2021. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;ISSN: 1687-6180 ;ISSN: 1687-6172 ;EISSN: 1687-6180 ;DOI: 10.1186/s13634-021-00813-8

Full text available

Citations Cited by

Actions
1. Add to My Research
2. Remove from My Research
3. E-mail
4. Print
5. Permalink
6. Citation
7. EasyBib
8. EndNote
9. RefWorks
10. Delicious
11. Export RIS
12. Export BibTeX

Title:
Speech enhancement from fused features based on deep neural network and gated recurrent unit network
Author: Wang, Youming ; Han, Jiali ; Zhang, Tianqi ; Qing, Didi
Subjects: Algorithms ; Analysis ; Artificial neural networks ; Computational linguistics ; Context ; Deep learning ; Deep neural network ; Engineering ; Gated recurrent unit ; Intelligibility ; Language processing ; Machine learning ; Mapping ; Natural language interfaces ; Neural networks ; Noise ; Quantum Information Technology ; Signal,Image and Speech Processing ; Speech ; Speech enhancement ; Speech processing ; Speech quality ; Spintronics
Is Part Of: EURASIP journal on advances in signal processing, 2021-10, Vol.2021 (1), p.1-19, Article 104
Description: Speech is easily interfered by external environment in reality, which results in the loss of important features. Deep learning has become a popular speech enhancement method because of its superior potential in solving nonlinear mapping problems for complex features. However, the deficiency of traditional deep learning methods is the weak learning capability of important information from previous time steps and long-term event dependencies between the time-series data. To overcome this problem, we propose a novel speech enhancement method based on the fused features of deep neural networks (DNNs) and gated recurrent unit (GRU). The proposed method uses GRU to reduce the number of parameters of DNNs and acquire the context information of the speech, which improves the enhanced speech quality and intelligibility. Firstly, DNN with multiple hidden layers is used to learn the mapping relationship between the logarithmic power spectrum (LPS) features of noisy speech and clean speech. Secondly, the LPS feature of the deep neural network is fused with the noisy speech as the input of GRU network to compensate the missing context information. Finally, GRU network is performed to learn the mapping relationship between LPS features and log power spectrum features of clean speech spectrum. The proposed model is experimentally compared with traditional speech enhancement models, including DNN, CNN, LSTM and GRU. Experimental results demonstrate that the PESQ, SSNR and STOI of the proposed algorithm are improved by 30.72%, 39.84% and 5.53%, respectively, compared with the noise signal under the condition of matched noise. Under the condition of unmatched noise, the PESQ and STOI of the algorithm are improved by 23.8% and 37.36%, respectively. The advantage of the proposed method is that it uses the key information of features to suppress noise in both matched and unmatched noise cases and the proposed method outperforms other common methods in speech enhancement.
Publisher: Cham: Springer International Publishing
Language: English
Identifier: ISSN: 1687-6180
ISSN: 1687-6172
EISSN: 1687-6180
DOI: 10.1186/s13634-021-00813-8
Source: DOAJ Directory of Open Access Journals
AUTh Library subscriptions: ProQuest Central
Springer Nature OA Free Journals

Back to results list


INSPIRE LIBRARY - TON DUC THANG UNIVERSITY	(84-028) 37 755 057	Feedback
19 Nguyen Huu Tho St. Dist.7, HCM	thuvien@tdtu.edu.vn	Feedback

Speech enhancement from fused features based on deep neural network and gated recurrent unit network

Searching Remote Databases, Please Wait