Result Number | Material Type | Add to My Shelf Action | Record Details and Options |
---|---|---|---|
1 |
Material Type: Article
|
AeGAN: Time-Frequency Speech Denoising via Generative Adversarial NetworksarXiv.org, 2020-062020. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://arxiv.org/licenses/nonexclusive-distrib/1.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.1910.12620Full text available |
|
2 |
Material Type: Article
|
Investigating Cross-Domain Losses for Speech EnhancementarXiv.org, 2021-052021. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://arxiv.org/licenses/nonexclusive-distrib/1.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2010.10468Full text available |
|
3 |
Material Type: Article
|
CMGAN: Conformer-Based Metric-GAN for Monaural Speech EnhancementarXiv.org, 2024-052024. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://arxiv.org/licenses/nonexclusive-distrib/1.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2209.11112Full text available |
|
4 |
Material Type: Article
|
RHR-Net: A Residual Hourglass Recurrent Neural Network for Speech EnhancementarXiv.org, 2019-042019. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://arxiv.org/licenses/nonexclusive-distrib/1.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.1904.07294Full text available |
|
5 |
Material Type: Article
|
How Familiar Does That Sound? Cross-Lingual Representational Similarity Analysis of Acoustic Word EmbeddingsarXiv.org, 2021-092021. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://arxiv.org/licenses/nonexclusive-distrib/1.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2109.10179Full text available |
|
6 |
Material Type: Article
|
Towards Dog Bark Decoding: Leveraging Human Speech Processing for Automated Bark ClassificationarXiv.org, 2024-042024. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://creativecommons.org/licenses/by/4.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2404.18739Full text available |
|
7 |
Material Type: Article
|
Sequence Segmentation Using Joint RNN and Structured Prediction ModelsarXiv.org, 2016-102016. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://arxiv.org/licenses/nonexclusive-distrib/1.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.1610.07918Full text available |
|
8 |
Material Type: Article
|
My lips are concealed: Audio-visual speech enhancement through obstructionsarXiv.org, 2019-072019. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://arxiv.org/licenses/nonexclusive-distrib/1.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.1907.04975Full text available |
|
9 |
Material Type: Article
|
The Conversation: Deep Audio-Visual Speech EnhancementarXiv.org, 2018-062018. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://arxiv.org/licenses/nonexclusive-distrib/1.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.1804.04121Full text available |
|
10 |
Material Type: Article
|
LAraBench: Benchmarking Arabic AI with Large Language ModelsarXiv.org, 2024-022024. This work is published under http://creativecommons.org/licenses/by-nc-sa/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://creativecommons.org/licenses/by-nc-sa/4.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2305.14982Full text available |
|
11 |
Material Type: Article
|
Real-Time Lightweight Chaotic Encryption for 5G IoT Enabled Lip-Reading Driven Secure Hearing-AidarXiv.org, 2018-092018. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://creativecommons.org/licenses/by/4.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.1809.04966Full text available |
|
12 |
Material Type: Article
|
Contextual Audio-Visual Switching For Speech Enhancement in Real-World EnvironmentsarXiv.org, 2018-082018. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://arxiv.org/licenses/nonexclusive-distrib/1.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.1808.09825Full text available |
|
13 |
Material Type: Article
|
Modular Customizable ROS-Based Framework for Rapid Development of Social RobotsarXiv.org, 2023-112023. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://creativecommons.org/licenses/by/4.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2311.15780Full text available |
|
14 |
Material Type: Article
|
On the Role of Visual Cues in Audiovisual Speech EnhancementarXiv.org, 2021-022021. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://arxiv.org/licenses/nonexclusive-distrib/1.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2004.12031Full text available |
|
15 |
Material Type: Article
|
Improving the Intent Classification accuracy in Noisy EnvironmentarXiv.org, 2023-032023. This work is published under http://creativecommons.org/licenses/by-nc-sa/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://creativecommons.org/licenses/by-nc-sa/4.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2303.06585Full text available |
|
16 |
Material Type: Article
|
FaceChat: An Emotion-Aware Face-to-face Dialogue FrameworkarXiv.org, 2023-032023. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://arxiv.org/licenses/nonexclusive-distrib/1.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2303.07316Full text available |
|
17 |
Material Type: Article
|
A Non-Technical Survey on Deep Convolutional Neural Network ArchitecturesarXiv.org, 2018-032018. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://creativecommons.org/licenses/by/4.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.1803.02129Full text available |
|
18 |
Material Type: Article
|
Whisper in Focus: Enhancing Stuttered Speech Classification with Encoder Layer OptimizationarXiv.org, 2023-112023. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://creativecommons.org/licenses/by/4.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2311.05203Full text available |
|
19 |
Material Type: Article
|
The ICSTM+TUM+UP Approach to the 3rd CHIME Challenge: Single-Channel LSTM Speech Enhancement with Multi-Channel Correlation Shaping Dereverberation and LSTM Language ModelsarXiv.org, 2015-102015. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://arxiv.org/licenses/nonexclusive-distrib/1.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.1510.00268Full text available |
|
20 |
Material Type: Article
|
Audio-Visual Target Speaker Enhancement on Multi-Talker Environment using Event-Driven CamerasarXiv.org, 2021-022021. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://arxiv.org/licenses/nonexclusive-distrib/1.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.1912.02671Full text available |