skip to main content
Language:
Search Limited to: Search Limited to: Resource type Show Results with: Show Results with: Search type Index

Results 1 - 20 of 1,440  for All Library Resources

Results 1 2 3 4 5 next page
Show only
Refined by: Database: arXiv.org remove
Result Number Material Type Add to My Shelf Action Record Details and Options
1
AeGAN: Time-Frequency Speech Denoising via Generative Adversarial Networks
Material Type:
Article
Add to My Research

AeGAN: Time-Frequency Speech Denoising via Generative Adversarial Networks

arXiv.org, 2020-06

2020. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://arxiv.org/licenses/nonexclusive-distrib/1.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.1910.12620

Full text available

2
Investigating Cross-Domain Losses for Speech Enhancement
Material Type:
Article
Add to My Research

Investigating Cross-Domain Losses for Speech Enhancement

arXiv.org, 2021-05

2021. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://arxiv.org/licenses/nonexclusive-distrib/1.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2010.10468

Full text available

3
CMGAN: Conformer-Based Metric-GAN for Monaural Speech Enhancement
Material Type:
Article
Add to My Research

CMGAN: Conformer-Based Metric-GAN for Monaural Speech Enhancement

arXiv.org, 2024-05

2024. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://arxiv.org/licenses/nonexclusive-distrib/1.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2209.11112

Full text available

4
RHR-Net: A Residual Hourglass Recurrent Neural Network for Speech Enhancement
Material Type:
Article
Add to My Research

RHR-Net: A Residual Hourglass Recurrent Neural Network for Speech Enhancement

arXiv.org, 2019-04

2019. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://arxiv.org/licenses/nonexclusive-distrib/1.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.1904.07294

Full text available

5
How Familiar Does That Sound? Cross-Lingual Representational Similarity Analysis of Acoustic Word Embeddings
Material Type:
Article
Add to My Research

How Familiar Does That Sound? Cross-Lingual Representational Similarity Analysis of Acoustic Word Embeddings

arXiv.org, 2021-09

2021. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://arxiv.org/licenses/nonexclusive-distrib/1.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2109.10179

Full text available

6
Towards Dog Bark Decoding: Leveraging Human Speech Processing for Automated Bark Classification
Material Type:
Article
Add to My Research

Towards Dog Bark Decoding: Leveraging Human Speech Processing for Automated Bark Classification

arXiv.org, 2024-04

2024. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://creativecommons.org/licenses/by/4.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2404.18739

Full text available

7
Sequence Segmentation Using Joint RNN and Structured Prediction Models
Material Type:
Article
Add to My Research

Sequence Segmentation Using Joint RNN and Structured Prediction Models

arXiv.org, 2016-10

2016. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://arxiv.org/licenses/nonexclusive-distrib/1.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.1610.07918

Full text available

8
My lips are concealed: Audio-visual speech enhancement through obstructions
Material Type:
Article
Add to My Research

My lips are concealed: Audio-visual speech enhancement through obstructions

arXiv.org, 2019-07

2019. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://arxiv.org/licenses/nonexclusive-distrib/1.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.1907.04975

Full text available

9
The Conversation: Deep Audio-Visual Speech Enhancement
Material Type:
Article
Add to My Research

The Conversation: Deep Audio-Visual Speech Enhancement

arXiv.org, 2018-06

2018. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://arxiv.org/licenses/nonexclusive-distrib/1.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.1804.04121

Full text available

10
LAraBench: Benchmarking Arabic AI with Large Language Models
Material Type:
Article
Add to My Research

LAraBench: Benchmarking Arabic AI with Large Language Models

arXiv.org, 2024-02

2024. This work is published under http://creativecommons.org/licenses/by-nc-sa/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://creativecommons.org/licenses/by-nc-sa/4.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2305.14982

Full text available

11
Real-Time Lightweight Chaotic Encryption for 5G IoT Enabled Lip-Reading Driven Secure Hearing-Aid
Material Type:
Article
Add to My Research

Real-Time Lightweight Chaotic Encryption for 5G IoT Enabled Lip-Reading Driven Secure Hearing-Aid

arXiv.org, 2018-09

2018. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://creativecommons.org/licenses/by/4.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.1809.04966

Full text available

12
Contextual Audio-Visual Switching For Speech Enhancement in Real-World Environments
Material Type:
Article
Add to My Research

Contextual Audio-Visual Switching For Speech Enhancement in Real-World Environments

arXiv.org, 2018-08

2018. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://arxiv.org/licenses/nonexclusive-distrib/1.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.1808.09825

Full text available

13
Modular Customizable ROS-Based Framework for Rapid Development of Social Robots
Material Type:
Article
Add to My Research

Modular Customizable ROS-Based Framework for Rapid Development of Social Robots

arXiv.org, 2023-11

2023. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://creativecommons.org/licenses/by/4.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2311.15780

Full text available

14
On the Role of Visual Cues in Audiovisual Speech Enhancement
Material Type:
Article
Add to My Research

On the Role of Visual Cues in Audiovisual Speech Enhancement

arXiv.org, 2021-02

2021. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://arxiv.org/licenses/nonexclusive-distrib/1.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2004.12031

Full text available

15
Improving the Intent Classification accuracy in Noisy Environment
Material Type:
Article
Add to My Research

Improving the Intent Classification accuracy in Noisy Environment

arXiv.org, 2023-03

2023. This work is published under http://creativecommons.org/licenses/by-nc-sa/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://creativecommons.org/licenses/by-nc-sa/4.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2303.06585

Full text available

16
FaceChat: An Emotion-Aware Face-to-face Dialogue Framework
Material Type:
Article
Add to My Research

FaceChat: An Emotion-Aware Face-to-face Dialogue Framework

arXiv.org, 2023-03

2023. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://arxiv.org/licenses/nonexclusive-distrib/1.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2303.07316

Full text available

17
A Non-Technical Survey on Deep Convolutional Neural Network Architectures
Material Type:
Article
Add to My Research

A Non-Technical Survey on Deep Convolutional Neural Network Architectures

arXiv.org, 2018-03

2018. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://creativecommons.org/licenses/by/4.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.1803.02129

Full text available

18
Whisper in Focus: Enhancing Stuttered Speech Classification with Encoder Layer Optimization
Material Type:
Article
Add to My Research

Whisper in Focus: Enhancing Stuttered Speech Classification with Encoder Layer Optimization

arXiv.org, 2023-11

2023. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://creativecommons.org/licenses/by/4.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2311.05203

Full text available

19
The ICSTM+TUM+UP Approach to the 3rd CHIME Challenge: Single-Channel LSTM Speech Enhancement with Multi-Channel Correlation Shaping Dereverberation and LSTM Language Models
Material Type:
Article
Add to My Research

The ICSTM+TUM+UP Approach to the 3rd CHIME Challenge: Single-Channel LSTM Speech Enhancement with Multi-Channel Correlation Shaping Dereverberation and LSTM Language Models

arXiv.org, 2015-10

2015. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://arxiv.org/licenses/nonexclusive-distrib/1.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.1510.00268

Full text available

20
HiFi++: a Unified Framework for Bandwidth Extension and Speech Enhancement
Material Type:
Article
Add to My Research

HiFi++: a Unified Framework for Bandwidth Extension and Speech Enhancement

arXiv.org, 2022-09

2022. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://arxiv.org/licenses/nonexclusive-distrib/1.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2203.13086

Full text available

Results 1 - 20 of 1,440  for All Library Resources

Results 1 2 3 4 5 next page

Personalize your results

  1. Edit

Refine Search Results

Expand My Results

  1.   

Show only

  1. Peer-reviewed Journals (1)

Searching Remote Databases, Please Wait