skip to main content
Language:
Search Limited to: Search Limited to: Resource type Show Results with: Show Results with: Search type Index

Results 1 - 20 of 1,437  for All Library Resources

Results 1 2 3 4 5 next page
Show only
Refined by: Database: arXiv.org remove
Result Number Material Type Add to My Shelf Action Record Details and Options
1
dEchorate: a Calibrated Room Impulse Response Database for Echo-aware Signal Processing
Material Type:
Article
Add to My Research

dEchorate: a Calibrated Room Impulse Response Database for Echo-aware Signal Processing

arXiv.org, 2021-04 [Peer Reviewed Journal]

2021. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://arxiv.org/licenses/nonexclusive-distrib/1.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2104.13168

Full text available

2
High Fidelity Speech Enhancement with Band-split RNN
Material Type:
Article
Add to My Research

High Fidelity Speech Enhancement with Band-split RNN

arXiv.org, 2023-06

2023. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://arxiv.org/licenses/nonexclusive-distrib/1.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2212.00406

Full text available

3
DeFTAN-II: Efficient Multichannel Speech Enhancement with Subgroup Processing
Material Type:
Article
Add to My Research

DeFTAN-II: Efficient Multichannel Speech Enhancement with Subgroup Processing

arXiv.org, 2023-08

2023. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://arxiv.org/licenses/nonexclusive-distrib/1.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2308.15777

Full text available

4
MERLIon CCS Challenge: A English-Mandarin code-switching child-directed speech corpus for language identification and diarization
Material Type:
Article
Add to My Research

MERLIon CCS Challenge: A English-Mandarin code-switching child-directed speech corpus for language identification and diarization

arXiv.org, 2023-05

2023. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://arxiv.org/licenses/nonexclusive-distrib/1.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2305.18881

Full text available

5
Attention does not guarantee best performance in speech enhancement
Material Type:
Article
Add to My Research

Attention does not guarantee best performance in speech enhancement

arXiv.org, 2023-02

2023. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://arxiv.org/licenses/nonexclusive-distrib/1.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2302.05690

Full text available

6
Speech Enhancement with Fullband-Subband Cross-Attention Network
Material Type:
Article
Add to My Research

Speech Enhancement with Fullband-Subband Cross-Attention Network

arXiv.org, 2022-11

2022. This work is published under http://creativecommons.org/licenses/by-sa/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://creativecommons.org/licenses/by-sa/4.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2211.05432

Full text available

7
A lightweight dual-stage framework for personalized speech enhancement based on DeepFilterNet2
Material Type:
Article
Add to My Research

A lightweight dual-stage framework for personalized speech enhancement based on DeepFilterNet2

arXiv.org, 2024-04

2024. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://arxiv.org/licenses/nonexclusive-distrib/1.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2404.08022

Full text available

8
Mel-FullSubNet: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR
Material Type:
Article
Add to My Research

Mel-FullSubNet: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR

arXiv.org, 2024-02

2024. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://creativecommons.org/licenses/by/4.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2402.13511

Full text available

9
Toward Universal Speech Enhancement for Diverse Input Conditions
Material Type:
Article
Add to My Research

Toward Universal Speech Enhancement for Diverse Input Conditions

arXiv.org, 2024-02

2024. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://creativecommons.org/licenses/by/4.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2309.17384

Full text available

10
How does end-to-end speech recognition training impact speech enhancement artifacts?
Material Type:
Article
Add to My Research

How does end-to-end speech recognition training impact speech enhancement artifacts?

arXiv.org, 2023-11

2023. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://arxiv.org/licenses/nonexclusive-distrib/1.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2311.11599

Full text available

11
Magnitude-and-phase-aware Speech Enhancement with Parallel Sequence Modeling
Material Type:
Article
Add to My Research

Magnitude-and-phase-aware Speech Enhancement with Parallel Sequence Modeling

arXiv.org, 2023-10

2023. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://arxiv.org/licenses/nonexclusive-distrib/1.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2310.07316

Full text available

12
Understanding Spoken Language Development of Children with ASD Using Pre-trained Speech Embeddings
Material Type:
Article
Add to My Research

Understanding Spoken Language Development of Children with ASD Using Pre-trained Speech Embeddings

arXiv.org, 2023-05

2023. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://arxiv.org/licenses/nonexclusive-distrib/1.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2305.14117

Full text available

13
MP-SENet: A Speech Enhancement Model with Parallel Denoising of Magnitude and Phase Spectra
Material Type:
Article
Add to My Research

MP-SENet: A Speech Enhancement Model with Parallel Denoising of Magnitude and Phase Spectra

arXiv.org, 2023-05

2023. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://arxiv.org/licenses/nonexclusive-distrib/1.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2305.13686

Full text available

14
Exploring the Importance of F0 Trajectories for Speaker Anonymization using X-vectors and Neural Waveform Models
Material Type:
Article
Add to My Research

Exploring the Importance of F0 Trajectories for Speaker Anonymization using X-vectors and Neural Waveform Models

arXiv.org, 2021-10

2021. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://arxiv.org/licenses/nonexclusive-distrib/1.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2110.06887

Full text available

15
Deep Multi-Frame Filtering for Hearing Aids
Material Type:
Article
Add to My Research

Deep Multi-Frame Filtering for Hearing Aids

arXiv.org, 2023-05

2023. This work is published under http://creativecommons.org/licenses/by-sa/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://creativecommons.org/licenses/by-sa/4.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2305.08225

Full text available

16
The NPU-Elevoc Personalized Speech Enhancement System for ICASSP2023 DNS Challenge
Material Type:
Article
Add to My Research

The NPU-Elevoc Personalized Speech Enhancement System for ICASSP2023 DNS Challenge

arXiv.org, 2023-03

2023. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://creativecommons.org/licenses/by/4.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2303.06811

Full text available

17
Complex-valued Spatial Autoencoders for Multichannel Speech Enhancement
Material Type:
Article
Add to My Research

Complex-valued Spatial Autoencoders for Multichannel Speech Enhancement

arXiv.org, 2021-08

2021. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://arxiv.org/licenses/nonexclusive-distrib/1.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2108.03130

Full text available

18
Fast FullSubNet: Accelerate Full-band and Sub-band Fusion Model for Single-channel Speech Enhancement
Material Type:
Article
Add to My Research

Fast FullSubNet: Accelerate Full-band and Sub-band Fusion Model for Single-channel Speech Enhancement

arXiv.org, 2023-03

2023. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://creativecommons.org/licenses/by/4.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2212.09019

Full text available

19
TridentSE: Guiding Speech Enhancement with 32 Global Tokens
Material Type:
Article
Add to My Research

TridentSE: Guiding Speech Enhancement with 32 Global Tokens

arXiv.org, 2022-10

2022. This work is published under http://creativecommons.org/licenses/by-nc-sa/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://creativecommons.org/licenses/by-nc-sa/4.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2210.12995

Full text available

20
Binaural Speech Enhancement Using STOI-Optimal Masks
Material Type:
Article
Add to My Research

Binaural Speech Enhancement Using STOI-Optimal Masks

arXiv.org, 2022-09

2022. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://arxiv.org/licenses/nonexclusive-distrib/1.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2209.15472

Full text available

Results 1 - 20 of 1,437  for All Library Resources

Results 1 2 3 4 5 next page

Personalize your results

  1. Edit

Refine Search Results

Expand My Results

  1.   

Show only

  1. Peer-reviewed Journals (1)

Searching Remote Databases, Please Wait