Result Number | Material Type | Add to My Shelf Action | Record Details and Options |
---|---|---|---|
1 |
Material Type: Article
|
A Model for Speech Processing in Second Language Listening ActivitiesEnglish language teaching (Toronto), 2016-01, Vol.9 (2), p.13ISSN: 1916-4742 ;EISSN: 1916-4750 ;DOI: 10.5539/elt.v9n2p13Full text available |
|
2 |
Material Type: Article
|
A Primer on Neural Network Models for Natural Language ProcessingThe Journal of artificial intelligence research, , Vol.57, p.345-420 [Peer Reviewed Journal]2016. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the associated terms available at https://www.jair.org/index.php/jair/about ;ISSN: 1076-9757 ;EISSN: 1943-5037 ;DOI: 10.1613/jair.4992Full text available |
|
3 |
Material Type: Article
|
Gammatonegram Representation for End-to-End Dysarthric Speech Processing Tasks: Speech Recognition, Speaker Identification, and Intelligibility AssessmentarXiv.org, 2024-032024. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://creativecommons.org/licenses/by/4.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2307.03296Full text available |
|
4 |
Material Type: Article
|
The X-LANCE Technical Report for Interspeech 2024 Speech Processing Using Discrete Speech Unit ChallengearXiv.org, 2024-042024. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://arxiv.org/licenses/nonexclusive-distrib/1.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2404.06079Full text available |
|
5 |
Material Type: Article
|
UTDUSS: UTokyo-SaruLab System for Interspeech2024 Speech Processing Using Discrete Speech Unit ChallengearXiv.org, 2024-032024. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://creativecommons.org/licenses/by/4.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2403.13720Full text available |
|
6 |
Material Type: Article
|
1SPU: 1-step Speech Processing UnitarXiv.org, 2023-122023. This work is published under http://creativecommons.org/licenses/by-nc-nd/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://creativecommons.org/licenses/by-nc-nd/4.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2311.04753Full text available |
|
7 |
Material Type: Article
|
Pre-trained Speech Processing Models Contain Human-Like Biases that Propagate to Speech Emotion RecognitionarXiv.org, 2023-102023. This work is published under http://creativecommons.org/licenses/by-nc-sa/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://creativecommons.org/licenses/by-nc-sa/4.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2310.18877Full text available |
|
8 |
Material Type: Article
|
Where Visual Speech Meets Language: VSP-LLM Framework for Efficient and Context-Aware Visual Speech ProcessingarXiv.org, 2024-052024. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://arxiv.org/licenses/nonexclusive-distrib/1.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2402.15151Full text available |
|
9 |
Material Type: Article
|
Developing Speech Processing Pipelines for Police AccountabilityarXiv.org, 2023-062023. This work is published under http://creativecommons.org/licenses/by-nc-sa/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://creativecommons.org/licenses/by-nc-sa/4.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2306.06086Full text available |
|
10 |
Material Type: Article
|
Topological Data Analysis for Speech ProcessingarXiv.org, 2023-062023. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://creativecommons.org/licenses/by/4.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2211.17223Full text available |
|
11 |
Material Type: Article
|
Transformers in Speech Processing: A SurveyarXiv.org, 2023-032023. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://arxiv.org/licenses/nonexclusive-distrib/1.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2303.11607Full text available |
|
12 |
Material Type: Article
|
Visually Grounded Models of Spoken Language: A Survey of Datasets, Architectures and Evaluation TechniquesThe Journal of artificial intelligence research, 2022-01, Vol.73, p.673-707 [Peer Reviewed Journal]2022. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the associated terms available at https://www.jair.org/index.php/jair/about ;ISSN: 1076-9757 ;EISSN: 1943-5037 ;DOI: 10.1613/jair.1.12967Full text available |
|
13 |
Material Type: Article
|
dEchorate: a Calibrated Room Impulse Response Database for Echo-aware Signal ProcessingarXiv.org, 2021-04 [Peer Reviewed Journal]2021. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://arxiv.org/licenses/nonexclusive-distrib/1.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2104.13168Full text available |
|
14 |
Material Type: Article
|
Open Implementation and Study of BEST-RQ for Speech ProcessingarXiv.org, 2024-052024. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://arxiv.org/licenses/nonexclusive-distrib/1.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2405.04296Full text available |
|
15 |
Material Type: Article
|
Compressing Transformer-based self-supervised models for speech processingarXiv.org, 2024-012024. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://arxiv.org/licenses/nonexclusive-distrib/1.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2211.09949Full text available |
|
16 |
Material Type: Article
|
Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Speech ProcessingarXiv.org, 2022-112022. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://arxiv.org/licenses/nonexclusive-distrib/1.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2211.01522Full text available |
|
17 |
Material Type: Article
|
A Review of Deep Learning Techniques for Speech ProcessingarXiv.org, 2023-052023. This work is published under http://creativecommons.org/licenses/by-sa/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://creativecommons.org/licenses/by-sa/4.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2305.00359Full text available |
|
18 |
Material Type: Article
|
SUPERB: Speech processing Universal PERformance BenchmarkarXiv.org, 2021-102021. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://creativecommons.org/licenses/by/4.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2105.01051Full text available |
|
19 |
Material Type: Article
|
Self-supervised Rewiring of Pre-trained Speech Encoders: Towards Faster Fine-tuning with Less Labels in Speech ProcessingarXiv.org, 2022-102022. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://arxiv.org/licenses/nonexclusive-distrib/1.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2210.13030Full text available |
|
20 |
Material Type: Article
|
SpeechFormer++: A Hierarchical Efficient Framework for Paralinguistic Speech ProcessingarXiv.org, 2023-022023. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://arxiv.org/licenses/nonexclusive-distrib/1.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2302.14638Full text available |