Anzeige
Sortierung
Seite 1 von 2

1

A Comparative Analysis on ASR System Combination for Attention, CTC, Factored Hybrid, and Transducer Models

Autoren:
Bayoumi, Noureldin; Schmitt, Robin; Raissi, Tina; Zeyer, Albert; Schlueter, Ralf; Ney, Hermann
Konferenz:
Speech Communication - 16th ITG Conference

2

A fully Zero-shot Approach to Obtaining Specialized and Compact Audio Tagging Models

Autoren:
Werning, Alexander; Haeb-Umbach, Reinhold
Konferenz:
Speech Communication - 16th ITG Conference

3

A Very-Low Delay High-Performance Speech Vocoder Based on the Encodec Speech Decoder

Autoren:
Shi, Renzheng; Fingscheidt, Tim
Konferenz:
Speech Communication - 16th ITG Conference

4

Acoustical characterization and perceptual comparison of four types of 3Dprinted vocal tract models for the German and Japanese vowels /a,e,i,o,u/

Autoren:
Kleiner, Christian; Birkholz, Peter; Schaefer, Dominik; Arai, Takayuki
Konferenz:
Speech Communication - 16th ITG Conference

5

Adapting the Frechet Audio Distance as an Objective Metric for Text-to-Speech Quality Evaluation

Autoren:
Zavistanavicius, Laurynas; Zalkow, Frank; Dittmar, Christian; Stevenson, Robert L.
Konferenz:
Speech Communication - 16th ITG Conference

6

An Improved Neural Network Architecture for Target Speech Extraction

Autoren:
Joos, David; Faubel, Friedrich; Jungclaussen, Jonas; Buck, Markus; Minker, Wolfgang
Konferenz:
Speech Communication - 16th ITG Conference

7

Binaural Distance Estimation Using a Joint Latent Representation of Acoustic Distance and Direct Path Response

Autoren:
Neudek, Daniel; Stodt, Benjamin; Getzmann, Stephan; Martin, Rainer
Konferenz:
Speech Communication - 16th ITG Conference

8

Blind Estimation of Head Rotations From Binaural Recordings

Autoren:
Fleischhauer, Erik; Jax, Peter
Konferenz:
Speech Communication - 16th ITG Conference

9

Building a German-centric SpeechLLM Using Limited Data

Autoren:
Maurya, Manas; Dethmann, Thomas; Walter, Oliver; Schmidt, Christoph Andreas; Koehler, Joachim
Konferenz:
Speech Communication - 16th ITG Conference

10

Comparison of Knowledge Distillation Methods for Low-complexity Multimicrophone Speech Enhancement using the FT-JNF Architecture

Autoren:
Metzger, Robert; Ohlenbusch, Mattes; Rollwage, Christian; Doclo, Simon
Konferenz:
Speech Communication - 16th ITG Conference

11

Detecting COPD Exacerbations Before Onset Using Vocal Biomarkers

Autoren:
Nippert, Lars; Simons, Sami O.; Hoxha, Julia
Konferenz:
Speech Communication - 16th ITG Conference

12

Early and Late Reflections in Acoustic Echo Control: An Experimental Study on (Neural) Kalman Filters and DNN Methods

Autoren:
Seidel, Ernst; Fingscheidt, Tim
Konferenz:
Speech Communication - 16th ITG Conference

13

Effectiveness of Acceleration Sensors on the Thorax and Abdomen for Speech Breathing Analysis

Autoren:
Kazzy, Dani; Kleiner, Christian; Fuchs, Susanne; Birkholz, Peter
Konferenz:
Speech Communication - 16th ITG Conference

14

Enhancement of Neural Embeddings for Speaker Identification in Ad-hoc Acoustic Sensor Networks and Multi-Speaker Scenarios

Autoren:
Intek, Philipp; Becker, Luca; Koppelmann, Timm; Martin, Rainer
Konferenz:
Speech Communication - 16th ITG Conference

15

Error Analysis in a Modular Meeting Transcription System

Autoren:
Vieting, Peter; Berger, Simon; von Neumann, Thilo; Boeddeker, Christoph; Schlueter, Ralf; Haeb-Umbach, Reinhold
Konferenz:
Speech Communication - 16th ITG Conference

16

Evaluating the Impact of Crowdsourced Audio Data on Speech Quality Assessment

Autoren:
Shchegelskiy, Kirill; El-Tannir, Malek; Wardah, Wafaa; Kocak Bueyuektas, Tugce Melike; Moeller, Sebastian
Konferenz:
Speech Communication - 16th ITG Conference

17

Evaluating the Recognition Performance of the RehaLingo Speech Training System with Aphasic Speech

Autoren:
Hirsch, Hans-Guenter; Tiggelkamp, Yannic; Neumann, Christian; Bolten, Tobias
Konferenz:
Speech Communication - 16th ITG Conference

18

Exploring In-Context Learning Capabilities of ChatGPT for Pathological Speech Detection

Autoren:
Amiri, Mahdi; Shahreza, Hatef Otroshi; Kodrasi, Ina
Konferenz:
Speech Communication - 16th ITG Conference

19

Extending Manifold-Based MIMO System Identification to Adaptive Crosstalk Cancellation

Autoren:
Hahn, Johannes; Kabzinski, Tobias; Jax, Peter
Konferenz:
Speech Communication - 16th ITG Conference

20

Investigation of Speech and Noise Latent Representations in Single-channel VAE-based Speech Enhancement

Autoren:
Li, Jiatong; Doclo, Simon
Konferenz:
Speech Communication - 16th ITG Conference