Anzeige
Sortierung
Seite 1 von 3

1

A Maximum Entropy Information Bottleneck (MEIB) Regularization for Generative Speech Enhancement with HiFi-GAN

Autoren:
Sach, Marvin; Pirklbauer, Jan; Fluyt, Kristoff; Tirry, Wouter; Fingscheidt, Tim
Konferenz:
Speech Communication - 15th ITG Conference

2

Ad Hoc Distributed Microphones Clustering: A Comparative Analysis on Using Coherence and Signal-Specific Features

Autoren:
Kindt, Stijn; Meeldijk, Martijn; Madhu, Nilesh
Konferenz:
Speech Communication - 15th ITG Conference

3

Advances In End-to-End Conversational Speech Quality Prediction

Autoren:
Bleiholder, Stefan; Kettler, Christian; Rohrer, Nils; Weyer, Steffen
Konferenz:
Speech Communication - 15th ITG Conference

4

Analyzing And Improving Neural Speaker Embeddings for ASR

Autoren:
Luescher, Christoph; Xu, Jingjing; Zeineldeen, Mohammad; Schlueter, Ralf; Ney, Hermann
Konferenz:
Speech Communication - 15th ITG Conference

5

Audio-Visual Speech Enhancement with Score-Based Generative Models

Autoren:
Richter, Julius; Frintrop, Simone; Gerkmann, Timo
Konferenz:
Speech Communication - 15th ITG Conference

6

BRUDEX Database: Binaural Room Impulse Responses with Uniformly Distributed External Microphones

Autoren:
Fejgin, Daniel; Middelberg, Wiebke; Doclo, Simon
Konferenz:
Speech Communication - 15th ITG Conference

7

Comparative Analysis of the wav2vec 2.0 Feature Extractor

Autoren:
Vieting, Peter; Schlueter, Ralf; Ney, Hermann
Konferenz:
Speech Communication - 15th ITG Conference

8

Comparative Study of LC3plus and Lyra codec on DNN-based Source Localisation for Hearing Aids

Autoren:
Song, Siyuan; Kindt, Stijn; Maes, Jasper; Bohlender, Alexander; Madhu, Nilesh
Konferenz:
Speech Communication - 15th ITG Conference

9

Comparison of Different Neural Network Architectures for Spoken Language Identification

Autoren:
Bazazo, Tala; Zeineldeen, Mohammad; Plahl, Christian; Schlueter, Ralf; Ney, Hermann
Konferenz:
Speech Communication - 15th ITG Conference

10

Compression of end-to-end non-autoregressive image-to-speech system for lowresourced devices

Autoren:
Srinivasagan, Gokul; Deisher, Michael; Georges, Munir
Konferenz:
Speech Communication - 15th ITG Conference

11

CRNN-based Multi-DOA Estimator: Comparing Classification and Regression

Autoren:
Cooreman, Pieter; Bohlender, Alexander; Madhu, Nilesh
Konferenz:
Speech Communication - 15th ITG Conference

12

Design of Low-Order IIR Filters Based on Hankel Nuclear Norm Regularization for Achieving Acoustic Transparency

Autoren:
Hilgemann, Florian; Weyer, Christoph; Jax, Peter
Konferenz:
Speech Communication - 15th ITG Conference

13

Development of Hybrid ASR Systems for Low Resource Medical Domain Conversational Telephone Speech

Autoren:
Luescher, Christoph; Zeineldeen, Mohammad; Yang, Zijian; Raissi, Tina; Vieting, Peter; Le-Duc, Khai; Wang, Weiyue; Schlueter, Ralf; Ney, Hermann
Konferenz:
Speech Communication - 15th ITG Conference

14

Distribution Mismatch Correction for Acoustic Scene Classification

Autoren:
Maier, Lukas; Fuchs, Alexander; Pernkopf, Franz
Konferenz:
Speech Communication - 15th ITG Conference

15

Evaluation Metrics for Generative Speech Enhancement Methods: Issues and Perspectives

Autoren:
Pirklbauer, Jan; Sach, Marvin; Fluyt, Kristoff; Tirry, Wouter; Wardah, Wafaa; Moeller, Sebastian; Fingscheidt, Tim
Konferenz:
Speech Communication - 15th ITG Conference

16

Evaluation of HRTF Models for Binaural Cue Adaptation

Autoren:
Nagel, Sebastian; Jax, Peter
Konferenz:
Speech Communication - 15th ITG Conference

17

Exploiting an External Microphone to Improve Time-Difference-of-Arrival Estimates for Euclidean Distance Matrix-Based Source Localization

Autoren:
Bruemann, Klaus; Doclo, Simon
Konferenz:
Speech Communication - 15th ITG Conference

18

Exploratory Evaluation of Speech Content Masking

Autoren:
Williams, Jennifer; Pizzi, Karla; Noe, Paul-Gauthier; Das, Sneha
Konferenz:
Speech Communication - 15th ITG Conference

19

Exploring Shapely Values for Blood Glucose Level Prediction from Speech

Autoren:
Pompe, Simone; Mallol-Ragolta, Adria; Schauer, Nicolas; Schuller, Bjoern W.
Konferenz:
Speech Communication - 15th ITG Conference

20

Exploring Visualization Techniques for Interpretable Learning in Speech Enhancement Deep Neural Networks

Autoren:
Nustede, Eike J.; Anemueller, Joern
Konferenz:
Speech Communication - 15th ITG Conference