Information Technology Society within VDE (ITG) (Hrsg.)

ITG-Fb. 252: Speech Communication

11. ITG-Fachtagung Sprachkommunikation 24. – 26. September 2014 in Erlangen

ITG-Fachberichte

2014, 202 Seiten, Slimlinebox, CD-Rom
ISBN 978-3-8007-3640-9
Persönliche VDE-Mitglieder erhalten auf diesen Titel 10% Rabatt

Inhaltsverzeichnis

The 11th ITG conference on Speech Communication solicits contributions on theory, algorithms, and applications in the following areas of speech, audio, and spoken language processing:

  • Acoustic Echo and Noise Control
  • Bandwidth Extension and Intelligibility Enhancement
  • Source Separation, Dereverberation, and Localization
  • Speech Production and Perception
  • Automatic Speech Recognition
  • Spoken Dialogue, Diarization, and Information Retrieval Systems
  • Speech Coding, Error Protection, and Concealment
  • Speech Quality Assessment
  • Speech in Mobile and Multimodal Applications
  • Acoustic Interfaces, Assistive Devices, and Hearing Aids
  • Automotive Applications
  • Hardware and Software Tools
  • Emerging Techniques and Applications

1

Spectral Noise Tracking for Improved Nonstationary Noise Robust ASR

Autoren: Chinaev, Aleksej; Puels, Marc; Haeb-Umbach, Reinhold

2

Semi-Automatic Calibration for Dereverberation by Spectral Subtraction for Continuous Speech Recognition

Autoren: Riedhammer, Korbinian; Bocklet, Tobias; Orozco-Arroyave, Juan Rafael; Orozco-Arroyave, Juan Rafael; Noeth, Elmar

3

4

5

Robust Multimodal Human Machine Interaction using the Kinect Sensor

Autoren: Zeiler, Steffen; Cwiklak, Jan; Kolossa, Dorothea

6

Towards a Localised German Automatic Speech Recognition

Autoren: Stadtschnitzer, Michael; Schmidt, Christoph; Stein, Daniel

7

8

Effects of Resampling in Acoustic Echo CancellationWith Static Nonlinear Loudspeaker Distortion

Autoren: Schalk-Schupp, Ingo; Faubel, Friedrich; Buck, Markus

9

Combined Nonlinear Echo Cancellation and Residual Echo Suppression

Autoren: Schwarz, Andreas; Hofmann, Christian; Kellermann, Walter

10

Efficient Multi-Channel Acoustic Echo Cancellation Using Constrained Sparse Filter Updates in the Subband Domain

Autoren: Desiraju, Naveen Kumar; Doclo, Simon; Gerkmann, Timo; Wolff, Tobias

11

Selflearning Codebook Speech Enhancement

Autoren: Heese, Florian; Nelke, Christoph Matthias; Niermann, Markus; Vary, Peter

12

An Open Source Corpus and Recording Software for Distant Speech Recognition with the Microsoft Kinect

Autoren: Schnelle-Walka, Dirk; Radeck-Arneth, Stephan; Biemann, Chris; Radomski, Stefan

13

Dual MicrophoneWind Noise Reduction by Exploiting the Complex Coherence

Autoren: Nelke, Christoph Matthias; Vary, Peter

14

A Differential Microphone Array with Input Level Alignment, Directional Equalization and Fast Notch Adaptation for Handsfree Communication

Autoren: Geiser, Bernd; Geiser, Bernd; Krueger, Hauke; Krueger, Hauke; Vary, Peter; Wiese, Detlef

15

16

Modeling Graphical and Speech User Interfaces with Widgets and Spidgets

Autoren: Massonie, Dominique; Hacker, Christian; Sowa, Timo

17

The Impact of Word Alignment Accuracy on Audio-visual Word Prominence Detection

Autoren: Heckmann, Martin; Mikias, Paschalis; Kolossa, Dorothea

18

19

Impact of Coding Noise on the Convergence of Blind Source Separation

Autoren: Meier, Stefan; Kellermann, Walter

20

Audio Coding for Beamforming with Distributed Microphones

Autoren: Pawig, Matthias; Vary, Peter

21

Declipping of Speech Signals Using Frequency Selective Extrapolation

Autoren: Jonscher, Markus; Seiler, Juergen; Kaup, Andre

22

23

On Reverse Waterfilling in Closed-Loop LPC with Noise Shaping

Autoren: Krueger, Hauke; Geiser, Bernd; Vary, Peter

24

Linear Predictive Coding With Backward Adaptation and Noise Shaping

Autoren: Korse, Srikanth; Krueger, Hauke; Pawig, Matthias; Vary, Peter

25

A Multi-Stage, Multi-Channel Processing System for Overlapping Speech Separation in a Real Scenario

Autoren: Toroghi, Rahil Mahdian; Oualil, Youssef; Klakow, Dietrich

26

Towards Acoustic Event Detection for Surveillance in Cars

Autoren: Transfeld, Peter; Receveur, Simon; Fingscheidt, Tim

27

Improved Performance Measures for Voice Activity Detection

Autoren: Graf, Simon; Graf, Simon; Herbig, Tobias; Buck, Markus; Schmidt, Gerhard

28

29

Application of Frequency Shifting in In-Car Communication Systems

Autoren: Withopf, Jochen; Rohde, Sebastian; Schmidt, Gerhard

30

31

Improvement in Listener Comfort Through Noise Shaping Using a Modified Wiener Filter Approach

Autoren: Rajan, Vasudev Kandade; Baasch, Christin; Krini, Mohamed; Schmidt, Gerhard

32

33

Challenges in Acoustic Signal Enhancement for Human-Robot Communication

Autoren: Loellmann, Heinrich W.; Barfuss, Hendrik; Deleforge, Antoine; Meier, Stefan; Kellermann, Walter

34

System Identification with Perfect Sequence Excitation – Efficient NLMS vs. Inverse Cyclic Convolution

Autoren: Antweiler, Christiane; Kuehl, Stefan; Sauert, Bastian; Vary, Peter

35

I-vector Speaker Verification for Speech Degraded by Narrowband and Wideband Channels

Autoren: Fernandez Gallardo, Laura; Fernandez Gallardo, Laura; Wagner, Michael; Wagner, Michael; Moeller, Sebastian; Moeller, Sebastian

36

On Bayesian Networks in Speech Signal Processing

Autoren: Maas, Roland; Huemmer, Christian; Hofmann, Christian; Kellermann, Walter

37

Advances in Perceptual Modeling of Speech Quality in Telecommunications

Autoren: Gierlich, Hans-Wilhelm; Heute, Ulrich; Moeller, Sebastian

38

Instrumental Evaluation of In-Car Communication Systems

Autoren: Theiss, Anne; Schmidt, Gerhard; Withopf, Jochen; Lueke, Christian

39

Speech quality of VoIP: bursty packet loss revisited

Autoren: Soloducha, Michal; Raake, Alexander

40

Orthogonal Audio Analyses for Disturbed Radio Broadcast

Autoren: Reimes, Jan; Kettler, Frank; Muesch, Udo; Lepage, Marc

41

New ITG Guideline for the Usability Evaluation of Smart Home Environments

Autoren: Moeller, Sebastian; Engelbrecht, Klaus-Peter; Hillmann, Stefan; Ehrenbrink, Patrick

42

43

Generalized Multichannel Wiener Filter for Spatially Distributed Microphones

Autoren: Lawin-Ore, Toby Christian; Stenzel, Sebastian; Freudenberger, Juergen; Doclo, Simon

44

45

46

Detection of Audio Events with Repetitive Structure Using Generalized Autocorrelations

Autoren: Kurth, Frank; Cornaggia-Urrigshardt, Alessia

47

Time-frequency Dependent Multichannel Voice Activity Detection

Autoren: Stenzel, Sebastian; Freudenberger, Juergen

48

Online Observation ErrorModel Estimation for Acoustic Sensor Network Synchronization

Autoren: Schmalenstroeer, Joerg; Zhao, Weile; Haeb-Umbach, Reinhold