Papers and Book Chapters

List of publications

A B C D E F G H I J K L M N O P Q R S T U V W X Y Z

K

Karbasi, M., Zeiler, S., Freiwald, J., & Kolossa, D. (2019). Toward Robust Mispronunciation Detection via Audio-Visual Speech Recognition. In I. Rojas, G. Joya, & A. Catala (Eds.), LNCS sublibray: SL1 - Theoretical computer science and general issues: Vol. 11506. Advances in Computational Intelligence: 15th International Work-Conference on Artificial Neural Networks, IWANN 2019, Gran Canaria, Spain, June 12-14, 2019, Proceedings, Part I (Vol. 11507, pp. 655–666). Springer. (https://doi.org/10.1007/978-3-030-20518-8_54)

Freiwald, J., Karbasi, M., Zeiler, S., Melchior, J., Kompella, V., Wiskott, L., Kolossa, D., (2018). “Utilizing Slow Feature Analysis for Lipreading”, in Speech communication: 13. ITG-Fachtagung Sprachkommunikation 10.- 12. Oktober 2018 in Oldenburg, ed. Simon Doclo and Peter Jax, 191–95. ITG-Fachbericht 282. Berlin: VDE VERLAG.

Karbasi, M., Abdelaziz, A. H., Meutzner H., Kolossa, D. (2016), "Blind Non-Intrusive Speech Intelligibility Prediction using Twin-HMMs", Proc. INTERSPEECH, San Francisco, USA, September 2016.

Karbasi, M., Abdelaziz, A. H., Kolossa, D. (2016). “Twin-HMM-Based Non-Intrusive Speech Intelligibility Prediction”, in 2016 IEEE International Conference on Acoustics, Speech and Signal Processing ICASSP 2016: March 20-25, 2016 Shanghai, China, 624–28: IEEE.

Karbasi, M., Kolossa, D. (2015). “A Microscopic Approach to Speech Intelligibility Prediction using Auditory Models”, Proc. DAGA, Nürnberg, March 2015.

Anfang

Kesselheim, M. (1990). "Computergestützte Konstruktion großer Wortklassensysteme." Institut für Kommunikationsakustik, Ruhr-Universität Bochum. Dissertation, Universitätsverlag Dr. N. Brockmeyer, D-Bochum

Kesselheim, M. (1988). "Arbeiten zur linguistischen Analyse europäischer Sprachen." BIGTECH.

Kesselheim, M., Kugler-Kruse, M., Gerlach, H., Hellingrath, B., Schreiber, P. (1987). "Benutzersprache - Sprache der Maschinen." Vortrag im Rahmen der VDI-Tagung Herausforderung der Gesellschaft durch den technischen Wandel, Braunschweig.

Anfang

Kirchner, M., Hudde, H. (1995). "Ein Verfahren zur individuellen Schätzung der Parameter des Gehörgangs." Fortschritte der Akustik - DAGA 95: 227-230.

Anfang

Kleingarn, D., Nabizadeh, N., Heckmann, M., & Kolossa, D. (2019). Speaker-adapted neural-network-based fusion for multimodal reference resolution. In Proceedings of the 20th Annual SIGdial Meeting on Discourse and Dialogue (pp. 210–214). Association for Computational Linguistics. (https://www.aclweb.org/anthology/W19-5925)

Anfang

Knohl, L. (1997). "Prosodiegesteuerte Sprecher- und Umweltadaption in einer Mehrsprecher-Architektur zur machinellen Spracherkennung (Prosody Driven Speaker - and Environment Adaptation in a Multi-Speaker Speech -Recognition Architecture)." Institut für Kommunikationsakustik, Ruhr-Universität Bochum. Dissertation, VDI Verlag, D-Düsseldorf

Knohl, L. (1994). "Ein Cochlea- und Haarzellenmodell zur physiologischen Sprachvorverarbeitung." Statusseminar Neuroinformatik des BMFT, Hrsg.: Projektträger Informationstechnik des BMFT bei der Dt. Forschungsanstalt für Luft- und Raumfahrt (DLR): 383-388.

Knohl, L., Rinscheid, A. (1994). "Verfahren zur gegenseitigen Abbildung von Merkmalsräumen." Deutsches Patent, Aktenzeichen DE 43 00 159 A1.

Knohl, L., Rinscheid, A. (1993). "Speaker normalization and adaptation based on feature map projection." EUROSPEECH'93. 1: 367-370.

Knohl, L., Rinscheid, A. (1993). "Speaker normalization with self-organizing feature maps." Proc. of the Int. Joint Conference on Neural Networks (IJCnn) 1: 243-246.

Knohl, L., Rinscheid, A. (1993). "Verfahren zur kontinuierlichen Merkmalsadaptaion mittels selbstorganisierender topologie-erhaltender Merkmalskarten." Fortschritte der Akustik - DAGA'93 1004-1007.

Anfang

D. Orth, N. Steinhardt, B. Bolder, M. Dunn, D. Kolossa, M. Heckmann: "Analysis of a Speech-Based Intersection Assistant in Real Urban Trafﬁc", accepted for publication," Proc. IEEE Intelligent Transportation Systems Conference (ITSC), Maui, Hawaii, USA, Nov. 2018.

J. Freiwald, M. Karbasi, S. Zeiler, J. Melchior, V. Kompella, L. Wiskott, D. Kolossa: "Utilizing Slow Feature Analysis for Lipreading," ITG Conference on Speech Communication, Oldenburg, Germany, Oct. 2018.

M. Heckmann, D. Orth, D. Kolossa: "Gap after the next two vehicles": A Spatio-temporally Situated Dialog for a Cooperative Driving Assistant," ITG Conference on Speech Communication, Oldenburg, Germany, Oct. 2018.

C. Schymura, T. Isenberg, D. Kolossa: "Extending linear dynamical systems with dynamic stream weights for audiovisual speaker localization," International Workshop on Acoustic Signal Enhancement (IWAENC), Tokyo, Sept. 2018.

C. Schymura, P. Guo, Y. Maymon, B. Rafaely, D. Kolossa: "Exploiting Structures of Temporal Causality for Robust Speaker Localization in Reverberant Environments," best student paper award, Proc. LVA/ICA, Guildford, UK, July 2018.

C. Schymura, D. Kolossa: "Potential-field-based active exploration foracoustic simultaneous localization and mapping," ICASSP 2018, Calgary, Canada, April 2018.

Orth, D., Bolder, B., Steinhardt, N., Dunn, M., Kolossa, D., Heckmann, M. (2018). "A Speech-Based On-Demand Intersection Assistant Prototype", 2018 IEEE Intelligent Vehicles Symposium (IV), Changshu, China, June 26-29 2018.

Watanabe, S., Kolossa, D., Virtanen, T. (2018). "Application of Source Separation to Robust Speech Analysis and Recognition", in E. Vincent, T. Virtanen, and S. Gannot (ed.) "Audio Source Separation and Speech Enhancement," John Wiley & Sons, 2018.
Heckmann, M., Orth, D., Wersing, H., Kolossa, D. (2017). "Development of a personalised intersection assistant", ATZ Worldw (2017) 119: 36.

Heckmann, M., Wersing, H., Orth, D., Kolossa, D., Schömig, N., Maag, C., Dunn, M. (2017). "Towards an On Demand Intersection Assistant: Initial User Acceptance and System Development", 2017 FAST-zero ’17, Nara, Japan, September 18-22 2017.

Meutzner, H., Gupta, S., Nguyen, V.-H., Holz, T., Kolossa, D. (2017). "Towards Improved Audio CAPTCHAs Based on Auditory Perception and Language Understanding", ACM Transactions on Information and System Security, vol. 19, no. 4, February 2017.

Meutzner, H., Ma, N., Nickel, R., Schymura, C., Kolossa, D. (2017). "Improving audio-visual speech recognition using deep neural networks with dynamic stream reliability estimates", Proc. ICASSP, New Orleans, March 2017.

Orth, D., Kolossa, D., Sarria Paja, M., Schaller, K., Pech, A., Heckmann, M. (2017). "A Maximum Likelihood Method for Driver-Specific Critical-Gap Estimation", 2017 IEEE Intelligent Vehicles Symposium (IV), Redondo Beach, CA, USA, 11-14 June 2017.

Orth, D., Schömig, N., Mark, C., Jagiellowicz-Kaufmann, M., Kolossa, D., Heckmann, M. (2017). "Benefits of Personalization in the Context of a Speech-Based Left-Turn Assistant", 9th ACM International Conference on Automotive User Interfaces and Interactive Vehicular Applications (AutomotiveUI ’17), Oldenburg, Germany, September 24 - 27, 2017.

Orth, D., Kolossa, D., Heckmann, M. (2017). "Predicting driver left-turn behavior from few training samples using a maximum a posteriori method", 2017 IEEE 20th International Conference on Intelligent Transportation Systems (ITSC), Yokohama, Japan, October 16-19.

Rafaely, B., Kolossa, D., Maymon, Y. (2017). "Towards acoustically robust localization of speakers in a reverberant environment," Proc. HSCMA, San Francisco, March 2017.

Rafaely, B., Kolossa, D. (2017)."Speaker localization in reverberant rooms based on direct path dominance test statistics", Proc. ICASSP, New Orleans, March 2017.

Schymura, C., Rios Grajales, J., Kolossa, C. (2017). "Monte Carlo exploration for active binaural localization", Proc. ICASSP, New Orleans, March 2017.

Schönherr, L., Orth, D., Heckmann, M., Kolossa, D. (2017). "Environmentally Robust Audio-Visual Speaker Identification", Proc. IEEE Workshop on Spoken Language Technology (SLT 2016), San Diego, USA, 13–16 December 2016

Boenninghoff, B., Nickel, R. M., Zeiler, S., Kolossa, D. (2016). "Unsupervised Classification of Voiced Speech and Pitch Tracking Using Forward-Backward Kalman Filtering", Proc. Speech Communication, 12. ITG Symposium, Paderborn, Germany, 2016.

Gergen, S., Zeiler, S., Hussen Abdelaziz, A., Kolossa, D. (2016). "New Insights into Turbo-Decoding-Based AVSR with Dynamic Stream Weights", ITG-Fachtagung Sprachkommunikation, Paderborn, Germany, Oct. 2016.

Hussen Abdelaziz, A., Kolossa, D. (2016). “General Hybrid Framework for Uncertainty-decoding-based Automatic Speech Recognition Systems”, accepted for publication, Speech Communication, 2016.

Jacobi, R., vom Bögel, G., Grabmaier, A., Kolossa, D. (2016). “Analysis of Time Variant Reliability Information Used in a Multilevel Decoding Scheme for RFID and Sensor Signals”, accepted for publication in International Journal for RF Technologies: Research and Applications 2016.

Karbasi, M., Abdelaziz, A. H., Kolossa, D. (2016). “Twin-HMM-Based Non-Intrusive Speech Intelligibility Prediction”, in 2016 IEEE International Conference on Acoustics, Speech and Signal Processing ICASSP 2016: March 20-25, 2016 Shanghai, China, 624–28: IEEE.

Schönherr, L., Orth, D., Heckmann, M., Kolossa, D. (2016). "Environmentally Robust Audio-Visual Speaker Identification", IEEE Spoken Language Technology Workshop (SLT), San Diego, USA, December 2016.

Schönherr, L., Zeiler, S., Kolossa, D. (2016). "Spoofing Detection via Simultaneous Verification of Audio-Visual Synchronicity and Transcription", IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), Okinawa, Japan, December 2017.

Taghia, J., Kolossa, D., Martin, R. (2016). „ALE for robots! A single-channel approach to robot self-noise cancellation”, IEEE International Workshop on Acoustic Signal Enhancement (IWAENC), S. 1–5, 2016.

Saeidi, R., Astudillo, R., Kolossa, D. (2015). “Uncertain LDA: Including observation uncertainties in discriminative transforms”, IEEE Trans. on Pattern Analysis and Machine Intelligence, Sept. 2015.

Hussen Abdelaziz, A., Zeiler, S., Kolossa, D. (2015). “Learning Dynamic Stream Weights For Coupled-HMM-based Audio-visual Speech Recognition”, IEEE Trans. Audio Speech and Language Processing, vol. 23, no. 5, pp. 863-876, May 2015.

Hussen Abdelaziz, A., Watanabe, S., Hershey, J., Vincent, E., Kolossa, D. (2015). “Uncertainty Propagation through Deep Neural Networks,” in Proc. Interspeech, Dresden, Germany, September 2015.

Schymura, C., Winter, F., Kolossa, D., Spors, S. (2015). “Binaural Sound Source Localisation and Tracking using a dynamic Spherical Head Model", in Proc. Interspeech, Dresden, Germany, September 2015.

Jacobi, R., vom Bögel, G., Kolossa, D. (2015). “Multilevel Decoding Scheme for RFID and Sensor Signals in Inductively Coupled Systems”, Proc. of the European Conference on Smart Objects, Systems and Technologies (SmartSysTech), Aachen, Germany, June 2015.

Jacobi, R., Süss, A., vom Bögel, G., Kolossa, D. (2015). “Determination of the Optimal Carrier Frequency in Harsh Environments by Parameter Estimation”, Proc. of the European Conference on Smart Objects, Systems and Technologies (SmartSysTech), Aachen, Germany, June 2015.

Jacobi, R., Süss, A., vom Bögel, G., Kolossa, D. (2015). “Carrier Frequency Adaptation Approach”, Proc. IEEE International Conference on RFID, San Diego, USA, April 2015.

Karbasi, M., Kolossa, D. (2015). “A Microscopic Approach to Speech Intelligibility Prediction using Auditory Models”, Proc. DAGA, Nürnberg, March 2015.

Meutzner, H., Gupta, S., Kolossa, D. (2015). "Constructing Secure Audio CAPTCHAs by Exploiting Differences between Humans and Machines", Proc. ACM Conference on Human Factors in Computing Systems (CHI), Seoul, Korea, April 2015.

Raake, A., Blauert, J., Braasch, J., Brown, G., Danés, P., Dau, T., Gas, B., Argentieri, S., Kohlrausch, A., Kolossa, D., Le Goff, N., May, T., Obermayer, K. Spors, S. (2014). "Two!Ears – integral interactive model of auditory perception and experience”, Fortschr. Akust., DAGA 2014, Dtsch. Ges. Akustik, D−Oldenburg.

Blauert, J., Kolossa, D., Danès, P. (2014). “Feedback loops in engineering models of binaural listening”, ACTA ACUSTICA (submitted).

Darnstädt, M., Meutzner, H., Kolossa, D. (2014). "Reducing the Cost of Breaking Audio CAPTCHAs by Active and Semi-Supervised Learning", Proc. ICMLA, December 2014.

Heckmann, M., Mikias, P., Kolossa, D. (2014). "The Impact of Word Alignment Accuracy on Audiovisual Word Prominence Detection", Proc. ITG Fachtagung Sprachkommunikation, September 2014.

Hussen Abdelaziz, A., Kolossa, D. (2014). "Dynamic Stream Weight Estimation in Coupled-HMMbased Audio-visual Speech Recognition Using Multilayer Perceptrons", Proc. Interspeech, September 2014.

Hussen Abdelaziz, A., Zeiler, S., Kolossa, D. (2014). "A new EM estimation of dynamic stream weights for coupled-HMM-based audio-visual ASR", Proc. ICASSP, Florence, May 2014.

Jacobi, R., Grey, S., vom Bögel, G., Kolossa, D. (2014). "Digitally Controlled Analog Front End for Inductively Coupled Transponder Systems", Proc. IEEE RFID Technology and Applications, September 2014.

Kolossa, D. (2014). "Narrowing the gap: Probabilistic interfaces for signal enhancement and pattern recognition", accepted for publication, Proc. IEEE GlobalSIP - Machine Learning Applications in Speech Processing, December 2014.

Meutzner, H., Nguyen, V. H., Holz, T., Kolossa, D. (2014). "Using Automatic Speech Recognition for Attacking Acoustic CAPTCHAs: The Trade-off between Usability and Security", Proc. ACSAC, December 2014.

Schmid, D., Enzner, G., Malik, S., Kolossa, D., Martin, R. (2014). "Variational Bayesian Inference for Multichannel Dereverberation and Noise Reduction", IEEE Trans. on Audio, Speech and Language Processing, vol. 22, no. 8, pp. 1320-1335, Aug. 2014.

Schymura, C., Ma, N., Brown, G. J., Walther, T., Kolossa, D. (2014). "Binaural Sound Source Localisation using a Bayesian-network-based Blackboard System and Hypothesis-driven Feedback", in: Proc. FORUM ACUSTICUM 2014, PL-Krakow.

Zeiler, S., Cwiklak, J., Kolossa, D. (2014). "Robust Multimodal Human Machine Interaction using the Kinect Sensor", Proc. ITG Fachtagung Sprachkommunikation, September 2014.

Astudillo, F. R., Kolossa, D., Abad, A., Zeiler, S., Saeidi, R., Mowlaee, R., da Silva Neto, J. P., Martin, R. (2013). "Integration of Beamforming and Uncertainty-of-Observation Techniques for Robust ASR in Multi-Source Environments", Computer Speech and Language, Special Issue on Multisource Environments, vol. 27, no. 3, pp. 837-850, May 2013.

Blauert, J., Kolossa, D., Obermayer, K., Adiloglu, K. (2013). "Further challenges and the road ahead", in J. Blauert (ed.), The technology of binaural listening, Springer, Berlin-Heidelberg-New York, 2013.

Blauert, J., Kolossa, D., Obermayer, K., Adiloglu, K. (2013). “Further challenges and the road ahead”, in J. Blauert (ed.) The technology of binaural listening, Springer, Berlin-Heidelberg- New York, 2013.

Hussen Abdelaziz, A., Zeiler, S., Kolossa, D. (2013). ''Using Twin-HMM-Based Audio-Visual Speech Enhancement as a Front-End for Robust Audio-Visual Speech Recognition'', Proc. Interspeech, Lyon, France, August 2013.

Hussen Abdelaziz, A., Zeiler, S., Kolossa, D. (2013). ''Twin-HMM-based audio-visual speech enhancement'', Proc. ICASSP, Vancouver, Canada, May 2013.

Hussen Abdelaziz, A., Zeiler, S., Kolossa, D., Leutnant, V., Haeb-Umbach, R. (2013). ''GMM-based significance decoding'', in Proc. ICASSP, 2013.

Kohlrausch, A., Braasch, J., Kolossa, D., Blauert, J. (2013). "An introduction to binaural processing”, in J. Blauert (ed.) The technology of binaural listening, Springer, Berlin- Heidelberg-New York, 2013.

Kolossa, D., Zeiler, S., Saeidi, R., Astudillo, F. R. (2013). “Noise-Adaptive LDA: A New Approach for Speech Recognition Under Observation Uncertainty", IEEE Signal Processing Letters, vol. 20, no. 11, pp. 1018-1021, 2013.

Meutzner, H., Malik, S., Kolossa, D. (2013). "SVM-Based Preprocessing for Automatic Speech Recognition", AIA/DAGA.

Meutzner, H., Schlesinger, A., Zeiler, S., Kolossa, D. (2013). "Binaural Signal Processing for Enhanced Speech Recognition Robustness in Complex Listening Environments", Proc. 2nd CHiME Workshop on Machine Listening in Multisource Environments, Vancouver, Canada, June 2013.

Nickel, R, Astudillo, F. R., Kolossa, D., Martin, R. (2013). “Corpus-Based Speech Enhancement with Uncertainty Modeling and Cepstral Smoothing”, IEEE Trans. on Audio, Speech and Language Processing, vol. 21, no. 5, May 2013.

Martin, R., Kolossa, D. (2012). “Voice activity detection, noise estimation, and adaptive filters for acoustic signal enhancement”, in: T. Virtanen, R. Singh, B. Raj (eds.): “Techniques for Noise Robustness in Automatic Speech Recognition”, John Wiley & Sons, September 2012.

Hussen Abdelaziz, A., Kolossa, D. (2012). "Decoding of Uncertain Features Using the Posterior Distribution of the Clean Data for Robust Speech Recognition", International Speech Communication Association, (2012).

Hussen Abdelaziz, A., Zeiler, S., Kolossa, D. (2012). "Audio-Visual Speech Recognition for Uncertain Acoustical Observations", ITG Fachtagung Sprachkommunikation, (2012).

Nickel, R., Astudillo, R., Kolossa, D., Zeiler, S., Martin, R. (2012). "Inventory-Style Speech Enhancement with Uncertainty-of- Observation Techniques", ICASSP, pp. 4645-4648, Kyoto, Japan, March 2012.

Schmid, D., Thüne, P., Kolossa, D., Enzner, G. (2012). "Dereverberation Preprocessing and Training Data Adjustments for Robust Speech Recognition in Reverberant Environments," in Proc. ITG Conference Speech Communication, Braunschweig, Germany, Sep. 2012.

Astudillo, F. R., Kolossa, D. (2011). “Uncertainty Propagation”, in: Kolossa, D., Haeb-Umbach, R. (eds.): „Robust Speech Recognition of Uncertain or Missing Data - Theory and Applications“, Springer Verlag, pp. 35-64, July 2011.

Hoffmann, E., Kolossa, D., Orglmeister, R. (2011). “Recognition of Multiple Speech Sources using ICA”, in: Kolossa, D., Haeb-Umbach, R. (eds.): „Robust Speech Recognition of Uncertain or Missing Data - Theory and Applications“, Springer Verlag, pp. 319-344, July 2011.

Vorwerk, A., Zeiler, S., Kolossa, D., Astudillo, F. R., Lerch, D. (2011). “Use of Missing and Unreliable Data for Audiovisual Speech Recognition”, in: Kolossa, D., Haeb-Umbach, R. (eds.): „Robust Speech Recognition of Uncertain or Missing Data - Theory and Applications“, Springer Verlag, pp. 345-375, July 2011.

Kolossa, D. (2011). “High-Level Processing of Binaural Features”, Proc. Forum Acusticum, Aalborg, Denmark, June 27-July 1, 2011.

Kolossa, D., Astudillo, F. R., Abad, A., Zeiler, S., Saeidi, R., Mowlaee, P., da Silva Neto, J.P., Martin, R. (2011). “CHiME Challenge: Approaches to Robustness using Beamforming and Uncertainty-of-Observation Techniques”, in Proc. CHiME Workshop on Machine Listening in Multisource Environments, Florence, Italy, Sept.1, 2011.

Kolossa, D., Haeb-Umbach, R. (2011) (eds.): „Robust Speech Recognition of Uncertain or Missing Data - Theory and Applications“, Springer Verlag, 380 pages, July 2011.

Kolossa, D. (2011). „Einführung in die automatische Spracherkennung“, TU Berlin, 155 pages, 2010.

Astudillo, F. R., Kolossa D., Philipp Mandelartz, P., Orglmeister, R. (2010). "An Uncertainty Propagation Approach to Robust ASR using the ETSI Advanced Front-End", IEEE Journal of Selected Topics in Signal Processing, Special issue on Natural Interaction with Intelligent Environments, vol. 4, pp. 824 – 833, October 2010.

Kohl, F., Wübbeler, G., Kolossa, D., Bär, M., Orglmeister R., Elster, C. (2010). "Shifted factor analysis for the separation of evoked dependent MEG signals”, Phys. Med. Biol., vol. 55, pp. 4219–4230, 2010.

Kolossa, D., Astudillo, F. R., Hoffmann, E., Orglmeister, R. (2010). "Independent Component Analysis and Time-Frequency Masking for Multi-Speaker-Recognition“, EURASIP Journal on Audio, Speech, and Music Processing. vol. 2010, Article ID 651420, 13 pages, 2010.

Kolossa, D., Astudillo, F. R., Zeiler, S. , Vorwerk, A., Lerch, D., Chong, J., Orglmeister, R. (2010). “Missing Feature Audiovisual Speech Recognition under Real-Time Constraints”, ITG Fachtagung Sprachkommunikation, paper 22, 4 pages, Bochum, Germany, October 6-8, 2010.

Kolossa, D., Chong, J., Zeiler, S., Keutzer, K. (2010). “Efficient Manycore CHMM Speech Recognition for Audiovisual and Multistream Data”, Proc. Interspeech 2010, pp. 2698 – 2701, Makuhari, Japan, September 26-30, 2010.

Kohl, F. , Wübbeler, G., Kolossa, D., Elster, C., Bär, M., Orglmeister, R.(2010). "Noise adjusted PCA for finding the subspace of evoked dependent signals from MEG data”, Latent Variable Analysis and Signal Separation (LVA 2010), Lecture Notes in Computer Science, vol. 6365, pp. 442-449, September 2010.

Vorwerk, A., Wang, X., Kolossa, D., Zeiler, S., Orglmeister, R. (2010). "WAPUSK20 - A Database for Robust Audiovisual Speech Recognition", Proc. 7th Int. Conf. on International Language Resources and Evaluation (ELREC), pp. 3016 – 3019, 2010.

Astudillo, F. R., Kolossa, D., Orglmeister, R. (2009). "Accounting for the Uncertainty of Speech Estimates in the Complex Domain for Minimum Mean Square Error Speech Enhancement", Interspeech 2009, Brighton, UK, September 2009.

Kolossa, D., Zeiler, S., Vorwerk, A., Orglmeister, R.(2009). "Audiovisual Speech Recognition with Missing or Unreliable Data", Audiovisual Speech Processing Workshop (AVSP 2009), Brighton, UK, September 10-13, 2009.

Jeub, M., Kolossa, D., Astudillo, F. R., Orglmeister, R. (2009). "Performance Analysis of Wavelet-based Voice Activity Detection", invited paper, Proc. DAGA2009, pp. 407-408, Rotterdam, March 2009.

Kohl, F., Wübbeler, G., Kolossa, D., Elster, C., Bär, M., Orglmeister, R. (2009). "Non-Independent BSS: A Model for Evoked MEG Signals with Controllable Dependencies" in: Proceedings of the ICA 2009, pp. 443-450, Paraty, Brazil, March 15-18, 2009.

Hoffmann, E., Kolossa, D., Orglmeister, R. (2009). "Time Frequency Masking Strategy for Blind Source Separation of Acoustic Signals Based on Optimally-Modified LOG-Spectral Amplitude Estimator" in: Proceedings of the ICA 2009, pp. 581-588, Paraty, Brazil, March 15-18, 2009.

Astudillo, F. R., Kolossa, D., Orglmeister, R. (2008). "Uncertainty Propagation for Speech Recognition using RASTA Features in Highly Nonstationary Noisy Environments", ITG Fachtagung Sprachkommunikation, Aachen, October 2008.

Kohl, F., Wübbeler, G., Kolossa, D., Orglmeister, R. , Elster, C., Bär, M. (2008). "Performance of ICA for MEG data generated from subspaces with dependent sources", Proc. European Biomedical Engineering Congress (EMBEC), Antwerpen, Nov. 2008.

Kolossa, D., Hoffmann, E., Orglmeister, R. (2008). "ICA-Based Bayesian Time-Frequency Masking", invited paper, ITG Fachtagung Sprachkommunikation, Aachen, October 2008.

Kohl, F., Wübbeler, G., Sander, T., Trahms, L., Kolossa, D. , Orglmeister, R., Elster, C. and Bär, M. (2008). " Performance of ICA for Dependent Sources using Synthetic Stimulus Evoked MEG Data", invited paper, Workshop Biosignalverarbeitung, pp. 32-35, Potsdam, July 2008.

Kolossa, D., Araki, S. , Delcroix, M., Nakatani, T., Orglmeister, R., Makino, S. (2008). „Missing Feature Speech Recognition in a Meeting Situation with Maximum SNR Beamforming”, invited paper, Proc. ISCAS, pp. 3218-3221, Seattle, WA, May 2008.

Astudillo, F. R. , Kolossa, D. , Orglmeister, R. (2007). “Propagation of Statistical Information Through Non-Linear Feature Extractions For Robust Speech Recognition”, Proc. MaxEnt2007, pp. 245-252, 27th Int. Workshop on Bayesian Inference and Maximum Entropy Methods in Science and Engineering, Saratoga Springs, July 2007.

Hoffmann, E., Kolossa, D., Orglmeister, R. (2007). „A Soft Masking Strategy based on Multichannel Speech Probability Estimation for Source Separation and Robust Speech Recognition”, Proc. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA 2007), pp. 118-121, New Paltz, NY.

Hoffmann, E., Kolossa, D., Orglmeister, R. (2007). “A Batch Algorithm for Blind Source Separation of Acoustic Signals Using ICA and Time-Frequency Masking”, Proc. ICA 2007, pp. 480-487, Springer Verlag, Berlin.

Kolossa, D., Astudillo, F. R., Orglmeister, R. (2007). “Spracherkennung im Automobil durch Verwendung von Missing Feature Techniken“, invited paper, Proc. DAGA 2007, pp. 301-302.

Kolossa, D., Sawada, H., Astudillo, F. R., Orglmeister, R., Makino, S. (2006). „Recognition of convolutive speech mixtures by missing feature techniques for ICA“, invited paper, in: Proc. 40th Asilomar Conference on Signals, Systems and Computers, pp. 1397-1401, October 29 - November 1, Pacific Grove, USA, 2006.

Maraboina, S., Kolossa, D., Bora, P., Orglmeister, R. (2006). „Multi-Speaker Voice Activity Detection using ICA and Beampattern Analysis“, in: Proc. Eusipco 2006, September 4-8, Florence, Italy.

Kolossa, D., Klimas, A., Baumann, W., Orglmeister, R. (2006). „Robuste Erkennung gestörter Sprache im Automobil durch MMSE-Störgeräuschunterdrückung und Missing-Data Spracherkennung“, invited paper, in: Proc. Daga 2006, March 20 - 23, Braunschweig, Germany.

Kolossa, D., Klimas, A., Orglmeister, R.(2005). „Separation and Robust Recognition of Noisy, Convolutive Speech Mixtures using Time-Frequency Masking and Missing Data Techniques“, in: Proceedings of the WASPAA 2005, pp. 82-85, New Paltz, NY, USA, October 16-19, 2005.

Kolossa, D., Orglmeister, R.(2004). „Nonlinear Postprocessing for Blind Speech Separation“, in: Proceedings of the ICA 2004, pp. 832–839, Granada, Spain, September 22-24, 2004.

Kolossa, D., Huo, Q. (2002). „Using Time-Stretched Pulses for Accurate Splitting of Speech Utterances Played Back in Noisy Reverberant Environments“, in: Proceedings of the International Conference on Spoken Language Processing (ICSLP), pp.1541-1544, September 16-20 2002, Denver, CO, USA.

Baumann, W. , Köhler, B.-U., Kolossa, D., Orglmeister, R. (2001). „Real Time Separation of Convolutive Mixtures“, in: Proceedings of the ICA2001, pp.65-69, San Diego, California, USA.

Kolossa, D., Köhler, B.-U., Conrath, M., Orglmeister, R. (2001). „Optimal Permutation Correction by Multiobjective Genetic Algorithms”, in: Proceedings of the ICA2001, pp. 373-378, San Diego, California, USA.

Kolossa, D., Grübel, G. (2000). „Evolutionary Computation and Nonlinear Programming in Multi-Model Robust Control Design“, In: Lecture Notes on Computer Science vol. 1803, pp. 147-157, Stefano Cagnoni et al. (Eds.), Springer Verlag, Berlin, Heidelberg, New York, 2000.

Kolossa, D. (2008). "Independent Component Analysis for Environmentally Robust Speech Recognition”, PhD Thesis, TU Berlin, 2008.

Baumann, W., Kolossa, D., Orglmeister, R. (2004). „Frequenzvariantes Beamforming zur Sprechertrennung im KFZ“, German Patent Number 10312065.3, March 2004.

Anfang

Korany, N. (2004). "Computer Modelling and Auralisation of Sound Fields in Rooms- An Overview." Invited paper for ICA 2004, Kyoto, Japan.

Korany, N., Onsy, A. A., Blauert, J. (2001). "Acoustic Simulation of Rooms with Boundaries of Partially Specular Reflectivity." J. Appl. Acoust. 62: 875-887.

Korany, N., Blauert, J., Abdel Alim, O. (2000). "Acoustic Simulation of Rooms with Boundaries of Partially Specular Reflectivity." Applied Acoustics.

Korany, N. (2000). "A Model for the Simulation of Sound Fields in Enclosures: Integrating the Geometrical and the Radiant Approaches (Ein Modell zur Simulation des Schallfeldes in geschlossenen Räumen: Integration des geometrischen und des Radiosity-Ansatzes)." University of Alexandria. Dissertation

Anfang

Köster, S. (2000). "Akustisch-phonetische Aspekte von Lombard-Sprache für verschiedene Sprechstile." DAGA 2000.

Köster, S., Pörschmann, Ch., Walter, J. (2000). "Eine Datenbank für deutsche Sprache mit Lombard-Effekt." Fortschritte der Akustik - DAGA 2000, DEGA in Fortschritte der Akustik - DAGA 2000: 356-357.

Köster, S., Mersdorf, J. (1999). "Intelligibility Enhancement of Synthetic Speech Heard via Telephone in a Noisy Environment." ACUSTICA/acta acustica 85: 1, 166.

Anfang

Kraft, V. (1997). "Verkettung natürlichsprachlicher Bausteine zur Sprachsynthese: Anforderungen, Techniken und Evaluierung." Institut für Kommunikationsakustik, Ruhr-Universität Bochum. Dissertation, VDI Verlag, D-Düsseldorf

Kraft, V., Portele, T. (1995). "Quality Evaluation of Five Speech Synthesis Systems for German." Acta Acustica 3: 351-365.

Kraft, V. (1994). "Concatenation of synthesis units: instrumental and auditive evaluation." Proc. of the Workshop Speech Quality Assessment: Standards, Perspectives, Applications: 45-46.

Kraft, V. (1994). "Does the Resulting Speech Quality Improvement Make a Sophisticated Concatenation of Time-Domain Synthesis Units Worthwhile?" Proc. of the 2nd ESCA/IEEE Workshop on Speech Synthesis: 65-68.

Kraft, V. (1994). "Untersuchung qualitätsmindernder Faktoren bei der Erzeugung synthetischer Sprache." Fortschritte der Akustik - DAGA'94: 1329-1332.

Kraft, V. (1993). "Auditory Detection of Discontinuities in Synthesis-by-Concatenation." EUROSPEECH'93: 929-932.

Kraft, V. (1993). "Automatische Extraktion und Segmentierung von Halbsilben als Sprachsynthesebausteine." Fortschritte der Akustik - DAGA'93: 1036-1039.

Kraft, V., Rinscheid, A. (1992). "Automatische Markierung von Grundperioden bei Sprachsignalen - mit und ohne Auswertung eines Laryngogramms." Fortschritte der Akustik - DAGA'92 633-636.

Anfang

Krebber, J., Drullman, R., Eneman, K., Huber, R., Jekosch, U., Luts, H., Martin, R. (2006). "Enhancement of Telephone Situations for Hearing Impaired" , Proceedings of the Conference and Workshop on Assistive Technologies for Vision and Hearing Impairment Technology for Inclusion, M.A. Hersh (ed), 2006.

Pegam, R., Krebber, J., Jekosch, U. (2005). „Benutzererwartungen im Zusammenhang mit dem Sprachdialogsystem INSPIRE“, DAGA 2005, München, pp. 251-252.

Krebber, J., Möller, S., Pegam, R., Raake, A., Melichar, M., Rajman, M. (2004). "Wizard of Oz Tests for a Dialogue System for Smart Homes." in: Proc. 7. Congrès Français d'Acoustique / 30. Dtsch. Jahrestg. f. Akust. (CFA/DAGA'04), F-Strassburg, 1149-1150.

Krebber, J., Raake, A., Möller, S. (2003). "Modellierung der Einflüsse des Übertragungskanals bei der Interaktion mit einem Smart-Home-System." Fortschritte der Akustik - DAGA 2003: Plenarvortr. u. Fachbeitr. d. 29. Dtsch. Jahrestg. f. Akust., D-Aachen, Dtsch. Ges. Akust., D-Oldenburg: 732-733.

Krebber, J., Raake, A., Möller, S., Novo, P. (2003). "Remote Access Simulation System User Manual, Deliverable 3.3, IST project INSPIRE (INfotainment management with SPeech Interaction via REmote-microp>hones and telephone interfaces, IST-2001-32746)." Institut für Kommunikationsakustik, Ruhr-Universität, D-Bochum.

Krebber, J., Möller, S., Raake, A., Rehmann, S., Berger, J., Johannsen, W. (2002). "Ein Simulationssystem zur Untersuchung des Einflusses von Übertragungskanälen bei Smart-Home-Anwendungen." Elektronische Sprachsignalverarbeitung: Tagungsband der 13. Konferenz. R. Hoffmann, w.e.b. Universitätsverlag: 75-82.

Krebber, J. (2002). "Wavelet Transformation zur Signal-Rauschabstandsverbesserung von Ausiosignalen." Fortschritte der Akustik - DAGA 2002, Dtsch. Ges. Akustik, DEGA e.V., CD-ROM.

Krebber, J., S. Möller, et al. (2004). "Speech Input and Output: Module Assessment for Remote Access to a Smart-Home Spoken Dialog System." Proc. 8th Int. Conf. on Spoken Language Processing (Interspeech 2004 - ICSLP), KR-Jeju Island, Vol. 4, 2753-2756.

Anfang

Kugler, M., Kulas, W., Rühl, H. W. (1985). "Endgraphemuntersuchungen für die Satzanalyse in einem Vorleseautomaten." Fortschritte der Akustik - DAGA'85: 583-586.

Kugler-Kruse, M., Posmyk, R. (1987). "Methods for the Simulation of Natural Intonation in the 'SYRUB' Text-to-speech System for Unrestricted German Text." European Conference on Speech Technology Vol. 2: pp. 177-180.

Kugler-Kruse, M., Jekosch, U., Kulas, W. (1986). "Erweiterung der universellen Phonemsynthesizer-Schnittstelle für marktübliche Sprachsynthetisatoren." Fortschritte der Akustik - DAGA'86: 557-560.

Anfang

Kulas, W., Kugler-Kruse, M., Jekosch, U., Kesselheim, M. (1986). "Ergonomische Gesichtspunkte bei Vorleseautomaten: Benutzerschnittstelle und Prosodiesteuerung im Bochumer SYRUB-Programmsystem zur Umsetzung von deutschem Schrifttext in Lautschrift mit Prosodieinformation." NTG-Fachberichte 94, Sprachkommunikation: 198-202.

Kulas, W., Blauert, J. (1986). German text-to-phoneme software drives any speech synthesizer. Report Speech Tech '86, Media Dimensions Inc.

Kulas, W., Kugler-Kruse, M. (1986). "Sprechrhythmussteuerung im Bochumer Vorleseautomaten." Fortschritte der Akustik - DAGA'86: 553-556.

Kulas, W., Rühl, H. W. (1985). Sprachsynthese. Zur Synthese von natürlich gesprochener Sprache aus Texten und Konzepten. Germanistische Linguistik. 79-80: 128-138.

Kulas, W., Rühl, H. W. (1984). Ein autonomes Sprachausgabegerät mit unbegrenztem Wortschatz, Tagungsbericht der Siemens-Anwenderkreis-Tagung. Dortmund.

Kulas, W., Rühl, H. W. (1984). "Eine einfache Satzintonation für ein Sprachausgabesystem mit unbegrenztem Wortschatz." Fortschritte der Akustik - DAGA'84: 821-824.

Kulas, W., Rühl, H. W. (1984). Über die Erzeugung einer Satzintonation, Sprachsynthese. G. Olms Verlag.

Kulas, W., Rühl, H. W. (1982). "Satzzerlegung für ein Sprachausgabesystem mit unbegrenztem Wortschatz." Fortschritte der Akustik - DAGA'82: 1017-1020.

Kulas, W., Rühl, H. W. (1982). "SYNTEX - Synthetische Sprache aus Text. Eine Übersicht." NTG-Sitzung über Sprachsynthese.

Anfang

Institute of Communication acoustics

Publications

List of publications