List of publications
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z
K
Karbasi, M., Zeiler, S., Freiwald, J., & Kolossa, D. (2019). Toward Robust Mispronunciation Detection via Audio-Visual Speech Recognition. In I. Rojas, G. Joya, & A. Catala (Eds.), LNCS sublibray: SL1 - Theoretical computer science and general issues: Vol. 11506. Advances in Computational Intelligence: 15th International Work-Conference on Artificial Neural Networks, IWANN 2019, Gran Canaria, Spain, June 12-14, 2019, Proceedings, Part I (Vol. 11507, pp. 655–666). Springer. (https://doi.org/10.1007/978-3-030-20518-8_54)
Freiwald, J., Karbasi, M., Zeiler, S., Melchior, J., Kompella, V., Wiskott, L., Kolossa, D., (2018). “Utilizing Slow Feature Analysis for Lipreading”, in Speech communication: 13. ITG-Fachtagung Sprachkommunikation 10.- 12. Oktober 2018 in Oldenburg, ed. Simon Doclo and Peter Jax, 191–95. ITG-Fachbericht 282. Berlin: VDE VERLAG.
Karbasi, M., Abdelaziz, A. H., Meutzner H., Kolossa, D. (2016), "Blind Non-Intrusive Speech Intelligibility Prediction using Twin-HMMs", Proc. INTERSPEECH, San Francisco, USA, September 2016.
Karbasi, M., Abdelaziz, A. H., Kolossa, D. (2016). “Twin-HMM-Based Non-Intrusive Speech Intelligibility Prediction”, in 2016 IEEE International Conference on Acoustics, Speech and Signal Processing ICASSP 2016: March 20-25, 2016 Shanghai, China, 624–28: IEEE.
Karbasi, M., Kolossa, D. (2015). “A Microscopic Approach to Speech Intelligibility Prediction using Auditory Models”, Proc. DAGA, Nürnberg, March 2015.
Anfang
Kesselheim, M. (1990). "Computergestützte
Konstruktion großer Wortklassensysteme." Institut
für Kommunikationsakustik, Ruhr-Universität Bochum.
Dissertation, Universitätsverlag Dr. N. Brockmeyer, D-Bochum
Kesselheim, M. (1988). "Arbeiten zur
linguistischen Analyse europäischer Sprachen." BIGTECH.
Kesselheim, M., Kugler-Kruse, M., Gerlach,
H., Hellingrath, B., Schreiber, P. (1987). "Benutzersprache
- Sprache der Maschinen." Vortrag im Rahmen der VDI-Tagung
Herausforderung der Gesellschaft durch den technischen Wandel,
Braunschweig.
Anfang
Kirchner, M., Hudde, H. (1995). "Ein Verfahren
zur individuellen Schätzung der Parameter des Gehörgangs."
Fortschritte der Akustik - DAGA 95: 227-230.
Anfang
Kleingarn, D., Nabizadeh, N., Heckmann, M., & Kolossa, D. (2019). Speaker-adapted neural-network-based fusion for multimodal reference resolution. In Proceedings of the 20th Annual SIGdial Meeting on Discourse and Dialogue (pp. 210–214). Association for Computational Linguistics. (https://www.aclweb.org/anthology/W19-5925)
Knohl, L. (1997). "Prosodiegesteuerte Sprecher-
und Umweltadaption in einer Mehrsprecher-Architektur zur machinellen
Spracherkennung (Prosody Driven Speaker - and Environment
Adaptation in a Multi-Speaker Speech -Recognition Architecture)."
Institut für Kommunikationsakustik, Ruhr-Universität
Bochum. Dissertation, VDI Verlag, D-Düsseldorf
Knohl, L. (1994). "Ein Cochlea- und
Haarzellenmodell zur physiologischen Sprachvorverarbeitung."
Statusseminar Neuroinformatik des BMFT, Hrsg.: Projektträger
Informationstechnik des BMFT bei der Dt. Forschungsanstalt
für Luft- und Raumfahrt (DLR): 383-388.
Knohl, L., Rinscheid, A. (1994). "Verfahren
zur gegenseitigen Abbildung von Merkmalsräumen."
Deutsches Patent, Aktenzeichen DE 43 00 159 A1.
Knohl, L., Rinscheid, A. (1993). "Speaker
normalization and adaptation based on feature map projection."
EUROSPEECH'93. 1: 367-370.
Knohl, L., Rinscheid, A. (1993). "Speaker
normalization with self-organizing feature maps." Proc.
of the Int. Joint Conference on Neural Networks (IJCnn) 1:
243-246.
Knohl, L., Rinscheid, A. (1993). "Verfahren
zur kontinuierlichen Merkmalsadaptaion mittels selbstorganisierender
topologie-erhaltender Merkmalskarten." Fortschritte der
Akustik - DAGA'93 1004-1007.
Anfang
D. Orth, N. Steinhardt, B. Bolder, M. Dunn, D. Kolossa, M. Heckmann:
"Analysis of a Speech-Based Intersection Assistant in Real Urban
Traffic", accepted for publication," Proc. IEEE Intelligent
Transportation Systems Conference (ITSC), Maui, Hawaii, USA, Nov. 2018.
J. Freiwald, M. Karbasi, S. Zeiler, J. Melchior, V. Kompella, L. Wiskott, D. Kolossa: "Utilizing Slow Feature Analysis for Lipreading," ITG Conference on Speech Communication, Oldenburg, Germany, Oct. 2018.
M. Heckmann, D. Orth, D. Kolossa: "Gap after the next two vehicles": A Spatio-temporally Situated Dialog for a Cooperative Driving Assistant," ITG Conference on Speech Communication, Oldenburg, Germany, Oct. 2018.
C. Schymura, T. Isenberg, D. Kolossa: "Extending linear dynamical systems with dynamic stream weights for audiovisual speaker localization," International Workshop on Acoustic Signal Enhancement (IWAENC), Tokyo, Sept. 2018.
C. Schymura, P. Guo, Y. Maymon, B. Rafaely, D. Kolossa: "Exploiting Structures of Temporal Causality for Robust Speaker Localization in Reverberant Environments," best student paper award, Proc. LVA/ICA, Guildford, UK, July 2018.
C. Schymura, D. Kolossa: "Potential-field-based active exploration foracoustic simultaneous localization and mapping," ICASSP 2018, Calgary, Canada, April 2018.
Orth, D., Bolder, B., Steinhardt, N., Dunn, M., Kolossa, D., Heckmann, M. (2018). "A Speech-Based On-Demand Intersection Assistant Prototype", 2018 IEEE Intelligent Vehicles Symposium (IV), Changshu, China, June 26-29 2018.
Watanabe, S., Kolossa, D., Virtanen, T. (2018). "Application of Source
Separation to Robust Speech Analysis and Recognition", in E. Vincent, T.
Virtanen, and S. Gannot (ed.) "Audio Source Separation and Speech
Enhancement," John Wiley & Sons, 2018.
Heckmann, M., Orth, D., Wersing, H., Kolossa, D. (2017). "Development of a personalised intersection assistant", ATZ Worldw (2017) 119: 36.
Heckmann, M., Wersing, H., Orth, D., Kolossa, D., Schömig, N., Maag, C., Dunn, M. (2017). "Towards an On Demand Intersection Assistant: Initial User Acceptance and System Development", 2017 FAST-zero ’17, Nara, Japan, September 18-22 2017.
Meutzner, H., Gupta, S., Nguyen, V.-H., Holz, T., Kolossa, D. (2017). "Towards Improved Audio CAPTCHAs Based on Auditory Perception and Language Understanding", ACM Transactions on Information and System Security, vol. 19, no. 4, February 2017.
Meutzner, H., Ma, N., Nickel, R., Schymura, C., Kolossa, D. (2017). "Improving
audio-visual speech recognition using deep neural networks with dynamic
stream reliability estimates", Proc. ICASSP, New Orleans, March 2017.
Orth, D., Kolossa, D., Sarria Paja, M., Schaller, K., Pech, A., Heckmann, M. (2017). "A Maximum Likelihood Method for Driver-Specific Critical-Gap Estimation", 2017 IEEE Intelligent Vehicles Symposium (IV), Redondo Beach, CA, USA, 11-14 June 2017.
Orth, D., Schömig, N., Mark, C., Jagiellowicz-Kaufmann, M., Kolossa, D., Heckmann, M. (2017). "Benefits of Personalization in the Context of a Speech-Based Left-Turn Assistant", 9th ACM International Conference on Automotive User Interfaces and Interactive Vehicular Applications (AutomotiveUI ’17), Oldenburg, Germany, September 24 - 27, 2017.
Orth, D., Kolossa, D., Heckmann, M. (2017). "Predicting driver left-turn behavior from few training samples using a maximum a posteriori method", 2017 IEEE 20th International Conference on Intelligent Transportation Systems (ITSC), Yokohama, Japan, October 16-19.
Rafaely, B., Kolossa, D., Maymon, Y. (2017). "Towards acoustically robust
localization of speakers in a reverberant environment," Proc. HSCMA, San
Francisco, March 2017.
Rafaely, B., Kolossa, D. (2017)."Speaker localization in reverberant rooms
based on direct path dominance test statistics", Proc. ICASSP, New
Orleans, March 2017.
Schymura, C., Rios Grajales, J., Kolossa, C. (2017). "Monte Carlo exploration for
active binaural localization", Proc. ICASSP, New Orleans, March 2017.
Schönherr, L., Orth, D., Heckmann, M., Kolossa, D. (2017). "Environmentally Robust
Audio-Visual Speaker Identification", Proc. IEEE Workshop on Spoken
Language Technology (SLT 2016), San Diego, USA, 13–16 December 2016
Boenninghoff, B., Nickel, R. M., Zeiler, S., Kolossa, D. (2016). "Unsupervised Classification of Voiced Speech and Pitch Tracking Using Forward-Backward Kalman Filtering", Proc. Speech Communication, 12. ITG Symposium, Paderborn, Germany, 2016.
Gergen, S., Zeiler, S., Hussen Abdelaziz, A., Kolossa, D. (2016). "New Insights into Turbo-Decoding-Based AVSR with Dynamic Stream Weights", ITG-Fachtagung Sprachkommunikation, Paderborn, Germany, Oct. 2016.
Hussen Abdelaziz, A., Kolossa, D. (2016). “General Hybrid Framework for
Uncertainty-decoding-based Automatic Speech Recognition Systems”, accepted for publication, Speech Communication, 2016.
Jacobi, R., vom Bögel, G., Grabmaier, A., Kolossa, D. (2016). “Analysis of Time
Variant Reliability Information Used in a Multilevel Decoding Scheme for
RFID and Sensor Signals”, accepted for publication in International
Journal for RF Technologies: Research and Applications 2016.
Karbasi, M., Abdelaziz, A. H., Kolossa, D. (2016). “Twin-HMM-Based Non-Intrusive Speech Intelligibility Prediction”, in 2016 IEEE International Conference on Acoustics, Speech and Signal Processing ICASSP 2016: March 20-25, 2016 Shanghai, China, 624–28: IEEE.
Schönherr, L., Orth, D., Heckmann, M., Kolossa, D. (2016). "Environmentally Robust Audio-Visual Speaker Identification", IEEE Spoken Language Technology Workshop (SLT), San Diego, USA, December 2016.
Schönherr, L., Zeiler, S., Kolossa, D. (2016). "Spoofing Detection via Simultaneous Verification of Audio-Visual Synchronicity and Transcription", IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), Okinawa, Japan, December 2017.
Taghia, J., Kolossa, D., Martin, R. (2016). „ALE for robots! A single-channel approach to robot self-noise cancellation”, IEEE International Workshop on Acoustic Signal Enhancement (IWAENC), S. 1–5, 2016.
Saeidi, R., Astudillo, R., Kolossa, D. (2015). “Uncertain LDA: Including observation uncertainties in discriminative transforms”, IEEE Trans. on Pattern Analysis and Machine Intelligence, Sept. 2015.
Hussen Abdelaziz, A., Zeiler, S., Kolossa, D. (2015). “Learning Dynamic Stream Weights For Coupled-HMM-based Audio-visual Speech Recognition”, IEEE Trans. Audio Speech and Language Processing, vol. 23, no. 5, pp. 863-876, May 2015.
Hussen Abdelaziz, A., Watanabe, S., Hershey, J., Vincent, E., Kolossa, D. (2015). “Uncertainty Propagation through Deep Neural Networks,” in Proc. Interspeech, Dresden, Germany, September 2015.
Schymura, C., Winter, F., Kolossa, D., Spors, S. (2015). “Binaural Sound Source Localisation and Tracking using a dynamic Spherical Head Model", in Proc. Interspeech, Dresden, Germany, September 2015.
Jacobi, R., vom Bögel, G., Kolossa, D. (2015). “Multilevel Decoding Scheme for RFID and Sensor Signals in Inductively Coupled Systems”, Proc. of the European Conference on Smart Objects, Systems and Technologies (SmartSysTech), Aachen, Germany, June 2015.
Jacobi, R., Süss, A., vom Bögel, G., Kolossa, D. (2015). “Determination of the Optimal Carrier Frequency in Harsh Environments by Parameter Estimation”, Proc. of the European Conference on Smart Objects, Systems and Technologies (SmartSysTech), Aachen, Germany, June 2015.
Jacobi, R., Süss, A., vom Bögel, G., Kolossa, D. (2015). “Carrier Frequency Adaptation Approach”, Proc. IEEE International Conference on RFID, San Diego, USA, April 2015.
Karbasi, M., Kolossa, D. (2015). “A Microscopic Approach to Speech Intelligibility Prediction using Auditory Models”, Proc. DAGA, Nürnberg, March 2015.
Meutzner, H., Gupta, S., Kolossa, D. (2015). "Constructing Secure Audio CAPTCHAs by Exploiting Differences between Humans and Machines", Proc. ACM Conference on Human Factors in Computing Systems (CHI), Seoul, Korea, April 2015.
Raake, A., Blauert, J., Braasch, J., Brown, G., Danés, P., Dau, T., Gas, B., Argentieri, S., Kohlrausch, A., Kolossa, D., Le Goff, N., May, T., Obermayer, K. Spors, S. (2014). "Two!Ears – integral interactive model of auditory perception and experience”, Fortschr. Akust., DAGA 2014, Dtsch. Ges. Akustik, D−Oldenburg.
Blauert, J., Kolossa, D., Danès, P. (2014). “Feedback loops in engineering models of binaural listening”, ACTA ACUSTICA (submitted).
Darnstädt, M., Meutzner, H., Kolossa, D. (2014). "Reducing the Cost of Breaking Audio CAPTCHAs
by Active and Semi-Supervised Learning", Proc. ICMLA, December
2014.
Heckmann, M., Mikias, P., Kolossa, D. (2014). "The Impact of Word Alignment Accuracy on Audiovisual
Word Prominence Detection", Proc. ITG Fachtagung Sprachkommunikation, September 2014.
Hussen Abdelaziz, A., Kolossa, D. (2014). "Dynamic Stream Weight Estimation in Coupled-HMMbased
Audio-visual Speech Recognition Using Multilayer Perceptrons", Proc. Interspeech,
September 2014.
Hussen Abdelaziz, A., Zeiler, S., Kolossa, D. (2014). "A new EM estimation of dynamic stream
weights for coupled-HMM-based audio-visual ASR", Proc. ICASSP, Florence, May 2014.
Jacobi, R., Grey, S., vom Bögel, G., Kolossa, D. (2014). "Digitally Controlled Analog Front End for Inductively Coupled Transponder Systems", Proc. IEEE RFID Technology and Applications,
September 2014.
Kolossa, D. (2014). "Narrowing the gap: Probabilistic interfaces for signal enhancement and pattern
recognition", accepted for publication, Proc. IEEE GlobalSIP - Machine Learning Applications
in Speech Processing, December 2014.
Meutzner, H., Nguyen, V. H., Holz, T., Kolossa, D. (2014). "Using Automatic Speech Recognition for
Attacking Acoustic CAPTCHAs: The Trade-off between Usability and Security", Proc. ACSAC, December 2014.
Schmid, D., Enzner, G., Malik, S., Kolossa, D., Martin, R. (2014). "Variational Bayesian Inference for
Multichannel Dereverberation and Noise Reduction", IEEE Trans. on Audio, Speech and
Language Processing, vol. 22, no. 8, pp. 1320-1335, Aug. 2014.
Schymura, C., Ma, N., Brown, G. J., Walther, T., Kolossa, D. (2014). "Binaural Sound Source Localisation using a Bayesian-network-based Blackboard System and Hypothesis-driven Feedback", in: Proc. FORUM ACUSTICUM 2014, PL-Krakow.
Zeiler, S., Cwiklak, J., Kolossa, D. (2014). "Robust Multimodal Human Machine Interaction using the Kinect Sensor", Proc. ITG Fachtagung Sprachkommunikation, September 2014.
Astudillo, F. R., Kolossa, D., Abad, A., Zeiler, S., Saeidi, R., Mowlaee, R., da Silva Neto, J. P., Martin, R. (2013). "Integration of Beamforming and Uncertainty-of-Observation Techniques for Robust ASR in Multi-Source Environments", Computer Speech and Language, Special Issue on Multisource Environments, vol. 27, no. 3, pp. 837-850, May 2013.
Blauert, J., Kolossa, D., Obermayer, K., Adiloglu, K. (2013). "Further challenges and the road ahead", in J. Blauert (ed.), The technology of binaural listening, Springer, Berlin-Heidelberg-New York, 2013.
Blauert, J., Kolossa, D., Obermayer, K., Adiloglu, K. (2013). “Further challenges and the road
ahead”, in J. Blauert (ed.) The technology of binaural listening, Springer, Berlin-Heidelberg-
New York, 2013.
Hussen Abdelaziz, A., Zeiler, S., Kolossa, D. (2013). ''Using Twin-HMM-Based Audio-Visual Speech Enhancement as a Front-End for Robust Audio-Visual Speech Recognition'', Proc.
Interspeech, Lyon, France, August 2013.
Hussen Abdelaziz, A., Zeiler, S., Kolossa, D. (2013). ''Twin-HMM-based audio-visual speech enhancement'', Proc. ICASSP, Vancouver, Canada, May 2013.
Hussen Abdelaziz, A., Zeiler, S., Kolossa, D., Leutnant, V., Haeb-Umbach, R. (2013). ''GMM-based significance decoding'', in
Proc. ICASSP, 2013.
Kohlrausch, A., Braasch, J., Kolossa, D., Blauert, J. (2013). "An introduction to binaural
processing”, in J. Blauert (ed.) The technology of binaural listening, Springer, Berlin- Heidelberg-New York, 2013.
Kolossa, D., Zeiler, S., Saeidi, R., Astudillo, F. R. (2013). “Noise-Adaptive LDA: A New Approach for
Speech Recognition Under Observation Uncertainty", IEEE Signal Processing Letters, vol. 20,
no. 11, pp. 1018-1021, 2013.
Meutzner, H., Malik, S., Kolossa, D. (2013). "SVM-Based Preprocessing for Automatic Speech Recognition", AIA/DAGA.
Meutzner, H., Schlesinger, A., Zeiler, S., Kolossa, D. (2013). "Binaural Signal Processing for Enhanced Speech Recognition Robustness in Complex Listening Environments", Proc. 2nd
CHiME Workshop on Machine Listening in Multisource Environments, Vancouver, Canada,
June 2013.
Nickel, R, Astudillo, F. R., Kolossa, D., Martin, R. (2013). “Corpus-Based Speech
Enhancement with Uncertainty Modeling and Cepstral Smoothing”, IEEE Trans. on Audio,
Speech and Language Processing, vol. 21, no. 5, May 2013.
Martin, R., Kolossa, D. (2012). “Voice activity detection, noise estimation, and adaptive filters for acoustic signal enhancement”, in: T. Virtanen, R. Singh, B. Raj (eds.): “Techniques for Noise Robustness in Automatic Speech Recognition”, John Wiley & Sons, September 2012.
Hussen Abdelaziz, A., Kolossa, D. (2012). "Decoding of Uncertain Features Using the Posterior Distribution of the Clean Data for Robust Speech Recognition", International Speech Communication Association, (2012).
Hussen Abdelaziz, A., Zeiler, S., Kolossa, D. (2012). "Audio-Visual Speech Recognition for Uncertain Acoustical Observations", ITG Fachtagung Sprachkommunikation, (2012).
Nickel, R., Astudillo, R., Kolossa, D., Zeiler, S., Martin, R. (2012). "Inventory-Style Speech Enhancement with Uncertainty-of- Observation Techniques",
ICASSP, pp. 4645-4648, Kyoto, Japan, March 2012.
Schmid, D., Thüne, P., Kolossa, D., Enzner, G. (2012). "Dereverberation Preprocessing and Training Data Adjustments for Robust Speech Recognition in Reverberant Environments," in Proc. ITG Conference Speech Communication, Braunschweig, Germany, Sep. 2012.
Astudillo, F. R., Kolossa, D. (2011). “Uncertainty Propagation”, in: Kolossa, D., Haeb-Umbach, R. (eds.): „Robust Speech Recognition of Uncertain or Missing Data - Theory and Applications“, Springer Verlag, pp. 35-64, July 2011.
Hoffmann, E., Kolossa, D., Orglmeister, R. (2011). “Recognition of Multiple Speech Sources using ICA”, in: Kolossa, D., Haeb-Umbach, R. (eds.): „Robust Speech Recognition of Uncertain or Missing Data - Theory and Applications“, Springer Verlag, pp. 319-344, July 2011.
Vorwerk, A., Zeiler, S., Kolossa, D., Astudillo, F. R., Lerch, D. (2011). “Use of Missing and Unreliable Data for Audiovisual Speech Recognition”, in: Kolossa, D., Haeb-Umbach, R. (eds.): „Robust Speech Recognition of Uncertain or Missing Data - Theory and Applications“, Springer Verlag, pp. 345-375, July 2011.
Kolossa, D. (2011). “High-Level Processing of Binaural Features”, Proc. Forum Acusticum, Aalborg, Denmark, June 27-July 1, 2011.
Kolossa, D., Astudillo, F. R., Abad, A., Zeiler, S., Saeidi, R., Mowlaee, P., da Silva Neto, J.P., Martin, R. (2011). “CHiME Challenge: Approaches to Robustness using Beamforming and Uncertainty-of-Observation Techniques”, in Proc. CHiME Workshop on Machine Listening in Multisource Environments, Florence, Italy, Sept.1, 2011.
Kolossa, D., Haeb-Umbach, R. (2011) (eds.): „Robust Speech Recognition of Uncertain or Missing Data - Theory and Applications“, Springer Verlag, 380 pages, July 2011.
Kolossa, D. (2011). „Einführung in die automatische Spracherkennung“, TU Berlin, 155 pages, 2010.
Astudillo, F. R., Kolossa D., Philipp Mandelartz, P., Orglmeister, R. (2010). "An Uncertainty Propagation Approach to Robust ASR using the ETSI Advanced Front-End", IEEE Journal of Selected Topics in Signal Processing, Special issue on Natural Interaction with Intelligent Environments, vol. 4, pp. 824 – 833, October 2010.
Kohl, F., Wübbeler, G., Kolossa, D., Bär, M., Orglmeister R., Elster, C. (2010). "Shifted factor analysis for the separation of evoked dependent MEG signals”, Phys. Med. Biol., vol. 55, pp. 4219–4230, 2010.
Kolossa, D., Astudillo, F. R., Hoffmann, E., Orglmeister, R. (2010). "Independent Component Analysis and Time-Frequency Masking for Multi-Speaker-Recognition“, EURASIP Journal on Audio, Speech, and Music Processing. vol. 2010, Article ID 651420, 13 pages, 2010.
Kolossa, D., Astudillo, F. R., Zeiler, S. , Vorwerk, A., Lerch, D., Chong, J., Orglmeister, R. (2010). “Missing Feature Audiovisual Speech Recognition under Real-Time Constraints”, ITG Fachtagung Sprachkommunikation, paper 22, 4 pages, Bochum, Germany, October 6-8, 2010.
Kolossa, D., Chong, J., Zeiler, S., Keutzer, K. (2010). “Efficient Manycore CHMM Speech Recognition for Audiovisual and Multistream Data”, Proc. Interspeech 2010, pp. 2698 – 2701, Makuhari, Japan, September 26-30, 2010.
Kohl, F. , Wübbeler, G., Kolossa, D., Elster, C., Bär, M., Orglmeister, R.(2010). "Noise adjusted PCA for finding the subspace of evoked dependent signals from MEG data”, Latent Variable Analysis and Signal Separation (LVA 2010), Lecture Notes in Computer Science, vol. 6365, pp. 442-449, September 2010.
Vorwerk, A., Wang, X., Kolossa, D., Zeiler, S., Orglmeister, R. (2010). "WAPUSK20 - A Database for
Robust Audiovisual Speech Recognition", Proc. 7th Int. Conf. on International Language
Resources and Evaluation (ELREC), pp. 3016 – 3019, 2010.
Astudillo, F. R., Kolossa, D., Orglmeister, R. (2009). "Accounting for the Uncertainty of Speech Estimates in the Complex Domain for Minimum Mean Square Error Speech Enhancement", Interspeech 2009, Brighton, UK, September 2009.
Kolossa, D., Zeiler, S., Vorwerk, A., Orglmeister, R.(2009). "Audiovisual Speech Recognition with Missing or Unreliable Data", Audiovisual Speech Processing Workshop (AVSP 2009), Brighton, UK, September 10-13, 2009.
Jeub, M., Kolossa, D., Astudillo, F. R., Orglmeister, R. (2009). "Performance Analysis of Wavelet-based Voice Activity Detection", invited paper, Proc. DAGA2009, pp. 407-408, Rotterdam, March 2009.
Kohl, F., Wübbeler, G., Kolossa, D., Elster, C., Bär, M., Orglmeister, R. (2009). "Non-Independent BSS: A Model for Evoked MEG Signals with Controllable Dependencies" in: Proceedings of the ICA 2009, pp. 443-450, Paraty, Brazil, March 15-18, 2009.
Hoffmann, E., Kolossa, D., Orglmeister, R. (2009). "Time Frequency Masking Strategy for Blind Source Separation of Acoustic Signals Based on Optimally-Modified LOG-Spectral Amplitude Estimator" in: Proceedings of the ICA 2009, pp. 581-588, Paraty, Brazil, March 15-18, 2009.
Astudillo, F. R., Kolossa, D., Orglmeister, R. (2008). "Uncertainty Propagation for Speech Recognition using RASTA Features in Highly Nonstationary Noisy Environments", ITG Fachtagung Sprachkommunikation, Aachen, October 2008.
Kohl, F., Wübbeler, G., Kolossa, D., Orglmeister, R. , Elster, C., Bär, M. (2008). "Performance of ICA for MEG data generated from subspaces with dependent sources", Proc. European Biomedical Engineering Congress (EMBEC), Antwerpen, Nov. 2008.
Kolossa, D., Hoffmann, E., Orglmeister, R. (2008). "ICA-Based Bayesian Time-Frequency Masking", invited paper, ITG Fachtagung Sprachkommunikation, Aachen, October 2008.
Kohl, F., Wübbeler, G., Sander, T., Trahms, L., Kolossa, D. , Orglmeister, R., Elster, C. and Bär, M. (2008). " Performance of ICA for Dependent Sources using Synthetic Stimulus Evoked MEG Data", invited paper, Workshop Biosignalverarbeitung, pp. 32-35, Potsdam, July 2008.
Kolossa, D., Araki, S. , Delcroix, M., Nakatani, T., Orglmeister, R., Makino, S. (2008). „Missing Feature Speech Recognition in a Meeting Situation with Maximum SNR Beamforming”, invited paper, Proc. ISCAS, pp. 3218-3221, Seattle, WA, May 2008.
Astudillo, F. R. , Kolossa, D. , Orglmeister, R. (2007). “Propagation of Statistical Information Through Non-Linear Feature Extractions For Robust Speech Recognition”, Proc. MaxEnt2007, pp. 245-252, 27th Int. Workshop on Bayesian Inference and Maximum Entropy Methods in Science and Engineering, Saratoga Springs, July 2007.
Hoffmann, E., Kolossa, D., Orglmeister, R. (2007). „A Soft Masking Strategy based on Multichannel Speech Probability Estimation for Source Separation and Robust Speech Recognition”, Proc. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA 2007), pp. 118-121, New Paltz, NY.
Hoffmann, E., Kolossa, D., Orglmeister, R. (2007). “A Batch Algorithm for Blind Source Separation of Acoustic Signals Using ICA and Time-Frequency Masking”, Proc. ICA 2007, pp. 480-487, Springer Verlag, Berlin.
Kolossa, D., Astudillo, F. R., Orglmeister, R. (2007). “Spracherkennung im Automobil durch Verwendung von Missing Feature Techniken“, invited paper, Proc. DAGA 2007, pp. 301-302.
Kolossa, D., Sawada, H., Astudillo, F. R., Orglmeister, R., Makino, S. (2006). „Recognition of convolutive speech mixtures by missing feature techniques for ICA“, invited paper, in: Proc. 40th Asilomar Conference on Signals, Systems and Computers, pp. 1397-1401, October 29 - November 1, Pacific Grove, USA, 2006.
Maraboina, S., Kolossa, D., Bora, P., Orglmeister, R. (2006). „Multi-Speaker Voice Activity Detection using ICA and Beampattern Analysis“, in: Proc. Eusipco 2006, September 4-8, Florence, Italy.
Kolossa, D., Klimas, A., Baumann, W., Orglmeister, R. (2006). „Robuste Erkennung gestörter Sprache im Automobil durch MMSE-Störgeräuschunterdrückung und Missing-Data Spracherkennung“, invited paper, in: Proc. Daga 2006, March 20 - 23, Braunschweig, Germany.
Kolossa, D., Klimas, A., Orglmeister, R.(2005). „Separation and Robust Recognition of Noisy, Convolutive Speech Mixtures using Time-Frequency Masking and Missing Data Techniques“, in: Proceedings of the WASPAA 2005, pp. 82-85, New Paltz, NY, USA, October 16-19, 2005.
Kolossa, D., Orglmeister, R.(2004). „Nonlinear Postprocessing for Blind Speech Separation“, in: Proceedings of the ICA 2004, pp. 832–839, Granada, Spain, September 22-24, 2004.
Kolossa, D., Huo, Q. (2002). „Using Time-Stretched Pulses for Accurate Splitting of Speech Utterances Played Back in Noisy Reverberant Environments“, in: Proceedings of the International Conference on Spoken Language Processing (ICSLP), pp.1541-1544, September 16-20 2002, Denver, CO, USA.
Baumann, W. , Köhler, B.-U., Kolossa, D., Orglmeister, R. (2001). „Real Time Separation of Convolutive Mixtures“, in: Proceedings of the ICA2001, pp.65-69, San Diego, California, USA.
Kolossa, D., Köhler, B.-U., Conrath, M., Orglmeister, R. (2001). „Optimal Permutation Correction by Multiobjective Genetic Algorithms”, in: Proceedings of the ICA2001, pp. 373-378, San Diego, California, USA.
Kolossa, D., Grübel, G. (2000). „Evolutionary Computation and Nonlinear Programming in Multi-Model Robust Control Design“, In: Lecture Notes on Computer Science vol. 1803, pp. 147-157, Stefano Cagnoni et al. (Eds.), Springer Verlag, Berlin, Heidelberg, New York, 2000.
Kolossa, D. (2008). "Independent Component Analysis for Environmentally Robust Speech Recognition”, PhD Thesis, TU Berlin, 2008.
Baumann, W., Kolossa, D., Orglmeister, R. (2004). „Frequenzvariantes Beamforming zur Sprechertrennung im KFZ“, German Patent Number 10312065.3, March 2004.
Anfang
Korany, N. (2004). "Computer Modelling
and Auralisation of Sound Fields in Rooms- An Overview."
Invited paper for ICA 2004, Kyoto, Japan.
Korany, N., Onsy, A. A., Blauert, J. (2001).
"Acoustic Simulation of Rooms with Boundaries of Partially
Specular Reflectivity." J. Appl. Acoust. 62: 875-887.
Korany, N., Blauert, J., Abdel Alim, O. (2000).
"Acoustic Simulation of Rooms with Boundaries of Partially
Specular Reflectivity." Applied Acoustics.
Korany, N. (2000). "A Model for the
Simulation of Sound Fields in Enclosures: Integrating the
Geometrical and the Radiant Approaches (Ein Modell zur Simulation
des Schallfeldes in geschlossenen Räumen: Integration
des geometrischen und des Radiosity-Ansatzes)." University
of Alexandria. Dissertation
Anfang
Köster, S. (2000). "Akustisch-phonetische
Aspekte von Lombard-Sprache für verschiedene Sprechstile."
DAGA 2000.
Köster, S., Pörschmann, Ch., Walter,
J. (2000). "Eine Datenbank für deutsche Sprache
mit Lombard-Effekt." Fortschritte der Akustik - DAGA
2000, DEGA in Fortschritte der Akustik - DAGA 2000: 356-357.
Köster, S., Mersdorf, J. (1999). "Intelligibility
Enhancement of Synthetic Speech Heard via Telephone in a Noisy
Environment." ACUSTICA/acta acustica 85: 1, 166.
Anfang
Kraft, V. (1997). "Verkettung natürlichsprachlicher
Bausteine zur Sprachsynthese: Anforderungen, Techniken und
Evaluierung." Institut für Kommunikationsakustik,
Ruhr-Universität Bochum. Dissertation, VDI Verlag, D-Düsseldorf
Kraft, V., Portele, T. (1995). "Quality
Evaluation of Five Speech Synthesis Systems for German."
Acta Acustica 3: 351-365.
Kraft, V. (1994). "Concatenation of
synthesis units: instrumental and auditive evaluation."
Proc. of the Workshop Speech Quality Assessment: Standards,
Perspectives, Applications: 45-46.
Kraft, V. (1994). "Does the Resulting
Speech Quality Improvement Make a Sophisticated Concatenation
of Time-Domain Synthesis Units Worthwhile?" Proc. of
the 2nd ESCA/IEEE Workshop on Speech Synthesis: 65-68.
Kraft, V. (1994). "Untersuchung qualitätsmindernder
Faktoren bei der Erzeugung synthetischer Sprache." Fortschritte
der Akustik - DAGA'94: 1329-1332.
Kraft, V. (1993). "Auditory Detection
of Discontinuities in Synthesis-by-Concatenation." EUROSPEECH'93:
929-932.
Kraft, V. (1993). "Automatische Extraktion
und Segmentierung von Halbsilben als Sprachsynthesebausteine."
Fortschritte der Akustik - DAGA'93: 1036-1039.
Kraft, V., Rinscheid, A. (1992). "Automatische
Markierung von Grundperioden bei Sprachsignalen - mit und
ohne Auswertung eines Laryngogramms." Fortschritte der
Akustik - DAGA'92 633-636.
Anfang
Krebber, J., Drullman, R., Eneman, K., Huber, R.,
Jekosch, U., Luts, H., Martin, R. (2006). "Enhancement
of Telephone Situations for Hearing Impaired"
, Proceedings
of the Conference and Workshop on Assistive Technologies for
Vision and Hearing Impairment Technology for Inclusion, M.A.
Hersh (ed), 2006.
Pegam, R., Krebber, J., Jekosch, U. (2005). „Benutzererwartungen im Zusammenhang mit dem Sprachdialogsystem INSPIRE“, DAGA 2005, München, pp. 251-252.
Krebber, J., Möller, S., Pegam, R., Raake, A.,
Melichar, M., Rajman, M. (2004). "Wizard of Oz Tests
for a Dialogue System for Smart Homes." in: Proc. 7.
Congrès Français d'Acoustique / 30. Dtsch. Jahrestg.
f. Akust. (CFA/DAGA'04), F-Strassburg, 1149-1150.
Krebber, J., Raake, A., Möller, S. (2003).
"Modellierung der Einflüsse des Übertragungskanals
bei der Interaktion mit einem Smart-Home-System." Fortschritte
der Akustik - DAGA 2003: Plenarvortr. u. Fachbeitr. d. 29.
Dtsch. Jahrestg. f. Akust., D-Aachen, Dtsch. Ges. Akust.,
D-Oldenburg: 732-733.
Krebber, J., Raake, A., Möller, S.,
Novo, P. (2003). "Remote Access Simulation System User
Manual, Deliverable 3.3, IST project INSPIRE (INfotainment
management with SPeech Interaction via REmote-microp>hones
and telephone interfaces, IST-2001-32746)." Institut
für Kommunikationsakustik, Ruhr-Universität, D-Bochum.
Krebber, J., Möller, S., Raake, A.,
Rehmann, S., Berger, J., Johannsen, W. (2002). "Ein Simulationssystem
zur Untersuchung des Einflusses von Übertragungskanälen
bei Smart-Home-Anwendungen." Elektronische Sprachsignalverarbeitung:
Tagungsband der 13. Konferenz. R. Hoffmann, w.e.b. Universitätsverlag:
75-82.
Krebber, J. (2002). "Wavelet Transformation
zur Signal-Rauschabstandsverbesserung von Ausiosignalen."
Fortschritte der Akustik - DAGA 2002, Dtsch. Ges. Akustik,
DEGA e.V., CD-ROM.
Krebber, J., S. Möller, et al. (2004).
"Speech Input and Output: Module Assessment for Remote
Access to a Smart-Home Spoken Dialog System." Proc. 8th
Int. Conf. on Spoken Language Processing (Interspeech 2004
- ICSLP), KR-Jeju Island, Vol. 4, 2753-2756.
Anfang
Kugler, M., Kulas, W., Rühl, H. W. (1985).
"Endgraphemuntersuchungen für die Satzanalyse in
einem Vorleseautomaten." Fortschritte der Akustik - DAGA'85:
583-586.
Kugler-Kruse, M., Posmyk, R. (1987). "Methods
for the Simulation of Natural Intonation in the 'SYRUB' Text-to-speech
System for Unrestricted German Text." European Conference
on Speech Technology Vol. 2: pp. 177-180.
Kugler-Kruse, M., Jekosch, U., Kulas, W.
(1986). "Erweiterung der universellen Phonemsynthesizer-Schnittstelle
für marktübliche Sprachsynthetisatoren." Fortschritte
der Akustik - DAGA'86: 557-560.
Anfang
Kulas, W., Kugler-Kruse, M., Jekosch, U., Kesselheim,
M. (1986). "Ergonomische Gesichtspunkte bei Vorleseautomaten:
Benutzerschnittstelle und Prosodiesteuerung im Bochumer SYRUB-Programmsystem
zur Umsetzung von deutschem Schrifttext in Lautschrift mit
Prosodieinformation." NTG-Fachberichte 94, Sprachkommunikation:
198-202.
Kulas, W., Blauert, J. (1986). German text-to-phoneme
software drives any speech synthesizer. Report Speech Tech
'86, Media Dimensions Inc.
Kulas, W., Kugler-Kruse, M. (1986). "Sprechrhythmussteuerung
im Bochumer Vorleseautomaten." Fortschritte der Akustik
- DAGA'86: 553-556.
Kulas, W., Rühl, H. W. (1985). Sprachsynthese.
Zur Synthese von natürlich gesprochener Sprache aus Texten
und Konzepten. Germanistische Linguistik. 79-80: 128-138.
Kulas, W., Rühl, H. W. (1984). Ein autonomes
Sprachausgabegerät mit unbegrenztem Wortschatz, Tagungsbericht
der Siemens-Anwenderkreis-Tagung. Dortmund.
Kulas, W., Rühl, H. W. (1984). "Eine
einfache Satzintonation für ein Sprachausgabesystem mit
unbegrenztem Wortschatz." Fortschritte der Akustik -
DAGA'84: 821-824.
Kulas, W., Rühl, H. W. (1984). Über
die Erzeugung einer Satzintonation, Sprachsynthese. G. Olms
Verlag.
Kulas, W., Rühl, H. W. (1982). "Satzzerlegung
für ein Sprachausgabesystem mit unbegrenztem Wortschatz."
Fortschritte der Akustik - DAGA'82: 1017-1020.
Kulas, W., Rühl, H. W. (1982). "SYNTEX
- Synthetische Sprache aus Text. Eine Übersicht."
NTG-Sitzung über Sprachsynthese.
Anfang