Mikko Kurimo

Full Professor
Full Professor
T412 Department of Information and Communications Engineering

Mikko Kurimo received his M.Sc., Lic.Tech and D.Sc.(Tech.) degrees from Helsinki University of Technology in 1992, 1994 and 1997. In his PhD thesis he developed neural networks based machine learning for automatic speech recognition (ASR). Since then he has been working as a research scientist at IDIAP, a Swiss research centre for artificial intelligence and visited as an international research fellow in a number of research groups specialized in machine learning and ASR including University of Colorado in Boulder, University of Edinburgh, SRI in Stanford, ICSI in Berkeley and NITech in Nagoya. At Aalto University Professor Kurimo has been the head of the automatic speech recognition group since his return from Switzerland in 2000. His work is internationally best known for unsupervised subword language modeling for morphologically complex languages such as Finnish, Estonian, Turkish and Arabic.  His recent achievements include the winning of the 2017 multi-genre broadcast speech recognition challenge and success in Tekes Challenge Finland competition (348 competing projects) and in EC's H2020-ICT-2017 call (115 competing projects). His research interests include deep learning methods for automatic speech recognition and spoken language modeling.

Full researcher profile
https://research.aalto.fi/...

Contact information

Phone number
+358503476221

Areas of expertise

AIRC, Multimodal Interfaces, COIN, Mobster, Simple4All, EMIME, Speech recognition

Honors and awards

The ACM Multimedia 2023 Computational Paralinguistics Challenge Prize

We have received the prize for winning the Emotion Share sub-challenge. More info at http://www.compare.openaudio.eu/winners/
Invitation or ranking in competition Speech Recognition Jan 2023

MGB3 2017: Multi Genre Broadcast challenge for recognizing Arabic dialect speech

First place in the Speech Recognition part of the Arabic Multi-Genre Broadcast Challenge 2017 (MGB3)
Invitation or ranking in competition Centre of Excellence in Computational Inference, COIN Jan 2017

ISCA Award for the best student paper of Interspeech 2011

ISCA Award for the best student paper of Interspeech 2011: Hannu Pulakka, Ulpu Remes, Santeri Yrttiaho, Kalle Palomäki, Mikko Kurimo, and Paavo Alku, “Low-Frequency Bandwidth Extension of Telephone Speech Using Sinusoidal Synthesis and Gaussian Mixture Model”.
Award or honor granted for a specific work Dept Signal Process and Acoust Jan 2011

Professeur Invité 2005-2006 at Université de Saint-Etienne Université de Saint-Etienne, France

Award or honor granted for a specific work Dept Signal Process and Acoust Jan 2006

International Short Visit Fellowship Award The Royal Society, Iso-Britannia

Award or honor granted for a specific work Dept Signal Process and Acoust Jan 2004

Research groups

  • Speech Recognition

Publications

Principled Comparisons for End-to-End Speech Recognition: Attention vs Hybrid at the 1000-hour Scale

Aku Rouhe, Tamás Grósz, Mikko Kurimo 2024 IEEE/ACM Transactions on Audio, Speech, and Language Processing

Automatic Rating of Spontaneous Speech for Low-Resource Languages

Ragheb Al-Ghezi, Yaroslav Getman, Ekaterina Voskoboinik, Mittul Singh, Mikko Kurimo 2023 2022 IEEE Spoken Language Technology Workshop, SLT 2022 - Proceedings

Automatic Speaking Assessment of Spontaneous L2 Finnish and Swedish

Ragheb Al-Ghezi, Ekaterina Voskoboinik, Yaroslav Getman, Anna Von Zansen, Heini Kallio, Mikko Kurimo, Ari Huhta, Raili Hildén 2023 Language Assessment Quarterly

Developing an AI-assisted Low-resource Spoken Language Learning App for Children

Yaroslav Getman, Nhan Phan, Ragheb Al-Ghezi, Ekaterina Voskoboinik, Mittul Singh, Tamas Grosz, Mikko Kurimo, Giampiero Salvi, Torbjorn Svendsen, Sofia Strombergsson, Anna Smolander, Sari Ylinen 2023 IEEE Access

Multi-task wav2vec2 Serving as a Pronunciation Training System for Children

Yaroslav Getman, Ragheb Al-Ghezi, Tamas Grosz, Mikko Kurimo 2023 9th Workshop on Speech and Language Technology in Education (SLaTE)

Investigating wav2vec2 context representations and the effects of fine-tuning, a case-study of a Finnish model

Tamas Grosz, Yaroslav Getman, Ragheb Al-Ghezi, Aku Rouhe, Mikko Kurimo 2023 Proceedings of Interspeech 2023

Discovering Relevant Sub-spaces of BERT, Wav2Vec 2.0, ELECTRA and ViT Embeddings for Humor and Mimicked Emotion Recognition with Integrated Gradients

Tamás Grósz, Anja Virkkunen, Dejan Porjazovski, Mikko Kurimo 2023 MuSe '23: Proceedings of the 4th on Multimodal Sentiment Analysis Challenge and Workshop: Mimicked Emotions, Humour and Personalisation

Non-game like training benefits spoken foreign-language processing in children with dyslexia

Katja Junttila, Anna Riikka Smolander, Reima Karhila, Mikko Kurimo, Sari Ylinen 2023 FRONTIERS IN HUMAN NEUROSCIENCE

Multilingual TTS Accent Impressions for Accented ASR

Georgios Karakasidis, Nathaniel Robinson, Yaroslav Getman, Atieno Ogayo, Ragheb Al-Ghezi, Ananya Ayasi, Shinji Watanabe, David R. Mortensen, Mikko Kurimo 2023 Text, Speech, and Dialogue - 26th International Conference, TSD 2023, Proceedings

A pronunciation Scoring System Embedded into Children’s Foreign Language Learning Games with Experimental Verification of Learning Benefits

Reima Karhila, Sari Ylinen, Anna-Riikka Smolander, Aku Rouhe, Ragheb Al-Ghezi, Yaroslav Getman, Tamas Grosz, Maria Uther, Mikko Kurimo 2023 9th Workshop on Speech and Language Technology in Education (SLaTE)