Tamás Grósz

Project Employee
Project Employee
T412 Department of Information and Communications Engineering
Full researcher profile
https://research.aalto.fi/...
Phone number
+358451711834

Honors and awards

The ACM Multimedia 2023 Computational Paralinguistics Challenge Prize

We have received the prize for winning the Emotion Share sub-challenge. More info at http://www.compare.openaudio.eu/winners/
Invitation or ranking in competition Speech Recognition Jan 2023

Research groups

  • Speech Recognition, Project Employee

Publications

Comparison and analysis of new curriculum criteria for end-to-end ASR

Georgios Karakasidis, Mikko Kurimo, Peter Bell, Tamás Grósz 2024 Speech Communication

Collecting Linguistic Resources for Assessing Children's Pronunciation of Nordic Languages

Anne Marte Haug Olstad, Anna Smolander, Sofia Strömbergsson, Sari Ylinen, Minna Lehtonen, Mikko Kurimo, Yaroslav Getman, Támas Grosz, Xinwei Cao, Torbjørn Svendsen, Giampiero Salvi 2024 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, LREC-COLING 2024 - Main Conference Proceedings

From Raw Speech to Fixed Representations: A Comprehensive Evaluation of Speech Embedding Techniques

Dejan Porjazovski, Tamas Grosz, Mikko Kurimo 2024 IEEE/ACM Transactions on Audio Speech and Language Processing

Principled Comparisons for End-to-End Speech Recognition: Attention vs Hybrid at the 1000-hour Scale

Aku Rouhe, Tamás Grósz, Mikko Kurimo 2024 IEEE/ACM Transactions on Audio, Speech, and Language Processing

Listening like a speech-training app: Expert and non-expert listeners’ goodness ratings of children’s speech

Sofia Strömbergsson, Molly Fröjdh, Magdalena Pettersson, Tamás Grósz, Yaroslav Getman, Mikko Kurimo 2024 Clinical Linguistics and Phonetics

INVESTIGATING THE CLUSTERS DISCOVERED BY PRE-TRAINED AV-HUBERT

Anja Virkkunen, Guangpu Huang, Tamas Grosz, Mikko Kurimo 2024 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2024 - Proceedings

Developing an AI-assisted Low-resource Spoken Language Learning App for Children

Yaroslav Getman, Nhan Phan, Ragheb Al-Ghezi, Ekaterina Voskoboinik, Mittul Singh, Tamas Grosz, Mikko Kurimo, Giampiero Salvi, Torbjorn Svendsen, Sofia Strombergsson, Anna Smolander, Sari Ylinen 2023 IEEE Access

Multi-task wav2vec2 Serving as a Pronunciation Training System for Children

Yaroslav Getman, Ragheb Al-Ghezi, Tamas Grosz, Mikko Kurimo 2023 9th Workshop on Speech and Language Technology in Education (SLaTE)

Investigating wav2vec2 context representations and the effects of fine-tuning, a case-study of a Finnish model

Tamas Grosz, Yaroslav Getman, Ragheb Al-Ghezi, Aku Rouhe, Mikko Kurimo 2023 Proceedings of Interspeech 2023

Discovering Relevant Sub-spaces of BERT, Wav2Vec 2.0, ELECTRA and ViT Embeddings for Humor and Mimicked Emotion Recognition with Integrated Gradients

Tamás Grósz, Anja Virkkunen, Dejan Porjazovski, Mikko Kurimo 2023 MuSe '23: Proceedings of the 4th on Multimodal Sentiment Analysis Challenge and Workshop: Mimicked Emotions, Humour and Personalisation