News

How to tackle scarce data and high dimensionality in machine learning? Juho Piironen brings novel techniques to predictive modeling

Doctoral candidate, MSc Juho Piironen, studied Bayesian statistical inference in supervised learning problems and proposed novel techniques to account for uncertainty and computational cost in predictive model construction and feature selection.
Juho Piironen_photo by Matti Ahlgren

Researchers in many different fields encounter research problems in which data are scarce, but the number of features is large. An illustrative example is gene expression datasets where expressions of specific genes are either high or low for cancer samples and vice versa for controls. Due to small sample sizes, these problems are usually characterized by high uncertainty. These are exactly the problems where Juho Piironen’s doctoral dissertation and its novel methods can help.

In his thesis, Juho applies Bayesian statistical inference methods, which have gained popularity during the recent years. The fundamental aim is to build models in which given new feature values can predict the associated target variable as accurately as possible (e.g. specific gene variations predict prevalence of cancer), but also to find out which features are relevant for the prediction. Bayesian methods provide a systematic framework accounting for uncertainty, but they also allow incorporation of prior information into the model.

Despite all these benefits, Bayesian inference can be computationally demanding. To mitigate this burden, Juho developed techniques that can scale to a large number of features and improve feature selection and prediction accuracy. “The most valuable contributions of my thesis are clearly methodological, but I could think of several potential application areas in both research and industry where these methods can be of great utility”, Juho says.

During his studies at Aalto University, Juho majored in applied mathematics, but selected computational science as his minor. His growing interest in machine learning led into a Master’s thesis project and finally expanded into doctoral thesis. Juho’s supervisor is Professor Aki Vehtari, who is currently a co-leader in the Probabilistic Machine Learning research group.

”The topic of my doctoral dissertation is rather narrow in the broad field of machine learning, but I have learned a lot about the field and research in general”, Juho describes his four years of doctoral studies. While finalizing his thesis, Juho worked on his open source code, which took several months. “The work was intensive but very valuable as it allows a wider research community to make use of these methods and opens new avenues for further methodological development as well”, Juho explains.

Juho hopes he will be able to work with machine learning also in the future. He has continued working in the field, but is now focusing on neural networks at Curious AI company. “Nowadays it is also possible to do research and publish while working in industry. At the moment it is just rewarding to work with the topics close to my interests and expertise”, Juho concludes.

MSc (Tech.) Juho Piironen will defend the dissertation "Bayesian Predictive Inference and Feature Selection for High-Dimensional Data" on Friday 24 May 2019 at 12 noon in Aalto University School of Science, lecture hall T2, Konemiehentie 2, Espoo. 

Link to the electronic version of the thesis

Link to Lectio Praecursoria in the defense

Contact information[email protected]

Update on 9.11.2020:

Juho Piironen has received national doctoral dissertation award (Tietotekniikan tutkimussäätiön väitöskirjapalkinto) for his dissertation "Bayesian Predictive Inference and Feature Selection for High-Dimensional Data".

Department of Computer Science

We are an internationally-oriented community and home to world-class research in modern computer science.

Computer Science building front entrance, Aalto University, photo Matti Ahlgren / Tietotekniikan talon sisäänkäynti, Aalto-yliopisto

Probabilistic Machine Learning Group (external link)

We develop new methods for probabilistic modeling, Bayesian inference and machine learning. Our current focuses are in particular learning from multiple data sources, Bayesian model assessment and selection, approximate inference and information visualization.

PML research group, Aalto University Computer Science
  • Published:
  • Updated:

Read more news

Six soil samples with germinating seeds showing various stages of sprouting in different soil types.
Cooperation, Research & Art, Studies Published:

New life for construction waste wood

In Finland, almost all waste wood from construction ends up being burned, even though it could play a major role in the development of ecological growing media. New research shows that steam-treated waste wood can replace peat as a growing medium and offers promising results in plant germination and root growth.
A group photo of pre-incubator teams on Marsio Stage at the Aalto Creative Demo Day event.
Campus, Cooperation, Research & Art Published:

Aalto Creative Demo Day: Inspiring Pitches and the Future of Creative Industry Incubation

Pre-incubator participants from Aalto Digital Creatives pitched their startup ideas on Marsio Stage. Aalto Digital Creatives will continue as the Aalto Creatives program in 2025.
Kulkutunnisteen lukija
Campus, Incidents, University Published:

Short interruptions in the operation of access tags and doors

A system update will cause the access tags and doors on Otaniemi campus 16.-20.12.2024
Illustration of van Der Waals materials in moiré
Research & Art Published:

Twisted multiferroics discovery makes new kinds of topological spintronic devices possible

Research into moiré patterns in two-dimensional van der Waals materials provides a new way to engineer quantum matter that may lead to more energy-efficient memory storage and improved quantum technologies