News

Artificial intelligence produces data synthetically to help treat diseases like COVID-19

The ability to produce data synthetically makes studying of the COVID-19 disease significantly easier.
Violetti- ja beigesävyinen kuvituskuva, jossa näkyy ihmisiä ja numeroita
The possibility to produce synthetic data solves many problems and helps develop for example better treatment methods. Illustration: Matti Ahlgren / Aalto University

Data driven technologies and 'big data' are revolutionizing many industries. However, in many areas of research – including health and drug development – there is too little data available due to its sensitive nature and the strict protection of individuals. When data are scarce, the conclusions and predictions made by researchers remain uncertain, and the coronavirus outbreak is one of these situations.

'When a person gets sick, of course, they want to get the best possible care. Then it would be important to have the best possible methods of personalized healthcare available,' says Samuel Kaski, Academy Professor and the Director of the Finnish Center for Artificial Intelligence FCAI.

However, developing such methods of personalized healthcare requires a lot of data, which is difficult to obtain because of ethical and privacy issues surrounding the large-scale gathering of personal data. 'For example, I myself would not like to give insurance companies my own genomic information, unless I can decide very precisely what the insurance company will do with the information,' says Professor Kaski.

To solve this issue, researchers at FCAI have developed a new machine learning-based method that can produce research data synthetically. The method can be useful in helping develop better treatments and to understand the COVID-19 disease, as well as in other applications. The researchers recently released an application based on the method that allows academics and companies to share data with each other without compromising the privacy of the individuals involved in the study.

Many industries want to protect their own data so that they do not reveal trade secrets and inventions to their competitors. This is especially true in drug development, which requires lots of financial risk. If pharmaceutical companies could share their data with other companies and researchers without disclosing their own inventions, everyone would benefit.

When researchers have synthetic data, we start understanding the COVID-19 better

The ability to produce data synthetically solves these problems. In their previous study, which is currently being peer-reviewed, FCAI researchers found that synthetic data can be used to draw as reliable statistical conclusions as the original data. It allows researchers to conduct an indefinite number of analyses while keeping the privacy of the individuals involved in the original experiment secure.

The application that was published at the end of June works like this: The researcher enters the original data set into the application, from which the application builds the synthetic dataset. They can then share their data to other researchers and companies in a secure way.

The application was released on the fastest possible schedule so that researchers investigating the Coronavirus pandemic would have access to it as early as possible. Researchers are further improving the application, to make it easier to use and add other functionality. 'There are still many things we don’t know about the new coronavirus: for example, we do not know well enough what the virus causes in the body and what the actual risk factors are. When researchers have synthetic data, we start understanding these things better,' says Kaski.

FCAI researchers are now working on a project in which they use synthetic data to construct a model that, based on certain biomarkers, predicts whether a test subject’s coronavirus test is positive or negative. Biomarkers can be for example certain types of molecules, cells, or hormones that indicate a disease.

'The original data set with which we do this has been publicly available. Now we are trying to reproduce the results of the original research with the help of synthetic data and build a predictive model from the synthetic data that was achieved in the original research,' explains Joonas Jälkö, doctoral researcher at Aalto University.

The research conducted at FCAI is funded by the Academy of Finland.

More information

Samuel Kaski
Director, Finnish Artificial Intelligence Center FCAI
Academy Professor
Tel. +358503058694
[email protected]

Joonas Jälkö
Doctoral Researcher, Aalto University
Tel. +358405830937
[email protected]

Preprint version of the research article

Link to the software

The website of the Finnish Center for Artificial Intelligence

  • Published:
  • Updated:

Read more news

Dave - Open house
Press releases Published:

Open demo-day in DAVE of Aalto Behavioral Laboratory

Open house event for Dynamic Audio Visual Environment of Aalto Behavioral Laboratory on 7th of May 2024, 13:00-16:00
The WAVE technique developed by the researchers is based on anticipating future movement, such as a turn. Picture: Markus Laatta
Press releases Published:

Researchers develop a new way to instruct dance in Virtual Reality

The researchers started by experimenting with visualisation techniques familiar from previous dance games. But after several prototypes and stages, they decided to try out the audience wave, familiar from sporting events, to guide the dance.
Researchers designed an algorithm that controls the direction of the air nozzle with two motors.
Press releases, Research & Art Published:

Scientists harness the wind as a tool to move objects

New approach allows contactless or remote manipulation of objects by machines or robots.
Min-Kyu Paek
Appointments Published:

Min-Kyu Paek has been appointed Assistant Professor at the Department of Chemical and Metallurgical Engineering

Min-Kyu Paek has been appointed Assistant Professor at the Department of Chemical and Metallurgical Engineering