News

Artificial intelligence produces data synthetically to help treat diseases like COVID-19

The ability to produce data synthetically makes studying of the COVID-19 disease significantly easier.
Violetti- ja beigesävyinen kuvituskuva, jossa näkyy ihmisiä ja numeroita
The possibility to produce synthetic data solves many problems and helps develop for example better treatment methods. Illustration: Matti Ahlgren / Aalto University

Data driven technologies and 'big data' are revolutionizing many industries. However, in many areas of research – including health and drug development – there is too little data available due to its sensitive nature and the strict protection of individuals. When data are scarce, the conclusions and predictions made by researchers remain uncertain, and the coronavirus outbreak is one of these situations.

'When a person gets sick, of course, they want to get the best possible care. Then it would be important to have the best possible methods of personalized healthcare available,' says Samuel Kaski, Academy Professor and the Director of the Finnish Center for Artificial Intelligence FCAI.

However, developing such methods of personalized healthcare requires a lot of data, which is difficult to obtain because of ethical and privacy issues surrounding the large-scale gathering of personal data. 'For example, I myself would not like to give insurance companies my own genomic information, unless I can decide very precisely what the insurance company will do with the information,' says Professor Kaski.

To solve this issue, researchers at FCAI have developed a new machine learning-based method that can produce research data synthetically. The method can be useful in helping develop better treatments and to understand the COVID-19 disease, as well as in other applications. The researchers recently released an application based on the method that allows academics and companies to share data with each other without compromising the privacy of the individuals involved in the study.

Many industries want to protect their own data so that they do not reveal trade secrets and inventions to their competitors. This is especially true in drug development, which requires lots of financial risk. If pharmaceutical companies could share their data with other companies and researchers without disclosing their own inventions, everyone would benefit.

When researchers have synthetic data, we start understanding the COVID-19 better

The ability to produce data synthetically solves these problems. In their previous study, which is currently being peer-reviewed, FCAI researchers found that synthetic data can be used to draw as reliable statistical conclusions as the original data. It allows researchers to conduct an indefinite number of analyses while keeping the privacy of the individuals involved in the original experiment secure.

The application that was published at the end of June works like this: The researcher enters the original data set into the application, from which the application builds the synthetic dataset. They can then share their data to other researchers and companies in a secure way.

The application was released on the fastest possible schedule so that researchers investigating the Coronavirus pandemic would have access to it as early as possible. Researchers are further improving the application, to make it easier to use and add other functionality. 'There are still many things we don’t know about the new coronavirus: for example, we do not know well enough what the virus causes in the body and what the actual risk factors are. When researchers have synthetic data, we start understanding these things better,' says Kaski.

FCAI researchers are now working on a project in which they use synthetic data to construct a model that, based on certain biomarkers, predicts whether a test subject’s coronavirus test is positive or negative. Biomarkers can be for example certain types of molecules, cells, or hormones that indicate a disease.

'The original data set with which we do this has been publicly available. Now we are trying to reproduce the results of the original research with the help of synthetic data and build a predictive model from the synthetic data that was achieved in the original research,' explains Joonas Jälkö, doctoral researcher at Aalto University.

The research conducted at FCAI is funded by the Academy of Finland.

More information

Samuel Kaski
Director, Finnish Artificial Intelligence Center FCAI
Academy Professor
Tel. +358503058694
samuel.kaski@aalto.fi

Joonas Jälkö
Doctoral Researcher, Aalto University
Tel. +358405830937
joonas.jalko@aalto.fi

Preprint version of the research article

Link to the software

The website of the Finnish Center for Artificial Intelligence

  • Updated:
  • Published:
Share
URL copied!

Read more news

Assistant Professor Patrick Fleming
Appointments Published:

Meet Patrick Fleming, assistant professor of structural and architectural engineering

Fleming believes making the most of existing buildings is key to reaching a sustainable future.
Two women standing side by side, one in a grey sweater and the other in a dark blazer with a white shirt.
Appointments Published:

Sara Hulkkonen and Johanna Wartio start as Data Agents at the School of ARTS

Aalto Open Research Network has new members, Sara Hulkkonen and Johanna Wartio. Their aim is to support data management practices at the School of ARTS.
A person wearing a beige sweater and necklace stands indoors by a window with a forest view.
Awards and Recognition Published:

Research into physics of microscopically tiny organisms lands prestigious prize

Physics Professor Matilda Backholm received this year’s Väisälä award, handed out by the Finnish Academy of Sciences and Letters.
Three white, circular lace patterns on a black background, each with a unique geometric design.
Research & Art Published:

Smart textiles are reshaping our understanding of materials – and interspecies communication

The PAST-A-BOT research project, funded by the European Research Council (ERC), is developing soft, intelligent textiles that could one day function as rescue robots, sound-sensing agricultural fabrics, or assistive clothing. At the same time, the project aims to rethink the way we approach materials research.