News

AI use makes us overestimate our cognitive performance

New research warns we shouldn’t blindly trust Large Language Models with logical reasoning –– stopping at one prompt limits ChatGPT’s usefulness more than users realise.
Left: Daniela da Silva Fernandes, right: Robin Welsch.
Daniela da Silva Fernandes on the left and Robin Welsch on the right. Photo by Matti Ahlgren.

When it comes to estimating how good we are at something, research consistently shows that we tend to rate ourselves as slightly better than average. This tendency is stronger in people who perform low on cognitive tests. It’s known as the Dunning-Kruger Effect (DKE) –– the worse people are at something the more they tend to overestimate their abilities and the “smarter” they are, the less they realise their true abilities. 

However, a study led by Aalto University reveals that when it comes to AI, specifically, Large Language Models (LLMs), the DKE doesn’t hold, with researchers finding that all users show a significant inability to assess their performance accurately when using ChatGPT. In fact, across the board, people overestimated their performance. On top of this, the researchers identified a reversal of the Dunning-Kruger Effect ––   an identifiable tendency for those users who considered themselves more AI literate to assume their abilities were greater than they really were.  

‘We found that when it comes to AI, the DKE vanishes. In fact, what’s really surprising is that higher AI literacy brings more overconfidence,’ says Professor Robin Welsch. ‘We would expect people who are AI literate to not only be a bit better at interacting with AI systems, but also at judging their performance with those systems – but this was not the case.’  

What’s really surprising is that higher AI literacy brings more overconfidence.

Robin Welsch

The finding adds to a rapidly growing volume of research indicating that blindly trusting AI output comes with risks like ‘dumbing down’ people’s ability to source reliable information and even workforce de-skilling. While people did perform better when using ChatGPT, it’s concerning that they all overestimated that performance. 

‘AI literacy is truly important nowadays, and therefore this is a very striking effect. AI literacy might be very technical, and it’s not really helping people actually interact fruitfully with AI systems’, says Welsch.  

‘Current AI tools are not enough. They are not fostering metacognition [awareness of one’s own thought processes] and we are not learning about our mistakes,’ adds doctoral researcher Daniela da Silva Fernandes. ‘We need to create platforms that encourage our reflection process.’ 

The article was published on October 27th in the journal Computers in Human Behavior. 

Why a single prompt is not enough 

The researchers designed two experiments where some 500 participants used AI to complete logical reasoning tasks from the US’s famous Law School Admission Test (LSAT). Half of the group used AI and half didn’t. After each task, subjects were asked to monitor how well they performed –– and if they did that accurately, they were promised extra compensation. 

‘These tasks take a lot of cognitive effort. Now that people use AI daily, it’s typical that you would give something like this to AI to solve, because it’s so challenging’, Welsch says. 

Current AI tools are not enough. They are not fostering metacognition and we are not learning about our mistakes

Daniela da Silva Fernandes

The data revealed that most users rarely prompted ChatGPT more than once per question. Often, they simply copied the question, put it in the AI system, and were happy with the AI’s solution without checking or second-guessing. 

‘We looked at whether they truly reflected with the AI system and found that people just thought the AI would solve things for them. Usually there was just one single interaction to get the results, which means that users blindly trusted the system. It’s what we call cognitive offloading, when all the processing is done by AI’, Welsch explains.  

This shallow level of engagement may have limited the cues needed to calibrate confidence and allow for accurate self-monitoring. Therefore, it’s plausible that encouraging or experimentally requiring multiple prompts could provide better feedback loops, enhancing users’ metacognition, he says. 

So what’s the practical solution for everyday AI users? 

‘AI could ask the users if they can explain their reasoning further. This would force the user to engage more with AI, to face their illusion of knowledge, and to promote critical thinking,’ da Silva Fernandes says.

  • Updated:
  • Published:
Share
URL copied!

Read more news

The new ultrasonic needle allows for two to three times the quantity of tissue to be sampled comparative to current needle biopsy methods. Photo: Kalle Kataila, Aalto University.
Press releases Published:

New ultrasonic needle yields samples 2–3 times larger, potentially reshaping cancer diagnostics

Developed at Aalto University over several years, a new ultrasonic needle for tumour diagnostics has been trialled in collaboration with Helsinki University Hospital (HUS). According to the resulting peer-reviewed study, salivary gland tumours could be diagnosed with far greater precision using the innovative needle.
Band performing on stage, singer in bright pink skirt, guitarist in black, crowd lights twinkling behind
Cooperation, Press releases, Research & Art Published:

Music industry stakeholders: the industry’s value will double by 2040 through large-scale equality initiatives

The industry aims to establish a self-regulatory body and double the value of the music industry, as outlined in the report “An Equal Music Industry in Finland by 2040”, to be published 11 May.
Designs for a Cooler Planet
Press releases Published:

Research becomes real-world solutions in autumn exhibition – Designs for a Cooler Planet showcases work by researchers and students

Aalto University’s largest annual exhibition, Designs for a Cooler Planet, will point the way to the future this autumn. It will feature more than 20 practical solutions, experiments and ideas from researchers and students.
Collage of workshops, group photos and presentations from the first year of the Aalto Inventors programme.
Cooperation, Research & Art Published:

Aalto Inventors turns one: A year of bridging research and real-world impact

Aalto Inventors marks its first anniversary, having engaged 190 researchers across six cohorts in fields including AI, quantum, and biomaterials. New cohorts are planned for the next academic year, stay tuned and join the waitlist.