Article published in "Nature Machine Intelligence": Uni Osnabrück

084/2025 Aug 18, 2025 Created by Dr. Oliver Schmidt

Publications IT & AI Psychology & Cognition

In a study published in the renowned journal Nature Machine Intelligence, a team of researchers proposes using language models from artificial intelligence to understand human vision. This is a new approach worldwide.

Background to the study: When we look at the world around us, our brain not only recognizes objects such as "dog" or "car", but also understands higher-level spatial, semantic relationships - what is happening, where it is happening and how everything fits together. This information is essential for our understanding of human vision, but until now scientists have lacked the tools to analyze these complex processes.

"Using language models to understand visual processing sounds nonsensical at first," explains Prof. Dr. Tim C. Kietzmann from Osnabrück University and co-first author of the study. "However, language models are extremely good at processing contextual information and at the same time have a semantically rich understanding of objects and actions. These are important ingredients that the visual system could also extract when confronted with natural scenes."

And indeed: linguistic scene descriptions, represented in large language models, show astonishing similarities to brain activity in the visual system while subjects look at the corresponding images in a magnetic resonance tomograph. So could it be that the task of the human brain's visual system is to process visual impressions in such a way that they are compatible with language? "It is conceivable that the brain tries to find a uniform language, a lingua franca, across different senses and languages. This would greatly simplify the exchange between brain areas," says Prof. Dr. Adrien Doerig, who is now a researcher at the FU Berlin.

The researchers went one step further: they trained artificial neural networks that can predict correct language model representations from images in a multi-stage process. These models, which process visual information in such a way that it can be decoded linguistically, can map the brain activity of the test subjects better than many of the currently leading AI models in the field.

The surprising correspondence between representations in AI language models and activation patterns in the brain is not only important for our understanding of complex semantic processing in the brain, but also points to possible ways in which AI systems can be improved in the future. Medical applications are also conceivable. The research team also succeeded in using AI to generate accurate descriptions of the images that the test subjects were looking at in the brain scanner. This mind reading points to possible improvements for brain-computer interfaces. Conversely, this new technology could one day also contribute to the development of visual prostheses for people with severe visual impairments.

About the paper: Adrien Doerig et al, High-level visual representations in the human brain are aligned with large language models, Nature Machine Intelligence (2025). DOI: 10.1038/s42256-025-01072-0 https://www.nature.com/articles/s42256-025-01072-0

Further information for editors:
Prof. Dr. Tim C. Kietzmann, Osnabrück University
Institute of Cognitive Science
tim.kietzmann@uni-osnabrueck.de

Related news

Two young women stand in front of a pinboard covered with colorful posters

Study choice & application Social Equity Natural sciences Psychology & Cognition

Participants of the Niedersachsen-Technikum visited Cognitive Science at Osnabrück University

Dec 5, 2025

Science, technology, engineering and mathematics (STEM) are exciting fields of study - but often with a low women's quota. As part of the Niedersachsen-Technikum, the participants visited the Institute of Cognitive Science at Osnabrück University.

to the press release

Research Psychology & Cognition

When chimpanzees converse with each other

Nov 26, 2025

Observations by scientists from Osnabrück University in Uganda show that chimpanzee mothers interact with their young in a rhythmic way - almost like a human conversation.

to the press release

A notebook with a graphic and hands in front of it.

Projects & studies IT & AI Law

Ethics and law of collective privacy in the data society

Nov 19, 2025

From credit scoring to predictive policing: data-based prediction models are used in many areas to estimate behavior, risks and profits. One project aims to investigate the associated risks.

to the press release

Many different elements, such as a microphone, an octopus and a robot, come together.

Event IT & AI Natural sciences

Science Slam in the student center

Nov 13, 2025

From AI and brains to dreams and octopuses: This and much more is on offer at a science slam on Thursday, November 20, at Osnabrück University.

to the press release

News list