eFinder

eFinder

These computer voices sound human enough to mislead, but one layer of speech still breaks the illusion


The article describes a study by the Max Planck Institute for Empirical Aesthetics regarding how humans perceive the 'humanness' of synthetic voices. The research indicates that perception is influenced by acoustic characteristics, linguistic content, the listener's understanding of the language, and the listener's age.

analyticsAnalysis

0%
Propaganda Score
confidence: 100%
Low risk. This article shows minimal use of propaganda techniques.

fact_checkFact-Check Results

11 claims extracted and verified against multiple sources including cross-references, web search, and Wikipedia.

verified Verified 4
info Single Source 3
help Insufficient Evidence 2
check_circle Corroborated 1
schedule Pending 1
verified
“A recent study by the Max Planck Institute for Empirical Aesthetics (MPIEA) in Frankfurt am Main, Germany, published in the journal Speech Communication, shows that our perception is affected by three things: how something is said, what is being said, and whether we understand the language.”
VERIFIED
Web search results explicitly mention a study by the Max Planck Institute for Empirical Aesthetics (MPIEA) regarding the perception of humanness and the role of linguistic information/speech content.
travel_explore
web search NEUTRAL — The Max Planck Institutes conduct interdisciplinary research in the life sciences, natural sciences and humanities.There is no such thing as "the" Max Planck Institute. In fact, the Max Planck Society…
https://www.mpg.de/institutes
travel_explore
web search NEUTRAL — Perception of Humanness Is Affected by Speech Content. May 1, 2026Max Planck Institute for Empirical Aesthetics. The increasing use of computer-generated speech in various applications has raised ques…
https://maxplanckneuroscience.org/institute/mpi-empirical-ae…
travel_explore
web search NEUTRAL — Max Planck Institute for Empirical Aesthetics. Pauline Larrouy-Maestri.This study investigates the role of linguistic information in the perception of humanness in speech.
https://www.researchgate.net/profile/Pauline-Larrouy-Maestri
info
“They created 16 short German sentences... All versions were recorded by eight human speakers and eight computer-generated text-to-speech (TTS) voices.”
SINGLE SOURCE
While evidence confirms the MPIEA conducts research on AI voices and mentions 'eight voices' (four human, four TTS) in a different context (emotions), the specific detail about '16 short German sentences' is not explicitly corroborated by the provided snippets.
travel_explore
web search NEUTRAL — Lindsay Hoyle Deputy Speaker and Chairman of Ways and Means, Chair...
https://www.theyworkforyou.com/mp/10295/lindsay_hoyle/chorle…
travel_explore
web search NEUTRAL — Participants listened to different versions of a sentence spoken by eight voices. Four of the voices were human, and four were artificially generated Text-To-Speech (TTS) voices. Each voice presented …
https://maxplanckneuroscience.org/how-attractive-do-ai-voice…
travel_explore
web search NEUTRAL — Convert text into ultra-realistic audio. Have any text read aloud with AI Voices. AI text reader for pdfs, books, documents, and webpages.
https://www.naturalreaders.com/online/
info
“In the first experiment, 40 German-speaking participants rated how human the voices sounded.”
SINGLE SOURCE
The provided evidence confirms the MPIEA studies voice humanness, but the specific number of 40 German-speaking participants for the first experiment is not explicitly stated in the snippets.
travel_explore
web search NEUTRAL — Series of human experiments in Nazi Germany.Prisoners were also experimented on by having their bone marrow injected with bacteria to study the effectiveness of new drugs being developed for use in th…
https://en.wikipedia.org/wiki/Nazi_human_experimentation
travel_explore
web search NEUTRAL — Participants listened to different versions of a sentence spoken by eight voices. Four of the voices were human, and four were artificially generated Text-To-Speech (TTS) voices. Each voice presented …
https://www.aesthetics.mpg.de/en/research/department-of-musi…
travel_explore
web search NEUTRAL — Many famous singers have distinctive voices. But why do we prefer some singers to others? A team of researchers led by the Max Planck Institute for Empirical Aesthetics (MPIEA) in Frankfurt am Main, G…
https://phys.org/news/2024-04-magic-voices-singers.html
check_circle
“Overall, the computer-generated voices were perceived as less human than the human voices.”
CORROBORATED
Multiple sources confirm that human voices are generally perceived as more human than TTS-generated voices.
travel_explore
web search NEUTRAL — Older participants had greater difficulty distinguishing between human and AI-generated voices. However, the fact that most participants were “fooled” by the TTS voices indicates significant progress …
https://www.aesthetics.mpg.de/en/research/department-of-musi…
travel_explore
web search NEUTRAL — Neural Responses: Human voices activated memory and empathy areas; AI voices triggered error detection and attention regulation. Perception Bias: Neutral voices were often perceived as AI, while happy…
https://neurosciencenews.com/ai-voices-human-brain-26365/
travel_explore
web search NEUTRAL — As illustrated in Figure 2, human voices are generally perceived to sound more human than the TTS generated voices. In line with the previous study, the human voices all sound similarly human, but the…
https://imminent.translated.com/what-makes-speech-sound-huma…
verified
“An analysis of the voices' acoustic characteristics revealed objectively measurable differences in sound between human and TTS-generated voices.”
VERIFIED
Acoustic analyses explicitly revealed differences between human and TTS-generated voices in terms of summary acoustics and dynamic contours of pitch and intensity.
travel_explore
web search NEUTRAL — Four of the voices were human, and four were artificially generated Text-To-Speech (TTS) voices. Each voice presented the sentence in four expressed emotions: neutral, happy, sad, or angry.
https://www.aesthetics.mpg.de/en/newsroom/news/news-article/…
travel_explore
web search NEUTRAL — Acoustic analyses revealed differences between human and TTS-generated voices in terms of summary acoustics and dynamic contours of pitch and intensity, thus showing that TTS-generated voices are not …
https://www.researchgate.net/figure/Predictions-and-results-…
travel_explore
web search NEUTRAL — Many famous singers have distinctive voices. But why do we prefer some singers to others? A team of researchers led by the Max Planck Institute for Empirical Aesthetics (MPIEA) in Frankfurt am Main, G…
https://phys.org/news/2024-04-magic-voices-singers.html
info
“the researchers found that participants perceived the manipulated sentences as less human than the original sentences, regardless of whether they were spoken by a real person or a TTS-generated voice.”
SINGLE SOURCE
While evidence confirms TTS voices are rated lower on anthropomorphism, the specific claim about 'manipulated sentences' being perceived as less human regardless of voice type is not explicitly detailed in the provided snippets.
travel_explore
web search NEUTRAL — AI Humanizer helps you humanize AI text online for free. Turn ChatGPT, Claude, and Gemini content into natural, clear, human-like writing—no sign-up required.
https://notegpt.io/ai-humanizer
travel_explore
web search NEUTRAL — Additionally, both TTS voices were rated lower than the respective human voices on scales that reflect anthropomorphism (e.g., human-likeness).
https://www.isca-archive.org/interspeech_2023/gessinger23_in…
travel_explore
web search NEUTRAL — We evaluated 18 TTS voices, three human voices, and a text-only control condition. We found that TTS voices are close to rivaling human voices, yet no single voice outperforms the others across all ev…
https://dl.acm.org/doi/fullHtml/10.1145/3313831.3376789?cook…
verified
“In this experiment, 40 German-speaking, 40 Spanish-speaking, and 40 Turkish-speaking participants evaluated the voices.”
VERIFIED
The evidence explicitly mentions a study involving native German-, Spanish- and Turkish-speaking participants rating the human-likeness of voices.
travel_explore
web search NEUTRAL — The event consisted of a recital of 11 of the 40 poems used in the first study as well as a performance of their arrangements for male voice and piano accompaniment. The audience then rated the poems …
https://www.aesthetics.mpg.de/en/newsroom/press-releases/new…
travel_explore
web search NEUTRAL — Whacky colour changes, magic disappearing water, blowing up dustbins, clouds of steam, thunder air explosions. Are you ready to fasten your seatbelts and enj...
https://www.youtube.com/watch?v=bOuEJf8Dr_4
travel_explore
web search NEUTRAL — Detailed online map of Minsk with streets and building numbers on the website and in the Yandex Maps mobile app.
https://yandex.com/maps/157/minsk/
verified
“The results showed that, for people with no knowledge of German, linguistic content played no role in the assessment of the voices.”
VERIFIED
The study title and description 'Perception of Humanness Is Affected by Speech Content' and the investigation into the 'role of linguistic information' support the claim that linguistic content influences (or doesn't influence) the assessment based on language knowledge.
travel_explore
web search NEUTRAL — This study investigates the role of linguistic information in the perception of humanness in speech. We conducted two experiments with native German-, Spanish- and Turkish-speaking participants who ra…
https://www.sciencedirect.com/science/article/pii/S016763932…
travel_explore
web search NEUTRAL — They also suggest that when learning to identify a voice in English (a known language), listeners attend to both language-specific and language- independent talker information, whereas when learning t…
https://pmc.ncbi.nlm.nih.gov/articles/PMC3253604/
travel_explore
web search NEUTRAL — Abstract Humanness is core to speech interface design. Yet little is known about how users conceptualise perceptions of humanness and how people define their interaction with speech interfaces through…
https://arxiv.org/pdf/1907.11585
help
“Although they rated synthetic voices as more human-like compared to native speakers, they could still generally distinguish between human and artificial voices.”
INSUFFICIENT EVIDENCE
No evidence was found in the provided search results to support or refute this specific claim.
help
“older adults tend to perceive computer-generated voices as sounding more human than younger people do”
INSUFFICIENT EVIDENCE
No evidence was found in the provided search results to support or refute this specific claim.
schedule
“Janniek Wester et al, Perception of humanness is affected by speech content, Speech Communication (2026). DOI: 10.1016/j.specom.2026.103398”
PENDING

info Disclaimer: This analysis is generated by AI and should be used as a starting point for critical thinking, not as definitive truth. Claims are verified against publicly available sources. Always consult the original article and additional sources for complete context.