ABOUT


ABOUT

I study where

speech loses

credibility

I study where speech loses credibility

My work explores the fracture point between AI-generated Hungarian speech and human perception. The Illusion Breaks in Hungarian is a research project examining semantic loss, timing drift, dubbing quality and speech credibility.


Through dubbing reconstruction, sync correction and ASR analysis, I examine how timing, rhythm and semantic distortion alter meaning.


Focus: speech timing, dubbing QA, ASR/TTS evaluation and Hungarian linguistic review.


Accuracy alone is not communication


Synchronization. Timing. Meaning.


Accuracy alone is not communication.

— DUBBING / TTS / SYNC QUALITY LAB

ABOUT

We rebuild

what meaning has lost.

I study where speech loses credibility

Neural dubbing, TTS evaluation and focused Hungarian adaptation for

systems where literal accuracy is not enough.


HEAR WHAT WAS MEANT →


Synchronization. Timing. Meaning.


Accuracy alone is not communication.

CASE 01FILM
BEFOREAFTER
ISSUE

Lip closure mismatch, consonant cut

FIX

Reconstructed lip closure

CASE 02SERIES
BEFOREAFTER
ISSUE

Vowel length mismatch, late mouth opening

FIX

Corrected timing & duration

CASE 03ANIMATION
BEFOREAFTER
ISSUE

Phoneme collapse, unnatural transitions

FIX

Natural phoneme transitions

CASE 04DOCUMENTARY
BEFOREAFTER
ISSUE

Breath misalignment, cut-off ending

FIX

Breath matched, natural ending

WAVEFORM
UPLOAD WAVEFORM
SPECTROGRAM
UPLOAD SPECTROGRAM
PHONEME ALIGNMENT
szeretem
szeretem
FRAME TIMING
DRIFT+62 ms
MAX ERROR112 ms
SYNC QUALITY96.3%

TIMING DRIFT

SEVERITYP2
OCCURRENCES24
IMPACTMedium
FIX APPLIED

Time-stretch & realignment

LIP-SYNC ERROR

SEVERITYP1
OCCURRENCES17
IMPACTHigh
FIX APPLIED

Phoneme-based re-synthesis

EMOTION MISMATCH

SEVERITYP2
OCCURRENCES13
IMPACTMedium
FIX APPLIED

Re-performance & tone match

VOICE CONSISTENCY

SEVERITYP3
OCCURRENCES9
IMPACTLow
FIX APPLIED

Spectral matching

MODELMOSNATURALNESSPROSODYINTELLIGIBILITYSTABILITYCOMMENTS
ElevenLabs v2 (Hungarian)4.32■■■■■■■■■■■■■■■■■■■■■■■Strong prosody, minor imbalance
Resemble Enhance4.18■■■■■■■■■■■■■■■■■■■Natural tone, slightly robotic
PlayHT 2.03.91■■■■■■■■■■■■■■■■■Good clarity, flat prosody
Google TTS (Neural2)3.62■■■■■■■■■■■■■Stable but mechanical
Azure TTS3.28■■■■■■■■■Intelligible, low expressiveness
FILM

Hungarian Lip-Sync Reconstruction

Timing correction, phoneme alignment, semantic preservation.

TTS

Neural Voice Evaluation

Hungarian prosody, intelligibility and naturalness testing.

QA

Dubbing QA Error Atlas

Error classification for localization and dubbing pipelines.

AUDIO

AUDIO

Audio-Visual

Timing & Editing

Audio-Visual Timing & Editing

Editing rhythm, sound-image synchronization

and cinematic pacing.

Editing rhythm, sound-image synchronization

and cinematic pacing.

15+ years of audiovisual work with young
classical musicians — recording, concert
documentation and promotional editing.

Focused on rhythm, pacing and perceptual
sync between sound and image.

[ VIDEO GRID ]

LIVE CONCERT RECORDING

PIANO STUDIO SESSION

MARKET MOTION STUDY

LANDSCAPE RHYTHM STUDY

SELECT A FRAME
[01] LIVE CONCERT RECORDING
[02] PIANO STUDIO SESSION
[03] MARKET MOTION STUDY
[04] TRIATHLON PACING STUDY

You either hear it or you don't








[ VIDEO GRID ]

LIVE CONCERT RECORDING

PIANO STUDIO SESSION

MARKET MOTION STUDY

LANDSCAPE RHYTHM STUDY

CLINICAL ASR FAILURE

ATLAS


CLINICAL ASR FAILURE

ATLAS


MAPPING SEMANTIC RISK IN HUNGARIAN MEDICAL SPEECH RECOGNITION

MAPPING SEMANTIC RISK IN HUNGARIAN MEDICAL SPEECH RECOGNITION

300 annotated clinical utterances

6 semantic failure classes

P1–P3 clinical severity taxonomy

Hungarian medical ASR audit

300 annotated clinical utterances

6 semantic failure classes

P1–P3 clinical severity taxonomy

Hungarian medical ASR audit

EXPLORE DATASET

EXPLORE DATASET

FAILURE CLASS I

NUMERICAL

SEMANTIC COLLAPSE

FAILURE CLASS I

NUMERICAL

SEMANTIC COLLAPSE

SOURCE: "kettő-három"

ASR: "23"

SOURCE: "kettő-három"

ASR: "23"

A clinically essential drug identifier disintegrates during speech reconstruction, introducing potential treatment risk.

A clinically essential drug identifier disintegrates during speech reconstruction, introducing potential treatment risk.

SEVERITY: P1

SEVERITY: P1

FAILURE CLASS V

NEGATION INVERSION

FAILURE CLASS V

NEGATION INVERSION

SOURCE: "vérrögoldót adnak"

ASR: "vérrögoldót NEM adnak"

SOURCE: "vérrögoldót adnak"

ASR: "vérrögoldót NEM adnak"

Clinical intent inverted.

A single inserted negation reverses treatment logic completely.

Clinical intent inverted.

A single inserted negation reverses treatment logic completely.

SEVERITY: P1

SEVERITY: P1

FAILURE CLASS I

TERMINOLOGICAL

DISINTEGRATION

FAILURE CLASS I

TERMINOLOGICAL

DISINTEGRATION

SOURCE: "Clopidogrel"

ASR: "klopi dogrel"

SOURCE: "Clopidogrel"

ASR: "klopi dogrel"

A clinically essential drug identifier disintegrates during speech reconstruction, introducing potential treatment risk.

A clinically essential drug identifier disintegrates during speech reconstruction, introducing potential treatment risk.

SEVERITY: P1

SEVERITY: P1

THE EVIDENCE BASE

300

THE EVIDENCE BASE

300

ANNOTATED CLINICAL FAILURES

ANNOTATED CLINICAL FAILURES

P1–P3 SEVERITY TAXONOMY

P1–P3 SEVERITY TAXONOMY

HUNGARIAN MEDICAL ASR AUDIT

HUNGARIAN MEDICAL ASR AUDIT

300 CLINICAL UTTERANCES

300 CLINICAL UTTERANCES

EXPLORE ATLAS

EXPLORE ATLAS

Independently built and manually annotated archive of Hungarian medical ASR failures.

ATLAS EXPLORER
FILTER ☰
SEV
CLASS
TRANSCRIPTION ERROR
IMPACT
P3
Treatment
NEGATION INVERSION
"vérhígítót adnak""vérhígítót NEM adnak"
Treatment logic reversed.
SHOWING 1–1 OF 1 LOGS
PAGE 1 OF 1
ATLAS EXPLORER
FILTER ☰
SEV
CLASS
TRANSCRIPTION ERROR
IMPACT
P3
Treatment
NEGATION INVERSION
"vérhígítót adnak""vérhígítót NEM adnak"
Treatment logic reversed.
SHOWING 1–1 OF 1 LOGS
PAGE 1 OF 1

QA / THINKING


QA / THINKING

I evaluate where
meaning survives
reconstruction

I evaluate where meaning

survives reconstruction

My QA work focuses on native Hungarian perception and semantic stability in reconstructed speech systems.


Evaluation areas:

semantic distortion: meaning shifted or lost during transcription

linguistic failure: native Hungarian patterns broken or unnatural

timing instability: speech rhythm disrupting comprehension

cognitive correction burden: extra mental effort required to restore meaning


Accuracy is not understanding

My QA work focuses on native Hungarian perception and semantic stability in reconstructed speech systems.

Evaluation areas:

semantic distortion: meaning shifted or lost during transcription

linguistic failure: native Hungarian patterns broken or unnatural

timing instability: speech rhythm disrupting comprehension

cognitive correction burden: extra mental effort required to restore meaning

Accuracy is not understanding.

P2
P1
P3
SEMANTIC STABILITY
94.2%
RECONSTRUCTION LOSS
3.1%
P2
P1
P3
SEMANTIC STABILITY
94.2%
RECONSTRUCTION LOSS
3.1%
AVAILABLE FOR
  • Lip-Sync & Dubbing Synchronization
  • Speech Timing Optimization
  • Dubbing QA Review
  • TTS Quality Evaluation
  • ASR Evaluation

2026 PentimentoArt — Concept, design & development by Orsolya Beata Eszlári • TOP ↑

2026 PentimentoArt —

Design & development by Orsolya Eszlari