Magyar beszéd Magyar beszéd
  • Language-text-speech
    • Mini dictionaryThe most important terms in speech technology and phonetics
    • Pronunciation dictionaryThe pronunciation of 1.5 million Hungarian words with sound symbols
    • Writing and speakingThe relationship between speech and writing, the phonetic transcript
    • Speech sound symbolsTables of IPA, SAMPA, and other sound symbol sets used in speech research
    • StatisticsLetter-, syllable-, word- and diphone combination statistics
  • Speech acoustics
    • BasicsA short overview of the basic topics of speech acoustics
    • Speech databasesDatabases of various speech representations for developments
    • Interactive programs6 interactive programs
    • Sentence melodiesHungarian sentence melody forms
    • Stress databaseStress of Hungarian words
  • Speech synthesis
    • 1980 - beginningsThe beginnings of Hungarian speech synthesis research, 1791-1989
    • 1988 - BME MultivoxBME TMIT Speech Technology Laboratory: Multivox family 1987-2002
    • 1996 - ProfiVox familyBME TMIT Speech Technology Laboratory: Profivox family 1998-2022
    • 2002 - FlexVoiceFlexible hybrid text to speech synthesiser by Mindmaker
    • 2020 - BME Neural ProfiVoxTechnology of the 21st century
  • Speech recognition
    • 1971 - the beginningsBeginnings in Hungary
    • 1990 - BME TMITEducation, research, developments
    • 2013 - Speech TexTechnology of the 21st century
  • Applications
    • TTSSpeech synthesis applications
    • Speech-to-text (SST)Machine speech recorder, subtitler
    • Talking headVirtual announcer and transparent articulation instructor
    • Magic box with ASRHelp children with speech and hearing disorder
    • ASR-TTSASR and TTS supported dialogue systems
  • Others
    • The maintainersWebsite creators and maintainers
    • History of the websiteShort history
    • EducationEducation related to speech technology
    • Downloadable literaturePublic books and articles
    • Related linksOther related links
    • Related softwareOther related software
    • ContactContact information

The Profivox corpus

The Profivox corpus text-to-speech system uses a flexible, long unit-search method for speech synthesis. This requires a fast computer as the computational demand is high. The procedure takes into account that speech is an event of the moment, the sound wave is constantly changing. Even uttering the same speech sound twice the produced two wave forms are not exactly the same. This gives the personal sound timbre. This method ensures the best quality synthesized speech. The owner of the sound can be recognized. This is because this technology concatenates long speech units like words, word sequences, or complete sentences, when it generates speech from text. It is used in applications where impeccable sound quality is a requirement (e.g. weather forecast). The price of this good quality is that this method can only be used in a limited topic. The synthesis database is a multi-hour speech corpus. The person reads sentences and phrases that are most likely to occur in the limited topic, for example, in weather forecast texts. The text to be read should be designed with serious, precise work. The ‘master sentence’ application procedure is used at every time when new recording is done in the studio. The speech database is labeled in detail. Speech synthesis is then performed using a search algorithm. This selects the most appropriate waveform elements from the speech database in several steps and with weighting calculations. So, the longer the selected unit, the better the quality is. This complex algorithm works in real time. The result is personal and natural-sounding speech. The speech database must be created individually for each topic.

Listen to some synthesized voices in Hungarian

Motorolla telefon hirdetés
Your browser does not support the audio element.

Samsung telefon hirdetés
Your browser does not support the audio element.

Nokia telefon hirdetés
Your browser does not support the audio element.

Számla egyenleg közlése
Your browser does not support the audio element.

Dátum megadása
Your browser does not support the audio element.

Dátum és időpont
Your browser does not support the audio element.

Időjárás jelentés automatikus felolvasása
Your browser does not support the audio element.

Időjárás jelentés automatikus felolvasása
Your browser does not support the audio element.

Pályaudvari utastájékoztatás menetrendi szövegekből
Your browser does not support the audio element.

Pályaudvari utastájékoztatás menetrendi szövegekből
Your browser does not support the audio element.
Featured
  • Pronunciation dictionary
  • Speech synthesis applications
Downloads
  • Downloadable literature
About
  • Owners
  • Contact
Magyar beszéd Magyar beszéd

Copyright 2022. Olaszy Gábor és Abari Kálmán
Utolsó frissítés: 2022. 09. 01. (Last update: 01. 09. 2022)