2024 Note on Registrations

datajake1999 · 2017-03-17 09:13:24

datajake1999
Kingdom crafter
Offline

From: Someware in space
Registered: 2015-02-17
Posts: 445
User Karma: 34

Hi,
I just released a new version of nv speechPlayer. Not speechPlayer in espeak, but the actual nv speechPlayer.
This version fixes the issues with the latest espeak that comes with nvda, and I also enabled more Klatt settings.
http://grossgang.com/tts/speechPlayer%2 … nvda-addon

hurstseth405 · 2017-03-18 07:23:55

hurstseth405
red potter
Offline

Registered: 2014-08-15
Posts: 772
User Karma: 29

What do all the settings mean in the voice dialog.

Bitcoin Address:
1MeNca7h6m8du4TV3psN4m4X666p6Y36u5m

Slender · 2017-03-18 19:45:50

Slender
Hunter grunt
Offline

From: Down here!
Registered: 2015-07-24
Posts: 1,292
User Karma: 97

Many of them are settings related to Klatt Synthesis, the method NV Speech Player is using. Some are related to cascade formants, some are pitch settings. It's best to play around with them and see what you get, as there are too many to explain.

Oh no! Somebody released the h key! Everybody run and hide!

datajake1999 · 2017-03-18 20:18:03

datajake1999
Kingdom crafter
Offline

From: Someware in space
Registered: 2015-02-17
Posts: 445
User Karma: 34

The settings are explained in frame.h if you download the source code, and I am copying the part of the code that explains the settings below.

// voicing and cascaide
speechPlayer_frameParam_t voicePitch; // fundermental frequency of voice (phonation) in hz
speechPlayer_frameParam_t vibratoPitchOffset; // pitch is offset up or down in fraction of a semitone
speechPlayer_frameParam_t vibratoSpeed; // Speed of vibrato in hz
speechPlayer_frameParam_t voiceTurbulenceAmplitude; // amplitude of voice breathiness from 0 to 1
speechPlayer_frameParam_t glottalOpenQuotient; // fraction between 0 and 1 of a voice cycle that the glottis is open (allows voice turbulance, alters f1...)
speechPlayer_frameParam_t voiceAmplitude; // amplitude of voice (phonation) source between 0 and 1.
speechPlayer_frameParam_t aspirationAmplitude; // amplitude of aspiration (voiceless h, whisper) source between 0 and 1.
speechPlayer_frameParam_t dcf1, dcb1; // change in hz in frequency and bandwidth of cascaide formant 1 in voice cycle while glottis is open
speechPlayer_frameParam_t cf1, cf2, cf3, cf4, cf5, cf6, cfN0, cfNP; // frequencies of standard cascaide formants, nasal (anti) 0 and nasal pole in hz
speechPlayer_frameParam_t cb1, cb2, cb3, cb4, cb5, cb6, cbN0, cbNP; // bandwidths of standard cascaide formants, nasal (anti) 0 and nasal pole in hz
speechPlayer_frameParam_t ca1, ca2, ca3, ca4, ca5, ca6, caN0, caNP; // amplitudes of standard cascaide formants, nasal (anti) 0 and nasal pole in hz
// fricatives and parallel
speechPlayer_frameParam_t fricationAmplitude; // amplitude of frication noise from 0 to 1.
speechPlayer_frameParam_t pf1, pf2, pf3, pf4, pf5, pf6; // parallel formants in hz
speechPlayer_frameParam_t pb1, pb2, pb3, pb4, pb5, pb6; // parallel formant bandwidths in hz
speechPlayer_frameParam_t pa1, pa2, pa3, pa4, pa5, pa6; // amplitude of parallel formants between 0 and 1
speechPlayer_frameParam_t parallelBypass; // amount of signal which should bypass parallel resonators from 0 to 1
speechPlayer_frameParam_t preFormantGain; // amplitude from 0 to 1 of all vocal tract sound (voicing, frication) before entering formant resonators. Useful for stopping/starting speech
speechPlayer_frameParam_t outputGain; // amplitude from 0 to 1 of final output (master volume)
speechPlayer_frameParam_t endVoicePitch; // pitch of voice at the end of the frame length

2024 Note on Registrations

New version of nv speechPlayer

Posts: 4

#1 Topic by datajake1999 2017-03-17 09:13:24

#2 Reply by hurstseth405 2017-03-18 07:23:55

#3 Reply by Slender 2017-03-18 19:45:50 (edited by Slender 2017-03-18 19:52:28)

#4 Reply by datajake1999 2017-03-18 20:18:03

Posts: 4