I tryed contacting assitiveware and they said that they don't know or they wouldn't respond. So yeah. Uh, jwho is the voice of Josh, I know someone can do this! Thanks, Have a good day!
You are not logged in. Please login or register.
AudioGames.net Forum → Off-topic room → The people behind TTS voices
I tryed contacting assitiveware and they said that they don't know or they wouldn't respond. So yeah. Uh, jwho is the voice of Josh, I know someone can do this! Thanks, Have a good day!
Dude Josh and Ella are kid voices done by kids. Obviously you are not going to be able to find out who they are as they will be protected as they should be.
The entire TTS space is like this, not just Josh or Ella. The people behind these voices are often very difficult to find, plus companies don't like to disclose them. Good luck finding the voice actors behind these.
Hi guise. Does any of you know who is behind eloquence?
I'm wondering if he can pronounce those eloquence crash words lol.
I like so much Abel Folk. Is the one that created Loquendo and Vocalizer Jorge. I know this name and I well, shur I.
Hi all,
I know that the german voice of vocalizer, vocalizer Anna, is voiced by Heike Hagen.
The voice of Vocalizer Yelda is Yelda Uğurlu
Does anyone know who the voice of Microsoft Tolga is? It's the first Turkish voice microsoft has ever made
3 of the members of festival's development team lent their voices to the festival tts engine. kal is voiced by kevin lenzo (who now works at cepstral), clb is voiced by catheren broumen, and awb is voiced by scottish researcher and website maintainer alan black.
And I always asked to me how the names of loquendo voices are. Especially for the germany voices loquendo Stefan and loquendo Katrin. And the american voices loquendo Dave and loquendo Allison. These voices are awesome!
I don't remember the voice of Loquendo Dave quite well, but I've heard from someone that this same voice is now Microsoft David in Windows 8 and 10. Is it true?
That would be interesting. I have a computer with win 10, I'll test this, hope I find the point where to hear the voice there.
I need to find this link again, but there was a website that had every phrase spoken by every Festival voice as a wav file. Most of the time there were about 1132 phrases they spoke.
Wow! That link would be very interesting, I hope you'll find it again. Gosh! So many phrases!
lady parsec's voice is that of aubry anderson, who was a college student at the time her voice was recorded in 1981.
So you guys wanted the link? Well, I found it at last! There are zip files in here, just download them. The wav files are in the wav directory, and these phrases were the ones the poor Festival folks had to read. Enjoy!
wow! great posts and information. I am learning quite a bit. That dectalk interview was just epic.
I believe September Day is the voice of Ivona Sali. https://www.youtube.com/watch?v=P0jSZePDLVs
I wonder who is VW jaims. Siriusly he's voice is vary good for reading books.
owe, could this topic all reddy be 3 pages long? well, I would like to know, if there are any, persons behind all those Microsoft voices spetially Sam. He leaves such lovely memories.
Hi everyone, I am going to revive this thread to hopefully get some sort of clarification on a few things. I'm also working on something a little exciting, I hope.
I saw a PDF file that contained instructions on how to build a speech synthesiser using a Moog Synthesiser module and various electronic manipulatives. Still, I wonder what people do exactly to model someone's voice. For instance, is eSpeak modelled after someone, or is it purely formant and not based on any model? Is there an in-between category between formant and concatenative? For instance, I heard that Eloquence and Microsoft Sam were primarily formant, but they also had some concatenative elements that were processed with digital signal processing as well.
The Wikipedia article indicated that there was a third type of speech synthesis called articulatory speech synthesis, but it didn't describe hot it was made, or at least, it didn't delve into the details of the processes needed to model someone's vocal tract to make a speech synthesiser out of it.
@FlybyChow, you said before you were interested in making your own speech synthesiser using concatenation. I.E. Using small segments of your voice along with some digital signal processing, or have someone model your vocal tract by recording different things to make an analysis. I don't know if there are accessible spectograms or whatever it is they use to replicate someone's voice using tones and noise. Anyhow, look at posts 52 and 57 of this topic. http://forum.audiogames.net/viewtopic.php?id=8877
I found these recordings in the Festvox demo. http://www.festvox.org/history/klatt.html
Well, Apple did a good job with Speech synth Alex. Wonder who could be behind this. Is it based off human voices, or Apple written it completely from scratch? We never know what Apple did with this synth.
I think it is safe to say that Apple did use someone to concatenate his voice. Obviously he recorded a whole bunch of words, phrases, syllables, breath sounds, etc, sort of like Vocaloid. But to make it really high quality, that is another thing. Maybe Apple has some secret algorithm. But the fact that Apple is secretive about many things is the reason why they are one of the top leading money-making companies in the world.
Microsoft Sam is diphone based, and eloquence is formant based. They don't use any combination of both.
There is a type of synthesis where you record some speech and then the recordings are used to create an Approximation of there vocal tract.
This type of synthesis is called Hidden Markov Models and is used in engines such as modern Microsoft and Google TTS.
AudioGames.net Forum → Off-topic room → The people behind TTS voices
Generated in 0.028 seconds (50% PHP - 50% DB) with 11 queries