2024 Note on Registrations

CAE_Jones · 2012-09-26 01:47:14

CAE_Jones
A kind of legend
Offline

Registered: 2010-11-07
Posts: 8,471
User Karma: 692

It sounds like it was made via Festival...
In which case, technically, it could be made to sing, as Flinger seems to more or less be an expansion on Festival. (Assuming it still exists?)

看過來!
"If you want utopia but reality gives you Lovecraft, you don't give up, you carve your utopia out of the corpses of dead gods."
MaxAngor wrote:
George... Don't do that.

AlexN94 · 2012-09-26 01:50:32

AlexN94
Pragmatic party professional
Offline

From: Denmark
Registered: 2009-10-01
Posts: 2,772
User Karma: 125

Heh, it sounds just so awesome... Lol! I just like such old-sounding robotic synths...

To see a world in a grain of sand, and a heaven in a wild flower.
Hold infinity in the palm of your hand, and eternity in an hour.
William Blake - Auguries of Innocence, line 1 to 4

philip_bennefall · 2012-09-27 02:43:26

philip_bennefall
red potter
Offline

From: Sweden
Registered: 2007-06-07
Posts: 747
User Karma: 119

Hi all,

I have now updated the initial post with a new link to a sample that I uploaded to Dropbox. I'll be interested to hear your opinions on the differences.

Kind regards,

Philip Bennefall

tward · 2012-09-27 07:50:19

tward
Playroom playboy
Offline

From: United States
Registered: 2011-12-30
Posts: 1,586
User Karma: 151

Philip, way to go. It does sound quite a bit better than the last demonstration. As you say it does require some more work, but I like it so far.

Sincerely,
Thomas Ward
USA Games Interactive
http://www.usagamesinteractive.com

greatjobatodesk · 2012-09-27 16:52:32

greatjobatodesk
Shades of newbie
Offline

Registered: 2012-09-26
Posts: 21

hi, nice try for the voice. however, I still wait for the final result...

SLJ · 2012-09-28 09:24:36

SLJ
You have defeated this forum and won a custom rank
Offline

From: Denmark
Registered: 2007-05-25
Posts: 14,460
User Karma: 1,014

wow. Nice improvements there. How do you make a voice like this better? I thought you recorded lots and lots of sentences which you might have done, but what next? what can you do to improve the speech quallity?

Best regards SLJ.
Feel free to contact me privately if you have something in mind. If you do so, then please send me a mail instead of using the private message on the forum, since I don't check those very often.
Facebook: https://facebook.com/sorenjensen1988
Twitter: https://twitter.com/soerenjensen

musicalman · 2012-09-28 13:22:04

musicalman
Kirtwolf cubscout
Offline

Registered: 2007-12-26
Posts: 2,854
User Karma: 499

Hi,
The second voice sounds less jumpy, but needs a pretty big intonation increase in my opinion. It sounds monotone to me. But the intelligibility has improved.

Make more of less, that way you won't make less of more!
If you like what you're reading, please give a thumbs-up.

KG4RDF · 2012-09-28 17:32:35

KG4RDF
Kingdom crafter
Offline

From: Springfield, Missouri
Registered: 2012-04-12
Posts: 434
User Karma: 17

Yes, the latest version does sound less jumpy, but I must agree with Ray that it does have next to no inflection. I think the pitch should be raised slightly, not lowered, but that is just my personal prefference.

I can't wait to hear the next update! I would also be interested to know how you go about improving the overall quality of the voice.

My opinions are my own. I try not to state them as facts and if I'm not sure about something, I do whatever research I can. I feel everyone should consider doing the same.

paddy · 2013-05-09 16:34:27

paddy
Kingdom crafter
Offline

From: Stade, Lower Saxony, Germany
Registered: 2010-07-15
Posts: 482
User Karma: 7

Hi Philip!
If I listen very close, it sounds pretty much like your voice, I mean the second recording. I couldn't believe that it was your "voice" speaking. How did you make it? I may have missed something, but I heard something of "Festival", is it the developing kid for a sapi voice?

Keep up the good work!!!

Feel free to check my blog at
http://www.patrickdembinski.org
Aut enim do tibi, ut des, aut do, ut facias, aut facio, ut des, aut facio, ut facias.

camlorn · 2013-05-09 17:53:48

camlorn
You have defeated this forum and won a custom rank
Offline

From: Seattle
Registered: 2013-04-08
Posts: 10,801
User Karma: 2,083

Since someone necroed this, I've got to ask. Is this using Phonemes? Diphones? Words?
Also, are you blind, Philip? I'm just curious--everything I found made making a voice like this into a very complex and visual process (open up your audio tools, select this view, and start clipping with the mouse on a sample-perfect basis...).
And for the record,at least the Espeak with NVDA is great. I wouldn't go to anything else, now. So, so fast, and yet clear. Not natural, but clear all the same.
The one on Linux, especially with Orca, sucks: you can't change the prosity/inflection/whatever it's officially called. one of these days, I'm going to track someone down who can help me reconfigure it all to actually work well and get Libsonic support without disabling audio permanently (oopse, and yay for VM).

My Blog
Twitter: @ajhicks1992

philip_bennefall · 2013-05-10 08:26:54

philip_bennefall
red potter
Offline

From: Sweden
Registered: 2007-06-07
Posts: 747
User Karma: 119

Hello Camlorn,

Using the Festival and Festvox tools, it doesn't have to be a visual process at all. Sure you have to edit some audio, but the editing is trivial and can be automated for the most part, which suits me as I am completely blind and don't want to mess with visual interfaces either. I really just wanted to see how far I could take the voice, and since I am not particularly happy with the end result I have shelved it for now. I'll take it up again once a new version of Festvox is released that can do better prosodic models, but it's hard to tell when that will be. The method I use is called Clustergen, and creates a statistical model of my voice based on phonemes and diphones when available. So the size and phonetic balance/coverage are vital when you construct your dataset.

As for ESpeak, it's not for me. I only use it when I have absolutely no other option or if I am feeling particularly masochistic. Grins.

Kind regards,

Philip Bennefall

AlexN94 · 2013-05-10 08:31:43

AlexN94
Pragmatic party professional
Offline

From: Denmark
Registered: 2009-10-01
Posts: 2,772
User Karma: 125

Awww, sorry to hear you've shelved it, I really liked it.

To see a world in a grain of sand, and a heaven in a wild flower.
Hold infinity in the palm of your hand, and eternity in an hour.
William Blake - Auguries of Innocence, line 1 to 4

paddy · 2013-05-10 12:30:39

paddy
Kingdom crafter
Offline

From: Stade, Lower Saxony, Germany
Registered: 2010-07-15
Posts: 482
User Karma: 7

Yeah the second recording was quite well made. Of corse it's not the best ever tts, but it would work.

Feel free to check my blog at
http://www.patrickdembinski.org
Aut enim do tibi, ut des, aut do, ut facias, aut facio, ut des, aut facio, ut facias.

AlexN94 · 2013-05-10 14:08:57

AlexN94
Pragmatic party professional
Offline

From: Denmark
Registered: 2009-10-01
Posts: 2,772
User Karma: 125

Well, I love such voices that aren't "the best" as you say. Yeah, I'm weird. Heh.

To see a world in a grain of sand, and a heaven in a wild flower.
Hold infinity in the palm of your hand, and eternity in an hour.
William Blake - Auguries of Innocence, line 1 to 4

camlorn · 2013-05-10 20:47:06

camlorn
You have defeated this forum and won a custom rank
Offline

From: Seattle
Registered: 2013-04-08
Posts: 10,801
User Karma: 2,083

To be honest, I thought the first recording of them all was the best. I also now dislike Eloquence and think that the Espeak with NVDA is the best synthesis ever, so...take it with a grain of salt.
I need to look into this again. if it's using statistics, it could be possible to get the modle to be more accurate by providing more data from different people, or at least to get it sounding more interesting, and I kinda wonder if you couldn't somehow duplicate Eloquence with it by using recordings.
How are you automating audio? I'm not familiar of any audio analysis and modification scripts, save maybe Nyquist, but that's probably overkill. The rest are all geared towards music, or so it seems.
If I can get or find a good microphone and a quiet place to record, this could be a lot of fun to play with.

My Blog
Twitter: @ajhicks1992

philip_bennefall · 2017-11-14 17:14:28

philip_bennefall
red potter
Offline

From: Sweden
Registered: 2007-06-07
Posts: 747
User Karma: 119

Hi guys,

After a few years of silence, I picked up this thread again. There have been some improvements in the voice creation tools, and I regenerated the voice with them using the existing dataset. Here's a new recording for those who may be interested:

https://www.dropbox.com/s/yb3zx4dt3rmdde4/text.wav?dl=1

I plan to rebuild the voice yet again with the latest snapshot of the tools that came out just a few days ago, but that's a project for the weekend when I'm off work.

Kind regards,

Philip Bennefall

Dan Gero · 2017-11-14 21:47:29

Dan Gero
Sound stratogist
Offline

From: North Carolina
Registered: 2012-07-30
Posts: 4,643
User Karma: 494

I personally would just wait to see if Lyrebird releases their AI, as in my opinion it sounds a bit better than whatever you're using right now. Still, I definitely see myself downloading it if you made it a sapi voice or if you made it for NVDA. Plus, I don't think Lyrebird will be releasing there AI any time in the near future...

Discord: dangero#0750
Steam: dangero2000
TWITCH
YOUTUBE and YOUTUBE DISCORD SERVER

philip_bennefall · 2017-11-14 23:23:57

philip_bennefall
red potter
Offline

From: Sweden
Registered: 2007-06-07
Posts: 747
User Karma: 119

@shotgunshell I definitely agree that Lyrebird sounds better. My goal is really just to experiment with the Festvox system to see what results I can achieve. I will only release it if I get a voice that I personally consider usable, in which case I could easily make it into a Sapi voice or a DLL for NVDA or whatever other format people might want. I'm doing this in my free time, of which I don't have a lot, so I have no idea when/if I'll have something usable. I'm just playing around and wanted to revive this topic to post the current output.

Kind regards,

Philip Bennefall

queenslight · 2017-11-15 01:19:07

queenslight
Pragmatic party professional
Offline

From: Denver, Colorado
Registered: 2012-01-04
Posts: 2,617
User Karma: 168

Would you be willing to make it an Android TTS voice one day? I'd be willing to pay for it!

I myself quite like it!

Dan Gero · 2017-11-15 01:56:26

Dan Gero
Sound stratogist
Offline

From: North Carolina
Registered: 2012-07-30
Posts: 4,643
User Karma: 494

I'm kind of curious about this app myself, does it have command line parameters?

Discord: dangero#0750
Steam: dangero2000
TWITCH
YOUTUBE and YOUTUBE DISCORD SERVER

hurstseth405 · 2017-11-15 02:13:01

hurstseth405
red potter
Offline

Registered: 2014-08-15
Posts: 772
User Karma: 29

Your links don't work.

Bitcoin Address:
1MeNca7h6m8du4TV3psN4m4X666p6Y36u5m

defender · 2017-11-15 03:16:10

defender
Solara Sovereigne
Offline

From: Southwestern United States
Registered: 2012-01-13
Posts: 6,383
User Karma: 1,644

Sounds like a fuzzy bucket. :-)
Decent inflection though, and it actually does sound like you.
No noticeable artifacts in the portion I heard either, but that crazy smoother thing that makes it sound fuzzy probably hides them all anyway...
May be the way you wrote it but, it seems kind of, droning, not enough commas? The sentences don't have defined separators, no real inflection change at the ends.

PREPARE
YOUR
ANUS!
https://freesound.org/people/SilverIllu … nds/546960

Green Gables Fan · 2017-11-15 05:21:25

Green Gables Fan
Sarah's social circle
Offline

From: Over the Rainbow
Registered: 2010-05-07
Posts: 1,895
User Karma: 36

If it would be as easy as making several concatenative files with different speech samples, and the whole interface was made to be accessible, people could definitely put in the effort to make their own. As for whether or not they want to to sell it...

Ulysses, KJ7ERC
She/they
Reedsy

philip_bennefall · 2017-11-15 09:33:30

philip_bennefall
red potter
Offline

From: Sweden
Registered: 2007-06-07
Posts: 747
User Karma: 119

The main thing I'm wanting to fix is the inflection, and the endless sentences. I have solutions for both, as well as a few other tweaks I want to do. I'll post another sample when I have it.

The old links don't work, but that version sounded awful. For kicks, here they are for comparison.

The very first link, with a tiny dataset of just 500 recordings:
https://www.dropbox.com/s/upe4x3ckssv5m … 0.wav?dl=1

And the second link, with a few more recordings and a slightly different rendering method:
https://www.dropbox.com/s/0w31xv2h2utvr … l.wav?dl=1

Now that, that's fuzzy for sure. I die a little every time I go back and listen to these. They were made with very very old versions of the tools.

When it comes to platforms, as long as I'm happy with the voice itself, I'm game to try building it for whatever I can get my hands on. But again, I have not decided at all what I'm going to do with it.

Kind regards,

Philip Bennefall

ammericandad2005 · 2017-11-15 21:31:54

ammericandad2005
Great word adventurer
Offline

From: Castle Wolfenstein
Registered: 2013-12-03
Posts: 1,782
User Karma: 23

hay philop,
could you provide a tutorial on how to make a sapi voice using festvox?

be a hero and stop Coppa now!
https://docs.google.com/document/d/1Dkm … DkWZ8/edit
-id software, 1995

2024 Note on Registrations

Tts experiments at Blastbay (Page 2 of 3)

Posts: 26 to 50 of 62

#26 Reply by CAE_Jones 2012-09-26 01:47:14

#27 Reply by AlexN94 2012-09-26 01:50:32

#28 Reply by philip_bennefall 2012-09-27 02:43:26

#29 Reply by tward 2012-09-27 07:50:19

#30 Reply by greatjobatodesk 2012-09-27 16:52:32

#31 Reply by SLJ 2012-09-28 09:24:36

#32 Reply by musicalman 2012-09-28 13:22:04 (edited by musicalman 2012-09-28 13:22:34)

#33 Reply by KG4RDF 2012-09-28 17:32:35

#34 Reply by paddy 2013-05-09 16:34:27

#35 Reply by camlorn 2013-05-09 17:53:48

#36 Reply by philip_bennefall 2013-05-10 08:26:54

#37 Reply by AlexN94 2013-05-10 08:31:43

#38 Reply by paddy 2013-05-10 12:30:39

#39 Reply by AlexN94 2013-05-10 14:08:57

#40 Reply by camlorn 2013-05-10 20:47:06

#41 Reply by philip_bennefall 2017-11-14 17:14:28

#42 Reply by Dan Gero 2017-11-14 21:47:29

#43 Reply by philip_bennefall 2017-11-14 23:23:57

#44 Reply by queenslight 2017-11-15 01:19:07

#45 Reply by Dan Gero 2017-11-15 01:56:26

#46 Reply by hurstseth405 2017-11-15 02:13:01

#47 Reply by defender 2017-11-15 03:16:10 (edited by defender 2017-11-15 03:17:14)

#48 Reply by Green Gables Fan 2017-11-15 05:21:25

#49 Reply by philip_bennefall 2017-11-15 09:33:30

#50 Reply by ammericandad2005 2017-11-15 21:31:54

Posts: 26 to 50 of 62