Control over punctuation delays.
Posted: Wed Apr 11, 2007 6:01 pm
I have used many software-based voices over the years, and some are definately better than others where understandability and features are concerned. Some, like AT&T have nice features but slur the speach to the point its really unsuable...
Cepstral David that I ordered has a much better ability to pronounce words than AT&T seems to have, yet has this very annoying pause between taking in text and delivering actual speech audio, add to this, I find that the delays between ; and , and . to be somewhat exaggerated, and not adjusted with the user-specified words-per-minute.
I wonder if the delay for ; , . is even adjusted at all for a given wpm?
I am not sure if ; and , have different "weights", I'd use the same delay value, but . (period) is a little longer than a , (comma), also the
: (colon) should NOT cause delays in my opinion.
I did discover that if you output to a WAVE FILE that the delay is not at all as exaggerated! I am actually much more pleased with the performance of the Wave output then that pathetic Text-Aloud-MP3 which has become more garbage than usable for me in Windows.
Text-Aloud-MP3 crashes for no apparently reason on text files and its it nightmare to make any progress with it.
So far, saving out to wave using Cepstral under linux has been BEAUTIFUL and flawless.
However, I still urge the addition of a user-define delay in miliseconds for the pause taken up by , ; and . as it greatly enhances the users ability to absorb what is spoken more uniformly. Delays actually break my concentration! If I need that extra time, I will set it to the proper value.
I also have DecTalk ISA and DoubleTalk ISA cards; of these I prefer listening to DecTalk. DecTalk is 100% absolutely consistant in the way it pronouces words! When I time DecTalk's reading of a file, I know I can make accurate predictions how long a file will take to read through completely. And DecTalk does not suffer from exaggerated punctuation delays.
Just my 2 cents worth. Cepstal is the real winner for me, because it works
very well in Linux (CentOS and Fedora Core) and probably Win32.
Cepstral David that I ordered has a much better ability to pronounce words than AT&T seems to have, yet has this very annoying pause between taking in text and delivering actual speech audio, add to this, I find that the delays between ; and , and . to be somewhat exaggerated, and not adjusted with the user-specified words-per-minute.
I wonder if the delay for ; , . is even adjusted at all for a given wpm?
I am not sure if ; and , have different "weights", I'd use the same delay value, but . (period) is a little longer than a , (comma), also the
: (colon) should NOT cause delays in my opinion.
I did discover that if you output to a WAVE FILE that the delay is not at all as exaggerated! I am actually much more pleased with the performance of the Wave output then that pathetic Text-Aloud-MP3 which has become more garbage than usable for me in Windows.
Text-Aloud-MP3 crashes for no apparently reason on text files and its it nightmare to make any progress with it.
So far, saving out to wave using Cepstral under linux has been BEAUTIFUL and flawless.
However, I still urge the addition of a user-define delay in miliseconds for the pause taken up by , ; and . as it greatly enhances the users ability to absorb what is spoken more uniformly. Delays actually break my concentration! If I need that extra time, I will set it to the proper value.
I also have DecTalk ISA and DoubleTalk ISA cards; of these I prefer listening to DecTalk. DecTalk is 100% absolutely consistant in the way it pronouces words! When I time DecTalk's reading of a file, I know I can make accurate predictions how long a file will take to read through completely. And DecTalk does not suffer from exaggerated punctuation delays.
Just my 2 cents worth. Cepstal is the real winner for me, because it works
very well in Linux (CentOS and Fedora Core) and probably Win32.