Cepstral offers several ways to manipulate word and phrase pronunciation.
For instance:
1. First, the voice is sensitive to punctuation and word order. You might try rearranging your sentence, or verb selection, or adding commas or such to troubleshoot the issue.
2. Cepstral offers a lexicon.txt file where users can make global changes word pronunciation. Instructions are on our FAQ at:
http://cepstral.com/cgi-bin/support?page=userlex. For example, if you had a weather related application and wanted always to say "wind" in context and avoid ever confusing the wrist watch context of "wind". You would add the following to a new line in your lexicon.txt file:
- wind 0 w ih1 n d
4. You can also mix the two (Cepstral's lexicon + SSML) such that you send phonetic strings dynamically to the engine wrapped in the PHONEME tag. Examples:
- <phoneme ph="g ay1 k ow0">Geico</phoneme>
<phoneme ph="l i1 d">lead</phoneme> ... as in The lead singer is Mick Jagger
<phoneme ph="w ih1 n d">wind</phoneme> ...as in The wind outside today is 5 MPH
(http://www.phon.ucl.ac.uk/home/sampa/home.htm)
- <phoneme alphabet='sampa' ph='gaIko'>Geico</phoneme>
*Note SAMPA phones does not have spaces, and Cepstral's do.
6. Lastly, Cepstral offers professional services. We can "tune" your prompts at a lower level than what SSML can. If needed, we can even have the original human speaker add your prompts to the TTS voice database. Check with sales for this option.
Thanks,
-Cepstral Sales