After discovering Text to Trump
, I thought to myself: how cool would it be to make a text-to-speech voice that resembles my own? And then I thought: how hard would it be?
The answer to both questions is "very".
Slightly modified from this Github repository
, I created a Python program which would take entries in a modified version of the CMU Pronouncing Dictionary
, convert them into individual sounds, and play them back (as well as output them to a .wav file). The results are a bit Stephen-Hawking-ish right now, but I hope to try and improve it soon
Here is an example of "my own voice" reading the CodeWalrus intro: https://clyp.it/02ucuy3z
, and PyDub
. You do not need FFmpeg, but the program can freak out without it
Project download here
. Try it!