text to speech with voice of popular person

I am looking ideas, sources, algorithms etc to create program like that. I was thinking about TacoTron2, but how can I learn (and create) deep learning for that?