tts-dataset
Extract phoneme-level timestamps from speeh audio.
A python library to generate speech dataset from Youtube videos