attention-network
Multilingual Automatic Speech Recognition with word-level timestamps and confidence