Automatic Speech Recognition
Speech Processing Lab, IIIT Hyderabad
LTRC, IIIT Hyderabad
A challenge on building Automatic Speech Recognition (ASR) system for the Telugu language is being organized jointly by IIIT Hyderabad, Technology Development for Indian Languages (TDIL), Ministry of Electronics and Information Technology (MeitY) as a part of the National Language Translation Mission (NLTM). In this challenge, we are releasing a 2000.8 hours Telugu Speech Database which is collected in a crowdsourced manner. The regional variations of Telugu speech are collected in three modes, namely, i.e. spontaneous, conversational, and read modes with different background conditions (clean and moderate noisy environments) and transcriptions with varying degrees of accuracy due to crowdsourcing.
The financial assistance received towards this Telugu database collection is from the Technology Development for Indian Languages (TDIL), Ministry of Electronics and Information Technology (MeitY), Government of Republic of India under the pilot project on "Crowd Sourced Large Speech Data Sets to Enable Indian Language Speech - Speech Solutions".