Development of Hausa dataset a baseline for speech recognition

Umar Adam IbrahimMoussa Mahamat BoukarMuhammed Aliyu Suleiman2025-01-172022-01-10Ibrahim, Umar Adam; Boukar, Moussa Mahamat; Suleiman, Muhammed Aliyu (2022). Development of Hausa dataset a baseline for speech recognition. Data in Brief,2352-3409https://doi.org/10.1016/j.dib.2022.107820https://repository.nileuniversity.edu.ng/handle/123456789/142The Hausa language read-speech dataset was created by recording native Hausa speakers. The recording took place at Nile university of Nigeria audio studio and radio broadcasting studio. The recorded dataset was segmented into unigram and bigram. The Hausa speech dataset contain 47hr of recorded audio speech. The dataset can be used for automatic speech recognition, speech synthesis, Text-to-Speech and speech-to-text applicationenCorpusAutomatic speechNLPText-to-speechHausa corpusDevelopment of Hausa dataset a baseline for speech recognitionArticle