Development of Hausa dataset a baseline for speech recognition

Umar Adam Ibrahim; Moussa Mahamat Boukar; Muhammed Aliyu Suleiman

Development of Hausa dataset a baseline for speech recognition

Files

S2352340922000324.htm (156.26 KB)

Date

2022-01-10

Authors

Umar Adam Ibrahim

Moussa Mahamat Boukar

Muhammed Aliyu Suleiman

Publisher

Data in Brief

Abstract

The Hausa language read-speech dataset was created by recording native Hausa speakers. The recording took place at Nile university of Nigeria audio studio and radio broadcasting studio. The recorded dataset was segmented into unigram and bigram. The Hausa speech dataset contain 47hr of recorded audio speech. The dataset can be used for automatic speech recognition, speech synthesis, Text-to-Speech and speech-to-text application

Keywords

Corpus, Automatic speech, NLP, Text-to-speech, Hausa corpus

Citation

Ibrahim, Umar Adam; Boukar, Moussa Mahamat; Suleiman, Muhammed Aliyu (2022). Development of Hausa dataset a baseline for speech recognition. Data in Brief,

URI

https://repository.nileuniversity.edu.ng/handle/123456789/142

Collections

Research Articles in Software Engineering

Full item page

Development of Hausa dataset a baseline for speech recognition

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

URI

Collections

Endorsement

Review

Supplemented By

Referenced By