Our official NaijaVoices paper is out! Click here to learn how Africa's largest dataset was created.

Our Language is Our Strength

1800

Audio Hours

1.9+M

Authentic Sentences

5000+

Unique Voices

Our Language is Our Strength

Across the globe, major speech recognition platforms like Amazon’s Alexa, Apple’s Siri, and Google’s Home dominate the market, yet they neglect the rich tapestry of African languages. Not a single native African language is supported by the current voice technologies widely used. To change this narrative, we have taken a bold step as a community by creating the NaijaVoices dataset, which comprises over 1,800 hours of diverse speech-text data from an unprecedented 5,000+ speakers.

Explore Our Resources

image
NaijaVoices Dataset (10.57967/hf/3257)

Description: The largest African speech dataset encoompassing more than 5,000 speakers.

image
The NaijaVoices Language Heritage Micro-Grants

Supporting community-driven projects that build, enrich, and conserve Nigeria's linguistic diversity.

Meet Our Partners


logo
logo
logo
logo
logo
logo
logo
logo

Support Us with a donation