Our Language is Our Strength
1800
Audio Hours
1.9+M
Authentic Sentences
5000+
Unique Voices
Our Language is Our Strength
Across the globe, major speech recognition platforms like Amazon’s Alexa, Apple’s Siri, and Google’s Home dominate the market, yet they neglect the rich tapestry of African languages. Not a single native African language is supported by the current voice technologies widely used. To change this narrative, we have taken a bold step as a community by creating the NaijaVoices dataset, which comprises over 1,800 hours of diverse speech-text data from an unprecedented 5,000+ speakers.Explore Our Resources

NaijaVoices Dataset (10.57967/hf/3257)
Description: The largest African speech dataset encoompassing more than 5,000 speakers.

The NaijaVoices Language Heritage Micro-Grants
Supporting community-driven projects that build, enrich, and conserve Nigeria's linguistic diversity.
Meet Our Partners





Support Us with a donation
