About the project

Improving the representation of diversity of speech patterns

The project collects speech samples from paid volunteers representing a diversity of speech patterns. Illinois researchers are using the recordings to create a private, de-identified dataset for training machine learning models to better understand a variety of speech patterns.

Improving the representation of diversity of speech patterns

Artificial intelligence and machine learning allow people to use speech recognition, such as voice assistants or translation tools, to operate technology using their voices. Speech recognition is powered by machine learning; without diverse, representative data, ML models cannot learn how to understand a diversity of speech. This project aims to change that by creating the dataset needed to more effectively train these machine learning models.

Instead of separate and duplicative initiatives by different companies and research teams, the groups are collaborating on this project to gather a set of high-quality, representative speech samples that will help accelerate the technologies that support these communities of people with diverse speech patterns.

Find answers to common questions about the Speech Accessibility Project.

Speech Accessibility Project

405 N Mathews Ave., Urbana, IL 61801

speechaccessibility@beckman.illinois.edu

Beckman Institute for Advanced Science and Technology

About the project

Improving the representation of diversity of speech patterns

Improving the representation of diversity of speech patterns

Explore Campus

Connect with Illinois

Access University Resources

Explore Campus

Connect with Illinois

Access University Resources