Frequently asked questions
What is the Speech Accessibility Project?
The Speech Accessibility Project is an interdisciplinary initiative with one shared goal: to improve speech technology for people with a range of diverse speech patterns and disabilities. Led by the University of Illinois Urbana-Champaign with support from Amazon, Apple, Google, Meta, and Microsoft, the project brings together technologists, academic researchers, and community organizations to create the diverse speech data needed to make spoken interfaces more accessible for everyone.
Why is the Speech Accessibility Project important?
Speech recognition is a powerful tool to make technology more accessible in our daily lives, especially for people with disabilities. However, it is not fully accessible for many people with speech disabilities.
Speech recognition is powered by machine learning, and without diverse, representative data, ML models cannot learn how to understand a diversity of speech. This project aims to change that by creating the dataset needed to more effectively train these machine learning models.
Instead of separate and duplicative initiatives by different companies and research teams, the groups will collaborate on this project to gather a set of high-quality, representative speech samples that will help accelerate the technologies that support these communities of people with diverse speech patterns.
Who will benefit from the Speech Accessibility Project?
Millions of people will benefit from diverse and inclusive speech recognition. This may include but isn’t limited to people with Amyotrophic Lateral Sclerosis, Parkinson's disease, cerebral palsy, Down syndrome and a wide range of medical and non-medical conditions that affect speech.
Why are so many companies supporting the Speech Accessibility Project?
The companies supporting the Speech Accessibility Project — Amazon, Apple, Google, Meta, and Microsoft — each have longstanding accessibility commitments that include delivering products, services, and experiences for people from a wide range of communities and backgrounds.
This one-of-a-kind collaboration is rooted in the belief that inclusive speech recognition should be a universal experience. Working together on the Speech Accessibility Project is a way to provide the best and most expedient path to inclusive speech recognition solutions.
What is the role of the University of Illinois Urbana-Champaign?
UIUC is leading the Speech Accessibility Project, managing data collection and curation, privacy, authenticity, and equitable access for researchers. With programs in computer science, speech, hearing, linguistics, and accessibility, UIUC not only has a historic commitment to disability inclusion, but the expertise needed to make the project a success. The project is happening at UIUC’s Beckman Institute for Advanced Science and Technology, which brings together researchers across disciplines to solve problems and break technological barriers.
How will the Speech Accessibility Project work with the disability community?
University and industry partners are committed to working closely with disability community organizations to identify areas of need, recruit participants, review the participation and consent process, and get feedback on how the project can be improved over time.
How can I participate?
We are currently screening potential participants via the Speech Accessibility App. Each participant will work with a mentor to record speech samples. Because of state privacy laws, residents of Illinois, Washington, and Texas are not currently eligible to participate.
Will I be compensated?
Participants will receive a total of $180 in three increments: $60 after recording the 200th sample, an additional $60 after recording the 400th sample, and an additional $60 after recording the 600th sample. Our payment comes in the form of Amazon eCodes.
Named caregivers assisting participants will be compensated with up to $90 in Amazon eCodes as follows: $30 after recording the 200th sentence, an additional $30 after recording the 400th sentence, and an additional $30 after recording the 600th sentence.
How will participants contribute?
Volunteer participants record speech samples, for example, by reading text prompts, or by answering questions out loud. The questions include things like, “How would you spend a rainy day?” or “What are your hobbies?”
The texts that volunteer project participants read, and the questions they respond to, were developed in collaboration with focus groups and disability community organizations in order to make sure we’re collecting data that will help researchers create datasets to more effectively train speech recognition technologies.
How long will it take to participate?
It will take different people different amounts of time to complete the project. Some people finish in a few days. For others, it may take a week or more. Participants will read 10 sets of about 45 sentences each, although the sets don’t have to be completed all at once. Some of the prompts will ask participants to read a specific sentence. Some are open-ended and ask participants to answer questions with general information. As participants read through the sets (and caregivers assist them), we’ll email them Amazon gift cards at certain points throughout the process.
When will I see the results of this project?
The Speech Accessibility Project is a multi-year research initiative. However, each of the companies supporting the Speech Accessibility Project is committed to leveraging project data to make improvements within their respective voice recognition products and services. The project is already sending de-identified data to its corporate sponsors.
How will the data generated by the Speech Accessibility Project be collected and stored?
UIUC maintains a secure, private, and encrypted tool to store and access the Speech Accessibility Project’s research data. Participant data will be de-identified to protect each participant’s privacy and personal information.
Will the data be available to other researchers?
Yes. The goal for this project is to support a range of efforts to make voice recognition technology more useful for people with a diversity of speech patterns or disabilities. The de-identified data will be made available to researchers and developers, in keeping with UIUC’s data use agreement.