Amazon Releases New Public Data Set to Help Address “Cocktail Party” Problem
- Maarten Van Segbroeck, an applied scientist in the Alexa International group and first author on the associated paper, cowrote this post with Zaid Ahmed.
- Amazon today announced the public release of a new data set that will help speech scientists address the difficult problem of separating speech signals in reverberant rooms with multiple speakers.
- Each participant was outfitted with a headset microphone, which captured a clear, speaker-specific signal.
- Also dispersed around the room were five devices with seven microphones each, which fed audio signals directly to an administrator’s laptop.
- The data set we are releasing includes both the raw audio from each of the seven microphones in each device and the headset signals.
- The headset signals provide speaker-specific references that can be used to gauge the success of speech separation systems acting on the signals from the microphone arrays.
Top 200 comments