Amazon Releases New Public Data Set to Help Address “Cocktail Party” Problem

  • Maarten Van Segbroeck, an applied scientist in the Alexa International group and first author on the associated paper, cowrote this post with Zaid Ahmed.
  • Amazon today announced the public release of a new data set that will help speech scientists address the difficult problem of separating speech signals in reverberant rooms with multiple speakers.
  • Each participant was outfitted with a headset microphone, which captured a clear, speaker-specific signal.
  • Also dispersed around the room were five devices with seven microphones each, which fed audio signals directly to an administrator’s laptop.
  • The data set we are releasing includes both the raw audio from each of the seven microphones in each device and the headset signals.
  • The headset signals provide speaker-specific references that can be used to gauge the success of speech separation systems acting on the signals from the microphone arrays.

save | report | share on


Top 200 comments