Here you can find a collection of datasets for benchmarking sound source localization algorithms. For each dataset, we provide a brief description, audio and video clips for each sound instance, ground truth source locations, and other relevant metadata.
A dataset of sounds emitted by a stationary speaker at ~400 positions.
Size: 15GB
A dataset of sounds emitted by an Edison robot performing a random walk around Environment 1.
Size: 44GB
A dataset of vocalizations emitted by lone, freely-behaving adolescent gerbils in response to vocalizations presented through a speaker.
Size: 7.5GB
A dataset of spontaneous vocalizations emitted by various pairings of two gerbils.
Size: 86MB
A dataset of sounds emitted by an earbud affixed to the head of a freely-behaving adult gerbil.
Size: 1.3GB
A dataset of sounds emitted by an ultrasonic speaker affixed to a hexapod robot walking in parallel lines across Environment 2.
Size: 236GB
A dataset of sounds emitted by an earbud affixed to the head of a lone, freely-behaving mouse.
Size: 208GB
A dataset of vocalizations emitted by a freely-behaving mouse.
Size: 362MB
A dataset of vocalizations emitted by two freely-behaving mice.
Size: 2.1GB
A prismic environment with hard plastic walls, acoustic foam lining the ceiling, and an inch of bedding on the floor.
A prismic environment with hard plastic for its walls and ceiling and a layer of bedding on its floor.
A square environment with acoustically transparent walls and a Lexane floor.