Benchmark Datasets for Sound Source Localization

Here you can find a collection of datasets for benchmarking sound source localization algorithms. For each dataset, we provide a brief description, audio and video clips for each sound instance, ground truth source locations, and other relevant metadata.


Datasets

Speaker-4M
Speaker-4M-E1

A dataset of sounds emitted by a stationary speaker at ~400 positions.

Size: 15GB

Edison-4M-E1
Edison-4M-E1

A dataset of sounds emitted by an Edison robot performing a random walk around Environment 1.

Size: 44GB

SoloGerbil-4M-E1
SoloGerbil-4M-E1

A dataset of vocalizations emitted by lone, freely-behaving adolescent gerbils in response to vocalizations presented through a speaker.

Size: 7.5GB

DyadGerbil-4M-E1
DyadGerbil-4M-E1

A dataset of spontaneous vocalizations emitted by various pairings of two gerbils.

Size: 86MB

GerbilEarbud-4M-E1
GerbilEarbud-4M-E1

A dataset of sounds emitted by an earbud affixed to the head of a freely-behaving adult gerbil.

Size: 1.3GB

Hexapod-8M-E2
Hexapod-8M-E2

A dataset of sounds emitted by an ultrasonic speaker affixed to a hexapod robot walking in parallel lines across Environment 2.

Size: 236GB

MouseEarbud-24M-E3
MouseEarbud-24M-E3

A dataset of sounds emitted by an earbud affixed to the head of a lone, freely-behaving mouse.

Size: 208GB

SoloMouse-24M-E3
SoloMouse-24M-E3

A dataset of vocalizations emitted by a freely-behaving mouse.

Size: 362MB

DyadMouse-24M-E3
DyadMouse-24M-E3

A dataset of vocalizations emitted by two freely-behaving mice.

Size: 2.1GB


Environments

Environment 1
Environment 1

A prismic environment with hard plastic walls, acoustic foam lining the ceiling, and an inch of bedding on the floor.

Environment 2
Environment 2

A prismic environment with hard plastic for its walls and ceiling and a layer of bedding on its floor.

Environment 3
Environment 3

A square environment with acoustically transparent walls and a Lexane floor.