Title

Machine Learning for Bird Song Learning (ML4BL) dataset

Description

General description This dataset contains Zebra Finch decisions about perceptual similarity on song units. All the data and files are used for reproducing the results of the paper 'Bird song comparison using deep learning trained from avian perceptual judgments' by the same authors. Git repo on Zenodo: https://doi.org/10.5281/zenodo.5545932 Git repo access: https://github.com/veronicamorfi/ml4bl/tree/v1.0.0 Directory organisation: ML4BL_ZF |_files |_Final_probes_20200816.csv - all trials and decisions of the birds (aviary 1 cycle 1 data are removed from experiments) |_luscinia_triplets_filtered.csv - triplets to use for training |_mean_std_luscinia_pretraining.pckl - mean and std of luscinia triplets used for trianing |_*_cons_* - % side consistency on triplets (train/test) - train set contains both train and val splits |_*_gt_* - cycle accuracy for triplets of the specific bird (train/test) - train set contains both train and val splits |_*_trials_* - number of decisions made for a triplet (train/test) - train set contains both train and val splits |_*_triplets_* - triplet information (aviary_cycle-acc_birdID, POS, NEG, ANC) (train/test) - train set contains both train and val splits |_*_low*_ - low-margin (ambiguous) triplets (train/val/test) |_*_high_ - high-margin (unambiguous) triplets (train/val/test) |_*_cycle_bird_keys_* - unique aviary_cycle-acc_birdID keys (train/test) - train set contains both train and val splits |_TunedLusciniaV1e.csv - pairwise distance of two recordings computed by Luscinia |_training_setup_1_ordered_acc_single_cons_50_70_trials.pckl - dictionary containing everything needed for training the model (keys: 'train_keys', 'train_triplets', 'val_keys', 'vali_triplets', 'test_triplets', 'test_keys', 'train_mean', 'train_std') |_melspecs - *.pckl - melspectrograms of recordings |_wavs - *wav - recordings |_README.txt Recordings 887 syllables extracted from zebra finch song recordings, with a sampling rate of 48kHz and high pass filtered (100Hz), with a 20ms intro/outro fade. Decisions Triplets were created from the recordings and the birds made side based decisions about their similarity (see 'Bird song comparison using deep learning trained from avian perceptual judgments' for further information). Training dictionary Information Dictionary keys: 'train_keys', 'train_triplets', 'val_keys', 'vali_triplets', 'test_triplets', 'test_keys', 'train_mean', 'train_std' train_triplets/vali_triplets/test_triplets: Aviary_Cycle_birdID, POS, NEG, ANC, Decisions, Cycle_ACC(%), Consistency(%) train_keys/val_keys/test_keys: Aviary_Cycle_birdID train_mean/train_std: shape: (1, mel_bins) Open Access This dataset is available under a Creative Commons Attribution 4.0 International (CC BY 4.0) license. Contact info Please send any questions about the recordings to: Lies Zandberg: Elisabeth.Zandberg@rhul.ac.uk Please send any feedback or questions about the code and the rest of the data to: Veronica Morfi: g.v.morfi@qmul.ac.uk

Publication Date

10-2-2021

Publisher

Zenodo

DOI

10.5281/zenodo.5545872

Document Type

Data Set

Identifier

5545872

Embargo Date

10-2-2021

Version

1

Share

COinS