Skip to content

SR Evaluation Data

CN-Celeb

# Accuracy (ACC)
# Equal Error Rate (EER)

Data Description

CN-Celeb contains a voiceprint database of 1,000 Chinese celebrities with 130,109 Chinese utterances (including singers, actors, rappers, etc.) from interviews, music performances, and film works, totaling 273.72 hours of audio.
This dataset includes 11 types of real-world scenarios, covering complexities such as noise, channel effects, and pronunciation variations. It is particularly suitable for studying speaker recognition technologies in complex scenarios.

Dataset Composition and Specifications

Source Data Volume

The training set contains 111,260 speech samples from 800 speakers, and the test set contains 18,849 speech samples from 200 speakers.

Evaluation Data Volume

The test set, sourced from CN-Celeb, contains 18,849 speech samples.

Data Fields

For the ASI (Automatic Speaker Identification) task, both the training and test sets are stored in .txt files, with each line containing two fields: the audio file index and the speaker label.

text
*.txt: audio_file_path speaker_id

For the ASV (Automatic Speaker Verification) task, the training and validation sets use the same format. However, the test set is saved in .txt files with only the audio file index.

text
*.txt: audio_file_path

Dataset Example

downstream/spkrec/dataset/data/id00013/singing-03-036.flac id00013

Evaluation Metrics

  • Accuracy (ACC) is used for the speaker identification task.
  • Equal Error Rate (EER) is used for the speaker verification task.

Paper Citations

@inproceedings{fan2020cn,
  title={CN-CELEB: a challenging Chinese speaker recognition dataset},
  author={Fan, Yue and Kang, JW and Li, LT and Li, KC and Chen, HL and Cheng, ST and Zhang, PY and Zhou, ZY and Cai, YQ and Wang, Dong},
  booktitle={ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)},
  pages={7604--7608},
  year={2020},
  organization={IEEE}
}

@misc{li2020cn,
  title={CN-Celeb: multi-genre speaker recognition},
  author={Lantian Li and Ruiqi Liu and Jiawen Kang and Yue Fan and Hao Cui and Yunqi Cai and Ravichander Vipperla and Thomas Fang Zheng and Dong Wang},
  year={2020},
  eprint={2012.12468},
  archivePrefix={arXiv},
  primaryClass={eess.AS}
 }

Open Source License

Attribution-ShareAlike 4.0 International