Introduction
2019 NIST Speaker Recognition Evaluation Test Set -- CTS Challenge was developed by the Linguistic Data Consortium (LDC) and NIST (National Institute of Standards and Technology). It contains approximately 635 hours of Tunisian Arabic telephone recordings for development and test, answer keys, enrollment, trial files and documentation from the CTS Challenge portion of the NIST-sponsored 2019 Speaker Recognition Evaluation (SRE).
The ongoing series of SRE yearly evaluations conducted by NIST are intended to be of interest to researchers working on the general problem of text independent speaker recognition. To this end the evaluations are designed to be simple, to focus on core technology issues, to be fully supported and to be accessible to those wishing to participate.
The 2019 evaluation task was speaker detection, that is, to determine whether a specified target speaker was speaking during a segment of speech. The evaluation was conducted in two parts: (1) a leaderboard-style challenge based on conversational telephone speech from LDC's Call My Net 2 (CMN2) corpus; and (2) a separate evaluation using audio-visual material collected by LDC for the VAST (Video Annotation for Speech Technology) project. Further information about the evaluation is contained in the evaluation plan included in this release.
Data
The telephone speech data for the CTS Challenge was drawn from the CMN2 collection conducted by LDC in Tunisia in which Tunisian Arabic speakers called friends or relatives who agreed to record their telephone conversations lasting between 8-10 minutes. The speech segments include PSTN (public switched telephone network) and VOIP (voice over IP) data. This telephone speech is presented in sphere format as 8-bit a-law with a sample rate of 8000 KHz.
Use email button above to contact.
Our Community Norms as well as good scientific practices expect that proper credit is given via citation. Please use the data citation above, generated by the Dataverse.
No waiver has been selected for this dataset.
Except as otherwise provided herein, the user shall have no right to copy, redistribute, transmit, publish, sell, transfer, or otherwise use the LDC data for any purpose. The user shall give appropriate attribution to the LDC data in all scholarly or similar publications for which the LDC data or potions thereof have been used.
Only individuals who are then-current faculty, students or staff members of LDC Member institutions or consultants or individuals providing services or doing research for Member institutions shall have access to the LDC data.
The LDC data is protected by copyright as a collective work or compilation under the laws of the United States and other countries. All content, material, and other elements comprising LDC data are also copyrighted works. Users must abide by all additional copyright notices or restrictions contained in the LDC data license agreement supplements.
No guestbook is assigned to this dataset, you will not be prompted to provide any information on file download.
This file has already been deleted (or replaced) in the current version. It may not be edited.
Restricting limits access to published files. You can add or edit Terms of Access for the dataset, and allow people to Request Access to restricted files.
The file will be deleted after you click on the Delete button.
Files will not be removed from previously published versions of the dataset.
Please select one or more files.
Share this dataset on your favorite social media networks.
Citations for this dataset are retrieved from Crossref via DataCite using Make Data Count standards. For more information about dataset metrics, please refer to the User Guide.
The restricted file(s) selected may not be downloaded because you have not been granted access.
The files selected are too large to download as a ZIP.
You can select individual files that are below the 4.0 GB download limit from the files table, or use the Data Access API for programmatic access to the files.
Please select a file or files to be downloaded.
Click Continue to download the files you have access to download.
Are you sure you want to delete this dataset and all of its files? You cannot undelete this dataset.
Are you sure you want to delete this draft version? Files will be reverted to the most recently published version. You cannot undelete this draft.
Private URL can only be used with unpublished versions of datasets.
Are you sure you want to disable the Private URL? If you have shared the Private URL with others they will no longer be able to use it to access your unpublished dataset.
The file(s) will be deleted after you click on the Delete button.
This dataset contains restricted files you may not compute on because you have not been granted access.
Are you sure you want to deaccession? The selected version(s) will no longer be viewable by the public.
Are you sure you want to deaccession this dataset? It will no longer be viewable by the public.
Please select two versions to view the differences.
Please select a file or files for access request.
Select existing file tags or create new tags to describe your files. Each file can have more than one tag.
You need to Log In to request access.
???file.mapData.unpublished.message???
Please confirm and/or complete the information needed below in order to continue.
Upon downloading files the guestbook asks for the following information.
Account Information
Use the Download URL in a Wget command or a download manager to download this package file. Download via web browser is not recommended. User Guide - Downloading a Dataverse Package via URL
https://abacus.library.ubc.ca/api/access/datafile/
Please confirm and/or complete the information needed below in order to request access to files in this dataset.
You will not be able to make changes to this dataset while it is in review.
Are you sure you want to republish this dataset?
Select if this is a minor or major version update.
This dataset cannot be published until Linguistic Data Consortium is published by its administrator.
This dataset cannot be published until Linguistic Data Consortium and Abacus Data Network are published.
Return this dataset to contributor for modification.
Abacus Data Network Support
Please fill this out to prove you are not a robot.