Dataset Restricted Access

Catalan in a bilingual context (PhonCAT)

Lleó, Conxita

Audio recordings of prompted, read and spontaneous speech data from L1 Catalan speakers from Barcelona. The data is stratified according to three different city districts and three age groups. Speakers' age vary from approx. 5 to 45 years.

The Phonoprosodic corpus of spoken Catalan (PhonCAT) was compiled between July 2006 and June 2011 in the research project Phonoprosodic development of Catalan in its current bilingual context (H6, director: Conxita Lleò), part of the Collaborative Research Centre "Multilingualism", funded by the German Research Foundation (Deutsche Forschungsgemeinschaft, DFG) and hosted by the University of Hamburg. It comprises read and elicited as well as spontaneous speech data from speakers of Catalan in Barcelona. The speakers were divided into nine groups according to their age and to the district in Barcelona in which they live. For most recordings, selected words were transcribed phonetically by one to three transcribers and annotated with their orthographic target form. A smaller set of recordings was transcribed orthographically.

Benet, Ariadna, Susana Cortés, and Conxita Lleó. 2012. "Phonoprosodic Corpus of Spoken Catalan (PhonCAT)." In Multilingual Corpora and Multilingual Corpus Analysis, edited by Thomas Schmidt and Kai Wörner, pp. 215–29. Hamburg Studies in Multilingualism, 14. John Benjamins.

 

CLARIN Metadata summary for Catalan in a bilingual context (PhonCAT) (CMDI-based)

Title: Catalan in a bilingual context (PhonCAT)
Description: Audio recordings of prompted, read and spontaneous speech data from L1 Catalan speakers from Barcelona. The data is stratified according to three different city districts and three age groups. Speakers' age vary from approx. 5 to 45 years.
Data owner: Conxita Lleó, Institut für Romanistik / Von-Melle-Park 6 / D-20146 Hamburg, lleo@uni-hamburg.de
Contributors: Conxita Lleó, Institut für Romanistik / Von-Melle-Park 6 / D-20146 Hamburg, lleo@uni-hamburg.de (compiler)
Project: H6 "Phonoprosodic development of Catalan in its current bilingual context", German Research Foundation (DFG)
Keywords: adult bilingualism, bilingual society, cross-sectional data, simultaneous bilingualism, successive bilingualism, child L2 acquisition, L1 data, child bilingualism, language contact, EXMARaLDA
Language: Catalan (cat)
Size: 234 speakers (0 female, 234 male), 225 communications, 205 recordings, 8719 minutes, 875 transcriptions, 187967 words
Genre: discourse
Modality: spoken
References: Benet, Ariadna, Susana Cortés, and Conxita Lleó. 2012. "Phonoprosodic Corpus of Spoken Catalan (PhonCAT)." In Multilingual Corpora and Multilingual Corpus Analysis, edited by Thomas Schmidt and Kai Wörner, pp. 215–29. Hamburg Studies in Multilingualism, 14. John Benjamins.

 

Restricted Access

You may request access to the files in this upload, provided that you fulfil the conditions below. The decision whether to grant/deny access is solely under the responsibility of the record owner.


Restricted access: For research and teaching purposes, access can be given upon request, please contact the record owner(s) stating your purpose and personal data including affiliation.


Cite record as