Siemens Shanghai Mandarin FDB-1000
View resource name in all available languages
Base de données FDB-1000 du mandarin de Shanghai
The Shanghai Mandarin FDB-1000 database contains the recordings of 1,000 speakers (500 males, 500 females) recorded over the fixed telephone network. This acoustic database gathers Mandarin data, as spoken in Shanghai as a first or second Chinese dialect/language.
The corpus consists of read speech, including digits and application words for teleservices, recorded through an ISDN card. Chinese characters and English translation are included, as well as canonical Pinyin transcription including tone markers, and several categories of non-speech events.
Speech samples are stored as sequences of 8 bits 8 kHz A-law. Signal and annotation files are stored separately.
Each speaker uttered about 70 items, which consist of isolated digits, yes/no questions, common application words and phrases.
A pronunciation lexicon with a phonemic transcription in SAMPA is also included.
View resource description in all available languages