German SpeechDat(II) FDB-4000
View resource name in all available languages
Base de données SpeechDat(II) FDB 4000 de l'allemand
The German SpeechDat(II) FDB 4000 consists of 4000 calls over the German fixed network, stored on 17 CD-ROMs in the final SpeechDat(II) database exchange format. The speech databases made within the SpeechDat(II) project were validated by SPEX, the Netherlands, to assess their compliance with the SpeechDat format and content specifications.
The following items were recorded:
- 1 isolated digit
- 1 sequence of 10 isolated digits
- prompt sheet number = 5
- 9-11 digit telephone number (read)
- 15-16 digit credit card number (read, 150 different credit card numbers were found)
- 6 digit PIN code (read)
- 1 natural number (read)
- 1 money amount (read)
- 2 yes/no questions (spontaneous, not prompted)
- 3 dates (1 spontaneous, e.g. birthday; 1 prompted text form; 1 relative and general date form)
- 1 time of day (spontaneous)
- 1 time phrase (read)
- 3 application words
- 1 word spotting phrase
- 5 directory assistance names (1 spontaneous name (e.g. forename), 1 spontaneous city name, 1 read city name (from a list of 500 most frequent), 1 read company/agency name (from a list of 500 most frequent), 1 read proper name, fore- and surname (from list of 150 SDB names).
- 3 spellings (1 spontaneous, e.g. forename; 1 directory city name; 1 real/artificial word)
- 4 isolated words
- 9 phonetically rich sentences (read)
The speech files are stored as sequence of 8-bit, 8kHz A-law speech files and are not compressed. Each prompt utterance is stored within a separate file and has an accompanying ASCII SAM label file.
A pronunciation lexicon with a phonemic transcription in SAMPA is also included.
View resource description in all available languages