German SpeechDat(II) FDB-1000 – META-SHARE

Last view: 2024-07-09

18 Last view: 2024-07-09

German SpeechDat(II) FDB-1000

View resource name in all available languages

Base de données SpeechDat(II) FDB 1000 de l'allemand

http://catalog.elra.info/product_info.php?products_id=453

ID:

ELRA-S0051

The German SpeechDat(II) FDB 1000 consists of 988 calls over the German fixed network, stored on 4 CD-ROMs in the final SpeechDat(II) database exchange format. The speech databases made within the SpeechDat(II) project were validated by SPEX, the Netherlands, to assess their compliance with the SpeechDat format and content specifications.
The following items were recorded:
- 1 isolated digit (read or prompted)
- 1 sequence of 10 isolated digit
- 4 connected digits
- 4-6 digit number to identify the prompt sheet
- ca. 10 digit telephone number (read)
- 14-16 digit credit card number (read, 150 different credit card numbers were found)
- 6 digit PIN code (read)
- 1 natural number (read)
- 1 money amount (read)
- 3 spelled words (1 spontaneous name spelling, 2 read)
- 1 time of day (spontaneous)
- 1 time phrase (read)
- 1 date (spontaneous)
- 1 date (read)
- 1 relative date (read)
- 2 yes/no questions (spontaneous, not prompted)
- 3/6 common application words (read)
All application words are recorded more than 80 times.
These are:
- 1 application word phrase
- 9 phonetically rich sentences (read)
- 4 phonetically rich words (read)
- 5 directory assistance names (1 spontaneous name (e.g. forename), 1 spontaneous city name, 1 read city name (from a list of 500 most frequent), 1 read company/agency name (from a list of 500 most frequent), 1 read proper name, fore- and surname (from list of 150 SDB names).

A pronunciation lexicon with a phonemic transcription in SAMPA is also included.

View resource description in all available languages

La base de données SpeechDat(II) FDB 1000 de l'allemand est composée de 988 appels à travers le réseau téléphonique fixe allemand présentés sur 4 CD-ROM. Les bases orales réalisées lors du projet SpeechDat(II) ont été validées par SPEX, Pays-Bas, afin de contrôler leur adéquation avec le format SpeechDat et les spécifications de contenu.

Les éléments suivants ont été enregistrés :

- 1 chiffre isolé (lu ou soufflé)
- 1 séquence de 10 chiffres isolés
- 4 chiffres connectés
- 1 nombre de 4-6 chiffres permettant d'identifier la feuille de prompt
- 1 numéro de téléphone d'environ 10 chiffres (lu)
- 1 numéro de carte de crédit de 14-16 chiffres (lu, 150 numéros différents ont été trouvés)
- 1 code confidentiel de 6 chiffres (lu)
- 1 nombre entier naturel (lu)
- 1 montant (argent) (lu)
- 3 mots épelés (1 épellation spontanée de nom, 2 lues)
- 1 jour (spontané)
- 1 phrase comportant une notion de temps (lue)
- 1 date (spontanée)
- 1 date (lue)
- 1 date relative (lue)
- 2 questions oui/non (spontanées)
- 3/6 mots de commande courants (lus)

Tous les mots de commande sont enregistrés plus de 80 fois. Ceux-ci sont :

- 1 mot de commande
- 9 phrases phonétiquement riches (lues)
- 4 mots phonétiquement riches (lus)
- 5 noms provenant d'un annuaire de renseignements téléphoniques (1 nom spontané (ex : prénom), 1 nom de ville spontané, 1 nom de ville lu (provenant d'une liste de 500 noms les plus fréquents), 1 nom de compagnie lu (à partir d'une liste de 500 noms les plus fréquents), 1 nom propre, prénom et nom de famille, lu (à partir d'une liste de 150 noms).

Un lexique de prononciation avec sa transcription phonétique en SAMPA est également fourni.

You don’t have the permission to edit this resource.

DistributionAvailability

Available - Restricted Use

Start date: 04/16/1998

Licence

ELRA END USER

Restrictions: Academic - Non Commercial Use

For Non Members of ELRA

User Nature: Commercial

ELRA VAR

Restrictions: Commercial Use

For Members of ELRA

User Nature: Commercial

ELRA END USER

Restrictions: Academic - Non Commercial Use

For Members of ELRA

User Nature: Commercial

ELRA VAR

Restrictions: Commercial Use

For Members of ELRA

User Nature: Academic

ELRA END USER

Restrictions: Academic - Non Commercial Use

For Members of ELRA

User Nature: Academic

ELRA VAR

Restrictions: Commercial Use

For Non Members of ELRA

User Nature: Commercial

ELRA VAR

Restrictions: Commercial Use

For Non Members of ELRA

User Nature: Academic

ELRA END USER

Restrictions: Academic - Non Commercial Use

For Non Members of ELRA

User Nature: Academic

Contact Person

Mapelli Valérie

audio

Monolingual audio corpusLanguages

German

Linguality

Linguality type: Monolingual

Size

no size available

Resource Creation

Funding Project

SpeechDat(II)

Funding Type: Eu Funds

Metadata

Created: 05/12/2005

Version

Version: 1.0

Last Updated: 08/28/2007

People who looked at this resource also viewed the following: