SPINA Corpus ("Robots Commands") – META-SHARE

Last view: 2024-07-09

9 Last view: 2024-07-09

SPINA Corpus ("Robots Commands")

View resource name in all available languages

Corpus SPINA ("Commandes robots")

http://catalog.elra.info/product_info.php?products_id=446

ID:

ELRA-S0044

This German corpus contains read speech of 22 different speakers (6 male, 16 female). The corpus consists of 10 robot command sentences and 62 robot command words. Each speaker reads the whole corpus 5 times, except one speaker who reads the sentence corpus 16 times and the word corpus 51 times. The speakers were recorded at two different sites in Germany (University of Goettingen, University of Bochum).The corpus contains a total of 10,810 recorded utterances.
All speakers are within the age of 25-30. Two speakers are non-native speakers. One file gives information about the speakers (speaker ID, recording site, sex).The task for the speaker was to read carefully but fluently. If an error occurred, the recording was interrupted by the supervisor and the sentence was repeated. The signal files are raw files without any header, 16 bit per sample, linear, most significant byte first, 16 kHz sample frequency.
The orthography of the corpus is given in two distinct files which contain the prompted words and the prompted sentences as an ordered list.The recording conditions are as follows:
Microphone:AKG acoustics, C414B-TL, condensator microphone omnidirectional, built-in attenuator and high pass filter switched off, distance to mouth 50 cm.
Environment: Studio Quality, echo cancelled room, about 121 qqm
Preamplifier: John Hardy, M-1
Sampling rate: 48 kHz to DAT recorder, filtered to 16 kHz
Resolution: 16 Bit, most significant byte first
The speech data were digitally filtered to 8 kHz cut-off frequency and downsampled to 16 kHz.
The corpus consists of 1 volume, total size 266,361 KB uncompressed data. The signal of each utterance is stored in a separate file. Symbolic information like segmentations or labelling (e.g. Phonological Segmentation of words or Word Segmentation of sentences) are stored in files with the same prefix but with different extensions.

View resource description in all available languages

Ce corpus allemand est constitué par de la parole lue par 22 locuteurs différents (6 hommes, 16 femmes). Il comprend 10 phrases et 62 mots de commande de robot. Chaque locuteur a lu le corpus 5 fois dans son intégralité. Un locuteur a lu le corpus de phrases 16 fois et le corpus de mots 51 fois. Les enregistrements de locuteurs proviennent de deux sites différents en Allemagne (l'Université de Goettingen, l'Université de Bochum). Le corpus contient 10 810 enregistrements.
Tous les locuteurs ont entre 25 et 30 ans. Deux locuteurs n'ont pas l'allemand pour langue maternelle. Un fichier présente une description des locuteurs (identité du locuteur, lieu d'enregistrement, sexe).

L'objectif pour chaque locuteur était de lire les textes avec attention mais naturellement. Lorsque une erreur était commise, l'enregistrement était interrompu par un superviseur et la phrase était répétée. Les fichiers de signaux sont des fichiers bruts, sans en-tête, de 16 bit par échantillon, linéaires, octet de poids fort en premier, et d'une fréquence d'échantillonnage de 16 kHz.

L'orthographe du corpus est donnée dans deux fichiers distincts contenant les mots et les phrases à prononcer dans une liste ordonnée.

Les conditions d'enregistrements sont les suivantes:
Microphone: AKG acoustique, C414B-TL, microphone à condensateur omnidirectionnel, distance à la bouche: 50 cm.
Environnement: de qualité studio, en chambre anéchoïque.
Préamplificateur: John Hardy, M-1.
Taux d'échantillonnage: 48 kHz sur DAT, sous-échantillonné à 16 kHz.
Résolution: 16 bit, octet de poids fort en premier
Les données parole ont été filtrées numériquement avec une fréquence de coupure à 8 kHz et sous-échantillonnées à 16 kHz.
La taille totale du corpus est de 266 361 KB de données non compressées.
Le signal de chaque occurrence est stocké dans un fichier séparé. Les informations symboliques comme la segmentation ou l'étiquetage (ex. : segmentation phonologique des mots et segmentation des phrases) sont stockées dans des fichiers comportant le même préfixe mais une extension différente.

You don’t have the permission to edit this resource.

DistributionAvailability

Available - Restricted Use

Start date: 10/28/1997

Licence

ELRA VAR

Restrictions: Commercial Use

For Members of ELRA

User Nature: Academic

ELRA END USER

Restrictions: Academic - Non Commercial Use

For Non Members of ELRA

User Nature: Academic

ELRA VAR

Restrictions: Commercial Use

For Non Members of ELRA

User Nature: Academic

ELRA END USER

Restrictions: Academic - Non Commercial Use

For Non Members of ELRA

User Nature: Commercial

ELRA VAR

Restrictions: Commercial Use

For Non Members of ELRA

User Nature: Commercial

ELRA END USER

Restrictions: Academic - Non Commercial Use

For Members of ELRA

User Nature: Academic

ELRA END USER

Restrictions: Academic - Non Commercial Use

For Members of ELRA

User Nature: Commercial

ELRA VAR

Restrictions: Commercial Use

For Members of ELRA

User Nature: Commercial

Contact Person

Mapelli Valérie

audio

Monolingual audio corpusLanguages

German

Linguality

Linguality type: Monolingual

Size

no size available

Metadata

Created: 05/12/2005

Version

Version: 1.0

Last Updated: 05/02/2005

People who looked at this resource also viewed the following: