Cover image for C-ORAL-ROM : Integrated Reference Corpora for Spoken Romance Languages.
C-ORAL-ROM : Integrated Reference Corpora for Spoken Romance Languages.
Title:
C-ORAL-ROM : Integrated Reference Corpora for Spoken Romance Languages.
Author:
Cresti, Emanuela.
ISBN:
9789027294579
Personal Author:
Physical Description:
1 online resource (322 pages)
Contents:
C-ORAL-ROM -- Editorial page -- Title page -- LCC data -- Table of contents -- Acknowledgements -- Preface -- 1. The C-ORAL-ROM resource -- 1.1 Introduction -- 1.2 Prosodic tagging criteria -- 1.3 Textual format -- 1.4 WinPitch Corpus. A Text-to-Speech Analysis and Alignment Tool -- 1.5 C-ORAL-ROM PoS tagging -- 1.6 Measurements of spoken language variability in the Romance languages -- Notes -- 2. The Italian corpus -- 2.1 History of the corpus within the national framework -- 2.2 Orthographic transcription -- 2.3 Morpho-syntactic tagging -- 2.4 Main data from lemmatisation -- Notes -- 3. The French corpus -- 3.1 History of the corpus within the national framework -- 3.2 Orthographic transcription -- 3.3 Morpho-syntactic tagging -- Notes -- 4. The Spanish corpus -- 4.1 History of the corpus in the national framework -- 4.2 Orthographic transcription -- 4.3 Morpho-syntactic tagging -- Notes -- 5. The Portuguese corpus -- 5.1 History of the corpus within the national framework -- 5.2 Orthographic transcription -- 5.3 Morpho-syntactic tagging -- 5.4 Main data from lemmatisation -- Notes -- 6. Notes on lexical strategy, structural strategies and surface clause indexes in the C-ORAL-ROM spoken corpora -- 6.1 Premises -- 6.2 The noun vs. verb lexical strategy in speech -- 6.3 Informational patterning -- 6.4 The verbal utterance -- 6.5 The 'non-structuring strategies' in Italian -- 6.6 The structural types of utterances -- 6.7 Some remarks on ItalianMedia and Telephone -- 6.8 Surface clause indexes -- 6.9 The informational positions of surface clause indexes -- 6.10 Some remarks on coordination, subordination and negation in the four Romance languages (FRLs) -- Notes -- Appendix: Evaluation of consensus on the annotation of terminal and non-terminal prosodic breaks in the C-ORAL-ROM Corpus -- A.1 Goals of the evaluation -- A.2 Evaluation background.

A.3 Experimental setting -- A.4 Selection of evaluators -- A.5 Measurements and statistics -- A.6 Results -- A.7 Discussion -- Notes -- Bibliography -- Note -- The series Studies in Corpus Linguistics.
Abstract:
The C-ORAL-ROM book and DVD provide a unique set of comparable corpora of spontaneous speech for the main Romance languages, French, Italian, Portuguese and Spanish. The corpora are accompanied by comparative linguistic studies, models and standard linguistic measures of spoken language variability. Each corpus is built to the same design using identical sampling techniques, and each corpus is presented in multimedia format, allowing simultaneous access to aligned acoustic and textual information. Texts are headed with information about provenance, participants, etc. and the transcriptions show changes of speaker. Speech acts are tagged according to the evidence of prosodic criteria. Each corpus totals 300,000 words and presents formal and informal speech in a variety of contexts of use, dialogue structure and text genres, semantic domains and speech act typologies. The corpora have great statistical relevance for spoken language structures and can address key issues in human language technology such as speech recognition in unrestricted discourse, the suitability of speech synthesis in natural prosody, and multilingual applications of the spoken language interface. The work provides new data and innovative theoretical perspectives that are relevant for corpus linguistics, romance linguistics, syntactic theory, speech and prosody research, and second language acquisition.
Local Note:
Electronic reproduction. Ann Arbor, Michigan : ProQuest Ebook Central, 2017. Available via World Wide Web. Access may be limited to ProQuest Ebook Central affiliated libraries.
Added Author:
Electronic Access:
Click to View
Holds: Copies: