Suggestions for building an accurate oral corpus to phonetics analysis

Authors

  • Nuria Polo Cano Universidad Nacional de Educación a Distancia (UNED), España

DOI:

https://doi.org/10.18778/2392-0718.05.07

Keywords:

oral corpus, analysis of phonetics, natural speech, recordings, data-mining

Abstract

Mistakes are often made when an oral corpus is collected and sometimes these mistakes could make impossible a future phonetic analysis of the data. To avoid this happen some advices are proposed in this paper regarded to participants, to recordings, and to available tools in order to build an oral corpus. The purpose of this paper is to advice future researchers in building this kind of corpus. These advices will help them to build an accurate corpus to phonetics analysis following current scientific quality standards.

References

BLUM-KULKA, S., J. HOUSE & G. KASPER (1989): Cross-cultural Pragmatics: Requests and Apologies. Norwood, NJ: Alblex Publishing Corporation.
Google Scholar

BOERSMA, P. & D. WEENINK (2016): Praat: Doing Phonetics by Computer, (versión 6.0.17) <http://ddw.praat.org/>.
Google Scholar

BRUYNINCKX, M., B. HARMEGNIES, J. LLISTERRI & D. POCH (1994): Language-Induced voice quality variability in bilinguals. Journal of Phonetics, 22(1), pp. 19-31.
Google Scholar

DURAND, J., U. GUT & G. KRISTOFFERSEN (eds.) (2014): The Oxford Handbook of Corpus Phonology. Oxford: Oxford University Press.
Google Scholar

ESCUDERO, D., L. AGUILAR, M. M. VANRELL & P. PRIETO (2012): “Analysis of intertranscriber consistency in the Cat_ToBI prosodic labelling system”. Speech Communication, 54, pp. 566‐582.
Google Scholar

ANDERSON, A.H., M. BADER, E.G. BARD, E. BOYLE, G. DOHERTY, S. GARROD, S. ISARD, J. KOWTKO, J. MCALLISTER, J. MILLER, C. SOTILLO, H.S. THOMPSON & R. WEINERT (1991): “The HCRC Map Task corpus”. Language and Speech, 34, 4, pp. 351-366.
Google Scholar

KOCK, J. de (2001): Lingüística con corpus. Catorce aplicaciones sobre el español. Salamanca: Universidad de Salamanca.
Google Scholar

KENNEDY, G. D. (1998): An Introduction to Corpus Linguistics. Londres: Longman.
Google Scholar

LÓPEZ MORALES, H. (1994): Métodos de investigación lingüística. Salamanca: Colegio de España.
Google Scholar

MACWHINNEY, B. (2010): Introduction to CHILDES and TalkBank. Presentación de Powerpoint en la página web: http://childes.psy.cmu.edu/intro/ .
Google Scholar

MCENERY, T. & A. WILSON (1996): Corpus Linguistics. Edinburgh: Edinburgh University Press.
Google Scholar

ORTEGA, J., J. GONZÁLEZ & V. MARRERO (2000): “Ahumada: A large corpus in Spanish for speaker characterization and identification”. Speech Communication, 31, 2, pp. 255-264.
Google Scholar

ROSE, Y., B. MACWHINNEY, R. BYRNE, G. HEDLUND, K. MADDOCKS, P. O’BRIEN & T. WAREHAM (2006): “Introducing Phon: A Software Solution for the Study of Phonological Acquisition”. Proceedings of the 30th Annual Boston University Conference on Language Development. D. BAMMAN, T. MAGNITSKAIA & C. ZALLER (eds.). Somerville: Cascadilla Press, pp. 489-500.
Google Scholar

SILVA-CORVALÁN, C. (2001): Sociolingüística y pragmática del español. Washington D.C.: Georgetown University Press.
Google Scholar

SCHMIDT, T., K. WÖRNER, H. HEDELAND & T. LEHMBERG (2011): “New and future developments in EXMARaLDA. Multilingual Resources and Multilingual Applications. Proceedings of GSCL Conference 2011 Hamburg. T. SCHMIDT & K. WÖRNER (eds.).
Google Scholar

SVEC, J.G. & S. GRANQVIST (2010): “Guidelines for selecting microphones for human voice production research”. American Journal of Speech- Language Pathology, 19,4, pp. 356-368.
Google Scholar

TORRUELLA, J. & J. LLISTERRI (1999): “Diseño de corpus textuales y orales”. Filología e informática. Nuevas tecnologías en los estudios filológicos. J.M. BLECUA, G. CLAVERÍA, C. SÁNCHEZ & J. TORRUELLA (eds.). Barcelona: Seminario de Filología e Informática, Departamento de Filología Española, Universidad Autónoma de Barcelona - Editorial Milenio, pp. 45-77.
Google Scholar

WITTENBURG, P., H. BRUGMAN, A. RUSSEL, A. KLASSMANN & H. SLOETJES (2006): “ELAN: a Professional Framework for Multimodality Research”. Proceedings of LREC 2006, Fifth International Conference on Language Resources and Evaluation.
Google Scholar

Published

2018-03-31

How to Cite

Polo Cano, N. (2018). Suggestions for building an accurate oral corpus to phonetics analysis. E-Scripta Romanica, 5, 71–79. https://doi.org/10.18778/2392-0718.05.07