Design and development of Iberia: a corpus of scientific Spanish
Design and development of Iberia: a corpus of scientific Spanish
Zamorano, Jordi Porta; García, Emilio del Rosal; Lara, Ignacio Ahumada
2011-11-01 00:00:00
<jats:p> Iberia is a synchronic corpus of scientific Spanish designed mainly for terminological studies. In this paper, we describe its design and the infrastructure for its acquisition, processing and exploitation, including mark-up, linguistic annotation, indexing and the user interface. Two pre-processing tasks affecting a large number of words are described in detail: de-hyphenation and identification of text fragments in other languages. We also show how some of the reported statistics, namely, dispersion and association, are used for research on lexis. </jats:p>
http://www.deepdyve.com/assets/images/DeepDyve-Logo-lg.pngCorporaEdinburgh University Presshttp://www.deepdyve.com/lp/edinburgh-university-press/design-and-development-of-iberia-a-corpus-of-scientific-spanish-0U02yFzKP0
Design and development of Iberia: a corpus of scientific Spanish
<jats:p> Iberia is a synchronic corpus of scientific Spanish designed mainly for terminological studies. In this paper, we describe its design and the infrastructure for its acquisition, processing and exploitation, including mark-up, linguistic annotation, indexing and the user interface. Two pre-processing tasks affecting a large number of words are described in detail: de-hyphenation and identification of text fragments in other languages. We also show how some of the reported statistics, namely, dispersion and association, are used for research on lexis. </jats:p>
To get new article updates from a journal on your personalized homepage, please log in first, or sign up for a DeepDyve account if you don’t already have one.
All DeepDyve websites use cookies to improve your online experience. They were placed on your computer when you launched this website. You can change your cookie settings through your browser.