Designing and Evaluating Language Corpora

A Practical Framework for Corpus Representativeness

Bethany Gray author Douglas Biber author Jesse Egbert author

Format:Hardback

Publisher:Cambridge University Press

Published:14th Apr '22

Currently unavailable, and unfortunately no date known when it will be back

This hardback is available in another edition too:

Designing and Evaluating Language Corpora cover

This volume introduces a new framework for conceptualizing and achieving corpus representativeness in a rigorous, yet practical way.

The use of language corpora, or large samples of natural texts, has become ubiquitous in linguistic research. Yet, there are no conceptual or methodological frameworks for corpus representativeness. This book is the first to provide the field of linguistics with a comprehensive framework for corpus design, evaluation, and representativeness.Corpora are ubiquitous in linguistic research, yet to date, there has been no consensus on how to conceptualize corpus representativeness and collect corpus samples. This pioneering book bridges this gap by introducing a conceptual and methodological framework for corpus design and representativeness. Written by experts in the field, it shows how corpora can be designed and built in a way that is both optimally suited to specific research agendas, and adequately representative of the types of language use in question. It considers questions such as 'what types of texts should be included in the corpus?', and 'how many texts are required?' – highlighting that the degree of representativeness rests on the dual pillars of domain considerations and distribution considerations. The authors introduce, explain, and illustrate all aspects of this corpus representativeness framework in a step-by-step fashion, using examples and activities to help readers develop practical skills in corpus design and evaluation.

'A valuable guide for corpora users and designers, a must-read before beginning the process of corpora selection and design.' Ana Abigahil Flores Hernández and Pauline Moore, Tertium Linguistic Journal

ISBN: 9781107151383

Dimensions: 235mm x 158mm x 21mm

Weight: 570g

250 pages