Corpus Analysis

Contents

What is a Corpus

In linguistics and lexicography, a corpus is a body of texts, utterances, or other specimens considered more or less representative of a language, and usually stored as an electronic database.

Corpus Creation

Corpus Annotation

Corpus Analysis Tools

The following websites contain lists of analysis techniques and tools which have been grouped by the mainteiners of the pages according to their function and availability.

Corpus Encoding Standards

Markup Languages: SGML/XML

Corpus Search



Next: Repositories of Corpora
bentivo@itc.it
Last modified: Wed Feb 13 12:23:26 MET 2002