Eesti keeles

The Mixed Corpus: Horisont

Contents

This subcorpus contains texts from the popular science magazine Horisont, 260 000 words altogether. The corpus contains issues from the years 1996-2003, 230 articles in 7 files. The texts originate from the webpage http://www.horisont.ee (09.10 2003)

How can one use it?

The corpus is free for use for non-commercial purposes only.

Texts and annotation

Mark-up and annotation conform to the TEI-guidelines.

Every file begins with a header <teiheader> that contains information about the file size, used tags etc.

The rest of the file is structured as follows:

In the corpus version one can access via our corpus query, all mark-up except the tags <gap> used for the omitted material have been deleted.

Valid XHTML 1.0! Valid CSS! Webmaster    Last modified: March 24 2014 14:09:34.