Eesti keeles

The Mixed Corpus: Horisont

Content

This subcorpus contains texts from the popular science magazine Horisont, 260 000 words altogether. The corpus contains issues from the years 1996-2003, 230 articles in 7 files. The texts originate from the webpage http://www.horisont.ee (09.10 2003)

The corpus is free for use for non-commercial purposes only.

Texts and annotation

Mark-up and annotation conform to the TEI-guidelines.

Every file begins with a header <teiheader> that contains information about the file size, used tags etc.

The rest of the file is structured as follows:

SGML-entities

SGML-files contain entities listed in this table


Valid XHTML 1.0! Valid CSS! Webmaster    Last modified: December 21 2018 18:41:55.