Corpus of Written Estonian: the 1980s
Statistics and bibliographical references
Version without mark-up, sentence per line
Examples:
Corpus texts
NB! Accented letters and special symbols are in Latin4 code table (ISO-8859-4).
Punctuation is separated from words with a space, excl. abbreviations.
Texts were last edited in Dec. 15, 2010
Version with mark-up
Examples:
Corpus texts
-
Newspapers
(zip-fail 1,71 Mb)
-
Fiction
(zip-fail 1,13 Mb)
-
Science
(zip-fail 0,74 Mb)
-
Popular science
(zip-fail 0,71 Mb)
-
Essays and biographies
(zip-fail 0,44 Mb)
-
Hobby texts
(zip-fail 0,39 Mb)
-
Propaganda
(zip-fail 0,27 Mb)
-
Encyclopædias
(zip-fail 0,10 Mb)
-
Documents
(zip-fail 0,05 Mb)
-
Religion
(zip-fail 0,04 Mb)
NB! Accented letters and special symbols are as SGML entities.
Texts were last edited in Dec. 15, 2010
Webmaster
Last modified: December 19 2018 17:03:37.