Fri - July 28, 2006

TIGER Corpus



From the TIGER project:

The TIGER project is pleased to announce the second release of the TIGER Corpus. The treebank consists of app. 900,000 tokens (50,000 sentences) of German newspaper text. It was semi-automatically tagged with part-of-speech, syntactic structures and - in addition to the first release - morphological information.

For details about the TIGER Corpus (download, documentation etc.), see:

http://www.ims.uni-stuttgart.de/projekte/TIGER/TIGERCorpus/

We provide the corpus for scientific use for free.

Posted at 01:36 PM  |  Category:   |   |   |   |  E-mail me | 


©