Swedish Treebank
The Swedish Treebank is a syntactically annotated corpus. The annotation includes word and sentence boundaries, morphological information (word classes etc.), and syntactic information (phrases and grammatical functions, dependency structure). The Swedish Treebank is based on two previous corpora, Talbanken and SUC, which have been harmonised. The Swedish Treebank contains approx. 350,000 tokens.
Go to data source
Opens in a new tabhttp://stp.lingfil.uu.se/~nivre/swedish_treebank/
Citation and access
Citation and access
Data access level:
Creator/Principal investigator(s):
- Joakim Nivre - Uppsala University
- Beáta Megyesi - Uppsala University
- Bengt Dahlqvist - Uppsala University
- Anna Sågvall Hein - Uppsala University - Department of Linguistics and Philology
- Johan Hall - Uppsala University
- Jens Nilsson - Växjö University
Research principal:
Data contains personal data:
No
