SUSANNETS treebank

Author: Alastair Butler


SUSANNETS is a conversion of Geoffrey Sampson's SUSANNE Corpus into the CorpusSearch format and with tag labels changed to the Penn Historical Corpora scheme.

The conversion was made primarily to serve as a companion to the Treebank Semantics Parsed Corpus (TSPC), which annotates English sentences following a scheme informed by both the Penn Historical Corpora scheme (adopting tag labels and CorpusSearch format), and the SUSANNE scheme (adopting construction analysis, functional and grammatical information, and the forming of complex expressions).

This increases the availability of high quality syntactic parsed analyses for testing the generation of predicate logic based meaning representations with Treebank Semantics.

Highlights include:

The parsed data, and further results of analysis (e.g., derived indexing, word dependencies, generated semantic representations), are made accessible through a web based interface.

You are invited to:


One is permitted to take copies of the SUSANNE Corpus and use it for any purpose, and these same privileges extend to this conversion.


Development for this conversion was funded by the Japan Society for the Promotion of Science (JSPS). I should like to gratefully acknowledge the original work of Geoffrey Sampson and his team at Sussex University as creators of the SUSANNE Corpus with sponsorship from the Economic and Social Research Council (UK). As a conversion, the current work is derivative. But if problems are found, then they are solely the responsibility of this conversion work. In this regard, please consider reporting any problems to: ajb129 __AT__ hotmail __dot__ com.