SUSANNETS treebank

Author: Alastair Butler

Introduction

SUSANNETS is an automatic conversion of Geoffrey Sampson's SUSANNE treebank into the CorpusSearch format and with tag labels changed to the Penn Historical Corpora scheme.

The script used for the automatic conversion is freely available.

The conversion was made primarily to serve as a companion to the Treebank Semantics Parsed Corpus (TSPC), which annotates English sentences following a scheme informed by both the Penn Historical Corpora scheme (adopting tag labels, construction analysis, and CorpusSearch format), and the SUSANNE scheme (adopting construction analysis, functional and grammatical information, and the forming of complex expressions).

This increases the availability of high quality syntactic parsed analyses for testing the generation of predicate logic based meaning representations with Treebank Semantics.

Highlights include:

The parsed data, and further results of analysis (e.g., derived indexing, word dependencies, generated semantic representations), are made accessible through a web based interface.

You are invited to:

Explore SUSANNETS