The Treebank Semantics Parsed Corpus (TSPC)

Author: Alastair Butler

Introduction

The Treebank Semantics Parsed Corpus (TSPC) is built as a testing ground for generating predicate logic based meaning representations with Treebank Semantics.

The parsed annotation follows a scheme informed by both the Penn Historical Corpora scheme (adopting tag labels and CorpusSearch format) and the SUSANNE scheme (adopting construction analysis, functional and grammatical information, and the forming of complex expressions).

Highlights include:

The parsed data, and further results of analysis (e.g., derived indexing, word dependencies, generated semantic representations), are made accessible through a web based interface.

You are invited to:

Explore and download the TSPC

Acknowledgements

Development was funded by the Japan Science and Technology Agency (JST) and the Japan Society for the Promotion of Science (JSPS).