LOBTS treebank

Author: Alastair Butler

Introduction

LOBTS is a (growing) parsed fragment of the Lancaster-Oslo-Bergen corpus of modern English (LOB).

This corpus is being developed primarily to serve as a companion to the Treebank Semantics Parsed Corpus (TSPC), which annotates English sentences following a scheme informed by both the Penn Historical Corpora scheme (adopting tag labels and the CorpusSearch format), and the SUSANNE scheme (adopting construction analysis, functional and grammatical information, and the forming of complex expressions).

This increases the availability of high quality syntactic parsed analyses for testing the generation of predicate logic based meaning representations with Treebank Semantics.

The parsed data, and further results of analysis (e.g., derived indexing, word dependencies, generated semantic representations), are made accessible through a web based interface.

You are invited to:

Explore and download LOBTS


LOBTS is distributed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License.

Please consider reporting any problems to: ajb129 __AT__ hotmail __dot__ com.

Acknowledgements

Development is funded by the Japan Society for the Promotion of Science (JSPS).