next up previous contents
Next: Constraint Grammar Up: Comparing approaches to subcategorisation Previous: Summary

Preliminary Recommendations

Syntactically annotated corpora

In this section, a brief overview will be given of the main annotation schemes used in the syntactic annotation of corpora to date. There are six main schemes in use at the moment, the majority of which are used on English only, although most are now being extended to include the annotation of other European languages. The schemes are the following:

  1. Constraint Grammar
  2. Lancaster/IBM Scheme
  3. Paris/IBM Scheme
  4. TOSCA
  5. UPenn
  6. SUSANNE

All of the above schemes use a traditional phrase structure approach to syntactic analysis, apart from Constraint Grammar, which uses a partial dependency approach. The Constraint Grammar scheme is also the only fully automated system; the other schemes are applied either manually, or in an interactive manner, involving human intervention/correction at different points in the analysis.