Showing dependency relations: layer (c)

Next: Indicating syntactic function labels: Up: Layers of annotation Previous: Labelling the category of

Recommendations

Showing dependency relations: layer (c)

This layer of information comprises head-dependent relations between words, e.g. adjectives and the nouns they modify. It is often assumed that the showing of dependency relations between words (dependency syntax) is an alternative to labelled bracketing (phrase structure syntax) as a way of indicating syntactic constituency relations, and indeed the two methods of indicating syntactic structure have overlapping functions. But they also convey different kinds of information (see below), and since both may be (independently or simultaneously) marked in a text, there is no difficulty in treating them as separate layers of annotation for the present purpose.

Dependency relations are traditionally indicated with arrows; these point from a head to a dependent, as in 6, although they may point in the opposite direction, showing the relation of a dependent to the head, as in 7. Note that there are some differences of opinion as to which words are heads and which words dependents. For example, within the NP, the noun may be regarded as the head of the determiner, or vice versa.

(6)

(7)

In many cases, dependency relations can also be represented by a tree structure, as in 8 . Heads are shown as nodes at the upper ends of branches (slanted lines) and dependents are shown at the lower end of branches. Thus, walked is analysed as the head of both he and into; into of garden; and garden of the.

(8)

An important difference between phrase structure and dependency trees is that in phrase structure trees the segments related to one another may or may not correspond to single words, whereas in dependency representations the segments related to one another are words.

As far as we know, the ENGCG parser is the only system of corpus annotation that uses a partial dependency syntax, and can be used as an exemplar. An example of ENGCG output is given in table 6; the output was obtained from LingSoft's e-mail parser. Instead of the arrows of the traditional dependency representation, the symbols < and > are used to indicate dependents; < means that the word's head precedes, > that it follows. The link is established in the syntactic function label. For example, @DN> indicates that an is a determiner (D) whose head is a noun (N) that follows. If the head is not the next noun but the second following one, the notation @DN2> is used. The symbols @+FMAINV and @-FMAINV represent a finite verb and a non-finite verb respectively.

**Table 6:** ENGCG output
"<*i>"
"i" <*> <NonMod> PRON PERS NOM SG1 SUBJ @SUBJ
"<started>"
"start" <SV> <SVO> <P/on> V PAST VFIN @+FMAINV
"<work>"
"work" N NOM SG @OBJ
"<on>"
"on" PREP @ADVL
"<an>"
"an" <Indef> DET CENTRAL ART SG @DN>
"<*english>"
"english" <*> <Nominal> A ABS @AN>
"<description>"
"description" N NOM SG @<P
"<within>"
"within" PREP @<NOM @ADVL
"<the>"
"the" <Def> DET CENTRAL ART SG/PL @DN>
"<*constraint>"
"constraint" <*> N NOM SG @NN>
"<*grammar>"
"grammar" <*> N NOM SG @NN>
"<framework>"
"framework" N NOM SG @<P
"<proposed>"
"propose" <Vcog> <SVO> <SV> PCP2 @<NOM-FMAINV
"<by>"
"by" PREP @ADVL
"<*karlsson>"
"karlsson" <*> <Proper> N NOM SG @<P
"<$[>"
"<1990>"
"1990" <1900> NUM CARD @ADVL
"<$;>"
"<1994a>"
"1994a" <1994a> NUM CARD @ADVL

Although there are a number of resemblances between phrase structure and dependency trees, it is not always easy or possible to convert one to the other. In the classical phrase structure representation, as in 5, heads are not marked at all, so a mechanical conversion will require stipulation of what the head of each dependent is in order to be able to create the dependency chain. Conversely, since in classical dependency syntax constituency plays no role (constituents as such are not indicated), the boundaries of a constituent have to be reconstructed from a subtree of a dependency tree. Thus, in 8, the part of the dependency tree dominated by Prep defines a prepositional phrase. However, dependencies may also be discontinuous, not mapping on to a context-free phrase structure tree.

Since phrase structure trees and dependency trees convey overlapping types of information, it is unusual for a syntactic annotation scheme to include both. We regard them in this report as alternative methods of indicating structural relations between parts of a sentence.

Next: Indicating syntactic function labels: Up: Layers of annotation Previous: Labelling the category of