README Oracc Home SEARCH DOCUMENTATION

Creative Commons License

ORACC Home


Introduction


XST


synattr.rnc

XTF Attributes


Resources

XST: XML Syntax Trees

(http://oracc.org/ns/xst/1.0)

Steve Tinney
Version of 2017-08-10

Introduction

XML Syntax Trees provide a generic vocabulary for expressing syntactic data which is based closely on XCES. This document describes the CDL system-level support for XST, including the facilities in the ATF notation for expressing syntactic annotation. Such annotations may range from isolated hints to a parser which generates the bulk of syntax trees automatically to complete syntax trees interwoven with lemmatization in the linguistic annotation layer of ATF. The annotations themselves are language-specific and are described in separate documents.

XST

synattr.rnc

XTF Attributes

When syntax data is procesed by the ATF processor it is stored in a set of attributes placed on word nodes in the XTF file; the actual definitions of these attributes is located in the GDL schema, along with the various word elements; the information is repeated here for convenience.

brack-before, brack-after
These attributes store bracket sequences and their annotation; the bracket sequences may occur before or after the word (represented as a lemma).
ub-before, ub-after
These attributes store unit break information given before or after the lemma. If the attribute value is + a unit break is inserted before or after the word. A value of - is only relevant if some of the unit breaks are being chosen automatically; the effect is to suppress an automatically chosen unit break.
namespace syn = "http://oracc.org/ns/syntax/1.0"
syntax.attributes = 
  (attribute syn:brack-before { text } |
   attribute syn:brack-after  { text } |
   attribute syn:ub-before  { "+" | "-" } |
   attribute syn:ub-after   { "+" | "-" } )

Resources


Questions about this document may be directed to the Oracc Steering Committee (osc at oracc dot org).