This document provides an overview of language-specific annotation conventions for Greek used in Oracc.
Greek is indicated by the language shift
Greek may be entered in Unicode without any special indication to the ATF processor; we define an ASCII transliteration as well since it may be easier for some users to employ.
Words are separated by spaces.
Matching square brackets are used for broken material, a following
question mark for unclear letters and a following exclamation mark for
corrected letters. Multiple alternative possible readings are
separated by slashes (e.g.,
Within square brackets, broken segments of relatively certain length are indicated by a period for each missing letter.
Broken segments of uncertain length are indicated by underscore
[________________________] indicates that in the first break 3
characters are missing, and in the second break more a large but
indeterminate number of characters is missing.
Spaces should be used to terminate words in breaks; words may
contain dots and underscores indicating that they are partially
[_____ _]br[_ _____].
Steve Tinney, 'GRC: Oracc Linguistic Annotation for Greek', Oracc: The Open Richly Annotated Cuneiform Corpus, Oracc, 2014 [http://oracc.museum.upenn.edu/doc/help/languages/greek/]