Oracc builder Documentation

First Steps

CDLI ATF Primer
If you are preparing texts for insertion directly into the CDLI repository you should read this document first.
ATF Primer
A very quick start to get you up and running with your first ATF texts by giving some annotated examples.
ATF Quick Reference
A quick reference sheet for the ATF format.
Akkadian Stylesheet
There are many valid ways to transliterate Akkadian in ATF, but the more similarly people transliterate texts for Oracc, the better the corpora will work together. This document sets out Oracc's recommended transliteration conventions for Akkadian.

Next Steps

ATF Structure Tutorial
This document gives a tutorial on how to type the block structure features of texts in ATF, the ASCII Transliteration Format.
ATF Inline Tutorial
This document gives a tutorial on how to type graphemes and other inline content in ATF.
Working with ATF in Emacs using atf-mode
An Emacs major mode is available which makes editing ATF files a bit easier and gives access to the template generator and checker.
Linguistic Annotation
This document provides an overview of linguistic annotation facilities, such as lemmatization, used in Oracc.

Working with L2

L2: How To Use It
This document explains the steps required to use L2, the lemmatiser used by new-style Oracc projects, for those who have been using the old lemmatiser. First we describe what you need to know about editing ATF files, then glossary management, then rebuilding the whole project.
BFFs: Byforms
BFFs are the Oracc glossary mechanism for word byforms of various kinds. The general mechanism allows the individual byforms to be treated under their own entries within the glossary and during lemmatization, but then be kept separate or grouped together when the glossaries are rendered.
F2: Forms Parser
This page gives some help with fixing L2 errors with the (f2) label.
COFs: Compound Orthographic Forms
This page describes how L2 handles Compound Orthographic Forms, or written words which contain more than one lemma.
PSUs: Phrasal Semantic Units
This page describes how L2 handles Phrasal Semantic Units, or glossary entries which consist of more than one word.
Languages and dialects
This page describes how to define the default and alternative languages of a text, and how to mark language shifts within a text, for projects using the new L2 lemmatiser. Projects using the old lemmatiser should continue as before.
L2: Signature/Lemmatization Syntax
This document describes extant and planned elements of the syntax of signatures and the lemmatization specifications that use them.
L2: Language Tags
This document provides an overview of language tag definitions and recommendations used in Oracc.
L2: How It Works
This document provides an overview of how the new lemmatizer, L2, works to help Oracc builders understand what files are used for validation and how to control lemmatization.

More ATF Documentation