The CBD format is used for glossaries and lexicons. The architecture is designed to support the inclusion of arbitrary items of information tailored to the needs of different languages or types of name books.
The CBD format is used for glossaries and lexicons, and this document describes version 2 of the CBD, a.k.a., G2. The architecture is designed to support the inclusion of arbitrary items of information tailored to the needs of different languages or types of name books.
CDF is an XML format for representing a variety of document types from text transliterations to books. The format combines narrative facilities with technical data types such as XTF data and corpus-based dictionaries.
ESP is a port of George MacKerron's Electrostatic website building software to Oracc. It is used for the portal components of all Oracc projects.
Grapheme Description Language for embedding in higher-order document types such as text editions and signlists. A formal definition with RNC schema is given interwoven with the ATF conventions for representing each element in the schema.
Sign list source files are maintained as a simple plain text file with @-commands in the manner of Oracc glossaries. This document gives an simple overview of the SL commands.
XCF is an XML vocabulary for setting program configuration values; one day there will be a more user-friendly ASCII front end.
XCL is the format used as a basis for linguistic annotation.
This document contains a preliminary description of the XFF data format.
XMD is a CDL subsystem which imports catalogs containing metadata concerning texts and compositions and creates an XML version of the data. Some validation of the catalog content is performed and sort codes are generated and added to some fields. XMD also encompasses a keyset-generator to create keys for indexing.
XSE is a schema for expressing the search expressions that are understood by the Oracc native search engine.
XML Syntax Trees provide a generic vocabulary for expressing syntactic data which is based closely on XCES. This document describes the CDL system-level support for XST, including the facilities in the ATF notation for expressing syntactic annotation. Such annotations may range from isolated hints to a parser which generates the bulk of syntax trees automatically to complete syntax trees interwoven with lemmatization in the linguistic annotation layer of ATF. The annotations themselves are language-specific and are described in separate documents.
XTC (besides being a tribute) is a very simple XML vocabulary which provides a list of texts on which downstream corpus processing can operate. The principle is that further linguistic service processing does not need to know anything system-specific, enabling the higher-order tools to operate on a variety of source corpora.
XTF is an XML format for describing the transliteration of cuneiform texts; it also encompasses facilities for other kinds of editions commonly used in cuneiform studies.
This document provides the RNG schema used by the XML version of translations and explains implementation details useful for programmers; if you are typing translations in ATF you don't need to read this page.
18 Dec 2019