This page offers an overview of the protocols which may be used in ATF documents.
Protocols are statements which are interpreted or stored by the ATF processor but are not part of the text edition proper. Protocols are all named and may trigger special processing within the ATF processor.
Protocols are indicated in ATF by a line beginning with the hash
character (#
), a known protocol name and a colon character
(:
).
With the exception of #note:
, protocols must occur on
a single line; multiple protocols do not need blank lines between them
except for multiple #note:
protocols which behave like
comments.
Protocols are divided into four classes:
#basket:
may occur in this location.#atf:
, #bib:
, #link:
,
#note:
and #version:
may occur in this
location.#note:
may occur in
this location. Other protocols are not required before
#note:
, but if they are present they must precede
it.#bib:
, #lem:
, #note:
and
#var:
may occur in this location.#bib: MSL 14, 343 1. a #lem: a[water] #note: This can only occur after any protocols other than #note:.
Introduces directives to the ATF processor. Implemented directives are:
lang <LANG>
Sets the default language for the text; see under languages in the GDL tutorial for more information.
#atf: lang qpc
use <FEATURE>
The use
directive enables less commonly used ATF
features, which are turned off by default, to be turned on.
FEATURE
must be one of: alignment-groups; legacy; lexical; math;
mylines.
#atf: use lexical
This protocol is used only by the CDLI ATF repository system; the content is a token used internally by the repository and should not be changed by users.
Bibliography information my be included using this protocol and may then be included with locator strings to provide text and publication information, e.g.:
&P312111 = Some Lexical Text 1. a #bib: MSL 14, 33
It could be rendered as: (Some Lexical Text 1 [MSL 14,
33])
but is not currently implemented.
Gives lemmatization for the line before. The format is a list of
lemmata separated by semi-colon-space (;
) and the number
of lemmata must equal the number of words in the lemmatized line:
1. a i3-nag #lem: a[water]; naj[drink]
Introduces directives to the lemmatization subsystem. Implemented directives are:
sparse do <FIELDS>
<FIELDS>
must match field names used in the
document. See the lexical documentation for an example.Introduces directives to the linkage subsystem. Implemented directives are:
def <SYMBOL> = <ID> = <NAME>
Defines a SYMBOL to refer to the text indicated by ID and NAME; interlinear links can then refer to the text via the symbol (see the linkage documentation for more information and examples):
&P123321 = Some Exemplar #link: def A = Q000002 = Archaic Lu A 1. NAMESZDA >> A 1
Notes given using this protocol are included in the rendered ATF text.
Introduces directives to the syntax processing subsystem. Implemented directives are:
line_is_unit
#atf: use
lexical
is given.Informal annotation of text variants; see the composites documentation for a more structured implementation.
Provides a location in the ATF file for a version number or version control system string.
18 Dec 2019Steve Tinney
Steve Tinney, 'ATF Protocols', Oracc: The Open Richly Annotated Cuneiform Corpus, Oracc, 2019 [http://oracc.museum.upenn.edu/doc/help/editinginatf/protocols/]