This document provides an overview of language-specific annotation conventions for Aramaic used in Oracc.
Aramaic is indicated by the language shift %arc
;
following this language shift, the ATF processor recognizes a special
set of conventions adapted to alphabetic scripts, and a special
transliteration system particular to Aramaic. The transliteration
system is that of the Comprehensive Aramaic Lexicon, and this document
has benefited from e-mail discussions with Steve Kaufman, Director of
the CAL.
) = aleph b = beth g = gimel d = daleth h = he w = waw z = zayin x = heth T = teth y = yodh k = kaph l = lamedh m = mem n = nun s = samekh ( = ayin p = pe c = tsade q = qoph r = resh $ = shin & = sin t = taw
This ASCII transliteration is converted on input to the following Unicode characters:
bgdhwzyklmnspqrt : themselves ) ʾ ( ʿ T ṭ c ṣ $ š x ḥ & ś
Words are separated by spaces.
Matching square brackets are used for broken material, a following
question mark for unclear letters and a following exclamation mark for
corrected letters. Multiple alternative possible readings are
separated by slashes (e.g., d/r
).
Within square brackets, broken segments of relatively certain length are indicated by a period for each missing letter.
Broken segments of uncertain length are indicated by underscore
characters, e.g., [...]br
[________________________]
indicates that in the first break 3
characters are missing, and in the second break a large but
indeterminate number of characters is missing.
Spaces should be used to terminate words in breaks; words may
contain dots and underscores indicating that they are partially
preserved, e.g., [_____ _]br[_ _____]
.
Steve Tinney
Steve Tinney, 'ARC: Oracc Linguistic Annotation for Aramaic', Oracc: The Open Richly Annotated Cuneiform Corpus, Oracc, 2019 [http://oracc.museum.upenn.edu/doc/help/languages/aramaic/]