This document provides an overview of language-specific annotation conventions for Greek used in Oracc.
Greek is indicated by the language shift %grc.
Greek may be entered in Unicode without any special indication to the ATF processor; we define an ASCII transliteration as well since it may be easier for some users to employ.
| alpha | α | a |
| beta | β | b |
| gamma | γ | g |
| delta | δ | d |
| epsilon | ε | e |
| zeta | ζ | z |
| eta | η | e' |
| theta | θ | th |
| iota | ι | i |
| kappa | κ | k |
| lambda | λ | l |
| mu | μ | m |
| nu | ν | n |
| xi | ξ | x |
| omicron | ο | o |
| pi | π | p |
| rho | ρ | r |
| sigma | ς, σ | s |
| tau | τ | t |
| upsilon | υ | u |
| phi | φ | ph |
| chi | χ | ch |
| psi | ψ | p' |
| omega | ω | o' |
Words are separated by spaces.
Matching square brackets are used for broken material, a following
question mark for unclear letters and a following exclamation mark for
corrected letters. Multiple alternative possible readings are
separated by slashes (e.g., d/r).
Within square brackets, broken segments of relatively certain length are indicated by a period for each missing letter.
Broken segments of uncertain length are indicated by underscore
characters, e.g., [...]br
[________________________] indicates that in the first break 3
characters are missing, and in the second break more a large but
indeterminate number of characters is missing.
Spaces should be used to terminate words in breaks; words may
contain dots and underscores indicating that they are partially
preserved, e.g., [_____ _]br[_ _____].
Steve Tinney
Steve Tinney, 'GRC: Oracc Linguistic Annotation for Greek', Oracc: The Open Richly Annotated Cuneiform Corpus, Oracc, 2019 [http://oracc.museum.upenn.edu/doc/help/languages/greek/]