GRC: Oracc Linguistic Annotation for Greek

This document provides an overview of language-specific annotation conventions for Greek used in Oracc.

Transliteration

Greek is indicated by the language shift %grc.

Greek may be entered in Unicode without any special indication to the ATF processor; we define an ASCII transliteration as well since it may be easier for some users to employ.

alpha	α	a
beta	β	b
gamma	γ	g
delta	δ	d
epsilon	ε	e
zeta	ζ	z
eta	η	e'
theta	θ	th
iota	ι	i
kappa	κ	k
lambda	λ	l
mu	μ	m
nu	ν	n
xi	ξ	x
omicron	ο	o
pi	π	p
rho	ρ	r
sigma	ς, σ	s
tau	τ	t
upsilon	υ	u
phi	φ	ph
chi	χ	ch
psi	ψ	p'
omega	ω	o'

Words are separated by spaces.

Editing Conventions

Matching square brackets are used for broken material, a following question mark for unclear letters and a following exclamation mark for corrected letters. Multiple alternative possible readings are separated by slashes (e.g., d/r).

Within square brackets, broken segments of relatively certain length are indicated by a period for each missing letter.

Broken segments of uncertain length are indicated by underscore characters, e.g., [...]br [________________________] indicates that in the first break 3 characters are missing, and in the second break more a large but indeterminate number of characters is missing.

Spaces should be used to terminate words in breaks; words may contain dots and underscores indicating that they are partially preserved, e.g., [_____ _]br[_ _____].

18 Dec 2019 osc at oracc dot org

Steve Tinney

Steve Tinney, 'GRC: Oracc Linguistic Annotation for Greek', Oracc: The Open Richly Annotated Cuneiform Corpus, Oracc, 2019 [http://oracc.museum.upenn.edu/doc/help/languages/greek/]