History
A cumulative list of changes made to words and word IDs is available on the OID History page [/epsd2/oid-history.html]
2.7.2, 2024-08-31
ePSD2 subprojects cleaned up to remove validation and lemmatization
errors.
2.7.1, 2024-07-31
ePSD 2.7.1 is the first in a series of incremental updates to ePSD
2.7 which will lead to a 2.8 release in December 2024. Since the 2.7
release in December 2022, Oracc has had upgrades to grapheme
validation (GDL/GVL) and the pager (P4) and these upgrades require
many small technical changes to ePSD2. The following sequence and
timeline is planned for 2024:
- 2.7.1 [July]: update all glossaries for ePSD2 and related
projects to pass GDL/GVL validation; rebuild ePSD2 with essential
functionality to work with P4 on oracc.museum.upenn.edu.
- 2.7.2 [August]: update all corpora for ePSD2 and related
projects to pass GDL/GVL validation; in 2.7.1 some texts will fail
to build properly because of grapheme validation errors. All such
errors will be eliminated for 2.7.2.
- 2.7.3 [September]: align all ePSD2 subprojects and ePSD2-related
projects with main ePSD2 glossaries. before 2.7.3 some references
will fail to be included in the main ePSD2 dataset because of
alignment issues. All alignment issues will be resolved for
2.7.3.
- 2.7.4 [October]: apply all queued fixes to the main ePSD2
glossaries and update the alignment of all corpora and
glossaries. Ensure ePSD catalogue and other additional facets of
ePSD such as issl build correctly.
- 2.7.5 [September-November]: induct additional available corpus
material into ePSD2, e.g., additions from CDLI and other
sources.
- 2.8 [December]: full new release of ePSD2 planned for 2024-12-21.
2.7, 2022-12-21
- 153 new words (many from admin/oakk); 179 changes to entries in total.
- Review of admin/oakk (Old Akkadian) completed (Philip Jones)
- Integration of eISL: A Corpus of First Millennium Emesal Liturgies [/eisl]
- Additional OB Emesal via /obel [/obel]
- Sumerian from RIBO inducted into ePSD
- Further sallies in the battle against emesal words tagged as emegir and vice versa
- Updates to epsd2/catalogue
- Improvements to alignment validation
- Bug fixes affecting phrases and lexical data
2.6, 2022-06-21
- 142 changes to entries including 72 new words
- Reworked epsd2/catalogue [/epsd2/catalogue] now tracks almost 155,000 Sumerian or bilingual texts
- New alignment validation implemented to eliminate mismatches
between subprojects/partner projects and main ePSD glossaries
- Two Women B added to DSSt [/dsst] (courtesy Jana Matuszak)
- Several thousand admin texts harvested from CDLI, lemmatized, and reviewed (Veldhuis/Jones/Tinney)
- admin/names Ur III normalizations (Niek Veldhuis)
- admin/oakk girsu and adab review completed (Philip Jones)
- admin/ebla removed
- Yet more separation of Emegir and Emesal forms; additions to Ershahungas via BLMS
- Inclusion of OB liturgical texts from http://oracc.org/obel [http://oracc.org/obel]; most of these are much improved
revisions of texts that were formerly in epsd2/praxis/liturgy
2.5, 2021-12-21
- 76 new words
- Review of admin/ed12 Nisaba 25 texts
- Review of admin/ed3a completed
- Review of admin/ed3b completed (Philip Jones)
- Additions to literary, admin/ur3, admin/oldbab (Niek Veldhuis)
- Improved separation of Emegir and Emesal forms
2.4, 2021-06-21
- Literary Disputes and Dialogs are now indexed from DSSt [http://oracc.org/dsst]
- Most ePSD data now downloadable as JSON--see the ePSD2 JSON page [/epsd2/json]
- Over 5000 texts (mostly Ur III) imported from BDTNS [http://bdtns.filol.csic.es/] (Niek Veldhuis)
- All 3rd Millennium/OB god lists now included
- Additions to admin/ed3a
- Further review of admin/ed3b (Philip Jones)
- ongoing additions and improvements to admin/names and admin subcorpora
2.3, 2021-03-21
- fixes and improvements to search including c: and b: prefixes, and functioning search by SIGN
- integrate searching page into main ePSD portal
- additions to earlylit, literary (Niek Veldhuis)
- first stage of review of admin/ed3a [EDATS]
- ed3b review complete and most common unlemmatized forms resolved (Philip Jones)
- additions to admin/ur3 (Niek Veldhuis)
- add about 600 royal inscriptions to epsd2/royal based on ETCSRI and CDLI
- ongoing additions and improvements to admin/names and admin subcorpora
2.2, 2020-12-21
- epsd2/earlylit lemmatised and augmented, additions to epsd2/literary (Niek Veldhuis)
- epsd2/admin/names redone with more names normalized, especially Drehem names by John Carnahan
- review of unlemmatized forms in admin subcorpora to lemmatize several thousand additional instances
- over 10000 additional proper noun forms added to epsd2/admin/names
- review of epsd2/admin/lagash2 (Niek Veldhuis)
- admin corpora updated to include additional texts from CDLI (Niek Veldhuis)
2.1, 2020-06-21
For more details on updates in 2.1 see the build-by-build descriptions below. Highlights include:
- over 10,000 more lemmatized instances
- major alignment improvements between subprojects and main glossary fixing tens of thousands of phrasal instances
- greater consistency between ePSD2 and OGSL [http://oracc.org/ogsl]
- Edubba R added to literary (Niek Veldhuis)
- admin/names glossary and proper noun colourization in admin subcorpora
- bases and senses sorted by frequency (most common at top)
2.1 RC3, 2020-06-12
- new edition of Edubba R in literary (Niek Veldhuis)
- do initial lemmatization of proper nouns in epsd2/praxis/incantations resulting in about 1,100 additional lemmatized instances
- do initial lemmatization of proper nouns in epsd2/royal resulting in about 6,000 additional lemmatized instances
- import new data from dcclt and etcsri proper noun glossaries
- add just over 600 forms to ur3/sux.glo resulting in about 4000 additional lemmatized instances
- Resolve many duplicates in epsd2/names
- fix a few score semantic/phonetic determatives
- all admin corpora now do colourization of proper nouns
- fix cuneiform display bug which affected qualified readings--just need to cuneify the qualifiers
- Fix blank forms in, e.g., http://oracc.museum.upenn.edu/epsd2/xff/o0027922 (John Halloran report)
2.1 RC2, 2020-06-04
- Some duplicates removed from main glossary
- Audit bases with qualifiers like kinₓ(|ŠE.KIN|) and enforce consistency with OGSL, adding/modifying base values as necessary
- Ongoing updates to admin by Veldhuis and Jones, and to literary by Veldhuis
2.1 RC1, 2020-05-13
- fix bugs importing PSUs from subprojects into epsd2 main listing
- fix bugs importing legitimate but partially matching subproject entries into epsd2 main listing
- Sort senses by original order in glossaries instead of sorting them alphabetically
- Sort bases by frequency--most frequent at top
- Experimental admin/names glossary included but destined for major revision in future version
2.0.1 RC1 (experimental), 2020-02-11
- Experimental release to test the new admin/names glossary and NN tagging in admin/*
2.0, 2020-01-29
- align all glossaries with main epsd2
- remove spurious translations, re-align everything until there are no mis-alignments anywhere
- improve content and formatting of OID footer
2.0 RC2, 2020-01-27
- clean up glossaries and get all projects lemmatizing cleanly against ePSD2
2.0 RC1, 2020-01-24
- implement simple merge for epsd2/names to list, e.g., Enkik and Enki together in epsd2
- tweak location of OIDs
- no longer translate J to Ŋ in search keys (general Oracc change, not only epsd2)
- compound xrefs should now be links
2.0 RC1, 2020-01-23
- Begin pre-builds for ePSD 2.0
- Initial mapping experiment in epsd2/names
- try to restore compound x-refs again
- Test inclusion of persistent ID in summary
Beta 7, 2020-01-22
- base audit of main epsd2 glossary to remove almost 400 of the most egregious base issues
- restore cross-references from compound constituents to compound verbs
- fix XFF (broken in last few RCs)
- near-final fixes to ur3 and ed3b before public 2.0
Beta 7 RC5, 2020-01-20
- widespread X-form fixing in admin corpora
Beta 7 RC4, 2020-01-16
- Over 13,000 additional proper nouns added to ur3/names
- Various lemmatization improvements to numbers and words in ur3
- Ongoing Phil Jones work on ed3b
Beta 7 RC3, 2020-01-13
- admin/u3adm renamed admin/ur3
- admin/ur3leg and admin/ur3let moved into admin/ur3 and removed as separate projects
- year names now tagged in ur3 corpus, something over 55,000 of them
- Tinney/Veldhuis work on disambiguation in admin ur3.atf
- Ongoing updates to ed3b by Jones
Beta 7 RC2, 2020-01-03
- Niek Veldhuis's review of Ur III admin glossary and corpus is complete!
Beta 7 RC1, 2019-11-26
- Beginning of the sequence that will result in ePSD 2.0--B7 will be the last Beta
- All epsd2 subprojects and related projects are now aligned with the main glossary
- Ongoing work since B6 on Ur III, ED IIIb and elsewhere
Beta 6, 2019-09-04
- UET 6 integrated into epsd2/literary; this is a revised version
of the pioneering work done by Jeremie Peterson for the Ur Online
Project
- Sample corpus of Old Babylonian liturgical texts now included in epsd2/praxis/liturgy; needs further work
- Over a dozen new literary and liturgical texts added by Niek Veldhuis
- Ongoing updates to admin by Veldhuis and Philip Jones
- Lexical Akkadian information now inducted from DCCLT
- Pronunciation glosses in bases now treated much more consistently
- Over 100 duplicates removed from main glossary
- Oracc IDs used internally by ePSD2 in preparation and infrastructure improvements to support change-tracking (not yet in use in this Beta
- Articles now have functional implementation of links in left outline pane to support jumping around in longer articles
Beta 6 RC8, 2019-08-29
- improved build procedure for epsd2/admin/* subprojects creates 00lib/approved.lst from source ATF before building project
- epsd2/literary/uet6 work ongoing
- epsd2/praxis/liturgy much improved but still in need of review
- ongoing updates to admin and literary
Beta 6 RC7, 2019-07-26
- Sub-corpora laid out as a table on home page by Niek Veldhuis
- epsd2/praxis/liturgy wrangled into initial usable state by Steve Tinney
Beta 6 RC6, 2019-07-25
- First cut at including Akkadian information from DCCLT
- Ongoing updates to ed3b and u3adm and some additional literary by Niek Veldhuis
Beta 6 RC5, 2019-07-22
- Fix build of u3adm which was broken in B6 RC4
- Ongoing updates to ed3b and u3adm
Beta 6 RC4, 2019-07-05
- Fix "Malformed OID" bug in sub-project glossaries
- Eliminate over 100 duplicates in main glossary and create several additional emesal forms in epsd2/emesal
- Improve preferred-bases (not yet displayed but used for developmental print version)
- Ongoing updates to ed3b and u3adm
Beta 6 RC3, 2019-06-15
- Fix about 300 missing + signs from phonetic glosses in bases so they make their way into the Pronunciation Data
- Fix a build bug which broke OIDs in epsd2 subprojects
- Fill in BUILD info in home.xml from news.xml so they won't accidentally out of sync any more
- Ongoing updates in glossary and admin corpora
Beta 6 RC2, 2019-06-10
- Ongoing updates in glossary and admin corpora
- Build subsidiary glossaries in epsd2/names
- Have another go at article outline jump links
Beta 6 RC1, 2019-06-04
- The content for this release is the same as Beta 5; the difference is that internally OIDs are now used for the dictionary articles which means that links like http://build-oracc.museum.upenn.edu/epsd2/o0023086 work. Don't tell anyone, though, because the OIDs are not stable yet, so they shouldn't be used outside of the ePSD2 universe.
Beta 5, 2019-05-20
- All admin texts now collected into a new umbrella corpus and glossary [/epsd2/admin]
- Initial implementation of epsd2 catalogue
- Various improvements to Pronunciation Data section
- Transliteration index in epsd1-style matrix now indexes bases
rather than forms and renamed to 'Bases' with 'B' in banner
line
- Improvements to Signs index in matrix
- list of homographs added to matrix
- Persistent Oracc IDs (OIDs) closer to being usable
- ongoing improvements to admin corpora by Niek Veldhuis, Dan Patterson and Phil Jones
- Other fixes as listed under Release Candidates below
Beta 5 RC8, 2019-05-19
- Fix phrases in forms tables
- Fix phrasal forms so continuations work properly among other things
Beta 5 RC7, 2019-05-18
- Fix a ru in epsd2/royal; needs review in other projects
- Fix rendering bug cuneiform for numbers with qualifiers (e.g., those with ₓ-index)
- Compute "preferred base" for each headword using stats on occurrences of bases and their alignment with the shape of the Citation Form
- Reference bases for compound verbs are now generated from preferred bases
- Even more work on getting Pronunciation Data right
Beta 5 RC6, 2019-05-16
- More work on getting Pronunciation Data right
- Minor formatting improvements in Pronunciation Data
Beta 5 RC5, 2019-05-15
- Redo implementation of xff and fix sort
- Restrict Pronunciation data to non-implied instances with a pronunciation column
- fix bug in generation of signnames-homographs list in epsd1-style matrix
Beta 5 RC4, 2019-05-14
- Bases with phonetic determinatives and continuation graphemes
now added to Pronunciation Data section of articles
- Transliteration index in epsd1-style matrix now indexes bases
rather than forms and renamed to 'Bases' with 'B' in banner
line
- Improvements to Signs index in matrix
- list of homographs added to matrix
- remove head tag from bases index which had caused spurious additional bases to appear
- fix epsd2 signlist to remove spurious items from "Independent"
view, improve categorization of initial/medial/final, and fix sort
order
Beta 5 RC3, 2019-05-12
- Complete review/fix of sign-name fields in lexical texts means
many more Pronunciation Data refs
- Oracc IDs (OIDs) now found for all sign names in lex sign names field
- epsd2/names and epsd2/emesal now included in Oracc IDs (OIDs)
- ongoing improvements to admin corpora by Niek Veldhuis, Dan Patterson and Phil Jones
Beta 5 RC2, 2019-04-28
Beta 5 RC1, 2019-04-22
- Bug fixes to import of @parts and @forms so more data is inducted from the 15 subprojects
- fix proxy sig bug which was dropping many sigs esp from epsd2/royal in Beta 4
- Ongoing updates to admin corpora and main glossary
- Initial implementation of epsd2 catalogue
Beta 4, 2019-02-20
- ePSD now has a signlist with references to dictionary articles
- ePSD1-style browsable views of data are now available
- the Index to the Sumerian Secondary Literature is now integrated into ePSD2
- for verbs that have prefixes a clickable list of attested prefixes is given before senses in the article
- Switch to supergenre ELA/LIT/LEX/STL in catalogue--this will be documented in portal pages in a future Beta but for now:
- ELA = Economic, Legal and Administrative
- LIT = Literary, including royal inscriptions
- LEX = Lexical
- STL = Scientific and Technical Literature -- magic, medicine, etc.
- Infrastructure improvements for more robust build and install
Beta 4 RC6, 2019-02-17
- ePSD2 signlist functional (and experimentally using persistent IDs)
- epsd2/issl now works
- Ongoing updates to lit, admin corpora and main glossary
Beta 4 RC5, 2019-02-12
- ePSD2 signlist debut
- Switch to ISSL that is integrated with ePSD2
- Ongoing updates to lit, admin corpora and main glossary
Beta 4 RC4, 2019-02-02
- Work on prefixes; also, epsd2 broke libxml2 again, so new strategy for IDs
- Ongoing updates to lit, admin corpora and main glossary
Beta 4 RC3, 2019-02-02
- Verbs now have prefixes given in article
- Ongoing updates to lit, admin corpora and main glossary
Beta 4 RC2, 2019-01-16
- First cut at ePSD2 version of ePSD1 TOC matrix
- Additions to epsd2/literary corpus by Niek Veldhuis
- Ongoing updates to admin corpora and main glossary
Beta 4 RC1, 2019-01-12
Ongoing updates to admin corpora and main glossary
Beta 3, 2018-10-23
See news under Beta 3 Release Candidates for the most important additions and improvements. The most significant thing is the alignment of epsd2/royal, epsd2/praxis and blms glossaries which has greatly reduced the number of duplicate entries in ePSD2.
Beta 3 RC6
- blms and praxis/incantations fully aligned with epsd2 main (emesal and names not quite)
- ongoing improvements in admin/u3adm, admin/lagash2, admin/ed3b
Beta 3 RC5
- credits page now has links thanks to Niek Veldhuis
- epsd2/royal fully aligned with epsd2
- epsd2/admin/u3leg and epsd2/admin/u3let fully aligned with epsd2
Beta 3 RC4
- fix bug in collo matching POS-only forms
- fix bug in lem-simplify which caused u3adm lem errors
Beta 3 RC3
- support n in @collo
- fix bug in lemmatizer RANK support
Beta 3 RC2
- augment admin/lagash2 glossary from admin/u3adm
Beta 3, Release Candidate 1
- full build functional on new dev machine
- fix NULL project in signatures
- implement ngrammer predicate <!vpr> for e2 du2-a in epsd2
- fix lemm-data generation to ensure frequencies are included and integrate into build
- more improvements on lex data aesthetics--strip POS in phrase titles
- BLMS lemmatizes cleanly against epsd2
- document @collo and ngrammer in main Oracc doco
- create new help files for people working on admin texts
Beta 2, 2018-09-07
Major improvements include the inclusion of lexical data in the
body of articles (see, e.g., a[arm], bappir[~beer]); marking of Emesal
words by (ES); the ability to search the ePSD2 lemmatized corpus
(search starting from /epsd2/pager); and bug-fixes for hiliting target
words when clicking through to instances list and on into the
reference-in-context, and for arriving at the wrong word when clicking
on a summary after searching the glossary.
- lex data now uses formatted transliteration in refs (e.g., ab[window]
Beta 2 RC4
- home page now uses relative URLs so different installations link to the proper resources
- (ES) now omitted from header and just put in summary
- fix bug that resulted in some phrases instances not propagating from sub-projects to espd2
- /epsd2/pager now supports searching the lemmatized texts
- clicking from instance list to instance-in-context now hilites word in context
Beta 2 RC3
- fixes the most serious interface bug, which resulted in the wrong entries being displayed after a search in some circumstances
- forms table support is improved
- instances from proper nouns are now included in top-level epsd2
- Emesal words and forms are now indicated by (ES).
Beta 2 RC2
- improved support for the inclusion of lexical data
Beta 2 RC1
- First version of inclusion of lexical data in body of articles