Skip to content

Releases: gcelano/LemmatizedAncientGreekXML

Lemmatized Ancient Greek Texts (v1.2.5)

28 Sep 11:09
Compare
Choose a tag to compare

Changes:

  • Lemmas for prepositions, particles, and a few clear mistakes concerning article lemmas have been corrected. This has increased dramatically the number of the lemmas available in the corpus: 21493806 lemmas against 25522507 tokens.

LemmatizedAncientGreekXML (v1.2.4)

14 Sep 15:49
Compare
Choose a tag to compare

Changes:

  • The codepoint "’" is used as apostrophe or as quotation mark. Some known issues stemming from (wrong) Betacode conversion makes tokenization for this codepoint hard to handle. This version corrects some tokenization errors to move quotation mark "’" into a separate token

LemmatizedAncientGreekXML (v1.2.3)

01 Sep 13:58
Compare
Choose a tag to compare

Changes:

  • "’" position for elision has been corrected, i.e., put in the same element of
    the elided word (This error was due to the fact that the apostrophe has been
    encoded with different codepoints)

LemmatizedAncientGreekXML (v1.2.2)

01 Sep 12:19
Compare
Choose a tag to compare

Changes:

  • In tlg0018.tlg010.opp-grc1.xml and tlg0018.tlg015.opp-grc1.xml the erroneous
    ’Kv at the beginning of the first sentence has been corrected into Ἐν
  • In tlg0018.tlg019.opp-grc1.xml the erroneous ’Η at the beginning of the first
    sentence has been corrected into Ἡ
  • "’" position has been corrected, i.e., put at the end of a sentence
  • Duplicate l1 and l2 are deleted

LemmatizedAncientGreekXML (v1.2.1)

22 Aug 13:41
Compare
Choose a tag to compare

Improved documentation in README.md
lemma redundancy deleted

LemmatizedAncientGreekXML (v1.2.0)

08 Aug 12:59
Compare
Choose a tag to compare

LemmatizedAncientGreekXML (v1.0.1)

30 Jun 16:05
Compare
Choose a tag to compare

An xml version of the Morpheus database, which has been used to retrieve lemmas, has been added.

LemmatizedAncientGreekXML (v1.0.0)

29 Jun 11:23
Compare
Choose a tag to compare

This release contains the data as they have been automatically produced, without attempting manual corrections.