Skip to content

Releases: mideind/GreynirEngine

Version 2.6.1

30 Sep 14:07
Compare
Choose a tag to compare
  • Old test corpora deleted
  • README updated
  • S inside CP-REL fixed

Version 2.6.0

23 Sep 15:53
Compare
Choose a tag to compare
  • Switched from GNU GPLv3 to MIT license, in accordance with the requirements of the Icelandic Government's 5-Year Language Technology Programme
  • Python 3.6 is now a requirement; Python 3.5 is no longer supported
  • Renamed repository from ReynirPackage to GreynirPackage
  • Many enhancements to the context-free grammar
  • Type annotations added
  • Added circuit breaker to abandon the parsing of very complex and time-consuming sentences, via a ParseError exception
  • Documentation additions and formatting improvements
  • Added force_number parameter to NounPhrase class
  • Added error attribute to the _Sentence class
  • Many other smaller enhancements

Version 2.5.0

06 Jul 17:52
Compare
Choose a tag to compare
  • Lots of type annotations added
  • Adapted to Tokenizer 2.1.0 which no longer splits a final period off an abbreviation that ends a sentence
  • Significant additions to vocabulary; minor additions to grammar
  • Added ability to dump and load _Sentence instances, including tokens, to and from JSON

Version 2.4.0

26 Jun 16:52
Compare
Choose a tag to compare
  • Added test for foreign sentences, i.e. sentences where less than 60% of the words are found in BÍN (or resolved by the composite word algorithm). Such sentences are now marked by an error code instead of being parsed.
  • More flexibility in specifying the maximum length of sentences to be parsed. The (overridable) default is now a maximum of 90 tokens in a sentence.
  • Added and enhanced functions for generating lemmas from parsed sentences. New properties include _Sentence.lemmas_and_cats and _Sentence.categories, and the method Greynir.parse_tokens() has been added.
  • Grammar and vocabulary improvements.
  • Better handling of abbreviations and multi-word prepositions.
  • Improved type annotations in the main Greynir API.
  • Added tests.
  • New features documented and examples added.

Version 2.3.1

20 May 12:36
Compare
Choose a tag to compare
  • Added type annotations to ReynirPackage's public API; now compatible with the mypy static type checker
  • Added BIN_Db.lookup_lemma() method in bindb.py
  • Enhanced pattern matching functionality in matcher.py and simpletree.py
  • Various grammar and vocabulary additions

Version 2.2.0

07 Apr 18:27
Compare
Choose a tag to compare
  • Added progress_func parameter to Greynir.submit() and Greynir.parse() functions, for progress update callbacks while a parse is ongoing
  • Performance regression under PyPy 7.3 fixed by refactoring token/terminal matching code in binparser.py.
  • Split matcher.py and moved SimpleTree class to simpletree.py.
  • Allowed category of word forms denoting prepositions to be restricted via Phrases.conf.
  • Switched to new-style CFFI callbacks from C++ code to Python for better performance and reliability

Version 2.1.0

20 Feb 16:06
Compare
Choose a tag to compare
  • Added NounPhrase class, allowing noun phrases to be easily encapsulated and inflected in various forms
  • Additions to vocabulary in ord.compressed
  • Updated documentation and README
  • Updated copyright notices

Version 2.0.1

06 Dec 16:22
Compare
Choose a tag to compare
  • Changed name of main class to Greynir (with Reynir still being available as an alias)
  • Updated documentation to use Greynir name instead of Reynir
  • Branding and styling update of Sphinx-based documentation, added custom.css style sheet
  • No change in functionality

Version 2.0.0

04 Dec 17:30
Compare
Choose a tag to compare
  • Adaptation to Tokenizer 2.0.0
  • Better handling of hyphenated words, including þingmenn og -konur, málfræði-greining, etc.
  • Added terminal types for USERNAME, SERIALNUMBER and MOLECULE tokens
  • Various bug fixes and additions to test suite

Version 1.9.0

18 Oct 16:21
Compare
Choose a tag to compare
  • Many improvements to grammar and vocabulary
  • The gender neutral pronoun hán is now supported in the grammar
  • Parse trees are now annotated with accumulative node scores
  • sequence terminals added, matching any of 1,2,3..., a,b,c..., i,ii,iii...