Releases: mideind/GreynirEngine
Releases · mideind/GreynirEngine
Version 2.6.1
- Old test corpora deleted
- README updated
- S inside CP-REL fixed
Version 2.6.0
- Switched from GNU GPLv3 to MIT license, in accordance with the requirements of the Icelandic Government's 5-Year Language Technology Programme
- Python 3.6 is now a requirement; Python 3.5 is no longer supported
- Renamed repository from
ReynirPackage
toGreynirPackage
- Many enhancements to the context-free grammar
- Type annotations added
- Added circuit breaker to abandon the parsing of very complex and time-consuming sentences, via a
ParseError
exception - Documentation additions and formatting improvements
- Added
force_number
parameter toNounPhrase
class - Added
error
attribute to the_Sentence
class - Many other smaller enhancements
Version 2.5.0
- Lots of type annotations added
- Adapted to Tokenizer 2.1.0 which no longer splits a final period off an abbreviation that ends a sentence
- Significant additions to vocabulary; minor additions to grammar
- Added ability to dump and load
_Sentence
instances, including tokens, to and from JSON
Version 2.4.0
- Added test for foreign sentences, i.e. sentences where less than 60% of the words are found in BÍN (or resolved by the composite word algorithm). Such sentences are now marked by an error code instead of being parsed.
- More flexibility in specifying the maximum length of sentences to be parsed. The (overridable) default is now a maximum of 90 tokens in a sentence.
- Added and enhanced functions for generating lemmas from parsed sentences. New properties include
_Sentence.lemmas_and_cats
and_Sentence.categories
, and the methodGreynir.parse_tokens()
has been added. - Grammar and vocabulary improvements.
- Better handling of abbreviations and multi-word prepositions.
- Improved type annotations in the main Greynir API.
- Added tests.
- New features documented and examples added.
Version 2.3.1
- Added type annotations to ReynirPackage's public API; now compatible with the
mypy
static type checker - Added
BIN_Db.lookup_lemma()
method inbindb.py
- Enhanced pattern matching functionality in
matcher.py
andsimpletree.py
- Various grammar and vocabulary additions
Version 2.2.0
- Added
progress_func
parameter toGreynir.submit()
andGreynir.parse()
functions, for progress update callbacks while a parse is ongoing - Performance regression under PyPy 7.3 fixed by refactoring token/terminal matching code in
binparser.py
. - Split
matcher.py
and movedSimpleTree
class tosimpletree.py
. - Allowed category of word forms denoting prepositions to be restricted via
Phrases.conf
. - Switched to new-style CFFI callbacks from C++ code to Python for better performance and reliability
Version 2.1.0
- Added
NounPhrase
class, allowing noun phrases to be easily encapsulated and inflected in various forms - Additions to vocabulary in
ord.compressed
- Updated documentation and README
- Updated copyright notices
Version 2.0.1
- Changed name of main class to Greynir (with Reynir still being available as an alias)
- Updated documentation to use Greynir name instead of Reynir
- Branding and styling update of Sphinx-based documentation, added
custom.css
style sheet - No change in functionality
Version 2.0.0
- Adaptation to Tokenizer 2.0.0
- Better handling of hyphenated words, including þingmenn og -konur, málfræði-greining, etc.
- Added terminal types for
USERNAME
,SERIALNUMBER
andMOLECULE
tokens - Various bug fixes and additions to test suite
Version 1.9.0
- Many improvements to grammar and vocabulary
- The gender neutral pronoun hán is now supported in the grammar
- Parse trees are now annotated with accumulative node scores
sequence
terminals added, matching any of 1,2,3..., a,b,c..., i,ii,iii...