Skip to content

Commit 11255bd

Browse files
authored
prepare version 1.9.0 (#580)
* prepare version 1.9.0 * fix history
1 parent 022ce28 commit 11255bd

File tree

3 files changed

+27
-4
lines changed

3 files changed

+27
-4
lines changed

HISTORY.md

Lines changed: 26 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,32 @@
11
## History / Changelog
22

33

4+
### 1.9.0
5+
6+
Extraction:
7+
- add markdown as explicit output (#550)
8+
- improve recall preset (#571)
9+
- speedup for readability-lxml (#547)
10+
- add global options object for extraction and use it in CLI (#552)
11+
- fix: better encoding detection (#548)
12+
- recall: fix for lists inside tables with @mikhainin (#534)
13+
- add symbol to preserve vertical spacing in Markdown (#499)
14+
- fix: table cell separators in non-XML output (#563)
15+
- slightly better accuracy and execution speed overall
16+
17+
Metadata:
18+
- add file creation date (date extraction, JSON & XML-TEI) (#561)
19+
- fix: empty content in meta tag by @felipehertzer (#545)
20+
21+
Maintenance:
22+
- restructure and simplify code (#543, #556)
23+
- CLI & downloads: revamp and use global options (#565)
24+
- eval: review code, add guidelines and small benchmark (#542)
25+
- fix: raise error if config file does not exist (#554)
26+
- deprecate `process_record()` (#549)
27+
- docs: convert readme to markdown and update info (#564, #578)
28+
29+
430
### 1.8.1
531

632
Maintenance:

setup.py

Lines changed: 0 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -19,9 +19,6 @@ def get_long_description():
1919
"Return the README"
2020
with open("README.md", "r", encoding="utf-8") as filehandle:
2121
long_description = filehandle.read()
22-
# long_description += "\n\n"
23-
# with open("CHANGELOG.md", encoding="utf8") as f:
24-
# long_description += f.read()
2522
return long_description
2623

2724

trafilatura/__init__.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -9,7 +9,7 @@
99
__author__ = 'Adrien Barbaresi and contributors'
1010
__license__ = "Apache-2.0"
1111
__copyright__ = 'Copyright 2019-2024, Adrien Barbaresi'
12-
__version__ = '1.8.1'
12+
__version__ = '1.9.0'
1313

1414

1515
import logging

0 commit comments

Comments
 (0)