Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add structure detection of news objects #17

Open
ivbeg opened this issue Aug 16, 2022 · 0 comments
Open

Add structure detection of news objects #17

ivbeg opened this issue Aug 16, 2022 · 0 comments
Assignees
Labels
breaking change Critical change that could affect all product enhancement New feature or request

Comments

@ivbeg
Copy link
Owner

ivbeg commented Aug 16, 2022

Add structure detection and xpath reconstruction.
Instead of dynamic news detection build pseudo-code to extract news from the page.

It should implement analysis logic that should detect:

  • news list block container
  • the type of news list: sub-blocks or mixed list
  • headline tag
  • text tag/tag-block
  • date tag is exists
  • links is exists
  • images if exists

Is something missing?

@ivbeg ivbeg added enhancement New feature or request breaking change Critical change that could affect all product labels Aug 16, 2022
@ivbeg ivbeg self-assigned this Aug 16, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
breaking change Critical change that could affect all product enhancement New feature or request
Projects
Development

No branches or pull requests

1 participant