Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

标题抽取因为 | 字符少了一截 #124

Open
pgshow opened this issue May 25, 2022 · 0 comments
Open

标题抽取因为 | 字符少了一截 #124

pgshow opened this issue May 25, 2022 · 0 comments
Assignees
Labels
bug Something isn't working

Comments

@pgshow
Copy link

pgshow commented May 25, 2022

bug的现象

在抽取该文章的标题时 https://baijiahao.baidu.com/s?id=1733755048466991904
原标题为 操盘必读|央行、银保监会部署加大信贷投放;国泰君安拟协议受让华安基金8%股权;美股涨跌不一,但是只返回了 “操盘必读” 这四个字

如何复现

  1. 目标网址: https://baijiahao.baidu.com/s?id=1733755048466991904
  2. 你怎么调用GNE的

gne_info = extractor.extract(response.text)
title = gne_info['title']

使用环境:

  • OS: [e.g. Ubuntu 19.04/Windows 10/macOS ]
  • Python版本 [e.g. 3.8]
@pgshow pgshow added the bug Something isn't working label May 25, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants