Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

Python package to parse news from various news website

License

Notifications You must be signed in to change notification settings

hee0624/extract_news

Folders and files

NameName
Last commit message
Last commit date

Latest commit

History

9 Commits

Repository files navigation

extract_news

Getting started

新闻图片

Requirements

You need to have a 3.6 version of Python installed. See the page for more information.

Installation

  • python setup.py install
  1. online extract
import extract_news
res = extract_news.online_parse(url)
# 获取新闻正文
news_content = res.get('news_content')
# 获取新闻发布时间
news_pubtime = res.get('news_pubtime')
# 获取新闻标题
news_title = res.get('news_title')
  1. offline extract
# html为新闻网页字符串
res = extract_news.offline_parse(html)

Questions, comments?

Send them to He Chen.

About

Python package to parse news from various news website

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

AltStyle によって変換されたページ (->オリジナル) /