Goose3

A Python 3 compatible version of goose http://goose3.readthedocs.io/en/latest/index.html

Popularity
4.4
Stable
Activity
6.3
Growing
897
19
107

Description

Goose was originally an article extractor written in Java that has most recently (Aug2011) been converted to a scala project.

This is a complete rewrite in Python. The aim of the software is to take any news article or article-type web page and not only extract what is the main body of the article but also all meta data and most probable image candidate.

Goose will try to extract the following information:

Programming language: HTML
License: Apache License 2.0
Latest version: v3.1.12

Goose3 alternatives and similar packages

Based on the "Web Content Extracting" category.
Alternatively, view Goose3 alternatives based on common mentions on social networks and blogs.

* Code Quality Rankings and insights are calculated and provided by Lumnify.
They vary from L1 to L5 with "L5" being the highest.

Do you think we are missing an alternative of Goose3 or a related project?

Add another 'Web Content Extracting' Package

Do not miss the trending, packages, news and articles with our weekly report.

Awesome Python is part of the LibHunt network. Terms. Privacy Policy.

(CC)
BY-SA
We recommend Spin The Wheel Of Names for a cryptographically secure random name picker.

AltStyle によって変換されたページ (->オリジナル) /