Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

python2and3developer/save_webpage

Folders and files

NameName
Last commit message
Last commit date

Latest commit

History

6 Commits

Repository files navigation

save_webpage

usage: save_webpage.py [-h] [-o OUTPUT] [--version] [-q] [--insecure]
 [--forbidden-urls FORBIDDEN_URLS [FORBIDDEN_URLS ...]]
 [--follow-links] [-b BASE_URL]
 [--index-html INDEX_HTML]
 [--mode {relative,absolute,nochange}]
 [--config PATH_TO_CONFIG_FILE]
 list_of_seed_urls [list_of_seed_urls ...]
Save_Webpage Save webpages and all its resources. Apply search and replace of
matched strings.
positional arguments:
 list_of_seed_urls Seed urls
optional arguments:
 -h, --help show this help message and exit
 -o OUTPUT, --output OUTPUT
 Output directory
 --version show program's version number and exit
 -q, --quite don't show verbose url get log in stderr
 --insecure Ignore the certificate
 --forbidden-urls FORBIDDEN_URLS [FORBIDDEN_URLS ...]
 Forbidden urls
 --follow-links Follow links
 -b BASE_URL, --base-url BASE_URL
 Resolves relative links using URL as the point of
 reference
 --index-html INDEX_HTML
 Default index file
 --mode {relative,absolute,nochange}
 Mode of extraction
 --config PATH_TO_CONFIG_FILE
 Path to configuration file
save_webpage.py: Takes a list of url's and download the websites with all its
internal resource files. Transforms all internal resources so that they link
to local files. Process css files exctracting new resource and converting
url's. Possibility to replace javascript and html files using custom
substitutions. Full Unicode/UTF-8 support.

Command-line usage:

Examples:

$ python save_webpage.py -h
 you are reading this help message
$ python save_webpage.py http://www.google.com
 save google url page for offline reading, keep style untainted
 the website and all its resource are saved in the 'output' folder
$ python save_webpage.py http://gabrielecirulli.github.io/2048/ --output game
 save dynamic page with Javascript example
 the 2048 game can be played offline after being saved
 the website and all its resource are saved in the 'game' folder

About

Download a webpage and all the external resources.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

Languages

AltStyle によって変換されたページ (->オリジナル) /