edwarts/s_crawler

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
testpages		testpages
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
crawl3.py		crawl3.py
data.json		data.json
get-pip.py		get-pip.py
mockserver.py		mockserver.py
pagecrawlbehaviour_test.py		pagecrawlbehaviour_test.py
requirements.txt		requirements.txt
unit_test.py		unit_test.py

Repository files navigation

A crawl by Python

Please use Python 3 to run this

How to install it

You can use pip to install the requirements as follows:

pip install -r requirements.txt

How to use it

You only need to type

python3 crawl3.py your-start-url timeout researchlevel

in your terminal. And then input the starturl ,timeout and researchlevel. The result is saved in the file named "data.json" in the same folder as well as STDOUT.

The research level can be set -1 if you want to explore the whole internet

You can run the test case by following the instruction:

1 python3 mockserver.py to run the mockserver for the test case
2 python3 -m unittest unit_test pagecrawlbehaviour_test

For fun to run on docker

docker build -t crawl
docker run crawl crawl3.py your-start-url timeout researchlevel

About

a simple crawler in python

Releases

No releases published

Packages

Contributors

Languages

Python 99.7%
Other 0.3%

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

edwarts/s_crawler

Folders and files

Latest commit

History

Repository files navigation

A crawl by Python

Please use Python 3 to run this

How to install it

How to use it

For fun to run on docker

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

A crawl by Python

Please use Python 3 to run this

How to install it

How to use it

For fun to run on docker

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages