Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

albersola/WebGraph

Folders and files

NameName
Last commit message
Last commit date

Latest commit

History

10 Commits

Repository files navigation

Web Graph

Web Graph is a program that finds relationships between web pages.

It downloads a web page, extract the urls and follow those links in order to generate connections between the webs.

It's made with C++ and Lex for a university project.

The output can be preprocessed with 'graph.py' in order to generate a graph, but because of there is a lot of data, if you want to display the graph it would be necessary to extract the most relevant data.

Example of data from Xataka:

Xataka Graph

Dependencies

  • C++ (g++)
  • flex
  • libcurl
  • python3

Usage

$ make $ ./webgraph

  • url: start url.
  • depth: number of links to follow (depth).

Output

TXT File where each line is:

You can run "python3 graph.py" and it will parse this output file and it will create a .csv file with the edges of a graph.

Visualization

Soon :)

About

Extract and show relations between web pages.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

AltStyle によって変換されたページ (->オリジナル) /