Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

Commit c9ae615

Browse files
update readme.md
1 parent e2f7a93 commit c9ae615

File tree

1 file changed

+15
-2
lines changed
  • WebScrapingScripts/Movie Review Scraping

1 file changed

+15
-2
lines changed

‎WebScrapingScripts/Movie Review Scraping/README.md

Lines changed: 15 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -3,13 +3,20 @@
33
## AIM
44
> To Extract the Reviews of Movies.
55
6+
## DESCRIPTION
7+
Here is the Python Script which is used to extract the Reviews of Movies from IMDb. We have use `requests` and `bs4` packages to extract the data.
8+
9+
## PURPOSE
10+
In this project you’ll learn about HTTP requests and how to send them using the requests package and will also learn how to extract required data from HTML pages using some simple functions of beautifulsoup module. As we know Sentimental Analysis is very popular task in Machine Learning, so I have wrote a Python script to get the data for you and perform several task on this type of NLP.
11+
612
## PACKAGES USED
713
> The purpose of these packages in project
814
- `requests` - It has been to send and recieve the request in order to fetch the data from IMDB.
915
- `bs4` - It has been used to extract the HTML elements from website.
1016
- `json` - json is used as helper in order to save the list of movies and its links.
1117
- `pandas` - It is used to create and store dataframes into .csv format.
1218

19+
1320
## Workflow
1421
- Import above packages mentioned above.
1522
- Extracting movies and links
@@ -22,15 +29,21 @@
2229
- `pip install bs4`
2330
- `pip install json`
2431

32+
## COMPILATION STEPS
33+
> Go to terminal
34+
35+
> Run command : `python3 scrapy_data.py`
36+
37+
> Rest the script will do the work.
2538
## SOURCE
2639
### **IMDB**
27-
![Image](img.png)
40+
![Image](Images/img.png)
2841

2942

3043

3144
## OUTPUT
3245
### VS CODE TERMINAL
33-
![OUTPUT](terminal.png)
46+
![OUTPUT](Images/terminal.png)
3447

3548

3649
## AUTHOR

0 commit comments

Comments
(0)

AltStyle によって変換されたページ (->オリジナル) /