ohjho/recommendation_system

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 121 Commits
Hybrid system		Hybrid system
collaborative_filtering		collaborative_filtering
content_based_filtering		content_based_filtering
data		data
latent_factor_analysis		latent_factor_analysis
.gitignore		.gitignore
Data_cleaning.py		Data_cleaning.py
EDA.ipynb		EDA.ipynb
EDA_presentation.ipynb		EDA_presentation.ipynb
LICENSE		LICENSE
README.md		README.md
main.py		main.py
merge_data.py		merge_data.py
project_details.md		project_details.md
requirements.txt		requirements.txt

Repository files navigation

recommendation_system

Xccelerate Data Science Bootcamp Collaborative Project: 4 flavours of recommendation systems using the Booking Crossing Dataset which is also included here in this repo.

See the project's details here

made-with-python python versions MIT license

How to Use this repo

Clone this repo:

$ git clone https://github.com/ohjho/recommendation_system.git
$ cd recommendation_system

install the requirements. We highly recommend doing this inside a virtualenv and avoid dependency hell.

#---------------- optional ------------------
$ mkvirtualenv --python=`which python3` NameOfYourEnv
$ workon NameOfYourEnv
#--------------------------------------------
(NameOfYourEnv) $ pip install -r requirements.txt

and just check and resolve any packages dependency issues if they show up under pip check. It should say No broken requirements found.

Start Jupyter notebook

$ jupyter notebook

Data Cleaning

How to use data_cleaning.py

The script data_cleaning.py will import the datasets and clean the data.

To get 3 separate dataframes, do this

from data_cleaning import get_clean_data
df_books, df_users, df_ratings = get_clean_data()

And if the csv files are not under data/, use the path argument.

To get one merged dataframe, do this:

from data_cleaning import get_merged_data_frame
df_merged = get_merged_data_frame(user_argv=user_threshold, isbn_argv=book_threshold)

where user_threshold is the threshold to filter out users with fewer than this number of books rated. books_threshold is the books counterpart And if the csv files are not under "/data/", use the path argument.

Presentation

Google Slides

About

Xccelerate Data Science Bootcamp Collaborative Project: 4 flavours of recommendation systems

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

ohjho/recommendation_system

Folders and files

Latest commit

History

Repository files navigation

recommendation_system

How to Use this repo

Data Cleaning

How to use data_cleaning.py

Modeling

A. Content-based Filtering

B. Collaborative Filtering

C. Latent Factor Analysis

Presentation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Uh oh!

Contributors 9

Uh oh!

Languages

License

ohjho/recommendation_system

Folders and files

Latest commit

History

Repository files navigation

recommendation_system

How to Use this repo

Data Cleaning

How to use data_cleaning.py

Modeling

A. Content-based Filtering

B. Collaborative Filtering

C. Latent Factor Analysis

Presentation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 9

Uh oh!

Languages

Packages