Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

Commit 1e39fb4

Browse files
Update README.md
1 parent aafd4d1 commit 1e39fb4

File tree

1 file changed

+21
-11
lines changed

1 file changed

+21
-11
lines changed

‎README.md

Lines changed: 21 additions & 11 deletions
Original file line numberDiff line numberDiff line change
@@ -1,25 +1,35 @@
1-
## Exploration of Sentiment Analysis using Lexicon and Machine-Learning Based Methods
1+
## Exploration of Sentiment Analysis
22

33
This repo provides the submission entry for an in-class NLP sentiment analysis competition held at Microsoft AI Singapore group using techniques learned in class to classify text in identifying positive or negative sentiment.
44

5-
Data for this in-class competition comes from the `Sentiment140` dataset where the training and test data consists of randomly sampled 10% and 5% of the Sentiment140 dataset.
5+
![jpg](images/inclass-competition.jpg)
6+
7+
Recommended to install [Anaconda](https://www.anaconda.com/products/distribution), a pre-packaged Python distribution that contains all of the necessary libraries and software for this project. Alternatively, you can make use of [Google Colaboratory](https://colab.research.google.com/), which allows you to write and execute Python codes in your browser.
8+
9+
**Data**
10+
11+
Data for this in-class competition comes from the [Sentiment140](https://www.kaggle.com/datasets/kazanova/sentiment140) dataset where the training and test data consists of randomly sampled 10% and 5% of the dataset.
612

7-
- Text Pre-processing
8-
- VADER (VALENCE based sentiment analyzer)
13+
## Getting started using Lexicon and Machine Learning (ML) based methods
14+
Open `SentimentAnalysis.ipynb` on a jupyter notebook environment. Alternatively, you can view the codes in Google Colab [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://githubtocolab.com/KwokHing/SentimentAnalysis-Python-Demo/blob/master/SentimentAnalysis.ipynb).
15+
16+
- VADER (VALENCE based sentiment analyzer) (67%)
917
- Naive Bayes
10-
- Linear SVM (Support Vector Machine)
18+
- Linear SVM (Support Vector Machine) (80%)
1119
- Decision Tree
1220
- Random Forest
1321
- Extra Trees
14-
- SVC
22+
- SVC (80%)
1523

16-
![jpg](images/inclass-competition.jpg)
24+
## Exploring using Deep Learning Techniques (LSTM)
25+
Open `SentimentAnalysis_RNN.ipynb` on a jupyter notebook environment. Alternatively, you can view the codes in Google Colab [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://githubtocolab.com/KwokHing/SentimentAnalysis-Python-Demo/blob/master/SentimentAnalysis_RNN.ipynb).
26+
27+
The LSTM deep learning method (79%) did not perform better than SVC/SVM method
1728

18-
## Getting started
19-
Open `SentimentAnalysis.ipynb` on a jupyter notebook environment. Alternatively, you can view the codes in Google Colab [here](https://drive.google.com/open?id=1d_po5AQDFRovk4livi2kvv1hhjPLxqAC). The notebook consists of further technical details.
29+
## How about the BERT Transformers model
30+
Open `SentimentAnalysis_BERT.ipynb` on a jupyter notebook environment. Alternatively, you can view the codes in Google Colab [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://githubtocolab.com/KwokHing/SentimentAnalysis-Python-Demo/blob/master/SentimentAnalysis_BERT.ipynb).
2031

21-
## Improvements
22-
Could potentially explore the use of Deep Learning Techniques such as RNN and/or LSTM for sentiment analysis
32+
The State-of-the-Art transformer model performs slightly better at 82% accuracy
2333

2434
<!---
2535
# Walk-through of the submission entry:

0 commit comments

Comments
(0)

AltStyle によって変換されたページ (->オリジナル) /