hate-speech

Star

Here are 113 public repositories matching this topic...

Language: All

Filter by language

All 113 Jupyter Notebook 51 Python 29 HTML 4 JavaScript 3 TypeScript 3 CSS 1 Elm 1 Go 1 Julia 1 PowerShell 1

Sort: Most stars

Sort options

Most stars Fewest stars Most forks Fewest forks Recently updated Least recently updated

unitaryai / detoxify

Star 1.1k

Trained models & code to predict toxic comments on all 3 Jigsaw Toxic Comment Challenges. Built using ⚡ Pytorch Lightning and 🤗 Transformers. For access to our API, please email us at contact@unitary.ai.

nlp kaggle-competition sentence-classification bert hatespeech hate-speech toxicity toxic-comment-classification toxic-comments bert-model hate-speech-detection huggingface pytorch-lightning toxicity-classification huggingface-transformers

Updated Oct 6, 2025
Python

t-davidson / hate-speech-and-offensive-language

Star 830

Repository for the paper "Automated Hate Speech Detection and the Problem of Offensive Language", ICWSM 2017

nlp classifier machine-learning natural-language-processing twitter dataset abuse labeled-data offensive icwsm computational-social-science hatespeech offensive-language hate-speech

Updated Jun 12, 2023
Jupyter Notebook

kocohub / korean-hate-speech

Star 388

Korean HateSpeech Dataset

natural-language-processing dataset korean-nlp hate-speech

Updated Jul 18, 2020

hate-alert / HateXplain

Star 221

Can we use explanations to improve hate speech models? Our paper accepted at AAAI 2021 tries to explore that question.

detection lstm offensive bias hatespeech hate-speech interpretable-deep-learning attention-lstm bert-model explainability bert-fine-tuning

Updated Jun 12, 2023
Python

Hironsan / HateSonar

Sponsor

Star 193

Hate Speech Detection Library for Python.

python machine-learning natural-language-processing hate-speech

Updated Oct 15, 2025
Jupyter Notebook

surge-ai / toxicity

Star 187

The world's largest social media toxicity dataset.

hate-speech toxicity content-moderation hate-speech-detection

Updated Jun 10, 2022

hate-alert / DE-LIMIT

Star 109

DeEpLearning models for MultIlingual haTespeech (DELIMIT): Benchmarking multilingual models across 9 languages and 16 datasets.

multilingual classification bert hate-speech cnn-gru laser-embeddings

Updated Jun 12, 2023
Jupyter Notebook

manoelhortaribeiro / HatefulUsersTwitter

Star 74

Code for the paper "Characterizing and Detecting Hateful Users on Twitter"

twitter abuse-detection hate-speech

Updated Apr 20, 2021
Jupyter Notebook

napolab

ruanchaves / napolab

Star 71

The Natural Portuguese Language Benchmark (Napolab). Stay up to date with the latest advancements in Portuguese language models and their performance across carefully curated Portuguese language tasks.

python nlp transformers english spanish benchmarks question-answering semantic-similarity datasets portuguese catalan textual-entailment hate-speech text-simplification galician huggingface huggingface-transformers large-language-models

Updated Jul 28, 2025
Python

hate-alert / Hate-Speech-Reading-List

Star 44

This repository contains papers and resources pertaining to Hate speech research.

counter research speech reading-list hate hatespeech hate-speech counterspeech counter-speech

Updated May 30, 2021

hate-alert / Tutorial-Resources

Star 38

Resources and tools for the Tutorial - "Hate speech detection, mitigation and beyond" presented at ICWSM 2021

nlp natural-language-processing tutorial twitter hatespeech abuse-detection hate-speech bert-model counterspeech hate-speech-detection huggingface xlm-roberta xlmroberta huggingface-transformers icwsm2021

Updated Feb 23, 2022
Python

phusroyal / ViHOS

Star 35

Repository for the paper "ViHOS: Vietnamese Hate and Offensive Spans Detection" (EACL2023)

nlp benchmark machine-learning natural-language-processing deep-learning python3 dataset sequence-labeling vietnamese-nlp social-media-mining hate-speech benchmark-datasets span-prediction vietnamese-dataset span-detection vihos

Updated Nov 25, 2023
Jupyter Notebook

sidneykung / twitter_hate_speech_detection

Star 32

Capstone project to automate Twitter hate speech detection with classification modeling.

nlp twitter classification logistic-regression nlp-machine-learning hate-speech hate-speech-tweets

Updated May 17, 2021
Jupyter Notebook

iVerify-Apps

undp / iVerify-Apps

Star 31

iVerify Apps: Apps that support the AI-powered iVerify platform to combat misinformation and hate speech

elections misinformation disinformation hate-speech information-pollution

Updated Jun 26, 2025
TypeScript

investigation-youtube-ad-placements

the-markup / investigation-youtube-ad-placements

Star 30

Data and code from our stories, "Google Has a Secret Blocklist that Hides YouTube Hate Videos from Advertisers—But It’s Full of Holes," and "Google Blocks Advertisers from Targeting Black Lives Matter YouTube Videos."

youtube social-justice hate keyword-lists hate-speech algorithm-auditing racial-justice undocumented-endpoints

Updated Aug 23, 2021
Jupyter Notebook

hate-alert / Fear-speech-analysis

Star 26

Can fear be used for polarisation and spreading negativity? Our paper accepted in The Web conference 2021 tries to explore this question in light of public Whatsapp groups.

natural-language-processing paper transformers survey dataset whatsapp facebook-ads hatespeech hate-speech fear-speech whatsapp-groups fearspeech

Updated Mar 27, 2023
Jupyter Notebook

richouzo / hate-speech-detection-survey

Star 21

Trained Neural Networks (LSTM, HybridCNN/LSTM, PyramidCNN, Transformers, etc.) & comparison for the task of Hate Speech Detection on the OLID Dataset (Tweets).