Skip to main content
Stack Overflow
  1. About
  2. For Teams
Filter by
Sorted by
Tagged with
0 votes
1 answer
73 views

I’m trying to evaluate classification models on a highly imbalanced fraud dataset using the Brier Skill Score (BSS) as the evaluation metric. The dataset has ~2133 rows and the target Fraud_Flag is ...
0 votes
0 answers
100 views

I am working on the loan default prediction data set available on Kaggle which has a highly skewed class distribution. The best model I have gotten so far is as follows using ExtraTreesClassifier: ...
0 votes
1 answer
240 views

I’m running into a frustrating issue while training a BERT-based multi-label text classification model on an imbalanced dataset. After a few epochs, the training loss suddenly becomes NaN, and I can’t ...
-2 votes
1 answer
51 views

I am working on testing accuracy and performance using deep learning models on a complex dataset but I have reached a good accuracy but I need to improve it so any suggestions other than what I did(...
0 votes
0 answers
77 views

As an exercise, I'm trying to translate a model written in Keras (https://github.com/CVxTz/ECG_Heartbeat_Classification/blob/master/code/baseline_mitbih.py) into Pytorch code. I realize in Keras much ...
1 vote
2 answers
481 views

I anticipate that I have seen the question: Keras class_weight error dictionary keys/values referring to the same problem, but the solution does not seem to help me. With this code, where I just added ...
0 votes
0 answers
69 views

I'm training and validating models for a binary classification problem in a dataset that has great class imbalance. When searching for metrics for evaluating the performance of the models, I found ...
1 vote
1 answer
1k views

After researching, I realized that scale_pos_weight is typically calculated as the ratio of the number of negative samples to the number of positive samples in the training data. My dataset has 840 ...
1 vote
0 answers
88 views

I'm fairly new to ML and now I'm in the process of predicting employee attrition in a medium sized dataset. I have been able to run everything smoothly, but, as the dataset is imbalanced, I've been ...
0 votes
0 answers
125 views

I'm working on a medical image binary segmentation problem using a U-Net in tensorflow, and my classes are extremely unbalanced (about 1 in 10,000). As a result, my model wastes a ton of time going ...
0 votes
1 answer
44 views

I am trying to solve a ML problem if a person will deliver an order or not. Highly Imbalance dataset. Here is the glimpse of my dataset [{'order_id': '1bjhtj', 'Delivery Guy': 'John', 'Target': 0}, {'...
0 votes
0 answers
36 views

I am trying to predict number of members who will discontinue their membership. The whole dataset is about 12 millions rows of data with about 40 columns. A member status can be "Continue", "Voluntary ...
-1 votes
1 answer
190 views

I have a dataset for fraud detection (i can't disclose dataset) which is extremely imbalanced, when i use SMOTE everything works, but as i have 9 categorical features i wanted to use SMOTE-NC but when ...
0 votes
0 answers
59 views

I am trying to perform a balancing between two classes, one majority and one minority. The majority class is a number of no landslide points and the minority class is landslide. I am trying to apply ...
MM-'s user avatar
  • 1
0 votes
1 answer
363 views

When dealing with imbalanced datasets, my understanding is possible solutions are subsampling or oversampling the training set. However, the test set should reflect the imbalance of the original ...

15 30 50 per page
1
2 3 4 5
...
22

AltStyle によって変換されたページ (->オリジナル) /