495 questions
- Bountied 0
- Unanswered
- Frequent
- Score
- Trending
- Week
- Month
- Unanswered (my tags)
0
votes
0
answers
26
views
Assistance with Data Processing Insurance Premiums
I have been set a task by my manager to try and predict insurance premiums based on some categories such as job description, number of people employed and turnover. I am comparing between K-Nearest ...
0
votes
0
answers
27
views
Unabel to create kafka topics for MongoDB connector
I am trying to use MongoDB Kafka official connector to create topics automatically while creating connector using sql command
CREATE SOURCE CONNECTOR logistics_n WITH (
'connector.class' = 'com....
0
votes
1
answer
58
views
Multivalued column cannot be transformed
Im working with Stackoverflow 2024 survey. In the csv file there are several multivalued variables (separated by ;). I want to apply One-hot encoding to the variables Employment and LanguageAdmire by ...
0
votes
0
answers
22
views
NaN Values After Applying IterativeImputer and Inverse Transforming LabelEncoded Data
I am using IterativeImputer from sklearn.impute to fill missing values in my dataset. One of my columns, Education_Level, is a categorical feature, so I first applied LabelEncoder to convert it into ...
0
votes
0
answers
21
views
Does Modifying an Attribute of a Custom Dataset Affect Both Subsets After random_split in PyTorch?
I am working on a binary classification task using an audio dataset, which is already divided into training and testing sets. However, I also need a validation set, so I split the training set into ...
0
votes
2
answers
60
views
Combining multiple dataframes with same number of rows and different columns in R [duplicate]
I'm trying to combine several (>2) dataframes with the same rows and different columns in R.
For example, I have 4 dataframes:
df1 <- data.frame(
x = c("A1", "A2", "A3&...
user avatar
Karina
0
votes
1
answer
50
views
Is there a way to set the data_min and the data_max in MinMaxScaler()?
I'm currently using MinMaxScaler() on my dataset. However, because my dataset is large I'm doing a first iteration pass in batches to compute the Min and Max Values for my Scaler. i'm using ...
0
votes
0
answers
74
views
Downloading MIT-BIH NSR & SCD Holter Databases from PhysioNet in Python
I am working on a deep learning project to forecast Sudden Cardiac Death (SCD) using ECG data from PhysioNet. Specifically, I need to download and preprocess the following databases:
MIT-BIH Normal ...
0
votes
0
answers
18
views
How to combine columns with nested lists with each other using pandas? [duplicate]
I'm working on a padas DataFrame that contains columns with lists and currently trying the method explode, but I'm not getting the desired output, instead, it does a Cartesian Product, combining all ...
0
votes
0
answers
54
views
How can I batch process multiple .npy files in Python for motion capture data preprocessing?
I am working on a project where I need to preprocess multiple motion capture files stored in .npy format. I am able to load and preprocess individual files, but I am facing difficulties when trying to ...
2
votes
0
answers
66
views
kernel died when I run : dataset = Dataset.from_dict(data_dict)
I am fine-tuning sam model for my dataset containing train_images and train_masks. I am able to create dict, but when calling last command i.e. to load dataset from dict, kernel dies. It happened ...
0
votes
1
answer
550
views
How to create a scaler applying log transformation and MinMaxScaler in sklearn
I want to apply log() to my DataFrame and MinMaxScaler() together.
I want the output to be a pandas DataFrame() with indexes and columns from the original data.
I want to use the parameters used to ...
0
votes
1
answer
70
views
Varying embedding dim due to changing padding in batch size
I want to train a simple neural network, which has embedding_dim as a parameter:
class BoolQNN(nn.Module):
def __init__(self, embedding_dim):
super(BoolQNN, self).__init__()
self....
0
votes
0
answers
74
views
Input file specified two times
I am using shell in Jupyter with Python programming Language. When I use to prepare a dataset, I fail to complete it on sorting by column and case sensitive.
The line is like this:
!head -n 5 $...
-1
votes
1
answer
195
views
Capitalized words in sentiment analysis
I'm currently working with data of customers reviews on products from Sephora. my task to classify them to sentiments : negative, neutral , positive .
A common technique of text preprocessing is to ...