Newest 'dataframe' Questions

Stack Overflow

1. Home
2. Questions
3. AI Assist
4. Tags
5. Challenges
6. Chat
7. Articles
8. Users
9. Companies
11. Communities for your favorite technologies. Explore all Collectives
Stack Internal

Stack Overflow for Teams is now called Stack Internal. Bring the best of human thought and AI automation together at your work.
Try for free Learn more
Bring the best of human thought and AI automation together at your work. Learn more

148,726 questions

Newest Active Bountied Unanswered

Advice

3 votes

6 replies

75 views

Reading in XML data in python

I am looking for some assistance with how to convert the below XML data into a dataframe. I have managed to write a working code in R (XML package, code is messy) but then I realised it might even be ...

John_dydx's user avatar

John_dydx

asked yesterday

4 votes

2 answers

204 views

Pandas rolling window over a time period if some data might be missing within groups

I have a dataset with a column of groups, dates, day of the week and some data columns. For each date in each group, I want to work out the same day average from the last 3 weeks (l3w). I've been ...

Emi OB's user avatar

Emi OB

3,425

asked Dec 31, 2025 at 13:46

0 votes

2 answers

89 views

Polars add elements to list

Suppose I have the following polars DataFrame: df = pl.DataFrame({"a": [["A111", "A110"], ["Z254"], ["B897", "C768", "D456"]]}) ...

robertspierre's user avatar

robertspierre

5,516

asked Dec 31, 2025 at 6:42

0 votes

0 answers

79 views

zscore function not found [closed]

I'm working in R and have a dataframe mtcars with cars having a column wt (weight of the cars). I'm trying to calculate skewness of weights. The following is the exercise as shown in the book. The 1st ...

Baldomero123's user avatar

Baldomero123

asked Dec 31, 2025 at 4:53

4 votes

4 answers

197 views

Changing column values based on values of separate columns

I'm trying to figure out how to change values in a column (Age), based on the values of two separate columns (Species and Length). I have a dataset of fish lengths, with all of them designated either &...

Ray's user avatar

Ray

asked Dec 29, 2025 at 19:09

3 votes

1 answer

123 views

Replacing several rows of data in a column efficiently using condition in a pipeline

I have the following dataframe: df <- data.frame( Form=rep(c("Fast", "Medium", "Slow"), each = 3), Parameter =rep(c("Fmax", "TMAX", "B&...

Maz's user avatar

Maz

asked Dec 26, 2025 at 18:42

3 votes

1 answer

75 views

faster methods to remove substrings stored in one column from strings stored in another column

hist_df_2["time"] = hist_df_2.apply(lambda row : hist_df_2['timestamp'].replace(str(hist_df_2['date']), ''), axis=1) I tried this to remove the date part from the timestamp. However, for ...

DivineBanana's user avatar

DivineBanana

asked Dec 21, 2025 at 21:01

1 vote

1 answer

130 views

Broadcasting DataFrames across NumPy array dimensions

I'm working with a large Pandas DataFrame and a multi-dimensional NumPy array. My goal is to efficiently "broadcast" a specific column of the DataFrame across one or more dimensions of the ...

MintForge π's user avatar

MintForge π

asked Dec 21, 2025 at 1:45

-1 votes

0 answers

83 views

Best approaches to applying a function to more than one column in df at once? [duplicate]

Say I have a pandas dataframe of > 2 columns and > 2 rows, I want to apply a function, such as a datatype conversion, to each element in at least two columns. I would like for it to be efficient,...

Jerry Sizzler's user avatar

Jerry Sizzler

asked Dec 19, 2025 at 22:25

Best practices

0 votes

4 replies

44 views

unlink a file from dataframe

I have uploaded an Excel file in Python data frame. But once it's loaded, the file gets locked for further changes. Now I want to unlink the file so that I can make changes in file directly as well.

Tarun's user avatar

Tarun

asked Dec 18, 2025 at 7:24

4 votes

0 answers

135 views

Filter empty string in a polars lazyframe

I am trying to filter out the URI column from a parquet file having over 50 million rows containing empty string using import polars as pl lf = pl.scan_parquet("data.parquet") lf.filter(pl....

srajan0149's user avatar

srajan0149

asked Dec 17, 2025 at 16:45

5 votes

3 answers

250 views

How to query columns that are lists or dicts?

How can I query columns that are lists or dicts? Here is some basic JSON-like data. [ { "id": 1, "name": "John Doe", "age": 30, &...

LayneSadler's user avatar

LayneSadler

6,161

asked Dec 16, 2025 at 3:21

1 vote

3 answers

114 views

Access data frame from binary file

if I have saved a data frame using pickle in a binary file how can I access it? def create_dataset(path): """ creates an binary file with dataset saved in it. "&...

Prince Khatri's user avatar

Prince Khatri

asked Dec 14, 2025 at 12:37

-3 votes

2 answers

112 views

How to print the value counts of a user-selected column in a pandas DataFrame? [closed]

I’m trying to write a Python script that allows the user to input the name of a column and then prints the value counts of that column from a pandas DataFrame. Here's what I currently have: def ...

user32044318's user avatar

user32044318

asked Dec 13, 2025 at 17:26

4 votes

2 answers

124 views

Is it the expected behaviour for `pl.int_ranges(scalar1, scalar2).list.sample(n)` to generate a column with a same sample filled? and why?

Given a DataFrame that with a column of multiple rows, I try to generate a column with different random samples for each row from a same range, so I tried to write this: >>> import polars as ...

huangjj27's user avatar

huangjj27

asked Dec 13, 2025 at 8:47

15 30 50 per page

2 3 4 5

...

9916 Next

CollectivesTM on Stack Overflow

Reading in XML data in python

Pandas rolling window over a time period if some data might be missing within groups

Polars add elements to list

zscore function not found [closed]

Changing column values based on values of separate columns

Replacing several rows of data in a column efficiently using condition in a pipeline

faster methods to remove substrings stored in one column from strings stored in another column

Broadcasting DataFrames across NumPy array dimensions

Best approaches to applying a function to more than one column in df at once? [duplicate]

unlink a file from dataframe

Filter empty string in a polars lazyframe

How to query columns that are lists or dicts?

Access data frame from binary file

How to print the value counts of a user-selected column in a pandas DataFrame? [closed]

Is it the expected behaviour for `pl.int_ranges(scalar1, scalar2).list.sample(n)` to generate a column with a same sample filled? and why?

Hot Network Questions