16,257 questions
- Bountied 0
- Unanswered
- Frequent
- Score
- Trending
- Week
- Month
- Unanswered (my tags)
0
votes
0
answers
19
views
Which statistical test should be used for multiple-response variables in survey data? [migrated]
Which statistical test should be used for multiple-response variables in survey data?
I am working on an opinion barometer and analysing survey data in R.
For single-choice variables, my approach is :
...
3
votes
1
answer
104
views
How do I fix to my R GLMMadaptive hurdle beta model so as not to give me NaNs?
I just recently created an account. I am trying to create a glmm beta hurdle model based on parasite success of parasitoid wasps on r, I used the mixed_model function from the GLMMadaptive package.
...
Advice
1
vote
3
replies
65
views
How to handle heteroskedasticity when detecting anomalies using Z-scores on growth rates?
I’m trying to detect anomalies in a dataset using Z-scores based on the logged index change of a value between two time periods: VAL_t0 and VAL_t1.
The issue:
The variance of Z-scores decreases as ...
Tooling
0
votes
0
replies
18
views
JASP - covariate in LMM analysis
In Jasp, how to run a LMM with time and group factors, while controlling for a covariate? I tried, with time and group as fixed effects and a model that tests the time x group interaction. Then I ...
2
votes
0
answers
97
views
multinom standard errors change when scaling variable
I have the following R code that performs a multinomial logistic regression.
When scaling birthweight from grams (original data) to kg (more similar scale as other variables and easier interpretation) ...
Advice
0
votes
1
replies
44
views
Ranking formula for a frequency list
I'm trying to compile a list of the 5000 most common Simplified Chinese characters as found in modern fiction (books written from around 1920 onward). It should probably be mentioned that 5000 is a ...
3
votes
2
answers
154
views
I don't want some variable names to appear in the row column in expss
In the code below, I want to remove the "Miles/(US) gallon" text in the line. Because I'm already writing it as a column heading. I also don't want the "4/6/8" text to appear in ...
0
votes
1
answer
85
views
Via python IQR based outlier exclusion not consistently right, but not sure why? Code design error?
Via python IQR based outlier exclusion not consistently right, but not sure why? Code design error?
The script reads the Tachometer from a fan via a i2c based sensor, this sensor at times reports a ...
1
vote
2
answers
97
views
Polars Rolling_mean negative values
Daily variation analysis in Polars: Rolling std for negative values only
The situation
Hi there, I have a Polars DataFrame with columns like ['Date','category','number'], already grouped. This ...
0
votes
0
answers
67
views
I can't get the coverage graph for the Wald interval. It only shows the outlines of 99% CI but not the coverage that wald has
wald = function(x, n, conf.level = 0.99){
p = x/n
sd <- sqrt(p*((1-p)/n))
z = 2.576
ci <- p + z*sd
return(ci)
}
numSamples <- 10000
numTrials <- 10
probs <- seq(0.2, 0.4, 0....
0
votes
1
answer
62
views
Ellipse.contains_points() in Matplotlib is not working as expected
I am using a slightly modified version of the confidence_ellipse() function given in this example on the Matplotlib documentation. The only difference is I am returning the ellipse variable itself ...
0
votes
4
answers
128
views
PostgreSQL solution to work easier on a specific row with multiple columns
I have a table where I want to do some statistics on each rows different columns. For example:
id
NW2020
NW2021
NW2022
NW2023
NW2024
CA2020
...
1
-299
4000
21
-325
2544
55
2
54
244
-5
-54
325
874
How ...
3
votes
1
answer
102
views
Getting an actual boundary equation of LDA with two classes and two variables
I have a two-dimensional data plot that shows the sexual morphological difference of some insect species like this:
data plot.
My purpose is to obtain an estimated boundary of the linear ...
9
votes
1
answer
329
views
Generating blue noise with values sampled from a log normal distribution
Aim
I am trying to generate random signals with the following two properties:
The values should be approximately log-normally distributed (any long-tailed distribution bounded form below with non-...
2
votes
1
answer
61
views
Seaborn histogram to plot survey answer frequency by gender
I have some survey question answers (letters A, B, C) I'd like to plot in a Seaborn histogram, with the frequency of each response letter grouped by gender.
import pandas as pd
answers = ['A', 'B', 'A'...