Skip to main content
Stack Overflow
  1. About
  2. For Teams
Filter by
Sorted by
Tagged with
0 votes
1 answer
59 views

Writing and reading a large data.table (~800M rows, 15 variables, 92GB) alters the data and I don't understand why! psc_all_old <- haven::read_sas("~/own/data/archive/psc_all_v_20250331....
Grillo's user avatar
  • 130
2 votes
2 answers
297 views

I'm working with 50 Parquet files (each file is of ~800MB with ~380000 rows and ~8 columns). I need to perform a grouped summarisation in R. Something like: group_by(sample_id, gene1, gene2) %>% ...
0 votes
1 answer
42 views

I am trying to run genetic distance analysis using adegenet on RStudio and I keep getting an error that some individuals are missing the -5 SNPs. I've copied a section of the code and error below. ...
1 vote
1 answer
123 views

I am trying to create a triangle heatmap of my FST distance matrix: NWAtl CIV SEN MRT MOR PRT ESP PAL LG JN TUN MLT MR AEG NWAtl 0 0 0 0 0 0,00098 0,00586 0,01953 0,01465 0,01855 ...
0 votes
1 answer
459 views

I would like to calculate some summary statistics such as FIS, FST, Ho, He for "each population" using a hierfstat object with 217 samples and 38,000 sites. Although the basic.stats() ...
Pei-Wei Sun's user avatar
0 votes
1 answer
221 views

I want to do the below, would it theoretically work? (I am working on coding it and checking but that is taking some time) Serialize a Java object of Custom class C1 (convert into byte[]) using FST ...
0 votes
0 answers
114 views

I'm reading data stored in several fst-files using tidyfst and purrr::map_dfr. The following example works as expected. library(tidyverse) library(fst) library(tidyfst) # get all fst files in current ...
mgrund's user avatar
  • 1,665
4 votes
0 answers
39 views

I am aware that this question was already posted before, but since it was asked more than eight years ago I was wondering if there is solution now. The suggested answers in that post do not provide a ...
0 votes
0 answers
711 views

This is an issue already opened on fstpackage's github, but it seems the package author is no longer actively maintaining it. Meanwhile, I need a workaround to this crash problem, which is repeatable ...
0 votes
0 answers
80 views

I have two finite state acceptors, a.fst and b.fst. They are both unweighted, simply encoding sequences of words. Their symbol (word) lists have some overlap, but are not identical. Now I want a union ...
Jiaji Huang's user avatar
0 votes
1 answer
145 views

In this paper "Direct Construction of Minimal Acyclic Subsequential Transducers" https://citeseerx.ist.psu.edu/viewdoc/download;jsessionid=690EBF51DD6D52AB9160723AAE3A9541?doi=10.1.1.24.3698&...
Chen Li's user avatar
  • 350
1 vote
0 answers
344 views

Our team is looking to integrate more of our R and python work. One part of this effort has been trying to move from fst files (using the package fst), which as far as I know cannot be read in python ...
0 votes
1 answer
88 views

I am having problems recovering lubridate::intervals when reading back from csv, and fst formats. Does anyone have a suggestion for how to do this? library(tidyverse) library(fst) library(lubridate) ...
evidently's user avatar
0 votes
2 answers
79 views

There is a set of pairs (set_pairs) and I want to create a set (set_fsts) of first elements of those pairs. I am writing it in the following way definition "set_fsts = {f . p ∈ set_piars ∧ fst p ...
S2020's user avatar
  • 1
0 votes
1 answer
692 views

Given a "large" list of patterns and a "short" text, what is the best/fastest way to search/tag those patterns in the text, where we are trying to find the pattern as a substring ...

15 30 50 per page
1
2 3

AltStyle によって変換されたページ (->オリジナル) /