Methodology
See recent articles
Showing new listings for Thursday, 2 July 2026
- [1] arXiv:2607.00128 [pdf, html, other]
-
Title: Similarity-Based Prediction for Digital Twins: Panel Data, Theory, and ApplicationsComments: 32 pages, 1 figureSubjects: Methodology (stat.ME)
Prediction from sequential panel data is central to digital-twin modeling, where new panels arrive over time and the predictive system is updated sequentially. Existing methods often rely on temporal proximity, which can fail when similar input-output patterns recur at nonadjacent times or when recent panels differ from the target panel. We propose State-Local Prediction (StaLoP), a nonparametric dynamic panel prediction framework that utilizes information through target-local predictive compatibility. StaLoP represents panels using target-local state vectors, compares historical and target panels via empirical discrepancy scores to determine relevance weights for the target point, and combines these weights with covariate localization. Theoretical results, including bias-variance characterization, asymptotic normality, simultaneous prediction bands, and a target-local-GDF-corrected MSPE criterion for panel and model selection, are developed. Extensive simulations validate the performance of StaLoP and support its theoretical properties. Applications to sequence prediction, simulator calibration, variable selection, and county-to-county migration-flow forecasting demonstrate improved out-of-sample prediction and provide scientific insights into the underlying applications.
- [2] arXiv:2607.00188 [pdf, html, other]
-
Title: Quantile regression with measurement errorsSubjects: Methodology (stat.ME)
We devise a novel estimator for a general quantile regression model with normal measurement errors in the covariates. The method is applicable to both linear and nonlinear quantile regressions and does not impose the quantile requirement on multiple quantile levels simultaneously. We circumvent the difficulties caused by discontinuity in quantile regression through kernel smoothing, and overcome the nonlinearity inherent in quantile regression via considering extension to the complex domain and moment generating functions. We show that the resulting estimator achieves the standard root-$n$ consistency and asymptotic normality under mild conditions. The performance of the proposed method is illustrated via numerical simulations and a real data example related to Cherry Blossom times in Japan in 2024. This is the first consistent estimator in a general quantile regression problem with normal measurement errors.
- [3] arXiv:2607.00222 [pdf, html, other]
-
Title: Causal Inference for All: Marginal Estimands for Outcomes Truncated by DeathSubjects: Methodology (stat.ME); Statistics Theory (math.ST)
In longitudinal studies, outcomes of interest are often truncated by death, meaning that they are only observed or well-defined conditional on intercurrent events such as survival. Existing strategies face a trade-off: causally interpretable estimands, such as survivor average causal effects, target a latent subgroup, whereas while-alive and composite summaries apply to the full population but are difficult to interpret as causal effects on the non-mortality outcome. We address these challenges by introducing methodology for a new set of estimands that (i) concern the entire population, (ii) remain causally interpretable, and (iii) leverage the longitudinal data commonly available in studies with outcomes truncated by death. The set of estimands includes single-world marginal separable effects that generalize conditional separable effects to full-population summaries. We develop identification and estimation results for these estimands and apply the methodology in a reanalysis of a prostate cancer trial, highlighting how different estimands can yield different treatment conclusions.
- [4] arXiv:2607.00350 [pdf, html, other]
-
Title: Robust Estimation and Inference with Selective Borrowing in Hybrid Controlled Trials: A Tutorial with SelectiveIntegrative and intFRTSubjects: Methodology (stat.ME)
Hybrid controlled trials (HCTs) augment randomized controlled trials (RCTs) with external controls (ECs) to improve statistical efficiency when RCTs face limited sample sizes, slow accrual, or ethical constraints. However, valid use of ECs requires careful adjustment for covariate shift and outcome drift, as inappropriate borrowing may introduce bias and compromise inference. This tutorial provides a practical workflow for estimation and inference in HCTs. We first present a statistical analysis roadmap covering estimands, identification assumptions, eligibility alignment, matching, full and selective borrowing strategies, and both asymptotic inference and randomization tests. We then demonstrate step-by-step implementation using the SelectiveIntegrative and intFRT packages. The workflow is illustrated using a synthetic lung cancer dataset included in the intFRT package that mimics the CALGB 9633 trial and ECs from the National Cancer Database. The tutorial aims to help applied statisticians conduct transparent, interpretable, and reproducible HCT analyses that improve efficiency while maintaining valid inference.
- [5] arXiv:2607.00373 [pdf, html, other]
-
Title: Confidence Intervals for the Risk Difference in Combined Unilateral and Bilateral Data Incorporating a Distribution-Based ApproachComments: 23 pages, 3 figures, 8 tablesSubjects: Methodology (stat.ME)
Combined unilateral and bilateral binary outcomes frequently arise in studies involving paired organs. The risk difference is a clinically interpretable measure for comparing treatment effects between groups. Existing confidence interval methods are primarily based on asymptotic normality and may fail to adequately reflect finite-sample distributional features, particularly skewness. To address this issue, we propose a distribution-based confidence interval derived from the probability distribution of the risk difference estimator and a modified MOVER procedure that accounts for intra-subject correlation. Their performances are compared with those of commonly used asymptotic methods through extensive simulation studies. Across a broad range of parameter settings, all methods exhibited satisfactory performance as sample size increased. The proposed distribution-based interval achieved coverage probabilities close to the nominal level with interval widths comparable to those of existing procedures. In small sample settings, it was able to capture skewness in the sampling distribution that was not reflected by methods relying on asymptotic normality. Analyses of two real-world datasets demonstrated the practical applicability of the competing methods and yielded consistent inferential conclusions. The proposed approach provides an alternative framework for interval estimation of the risk difference in studies involving combined unilateral and bilateral binary outcomes.
- [6] arXiv:2607.00376 [pdf, html, other]
-
Title: Distributed Prediction under Heterogeneity with Unidentifiable ParameterSubjects: Methodology (stat.ME); Optimization and Control (math.OC); Statistics Theory (math.ST)
Predicting a response based on covariates is a fundamental problem in statistics and machine learning. However, profound difficulties arise when the underlying low-dimensional structural parameters are unidentifiable, as typified in dimension reduction contexts. Specifically,estimating these non-identifiable parameters inherently introduces severe nonconvexity. In distributed settings, this difficulty is further compounded by the challenges of data heterogeneity and communication cost. To overcome these intertwined barriers, we propose a novel distributed semiparametric framework. We formulate an adaptive homogeneity pursuit utilizing a trace-similarity penalty to effectively address data heterogeneity. To resolve the ensuing severe nonconvexity and communication bottlenecks, we introduce an invex relaxation technique coupled with a multi-step local update algorithm, ensuring stable convergence to global optimality with significantly reduced communication overhead. Theoretically, we establish a non-asymptotic model-free prediction error bound and prove that our estimator achieves a two-phase minimax optimal convergence rate and an sharper model-free prediction error bound. Furthermore, we provide theoretical guarantees for algorithmic convergence and communication efficiency. Extensive simulations and a real-world multi-center medical application validate the superiority of our method.
- [7] arXiv:2607.00722 [pdf, html, other]
-
Title: How does academic performance affect self-efficacy? Interpretable modelling through latent academic achievementComments: Main manuscript: 25 pages (including references). Supplementary material: 19 pagesSubjects: Methodology (stat.ME); Applications (stat.AP)
There is increasing evidence of a directional relationship from academic performance to self-efficacy. We develop a Bayesian model for investigating this relationship when academic performance is measured on an ordinal scale and self-efficacy on a continuous scale. The model allows latent academic achievement to enter the self-efficacy regression as a predictor, while Bayesian variable selection identifies factors associated with either response. The resulting conditional formulation yields an interpretable regression characterisation of how latent academic achievement relates to self-efficacy. Furthermore, it enables a tailored partially collapsed Gibbs sampler that analytically integrates out the regression coefficients when updating the variable inclusion indicators. Simulation studies demonstrate that the proposed conditional formulation and tailored sampler improve sampling efficiency and variable-selection performance relative to a recent, more general joint Gaussian copula regression formulation. We apply the methodology to data from the longitudinal study of Australian children, a landmark national cohort study covering children's education, social and emotional wellbeing, health and family circumstances. The model and analysis shed light on how latent academic achievement relates to self-efficacy in Australian children, and reveal that the two outcomes differ markedly in the range of covariates associated with each outcome.
- [8] arXiv:2607.00847 [pdf, html, other]
-
Title: Transfert learning and adaptive LASSO quantileSubjects: Methodology (stat.ME); Computation (stat.CO)
We propose for a quantile regression an estimation method for transferring knowledge using two $L_1$ penalties based on an estimator obtained from a source database. The proposed transfer learning estimator satisfies the properties of consistency and sparsity. Its convergence rate and asymptotic behavior are studied in several scenarios. This knowledge transfer results in a shorter computation time than that of the standard adaptive LASSO estimator. Another advantage of our method is that it can be applied to models with non-Gaussian errors. In addition, in order to implement the computing of the adaptive transfer LASSO quantile estimator, we propose an algorithm. The simulations confirm the theoretical results and demonstrate that the adaptive learning estimator, calculated using the proposed algorithm, is more competitive than the LASSO estimators. Finally, we illustrate the practical utility of the proposed transfer learning estimator and algorithm using a real-data application involving the physicochemical properties of protein tertiary structures.
- [9] arXiv:2607.00915 [pdf, html, other]
-
Title: Simulating Node Manipulations in Gaussian Graphical Models: The GGMNIRA Framework for Continuous and Ordinal Psychological Network DataSubjects: Methodology (stat.ME)
Scientific Abstract: In psychological network analysis, centrality indices are commonly used to evaluate the importance of nodes within a network. However, centrality only captures the static topological position of a node, and there is no sufficient theoretical justification for assuming that it reflects a node's influence on network dynamics. The NodeIdentifyR Algorithm (NIRA) offers an alternative by systematically applying simulated manipulations to node intercepts within the Ising model to evaluate nodes' projected importance, but this algorithm is restricted to binary data, and the manipulated parameter lacks a clear theoretical meaning outside the context of psychopathology. To address these limitations, we propose the Gaussian Graphical Model NodeIdentifyR Algorithm (GGMNIRA), which manipulates a node's conditional mean and uses Kullback-Leibler (KL) divergence to quantify the change in network distribution before and after manipulation, thereby extending this simulated manipulation logic to the Gaussian graphical model framework, which is applicable to continuous and ordinal data. Around this algorithm, we further developed a correlation stability coefficient and a nonparametric bootstrap difference test for KL divergence, with corresponding interpretive thresholds established through simulation studies. The framework was also extended to bridge Gaussian graphical models and moderated Gaussian graphical models, enabling its application to multi-construct comorbidity networks and to contexts involving moderation effects. All methods are implemented in the R package "GGMNIRA".
- [10] arXiv:2607.00980 [pdf, html, other]
-
Title: An Instrumental Variable Approach to Account for Informative Treatment Switching in Real-world EvidenceSubjects: Methodology (stat.ME)
Reproducible and generalizable assessment of treatment decisions requires principled handling of subsequent treatment switching that may inform expected outcomes and shift across cohorts and over time. To effectively account for informative treatment switching, we propose an instrumental variable approach that characterizes the poorly documented expected outcomes at switching as unmeasured confounding. After establishing the baseline treatment as a viable instrumental variable, we constructed an estimating equation based on the association between the centered instrumental variable and a martingale style residual process that identifies the treatment effect under structural cumulative survival model. Our proposed method is doubly robust, i.e., valid whenever either of baseline propensity model or no-switching outcome model is consistently estimated. A co-training of treatment effect parameter and survival outcome regression model eliminated the requirement of observing a no-switching subset under semi-parametric additive hazards models. We further developed an baseline-survival-corrected cross-fitting approach to incorporate general machine learning models for estimating nuisance models. Numerical results demonstrated the validity of our method in various settings when a basket of benchmark solutions produced biased or contradictory results. We applied our method to comparison of high-efficacy vs standard efficacy disease modifying treatments as the second line therapy of multiple sclerosis.
New submissions (showing 10 of 10 entries)
- [11] arXiv:2607.00312 (cross-list from econ.EM) [pdf, html, other]
-
Title: Post-selection inference for network structureSubjects: Econometrics (econ.EM); Methodology (stat.ME)
Researchers often use the density of connections between groups of agents, such as communities, blocs, or markets, to characterize the structure of a social or economic network. In many cases, these groups are selected using the network data, making conventional fixed-group inference procedures potentially invalid. To address this issue, we develop two new confidence intervals that are universally valid post-selection in the sense that they guarantee simultaneous coverage asymptotically over all pairs of groups whose relative sizes do not vanish. Our first interval builds on a strategy of \cite{berk2013valid}. Our second interval is based on a Talagrand-type concentration inequality for empirical processes. Both intervals are simple to compute and scalable to large networks, but a key technical contribution of our paper is show that only the second interval achieves the best-possible width asymptotically up to a constant factor. Three empirical illustrations show that accounting for selection can matter in practice. Some evidence for homophily in a social network and a hub-and-spoke structure in a trade network survives our correction, while evidence for disjoint market segments in a worker transition network does not.
Cross submissions (showing 1 of 1 entries)
- [12] arXiv:2406.00730 (replaced) [pdf, html, other]
-
Title: Assessing survival models by interval testing with Poisson-binomial distributionsComments: Main: 13 pages. Total: 15 pagesSubjects: Methodology (stat.ME)
Selecting appropriate parametric survival models is often a pivotal part of a regulatory submission for new pharmaceutical products. With recent developments in complex survival approaches, the number of suitable models is increasing, making model selection more challenging. Common approaches to model selection include AIC, BIC, and expert opinion on survival extrapolation. However, these approaches primarily assess relative goodness-of-fit, providing limited insight into where, and to what extent, a fitted model is incompatible with the observed data. We propose evaluating survival models using Poisson-binomial distributions across specified time intervals. Two interval selection approaches, censor-defined intervals and 10 evenly-spaced intervals, are presented with worked examples. A simulation exercise, targeting two proposed test statistics across 12 standard scenarios (with different data maturity and patient numbers), demonstrated that for every scenario the empirical Type I error did not exceed the nominal 5% level. Our proposed model selection technique goes beyond classical approaches by highlighting time intervals where models perform poorly.
- [13] arXiv:2406.11584 (replaced) [pdf, html, other]
-
Title: Modeling cyclicality and intransitivity in paired comparisons dataComments: 49 pages, 5 tables, 3 FiguresSubjects: Methodology (stat.ME)
Paired comparison data arise in ranking problems, decision analysis, sports analytics, recommendation systems, and many other applications in which alternatives are evaluated by comparing two items at a time. Standard models typically impose a transitive preference profile induced by a vector of merits. In many empirical settings, however, preference relations exhibit cyclic and intransitive patterns that cannot be adequately represented by a global ranking. This paper develops a framework for modeling cyclicality and departures from transitivity. The proposed approach decomposes a preference profile into orthogonal transitive and cyclic components and provides a geometric characterization of the associated parameter space. The cyclic component is represented using an overcomplete dictionary of elementary cycles, so that identifying cyclic structure and the intransitivities it may induce becomes a sparse model selection problem. We propose a method for recovering sparse cyclic structure and establish large--sample guarantees for estimation and model recovery. The analysis clarifies the relationship between cyclicality, intransitivity, and several notions of transitivity used in paired comparison theory. By explicitly modeling cyclic structure, the proposed framework can improve estimation, ranking, interpretation, and prediction. The methodology is evaluated through simulations and illustrated with an empirical application.
- [14] arXiv:2504.15388 (replaced) [pdf, other]
-
Title: Deep learning with missing dataComments: 57 pages, 13 figuresSubjects: Methodology (stat.ME); Machine Learning (cs.LG); Statistics Theory (math.ST); Machine Learning (stat.ML)
In the context of multivariate nonparametric regression with missing covariates, we propose Pattern Embedded Neural Networks (PENNs), which can be applied in conjunction with any existing imputation technique. In addition to a neural network trained on the imputed data, PENNs pass the vectors of observation indicators through a second neural network to provide a compact representation. The outputs are then combined in a third neural network to produce final predictions. Our main theoretical result exploits an assumption that the observation patterns can be partitioned into cells on which the Bayes regression function behaves similarly, and belongs to a compositional Hölder class. It provides a finite-sample excess risk bound that holds for an arbitrary missingness mechanism, and in combination with a complementary minimax lower bound, demonstrates that our PENN estimator attains in typical cases the minimax rate of convergence as if the cells of the partition were known in advance, up to a poly-logarithmic factor in the sample size. Numerical experiments on simulated, semi-synthetic and real data confirm that the PENN estimator consistently improves, often dramatically, on standard neural networks without pattern embedding. Code to reproduce our experiments, as well as a tutorial on how to apply our method, is publicly available.
- [15] arXiv:2508.00937 (replaced) [pdf, html, other]
-
Title: A General Approach to Visualizing Uncertainty in Statistical GraphicsSubjects: Methodology (stat.ME); Graphics (cs.GR); Machine Learning (cs.LG)
We present a general approach to visualizing uncertainty in static 2-D statistical graphics. If we treat a visualization as a function of its underlying quantities, uncertainty in those quantities induces a distribution over images. We show how to aggregate these images into a single visualization that represents the uncertainty. The approach can be viewed as a generalization of sample-based approaches that use overlay. Notably, standard representations, such as confidence intervals and bands, emerge with their usual coverage guarantees without being explicitly quantified or visualized. As a proof of concept, we implement our approach in the IID setting using resampling, provided as an open-source Python library. Because the approach operates directly on images, the user needs only to supply the data and the code for visualizing the quantities of interest without uncertainty. Through several examples, we show how both familiar and novel forms of uncertainty visualization can be created. The implementation is not only a practical validation of the underlying theory but also an immediately usable tool that can complement existing uncertainty-visualization libraries.
- [16] arXiv:2511.21534 (replaced) [pdf, html, other]
-
Title: A Sensitivity Analysis Framework for Causal Inference Under InterferenceSubjects: Methodology (stat.ME)
In many applications of causal inference, the treatment received by one unit may influence the outcome of another, a phenomenon referred to as interference. Although there are several frameworks for conducting causal inference in the presence of interference, practitioners often lack the data necessary to adjust for its effects. In this paper, we propose a weighting-based sensitivity analysis framework that can be used to assess the systematic bias arising from ignoring interference. Unlike most of the existing literature, we allow for the presence of unmeasured confounding, and show that the combination of interference and unmeasured confounding is a notable challenge to causal inference. We also study a third factor contributing to systematic bias: lack of transportability. Our framework enables practitioners to assess the impact of these three issues simultaneously through several easily interpretable sensitivity parameters that can reflect a wide range of intuitions about the data.
- [17] arXiv:2603.20467 (replaced) [pdf, html, other]
-
Title: Goal-oriented learning of stochastic differential equations using error bounds on path-space observablesSubjects: Methodology (stat.ME); Machine Learning (cs.LG); Dynamical Systems (math.DS)
Stochastic differential equations (SDEs), which serve as the governing equations for dynamical systems in a broad range of applications, can become cost-prohibitive for numerical simulation at scales necessary for quantifying key properties. Surrogate models of the drift function of an SDE, learned from data of the high-fidelity system, are routinely used to increase the efficiency of simulation and prediction of properties. However, standard choices of loss function for learning the surrogate model fail to provide error guarantees in certain path-dependent observables, such as transition times. This paper introduces an error bound for path-space observables and employs it as a novel variational loss for the goal-oriented learning of the drift function of a SDE. We show the error bound holds for a broad class of observables, including mean first hitting times on unbounded time domains. We derive an analytical gradient of the goal-oriented loss by leveraging the formula for Fréchet derivatives of expected path functionals, which remains tractable for implementation in stochastic gradient descent schemes. We demonstrate that surrogate models of overdamped Langevin systems developed via goal-oriented learning achieve improved accuracy in predicting the statistics of a first hitting time observable and robustness to distributional shift in the data.
- [18] arXiv:2605.03264 (replaced) [pdf, html, other]
-
Title: Efficient Propose-Test-Release for Optimal Differentially Private EstimationComments: 20 pages, 3 figuresSubjects: Methodology (stat.ME)
Differential privacy (DP) is a rigorous framework that protects the participation of individuals in a dataset by controlling information leakage through released estimators. It brings a challenge for statisticians: DP uniformly considers all possible datasets, whereas statistical practice often downweights atypical or rare outcomes. The conceptual challenge is especially pronounced in sensitivity analysis, where atypical datasets introduces markedly high sensitivity, even for a basic estimator such as ordinary least square. Standard DP recipe adds a noise governed by this large overall sensitivity, which causes excessive loss in accuracy. We introduce an efficient Propose-Test Release (ePTR) pipeline, which tests the dataset via a user-designed Safety Lower Bound, and then probabilistically releases the estimator based on local sensitivity level. This flexible pipeline enables substantially simple DP mechanisms for many problems. To illustrate, we study basic estimators for Bayes classification, linear regression, and kernel regression. Each estimator can be highly sensitive to atypical datasets, yet admits simple ePTR-based algorithms that achieve minimax optimality. In numerical studies, these ePTR estimators demonstrate improved accuracy against popular DP baselines under privacy guarantees.
- [19] arXiv:2605.26608 (replaced) [pdf, html, other]
-
Title: Maximum-Likelihood Estimation of Hyperedge-Triggered Hawkes Processes via a Closed-Form EM AlgorithmComments: 13 pages, 6 figures, 2 tables; revised version with updated figures and layoutSubjects: Methodology (stat.ME)
Hypergraph effects in event streams are difficult to estimate because a group-level burst can often be explained either by direct higher-order excitation or by a collection of ordinary pairwise Hawkes interactions. This paper studies maximum-likelihood estimation for a hyperedge-triggered Hawkes process, in which the conditional intensity is excited both by individual past events and by the completion of a multi-node firing pattern within a short temporal window. We derive a closed-form EM algorithm based on latent branching responsibilities and a piecewise compensator for the most-recent-anchor hyperedge mechanism. The compensator corrects the naive integral that overcounts superseded pattern completions. For independently parameterised candidate hyperedges, the EM updates are closed form; when a low-rank CP parameterisation is imposed, the hyperedge factors are updated by block-coordinate ascent on the same expected complete-data objective, yielding a generalised EM implementation. Synthetic experiments show near-unbiased recovery under a time-rescaling-validated simulator, stable EM convergence, identifiable trigger-window structure, and the expected O(n^2) event-count scaling of the prototype implementation. The main statistical limitation is not numerical optimisation but identifiability: when pairwise and hyperedge components are supported on the same co-firing events, likelihood gains can be hard to attribute. Held-out analyses on retina and primary visual-cortex spike-train datasets show stable positive candidate-count BIC differences for the two cortical datasets and more fragile evidence for the retina dataset as the candidate set expands. Code and reproducibility scripts are available at this https URL.
- [20] arXiv:2605.29200 (replaced) [pdf, html, other]
-
Title: Approximating full conformal prediction: distribution free guarantees via the tournament correctionComments: 23 pages, 2 figuresSubjects: Methodology (stat.ME)
Conformal prediction is a framework for providing prediction intervals with distribution-free validity, guaranteeing predictive coverage for data drawn from any distribution. Its two main variants are full conformal prediction and split conformal prediction (also called transductive and inductive). Full conformal prediction is widely considered to be statistically more efficient (since split conformal prediction requires data splitting, and therefore can lead to wider prediction intervals due to the resulting loss in sample size), but its implementation is computationally prohibitive, as it requires the underlying model to be refit for every candidate value in the response space. Existing computational shortcuts, such as using a discrete grid of values to approximate the full conformal prediction construction, frequently lack theoretical guarantees on marginal coverage and can fail in practice.
To address this limitation, we introduce a novel class of approximations to the full conformal prediction method, based on the idea of \emph{tournaments}, which enables the construction of prediction sets with a rigorous marginal coverage guarantee of 1ドル-2\alpha$. Under stability conditions, the theoretical coverage guarantee tightens to approximately 1ドル-\alpha$. This new framework generalizes the existing method of leave-one-out cross-conformal prediction, while allowing for flexible use of various existing approximation strategies. - [21] arXiv:2606.31190 (replaced) [pdf, html, other]
-
Title: Semiparametric Efficiency in Sequential Experiments: Characterization and Design via Average PropensitySubjects: Methodology (stat.ME)
Modern experiments, including evaluations of AI-enabled services and platform interventions, often depart from independent and identically distributed (i.i.d.) sampling because assignments may be adaptive, balanced across covariates, or subject to rollout constraints such as exposure, fairness, and budget limits. This paper studies the efficiency benchmark for estimating causal targets in such sequential experiments. We show that every non-anticipating design induces an average propensity score, and we establish a semiparametric lower bound: for regular locally unbiased estimators, attainable precision is bounded by the i.i.d. efficiency benchmark evaluated at this induced score. The average propensity score thereby serves as a common benchmark and design target, allowing sequential experimental design to be viewed as choosing or learning an efficient allocation rule, with operational constraints entering through the admissible set when present. We then develop implementable batched adaptive designs that approach this benchmark through two complementary mechanisms. The first uses regression adjustment based on efficient influence functions; for general smooth estimands it attains the benchmark under standard nuisance-rate conditions, while for linear functionals of outcome means it achieves a sharp second-order rate. The second uses adaptive covariate balancing to attain the same benchmark through the assignment mechanism, enabling simple moment-based estimation. Both routes require only a small number of policy updates, making them compatible with delayed feedback and easier to monitor in operational deployments. Numerical experiments and an empirical study of AI medical-assistant evaluation demonstrate the practical efficiency gains, including in multi-treatment settings. Overall, the paper provides a unified framework for characterizing and designing efficient sequential experiments.
- [22] arXiv:2212.04814 (replaced) [pdf, html, other]
-
Title: The Generalized Falsification Adaptive Set for Violations of the Exclusion Restriction and ExogeneitySubjects: Econometrics (econ.EM); Methodology (stat.ME)
The falsification adaptive set (FAS) as proposed by Masten and Poirier (2021) provides an identified set for a treatment effect when the baseline model is falsified, assuming invalid instruments violate exclusion only. We show that whether an invalid instrument is a confounder or collider has important consequences: incorrect treatment can cause the FAS to exclude the true parameter. We derive pattern-specific falsification adaptive sets for each combination of violations and propose a generalized FAS as their union, containing the true parameter value if any instrument is valid. We illustrate our results with the roads and trade application of Duranton et al. (2014).
- [23] arXiv:2410.02050 (replaced) [pdf, html, other]
-
Title: A fast, flexible simulation framework for Bayesian adaptive designs -- the R package BATSSSubjects: Computation (stat.CO); Methodology (stat.ME)
The use of Bayesian adaptive designs for randomised controlled trials has been hindered by the lack of software readily available to statisticians. We have developed a new software package (Bayesian Adaptive Trials Simulator Software - BATSS for the statistical software R, which provides a flexible structure for the fast simulation of Bayesian adaptive designs for clinical trials. We illustrate how the BATSS package can be used to define and evaluate the operating characteristics of Bayesian adaptive designs for various different types of primary outcomes (e.g., those that follow a normal, binary, Poisson or negative binomial distribution) and can incorporate the most common types of adaptations: stopping treatments (or the entire trial) for efficacy or futility, and Bayesian response adaptive randomisation - based on user-defined adaptation rules. Other important features of this highly modular package include: the use of (Integrated Nested) Laplace approximations to compute posterior distributions, parallel processing on a computer or a cluster, customisability, adjustment for covariates and a wide range of available conditional distributions for the response.
- [24] arXiv:2510.06995 (replaced) [pdf, html, other]
-
Title: Root Cause Analysis of Outliers in Unknown Cyclic GraphsSubjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Methodology (stat.ME)
We study the propagation of outliers in cyclic causal graphs with linear structural equations, tracing them back to one or several "root cause" nodes. We show that it is possible to identify a short list of potential root causes provided that the perturbation is sufficiently strong and propagates according to the same structural equations as in the normal mode. This shortlist consists of the true root causes together with those of its parents lying on a cycle with the root cause. Notably, our method does not require prior knowledge of the causal graph and yields encouraging results on simulated data and real data from biology and cloud computing.
- [25] arXiv:2604.18742 (replaced) [pdf, html, other]
-
Title: JASPER: Joint Bayesian Analysis of Spatial Expression via RegressionComments: 43 pages; 5 figuresSubjects: Applications (stat.AP); Methodology (stat.ME)
Spatially resolved transcriptomics is a fast-developing set of technologies that enables the measurement of localized gene expression across spatial locations in a sample. Detecting spatially varying genes is critical for analyzing such data, yet existing methods often fail to account for inter-gene correlations, leading to inflated false positive and false negative rates. Additionally, most prominent methods rely on predefined spatial covariance kernels, making them sensitive to the complexity of spatial expression patterns. Motivated by a human breast cancer dataset, we address these limitations in existing literature through JASPER (Joint Bayesian Analysis of SPatial Expression via Regression), a Bayesian framework that jointly models spatial expression patterns across multiple genes using a spatial basis function regression approach. We demonstrate the superior performance of JASPER compared to existing methods in several real-world spatial transcriptomic datasets and supporting simulation experiments. JASPER identifies genes with stronger spatial correlation and greater biological relevance, as validated by overlap comparison, enrichment analysis, and pathway analysis using independent biological databases. Our results highlight the ability of JASPER to improve the statistical and biological interpretability of spatial transcriptomics data, making it a powerful tool for uncovering spatial gene expression patterns in complex biological systems.
- [26] arXiv:2606.03665 (replaced) [pdf, html, other]
-
Title: Sparse Tree-Based Aggregation for Time Series RegressionsSubjects: Econometrics (econ.EM); Methodology (stat.ME)
High-dimensional time series regressions are often regularized to produce sparse coefficients. We show that temporal aggregation provides a powerful alternative to reduce dimensionality in high-order autoregressions and mixed-frequency regressions. To this end, we propose StarTime (Sparse Tree-based Aggregation for Time Series), a convex penalization method that uses a temporal tree to arrange lags hierarchically from high to low frequency. StarTime then flexibly selects coefficients to be aggregated at possibly varying frequencies, sparse or a combination thereof. We provide new error bounds for StarTime, demonstrate improved estimation accuracy and recovery of aggregation and sparsity in simulations relative to benchmarks, and illustrate StarTime's relevance for financial and macroeconomic applications.
- [27] arXiv:2606.21639 (replaced) [pdf, html, other]
-
Title: A new classification method based on Minimum Spanning TreesSubjects: Machine Learning (cs.LG); Methodology (stat.ME); Machine Learning (stat.ML)
Minimum Spanning Trees have been used in unsupervised learning, particularly in clustering tasks, due to their ability to recognize clusters by removing edges that are considered inconsistent in defining those clusters. This paper aims to study the use of Minimum Spanning Trees in supervised learning. Specifically, we propose a classification algorithm based on Minimum Spanning Trees. To improve its performance, we introduce a robust version of the method that is also computationally more efficient. We evaluate the effectiveness of our proposed method through an extensive simulation study. We also apply the proposed methodology to a real-world case study involving aircraft trajectories.