Unable to connect, retrying...

feeds
popular
recent
reader
about
story
technologies
what is rss
connect
dmca
contact

Online collaborative whiteboard. Powerful, engaging with timer, emoji's, commenting and voting.

Feed Preview stat.ML updates on arXiv.org

stat.ML updates on arXiv.org

stat.ML updates on the arXiv.org e-print archive.

Feed: Related:

Learning Sparse High-Dimensional Matrix-Valued Graphical Models From Dependent Data

Thu January 1st, 1970

arXiv:2404.19073v1 Announce Type: new Abstract: We consider the problem of inferring the conditional independence graph (CIG) of a sparse, high-dimensional, stationary matrix-variate Gaussian tim...

https://arxiv.org/abs/2404.19073

Landmark Alternating Diffusion

Thu January 1st, 1970

arXiv:2404.19649v1 Announce Type: cross Abstract: Alternating Diffusion (AD) is a commonly applied diffusion-based sensor fusion algorithm. While it has been successfully applied to various probl...

https://arxiv.org/abs/2404.19649

Tabular Data Contrastive Learning via Class-Conditioned and Feature-Correlation Based Augmentation

Thu January 1st, 1970

arXiv:2404.17489v2 Announce Type: replace-cross Abstract: Contrastive learning is a model pre-training technique by first creating similar views of the original data, and then encouraging the dat...

https://arxiv.org/abs/2404.17489

Is Model Collapse Inevitable? Breaking the Curse of Recursion by Accumulating Real and Synthetic Data

Thu January 1st, 1970

arXiv:2404.01413v2 Announce Type: replace-cross Abstract: The proliferation of generative models, combined with pretraining on web-scale data, raises a timely question: what happens when these mo...

https://arxiv.org/abs/2404.01413

An extended asymmetric sigmoid with Perceptron (SIGTRON) for imbalanced linear classification

Thu January 1st, 1970

arXiv:2312.16043v3 Announce Type: replace-cross Abstract: This article presents a new polynomial parameterized sigmoid called SIGTRON, which is an extended asymmetric sigmoid with Perceptron, and...

https://arxiv.org/abs/2312.16043

Any-dimensional equivariant neural networks

Thu January 1st, 1970

arXiv:2306.06327v2 Announce Type: replace-cross Abstract: Traditional supervised learning aims to learn an unknown mapping by fitting a function to a set of input-output pairs with a fixed dimens...

https://arxiv.org/abs/2306.06327

Kernel Density Matrices for Probabilistic Deep Learning

Thu January 1st, 1970

arXiv:2305.18204v3 Announce Type: replace-cross Abstract: This paper introduces a novel approach to probabilistic deep learning, kernel density matrices, which provide a simpler yet effective mec...

https://arxiv.org/abs/2305.18204

E-Valuating Classifier Two-Sample Tests

Thu January 1st, 1970

arXiv:2210.13027v2 Announce Type: replace-cross Abstract: We introduce a powerful deep classifier two-sample test for high-dimensional data based on E-values, called E-value Classifier Two-Sample...

https://arxiv.org/abs/2210.13027

Orthonormal Expansions for Translation-Invariant Kernels

Thu January 1st, 1970

arXiv:2206.08648v4 Announce Type: replace-cross Abstract: We present a general Fourier analytic technique for constructing orthonormal basis expansions of translation-invariant kernels from ortho...

https://arxiv.org/abs/2206.08648

Sparse Interaction Neighborhood Selection for Markov Random Fields via Reversible Jump and Pseudoposteriors

Thu January 1st, 1970

arXiv:2204.05933v4 Announce Type: replace-cross Abstract: We consider the problem of estimating the interacting neighborhood of a Markov Random Field model with finite support and homogeneous pai...

https://arxiv.org/abs/2204.05933

Estimating Causal Effects with Double Machine Learning -- A Method Evaluation

Thu January 1st, 1970

arXiv:2403.14385v2 Announce Type: replace Abstract: The estimation of causal effects with observational data continues to be a very active research area. In recent years, researchers have develop...

https://arxiv.org/abs/2403.14385

Conditional validity of heteroskedastic conformal regression

Thu January 1st, 1970

arXiv:2309.08313v2 Announce Type: replace Abstract: Conformal prediction, and split conformal prediction as a specific implementation, offer a distribution-free approach to estimating prediction ...

https://arxiv.org/abs/2309.08313

Causal Inference with Differentially Private (Clustered) Outcomes

Thu January 1st, 1970

arXiv:2308.00957v2 Announce Type: replace Abstract: Estimating causal effects from randomized experiments is only feasible if participants agree to reveal their potentially sensitive responses. O...

https://arxiv.org/abs/2308.00957

KAN: Kolmogorov-Arnold Networks

Thu January 1st, 1970

arXiv:2404.19756v1 Announce Type: cross Abstract: Inspired by the Kolmogorov-Arnold representation theorem, we propose Kolmogorov-Arnold Networks (KANs) as promising alternatives to Multi-Layer P...

https://arxiv.org/abs/2404.19756

The lazy (NTK) and rich ($mu$P) regimes: a gentle tutorial

Thu January 1st, 1970

arXiv:2404.19719v1 Announce Type: cross Abstract: A central theme of the modern machine learning paradigm is that larger neural networks achieve better performance on a variety of metrics. Theore...

https://arxiv.org/abs/2404.19719

PCA for Point Processes

Thu January 1st, 1970

arXiv:2404.19661v1 Announce Type: cross Abstract: We introduce a novel statistical framework for the analysis of replicated point processes that allows for the study of point pattern variability ...

https://arxiv.org/abs/2404.19661

Attacking Bayes: On the Adversarial Robustness of Bayesian Neural Networks

Thu January 1st, 1970

arXiv:2404.19640v1 Announce Type: cross Abstract: Adversarial examples have been shown to cause neural networks to fail on a wide range of vision and language tasks, but recent work has claimed t...

https://arxiv.org/abs/2404.19640

Scalable Bayesian Inference in the Era of Deep Learning: From Gaussian Processes to Deep Neural Networks

Thu January 1st, 1970

arXiv:2404.19157v1 Announce Type: new Abstract: Large neural networks trained on large datasets have become the dominant paradigm in machine learning. These systems rely on maximum likelihood poi...

https://arxiv.org/abs/2404.19157

Be Aware of the Neighborhood Effect: Modeling Selection Bias under Interference

Thu January 1st, 1970

arXiv:2404.19620v1 Announce Type: cross Abstract: Selection bias in recommender system arises from the recommendation process of system filtering and the interactive process of user selection. Ma...

https://arxiv.org/abs/2404.19620

Inexact subgradient methods for semialgebraic functions

Thu January 1st, 1970

arXiv:2404.19517v1 Announce Type: cross Abstract: Motivated by the widespread use of approximate derivatives in machine learning and optimization, we study inexact subgradient methods with non-va...

https://arxiv.org/abs/2404.19517

Online and Offline Robust Multivariate Linear Regression

Thu January 1st, 1970

arXiv:2404.19496v1 Announce Type: cross Abstract: We consider the robust estimation of the parameters of multivariate Gaussian linear regression models. To this aim we consider robust version of ...

https://arxiv.org/abs/2404.19496

Provably Efficient Information-Directed Sampling Algorithms for Multi-Agent Reinforcement Learning

Thu January 1st, 1970

arXiv:2404.19292v1 Announce Type: cross Abstract: This work designs and analyzes a novel set of algorithms for multi-agent reinforcement learning (MARL) based on the principle of information-dire...

https://arxiv.org/abs/2404.19292

Training-free Graph Neural Networks and the Power of Labels as Features

Thu January 1st, 1970

arXiv:2404.19288v1 Announce Type: cross Abstract: We propose training-free graph neural networks (TFGNNs), which can be used without training and can also be improved with optional training, for ...

https://arxiv.org/abs/2404.19288

Statistical Mechanics Calculations Using Variational Autoregressive Networks and Quantum Annealing

Thu January 1st, 1970

arXiv:2404.19274v1 Announce Type: cross Abstract: In statistical mechanics, computing the partition function is generally difficult. An approximation method using a variational autoregressive net...

https://arxiv.org/abs/2404.19274

Orthogonal Bootstrap: Efficient Simulation of Input Uncertainty

Thu January 1st, 1970

arXiv:2404.19145v1 Announce Type: cross Abstract: Bootstrap is a popular methodology for simulating input uncertainty. However, it can be computationally expensive when the number of samples is l...

https://arxiv.org/abs/2404.19145

A model-free subdata selection method for classification

Thu January 1st, 1970

arXiv:2404.19127v1 Announce Type: cross Abstract: Subdata selection is a study of methods that select a small representative sample of the big data, the analysis of which is fast and statisticall...

https://arxiv.org/abs/2404.19127

Hidden Synergy: $L_1$ Weight Normalization and 1-Path-Norm Regularization

Thu January 1st, 1970

arXiv:2404.19112v1 Announce Type: cross Abstract: We present PSiLON Net, an MLP architecture that uses $L_1$ weight normalization for each weight vector and shares the length parameter across the...

https://arxiv.org/abs/2404.19112

Unifying Simulation and Inference with Normalizing Flows

Thu January 1st, 1970

arXiv:2404.18992v1 Announce Type: cross Abstract: There have been many applications of deep neural networks to detector calibrations and a growing number of studies that propose deep generative m...

https://arxiv.org/abs/2404.18992

Probabilistic Inference in Language Models via Twisted Sequential Monte Carlo

Thu January 1st, 1970

arXiv:2404.17546v1 Announce Type: cross Abstract: Numerous capability and safety techniques of Large Language Models (LLMs), including RLHF, automated red-teaming, prompt engineering, and infilli...

https://arxiv.org/abs/2404.17546

Neural Dynamic Data Valuation

Thu January 1st, 1970

arXiv:2404.19557v1 Announce Type: new Abstract: Data constitute the foundational component of the data economy and its marketplaces. Efficient and fair data valuation has emerged as a topic of si...

https://arxiv.org/abs/2404.19557

Statistics and explainability: a fruitful alliance

Thu January 1st, 1970

arXiv:2404.19301v1 Announce Type: new Abstract: In this paper, we propose standard statistical tools as a solution to commonly highlighted problems in the explainability literature. Indeed, lever...

https://arxiv.org/abs/2404.19301

Regression for matrix-valued data via Kronecker products factorization

Thu January 1st, 1970

arXiv:2404.19220v1 Announce Type: new Abstract: We study the matrix-variate regression problem $Y_i = sum_{k} beta_{1k} X_i beta_{2k}^{top} + E_i$ for $i=1,2dots,n$ in the high dimensional regime...

https://arxiv.org/abs/2404.19220

Control randomisation approach for policy gradient and application to reinforcement learning in optimal switching

Thu January 1st, 1970

arXiv:2404.17939v2 Announce Type: replace-cross Abstract: We propose a comprehensive framework for policy gradient methods tailored to continuous time reinforcement learning. This is based on the...

https://arxiv.org/abs/2404.17939