arXiv:2404.19073v1 Announce Type: new Abstract: We consider the problem of inferring the conditional independence graph (CIG) of a sparse, high-dimensional, stationary matrix-variate Gaussian tim...
arXiv:2404.19649v1 Announce Type: cross Abstract: Alternating Diffusion (AD) is a commonly applied diffusion-based sensor fusion algorithm. While it has been successfully applied to various probl...
arXiv:2404.17489v2 Announce Type: replace-cross Abstract: Contrastive learning is a model pre-training technique by first creating similar views of the original data, and then encouraging the dat...
arXiv:2404.01413v2 Announce Type: replace-cross Abstract: The proliferation of generative models, combined with pretraining on web-scale data, raises a timely question: what happens when these mo...
arXiv:2312.16043v3 Announce Type: replace-cross Abstract: This article presents a new polynomial parameterized sigmoid called SIGTRON, which is an extended asymmetric sigmoid with Perceptron, and...
arXiv:2306.06327v2 Announce Type: replace-cross Abstract: Traditional supervised learning aims to learn an unknown mapping by fitting a function to a set of input-output pairs with a fixed dimens...
arXiv:2305.18204v3 Announce Type: replace-cross Abstract: This paper introduces a novel approach to probabilistic deep learning, kernel density matrices, which provide a simpler yet effective mec...
arXiv:2210.13027v2 Announce Type: replace-cross Abstract: We introduce a powerful deep classifier two-sample test for high-dimensional data based on E-values, called E-value Classifier Two-Sample...
arXiv:2206.08648v4 Announce Type: replace-cross Abstract: We present a general Fourier analytic technique for constructing orthonormal basis expansions of translation-invariant kernels from ortho...
arXiv:2204.05933v4 Announce Type: replace-cross Abstract: We consider the problem of estimating the interacting neighborhood of a Markov Random Field model with finite support and homogeneous pai...
arXiv:2403.14385v2 Announce Type: replace Abstract: The estimation of causal effects with observational data continues to be a very active research area. In recent years, researchers have develop...
arXiv:2309.08313v2 Announce Type: replace Abstract: Conformal prediction, and split conformal prediction as a specific implementation, offer a distribution-free approach to estimating prediction ...
arXiv:2308.00957v2 Announce Type: replace Abstract: Estimating causal effects from randomized experiments is only feasible if participants agree to reveal their potentially sensitive responses. O...
arXiv:2404.19756v1 Announce Type: cross Abstract: Inspired by the Kolmogorov-Arnold representation theorem, we propose Kolmogorov-Arnold Networks (KANs) as promising alternatives to Multi-Layer P...
arXiv:2404.19719v1 Announce Type: cross Abstract: A central theme of the modern machine learning paradigm is that larger neural networks achieve better performance on a variety of metrics. Theore...
arXiv:2404.19661v1 Announce Type: cross Abstract: We introduce a novel statistical framework for the analysis of replicated point processes that allows for the study of point pattern variability ...
arXiv:2404.19640v1 Announce Type: cross Abstract: Adversarial examples have been shown to cause neural networks to fail on a wide range of vision and language tasks, but recent work has claimed t...
arXiv:2404.19157v1 Announce Type: new Abstract: Large neural networks trained on large datasets have become the dominant paradigm in machine learning. These systems rely on maximum likelihood poi...
arXiv:2404.19620v1 Announce Type: cross Abstract: Selection bias in recommender system arises from the recommendation process of system filtering and the interactive process of user selection. Ma...
arXiv:2404.19517v1 Announce Type: cross Abstract: Motivated by the widespread use of approximate derivatives in machine learning and optimization, we study inexact subgradient methods with non-va...
arXiv:2404.19496v1 Announce Type: cross Abstract: We consider the robust estimation of the parameters of multivariate Gaussian linear regression models. To this aim we consider robust version of ...
arXiv:2404.19292v1 Announce Type: cross Abstract: This work designs and analyzes a novel set of algorithms for multi-agent reinforcement learning (MARL) based on the principle of information-dire...
arXiv:2404.19288v1 Announce Type: cross Abstract: We propose training-free graph neural networks (TFGNNs), which can be used without training and can also be improved with optional training, for ...
arXiv:2404.19274v1 Announce Type: cross Abstract: In statistical mechanics, computing the partition function is generally difficult. An approximation method using a variational autoregressive net...
arXiv:2404.19145v1 Announce Type: cross Abstract: Bootstrap is a popular methodology for simulating input uncertainty. However, it can be computationally expensive when the number of samples is l...
arXiv:2404.19127v1 Announce Type: cross Abstract: Subdata selection is a study of methods that select a small representative sample of the big data, the analysis of which is fast and statisticall...
arXiv:2404.19112v1 Announce Type: cross Abstract: We present PSiLON Net, an MLP architecture that uses $L_1$ weight normalization for each weight vector and shares the length parameter across the...
arXiv:2404.18992v1 Announce Type: cross Abstract: There have been many applications of deep neural networks to detector calibrations and a growing number of studies that propose deep generative m...
arXiv:2404.17546v1 Announce Type: cross Abstract: Numerous capability and safety techniques of Large Language Models (LLMs), including RLHF, automated red-teaming, prompt engineering, and infilli...
arXiv:2404.19557v1 Announce Type: new Abstract: Data constitute the foundational component of the data economy and its marketplaces. Efficient and fair data valuation has emerged as a topic of si...
arXiv:2404.19301v1 Announce Type: new Abstract: In this paper, we propose standard statistical tools as a solution to commonly highlighted problems in the explainability literature. Indeed, lever...
arXiv:2404.19220v1 Announce Type: new Abstract: We study the matrix-variate regression problem $Y_i = sum_{k} beta_{1k} X_i beta_{2k}^{top} + E_i$ for $i=1,2dots,n$ in the high dimensional regime...
arXiv:2404.17939v2 Announce Type: replace-cross Abstract: We propose a comprehensive framework for policy gradient methods tailored to continuous time reinforcement learning. This is based on the...