DISPCA : A hybrid iterative-sequential approach for the identification of errors-in-variables model of linear DAE systems

Published: 2026-06-22 19:32:57

Authors: Deepanjhan Das, Vishwesh Ramanathan, Shankar Narasimhan

Categories: eess.SY

Abstract:
The dynamic behavior of numerous engineering processes is effectively characterized through differential-algebraic equations (DAEs), commonly referred to as descriptor systems. While substantial progress has been achieved in identifying dynamic models governed by ordinary differential equations (ODEs), limited research has addressed the identification of descriptor systems from measured data. This work presents a systematic methodology for identifying the DAE model of a linear descriptor system in discrete difference equation form under errors-in-variables (EIV) setting, where both input and output measurements are corrupted by random noise. The proposed methodology generalizes the identification framework to handle scenarios where the system contains multiple algebraic and different ordered differential relations. The key innovation involves a partial stacking procedure of lagged data matrix with a sequentially increasing lag window that identifies all the differential relations individually. This is preceded by an iterative estimation of the measurement error covariance matrix that is diagonal and heteroskedastic, under large sample conditions. The algorithm simultaneously estimates the number of differential and algebraic relations, observability indices and delay parameters of the differential equations, and all the model coefficients directly from measured data without requiring prior specification from the user. The framework addresses the increased complexity arising from multiple dynamic coupled interactions while maintaining computational tractability through systematic decomposition of the identification problem. Effectiveness of the proposed methodology is demonstrated through several simulation studies.

arXiv Page | PDF

Score: 0

Order restricted estimation of the parameter functions in an additive hazard model

Published: 2026-06-22 19:28:27

Authors: Dragi Anevski, ElBatoul Manel Merai

Categories: math.ST

Abstract:
In this paper we propose estimators of the parameter functions in an Aalen additive hasard regression model. The estimators are the individual and componentwise $l^2$ projections of the naive estimators resulting from the ordinary least squares estimator in the Aalen additive hazard model on the space of monotone functions. We provide pointwise limit distribution results for the resulting estimators, that exhibit $n^{-1/3}$ rate of convergence and the Chernoff distribution as the limit distribution.

arXiv Page | PDF

Score: 0

JupOtter: Cell-Level Bug Detection in Jupyter Notebooks

Published: 2026-06-22 19:23:59

Authors: Lukas Ottenhof, Thibaud Lutellier

Categories: cs.SE, cs.AI

Abstract:
Jupyter Notebooks are an increasingly popular coding environment used across many domains, especially in Python-based data science and scientific computing. Originally used for prototyping and interactive exploration, notebooks are increasingly used to develop more complex programs, leading to a rapid rise in buggy notebooks on platforms like GitHub. To address this trend, we present JupOtter, a bug detection system designed specifically for Jupyter Notebooks. JupOtter features three novel contributions: (1) a notebook-specific tokenization strategy that preserves cell structure, (2) a cell-level bug prediction technique, and (3) a new labeled dataset, OtterDataset, containing over 21,000 notebooks annotated for fine-grained cell-level bug detection. JupOtter achieves cell-level bug detection F1 scores that surpass static analyzers and large language models in two out of three evaluation datasets.

arXiv Page | PDF

Score: 0

On the Semantics of Generative SPARQL

Published: 2026-06-22 19:21:41

Authors: Ratan Bahadur Thapa, Steffen Staab

Categories: cs.DB

Abstract:
We extend SPARQL with a generative query construct, called \tx{GenOp}, whose evaluation calls a language model and produces typed solution mappings. We define the semantics of the GenOp in the query in a way that maintains the fixed-dataset assumption, on which formal semantics of SPARQL build, and extend solution mappings with values generated by the language model. We formalize the semantics of the extended language over these mappings using a compatibility relation that generalizes equality and supports similarity-based matching between RDF terms and generated values. We analyze the semantic consequences of generative query patterns, focusing on mapping-level recursion induced by the reuse of generated bindings. Under deterministic bounded generation and finite candidate coverage assumptions, we characterize acyclic and stratified fragments with fixpoint semantics, establish algebraic equivalence and semantics-preserving rewrite rules, and provide an executable evaluation method; and we show that data and combined complexity coincide with those of standard SPARQL.

arXiv Page | PDF

Score: 0

Federated Survival Analysis in Healthcare: A Multi-Model Evaluation on Cross-Institutional Heterogeneous Breast Cancer Data

Published: 2026-06-22 19:14:51

Authors: Natalia Moreno-Blasco, Anusha Ihalapathirana, Pekka Siirtola, Miguel Fernandez-de-Retana

Categories: cs.LG, q-bio.QM, stat.ME, stat.ML

Abstract:
Survival analysis is central to clinical decision-making, yet reliable time-to-event models require large, diverse cohorts that are rarely available at a single institution, while privacy regulations restrict the centralization of patient data. Federated learning (FL) offers a privacy-preserving alternative by training shared models without exchanging raw data, but its effectiveness for survival modeling under realistic, heterogeneous conditions remains insufficiently understood. This paper presents a systematic, multi-model evaluation of federated survival analysis on a cross-institutional breast cancer cohort with naturally heterogeneous distributed clients. Three representative survival models, the Cox Proportional Hazards model, DeepSurv, and Random Survival Forest (RSF), are compared across centralized, local, and federated training, and three federated optimization strategies (FedAvg, FedProx, and FedAdam) are assessed for the gradient-based models. Results show that FL consistently outperforms local training and approaches, and occasionally exceeds, centralized performance, while RSF offers the best overall balance of discrimination, calibration, and robustness across heterogeneous clients. We further find that performance depends on the diversity of client distributions, and that FedAvg and FedProx are stronger and more stable than FedAdam. Based on these findings, we derive practical, decision-oriented guidelines mapping data, privacy, interpretability, and resource constraints to recommended model and training-paradigm choices for federated survival modeling in healthcare.

arXiv Page | PDF

Score: 0

Canopies: A Generalization of Vines and Vineyards for Parameterized Persistence

Published: 2026-06-22 18:52:57

Authors: Barbara Giunti, Elizabeth Munch

Categories: cs.CG

Abstract:
In this paper, we provide a new construction for studying parameterized persistence, called a canopy. We give two versions of this construction: the A-canopy, retaining all information about points on the diagonal of the persistence diagram; and the D-canopy, encoding the information of the "standard" persistence diagram. We do this by making a simple but major modification in the persistence bundle representation information: namely, rather than tracking a point in the persistence diagram, we instead track some choice of pairs of simplices that created said point. This viewpoint is a combinatorial version of tracking the chain complex information rather than just the output of persistence. We show how to construct the canopies from any filtered filtration function, proving, using the algebraic structure of filtered chain complexes, that different choices of pairs result in homeomorphic structures. Finally, we showcase the power of our approach by using canopies to define vines even in the presence of points with multiplicity; to discuss monodromy; and to obtain some immediate results linking non-trivial monodromy in the persistent homology transform with the existence of non-Hausdorff points in the canopy.

arXiv Page | PDF

Score: 0

Sesame: Structure-Aware Molecular Generation via Spatial Density-Map Conditioning

Published: 2026-06-22 18:48:10

Authors: Konstantin Yatsenko, Arvind Thiagarajan

Categories: cs.LG

Abstract:
Generative molecular models for drug design are a promising direction with much active research. In the next phase of computational drug design, such models will need to understand small molecule structure and protein-ligand interactions, and they will need to possess the machinery to generate molecules \textit{de novo}. Incorporating each feature poses a critical challenge. Equally important, yet often treated as secondary, is the ability to grow a molecule from a partial starting point -- a scaffold or fragment supplied by a chemist -- which is the central operation of lead optimization. We present Sesame (Spatial Evoformer for a Structure-Aware Molecular Engine), a diffusion-based molecular generation model that leverages a novel spatial pairformer module to condition on partial molecular structure and the surrounding protein pocket, both expressed as continuous spatial density maps. This single conditioning mechanism supports both \textit{de novo} generation and fragment-conditioned lead optimization, letting a medicinal chemist prune a hit to a scaffold and have Sesame grow it in productive ways. In addition to this module, we also introduce a diffusion framework for joint denoising of atom types, bond types, and positions, along with a trajectory finetuning scheme that trains on the model's own sampling rollouts to improve generation quality. Sesame is trained on a large corpus of ligand-only and protein-ligand datasets.

arXiv Page | PDF

Score: 0

A periodic flow with high emergence

Published: 2026-06-22 18:27:59

Authors: Odylo Costa

Categories: math.DS

Abstract:
We construct a smooth nonsingular periodic flow on a compact manifold with high emergence, in sharp contrast with the low statistical complexity of periodic self-maps. The construction is based on a modification of the Epstein--Vogt counterexample to the Periodic Orbit Conjecture and on the high-emergence mechanism of Berger--Bochi.

arXiv Page | PDF

Score: 0

Embodied Explainability and Ontological Obstacles: Why We Struggle to Explain the Answers of Large Language Models (LLMs)

Published: 2026-06-22 18:22:53

Authors: Marvin Pafla, Jesse Hoey, Kate Larson, Mark Hancock

Categories: cs.HC

Abstract:
Explainability is often framed as a property of an AI model, with explanations extracted from its internals and shown to users. In this argument paper, we instead provide an embodied account of explainability based on Dourish and enactivist cognition: understanding is created in use as people act on affordances in shared practice. Using demonstrations and conceptual analysis, we reveal ontological obstacles when "looking inside" large language models: surrogates import external abstractions that can be mistaken for the model's, and focusing on internal reasoning misses that explainers participate in their own understanding. We discuss these obstacles in XAI practice, arguing that many explanations are misnamed, which skews their purpose and can increase overreliance. Finally, we highlight how embodied explanations reorganize sense-making by making what matters publicly available for action, and argue that explainability claims should be reserved for designs that provide affordances to probe, coordinate, and repair behaviour in situated practice.

arXiv Page | PDF

Score: 0

Wavelet Matrix Product States for Quantum Fields

Published: 2026-06-22 18:07:17

Authors: Molly Kaplan, Antoine Tilloy

Categories: quant-ph, cond-mat.str-el, hep-th

Abstract:
We introduce a variational method to solve continuum quantum models with discrete tensor network techniques. The method leverages wavelet matrix product states (wMPS): matrix product states built on top of sufficiently regular ($N\geq 6$) Daubechies scaling functions. These states live in the continuum field theory Fock space, have finite energy density, and can be optimized with standard algorithms, without restriction to free theories. Further, exploiting the multi-resolution analysis built into wavelets, and its quantum circuit description, we can iteratively refine wMPS to obtain accurate approximations at arbitrarily fine length-scales. We showcase the efficiency of the method on the Lieb-Liniger model, computing energy density and correlation functions.

arXiv Page | PDF

Score: 0

Calibrating angular momentum transport in intermediate-mass stars from gravity-mode asteroseismology. II. Modelling 2937 BAF-type stars

Published: 2026-06-22 18:01:57

Authors: J. S. G. Mombarg, S. Mathis

Categories: astro-ph.SR

Abstract:
Asteroseismology of gravity (g)-mode pulsators covering BAF-type stars have shown that angular momentum is redistributed during the main sequence. The efficiency of the transport, however, remains largely uncalibrated. This paper aims at exploiting a sample of 2937 characterized g-mode pulsators (the largest one to-date) to place constraints on the efficiency of angular momentum transport by assuming an effective viscosity or an Eddy-viscosity based on the Tayler-Spruit dynamo within a fully-diffusive framework. We compute grids of rotating stellar evolution models that we then use to simulate a population of stars by sampling from these grids with prior distributions on the mass, age and initial rotation rate. We then compare these simulated distributions of rotation frequencies and specific angular momentum ($J/M$) to the ones of the sample of observed stars. We find that a fully-diffusive framework for the transport of angular momentum during the main sequence is sufficient to explain the observed evolution of near-core rotation frequencies, the observed differential rotation, and the observed mass-dependence of $J/M$ when the effective viscosity (assumed constant) is $10^6\,{\rm cm^2\,{\rm s^{-1}}}$ or larger. Viscosities predicted by the Tayler-Spruit dynamo are in general far above this value and can explain the data as well. Future observational studies of main sequence g-mode pulsators are encouraged to measure core-to-surface rotation rates, particularly of B-type stars. In this work we have exploited the constraining potential of near-core rotation frequencies alone, while the contrast with the surface rotation would allow us to unravel the mechanisms driving the transport further.

arXiv Page | PDF

Score: 0

Nuclei in high-energy neutrino sources: A multimessenger study of in-source propagation

Published: 2026-06-22 18:01:01

Authors: AmirFarzan Esmaeili, Arman Esmaili, Pasquale Dario Serpico

Categories: hep-ph, astro-ph.HE

Abstract:
The joint observation of astrophysical sources in gamma rays and neutrinos can provide invaluable insight into the physical conditions of the source, including its size, particle densities, and acceleration and production mechanisms. In this work, we investigate the role of nuclear composition in high-energy astrophysical environments. Using NGC 1068 as a representative example, we perform detailed Monte Carlo simulations of nuclear and electromagnetic cascades within the source and study the imprints of the injected nuclear composition on the resulting neutrino and gamma-ray emissions. We further discuss the importance of MeV-GeV gamma-ray observations for constraining the source composition in the context of future gamma-ray experiments. A dedicated re-analysis of archival COMPTEL observations is also presented.

arXiv Page | PDF

Score: 0

Universal Dynamical Response to Slow Driving in Chaotic Systems

Published: 2026-06-22 18:00:20

Authors: Nachiket Karve, Nathan Rose, David Campbell, Anatoli Polkovnikov

Categories: cond-mat.stat-mech, nlin.CD, quant-ph

Abstract:
We propose a unified perspective on classical and quantum chaos based on the stability of a system's stationary states under slow driving. We probe this sensitivity via the system's susceptibility to the average protocol speed, which we call the ``speed-Fisher information," and relate it to irreversible entropy production in the system. We show that chaotic dynamics manifests as a divergence of the speed-Fisher information with the protocol time, and that this response is controlled by the perturbation's low-frequency spectral weight. This approach to chaos applies to both classical and quantum Hamiltonian systems, and naturally extends to non-Hamiltonian classical flows. We illustrate this framework with simple classical and quantum examples, along with a non-Hamiltonian flow that qualitatively exhibits analogous low-frequency spectral behavior.

arXiv Page | PDF

Score: 0

First measurement of narrow-line flux ratios for a lensed quasar with JWST/NIRSpec IFS

Published: 2026-06-22 18:00:10

Authors: Hadrien Paugnat, Tommaso Treu, Anna M. Nierenberg, Anowar J. Shajib, Shawn Knabel, Daniel Gilman

Categories: astro-ph.GA, astro-ph.CO

Abstract:
Strong gravitational lensing is a powerful probe of dark matter (DM) structure on subgalactic scales: in particular, statistics of flux-ratio anomalies (discrepancies between mass model predictions and observed flux ratios) in quadruply imaged quasars are sensitive to perturbations by low-mass DM halos down to $\sim 10^6 M_\odot$. Studies leveraging these anomalies require high-quality flux-ratio measurements from an emission region insensitive to stellar microlensing. In this paper, we present the first measurement of narrow-line flux ratios for a gravitationally lensed quasar using JWST/NIRSpec with Integral Field Spectroscopy (IFS), targeting the well-studied system RXJ1131$-$1231. Flux ratios are extracted from the [S III] 9071/9533 $Å$ narrow-line doublet - the first use of this doublet for substructure studies - by performing a full lens model reconstruction to isolate the unresolved nuclear emission from extended narrow-line emission. The resulting spectra are jointly modeled using $\texttt{lensqso-specfit}$, a publicly available software package introduced in this work for the simultaneous spectral fitting of multiple lensed quasar images. We achieve $\sim$ 5% uncertainties on the flux ratios, comparable to the precision of JWST/MIRI warm dust measurements, and detect a clear anomaly in the cusp images relative to a standard smooth lens model. Our results are in good agreement with previous narrow-line measurements and broadly consistent with JWST/MIRI warm dust flux ratios, with marginal ($\sim 2-3σ$) deviations. We demonstrate how such shifts between differently sized emission regions may be enhanced by small ($\sim 10$ pc) spatial offsets. Our method is generalizable to other systems with existing or future IFS observations, and the combination of narrow-line and warm dust flux ratios offers a new avenue for improving DM constraints with flux-ratio anomaly statistics.

arXiv Page | PDF

Score: 0

Chiralization of Quiver Varieties

Published: 2026-06-22 18:00:08

Authors: Ioana Coman, Myungbo Shim, Masahito Yamazaki, Yehao Zhou

Categories: math.AG, hep-th, math-ph, math.QA, math.RT

Abstract:
Given a quiver Q with gauge dimension $\bf v$ and framing dimension $\bf w$, one can define the extended quiver variety $\widetilde{\mathcal M}(\mathbf v,\mathbf w)$, which is a smooth family of deformations of the Nakajima quiver variety $\mathcal M(\mathbf v,\mathbf w)$. In this paper we discuss two vertex algebras which chiralize the geometry $\widetilde{\mathcal M}(\mathbf v,\mathbf w)$. We construct a sheaf of $\hbar$-adic vertex superalgebras $\mathscr D^{\mathrm{ch}}_{\widetilde{\mathcal M}(\mathbf v,\mathbf w),\hbar}$ on $\widetilde{\mathcal M}(\mathbf v,\mathbf w)$ which quantizes the jet bundle of $\widetilde{\mathcal M}(\mathbf v,\mathbf w)$, and define a vertex algebra $\mathsf D^{\mathrm{ch}}(\widetilde{\mathcal M}(\mathbf v,\mathbf w))$ to be the $\hbar=1$ specialization of the $\mathbb C^{\times}$-finite part of the vector space of global sections $Γ(\widetilde{\mathcal M}(\mathbf v,\mathbf w), \mathscr D^{\mathrm{ch}}_{\widetilde{\mathcal M}(\mathbf v,\mathbf w),\hbar})$. We define another vertex superalgebra $\mathcal V(\mathbf v,\mathbf w)$ by BRST reduction of the tensor product of the $βγbc$-system and Heisenberg VOA associated to the quiver Q, and show that there exists a natural vertex superalgebra map from $\mathcal V(\mathbf v,\mathbf w)$ to $\mathsf D^{\mathrm{ch}}(\widetilde{\mathcal M}(\mathbf v,\mathbf w))$. Under certain technical assumptions, we prove that the negative degree BRST cohomologies of the tensor product of $βγbc$-systems and Heisenberg VOA associated to the quiver Q are zero, and under stronger assumptions, that the aforementioned vertex superalgebra map is injective. Physically, the vertex superalgebra $\mathcal V(\mathbf v,\mathbf w)$ is closely related to the boundary VOA of the H-twisted 3D $\mathcal N=4$ quiver gauge theory associated to the quiver Q with gauge and framing dimension vectors $\bf v$ and $\bf w$.

arXiv Page | PDF

Score: 0

Lift4D: Harmonizing Single-View 3D Estimation for 4D Reconstruction In-the-Wild

Published: 2026-06-22 17:59:54

Authors: Yehonathan Litman, Xiaoxuan Ma, Manan Shah, Nicolas Ugrinovic, Kris Kitani, Fernando De la Torre, Shubham Tulsiani

Categories: cs.CV

Abstract:
Reconstructing dynamic non-rigid objects from monocular video requires integrating visual cues from direct observations with data-driven priors over geometry and appearance. Prior approaches either learn to directly predict 4D representations from visual input or initialize a 3D representation that is subsequently deformed and refined based on video evidence. However, the former are constrained by the scarcity of 4D training data, while the latter leverage priors only for the initial reconstruction and rely solely on video supervision thereafter; neither handles complex in-the-wild scenarios with large deformations and occlusions well. We present Lift4D, a test-time optimization framework that addresses both limitations. First, we adapt an existing single-view 3D reconstruction model to yield temporally consistent per-frame predictions via causal latent conditioning, providing a coherent initialization for a deformable 3D Gaussian Splatting representation. We then ``sculpt'' this representation to match the input video through an occlusion-aware optimization that faithfully recovers visible surface details while completing unobserved regions using a view-conditioned diffusion prior. We demonstrate that Lift4D clearly improves over prior 4D reconstruction methods, particularly on challenging in-the-wild sequences with severe occlusions and non-rigid motion.

arXiv Page | PDF

Score: 0

Randomized YaRN Improves Length Generalization for Long-Context Reasoning

Published: 2026-06-22 17:59:53

Authors: Manas Mehta, Fangcong Yin, Greg Durrett

Categories: cs.CL

Abstract:
Large language models (LLMs) are typically pretrained on short sequences and then extended to work on longer sequences with additional training. However, such LLMs still struggle to further generalize to very long sequences. We propose Randomized YaRN, a training method that improves length generalization by combining YaRN-based positional extrapolation with randomized positional encoding and a length curriculum. During training on short context data, tokens are assigned YaRN positional encodings sampled from a larger position range, exposing the model to out-of-distribution positional representations even on short-context inputs. We evaluate Randomized YaRN on two challenging long-context reasoning benchmarks, BABILong and Multi-Round Coreference Resolution (MRCR). When training on data with <8K context, Randomized YaRN consistently improves reasoning performance on context lengths from 16K to 128K and outperforms standard fine-tuning, with the largest gains appearing at far out-of-distribution lengths. Our results suggest that progressively exposing models to OOD positional distributions provides an effective recipe for generalizable long-context reasoning.

arXiv Page | PDF

Score: 0

LIBERO-Safety: A Comprehensive Benchmark for Physical and Semantic Safety in Vision-Language-Action Models

Published: 2026-06-22 17:59:53

Authors: Rongxu Cui, Zongzheng Zhang, Jingrui Pang, Haohan Chi, Jinbang Guo, Saining Zhang, Shaoxuan Xie, Xin Jin, Yao Mu, Jiaolong Yang, Guocai Yao, Xianyuan Zhan, Ya-Qin Zhang, Hao Zhao

Categories: cs.RO

Abstract:
Despite the impressive manipulation capabilities of Vision-Language-Action (VLA) models, their operational safety under strict constraints remains largely unverified. To address this, we introduce a parametric safety benchmark to procedurally generate safety-critical scenarios with comprehensive stochasticity. To overcome the scalability bottlenecks of human teleoperation, we develop a novel keypose-driven data generation pipeline. Leveraging this infrastructure, we curate a large-scale dataset of 19,664 strictly collision-free demonstrations with extensive domain randomization. We then conduct a systematic cross-paradigm evaluation of eight VLA and two embodied foundation models. Our analysis reveals a critical generalization-safety tension: although high-diversity training fosters safer trajectories, task success remains fundamentally bottlenecked by sub-optimal trajectory synthesis and semantic misalignment. By providing a scalable pipeline, a robust dataset, and profound failure-mode insights, LIBERO-Safety establishes a crucial foundation for developing safe and reliable VLA models.

arXiv Page | PDF

Score: 0

LaST-HD: Learning Latent Physical Reasoning from Scalable Human Data for Robot Manipulation

Published: 2026-06-22 17:59:52

Authors: Jiaming Liu, Yinxi Wang, Chenyang Gu, Siyuan Qian, Xiangju Mi, Hao Chen, Jiawei Chen, Qingpo Wuwu, Xiaoqi Li, Nuowei Han, Yiming Zhang, Xuheng Zhang, Yang Yue, Yeqing Yang, Lei Wang, Peng Jia, Hao Tang, Shanghang Zhang

Categories: cs.RO

Abstract:
Human-hand demonstrations provide a direct and scalable source of physical interaction data for robot learning. While manual retargeting is indispensable for establishing kinematic action correspondence across different morphologies, robust transfer requires going beyond geometry to address the underlying alignment of physical dynamics between human and robot manipulation. To address this, we introduce LaST-HD, a novel human-to-robot action learning paradigm that extends reasoning-before-acting VLA by aligning human-hand and robot demonstrations in a shared latent reasoning space. Rather than mimicking human kinematics, LaST-HD trains an auxiliary action-conditioned world model on unpaired human-hand and robot trajectories to synthesize unified latent targets. After aligning cross-embodiment representations in this shared forward-dynamics space, these targets supervise LaST-HD's latent reasoning process, enabling it to internalize shared physical dynamics and drive efficient human-hand action learning. Moreover, we develop Out-of-Lab (OOL) Glove, a low-cost motion-capture glove tailored to LaST-HD for human-hand data collection. The captured human data provide precise keypoints and serve as universal action supervision across grippers and dexterous hands. Armed with the aligned latent space and high-fidelity human-hand data, we develop a progressive mixed-to-human training recipe comprising mixed human-robot co-training and human-hand online correction post-training. Through mixed co-training, LaST-HD improves generalization to novel objects, scenes, and positions using only human-hand demonstrations. With online correction, LaST-HD further adapts to novel environments and achieves over 90\% accuracy using only 20 minutes of OOL glove data.

arXiv Page | PDF

Score: 0

Keep The Essentials: Efficient Reference Conditioned Generation via Token Dropping

Published: 2026-06-22 17:59:28

Authors: Rishubh Parihar, Ayush Raina, R. Venkatesh Babu, Or Patashnik

Categories: cs.CV

Abstract:
Reference-based diffusion models enable highly controllable image generation by leveraging elements from input images to guide prompt-driven synthesis. However, these models are computationally expensive in runtime, and their cost scales severely with the number of input references. While the efficiency of diffusion models has been extensively studied in the context of prompt-driven generation, it remains largely under-explored in the realm of reference-based models. This setting presents unique challenges not addressed by methods focusing solely on generation. In particular, the wasteful representation of references as dense token grids offers significant opportunities for improvement. In this work, we present Sparse Context, a method for constructing sparse reference representations by retaining only a reduced subset of reference tokens. We observe that even without modifying the model, dropping a significant portion of reference tokens at inference time largely preserves its generation capabilities. To fully realize this potential, we fine-tune the model with random token dropping at varying ratios, encouraging robustness to partial reference representations. Crucially, this training strategy decouples the model from any specific token selection rule, allowing flexible control at inference time. At inference time, instead of random dropping, we apply task-aware token selection strategies that prioritize the most informative regions of the reference images, adapting the token budget to the input and task requirements. Extensive experiments show our method achieves a 4x increase in inference speed for multi-reference generation and an 2x for single reference generation. Importantly, this efficiency is achieved without compromising visual quality across both spatially-aligned editing and subject-driven generation.

arXiv Page | PDF

Score: 0

Tapered Language Models

Published: 2026-06-22 17:56:25

Authors: Reza Bayat, Ali Behrouz, Aaron Courville

Categories: cs.LG, cs.AI, cs.CL

Abstract:
Modern language models, including transformer, recurrent, and memory-based variants, share a common chassis: a stack of identical layers in which parameters are allocated uniformly across depth. This is a default inherited from the original transformer and largely unchanged since, yet a growing body of evidence suggests that layers contribute non-uniformly to the final output, with later layers refining the residual stream rather than transforming it. We ask whether parameter capacity should reflect this asymmetry. Our controlled experiment shows that, under a fixed budget, allocating more capacity to earlier layers and less to later layers improves perplexity over a uniform-width baseline, while the reverse allocation hurts. Building on this result, we introduce Tapered Language Models (TLMs), an architectural principle in which a parameter-bearing component is monotonically tapered across depth under a fixed total budget. MLPs are the natural site for this instantiation: they dominate parameter count across all modern LM families and expose width as a single, clean axis of variation. Across three model scales and four architectures (Transformer, Gated Attention, Hope-attention, and Titans), tapering MLP width via a smooth cosine schedule consistently improves perplexity and downstream benchmark performance over uniform baselines, at no additional parameter or compute cost. These findings establish depth-aware capacity allocation as a simple, architecture-agnostic axis of language model design, a free lever hidden in plain sight.

arXiv Page | PDF

Score: 0

On the Limits of Prompt-Conditioned Language Models as General-Purpose Learners

Published: 2026-06-22 17:52:59

Authors: David Mguni, Julian Ma, Jun Wang

Categories: cs.LG

Abstract:
Large Language Models (LLMs) are frequently portrayed as general-purpose solvers capable of solving arbitrary tasks. We argue that this view overlooks a fundamental constraint: language is a compressed and capacity-limited interface for conveying task information. Modelling User--System interaction as a bilevel \emph{cheap-talk} game, we analyse how latent tasks are encoded into prompts and reinterpreted under alignment and safety constraints. We introduce a conceptual decomposition separating task inference from execution and derive PAC-Bayes bounds that distinguish finite-sample estimation error from irreducible structural limitations. Our first main result establishes an \emph{expressivity floor}: language acts as a capacity-limited communication channel, and whenever the informational complexity of a task family exceeds the capacity of that channel, distinct tasks become unavoidably indistinguishable to the Solver, inducing a strictly positive error floor that cannot be eliminated by additional data, optimisation, or model scaling alone. We then establish an \emph{objective-misalignment floor}: when alignment constraints restrict the admissible output set, the User-ideal distribution may lie outside the feasible class, inducing an irreducible distortion. Together, these results yield a formal negative conclusion: prompt-conditioned LLMs are not universal problem solvers through prompting alone, as there exist task families for which correct behaviour is provably unattainable even in the infinite-data regime. More broadly, our analysis shows the limits of prompt-based generalisation arise from information-constrained communication and alignment-constrained objectives. This suggests that interfaces beyond natural language, including multimodal observations and, external memory, may reduce the inherent LLM limitations by increasing the task-relevant information available to the System.

arXiv Page | PDF

Score: 0

The strong Arnol'd chord conjecture for the boundary of a uniformly convex domain in $\mathbb{R}^{4}$

Published: 2026-06-22 17:48:32

Authors: Dylan Cant

Categories: math.SG

Abstract:
Following the idea of Jungsoo Kang and Jun Zhang, we prove the strong Arnol'd chord conjecture for the boundary of a uniformly convex domain in $\mathbb{R}^{4}$, using an ellipsoid embedding construction due to Oliver Edtmair. We prove a general structural result for Legendrians $L$ which are eventually equivariantly essential (E3), in the sense that the $k$th Gutt-Hutchings capacity $c_{k}(D^{*}TL)$ is infinite for $k$ large enough. We show that any E3 Legendrian in the boundary of a Liouville domain $Ω$ bounds a chord of length at most $\liminf c_{k}(Ω)/k$.

arXiv Page | PDF

Score: 0

A Resolution of Erdős Problem 550 on Tree versus Complete Multipartite Ramsey Numbers

Published: 2026-06-22 17:45:05

Authors: Eric Li

Categories: math.CO

Abstract:
We resolve Erdős Problem 550, originally asked as question (2) of Erdős, Faudree, Rousseau, and Schelp. Precisely, for fixed integers $k\geq 2$ and $1\leq m_1\leq \cdots \leq m_k$, we prove that, for every sufficiently large $n$ and every $n$-vertex tree $T$, $R(T,K_{m_1,\ldots,m_k}) \leq (k-1)(R(T,K_{m_1,m_2})-1)+m_1$. The proof combines a new off-Turán tree-embedding theorem with a compactness-and-rounding theorem for represented bounded-rank hypergraph obstructions. The embedding theorem follows from Szemerédi regularity and a local regular-matching embedding lemma of Hladký and Piguet. The compactness argument uses shadow hypergraphs to retain obstructions whose vertices escape along the limiting sequence.

arXiv Page | PDF

Score: 0

Lightweight Neural Framework for Robust 3D Volume and Surface Estimation from Multi-View Images

Published: 2026-06-22 17:39:18

Authors: Diego E. Farchione, Ramzi Idoughi, Peter Wonka

Categories: cs.CV

Abstract:
Accurate volume and surface area estimation is critical for diverse applications, from marine ecology to medical diagnostics. However, existing methods often suffer from high computational costs and poor performance with sparse and noisy data. We propose a fully feed-forward framework that regresses scale-normalized volume and surface area and their associated uncertainties directly from multi-view images. By fusing 3D point cloud reconstructions with view-aligned 2D features through a graph-based decoder, our model bypasses iterative optimization, ensuring exceptional scalability and rapid inference. Experimental results demonstrate that our approach outperforms state-of-the-art methods, particularly when operating with a low number of input images. Validated across coral monitoring, dietary analysis, and anthropometry, our proposed framework provides a robust, adaptable solution for quantitative shape analysis. This architecture provides a high-speed, scalable alternative for precise geometric estimation from visual data, maintaining high performance even in resource-constrained or sparse-view scenarios.

arXiv Page | PDF

Score: 0

Which Waveguide Network Realizes a Prescribed Transmission Profile? An Exact Forward Construction

Published: 2026-06-22 17:32:40

Authors: Tristan M. Lawrie

Categories: math-ph

Abstract:
We introduce an analytically invertible framework for wavefront construction based on the scattering properties of periodic waveguide networks governed by a gauge-shifted Helmholtz operator. By determining the exact transmission coefficients of the network, we express the lattice reactance as a Fourier expansion whose coefficients are analytically mapped onto the underlying graph architecture, allowing the required bond connections, refractive indices, lengths, and gauge phases to be determined directly from a prescribed target transmission coefficient. In contrast to conventional inverse-design approaches, the present formulation provides a closed-form route from desired wave transmission profiles to physically realisable structures. The framework extends naturally from one-dimensional angular filtering to two-dimensional image synthesis, where arbitrary transmitted intensity patterns are reconstructed through exact spectral control of the network scattering response.

arXiv Page | PDF

Score: 0

Flatness Preserves Instruction Following in Vision-Language-Action Models

Published: 2026-06-22 17:30:29

Authors: Haochen Zhang, Yonatan Bisk

Categories: cs.RO

Abstract:
Vision-language-action (VLA) models have the potential for open-world generalization by leveraging pretrained vision-language representations, yet downstream finetuning on limited robot data often degrades these representations, leading to brittle policies that ignore language instructions in favor of visual shortcuts, a failure mode we term instruction blindness. We hypothesize that standard finetuning with limited data applies gradients to a sparse set of points, which manifests as a sharp loss landscape with high-curvature minima. We propose to address this directly through flatness-preserving optimization while finetuning on the exact same data, where learning a flatter landscape results in a model more robust to perturbations in the weight space. Specifically, we demonstrate that simply applying sharpness-aware minimization during VLA finetuning significantly improves instruction following by over 60% across multiple simulation and real-world benchmarks without additional data, architectural modification, or retraining. We further analyze the effect of selective sharpness, quantify its effects, and show that our approach is complementary to existing guidance techniques. Project page can be found at https://haochenz11.github.io/papers/flatness-vla/.

arXiv Page | PDF

Score: 0

Learning Process Rewards via Success Visitation Matching for Efficient RL

Published: 2026-06-22 17:30:24

Authors: Raymond Tsao, Andrew Wagenmaker, Sergey Levine

Categories: cs.LG, cs.AI, cs.RO, stat.ML

Abstract:
In many modern applications of reinforcement learning (RL), the natural reward for a task of interest is inherently sparse: a reward of 0 is given everywhere except when the task is completed, when a reward of +1 is given. Training a policy to maximize such a sparse reward requires solving a challenging credit assignment problem, leading to slow or ineffective RL improvement. We propose a simple approach to transform a sparse outcome reward into a dense process reward. Our approach relies on training a discriminator to distinguish between previous successful and unsuccessful episodes, and using this discriminator to incentivize the RL-learned policy to match the state-action visitations of successful episodes, while avoiding those of unsuccessful episodes. By incentivizing the policy to match the visitations over all states, not just those that correspond to task success, this reward provides dense feedback on whether progress is being made towards task completion, and, we show, provably achieves this without changing the optimal policy. Focusing on finetuning of robotic control policies, we demonstrate that our approach leads to significantly faster RL finetuning performance on both simulated and real-world manipulation tasks, as compared to simply maximizing the sparse outcome reward.

arXiv Page | PDF

Score: 0

Ergodicity of the bicentralizer flow and Kadison's problem

Published: 2026-06-22 17:27:28

Authors: Amine Marrakchi

Categories: math.OA

Abstract:
We show that the relative bicentralizer flow of a type $\mathrm{III}_1$ irreducible subfactor with expectation is always ergodic. As a consequence, every irreducible subfactor with expectation in a factor with separable predual contains a maximal abelian subalgebra. This completes the solution to Kadison's problem on maximal abelian subalgebras from 1967.

arXiv Page | PDF

Score: 0

Pose Anything Anywhere:Model-free Object Poses from Arbitrary References

Published: 2026-06-22 17:23:57

Authors: Hongli Xu, Jiaqi Hu, Junwen Huang, Boyang Zhong, Peter KT Yu, Nassir Navab, Benjamin Busam, Slobodan Ilic

Categories: cs.CV

Abstract:
Estimating the 6D pose of unseen objects is a fundamental yet challenging problem for open-world robotics and embodied perception. Model-based methods are accurate but depend on CAD assets or heavy onboarding, while most model-free approaches are still limited to pairwise single-anchor matching and thus fail under occlusion and large viewpoint changes with low query-reference overlap. Therefore, we present PANY, a unified model-free framework that seamlessly supports both RGB and RGB-D inputs, operates on one or sparse pose-free reference views, and generalizes effectively to novel objects. Built on a multi-view transformer geometry backbone, PANY moves beyond pairwise matching by learning view-consistent geometry and cross-view alignment cues that remain stable under wide baselines and limited overlap. When additional unposed assist views are available, PANY aggregates them via pose-graph canonical registration to increase geometric coverage and reinforce the final pose. Extensive experiments show that PANY achieves state-of-the-art performance across multiple benchmarks, substantially outperforming existing model-free methods, improving pose accuracy by +12% on YCB-V and over +20% on LM-O. Furthermore, PANY consistently performs well under both single-reference and sparse-reference settings, demonstrating strong robustness in real-world environments.

arXiv Page | PDF

Score: 0

Why Machines Misread Pedagogical Quality: Human-Machine Alignment in LLM-Based Pretest Question Evaluation

Published: 2026-06-22 17:22:22

Authors: Pei-Yu Tseng, Mahir Akgun, Peng Liu

Categories: cs.HC

Abstract:
Designing effective pretest questions is challenging at scale: high-quality questions require careful calibration of openness, cognitive depth, and alignment with learning objectives, yet generating and evaluating them manually is time-consuming. We present an AI-assisted workflow for pretest question development that combines automated generation, rubric-based evaluation, and iterative selection. Because the workflow relies on machine evaluation to filter questions at scale, we investigate the alignment between human and machine judgments across a 2x2 design varying rubric operationalization and evaluation mode. Our findings show that human-machine disagreements are systematic rather than random, that rubric revision has a larger effect on alignment than rationale-first evaluation, and that the two interventions are complementary. These findings highlight that scalable AI-assisted pretesting depends not only on generation capability but on how pedagogical quality is operationalized for machine interpretation.

arXiv Page | PDF

Score: 0

DiT-Reward: Generative Representations for Text-to-Image Reward Modeling

Published: 2026-06-22 17:19:58

Authors: Yuanming Yang, Guoqing Ma, Bo Wang, Yuan Zhang, Wei Tang, Chenyi Li, Haoyang Huang, Nan Duan

Categories: cs.LG, cs.AI

Abstract:
Can representations learned for image generation also support the evaluation of generated images? We study text-to-image reward prediction as a downstream task of generative representation learning. To this end, we introduce DiT-Reward, which converts a pretrained text-to-image Diffusion Transformer into a reward model by processing near-clean image latents and aggregating text-conditioned image representations across transformer layers. Under the same training data mixture as HPSv3, DiT-Reward outperforms HPSv3 on all four evaluated preference benchmarks, reaching 85.6% on HPDv2 and 77.6% on HPDv3. When the generative backbone is frozen, a lightweight learned head can still extract meaningful preference predictions from its representations. Probing across depth further reveals that downstream reward performance is strongest in the middle-to-late layers and benefits from combining representations across different stages. We also observe consistent positive scaling with generative backbone capacity. Finally, when used to optimize Stable Diffusion 3.5 Large with Flow-GRPO, DiT-Reward outperforms HPSv3 along the matched training trajectory, with particularly clear gains in realism. Direct latent scoring also achieves a 1.65x inference speedup over HPSv3 with comparable peak memory. These results show that pretrained generative DiTs provide transferable representations for reward modeling and policy optimization.

arXiv Page | PDF

Score: 0

Learning to See While Learning to Act: Diffusion Models for Active Perception in Robot Imitation

Published: 2026-06-22 17:19:57

Authors: Kuancheng Wang, Vaibhav Saxena, Shuo Cheng, Yotto Koga, Danfei Xu

Categories: cs.RO

Abstract:
Most imitation learning methods assume full observability in table-top settings. In practice, objects are often occluded, requiring robots to both search and act, and learning this coupled behavior from limited demonstrations remains challenging. We propose See2Act, an imitation learning approach that conditions action prediction on a sequence of actively-inferred viewpoints at test time, by coupling action denoising with viewpoint refinement. The policy is trained using camera poses anchored to keyframe actions from offline demonstrations, enabling implicit learning of where to see, while learning how to act. We empirically demonstrate that in Ravens the policy recovers informative viewpoints under severe occlusions, and on RLBench tasks it improves performance by up to 34% over prior methods. In the real world, we collect 50 demonstrations in a digital twin and achieve zero-shot sim-to-real transfer on pick-and-place tasks using depth observations. The policy handles significant occlusions, showing that learned viewpoint reasoning enables robust manipulation under partial observability.

arXiv Page | PDF

Score: 0

Autonomous Subsea Cable Search and Tracking with Graph-Optimised Priors and Visual Tracking

Published: 2026-06-22 17:08:10

Authors: Ibrahim Fadhil Djauhari, Adrian Bodenmann, Samuel Simmons, Cailei Liang, David White, Susan Gourvenec, Tom Bennetts, Darryl Newborough, Blair Thornton

Categories: cs.RO, cs.CV, eess.SY

Abstract:
Global communications rely on subsea cable infrastructure that remains vulnerable to damage from natural hazards and human activity. Autonomous underwater vehicles (AUVs) offer an efficient means to inspect long sections of exposed cable, but uncertainty in cable route maps, small cable diameters and partial burial makes continuous tracking a challenge. This paper presents a novel cable search and tracking method that leverages uncertain prior cable route maps. Graph-based optimisation continuously update the cable route to remain consistent with visual observations. Route uncertainty is constrained as a function of distance from observations using physics-based catenary models that account for cable parameters (i.e., lay depth, diameter, and density), bounding the search space to physically feasible regions and improving search efficiency. Cable detection is performed using a semi-supervised classifier running in real-time on-board a camera-equipped AUV. These detections both update the graph-based optimisation and enable visual cable tracking. When tracking is lost due to misclassification, burial or imperfect control, the bounded search space enables efficient recovery. The approach was demonstrated in field trials using the University of Southampton's Smarty200 AUV. The system successfully located the cable despite deliberate errors in it initial cable route map, updating this to be consistent with observations and using visual tracking to inspect up to 59% of a 120m test cable, with successful recovered after tracking loss.

arXiv Page | PDF

Score: 0

SPIRAL: Learning to Search and Aggregate

Published: 2026-06-22 17:02:09

Authors: Jubayer Ibn Hamid, Ifdita Hasan Orney, Michael Y. Li, Omar Shaikh, Yoonho Lee, Dorsa Sadigh, Chelsea Finn, Noah Goodman

Categories: cs.AI

Abstract:
Language model reasoning can be substantially improved at test time via scaffolds that scale inference compute across different primitives -- sequential reasoning within a trace, independently sampled parallel traces, and aggregation of multiple reasoning traces into a final response. During post-training, however, language models are optimized only for sequential reasoning within a single trace. We introduce Sequential-Parallel-Aggregative Reinforcement Learning (SPIRAL), a framework in which a language model is trained to use all three primitives, as part of a unified inference compute pipeline. Concretely, the language model first samples a set of independent traces in parallel, each produced through sequential chain-of-thought reasoning, and then generates a final aggregation trace conditioned on those traces; all components are optimized end-to-end against the reward of the final aggregated response. To train this system, SPIRAL uses set reinforcement learning to teach models to produce a set of traces that are collectively useful for an aggregator and standard reinforcement learning to teach models to aggregate the set into improved final responses. Our experiments on reasoning tasks show that SPIRAL effectively scales with inference compute, outperforming GRPO by up to 11$\times$ scaling efficiency and 15% higher performance when all three compute primitives are scaled.

arXiv Page | PDF

Score: 0

KEMO: Event-Driven Keyframe Memory for Long-Horizon Robot Manipulation with VLA Policies

Published: 2026-06-22 16:57:43

Authors: Yihan Zeng, Minghao Ye, Yiyuan Chen, Yide Shentu, Philipp Wu, Zike Yan, Zhongyu Li

Categories: cs.RO

Abstract:
Long-horizon robot manipulation remains challenging because similar observations may occur at different execution stages, while the appropriate action depends on previously completed operations. Memory can address this ambiguity by enabling policies to infer task progress from execution history. However, existing memory-augmented approaches often either retain dense histories that require compression or rely primarily on recent context that may discard earlier task-relevant events. In this work, we propose propose KEMO, a lightweight plug-in memory framework that automatically selectively preserves keyframes associated with task-relevant state changes for VLA policies. KEMO combines robot kinematics with visual filtering to detect events, encodes the selected keyframes as compact temporally ordered memory tokens, and integrates them with current visual features through cross-attention and gated residual fusion for VLA training. The detected events also define higher-weight training samples near critical transitions. We evaluate KEMO on various real-world dual-arm manipulation tasks spanning 2 to 6 scored subtasks, and trajectory length ranging from 830 steps to 2846 execution steps (durations from 28 to 95 seconds). Compared with the memory-free baseline (e.g., $π_{0.5}$), KEMO improves aggregate Task Success Rate by 23.6\% and Stage Completion Rate by 34.1\%. Ablations show that event-driven keyframe selection outperforms uniform sampling and recent-frame retention, while the proposed gated fusion and keyframe-aligned loss weighting provide complementary gains.

arXiv Page | PDF

Score: 0

A Generative Model for Closed-Loop Microsimulation of Signalized Intersections

Published: 2026-06-22 16:57:10

Authors: Yash Ranjan, Rahul Sengupta, Anand Rangarajan, Sanjay Ranka

Categories: cs.RO, cs.AI

Abstract:
Traffic microsimulators rely on hand-crafted behavior models that reproduce aggregate flow but miss the heterogeneous interactions between vehicles at signalized intersections. Learned trajectory predictors capture richer interactions but are short-horizon and tend to be unstable when run in closed loop. We present Enactor, an actor-centric generative model for closed-loop intersection microsimulation. The model focuses on vehicles; pedestrians are included as context that can influence vehicle decisions but not predicted. Dynamic actors and lane polylines are encoded in polar coordinates referenced to the intersection center. A transformer with separate spatial and temporal attention blocks predicts a distribution over each actor's next-step motion ($s$, $α$). Training uses a closed-loop curriculum so the model is exposed to its own predictions. We evaluate Enactor in two regimes. In a 4000-second simulation-in-the-loop test at two intersection geometries, Enactor controls every dynamic vehicle against a continuously refreshing actor set rather than the fixed cohort that learned trajectory predictors are usually evaluated against. It recovers the SUMO data generator's speed and travel-time distributions with KL divergence over an order of magnitude lower than a recent transformer baseline on travel time, and substantially lower on speed (roughly $5\times$ lower at Site 1), and reduces red-light violations relative to the same baseline by more than an order of magnitude. An ablation isolates the leader rear-bumper feature as the change with the largest effect on intersection-aware safety metrics. We also evaluate on real-world field data and apply the same architecture to naturalistic vehicle trajectories from a fish-eye camera at a signalized intersection and evaluate it on multi-horizon predictive tasks. Enactor outperforms a constant-velocity baseline at every horizon evaluated.

arXiv Page | PDF

Score: 0

Kamera: Unified Position-Invariant Multimodal KV Cache for Training-Free Reuse

Published: 2026-06-22 16:47:00

Authors: Bole Ma, Jan Eitzinger, Harald Koestler, Gerhard Wellein

Categories: cs.DC, cs.AI, cs.CV

Abstract:
Multimodal agents repeatedly re-examine the same video frames, UI screenshots, and rendered artifacts as their context window slides and reasoning iterates, yet every look-back re-encodes from scratch, because prefix caches serve reuse only at a fixed leading position. We show this recompute is avoidable, and identify exactly what naive KV reuse loses: the cross-chunk conditioning a chunk absorbs from its neighbours. This loss is asymmetric. The direct readout of a cached chunk is recovered exactly and for free by the standard state-merge. What remains is a diffuse, low-rank residue concentrated in deep layers, invisible to single-hop retrieval but precisely what multi-hop reasoning binds on. Blind reuse therefore leaves single-hop recall intact while halving multi-hop accuracy; this is the failure mode prior position-independent caches, designed for single-context or single-image reuse, do not address. We repair it with a small, training-free low-rank conditioning patch stored alongside each position-free chunk. Reuse reduces to one operator across MLA, GQA, and MHA: exact RoPE re-rotation to any target position, plus the patch that restores cross-chunk binding. This makes three window operations cheap: reorder (one patch serves every ordering of a cached set), sliding-window survival (surviving chunks relocate via rotation only, zero re-encode), and recall (an evicted chunk is rehydrated by its patch, never re-encoded). A rank-m patch recovers full task accuracy on cross-chunk-binding benchmarks, MM-NIAH across two attention families and two-page doc-QA, at a fraction of the KV footprint, and reconstructs re-prefill KV to within bf16 rounding in a production SGLang kernel across six backbones. The conditioning signal is strongest in redundant vision and video streams, making our solution most impactful where multimodal agents spend their recompute budget.

arXiv Page | PDF

Score: 0

Quantification of the Flavor Diagonal Hadronic CP Violation

Published: 2026-06-22 16:46:22

Authors: Nodoka Yamanaka

Categories: hep-ph, hep-ex, hep-th, nucl-ex, nucl-th

Abstract:
The flavor diagonal CP violation of elementary particle physics contributes to the atomic, nuclear, and nucleon electric dipole moments (EDMs), T-violating neutron optics, and to the angular correlations of beta decay. In this contribution, we review the basics and the importance of CP violation in the search for new physics beyond the standard model, the recent progress in the quantification of the hadron level CP violation contributing to the aforementioned observables, and finally the current attempt to solve the strong CP problem without additional interactions and fields.

arXiv Page | PDF

Score: 0

Irrelevance of Anomalous Breaking of Axial U(1) Symmetry and the U(1) Problem

Published: 2026-06-22 16:46:15

Authors: Nodoka Yamanaka

Categories: hep-ph, hep-ex, nucl-th

Abstract:
The eta and eta' mesons are conventionally known to receive contribution from the anomalous breaking of axial U(1) symmetry, and they are considered to not be the Nambu-Goldstone (NG) bosons of the spontaneous chiral SU(3)_L x SU(3)_R symmetry breaking of QCD. However, it has recently been shown that this axial U(1) anomaly is not actually physical. In this contribution, we first review this statement and then propose a mechanism in which eta and eta' mesons are indeed NG bosons while being consistent with the axial U(1) problem.

arXiv Page | PDF

Score: 0

Genuine Global Kochen-Specker Contextuality as Classical Coordination Cost

Published: 2026-06-22 16:45:23

Authors: Ming Yang

Categories: quant-ph, cs.CC

Abstract:
Classical simulations of quantum correlations can fail because no low-communication local hidden-variable model exists, or because no single noncontextual hidden state can explain all compatible measurement contexts. This manuscript studies a third regime: genuine global Kochen-Specker contextuality, where local subsystems are noncontextual and the tested multipartite blocks are generalized-Bell-local, but the whole empirical model admits no global noncontextual hidden-variable explanation. We propose a coordination-cost framework in which communication, memory, and local computation are treated as different ways for a classical simulator to maintain a global classical explanation from locally available information. We introduce coordination bits, global contextual covering numbers, scaling laws for task families, and an abstract lifting theorem showing how classical simulation lower bounds for KS-contextual seed families can be transferred to genuinely global-KS models. As worked examples, we analyze a polarization-path Hardy obstruction and postselected KCBS-type tasks.

arXiv Page | PDF

Score: 0

A Spectral Theory of Normalized Corrected GNN Propagation

Published: 2026-06-22 16:38:50

Authors: Qihan Chen, Wei Li, Meng Qin, Jianfeng Hou

Categories: cs.LG

Abstract:
We develop a spectral theory for \emph{normalized corrected GNN propagation}. The object of study is the symmetric normalized adjacency with its degree-stationary component removed, matching the normalization used by standard GCN-style models while isolating the stationary direction most directly tied to oversmoothing. The central theoretical question is whether this corrected normalized operator preserves class-discriminative signal after many propagation layers. Our main result is a high-probability exact-recovery theorem for the binary Contextual Stochastic Block Model after \(k=O(\log n)\) propagation steps in the dense polylogarithmic regime \(p\ge C\log^B n/n\), for any fixed \(B>4\), under explicit graph-signal and feature-SNR conditions. We also establish a multi-class partial recovery theorem showing contraction toward class centers for most nodes. Synthetic and real node-classification experiments are included as empirical checks of the theory's predicted dependence on depth, graph signal, and feature noise.

arXiv Page | PDF

Score: 0

HoloAgent-0: A Unified Embodied Agent Framework with 3D Spatial Memory

Published: 2026-06-22 16:31:48

Authors: Xiaolin Zhou, Liu Liu, Tingyang Xiao, Wei Feng, Fa Fu, Xinrui Meng, Xinjie Wang, Jialiang Han, Boyang Yu, Yun Du, Wei Sui, Zhizhong Su

Categories: cs.RO, cs.CV

Abstract:
LLM agents follow a practical execution loop in digital environments: they reason over structured states, invoke tools, inspect feedback, and revise actions. Extending this loop to physical robots is difficult because physical execution is continuous, embodiment-dependent, uncertain, and constrained by safety. Existing embodied-AI systems have advanced manipulation, spatial understanding, navigation, and humanoid control, but these capabilities often remain specialized modules or loosely coupled decision loops. In this work, we introduce HoloAgent-0, a unified embodied agent framework for real-world robot deployment. Embodied AgentOS converts language instructions into executable skill graphs, schedules robot resources, monitors execution, and triggers clarification or re-planning from runtime feedback. HoloAgent-0 organizes heterogeneous robot models and controllers through three coupled layers: Embodied AgentOS for closed-loop execution, 3D spatial memory for physical world grounding, and embodied skills for robot action. We deploy HoloAgent-0 on real hardware and evaluate its spatial memory, long-horizon navigation, and closed-loop execution across motion generation, object search, cross-robot coordination, and mobile manipulation.

arXiv Page | PDF

Score: 0

Computing Gaussian and exponential integrals in ${\Bbb R}^n$

Published: 2026-06-22 16:28:05

Authors: Alexander Barvinok

Categories: cs.DS, math-ph, math.CA, math.PR

Abstract:
We consider expectations of the type $E\ \exp \left\{\sum_{i=1}^m φ_i \right\}$, where $φ_i: {\Bbb R}^n \longrightarrow {\Bbb C}$ are functions, each depending on a few coordinates of a point in ${\Bbb R}^n$, and the expectation is taken with respect to the standard Gaussian or symmetric exponential probability measures. We prove sufficient conditions, in terms of the Lipschitz constants of $φ_i$ and the combinatorics of their dependencies, for the integral to be separated from 0, and, consequently, to be amenable to a computationally efficient approximation. We discuss applications to computing volumes of bodies and statistics on integer points in polyhedra in ${\Bbb R}^n$.

arXiv Page | PDF

Score: 0

Exponential stability of abstract boundary-coupled positive systems

Published: 2026-06-22 16:23:03

Authors: Yassine El Gantouh, Mahyar Mahinzaeim

Categories: math.OC

Abstract:
In this paper, we study the well-posedness and exponential stability of a class of abstract boundary-coupled positive systems. Within a general semigroup framework, we establish well-posedness in terms of existence, uniqueness, and regularity of solutions. More concretely, we derive some simple and readily verifiable tests on the boundary coupling operators for the verification of the well-posedness of the coupled system and, at the same time, provide spectral criteria that guarantee exponential stability of the solutions. The analysis is based on two complementary approaches: the feedback-theoretic method and perturbation techniques for resolvent positive operators on Banach lattices (i.e., Banach spaces equipped with a compatible order structure). This unified framework allows for a systematic treatment of a broad class of boundary couplings in abstract dynamical systems. Several examples are presented to illustrate the applicability of the theoretical results.

arXiv Page | PDF

Score: 0

AwakeForest: An Interactive Geospatial Platform for Large-Scale Forest Imagery

Published: 2026-06-22 16:17:29

Authors: Suraj Prasai, Kangning Cui, Rongkun Zhu, Sarra Alqahtani, Ying Zhang, Victor Paul Pauca, Miles R. Silman, Fan Yang

Categories: cs.CV, cs.SE

Abstract:
Forest imagery analysis often involves multiple tightly coupled vision tasks, which must be performed under substantial variation in geographic regions, sensors, and acquisition conditions. However, practitioners often lack a unified tool that is geospatial-native, cloud-optimized, and ML-integrated for end-to-end workflows spanning annotation, prediction, visualization, and downstream analysis at scale. We present AwakeForest, an interactive end-to-end platform designed for large-scale forest imagery that integrates model-assisted inference, automatic annotation, and human-in-the-loop refinement within a single workflow. Our platform supports plug-and-play integration of pretrained models and enables scalable interaction with forest imagery ranging from standard aerial scenes to large orthomosaics that can span several gigabytes to hundreds of gigabytes. AwakeForest produces analysis-ready outputs that can be directly used for downstream analysis and to support iterative model and annotation updates on new scenes. We demonstrate the system on the PALMS dataset and illustrate how AwakeForest supports an end-to-end workflow for practical forest management and analysis.

arXiv Page | PDF

Score: 0

SQLConductor: Search-to-Policy Learning for Step-wise Text-to-SQL Orchestration

Published: 2026-06-22 16:13:39

Authors: Yizhang Zhu, Zhangyang Peng, Boyan Li, Yuyu Luo

Categories: cs.DB, cs.AI, cs.LG

Abstract:
Text-to-SQL enables users to access relational databases via natural language, but real-world settings remain challenging due to coordinated reasoning over complex database environments. Existing systems often use multi-stage pipelines or reasoning models specialized for individual stages. However, fixed pipelines rely on predefined stage orders, limiting their adaptivity to query demands and intermediate evidence. Recent orchestration-based methods provide flexibility by composing specialized modules for each query, but typical plan-then-execute approaches still commit to a complete workflow before execution and cannot adapt to intermediate artifacts and feedback. In this paper, we propose SQLConductor, a step-wise orchestration learning framework for Text-to-SQL. SQLConductor formulates Text-to-SQL subtasks as specialized actions for workflow composition and trains a policy model to select the next action based on intermediate artifacts and feedback. To learn this policy, SQLConductor introduces Search-to-Policy Learning, which uses Monte Carlo Tree Search to explore candidate workflows and stability estimation to identify robust supervision. The policy model is trained with Stability-weighted Supervised Fine-tuning to prioritize high-quality orchestration patterns and further enhanced through Curriculum Reinforcement Learning. This transforms offline workflow search into a deployable policy for step-wise orchestration at inference time. Experiments on BIRD-Dev and out-of-distribution datasets show that SQLConductor achieves superior execution accuracy and strong generalization, reaching 73.2% EX on BIRD-Dev with a compact orchestration policy coordinating frozen larger action models, outperforming prior methods that directly train comparable or larger Text-to-SQL backbones. Further analyses show that the learned policy adapts orchestration to diverse query demands.

arXiv Page | PDF

Score: 0

Towards an Automated Reasoning Tool for Complexity Analysis of Automated Reasoners

Published: 2026-06-22 16:03:34

Authors: Louis Rustenholz, Manuel V. Hermenegildo, Pedro Lopez-Garcia, Alessio Mansutti, Félix Ridoux, Niki Vazou

Categories: cs.LO

Abstract:
We present the theory underpinning a complexity analysis tool (under development) that aims at automating tedious parts of the analysis of complex algorithms originating from the field of automated reasoning. Examples are given by super-exponential quantifier elimination procedures in real and integer arithmetics. Our tool implements the following pipeline. * Together with the algorithm to be analysed, the user (expert, e.g. the algorithm designer) can provide key metrics to track, and lemmas to improve the analysis. In pen-and-paper proofs, these correspond to the "non-tedious" and "creative" parts of the complexity analysis, that require human ingenuity. * The second step consists in the extraction of (generalised) recurrence equations. Here, we rely on a novel higher-order abstract interpretation technique, built on the concept of operator semantics. It enables (optimal) abstract compilation of symbolic programs to different kinds of purely numerical recursive representations, such as recurrence equations on interval-valued functions or numerical logic programs. * Finally, our tool solves the recurrence equations. We propose to go beyond the direct usage of computer algebra systems (CAS), and use pre/postfixpoint-based techniques to discover and verify candidate bounds on the solution. This approach makes use, in turn, of recent progress in SMT solvers, and can also be improved by techniques originating in termination analysis research.

arXiv Page | PDF

Score: 0

Atomic-scale theory of robust out-of-plane ferroelectricity in ultrathin films

Published: 2026-06-22 15:56:23

Authors: Fengbo Yuan, Yujia Teng, Karin M. Rabe, Yubo Qi

Categories: cond-mat.mtrl-sci

Abstract:
Ferroelectricity in ultrathin films, characterized by robust switchable out-of-plane polarization, is key to next-generation nanoelectronics. Although the macroscopic theory of ferroelectricity suggests that ferroelectricity is inevitably suppressed as the film thickness decreases, recent studies have demonstrated robust ultra thin-film ferroelectricity, for certain ferroelectric materials, specifically HfO$_2$-based oxides and bismuth-based oxides. In this work, we develop an atomic-scale theoretical framework for understanding ferroelectricity in this limiting regime. By considering the work function of the termination layers of the film, we find that robust ferroelectricity arises from ``self-polarizing'' and ``switchable role of the termination layer'' effects strongly correlated to the ``characteristic structure.'' This theory also provides further insights on the importance of top electrodes in stabilizing ferroelectricity for this class of materials in the ultrathin limit. This work aims to develop a comprehensive theoretical framework for thin-film ferroelectricity, providing fundamental insights that can guide the design of next-generation nanoscale devices.

arXiv Page | PDF

Score: 0

Analysis of quantum-related courses and textbooks for potential integration of quantum sensing

Published: 2026-06-22 15:56:03

Authors: Namitha Pradeep, Ben Zwickl

Categories: physics.ed-ph

Abstract:
The second quantum revolution is driving advancements in quantum computing, communication, and sensing. While quantum computing has gained significant attention in education, quantum sensing remains largely overlooked. In order to find ways of integrating sensing into existing quantum-related curricula, we performed an analysis of six of the most commonly used textbooks in modern physics, quantum mechanics, and quantum computing. We identified a set of keywords related to quantum sensing and tagged all excerpts in which these concepts appeared. For each excerpt, we also recorded the context in which it was presented within the textbook. Network maps were constructed to visualize which keywords appeared in which contexts and the frequency of these occurrences within each subject area. We then developed an analytic rubric to evaluate the conceptual and mathematical depth of these excerpts as well as the extent of sensing-related discussions. Our results show that there is significant variation in how different subjects address these concepts, both in the nature of the content covered and the depth of coverage. We also observe notable differences in how spin-first and position-first textbooks discuss these keywords, particularly in the coverage and contexts in which core concepts such as superposition and entanglement appear. Additionally, we analyzed course titles and descriptions from a database of over 8,000 quantum-related courses, focusing on those that mentioned ``sensing'' or ``sensor''. Together, these analyses of textbook content and course descriptions inform the quantum information science and engineering education community about potential opportunities to integrate quantum sensing topics into quantum-related courses in physics and adjacent disciplines.

arXiv Page | PDF

Score: 0