420

arXiv:2512.13102v2 Announce Type: replace
Abstract: Large Language Models (LLMs) excel at static interactions, where they answer user queries by retrieving knowledge encoded in their parameters. However, in many real-world settings, such as educational tutoring or medical assistance, relevant infor…
342

arXiv:2503.07982v3 Announce Type: replace
Abstract: High-quality instance and panoptic segmentation has traditionally relied on dense instance-level annotations such as masks, boxes, or points, which are costly, inconsistent, and difficult to scale. Unsupervised and weakly-supervised approaches red…
329

arXiv:2512.19331v1 Announce Type: new
Abstract: Whole Slide Images (WSIs) are typically analyzed using multiple instance learning (MIL) methods. However, the scale and heterogeneity of WSIs generate highly redundant and dispersed information, making it difficult to identify and integrate discrimina…
332

arXiv:2508.06831v2 Announce Type: replace
Abstract: Adapting person re-identification (reID) models to new target environments remains a challenging problem that is typically addressed using unsupervised domain adaptation (UDA) methods. Recent works show that when labeled data originates from sever…
319

Demis Hassabis, CEO of Google DeepMind, summed it up in three words: “This is embarrassing.”   Hassabis was replying on X to an overexcited post by Sébastien Bubeck, a research scientist at the rival firm OpenAI, announcing that two mathematicians had used OpenAI’s latest large language model, GPT-5…
320

arXiv:2512.18073v1 Announce Type: new
Abstract: Multimodal LLMs (MLLMs) have gained significant traction in complex data analysis, visual question answering, generation, and reasoning. Recently, they have been used for analyzing the biometric utility of iris and face images. However, their capabili…
232

Join Stephen Cass, Dina Genkina, and Kohava Mendelsohn as they discuss whether AI spells the end of distinct programming languages as we know it. IEEE Spectrum publishes a respected annual ranking of the year’s Top Programming Languages—but could this year be our last? This recording of the live web…
222

arXiv:2512.18068v1 Announce Type: new
Abstract: Imitation learning (IL) has shown immense promise in enabling autonomous dexterous manipulation, including learning surgical tasks. To fully unlock the potential of IL for surgery, access to clinical datasets is needed, which unfortunately lack the ki…
222

arXiv:2507.17383v2 Announce Type: replace
Abstract: Trustworthy robot behavior requires not only high levels of task success but also that the robot can reliably quantify how likely it is to succeed. To this end, we present a first-of-its-kind study of confidence calibration in vision-language-acti…
227

arXiv:2512.18551v1 Announce Type: new
Abstract: In language modeling, neologisms are new tokens trained to represent a concept not already included in a given model's vocabulary. Neologisms can be used to encourage specific behavior in models, for example by appending prompts with "Give me a neolog…
213

Lately, everywhere I scroll, I keep seeing the same fish-eyed CCTV view: a grainy wide shot from the corner of a living room, a driveway at night, an empty grocery store. Then something impossible happens. JD Vance shows up at the doorstep in a crazy outfit. A car folds into itself like paper and dr…
213

arXiv:2512.18312v1 Announce Type: new
Abstract: The creation of high-fidelity, physically-based rendering (PBR) materials remains a bottleneck in many graphics pipelines, typically requiring specialized equipment and expert-driven post-processing. To democratize this process, we present MatE, a nov…