Papers

12094 papers

ICLR2025

NovelQA: Benchmarking Question Answering on Documents Exceeding 200K Tokens

Summary pending...

Long-contextLarge Language ModelsQuestion Answering
ICLR2025

Stealthy Shield Defense: A Conditional Mutual Information-Based Approach against Black-Box Model Inversion Attacks

Summary pending...

AI securitymodel inversion attackinformation bottleneck
ICLR2025

Arithmetic Transformers Can Length-Generalize in Both Operand Length and Count

Summary pending...

Length GeneralizationTransformersScratchpad
ICLR2025

TabReD: Analyzing Pitfalls and Filling the Gaps in Tabular Deep Learning Benchmarks

Summary pending...

Tabular DataBenchmarksReality Check
ICLR2025

Be More Diverse than the Most Diverse: Optimal Mixtures of Generative Models via Mixture-UCB Bandit Algorithms

Summary pending...

Multi-Armed BanditsEvaluation of generative modelsKernel-based evaluation scores
ICLR2025

LASeR: Towards Diversified and Generalizable Robot Design with Large Language Models

Summary pending...

Robot Design AutomationLarge Language ModelVoxel-Based Soft Robot
ICLR2025

Long Context Compression with Activation Beacon

Summary pending...

Context CompressionLong Context LLMsLLM Memory
ICLR2025

Understanding and Enhancing Safety Mechanisms of LLMs via Safety-Specific Neuron

Summary pending...

Large Language ModelsAlignmentSafety
ICLR2025

SOO-Bench: Benchmarks for Evaluating the Stability of Offline Black-Box Optimization

Summary pending...

Offline OptimizationBlack-Box OptimizationStability
ICLR2025

Distribution-Free Data Uncertainty for Neural Network Regression

Summary pending...

deep learninguncertainty quantificationregression uncertainty
ICLR2025

OCEAN: Offline Chain-of-thought Evaluation and Alignment in Large Language Models

Summary pending...

chain-of-thoughtlarge language modelsoffline policy evaluation
ICLR2025

Asymptotic Analysis of Two-Layer Neural Networks after One Gradient Step under Gaussian Mixtures Data with Structure

Summary pending...

deep learning theoryrandom featuresGaussian equivalence
ICLR2025

NetMoE: Accelerating MoE Training through Dynamic Sample Placement

Summary pending...

Mixture of ExpertsAll-to-All communicationDistributed training
ICLR2025

Rapid Selection and Ordering of In-Context Demonstrations via Prompt Embedding Clustering

Summary pending...

in-context learningorder sensitivityLLMs
ICLR2025

Think Then React: Towards Unconstrained Action-to-Reaction Motion Generation

Summary pending...

Human Reaction Generation3D Human MotionLarge Language Model
ICLR2025

UniCBE: An Uniformity-driven Comparing Based Evaluation Framework with Unified Multi-Objective Optimization

Summary pending...

evaluationefficientscalability
ICLR2025

Language Imbalance Driven Rewarding for Multilingual Self-improving

Summary pending...

Large Language ModelSelf-ImprovingMultilinguality
ICLR2025

HyperFace: Generating Synthetic Face Recognition Datasets by Exploring Face Embedding Hypersphere

Summary pending...

Face RecognitionHypersphere OptimizationPrivacy
ICLR2025

Web Agents with World Models: Learning and Leveraging Environment Dynamics in Web Navigation

Summary pending...

Web AgentWorld ModelDigital Agent
ICLR2025

DPLM-2: A Multimodal Diffusion Protein Language Model

Summary pending...

protein foundation modeldiffusion language modelmultimodal language model