Papers

12094 papers

ICLR2025

Do I Know This Entity? Knowledge Awareness and Hallucinations in Language Models

Summary pending...

Mechanistic InterpretabilityHallucinationsLanguage Models
ICLR2025

Provable weak-to-strong generalization via benign overfitting

Summary pending...

benign overfittingspiked covariance modelsoverparameterized models
ICLR2025

HARDMath: A Benchmark Dataset for Challenging Problems in Applied Mathematics

Summary pending...

mathbenchmarkdataset
ICLR2025

STAR: Synthesis of Tailored Architectures

Summary pending...

alternative architecturesdeep signal processinglanguage models
ICLR2025

Last-Iterate Convergence Properties of Regret-Matching Algorithms in Games

Summary pending...

Last-Iterate ConvergenceMinty solutionRegret Matching
ICLR2025

HQ-Edit: A High-Quality Dataset for Instruction-based Image Editing

Summary pending...

Image-to-image translationImage Editing
ICLR2025

MCNC: Manifold-Constrained Reparameterization for Neural Compression

Summary pending...

Model CompressionLoRAPEFT
ICLR2025

Residual-MPPI: Online Policy Customization for Continuous Control

Summary pending...

Policy customizationCombination of learning- and planning-based approachesModel predictive control
ICLR2025

Machine Unlearning Fails to Remove Data Poisoning Attacks

Summary pending...

machine unlearningdata poisoning
ICLR2025

RankSHAP: Shapley Value Based Feature Attributions for Learning to Rank

Summary pending...

Feature attributionsShapley valuesInformation Retrieval
ICLR2025

Misspecified $Q$-Learning with Sparse Linear Function Approximation: Tight Bounds on Approximation Error

Summary pending...

misspecification errorreinforcement learning theorysample complexity
ICLR2025

Steering Large Language Models between Code Execution and Textual Reasoning

Summary pending...

Large Language ModelsCode InterpreterCode/text generation
ICLR2025

Can Large Language Models Understand Symbolic Graphics Programs?

Summary pending...

Large Language ModelsSymbolic Graphics Programs
ICLR2025

Beyond Worst-Case Dimensionality Reduction for Sparse Vectors

Summary pending...

dimensionality reductionsparsityjohnson lindenstrauss
ICLR2025

First-Person Fairness in Chatbots

Summary pending...

fairnesslarge language modelschatbots
ICLR2025

Tamper-Resistant Safeguards for Open-Weight LLMs

Summary pending...

ai safetylarge language modelstamper-resistance
ICLR2025

E(n) Equivariant Topological Neural Networks

Summary pending...

Topological Deep LearningEquivarianceEquivariant Neural Networks
ICLR2025

eQMARL: Entangled Quantum Multi-Agent Reinforcement Learning for Distributed Cooperation over Quantum Channels

Summary pending...

quantum machine learningmulti-agent reinforcement learningquantum entanglement
ICLR2025

Reconciling Model Multiplicity for Downstream Decision Making

Summary pending...

model multiplicitymulti-calibrationdecision-making
ICLR2025

Transformer Block Coupling and its Correlation with Generalization in LLMs

Summary pending...

large language modelstransformershidden representations