Papers

12094 papers

ICLR2025

Constraint-Conditioned Actor-Critic for Offline Safe Reinforcement Learning

Summary pending...

Offline Safe Reinforcement LearningConstraint-conditioned Actor-CriticData Generation

Paper

ICLR2025

Conformal Language Model Reasoning with Coherent Factuality

Summary pending...

language modelsreasoningconformal prediction

Paper

ICLR2025

NeSyC: A Neuro-symbolic Continual Learner For Complex Embodied Tasks In Open Domains

Summary pending...

Embodied AINeuro-symbolic AI

Paper

ICLR2025

ACC-Collab: An Actor-Critic Approach to Multi-Agent LLM Collaboration

Summary pending...

Multi-Agent CollaborationLLM AgentsPreference Optimization

Paper

ICLR2025

Dissecting Adversarial Robustness of Multimodal LM Agents

Summary pending...

LM agentsmultimodal agentssafety

Paper

ICLR2025

SeedLM: Compressing LLM Weights into Seeds of Pseudo-Random Generators

Summary pending...

Model CompressionLarge Language ModelsPost-Training Quantization

Paper

ICLR2025

CL-MFAP: A Contrastive Learning-Based Multimodal Foundation Model for Molecular Property Prediction and Antibiotic Screening

Summary pending...

Contrastive LearningMultimodal Foundation ModelAntibiotic Property Prediction

Paper

ICLR2025

Towards Principled Evaluations of Sparse Autoencoders for Interpretability and Control

Summary pending...

mechanistic interpretabilitysparse autoencodersevaluations

Paper

ICLR2025

Commit0: Library Generation from Scratch

Summary pending...

code generationlanguage modelevaluation

Paper

ICLR2025

VVC-Gym: A Fixed-Wing UAV Reinforcement Learning Environment for Multi-Goal Long-Horizon Problems

Summary pending...

Reinforcement Learning EnvironmentDemonstrationsGoal-Conditioned Reinforcement Learning

Paper

ICLR2025

MrT5: Dynamic Token Merging for Efficient Byte-level Language Models

Summary pending...

NLPByT5T5

Paper

ICLR2025

Simple is Effective: The Roles of Graphs and Large Language Models in Knowledge-Graph-Based Retrieval-Augmented Generation

Summary pending...

Knowledge GraphsLarge Language ModelsRetrieval-Augmented Generation

Paper

ICLR2025

RazorAttention: Efficient KV Cache Compression Through Retrieval Heads

Summary pending...

LLMsKV cache compressionLLM inference acceleration

Paper

ICLR2025

Efficient Top-m Data Values Identification for Data Selection

Summary pending...

data selectiondata valuationtop-m arms identification

Paper

ICLR2025

Finally Rank-Breaking Conquers MNL Bandits: Optimal and Efficient Algorithms for MNL Assortment

Summary pending...

Active online assortment optimizationPreference feedbackSubsetwise utility maximization

Paper

ICLR2025

Collab: Controlled Decoding using Mixture of Agents for LLM Alignment

Summary pending...

AlignmentDecodingRLHF

Paper

ICLR2025

DailyDilemmas: Revealing Value Preferences of LLMs with Quandaries of Daily Life

Summary pending...

language modelmoral dilemmamodel alignment

Paper

ICLR2025

Follow My Instruction and Spill the Beans: Scalable Data Extraction from Retrieval-Augmented Generation Systems

Summary pending...

Retrieval-Augmented GenerationSecurityPrivacy

Paper

ICLR2025

SWE-bench Multimodal: Do AI Systems Generalize to Visual Software Domains?

Summary pending...

Language modelsNatural language processingSoftware engineering

Paper

ICLR2025

PEARL: Towards Permutation-Resilient LLMs

Summary pending...

In-Context LearningLarge Language ModelsInstruction Tuning

Paper