Papers
12094 papers
ICLR2025
Constraint-Conditioned Actor-Critic for Offline Safe Reinforcement Learning
Summary pending...
Offline Safe Reinforcement LearningConstraint-conditioned Actor-CriticData Generation
ICLR2025
Conformal Language Model Reasoning with Coherent Factuality
Summary pending...
language modelsreasoningconformal prediction
ICLR2025
NeSyC: A Neuro-symbolic Continual Learner For Complex Embodied Tasks In Open Domains
Summary pending...
Embodied AINeuro-symbolic AI
ICLR2025
ACC-Collab: An Actor-Critic Approach to Multi-Agent LLM Collaboration
Summary pending...
Multi-Agent CollaborationLLM AgentsPreference Optimization
ICLR2025
Dissecting Adversarial Robustness of Multimodal LM Agents
Summary pending...
LM agentsmultimodal agentssafety
ICLR2025
SeedLM: Compressing LLM Weights into Seeds of Pseudo-Random Generators
Summary pending...
Model CompressionLarge Language ModelsPost-Training Quantization
ICLR2025
CL-MFAP: A Contrastive Learning-Based Multimodal Foundation Model for Molecular Property Prediction and Antibiotic Screening
Summary pending...
Contrastive LearningMultimodal Foundation ModelAntibiotic Property Prediction
ICLR2025
Towards Principled Evaluations of Sparse Autoencoders for Interpretability and Control
Summary pending...
mechanistic interpretabilitysparse autoencodersevaluations
ICLR2025
Commit0: Library Generation from Scratch
Summary pending...
code generationlanguage modelevaluation
ICLR2025
VVC-Gym: A Fixed-Wing UAV Reinforcement Learning Environment for Multi-Goal Long-Horizon Problems
Summary pending...
Reinforcement Learning EnvironmentDemonstrationsGoal-Conditioned Reinforcement Learning
ICLR2025
MrT5: Dynamic Token Merging for Efficient Byte-level Language Models
Summary pending...
NLPByT5T5
ICLR2025
Simple is Effective: The Roles of Graphs and Large Language Models in Knowledge-Graph-Based Retrieval-Augmented Generation
Summary pending...
Knowledge GraphsLarge Language ModelsRetrieval-Augmented Generation
ICLR2025
RazorAttention: Efficient KV Cache Compression Through Retrieval Heads
Summary pending...
LLMsKV cache compressionLLM inference acceleration
ICLR2025
Efficient Top-m Data Values Identification for Data Selection
Summary pending...
data selectiondata valuationtop-m arms identification
ICLR2025
Finally Rank-Breaking Conquers MNL Bandits: Optimal and Efficient Algorithms for MNL Assortment
Summary pending...
Active online assortment optimizationPreference feedbackSubsetwise utility maximization
ICLR2025
Collab: Controlled Decoding using Mixture of Agents for LLM Alignment
Summary pending...
AlignmentDecodingRLHF
ICLR2025
DailyDilemmas: Revealing Value Preferences of LLMs with Quandaries of Daily Life
Summary pending...
language modelmoral dilemmamodel alignment
ICLR2025
Follow My Instruction and Spill the Beans: Scalable Data Extraction from Retrieval-Augmented Generation Systems
Summary pending...
Retrieval-Augmented GenerationSecurityPrivacy
ICLR2025
SWE-bench Multimodal: Do AI Systems Generalize to Visual Software Domains?
Summary pending...
Language modelsNatural language processingSoftware engineering
ICLR2025
PEARL: Towards Permutation-Resilient LLMs
Summary pending...
In-Context LearningLarge Language ModelsInstruction Tuning