Papers

12094 papers

ICLR2025

DPLM-2: A Multimodal Diffusion Protein Language Model

Summary pending...

protein foundation modeldiffusion language modelmultimodal language model
ICLR2025

Efficient Residual Learning with Mixture-of-Experts for Universal Dexterous Grasping

Summary pending...

dexterous graspingresidual policy learningreinforcement learning
ICLR2025

Quantum-PEFT: Ultra parameter-efficient fine-tuning

Summary pending...

parameter-efficient fine-tuningloraquantum machine learning
ICLR2025

TopoLM: brain-like spatio-functional organization in a topographic language model

Summary pending...

language modelingtopographyfMRI
ICLR2025

Language Models are Advanced Anonymizers

Summary pending...

privacyanonymizationlarge language models
ICLR2025

SWE-Search: Enhancing Software Agents with Monte Carlo Tree Search and Iterative Refinement

Summary pending...

agentsLLMSWE-agents
ICLR2025

How new data permeates LLM knowledge and how to dilute it

Summary pending...

fine-tuninghallucinationsknowledge injection
ICLR2025

Reliable and Diverse Evaluation of LLM Medical Knowledge Mastery

Summary pending...

LLM EvaluationMedical EvaluationLarge Language Model
ICLR2025

Clique Number Estimation via Differentiable Functions of Adjacency Matrix Permutations

Summary pending...

Graph neural networkdistant supervision
ICLR2025

SCOPE: A Self-supervised Framework for Improving Faithfulness in Conditional Text Generation

Summary pending...

faithfulnesshallucinationconditional text generation
ICLR2025

ADAM: An Embodied Causal Agent in Open-World Environments

Summary pending...

embodied agentcausalitylarge language model
ICLR2025

Ward: Provable RAG Dataset Inference via LLM Watermarks

Summary pending...

llmwatermarksdataset inference
ICLR2025

ProAdvPrompter: A Two-Stage Journey to Effective Adversarial Prompting for LLMs

Summary pending...

jailbreaking attacks; large language model
ICLR2025

Expected Return Symmetries

Summary pending...

multi-agent reinforcement learningzero-shot coordination
ICLR2025

Herald: A Natural Language Annotated Lean 4 Dataset

Summary pending...

Lean 4AutoformalizingLLM
ICLR2025

Black-Box Detection of Language Model Watermarks

Summary pending...

llmwatermarking
ICLR2025

Drop-Upcycling: Training Sparse Mixture of Experts with Partial Re-initialization

Summary pending...

mixture of expertslarge language modelscontinual pre-training
ICLR2025

Oracle efficient truncated statistics

Summary pending...

truncated statisticsexponential familystatistical learning
ICLR2025

Neural Interactive Proofs

Summary pending...

interactive proofsgame theoryneural networks
ICLR2025

SimulPL: Aligning Human Preferences in Simultaneous Machine Translation

Summary pending...

simultaneous machine translationsimultaneous preference optimizationhuman preferences