Papers

12094 papers

ICLR2025

Stiefel Flow Matching for Moment-Constrained Structure Elucidation

Summary pending...

3D molecular generative modelsflow matchingStiefel manifold
ICLR2025

Robust Weight Initialization for Tanh Neural Networks with Fixed Point Analysis

Summary pending...

Weight initializationSignal propagationPhysics informed neural networks
ICLR2025

LongMamba: Enhancing Mamba's Long-Context Capabilities via Training-Free Receptive Field Enlargement

Summary pending...

Large Language ModelsState Space ModelsLong Context Understanding
ICLR2025

Learning to Search from Demonstration Sequences

Summary pending...

planningreasoninglearning to search
ICLR2025

Self-Supervised Diffusion MRI Denoising via Iterative and Stable Refinement

Summary pending...

Diffusion based modelsSelf-supervised MRI denoising
ICLR2025

Tracking the Copyright of Large Vision-Language Models through Parameter Learning Adversarial Images

Summary pending...

Copyright TrackingLarge Vision-Language ModelsAdversarial Attacks
ICLR2025

Adversarial Generative Flow Network for Solving Vehicle Routing Problems

Summary pending...

Generative Flow NetworkAdversarial TrainingVehicle Routing Problem
ICLR2025

Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence

Summary pending...

llm agentmulti-agent
ICLR2025

Semantic Loss Guided Data Efficient Supervised Fine Tuning for Safe Responses in LLMs

Summary pending...

Large Language ModelSafe Large Language ModelEarth Mover Distance
ICLR2025

Agent S: An Open Agentic Framework that Uses Computers Like a Human

Summary pending...

Large Vision and Language ModelAgentsRetrieval Augmented Generation
ICLR2025

Efficient Policy Evaluation with Safety Constraint for Reinforcement Learning

Summary pending...

Reinforcement Learning
ICLR2025

Context Steering: Controllable Personalization at Inference Time

Summary pending...

personalizationcontextlarge language model
ICLR2025

Searching for Optimal Solutions with LLMs via Bayesian Optimization

Summary pending...

searchoptimizationLLMs
ICLR2025

Online Preference Alignment for Language Models via Count-based Exploration

Summary pending...

Reinforcement Learning from Human FeedbackRLHFPreference Alignment
ICLR2025

Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning

Summary pending...

Actor-CriticExplorationReinforcement Learning
ICLR2025

Geometry-Aware Approaches for Balancing Performance and Theoretical Guarantees in Linear Bandits

Summary pending...

Linear banditThompson samplingGreedy
ICLR2025

Diverse Preference Learning for Capabilities and Alignment

Summary pending...

alignmentdiversitynatural language processing
ICLR2025

Making Text Embedders Few-Shot Learners

Summary pending...

large language modelembedding modelin-context learning
ICLR2025

BrainACTIV: Identifying visuo-semantic properties driving cortical selectivity using diffusion-based image manipulation

Summary pending...

brainselectivityvisual cortex
ICLR2025

SafeDiffuser: Safe Planning with Diffusion Probabilistic Models

Summary pending...

Diffusion modelSafety guaranteesPlanning and control