Papers
12094 papers
ICLR2025
Stiefel Flow Matching for Moment-Constrained Structure Elucidation
Summary pending...
3D molecular generative modelsflow matchingStiefel manifold
ICLR2025
Robust Weight Initialization for Tanh Neural Networks with Fixed Point Analysis
Summary pending...
Weight initializationSignal propagationPhysics informed neural networks
ICLR2025
LongMamba: Enhancing Mamba's Long-Context Capabilities via Training-Free Receptive Field Enlargement
Summary pending...
Large Language ModelsState Space ModelsLong Context Understanding
ICLR2025
Learning to Search from Demonstration Sequences
Summary pending...
planningreasoninglearning to search
ICLR2025
Self-Supervised Diffusion MRI Denoising via Iterative and Stable Refinement
Summary pending...
Diffusion based modelsSelf-supervised MRI denoising
ICLR2025
Tracking the Copyright of Large Vision-Language Models through Parameter Learning Adversarial Images
Summary pending...
Copyright TrackingLarge Vision-Language ModelsAdversarial Attacks
ICLR2025
Adversarial Generative Flow Network for Solving Vehicle Routing Problems
Summary pending...
Generative Flow NetworkAdversarial TrainingVehicle Routing Problem
ICLR2025
Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence
Summary pending...
llm agentmulti-agent
ICLR2025
Semantic Loss Guided Data Efficient Supervised Fine Tuning for Safe Responses in LLMs
Summary pending...
Large Language ModelSafe Large Language ModelEarth Mover Distance
ICLR2025
Agent S: An Open Agentic Framework that Uses Computers Like a Human
Summary pending...
Large Vision and Language ModelAgentsRetrieval Augmented Generation
ICLR2025
Efficient Policy Evaluation with Safety Constraint for Reinforcement Learning
Summary pending...
Reinforcement Learning
ICLR2025
Context Steering: Controllable Personalization at Inference Time
Summary pending...
personalizationcontextlarge language model
ICLR2025
Searching for Optimal Solutions with LLMs via Bayesian Optimization
Summary pending...
searchoptimizationLLMs
ICLR2025
Online Preference Alignment for Language Models via Count-based Exploration
Summary pending...
Reinforcement Learning from Human FeedbackRLHFPreference Alignment
ICLR2025
Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning
Summary pending...
Actor-CriticExplorationReinforcement Learning
ICLR2025
Geometry-Aware Approaches for Balancing Performance and Theoretical Guarantees in Linear Bandits
Summary pending...
Linear banditThompson samplingGreedy
ICLR2025
Diverse Preference Learning for Capabilities and Alignment
Summary pending...
alignmentdiversitynatural language processing
ICLR2025
Making Text Embedders Few-Shot Learners
Summary pending...
large language modelembedding modelin-context learning
ICLR2025
BrainACTIV: Identifying visuo-semantic properties driving cortical selectivity using diffusion-based image manipulation
Summary pending...
brainselectivityvisual cortex
ICLR2025
SafeDiffuser: Safe Planning with Diffusion Probabilistic Models
Summary pending...
Diffusion modelSafety guaranteesPlanning and control