Papers

12094 papers

ICLR2025

Single Teacher, Multiple Perspectives: Teacher Knowledge Augmentation for Enhanced Knowledge Distillation

Summary pending...

TeKAPTeacher Knowledge AugmentationTeacher Knowledge Perturbation
ICLR2025

Model-based Offline Reinforcement Learning with Lower Expectile Q-Learning

Summary pending...

model-based offline reinforcement learningoffline reinforcement learninglower expectile return
ICLR2025

Unify ML4TSP: Drawing Methodological Principles for TSP and Beyond from Streamlined Design Space of Learning and Search

Summary pending...

Neural Combinatorial OptimizationTravelling salesman problem
ICLR2025

Learning Partial Graph Matching via Optimal Partial Transport

Summary pending...

Graph MatchingOptimal Transport
ICLR2025

SynFlowNet: Design of Diverse and Novel Molecules with Synthesis Constraints

Summary pending...

GFlowNetsde novo molecular generationsynthesizable molecular design
ICLR2025

Continuous Diffusion for Mixed-Type Tabular Data

Summary pending...

synthetic data generationdiffusion modelgenerative model
ICLR2025

SpaceGNN: Multi-Space Graph Neural Network for Node Anomaly Detection with Extremely Limited Labels

Summary pending...

Node Anomaly DetectionGraph Neural NetworkMultiple Spaces
ICLR2025

Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning

Summary pending...

Offline Reinforcement LearningDiffusion ModelsActor-critic Learning
ICLR2025

Everything is Editable: Extend Knowledge Editing to Unstructured Data in Large Language Models

Summary pending...

Knowledge EditingLarge Language Models
ICLR2025

A Solvable Attention for Neural Scaling Laws

Summary pending...

self-attentionscaling lawssolution of learning dynamics
ICLR2025

Conformalized Survival Analysis for General Right-Censored Data

Summary pending...

conformal predictionsurvival analysisPAC
ICLR2025

A Stochastic Approach to the Subset Selection Problem via Mirror Descent

Summary pending...

Nonconvex OptimizationSubset SelectionStochastic
ICLR2025

Improving Long-Text Alignment for Text-to-Image Diffusion Models

Summary pending...

Long Text AlignmentDiffusion ModelsPreference Optimization
ICLR2025

Toward Exploratory Inverse Constraint Inference with Generative Diffusion Verifiers

Summary pending...

Inverse Reinforcement LearningGenerative Diffusion Model
ICLR2025

Unified Parameter-Efficient Unlearning for LLMs

Summary pending...

Large Language Model Unlearning; Machine Unlearning; Influence Function
ICLR2025

Exploring the Effectiveness of Object-Centric Representations in Visual Question Answering: Comparative Insights with Foundation Models

Summary pending...

Object-centric LearningFoundation Models
ICLR2025

ADMM for Nonconvex Optimization under Minimal Continuity Assumption

Summary pending...

Nonconvex OptimizationProximal Linearized ADMMNonsmooth Optimization
ICLR2025

MAST: model-agnostic sparsified training

Summary pending...

dropout theorysparse training
ICLR2025

DOCS: Quantifying Weight Similarity for Deeper Insights into Large Language Models

Summary pending...

Weight SimilarityLarge Language ModelsDistribution of Cosine Similarity
ICLR2025

Cross-Domain Off-Policy Evaluation and Learning for Contextual Bandits

Summary pending...

Off-Policy EvaluationOff-Policy LearningImportance Weighting