Papers

12094 papers

ICLR2025

Optimization by Parallel Quasi-Quantum Annealing with Gradient-Based Sampling

Summary pending...

Combinatorial OptimizationDiscrete OptimizationLearning for Combinatorial Optimization
ICLR2025

Few for Many: Tchebycheff Set Scalarization for Many-Objective Optimization

Summary pending...

multi-objective optimizationmany-objective optimizationTchebycheff scalarization
ICLR2025

VICtoR: Learning Hierarchical Vision-Instruction Correlation Rewards for Long-horizon Manipulation

Summary pending...

reward learningreinforcement learninglong-horizon robot learning
ICLR2025

HeadMap: Locating and Enhancing Knowledge Circuits in LLMs

Summary pending...

Large Language ModelParameter-Efficient Fine-TuningInterpretability
ICLR2025

Ultra-Sparse Memory Network

Summary pending...

Large language modelsparse modelscaling law
ICLR2025

AdaRankGrad: Adaptive Gradient Rank and Moments for Memory-Efficient LLMs Training and Fine-Tuning

Summary pending...

low rank adaptation; low rank gradient training; memory efficient fine tuning; memory optimization; adaptive rank; foundation large language models;
ICLR2025

From Tokens to Words: On the Inner Lexicon of LLMs

Summary pending...

DetokenizationLarge Language ModelsLLM
ICLR2025

OmniBind: Large-scale Omni Multimodal Representation via Binding Spaces

Summary pending...

multimodal representations
ICLR2025

ThinkBot: Embodied Instruction Following with Thought Chain Reasoning

Summary pending...

Embodied Instruction Following (EIF)Large Language ModelChain-of-thought Reasoning
ICLR2025

Looking Backward: Streaming Video-to-Video Translation with Feature Banks

Summary pending...

Streaming video translationdiffusion modelsfeature banks
ICLR2025

Making Transformer Decoders Better Differentiable Indexers

Summary pending...

Generative RetrievalGenerative IndexEnd-to-end Recommender System
ICLR2025

Sharpness-Aware Black-Box Optimization

Summary pending...

Black-box OptimizationSharpness-Aware Minimization
ICLR2025

Online-to-Offline RL for Agent Alignment

Summary pending...

reinforcement learningagent alignment
ICLR2025

Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates

Summary pending...

Large Language ModelsCheatingAutomatic LLM Benchmarks
ICLR2025

Revisiting text-to-image evaluation with Gecko: on metrics, prompts, and human rating

Summary pending...

text-to-image evaluation; text-to-image alignment; human evaluation;
ICLR2025

ELBOing Stein: Variational Bayes with Stein Mixture Inference

Summary pending...

variational Bayesparticle-based inferencemixture models
ICLR2025

Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization

Summary pending...

large language modelsalignmentpreference optimization
ICLR2025

Deriving Causal Order from Single-Variable Interventions: Guarantees & Algorithm

Summary pending...

causalitycausal inferencecausal discovery
ICLR2025

DECO: Unleashing the Potential of ConvNets for Query-based Detection and Segmentation

Summary pending...

DetectionSegmentationConvNet
ICLR2025

MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering

Summary pending...

benchmarkevalsevaluations