Papers
12094 papers
ICLR2025
Optimization by Parallel Quasi-Quantum Annealing with Gradient-Based Sampling
Summary pending...
Combinatorial OptimizationDiscrete OptimizationLearning for Combinatorial Optimization
ICLR2025
Few for Many: Tchebycheff Set Scalarization for Many-Objective Optimization
Summary pending...
multi-objective optimizationmany-objective optimizationTchebycheff scalarization
ICLR2025
VICtoR: Learning Hierarchical Vision-Instruction Correlation Rewards for Long-horizon Manipulation
Summary pending...
reward learningreinforcement learninglong-horizon robot learning
ICLR2025
HeadMap: Locating and Enhancing Knowledge Circuits in LLMs
Summary pending...
Large Language ModelParameter-Efficient Fine-TuningInterpretability
ICLR2025
Ultra-Sparse Memory Network
Summary pending...
Large language modelsparse modelscaling law
ICLR2025
AdaRankGrad: Adaptive Gradient Rank and Moments for Memory-Efficient LLMs Training and Fine-Tuning
Summary pending...
low rank adaptation; low rank gradient training; memory efficient fine tuning; memory optimization; adaptive rank; foundation large language models;
ICLR2025
From Tokens to Words: On the Inner Lexicon of LLMs
Summary pending...
DetokenizationLarge Language ModelsLLM
ICLR2025
OmniBind: Large-scale Omni Multimodal Representation via Binding Spaces
Summary pending...
multimodal representations
ICLR2025
ThinkBot: Embodied Instruction Following with Thought Chain Reasoning
Summary pending...
Embodied Instruction Following (EIF)Large Language ModelChain-of-thought Reasoning
ICLR2025
Looking Backward: Streaming Video-to-Video Translation with Feature Banks
Summary pending...
Streaming video translationdiffusion modelsfeature banks
ICLR2025
Making Transformer Decoders Better Differentiable Indexers
Summary pending...
Generative RetrievalGenerative IndexEnd-to-end Recommender System
ICLR2025
Sharpness-Aware Black-Box Optimization
Summary pending...
Black-box OptimizationSharpness-Aware Minimization
ICLR2025
Online-to-Offline RL for Agent Alignment
Summary pending...
reinforcement learningagent alignment
ICLR2025
Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates
Summary pending...
Large Language ModelsCheatingAutomatic LLM Benchmarks
ICLR2025
Revisiting text-to-image evaluation with Gecko: on metrics, prompts, and human rating
Summary pending...
text-to-image evaluation; text-to-image alignment; human evaluation;
ICLR2025
ELBOing Stein: Variational Bayes with Stein Mixture Inference
Summary pending...
variational Bayesparticle-based inferencemixture models
ICLR2025
Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization
Summary pending...
large language modelsalignmentpreference optimization
ICLR2025
Deriving Causal Order from Single-Variable Interventions: Guarantees & Algorithm
Summary pending...
causalitycausal inferencecausal discovery
ICLR2025
DECO: Unleashing the Potential of ConvNets for Query-based Detection and Segmentation
Summary pending...
DetectionSegmentationConvNet
ICLR2025
MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering
Summary pending...
benchmarkevalsevaluations