Papers

12094 papers

ICLR2025

MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions

Summary pending...

Human AlignmentLarge Language ModelsReinforcement Learning
ICLR2025

Generalization and Distributed Learning of GFlowNets

Summary pending...

GFlowNets
ICLR2025

TexTailor: Customized Text-aligned Texturing via Effective Resampling

Summary pending...

3D texture synthesisdiffusion modelresampling
ICLR2025

MMTEB: Massive Multilingual Text Embedding Benchmark

Summary pending...

natural language processingbenchmarksentence embeddings
ICLR2025

Mind the GAP: Glimpse-based Active Perception improves generalization and sample efficiency of visual reasoning

Summary pending...

visual reasoningactive visionout-of-distribution
ICLR2025

Talking Turns: Benchmarking Audio Foundation Models on Turn-Taking Dynamics

Summary pending...

Turn-takingConversation AIAudio Foundation Models
ICLR2025

Real-time design of architectural structures with differentiable mechanics and neural networks

Summary pending...

Differentiable physicsmechanical designphysics-in-the-loop neural networks
ICLR2025

BEEM: Boosting Performance of Early Exit DNNs using Multi-Exit Classifiers as Experts

Summary pending...

Early Exits; Expert-based exiting
ICLR2025

Scaling Laws for Adversarial Attacks on Language Model Activations and Tokens

Summary pending...

adversarial attackslanguage modelsscaling laws
ICLR2025

Locally Connected Echo State Networks for Time Series Forecasting

Summary pending...

Time Series AnalysisTime Series ForecastingTSF
ICLR2025

Reinforcement Learning for Control of Non-Markovian Cellular Population Dynamics

Summary pending...

optimal drug dosingfractional differential equationsreinforcement learning
ICLR2025

BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval

Summary pending...

Retrieval benchmarkReasoning
ICLR2025

Accelerated training through iterative gradient propagation along the residual path

Summary pending...

optimizationefficient training
ICLR2025

Hierarchical Autoregressive Transformers: Combining Byte- and Word-Level Processing for Robust, Adaptable Language Models

Summary pending...

transformerautoregressivegenerative
ICLR2025

U-shaped and Inverted-U Scaling behind Emergent Abilities of Large Language Models

Summary pending...

large language modelsemergent abilitiesscaling laws
ICLR2025

DELIFT: Data Efficient Language model Instruction Fine-Tuning

Summary pending...

Data Efficient Instruction Fine-Tuning; Data Subset Selection; Submodular Functions
ICLR2025

TULIP: Token-length Upgraded CLIP

Summary pending...

Vision-Language ModelsCLIPPosition Encodings
ICLR2025

Sketching for Convex and Nonconvex Regularized Least Squares with Sharp Guarantees

Summary pending...

SketchingRandom ProjectionMinimax Rates
ICLR2025

OvercookedV2: Rethinking Overcooked for Zero-Shot Coordination

Summary pending...

multi-agent reinforcement learningreinforcement learningmulti-agent systems
ICLR2025

On the Learn-to-Optimize Capabilities of Transformers in In-Context Sparse Recovery

Summary pending...

TransformerIn-context learningLearning-to-optimize