Papers
12094 papers
ICLR2025
MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions
Summary pending...
Human AlignmentLarge Language ModelsReinforcement Learning
ICLR2025
TexTailor: Customized Text-aligned Texturing via Effective Resampling
Summary pending...
3D texture synthesisdiffusion modelresampling
ICLR2025
MMTEB: Massive Multilingual Text Embedding Benchmark
Summary pending...
natural language processingbenchmarksentence embeddings
ICLR2025
Mind the GAP: Glimpse-based Active Perception improves generalization and sample efficiency of visual reasoning
Summary pending...
visual reasoningactive visionout-of-distribution
ICLR2025
Talking Turns: Benchmarking Audio Foundation Models on Turn-Taking Dynamics
Summary pending...
Turn-takingConversation AIAudio Foundation Models
ICLR2025
Real-time design of architectural structures with differentiable mechanics and neural networks
Summary pending...
Differentiable physicsmechanical designphysics-in-the-loop neural networks
ICLR2025
BEEM: Boosting Performance of Early Exit DNNs using Multi-Exit Classifiers as Experts
Summary pending...
Early Exits; Expert-based exiting
ICLR2025
Scaling Laws for Adversarial Attacks on Language Model Activations and Tokens
Summary pending...
adversarial attackslanguage modelsscaling laws
ICLR2025
Locally Connected Echo State Networks for Time Series Forecasting
Summary pending...
Time Series AnalysisTime Series ForecastingTSF
ICLR2025
Reinforcement Learning for Control of Non-Markovian Cellular Population Dynamics
Summary pending...
optimal drug dosingfractional differential equationsreinforcement learning
ICLR2025
BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval
Summary pending...
Retrieval benchmarkReasoning
ICLR2025
Accelerated training through iterative gradient propagation along the residual path
Summary pending...
optimizationefficient training
ICLR2025
Hierarchical Autoregressive Transformers: Combining Byte- and Word-Level Processing for Robust, Adaptable Language Models
Summary pending...
transformerautoregressivegenerative
ICLR2025
U-shaped and Inverted-U Scaling behind Emergent Abilities of Large Language Models
Summary pending...
large language modelsemergent abilitiesscaling laws
ICLR2025
DELIFT: Data Efficient Language model Instruction Fine-Tuning
Summary pending...
Data Efficient Instruction Fine-Tuning; Data Subset Selection; Submodular Functions
ICLR2025
TULIP: Token-length Upgraded CLIP
Summary pending...
Vision-Language ModelsCLIPPosition Encodings
ICLR2025
Sketching for Convex and Nonconvex Regularized Least Squares with Sharp Guarantees
Summary pending...
SketchingRandom ProjectionMinimax Rates
ICLR2025
OvercookedV2: Rethinking Overcooked for Zero-Shot Coordination
Summary pending...
multi-agent reinforcement learningreinforcement learningmulti-agent systems
ICLR2025
On the Learn-to-Optimize Capabilities of Transformers in In-Context Sparse Recovery
Summary pending...
TransformerIn-context learningLearning-to-optimize