Papers

12094 papers

ICLR2025

Safety-Prioritizing Curricula for Constrained Reinforcement Learning

Summary pending...

curriculum learningconstrained reinforcement learning
ICLR2025

Compute-Optimal LLMs Provably Generalize Better with Scale

Summary pending...

generalization boundslanguage modelsscaling laws
ICLR2025

Optimizing $(L_0, L_1)$-Smooth Functions by Gradient Methods

Summary pending...

$(L_0L_1)$-smoothnessgradient methods
ICLR2025

Competition Dynamics Shape Algorithmic Phases of In-Context Learning

Summary pending...

In-Context LearningCircuit CompetitionMarkov Chains
ICLR2025

DiscoveryBench: Towards Data-Driven Discovery with Large Language Models

Summary pending...

scientific discoverydata-driven discoverydata analysis
ICLR2025

Predictive Uncertainty Quantification for Bird's Eye View Segmentation: A Benchmark and Novel Loss Function

Summary pending...

Uncertainty QuantificationEvidential Deep LearningBird's Eye View (BEV) Segmentation
ICLR2025

L-WISE: Boosting Human Visual Category Learning Through Model-Based Image Selection and Enhancement

Summary pending...

Human-aligned modelsrobust neural networksvisual perception
ICLR2025

What Secrets Do Your Manifolds Hold? Understanding the Local Geometry of Generative Models

Summary pending...

GeometryDiffusion modelsVAE
ICLR2025

A Little Goes a Long Way: Efficient Long Context Training and Inference with Partial Contexts

Summary pending...

Long-Context LLMEfficient LLMContext Extension
ICLR2025

EFFICIENT JAILBREAK ATTACK SEQUENCES ON LARGE LANGUAGE MODELS VIA MULTI-ARMED BANDIT-BASED CONTEXT SWITCHING

Summary pending...

JailBreakAI SecurityLLM Vunlnerability
ICLR2025

Adaptive Rank Allocation: Speeding Up Modern Transformers with RaNA Adapters

Summary pending...

Large Language ModelsAdaptive computeRank adapters
ICLR2025

MetaDesigner: Advancing Artistic Typography through AI-Driven, User-Centric, and Multilingual WordArt Synthesis

Summary pending...

MetaDesigner
ICLR2025

DynaMath: A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision Language Models

Summary pending...

Visual Mathematical BenchmarkVision Language Models
ICLR2025

Should VLMs be Pre-trained with Image Data?

Summary pending...

vision language modelspre-trainingfine-tuning
ICLR2025

Model Editing as a Robust and Denoised variant of DPO: A Case Study on Toxicity

Summary pending...

model editingmechanistic interpretabilityai safety
ICLR2025

A Common Pitfall of Margin-based Language Model Alignment: Gradient Entanglement

Summary pending...

AlignmentPreference OptimizationLarge Language Model
ICLR2025

How Gradient descent balances features: A dynamical analysis for two-layer neural networks

Summary pending...

learning theoryover-parameterizationlearning dynamics
ICLR2025

ManiSkill-HAB: A Benchmark for Low-Level Manipulation in Home Rearrangement Tasks

Summary pending...

benchmarkdatasetsimulation
ICLR2025

DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory

Summary pending...

Document-Level TranslationLarge Language ModelsAutonomous Agents
ICLR2025

RTop-K: Ultra-Fast Row-Wise Top-K Selection for Neural Network Acceleration on GPUs

Summary pending...

row-wise topk selectionGPUCUDA