Papers

12094 papers

ICLR2025

What Secrets Do Your Manifolds Hold? Understanding the Local Geometry of Generative Models

Summary pending...

GeometryDiffusion modelsVAE
ICLR2025

A Little Goes a Long Way: Efficient Long Context Training and Inference with Partial Contexts

Summary pending...

Long-Context LLMEfficient LLMContext Extension
ICLR2025

EFFICIENT JAILBREAK ATTACK SEQUENCES ON LARGE LANGUAGE MODELS VIA MULTI-ARMED BANDIT-BASED CONTEXT SWITCHING

Summary pending...

JailBreakAI SecurityLLM Vunlnerability
ICLR2025

Adaptive Rank Allocation: Speeding Up Modern Transformers with RaNA Adapters

Summary pending...

Large Language ModelsAdaptive computeRank adapters
ICLR2025

MetaDesigner: Advancing Artistic Typography through AI-Driven, User-Centric, and Multilingual WordArt Synthesis

Summary pending...

MetaDesigner
ICLR2025

DynaMath: A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision Language Models

Summary pending...

Visual Mathematical BenchmarkVision Language Models
ICLR2025

Should VLMs be Pre-trained with Image Data?

Summary pending...

vision language modelspre-trainingfine-tuning
ICLR2025

Model Editing as a Robust and Denoised variant of DPO: A Case Study on Toxicity

Summary pending...

model editingmechanistic interpretabilityai safety
ICLR2025

A Common Pitfall of Margin-based Language Model Alignment: Gradient Entanglement

Summary pending...

AlignmentPreference OptimizationLarge Language Model
ICLR2025

How Gradient descent balances features: A dynamical analysis for two-layer neural networks

Summary pending...

learning theoryover-parameterizationlearning dynamics
ICLR2025

ManiSkill-HAB: A Benchmark for Low-Level Manipulation in Home Rearrangement Tasks

Summary pending...

benchmarkdatasetsimulation
ICLR2025

DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory

Summary pending...

Document-Level TranslationLarge Language ModelsAutonomous Agents
ICLR2025

RTop-K: Ultra-Fast Row-Wise Top-K Selection for Neural Network Acceleration on GPUs

Summary pending...

row-wise topk selectionGPUCUDA
ICLR2025

Semi-Parametric Retrieval via Binary Bag-of-Tokens Index

Summary pending...

information retrievalefficient retrievalretrieval-agumented applications
ICLR2025

Safety-Prioritizing Curricula for Constrained Reinforcement Learning

Summary pending...

curriculum learningconstrained reinforcement learning
ICLR2025

Attributing Culture-Conditioned Generations to Pretraining Corpora

Summary pending...

culture biaspretraining datamemorization
ICLR2025

Reasoning of Large Language Models over Knowledge Graphs with Super-Relations

Summary pending...

Knowledge GraphsLarge Language ModelsQuestion Answering
ICLR2025

Physics of Language Models: Part 2.2, How to Learn From Mistakes on Grade-School Math Problems

Summary pending...

pretraininglanguage modelerror correction
ICLR2025

Causal Order: The Key to Leveraging Imperfect Experts in Causal Inference

Summary pending...

Causal OrderImperfect ExpertsCausal Inference
ICLR2025

Optimizing Neural Network Representations of Boolean Networks

Summary pending...

Neural NetworksBoolean NetworksLossless Optimization