Papers

12094 papers

ICLR2025

Pursuing Feature Separation based on Neural Collapse for Out-of-Distribution Detection

Summary pending...

Out-of-Distribution DetectionFeature SeparationNeural Collapse
ICLR2025

To Tackle Adversarial Transferability: A Novel Ensemble Training Method with Fourier Transformation

Summary pending...

robustnessdiversityensemble training
ICLR2025

Flow matching achieves almost minimax optimal convergence

Summary pending...

flow matchinggenerative modelconvergence rate
ICLR2025

MM1.5: Methods, Analysis & Insights from Multimodal LLM Fine-tuning

Summary pending...

Multimodal LLM
ICLR2025

Adam Exploits $\ell_\infty$-geometry of Loss Landscape via Coordinate-wise Adaptivity

Summary pending...

Adamcoordinate-wise adaptivityadaptive algorithms
ICLR2025

Discriminator-Guided Embodied Planning for LLM Agent

Summary pending...

LLM AgentEmbodied PlanningDiscriminator
ICLR2025

Beyond Squared Error: Exploring Loss Design for Enhanced Training of Generative Flow Networks

Summary pending...

GFlowNetGenerative Modelsf-Divergence
ICLR2025

Perplexed by Perplexity: Perplexity-Based Data Pruning With Small Reference Models

Summary pending...

DataData FilteringData Pruning
ICLR2025

MoDeGPT: Modular Decomposition for Large Language Model Compression

Summary pending...

LLMmodel compressionmatrix decomposition
ICLR2025

What Does It Mean to Be a Transformer? Insights from a Theoretical Hessian Analysis

Summary pending...

HessianTransformers
ICLR2025

A Non-Contrastive Learning Framework for Sequential Recommendation with Preference-Preserving Profile Generation

Summary pending...

sequential recommendationnon-contrastive learning
ICLR2025

ADBM: Adversarial Diffusion Bridge Model for Reliable Adversarial Purification

Summary pending...

diffusion modelsadversarial robustness
ICLR2025

PeriodWave: Multi-Period Flow Matching for High-Fidelity Waveform Generation

Summary pending...

Conditional Flow MatchingNeural VocoderSpeech Synthesis
ICLR2025

CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer

Summary pending...

Video GenerationDiffusion modelPretraining
ICLR2025

Diversity Empowers Intelligence: Integrating Expertise of Software Engineering Agents

Summary pending...

large language modelsLLM agentssoftware engineering
ICLR2025

CrossMPT: Cross-attention Message-passing Transformer for Error Correcting Codes

Summary pending...

Cross-attentionError correcting codesMessage-passing decoder
ICLR2025

Recite, Reconstruct, Recollect: Memorization in LMs as a Multifaceted Phenomenon

Summary pending...

memorizationontologieslanguage modelling
ICLR2025

Selective Label Enhancement Learning for Test-Time Adaptation

Summary pending...

label enhancementtest-time adaptationdistribution shift
ICLR2025

InverseBench: Benchmarking Plug-and-Play Diffusion Priors for Inverse Problems in Physical Sciences

Summary pending...

inverse problembenchmarkdiffusion model
ICLR2025

MambaExtend: A Training-Free Approach to Improve Long Context Extension of Mamba

Summary pending...

MambaLong Context GeneralizationDiscretization Step