Papers
12094 papers
ICLR2025
Pursuing Feature Separation based on Neural Collapse for Out-of-Distribution Detection
Summary pending...
Out-of-Distribution DetectionFeature SeparationNeural Collapse
ICLR2025
To Tackle Adversarial Transferability: A Novel Ensemble Training Method with Fourier Transformation
Summary pending...
robustnessdiversityensemble training
ICLR2025
Flow matching achieves almost minimax optimal convergence
Summary pending...
flow matchinggenerative modelconvergence rate
ICLR2025
MM1.5: Methods, Analysis & Insights from Multimodal LLM Fine-tuning
Summary pending...
Multimodal LLM
ICLR2025
Adam Exploits $\ell_\infty$-geometry of Loss Landscape via Coordinate-wise Adaptivity
Summary pending...
Adamcoordinate-wise adaptivityadaptive algorithms
ICLR2025
Discriminator-Guided Embodied Planning for LLM Agent
Summary pending...
LLM AgentEmbodied PlanningDiscriminator
ICLR2025
Beyond Squared Error: Exploring Loss Design for Enhanced Training of Generative Flow Networks
Summary pending...
GFlowNetGenerative Modelsf-Divergence
ICLR2025
Perplexed by Perplexity: Perplexity-Based Data Pruning With Small Reference Models
Summary pending...
DataData FilteringData Pruning
ICLR2025
MoDeGPT: Modular Decomposition for Large Language Model Compression
Summary pending...
LLMmodel compressionmatrix decomposition
ICLR2025
What Does It Mean to Be a Transformer? Insights from a Theoretical Hessian Analysis
Summary pending...
HessianTransformers
ICLR2025
A Non-Contrastive Learning Framework for Sequential Recommendation with Preference-Preserving Profile Generation
Summary pending...
sequential recommendationnon-contrastive learning
ICLR2025
ADBM: Adversarial Diffusion Bridge Model for Reliable Adversarial Purification
Summary pending...
diffusion modelsadversarial robustness
ICLR2025
PeriodWave: Multi-Period Flow Matching for High-Fidelity Waveform Generation
Summary pending...
Conditional Flow MatchingNeural VocoderSpeech Synthesis
ICLR2025
CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer
Summary pending...
Video GenerationDiffusion modelPretraining
ICLR2025
Diversity Empowers Intelligence: Integrating Expertise of Software Engineering Agents
Summary pending...
large language modelsLLM agentssoftware engineering
ICLR2025
CrossMPT: Cross-attention Message-passing Transformer for Error Correcting Codes
Summary pending...
Cross-attentionError correcting codesMessage-passing decoder
ICLR2025
Recite, Reconstruct, Recollect: Memorization in LMs as a Multifaceted Phenomenon
Summary pending...
memorizationontologieslanguage modelling
ICLR2025
Selective Label Enhancement Learning for Test-Time Adaptation
Summary pending...
label enhancementtest-time adaptationdistribution shift
ICLR2025
InverseBench: Benchmarking Plug-and-Play Diffusion Priors for Inverse Problems in Physical Sciences
Summary pending...
inverse problembenchmarkdiffusion model
ICLR2025
MambaExtend: A Training-Free Approach to Improve Long Context Extension of Mamba
Summary pending...
MambaLong Context GeneralizationDiscretization Step