Papers

12094 papers

ICML2025

Non-Asymptotic Length Generalization

Summary pending...

Large Language ModelLength GeneralizationMachine Learning Theory
ICML2025

Strengthen Out-of-Distribution Detection Capability with Progressive Self-Knowledge Distillation

Summary pending...

Out-of-distribution DetectionSelf-distillation
ICML2025

Diversifying Policy Behaviors with Extrinsic Behavioral Curiosity

Summary pending...

Quality DiversityReinforcement LearningImitation Learning
ICML2025

Subgoal-Guided Policy Heuristic Search with Learned Subgoals

Summary pending...

Tree searchheuristic searchpolicy tree search
ICML2025

Assessing Safety Risks and Quantization-aware Safety Patching for Quantized Large Language Models

Summary pending...

Large Language ModelPreference AlignmentSafety Evaluation
ICML2025

Constrained Exploitability Descent: An Offline Reinforcement Learning Method for Finding Mixed-Strategy Nash Equilibrium

Summary pending...

offline reinforcement learningadversarial Markov gamemixed-strategy Nash equilibrium
ICML2025

Linear Mode Connectivity between Multiple Models modulo Permutation Symmetries

Summary pending...

Linear mode connectivitydeep learningpermutation symmetry
ICML2025

Do Not Mimic My Voice : Speaker Identity Unlearning for Zero-Shot Text-to-Speech

Summary pending...

machine unlearningzero-shot ttsvoice privacy
ICML2025

SPMC: Self-Purifying Federated Backdoor Defense via Margin Contribution

Summary pending...

Backdoor DefenseFederated LearningGame Theory
ICML2025

Training Software Engineering Agents and Verifiers with SWE-Gym

Summary pending...

AgentsSoftware Engineering AgentsPost-training
ICML2025

Origin Identification for Text-Guided Image-to-Image Diffusion Models

Summary pending...

Diffusion ModelsOrigin Identification
ICML2025

In-Context Denoising with One-Layer Transformers: Connections between Attention and Associative Memory Retrieval

Summary pending...

attentionin-context learningdenoising
ICML2025

Offline Model-based Optimization for Real-World Molecular Discovery

Summary pending...

AI for ScienceOffline Model-based OptimizationMolecular Optimization
ICML2025

Neural Solver Selection for Combinatorial Optimization

Summary pending...

neural combinatorial optimization
ICML2025

Overcoming Non-monotonicity in Transducer-based Streaming Generation

Summary pending...

streaming generationsimultaneous translationTransducer
ICML2025

Cost-efficient Collaboration between On-device and Cloud Language Models

Summary pending...

Local-remote collaborationreasoning
ICML2025

The Role of Sparsity for Length Generalization in LLMs

Summary pending...

Length generalizationTransformersPositional Encoding
ICML2025

The Polynomial Stein Discrepancy for Assessing Moment Convergence

Summary pending...

Stein DiscrepancyApproximate MCMCGoodness-of-fit
ICML2025

Efficient Heterogeneity-Aware Federated Active Data Selection

Summary pending...

Federated learningActive learningLeverage score sampling
ICML2025

Pixel2Feature Attack (P2FA): Rethinking the Perturbed Space to Enhance Adversarial Transferability

Summary pending...

TransferabilityAdversarial ExampleAI Security