Papers

12094 papers

ICML2024

Hard Tasks First: Multi-Task Reinforcement Learning Through Task Scheduling

Summary pending...

ICML2024

Precise Accuracy / Robustness Tradeoffs in Regression: Case of General Norms

Summary pending...

ICML2024

Prompting is a Double-Edged Sword: Improving Worst-Group Robustness of Foundation Models

Summary pending...

ICML2024

Towards Robust Model-Based Reinforcement Learning Against Adversarial Corruption

Summary pending...

ICML2024

When Will Gradient Regularization Be Harmful?

Summary pending...

ICML2024

Conditional Common Entropy for Instrumental Variable Testing and Partial Identification

Summary pending...

ICML2024

Dr. Strategy: Model-Based Generalist Agents with Strategic Dreaming

Summary pending...

ICML2024

Multi-View Clustering by Inter-cluster Connectivity Guided Reward

Summary pending...

ICML2024

On Discrete Prompt Optimization for Diffusion Models

Summary pending...

ICML2024

Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter Lesson of Reinforcement Learning

Summary pending...

ICML2024

Implicit meta-learning may lead language models to trust more reliable sources

Summary pending...

ICML2024

Copyright Traps for Large Language Models

Summary pending...

ICML2024

Learning in Feature Spaces via Coupled Covariances: Asymmetric Kernel SVD and Nyström method

Summary pending...

ICML2024

Evaluation of Trajectory Distribution Predictions with Energy Score

Summary pending...

ICML2024

Causal Discovery with Fewer Conditional Independence Tests

Summary pending...

ICML2024

Referee Can Play: An Alternative Approach to Conditional Generation via Model Inversion

Summary pending...

ICML2024

HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal

Summary pending...

ICML2024

Adversarial Attacks on Combinatorial Multi-Armed Bandits

Summary pending...

ICML2024

Exact Conversion of In-Context Learning to Model Weights in Linearized-Attention Transformers

Summary pending...

ICML2024

Reshape and Adapt for Output Quantization (RAOQ): Quantization-aware Training for In-memory Computing Systems

Summary pending...