Papers

12094 papers

ICML2024

Reducing Item Discrepancy via Differentially Private Robust Embedding Alignment for Privacy-Preserving Cross Domain Recommendation

Summary pending...

ICML2024

Neural Collapse for Cross-entropy Class-Imbalanced Learning with Unconstrained ReLU Features Model

Summary pending...

ICML2024

The Pitfalls and Promise of Conformal Inference Under Adversarial Attacks

Summary pending...

ICML2024

Nash Learning from Human Feedback

Summary pending...

ICML2024

Unsupervised Domain Adaptation for Anatomical Structure Detection in Ultrasound Images

Summary pending...

ICML2024

StableMask: Refining Causal Masking in Decoder-only Transformer

Summary pending...

ICML2024

Feedback Loops With Language Models Drive In-Context Reward Hacking

Summary pending...

ICML2024

Sample-Efficient Multiagent Reinforcement Learning with Reset Replay

Summary pending...

ICML2024

Plug-and-Play image restoration with Stochastic deNOising REgularization

Summary pending...

ICML2024

Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast

Summary pending...

ICML2024

Graph Out-of-Distribution Detection Goes Neighborhood Shaping

Summary pending...

ICML2024

Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models

Summary pending...

ICML2024

Autonomous Sparse Mean-CVaR Portfolio Optimization

Summary pending...

ICML2024

Iterative Preference Learning from Human Feedback: Bridging Theory and Practice for RLHF under KL-constraint

Summary pending...

ICML2024

Hierarchical Integral Probability Metrics: A distance on random probability measures with low sample complexity

Summary pending...

ICML2024

The Stronger the Diffusion Model, the Easier the Backdoor: Data Poisoning to Induce Copyright BreachesWithout Adjusting Finetuning Pipeline

Summary pending...

ICML2024

Reflective Policy Optimization

Summary pending...

ICML2024

Safe and Robust Subgame Exploitation in Imperfect Information Games

Summary pending...

ICML2024

Multi-Source Conformal Inference Under Distribution Shift

Summary pending...

ICML2024

Langevin Policy for Safe Reinforcement Learning

Summary pending...