Papers

12094 papers

ICLR2024

Provable Memory Efficient Self-Play Algorithm for Model-free Reinforcement Learning

Summary pending...

model-free RLself-playmemory efficiency
ICLR2024

Towards Assessing and Benchmarking Risk-Return Tradeoff of Off-Policy Evaluation

Summary pending...

off-policy evaluationoffline reinforcement learningoffline policy selection
ICLR2024

Modulated Phase Diffusor: Content-Oriented Feature Synthesis for Detecting Unknown Objects

Summary pending...

Unsupervised out-of-distribution object detection; OOD data synthesis; Modulated phase diffusion
ICLR2024

On Double Descent in Reinforcement Learning with LSTD and Random Features

Summary pending...

Regularized Least-Square Temporal Differencedouble descentover-parameterization
ICLR2024

Measuring Vision-Language STEM Skills of Neural Models

Summary pending...

BenchmarkSTEMMultimodal
ICLR2024

MAP IT to Visualize Representations

Summary pending...

Visualization; Representation learning; Dimensionality reduction; Divergence
ICLR2024

FedLoGe: Joint Local and Generic Federated Learning under Long-tailed Data

Summary pending...

Personalized federated learningData heterogeneityLong-tailed Learning
ICLR2024

The Wasserstein Believer: Learning Belief Updates for Partially Observable Environments through Reliable Latent Space Models

Summary pending...

pomdpguaranteesrepresentation learning
ICLR2024

Decoupling Weighing and Selecting for Integrating Multiple Graph Pre-training Tasks

Summary pending...

Multi-tasking learningGraph Neural NetworksAutoML
ICLR2024

Towards Imitation Learning to Branch for MIP: A Hybrid Reinforcement Learning based Sample Augmentation Approach

Summary pending...

hybrid RLSample AugmentationLearning to branch
ICLR2024

Retrieval is Accurate Generation

Summary pending...

Artificial IntelligenceNatural Language ProcessingLanguage Models
ICLR2024

Scalable and Effective Implicit Graph Neural Networks on Large Graphs

Summary pending...

graph neural networksimplicit graph neural networksimplicit models
ICLR2024

AutoChunk: Automated Activation Chunk for Memory-Efficient Deep Learning Inference

Summary pending...

Long Sequence InferenceInferenceCompiler
ICLR2024

Uncertainty-aware Constraint Inference in Inverse Constrained Reinforcement Learning

Summary pending...

Inverse Constrained Reinforcement LearningConstrained Reinforcement LearningInverse Reinforcement Learning
ICLR2024

Two-timescale Extragradient for Finding Local Minimax Points

Summary pending...

Minimax optimizationNonconvex-nonconcave optimizationExtragradient method
ICLR2024

Algorithms for Caching and MTS with reduced number of predictions

Summary pending...

ML-Augmented AlgorithmsCachingMetrical Task Systems
ICLR2024

Interpretable Diffusion via Information Decomposition

Summary pending...

Diffusion ModelsInformation TheoryInterpretable Machine Learning
ICLR2024

Generative Learning for Solving Non-Convex Problem with Multi-Valued Input-Solution Mapping

Summary pending...

Non-convex optimizationMulti-valued solution mappingGenerative model
ICLR2024

Constructing Adversarial Examples for Vertical Federated Learning: Optimal Client Corruption through Multi-Armed Bandit

Summary pending...

Vertical Federated LearningAdversarial ExamplesAdaptive Client Corruption
ICLR2024

Text2Reward: Reward Shaping with Language Models for Reinforcement Learning

Summary pending...

reinforcement learning; large language models; robotics