Papers
12094 papers
ICLR2024
Provable Memory Efficient Self-Play Algorithm for Model-free Reinforcement Learning
Summary pending...
model-free RLself-playmemory efficiency
ICLR2024
Towards Assessing and Benchmarking Risk-Return Tradeoff of Off-Policy Evaluation
Summary pending...
off-policy evaluationoffline reinforcement learningoffline policy selection
ICLR2024
Modulated Phase Diffusor: Content-Oriented Feature Synthesis for Detecting Unknown Objects
Summary pending...
Unsupervised out-of-distribution object detection; OOD data synthesis; Modulated phase diffusion
ICLR2024
On Double Descent in Reinforcement Learning with LSTD and Random Features
Summary pending...
Regularized Least-Square Temporal Differencedouble descentover-parameterization
ICLR2024
Measuring Vision-Language STEM Skills of Neural Models
Summary pending...
BenchmarkSTEMMultimodal
ICLR2024
MAP IT to Visualize Representations
Summary pending...
Visualization; Representation learning; Dimensionality reduction; Divergence
ICLR2024
FedLoGe: Joint Local and Generic Federated Learning under Long-tailed Data
Summary pending...
Personalized federated learningData heterogeneityLong-tailed Learning
ICLR2024
The Wasserstein Believer: Learning Belief Updates for Partially Observable Environments through Reliable Latent Space Models
Summary pending...
pomdpguaranteesrepresentation learning
ICLR2024
Decoupling Weighing and Selecting for Integrating Multiple Graph Pre-training Tasks
Summary pending...
Multi-tasking learningGraph Neural NetworksAutoML
ICLR2024
Towards Imitation Learning to Branch for MIP: A Hybrid Reinforcement Learning based Sample Augmentation Approach
Summary pending...
hybrid RLSample AugmentationLearning to branch
ICLR2024
Retrieval is Accurate Generation
Summary pending...
Artificial IntelligenceNatural Language ProcessingLanguage Models
ICLR2024
Scalable and Effective Implicit Graph Neural Networks on Large Graphs
Summary pending...
graph neural networksimplicit graph neural networksimplicit models
ICLR2024
AutoChunk: Automated Activation Chunk for Memory-Efficient Deep Learning Inference
Summary pending...
Long Sequence InferenceInferenceCompiler
ICLR2024
Uncertainty-aware Constraint Inference in Inverse Constrained Reinforcement Learning
Summary pending...
Inverse Constrained Reinforcement LearningConstrained Reinforcement LearningInverse Reinforcement Learning
ICLR2024
Two-timescale Extragradient for Finding Local Minimax Points
Summary pending...
Minimax optimizationNonconvex-nonconcave optimizationExtragradient method
ICLR2024
Algorithms for Caching and MTS with reduced number of predictions
Summary pending...
ML-Augmented AlgorithmsCachingMetrical Task Systems
ICLR2024
Interpretable Diffusion via Information Decomposition
Summary pending...
Diffusion ModelsInformation TheoryInterpretable Machine Learning
ICLR2024
Generative Learning for Solving Non-Convex Problem with Multi-Valued Input-Solution Mapping
Summary pending...
Non-convex optimizationMulti-valued solution mappingGenerative model
ICLR2024
Constructing Adversarial Examples for Vertical Federated Learning: Optimal Client Corruption through Multi-Armed Bandit
Summary pending...
Vertical Federated LearningAdversarial ExamplesAdaptive Client Corruption
ICLR2024
Text2Reward: Reward Shaping with Language Models for Reinforcement Learning
Summary pending...
reinforcement learning; large language models; robotics