Papers

12094 papers

UAI2024

Decentralized Online Learning in General-Sum Stackelberg Games

Summary pending...

Stackelberg games; Bandits; Online learning;
UAI2024

Graph Feedback Bandits with Similar Arms

Summary pending...

online learningbandit
UAI2024

Low-rank Matrix Bandits with Heavy-tailed Rewards

Summary pending...

contextual bandit
UAI2024

Trusted re-weighting for label distribution learning

Summary pending...

label distribution learning
UAI2024

Partial Identification with Proxy of Latent Confoundings via Sum-of-ratios Fractional Programming

Summary pending...

causal effect;
UAI2024

SMuCo: Reinforcement Learning for Visual Control via Sequential Multi-view Total Correlation

Summary pending...

reinforcement learningvisual controlmulti-view total correlation
UAI2024

Fast Reliability Estimation for Neural Networks with Adversarial Attack-Driven Importance Sampling

Summary pending...

Deep Neural NetworksReliabilityAdversarial Attacks
UAI2024

Learning from Crowds with Dual-View K-Nearest Neighbor

Summary pending...

CrowdsourcingLabel integrationK-Nearest Neighbor
UAI2024

BanditQ:Fair Bandits with Guaranteed Rewards

Summary pending...

Banditsfairnessregret bounds
UAI2024

Efficient Monte Carlo Tree Search via On-the-Fly State-Conditioned Action Abstraction

Summary pending...

Monte Carlo tree searchContext-specific independence
UAI2024

Towards Minimax Optimality of Model-based Robust Reinforcement Learning

Summary pending...

Robust MDPsRobust Reinforcement LearningSample complexity
UAI2024

Neighbor Similarity and Multimodal Alignment based Product Recommendation Study

Summary pending...

Neighbor similarity graph convolutional network; Multimodal alignment and fusion; User preference information enhancement; Multimodal recommendation
UAI2024

Differentiable Pareto-Smoothed Weighting for High-Dimensional Heterogeneous Treatment Effect Estimation

Summary pending...

treatment effect estimationPareto smoothing
UAI2024

Calibrated and Conformal Propensity Scores for Causal Effect Estimation

Summary pending...

Causal InferenceConformal PredictionPropensity Scores
UAI2024

Hybrid CtrlFormer: Learning Adaptive Search Space Partition for Hybrid Action Control via Transformer-based Monte Carlo Tree Search

Summary pending...

Deep Reinforcement LearningHybrid Action Space ControlTransformer
UAI2024

Online Policy Optimization for Robust Markov Decision Process

Summary pending...

reinforcement learning
UAI2024

ContextFlow++: Generalist-Specialist Flow-based Generative Models with Mixed-variable Context Encoding

Summary pending...

normalizing flowscontextsdiscrete
UAI2024

Inference for Optimal Linear Treatment Regimes in Personalized Decision-making

Summary pending...

Linear treatment regime; Double robustness; Cube root asymptotics; Bootstrapping.
UAI2024

Two Facets of SDE Under an Information-Theoretic Lens: Generalization of SGD via Training Trajectories and via Terminal States

Summary pending...

Generalizationinformation-theoretic generalization boundSGD
UAI2024

Inference in Probabilistic Answer Set Programs with Imprecise Probabilities via Optimization

Summary pending...

probabilistic answer set programmingstatistical relational artificial intelligenceimprecise probabilities