Papers

12094 papers

UAI2024

Decentralized Online Learning in General-Sum Stackelberg Games

Summary pending...

Stackelberg games; Bandits; Online learning;

Paper

UAI2024

Graph Feedback Bandits with Similar Arms

Summary pending...

online learningbandit

Paper

UAI2024

Low-rank Matrix Bandits with Heavy-tailed Rewards

Summary pending...

contextual bandit

Paper

UAI2024

Trusted re-weighting for label distribution learning

Summary pending...

label distribution learning

Paper

UAI2024

Partial Identification with Proxy of Latent Confoundings via Sum-of-ratios Fractional Programming

Summary pending...

causal effect;

Paper

UAI2024

SMuCo: Reinforcement Learning for Visual Control via Sequential Multi-view Total Correlation

Summary pending...

reinforcement learningvisual controlmulti-view total correlation

Paper

UAI2024

Fast Reliability Estimation for Neural Networks with Adversarial Attack-Driven Importance Sampling

Summary pending...

Deep Neural NetworksReliabilityAdversarial Attacks

Paper

UAI2024

Learning from Crowds with Dual-View K-Nearest Neighbor

Summary pending...

CrowdsourcingLabel integrationK-Nearest Neighbor

Paper

UAI2024

BanditQ:Fair Bandits with Guaranteed Rewards

Summary pending...

Banditsfairnessregret bounds

Paper

UAI2024

Efficient Monte Carlo Tree Search via On-the-Fly State-Conditioned Action Abstraction

Summary pending...

Monte Carlo tree searchContext-specific independence

Paper

UAI2024

Towards Minimax Optimality of Model-based Robust Reinforcement Learning

Summary pending...

Robust MDPsRobust Reinforcement LearningSample complexity

Paper

UAI2024

Neighbor Similarity and Multimodal Alignment based Product Recommendation Study

Summary pending...

Neighbor similarity graph convolutional network; Multimodal alignment and fusion; User preference information enhancement; Multimodal recommendation

Paper

UAI2024

Differentiable Pareto-Smoothed Weighting for High-Dimensional Heterogeneous Treatment Effect Estimation

Summary pending...

treatment effect estimationPareto smoothing

Paper

UAI2024

Calibrated and Conformal Propensity Scores for Causal Effect Estimation

Summary pending...

Causal InferenceConformal PredictionPropensity Scores

Paper

UAI2024

Hybrid CtrlFormer: Learning Adaptive Search Space Partition for Hybrid Action Control via Transformer-based Monte Carlo Tree Search

Summary pending...

Deep Reinforcement LearningHybrid Action Space ControlTransformer

Paper

UAI2024

Online Policy Optimization for Robust Markov Decision Process

Summary pending...

reinforcement learning

Paper

UAI2024

ContextFlow++: Generalist-Specialist Flow-based Generative Models with Mixed-variable Context Encoding

Summary pending...

normalizing flowscontextsdiscrete

Paper

UAI2024

Inference for Optimal Linear Treatment Regimes in Personalized Decision-making

Summary pending...

Linear treatment regime; Double robustness; Cube root asymptotics; Bootstrapping.

Paper

UAI2024

Two Facets of SDE Under an Information-Theoretic Lens: Generalization of SGD via Training Trajectories and via Terminal States

Summary pending...

Generalizationinformation-theoretic generalization boundSGD

Paper

UAI2024

Inference in Probabilistic Answer Set Programs with Imprecise Probabilities via Optimization

Summary pending...

probabilistic answer set programmingstatistical relational artificial intelligenceimprecise probabilities

Paper