Scholar's Hub

These papers have received best paper awards or distinguished paper awards from renowned computer science conferences in the Artificial Intelligence and Theory fields.

This collection is sourced from each conference. If you notice any errors, please contact us.

AI Papers Theory Papers

AI

AAAI

Misspecification in Inverse Reinforcement Learning

Joar Skalse, A. Abate
ArXiv
December 6, 2022

The aim of Inverse Reinforcement Learning (IRL) is to infer a reward function $R$ from a policy $\pi$. To do this, we need a model of how $\pi$ relates to $R$. In the current literature, the most common models are optimality, Boltzmann rationality, and causal entropy maximisation. One of the primary motivations behind IRL is to infer human preferences from human behaviour. However, the true relationship between human preferences and human behaviour is much more complex than any of the models currently used in IRL. This means that they are misspecified, which raises the worry that they might lead to unsound inferences if applied to real-world data. In this paper, we provide a mathematical analysis of how robust different IRL models are to misspecification, and answer precisely how the demonstrator policy may differ from each of the standard models before that model leads to faulty inferences about the reward function $R$. We also introduce a framework for reasoning about misspecification in IRL, together with formal tools that can be used to easily derive the misspecification robustness of new IRL models.

Scholar's Hub

AI

AAAI

Misspecification in Inverse Reinforcement Learning

Online certification of preference-based fairness for personalized recommender systems

ACL

From Pretraining Data to Language Models to Downstream Tasks: Tracking the Trails of Political Biases Leading to Unfair NLP Models

What the DAAM: Interpreting Stable Diffusion Using Cross Attention

Do Androids Laugh at Electric Sheep? Humor “Understanding” Benchmarks from The New Yorker Caption Contest

Learned Incremental Representations for Parsing

CIKM

D-HYPR: Harnessing Neighborhood Modeling and Asymmetry Preservation for Digraph Representation Learning

CVPR

Planning-oriented Autonomous Driving

Visual Programming: Compositional visual reasoning without training

Learning to Solve Hard Minimal Problems

EMNLP

Faster Minimum Bayes Risk Decoding with Confidence-based Pruning

Ignore This Title and HackAPrompt: Exposing Systemic Vulnerabilities of LLMs through a Global Scale Prompt Hacking Competition

Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning

Personalized Dense Retrieval on Global Index for Voice-enabled Conversational Systems

PaperMage: A Unified Toolkit for Processing, Representing, and Manipulating Visually-Rich Scientific Documents

Abstract Visual Reasoning with Tangram Shapes

HRI

Design Principles for Robot-Assisted Feeding in Social Contexts

Feminist Human-Robot Interaction: Disentangling Power, Principles and Practice for Better, More Ethical HRI

Multi-Purposeful Activities for Robot-Assisted Autism Therapy: What Works Best for Children's Social Outcomes?

Interactive Policy Shaping for Human-Robot Collaboration with Transparent Matrix Overlays

Lively: Enabling Multimodal, Lifelike, and Extensible Real-time Robot Motion

REGROUP: A Robot-Centric Group Detection and Tracking System

MIND MELD: Personalized Meta-Learning for Robot-Centric Imitation Learning

Memory-Based Personalization for Fostering a Long-Term Child-Robot Relationship

More than words: A Framework for Describing Human-Robot Dialog Designs

Exploring Machine-like Behaviors for Socially Acceptable Robot Navigation in Elevators

ICLR

Universal Few-shot Learning of Dense Prediction Tasks with Visual Token Matching

Emergence of Maps in the Memories of Blind Navigation Agents

Rethinking the Expressive Power of GNNs via Graph Biconnectivity

DreamFusion: Text-to-3D using 2D Diffusion

Expressiveness and Approximation Properties of Graph Neural Networks

Learning strides in convolutional neural networks

Analytic-DPM: an Analytic Estimate of the Optimal Reverse Variance in Diffusion Probabilistic Models

Comparing Distributions by Measuring Differences that Affect Decision Making

Hyperparameter Tuning with Renyi Differential Privacy

Bootstrapped Meta-Learning

ICML

Self-Repellent Random Walks on General Graphs - Achieving Minimal Sampling Variance via Nonlinear Markov Chains

Generalization on the Unseen, Logic Reasoning and Degree Curriculum

A Watermark for Large Language Models

Learning-Rate-Free Learning by D-Adaptation

Bayesian Design Principles for Frequentist Sequential Learning

Adapting to game trees in zero-sum imperfect information game

Causal Conceptions of Fairness and their Consequences

Solving Stackelberg Prediction Game with Least Squares Loss via Spherically Constrained Least Squares Reformulation

Privacy for Free: How does Dataset Condensation Help Privacy?

Bayesian Model Selection, the Marginal Likelihood, and Generalization

IJCAI

Levin Tree Search with Context Models

SAT-Based PAC Learning of Description Logic Concepts

Safe Reinforcement Learning via Probabilistic Logic Shields

QCDCL with Cube Learning or Pure Literal Elimination - What is best?

Completeness and Diversity in Depth-First Proof-Number Search with Applications to Retrosynthesis

Plurality Veto: A Simple Voting Rule Achieving Optimal Metric Distortion

KDD

All in One: Multi-Task Prompting for Graph Neural Networks

Improving Training Stability for Multitask Ranking Models in Recommender Systems

Learning Causal Effects on Hypergraphs

FederatedScope-GNN: Towards a Unified, Comprehensive and Efficient Package for Federated Graph Learning

NEURIPS

Privacy Auditing with One (1) Training Run

Are Emergent Abilities of Large Language Models a Mirage?

Is Out-of-Distribution Detection Learnable?

On-Demand Sampling: Learning Optimally from Multiple Distributions

Beyond neural scaling laws: beating power law scaling via data pruning

ProcTHOR: Large-Scale Embodied AI Using Procedural Generation

High-dimensional limit theorems for SGD: Effective dynamics and critical scaling

A Neural Corpus Indexer for Document Retrieval

Elucidating the Design Space of Diffusion-Based Generative Models

Using Natural Language and Program Abstractions to Instill Human Inductive Biases in Machines

SIGIR