Reinforcement Learning | Adobe Media and Data Science Research (MDSR) Laboratory

Explaining RL Decisions with Trajectories

Explanation is a key component for the adoption of reinforcement learning (RL) in many real-world decision-making problems. In the …

Shripad Vilasrao Deshmukh, Arpan Dasgupta, Balaji Krishnamurthy, Nan Jiang, Chirag Agarwal, Georgios Theocharous, Jayakumar Subramanian

Robustness and Sample Complexity of Model-Based MARL for General Sum Markov Games

Multi-agent reinforcement learning (MARL) is often modeled using the framework of Markov games (also called stochastic games or dynamic …

Jayakumar Subramanian, Amit Sinha, Aditya Mahajan

SARC: Soft Actor Retrospective Critic

The two-time scale nature of SAC, which is an actor-critic algorithm, is characterised by the fact that the critic estimate has not …

Sukriti Verma, Jayakumar Subramanian, Ayush Chopra, Nikaash Puri, Mausoom Sarkar, Piyush Gupta, Balaji Krishnamurthy

Status-quo policy gradient in Multi-Agent Reinforcement Learning

Individual rationality, which involves maximizing expected individual returns, does not always lead to high-utility individual or group …

Pinkesh Badjatiya, Mausoom Sarkar, Nikaash Puri, Jayakumar Subramanian, Abhishek Sinha, Siddharth Singh, Balaji Krishnamurthy

Medical Dead-ends and Learning to Identify High-Risk States and Treatments

Machine learning has successfully framed many sequential decision making problems as either supervised prediction, or optimal …

Fatemi M., Killian T., Subramanian J., Ghassemi M.

 An Empirical Study of Representation Learning for Reinforcement Learning in Healthcare

Reinforcement Learning (RL) has recently been applied to sequential estimation and prediction problems identifying and developing …

Killian T., Zhang H., Subramanian J., Fatemi M., Ghassemi M.

Explain Your Move: Understanding Agent Actions Using Specific and Relevant Feature Attribution

As deep reinforcement learning (RL) is applied to more tasks, there is a need to visualize and understand the behavior of learned …

Nikaash Puri, Sukriti Verma, Piyush Gupta, Dhruv Kayastha, Shripad Deshmukh, Balaji Krishnamurthy, Sameer Singh

Towards A Unified Framework for Visual Compatibility Prediction

Visual compatibility prediction refers to the task of determining if a set of items go well together. Existing techniques for …

Ayush Chopra, Kumar Ayush, Anirudh Singhal, Utkarsh Patel, Balaji Krishnamurthy

Powering Robust Fashion Retrieval with Information Rich Feature Embeddings

Visual content based product retrieval has become increasingly important for e-commerce. Fashion retrieval, in particular, is a …

Ayush Chopra, Abhishek Sinha, Mausoom Sarkar, Hiresh Gupta, Kumar Ayush, Balaji Krishnamurthy