Adobe Media and Data Science Research (MDSR) Laboratory
Adobe Media and Data Science Research (MDSR) Laboratory
People
Join us!
Publications
Collaborators
Explaining RL Decisions with Trajectories
Explanation is a key component for the adoption of reinforcement learning (RL) in many real-world decision-making problems. In the …
Shripad Vilasrao Deshmukh
,
Arpan Dasgupta
,
Balaji Krishnamurthy
,
Nan Jiang
,
Chirag Agarwal
,
Georgios Theocharous
,
Jayakumar Subramanian
PDF
Robustness and Sample Complexity of Model-Based MARL for General Sum Markov Games
Multi-agent reinforcement learning (MARL) is often modeled using the framework of Markov games (also called stochastic games or dynamic …
Jayakumar Subramanian
,
Amit Sinha
,
Aditya Mahajan
PDF
SARC: Soft Actor Retrospective Critic
The two-time scale nature of SAC, which is an actor-critic algorithm, is characterised by the fact that the critic estimate has not …
Sukriti Verma
,
Jayakumar Subramanian
,
Ayush Chopra
,
Nikaash Puri
,
Mausoom Sarkar
,
Piyush Gupta
,
Balaji Krishnamurthy
PDF
Status-quo policy gradient in Multi-Agent Reinforcement Learning
Individual rationality, which involves maximizing expected individual returns, does not always lead to high-utility individual or group …
Pinkesh Badjatiya
,
Mausoom Sarkar
,
Nikaash Puri
,
Jayakumar Subramanian
,
Abhishek Sinha
,
Siddharth Singh
,
Balaji Krishnamurthy
PDF
Medical Dead-ends and Learning to Identify High-Risk States and Treatments
Machine learning has successfully framed many sequential decision making problems as either supervised prediction, or optimal …
Fatemi M.
,
Killian T.
,
Subramanian J.
,
Ghassemi M.
PDF
An Empirical Study of Representation Learning for Reinforcement Learning in Healthcare
Reinforcement Learning (RL) has recently been applied to sequential estimation and prediction problems identifying and developing …
Killian T.
,
Zhang H.
,
Subramanian J.
,
Fatemi M.
,
Ghassemi M.
PDF
Explain Your Move: Understanding Agent Actions Using Specific and Relevant Feature Attribution
As deep reinforcement learning (RL) is applied to more tasks, there is a need to visualize and understand the behavior of learned …
Nikaash Puri
,
Sukriti Verma
,
Piyush Gupta
,
Dhruv Kayastha
,
Shripad Deshmukh
,
Balaji Krishnamurthy
,
Sameer Singh
PDF
Towards A Unified Framework for Visual Compatibility Prediction
Visual compatibility prediction refers to the task of determining if a set of items go well together. Existing techniques for …
Ayush Chopra
,
Kumar Ayush
,
Anirudh Singhal
,
Utkarsh Patel
,
Balaji Krishnamurthy
PDF
Powering Robust Fashion Retrieval with Information Rich Feature Embeddings
Visual content based product retrieval has become increasingly important for e-commerce. Fashion retrieval, in particular, is a …
Ayush Chopra
,
Abhishek Sinha
,
Mausoom Sarkar
,
Hiresh Gupta
,
Kumar Ayush
,
Balaji Krishnamurthy
PDF
Cite
×