Search

Adobe Media and Data Science Research (MDSR) Laboratory
Adobe Media and Data Science Research (MDSR) Laboratory
  • News
  • Publications
  • People
  • Join us!
  • Collaborators
Explaining RL Decisions with Trajectories
Explanation is a key component for the adoption of reinforcement learning (RL) in many real-world decision-making problems. In the …
Shripad Vilasrao Deshmukh, Arpan Dasgupta, Balaji Krishnamurthy, Nan Jiang, Chirag Agarwal, Georgios Theocharous, Jayakumar Subramanian
PDF
Robustness and Sample Complexity of Model-Based MARL for General Sum Markov Games
Multi-agent reinforcement learning (MARL) is often modeled using the framework of Markov games (also called stochastic games or dynamic …
Jayakumar Subramanian, Amit Sinha, Aditya Mahajan
PDF
SARC: Soft Actor Retrospective Critic
The two-time scale nature of SAC, which is an actor-critic algorithm, is characterised by the fact that the critic estimate has not …
Sukriti Verma, Jayakumar Subramanian, Ayush Chopra, Nikaash Puri, Mausoom Sarkar, Piyush Gupta, Balaji Krishnamurthy
PDF
Status-quo policy gradient in Multi-Agent Reinforcement Learning
Individual rationality, which involves maximizing expected individual returns, does not always lead to high-utility individual or group …
Pinkesh Badjatiya, Mausoom Sarkar, Nikaash Puri, Jayakumar Subramanian, Abhishek Sinha, Siddharth Singh, Balaji Krishnamurthy
PDF
Medical Dead-ends and Learning to Identify High-Risk States and Treatments
Machine learning has successfully framed many sequential decision making problems as either supervised prediction, or optimal …
Fatemi M., Killian T., Subramanian J., Ghassemi M.
PDF
 An Empirical Study of Representation Learning for Reinforcement Learning in Healthcare
Reinforcement Learning (RL) has recently been applied to sequential estimation and prediction problems identifying and developing …
Killian T., Zhang H., Subramanian J., Fatemi M., Ghassemi M.
PDF
Explain Your Move: Understanding Agent Actions Using Specific and Relevant Feature Attribution
As deep reinforcement learning (RL) is applied to more tasks, there is a need to visualize and understand the behavior of learned …
Nikaash Puri, Sukriti Verma, Piyush Gupta, Dhruv Kayastha, Shripad Deshmukh, Balaji Krishnamurthy, Sameer Singh
PDF
Towards A Unified Framework for Visual Compatibility Prediction
Visual compatibility prediction refers to the task of determining if a set of items go well together. Existing techniques for …
Ayush Chopra, Kumar Ayush, Anirudh Singhal, Utkarsh Patel, Balaji Krishnamurthy
PDF
Powering Robust Fashion Retrieval with Information Rich Feature Embeddings
Visual content based product retrieval has become increasingly important for e-commerce. Fashion retrieval, in particular, is a …
Ayush Chopra, Abhishek Sinha, Mausoom Sarkar, Hiresh Gupta, Kumar Ayush, Balaji Krishnamurthy
PDF

© 2025 Adobe. All rights reserved.

Published with Hugo Blox Builder — the free, open source website builder that empowers creators.

Cite
Copy Download