Adobe Media and Data Science Research (MDSR) Laboratory
Adobe Media and Data Science Research (MDSR) Laboratory
News
Publications
People
Join us!
Collaborators
What do audio transformer models hear? Probing Acoustic Representations for Language Delivery and its Structure
Transformer models across multiple domains such as natural language processing and speech form an unavoidable part of the tech stack of …
Yaman Kumar Singla
,
Jui Shah
,
Rajiv Ratn Shah
,
Changyou Chen
PDF
Approximate Information State for Approximate Planning and Reinforcement Learning in Partially Observed Systems
We propose a theoretical framework for approximate planning and learning in partially observed systems. Our framework is based on the …
Jayakumar Subramanian
,
Amit Sinha
,
Raihan Seraj
,
Aditya Mahajan
PDF
Automated Speech Scoring System Under The Lens: Evaluating and interpreting the linguistic cues for language proficiency
English proficiency assessments have become a necessary metric for filtering and selecting prospective candidates for both academia and …
Manraj Grover
,
Pakhi Bamdev
,
Yaman Kumar Singla
,
Payman Vafaee
,
Mika Hama
,
Rajiv Ratn Shah
PDF
CoSe-Co: Text Conditioned Generative CommonSense Contextualizer
Pre-trained Language Models (PTLMs) have been shown to perform well on natural language tasks. Many prior works have leveraged …
Rachit Bansal
,
Jivat Meet Kaur
,
Milan Aggarwal
,
Sumit Bhatia
,
Balaji Krishnamurthy
PDF
CyCLIP: Cyclic Contrastive Language-Image Pretraining
Recent advances in contrastive representation learning over paired image-text data have led to models such as CLIP that achieve …
Shashank Goel
,
Hritik Bansal
,
Sumit Bhatia
,
Ryan A. Rossi
,
Vishwa Vinay
,
Aditya Grover
PDF
Data Instance Prior for Transfer Learning in GANs
Recent advances in generative adversarial networks (GANs) have shown remarkable progress in generating high-quality images. However, …
Puneet Mangla
,
Nupur Kumari
,
Mayank Singh
,
Vineeth N. Balasubramanian
,
Balaji Krishnamurthy
PDF
DynamicTOC: Persona-based Table of Contents for Consumption of Long Documents
Long documents like contracts, financial documents, etc., are often tedious to read through. Linearly consuming (via scrolling or …
Himanshu Maheshwari
,
Nethraa Sivakumar
,
Shelly Jain
,
Tanvi Karandikar
,
Vinay Aggarwal
,
Navita Goyal
,
Sumit Shekhar
PDF
Harmonized Banner Creation from Multimodal Design Assets
Designing aesthetically pleasing single-page graphic designs (”banners”) that appeal to the target recipients is non-trivial and …
Praneetha Vaddamanu
,
Vinay Aggarwal
,
Bhanu Prakash Reddy Guda
,
Balaji Vasan Srinivasan
,
Niyati Chhaya
PDF
LM-CORE: Language Models with Contextually Relevant External Knowledge
Large transformer-based pre-trained language models have achieved impressive performance on a variety of knowledge-intensive tasks and …
Rachit Bansal
,
Jivat Meet Kaur
,
Milan Aggarwal
,
Sumit Bhatia
,
Balaji Krishnamurthy
PDF
Minimal: Mining Models for Universal Adversarial Triggers
It is well known that natural language models are vulnerable to adversarial attacks, which are mostly input-specific in nature. …
Yaman Kumar Singla
,
Swapnil Parekh
,
Somesh Singh
,
Changyou Chen
,
Balaji Krishnamurthy
,
Rajiv Ratn Shah
PDF
Cite
Code
«
»
Cite
×