Adobe Media and Data Science Research (MDSR) Laboratory
Adobe Media and Data Science Research (MDSR) Laboratory
News
Publications
People
Join us!
Collaborators
MixBoost: Synthetic Oversampling with Boosted Mixup for Handling Extreme Imbalance
Training a classification model on a dataset where the instances of one class outnumber those of the other class is a challenging …
Anubha Kabra
,
Ayush Chopra
,
Nikaash Puri
,
Pinkesh Badjatiya
,
Sukriti Verma
,
Piyush Gupta
,
Balaji Krishnamurthy
PDF
Multi-Modal Association based Grouping for Form Structure Extraction
Document structure extraction has been a widely researched area for decades. Recent work in this direction has been deep …
Milan Aggarwal
,
Mausoom Sarkar
,
Hiresh Gupta
,
Balaji Krishnamurthy
PDF
Retrospective Loss: Looking Back to Improve Training of Deep Neural Networks
Deep neural networks (DNNs) are powerful learning machines that have enabled breakthroughs in several domains. In this work, we …
Surgan Jandial
,
Ayush Chopra
,
Mausoom Sarkar
,
Piyush Gupta
,
Balaji Krishnamurthy
,
Vineeth Balasubramanian
PDF
ShapeVis: High-dimensional Data Visualization at Scale
We present ShapeVis, a scalable visualization technique for point cloud data inspired from topological data analysis. Our method …
Nupur Kumari
,
Siddarth R.
,
Akash Rupela
,
Piyush Gupta
,
Balaji Krishnamurthy
PDF
SieveNet: A Unified Framework for Robust Image-based Virtual Try-On
Image-based virtual try-on for fashion has attracted considerable attention recently. The task requires trying on the desired clothing …
Ayush Chopra
,
Surgan Jandial
,
Kumar Ayush
,
Mayur Hemani
,
Balaji Krishnamurthy
PDF
SimPropNet: Improved Similarity Propagation for Few-shot Image Segmentation
Few-shot segmentation (FSS) methods perform image segmentation for a particular object class in a target (query) image, using a small …
Siddhartha Gairola
,
Mayur Hemani
,
Ayush Chopra
,
Balaji Krishnamurthy
PDF
Towards A Unified Framework for Visual Compatibility Prediction
Visual compatibility prediction refers to the task of determining if a set of items go well together. Existing techniques for …
Ayush Chopra
,
Kumar Ayush
,
Anirudh Singhal
,
Utkarsh Patel
,
Balaji Krishnamurthy
PDF
Harnessing the Vulnerability of Latent Layers in Adversarially Trained Models
Neural networks are vulnerable to adversarial attacks – small visually imperceptible crafted noise which when added to the input …
Mayank Singh
,
Abhishek Sinha
,
Nupur Kumari
,
Harshitha Machiraju
,
Balaji Krishnamurthy
,
Vineeth N Balasubramanian
PDF
Hush-Hush Speak: Speech Reconstruction Using Silent Videos
Speech Reconstruction is the task of recreation of speech using silent videos as input. In the literature, it is also referred to as …
Shashwat Uttam
,
Yaman Kumar Singla
,
Dhruva Sharawat
,
Mansi Aggarwal
,
Debanjan Mahata
,
Rajiv Ratn Shah
,
Amanda Stent
PDF
Lipper: Synthesizing Thy Speech using Multi-View Lipreading
Lipreading has a lot of potential applications such as in the domain of surveillance and video conferencing. Despite this, most of the …
Yaman Kumar Singla
,
Rohit Jain
,
Khwaja Mohd. Salik
,
Rajiv Ratn Shah
,
Yifang Yin
,
Roger Zimmerman
PDF
«
»
Cite
×