Conference | Adobe Media and Data Science Research (MDSR) Laboratory

Harnessing GANs for Zero-shot Learning of New Classes in Visual Speech Recognition

Visual Speech Recognition (VSR) is the process of recognizing or interpreting speech by watching the lip movements of the speaker. …

Yaman Kumar Singla, Dhruva Sharawat, Shubham Maheshwari, Debanjan Mahata, Rajiv Ratn Shah, Yifang Yin, Roger Zimmermann, Amanda Stent

Keyphrase Extraction as Sequence Labeling Task using Transformers

In this paper, we formulate keyphrase extraction from scholarly articles as a sequence labeling task solved using a BiLSTM-CRF, where …

Dhruva Sahrawat, Debanjan Mahata, Raymond Zhang, Mayank Kulkarni, Agniv Sharma, Rakesh Gosangi, Amanda Stent, Yaman Kumar Singla, Rajiv Ratn Shah, Roger Zimmermann

Learning based Methods for Code Runtime Complexity Prediction

Predicting the runtime complexity of a programming code is an arduous task. In fact, even for humans, it requires a subtle analysis and …

Jagriti Sikka, Kushal Satya, Yaman Kumar Singla, Shagun Uppal, Rajiv Ratn Shah, Roger Zimmermann

MixBoost: Synthetic Oversampling with Boosted Mixup for Handling Extreme Imbalance

Training a classification model on a dataset where the instances of one class outnumber those of the other class is a challenging …

Anubha Kabra, Ayush Chopra, Nikaash Puri, Pinkesh Badjatiya, Sukriti Verma, Piyush Gupta, Balaji Krishnamurthy

Multi-Modal Association based Grouping for Form Structure Extraction

Document structure extraction has been a widely researched area for decades. Recent work in this direction has been deep …

Milan Aggarwal, Mausoom Sarkar, Hiresh Gupta, Balaji Krishnamurthy

Retrospective Loss: Looking Back to Improve Training of Deep Neural Networks

Deep neural networks (DNNs) are powerful learning machines that have enabled breakthroughs in several domains. In this work, we …

Surgan Jandial, Ayush Chopra, Mausoom Sarkar, Piyush Gupta, Balaji Krishnamurthy, Vineeth Balasubramanian

ShapeVis: High-dimensional Data Visualization at Scale

We present ShapeVis, a scalable visualization technique for point cloud data inspired from topological data analysis. Our method …

Nupur Kumari, Siddarth R., Akash Rupela, Piyush Gupta, Balaji Krishnamurthy

SieveNet: A Unified Framework for Robust Image-based Virtual Try-On

Image-based virtual try-on for fashion has attracted considerable attention recently. The task requires trying on the desired clothing …

Ayush Chopra, Surgan Jandial, Kumar Ayush, Mayur Hemani, Balaji Krishnamurthy

SimPropNet: Improved Similarity Propagation for Few-shot Image Segmentation

Few-shot segmentation (FSS) methods perform image segmentation for a particular object class in a target (query) image, using a small …

Siddhartha Gairola, Mayur Hemani, Ayush Chopra, Balaji Krishnamurthy

Towards A Unified Framework for Visual Compatibility Prediction

Visual compatibility prediction refers to the task of determining if a set of items go well together. Existing techniques for …

Ayush Chopra, Kumar Ayush, Anirudh Singhal, Utkarsh Patel, Balaji Krishnamurthy