Adobe Media and Data Science Research (MDSR) Laboratory
Adobe Media and Data Science Research (MDSR) Laboratory
News
Publications
People
Join us!
Collaborators
Social Agents: Collective Intelligence Improves LLM Predictions
In human society, collective decision making has often outperformed the judgment of individuals. Classic examples range from estimating …
Aanisha Bhattacharyya
,
Abhilekh Borah
,
Yaman Kumar Singla
,
Rajiv Ratn Shah
,
Changyou Chen
,
Balaji Krishnamurthy
PDF
Cite
ALPHA: Action-Based Learning for Pluralistic Human Alignment in Large Language Models
Large language models are widely used, but aligning them with societal values remains challenging. Current approaches often rely on …
Aanisha Bhattacharyya
,
Susmit Aggarwal
,
Yaman Kumar Singla
,
Tarun Menta
,
Nikitha SR
,
Rajiv Ratn Shah
,
Changyou Chen
,
Balaji Krishnamurthy
PDF
Code
Dataset
Project
Poster
BrandFusion: Aligning Image Generation with Brand Styles
While recent text-to-image models excel at generating realistic content, they struggle to capture the nuanced visual characteristics …
Parul
,
Varun Khurana
,
Yaman Kumar Singla
,
Balaji Krishnamurthy
,
Abhinav Dhall
PDF
Cite
SPRO: Improving Image Generation via Self-Play
Recent advances in diffusion models have dramatically improved image fidelity and diversity. However, aligning these models with …
Ritika Jha
,
Aanisha Bhattacharyya
,
Yaman Kumar Singla
,
Rajiv Ratn Shah
,
Changyou Chen
,
Balaji Krishnamurthy
PDF
Cite
Project
Evaluating Variance in Visual Question Answering Benchmarks
Multimodal large language models (MLLMs) have emerged as powerful tools for visual question answering (VQA), enabling reasoning and …
Nikitha SR
PDF
HIRE: Lightweight High-Resolution Image Feature Enrichment for Multimodal LLMs
The integration of high-resolution image features in modern multimodal large language models has demonstrated significant improvements …
Nikitha SR
,
Aradhya Neeraj Mathur
,
Tarun Ram Menta
,
Rishabh Jain
,
Mausoom Sarkar
PDF
Learning Together to Perform Better: Teaching Small-Scale LLMs to Collaborate via Preferential Rationale Tuning
LLMs such as GPT-4 have shown a remarkable ability to solve complex questions by generating step-by-step rationales. Prior works have …
Sohan Patnaik
,
Milan Aggarwal
,
Sumit Bhatia
,
Balaji Krishnamurthy
PDF
Cite
Code
EOPose: Exemplar-based object reposing using Generalized Pose Correspondences
Reposing generic objects without the use of 3D models poses a significant challenge due to the absence of a standardized pose …
Sarthak Mehrotra
,
Rishabh Jain
,
Mayur Hemani
,
Balaji Krishnamurthy
,
Mausoom Sarkar
PDF
AesthetiQ: Enhancing Graphic Layout Design via Aesthetic-Aware Preference Alignment
Numerous pose-guided human editing methods have been explored by the vision community due to their extensive practical applications. …
Sohan Patnaik
,
Rishabh Jain
,
Balaji Krishnamurthy
,
Mausoom Sarkar
PDF
Cite
Project
LLaVA Finds Free Lunch: Teaching Human Behavior Improves Content Understanding Abilities Of LLMs
Communication is defined as “Who says what to whom with what effect”. A message from a communicator generates downstream …
Somesh Singh
,
S I Harini
,
Yaman Kumar Singla
,
Veeky Baths
,
Rajiv Ratn Shah
,
Changyou Chen
,
Balaji Krishnamurthy
PDF
Cite
»
Cite
×