Smaller Beats Bigger: Developing a Custom VLM for Marketing Image Evaluation

Date: Monday, May 11

Start Time: 2:40 pm

End Time: 3:10 pm

What makes an image effective for marketing is subtle, and we’ve found that general-purpose vision-language models often miss the cues practitioners care about. In this talk, we present MarketingGenie, a domain-specialized VLM trained on ~20K marketing images annotated by experts for composition, lighting, emotional appeal and storytelling. MarketingGenie is ~100x smaller than GPT-4o yet scores significantly higher on marketing-specific evaluations. We’ll share the techniques that made it work: how we defined “marketing quality” and converted expert labels into consistent QA pairs, why fine-tuning an open model (LLaVA-8B) beat using a large API model for cost and controllability and how a multi-encoder design (CLIP plus aesthetic and human-preference encoders) with learnable adapters improved critique quality. We’ll also cover our data-mixture strategy to avoid catastrophic forgetting, a calibrated scoring head that grounds numeric ratings in reference images and how we measure scoring accuracy against human judgments.

Track

2:40 PM

Session Speakers

Shradha Agrawal
ML Lead, Adobe

Shradha Agrawal is an ML Lead for Generative AI at Adobe, where she drives applied research and production AI systems powering Adobe Firefly and Adobe GenStudio—Adobe’s flagship generative AI products for enterprise marketers. With ten-plus years building at the frontier of AI, her work spans vision-language models, diffusion architectures, agentic systems and human-calibrated evaluation frameworks. Her research contributions include multiple publications at CVPR and ten-plus USPTO patents in image generation, content personalization and brand-aware AI. Shradha is an Institute Gold Medalist from the Indian Institute of Technology (BHU) Varanasi and holds an MS in Computer Science from UC San Diego. Outside Adobe, she is a regular at AI hackathons. Shradha is deeply bullish on AI systems that don't just generate, but understand and reason about content quality—and is actively building toward that frontier.

Smaller Beats Bigger: Developing a Custom VLM for Marketing Image Evaluation

Track

Session Speakers

Shradha Agrawal

See you May 11-13, 2026 in Silicon Valley, California

Sponsors and Exhibitors

Get in Touch

Share