A multi-modal search feature, with AI to enhance the way customers search for products
[Formats]: text, Images, audio, or video
Origin
Themed GenAI Innovation Days: 2 day annual Hackathon - themed GenAI
where [Senior Data Scientist Surbhi Mathur] introduces a multi-model images/text input search.
- Query consists of a seed image accompanied by a relative text input describing the desired changes from the seed image.
- As the query was composed of both image and text, we chose a multimodal model architecture like Contrastive Language–Image Pre-training (CLIP) from OpenAI, which embeds image and text in the same space
Search Feature Changes Over time
- Exact matches between user queries and our product catalog
- Click-based model, leveraging clickstream data to better understand user intent
- Demand-based approach, using data from high-demand products or categories
- For lower demands limitation, a text-based and click based data hybrid approach
Flipkart’s journey into generative AI technology began with the introduction of Flippi, its AI-powered shopping chat assistant