Tag: Visual Discovery

  • Boost Engagement with Multi-Image Shopping Ads on Microsoft

    Boost Engagement with Multi-Image Shopping Ads on Microsoft

    I’ve discovered that Microsoft Advertising is rolling out a captivating new feature that could transform how we see Shopping campaigns in Bing search results. These multi-image ads offer eCommerce brands a unique opportunity to showcase their products more vividly, potentially capturing shopper attention even before they click.

    What’s new. Now, I can include multiple product images in a single Shopping ad, allowing shoppers to preview various angles, styles, or variations directly within the search results. This approach could be a game-changer for advertisers.

    The design is crafted to enhance visual engagement and provide more informative ads. It allows consumers like myself to quickly compare options without the need to leave the results page.

    How it works:

    • I can upload additional images using the optional additional_image_link attribute in the product feed.
    • There is an option to include up to 10 images, which I can separate by commas.
    • The images will appear alongside pricing and retailer information in Shopping results.
    ```json
{
  "alt": "Online shopping results for Reebok Nano X5 Edge sneakers showing various styles and prices.",
  "caption": "Explore a range of Reebok Nano X5 Edge sneakers in different colors, available from multiple retailers at competitive prices.",
  "description": "The image displays online shopping results for Reebok Nano X5 Edge sneakers via a search engine. It showcases multiple sneaker styles, including white, grey, and black versions, with prices ranging from $184.91 to $220.00. Retailers like The Iconic and Amazon AU are highlighted, offering these products with different features, such as free shipping. Keywords: Reebok, Nano X5 Edge, sneakers, online shopping, footwear."
}
```

    Why we care. From my perspective, multi-image ads have the potential to boost engagement and purchase intent by offering a more comprehensive visual representation of a product. More imagery can highlight features, colors, and design elements that a single image might miss.

    Discovery. This feature was initially noticed by digital marketer Arpan Banerjee, who shared it on LinkedIn.

    The bottom line. For retailers like you and me, multi-image Shopping ads provide more creative freedom and give shoppers a richer context immediately. This shift has the potential to enhance ad performance and reshape how products are presented in search results.


    Inspired by this post on Search Engine Land.


    crushpress.ai community screenshot
  • Make Your Products Stand Out in Multimodal AI Search

    Make Your Products Stand Out in Multimodal AI Search

    As an ecommerce enthusiast, I know how crucial it is for our products to be easily understandable by AI systems. In today’s visually-driven market, designing images that AI can interpret accurately, from OCR-ready labels to visuals aligned with sentiment, is essential.

    The power of images and videos to tell complex stories instantly is unparalleled. In our digital store, these visuals are not just content—they are tools that aid in making purchase decisions.

    Generative search systems capture objects, embedded text, and style to deduce potential use cases. Language Learning Models (LLMs) then bring to light the assets that best respond to a shopper’s inquiries. Essentially, each image becomes structured data that breaks down buying barriers, amplifying discoverability in multimodal searches when someone takes a photo or uploads a screenshot.

    Visual search as a shopping behavior

    Our customers often use visual search for quick decision-making: snapping photos, scanning labels, or comparing products to decide “Will this work for me?” It’s vital that our photos fulfill this need, showing scale, size cues, real colors, and comparisons.

    Multimodal search reshaping behaviors

    With visual search on the rise, Google Lens handling 20 billion monthly queries mostly from younger users, it’s a clear sign of changing behaviors. These behaviors fall into distinct intent categories.

    Quick capture and identification

    ```json
{
  "alt": "Screenshot of Mark Williams-Cook's LinkedIn post discussing SEO intent and PAA results with questions about Dr. Martens and inclusivity.",
  "caption": "Unlocking SEO Potential: Mark Williams-Cook advises on using intent-focused strategies with PAA results, highlighting inclusivity inquiries for Dr. Martens.",
  "description": "This image shows a LinkedIn post by Mark Williams-Cook discussing SEO strategies using 'People Also Ask' results. He suggests focusing on user intent rather than keywords, with questions about Dr. Martens' inclusivity and representation of women over 40. The post emphasizes exploring multi-modal communication methods and ensuring inclusive marketing strategies, particularly related to LGBTQ support and visibility across platforms."
}
```

    Taking a photo to identify an item (like “What plant is this?”) helps with quick recognition and troubleshooting, accelerating issue resolution and product verification.

    Visual comparison

    By showing a product and asking systems to “find a dupe” or analyze “room style,” we bypass complex descriptions, promoting faster cross-category shopping and suitability checks.

    Information processing

    Displaying ingredient lists or foreign texts prompts real-time data conversion, avoiding manual reentry or the need for alternative instruction sources.

    Modification search

    Asking for product variations like “this but in blue” allows for specific attribute searches without chasing model numbers, indicating a shift from text-based navigation to visual exploration.

    ```json
{
  "alt": "Comparison of original and updated Cetaphil product labels with text on branding strategy.",
  "caption": "Cetaphil updates product details to align better with language models, ensuring clear relay of brand information.",
  "description": "The image showcases the original and updated labels for a Cetaphil product. The updates include more detailed information on the product's benefits, emphasizing its gentle formulation for sensitive skin, and a focus on compatibility with digital language models. The surrounding text highlights the brand's strategy to enhance online product listing communication."
}
```

    Multimodal AI has made instant recognition, decision support, and creative exploration accessible, reducing friction in ecommerce and information journeys.

    You can check a detailed table of multimodal visual search types here.

    Further Reading: How multimodal discovery is redefining SEO in the AI era

    Prioritizing content and quality for purchase decisions

    We must ensure that our product images spotlight the details customers care about, like pockets or stitching. Images convey these abstract ideas authentically, prompting shoppers to answer questions such as whether a particular style is suitable for them.

    Original images are crucial; they highlight effort, uniqueness, and skill, making our content more personable and credible.

    Making products machine-readable for image vision

    ```json
{
  "alt": "Shelf with brown packages of oat protein labeled 'UPFRONT' and 'WORSE,' Google Lens translation overlay.",
  "caption": "Exploring the challenge of using Google Lens to translate oat protein package text, highlighting issues with current machine vision capabilities.",
  "description": "The image shows two brown packages labeled 'UPFRONT' and 'WORSE', marketed as oat protein, displayed on a store shelf. Above the packages, a Google Lens overlay shows an attempt to translate the text from Dutch to English. The photo highlights the limitations of machine vision in reading product packaging. The surrounding social media discussion on the right reflects on multi-modal search experiences and the struggles faced by AI in interpreting such text, emphasizing the potential barriers in product information accessibility."
}
```

    For products to be machine-readable, all visual elements need to be easily interpreted by AI. This begins with the design of images and packaging.

    Products and packaging as landing pages

    Ecommerce packaging should be crafted like a digital asset, thriving in a world driven by multimodal AI searches.

    If AI or search engines fail to read packaging, the product might as well be invisible at the peak of consumer interest.

    Designing for OCR-friendliness and authenticity

    Google Lens and leading LLMs employ optical character recognition (OCR) to extract and index data from physical goods. Therefore, text and visuals on our packaging need to be OCR-friendly.

    Use high-contrast color schemes—black text on white backgrounds is ideal. Ensure that critical information is in clean, sans-serif fonts on solid backgrounds without patterns. Treat physical product labeling with the same care as a landing page, much like Cetaphil does.

    ```json
{
  "alt": "Two people discussing a screen about ChatGPT product origins with statistics and razor product listings.",
  "caption": "Discover how ChatGPT's product sourcing is changing the landscape: 36% of products from original brands, while 64% link to other merchants. What does this mean for consumers?",
  "description": "This image shows a video call between two individuals discussing the sourcing of products in ChatGPT, highlighted by a yellow screen with text stating 36% of products originate from the brand's own site, while 64% reference another merchant. The screen also displays product listings for electric razors from Best Buy and Walmart as examples. This discussion highlights the importance of understanding how consumers are being directed to different merchants."
}
```

    Avoid these common errors:

    • Low contrast.
    • Decorative or script fonts.
    • Busy patterns.
    • Curved or creased surfaces.
    • Glossy materials that disrupt text visibility.

    Document OCR fail points and analyze why they occur. Run a grayscale test to ensure text remains legible without color.

    Add a QR code to each product for direct access to a webpage with structured, machine-readable HTML information.

    High-resolution, multi-angle product images are optimal, especially for items needing authenticity checks. Genuine photos excel in accuracy and credibility, outperforming AI-generated images.

    Dive deeper: How to make ecommerce product pages work in an AI-first world

    Managing your brand’s visual knowledge graph

    ```json
{
  "alt": "L'Oréal Glycolic Gloss product search results showing videos and articles on suitability for fine wavy hair.",
  "caption": "Discover if L'Oréal Glycolic Gloss is the right pick for your fine wavy hair with insights and reviews.",
  "description": "The image displays search results for L'Oréal Glycolic Gloss, highlighting its effectiveness for fine wavy hair. The results include video thumbnails and article snippets that discuss product usage, benefits, and reviews. It's suggested for those seeking shine and smoothness without weighing down fine hair. Keywords: L'Oréal, Glycolic Gloss, fine hair, wavy hair, product reviews."
}
```

    In an AI-driven context, it’s about more than just your product. AI builds contextual databases, examining every object in an image, which helps infer the brand’s market position.

    Elements like props, backgrounds, and adjacent items fine-tune our brand’s digital persona. With each visual placement, we send out signals—be it luxury, sportiness, or utility—all influencing the brand’s perception machine-wise.

    Guarding these adjacency signals is now intrinsic to brand management. Strategic curation helps AI accurately interpret our brand’s value, setting us up to appear in high-value conversational queries.

    Conduct a co-occurrence audit for brand context

    We should set up processes to evaluate brand context for multimodal AI searches systematically. Using tools like AI Modes, ChatGPT searches, or similar LLM models, gather relevant lifestyle or product photos to input into these systems. A prompt like:

    • “List each object in the image. From these, describe the potential owner.”

    This step enriches our understanding of the machine’s narrative, helping us adjust any disconnects, like misaligned perception due to unintended signals. From there, we craft specific guidelines for props, contextual elements, and visual do’s and don’ts for our creative teams to safeguard brand narrative.

    ```json
{
  "alt": "Google search results for 'Helly Hansen Nazi' with Wikipedia snippet about clothing brand appropriation.",
  "caption": "A Google search reveals concerns over the appropriation of the Helly Hansen logo by extremist groups, reflecting brand challenges in managing reputation.",
  "description": "This image shows a screenshot of Google search results for 'Helly Hansen Nazi.' The result highlights a Wikipedia entry discussing how the Helly Hansen clothing brand has been appropriated by far-right and neo-Nazi groups. The snippet points out that these groups have interpreted the brand's 'HH' logo in a controversial manner. The page includes navigation options like Products, Images, and Videos, with the Wikipedia link prominently displayed. This raises questions about brand image and reputation management in the digital age."
}
```

    Refining this alignment ensures that machines perceive our brand consistently with our strategic goals, bolstering our presence in new-gen search settings.

    Brand control across the visual layers

    Using the brand control quadrant, we efficiently manage brand visibility through machine interpretation, focusing on four key layers—some we own outright, others we can influence.

    Known brand layers

    Here, we have visuals like official logos and branded imagery, which are typically controlled and recognized by both our audience and AI.

    Visual strategy:

    • Create a visual knowledge database.
    • Regularly evaluate adjacent objects in brand visuals.
    • Develop an “Object Bible” to avoid narrative misalignment, ensuring lifestyle cues uphold our brand image.
    ```json
{
  "alt": "Google search results for 'helly hansen nazi' with Reddit link discussing the brand.",
  "caption": "Exploring the Helly Hansen brand's perception with Google search results and a Reddit discussion on possible controversies.",
  "description": "A Google search screenshot for 'helly hansen nazi' reveals a Reddit link discussing if the brand Helly Hansen is banned in Germany. The search snippet indicates a conversation about brands linked to Nazi associations. The results page includes multiple queries related to extremist fashion and brand perception. This image highlights discussions and controversies surrounding brand identity in social and political contexts. Keywords: Helly Hansen, Nazi, Reddit, brand controversy, Google search results."
}
```

    Latent brand

    These include “wild” images like user photos and social posts that can lead to unexpected inferences about our brand’s standing.

    • Audit these occurrences to prevent unintended associations.

    Shadow brand

    This involves old brand assets and materials that could be unintentionally made public, influencing AI’s interpretation of us.

    • Audit all public archives for outdated visuals; remove or update them.
    • Ensure that current branded visuals reflect our strategies.

    AI-narrated brand

    ```json
{
  "alt": "Screenshot of a search result on how to use L'Oréal Glycolic Gloss with video thumbnails and text instructions.",
  "caption": "Discover the secrets to smooth, glossy hair with L'Oréal Glycolic Gloss. Watch tutorials and follow detailed steps for salon-like results at home.",
  "description": "This image is a screenshot of a search result page on using L'Oréal Glycolic Gloss. It includes clickable video thumbnails, such as tutorials and reviews. Text instructions are provided in French, explaining how to apply the product for optimal hair care results. The image highlights related products and advice on achieving 'glass hair.' Great for anyone looking to enhance their hair care routine with professional tips."
}
```

    AI synthesizes narratives by blending visual and emotional cues with text, which could introduce competitor tones or mismatched perceptions.

    Visual strategy:

    • Use AI tools like Google Cloud Vision to verify tonal alignment.
    • Adjust mismatched assets to ensure narrative cohesion.

    Sentiment alignment: balancing visual tone and emotional context

    Beyond supplying information, images capture emotion and attention within moments, shaping customer perceptions.

    In AI-driven searches, this emotional resonance becomes a direct signal, evaluated for emotional tone, sentiment, and context.

    The affective quality of each image is assessed by LLMs, along with sentiment and contextual tone to match content with the user’s emotional state and intent.

    ```json
{
  "alt": "Smiling woman in an off-shoulder blue dress with highlighted facial recognition analysis.",
  "caption": "Capturing joy with accuracy! A woman beams joyfully in a stylish blue dress, as her facial expression is analyzed with remarkable confidence.",
  "description": "This image presents a woman wearing an elegant off-shoulder blue dress, smiling broadly. Facial recognition analysis rates her expression as very likely joyful, with minimal indicators of other emotions. The technical overlay includes a confidence score of 99% and slight facial orientation adjustments: roll 7°, tilt -4°, pan 7°. Ideal for fashion, emotion analytics, and photography discussions."
}
```

    We need to deliberately design and inspect our imagery’s emotional tone, using tools like Microsoft Azure’s Computer Vision API to:

    • Score emotions in images broadly.
    • Assess facial expressions for emotion probabilities, allowing imagery to be accurately targeted—like promoting calmness in a yoga line or confidence in business wear.

    Align image emotion with marketing targets. Ensure the imagery arouses the right emotions and resonates with our audience.

    Start by recognizing the emotional baseline in your imagery, rigorously testing for consistency with AI tools.

    Matching your brand narrative with AI perception

    We must focus on authenticity in product photos, ensuring every asset is designed for machine-readability and maintaining visual context and sentiment meticulously.

    Treat packaging and online visuals as digital assets; conduct regular audits for object proximity, emotional tone, and clear identification.

    AI will craft a narrative for our brand with or without guidance, so it’s essential to ensure every visual aligns with the intended story.


    Inspired by this post on Search Engine Land.


    crushpress.ai community screenshot
  • ChatGPT’s New Visual Feature: Transforming Search with Images

    ChatGPT’s New Visual Feature: Transforming Search with Images

    As I explore the latest updates to ChatGPT, I’m excited to share that it now incorporates more images into its answers, bringing a fresh, multimodal approach to search. This enhancement makes images just as vital as text for exploring brands and products.

    OpenAI has unveiled this visual upgrade, which pulls images from the web to enrich answers about a variety of topics, such as people, places, and products. It’s a fascinating development that shifts ChatGPT from providing simple text responses to offering a more interactive search experience.

    How it works. With this update, ChatGPT becomes more than just a text generator. It now offers a search experience similar to what I’m used to:

    • Images will appear when they add clarity to the information.
    • These images, sourced from the web, align with the most relevant text.
    • If I’m curious about an image, clicking on it expands it to its original size and shows the source.

    Where it’s live. The rollout of this update is occurring globally, and I’ve noticed it gradually becoming available across all ChatGPT plans that I access:

    • I’ve used it on web, iOS, and Android platforms.
    • It’s important to note that it only works with responses created by GPT 5.1.

    Why we care. I realize that search is evolving to be more multimodal, integrating text, images, videos, and audio. Beyond ensuring that my brand is part of AI-driven replies, it’s crucial to consider how our visuals show up when ChatGPT responds to queries.

    OpenAI’s announcement. For further details, you can check More images in answers


    Inspired by this post on Search Engine Land.


    crushpress.ai community screenshot
  • Unleash Google Ads Demand Gen for Maximum Impact

    Unleash Google Ads Demand Gen for Maximum Impact

    I understand that today’s consumers are constantly bombarded online.

    I mean, I too find myself scrolling YouTube Shorts, tracking TikTok influencers, navigating Gmail promotions, and doubting if that viral Facebook video is real or AI-driven—all before I even have lunch!

    The path from intent to conversion used to be straightforward, but now, in this attention-driven economy, making purchase decisions has become a complex affair.

    Yet, many advertisers haven’t adapted to this reality. They still focus solely on search-based intent, missing out on entire audiences who don’t make it to the search bar.

    Google’s Demand Gen campaigns are my secret weapon here, allowing me to escape this trap by fostering discovery and condensing the sales funnel.

    Success isn’t complicated, but it requires mastering three elements: engaging creative content, strategic audience outreach, and rigorous testing methods.

    The Demand Gen Opportunity

    I see Demand Gen as the perfect blend of Google’s visual placements like YouTube, Gmail, and Discover matched with refined audience targeting and creative optimization.

    Think of it as social advertising uniquely adapted for Google’s ecosystem. These campaigns tap into users’ browsing habits rather than their search activities, making them ideal for raising brand awareness.

    Consumer behavior has undeniably shifted towards visual discovery, demanding more consumer touchpoints before sealing the deal.

    YouTube, after all, is a largely visual platform and is now the second-most-used social media platform with a whopping 2.6 billion users worldwide.

    In this new landscape, the purchase funnel is not only noisier but also more complex.

    Unfortunately, many marketers still treat Demand Gen like search, expecting instant conversions—a mindset that misses the point.

    To me, Demand Gen is about breaking consumption patterns, igniting interest, and nurturing intent over time.

    Marketers who can shift their mindset will see their performance compound, growing stronger with each impression.

    This is my go-to guide for nailing Demand Gen campaigns right from the start.

    Element 1: Creative That Commands Attention

    Thanks to modern tools, creating high-quality assets no longer requires expensive agencies.

    And this matters—a lot. Visual content is a major conversion driver.

    YouTube viewers are twice as likely to purchase something they’ve seen in a video and four times more likely to seek new products on the platform.

    If advertisers don’t master visual storytelling, they’ll miss speaking the language of today’s consumers.

    The Four-Part Framework for Demand Gen Creative

    Crafting successful creative assets doesn’t have to be a guessing game. The best assets adhere to a four-part framework:

    • Grab attention immediately: Capture interest within the first three seconds to stop that scroll.
    • Build brand recognition: Maintain a consistent visual identity across all placements to fortify brand recall.
    • Create emotional resonance: Make the viewer feel something meaningful.
    • Provide clear direction: Guide viewers on what to do after watching.

    Testing Creative Approaches

    I believe testing is pivotal in refining creative content. Experiment with various types like educational, product-focused, and testimonial formats.

    Educational content is great for awareness at the funnel’s top, while testimonials enhance consideration mid-funnel and product-focused creatives encourage conversion at its base.

    Finding what resonates with your audience is key, and optimizing for each unique platform—what works on YouTube may not on Gmail—is crucial.

    Dig deeper: Google’s Demand Gen upgrade: Key changes and success strategies

    Element 2: An Audience Strategy That Matches Intent

    I always think of audience strategy as an extension of creative development. Every audience is unique and should be addressed differently at various funnel stages.

    Before spending a dime, I make sure to identify who my audience is and the actions I want them to take.

    To do this, I start with the classic reporter’s questions:

    • Who is your target audience?
    • What are you trying to convey?
    • Where do they find their information?
    • Why would they care about your message?

    Once audiences are defined, I align messages to their respective stages, aiming to guide them smoothly through the journey.

    My goal is to nudge them to the next step without rushing them into a conversion.

    Get the newsletter search marketers rely on.

    MktoForms2.loadForm(“https://app-sj02.marketo.com”, “727-ZQE-044”, 16298, function(form) { });

    Audience Targeting Recommendations

    After honing in on audience and creative message, it’s time to target them effectively within Demand Gen campaigns.

    To start, I often create custom audiences, as they provide optimal control and granularity.

    These can be built using keywords, URLs, or app usage, focusing on those most likely to take the desired action.

    Happily, lookalike audiences have returned to Demand Gen campaigns, allowing targeting of prospects similar to existing customers.

    Affinity and in-market audiences can also be targeted, enabling outreach to those with both broad interests and those in active consideration phases.

    Campaign Structure Best Practices

    When launching Demand Gen campaigns, I adhere to a few best practices:

    • Start with separate campaigns for remarketing and prospecting as they have distinct goals and targeting options.
    • Allow campaigns to run for 30 days at least before making changes.
    • Consider dedicated campaigns for specific placements like Gmail, Discover, or YouTube.
    • Test shorts-only campaigns, noting the format converts differently on mobile due to instantaneous decisions.

    Consistent messaging and visuals across platforms is a must for building brand recall and reducing touchpoints before purchase.

    Dig deeper: Google pushes Demand Gen deeper into performance marketing

    Element 3: Testing and Optimization

    Having set up my Demand Gen ads, it’s time to delve into testing and optimization.

    Variables abound in these campaigns; hence, I meticulously test one element at a time for clarity and precision.

    This endeavor isn’t about pinpointing one solution but focuses on persistent optimization. Trends change, and what works today may need tweaking in a few months.

    Establishing Testing Parameters

    I typically classify my testing into three main categories:

    • Creative: Discover which creative elements resonate more. This could include content types, hooks, or video styles.
    • Placement: Determine which approaches work where by testing on Gmail, Discover, and YouTube.
    • Audience: Compare performances across differing audiences, such as custom vs. lookalike or remarketing vs. prospecting.

    As I continue testing, performance trends inform future creative, messaging, and placement choices.

    Consistently successful approaches allow scaling through budget increases for particular placements or audiences.

    Set Realistic Time Horizons

    Initial Demand Gen outcomes don’t reflect longer-term impact. Brand awareness takes time to build.

    I advise allowing a 60 to 90-day period for campaigns to stabilize and gain traction.

    Why Demand Gen Campaigns Fail

    Failures in Demand Gen execution are rare. More often, it’s mismeasured and prematurely abandoned campaigns that falter.

    This leads many away from Demand Gen entirely.

    Here’s how I steer clear of prevalent missteps:

    Unrealistic Expectations

    Many start Demand Gen campaigns expecting similar returns to those of direct search campaigns.

    Once those high expectations aren’t met, campaigns get abandoned.

    The remedy is setting realistic expectations from the start.

    Demand Gen builds brands and fills sales funnels, providing compound results if given the room to operate.

    Measurement Myopia

    This often accompanies unrealistic expectations. Relying solely on last-click attribution undervalues Demand Gen’s impact.

    I suggest considering these alternatives:

    • Use platform comparables: A Google Ads metric similar to social ads’ view-through method.
    • Observation mode: Incorporate Demand Gen audiences into search campaigns to track if brand searches rise.
    • Holistic brand metrics: Evaluate if brand growth is happening across channels, indicative of brand awareness.

    If only last-click returns are considered, you undervalue your efforts.

    Unrealistic Timelines

    Don’t halt campaigns within 30 days if results disappoint, and avoid hasty changes.

    I stay committed to a 60 to 90-day evaluation period while managing stakeholder expectations regarding timing.

    Master Discovery to Win the Future

    Attention is at its peak, and the progression of paid media leans towards visuals and discovery.

    Brands sticking to search will face growth challenges.

    Success in this terrain relies on three pillars:

    • Engaging creative.
    • Thoughtful audience targeting.
    • Consistent testing.

    Together, they foster performance and grow brand awareness.

    The competitive edge will favor those mastering discovery today.

    Large budgets aren’t essential for starting. Commitment to principles and patience with results suffice.

    Demand Gen campaigns can embed your brand in your audience’s daily online life.


    Inspired by this post on Search Engine Land.


    crushpress.ai community screenshot