Skip to main content
SEO & Contentapprox. 9 min. readMarch 05, 2026Daniel Müller

Multimodal Search: How to Optimise for Image, Voice and Text Search.

Google Lens, Voice Search, Circle to Search — search is going multimodal. We show you how to optimise your content for all search formats and unlock new traffic channels.

Multimodal Search Optimisation — Image, voice and text search for better SEO visibility

Multimodal search enables searching with different input formats: text, image, voice or combinations thereof. Google Lens processes 15 billion search queries per month. Voice search is used regularly by over 40% of Swiss users. Circle to Search on Android allows circling objects on the screen. These channels are growing — and offer new SEO opportunities.

Image SEO: The Underrated Channel

With the rise of Google Lens and visual search, image SEO is becoming a standalone traffic channel. The most important measures:

Alt texts: Descriptive and contextually relevant. Not: 'image1.jpg'. Instead: 'Modern web design for a Zurich law firm — minimalist layout with a dark colour scheme'.

File names: Keywords in file names: 'webdesign-zurich-law-firm.webp' instead of 'IMG_20260315.jpg'.

Image quality: High-resolution, well-lit, in WebP or AVIF format for optimal file size.

Structured data: Schema.org Product, Recipe or ImageObject markup helps Google understand your images in the right context.

Voice Search Optimisation

Voice search differs fundamentally from text search: voice queries are longer (29 words vs. 3–4), more natural-sounding and formulated as questions. Optimisation: FAQ sections with natural questions and direct answers. Target long-tail keywords ('How much does a website cost in Zurich?' instead of 'website costs Zurich'). Implement Speakable schema. Maintain Google Business Profile (50%+ of voice searches have local intent).

Circle to Search on Android devices enables: circling an object on the screen and immediately receiving information. This is particularly relevant for e-commerce businesses: users can search for products from screenshots, social media posts or photos. Optimisation: product photos on the website with a clear background, Google Merchant Center with up-to-date product data and structured product data (Schema.org Product).

The Multimodal Search Checklist

Implement immediately: 1) All images have descriptive alt texts. 2) Images in WebP/AVIF with optimised file names. 3) FAQ sections with natural-language questions. 4) Google Business Profile complete (for local voice and visual searches). Medium-term: 5) Implement Speakable schema. 6) Video content with transcripts and subtitles. 7) Product data in Google Merchant Center. Long-term: 8) Diversify content formats (text + image + video) for maximum search surface. DLM Digital helps with implementing a comprehensive multimodal search strategy.

DM

Daniel Müller

Senior Developer at DLM Digital – 10+ years of experience in web development, SEO and digital strategy for Swiss SMEs

Tags:Multimodal SearchVoice SearchGoogle LensImage SEOSEO
Share article:

Frequently Asked Questions: Multimodal Search: How to Optimise for Image, Voice and Text Search

Increasingly important. Google Lens processes 15 billion search queries/month — trending upward. Image SEO is no longer 'nice to have' but a standalone traffic channel. Fundamentals: descriptive alt texts, optimised file names, WebP/AVIF format, and Schema.org markup. For e-commerce businesses, visual search is particularly relevant.

Yes, gradually. The good news: most voice search optimisations also improve your classic SEO. FAQ sections, clear question-and-answer structures, long-tail keywords and fast loading times help with both text and voice search simultaneously. Specifically for voice: implement Speakable schema and optimise Google Business Profile for local voice searches.

Google Lens is Google's visual search: point your smartphone camera at an object and Google recognises it — products, plants, animals, text, buildings, QR codes. Relevant for businesses: product photos must be high-quality and distinctive. Google Merchant Center data must be up to date. Alt texts and file names must be descriptive. Tip: test whether Google Lens recognises your products.

More organic traffic for your business?

We bring your website to the top of search results — with data-driven SEO strategy.

Request SEO Consultation