Skip to main content
Skip to main content
Michael Pilgram, Founder & Digital Strategist
Michael Pilgram
Founder & Digital Strategist
17 min read

Modern SEO: Optimising for AI Search Engines in 2025

ChatGPT, Perplexity, and Google AI Overviews changed how users find content. Learn to optimise for AI search without sacrificing traditional SEO.

seoai-searchstructured-datacontent-strategy

Search has changed dramatically in the past few years. In 2020, Google held over 92% of the market and we all optimised for one engine. Today, ChatGPT serves 800 million weekly active users, Perplexity has become the research tool of choice for millions, and Google's market share has dropped below 90% for the first time since 2015.

AI search traffic is up 527% year-over-year. Most websites now see between 0.5-3% of their total traffic coming from AI search engines. But AI search engines don't crawl websites the way traditional search engines do.

You can have a site perfectly optimised for Google that's completely invisible to ChatGPT or Perplexity. The difference? Structured data, semantic markup, and content architecture specifically designed for AI extraction.

How AI Search Changed Everything

When Brighton SEO analysed over 41 million AI search results earlier this year, they uncovered something fascinating about how different AI platforms select and cite sources.

Each Platform Has Its Preferences

ChatGPT heavily favours established sources with comprehensive structured data. We're talking about Wikipedia (1.3M citations), G2 (196K citations), Forbes (181K citations), and Amazon (133K citations).

ChatGPT citations often come from URLs ranking beyond position 21+ on Google. Domain-level authority matters more than specific page rankings for AI visibility.

Perplexity takes a completely different approach, gravitating towards user-generated and social content. Reddit leads with 3.2M citations, followed by YouTube (906K citations) and LinkedIn (553K citations).

This means your optimisation strategy needs to consider which platforms your audience actually uses. B2B content performs very differently to consumer content across these AI engines.

The Surprising Content Format Winner

Comparative listicles dominate AI citations, accounting for nearly a third of all citations. For years, conventional SEO wisdom favoured long-form, comprehensive content. Whilst that's still true for traditional SEO, AI systems substantially prefer well-structured comparative content over comprehensive deep-dives.

Structure and clarity beat length and depth for AI citations.

What AI Search Engines Actually Want

Traditional SEO was about keywords, meta descriptions, and backlink profiles. AI search engines want something different entirely—they need to extract, interpret, and cite your content with confidence.

1. Structured Data Isn't Optional Anymore

When Microsoft's Principal Product Manager publicly stated earlier this year that "Schema Markup helps Microsoft's LLMs understand content," it confirmed what we'd been seeing in practice.

Only 12.4% of websites have implemented Schema.org structured data properly. If you get this right now, you're ahead of nearly 90% of your competition.

The numbers:

  • AI models using knowledge graphs achieve 300% higher accuracy compared to those relying on unstructured data
  • Pages with properly implemented schema markup see approximately 30% higher click-through rates
  • Content lacking structured data often gets skipped entirely, even when it ranks well organically

2. Semantic HTML Provides the Context AI Needs

Semantic HTML5 elements work as a translation layer between your content and AI crawlers. Without them, AI systems struggle to understand your content hierarchy:

<article>
  <header>
    <h1>Complete Guide to AI Search Optimisation</h1>
    <div class="author-info">
      <p>By <a href="/authors/michael-pilgram">Michael Pilgram</a></p>
      <time datetime="2025-10-23">October 23, 2025</time>
    </div>
  </header>

  <section>
    <h2>Understanding AI Search Engines</h2>
    <p>AI search engines extract and synthesise information...</p>

    <h3>ChatGPT Search Mechanics</h3>
    <p>ChatGPT uses web search to access current information...</p>
  </section>
</article>

Use proper semantic elements—<article>, <section>, <header>, <footer>—and maintain a logical heading hierarchy (H1→H2→H3). Avoid "div soup" where everything's wrapped in generic containers that obscure meaning.

3. Be Direct and Specific

AI extraction works well with clear, specific answers. Vague marketing speak? Not so much.

❌ This doesn't help AI systems:

Our platform helps businesses improve efficiency through innovative solutions.

✓ This does:

Our platform reduces manual data entry time by 75% through automated form processing and intelligent data extraction, processing 10,000 forms per hour with 99.2% accuracy.

Always quantify your benefits. Answer "what," "how," and "why" explicitly. AI models excel at extracting and citing concrete information—they struggle with ambiguity.

Implementing Schema Markup Properly

Schema markup has evolved beyond rich snippets. It's now fundamental to AI visibility. Here's how we implement it on our projects.

Article Schema: Your Foundation

Every blog post and article needs comprehensive Article schema with all required fields. Here's what we use on this very blog:

{
  "@context": "https://schema.org",
  "@type": "Article",
  "headline": "Modern SEO: Optimising for AI Search Engines in 2025",
  "description": "Complete guide to optimising content for ChatGPT, Perplexity, and Google AI Overviews",
  "author": {
    "@type": "Person",
    "name": "Michael Pilgram",
    "url": "https://www.numentechnology.co.uk/authors/michael-pilgram",
    "jobTitle": "Technical Director",
    "sameAs": [
      "https://www.linkedin.com/in/michaelpilgram",
      "https://github.com/michaelpilgram"
    ]
  },
  "publisher": {
    "@type": "Organization",
    "name": "Numen Technology",
    "url": "https://www.numentechnology.co.uk",
    "logo": {
      "@type": "ImageObject",
      "url": "https://www.numentechnology.co.uk/logo.png"
    }
  },
  "datePublished": "2025-10-08",
  "dateModified": "2025-10-23",
  "image": "https://www.numentechnology.co.uk/blog/ai-search-seo.jpg"
}

The critical fields you can't skip:

  • datePublished and dateModified signal freshness to AI systems
  • author with sameAs links establishes credibility
  • publisher information builds domain authority
  • image provides visual content for social sharing and AI understanding

FAQ Schema: The Highest ROI Implementation

In our experience working with clients, FAQ schema consistently delivers the best return on investment. AI systems absolutely love the direct question-answer format. Here's a working example:

{
  "@context": "https://schema.org",
  "@type": "FAQPage",
  "mainEntity": [
    {
      "@type": "Question",
      "name": "How does structured data improve AI search visibility?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "Structured data helps AI systems understand your content's purpose, context, and relationships. AI models using knowledge graphs achieve 300% higher accuracy than those working with unstructured data. Pages with proper schema markup see approximately 30% higher click-through rates and are more likely to be cited in AI-generated responses."
      }
    },
    {
      "@type": "Question",
      "name": "What percentage of websites use Schema.org markup?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "Only 12.4% of registered web domains have implemented Schema.org structured data as of 2025. This represents a massive opportunity for businesses ready to dominate AI-driven search results before the majority catch on."
      }
    },
    {
      "@type": "Question",
      "name": "How long does it take to see results from AI search optimisation?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "Quick wins like fixing structured data errors and adding FAQ schema typically show results in 2-6 weeks with 50-100% traffic increases. Comprehensive implementation usually delivers 100-300% traffic increases within 2-4 months. Long-term strategies over 6 months can achieve 300-2000%+ increases."
      }
    }
  ]
}

Best practices for implementation:

  • Keep answers under 2-3 paragraphs for optimal AI extraction
  • Use conversational, human-friendly tone that reads naturally
  • Focus on genuine questions your customers actually ask
  • Include 5-15 FAQs per topic page for comprehensive coverage

According to Search Engine Land's research, FAQ schema remains actively supported by Google and proves particularly powerful for AI platform optimisation, even though traditional rich results are now restricted to authoritative sites.

Organisation Schema: Building Domain Trust

Your organisation schema builds domain-level trust with AI systems. Think of it as your digital business card for AI:

{
  "@context": "https://schema.org",
  "@type": "Organization",
  "name": "Numen Technology",
  "url": "https://www.numentechnology.co.uk",
  "logo": "https://www.numentechnology.co.uk/logo.png",
  "description": "Expert software development and AI search optimisation services",
  "foundingDate": "2020",
  "sameAs": [
    "https://www.linkedin.com/company/numentechnology",
    "https://github.com/numentechnology"
  ],
  "contactPoint": {
    "@type": "ContactPoint",
    "contactType": "Customer Service",
    "email": "info@numentechnology.co.uk",
    "availableLanguage": ["English"]
  }
}

Validation Isn't Negotiable

Don't just add schema and hope for the best. Validate it until you achieve zero errors:

  1. Google's Rich Results Test shows how Google parses your markup
  2. Schema.org Validator ensures compliance with standards
  3. Google Search Console monitors site-wide structured data issues at scale

Malformed or incomplete schema is worse than no schema at all—AI crawlers may skip your content entirely if they can't parse your markup properly.

Author Credibility: Why E-E-A-T Matters More Than Ever

Google now explicitly prioritises content linked to verifiable human authors with proven credentials. This extends to all AI search platforms, not just Google.

Earlier this year, the Columbia Journalism Review published research showing that AI-powered search tools frequently provide incorrect answers with "alarming confidence." This finding heightened the importance of credibility signals across the industry.

Person Schema with Real Credentials

Every author on your site needs comprehensive Person schema. Here's the structure we use:

{
  "@context": "https://schema.org",
  "@type": "Person",
  "name": "Michael Pilgram",
  "jobTitle": "Technical Director",
  "url": "https://www.numentechnology.co.uk/authors/michael-pilgram",
  "image": "https://www.numentechnology.co.uk/authors/michael-pilgram.webp",
  "description": "Michael is a software architect with 15+ years of experience in SEO and web development. He specialises in AI search optimisation and has helped over 30 clients improve their AI search visibility.",
  "sameAs": [
    "https://www.linkedin.com/in/michaelpilgram",
    "https://github.com/michaelpilgram",
    "https://twitter.com/michaelpilgram"
  ],
  "alumniOf": {
    "@type": "EducationalOrganization",
    "name": "University of Technology"
  },
  "worksFor": {
    "@type": "Organization",
    "name": "Numen Technology"
  }
}

The sameAs array is absolutely critical here—it provides external validation of the author's identity and expertise through LinkedIn profiles, GitHub contributions, and other verifiable sources.

What Makes a Proper Author Bio

Don't do this:

  • ❌ Generic "Marketing Team" attribution
  • ❌ One-sentence bios with no substance
  • ❌ Missing credentials or background information
  • ❌ No profile photos

Do this instead:

  • ✓ Real author names with actual credentials
  • ✓ Detailed author bio pages (150-300 words minimum)
  • ✓ Professional headshots
  • ✓ Links to social profiles and published works
  • ✓ Clear areas of expertise
  • ✓ Short bio (2-3 sentences) at article bottom linking to full bio

Research shows that anonymous content gets deprioritised by AI systems. Content with clear expertise attached is 3x more likely to be cited.

Measuring Your AI Search Success

Traditional analytics don't automatically capture AI referral traffic. Here's how to track it properly:

Setting Up GA4 Custom Channel Groups

Create a dedicated "AI Traffic" channel group in Google Analytics 4:

Step-by-step setup:

  1. Navigate to Admin > Data display > Channel groups
  2. Create new channel group called "AI Traffic"
  3. Add rule: Medium equals referral AND Source matches regex
  4. Use this regex pattern:
(.*gpt.*|.*chatgpt.*|.*openai.*|.*perplexity.*|.*gemini.*google.*|.*copilot.*|.*claude.*|.*you\.com.*)
  1. Move this rule ABOVE "Referral" in your channel group order

This configuration captures traffic from:

  • ChatGPT (chatgpt.com, chat.openai.com)
  • Perplexity (perplexity.ai)
  • Google Gemini (gemini.google.com)
  • Microsoft Copilot (copilot.microsoft.com)
  • Claude (claude.ai)
  • You.com

Important caveat: Many AI clicks show up as "direct" traffic because AI platforms don't always pass referrer information. Your actual AI traffic is likely 1.5-2x what analytics show.

The Key Metrics Worth Monitoring

  1. AI referral traffic volume - Total sessions from AI sources
  2. Engagement rate - AI referrals often show higher intent with 2-3x conversion rates
  3. Platform distribution - Which AI engines actually drive your traffic
  4. Top landing pages - What content gets cited most frequently
  5. Citation frequency - Manual testing or specialised tools like Profound or BrightEdge

Semrush's 2025 AI traffic study found that companies now see 0.5% to 3% of total website traffic from AI systems, with ChatGPT consistently driving 60% of AI referrals and Perplexity accounting for 25%.

Real Results from Case Studies

Industry case studies show actual numbers from AI search optimisation:

B2B Industrial Products: 2,300% Traffic Increase

A B2B industrial products manufacturer went from zero AI search visibility to dominant AI citations within six months.

What they implemented:

  • Comprehensive Schema.org markup across their entire product catalogue
  • FAQ schema on all product pages (10-12 FAQs per product)
  • Proper author credibility signals on all content
  • Structured product specifications

The results:

  • 2,300% jump in traffic from AI platforms
  • Became the primary source cited by ChatGPT for their industry queries
  • 15% of their total website traffic now comes from AI search

Source: The Search Initiative case study

E-Commerce: 753% LLM Traffic Surge

PlushBeds, a mattress e-commerce company, implemented comprehensive product schema and FAQ markup across their site.

Results within 5 months:

  • 753% surge in LLM traffic
  • 950% lift in AI Overview visibility
  • Significant increase in organic conversions from AI referrals

Source: ResultFirst AI SEO case studies

Common Pattern: High Google Score, Zero AI Traffic

A common pattern emerges: sites ranking brilliantly on Google (90+ SEO scores) receiving zero traffic from AI search engines.

Typical issues found:

  • Malformed Schema.org markup that AI can't parse
  • FAQ sections present but not properly marked up with schema
  • Article schema missing critical fields like datePublished and author
  • No clear content hierarchy for AI to understand

After implementing fixes:

  • 100-340% increases in AI search traffic within weeks
  • Featured prominently in ChatGPT responses for industry queries
  • 15-25% of demo requests originating from AI referrals

What Success Actually Looks Like

Across multiple industries, properly implemented AI search optimisation delivers:

  • Traffic increases: 100% to 2,300% range
  • Conversion rate improvements: 2-3x higher than organic search
  • Timeline: 2-6 weeks for initial gains, compounding over time
  • ROI: Most implementations costing £4K-£20K deliver 150-400% traffic increases

The common thread in every successful implementation: comprehensive, error-free structured data combined with semantic content structure and genuine author credibility.

Server-Side Rendering: The Technical Foundation

Google's AI crawler handles JavaScript reasonably well, but non-Google AI tools often struggle with client-side rendering.

Why SSR Matters for AI Visibility

Server-side rendering ensures universal compatibility across all AI platforms:

  • Complete HTML delivered to all crawlers immediately
  • No rendering delays or timeout issues to worry about
  • Structured data guaranteed to be present in initial response
  • Works with all AI platforms, not just Google's

This proves critical for:

  • ChatGPT web crawling
  • Perplexity indexing
  • Third-party AI search engines
  • Emerging AI platforms we haven't seen yet

Whilst Search Engine Journal reports that Googlebot handles JavaScript reasonably well, SSR offers the most dependable results for AI search visibility across all platforms.

What Should Be Server-Side Rendered

Always server-side render these elements:

  • Main article content
  • Headings and content structure
  • Schema markup (JSON-LD in <head>)
  • Author information
  • Publication dates
  • FAQ sections

Can safely be client-side:

  • Interactive features (shopping cart, forms)
  • Dynamic updates (real-time data)
  • User-specific content

Frameworks like Next.js (which we use for this site) make hybrid approaches straightforward—SSR for SEO-critical content, client-side rendering for interactivity.

Common Mistakes That Kill AI Visibility

1. Partial Schema Implementation

Half-implemented schema is worse than no schema at all. AI systems need comprehensive, error-free markup to work with.

Wrong approach: Add Article schema to blog posts only Right approach: Article schema + FAQ schema + Person schema + Organisation schema site-wide

2. JavaScript-Only Content

Critical content that only exists after JavaScript executes may never be seen by AI crawlers.

Test your content like this:

curl -A "Mozilla/5.0" https://yoursite.com | grep "@type"

This shows what exists in the initial HTML before JavaScript runs. Your schema should appear here.

3. Generic Marketing Speak

❌ Avoid these vague statements:

  • "We provide innovative solutions"
  • "Industry-leading platform"
  • "Best-in-class service"

✓ Use specific, quantifiable statements:

  • "Reduce customer support costs by 40% through AI-powered chatbots"
  • "Process 10,000 forms per hour with 99.2% accuracy"
  • "Deploy in 2-4 weeks with zero downtime"

Specificity, quantification, and concrete benefits win every time.

4. Missing Author Credentials

Anonymous or poorly attributed content gets deprioritised consistently:

❌ Don't do this:

  • By "Admin"
  • By "Marketing Team"
  • No author information at all

✓ Do this:

  • By "Michael Pilgram, Technical Director"
  • Detailed bio with actual credentials
  • Person schema with sameAs links to verify identity

5. Ignoring Mobile

Mobile-first indexing means AI crawlers see your mobile version first:

  • Same content on mobile and desktop (no hiding content)
  • No hidden content in collapsed sections unless it's in the HTML
  • Identical schema markup across devices
  • Fast mobile performance is essential

Traditional SEO Still Matters

AI search optimisation complements traditional SEO—it doesn't replace it.

Still absolutely essential:

  • Page speed - Core Web Vitals impact rankings and crawl efficiency
  • Mobile responsiveness - Mobile-first indexing applies to AI crawlers too
  • Clean URLs - Descriptive, hierarchical URL structure
  • Sitemaps - Help crawlers discover your content
  • Technical SEO - Fix 404s, redirect chains, crawl errors
  • Backlinks - Domain authority signals trust to AI systems

Lumar's 2025 AI search report found that sites with strong traditional SEO foundations consistently see better AI search performance. The two strategies reinforce each other rather than competing.

Your Implementation Roadmap

Ready to optimise for AI search? Here's the phased approach we use with clients:

Phase 1: Foundation (Week 1-2)

  • Audit existing structured data for errors
  • Fix all validation errors (target: zero errors)
  • Implement Organisation schema
  • Add Person schema for all authors
  • Ensure server-side rendering for critical content

Phase 2: Content Schema (Week 3-4)

  • Add Article/BlogPosting schema to all content
  • Implement FAQ schema on relevant pages
  • Include datePublished and dateModified on all articles
  • Add author bylines with links to bio pages
  • Create comprehensive author bio pages

Phase 3: Semantic Structure (Week 5-6)

  • Audit heading hierarchy (ensure logical H1→H2→H3 flow)
  • Use semantic HTML5 elements (<article>, <section>, etc.)
  • Rewrite vague content to be specific and quantified
  • Add clear topic sentences to paragraphs
  • Structure content with comparative lists where appropriate

Phase 4: Measurement (Week 7-8)

  • Set up GA4 Custom Channel Group for AI traffic
  • Create AI traffic dashboard
  • Establish baseline metrics
  • Manual test: Query your topics in ChatGPT, Perplexity, Gemini
  • Document initial citation appearances

Phase 5: Optimisation (Ongoing)

  • Monthly: Check Search Console for structured data errors
  • Monthly: Manual testing of key queries across AI platforms
  • Quarterly: Update content and dateModified fields
  • Quarterly: Expand FAQ sections based on customer questions
  • Ongoing: Monitor AI traffic trends and adjust strategy

The Window of Opportunity

With only 12.4% of websites using structured data properly, early movers gain substantial advantages:

  1. First-mover citations - AI systems remember and favour established sources
  2. Compounding visibility - Citations lead to more backlinks and authority
  3. Lower competition - Most competitors haven't optimised yet
  4. Higher conversion rates - AI referrals show 2-3x better conversion
  5. Future-proofing - AI search will only continue growing

The window won't stay open indefinitely. Brighton SEO's analysis of 41 million results showed AI search traffic increased 527% year-over-year with no signs of slowing down.

What You Should Remember

  1. AI search is happening now, not later - 800M ChatGPT users and 527% YoY growth proves it
  2. Structured data is mandatory - Only 12.4% adoption means massive opportunity
  3. Real results are dramatic - 100-2,300% traffic increases are documented and verified
  4. FAQ schema delivers highest ROI - Consistently effective across all industries
  5. Author credibility significantly matters - 3x more likely to be cited with proper attribution
  6. Traditional SEO still complements AI SEO - Do both, not one or the other
  7. Server-side rendering ensures compatibility - Critical for non-Google AI platforms
  8. Validation must achieve zero errors - Malformed schema worse than no schema
  9. Specificity always beats vagueness - Quantify benefits and be concrete
  10. The opportunity gap won't last - Act now before everyone catches on

Getting Started

The shift to AI-powered search represents the most significant change in search since Google's rise. Businesses that adapt now will dominate AI citations and recommendations. Those that delay risk invisibility in the fastest-growing traffic channel.

Start with these fundamentals:

  1. Fix existing structured data errors
  2. Implement comprehensive FAQ schema
  3. Add proper author attribution
  4. Validate everything to zero errors
  5. Track AI traffic in GA4

Then expand to comprehensive schema coverage, semantic content optimisation, and ongoing measurement.


Ready to optimise for AI search? Contact us for an AI search readiness audit. We'll analyse your current structured data implementation and provide a roadmap for AI search visibility.

Share this article

Related Articles