<![CDATA[

XML sitemaps are roadmaps for search engines. They list the URLs on your site that you want indexed, along with metadata about each URL — last modification date, change frequency, and priority. A well-structured sitemap accelerates indexation and ensures important pages aren’t missed by crawlers.

XML Sitemap Basics

A sitemap is an XML file listing URLs with optional metadata:

<urlset xmlns="http://www.sitemaps.org/schemas/sitemap/0.9">
  <url>
    <loc>https://example.com/page/</loc>
    <lastmod>2026-03-01</lastmod>
    <changefreq>weekly</changefreq>
    <priority>0.8</priority>
  </url>
</urlset>

Sitemap Best Practices

Include Only Indexable Pages

Only list URLs you want indexed — pages that return 200 status codes, are not blocked by robots.txt, and don’t have noindex tags. Listing non-indexable URLs wastes crawl budget and sends conflicting signals.

For more on this topic, see our guide on url structure seo best practices.

Keep Sitemaps Under 50,000 URLs

Each sitemap file is limited to 50,000 URLs or 50MB uncompressed. For larger sites, use a sitemap index file that references multiple sitemap files.

Segment by Content Type

Split sitemaps by content type for better monitoring and control:

  • sitemap-pages.xml — Core pages
  • sitemap-blog.xml — Blog posts and articles
  • sitemap-products.xml — Product pages
  • sitemap-images.xml — Image-specific sitemap
  • sitemap-videos.xml — Video content

Use Accurate lastmod Dates

Only update the lastmod date when the page content actually changes. Falsely updating lastmod to trigger re-crawling can erode Googlebot’s trust in your sitemap data.

Submit in Search Console

After creating your sitemap, submit it in Google Search Console (Sitemaps section). Monitor the submitted vs. indexed URL count — a large gap indicates indexation problems worth investigating.

Dynamic Sitemaps

For sites that publish frequently, generate sitemaps dynamically. Most CMS platforms and frameworks (WordPress, Next.js) support automatic sitemap generation that updates as content is published or modified.

Sitemap Troubleshooting

  • Submitted but not indexed: Check for crawl blocks, noindex tags, thin content, or quality issues
  • Sitemap errors in Search Console: Validate XML syntax, check for non-200 URLs, verify encoding
  • New content not appearing: Verify the sitemap auto-updates and lastmod reflects the publish date

A sitemap doesn’t guarantee indexation — it guarantees discovery. Google still evaluates each URL for quality before indexing it. But without a sitemap, important pages may take weeks or months longer to be discovered and indexed.

]]>