Last updated: January 30, 2026
2.26 min read

Google indexing explained: why pages are stored, ignored, or forgotten

Key takeaways

  • A practical model of Google’s indexing decision (discovery → crawl → dedupe/canonical → store → refresh), plus the core entry pages that explain why URLs fail at the storage layer

If your pages don’t appear in search, most people jump straight to “rankings”.

Most of the time it’s not.

It’s indexing: Google discovered the URL, maybe even crawled it, and then decided it’s not worth keeping (yet), or that another URL should represent the same content.

This page is the pillar for the indexing cluster on this site: a simple model + the main “entry points” that Google can classify.

Start with the anchors (single-intent entry pages)

If your problem is not storage but visibility after storage, switch to the neighboring hub:

If you want the status map layer:

Google doesn’t “index everything” — it evaluates

Google can fetch far more URLs than it wants to keep. So it evaluates:

  • Cost: how expensive is it to crawl, render, dedupe, and refresh this URL?
  • Value: does this URL add something the index doesn’t already have?
  • Risk: is this site predictable and trustworthy enough to keep indexing deeply?

That’s why “request indexing” is not a magic button: it can speed up a fetch, but it doesn’t change the value/risk model.

The indexing decision (five gates)

Think of Google’s index as a curated store, not a backup drive.

For each URL, Google roughly asks:

  1. Can I fetch it reliably? (status codes, redirects, robots)
  2. Can I render/parse it? (HTML, JS, blocked resources)
  3. Is it a duplicate of something I already have? (canonicalization + near-duplicates)
  4. If it’s not a duplicate, is it worth keeping? (priority: site trust + internal hierarchy + incremental value)
  5. If I keep it, what is the canonical representative URL? (Google-selected canonical)

Most SEO advice starts at (4) “rankings”. Your real bottleneck is often (1)–(3).

Crawled ≠ Indexed: why this happens

“Crawled” means: Google fetched and processed the URL.

“Indexed” means: Google decided it’s worth storing and serving for queries.

That gap is where most modern SEO work lives.

Deep dives:


System context

Next step

If you want the fastest “entry page” for search intent, start with: