What is Passage embedding?

Question

Accepted Answer

A passage embedding is the engine's vector representation of a single chunk inside a page — not the page as a whole. Every major AI engine through mid-2026 stores one embedding per ~600–900 character chunk inside the retrieval substrate; queries are matched against chunk embeddings, not page embeddings, and the highest-scoring chunk wins the citation. The shift from page-level to passage-level embedding is the structural mechanism behind every other passage-level dynamic — chunk size targeting, heading boundary discipline, self-anchoring opening sentences, and the citation URL's text-fragment anchor all derive from the engine indexing one vector per chunk. Brands writing for page-level embedding (the 2022 mental model) under-perform brands writing for passage embedding (the mid-2026 reality) on citation share even at equivalent content quality.

What is Passage embedding?

How it relates to AI UGC

Key statistics

Related blog posts

Related terms