Definition
Content uniqueness is how distinct a given page's body is from every other page on the site. Measured via n-gram overlap, cosine similarity of embeddings, or simple string-diff percentage.
Why it matters
Search engines and LLM crawlers treat near-duplicate pages as redundant. In pSEO at scale, uniqueness is the difference between 500 indexed pages and 500 deindexed pages.
How we think about it
We measure uniqueness as 40%+ n-gram difference between any two pages in a generated set. Enforced in CI. If a set fails, the generator rewrites or the axis was wrong, we don't ship the duplicates and move on.