Content
Content audit: methodology and step-by-step process

A content audit is a systematic inventory and evaluation of every page on a site. Without one, you invest in new content while existing pages quietly lose rankings. We walk through the full process: from crawling to the Keep / Update / Consolidate / Delete decision matrix.
Most websites accumulate dozens or hundreds of pages over time that generate no traffic, earn no links, and drive no conversions. They dilute crawl budget, create keyword cannibalism, and drag down overall domain authority. A content audit is the tool that makes all of this visible — and gives you an action plan.
What is a content audit
A content audit is a structured process of inventorying, evaluating, and categorising every piece of content on a website. The goal is to determine what is working, what needs improvement, what should be merged, and what should be removed.
A content audit answers three questions: what content exists on the site, how well it serves business goals, and which specific actions will maximise the return on the existing content inventory.
Pages with zero traffic
Median for sites older than 3 years, per Ahrefs data
Organic growth
After a full audit and implementation of findings
Crawling
Removing dead weight helps Googlebot reach priority pages sooner
Full audit
Minimum recommended frequency for active sites
Types of content audit
Not all content audits are the same. The type is determined by the problem you are trying to solve.
SEO audit
Focuses on rankings, clicks, impressions, and crawlability. Identifies pages with falling traffic, cannibalism, and technical barriers.
UX audit
Analyses engagement: scroll depth, time on page, bounce rate. Surfaces pages users abandon quickly.
Quality audit
Manual evaluation of accuracy, completeness, freshness, and E-E-A-T alignment. Essential for medical, financial, and expert-heavy niches.
Conversion audit
Looks at each page's contribution to conversions: assisted conversions in GA4, user paths to the target action.
Step 1: inventory — a complete URL list
Before you can evaluate content, you need to know what exists. A crawler shows you what Googlebot sees — a more honest picture than an XML sitemap alone.
Crawling the site
- Run Screaming Frog (or Sitebulb) in Spider mode. The free version handles up to 500 URLs.
- Exclude technical URLs: /cart, /checkout, /account, /search?q=*, UTM variants.
- Also export the URL list from your XML sitemap and from GSC (Coverage report — all indexed pages).
- Merge both lists and remove duplicates. This is your complete inventory.
Inventory spreadsheet columns
| Field | Source | Purpose |
|---|---|---|
| URL | Crawler | Unique page identifier |
| Title / H1 | Crawler | Quick topic identification |
| Page type | Manual / segmentation | Blog, category, landing page, utility |
| Depth | Crawler | Crawl priority and architectural weight |
| Canonical | Crawler | Is there a self-referential canonical or does it point elsewhere? |
| Publication date | CMS / GSC | Content freshness signal |
| Indexable? | Crawler / GSC | indexable / non-indexable status |
Step 2: data collection — what the numbers say
The inventory is ready. Now you need to pull metrics for each URL from three sources: Google Analytics 4, Google Search Console, and a backlink tool.
Google Search Console
- Impressions (last 12 months) — shows how well Google understands the page
- Clicks (organic) — real traffic from search
- Average Position — average rank across all queries for the page
- CTR — an indirect signal of title and snippet quality
Google Analytics 4
- Sessions / Users from the organic channel (Medium = organic)
- Engagement Rate (≥ 60% is considered healthy)
- Average Engagement Time — how long users actually engage with the content
- Conversions — assisted conversions (if goal events are configured)
Backlinks
Pages with quality backlinks must never be deleted without a redirect — you will permanently lose link equity. From Ahrefs or Semrush, export: number of referring domains, DR/UR signal, and top anchors.
Step 3: evaluation criteria
Evaluate each page across six dimensions. Score each from 1 to 3 — the total gives you a priority score for the next step.
| Criterion | Low (1) | Medium (2) | High (3) |
|---|---|---|---|
| Organic traffic | < 50 sessions/mo | 50–500 sessions/mo | > 500 sessions/mo |
| Search position | 51+ | 11–50 | 1–10 |
| Engagement | < 40% engagement rate | 40–65% | > 65% |
| Link equity | 0 referring domains | 1–5 | > 5 |
| Conversions | 0 | Assisted | Direct |
| Freshness | Not updated > 2 years | 1–2 years | < 12 months |
Pages score between 6 (minimum across all criteria) and 18 (maximum). Score ≥ 14: keep without question. Score ≤ 8: candidate for consolidation or deletion. Range 9–13: requires manual judgement.
Step 4: keep / update / consolidate / delete decision matrix
The KUCD matrix is the primary output of a content audit. Every page lands in one of four categories.
The page is performing steadily: traffic is not declining, engagement is high, information is current. Monitor quarterly — no action needed.
KeepThe page has potential (positions 5–20, impressions exist) but traffic has declined or content is outdated. Action: rewrite sections, add current data, improve structure, refresh internal links.
UpdateMultiple similar pages compete for the same queries (cannibalism) or a page is too thin to stand alone. Action: merge into one strong page, set up 301 redirects from all old URLs.
ConsolidateThe page generates no traffic, earns no links, drives no conversions, and is not technically required. Action: delete, set up a 301 redirect to the nearest relevant URL or the section root.
DeleteDecision rules
- Has backlinks → always redirect (even when deleting)
- Drives conversions → Update before any other decision
- Two equally matched pages cannibalising each other → Consolidate into the stronger one
- Indexed but no traffic, no links for > 12 months → Delete
- Utility / legal pages (privacy policy, terms) → Keep, do not touch
Step 5: prioritisation and action plan
Once every page has a decision, you need to sequence the work. Not everything can be done simultaneously — start where the impact potential is highest.
Priority 1: pages ranking positions 5–20 with high impressions and low CTR. A quick title and lead improvement drives more clicks without a full rewrite.
Priority 2: cannibalising pages. While they compete with each other, neither reaches its ceiling. Consolidation removes the blocker.
Priority 3: pages with falling traffic (trend down 20% or more over six months). This signals content ageing — a rewrite with updated data is needed.
Priority 4: removing dead weight. This frees up crawl budget and improves overall domain quality, but does not produce an immediate traffic lift.
Tools for a content audit
| Tool | Task | Cost |
|---|---|---|
| Screaming Frog | Crawling, on-page data collection | Free up to 500 URLs, £259/year |
| Google Search Console | Traffic, positions, impressions | Free |
| Google Analytics 4 | Engagement, conversions | Free |
| Ahrefs Site Audit | Backlinks, technical issues, content metrics | From $129/mo |
| Semrush Content Audit | Semi-automated KUCD classification | From $139/mo |
| Sitebulb | Advanced architecture visualisation | From $13.50/mo |
| Google Sheets | Master audit spreadsheet, decision tracking | Free |
For most projects, the free stack is sufficient: Screaming Frog (up to 500 URLs) + GSC + GA4 + Google Sheets. Paid tools speed things up for large sites but do not replace analytical thinking.
How often to run a content audit
Frequency depends on the site's size and content publication pace.
- Full audit
- Once a year. Covers the entire inventory and refreshes the KUCD matrix for every page.
- Mini-audit of top pages
- Quarterly. Top 50–100 pages by traffic. Tracks trends without rebuilding the full spreadsheet.
- Drop monitoring
- Monthly. GSC alerts for pages with clicks down > 20% MoM. Enables quick reaction without waiting for a big audit cycle.