The average B2B site wastes roughly 40% of its crawl budget on pages that generate zero organic traffic. A content audit fixes that. It's a structured process: inventory every page, score each one against clear performance criteria, and assign a concrete action. Keep it, update it, merge it, or cut it. This guide walks you through all six steps, from setting audit goals to building a recurring cadence. You'll also find a scoring rubric that covers both traditional SEO signals and AI visibility, so your content performs in search and gets cited by tools like ChatGPT and Perplexity.

Most content teams are sitting on a graveyard of pages they've forgotten about. A content audit is how you find out what's buried there, and what to do about it.
A content audit is a systematic process of inventorying every page on your site, evaluating each one against defined performance and quality criteria, and assigning a clear action: keep, update, merge, or cut. That last part is what separates an audit from a simple content inventory.
Nielsen Norman Group draws a clean line between the two. A content inventory is the "what exists" layer: every URL, format, and creation date in a list. A content audit is the judgment layer on top. It assesses quality, spots gaps, and decides what gets updated, merged, or cut. You need both. The inventory without the audit is just a very long spreadsheet.
So why do most teams skip it?
Here's the kicker: skipping your audit is now twice as costly as it used to be. Ahrefs analyzed 17 million citations across ChatGPT, Perplexity, Gemini, and Copilot and found that AI assistants cite content that is 25.7% fresher than what ranks in organic Google results. Stale content doesn't just slip down the rankings. It gets ignored entirely by the AI tools your buyers use to research decisions.
Semrush recommends updating blog content every three to six months to stay accurate and aligned with current search intent. Without a recurring audit process, that cadence is impossible to maintain at scale.
The six-step process in this guide gives you a repeatable system: define your goals, build your inventory, pull performance data, score every page, assign actions, and build a cadence that keeps your content from going stale again.
There's no single right answer, but there are five clear triggers that should put an audit on your calendar.
1. Quarterly - if you're running a content-heavy site with 500+ pages (large blogs, news sites, e-commerce), quarterly audits keep decay from compounding before it costs you.
2. Every 6-12 months - the right cadence for most B2B and SaaS teams. Long enough to accumulate meaningful data, short enough to catch problems before they hurt rankings.
3. Before a website redesign or CMS migration - don't carry dead weight into a new build. An audit tells you what's worth migrating and what to leave behind.
4. After a major Google core algorithm update - when rankings shift overnight, an audit is your diagnostic tool. It tells you which pages were hit, and why.
5. After a sudden organic traffic drop - same logic. Don't guess. Audit first, then act.
Portent recommends an annual audit of your most valuable pages as a minimum baseline. Treat that as a floor, not a ceiling.
The smarter move is to build a rolling audit cadence into your content calendar so it's never a fire drill. For teams producing AI-generated content at scale, that cadence needs to be tighter. Volume accumulates faster than quality can be monitored, and the gap between "published" and "performing" widens quickly.
A content audit is a powerful diagnostic tool. But it's not a magic fix, and treating it like one is how teams end up disappointed.
Here's what a content audit is built to do:
Here's what it cannot do:
The audit surfaces the problems. Your team still has to solve them.
Most content audits die in a spreadsheet. Not because the data was wrong, but because nobody agreed on what they were looking for before they started pulling it.
Your audit goal determines which metrics you collect, how you weight them in scoring, and what a "good" outcome looks like. Skip this step and you'll end up with 800 rows of data and zero consensus on what to do next.
Write your goal as a single sentence before you open any tool. Something like: "We want to identify all pages that have lost more than 30% of their organic traffic in the last 12 months and decide whether to refresh or consolidate them." That one sentence tells you exactly which metrics matter and which ones don't.
Here are the four most common audit goal types and the metrics each one prioritizes:
The goal type you choose shapes everything downstream. HubSpot's 2026 Marketing Statistics report found that 74% of marketers say content marketing helped generate demand and leads. But that's only true when content is actively maintained, not left to decay quietly in a CMS nobody checks.
Pick one primary goal. Note secondary goals if you want, but don't try to optimize for all four at once or the audit loses focus before it starts.
Before you can score a single page, you need a complete list of what you're working with. Most teams underestimate this step and end up auditing 60% of their site by accident.
Three ways to pull your URL list:
The most complete picture comes from combining a crawl with your GSC export. The crawl finds every page; GSC tells you which ones Google has actually seen.
Your minimum spreadsheet columns:
If you're on WordPress or Webflow, use a CMS bulk export to pull page data without crawling. It won't give you SEO metrics, but it's a fast starting point for the structural columns.
For large sites (1,000+ pages), don't try to audit everything at once. Ahrefs recommends following the 80/20 rule: audit the top 20% of pages by organic traffic or backlinks first. You'll capture roughly 80% of the value without the full time investment.
What's in scope vs. out:
Blog posts, landing pages, and product or service pages are always in scope. Tag pages, author archives, and pagination are usually excluded unless they're pulling in meaningful organic traffic. When in doubt, check GSC clicks before deciding.
Trying to audit everything at once is how audits die on the spreadsheet.
Before you pull a single URL, decide how wide to cast the net. Three practical options:
For most teams running their first audit, start with the segment approach. Pick your blog, get the process right, then expand.
Nielsen Norman Group recommends a "start-small mindset" and timeboxing the initial effort to six weeks. That's enough time to surface real issues and build momentum, without the whole project collapsing under its own weight.
A URL list is just a list. What turns it into a decision-making tool is data. Pull from four sources and merge everything into one sheet.
1. Google Search Console
Export clicks, impressions, average CTR, and average position for each URL. Pull two date ranges: the last 3 months and the last 12 months. The 12-month view catches seasonal content that looks dead but spikes every winter. The 3-month view catches pages in active decay before they fall off a cliff.
2. Google Analytics 4
Pull sessions, engaged sessions, average engagement time, and conversions per page. A page with zero organic clicks might still drive hundreds of direct or email visits that convert well. GA4 tells you the full traffic picture, not just the search slice.
3. Backlink tool (Ahrefs, Semrush, or Moz)
Export referring domain count per URL. Here's the kicker: a page with low traffic but 40 referring domains is sitting on link equity you can't afford to throw away. Cutting or redirecting that page carelessly bleeds authority you've spent years building.
4. CMS or crawl data
Pull word count, last modified date, and internal link count. These tell you how fresh the content is, how thin it might be, and how well it's connected to the rest of your site.
Merging it all together
Use VLOOKUP or INDEX/MATCH in Google Sheets, keying every join on the URL column. The catch: URL normalization issues will silently break your merges. A trailing slash, an `http` vs `https` mismatch, or `www` vs non-www will cause rows to fail with no error message.
Fix this before you merge. Add a helper column that strips trailing slashes and lowercases every URL, then use that normalized column as your join key.
If you want to skip the manual merging entirely, Screaming Frog connects directly to both GSC and GA4 via its API integrations and exports everything in a single crawl file. For large sites, this alone saves hours.
Most content audits drown in data. The fix is a tiered approach: not every metric deserves equal weight, and pulling the wrong numbers first wastes hours.
Tier 1 - SEO value (must-have)
These four metrics answer one question: does this page have any organic value?
Pull these from Google Search Console and Ahrefs (or your preferred backlink tool). A page with zero clicks, zero impressions, and no referring domains has no SEO footprint. That's your first filter.
Tier 2 - Non-SEO value (important context)
Some pages earn their keep outside of organic search. Before you cut anything, check:
A page with thin organic traffic but strong conversion data is a keeper. Don't let SEO metrics alone make the call.
Tier 3 - Quality signals (build-to-rank indicators)
These signals tell you whether the page is built to rank and get cited by AI systems. Thin, unstructured, anonymous content is a liability in both channels.
Spotting content decay vs. dead content
This distinction matters. A page that drove strong traffic 12-18 months ago but has lost 30%+ of clicks in the last six months is a decay candidate, not a delete candidate. As Ahrefs notes, decay is gradual and reversible. It needs a refresh, not a funeral.
A page that has never had meaningful traffic? That's a stronger candidate for merge or cut. The history (or lack of it) is the signal.
Gut feel is the enemy of a good content audit. Without a scoring system, you'll spend hours debating whether a page is "good enough" and end up keeping things you should cut. A weighted rubric fixes that by turning a subjective call into a number.
Here's a 5-dimension framework that scores every page out of 100 points.
1. SEO Performance - 30 points This is your heaviest weight because organic traffic is the clearest signal of real-world value. Score based on organic clicks and average position in Google Search Console. A page ranking in positions 1-10 with 100+ monthly clicks earns 25-30 pts. A page with zero organic clicks in the last 90 days scores 0.
2. Link Equity - 20 points Backlinks are hard to rebuild once lost. Score based on referring domain count. Ten or more referring domains earns 15-20 pts. Zero referring domains scores 0. This dimension protects pages that may be underperforming on traffic but carry real authority worth preserving.
3. Business Relevance - 20 points This one is manual, and that's intentional. Ask: does this page support a current product, service, or target customer? A page about a discontinued feature scores 0. A page directly tied to a core use case scores 20. No algorithm can make this call for you.
4. Content Quality and Freshness - 20 points Score based on four signals: last modified date, word count relative to the SERP average, presence of E-E-A-T markers (author bio, first-person experience, cited sources), and structured data. A stale, thin page with no author and no schema scores near 0. A well-sourced, recently updated piece with markup scores 15-20.
5. GEO/AI Citability - 10 points This dimension is new, and it's becoming harder to ignore. Score based on: does the page have FAQ schema? Does it open with a clear, quotable definition in the first 100 words? Has it been cited by any AI tool? Was it updated in the last 12 months?
Why it matters: Ahrefs analyzed 17 million citations across seven AI platforms and found that AI assistants cite content that is 25.7% fresher than typical organic Google results. ChatGPT's preference is even sharper. It cites URLs that are 393 to 458 days newer than what ranks in organic search. If your content is stale, AI tools are already skipping it.
Score thresholds for decisions:
A page scoring below 20 is almost certainly hurting your site more than helping it, diluting crawl budget, splitting authority, and cluttering your topical map. The rubric just makes that visible.
Most content teams score pages for SEO and call it done. That's a blind spot. Generative engines like ChatGPT, Perplexity, Google AI Overviews, and Gemini use a partially different set of signals to decide what gets cited.
The signals overlap with traditional SEO, but four factors stand out.
1. Freshness. AI assistants strongly prefer recently updated content. Ahrefs' analysis of 17 million citations found that AI-cited content is 25.7% fresher than organic Google results, with the average cited URL being 1,064 days old versus 1,432 days for organic SERPs. If a page hasn't been touched in over a year, it's already at a disadvantage.
2. Quotability. Pages with a clear, self-contained definition in the opening paragraph, statistics tied to named sources, and structured answers are far more likely to be extracted and cited. AI systems are essentially looking for content they can lift cleanly.
3. E-E-A-T signals. Author credentials, first-person experience markers, and cited external sources signal trustworthiness to both Google and AI systems. A byline with no bio is a quiet credibility killer.
4. Structured data. FAQ, Article, and HowTo schema help AI systems parse and extract content accurately. Without it, even well-written pages get passed over.
Quick GEO audit checklist for each page:
Any page that fails three or more of these checks scores low on GEO citability. Flag it for update before anything else.
Your scoring process will surface two patterns that quietly bleed rankings for years before anyone notices them.
Keyword cannibalization happens when two or more pages target the same primary keyword. Google splits ranking signals between them, and neither page reaches its potential. The fix sounds obvious once you see it, but most teams never look.
To detect it: export all target keywords from your inventory and flag duplicates with a pivot table or conditional formatting. You can also open Google Search Console, filter by a keyword, and check which URLs appear in the top results. If two of your own pages compete for the same query, that's cannibalization. The usual fix is to merge the weaker page into the stronger one with a 301 redirect, or to clearly differentiate the intent of each page so they're no longer chasing the same query.
Content decay is the slow, often invisible decline of a page that once ranked well. Ahrefs describes it as a gradual drop in organic traffic and rankings caused by outdated information, stronger competitor content, or a shift in search intent. Here's the kicker: it's easy to miss until significant ground is already lost.
To detect it: compare 12-month GSC data against 3-month GSC data. Pages where clicks have dropped more than 30% year-over-year, with no seasonal explanation, are decay candidates.
The fix is a targeted refresh. Update statistics, add new sections, strengthen E-E-A-T signals, and update the publish date. Don't treat these pages as failures. Decay pages often carry backlinks and residual authority that make them far cheaper to recover than to replace.
That's the real difference between decay and a page that never performed: one has equity worth saving.
Your scores are meaningless without a decision attached to them. This is where the audit pays off: every page gets one of four labels, and each label comes with a specific action.
KEEP - Score 70-100
The page is performing, relevant to your business, and reasonably current. Don't touch it. Add it to a monitor list and schedule a review in six months. The only risk here is over-editing pages that are already working, so resist the urge.
UPDATE - Score 40-69
This is your biggest opportunity bucket. The page has real SEO value (traffic, backlinks, or a solid keyword position) but it's losing ground. Staleness, thin sections, or missing GEO signals are usually the culprit.
Assign a refresh priority based on score and traffic volume:
Typical refresh tasks include updating statistics and examples, expanding thin sections, adding FAQ schema, rewriting the introduction to be more quotable for AI citations, and updating the publish date once changes are live.
MERGE - Score 20-39 with a related page scoring higher
The page has some link equity or topical relevance but can't stand on its own. Consolidate: pull the best content into the stronger page, implement a 301 redirect from the weaker URL, and update internal links pointing to the old URL.
One warning: don't just delete and redirect. Read the weaker page carefully before you touch it. It may contain a unique example, a data point, or a perspective the stronger page is missing. Preserve what's worth keeping, then redirect.
As Search Engine Land notes, consolidating pages into thorough coverage reduces keyword cannibalization and sends clearer authority signals to AI search agents. That's a double win.
CUT - Score 0-19 with no backlinks and no non-organic value
This page is dead weight. It dilutes your crawl budget, contributes to index bloat, and drags down your site's overall quality signals with Google.
Your options:
The 'useful for another purpose' exception: Some pages should survive regardless of their organic traffic score. Privacy policies, terms of service, and active paid campaign landing pages aren't judged by SEO metrics. Flag these early and exclude them from the cut list.
A scored spreadsheet with 400 rows is just a different kind of paralysis. You need a filter.
Use a simple 2x2 matrix. One axis is effort to fix (low vs. high). The other is potential traffic impact (low vs. high). Every page lands in one of four buckets:
Once you've sorted pages into buckets, batch similar action types together. Do all 301 redirects in one session. Handle all meta description updates in another. Save full rewrites for a dedicated sprint.
Context-switching between redirect work and long-form editing kills productivity. Batching keeps momentum visible and makes it easy to report progress to stakeholders without a status meeting.
Most content audit guides stop at the action list. That's where the real work starts.
Assign ownership before anything else. Every action item in your audit spreadsheet needs three things: an owner, a due date, and a status column. Without those, your audit becomes a document people open once and quietly forget.
For update tasks, create a one-page brief for each URL that includes:
For merge tasks, document the source URL, destination URL, redirect type (301), and any content worth migrating before the redirect goes live. Doing this after the fact is how teams lose good content forever.
Once refreshes go live, monitor GSC clicks and impressions over the following 4-8 weeks. The timeline matters: traffic from refreshed pages often moves within days of re-indexing, while brand-new content typically takes weeks to months to gain traction. Ahrefs found that AI assistants cite content that is 25.7% fresher than standard organic results, which means a timely refresh does double duty , it can recover search rankings and improve your chances of being cited in AI-generated answers.
Set up a simple tracking tab in your audit spreadsheet with these columns:
That's it. Simple enough that someone will actually use it.
A one-time audit is better than nothing. A rolling cadence is what separates high-performing content programs from ones that stagnate between annual reviews.
Use a tiered schedule:
The third tier is the one most teams skip. A sudden traffic drop on a high-value page is a fire, not a quarterly agenda item.
Teams using a content platform with built-in GSC integration, like Content Pipeline, get these optimization signals surfaced automatically , without manually re-running the full audit each cycle. The cadence becomes part of the workflow, not a separate project you have to schedule.
Most tool guides dump a list of names and call it a day. This one doesn't. The right tool depends on where you are in the audit process, so here's what to reach for at each stage.
Inventory and crawling
Performance data
Scoring and analysis
GEO and AI visibility checks
Ongoing monitoring
---
| Tool | Stage | Free Tier | Best For |
|---|---|---|---|
| Screaming Frog | Crawl | Yes (500 URLs) | Technical crawl data |
| Ahrefs / Semrush | Crawl + Performance | No | All-in-one audit data |
| Google Search Console | Performance | Yes | Organic traffic and rankings |
| GA4 | Performance | Yes | Non-organic traffic, conversions |
| Google Sheets / Airtable | Scoring | Yes | Audit spreadsheet and rubric |
| Rich Results Test | GEO Check | Yes | Structured data validation |
| Schema Markup Validator | GEO Check | Yes | FAQ, HowTo, Article schema |
| Content Pipeline | Ongoing Monitoring | - | GSC-connected content optimization |
A content audit lives or dies by its spreadsheet. Here's the exact five-tab structure to build in Google Sheets so your whole team works from one source of truth.
Tab 1 - Master Inventory Capture every URL before you score anything. Columns: URL, page title, content type, content format, target keyword, word count, publish date, last modified date, author, canonical URL (self or other), meta description (yes/no), structured data (yes/no).
Tab 2 - Performance Data Pull this from Google Search Console and GA4. Columns: URL (join key), organic clicks (90d), organic impressions (90d), average position, CTR, referring domains, GA4 engaged sessions (90d), conversion events (90d).
Tab 3 - Scoring This is where decisions get made. Columns: URL (join key), SEO Performance score (0-30), Link Equity score (0-20), Business Relevance score (0-20), Content Quality & Freshness score (0-20), GEO Citability score (0-10), Total Score (0-100), Recommended Action (Keep/Update/Merge/Cut), Priority (High/Medium/Low), Owner, Due Date, Status.
Tab 4 - Redirect Map Track every URL you cut or merge. Columns: Source URL, Destination URL, Redirect Type (301/noindex/delete), Date Implemented, GSC Coverage Status post-implementation.
Tab 5 - Results Tracking Close the loop on every action. Columns: URL, Action Taken, Date Implemented, Clicks Before, Clicks 30d After, Clicks 90d After, Delta %.
One Google Sheet with all five tabs means stakeholders can filter by owner or priority without touching your raw data, and you track progress without rebuilding the file every quarter.
Most content audits don't fail because of bad data. They fail because of bad habits.
Here are the six mistakes that derail teams most often, and how to sidestep each one.
1. Auditing everything at once On a large site, scoring every page at once creates spreadsheet paralysis. Use the 80/20 rule: start with the top 20% of pages by traffic or backlinks. That slice typically drives the majority of your results.
2. Using organic traffic as your only signal A low-traffic page might still carry strong backlink equity or convert at a high rate. Scoring on traffic alone means cutting pages that are quietly doing real work. Use a multi-dimensional rubric that weighs backlinks, conversions, and engagement alongside organic visits.
3. Deleting pages without 301 redirects This is the most expensive mistake on the list. Removing a page without a redirect destroys its accumulated link equity and creates 404 errors that waste crawl budget. Always identify a redirect target before you remove anything.
4. Merging pages without reading both first The destination page often ends up thinner than either original. Before you merge, read both pages and carry any unique points, examples, or cited data into the final version.
5. Treating the audit as a one-time project Search Engine Land describes content decay as a slow erosion that affects any published content over time. A single audit gives you a snapshot, not a system. Build a rolling audit cadence into your content calendar so decay gets caught early.
6. Ignoring GEO signals Auditing only for Google rankings misses a growing share of visibility. Search Engine Land's 2026 GEO guide notes that Google AI Overviews now reach over 2 billion monthly users. If your scoring rubric doesn't include AI citability and structured data checks, you're optimizing for a shrinking slice of the search landscape.
An SEO audit and a GEO audit aren't the same checklist with a different label. They share a foundation, but the signals that matter diverge sharply once you factor in AI visibility.
For an SEO-focused audit, the primary signals are familiar: clicks, impressions, average position, backlinks, and on-page fundamentals like title tags, meta descriptions, heading structure, and internal links. The goal is ranking in Google's blue links.
For a GEO-focused audit, the signals shift:
Here's the kicker: pages that score well on SEO criteria often score reasonably well on GEO criteria too, because both reward authoritative, well-structured content. But the GEO layer adds specific requirements a traditional SEO audit won't catch. Freshness and structured data are the two biggest gaps. If your audit template doesn't score for both, you're flying half-blind.
The audit is done. You have a spreadsheet full of keep/update/merge/cut decisions. Now what?
That prioritized list is your content roadmap. Don't let it sit in a shared drive for three months. Translate it into a 90-day execution plan the week you finish scoring.
Weeks 1-2: Handle the technical cuts first. Implement 301 redirects and noindex tags for every page marked for removal. It's the fastest, lowest-effort work in the entire process, and it pays off immediately. According to LinkedIn SEO research, 50-75% of indexed pages on enterprise sites generate zero traffic and actively drag down overall site performance. Clearing that dead weight is pure upside.
Weeks 3-4: Attack the high-ROI updates. Focus on pages sitting in positions 11-20 with strong backlink profiles or high impressions. These are the pages closest to page one. A focused refresh, a stronger intro, a better internal link structure, and they can move. Fastest return of anything in your roadmap.
Month 2: Work through medium-priority updates and merges. Consolidating near-duplicate content takes more effort than a simple refresh, so give it proper time. Merges require redirects, rewritten copy, and updated internal links. Don't rush them.
Month 3: Create net-new content for genuine gaps. The audit will have surfaced topic clusters where you have zero coverage or where competitors clearly dominate. This is the only moment you should commission new content: when the audit has confirmed the gap exists.
Here's the uncomfortable truth the audit also tells you: if you already have 12 posts on a topic and most are underperforming, the answer is consolidation, not a 13th post.
Teams using Content Pipeline by Content Pipeline can feed audit findings directly into a 90-day content calendar, assign AI-assisted rewrites to update tasks, and publish refreshed content straight to WordPress or Webflow. The audit stops being a document and starts being a production queue, without adding headcount.
For most teams, once or twice a year is the right cadence. Semrush reports that 61% of marketers run audits at least twice a year. High-volume publishers may need quarterly reviews; smaller sites can get away with an annual pass. The trigger matters as much as the calendar: a significant traffic drop, a major algorithm update, or an upcoming site redesign all warrant an immediate audit.
It depends on your site's size and how much data you need to pull. A focused audit of 100-200 blog posts can take a few days with the right tooling. Brain Traffic estimates a manual reviewer can assess roughly 5-7 pages per hour, so a 500-page site could take one person several weeks without automation. Using a crawler like Screaming Frog alongside Google Search Console data cuts that time significantly.
An SEO audit focuses on technical and on-page signals: crawlability, site speed, broken links, and metadata. A content audit goes deeper into the substance of each page: whether it matches search intent, covers the topic well, and serves a clear purpose in the user journey. Think of an SEO audit as checking the plumbing. A content audit checks whether the rooms are worth living in.
Neither is always right. Pages with zero traffic, no backlinks, and no strategic purpose are safe to cut or redirect. Pages that rank but are outdated are better candidates for a refresh. Content Marketing Institute notes that old content doesn't just sit idle in 2025 and 2026 - it actively trains AI systems and shapes how your brand appears in AI-generated answers. That makes the update-vs-delete decision more consequential than it used to be.
Not necessarily. Google Search Console and Google Analytics are free and cover the core performance data you need. Screaming Frog (free up to 500 URLs) handles URL crawling and metadata extraction. A well-structured spreadsheet ties it all together. Paid platforms like Semrush or Ahrefs add keyword ranking data and backlink analysis, which helps when you're scoring pages at scale. Start with what you have, then add tools as the complexity grows.
A content audit isn't a one-time cleanup. It's how you stop publishing into a void and start making deliberate decisions about every page on your site. Score your inventory, act on the findings, and build a cadence that keeps your content working.
Once your audit surfaces what to keep, update, and create, you need a system to execute at scale. Content Pipeline by Content Pipeline plans, writes, optimizes, and publishes on-brand content straight to your CMS - so audit action items actually ship.
See Content Pipeline in Action
See the Content Pipeline platform, explore SEO and GEO, or compare us in AirOps alternatives.