QuickSEO
PricingToolsBlog
Get Started

Free Soft 404 Detector

Detect pages that return HTTP 200 but display error-like content. Enter a sitemap URL or paste a list of URLs to find soft 404s that waste crawl budget and hurt your SEO.

Check Pages for Soft 404s

Enter a sitemap URL (e.g. https://example.com/sitemap.xml) or paste a list of page URLs (one per line, up to 50).

What Is a Soft 404?

A soft 404 is a page that returns an HTTP 200 (OK) status code but contains little or no meaningful content. Instead of properly returning a 404 or 410 status code, the server tells search engines the page is fine — even though it shows an error message, an empty page, or placeholder text. This is one of the most common and most overlooked technical SEO issues.

Google specifically calls out soft 404s in Search Console because they waste crawl budget. When Googlebot encounters a soft 404, it still crawls and processes the page, consuming resources that could be spent discovering and indexing your real content. At scale, hundreds or thousands of soft 404s can significantly slow down how quickly Google finds your new and updated pages.

Detection Signals This Tool Checks

Our soft 404 detector analyzes each page for multiple signals and assigns a confidence rating:

  • Word count: Pages with fewer than 50 words of main content are flagged as likely soft 404s. Pages under 150 words get a medium-confidence warning.
  • 404-like phrases: Common error phrases such as "page not found", "no longer available", "does not exist", and "sorry" in the page body.
  • Title tag analysis: Title tags containing "404", "not found", "error", or "unavailable" are strong soft 404 signals.
  • Content-to-HTML ratio: Pages where the visible text is less than 5-10% of the total HTML code are likely template-only pages with no real content.
  • Empty or boilerplate-only content: Pages where all visible text comes from navigation, header, and footer elements with no unique main content.
  • Noindex signals: Pages with HTTP 200 and a noindex meta tag may be intentionally hidden error pages.

Why Soft 404s Hurt Your SEO

Soft 404s cause real damage to your search performance in several ways:

  • Wasted crawl budget: Googlebot has a finite crawl budget for your site. Every soft 404 it crawls is a page of real content it could have discovered instead.
  • Diluted site quality: Google uses aggregate quality signals. Pages with no meaningful content lower your overall site quality score.
  • Index bloat: Soft 404s can get indexed, adding low-quality pages to Google's index under your domain. This dilutes your topical authority.
  • Poor user experience: Users who land on soft 404s from search results will bounce immediately, sending negative engagement signals back to Google.
  • Missed redirect opportunities: Deleted product pages and old content URLs often become soft 404s when they should be 301-redirected to relevant alternatives.

Common Causes of Soft 404s

  • Deleted products or content: E-commerce sites and blogs often remove items without setting proper 404/410 status codes or redirects.
  • Empty search result pages: Internal search pages with no results that still return 200 OK.
  • Custom error pages returning 200: Web servers or CMS platforms configured to show a "not found" page without the correct HTTP status code.
  • Template pages with no content: CMS-generated pages that have a header, footer, and sidebar but no main body content.
  • Expired campaign pages: Landing pages for ended promotions that show "this offer is no longer available" but still return 200.
  • Broken dynamic pages: Pages that rely on database content but the underlying data has been deleted, resulting in empty templates.

How to Fix Soft 404s

  1. Return a proper 404 or 410 status code — If the content no longer exists and there is no relevant replacement, configure your server to return a real 404 (Not Found) or 410 (Gone) status code. This clearly tells search engines to remove the URL from the index.
  2. Set up 301 redirects — If a similar or replacement page exists, redirect the old URL to the new one with a 301 (Moved Permanently) redirect. This preserves any backlink equity pointing to the old URL.
  3. Add meaningful content — If the page should exist, add real content. Ensure the page has a proper title, headings, body text, and serves the user's intent.
  4. Fix your CMS error handling — Many CMS platforms return 200 for missing content by default. Configure your application to return proper HTTP error codes when content is not found.
  5. Remove from sitemap — Ensure soft 404 URLs are not included in your XML sitemap. Having them in the sitemap encourages Google to crawl them repeatedly.

How to Use This Tool

  1. Enter your URLs — Paste a sitemap URL to scan all listed pages, or enter individual page URLs (one per line, up to 50).
  2. Click "Detect Soft 404s" — The tool fetches each page, analyzes the content, and checks for multiple soft 404 signals.
  3. Review the score and summary — The overall score tells you how healthy your pages are. Summary cards show the breakdown of soft 404s by confidence level.
  4. Inspect individual pages — Each page shows its word count, content-to-HTML ratio, title tag, and specific signals that triggered the detection.
  5. Filter and export — Use filters to focus on soft 404s only, and export results as CSV for your team or audit report.

Frequently Asked Questions

What is a soft 404?

A soft 404 is a page that returns an HTTP 200 (OK) status code but displays error-like content such as "page not found" messages, empty pages, or very thin content. Unlike a real 404 which tells search engines the page does not exist, a soft 404 misleads crawlers into thinking the page is valid, wasting crawl budget and potentially harming SEO.

How does Google detect soft 404s?

Google uses multiple signals to detect soft 404s, including very low word count, common 404 phrases in the page content or title tag, low content-to-HTML ratio, and pages that look similar to the site's actual 404 error page. Google Search Console reports soft 404s under the Coverage or Pages section.

Why are soft 404s bad for SEO?

Soft 404s waste crawl budget because Googlebot spends time crawling pages with no real content. They can dilute your site quality signals, confuse search engines about which pages are important, and prevent real content from being discovered and indexed quickly. Fixing soft 404s improves crawl efficiency and overall site health.

How do I fix soft 404 pages?

You have three main options: (1) Return a proper 404 or 410 status code so search engines know the page does not exist. (2) Add meaningful content to the page if it should exist. (3) Set up a 301 redirect to a relevant alternative page. The best choice depends on whether the URL should exist with content or should be removed entirely.

How many URLs can I check at once?

You can check up to 50 URLs at once, either by pasting them directly (one per line) or by entering a sitemap URL. The tool will fetch each page, analyze the content, and report whether it shows soft 404 signals with a confidence rating.

Track Your Brand Across Google & AI

QuickSEO connects your Google Search Console data with AI visibility tracking across ChatGPT, Claude, and Gemini — all in one dashboard.

Try QuickSEO →

Related Tools

Bulk HTTP Status Checker

Check HTTP status codes for up to 100 URLs at once.

Thin Content Checker

Analyze page word count and content depth to identify thin content.

Content-to-HTML Ratio Checker

Analyze the text-to-code ratio of any page to identify code-heavy pages.

QuickSEO

HomeBlog

Other

Python Indexing ScriptBlog Idea GeneratorBlog Outline GeneratorSitemap URLs ExtractorSitemap ValidatorFavicon CheckerRobots.txt Validator

Track

ChatGPTClaudeGemini

Legal

Terms of usePrivacy policy

Contacts

support@quickseo.ai