Join our community of websites already using SEOJuice to automate the boring SEO work.
See what our customers say and learn about sustainable SEO that drives long-term growth.
Explore the blog →TL;DR: We crawled thousands of independent sites and measured six core SEO dimensions. The median site scores 84/100 overall — but only 66/100 on technical SEO, and 30.6% of images have no alt text. If you have one thing to fix, it's the plumbing, not the prose.
Every year, someone publishes "The Ultimate SEO Statistics for [Year]" and it's the same 80 facts recycled from Backlinko, HubSpot, and a Google blog post from 2019. The studies behind those numbers were run on Fortune 500 sites, enterprise CMS platforms, and media companies with dedicated SEO teams. If you're running a bootstrapped SaaS, an independent publication, or a small agency, those numbers describe a world you don't live in.
So we ran our own crawl. Thousands of independent, mostly small sites — the long tail of the web, not the enterprise layer. We pulled the latest snapshot per site, measured six dimensions, and cross-checked against published third-party studies. The results are messier and more honest than the usual roundups. Which is exactly why they're more useful.
(I should be upfront about what this isn't: a random sample of the entire web. Our corpus skews toward the kinds of sites that sign up for an SEO tool: small publishers, indie SaaS products, agency client sites, niche blogs. That's a bias worth knowing about before you cite these numbers.)
Each site in our dataset gets a full crawl: pages, images, metadata, load timing, and link structure. From that crawl we compute four composite scores: overall, content, technical, and accessibility. Each runs from 0 to 100. We also surface issue rates as percentages of pages or images affected.
The six metrics we publish here are the ones we're confident enough in to share externally: the four scores, the image alt-text rate, and page load time. We excluded a handful of metrics that showed suspiciously clean numbers (zero percent on some issue categories), which are likely measurement artifacts, not reality.
We're not claiming this is the definitive picture of the web. We're claiming it's an honest picture of a specific slice of it: the independent, often under-resourced sites that make up most of the web by count, even if not by traffic.
The median site in our corpus scores 84/100 overall. That sounds good, and in some ways it is. But the aggregate hides a structural split.
| Dimension | Median Score | Mean Score |
|---|---|---|
| Overall SEO | 84 / 100 | 83 / 100 |
| Content | 82 / 100 | 80 / 100 |
| Accessibility | 100 / 100 | 89 / 100 |
| Technical SEO | 66 / 100 | 69 / 100 |
The 18-point gap between the overall score (84) and the technical score (66) is the story. Content is close to overall — most sites that care enough to monitor their SEO have their writing in reasonable shape. Technical SEO is where the debt accumulates, quietly, without triggering any obvious alarm.
A median of 66 on technical SEO is one thing. What I find more telling is that 14% of sites in our corpus score below 50 on technical — a level where structural issues are almost certainly affecting crawlability and rankings. On the overall score, almost no sites fall below 50. Technical is where the tail gets heavy.
Page load is a major contributor. 29% of pages in our corpus load slowly, and the median load time is 2.54 seconds. For context, the HTTP Archive Web Almanac 2025 found that only 48% of mobile sites passed Core Web Vitals, and LCP (Largest Contentful Paint) was the hardest metric, with just 62% of mobile pages passing it. A 2.54s median load puts a lot of sites right at the edge of the acceptable zone, where a single unoptimized image or render-blocking script pushes them into the failing band.
Why is technical always the weak spot? My working theory: content feels urgent, technical debt doesn't. Writing a new post is visible. Fixing your canonical tags, compressing your images, or cleaning up your redirect chains takes an hour with no obvious immediate reward. So it accumulates.
The irony is that technical fixes compound faster than content fixes. A crawlability problem that's been suppressing 20 pages for a year doesn't care how good those pages are.
This is the number that surprised me most: 30.6% of images across our corpus have no alt text at all. Across millions of images crawled. One in three.
What makes this finding credible is the triangulation. We're not alone:
Our number (30.6%) is higher than WebAIM's (16.2%) for an obvious reason: WebAIM measured the top million sites, which skew toward larger organizations with dedicated teams. We measured thousands of independent sites. The gap between 16% and 30% is probably the gap between "has an SEO person" and "doesn't."
Alt text is, practically speaking, the cheapest SEO fix on this list. It doesn't require a developer. It doesn't require a deployment. It doesn't require understanding Core Web Vitals. It just requires going through your images and writing what they are. If you run an audit and find a high alt-text gap, that's your first fix.
There's also an increasingly relevant AI angle here. Image search is a real traffic channel, and vision models that crawl content for AI citations read alt text. Leaving it blank isn't just an accessibility failure — it's a discoverability failure.
I'll be honest: I'm not sure how much image alt text moves rankings directly versus accessibility-adjacent signals. The evidence I've seen is mixed. But I'm confident it's not neutral, and it's so cheap to fix that the cost-benefit calculation is obvious.
These three numbers get less attention than load time and alt text, but they show up consistently:
For context on meta descriptions: the Web Almanac 2025 found that about one-third of pages ship with no meta description at all. Our 17.3% "weak or missing" rate is lower, probably because our sites are SEO-aware enough to have filled something in, even if the content isn't strong. A description that's 20 characters of generic filler counts as present but weak.
Internal linking at 15.2% is the one that worries me most from a rankings standpoint. Meta descriptions don't directly affect rankings. Internal links absolutely do — they're how PageRank flows through a site and how Google understands what pages matter. A site that has good content but poor internal linking is leaving authority on the table. If you write something good, link to it from five other relevant pages. That's the whole strategy.
We cut the data by industry and the honest finding is: it barely matters. The range runs from roughly 82 to 87 across industries, a five-point band. Entertainment and media sites tend toward the higher end; sustainability and environmental sites toward the lower.
| Industry | Median SEO Score | Position in Band |
|---|---|---|
| Entertainment | ~87 / 100 | Top of band |
| Technology & SaaS | ~86 / 100 | Upper band |
| Media & Publishing | ~86 / 100 | Upper band |
| Education | ~86 / 100 | Upper band |
| Real Estate | ~86 / 100 | Upper band |
| E-commerce & Retail | ~85 / 100 | Upper-mid |
| Digital Marketing | ~85 / 100 | Upper-mid |
| Business Services | ~84 / 100 | Mid |
| Healthcare | ~84 / 100 | Mid |
| Lifestyle | ~84 / 100 | Mid |
| Finance | ~83 / 100 | Lower band |
| Sustainability | ~82 / 100 | Bottom of band |
A five-point spread across twelve industries is not a story about industry. It's a story about SEO fundamentals being universal. Don't let your vertical become an excuse. The site in the sustainability sector at 82 isn't struggling because sustainability SEO is uniquely hard — it's struggling because its technical score is probably 60 and a third of its images have no alt text. Same problem, same fix, regardless of niche.
(Side note: the industry classifications here are self-reported or inferred from site content. They're directional, not precise. I'd treat these as "roughly this order" rather than a ranking you'd cite in a board deck.)
For a deeper cut of how your industry compares, we maintain a separate SEO benchmark page that updates as the corpus grows.
Our crawl data covers technical execution: what sites are actually doing. To frame why any of this matters in 2026, you need the demand side: what's happening to search traffic itself.
The zero-click story has gotten materially worse. SparkToro and Similarweb tracked 68% of US Google searches ending without a click in early 2026, up from 60% in 2024. That's a structural shift in two years. AI Overviews are accelerating it: Ahrefs' study of 300,000 keywords found 58% lower CTR for the top-ranking page when an AIO was present. Pew Research found users clicked a result on 8% of visits when an AI summary appeared, versus 15% without (though Google disputed that study's methodology, so I'd treat the exact figures as illustrative rather than definitive).
AI Overviews themselves appeared for somewhere between 6% and 25% of queries through 2025, settling near 16% in Semrush's tracking of 10 million keywords, though that number is volatile and changes with Google's rollout decisions month to month.
What does this mean for the technical and on-page gaps we found? A few things. First, ranking signals haven't gotten easier. With fewer clicks distributed across results, being in positions 1-3 matters more than it did when position 6 still got meaningful traffic. Technical issues that suppress your rankings have higher stakes now. Second, structured, crawlable content that answers questions directly is what ends up in AI summaries. Sites with thin internal linking, missing alt text, and weak meta descriptions are structurally less likely to be cited. Third (and I say this carefully), the sites that get cited and linked to as sources still matter enormously even in a zero-click world. Being a named source is different from being a search result.
We wrote a longer piece on the traffic implications of AI Overviews if you want to dig into that specifically.
The accessibility numbers deserve their own paragraph because the distribution is unusual. The median site scores 100/100 on accessibility — perfect. The mean is 89. And yet 10.2% of sites score below 50.
That's a bimodal distribution: most sites either pass cleanly or ignore it entirely. There's not much middle ground. WebAIM's 2026 scan found WCAG failures on 95.9% of the top million home pages, which sounds contradictory until you realize that "some WCAG failure" includes very minor issues, while our scoring weights failures differently. The pattern is the same: a majority with passable accessibility, a meaningful minority with significant gaps.
For SEO purposes, accessibility is increasingly load-bearing. Google explicitly uses accessibility signals, and the overlap with technical SEO (alt text, semantic HTML, crawlable structure) is large. Fixing accessibility isn't separate work from fixing SEO. On many sites it's the same work. See our deeper take at accessibility for SEO.
If I had to prioritize fixes from this data, here's the order I'd work through:
If you want to check where your site sits against these benchmarks specifically, run a free audit. It's the same crawl and scoring methodology behind the numbers in this article. You'll see your four scores, the specific issues flagged, and how you compare to the median in your industry via our benchmark data.
You can also check your domain authority to see how your backlink profile sits alongside the technical fundamentals.
From our own crawl of thousands of independent sites analyzed through SEOJuice as of mid-2026. Each site gets a full crawl of its pages, images, metadata, link structure, and load timing. We compute four composite scores (overall, content, technical, accessibility) plus issue rates. This is proprietary data: not a survey, not a citation from another study. The corpus skews toward small and independent sites, which is different from enterprise-focused datasets you'll find elsewhere.
For the patterns we're highlighting, yes, with caveats. Issue rates like "30.6% of images missing alt text" or "14% of sites below 50 on technical" are stable across sample sizes once you're into the thousands. They won't shift dramatically with more data. What they won't tell you is what's true for enterprise CMS platforms, media sites with SEO teams, or Fortune 500 properties. Those require different datasets. We're specific about what we measured so you can calibrate accordingly.
Mostly because technical SEO debt is invisible until something breaks. Content improvements feel urgent: you publish a post and see a reaction. Technical issues like slow page loads, missing canonical tags, and crawl errors accumulate silently for months or years. Content gets refreshed; technical foundations get ignored. The gap we see (66 vs 82) is probably a proxy for "how much attention does this site get from an SEO practitioner" rather than anything specific about difficulty.
Run a free audit at /tools/seo-audit/. It uses the same scoring methodology as the corpus in this article. You'll see your overall, content, technical, and accessibility scores, plus specific issues flagged. The /seo-benchmark/ page lets you compare your scores against the median for your industry.
By our scoring, anything above 80 is in the healthy range — you're in the top half of the sites we've analyzed. Above 90 puts you in the top tier. Below 70 on any single dimension (especially technical) usually means there are active issues affecting crawlability or rankings. A score of 66 on technical SEO, which is the median in our data, means the average site has real room to improve even if it doesn't feel broken.
Related reading:
no credit card required