Data study Β· June 15, 2026
We probed 9,992,781 of the worldβs most popular domains and labelled each one alive, redirect, blocked, or dead. The real dead figure is 14.1% β not the 27.6% a naive crawl reports, because most of βdeadβ is just blocking you.
14.1%
of the 9,992,781 probed top domains are genuinely dead β gone from DNS or refusing every connection.
8.9%
answer but block bots
1.1%
of responders are parked
Homepage-level reachability from a datacenter IP β a lower bound.
14.1%
of the 9,992,781 probed domains are genuinely dead β no DNS, no connection, nothing answers. That is the real dead-web figure, not the 27.6% a naive crawl reports.
8.9%
answer but block automated clients (403/429/challenge) from a datacenter IP β alive, just not to a bot. Naive scans count these as dead.
10.3%
of all domains no longer resolve in DNS β the dominant cause of true death, 1,027,492 domains gone dark.
33%
.cn is the deadest common TLD β institutional and cheap-registration TLDs rot fastest, well above the .com baseline.
Every probed domain, by outcome
A naive 2024 crawl of the same top-10M list reported 27.6% dead. Probe honestly β separating genuine death from anti-bot blocking and answered errors β and the real figure is 14.1%. Here is where the difference goes.
DNS failure, anti-bot 403s, 404/5xx and timeouts all lumped together
No DNS, connection refused, or nothing accepts a connection
Where the βdeadβ really goes
The same domains, probed by an honest bot and by a browser-like client (real Chrome TLS/JA3). Where the browser column is lower on dead/blocked, the site is reachable β the bot just wasn't let in.
| Probe arm | Probed | Alive | Blocked | Dead | Dead % |
|---|---|---|---|---|---|
| Polite bot | 9,992,781 | 7,657,422 | 891,517 | 1,412,544 | 14.1% |
| Reachability (browser) | 9,997,315 | 7,743,245 | 819,599 | 1,412,889 | 14.1% |
| China (.cn) | 33% |
| India (.in) | 25.8% |
| United States of America (.us) | 22% |
| Brazil (.br) | 20.9% |
| Spain (.es) | 16.6% |
| Japan (.jp) | 15.6% |
| United Kingdom (.uk) | 15.3% |
| Australia (.au) | 15% |
| Russia (.ru) | 14.8% |
| France (.fr) | 14.5% |
| Canada (.ca) | 14.1% |
| Italy (.it) | 13.5% |
| Poland (.pl) | 13.1% |
| Sweden (.se) | 11.6% |
| Switzerland (.ch) | 9.8% |
| Netherlands (.nl) | 9.7% |
| Austria (.at) | 8.6% |
| Germany (.de) | 7.6% |
| Czechia (.cz) | 7.2% |
The gap between 27.6% and 14.2% is mostly a measurement choice. A crawler that stops at the first response sees only 45.9% return a clean 200; follow the redirects and read the bodies, and 71.9% are alive. Here is where every first response ends up.
| 200 OK β Alive | 4,584,611 (46.3%) |
| 3xx redirect β Alive | 2,677,304 (27%) |
| No response β Dead | 1,413,013 (14.3%) |
| 403 / 429 β Blocked | 410,511 (4.1%) |
| 3xx redirect β Blocked | 365,368 (3.7%) |
| 404 β Alive | 236,685 (2.4%) |
| No response β Blocked | 105,222 (1.1%) |
| 5xx β Alive | 85,728 (0.9%) |
| 3xx redirect β Redirect | 31,267 (0.3%) |
| 3xx redirect β Dead | 1,775 (0%) |
Split the 10 million by popularity and the dead rate climbs more than 20Γ β from 0.8% in the top 1,000 to 16.1% past rank 5 million β while blocked runs the other way, peaking at the popular head.
99.8% of dead domains sit below rank 100,000. The popular top-100K β where most web traffic lives β is only 2.2% dead, so weighted by attention the dead web nearly disappears:
share of the top 10M that are dead
the popular top-100K is only 2.2% dead
The same figures behind the charts above, as plain HTML tables β easy to copy, and machine-readable for search engines and AI answer engines that can't parse a chart.
| TLD | Domains | Dead | Dead % |
|---|---|---|---|
| .com | 4,403,688 | 563,545 | 12.8% |
| .org | 878,764 | 114,092 | 13% |
| .io | 363,234 | 10,113 | 2.8% |
| .de | 348,251 | 26,446 | 7.6% |
| .net | 347,414 | 69,151 | 19.9% |
| .ru | 301,639 | 44,780 | 14.8% |
| .jp | 253,187 | 39,563 | 15.6% |
| .uk | 244,776 | 37,525 | 15.3% |
| .fr | 135,021 | 19,567 | 14.5% |
| .edu | 128,968 | 28,357 | 22% |
| .it | 107,638 | 14,564 | 13.5% |
| .ca | 106,814 | 15,023 | 14.1% |
| .br | 99,202 | 20,693 | 20.9% |
| .au | 92,764 | 13,883 | 15% |
| .nl | 86,383 | 8,400 | 9.7% |
| .pl | 70,063 | 9,212 | 13.1% |
| .es | 67,984 | 11,305 | 16.6% |
| .eu | 67,354 | 10,582 | 15.7% |
| .ch | 64,850 | 6,350 | 9.8% |
| .in | 63,614 | 16,442 | 25.8% |
| .info | 61,440 | 17,362 | 28.3% |
| .co | 60,670 | 9,264 | 15.3% |
| .cz | 53,177 | 3,853 | 7.2% |
| .app | 48,858 | 2,054 | 4.2% |
| .se | 46,523 | 5,398 | 11.6% |
| .gov | 43,435 | 11,251 | 25.9% |
| .at | 43,092 | 3,696 | 8.6% |
| .cn | 42,827 | 14,144 | 33% |
| .us | 41,645 | 9,146 | 22% |
| .me | 39,043 | 5,501 | 14.1% |
| Failure reason | Domains |
|---|---|
| ok | 7,195,864 |
| dns_failed | 1,027,492 |
| timeout | 441,714 |
| forbidden | 441,603 |
| not_found | 294,610 |
| tls_error | 192,304 |
| server_error | 128,756 |
| rate_limited | 86,240 |
| connection_error | 75,364 |
| client_error | 38,180 |
| redirect | 18,761 |
| connection_refused | 17,453 |
| too_many_redirects | 12,537 |
| auth_required | 9,527 |
| connection_reset | 6,879 |
| anti_bot | 5,154 |
| unavailable_legal | 330 |
| Outcome | Domains |
|---|---|
| alive | 7,657,422 |
| dead | 1,412,544 |
| blocked | 891,517 |
| redirect | 31,298 |
Search, filter by outcome, switch the probe arm, and sort. The full dataset is on GitHub. Click a domain to see how each arm fared.
| Rank | Domain | Outcome | Reason | Status | Final URL |
|---|---|---|---|---|---|
| 3,601 | www.bmi.bund.de | Alive | client_error | 400 | https://www.bmi.bund.de/cookie-check-d973?l=%2F&v=61405663&m=572902497&h=1dnOVtf7DYeS%2B255ird5%2FS8bU5MVKTyUPe%2Bai7bygNg |
| 3,602 | www.emeraldinsight.com | Blocked | forbidden | 403 | https://www.emerald.com/ |
| 3,603 | tubitv.com | Alive | ok | 200 | https://tubitv.com |
| 3,604 | www.britishcouncil.org | Blocked | forbidden | 403 | https://www.britishcouncil.org |
| 3,605 | www.insta360.com | Blocked | forbidden | 403 | https://www.insta360.com/ |
| 3,606 | adblockplus.org | Alive | ok | 200 | https://adblockplus.org |
| 3,607 | dictionary.reference.com | Dead | dns_failed | β | β |
| 3,608 | distrowatch.com | Alive | ok | 200 | https://distrowatch.com |
| 3,609 | whitney.org | Alive | ok | 200 | https://whitney.org |
| 3,610 | fs.blog | Alive | ok | 200 | https://fs.blog |
| 3,611 | www.universiteitleiden.nl | Alive | ok | 200 | https://www.universiteitleiden.nl |
| 3,612 | www.garanteprivacy.it | Alive | ok | 200 | https://www.garanteprivacy.it |
| 3,613 | en-gb.wordpress.org | Alive | ok | 200 | https://en-gb.wordpress.org |
| 3,614 | java.sun.com | Alive | ok | 200 | https://www.oracle.com/java/technologies/ |
| 3,615 | www.commonsensemedia.org | Alive | ok | 200 | https://www.commonsensemedia.org |
| 3,616 | www.nhm.ac.uk | Alive | ok | 200 | https://www.nhm.ac.uk |
| 3,617 | www.nature.org | Alive | ok | 200 | https://www.nature.org/en-us/ |
| 3,618 | ipgeolocation.io | Alive | ok | 200 | https://ipgeolocation.io/ |
| 3,619 | a16z.com | Alive | ok | 200 | https://a16z.com |
| 3,620 | www.listennotes.com | Alive | ok | 200 | https://www.listennotes.com |
| 3,621 | balsamiq.com | Alive | ok | 200 | https://balsamiq.com |
| 3,622 | obamawhitehouse.archives.gov | Alive | ok | 200 | https://obamawhitehouse.archives.gov |
| 3,623 | surfshark.com | Alive | ok | 200 | https://surfshark.com |
| 3,624 | www.wellsfargo.com | Alive | ok | 200 | https://www.wellsfargo.com |
| 3,625 | www.aliyun.com | Alive | ok | 200 | https://www.alibabacloud.com/en?_p_lc=1&f-8DC992C756BA= |
| 3,626 | jsonlines.org | Alive | ok | 200 | https://jsonlines.org |
| 3,627 | www.flexjobs.com | Blocked | timeout | β | β |
| 3,628 | printify.com | Alive | ok | 200 | https://printify.com |
| 3,629 | www.liquidweb.com | Alive | ok | 200 | https://www.liquidweb.com |
| 3,630 | www.jitbit.com | Alive | ok | 200 | https://www.jitbit.com/ |
| 3,631 | theintercept.com | Alive | ok | 200 | https://theintercept.com |
| 3,632 | amzn.com | Alive | ok | 202 | https://www.amazon.com/ |
| 3,633 | adguard.com | Alive | ok | 200 | https://adguard.com/en/welcome.html |
| 3,634 | www.journals.uchicago.edu | Blocked | forbidden | 403 | https://www.journals.uchicago.edu |
| 3,635 | www.duden.de | Alive | ok | 200 | https://www.duden.de:443/ |
| 3,636 | www.jeju.go.kr | Alive | ok | 200 | https://www.jeju.go.kr/index.htm |
| 3,637 | www.fupa.net | Alive | ok | 200 | https://www.fupa.net |
| 3,638 | newsroom.fb.com | Alive | ok | 200 | https://www.meta.com/about/?utm_source=about.facebook.com&utm_medium=redirect |
| 3,639 | bitmovin.com | Blocked | forbidden | 403 | https://bitmovin.com |
| 3,640 | www.bmwi.de | Alive | client_error | 400 | https://validate.perfdrive.com/?ssa=02019ec8-388e-420b-a2c4-d33e917ef5ec&ssb=65233281062&ssc=https%3A%2F%2Fwww.bmwi.de%2F&ssi=1a6a9e87-dxzy-4ac2-b433-9c910a241fb1&[email protected]&ssm=69446146611080287107579523855317&ssn=45b1a096729bd7466ddf08d9ad6bc66019a87c88c08a-a019-4df5-b1c2a0&sso=f416e528-1221262c15f1c69230c0a765c2ef67a4ad960c2a8a68da47&ssp=83927137481781572652178155585860520&ssq=64228882394919638771823949408315608697357&ssr=MTYyLjIwOS4xMjUuNjE=&sst=Mozilla/5.0 (compatible; TenMillionDomainsBot/2.0; +https://github.com/Crawlora-org/ten-million-domains)&ssu=&ssv=&ssw=&ssx=eyJ1em14IjoiN2Y5MDAwZGY4MTc3MGMtNWIzMS00ZTFiLWE4YWUtZDAzZmU0MWFiYjllMS0xNzgxNTIzOTQ5NTQzMC04MjRmZjQ0ODZlNmY0MmMzMTAiLCJyZCI6ImV2YWx1YXRpb25lbi1ibXdrLmRlIiwiX191em1mIjoiN2Y5MDAwN2M4OGMwOGEtYTAxOS00ZGY1LWI1MjgtMTIyMTI2MmMxNWYxMS0xNzgxNTIzOTQ5NTQzMC0wMDA1N2M3MTYwYWNlYTQwMjVmMTAifQ== |
| 3,641 | underscorejs.org | Alive | ok | 200 | http://underscorejs.org |
| 3,642 | dropbox.com | Alive | ok | 200 | https://www.dropbox.com/ |
| 3,643 | www.agoda.com | Alive | ok | 200 | https://www.agoda.com |
| 3,644 | hubspot.com | Alive | ok | 200 | https://www.hubspot.com/ |
| 3,645 | unctad.org | Alive | ok | 200 | https://unctad.org |
| 3,646 | pan.baidu.com | Alive | ok | 200 | https://pan.baidu.com |
| 3,647 | www.honda.co.jp | Alive | ok | 200 | https://www.honda.co.jp |
| 3,648 | www.jsonline.com | Alive | ok | 200 | https://www.jsonline.com |
| 3,649 | www.elgato.com | Alive | ok | 200 | https://www.elgato.com/us/en |
| 3,650 | oklahoma.gov | Alive | ok | 200 | https://oklahoma.gov |
We probe a top-popularity domain list HTTPS-first from a datacenter IP, following redirects, and label each domain alive, redirect, blocked, or dead by the evidence the probe captures β a final HTTP status, or a transport error plus whether a raw TCP connect still succeeds. A served 404, a 5xx, or a Cloudflare 52x is alive (the host answered); a 403/429 or anti-bot challenge is blocked; only no DNS, a refused/reset connection, or nothing accepting a connection is dead. Every domain is probed twice β as a polite bot and as a browser-like client (real Chrome TLS/JA3) β and the full per-domain dataset is open.
This measures whether the domain itself still resolves and answers β a different question from Pew Researchβs 2024 link-rot study (25% of pages from 2013β2023 are gone; 38% of 2013 pages) and Ahrefsβ link-rot study (66.5% of links have rotted), which measure broken links insideliving pages. It is also not the βdead internet theoryβ β that is a claim about AI-generated content, not domain reachability.
Cite this
Crawlora (2026). Dead-Web Index 2026. 14.1% of 9,992,781 top domains are genuinely dead; 8.9% answer but block automated clients. https://crawlora.net/dead-web-index.
14.1% of the top 9,992,781 domains are genuinely dead β about 1,412,544 sites that no longer resolve in DNS or refuse every connection. That is far below the often-quoted "27.6% of the web is dead," which counted anti-bot blocks and answered errors as death.
A dead site never answers β no DNS record, or nothing accepts a TCP connection. A blocked site is alive and answering, it just refuses an automated client (a 403, 429, or anti-bot challenge). 8.9% of the top web (891,517 sites) is blocked, not dead β a distinction naive crawlers miss.
No. The dead internet theory is a claim that AI-generated content and bots have replaced human activity on the living web. This index measures the opposite and concrete thing: how many domains have gone completely dark and unreachable β DNS gone, connection refused, server gone.
Earlier top-10M crawls counted three non-dead things as dead: anti-bot 403/429 blocks, 404/5xx pages served by a live server, and domains a single flaky DNS resolver failed to look up. Classifying honestly β dead means genuinely unreachable β brings the real figure to 14.1%.
.cn has the highest death rate among common TLDs at 33%. Institutional TLDs like .gov and .edu also rank high β matching Pew Research's finding that government and reference pages suffer the worst link rot.
Anti-bot systems (Cloudflare, DataDome, and others) serve a 403 or a challenge to a datacenter IP while letting a real browser through. A matched browser TLS/JA3 fingerprint reaches the site where a naive bot is blocked β which is exactly why this index probes every domain twice, as a polite bot and as a browser-like client.
8.9% of the top web answers but blocks a naive bot. Crawlora escalates from a plain request to a real browser fingerprint only as far as a site demands, and bills on success β so you reach the live web that the 14.1% genuine dead doesnβt include.
| informational |
| 12 |
| unknown | 1 |