The WebAIM Million
The 2022 report on the accessibility of the top 1,000,000 home pages
For the fourth consecutive year, WebAIM conducted an accessibility evaluation of the home pages for the top 1,000,000 web sites. The evaluation was conducted using the WAVE stand-alone API (with additional tools to collect site technology and sector parameters). The results provide an overview of and insight into the current state of web accessibility for individuals with disabilities and trends over time.
Sites without home pages, pages that returned errors (404, etc.), or pages with fewer than 10 HTML elements were not included.
The WAVE accessibility engine was used to analyze the rendered DOM of all pages after scripting and styles are applied. WAVE detects end-user accessibility barriers and Web Content Accessibility Guidelines (WCAG) conformance failures. All automated tools, including WAVE, have limitations—not all conformance failures can be automatically detected. Absence of detected errors does not indicate that a site is accessible or compliant. Although based on a subset of accessibility issues, this report provides a quantified and reliable representation of the current state of the inaccessibility of the most influential pages on the web.
Across the one million home pages, 50,829,406 distinct accessibility errors were detected—an average of 50.8 errors per page. The number of detected errors decreased by 1.1% since the February 2021 (51.4 errors/page) analysis. "Errors" are WAVE-detected accessibility barriers having notable end user impact, and which have a very high likelihood of being WCAG 2 Level A/AA conformance failures.
Home Page Complexity
Home page complexity increased significantly in 12 months, from an average of 887 elements in February 2021 to 955 elements per home page in February 2022 - a 7.7% increase.
The following chart shows the number of home page elements detected over the last four WebAIM Million studies:
Home page elements have increased over 22% in the last 3 years. It is promising that detected errors have decreased slightly while page complexity has increased notably, but it seems that accessibility progress could be greatly accelerated if home pages were not growing at such alarming rates.
5.3% of all home page elements had a detected accessibility error. Users with disabilities would expect to encounter errors on 1 in every 19 home page elements with which they engage.
Error density (number of errors divided by number of page elements) is provided in the site lookup, but is an unreliable metric of site accessibility. A significant number of page elements (
<span> elements, for example) may result in a lower error density (suggesting better accessibility), when in fact many new accessibility errors may have also been introduced. This report focuses on average number of detected errors—likely end user barriers—present as opposed to error densities (how diluted those errors are within page elements).
96.8% of home pages had detected WCAG 2 failures! This improved very slightly from 97.4% in February 2021 and 97.8% in 2019. These are only automatically detected errors that align with WCAG conformance failures with a high level of reliability. Because automatic testing cannot detect all possible WCAG failure types, this means that the actual WCAG 2 A/AA conformance level was certainly lower.
This chart shows the percentage of home pages with detected WCAG conformance failures over time:
While the rate of pages with no detectable errors was very low, 21.6% of pages had 5 or fewer detected errors and 31.3% had 10 or fewer. Most conformance failures fall into just a few categories, as described below.
|WCAG Failure Type||% of home pages in 2022||% of home pages in 2021||% of home pages in 2020||% of home pages in 2019|
|Low contrast text||83.9%||86.4%||86.3%||85.3%|
|Missing alternative text for images||55.4%||60.6%||66.0%||68.0%|
|Missing form input labels||46.1%||54.4%||53.8%||52.8%|
|Missing document language||22.3%||28.9%||28.0%||33.1%|
96.5% of all errors detected fall into these six categories. These most common errors have been the same for the last 4 years. Addressing just these few types of issues would significantly improve accessibility across the web.
Low Contrast Text
Low contrast text, below the WCAG 2 AA thresholds, was found on 83.9% of home pages. This was the most commonly-detected accessibility issue. On average, each home page had 31.6 distinct instances of low-contrast text.
Images and Alternative Text
There were 39,750,181 images in the sample, or 39.8 images per home page on average. The number of images increase 4.7% since 2021. 23.2% of all home page images (9.2 per page on average) had missing alternative text (not counting
alt=""). Nearly half of the images missing alternative text were linked images—resulting in links that were not descriptive.
10.4% of images with alternative text had questionable or repetitive alternative text—such as alt="image", "graphic", "blank", a file name, etc., or alternative text identical to adjacent text or alternative text.
These data show that one may expect over one third of the images on popular home pages to have missing, questionable, or repetitive alternative text.
39% of the 4.4 million form inputs identified were not properly labeled (either via
title). This is a notable improvement from 45% in 2021, 55% in 2020, and 59% in 2019.
Nearly 23.2 million headings were detected (over 23 on average per home page). The number (and prevalence) of heading levels were:
- 1.7 million
- 7.2 million
- 8.2 million
- 3.8 million
- 1.6 million
- .8 million
The number of headings present on analyzed home pages increased 8% from 21.5 million in 2021. The prevalence of all heading levels also increased.
Because headings are the primary mechanism used by screen reader users to navigate content, their proper implementation is important. 19.6% of home pages had more than one
<h1>—an increase from 18.4% in 2021. There were 1,092,097 instances of skipped heading levels (e.g., jumping from
<h4>) and 1 in every 21 headings was improperly structured. Skipped headings were present on 40.4% of all pages (up from 38.4% in 2021), and 9.9% of pages had no headings present at all (down from 10.6% in 2021).
75.3% of home pages had at least one region (or ARIA landmark) defined—an increase from 69.1% in 2021, 68.9% in 2020 and 62.4% in 2019. A
<main> element or main landmark was present on 36% of home pages, up from 30.1% in 2021, 27.8% in 2020 and 23.5% in 2019. 19.9% of home pages had a "search" landmark, up from 17.6% in 2021.
96% of home pages with a main region/landmark had only one instance (which is a correct implementation). Pages with a navigation region/landmark present averaged 2.3 of them per page.
60,227,207 ARIA attributes were detected—over 60 per page on average! ARIA code usage increased 26% in just one year and has nearly tripled since 2019.
Home pages averaged 11.4
aria-describedby attributes. The prevalence of ARIA labels and descriptions nearly doubled in 2 years.
74.6% of the one million home pages used ARIA (excluding ARIA landmark roles)—an increase from 68.1% in 2021, 64.6% in 2020, and 60.1% in 2019. Home pages with ARIA present averaged 70% more detected errors than those without ARIA—one would expect to encounter an additional 23 potential barriers on home pages with ARIA present.
Increased ARIA usage on pages correlated to higher detected errors. The more ARIA attributes that were present, the more detected accessibility errors could be expected. This does not necessarily mean that ARIA introduced these errors (these pages are more complex), but pages typically had more errors when ARIA was present.
7.6% of home pages had an ARIA menu (
role="menu"), but 33% of ARIA menus introduced accessibility barriers due to the lack of necessary ARIA menu markup and interactions.
Home pages also averaged 11
aria-hidden="true" attributes (up from 8.9 in 2021 and 6.6 in 2020) and 2.5
role="button" attributes (up from 2.2 in 2021 and 1.3 in 2020) per page. Over 16.6 million (16.6 per page on average) instances of
tabindex=1 were present (up 62% from 2020).
14% of home pages had a "skip" link present. However, one out of every eleven "skip" links were broken—either they were hidden in a way that made them inaccessible or the link target was not present in the page.
86.1% of home pages had a valid HTML5 doctype—a notable increase from 79.1% in 2021. Pages with a valid HTML5 doctype had nearly double the page elements (average of 1013 vs. 595) and 15% more errors (average of 51.8 vs. 45) than those with other doctypes. 700 unique doctypes were encountered in the million-page sample.
We're grateful for the support of webshrinker.com in providing us the site category data. Their support makes this valuable information available.
The home pages were categorized based on content into IAB Content Taxonomy categories. The table below shows the number of home pages in each category (some sites may be in more than one category), average number of errors in that category, and the percent difference in errors for that category from the average of 50.8 errors for the entire million-page sample. In other words, the percentage difference is how much better or worse that category is than the average home page.
|Category||# of home pages||# of errors|
(difference from average)
|Law, Government, and Politics||30,570||29.5 (−41.9%)|
|Social Media||7,874||39.2 (−22.9%)|
|Technology and Computing||175,396||44.2 (−13.0%)|
|Food and Drink||27,438||46.4 (−8.7%)|
|Health and Fitness||52,941||47.1 (−7.3%)|
|Family and Parenting||5,585||48.9 (−3.9%)|
|Personal Finance||31,452||50.2 (−1.2%)|
|Religion and Spirituality||12,047||52.2 (2.7%)|
|Home and Garden||23,876||53.0 (4.3%)|
|Arts and Entertainment||45,883||55.8 (9.7%)|
|Hobbies and Interests||84,437||57.5 (13.1%)|
|Style and Fashion||16,823||64.1 (26.1%)|
|Real Estate||14,516||68.2 (34.2%)|
|Adult Content||18,048||91.4 (79.7%)|
There were notable differences in accessibility errors for sites in different categories. Home pages in the Food and Drink category were most improved since 2020 with errors reduced from 66.1 to 46.4 errors on average. This improvement may be partially attributed to the significant increase in litigation regarding web accessibility in this sector. Shopping sites, which were also highly subject to accessibility complaints and lawsuits, were greatly improved from 90.5 errors to 72.9 errors on average over two years, yet this category remains among the least accessible.
793 unique top-level domains (e.g., com, .tv, .fashion, etc.) were represented in the million pages analyzed. Home pages with .com (492,206), .org (81,625), and .net (42,296) were the most common.
The table below shows the most common TLDs (n>5,000) with page counts, average number of errors, and percentage difference between from the average of 50.8 errors for the entire million-page sample.
|Top-level Domain||# of home pages||# of errors|
(difference from average)
This shows notable differences between TLDs. Home pages with .gov (21.3 errors), .edu (30.1 errors), and .us (31.0 errors), which are all affiliated with U.S.-based entities, had among the lowest number of average accessibility errors of all common (n>2000) TLDs.
77% of pages specified a document language. Despite being a WCAG failure, pages without a language defined had fewer errors on average than most pages with a language defined.
This table shows the specified page language for the most common languages, number of pages in the sample, average number of errors, and percentage difference in errors from the overall average.
|Specified Language||# of home pages||# of errors|
(difference from average)
|No language specified||216,277||48.2 (−5.2%)|
As with TLDs, there are significant differences in accessibility of pages in various languages. Chinese and Farsi pages had nearly twice as many detectable errors than pages in English.
Over 1,200 different types of web technologies were detected on the one million home pages. Common technologies detected on more than 5,000 home pages are listed below, ordered from "best" to "worst". Note that correspondence of additional errors with a technology cannot automatically be attributed to that technology.
Content Management Systems
|CMS||# of home pages||Avg. # of errors||% difference|
There was a wide diversity in the impact that the CMS choice appeared to have on accessibility with home pages. Blogger, for example, had 6 times as many errors on average as home pages using Squarespace.
|Framework||# of home pages||Avg. # of errors||% difference|
The adoption of most of these common frameworks corresponded with more accessibility errors than the average home page. This does not necessarily mean that the frameworks caused these errors, but home pages with these frameworks had more errors than on average.
|Library||# of home pages||Avg. # of errors||% difference|
|Web Framework||# of home pages||Avg. # of errors||% difference|
Home pages in the sample that utilize the popular Bootstrap framework had 9.6 more accessibility errors on average than those that did not. We can't know from these data if Bootstrap introduced these errors, but there was a correspondence of increased errors when Bootstrap was present.
|Platform||# of home pages||Avg. # of errors||% difference|
As in other categories, there were large disparities in detected errors across Ecommerce platforms, with home pages using Magento, for example, having over four times as many errors as pages using Squarespace.
|Ad Network||# of home pages||Avg. # of errors||% difference|
Pages that utilized any of these popular ad systems had more errors on average than those that did not. The data suggest that ads were among the strongest harbingers of accessibility errors. Home pages that utilize the common Google AdSense system had 23.9 more errors on average than other pages.
Other common technologies also correlated to more errors. 7.1% of pages had ReCAPTCHA and averaged 13.8 more errors than those without. 36.5% of pages had Google Fonts and averaged 4.8 more errors. 2.7% of pages had Google Maps and averaged 10.7 more errors.
Here are some fun facts regarding this research:
- With 5 analyses of over one million pages each, the WebAIM Million database is approaching four billion data points.
- Despite being 2022, 9,152 home pages had
<marquee>and 373 home pages had blinking content (
- 1,298,423 tables were observed (down from 1,533,402 in 2021) on 179,966 pages. Only 175,700 (13.5%) of the tables had valid data table markup.
- The most errors detected on a single home page was 68,826!
While our 2022 analysis saw small decreases in the number of detected accessibility errors and WCAG conformance failures, significant work remains to be done to make the web accessible to everyone. We at WebAIM hope that this report will help influence improved accessibility.