Discover How Many Websites Use Each Schema Type with Schema.org

```json
{
  "alt": "Abstract graphic showing colorful 3D bar charts and circular shapes on a blue background.",
  "caption": "Dive into data with this vibrant 3D abstract design, showcasing colorful bar charts and circular elements against a deep blue canvas.",
  "description": "This image presents a vibrant abstract illustration featuring a series of colorful 3D bar charts and circular shapes. The bars vary in height and color, including red, yellow, blue, green, and orange, while circular elements add depth and texture. The rich blue background enhances the contrast and modern aesthetic. This graphic is ideal for concepts related to data visualization, business analytics, and creative design."
}
```

Have you ever been curious about how many sites use a specific type of structured data? Now, you have the chance to find out.

I recently discovered that Schema.org is now sharing aggregated usage statistics for its terms across the public web. This means you can see exactly how many domains are using a particular schema or structured data element.

According to a Schema.org announcement, they are excited to offer a new dataset providing these statistics. Updated monthly, the data is aggregated at the domain level and categorized into popularity range buckets, which helps to filter daily noise while emphasizing meaningful adoption trends for researchers and tool developers.

What’s the appearance like? Take a look at a snapshot of two Schema.org pages, featuring author schema and event schema, displaying the usage statistics prominently at the top:

Image

Delving deeper into the data. Schema.org has further detailed the usage statistics. Here’s a brief overview:

  • Schema.org term frequencies are evaluated within Google’s public web crawling infrastructure. The aggregation occurs at the domain level (e.g., example.com), not page by page. If you use the same term on 100 pages, it still only counts as one domain using it.
  • Rather than displaying exact numbers, which can fluctuate daily, websites are categorized into range buckets (e.g., “10K – 100K” domains). This approach stabilizes the data and respects website privacy.
  • The raw data files can be accessed on GitHub under the Google Public Stats dataset. Both JSON and CSV formats are available, alongside a JSON summary format offering aggregated bucket distributions, all updated monthly.
  • Term Type: Specifies whether the term is a Type (e.g., “Person” or “Event”) or a Property (e.g., “price” or “telephone”).
  • URI: Shows the official URI of the term, such as http://schema.org/Person.
  • Domain Count Bucket: The range of unique domains utilizing the term, for instance, 100K - 1M domains.
```json
{
  "alt": "GitHub repository page showing a CSV file preview in schemaorg project.",
  "caption": "A glimpse into the schema.org GitHub repository, showcasing a CSV file preview detailing Schema.org statistics.",
  "description": "This image captures a GitHub repository page titled 'schemaorg/schemaorg'. It features a preview of a CSV file named '2026_05.csv' located within the 'data/public_stats/google' directory. The file contains several schema types such as EventVenue and TVClip, along with their domain usage statistics. The header section shows navigation tabs including Code, Issues, Pull requests, and more. The page is part of a public repository highlighted by the Schema.org Stats Bot update."
}
```

If you’re interested, here’s a peek at GitHub:

Why is this important? Well, besides my love for data, understanding the popularity of a specific schema element might just convince your development team to incorporate that schema code on your site.


Inspired by this post on Search Engine Land.


crushpress.ai community screenshot

Comments

Leave a Reply

Your email address will not be published. Required fields are marked *