Google Custom Search (GCS) data uses the totalResults returned by API for search queries of the legal tool URLs (quoted and using linkSite for accuracy), countries codes, and language codes.
The results indicate there are approximately 33,658,633,295 online works in the commons--documents that are licensed or put in the public domain using a Creative Commons (CC) legal tool.
Thank you Google for providing the Programable Search Engine: Custom Search JSON API!
Plots showing Creative Commons (CC) legal tool product totals and percentages.
Plots showing Creative Commons (CC) legal tool status totals and percentages.
Plots showing latest Creative Commons (CC) legal tool totals and percentages.
Plots showing prior Creative Commons (CC) legal tool totals and percentages.
The unit names have been normalized ( => CC BY-ND-NCCC BY-NC-ND).
Plots showing retired Creative Commons (CC) legal tools total and percentages.
For more information on retired legal tools, see Retired Legal Tools - Creative Commons.
Plots showing countries with the highest useage of the latest Creative Commons (CC) legal tools.
The latest tools include Licenses version 4.0 (CC BY 4.0, CC BY-NC 4.0, CC BY-NC-ND 4.0, CC BY-NC-SA 4.0, CC-BY-ND 4.0, CC BY-SA 4.0), CC0 1.0, and the Public Domain Mark (PDM 1.0).
The complete data set indicates there are a total of 2,624,944,398 online works using a latest CC legal tool.
Plots showing languages with the highest useage of the latest Creative Commons (CC) legal tools.
The latest tools include Licenses version 4.0 (CC BY 4.0, CC BY-NC 4.0, CC BY-NC-ND 4.0, CC BY-NC-SA 4.0, CC-BY-ND 4.0, CC BY-SA 4.0), CC0 1.0, and the Public Domain Mark (PDM 1.0).
The complete data set indicates there are a total of 1,310,359,425 online works using a latest CC legal tool.
Plots showing Approved for Free Cultural Works legal tool usage.
Understanding Free Cultural Works - Creative Commons:
Using [the Freedom Defined definition of a "Free Cultural Work"], material licensed under CC BY or BY-SA is a free cultural work. (So is anything in the worldwide public domain marked with CC0 or the Public Domain Mark.) CC’s other licenses– BY-NC, BY-ND, BY-NC-SA, and BY-NC-ND–only allow more limited uses, and material under these licenses is not considered a free cultural work.
The GitHub data, below, uses the total_count returned by API for search queries of the various legal tools.
The results indicate that 483877 (0.17%) of the 289935546 total public repositories on GitHub that use a CC legal tool. Additionally, many more use a non-CC use a Public domain equivalent legal tools.
The GitHub data showcases the different level of rights reserved on repositories We have Public domain which includes works released under CC0, 0BSD and Unlicense meaning developers have waived all their rights to a software. Allowing anyone to freely use, modify, and distribute the code without restriction. See more at Public-domain-equivalent license. While a Permissive category of license contains works under MIT-0 and CC BY 4.0 allows users to reuse the code with some conditions and attribution Permissive license and Copyleft contains works under CC BY-SA 4.0. which requires any derivative works to be licensed under the same terms. Copyleft.
Thank you GitHub for providing public API access to repository metadata!
Plots showing totals by license type. This shows the distribution of different CC license and non CC license used in GitHub repositories. Allowing Commons to evaluate how freely softwares on GitHub are being used, modified, and shared and how developers choose to share their works. See more at SPDX License List
Plots showing totals by different levels of restrictions. This shows the distribution of Public domain, Permissive, and Copyleft licenses used in GitHub repositories.
The Smithsonian Institute data returns the overall statistics of CC0 legal tool records. It serves as the main legal tool used by Smithsonian Institute. The results indicate a total record of 15,058,654 objects, with a breakdown of 14,395,246 objects without CC0 Media and 5,205,203 objects with CC0 Media, taking a percentage of 16.31% in each institute member. There are 38 unique units in the data representing museums, libraries, zoos and other institutions with a minimum of 57 objects.
Plots showing totals by units. This shows the distribution of top 10 institute member across Smithsonian Institute with an average of 1,279,971.8 objects across the top 10 Institute members.
Plots showing totals by units.
This shows the distribution of lowest 10 institute member across Smithsonian Institute with an average of 3264.5 objects across the lowest 10 institute members.
Plots showing totals by CC0 records. This is the top 10 units with a breakdown of CC0 records without media, CC0 records with media and records that are not associated with CC0.
Plots showing totals by CC0 records. This is the lowest 10 units with a breakdown of CC0 records without media, CC0 records with media and records that are not associated with CC0.
This report provides insights into the usage of the Creative Commons Attribution 4.0 International across the different language editions of Wikipedia. The Wikipedia data, below, uses the Count field from the Wikipedia API to quantify the number of articles in each language edition of Wikipedia.
The total number of Wikipedia articles across 352 languages is 66,542,690. The top 10 languages account for 31,729,774 articles, which is 47.68% of the total articles. The average number of articles per language is 189,041.73.
Thank you to the volunteers who curate this data and the Wikimedia Foundation for making it publicly available!
Plots showing the language representation across different language editions of Wikipedia. This shows how many languages are underrepresented (below average number of articles) versus represented (above average number of articles).
Plots showing the most represented languages across the differentlanguage editions of Wikipedia.
Plots showing the least represented languages across the different language editions of Wikipedia.
This report was generated as part of:
creativecommons/quantifying: quantify the size and diversity of the commons--the collection of works that are openly licensed or in the public domain
The data used to generate this report is available in that repository at the following locations:
| Resource | Location |
|---|---|
| Fetched data: | 1-fetch/ |
| Processed data: | 2-process/ |
| Report data: | 3-report/ |
The Creative Commons (CC) icons and logos are for use under the Creative Commons Trademark Policy (see Policies - Creative Commons). They aren't licensed under a Creative Commons license (also see Could I use a CC license to share my logo or trademark? - Frequently Asked Questions - Creative Commons).

















