Automating Quantifying the Commons
- Mentors
- Timid Robot
- Organization
- Creative Commons
- Technologies
- python, git, os, pandas, Matplotlib, YAML, GitPython, pathlib
- Topics
- visualization, big data, CI/CD Automation, Directory Manipulation
Quantifying the Commons — an initiative emerging from the UC Berkeley Data Science Discovery Program — aims to quantify the frequency of open domain and CC license usage for future accessibility and analysis purposes. To date, previous advancements have not included automation or combined reporting, which are crucial for minimizing human error and ensuring timely updates — especially when engaging with substantial streams of data. Therefore, the primary objective of this project is to develop automation software for data collection, processing, and report generation, ensuring that the Quantifying reports are consistently updated and never more than three months out-of-date.