As of January 2025, there is no consolidated inventory of all U.S. Federal open-source code repositories. This project surveys all Federal open-source code across GitHub.
Despite growing adoption of open-source practices, there is no single, comprehensive hub for U.S. Federal agencies to manage and track source code—from research projects to software and web development. Code.gov was launched to address this need but has faced challenges with compliance and broader adoption. This list maintained by GitHub offers better coverage but mixes in non-Federal organizations and omits many smaller offices or research arms within the government.
This project seeks to expand the open-source movement in government by:
Given over 192.1 million GitHub users, evaluating every account is infeasible. Instead, this project focuses on organizations that associate themselves with a .gov
domain in their registered email or listed URL. There are 8,003,003 organizations listed on GitHub as of December 2024. Among them 3,203 organizations indicated a .gov
domain in at least one of the following fields: email
, blog
, description
, company
, location
, or name
. From these 1,599 .gov
-affiliated organizations that were US-based, 1,151 organizations with at least one public repository remained.
Organizations were categorized by their primary ownership. If an organization was perceived as government-run or self-identified as such, it was included. The 404 errors likely resulted from phishing organizations set up to imitate a government agency (they often had very recent creation dates). Government research programs (e.g., MoTrPAC) were counted when it appeared that the governing entity was the US government.
Limitations: This project does not cover repositories hosted outside of GitHub or those not connected to a
.gov
organization.
Within these 775 U.S. Federal organizations, there are 25,276 repositories in total. After excluding forks and repositories without at least one GitHub star, 12,468 repositories remained.
Within these repositories there were 27,382 unique contributors. Some of these are clearly provisioned automated bots, while others indicate extremely prolific human users.
Topics were determined by examining the top 1,500 repositories and then applied using a structured GPT query.
Category | Cumulative Stars |
---|---|
🌟 Open Source Software Development | 232781 |
🛡️ Cybersecurity and Threat Analysis | 93242 |
📊 Data Integration and Analytics | 66825 |
🚀 High-Performance Computing and Simulation | 47508 |
🪐 Space and Planetary Exploration Technologies | 45059 |
🎨 Web and Design Standards | 42425 |
🤖 Embedded Systems and Robotics | 36247 |
🌍 Geospatial and Earth Observation Technologies | 29900 |
🌱 Environmental and Energy Applications | 24976 |
🧠 Artificial Intelligence and Machine Learning | 24069 |
All repositories have been archived via a "shallow copy." Full data is available upon request.