Commit count is far from a perfect barometer of dev progress. It earned its place onto my 2020 list of "five worst software metrics agitating developers." Commit count is trivial to game. Unless you work in an organization that's dogmatic about its commit definition, every developer will have their own interpretation of "how much work goes into a single commit."
You could do worse. Compared to "lines of code," the signal/noise of "commit count" is at a clear advantage:
Even more important than its correlation advantage, "commit count" is much less prone to short-term spiking. Whereas "lines of code" can sometimes vaguely resemble "repo evolution" if you stand back and squint your eyes, "commit count" is a comparatively stable day-to-day artifact of developer activity underway.
For these reasons, we wanted to revisit a popular piece we posted a few years ago, git stats on dev activity, based on 295,000 of dev-years. This year, we have accumulated 878,592 developer-years of commit activity to analyze (<-- we'll use that coloring elsewhere to highlight key findings if you hope to skim this 7-minute research in one minute).
The data presented below is, as of its publishing in 2024, the largest aggregation of git commit activity known to exist online (if you know a larger-sourced research paper on this subject, leave a comment? 🙏). In the tables & graphs below, we'll present a picture of what an "average," "good," and "exceptional" year of commit progress looks like.
Every quarter, GitClear publishes a new original research paper in the space of developer experience, developer happiness, and developer productivity. Since we don't have a professional writer on staff, our research contribution is driven largely by leaning on the data that we collect in the natural course of processing commits for thousands of customers, and hundreds of open source projects.
Some metadata about the research that went into collecting the data we're presenting:
Gathered: Between 2020-2024
Source: Publicly visible data on GitHub profiles (e.g., your humble author had ~5k during 2023)
Inclusion criteria: A committer-year was included if and only if the year included between 100 (why?) and 20,000 commits (why?). Note that an average American software engineer works around 241 days per year.
Data size (n): 878,592 developer-years
Distribution: Global, all GitHub users who have not disabled profile stats
Recall that the bounds of this data is from 100 to 20,000 commits per year. We surmise, without any proof but with informed conjecture, that 80% or more of those who contribute 100+ commits per year work in a job related to software development. So it seems fair to label these stats as "Software Developer Commit Count Percentiles." So, let's do it:
As produced by this console query:
Percentile | Commit count | Commits per work day | Days active |
10th | 130 | 42 | |
20th | 171 | 58 | |
30th | 229 | 76 | |
40th | 308 | 97 | |
50th | 417 | 112 | |
60th | 564 | 149 | |
70th | 767 | 176 | |
80th | 1,071 | 204 | |
90th | 1,647 | 235 | |
99th | 4,307 | 324 |
Number of commits made annually among those who made at least 100 commits per year
To the extent that you believe that full-time developers average sometimes go a year with one commit every 2.4 days, the median developer between 2020-2024 contributed 417 commits per year. Thus, the median software developer contributes 1.73 commits per working day
The lower bound of "100" figures to include some percent of developers that work on code sparingly. Maybe they maintain a small open source project in their spare time, or they are an Upwork contractor who alternates between phases of "being engaged in tasks" and awaiting the next gig. Another way to look at the data is to update "Inclusion Criteria" to require at least 241 commits (one per working day) up to 10,000 commits (to better exclude accounts with automated commit activity) per year.
This more narrow band of eligible developer-years reduces the data set to 599,872 developer-years. To get a sense for the size of this data set, gpt-4o's best guess is that there are about 180,000 developers at Apple, Amazon, Meta, Microsoft and Google, combined, in 2024. That makes the size of this data set roughly equivalent to 3.3 years of developer stats for the five tech companies combined.
Percentile | Commit count | Commits per work day |
10th | 295 | |
20th | 362 | |
30th | 444 | |
40th | 546 | |
50th | 673 | |
60th | 834 | |
70th | 1050 | |
80th | 1,376 | |
90th | 1,978 | |
99th | 4,716 |
The table data above, graphed to visualize how annual commit count varies by percentile
To the extent that you believe Career Developers average at least one commit per working day, the median Career Developer between 2020-2024 contributed 673 commits. Thus, the median Career Developer averages 2.8 commits per day.
To produce a strata of "programmer days active per year" for the "Career Developer" cohort, we used a basic query to find 578,917 developer-years worth of data to analyze. Here was what our data set reported:
Percentile | Days with a commit |
10th | 75 |
20th | 97 |
30th | 117 |
40th | 136 |
50th | 156 |
60th | 176 |
70th | 196 |
80th | 217 |
90th | 246 |
99th | 332 |
This one is pretty interesting! Let's graph it:
The median developer in the data set has around 70 work days per year without a commit
Among those who average a commit per work day, only 20% of developers actually make a commit every work day. The median Career Developer has about 70 days per year where no commit is made, around 30% of the total work days. This assumes that the average developer will have around 230 work days per year (blending Americans with ~240 work days with Europeans and their ~220 work days annually).
Which countries are mentioned in the profile's "Location" field among developer-years with 1,000 commits or more?
Location | 1k dev years | All pro years | Percent 1k years |
Shenzhen | 22 | 156 | 14% |
Raleigh | 46 | 239 | 19% |
Sofia | 45 | 203 | 22% |
Rio De Janeiro | 52 | 234 | 22% |
Shanghai | 127 | 535 | 24% |
China | 84 | 351 | 24% |
Oakland | 51 | 206 | 25% |
Toulouse | 37 | 142 | 26% |
Moscow | 76 | 288 | 26% |
Nashville | 40 | 150 | 27% |
Russia | 78 | 287 | 27% |
Mountain View | 50 | 175 | 29% |
United States | 367 | 1274 | 29% |
Pittsburgh | 83 | 280 | 30% |
Bangalore | 150 | 501 | 30% |
Vancouver | 251 | 835 | 30% |
Redmond | 46 | 151 | 30% |
Ottawa | 101 | 328 | 31% |
Miami | 48 | 154 | 31% |
São Paulo | 201 | 644 | 31% |
Seattle | 469 | 1492 | 31% |
Norway | 95 | 298 | 32% |
Minneapolis | 94 | 293 | 32% |
Nyc | 145 | 447 | 32% |
Portugal | 106 | 324 | 33% |
Toronto | 453 | 1377 | 33% |
New York City | 129 | 387 | 33% |
Ukraine | 203 | 607 | 33% |
New York | 619 | 1827 | 34% |
Sweden | 199 | 587 | 34% |
San Diego | 121 | 356 | 34% |
Netherlands | 225 | 660 | 34% |
Munich | 185 | 541 | 34% |
San Francisco | 915 | 2651 | 35% |
Warsaw | 146 | 423 | 35% |
Spain | 135 | 389 | 35% |
Portland | 217 | 625 | 35% |
Singapore | 239 | 675 | 35% |
Paris | 592 | 1664 | 36% |
New Zealand | 81 | 227 | 36% |
Poland | 199 | 557 | 36% |
Budapest | 84 | 234 | 36% |
Zurich | 115 | 320 | 36% |
Vienna | 133 | 370 | 36% |
Brooklyn | 208 | 576 | 36% |
United Kingdom | 306 | 832 | 37% |
Prague | 177 | 480 | 37% |
Taiwan | 112 | 302 | 37% |
The Netherlands | 169 | 454 | 37% |
Stockholm | 365 | 966 | 38% |
Remote | 121 | 320 | 38% |
Switzerland | 186 | 489 | 38% |
Taipei | 102 | 268 | 38% |
Uk | 308 | 797 | 39% |
Oxford | 46 | 119 | 39% |
Dublin | 133 | 344 | 39% |
Colorado | 86 | 221 | 39% |
Philadelphia | 126 | 319 | 39% |
Zürich | 78 | 197 | 40% |
Montreal | 198 | 498 | 40% |
Sydney | 353 | 878 | 40% |
Oslo | 220 | 531 | 41% |
Melbourne | 359 | 862 | 42% |
Milan | 67 | 159 | 42% |
Seoul | 247 | 581 | 43% |
Belgium | 200 | 470 | 43% |
Utrecht | 67 | 156 | 43% |
Perth | 64 | 142 | 45% |
Valencia | 67 | 132 | 51% |
Tokyo | 1131 | 2058 | 55% |
Vilnius | 63 | 110 | 57% |
A lot of these locations don't have a huge sample of those who divulged their location, but still pretty interesting to see how there is almost a 2x difference between the density of 1k annual developers in Tokyo vs. Shenzen. Also, if, like the author, you were curious where Vilnius would be found on a map:
An extremely dense enclave of high-activity software engineers, but also, beware the tiny sample size (n=63)
In the full data set we analyzed, 1,000 commit years were fairly rare: only the top 30% of developers who average one commit per work day manage to reach 1,000 commits per year. But, as an employer of US-based software engineers, more than half of the hundred developers that our startup-sized company has employed were able to reach this threshold. This is to say, among professional Seattle-based programmers earning $150-250k annually, 1k commit years are much more common than in the overall developer population. Let's look at where the stats land for this particular demographic.
Our database finds 193,651 developer-years in the data set that accumulated 1,000 up to 15,000 annual commits:
Percentile | Commit count | Commits per work day | Diff Delta per year | Days with a commit |
10th | 1,080 | 23,500 | 163 | |
20th | 1,174 | 27,647 | 186 | |
30th | 1,283 | 32,625 | 201 | |
40th | 1,410 | 40,446 | 212 | |
50th | 1,563 | 51,130 | 222 | |
60th | 1,757 | 65,778 | 232 | |
70th | 2,021 | 90,510 | 244 | |
80th | 2,426 | 131,031 | 260 | |
90th | 3,196 | 215,281 | 285 | |
99th | 7,217 | 660,217 | 358 |
The median developer in the Senior Developer cohort contributes 1,563 commits for 51,130 Diff Delta annually (context for that much Diff Delta: it's enough to build two sites like Noteapps.info). How many days do developers have to be active to reach these heights?
Most developers with 1,000+ commits per year contribute on 200-250 days
For those aspiring to be an L6 or L7 developer at the tier 1 tech organizations, it's reasonable to ballpark this cohort as the people you compete against in the top positions. For those in support of work-life balance, it is encouraging to see that, even in this competitive group of upper-end talent, the median developer is active only on work days. It's not until one reaches the 70th percentile that we encounter the developers who don't let PTO or weekends stop them from engaging in their programmer pursuits. 🤓
Finally, let's see if we can find evidence that the proliferation of AI that began in 2022 has changed the volume of commits made by developers. To compare apples-to-apples, we'll consider only developers for which we have data before-and-after the 2022 epoch of Copilot and AI ubiquity. Specifically, this data set includes 60,658 developers who authored at least 100 commits every year between 2020 and 2023. Do they support the narrative that AI is boosting developer productivity?
Year | Median Commit Count | Average Commit Count |
2020 | 705 | 945 |
2021 | 678 | 871 |
2022 | 663 | 912 |
2023 | 638 | 908 |
Here were the database queries used to produce this data. Here are how the numbers look graphed:
So much for the unambiguous takeoff of developer productivity
It's hard to imagine how AI might actually be reducing commit activity. Still, from a correlative standpoint, there is no ambiguity that the 60k developers who have posted at least 100 commits per year have seen a gradual decline in their annual commits over the last four years. Compared to 2020, the median developer authored 9.5% less commits in 2023 than they did in 2020.
Since this finding is contrary to what common sense would predict, we consulted Diff Delta (a higher-correlating metric with "software effort" in our research) to check whether it agrees that the median developer is contributing less in 2023 than 2020. The count of developers with Diff Delta stats for each of the years totaled 339. Their median and average Diff Delta per year were:
Year | Median Diff Delta | Average Diff Delta | Days Active |
2020 | 69,721 | 137,646 | 106 |
2021 | 88,146 | 153,433 | 130 |
2022 | 88,786 | 149,344 | 130 |
2023 | 96,904 | 163,700 | 132 |
GitClear customers have gradually grown their Diff Delta each of the past four years -- more in line with what we would expect to see as powerful AI tools proliferate. It's interesting that, like the larger sample GitHub data, we do not see evidence that developers are more active with their commits than years past. Rather, it seems that, in our smaller sample of developers, each is squeezing greater amounts of code change into each commit than they had in years' past.
When attempting to understand developer activity based on commit count, much depends on where the lower bound is set. As we see in the analysis above, the commit count of the median developer depends largely on what cohort of developers is being analyzed:
All developers with at least 100 commits per year: Median: 417 commits per year over 125 days.
Career developers who average at least one commit per working day: Median 673 commits per year over 156 days.
Senior developers who average at least 1,000 commits per year. Median 1,563 commits per year over 222 days.
Median commit activity by constituency, analyzing 666,867 dev years
Among the cohort we have labeled "Senior Developers," for locations with at least 100 developers reporting, the highest concentration of Senior Developers were observed in Tokyo (55%), Belgium (43%) and Seoul (43%).
In terms of predictions, this data evaluation suggests that AI assistants are not increasingly the volume of commits made by the average developer. Rather, the data we've analyzed suggests that the "Age of AI" is thus far correlating with an increase in commit change density, but not in commit count.
If further research supports these findings, we would predict that:
Commits and PRs will see growth in the expected size of their commit message/PR description. This trend will also be influenced by the growing availability of AI description-generating tools.
Organizations will advocate more frequent committing. Between the benefits for manager transparency, and the lower cost of reviewing smaller units of work, we predict that organizations will grow more proactive in encouraging developers to find stopping points to push the growing amount of work each developer can produce with modern tooling.
Startup sizes will be smaller. This prediction supposes that a) it will take less code changes to implement big ideas in the era of LLMs b) the 90th percentile Senior Developers will continue to boost how much they can contribute, given improving tools. Already, our data suggests that a single 99th percentile developer will produce more than 10x the code change volume of a median career developer (29 commits per day vs 2.8). Add in the efficiency gains of operating with smaller teams, and it makes sense to imagine that it will become more common for teams of 5 or less developers to build large, popular products in 2025 and beyond.
What do you make of the data? Feel free to drop us a line hello@gitclear.com if you have other ideas for how we can answer your management questions using our terabytes of developer performance data.