Trends and Insights from GitHub 2022
We analyzed more than 5,000,000,000 rows of GitHub event data and got the results here. In this report, you'll get interesting findings about open source software on GitHub in 2022, including:
Top languages in the open source world over the past four years
This chart ranks programming languages yearly from 2019 to 2022 based on the ratio of new repositories using these languages to all new repositories.
Rankings of back-end programming languages
The programming languages used in a pull request reflect which languages developers used. To find out the most popular back-end programming languages, we queried the distribution of programming languages by new pull requests from 2019 to 2022 and took the top 10 for each year.
Geographic distribution of developer behavior
We queried the number of various events that occurred throughout the world from January 1 to September 30, 2022 and identified the top 10 countries by the number of events triggered by developers in these countries. The chart displays the proportion of each event type by country or region.
Developer behavior distribution on weekdays and weekends
We queried the distribution of each event type over the seven days of the week.
The distribution of specific events
Popular open source topics
Each year, technology introduces new buzz words. Can we gain insight into technical trends through the open source repositories behind the hot words? We investigated five technical areas: Low Code, Web3, GitHub Actions, Database, and AI.
Activity levels of popular topics
We queried the number of open source repositories associated with each technical area, as well as the percentage of active repositories in 2022.
Popular topics over the years
We queried the following items for each technical area from 2015 to 2022:- The annual increment of repositories- The annual increment of collaborative events- The number of developers participating in collaborative events- The annual increment of starsThen, we calculated the growth rate for each year which can reflect new entrants, developer engagement in this technical field, and the industry's interest in this area. For 2022, we compare its first nine months with the first nine months of 2021.
The most popular repositories in 2022
The number of stars is the most visible indication of the popularity of open source projects. We looked at the 50 projects that received the most stars from January 1 to September 30, 2022.
- * Suspected to be iterations of bots.
The most active repositories over the past four years
Here we looked up the top 20 active repositories per year from 2019 to 2022 and counted the total number of listings per repository. The activity of the repository is ranked according to the number of developers participating in collaborative events.
Who gave the most stars in 2022
We queried the developers who gave the most stars in 2022, took the top 20, and filtered out accounts of suspected bots. If a developer's number of star events divided by the number of starred repositories is equal to or greater than 2, we suspect this user to be a bot.
- 1Butters3388214136 starsper day
- 2frankfanslc133 starsper day
- 3mitoksim76 starsper day
The most active developers since 2011
We queried the top 20 most active developers per year since 2011. This time we didn't filter out bot events.
About GitHub events
GitHub events are triggered by user actions, like starring a repository or pushing code.
About time range
In this report, the data collection range of 2022 is from January 1, 2022 to September 30, 2022. When comparing data of 2022 with another year, we use year-on-year analysis.
About bot events
Bot-triggered events account for a growing percentage of GitHub events. However, these events are not the focus of this report. We filtered out most of the bot-initiated events by matching regular expressions.
How we classify technical fields by topics
We do exact matching and fuzzy matching based on the repository topic. Exact matching means that the repository topics have a topic that exactly matches the word, and fuzzy matching means that the repository topics have a topic that contains the word.
|Topic||Exact matching||Fuzzy matching|
|GitHub Actions||actions||github-action, gh-action|
|Low Code||low-code, lowcode, nocode, no-code|
|Database||db||database, databases nosql, newsql, sql mongodb,neo4j|
|AI||ai, aiops, aiot||artificial-intelligence, machine-intelligence computer-vision, image-processing, opencv, computervision, imageprocessing voice-recognition, speech-recognition, voicerecognition, speechrecognition, speech-processing machinelearning, machine-learning deeplearning, deep-learning transferlearning, transfer-learning mlops text-to-speech, tts, speech-synthesis, voice-synthesis robot, robotics sentiment-analysis natural-language-processing, nlp language-model, text-classification, question-answering, knowledge-graph, knowledge-base gan, gans, generative-adversarial-network, generative-adversarial-networks neural-network, neuralnetwork, neuralnetworks, neural-network, dnn tensorflow PyTorch huggingface transformers seq2seq, sequence-to-sequence data-analysis, data-science object-detection, objectdetection data-augmentation classification action-recognition|