Mined repositories languages #20
Replies: 7 comments
-
Our crawler uses the However, due to GitHub API issues, sometimes we get repositories written in other languages than what we asked for and that may be the cause of confusion. In such cases, we stick with what we searched for and classifies such repositories under the language we filtered on. Let me know if you still have any doubts. |
Beta Was this translation helpful? Give feedback.
-
My doubt was if those languages are the most present ones in decreasing order or a '(semi-)arbitrary' subset. |
Beta Was this translation helpful? Give feedback.
-
Okay, they are chosen based on their popularity, so you can call it a semi-arbitrary design decision. |
Beta Was this translation helpful? Give feedback.
-
Just noticed that Smalltalk/Pharo was not there, but not sure if it can be considered relevant. |
Beta Was this translation helpful? Give feedback.
-
It would be nice to have such a list of languages |
Beta Was this translation helpful? Give feedback.
-
Not sure if it's exactly what we were talking about but here is a list of languages apparently known to GitHub: https://github.com/github/linguist/blob/master/lib/linguist/languages.yml |
Beta Was this translation helpful? Give feedback.
-
Great, I also leave this here for future references: https://madnight.github.io/githut |
Beta Was this translation helpful? Give feedback.
-
Is there a way to see if some important languages are excluded from the mining?
I have seen the language stats report in the link 'Mined Projects'.
Are these the 13 most widespread languages and everything else is 'below Kotlin' or there are holes with widespread languages in between?
Beta Was this translation helpful? Give feedback.
All reactions