Your Open Wikipedia Ranking project is a great resource - thanks for making its data publicly available.
I've written my own code to compute the number of links between Wikipedia pages; this corresponds to your concept of "indegree". However I'm seeing significant differences between my counts and yours. Sometimes my count is higher than yours, and other times it is lower. I'm working with a Wikipedia dump from August 1st (specifically enwiki-20230801-pages-articles.xml). I know that you may have generated your 2023 data from an earlier dump, but I think that probably doesn't explain the large differences I'm seeing.
$ grep -c 'Classification of finite simple' enwiki-20230801-pages-articles.xml
54
Perhaps you are also counting links to pages that redirect to this page? My own program isn't smart enough to take those into consideration.
Is your software that generates the Wikipedia link graph available as open source? If so, I could look at it to try to understand these differences. Your site says that "The conversion is performed using a combination of classes from MG4J and WebGraph.", however I briefly looked at the documentation of those projects and it's not obvious to me that the Wikipedia link crawling code is in one of them.
Or, if the software is not available, could you provide a precise definition of what counts as a link? Your site says that "We do not consider links in infoboxes". What about links to pages that redirect? Thanks -
Adam