Hi Joseph,
> Is there a sudden spike in articles?
there shouldn't be spikes in the number of articles and also no
exponential grows. During 2021 and 2022 until now, the number of
articles (all languages) in the news data set [1] crawled per month
ranges between 15 and 18 million.
Of course, on the level per news site there may be spikes if a
feed/sitemap was lost and/or (re)discovered. There is also noise
caused by ad links in feeds/sitemaps or by expired news domains
now hosting different types of content.
In doubt, could you share details how your numbers were achieved?
Best,
Sebastian
[1]
https://commoncrawl.org/2016/10/news-dataset-available/