Number of documents in Corpus?

29 views
Skip to first unread message

Gordon V. Cormack

unread,
Aug 13, 2021, 4:06:16 PM8/13/21
to TREC Health Misinformation Track
I see 1_063_805_381.  

4667 of the jsonl files have 148_410 lines.
2501 of the jsonl files have 148_411 lines.

4667*148410 + 2501*148411 = 1063805381

I managed to extract that many json records without error.
Reply all
Reply to author
Forward
0 new messages