In the fiscal year 2021, the NDL undertook an optical character recognition (OCR) text conversion project for digitized materials and created OCR text data for 2.47 million digitized materials (223 million images). This accounts for almost all digitized materials in the National Diet Library Digital Collections as of the end of 2020. As a service that utilize these OCR text data, the “NDL Ngram Viewer” was released from the NDL Lab website on May 31, 2022. As of August 2022, the service provides visualization function of search results for approximately 280,000 text data of books whose copyright protection period have expired.
Jim Breen
unread,
Mar 12, 2026, 1:24:08 AM (3 days ago) Mar 12
Reply to author
Sign in to reply to author
Forward
Sign in to forward
Delete
You do not have permission to delete messages in this group
Copy link
Report message
Show original message
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to edict-...@googlegroups.com
Thanks for passing this on. I was vaguely aware of it, but had never
taken the time to look at it. Interesting to see the changing use of
terms over time.