Switching Reference List to BNC_COCA_25000?

9 views
Skip to first unread message

Terence Patrick Murphy

unread,
Aug 23, 2025, 11:53:14 PMAug 23
to AntWordProfiler-Discussion
Dear Professor Anthony,

This is perhaps a naive question, but I was wondering if it is possible to switch from the current set of 3 reference lists to a subdivided list with substantially more coverage?

I am trying to work with some literary texts in order to quantify literary style or register. The two texts that I am currently interested in are "The Short Happy Life of Francis Macomber" by Ernest Hemingway and "Winter Dreams" by F. Scott Fitzgerald.

I have a briefcase containing the BNC_COCA_25000 on my desktop, but I am not sure how to make WordProfiler access it. To be honest, I also cannot find the location on my computer of the current three-way reference list that you are using (1-gsl_1st_1000, 2_gsl_2nd_1000 and 3_awl_570). 

Obviously, I would then need to slice up the BNC_COCA_25000. Ideally, I think a slicing of the corpus into 9 subdivisions would be great. But that might not be possible, of course.

On a related note, it would be amazing if a future version of the AntWordProfiler could graph the distribution of the words, after they have been sorted with a large corpora list like the BNC_COCA. My strong impression is that the graphs would describe a variety of curves that obey Zipf's Law, and allow us to see, for example, that Fitzgerald accesses a much greater range of rarer vocabulary items than Hemingway does.

At any rate, as I slowly become more familiar with AntWordProfiler, I can see we are once again deeply in your debt, Professor. The programs you have set up are a major advance on coding in R or RStudio for a whole range of tasks!

All best wishes,
Terence Murphy

Full Professor of Rhetoric and Composition
Yonsei University
Seoul, KOREA 

Brian Gallagher

unread,
Aug 24, 2025, 2:14:54 AMAug 24
to antword...@googlegroups.com, Laurence Anthony
Dear Terence,
I believe you have contacted the wrong Anthony. I believe you are looking for Laurence?
Good luck with it all.

Regards,
Anthony Brian Gallagher

--
You received this message because you are subscribed to the Google Groups "AntWordProfiler-Discussion" group.
To unsubscribe from this group and stop receiving emails from it, send an email to antwordprofil...@googlegroups.com.
To view this discussion visit https://groups.google.com/d/msgid/antwordprofiler/66c67e5d-779f-41cd-89b8-dc9a8fd01749n%40googlegroups.com.


--
Brian Gallagher,
M.A.ODE(Open), PGCODE(Open), ProGCE, B.Sc.(Hons)
ガラカー ブライアン
Okinawa JALT 2025 Publications & Publicity Officer https://okijalt.org/
JALTCALL Conference 2025 Scheduling Officer https://jaltcall.org/
JALT PANSIG Conference (Executive Member) Forums and Posters Chair 2025 
JALT PIE SIG Assistant Program Chair 2025 https://jaltpiesig.org/
JALTCALL Post-Conference Publication Associate Editor https://jaltcall.org/publications/jaltcall.org/publications/

Assistant Professor *Specially Appointed, Faculty of Foreign Studies, Meijo University, Nagoya Dome Mae Campus, Aichi-ken, Nagoya-shi, Higashi Ku, Yada Minami 4-102-9
名古屋ナゴヤドーム前キャンパス 〒461−8534 名古屋市東区矢田南4−102−9 Tel: 0528321151


Terence Patrick Murphy

unread,
Aug 26, 2025, 5:18:54 PMAug 26
to AntWordProfiler-Discussion
I believe you are right. I had the oddest sense as soon as I sent it that the original email had gone lost in Webland. At any rate, I am glad that it had miraculously turned up in the right place!

Laurence Anthony

unread,
Aug 29, 2025, 2:56:46 AMAug 29
to antword...@googlegroups.com
Hi Terance,

First, you can get a nicely subdivided version of the  BNC_COCA list here:
https://laurenceanthony.net/software/wordfamilyfinder/

This is the official home of the  BNC_COCA list, including versioning. All the newest official versions of the list (from Paul Nation) will appear here.

You can swap out the inbuilt lists with the BNC_COCA via the file menu in AntWordProfiler. Check the help page if you are not sure how to do this. The inbuilt lists are stored in a resource folder inside the app itself, which is a little difficult to find. 

As for plotting graphs, AntWordProfiler produces nicely formatted tables that can be pasted directly into Excel via copy/paste, which you can then use to plot all the graphs that you want. I suggest you try that first.

I hope this helps!

Laurence.

###############################################################
Laurence ANTHONY, Ph.D.
Professor of Applied Linguistics
Faculty of Science and Engineering
Waseda University
3-4-1 Okubo, Shinjuku-ku, Tokyo 169-8555, Japan
E-mail: antho...@gmail.com
WWW: http://www.laurenceanthony.net/
###############################################################


Terence Patrick Murphy

unread,
Sep 1, 2025, 7:01:36 AMSep 1
to antword...@googlegroups.com
Hi Laurence,
Thanks very much! This Is very helpful.
All best wishes,
Terry

Reply all
Reply to author
Forward
0 new messages