Word frequency counter

2,593 views
Skip to first unread message

Bill Amatneek

unread,
Nov 16, 2020, 12:29:56 PM11/16/20
to indesi...@googlegroups.com
For ID cc 14.0.3 is there a Word Frequency Counter app/script, one that counts the frequency usage of each word in a (102,000 word) .indd file?

MTIA.

Best regards to all,
Bill Amatneek
Heart of a Man: Men’s Stories for Women

David Blatner

unread,
Nov 16, 2020, 4:15:25 PM11/16/20
to InDesign talk
Not sure if this will still work, but here's one idea:

Or there are also good ways to export all the text in a story, and then you could use a variety of tools (e.g. Excel) to get a word count.

If you want a word cloud, you could use this:

best wishes,
David Blatner
Publisher, InDesign Magazine, CreativePro.com

John Kramer

unread,
Nov 16, 2020, 4:40:57 PM11/16/20
to indesi...@googlegroups.com
You want a *concordance*, Bill, no?
Quick search yields a few options. Among them:
No personal experience with this.



--
You received this message because you are subscribed to the Google Groups "InDesign talk" group.
To unsubscribe from this group and stop receiving emails from it, send an email to indesign-tal...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/indesign-talk/c8403be5-9d0c-4c0e-b285-a86de8e75585n%40googlegroups.com.

Bill Amatneek

unread,
Nov 18, 2020, 1:00:58 AM11/18/20
to indesi...@googlegroups.com
Thank you, David and John.
I will look at all these ideas.

Bill Amatneek

unread,
Nov 18, 2020, 10:24:18 PM11/18/20
to indesi...@googlegroups.com
David & John, many thanks for your responses.

It appears that the apps that David suggests list each word that appears in a doc, 
but do not list the frequency usage of each word that appears in that doc.

And John, I don’t think I need a concordance, which is:
"an alphabetical list of the words (especially the important ones) present in a text, usually with citations of the passages in which they are found.”

I’d like a list that shows:

WORD FREQUENCY
a 1,234
about 120
alphabet  1
an 1,010

etc.

Better yet, it should be ordered by frequency, not alphabetized.

I see a number of online tools for this when searching on 
word frequency counter
but I’m strangely reluctant to put my entire book, 101,000 words, into an online app. 
I’d rather have something on my desktop to do this.

Again my thanks to David and John for your help.

Best regards to all,
Bill Amatneek


Not sure if this will still work, but here's one idea:
http://www.kasyan.ho.ua/indesign/all/word_gathering_and_sorting_in_dictionary.html
 
Or there are also good ways to export all the text in a story, and then you
could use a variety of tools (e.g. Excel) to get a word count.
 
If you want a word cloud, you could use this:
https://creativepro.com/creating-a-word-cloud-in-indesign/
 
best wishes,
David Blatner
Publisher, InDesign Magazine, CreativePro.com
On Monday, November 16, 2020 at 9:29:56 AM UTC-8 Bill Amatneek wrote:
 
John Kramer <johnkramer...@gmail.com>: Nov 16 04:40PM -0500

You want a *concordance*, Bill, no?
Quick search yields a few options. Among them:
http://textworld.com/scp/index.html
No personal experience with this.
 
 
 
You received this digest because you're subscribed to updates for this group. You can change your settings on the group membership page.
To unsubscribe from this group and stop receiving emails from it send an email to indesign-tal...@googlegroups.com.

Rick Gordon

unread,
Nov 29, 2020, 4:17:12 AM11/29/20
to indesi...@googlegroups.com
A script could do the following:

* Create an array (JS) / list (AS) of every word in every story (so if
a word is used 20 times, it exists 20 times in that list).

* Render that list into a block of text, one word followed by a return.

* Sort the paragraphs.

* Do a search for how many repeated lines for each entry, processing
the result for each word into a list containing the word and the
number of repeats (formatted as desired), or perhaps the number of
repeats first, for easier sorting, though if processed as
*word-tab-repeatNumber*, it could be brought into Excel and sorted
there easily.

Rick Gordon

--------------------
On November 29, 2020 at 1:01:24 AM [-0800], Bill Amatneek wrote in an
email entitled "[ID] Re: Word frequency counter":
>> <mailto:indesi...@googlegroups.com> wrote:
>>
>> indesi...@googlegroups.com
>> <https://groups.google.com/forum/?utm_source=digest&utm_medium=email#%21forum/indesign-talk/topics>
>> Google Groups
>> <https://groups.google.com/forum/?utm_source=digest&utm_medium=email/#%21overview>
>> <https://groups.google.com/forum/?utm_source=digest&utm_medium=email/#%21overview>
>>
>> Topic digest
>> View all topics
>> <https://groups.google.com/forum/?utm_source=digest&utm_medium=email#%21forum/indesign-talk/topics>
>>
>>
>> * Word frequency counter <x-msg://6/#group_thread_0> - 2 Updates
>>
>> Word frequency counter
>> <http://groups.google.com/group/indesign-talk/t/8cb80cbe97fbb1cc?utm_source=digest&utm_medium=email>
>>
>> David Blatner <da...@63p.com <mailto:da...@63p.com>>: Nov 16 01:15PM
>> -0800
>>
>> Not sure if this will still work, but here's one idea:
>> http://www.kasyan.ho.ua/indesign/all/word_gathering_and_sorting_in_dictionary.html
>>
>> Or there are also good ways to export all the text in a story, and
>> then you
>> could use a variety of tools (e.g. Excel) to get a word count.
>>
>> If you want a word cloud, you could use this:
>> https://creativepro.com/creating-a-word-cloud-in-indesign/
>>
>> best wishes,
>> David Blatner
>> Publisher, InDesign Magazine, CreativePro.com <http://CreativePro.com>
>> On Monday, November 16, 2020 at 9:29:56 AM UTC-8 Bill Amatneek wrote:
>>
>> John Kramer <johnkramer...@gmail.com
>> <mailto:johnkramer...@gmail.com>>: Nov 16 04:40PM -0500
>>
>> You want a *concordance*, Bill, no?
>> Quick search yields a few options. Among them:
>> http://textworld.com/scp/index.html
>> No personal experience with this.
>>
>>
>>
>> Back to top <x-msg://6/#digest_top>
>> You received this digest because you're subscribed to updates for
>> this group. You can change your settings on the group membership page
>> <https://groups.google.com/forum/?utm_source=digest&utm_medium=email#%21forum/indesign-talk/join>.
>> To unsubscribe from this group and stop receiving emails from it send
>> an email to indesign-tal...@googlegroups.com
>> <mailto:indesign-tal...@googlegroups.com>.
>>
>
> --
> You received this message because you are subscribed to the Google
> Groups "InDesign talk" group.
> To unsubscribe from this group and stop receiving emails from it, send
> an email to indesign-tal...@googlegroups.com
> <mailto:indesign-tal...@googlegroups.com>.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/indesign-talk/A94B8EF9-42F1-4905-B33C-1027AF5736DC%40vineyardspress.com
> <https://groups.google.com/d/msgid/indesign-talk/A94B8EF9-42F1-4905-B33C-1027AF5736DC%40vineyardspress.com?utm_medium=email&utm_source=footer>.
___________________________________________
RICK GORDON
EMERALD VALLEY GRAPHICS AND CONSULTING
___________________________________________
WWW: http://www.shelterpub.com

Rick Gordon

unread,
Nov 29, 2020, 5:08:03 AM11/29/20
to indesi...@googlegroups.com
To scale that to process an book, you'd just process each document in
turn, adding words to the list, and then do the sorting and repeated
line analysis.

--------------------
On November 29, 2020 at 2:04:34 AM [-0800], Rick Gordon wrote in an
email entitled "Re: [ID] Re: Word frequency counter":
> A script could do the following:
>
> * Create an array (JS) / list (AS) of every word in every story (so if
> a word is used 20 times, it exists 20 times in that list).
>
> * Render that list into a block of text, one word followed by a return.
>
> * Sort the paragraphs.
>
> * Do a search for how many repeated lines for each entry, processing
> the result for each word into a list containing the word and the
> number of repeats (formatted as desired), or perhaps the number of
> repeats first, for easier sorting, though if processed as
> *word-tab-repeatNumber*, it could be brought into Excel and sorted
> there easily.
Reply all
Reply to author
Forward
0 new messages