Groups
Conversations
All groups and messages
Send feedback to Google
Help
Training
Sign in
Groups
nltk-users
Conversations
About
Location of sent_tokenize
46 views
Skip to first unread message
Julius Hamilton
unread,
Feb 6, 2022, 7:23:28 AM
2/6/22
Reply to author
Sign in to reply to author
Forward
Sign in to forward
Delete
You do not have permission to delete messages in this group
Copy link
Report message
Show original message
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to nltk-...@googlegroups.com
Hey,
In which file in the source code is sent_tokenize defined?
Thanks very much,
Julius
Julius Hamilton
unread,
Feb 6, 2022, 7:23:34 AM
2/6/22
Reply to author
Sign in to reply to author
Forward
Sign in to forward
Delete
You do not have permission to delete messages in this group
Copy link
Report message
Show original message
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to nltk-...@googlegroups.com
I found it in __init__.py.
I see it loads the “Punkt” sentence tokenizer from a pickle file.
Why is the tokenizer a downloaded pickle file and not just part of the NLTK module internally?
Thanks very much,
Julius
Reply all
Reply to author
Forward
0 new messages