Groups
Conversations
All groups and messages
Send feedback to Google
Help
Sign in
Groups
nltk-users
Conversations
About
nltk-users
1–30 of 1910
Welcome to NLTK-Users, a mailing list for discussion about using the Natural Language Toolkit
Mark all as read
Report abusive group
0 selected
Ged D
Jan 27
Analysing a dataset of emails about projects to assess project health and status
Can anyone point me at a dataset, project, research paper or documentation resource that deals with
unread,
Analysing a dataset of emails about projects to assess project health and status
Can anyone point me at a dataset, project, research paper or documentation resource that deals with
Jan 27
Bertrand De Longueville
2
Jan 16
Job opportunity : NLP Scientific Officer at EC JRC - Ispra, Italy
DEADLINE HAS BEEN EXTENDED TO 23/01/2023 - don't miss it! :-) Le samedi 24 décembre 2022 à 14:07:
unread,
Job opportunity : NLP Scientific Officer at EC JRC - Ispra, Italy
DEADLINE HAS BEEN EXTENDED TO 23/01/2023 - don't miss it! :-) Le samedi 24 décembre 2022 à 14:07:
Jan 16
Philip Leclerc
, …
Wilson
6
12/3/22
Generating sentences from an fcfg?
Hello, everyone. This is an old question. I wonder if any of you managed to solve this? Kind regards,
unread,
Generating sentences from an fcfg?
Hello, everyone. This is an old question. I wonder if any of you managed to solve this? Kind regards,
12/3/22
Monali Mavani
11/23/22
problem with nltk.lm module
Hello all, I am facing a problem with nltk.lm module. After running the following code multiple times
unread,
problem with nltk.lm module
Hello all, I am facing a problem with nltk.lm module. After running the following code multiple times
11/23/22
Jason Miller
,
Alex Rudnick
3
10/22/22
Newbie question
Thank you so much Alex! On Fri, Oct 21, 2022 at 9:26 AM Alex Rudnick <alex.r...@gmail.com>
unread,
Newbie question
Thank you so much Alex! On Fri, Oct 21, 2022 at 9:26 AM Alex Rudnick <alex.r...@gmail.com>
10/22/22
oleg kapanadze
,
Alexis
2
8/12/22
FBCFG for Georgian
Hi Oleg, A bit late, so I hope you have found the answer already; but here it is just in case (and
unread,
FBCFG for Georgian
Hi Oleg, A bit late, so I hope you have found the answer already; but here it is just in case (and
8/12/22
Negar Arabzadeh
8/7/22
Call for Participants NeurIPS IGLU competition
We kindly invite you to consider the IGLU Neurips competition with the goal of interactive embodied
unread,
Call for Participants NeurIPS IGLU competition
We kindly invite you to consider the IGLU Neurips competition with the goal of interactive embodied
8/7/22
JuHyun Park
6/24/22
license of punkt in nltk_data
Hi. Is it possible to use punkt in nltk_data for commercial use ? I cannot find out the license of
unread,
license of punkt in nltk_data
Hi. Is it possible to use punkt in nltk_data for commercial use ? I cannot find out the license of
6/24/22
Jessica Wade
5/18/22
Book and other resources!!
Hello! Is there a hardcopy available instead of an electronic copy found here -> https://www.nltk.
unread,
Book and other resources!!
Hello! Is there a hardcopy available instead of an electronic copy found here -> https://www.nltk.
5/18/22
hari....@vedavit-ps.com
,
Atul Kr. Ojha
2
5/4/22
Tool to know the process of Sanskrit word formation
You may check this online site(s): https://sanskrit.uohyd.ac.in/scl/skt_gen/compounds/start.html or
unread,
Tool to know the process of Sanskrit word formation
You may check this online site(s): https://sanskrit.uohyd.ac.in/scl/skt_gen/compounds/start.html or
5/4/22
ines faravelli
,
Jordi Carrera
3
4/21/22
doubts about use of raw() and others functions
great! I'll try with the code, thank you very much Em quinta-feira, 21 de abril de 2022 às 13:26:
unread,
doubts about use of raw() and others functions
great! I'll try with the code, thank you very much Em quinta-feira, 21 de abril de 2022 às 13:26:
4/21/22
Baba Dook
,
Jordi Carrera
3
3/11/22
How to remove stopwords from tokenized sentences?
Thank you for your useful answer, I apply your code as the following for sent in sentences:
unread,
How to remove stopwords from tokenized sentences?
Thank you for your useful answer, I apply your code as the following for sent in sentences:
3/11/22
Praveen Kumar
,
Baba Dook
2
3/5/22
Add Synonyms in NLTK corpus
Hi there, I know there is WordNet so you can find synonyms in there. El lunes, 10 de enero de 2022 a
unread,
Add Synonyms in NLTK corpus
Hi there, I know there is WordNet so you can find synonyms in there. El lunes, 10 de enero de 2022 a
3/5/22
Julius Hamilton
2
2/6/22
Location of sent_tokenize
I found it in __init__.py. I see it loads the “Punkt” sentence tokenizer from a pickle file. Why is
unread,
Location of sent_tokenize
I found it in __init__.py. I see it loads the “Punkt” sentence tokenizer from a pickle file. Why is
2/6/22
Robert Schafish
, …
Alexis
4
1/3/22
Tokenize paragraphs
And here is the nltk answer: Use the predefined BlanklineTokenizer, which uses a blank line as the
unread,
Tokenize paragraphs
And here is the nltk answer: Use the predefined BlanklineTokenizer, which uses a blank line as the
1/3/22
Catarina Runa
, …
Bryan Finlayson
3
11/25/21
Where to start?
I would suggest checking out the NLTK Book, found at https://www.nltk.org/book/. Or marry a linguist!
unread,
Where to start?
I would suggest checking out the NLTK Book, found at https://www.nltk.org/book/. Or marry a linguist!
11/25/21
poornima shetty
11/19/21
dataset..
could you please help in downloading tweets from twitter?!!!!!!!!!
unread,
dataset..
could you please help in downloading tweets from twitter?!!!!!!!!!
11/19/21
Tanish Gupta
11/5/21
NLTK not working in Buildozer Android KivyMD app
I am making a basic app with KivyMD, which uses NLTK. The program works well in Pycharm IDE, but
unread,
NLTK not working in Buildozer Android KivyMD app
I am making a basic app with KivyMD, which uses NLTK. The program works well in Pycharm IDE, but
11/5/21
Philip Gillißen
10/27/21
Add German abbreviation with a blank ("z. B.") to PunktSentenceTokenizer
Dear all, I'm currently trying to optimize the ntlk PunktSentenceTokenizer in my project. I'm
unread,
Add German abbreviation with a blank ("z. B.") to PunktSentenceTokenizer
Dear all, I'm currently trying to optimize the ntlk PunktSentenceTokenizer in my project. I'm
10/27/21
Sree Ganesh
10/15/21
NLTK in Indian Languages
Dear Members, Good morning. With you happy Vijayadasami. Are there any one who isimplementing NLTK
unread,
NLTK in Indian Languages
Dear Members, Good morning. With you happy Vijayadasami. Are there any one who isimplementing NLTK
10/15/21
Brian Ó Broin
9/17/21
HTML Stripper not working
This question is based on chapter 4 of the NLTK book online: "Writing Structured Programs",
unread,
HTML Stripper not working
This question is based on chapter 4 of the NLTK book online: "Writing Structured Programs",
9/17/21
Brian Ó Broin
,
James Yang
2
9/15/21
NLTK Book Exercises - Sun, Moon, Stars
I was wondering the same. But after doing a few, I'm pretty sure that it's exactly what you
unread,
NLTK Book Exercises - Sun, Moon, Stars
I was wondering the same. But after doing a few, I'm pretty sure that it's exactly what you
9/15/21
Julius Hamilton
9/13/21
Identify natural language fragment
Hey, I'd like some sort of script that can recognize fairly accurately if some line of text is a
unread,
Identify natural language fragment
Hey, I'd like some sort of script that can recognize fairly accurately if some line of text is a
9/13/21
Mahesh Abnave
2
9/5/21
Semcor gives different output than is shown in the docs
I knew that semcor uses wordnet senses to tag to subset of brown corpus. But I was not aware that
unread,
Semcor gives different output than is shown in the docs
I knew that semcor uses wordnet senses to tag to subset of brown corpus. But I was not aware that
9/5/21
Rowan Jacobs
, …
Manuel Souto Pico
4
8/25/21
Identifying French text
Hi Rowan, From what I can read diagonally, I think you already have a very good answer from Jordi and
unread,
Identifying French text
Hi Rowan, From what I can read diagonally, I think you already have a very good answer from Jordi and
8/25/21
P Resnik
8/24/21
Using agreement metrics with missing data?
The documentation in the Agreement Metrics package indicates that for computation of inter-annotator
unread,
Using agreement metrics with missing data?
The documentation in the Agreement Metrics package indicates that for computation of inter-annotator
8/24/21
Steve
8/24/21
Looking for a NLP specialist(Remote)
Hi All, We are looking for a NLP specialist(Remote, flexible hours) with a proven NLP background and
unread,
Looking for a NLP specialist(Remote)
Hi All, We are looking for a NLP specialist(Remote, flexible hours) with a proven NLP background and
8/24/21
mary...@gmail.com
7/30/21
Looking for an NLP specialist to apply summarisation techniques
Hi everyone, In the EU agency where I work, we are looking for a specialist with a proven NLP
unread,
Looking for an NLP specialist to apply summarisation techniques
Hi everyone, In the EU agency where I work, we are looking for a specialist with a proven NLP
7/30/21
sara ali
,
Alexis
2
7/30/21
help with Arabic language tokenizer
This should be handled well by Python itself, you do not need a tokenizer for letters. 1. Make sure
unread,
help with Arabic language tokenizer
This should be handled well by Python itself, you do not need a tokenizer for letters. 1. Make sure
7/30/21
Julius Hamilton
, …
Jordi Carrera
4
7/27/21
Concordance
Hey Julius, I'm attaching some code to do what you describe, take a look and see if this is what
unread,
Concordance
Hey Julius, I'm attaching some code to do what you describe, take a look and see if this is what
7/27/21