Groups
Groups
Sign in
Groups
Groups
jiebaR 中文分词
Conversations
Labels
About
Send feedback
Help
关于TF-IDF的问题
25 views
Skip to first unread message
Hope
unread,
Jan 1, 2019, 8:51:18 AM
1/1/19
Reply to author
Sign in to reply to author
Forward
Sign in to forward
Delete
You do not have permission to delete messages in this group
Copy link
Report message
Show original message
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to jiebaR
敬爱的jiebaR作者:
您好!最近在研究如何进行关键词进行提取,想要使用jiebaR作为主要解决方案。我想要问一下,分词结束之后,能直接用提取关键词的函数来提取么?
我总觉得不对劲,是不是包里面自带了以前在《人民日报》中训练得到的idf值,然后作为参考来求TF-IDF,不然为什么一句话也能求关键词。如果我们个人使用,是不是应该根据自己的文档来求TF-IDF,那么这个时候jiebaR是否还有直接求得tf-idf的函数?还是说我们需要自己在R中另外求得?
谢谢!
黄天元
复旦大学
runner alice
unread,
Apr 29, 2022, 3:25:22 AM
4/29/22
Reply to author
Sign in to reply to author
Forward
Sign in to forward
Delete
You do not have permission to delete messages in this group
Copy link
Report message
Show original message
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to jiebaR 中文分词
我也遇到了相似的问题,我猜想可能是需要先get_idf()获得自己文本的idf值,并存储为txt文件,在提取关键词时,设置worker()的idf路径,然后提取关键词。不知道是否是这样?欢迎交流
Reply all
Reply to author
Forward
0 new messages