Issue 16 in judou: setup.py is broken

3 views
Skip to first unread message

ju...@googlecode.com

unread,
Apr 23, 2012, 8:06:54 AM4/23/12
to ju...@googlegroups.com
Status: New
Owner: ----
Labels: Type-Defect Priority-Medium

New issue 16 by lyxint: setup.py is broken
http://code.google.com/p/judou/issues/detail?id=16

setup.py is broken

junyi sun

unread,
Sep 18, 2012, 6:05:28 AM9/18/12
to ju...@googlegroups.com
Hi, All

我最近捣鼓了一个新的纯python分词lib: snailseg。  地址: https://github.com/fxsjy/snailseg 



snailseg

Chinese Words Segment Library in Python 简单的中文分词库

Usage

  • 将snailseg目录放置于当前目录或者site-packages目录
  • import snailseg

代码示例

import snailseg
words = snailseg.cut("南京市长江大桥")
for w in words:
    print w

Algorithm

  • 算法是统计单字在词语中出现位置的概率大小,选择最大可能的分词方案
  • 算法简单,只有100行纯Python代码

Performance

  • 700 KB/Second
  • Test Env: Intel(R) Core(TM) i7-2600 CPU @ 3.4GHz;《围城》.txt



--
====================
句读:开放的中文分词项目
====================

主要链接
=======

* 句读首页:http://judou.org

使用
====
* 讨论请发邮件到ju...@googlegroups.com
* 查看更多到http://groups.google.com/group/judou

* 想退订发邮件到judou+unsubscribe@googlegroups.com

twinsant

unread,
Sep 18, 2012, 10:25:39 PM9/18/12
to ju...@googlegroups.com
已收录到wiki
http://trac.judou.org/trac.judou.org/wiki

多谢。

2012/9/18 junyi sun <ccn...@gmail.com>
* 想退订发邮件到judou+un...@googlegroups.com

Reply all
Reply to author
Forward
0 new messages