HTML-parser/ content extractor purposals

98 views
Skip to first unread message

greelgorke

unread,
Sep 27, 2012, 5:36:56 AM9/27/12
to nod...@googlegroups.com
Hi folks,

is there any lib out there, that can made abstracts from a page like i.E. Google Reader?

any suggestions?

cheers

Gregor

Stanley Stuart

unread,
Sep 27, 2012, 11:00:29 AM9/27/12
to nod...@googlegroups.com
Mikeal's request module + jQuery would probably work really well.

Matt

unread,
Sep 27, 2012, 11:12:57 AM9/27/12
to nod...@googlegroups.com
http://libots.sourceforge.net/

You probably need something like https://github.com/mikeal/request and the command line html2text to convert the HTML to plain text first.

Matt.

--
Job Board: http://jobs.nodejs.org/
Posting guidelines: https://github.com/joyent/node/wiki/Mailing-List-Posting-Guidelines
You received this message because you are subscribed to the Google
Groups "nodejs" group.
To post to this group, send email to nod...@googlegroups.com
To unsubscribe from this group, send email to
nodejs+un...@googlegroups.com
For more options, visit this group at
http://groups.google.com/group/nodejs?hl=en?hl=en

greelgorke

unread,
Sep 28, 2012, 2:47:49 AM9/28/12
to nod...@googlegroups.com
i just found this ones:

thanks for your suggestions

alFReD NSH

unread,
Sep 28, 2012, 6:16:32 AM9/28/12
to nod...@googlegroups.com
You can have a look At cheerio: https://github.com/MatthewMueller/cheerio

alFReD NSH

unread,
Sep 28, 2012, 6:16:32 AM9/28/12
to nod...@googlegroups.com
Reply all
Reply to author
Forward
0 new messages