Account Options

  1. Sign in
The old Google Groups will be going away soon, but your browser is incompatible with the new version.
Google Groups Home
« Groups Home
Message from discussion HTML-parser/ content extractor purposals

Received: by 10.58.69.11 with SMTP id a11mr2010489veu.30.1348814880220;
        Thu, 27 Sep 2012 23:48:00 -0700 (PDT)
X-BeenThere: nodejs@googlegroups.com
Received: by 10.220.224.8 with SMTP id im8ls2546830vcb.4.gmail; Thu, 27 Sep
 2012 23:47:50 -0700 (PDT)
Received: by 10.52.93.132 with SMTP id cu4mr106942vdb.14.1348814869993;
        Thu, 27 Sep 2012 23:47:49 -0700 (PDT)
Date: Thu, 27 Sep 2012 23:47:49 -0700 (PDT)
From: greelgorke <greelgo...@gmail.com>
To: nodejs@googlegroups.com
Message-Id: <2ae9f765-fd32-4626-aaa4-d9afe4abbe61@googlegroups.com>
In-Reply-To: <CAPJ5V2bo81U3eLFZxo_AiJ+4EMJRZx9cbh4+Ro=hAbhkF7SYpA@mail.gmail.com>
References: <381ead56-4a8e-42ec-ad8d-5ab335017f64@googlegroups.com>
 <CAPJ5V2bo81U3eLFZxo_AiJ+4EMJRZx9cbh4+Ro=hAbhkF7SYpA@mail.gmail.com>
Subject: Re: [nodejs] HTML-parser/ content extractor purposals
MIME-Version: 1.0
Content-Type: multipart/mixed; 
	boundary="----=_Part_387_19975494.1348814869403"

------=_Part_387_19975494.1348814869403
Content-Type: multipart/alternative; 
	boundary="----=_Part_388_16048784.1348814869404"

------=_Part_388_16048784.1348814869404
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

i just found this ones:
ifrins/articlefinder =C2=B7 GitHub <https://github.com/ifrins/articlefinder=
>
Network Graph =C2=B7 saturngod/node-readability<https://github.com/saturngo=
d/node-readability/network>

thanks for your suggestions

Am Donnerstag, 27. September 2012 17:13:08 UTC+2 schrieb Matt Sergeant:
>
> http://libots.sourceforge.net/
>
> You probably need something like https://github.com/mikeal/request and=20
> the command line html2text to convert the HTML to plain text first.
>
> Matt.
>
> On Thu, Sep 27, 2012 at 5:36 AM, greelgorke <greel...@gmail.com<javascrip=
t:>
> > wrote:
>
>> Hi folks,
>>
>> is there any lib out there, that can made abstracts from a page like i.E=
.=20
>> Google Reader?
>>
>> any suggestions?
>>
>> cheers
>>
>> Gregor
>>
>> --=20
>> Job Board: http://jobs.nodejs.org/
>> Posting guidelines:=20
>> https://github.com/joyent/node/wiki/Mailing-List-Posting-Guidelines
>> You received this message because you are subscribed to the Google
>> Groups "nodejs" group.
>> To post to this group, send email to nod...@googlegroups.com<javascript:=
>
>> To unsubscribe from this group, send email to
>> nodejs+un...@googlegroups.com <javascript:>
>> For more options, visit this group at
>> http://groups.google.com/group/nodejs?hl=3Den?hl=3Den
>>
>
>
------=_Part_388_16048784.1348814869404
Content-Type: text/html; charset=utf-8
Content-Transfer-Encoding: quoted-printable

i just found this ones:<div><a href=3D"https://github.com/ifrins/articlefin=
der">ifrins/articlefinder =C2=B7 GitHub</a><br></div><div><a href=3D"https:=
//github.com/saturngod/node-readability/network">Network Graph =C2=B7 satur=
ngod/node-readability</a></div><div><br></div><div>thanks for your suggesti=
ons<br><br>Am Donnerstag, 27. September 2012 17:13:08 UTC+2 schrieb Matt Se=
rgeant:<blockquote class=3D"gmail_quote" style=3D"margin: 0;margin-left: 0.=
8ex;border-left: 1px #ccc solid;padding-left: 1ex;"><a href=3D"http://libot=
s.sourceforge.net/" target=3D"_blank">http://libots.sourceforge.net/</a><di=
v><br></div><div>You probably need something like&nbsp;<a href=3D"https://g=
ithub.com/mikeal/request" target=3D"_blank">https://github.com/<wbr>mikeal/=
request</a> and the command line&nbsp;html2text to convert the HTML to plai=
n text first.</div>
<div><br></div><div>Matt.<br><br><div class=3D"gmail_quote">On Thu, Sep 27,=
 2012 at 5:36 AM, greelgorke <span dir=3D"ltr">&lt;<a href=3D"javascript:" =
target=3D"_blank" gdf-obfuscated-mailto=3D"oei6kPMIrKcJ">greel...@gmail.com=
</a>&gt;</span> wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex">Hi folks,<div><br></div><div>is there any li=
b out there, that can made abstracts from a page like i.E. Google Reader?</=
div>
<div><br></div><div>any suggestions?</div><div><br></div><div>cheers</div><=
div><br></div><div>Gregor</div><span><font color=3D"#888888">

<p></p>

-- <br>
Job Board: <a href=3D"http://jobs.nodejs.org/" target=3D"_blank">http://job=
s.nodejs.org/</a><br>
Posting guidelines: <a href=3D"https://github.com/joyent/node/wiki/Mailing-=
List-Posting-Guidelines" target=3D"_blank">https://github.com/joyent/<wbr>n=
ode/wiki/Mailing-List-<wbr>Posting-Guidelines</a><br>
You received this message because you are subscribed to the Google<br>
Groups "nodejs" group.<br>
To post to this group, send email to <a href=3D"javascript:" target=3D"_bla=
nk" gdf-obfuscated-mailto=3D"oei6kPMIrKcJ">nod...@googlegroups.com</a><br>
To unsubscribe from this group, send email to<br>
<a href=3D"javascript:" target=3D"_blank" gdf-obfuscated-mailto=3D"oei6kPMI=
rKcJ">nodejs+un...@<wbr>googlegroups.com</a><br>
For more options, visit this group at<br>
<a href=3D"http://groups.google.com/group/nodejs?hl=3Den?hl=3Den" target=3D=
"_blank">http://groups.google.com/<wbr>group/nodejs?hl=3Den?hl=3Den</a><br>
</font></span></blockquote></div><br></div>
</blockquote></div>
------=_Part_388_16048784.1348814869404--

------=_Part_387_19975494.1348814869403--