Google Groups no longer supports new Usenet posts or subscriptions. Historical content remains viewable.
Dismiss

wget vs lynx

40 views
Skip to first unread message

Anas Husseini

unread,
Feb 17, 2008, 4:00:11 AM2/17/08
to
Hi,
 
wget is receiving a 403 error (Forbidden) when trying to retrieve a special web page,
while lynx (and other text and graphical web browsers) succeeded to get this page.
 
What may be the cause?
 
P.S. wget is behaving properly with other websites.

--
To move or to be moved
 

Carlos Rivera Cordero

unread,
Feb 17, 2008, 4:00:12 AM2/17/08
to
On 17/02/08 10:48, Anas Husseini wrote:
> wget is receiving a 403 error (Forbidden) when trying to retrieve a special
> web page, while lynx (and other text and graphical web browsers) succeeded
> to get this page.

If you give the url it should be easy for anyone to check and figure out what's going on.

cheers.

--
Puedes encontrar mi clave pública en:
Clave pública: http://keyserv.nic-se.se:11371/pks/lookup?op=get&search=0x70E0573FC7D69ABE
···································································
Por favor, NO utilices formatos de archivo cerrados para el intercambio
de documentos, como DOC y XLS, mejor HTML, RTF, TXT, CSV, ODT o cualquier
otro que no obligue a utilizar un programa privativo para abrirlo.

signature.asc

Carlos Rivera Cordero

unread,
Feb 17, 2008, 4:30:11 AM2/17/08
to
On 17/02/08 10:48, Anas Husseini wrote:
> wget is receiving a 403 error (Forbidden) when trying to retrieve a special
> web page, while lynx (and other text and graphical web browsers) succeeded to
> get this page.
>
> What may be the cause?

Your server does not like wget user agent, in fact it works if you tell wget to
use a different one. For example try using Iceweasel user agent, it works:

wget --user-agent="Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.8.1.12) Gecko/20080129 Iceweasel/2.0.0.12 (Debian-2.0.0.12-0etch1)" http://www.unix.org.ua/

cheers!

signature.asc

Chris Bannister

unread,
Feb 17, 2008, 10:50:09 PM2/17/08
to

Thanks for that! Those 403 errors are a pain, especially when you are
using wget like:

wget -a logfile -i file_with_urls

and you don't notice till you are offline. :-(

--
Chris.
======


--
To UNSUBSCRIBE, email to debian-us...@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listm...@lists.debian.org

Michelle Konzack

unread,
Feb 19, 2008, 9:00:18 AM2/19/08
to
Am 2008-02-18 16:45:12, schrieb Chris Bannister:
> On Sun, Feb 17, 2008 at 10:18:25AM +0100, Carlos Rivera Cordero wrote:
> > wget --user-agent="Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.8.1.12) Gecko/20080129 Iceweasel/2.0.0.12 (Debian-2.0.0.12-0etch1)" http://www.unix.org.ua/
>
> Thanks for that! Those 403 errors are a pain, especially when you are
> using wget like:
>
> wget -a logfile -i file_with_urls
>
> and you don't notice till you are offline. :-(

Note: You can write the "USER_AGENT" string directly into your ".wgetrc" as

user-agent = "Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.8.1.12) Gecko/20080129 Iceweasel/2.0.0.12 (Debian-2.0.0.12-0etch1)"

which do wonders...

Thanks, Greetings and nice Day
Michelle Konzack


--
Linux-User #280138 with the Linux Counter, http://counter.li.org/
##################### Debian GNU/Linux Consultant #####################
Michelle Konzack Apt. 917 ICQ #328449886
+49/177/9351947 50, rue de Soultz MSN LinuxMichi
+33/6/61925193 67100 Strasbourg/France IRC #Debian (irc.icq.com)

signature.pgp

Ron Johnson

unread,
Feb 19, 2008, 10:00:44 AM2/19/08
to
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

On 02/19/08 05:10, Michelle Konzack wrote:
> Am 2008-02-18 16:45:12, schrieb Chris Bannister:
>> On Sun, Feb 17, 2008 at 10:18:25AM +0100, Carlos Rivera Cordero wrote:
>>> wget --user-agent="Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.8.1.12) Gecko/20080129 Iceweasel/2.0.0.12 (Debian-2.0.0.12-0etch1)" http://www.unix.org.ua/
>> Thanks for that! Those 403 errors are a pain, especially when you are
>> using wget like:
>>
>> wget -a logfile -i file_with_urls
>>
>> and you don't notice till you are offline. :-(
>
> Note: You can write the "USER_AGENT" string directly into your ".wgetrc" as
>
> user-agent = "Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.8.1.12) Gecko/20080129 Iceweasel/2.0.0.12 (Debian-2.0.0.12-0etch1)"
>
> which do wonders...

Thanks! That's better than a huge alias!

- --
Ron Johnson, Jr.
Jefferson LA USA

PETA - People Eating Tasty Animals
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.6 (GNU/Linux)

iD8DBQFHuu2FS9HxQb37XmcRAmWbAJ0UYrByX9iNuG7+5PEWPCkz3joc5wCgndTh
BDhInQVSfH29lbt12Pv/7SY=
=r1by
-----END PGP SIGNATURE-----

Eduardo M KALINOWSKI

unread,
Feb 19, 2008, 11:50:16 AM2/19/08
to
On Feb 19, 2008 11:53 AM, Ron Johnson <ron.l....@cox.net> wrote:
> On 02/19/08 05:10, Michelle Konzack wrote:
> > Note: You can write the "USER_AGENT" string directly into your ".wgetrc" as
> >
> > user-agent = "Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.8.1.12) Gecko/20080129 Iceweasel/2.0.0.12 (Debian-2.0.0.12-0etch1)"
> >
> > which do wonders...
>
> Thanks! That's better than a huge alias!

For the record, some sites (all the ones I've had problems with, at
least) seem only to check for the presence of "Mozilla"; setting the
user-agent string to simply "Mozilla" fools them. But, especially in
the case of setting it in the .wgetrc file, it is much more convenient
to set it to a full string like that.

0 new messages