Is there some way to accomplish with free tools?
Assuming what you want to do is legal, Perl is free. Depending on how
many "a lot" is, it may take longer to write the script than it takes to
grab each image manually.
firefox and the DownThemAll! extension.
--
Jake Jarvis
You can look at the 'wget' utility.
--
Manuel Collado - http://lml.ls.fi.upm.es/~mcollado
This is a very nice addon.
Thanks for recommending it.
Thanks for the recommendation, but this looks more
like work than is needed for my purposes.
>>> I would like to download a lot of images from a remote site.
...
>>> Is there some way to accomplish with free tools?
>>
>> You can look at the 'wget' utility.
>
> Thanks for the recommendation, but this looks more
> like work than is needed for my purposes.
Is that a joke?
wget -A jpg,jpeg,gif,png http://some.site.of.interest
is "more like work than is needed"? Geez, you must be a lazy fucker!
Of course just scanning "wget --help" probably defeats you.
It doesn't defeat me (I've just described my interests as "anything with
a reference manual") but I have to factor in a day of reading the man
page whenever I have a new task for wget.
I was going to suggest wget, It's what I would have used, after a day or
two.
--
Steve Swift
http://www.swiftys.org.uk/swifty.html
http://www.ringers.org.uk
> Is that a joke?
No, it's a concern.
> wget -A jpg,jpeg,gif,png http://some.site.of.interest
>
> is "more like work than is needed"? Geez, you must be a lazy fucker!
That depends on whether wget * will work for what you need, which
depends on remote server config. If the paths to the images are simple
and the server supports directory listing, then wget is fine. If the
image URLs aren't obvious, then it won't. In that case DownThemAll is
a better option, as it uses the starting page as the source of the
URLs, so doesn't care about directory listings. It also has quite good
abilities to traverse beyond that, also (and very usefully) good
ignore facilities so that you can download the "content" assets
without also downloading all of the layout furniture.
.
> Geez, you must be a lazy fucker!
Also slow and stupid. I've got it ALL!
> Of course just scanning "wget --help" probably defeats you.
I would have had to download and install it first.
(MORE work!)
Thanks for the sample command line.
Now that I have a shortcut into its use, I will definitely
give it a try.
Well, I find myself needing this function, but
wget -A jpg http://swiftys.org.uk
Downloaded index.html and nothing else.
Admittedly there are no *.jpg files visible on that site (via index.html
and linked documents), but I don't understand why it downloaded an
"unacceptable" file extension.
Presumably it would also need some sort of recursion specification.
| Greg Russell wrote:
| > wget -A jpg,jpeg,gif,png http://some.site.of.interest
|
| Well, I find myself needing this function, but
|
| wget -A jpg http://swiftys.org.uk
|
| Downloaded index.html and nothing else.
|
| Admittedly there are no *.jpg files visible on that site (via index.html
| and linked documents), but I don't understand why it downloaded an
| "unacceptable" file extension.
|
| Presumably it would also need some sort of recursion specification.
That would be "wget -r" then I suppose :)
<http://linuxreviews.org/man/wget/>
Try the following:
wget -q -r --accept=*.jpg 'url'
--
David Postill
Dance your Life - Biodanza in Holland - <http://www.danceyourlife.eu>
That created a directory corresponding to the url (of my website) that I
specified and downloaded just robots.txt
I suspect that wget is honouring my robots.txt - I'll find the switch to
make it ignore that.
| David Postill wrote:
| > Try the following:
| >
| > wget -q -r --accept=*.jpg 'url'
|
| That created a directory corresponding to the url (of my website) that I
| specified and downloaded just robots.txt
|
| I suspect that wget is honouring my robots.txt - I'll find the switch to
| make it ignore that.
Try adding the option:
--user-agent="Mozilla/4.0 (compatible; MSIE 5.5; Windows NT 5.0)"
Cheers,
Thanks. I'll try that if the "native" mechanism of "-e robots=off --wait
1" fails.
This is just an intellectual exercise now, so it may be a day or two
before I do anything (just in case anyone is holding their breath).
For those of you not holding your breath� Happy New Year!