Saving the Clojure.org webiste

2 views
Skip to first unread message

Kei Suzuki

unread,
May 19, 2009, 3:49:02 PM5/19/09
to Clojure
I wanted to save the Clojure.org website so that I can read it when
I'm off-line. The problem is that none of the website downloader tools
I found is satisfactory; the pages don't look right and links are
broken (I think I know now why they don't work by looking into the
html and css files of the website). So I wrote a downloader in
Clojure. It's a bit slow and inefficient (but I don't care). Besides
it depends on the way the website is written and organized. But it
does what I want, so I'm happy...until the website changes radically.

I'll upload the code to the Clojure Google Groups file area. The file
name is save_clojure.org.tar.bz2. Hope you find it useful too.

Andrew Wagner

unread,
May 19, 2009, 3:53:28 PM5/19/09
to clo...@googlegroups.com
Can I get it in a bedtime-story format too? :)

Trevor Caira

unread,
May 19, 2009, 4:49:18 PM5/19/09
to Clojure
I haven't personally tried it on clojure.org, but wget -m tends to
work well for this kind of task.

Trevor

Rohan Nicholls

unread,
May 19, 2009, 5:07:16 PM5/19/09
to clo...@googlegroups.com
I tried that and various other combinations of wget, but no luck.

Vagif Verdi

unread,
May 19, 2009, 8:12:26 PM5/19/09
to Clojure
i saved it with wget and then fixed the files with sed to point to
right resources and urls.

Emeka

unread,
May 20, 2009, 8:36:44 AM5/20/09
to clo...@googlegroups.com
What about the zip version of  save_clojure.org.tar.bz2?

Emeka


Kei Suzuki

unread,
May 20, 2009, 3:05:44 PM5/20/09
to Clojure
I should have uploaded the file in the .zip format for ease of
extraction. Since I don't know how to replace it with a .zip version
and I don't want to clutter the file area, I don't upload the zip
version. Mac and Linux users should have no problem of extracting the
files, and there should be bunch of free .tar.bz2 extraction tools
available for Windows. Sorry.

Michael Wood

unread,
May 21, 2009, 3:22:11 AM5/21/09
to clo...@googlegroups.com

Windows users can use 7-zip:
http://www.7-zip.org/

--
Michael Wood <esio...@gmail.com>

Tom Faulhaber

unread,
May 21, 2009, 2:47:20 PM5/21/09
to Clojure
Here's the magic incantation for using wget to pull a useful copy (no
postprocessing required!):

wget -krmnp -E -X/page,/message --no-check-certificate -P <target>
https://clojure.org

replace target with the directory where you want the output and you're
off to the races.

Thanks to Kresimir Sojat for working this out.
Reply all
Reply to author
Forward
0 new messages