Request - Add site "www.siye.co.uk" to available sites.

34 views
Skip to first unread message

widget

unread,
Dec 28, 2011, 8:35:04 PM12/28/11
to Fanfiction Downloader
What is the possibility that the site "Sink Into Your Eyes" (at
http://www.siye.co.uk/siye/) could get added to the list of parse-able
sites. Using a story URL pulls up either the story (in the case of
single chapter one-shots) or a table of contents (in the case of multi-
chapter-ed stories). It also appear that the story contents are
contained within a set of <table></table> HTML tags.

http://www.siye.co.uk/siye/viewstory.php?sid=127268 (complete story
or multi-chapter table of contents)
http://www.siye.co.uk/siye/viewstory.php?sid=127268&chapter=1
(chapter of multi-chapter)

Cordelia Hunter

unread,
Dec 31, 2011, 8:32:42 AM12/31/11
to Fanfiction Downloader
In case HTML format is what you're looking for, all chapters of a
story on SIYE can be downloaded as a single HTML file by using the
"Print Story" link.

http://www.siye.co.uk/siye/viewstory.php?action=printable&textsize=0&sid=127268&chapter=all

Hope this helps,
Cordelia

dain bramage

unread,
Dec 31, 2011, 12:16:07 PM12/31/11
to fanfic-d...@googlegroups.com
I have been using the "print chapter/story" feature of the site in
conjunction with the site http://www.2epub.com/ to create epub files
(which in my opinion read better on my ebook reader of choice that html
files). Given the seemingly consistent structure of the stories
themselves and the URLs I thought that it could be possible to cut the
middle man, as it were, and have this utility parse the story directly.

Jim Miller

unread,
Dec 31, 2011, 10:59:45 PM12/31/11
to fanfic-d...@googlegroups.com

www.siye.co.uk, while certainly not the worst we've tried to support, is
also not easiest. Parsing out the metadata and chapter text relies on a
lot of assumptions and heuristics.

Still, I've implemented it. Give a try on the web server first and make
sure it works for you.

http://4-2-0.fanfictiondownloader.appspot.com/

I'll release new CLI and plugin zips and switch the default web version
after we've confirmed it works for more than just me.

Jim

--
Jim Miller
Retie...@gmail.com

Majid HUSSAIN

unread,
Jan 2, 2012, 7:23:03 AM1/2/12
to Fanfiction Downloader
hello
I am unable to download this story from siye.co.uk
http://www.siye.co.uk/siye/viewstory.php?sid=127442
as far as i no, this is the only story that has not downloaded
proppley
when I attempted to get the story, it just gives me the chapter titles
and not the story afterwords
thanks for reading and thank you for adding this website.
Majid Hussain
> RetiefJ...@gmail.com

Jim Miller

unread,
Jan 2, 2012, 11:48:58 AM1/2/12
to fanfic-d...@googlegroups.com

Majid,

That story was a formatted a little different, so it needed a tweak to
handle it. Thanks for pointing that out.

(<p> tags inside <span> tags parse differently than <br/> inside <span>
in BeautifulSoup vs BeautifulStoneSoup.)

Anybody else using www.siye.co.uk downloads with success or failure?

Jim


--
Jim Miller
Retie...@gmail.com

Dain Bramage

unread,
Jan 2, 2012, 12:15:57 PM1/2/12
to fanfic-d...@googlegroups.com
I have downloaded a couple of stories from www.siye.co.uk and the only problem I have ran into I can attribute to the epub reader i am using on my tablet. I plan on going through my favorites shortly to get a broader selection to test with.

Just out of curiosty, would the "printable" versions of the story been easier to parse?

Jim Miller

unread,
Jan 2, 2012, 12:17:10 PM1/2/12
to fanfic-d...@googlegroups.com
On 1/2/2012 11:15 AM, Dain Bramage wrote:
> I have downloaded a couple of stories fromwww.siye.co.uk and the only problem I have ran into I can attribute to the epub reader i am using on my tablet. I plan on going through my favorites shortly to get a broader selection to test with.

Great, let me know if you see any problems.

> Just out of curiosty, would the "printable" versions of the story been easier to parse?

Not really. The sites that are easiest to parse have a <div
id='storybody'> or something like that. The printable page on SIYE is
pretty much the same once you get down to the chapter text.

Reply all
Reply to author
Forward
0 new messages