That was actually my same question. I've been looking for the past
hour and I just found these:
http://news.ycombinator.com/item?id=1781013
http://amix.dk/blog/post/19574
So you need to use the votes, gravity weight and age of the article to
get the basic ranking.
Looking further:
There are a bunch of variables involved that pg didn't provide defs
for in his post. But, thank god for open source, you can get the
algorithms by downloading arc. Here's what I've figured out so far:
Numeric Variables you need to get the frontpage algo for a post:
Votes, search given
Age, search given
Gravity weight, constant given
Contro-factor, calculated
Light-weight factor, calculated
Front threshold, given
True/false variables
Whether it has a url, search given
Whether it is controversial, not sure yet
Whether it is lightweight, not sure yet
You won't be able to clone hn exactly, since pg has more info than you
do. Plus, news.yc shows he uses some random functions to make sure
that the front page keeps flowing. But you can get close enough that
most people wouldn't be able to tell the difference if they didn't
know what was going on.
Alternative: you could just pull news.ycombinator and break it into
pieces that you wanted. I thought about doing a HN Sans where it
filters out things from the front page currently, but that's been done
and it doesn't use the api much at all.
Good luck!
-Zack