how to run the dataset "medwiki.gz"

27 views
Skip to first unread message

metic...@gmail.com

unread,
Oct 6, 2013, 1:29:29 AM10/6/13
to graph...@googlegroups.com
Hello, doctor~
     I have download the dataset 'medwiki.gz' from graphlab.org
    and use 'head medwiki.gz' found:
"%%MatrixMarket matrix coordinate real general
 4364329 3333799 513883084
...."
in the first two lines..

how can i run it by using svd ? 
should i remove the first two line before i run it?
 

Danny Bickson

unread,
Oct 6, 2013, 1:35:32 AM10/6/13
to graph...@googlegroups.com
Hi, 
You can simply remove the first two lines.
You will need to give the row and col number as part of the svd command line.
Something like
./svd --matrix=medwiki --rows=4364329 --cols=3333799 --nsv=6 --nv=12 --max_iter=10

Best, 

Danny Bickson
Co-Founder
GraphLab Inc.


--
You received this message because you are subscribed to the Google Groups "GraphLab API" group.
To unsubscribe from this group and stop receiving emails from it, send an email to graphlabapi...@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

metic...@gmail.com

unread,
Oct 6, 2013, 2:39:21 AM10/6/13
to graph...@googlegroups.com
Thanks u~
actually, the dataset is too large to open it with vim smoothly...
is there any way to remove the first two lines quickly?

在 2013年10月6日星期日UTC+8下午1时35分32秒,BiXolVer写道:

Danny Bickson

unread,
Oct 6, 2013, 3:24:38 AM10/6/13
to graph...@googlegroups.com
Use the command "tail -n XX medwiki"
where XX is the number of desired lines (counted from the end)

Danny Bickson
Co-Founder
GraphLab Inc.


Reply all
Reply to author
Forward
0 new messages