Deadline nearing,

41 views
Skip to first unread message

Dent Earl

unread,
Mar 2, 2012, 2:51:14 PM3/2/12
to Alignathon
Hi all,

I just wanted to remind everyone that the deadline is approaching:
March 9th is one week from today. I haven't heard of any problems so I
assume that everyone's work is progressing nicely. Please contact me
directly if your group needs more time. I'll send an update next week
with details on how to submit and the issues you should address in
your submission write up.

Best regards,

d

Minmei Hou

unread,
Mar 5, 2012, 2:09:03 PM3/5/12
to align...@googlegroups.com
Dent,

I'm preparing TBA and revised multiz alignments. In order to do
multiple alignment, I need to create pairwise alignments first. I'm
using lastz from Penn State. There are a few pairs from flies that run
for really long time. Specifically, droVir3 vs. droWil1, droBip vs.
droAna3, dp4 vs. droPer1, and droSim1 vs. droSec1. These pairs have
run for many days. I've contacted Bob Harris for good parameters of
lastz to speed up, and the new processes have run for several days and
I don't know when they can finish. Do you know any groups who use
lastz and have finished running these pairs? I may need more time.

Thanks,

Minmei


--
Minmei Hou, Ph.D.
Assistant Professor
Department of Computer Science
Northern Illinois University

Dent Earl

unread,
Mar 5, 2012, 2:25:13 PM3/5/12
to align...@googlegroups.com
Hey Minmei,

I know that Cactus uses lastz as does UCSC's Multiz pipeline, maybe those guys can chime in here with tips. I'm not sure whether or not team Cactus has finished the flies but I believe team Multiz has.

I'll chat with you off-list about a time extension. For everyone: limited extensions are possible, please contact me if you're not going to make the deadline.

d

Glenn Hickey

unread,
Mar 5, 2012, 7:01:40 PM3/5/12
to align...@googlegroups.com
Hi Minmei

We have noticed that there are some unmasked repeats in the flies that
have slowed down our Cactus pipeline. I can't say for sure that this
is your problem, but the self-masking option in the newest version of
lastz has helped us out considerably:

http://www.bx.psu.edu/~rsharris/lastz/newer/

http://www.bx.psu.edu/~rsharris/lastz/newer/README.lastz-1.03.02.html#adv_selfmasking

cheers
-Glenn

Minmei Hou

unread,
Mar 6, 2012, 1:53:50 AM3/6/12
to align...@googlegroups.com
Thanks Glenn! I'm trying this approach to mask droVir3. I used the
same parameters from the example in the link except I set to display
the masking progress for every 1 query. So each query is 200 bases.
From what I have seen, most of queries take < 1s while some take
several minutes. Since there are 1878794 such fragments for droVir3,
I doubt that the process will finish in practical time on my machine.
Did you observe something similar? Is it okay for you to let me know
what parameters you used to run these programs for masking? Thank you
very much.

--Minmei

Bob Harris

unread,
Mar 6, 2012, 11:57:58 AM3/6/12
to align...@googlegroups.com
Howdy,

The self-masking process as described in the lastz README is simplified to make the description readable. I've sent a message to Minmei describing speedup details, mainly using parallelization, but with some other ideas too.

For others who may be interested in those details, I'll post them on a message to the lastz mailing list later today.

Bob H

Victor Solovyev

unread,
Mar 9, 2012, 7:22:28 AM3/9/12
to dent...@gmail.com, align...@googlegroups.com
Dear Dent,

please let us know how to submit our alignments or where to read it.

Sorry if I missed the email with the instructions.

Regards, Vict

Dent Earl

unread,
Mar 9, 2012, 12:53:40 PM3/9/12
to Victor Solovyev, align...@googlegroups.com
Hi Victor,

Perfect question for today! :) I think the easiest way to do the submission will be for teams to compress their submission, place it on the web and then email me directly (off-list) both the link and an md5 (or sha1) checksum. If anyone is unable to place an archive on the web contact me and we'll work out an alternative (dropbox, ftp, etc). Eventually the data sets will be made public, but not immediately.

The form of the short write up portion of the submission is still being composed, I'll ping the list and the submitters with it when it is complete.

d

Илья Минкин

unread,
Oct 7, 2013, 7:35:08 AM10/7/13
to align...@googlegroups.com, Victor Solovyev
To Multiz & Cactus submissions authors: how much resources did it require you to compute the alignments for flies dataset? Approximate time and memory, it looks like you utilized parallelization, how many threads/processes did you run?

Thank you.

суббота, 10 марта 2012 г., 1:53:40 UTC+8 пользователь Dent Earl написал:
Reply all
Reply to author
Forward
0 new messages