Luis-Miguel Rodr�guez Rojas wrote:
> Hi Yvan,
>
> The GIs from GenBank are assigned consecutively [1], and therefore you
> can use any random numbers generator to pick a number between 2 (there
> is no gi:1) and the latest GI (currently 269784792). To update the
> latest GI, you can use the Sequence Revision History [2], and try
> increasing your GI (the "first seen at NCBI" date is particularly
> useful for this end).
>
> [1]
http://www.ncbi.nlm.nih.gov/Sitemap/sequenceIDs.html
> [2]
http://www.ncbi.nlm.nih.gov/entrez/sutils/girevhist.cgi
>
> Hope it's useful,
>
> Regards,
>
> On Fri, Nov 20, 2009 at 8:49 AM, Fred <
frederi...@gmail.com
> <mailto:
frederi...@gmail.com>> wrote:
>
> Hello Yvan,
>
> I know my response is too late 5Sorry I was in vacation) but anyway
> here is what would be my suggestion using php.
> 1) Downloading all the Uniprot ID. (Should be available at :
>
http://www.uniprot.org/downloads).
> 2) Put all the Uniprot ID on a php array called $arrUniprotId.
> 3) Use the php function array_rand (
http://php.net/manual/en/
> function.array-rand.php
> <
http://php.net/manual/en/%0Afunction.array-rand.php>) to get
> randomly an id : $myRandomId =
> array_rand ($arrUniprotId);
> 4) Then create a set od url using a loop.
> for($i = 0; $i <= 100; $i++) {
> $myRandomUrl = "
http://www.uniprot.org/uniprot/" . $myRandomId ;
> }
>
> Hope this helps,
>
> Fred
>
> On Nov 20, 9:42 am, yvan <
yvan.str...@gmail.com
> > > <mailto:
yvan.str...@gmail.com <mailto:
yvan.str...@gmail.com>>>
> > > <mailto:
group4bioi...@googlegroups.com
> > > <mailto:
group4bioinforma...@googlegroups.com
> <
http://www.bioinformatics.fr>
> > Hello,
> >
> > I am writing some tests to find weak points or exceptions for a
> > webservice and/or a web application. The application takes a
> sequence in
> > fasta format and apply some "magic" on it. In order to find
> situations
> > which could crash the services, I am sending hundreds sequences
> > completely randomly chosen to the application.
> > Brad and Peter from the biopython community gave me good pointers,
> > especially Brad who suggested to use the random entry retrieval
> URL from
> > expasy.
> >
> >
http://www.expasy.org/cgi-bin/get-random-entry.pl?S
> >
> > So now the problem is solved ;-)
> >
> > But thanks anyway for asking.
> > Cheers
> > yvan- Hide quoted text -
> >
> > - Show quoted text -
>
> --
> Related website :
www.bioinformatics.fr
> <
http://www.bioinformatics.fr>
> <
http://bioinf.uniandes.edu.co/%7Emiguel/>]
> ---------------------------------
> Unidad de Bioinform�tica del Laboratorio de Micolog�a y Fitopatolog�a
>
luis...@uniandes.edu.co <mailto:
luis...@uniandes.edu.co>
>
lmrodr...@gmail.com <mailto:
lmrodr...@gmail.com>
What I am doing is:
1) fetch a random ID from expasy via this url
http://www.expasy.org/cgi-bin/get-random-entry.pl?S
2) fetch the records from this ID
3) manipulate the records in a way they are usable , for example switch
format, here swiss-prot to fasta
Maybe it is a little tedious, two fetches, but so I am sure I get a
existing, "real" protein. The code is completely dependant from expasy
and the consistency of the url's.
But it works ;-)
Thanks everyone for your help
Cheers,
yvan