DIR Return Create A Forum - Home
---------------------------------------------------------
nCoV_info
HTML https://ncovinfo.createaforum.com
---------------------------------------------------------
*****************************************************
DIR Return to: genetics
*****************************************************
#Post#: 10--------------------------------------------------
genbank 2020/01/28
By: gsgs Date: January 29, 2020, 8:59 pm
---------------------------------------------------------
genbank release 235 of 2019/12/15 had 35 files with viral
sequences
see
HTML https://ftp.ncbi.nlm.nih.gov/genbank/gbrel.txt
this gave xxxxx genbank records with 15.1 GB in total
31686 of these had the keyword "orona" in them , making 284 MB
for the records,
120.2MB for the nucleotid sequences
one was removed, which had >300000 nucleotides
file oro2.nuc , 31685 nucleotide sequences of lengths
40-33578 2911 of these with lengths > 25000
8766 of the 31685 were filtered as having some common epitope
with the new Wuhan strain.
(my program epifox from 2013 , epifox wuhan oro2.nuc 28 15 m5 )
file oro3.nuc
better,stronger filtering gives 366 in file oro4.nuc , lengths
130-30256 , 255 of these of length > 25000
*****************************************************