Skip to content

Commit

Permalink
update db build script to new number of NCBI NRP data files
Browse files Browse the repository at this point in the history
  • Loading branch information
oschwengers committed Feb 10, 2023
1 parent cd752a4 commit c3e709d
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion db-scripts/buid-db.sh
Original file line number Diff line number Diff line change
Expand Up @@ -179,7 +179,7 @@ rm -rf profiles ko_list.gz kofam* hmmsearch.kofam.* hmms*
printf "\n12/19: download RefSeq nonredundant proteins and clusters ...\n"
wget https://ftp.ncbi.nlm.nih.gov/genomes/CLUSTERS/PCLA_proteins.txt
wget https://ftp.ncbi.nlm.nih.gov/genomes/CLUSTERS/PCLA_clusters.txt
for i in {1..1573}; do
for i in {1..360}; do
wget https://ftp.ncbi.nlm.nih.gov/refseq/release/bacteria/bacteria.nonredundant_protein.${i}.protein.faa.gz
pigz -dc bacteria.nonredundant_protein.${i}.protein.faa.gz | seqtk seq -CU >> refseq-bacteria-nrp.trimmed.faa
rm bacteria.nonredundant_protein.${i}.protein.faa.gz
Expand Down

0 comments on commit c3e709d

Please sign in to comment.