Release date
Release ID
234
Data source
NCBI Refseq release 234
Number of genomes
15375
Number of proteins
715490
VOGDB group
Number of VFAM: 39776 (Virus protein families)
Number of VOG: 48605 (Virus orthologous groups)
Number of VFOLD: 33135 (Virus protein structural folds)
Base URL
VOGDB File
vog.raw_algs.alistat.txt (Statistics of multiple alignments): 3,268,283 bytes, MD5 checksum 2b1b0489580596b81dd7e7ef08bb0165
https://fileshare.csb.univie.ac.at/vog/vog234/vog.raw_algs.alistat.txt
vog.annotations.tsv.gz (Funcational annotations of groups): 370,631 bytes, MD5 checksum f565c2a22efce43e34736cacd002d502
https://fileshare.csb.univie.ac.at/vog/vog234/vog.annotations.tsv.gz
vfam.members.tsv.gz (Member protein ids of groups): 4,528,104 bytes, MD5 checksum b7713e33155ccfe728f7ba92839803b4
https://fileshare.csb.univie.ac.at/vog/vog234/vfam.members.tsv.gz
vfam.virusonly.tsv.gz (Specificity if groups to Viruses): 104,819 bytes, MD5 checksum 84fd9b10595fbcb3bfa74156c7fcb477
https://fileshare.csb.univie.ac.at/vog/vog234/vfam.virusonly.tsv.gz
vfam.lca.tsv.gz (Last common aencestors of groups): 497,240 bytes, MD5 checksum 10307a214a66c2b4ed092f4ca806378b
https://fileshare.csb.univie.ac.at/vog/vog234/vfam.lca.tsv.gz
vog.hmm.tar.gz (Hidden Markov Models of groups): 455,121,855 bytes, MD5 checksum 223d72ed8cd8e5f966b23c16c5f25641
https://fileshare.csb.univie.ac.at/vog/vog234/vog.hmm.tar.gz
vfam.annotations.tsv.gz (Funcational annotations of groups): 294,527 bytes, MD5 checksum b090bf532b771ab225242a404827af16
https://fileshare.csb.univie.ac.at/vog/vog234/vfam.annotations.tsv.gz
vog.faa.tar.gz (Protein sequences of groups): 61,850,606 bytes, MD5 checksum 07af77d916f7d0ad85902418b69bc3e0
https://fileshare.csb.univie.ac.at/vog/vog234/vog.faa.tar.gz
vogdb.host.txt (Host information and classification for genomes): 474,175 bytes, MD5 checksum db3fe76efd019b4980ffe1da31b590b5
https://fileshare.csb.univie.ac.at/vog/vog234/vogdb.host.txt
vfold.faa.tar.gz (Protein sequences of groups): 60,231,702 bytes, MD5 checksum ee11501baead71126e0632f2d9481b09
https://fileshare.csb.univie.ac.at/vog/vog234/vfold.faa.tar.gz
vfam.raw_algs.tar.gz (Multiple sequence alignments of groups): 53,614,182 bytes, MD5 checksum 13fe0af64b19d7ec9a472301e317c685
https://fileshare.csb.univie.ac.at/vog/vog234/vfam.raw_algs.tar.gz
vog.members.tsv.gz (Member protein ids of groups): 4,600,511 bytes, MD5 checksum ade0ee66d81818431d09e4e6b47efbb3
https://fileshare.csb.univie.ac.at/vog/vog234/vog.members.tsv.gz
vfold.members.tsv.gz (Member protein ids of groups): 4,486,189 bytes, MD5 checksum 35bf0abad8598081e18867dc4d69a78b
https://fileshare.csb.univie.ac.at/vog/vog234/vfold.members.tsv.gz
vfam.hmm.tar.gz (Hidden Markov Models of groups): 369,989,880 bytes, MD5 checksum 957278e2919e99df7a30f01701b9d454
https://fileshare.csb.univie.ac.at/vog/vog234/vfam.hmm.tar.gz
vfold.annotations.tsv.gz (Funcational annotations of groups): 243,163 bytes, MD5 checksum 903415806333d3b88ebfe08c1fadc38b
https://fileshare.csb.univie.ac.at/vog/vog234/vfold.annotations.tsv.gz
vfam.representatives.colabfold_predictions.tar.gz (Protein structure predictions): 5,039,012,334 bytes, MD5 checksum 4a0e33d499e4afe20ebd87aa23ba56e0
https://fileshare.csb.univie.ac.at/vog/vog234/vfam.representatives.colabfold_predictions.tar.gz
vogdb.proteins.all.fa.gz (Protein sequences of all genomes): 105,862,961 bytes, MD5 checksum 18e59f8c11c3051f3523a01fc183575e
https://fileshare.csb.univie.ac.at/vog/vog234/vogdb.proteins.all.fa.gz
vogdb.genes.all.fa.gz (Gene sequences of all genomes): 170,331,744 bytes, MD5 checksum 323a1bb2649925aa4e396f26b7da7a3f
https://fileshare.csb.univie.ac.at/vog/vog234/vogdb.genes.all.fa.gz
vfam.raw_algs.alistat.txt (Statistics of multiple alignments): 2,718,578 bytes, MD5 checksum a75ac83a979d7c43fdbfe2313184e07e
https://fileshare.csb.univie.ac.at/vog/vog234/vfam.raw_algs.alistat.txt
vfam.representatives.colabfold_mean_plddt.txt (Mean pLDDT values of protein structure predictions): 635,376 bytes, MD5 checksum 85eebde06ddaff3f1378ade092c496e4
https://fileshare.csb.univie.ac.at/vog/vog234/vfam.representatives.colabfold_mean_plddt.txt
vog.virusonly.tsv.gz (Specificity if groups to Viruses): 126,157 bytes, MD5 checksum 76a8cd85eca073dcb25ff871036b5eb8
https://fileshare.csb.univie.ac.at/vog/vog234/vog.virusonly.tsv.gz
vog.raw_algs.tar.gz (Multiple sequence alignments of groups): 53,412,286 bytes, MD5 checksum 62821e83b7597611354dcd74fcf205e3
https://fileshare.csb.univie.ac.at/vog/vog234/vog.raw_algs.tar.gz
vfold.lca.tsv.gz (Last common aencestors of groups): 412,291 bytes, MD5 checksum 1c5ffe6944d34f415c67474a84881f71
https://fileshare.csb.univie.ac.at/vog/vog234/vfold.lca.tsv.gz
vogdb.functional_categories.txt (Lettercodes of functional categories): 308 bytes, MD5 checksum 6b816cc49c17d0095da91bad4e7552fa
https://fileshare.csb.univie.ac.at/vog/vog234/vogdb.functional_categories.txt
vfold.virusonly.tsv.gz (Specificity if groups to Viruses): 86,788 bytes, MD5 checksum 14fc309cf96eea8b0c8b89dedcee23ba
https://fileshare.csb.univie.ac.at/vog/vog234/vfold.virusonly.tsv.gz
vog.lca.tsv.gz (Last common aencestors of groups): 614,139 bytes, MD5 checksum c514540736876d48652130c795db2858
https://fileshare.csb.univie.ac.at/vog/vog234/vog.lca.tsv.gz
vogdb.species.txt (Virus genomes used for VOG construction): 799,590 bytes, MD5 checksum 5159318ce8353b41fa8c28a20a5e52d5
https://fileshare.csb.univie.ac.at/vog/vog234/vogdb.species.txt
vogdb.taxonomy.krona.html (Distribution of virus genome taxonomies): 6,827,733 bytes, MD5 checksum a94e77047774ca55f4e36cc7cf5b46bf
https://fileshare.csb.univie.ac.at/vog/vog234/vogdb.taxonomy.krona.html
vfam.faa.tar.gz (Protein sequences of groups): 61,338,977 bytes, MD5 checksum 08e557885dad321d5f45054c9af0779d
https://fileshare.csb.univie.ac.at/vog/vog234/vfam.faa.tar.gz