Release date
Release ID
233
Data source
NCBI Refseq release 233
Number of genomes
15251
Number of proteins
712553
VOGDB group
Number of VFAM: 39975 (Virus protein families)
Number of VOG: 48870 (Virus orthologous groups)
Number of VFOLD: 33351 (Virus protein structural folds)
Base URL
VOGDB File
vog.raw_algs.alistat.txt (Statistics of multiple alignments): 3,275,692 bytes, MD5 checksum c9989df9950c7e46bd5665f1364d6e6c
https://fileshare.csb.univie.ac.at/vog/vog233/vog.raw_algs.alistat.txt
vog.annotations.tsv.gz (Funcational annotations of groups): 372,611 bytes, MD5 checksum 9c026f506b4d3c8ae976953f751d8df9
https://fileshare.csb.univie.ac.at/vog/vog233/vog.annotations.tsv.gz
vfam.members.tsv.gz (Member protein ids of groups): 4,482,163 bytes, MD5 checksum 8b77ff081d3a6693c4b1beeb985b2eeb
https://fileshare.csb.univie.ac.at/vog/vog233/vfam.members.tsv.gz
vfam.virusonly.tsv.gz (Specificity if groups to Viruses): 105,454 bytes, MD5 checksum 9baf698714d25d413a5a57484c6193bb
https://fileshare.csb.univie.ac.at/vog/vog233/vfam.virusonly.tsv.gz
vfam.lca.tsv.gz (Last common aencestors of groups): 481,829 bytes, MD5 checksum 9a370a760752debff0cb477885b305f9
https://fileshare.csb.univie.ac.at/vog/vog233/vfam.lca.tsv.gz
vog.hmm.tar.gz (Hidden Markov Models of groups): 567,041,630 bytes, MD5 checksum ebedf027930d9a08cb42c719a49a91d7
https://fileshare.csb.univie.ac.at/vog/vog233/vog.hmm.tar.gz
vfam.annotations.tsv.gz (Funcational annotations of groups): 295,486 bytes, MD5 checksum f3ad23daf30b0d2773a321228507155c
https://fileshare.csb.univie.ac.at/vog/vog233/vfam.annotations.tsv.gz
vog.faa.tar.gz (Protein sequences of groups): 64,843,583 bytes, MD5 checksum d939c2d9492dffc57c654c17300354c1
https://fileshare.csb.univie.ac.at/vog/vog233/vog.faa.tar.gz
vogdb.host.txt (Host information and classification for genomes): 470,314 bytes, MD5 checksum 33eab4d6c347ae04c57aca27a43d61e1
https://fileshare.csb.univie.ac.at/vog/vog233/vogdb.host.txt
vfold.faa.tar.gz (Protein sequences of groups): 62,779,721 bytes, MD5 checksum 8bc9eac2359e27ebc6978bc8a10faa2b
https://fileshare.csb.univie.ac.at/vog/vog233/vfold.faa.tar.gz
vfam.raw_algs.tar.gz (Multiple sequence alignments of groups): 61,408,483 bytes, MD5 checksum d41b56cbaed24e4b1968b17c57f676e7
https://fileshare.csb.univie.ac.at/vog/vog233/vfam.raw_algs.tar.gz
vog.members.tsv.gz (Member protein ids of groups): 4,567,453 bytes, MD5 checksum b861867c08f48e278defb44b476f460f
https://fileshare.csb.univie.ac.at/vog/vog233/vog.members.tsv.gz
vfold.members.tsv.gz (Member protein ids of groups): 4,441,152 bytes, MD5 checksum 913eba92b5303d7d614208a4467c50cd
https://fileshare.csb.univie.ac.at/vog/vog233/vfold.members.tsv.gz
vfam.hmm.tar.gz (Hidden Markov Models of groups): 461,508,597 bytes, MD5 checksum 8ebb9d6eb001c35383bbd9c5a7fc668d
https://fileshare.csb.univie.ac.at/vog/vog233/vfam.hmm.tar.gz
vfold.annotations.tsv.gz (Funcational annotations of groups): 244,844 bytes, MD5 checksum f2d411483673dc3bc6624962c83291ed
https://fileshare.csb.univie.ac.at/vog/vog233/vfold.annotations.tsv.gz
vfam.representatives.colabfold_predictions.tar.gz (Protein structure predictions): 7,751,723,340 bytes, MD5 checksum 6e58c3e365bfc23f112d36487525a94a
https://fileshare.csb.univie.ac.at/vog/vog233/vfam.representatives.colabfold_predictions.tar.gz
vogdb.proteins.all.fa.gz (Protein sequences of all genomes): 105,515,219 bytes, MD5 checksum 9eafc5d43d1baa07d5177a7ffa7b17c0
https://fileshare.csb.univie.ac.at/vog/vog233/vogdb.proteins.all.fa.gz
vogdb.genes.all.fa.gz (Gene sequences of all genomes): 169,745,636 bytes, MD5 checksum 3ab17122f5a2af8638d6c7c1c9f44703
https://fileshare.csb.univie.ac.at/vog/vog233/vogdb.genes.all.fa.gz
vfam.raw_algs.alistat.txt (Statistics of multiple alignments): 2,726,773 bytes, MD5 checksum 9eb797738ebf2ab33971c3f426bf9d16
https://fileshare.csb.univie.ac.at/vog/vog233/vfam.raw_algs.alistat.txt
vfam.representatives.colabfold_mean_plddt.txt (Mean pLDDT values of protein structure predictions): 638,576 bytes, MD5 checksum 5ce051d7d9137ac4d19e3479017a8293
https://fileshare.csb.univie.ac.at/vog/vog233/vfam.representatives.colabfold_mean_plddt.txt
vog.virusonly.tsv.gz (Specificity if groups to Viruses): 126,917 bytes, MD5 checksum 3f53ff04da822cd0824dc6049e8c4127
https://fileshare.csb.univie.ac.at/vog/vog233/vog.virusonly.tsv.gz
vog.raw_algs.tar.gz (Multiple sequence alignments of groups): 60,389,774 bytes, MD5 checksum 050217b0bd5737b648eb80f3fc041b3f
https://fileshare.csb.univie.ac.at/vog/vog233/vog.raw_algs.tar.gz
vfold.lca.tsv.gz (Last common aencestors of groups): 399,308 bytes, MD5 checksum 237e5c59b00b2ad321d6f70bd457d7fe
https://fileshare.csb.univie.ac.at/vog/vog233/vfold.lca.tsv.gz
vogdb.functional_categories.txt (Lettercodes of functional categories): 308 bytes, MD5 checksum 6b816cc49c17d0095da91bad4e7552fa
https://fileshare.csb.univie.ac.at/vog/vog233/vogdb.functional_categories.txt
vfold.virusonly.tsv.gz (Specificity if groups to Viruses): 87,310 bytes, MD5 checksum 6078f9be67febcc34b4a0fa2c1c41ac8
https://fileshare.csb.univie.ac.at/vog/vog233/vfold.virusonly.tsv.gz
vog.lca.tsv.gz (Last common aencestors of groups): 600,254 bytes, MD5 checksum 5ae145fdc6ac76ecbc1febe1633a6006
https://fileshare.csb.univie.ac.at/vog/vog233/vog.lca.tsv.gz
vogdb.species.txt (Virus genomes used for VOG construction): 792,481 bytes, MD5 checksum fed22b344786c2cda07be40e343752a0
https://fileshare.csb.univie.ac.at/vog/vog233/vogdb.species.txt
vogdb.taxonomy.krona.html (Distribution of virus genome taxonomies): 6,776,506 bytes, MD5 checksum b6c27cae5b65545b14ead93cc63581e3
https://fileshare.csb.univie.ac.at/vog/vog233/vogdb.taxonomy.krona.html
vfam.faa.tar.gz (Protein sequences of groups): 64,042,065 bytes, MD5 checksum 7a30bbf6eca8741c671ca4352e300279
https://fileshare.csb.univie.ac.at/vog/vog233/vfam.faa.tar.gz