One may download COSMIC VCF, dbSNP VCF and reference genome files required for running the somatic mutation annotator. Below is the guidance about how to fetch these files.
One may download COSMIC VCF, dbSNP VCF and reference genome files required for running the somatic mutation annotator. Below is the guidance about how to fetch these files.
In order to download COSMIC data, one is required to register first in the COSMIC website. Please go to the log-in page to register. After one has registered, please follow the instruction at the download page to download COSMIC data. There are two ways one can try to download COSMIC VCF files:
sftp username@sftp-cancer.sanger.ac.uk
. sftp>
prompt displays.
For example, if one wants to download the version 76 of COSMIC VCF file associated with GRCh37 genome build, please type:sftp> cd /cosmic/grch37/cosmic/v76/VCF
sftp> get CosmicCodingMuts.vcf.gz
sftp> cd /cosmic/grch38/cosmic/v76/VCF
sftp> get CosmicCodingMuts.vcf.gz
sftp> quit
/cosmic/grch37/cosmic/v76/VCF/CosmicCodingMuts.vcf.gz
. Notice that, since we are interested in the coding mutations, please download
CosmicCodingMuts.vcf.gz
instead of CosmicNonCodingVariants.vcf.gz
.
One can download the latest dbSNP VCF file from the dbSNP website.
For example, one can download the 146 build of dbSNP VCF file through the following command line:
$ wget ftp://ftp.ncbi.nih.gov/snp/organisms/human_9606_b146_GRCh38p2/VCF/common_all_20151104.vcf.gz
,
for GRCh38.
As for GRCh37, please download the dbSNP VCF file via the command line:
$ wget ftp://ftp.ncbi.nih.gov/snp/organisms/human_9606_b146_GRCh37p13/VCF/common_all_20151104.vcf.gz
.
Please visit the dbSNP website for the latest build.
One can download the reference genome files by following the instruction in the tutorial section here.
If one runs the somatic mutation annotator for the first time, both ANNOVAR and SnpEff will automatically download the dbNSFP database files.
For ANNOVAR, the files associated with one selected genome build will be downloaded from the ANNOVAR website
to humandb/
under the ANNOVAR directory,
whereas, for SnpEff, the following files will be downloaded to variantAnnoDatabase/dbNSFP/
under
the user's home directory. Please notice that one must put the database and index files in the same folder.