This directory contains FASTA files which contain a modified version
of the Feb. 2009 (GRCh37/hg19) reference human genome assembly.
The chromosomal sequences were assembled by the International Human
Genome Project sequencing centers. The assembly sequence was changed
to use IUPAC ambiguous nucleotide characters at each base covered by a
stringently filtered subset of single-base substitutions annotated by
dbSNP build 150. For example, if the assembly has an 'A' at a position
where dbSNP has annotated an A/C/T substitution SNP, the 'A' is replaced
by 'H' in the FASTA file here.
dbSNP single-base substitutions were excluded from masking in the
following cases:
- UCSC tagged the dbSNP item with any of these exceptions (see also the
exceptions field of the hg19.snp150 database table as well as the
hg19.snp150ExceptionDesc table):
- MultipleAlignments: dbSNP mapped item to multiple locations
- ObservedMismatch: the reference allele does not appear in the item's
observed alleles.
- ObservedWrongFormat: the observed sequence has an unexpected format
- dbSNP item class is not "single".
- dbSNP item length is not exactly one base.
- dbSNP item weight is greater than 1. (lower weight = higher confidence)
The remaining single-base substitutions were used to mask the genomic
sequence.
Files included in this directory:
chr*.subst.fa.gz - FASTA files with IUPAC characters for substitution SNPs
md5sum.txt - checksums of files in this directory
------------------------------------------------------------------
If you plan to download a large file or multiple files from this
directory, we recommend that you use ftp rather than downloading the
files via our website. To do so, ftp to hgdownload.soe.ucsc.edu
[username: anonymous, password: your email address], then cd to the
directory goldenPath/hg19/bigZips. To download multiple files, use
the "mget" command:
mget <filename1> <filename2> ...
- or -
mget -a (to download all the files in the directory)
Alternate methods to ftp access.
Using an rsync command to download the entire directory:
rsync -avzP rsync://hgdownload.soe.ucsc.edu/goldenPath/hg19/snp150Mask/ .
For a single file, e.g. chr1.subst.fa.gz
rsync -avzP
rsync://hgdownload.soe.ucsc.edu/goldenPath/hg19/snp150Mask/chr1.subst.fa.gz .
Or with wget, all files:
wget --timestamping
'ftp://hgdownload.soe.ucsc.edu/goldenPath/hg19/snp150Mask/*'
With wget, a single file:
wget --timestamping
'ftp://hgdownload.soe.ucsc.edu/goldenPath/hg19/snp150Mask/chr1.subst.fa.gz'
-O chr1.subst.fa.gz
To uncompress the fa.gz files:
gunzip <file>.fa.gz
Name Last modified Size Description
Parent Directory -
chr1.subst.fa.gz 2017-05-08 16:21 85M
chr1_gl000191_random.subst.fa.gz 2017-05-08 16:24 34K
chr1_gl000192_random.subst.fa.gz 2017-05-08 16:24 179K
chr2.subst.fa.gz 2017-05-08 16:24 90M
chr3.subst.fa.gz 2017-05-08 16:25 74M
chr4.subst.fa.gz 2017-05-08 16:25 71M
chr4_ctg9_hap1.subst.fa.gz 2017-05-08 16:25 232K
chr4_gl000193_random.subst.fa.gz 2017-05-08 16:26 62K
chr4_gl000194_random.subst.fa.gz 2017-05-08 16:26 67K
chr5.subst.fa.gz 2017-05-08 16:26 67M
chr6.subst.fa.gz 2017-05-08 16:26 63M
chr6_apd_hap1.subst.fa.gz 2017-05-08 16:26 925K
chr6_cox_hap2.subst.fa.gz 2017-05-08 16:26 1.8M
chr6_dbb_hap3.subst.fa.gz 2017-05-08 16:26 1.6M
chr6_mann_hap4.subst.fa.gz 2017-05-08 16:26 1.5M
chr6_mcf_hap5.subst.fa.gz 2017-05-08 16:26 1.4M
chr6_qbl_hap6.subst.fa.gz 2017-05-08 16:26 1.6M
chr6_ssto_hap7.subst.fa.gz 2017-05-08 16:26 1.6M
chr7.subst.fa.gz 2017-05-08 16:27 59M
chr7_gl000195_random.subst.fa.gz 2017-05-08 16:27 67K
chr8.subst.fa.gz 2017-05-08 16:27 55M
chr8_gl000196_random.subst.fa.gz 2017-05-08 16:27 13K
chr8_gl000197_random.subst.fa.gz 2017-05-08 16:27 12K
chr9.subst.fa.gz 2017-05-08 16:27 46M
chr9_gl000198_random.subst.fa.gz 2017-05-08 16:27 20K
chr9_gl000199_random.subst.fa.gz 2017-05-08 16:27 30K
chr9_gl000200_random.subst.fa.gz 2017-05-08 16:27 61K
chr9_gl000201_random.subst.fa.gz 2017-05-08 16:27 12K
chr10.subst.fa.gz 2017-05-08 16:22 50M
chr11.subst.fa.gz 2017-05-08 16:22 50M
chr11_gl000202_random.subst.fa.gz 2017-05-08 16:22 13K
chr12.subst.fa.gz 2017-05-08 16:22 49M
chr13.subst.fa.gz 2017-05-08 16:22 36M
chr14.subst.fa.gz 2017-05-08 16:23 33M
chr15.subst.fa.gz 2017-05-08 16:23 31M
chr16.subst.fa.gz 2017-05-08 16:23 30M
chr17.subst.fa.gz 2017-05-08 16:23 29M
chr17_ctg5_hap1.subst.fa.gz 2017-05-08 16:23 569K
chr17_gl000203_random.subst.fa.gz 2017-05-08 16:23 13K
chr17_gl000204_random.subst.fa.gz 2017-05-08 16:23 27K
chr17_gl000205_random.subst.fa.gz 2017-05-08 16:23 58K
chr17_gl000206_random.subst.fa.gz 2017-05-08 16:23 13K
chr18.subst.fa.gz 2017-05-08 16:23 28M
chr18_gl000207_random.subst.fa.gz 2017-05-08 16:23 1.5K
chr19.subst.fa.gz 2017-05-08 16:23 21M
chr19_gl000208_random.subst.fa.gz 2017-05-08 16:23 24K
chr19_gl000209_random.subst.fa.gz 2017-05-08 16:23 48K
chr20.subst.fa.gz 2017-05-08 16:24 23M
chr21.subst.fa.gz 2017-05-08 16:24 13M
chr21_gl000210_random.subst.fa.gz 2017-05-08 16:25 9.1K
chr22.subst.fa.gz 2017-05-08 16:25 13M
chrUn_gl000211.subst.fa.gz 2017-05-08 16:27 56K
chrUn_gl000212.subst.fa.gz 2017-05-08 16:27 63K
chrUn_gl000213.subst.fa.gz 2017-05-08 16:27 54K
chrUn_gl000214.subst.fa.gz 2017-05-08 16:27 44K
chrUn_gl000215.subst.fa.gz 2017-05-08 16:27 56K
chrUn_gl000216.subst.fa.gz 2017-05-08 16:27 43K
chrUn_gl000217.subst.fa.gz 2017-05-08 16:27 56K
chrUn_gl000218.subst.fa.gz 2017-05-08 16:27 57K
chrUn_gl000219.subst.fa.gz 2017-05-08 16:27 59K
chrUn_gl000220.subst.fa.gz 2017-05-08 16:27 54K
chrUn_gl000221.subst.fa.gz 2017-05-08 16:27 52K
chrUn_gl000222.subst.fa.gz 2017-05-08 16:27 60K
chrUn_gl000223.subst.fa.gz 2017-05-08 16:27 57K
chrUn_gl000224.subst.fa.gz 2017-05-08 16:27 52K
chrUn_gl000225.subst.fa.gz 2017-05-08 16:27 58K
chrUn_gl000226.subst.fa.gz 2017-05-08 16:27 2.6K
chrUn_gl000227.subst.fa.gz 2017-05-08 16:27 41K
chrUn_gl000228.subst.fa.gz 2017-05-08 16:27 33K
chrUn_gl000229.subst.fa.gz 2017-05-08 16:27 6.7K
chrUn_gl000230.subst.fa.gz 2017-05-08 16:27 14K
chrUn_gl000231.subst.fa.gz 2017-05-08 16:27 9.4K
chrUn_gl000232.subst.fa.gz 2017-05-08 16:27 14K
chrUn_gl000233.subst.fa.gz 2017-05-08 16:27 14K
chrUn_gl000234.subst.fa.gz 2017-05-08 16:27 14K
chrUn_gl000235.subst.fa.gz 2017-05-08 16:27 12K
chrUn_gl000236.subst.fa.gz 2017-05-08 16:27 14K
chrUn_gl000237.subst.fa.gz 2017-05-08 16:27 14K
chrUn_gl000238.subst.fa.gz 2017-05-08 16:27 13K
chrUn_gl000239.subst.fa.gz 2017-05-08 16:27 11K
chrUn_gl000240.subst.fa.gz 2017-05-08 16:27 14K
chrUn_gl000241.subst.fa.gz 2017-05-08 16:27 14K
chrUn_gl000242.subst.fa.gz 2017-05-08 16:27 14K
chrUn_gl000243.subst.fa.gz 2017-05-08 16:27 14K
chrUn_gl000244.subst.fa.gz 2017-05-08 16:27 13K
chrUn_gl000245.subst.fa.gz 2017-05-08 16:27 12K
chrUn_gl000246.subst.fa.gz 2017-05-08 16:27 13K
chrUn_gl000247.subst.fa.gz 2017-05-08 16:27 12K
chrUn_gl000248.subst.fa.gz 2017-05-08 16:27 13K
chrUn_gl000249.subst.fa.gz 2017-05-08 16:27 13K
chrX.subst.fa.gz 2017-05-08 16:27 54M
chrY.subst.fa.gz 2017-05-08 16:27 8.4M
md5sum.txt 2017-05-08 16:29 5.4K