Simulated Illumina BRCA1 reads in FASTQ format
Datasets usually provide raw data for analysis. This raw data often comes in spreadsheet form, but can be any collection of data, on which analysis can be performed.
Sixty-seven groups of 200,000 90bp paired-end FASTQ Illumina reads from the human BRCA1 gene (hg19). Each sequence grouping was mutated in-silico to introduce a combination of 20 SNPs and 13 INDELs.