Identification of deletion polymorphisms from haplotypes
Corona, E., Raphael, B., & Eskin, E. (2007). Identification of deletion polymorphisms from haplotypes. In T. Speed, & H. Huang (Eds.), RESEARCH IN COMPUTATIONAL MOLECULAR BIOLOGY, PROCEEDINGS (pp. 354-+). Springer-Verlag Berlin Heidelberg.
Numerous efforts are underway to catalog genetic variation in human populations. While the majority of studies of genetic variation have focused on single base pair differences between individuals, i.e. single nucleotide polymorphisms (SNPs), several recent studies have demonstrated that larger scale structural variation including copy number polymorphisms and inversion polymorphisms are also common. However, direct techniques for detection and validation of structural variants are generally much more expensive than detection and validation of SNPs. For some types of structural variation, in particular deletions, the polymorphism produces a distinct signature in the SNP data. In this paper, we describe a new probabilistic method for detecting deletion polymorphisms from SNP data. The key idea in our method is that we estimate the frequency of the haplotypes in a region of the genome both with and without the possibility of a deletion in the region and apply a generalized likelihood ratio test to assess the significance of a deletion. Application of our method to the HapMap Phase I data revealed 319 candidate deletions, 142 of these overlap with variants identified in earlier studies, while 177 are novel. Using Phase 11 HapMap data we predict 6730 deletions.