There is uncertainty about the true nature of predicted
single-nucleotide polymorphisms (SNPs) in segmental duplications
(duplicons) and whether these markers genuinely exist at increased
density as indicated in public databases. We explored these issues by
genotyping 157 predicted SNPs in duplicons and control regions in
normal diploid genomes and fully homozygous complete hydatidiform
moles. Our data identified many true SNPs in duplicon regions and few
paralogous sequence variants. Twenty-eight percent of the polymorphic
duplicon sequences we tested involved multisite variation, a new type
of polymorphism representing the sum of the signals from many
individual duplicon copies that vary in sequence content due to
duplication, deletion or gene conversion. Multisite variations can
masquerade as normal SNPs when genotyped. Given that duplicons
comprise at least 5\% of the genome and many are yet to be annotated in
the genome draft, effective strategies to identify multisite variation
must be established and deployed.
In conclusion, our study identifies MSVs as a new form of genome
polymorphism. Careful laboratory practice should often recognize MSVs
as aberrant markers, and MSVs may underlie the considerable fraction
of markers that fail HWE. But some MSVs are probably being interpreted
and used as unique SNPs, and HWE will not always identify these, even
if large sample numbers are used. More generally, MSVs (or rather
duplicon copy-number variation and duplicon gene conversion processes)
might underlie some common phenotypic differences between
individuals. We therefore suggest that MSVs should be specifically
targeted for evaluation in disease and pharmacogenomics research.
%0 Journal Article
%1 pmid15247918
%A Fredman, David
%A White, Stefan J
%A Potter, Susanna
%A Eichler, Evan E
%A Dunnen, Johan T Den
%A Brookes, Anthony J
%D 2004
%J Nat Genet
%K imported
%N 8
%P 861-866
%T Complex SNP-related sequence variation in segmental genome duplications
%V 36
%X There is uncertainty about the true nature of predicted
single-nucleotide polymorphisms (SNPs) in segmental duplications
(duplicons) and whether these markers genuinely exist at increased
density as indicated in public databases. We explored these issues by
genotyping 157 predicted SNPs in duplicons and control regions in
normal diploid genomes and fully homozygous complete hydatidiform
moles. Our data identified many true SNPs in duplicon regions and few
paralogous sequence variants. Twenty-eight percent of the polymorphic
duplicon sequences we tested involved multisite variation, a new type
of polymorphism representing the sum of the signals from many
individual duplicon copies that vary in sequence content due to
duplication, deletion or gene conversion. Multisite variations can
masquerade as normal SNPs when genotyped. Given that duplicons
comprise at least 5\% of the genome and many are yet to be annotated in
the genome draft, effective strategies to identify multisite variation
must be established and deployed.
In conclusion, our study identifies MSVs as a new form of genome
polymorphism. Careful laboratory practice should often recognize MSVs
as aberrant markers, and MSVs may underlie the considerable fraction
of markers that fail HWE. But some MSVs are probably being interpreted
and used as unique SNPs, and HWE will not always identify these, even
if large sample numbers are used. More generally, MSVs (or rather
duplicon copy-number variation and duplicon gene conversion processes)
might underlie some common phenotypic differences between
individuals. We therefore suggest that MSVs should be specifically
targeted for evaluation in disease and pharmacogenomics research.
@article{pmid15247918,
abstract = {
There is uncertainty about the true nature of predicted
single-nucleotide polymorphisms (SNPs) in segmental duplications
(duplicons) and whether these markers genuinely exist at increased
density as indicated in public databases. We explored these issues by
genotyping 157 predicted SNPs in duplicons and control regions in
normal diploid genomes and fully homozygous complete hydatidiform
moles. Our data identified many true SNPs in duplicon regions and few
paralogous sequence variants. Twenty-eight percent of the polymorphic
duplicon sequences we tested involved multisite variation, a new type
of polymorphism representing the sum of the signals from many
individual duplicon copies that vary in sequence content due to
duplication, deletion or gene conversion. Multisite variations can
masquerade as normal SNPs when genotyped. Given that duplicons
comprise at least 5\% of the genome and many are yet to be annotated in
the genome draft, effective strategies to identify multisite variation
must be established and deployed.
In conclusion, our study identifies MSVs as a new form of genome
polymorphism. Careful laboratory practice should often recognize MSVs
as aberrant markers, and MSVs may underlie the considerable fraction
of markers that fail HWE. But some MSVs are probably being interpreted
and used as unique SNPs, and HWE will not always identify these, even
if large sample numbers are used. More generally, MSVs (or rather
duplicon copy-number variation and duplicon gene conversion processes)
might underlie some common phenotypic differences between
individuals. We therefore suggest that MSVs should be specifically
targeted for evaluation in disease and pharmacogenomics research.
},
added-at = {2007-01-09T17:36:36.000+0100},
author = {Fredman, David and White, Stefan J and Potter, Susanna and Eichler, Evan E and Dunnen, Johan T Den and Brookes, Anthony J},
biburl = {https://www.bibsonomy.org/bibtex/2e67f235ee8a21d1039b08e30ee99a5c3/pjotrp},
interhash = {2b1cc80c9295b51f9ae282538b933639},
intrahash = {e67f235ee8a21d1039b08e30ee99a5c3},
journal = {Nat Genet},
keywords = {imported},
month = Aug,
number = 8,
pages = {861-866},
timestamp = {2007-01-09T17:36:36.000+0100},
title = {{Complex SNP-related sequence variation in segmental genome duplications}},
volume = 36,
year = 2004
}