копировать удалить добавить публикацию в буфер
Запись сообщества
посмотреть историю данной записи
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Conditioned genome reconstruction: how to avoid choosing the conditioning genome.

M. Spencer, D. Bryant, и E. Susko. Syst Biol, 56 (1): 25--43 (апреля 2007)
DOI: 10.1080/10635150601156313

Аннотация

Genome phylogenies can be inferred from data on the presence and absence of genes across taxa. Logdet distances may be a good method, because they allow expected genome size to vary across the tree. Recently, Lake and Rivera proposed conditioned genome reconstruction (calculation of logdet distances using only those genes present in a conditioning genome) to deal with unobservable genes that are absent from every taxon of interest. We prove that their method can consistently estimate the topology for almost any choice of conditioning genome. Nevertheless, the choice of conditioning genome is important for small samples. For real bacterial genome data, different choices of conditioning genome can result in strong bootstrap support for different tree topologies. To overcome this problem, we developed supertree methods that combine information from all choices of conditioning genome. One of these methods, based on the BIONJ algorithm, performs well on simulated data and may have applications to other supertree problems. However, an analysis of 40 bacterial genomes using this method supports an incorrect clade of parasites. This is a common feature of model-based gene content methods and is due to parallel gene loss.

Линки и ресурсы

ключ BibTeX: Spencer07
тип записи: article
адрес: Department of Mathematics and Statistics, Dalhousie University, Hali, Nova Scotia, B3H 3J5, Canada. m.spenser@liverpool.ac.uk
год: 2007
месяц: Apr
журнал: Syst Biol
номер: 1
страницы: 25--43
том: 56
pmid: 17366135
pst: ppublish
issn: 1063-5157 (Print)
jt: Systematic biology
edat: 2007/03/17 09:00
date-modified: 2009-01-28 13:04:38 +1300
mhda: 2007/04/28 09:00
jid: 9302532
crdt: 2007/03/17 09:00
dcom: 20070427
da: 20070316
so: Syst Biol. 2007 Feb;56(1):25-43.
stat: MEDLINE
au: Spencer, M and Bryant, D and Susko, E
pii: 771129819
sb: IM
pl: England
pt: Comparative Study; Journal Article; Research Support, Non-U.S. Gov't
own: NLM
language: eng
DOI: 10.1080/10635150601156313
url: http://dx.doi.org/10.1080/10635150601156313

тэги

Цитировать эту публикацию

@article{Spencer07, abstract = {Genome phylogenies can be inferred from data on the presence and absence of genes across taxa. Logdet distances may be a good method, because they allow expected genome size to vary across the tree. Recently, Lake and Rivera proposed conditioned genome reconstruction (calculation of logdet distances using only those genes present in a conditioning genome) to deal with unobservable genes that are absent from every taxon of interest. We prove that their method can consistently estimate the topology for almost any choice of conditioning genome. Nevertheless, the choice of conditioning genome is important for small samples. For real bacterial genome data, different choices of conditioning genome can result in strong bootstrap support for different tree topologies. To overcome this problem, we developed supertree methods that combine information from all choices of conditioning genome. One of these methods, based on the BIONJ algorithm, performs well on simulated data and may have applications to other supertree problems. However, an analysis of 40 bacterial genomes using this method supports an incorrect clade of parasites. This is a common feature of model-based gene content methods and is due to parallel gene loss.}, added-at = {2009-05-14T15:29:45.000+0200}, address = {Department of Mathematics and Statistics, Dalhousie University, Hali, Nova Scotia, B3H 3J5, Canada. m.spenser@liverpool.ac.uk}, au = {Spencer, M and Bryant, D and Susko, E}, author = {Spencer, Matthew and Bryant, David and Susko, Edward}, biburl = {https://www.bibsonomy.org/bibtex/2fc4bacea38b006861cd667011337635b/compevol}, crdt = {2007/03/17 09:00}, da = {20070316}, date-modified = {2009-01-28 13:04:38 +1300}, dcom = {20070427}, doi = {10.1080/10635150601156313}, edat = {2007/03/17 09:00}, interhash = {252e7690fbd5b259c56b531f069c9a96}, intrahash = {fc4bacea38b006861cd667011337635b}, issn = {1063-5157 (Print)}, jid = {9302532}, journal = {Syst Biol}, jt = {Systematic biology}, keywords = {*Algorithms *Models *Phylogeny Bacteria/genetics Classification/*methods Computer Genes/*genetics Genetic Genome/*genetics Simulation from:davidjamesbryant}, language = {eng}, mhda = {2007/04/28 09:00}, month = Apr, number = 1, own = {NLM}, pages = {25--43}, pii = {771129819}, pl = {England}, pmid = {17366135}, pst = {ppublish}, pt = {Comparative Study; Journal Article; Research Support, Non-U.S. Gov't}, sb = {IM}, so = {Syst Biol. 2007 Feb;56(1):25-43.}, stat = {MEDLINE}, timestamp = {2009-05-14T15:29:45.000+0200}, title = {Conditioned genome reconstruction: how to avoid choosing the conditioning genome.}, url = {http://dx.doi.org/10.1080/10635150601156313}, volume = 56, year = 2007 }

искать в

Метаданные

Последнее изменение 15 лет назад
Создан 15 лет назад

Комментарии и рецензии
(0)

Комментарии, или рецензии отсутствуют. Вы можете их написать!