Algorithms for Phylogenetic Tree Reconstruction Based on Genome Rearrangements [microform]

Algorithms for Phylogenetic Tree Reconstruction Based on Genome Rearrangements [microform]
Title Algorithms for Phylogenetic Tree Reconstruction Based on Genome Rearrangements [microform] PDF eBook
Author Bourque, Guillaume
Publisher Ann Arbor, Mich. : University Microfilms International
Pages 190
Release 2002
Genre
ISBN

Download Algorithms for Phylogenetic Tree Reconstruction Based on Genome Rearrangements [microform] Book in PDF, Epub and Kindle

Models and Algorithms for Genome Evolution

Models and Algorithms for Genome Evolution
Title Models and Algorithms for Genome Evolution PDF eBook
Author Cedric Chauve
Publisher Springer Science & Business Media
Pages 329
Release 2013-09-17
Genre Computers
ISBN 1447152980

Download Models and Algorithms for Genome Evolution Book in PDF, Epub and Kindle

This authoritative text/reference presents a review of the history, current status, and potential future directions of computational biology in molecular evolution. Gathering together the unique insights of an international selection of prestigious researchers, this must-read volume examines the latest developments in the field, the challenges that remain, and the new avenues emerging from the growing influx of sequence data. These viewpoints build upon the pioneering work of David Sankoff, one of the founding fathers of computational biology, and mark the 50th anniversary of his first scientific article. The broad spectrum of rich contributions in this essential collection will appeal to all computer scientists, mathematicians and biologists involved in comparative genomics, phylogenetics and related areas.

Enhance the Understanding of Whole-genome Evolution by Designing, Accelerating and Parallelizing Phylogenetic Algorithms

Enhance the Understanding of Whole-genome Evolution by Designing, Accelerating and Parallelizing Phylogenetic Algorithms
Title Enhance the Understanding of Whole-genome Evolution by Designing, Accelerating and Parallelizing Phylogenetic Algorithms PDF eBook
Author Zhaoming Yin
Publisher
Pages
Release 2014
Genre Algorithms
ISBN

Download Enhance the Understanding of Whole-genome Evolution by Designing, Accelerating and Parallelizing Phylogenetic Algorithms Book in PDF, Epub and Kindle

The advent of new technology enhance the speed and reduce the cost for sequencing biological data. Making biological sense of this genomic data is a big challenge to the algorithm design as well as the high performance computing society. There are many problems in Bioinformatics, such as how new functional genes arise, why genes are organized into chromosomes, how species are connected through the evolutionary tree of life, or why arrangements are subject to change. Phylogenetic analyses have become essential to research on the evolutionary tree of life. It can help us to track the history of species and the relationship between different genes or genomes through millions of years. One of the fundamentals for phylogenetic construction is the computation of distances between genomes. Since there are much more complicated combinatoric patterns in rearrangement events, the distance computation is still a hot topic as much belongs to mathematics as to biology. For the distance computation with input of two genomes containing unequal gene contents (with insertions/deletions and duplications) the problem is especially hard. In this thesis, we will discuss about our contributions to the distance estimation for unequal gene order data. The problem of finding the median of three genomes is the key process in building the most parsimonious phylogenetic trees from genome rearrangement data. For genomes with unequal contents, to the best of our knowledge, there is no algorithm that can help to find the median. In this thesis, we make our contributions to the median computation in two aspects. 1) Algorithm engineering aspect, we harness the power of streaming graph analytics methods to implement an exact DCJ median algorithm which run as fast as the heuristic algorithm and can help construct a better phylogenetic tree. 2) Algorithmic aspect, we theoretically formulate the problem of finding median with input of genomes having unequal gene content, which leads to the design and implementation of an efficient Lin-Kernighan heuristic based median algorithm. Inferring phylogenies (evolutionary history) of a set of given species is the ultimate goal when the distance and median model are chosen. For more than a decade, biologists and computer scientists have studied how to infer phylogenies by the measurement of genome rearrangement events using gene order data. While evolution is not an inherently parsimonious process, maximum parsimony (MP) phylogenetic analysis has been supported by widely applied to the phylogeny inference to study the evolutionary patterns of genome rearrangements. There are generally two problems with the MP phylogenetic arose by genome rearrangement: One is, given a set of modern genomes, how to compute the topologies of the according phylogenetic tree; Another is, given the topology of a model tree, how to infer the gene orders of the ancestor species. To assemble a MP phylogenetic tree constructor, there are multiple NP hard problems involved, unfortunately, they organized as one problem on top of other problems. Which means, to solve a NP hard problem, we need to solve multiple NP hard sub-problems. For phylogenetic tree construction with the input of unequal content genomes, there are three layers of NP hard problems. In this thesis, we will mainly discuss about our contributions to the design and implementation of the software package DCJUC (Phylogeny Inference using DCJ model to cope with Unequal Content Genomes), that can help to achieve both of these two goals. Aside from the biological problems, another issue we need to concern is about the use of the power of parallel computing to assist accelerating algorithms to handle huge data sets, such as the high resolution gene order data. For one thing, all of the method to tackle with phylogenetic problems are based on branch and bound algorithms, which are quite irregular and unfriendly to parallel computing. To parallelize these algorithms, we need to properly enhance the efficiency for localized memory access and load balance methods to make sure that each thread can put their potentials into full play. For the other, there is a revolution taking place in computing with the availability of commodity graphical processors such as Nvidia GPU and with many-core CPUs such as Cray-XMT, or Intel Xeon Phi Coprocessor with 60 cores. These architectures provide a new way for us to achieve high performance at much lower cost. However, code running on these machines are not so easily programmed, and scientific computing is hard to tune well on them. We try to explore the potentials of these architectures to help us accelerate branch and bound based phylogenetic algorithms.

Evolutionary Ancestor Inference Via Genome Rearrangement

Evolutionary Ancestor Inference Via Genome Rearrangement
Title Evolutionary Ancestor Inference Via Genome Rearrangement PDF eBook
Author Zaky Adam
Publisher
Pages 198
Release 2009
Genre University of Ottawa theses
ISBN

Download Evolutionary Ancestor Inference Via Genome Rearrangement Book in PDF, Epub and Kindle

Evaluation of Phylogeny Reconstruction Algorithms

Evaluation of Phylogeny Reconstruction Algorithms
Title Evaluation of Phylogeny Reconstruction Algorithms PDF eBook
Author Dehua Hang
Publisher
Pages 294
Release 2005
Genre Branch and bound algorithms
ISBN

Download Evaluation of Phylogeny Reconstruction Algorithms Book in PDF, Epub and Kindle

Inference of Insertion and Deletion Scenarios for Ancestral Genome Reconstruction and Phylogenetic Analyses

Inference of Insertion and Deletion Scenarios for Ancestral Genome Reconstruction and Phylogenetic Analyses
Title Inference of Insertion and Deletion Scenarios for Ancestral Genome Reconstruction and Phylogenetic Analyses PDF eBook
Author Abdoulaye Diallo
Publisher
Pages
Release 2009
Genre
ISBN

Download Inference of Insertion and Deletion Scenarios for Ancestral Genome Reconstruction and Phylogenetic Analyses Book in PDF, Epub and Kindle

Phylogenetic Tree Reconstruction with Protein Linkage

Phylogenetic Tree Reconstruction with Protein Linkage
Title Phylogenetic Tree Reconstruction with Protein Linkage PDF eBook
Author Junjie Yu
Publisher Open Dissertation Press
Pages
Release 2017-01-26
Genre
ISBN 9781361307441

Download Phylogenetic Tree Reconstruction with Protein Linkage Book in PDF, Epub and Kindle

This dissertation, "Phylogenetic Tree Reconstruction With Protein Linkage" by Junjie, Yu, 于俊杰, was obtained from The University of Hong Kong (Pokfulam, Hong Kong) and is being sold pursuant to Creative Commons: Attribution 3.0 Hong Kong License. The content of this dissertation has not been altered in any way. We have altered the formatting in order to facilitate the ease of printing and reading of the dissertation. All rights not granted by the above license are retained by the author. Abstract: Phylogenetic tree reconstruction for a set of species is an important problem for understanding the evolutionary history of the species. Existing algorithms usually represent each species as a binary string with each bit indicating whether a particular gene/protein exists in the species. Given the topology of a phylogenetic tree with each leaf representing a species (a binary string of equal length) and each internal node representing the hypothetical ancestor, the Fitch-Hartigan algorithm and the Sankoff algorithm are two polynomial-time algorithms which assign binary strings to internal nodes such that the total Hamming distance between adjacent nodes in the tree is minimized. However, these algorithms oversimplify the evolutionary process by considering only the number of protein insertions/deletions (Hamming distance) between two species and by assuming the evolutionary history of each protein is independent. Since the function of a protein may depend on the existence of other proteins, the evolutionary history of these functionally dependent proteins should be similar, i.e. functionally dependent proteins should usually be present (or absent) in a species at the same time. Thus, in addition to the Hamming distance, the protein linkage distance for some pairs/sets of proteins: whole block linkage distance, partial block linkage distance, pairwise linkage distance is introduced. It is proved that the phylogenetic tree reconstruction problem to find the binary strings for the internal nodes of a phylogenetic tree that minimizes the sum of the Hamming distance and the linkage distance is NP-hard. In this thesis, a general algorithm to solve the phylogenetic tree reconstruction with protein linkage problem which runs in O(4 DEGREESm-n) time for whole/partial block linkage distance and O(4 DEGREESm-- (m+n)) time for pairwise linkage distance (compared to the straight-forward O(4 DEGREESm- m- n) or O(4 DEGREESm- m DEGREES2-- n) time algorithm) is introduced where n is the number of species and m is the length of the binary string (number of proteins). It is further shown, by experiments, that our algorithm using linkage information can construct more accurate trees (better matches with the trees constructed by biologists) than the algorithms using only Hamming distance. DOI: 10.5353/th_b4961816 Subjects: Phylogeny Combinatorial analysis