Open Access

Downloads

Download data is not yet available.

Abstract

The problem of discovering the similar sub-sequences in a set of DNA biological sequences is an important problem of bio-technology. From the positions of similar sub sequences, we can discover the features of a group of similar function genes or the position for point mutation. In this paper, we analyze two problems of repeated and upproximate sub sequence discovery and employ the large set discovery algorithm for discovering the repeated sub-sequence and the genetic algorithm for discovering the approximate sub-sequence in a set of DNA biological sequence. In the repeated sub-sequence discovery algorithm, we consider the repeated sub-sequence as a large set and employed the Apriori-TiD algorithm (Agrawal , 1994) for discovering the maximal large set. In the approximate sub-sequence discovery algorithm, we consider the chromosome of genetic algorithm as a potential solution and employ the genetic algorithm for selecting the right solution. Two proposed algorithms work very well with large data set, therefore they satisfy the demand of the large data set of genes. Besides, we also propose a heuristic for improving the speed of solution discovery. We apply our proposed algorithms to the data of the promoters from the University of Irvine, USA and show the experiment results.



Author's Affiliation
Article Details

Issue: Vol 3 No 7&8 (2000)
Page No.: 5-11
Published: Aug 31, 2000
Section: Article
DOI: https://doi.org/10.32508/stdj.v3i7&8.3573

 Copyright Info

Creative Commons License

Copyright: The Authors. This is an open access article distributed under the terms of the Creative Commons Attribution License CC-BY 4.0., which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

 How to Cite
Kiem, H., & Phuc, D. (2000). DEVELOPING ALGORITHMS FOR FINDING THE SIMILAR MOTIF IN DNA SEQUENCES. Science and Technology Development Journal, 3(7&8), 5-11. https://doi.org/https://doi.org/10.32508/stdj.v3i7&8.3573

 Cited by



Article level Metrics by Paperbuzz/Impactstory
Article level Metrics by Altmetrics

 Article Statistics
HTML = 918 times
Download PDF   = 332 times
Total   = 332 times