1999 |
|
1. | Ramakrishna, Ramaswamy; Srinivasan, Ramachandran Gene identification in bacterial and organellar genomes using GeneScan Journal Article Computers and Chemistry, 23 (2), pp. 165–174, 1999, ISSN: 00978485. Abstract | Links | BibTeX | Tags: Fourier, GeneScan, Haemophilus, Mycoplasma, Plasmodium @article{Ramakrishna1999, title = {Gene identification in bacterial and organellar genomes using GeneScan}, author = {Ramaswamy Ramakrishna and Ramachandran Srinivasan}, url = {https://ramramaswamy.org/papers/071.pdf}, doi = {10.1016/S0097-8485(98)00034-5}, issn = {00978485}, year = {1999}, date = {1999-01-01}, journal = {Computers and Chemistry}, volume = {23}, number = {2}, pages = {165–174}, abstract = {The performance of the GeneScan algorithm for gene identification has been improved by incorporation of a directed iterative scanning procedure. Application is made here to the cases of bacterial and organnellar genomes. The sensitivity of gene identification was 100% in Plasmodium falciparum plastid-like genome (35 kb) and in 98% in the Mycoplasma genitalium genome (‚àº580 kb) and the Haemophilus influenzae Rd genome (‚àº1.8 Mb). Sensitivity was found to improve in both the Open Reading Frames (ORFs) which have been identified as genes (by homology or by other methods) and those that are classified as hypothetical. False positive assignments (at the nucleotide level) were 0.25% in H. influenzae genome and 0.3% in M. genitalium. There were no false positive assignments in the plastid-like genome. The agreement between the GeneScan predictions and GeneMark predictions of putative ORFs was 97% in M. genitalium genome and 86% in H. influenzae genome. In terms of an exact match between predicted genes/ORFs and the annotation in the databank, GeneScan performance was evaluated to be between 72% and 90% in different genomes. We predict five putative ORFs that were not annotated earlier in the GenBank files for both M. genitalium and H. influenzae genomes. Our preliminary analysis of the newly sequenced G + C rich genome of Mycobacterium tuberculosis H37Rv also shows comparable sensitivity (99%). textcopyright 1999 Elsevier Science Ltd. All rights reserved.}, keywords = {Fourier, GeneScan, Haemophilus, Mycoplasma, Plasmodium}, pubstate = {published}, tppubtype = {article} } The performance of the GeneScan algorithm for gene identification has been improved by incorporation of a directed iterative scanning procedure. Application is made here to the cases of bacterial and organnellar genomes. The sensitivity of gene identification was 100% in Plasmodium falciparum plastid-like genome (35 kb) and in 98% in the Mycoplasma genitalium genome (‚àº580 kb) and the Haemophilus influenzae Rd genome (‚àº1.8 Mb). Sensitivity was found to improve in both the Open Reading Frames (ORFs) which have been identified as genes (by homology or by other methods) and those that are classified as hypothetical. False positive assignments (at the nucleotide level) were 0.25% in H. influenzae genome and 0.3% in M. genitalium. There were no false positive assignments in the plastid-like genome. The agreement between the GeneScan predictions and GeneMark predictions of putative ORFs was 97% in M. genitalium genome and 86% in H. influenzae genome. In terms of an exact match between predicted genes/ORFs and the annotation in the databank, GeneScan performance was evaluated to be between 72% and 90% in different genomes. We predict five putative ORFs that were not annotated earlier in the GenBank files for both M. genitalium and H. influenzae genomes. Our preliminary analysis of the newly sequenced G + C rich genome of Mycobacterium tuberculosis H37Rv also shows comparable sensitivity (99%). textcopyright 1999 Elsevier Science Ltd. All rights reserved. |