A Database of Drosophila Genes & Genomes

FB2013_03, released May 7th, 2013
 

Reference Report

Reference
Citation Ladoukakis, E., Pereira, V., Magny, E.G., Eyre-Walker, A., Couso, J.P. (2011). Hundreds of putatively functional small open reading frames in Drosophila.  Genome Biol. 12(11): R118. (Export to RIS)
FlyBase ID FBrf0218190
Publication Type Research paper
PubMed ID 22118156
PubMed Abstract The relationship between DNA sequence and encoded information is still an unsolved puzzle. The number of protein-coding genes in higher eukaryotes identified by genome projects is lower than was expected, while a considerable amount of putatively non-coding transcription has been detected. Functional small open reading frames (smORFs) are known to exist in several organisms. However, coding sequence detection methods are biased against detecting such very short open reading frames. Thus, a substantial number of non-canonical coding regions encoding short peptides might await characterization.Using bio-informatics methods, we have searched for smORFs of less than 100 amino acids in the putatively non-coding euchromatic DNA of Drosophila melanogaster, and initially identified nearly 600,000 of them. We have studied the pattern of conservation of these smORFs as coding entities between D. melanogaster and Drosophila pseudoobscura, their presence in syntenic and in transcribed regions of the genome, and their ratio of conservative versus non-conservative nucleotide changes. For negative controls, we compared the results with those obtained using random short sequences, while a positive control was provided by smORFs validated by proteomics data.The combination of these analyses led us to postulate the existence of at least 401 functional smORFs in Drosophila, with the possibility that as many as 4,561 such functional smORFs may exist.
DOI 10.1186/gb-2011-12-11-r118
Related Publication(s)
Supplementary material Additional file 4: Excel file with the sequences of 401 smORFs representing our conservative estimate. [FBrf0217420]

hide Recent Updates
Description
What does this section display?
This section contains items that were added to this record for each release. It currently only tracks new links between this FlyBase report and other FlyBase data classes (e.g. genes, references, stocks) or controlled vocabulary terms (e.g. GO, anatomy terms).
What does this section not display?
This section does not currently display links that were removed or gene model changes.
Update Feed
Click the icon below to subscribe to this FlyBase record and receive updates automatically through your feed reader.
FB2013_03
FB2013_02
All updates Click here to see a list of all updates to this record from FB2010_08 and on.
hide Associated Information
Comments
Associated Files
hide Other Information
Secondary IDs
Language of Publication English
Additional Languages of Abstract
Also Published As
hide Parent Publication
Publication Type Journal
Abbreviation Genome Biol.
Title Genome Biology
Publication Year 2000-
ISBN/ISSN 1465-6906
hide Data from Reference
hideGenes (25)