Gene or accession: CG14751 Gene annotation error Genes CG14751 and CG8643 should be merged. Comments: I've sequenced the entire EST GH19651 and the cDNA encompasses both these 'genes'. My EST consensus sequence matches the genomic except in two places a 'g' has been substituted for 'a' both making amino acid changes. These two 'g's' are present also in sequence posted for an overlapping EST (GH06282) .. Subject: Re: <up>Re: FlyBase error report for CG14751 on Wed Jan 24 18:08:42 2001</up> Dear Gillian, Here's an update. It appears that the EST GH27921 is a different splice variant compared to GH19651, and the area that is present in GH19651 that causes the two coding frames to be out of frame, is removed in GH27921. So in effect, GH27921 codes for both of those CG's as one protein. SD02665 and GH06282 appear to have other splice variations as well but I don't have all the sequence on those. I'll submit the sequence for GH27921 and GH19651 to genebank in the next month or so since I'm publishing a paper on this gene in the next few months. Thanks, Audra .. Subject: update for FlyBase error report for CG14751 Dear Gillian, ..The bottom line now is that EST GH19651 is likely an incompletely processed transcript containing a small intron that disrupts the reading frame, while EST GH27921 has that intron spliced out. .. CG14751 and CG8643 .. do indeed code for one large C2H2 zinc finger protein from one transcript. I'm sequencing GH27921 in its entirety to verify the rest of the sequence with what I got for GH19651. At least now they'll know that CG14751 and CG8643 are one transcript and one continuous protein (minus the first MQ of CG6843). Once I have the sequence verified for GH27921 (in a couple of weeks), I'll submit the entire DNA and protein sequence to Genbank. Hope that helps, Audra .. Subject: Re:update for FlyBase error report for CG14751 Hi Gillian, ..I have submitted the sequence to Genbank for the gene regular (rgr) corresponding to the locus that the EST GH27921 represents. That Genbank \# is AF353512 and is scheduled for release in June. From my other e-mail account (which handles attachments better) I'll send you the protein and DNA sequence for GH27921 I submitted to Genbank as well as the DNA sequence for GH19651 to help Flybase clarify this region. Those attachments will be in Word. I hope this helps, Audra .. Here are the sequence files for the regular locus. Audra <<rgr protein (GH27921).doc>> <<rgr sequence (GH27921).doc>> <<GH19651 DNA sequence.doc>> \------------------------------------------------------------------------------ -- <<rgr protein(GH27921).doc>> MMPLQESPSPSWQLEDYAFFRKCGEITVSPDVQSFGFNCAFCPAICLQFSVFMDHIRVQHTQDVSRRYETEEERCPQTD LVIMELDCPEIYPPEAPRKCSWEADMDIQLMSLVPSPPAVHIAPLPPPKTTETPEIDLVYVVENPLPPSPPPSADISVD ELPGSSPQSFDLDPFAVLHEVVTMTPPTPIAPPPGPPTPNPTPTATPAPLPRYETRRVAMQRKLRQSQNEQPKDGAVDE VAVEEHKGTEPTGQPEKSAMELTPPATPPTPLPPTSPTSSTASCADFQSKSRRELERNHEFVGCLLREYERTEKLWNPR HPDYKYNAKRSAYGDLAGPLESICHRQLSGAEIFAVLKELRCRYRRELKKVNALGGKYKSRLWYFERMDFLRCVIENRR AEREAKISNESTESEKSCETDADSCGKSSLYHEVLSFILDAFKRQECLWNPQHYDYTTCCKTELFRDISVQLQEELNYE LSGEECCNEIQKLRTRYRKELRMVIKHKGLYLPKLWCYDEMEFLQPILQEQIFNKISKKIGVVGSNQKTKFIDASSIRF DNTEKQLQFVEIYHNCSALWDVDHPDFRSNTYRSQALGQMLDEINTTFHTSYTAERLEKTLFNLRKEFSAQKRKILTES EDSSSIPLLHAKLAEFLDQNLGPFRCDICSDLVKTCDQYKVHRSAHDGTQPFICTLCGKGFQMPCNLTVHIRRHRRDFP YSCEQCDKRFATSTEVAIHLRTHTGERPYICDLCGKSFKTWSFFDIHRRRHLNQSTFHCPICAKGFYEKNRFTDHMNSH WAIRKHLCTVCGKTFTTYGNLKKHTELHLAVKKYKCGTCGKRFAQFASLRWHKKREHSSVGQAGGK \------------------------------------------------------------------------------ -- <<rgr sequence (GH27921).doc>> cagcggcgttggttcggaaaagtgcagtctccgcgtcgcgagtgcgccaataacgagaaataacgagaattgaaaaagt tccttaagaggtggagtccagtaaagcgcagcagcaatgatgccgctgcaggagtcgccgtcgcccagttggcagctgg aggactacgctttcttccgcaagtgcggggaaatcaccgtctcgccggacgtgcagagcttcggcttcaactgcgcctt ctgtccggcgatctgcctgcagttctccgtgttcatggaccacatccgggtgcagcacacgcaggacgtgagccggcgc tacgagactgaggaggagaggtgtccgcaaaccgatctggtcatcatggaactcgactgccccgagatttatccgccgg aggcgcctaggaaatgcagctgggaggccgacatggacatccagctaatgagcctcgttccgtcgcctcccgcagtgca catagcacctctaccaccacctaaaacgaccgaaacaccagaaatagatttggtctatgtggttgaaaatccattacca ccctcgccgccgccctccgcggacatttcggtcgacgagctgccgggaagcagtccgcagtccttcgacctggatccct ttgccgtgctgcacgaggtagtaacgatgactccgcccactcccattgctccaccaccgggcccgccgacccccaatcc gactccaactgcgactccggccccattgccgcggtacgagacacgacgcgtggcaatgcagcgcaagcttcgtcagtcg caaaacgagcaacctaaagatggggcggtggacgaggtcgcggtggaggagcacaagggaacggaacccactggacagc cggagaagagtgctatggaactaacaccacctgccacgccgcccaccccactgccgcccacctcgcccacgagcagcac ggcaagctgtgcggatttccaatcgaaaagtcgcagggagctagaacgtaatcacgagttcgttggctgtctgctgagg gagtacgagcgcacggaaaagctgtggaatccccggcatccggactacaagtacaatgccaagcgcagtgcctacggtg atctggccggtccgctggagtccatttgccacaggcagctctccggagccgagatcttcgctgtgctaaaggagttgag gtgcagataccgccgcgagctgaaaaaggtgaacgccctgggtggaaagtacaagtcgcgtctgtggtactttgagagg atggactttctgcggtgtgtcatcgaaaacaggcgagccgaaagggaagccaagatttccaatgagagcacagagagtg aaaagtcgtgcgaaacggacgctgattcctgcggcaagtcatctttgtaccacgaggttctgagtttcattctggatgc cttcaagaggcaggaatgcctgtggaatccgcagcactacgactacaccacatgctgtaagacggaactgtttcgcgac atctccgtccagttgcaggaggagctcaactatgagctgagcggcgaggaatgctgcaacgagatccaaaaacttagga cacgttaccgcaaggagctgcgcatggttattaagcacaagggattgtacctacccaagttatggtgttacgatgaaat ggagttcctgcagcccatactgcaggagcagatcttcaacaagatcagcaagaaaatcggagtggtgggaagcaaccag aagaccaagttcatcgatgccagctcaattcgttttgacaatactgagaaacaactgcagtttgtagagatttaccaca actactcagctctgtgggatgttgaccatcccgacttccgatcgaatacgtatcgtagtcaggctttgggtcaaatgtt ggatgagataaacacaacctttcacacttcctacactgcggagcagttggaaaagaccttgttcaatctgcgcaaagaa ttctccgcccagaaacggaagatacttacggagtccgaagactccagcagcattccgctgctgcatgccaaactggcag agttcctagaccaaaatcttggtccttttcgttgcgatatatgctcggacctggtcaagacctgcgatcagtacaaggt gcatcgatccgcacacgatggcacccaacccttcatctgcactc \------------------------------------------------------------------------------ -- <<GH19651 DNAsequence.doc>> gccggttgtttggatcgcagattgaatcggaaagccagcggcgttggttcggaaaagtgcagtctccgcgtcgcgagtg cgccaataacgagaaataacgagaattgaaaaagttccttaagaggtggagtccagtaaagcgcagcagcaatgatgcc gctgcaggagtcgccgtcgcccagttggcagctggaggactacgctttcttccgcaagtgcggggaaatcaccgtctcg ccggacgtgcagagcttcggcttcaactgcgccttctgtccggcgatctgcctgcagttctccgtgttcatggaccaca tccgggtgcagcacacgcaggacgtgagccggcgctacgagactgaggaggagaggtgtccgcaaaccgatctggtcat catggaactcgactgccccgagatttatccgccggaggcgcctaggaaatgcagctgggaggccgacatggacatccag ctaatgagcctcgttccgtcgcctcccgcagtgcacatagcacctctaccaccacctaaaacgaccgaaacaccagaaa tagatttggtctatgtggttgaaaatccattaccaccctcgccgccgccctccgcggacatttcggtcgacgagctgcc gggaagcagtccgcagtccttcgacctggatccctttgccgtgctgcacgaggtagtaacgatgactccgcccactccc attgctccaccaccgggcccgccgacccccaatccgactccaactgcgactccggccccattgccgcggtacgagacac gacgcgtgcgccagcgattgtgtacgatttccagcaccacgataaaccccggacaagatcgtctggctgggcagcgctt atctcgcagagagcccaacactggcaacgttacaaatttatttatttatgcgaaatatttatacttactctatactcta tgcaggcaatgcagcgcaagcttcgtcagtcgcaaaacgagcaacctaaagatggggcggtggacgaggtcgcggtgga ggagcacaagggaacggaacccactggacagccggagaagagtgctatggaactaacaccacctgccacgccgcccacc ccactgccgcccacctcgcccacgagcagcacggcaagctgtgcggatttccaatcgaaaagtcgcagggagctagaac gtaatcacgagttcgttggctgtctgctgagggagtacgagcgcacggaaaagctgtggaatccccggcatccggacta caagtacaatgccaagcgcagtgcctacggtgatctggccggtccgctggagtccatttgccacaggcagctctccgga gccgagatcttcgctgtgctaaaggagttgaggtgcagataccgccgcgagctgaaaaaggtgaacgccctgggtggaa agtacaagtcgcgtctgtggtactttgagaggatggactttctgcggtgtgtcatcgaaaacaggcgagccgaaaggga agccaagatttccaatgagagcacagagagtgaaaagtcgtgcgaaacggacgctgattcctgcggcaagtcatctttg taccacgaggttctgagtttcattctggatgccttcaagaggcaggaatgcctgtggaatccgcagcactacgactaca ccacatgctgtaagacggaactgtttcgcgacatctccgtccagttgcaggaggagctcaactatgagctgagcggcga ggaatgctgcaacgagatccaaaaacttaggacacgttaccgcaaggagctgcgcatggttattaagcacaagggattg tacctacccaagttatggtgttacgatgaaatggagttcctgcagcccatactgcaggagcagatcttcaacaagatca gcaagaaaatcggagtggtgggaagcaaccagaagaccaagttcatcgatgccagctcaattcgttttgacaatactga gaaacaactgcagtttgtagagatttaccacaactgctcagctctgtgggatgttgaccatcccgacttccgatcgaat acgtatcgtagtcaggctttgggtcaaatgttggatgagataaacacaacctttcacacttcctacactgcggagcggt tggaaaagaccttgttcaatctgcgcaaagaattctccgcccag \------------------------------------------------------------------------------ --