Subject: More on Or19a duplication Dear Susan et al., I've spent the afternoon fighting with the apparent duplication of Or19a, about 50kbp downstream and in opposite orientation, that several of us are aware of. From the BACR11H15 sequence in GenBank I discovered a few new neat things that I think convincingly bring the DmOr number to 62. First, the duplication is much longer, extending +/-850bp upstream and +/-700 downstream for a total of around 2940bp and an average divergence of just 1%. The entire Or19a sequence is below, coding exons in upper case. Or19a \- MDISKVDSTRALVNHWRIFRIMGIHPPGKRTFWGRHYTAYSMVWNVTFHICIWVSFSVNLLQSNSLETFCESLCVTMP HT LYMLKLINVRRMRGQMISSHWLLRLLDKRLGCDDERQIIMAGIERAEFIFRTIFRGLACTVVLGIIYISASSEPTLMY PT WIPWNWRDSTSAYLATAMLHTTALMANATLVLNLSSYPGTYLILVSVHTKALALRVSKLGYGAPLPAVRMQAILVGYI HD HQIILRLFKSLERSLSMTCFLQFFSTACAQCTICYFLLFGNVGIMRFMNMLFLLVILTTETLLLCYTAELPCKEGESL LT AVYSCNWLSQSVNFRRLLLLMLARCQIPMILVSGVIVPISMKTFTVMIKGAYTMLTLLNEIRKTSLE aatagatattgaagaatagaatgagaaaaattgtttcaaattattcaaaagtaggtgtaaggaatggcgtaaccaatg tg cttttggtatggcggtagaaatttacaaaataaacaataaaacgaagaaaaatctacacatctttcaaaagtgtgagc gt ggcagtagtgggcggatggtgggcgtgccgaaaagttgtttggcatatcgataaaatctatagaactaacaaaaatgt aa aagaacatcttctatagttcctgagatcgagacgttcatacgaaatgaccgacggatggattgaaatcgccagctcga ca cggctattgatccggatcaagaatgtatatactttatatcgagtatacccttgtactctacgaggaacggatatataa aa atgtttcacttcctttgtcgcgtctcgcagccgccaaaattttgaagacacagttcaactatgcgccaccgcagctat gc agccttatcttcgttaactaagctatctcccatcgcagcgatttgccaaacaactatttgccacgcccacaaatcggc ca aacctttcacgccgttgaacacaacgctttagataaaataacgctttaaaaaaaaaatttaaacaaaagtgtgggaat aa ttataaaaaccttttcatttaccagaaagctaagtggcgctaatgatgataaggaggcgattgatttgcgaaaaatga tt gtcattgtaattgctggtggtggttcaaaatcgaactttttcggtgtaatttgcagcaatttgcataatttgcggcat ag gtggcttattggatgaatataaaaaggtcggaacgcggagagctgcttcaaagaggcgaacaaccATGGACATATCGA AG GTGGATTCAACGAGGGCTCTGGTTAACCACTGGCGCATCTTCAGGATTATGGGAATCCATCCGCCGGGCAAGAGGACC TT CTGGGGTCGCCACTACACGGCGTACTCCATGGTGTGGAACGTAACCTTCCACATCTGCATCTGGGTGTCCTTTTCGGT CA ATCTCCTGCAGTCCAATTCGCTGGAGACTTTCTGCGAGAGCCTCTGCGTGACCATGCCGCACACGCTCTACATGCTTA AG CTGATCAATGTCCGTCGGATGCGCGGCCAGATGATCAGCAGCCACTGGTTGCTCCGTCTCTTGGACAAGCGGCTTGGC TG CGACGACGAACGCCAGATCATTATGGCCGGCATCGAGCGGGCCGAGTTCATATTCCGCACCATTTTTCGCGGCCTTGC GT GCACCGTCGTCCTTGGCATCATCTACATATCCGCGTCCAGCGAGCCCACGCTGATGTACCCCACCTGGATTCCCTGGA AC TGGAGGGACAGCACCTCCGCCTACCTGGCCACCGCCATGCTGCACACGACCGCCCTCATGGCGAATGCGACACTTGTC CT CAATCTGAGCTCCTATCCGGGCACCTACCTCATCCTGGTCAGTGTCCACACCAAGGCGCTCGCCCTGCGGGTCTCCAA AT TGGGATATGGCGCGCCACTACCGGCGGTTCGGATGCAGGCCATTCTGGTTGGTTACATCCACGACCACCAGATCATTT TG CGgtgagtgtcaggaaatctcatctcccaatgcaagaacttttaaagcatttcgggagttttgaccttcatcgaaagg cg tatgtacacacactttggcgtgccaaacatcttcattgtcattgaagattattatatcctttttctcaactacagCCT CT TCAAGTCACTGGAGAGATCCCTTTCGATGACCTGCTTTCTGCAGTTCTTCAGCACGGCGTGTGCGCAGTGCACAATCT GC TACTTTCTACTCTTCGGGAACGTCGGGATCATGAGGTTCATGAATATGTTGTTCCTGCTGGTGATCCTCACCACGGAG AC CCTCCTTCTCTGCTACACGGCGGAGCTACCTTGCAAGGAAGGGGAGAGCCTCCTGACCGCTGTCTACAGCTGCAACTG GC TGTCCCAGTCGGTTAACTTTCGGAGACTCCTGCTCCTGATGCTCGCACGCTGCCAGATTCCAATGATCCTGGTCTCCG GC GTAATTGTGCCCATCAGCATGAAGACCTTCACGGTGgtgagtgctgtacaaagctaacatttacccaccttatcagcg at tttttgcagATGATTAAGGGAGCGTACACCATGCTTACTCTGCTGAATGAAATTCGTAAAACGTCCCTTGAATAGaac tg aaatcgtaggcaatatagtatatataattcatatgttcaaacacacacttgggaaacagtcgctttcctattcgcttc gc actgactttatctgagtaagaggtatccgatagtcgcagaactcggcaatggcattctctttggtctacccgcagtcg cc agaatttcctaaatttgttattttcaaacatgacaactttgaagacgcagttcagcatagacgcatcgatgaagaata aa taatcgttcgctaaagtaaataaacattaatcgtttgaaatcactacataaaaatcaatcgccttttcattgctcgaa aa aagcaaataagcattttaattatttttttttattattatttgggaataagttaacagtaatcgcatatactagatgtc ct gcaaatgttgattttggcttagtactcctcggattacatgcatgagctgaaataggtacagtatttcacattccttct cc ttttgggcaatcagcctctccatgtcctgtatttcctcatcgatgcgctggatctcttcaaaaagaaaaagcttcaat tg tcgaagaagaagcctgtaaggactgcttaccctccgtgtaagtaggcgaacttatcatattcgatacactcggctgct ca ggatccttcctcgtcaaatactcgttgtttctagttgaatatgcattctcgttcgactt Second, within the coding region there are now six single bp changes, three of which change the encoded amino acids, so this needs to be annotated as a separate slightly divergent gene, presumably Or19b by the accepted gene nomenclature for Ors. .. Or19b \- MDISKVDSTRALVNHWRIFRIMGIHPPGKRTFWGRHYTAYSMVWNVTFHICIWVSFSVNLLQSNSLETFCESLCVTMP HT LYMLKLINVRRMRGEMISSHWLLRLLDKRLGCADERQIIMAGIERAEFIFRTIFRGLACTVVLGIIYISASSEPTLMY PT WIPWNWKDSTSAYLATAMLHTTALMANATLVLNLSSYPGTYLILVSVHTKALALRVSKLGYGAPLPAVRMQAILVGYI HD HQIILRLFKSLERSLSMTCFLQFFSTACAQCTICYFLLFGNVGIMRFMNMLFLLVILTTETLLLCYTAELPCKEGESL LT AVYSCNWLSQSVNFRRLLLLMLARCQIPMILVSGVIVPISMKTFTVMIKGAYTMLTLLNEIRKTSLE aatagatattggggaatagaatgagaaaaattgtttcaaattattcaaaagtaggtgtaaggaatggcgtaaccaatg tg tttttggtatggcggtagaaatttacaaaataaacaataaaacgaagaaaaatctacacatctttcaaaagtgtgagc gt ggcagtagtgggcggatggtgggcgtgccgaaaagttgtttggcaaatcgataaaatctatagaactaacaaaaatgt aa aagaacatcttctatagttcctgagatcgagacgttcatacgaaatgaccgacggatggaccaaatcgccagctcgac ac ggctattgatccggatcaagaatgtatatactttatatcgagtatacccttttactcttcgagtaacgagtataaaaa tg tttcacttcctttgtcgcgtctcgcagccgccagaattttgaagacacagttcaactatgcgccaccgcagctatgca gc cttatcttcgttaactaagctatctcccatcgcagcgatttgccaaacaactatttgccacgcccacaaatcggccaa ac ctttcacgccgttgaacacaacgctttagataaaataacgctttaaaaaaaaaatttaaacaaaagtgtgggaataat ta taaaaaccttttcatttaccagaaagctaagtggcgctaatgatgataaggaggagattgatttgcgaaaaatgattg tc attgtaattgctggtggtggttcaaaatcgaactttttcggtgtaatttgcagcaatttgcataatttgcggcatagg tg gcttattggatgaatataaaaaggtcggaacgcggagagctgcttcaaagaggcgaacaaccATGGACATATCGAAGG TG GATTCAACGAGGGCTCTGGTTAACCACTGGCGCATCTTCAGGATTATGGGAATCCATCCGCCGGGCAAGAGGACCTTC TG GGGTCGCCACTACACGGCGTACTCCATGGTGTGGAACGTAACCTTCCACATCTGCATCTGGGTGTCCTTTTCGGTCAA TC TCCTGCAGTCCAATTCGCTGGAGACTTTCTGCGAGAGCCTCTGCGTGACCATGCCGCACACGCTCTACATGCTTAAGC TG ATCAATGTCCGTCGGATGCGCGGCGAGATGATCAGCAGCCACTGGTTGCTCCGTCTCTTGGACAAGCGGCTTGGCTGC GC CGACGAACGCCAGATCATTATGGCCGGCATCGAGCGGGCCGAGTTCATATTCCGCACCATTTTTCGCGGCCTTGCGTG CA CCGTCGTCCTTGGCATCATCTACATATCCGCGTCCAGCGAGCCCACGCTGATGTACCCCACCTGGATTCCCTGGAACT GG AAGGACAGCACCTCCGCCTACCTGGCCACCGCCATGCTGCACACGACCGCCCTCATGGCGAATGCGACACTTGTCCTC AA TCTGAGCTCCTATCCGGGCACCTACCTCATCCTGGTCAGTGTCCACACCAAGGCGCTCGCCCTGCGGGTCTCCAAATT GG GATATGGCGCGCCACTACCGGCGGTTCGGATGCAGGCCATTCTGGTTGGGTACATCCACGACCACCAGATCATTTTGC Gg taagtgtcaggaaatctcatctcccaatgcaagaacttttaaagcatttcgggagttttgaccttcatcgaaaggcgt at gtacacacactttggcgtgccaaacatcttcattgtcattgaagattattatatcctttttctcaactacagCCTCTT CA AGTCACTGGAGAGATCCCTTTCGATGACCTGCTTTCTGCAGTTCTTCAGCACGGCGTGTGCGCAGTGCACAATCTGCT AC TTTCTACTCTTCGGGAACGTCGGGATCATGAGGTTCATGAATATGTTGTTCCTGCTGGTGATCCTCACCACGGAGACC CT TCTTCTCTGCTACACGGCGGAGCTACCTTGCAAGGAAGGGGAGAGCCTCCTGACCGCTGTCTACAGCTGCAACTGGCT GT CCCAGTCGGTAAACTTTCGGAGACTCCTGCTCCTGATGCTCGCACGCTGCCAGATTCCAATGATCCTGGTCTCCGGCG TA ATTGTGCCCATCAGCATGAAGACCTTCACGGTGgtgagtgctgtacaaagctaacatttacccaccttatcagcgatt tt ttgcagATGATTAAGGGAGCGTACACCATGCTTACTCTGCTGAATGAAATTCGTAAAACGTCCCTTGAATAGaactga aa tcgtaggcaatatagtatatataattcatatgttcaaacacacacttgggaaacagtcgctttcctattcgcttcgca ct gactttatctgagtaagaggtatccgatagtcgcagaactcggcaatggcattctctttggtctacccgcagtcgcca ga atttcctaaatttgttattttcaaacatgacaactttgaagacgcagttcagcatagacgcatcgatgaagaataaat aa tcgttcgctaaagtaaataaacattaatcgtttgaaatcactacataaaaatcaatcgccttttcattgctcgaaaaa ag caaataagcattttaattatttttttttattattatttggcaataagttaacagtaatcgcatatactagatgtcctg ca aatattgattttggcttagtactcctcggattacatgcatgagctgaaataggtacagtatttcacattccttctcct tt tggacaatcagcctctccatgtcctgtatttcctcatcgatgcgctggatctcttcaaaaagaaaaagcttcaattgt cg aagaagaagcctgtaaggactgcttaccctccgtgtaagtaggcgaacttatcatattcgatacactcggctgctcag ga tccttcctcgtcaaatactcgttgtttctagttgaatatgcattcgcgttcgactt Hugh Hugh M. Robertson, Professor Department of Entomology, University of Illinois at Urbana-Champaign 320 Morrill Hall, 505 S. Goodwin Ave., Urbana, IL 61801