AGTACAACCATCCAGAGGTCGAAAATGCAATGCAATTTCACAAAGTGATCAAATCGTGAAATTACCTAAAGTGCTACGAG TCGCGGTCATTCGAGCCAATGGATCATCTATTATGCAACATGGCGGGCAACAGTCGCAATTCGTCCTACAATTGTCTCTC CTCGTGGGACTCCTATGATCGAATACCCAGAGTTCGGGTGGAGTACTATGTGAACGAAAACACATTCAAAGAAAGACTGC AACTCTATTTCATTAAGAATCAGCGTTCAAGCCTGCGCATACGAATCGCCGACTTATTCCTTAAGTTACTGTCGTGTGTT CTCTACATAATACGTGTGATATTGGATAAAAATCCAACATTTATAACATGCTATGGCTGCGAAGTGGGCAATAAGACGGA GTTCATCATCTCGGCCAAACTGACGGAGGAGGAGTTCCAGGAGAATCCAATCATTAACTGGGACGCAATACTCTGGGTGA ATCGGCCAACAGTGCTCTGGGTCCTGCAGCTGCTTCTAGCCATGGTGTCGCTAACGCAATCCTTGGTTCTCACATATCTA GGCTATAAGGGCAACATCTGGCAGCAGATACTCTCCTTTCACTTTATACTAGAATTAGTAACGACAATACCCTTTGCACT TACGATTGTCCATCCCCCGCTAAGAAATTTATTCATTCCCATTTTCCTCAATTGCTGGCTGGCCAAGCGATCGCTGGAGA ACATGTTTAATGACCTCCATCGCGCCATGCAAAAGTCCCAGTCAGCCCTCTCCCAGCAGTTGACCATTCTATCAGCCACA CTGCTGTGTTTGGTCTTCACGAGCGTTTGTGGTATCCAGCACTTTCAGCGCGCTGGCCATCGACATTTGAATCTCTTTCA GAGCACATACTATGTGGTTGTGACCTTCTCCACAGTGGGATACGGCGACTTTGTGCCGGACATTTGGCCCTCGCAGCTTT ATATGGTCATCATGATTTGTGTCGCCCTCATTGTGTTGCCCACGCAGTTTGAGCAGCTCGCCTTCACGTGGATGGAGCGC CAAAAGCTGGGCGGCAGTTACAGTTCACATCGCGCCCAAAGCGAAAAGCACGTGGTGGTGTGCTCCACCACCCTGCATGC GGACACCATAATGGACTTCCTCAACGAGTTCTATGCCCATCCCCTGCTGCAGGACTTCTATGTGGTGCTGCTCAGTCCCA TGGAGCTGGACACGACGATGCGGATGATCCTGCAGGTGCCCATTTGGGCGCAGCGCGTCATCTACATTCAGGGATCATGT CTAAAGGATGGCGACTTGGCGCGCGCCAGGATGAACGAGGCAGAGGCGTGCTTTATCCTGGCGGCCAGAAATTATGCGGA CAAAACGGCCGCCGACGAGCATACCATTCTACGTTCCTGGGCCGTCAAGGACTTCGCACCGAATGTACCGCAGTATGTGC AGATATTCAGACCTGAGCACAAGCTGCATGTGAAGTTCGCGGAGCACGTGGTCTGCGAGGACGAGTTCAAGTACGCCCTA CTGGCCAACAACTGCACCTGTCCCGGCGCCAGCACACTGGTCACCCTGCTGCTCCACACATCCCGCGGACAGGAGGGCCA GCAGAGTCCGGAGGAATGGCACCGTCTGTACGGCAAGTGTTCCGGAAACGAGATCTACCACATCGTCCTGGGCGACAGTC GATTCTTTGGCGAGTACGAGGGCAAGAGCTTCACCTACGCCAGCTTCCATTCGCATCGCAAATATGGAGTGGCCCTGGTG GGCGTGCGACCGGCGGAGCTGCCGGAATTCTACGAGGAAACCATTCTATTGAATCCCGGACCCAGGCACATTATGAAAAA GGATGACACGTGCTATTACATGAGCATCACCAAGGAAGAGAATTCCGCATTTGTCGTTAATCAAAATCAAACATCGGACC CCACGGCAGCTGCCAAGGAGGGGAGCGGGACAGGGGGCGGAGGCGGTGCCTCCTCATCCGCCTCCCACCACCACACTGCT GCAACTGCTAGCGAAAATCCGACGGCTGTGATAATCTCGGACTCTAGGCAGAACCTCAAGGACACGACGGTGACCCAGAC GGCGGCAACGATAACCACAACGACTCTGCCGCCACCGCCCCAGACGATGGGATCCCCCAGCATGGCGGGCGGATCGGGTG GATGTGGTGGAGGCGGCCTCGGCTTGGGCAGTGCCCACAGCATGGGCACCTCGCTCAGTATCACCCCAGCCACACTGACC ACCACGGGCAATCACCTGGATGTGCCCTTTGCCAACAATCCCAACCTGCTCAGCCCAGATGTGCTCAACCAGCGGAGGGG TAGTAGACGTCCCTCGATCCTACCCGTGCCCGACATGTTCACCTCCTCCTCATTCAGCATCGCCGGCAACGACGATGGTG AAGAGGGGGACGAGAGTGATGATGAGATCGACGACGAGATGCCCTGGCGTTCGCCATCCGAGAAGATTGCCTGCTTGGGC GGGCACTTTCCACAATCGCGCACCTATTCGCTCATCATGAGCTCCTCGGAGGATTCGTATCAGCGCGGTTGCAGCTTCTG CAGTGCCACTGCTTCCGCTTCCGCTGCTGCTGTTGCTGTTGCTGTTGCTGCTGCTCCTGCTCCTGCTCCTGCTCCTGCAT ATCCAAGTGATTCTGCAGGCGAGTCCGGAACTAATGCCGATTCCGCTGCCATGCCATCCGAGGAGTATCTACCGCAGTTG CGCAGGCGCGTCATGAAAAAGAGCTACAGCTGCGATAGTGAGTGCCGCTCGGTGCCCGGAATGGGAATGGGCATGGCCAT GGGTGTGGGTCTGGGTGGAGGAACTCTGGCCAGACTGGCGGCCAGAAGGCGTCAGTTGCAGCGATGTTCCTCCTGCAGTT GCTCCACTGCCACAACAACAACCACGCCGGCGGTGGCAGCGACAACAGCAATGGCAGCAGGAAGTGGGGCAACGGCATTC ACGTCGAGCAGTTCCGTAGAGACACGTCGACCGGTGCGTCCGGTTTGGGTCTACGACTACTCCTGCATCGTCAAGGGGTT TCCACCTGTGTCACCCTTCATTGGCGTTAGTCCCACGCTCTGCTATTTGCTCAAAGAGAAGAAGCCACTCTGCTGTTTGC AGTTGGCTCAGGTCTGCGAGCACTGCAGCTATCGAAACGCAAAGGAGTACCAGTGGCAGAACAAGACGATTATCCTGGCT GCGGATTACGCCTCCAATGGCATATACAACTTCATCATCCCGCTGAGAGCTCACTTCCGATCGAAGACCTCGCTGAATCC CATCATCCTGCTTCTGGAGAGAAGGCCAGATGTGGCCTTCCTAGATGCGTTGTCTTATTTTCCTTTGGTCTACTGGATGC TGGGATCGATCGACTGCCTGGACGATCTGCTGAGGGCTGGCATCACATTGGCAGAAAGTGTGGTGGTGGTCAACAAGGAG CTCTCCAATTCGGCCGAGGAGGACTCCCTCTCAGACTGCAACACCATAGTGGCCGTGCAGAATATGTTCAAATTCTTTCC CAGCATCAAGAGCATTACCGAGCTGTCGCAGAGCTCCAACATGAGATTCATGCAGTTCCGGGCCCACGACAAGTATGCCC TCCATCTGAGCAAAATGGAGAAGGAGCGCGGGTCGCATATCTCCTATATGTTCCGTCTGCCCTTTGCCGCAGGAGCCGTG TTCAGTGCCTCCATGTTGGACACCCTGCTGTACCAGGCCTTCGTGAAGGACTATGTGATTACCTTTGTGCGGTTGCTGCT TGGCATTGACCAGGCGCCGGGCAGTGGGTTCTTGACCTCGATGCGCATCACCAAGGACGACATGTGGATACGCACCTATG GTCGGCTGTACCAGAAGCTGTGCTCCACCACCTGCGAAATACCCATTGGCATTTACCGCACGCAGGACACCTCGAATGCG GACACGTCTCATTACTCCATCAATCTGGCCGACGAGGCGAGGGATAACCACGCTCAGCAGATCGAGCGGGCGGAGATCGC GAATCTCGTGCGAAGTCGGATGGAGTCTCTGAATCTACCCACGATCGACTACGACGATGTGAGCGAGAAACGCAACCACC TGTCCTATGTGATAATCAACCCGAGCTGTGATCTCAAACTGGAGGAAGGCGATCTCATATACTTGGTGAGGCCGTCGCCG TTCTCTGCCCAAAAGACATTCGAGCGACACAACTCGCGCCGCAAGTCGAACATCTCGTTCTGCTCGAACATCAACCTGGG TGCCACATGCGGCCCTCAGATGCCGCAGATGAACATGAACATGGCCAACACCGCCGTCGGGGCTGGATCGCGTCGAGGAT CCGGCATTGCCGGATTGAACCCCATGCAAATGCAGAGCGTTCAGACCTTGGCCGGGTATGGATCATCCTCGCAGCGCTGT AGTCCCCCGATGCAGCAAATTAAATCGAATTCTCTTTCTCTACCCGACAGTCCGACGGTGGTTGGCAATCAGCGAGGACG GAGTAACTCATTGCGGATCGACAACGATATACTGCTGCGTCGATCCTCGTCCCTGCGACAGGGCCTTCCCAGCGTGGGCG TCAGCCACGGCCGGAGAAAGTCGTCGCTGGAAGAGATCGGCATAAGTCACTTCACTACCCTCATGCAGGCAACGAACCAT AGTAATCCCATTAAGATATCCCTAAACGGTAGCATCGGTATGGAGAACCAGATCTCCTTGCAGGTGACGCCGCCCGAGGA GCCCACACCCATGTTAGGTGTGCCATGTATGTTAGGCGGTGGCGGTGGAGGTGGCATTAACCCATCTGGAGCGGGTTCCT CCACCGGCGGCATGCTGGGTGGTGGCTCCTCACTGGCAATCAATACCGCTGACCTGGGACCCGGACCAAGTACCTCATCG GGCGCCAGTGGATCACTGCAAGCACAGGACTCGCTGGGACAGCAGTCATCTCAGGTGTCATCGCCACAGCATTTGCAGGG AACGATCGTATGATACAACCTGATGTGGGGGCGCCGACTCCGCTCCTCCATGTCGAAAAACAAGTGGATATAGAATACGG CCAGCACACACAGACACACCCGTAGAGATAAGTTCTAGGCAGAGCGAAAGTATTAGGAACCCAACCAAATAAACCCACCC ACCCCAAAGGCTGTCTGTGGTGTGCGTTCCCGGTTAAGTCACTTCGAGTCCCATTAGATTAGTCTCCTACGTTTAATAGA TCAGTAGGAAGTGGCGCGTGCTTATCAGTTTATCGCTAGGAAGTAGGCTTATATTGTATATTGCATGTTTTGCAAATTAG TTGTAAGCTATGAACATGTCGGCTTTGCACAAAGGAAGCCGCGGCAAATTGTGTTTTGGGAACAACAAATGTAGTTAAAC TATTGGATAGGGGATTGGATATAGAAGATGATTGAGAAAACTGTATAACTGTATATATCGAATGCCTAGTTCTAAGCCAG GTGTAGCAAACTATCAAATAGATCGGGCTTAAACGCAAAAACCATGTATATACCTTAGGTATGACCACTTAGTTGGGCTT CAGCTCGGTTTAGCTTCCCCTTCCTTCGGGAGGATCTTCCTCTGACCCACGTATAACCCCATATAGATACATATATGAAG GTGCTATCAATGTCTTGTACTATACACACATTCAGATTCACATTTTGTAAAACTACTAGGAGTGTTTTACGTAATTATAG GAGAAAAGGAGGAGAAAAACAAACCAGAAAGTGAAACGGTTTTCATAACATAAATTGGAACGGGCTTTGGTATACTCTAA TGCTTTACAAAACCAACAGAAATATGTATACATATAGTAAAATAAATCAAAGAATTTGTATATAAACTATGTGAGTAAGA ACCAAACATGCATTAAAACTTAAAAAAAAAAAGGAAAAACAAACTACTAGAACTTTATGAACAAAACTTAATTCGAAAGA CAACAGAATTTAGATATAGAATTTAGAGTTAACTATTTAAGGCTAGTTCGAACCAAGGATCAAGTATCGGAAACTGTACC AAATTCAATGAGCTATGTGAACCAGCGTTCGCATCTAACACAACCCTGTAGCAAAGTGTAACTCAATTCGGTCCCAGCTA TAGGCTCTTTAGAAATTAATTAAAGCAAACTATGTACCTACTCAACCACATGCCACATGCCACATACCACACAAACATTC ACACTCACAAAAATGTTAGTTAAAGCACTTACGTCAGTTTTATAAGTAAATTTTAGTCAACGATGACGATGAAATGCATT CCAGTGTCTCGAGGGATTACCTAGTTTAGCTTATGCCTAGCACCCCAGAAACCTATTTATATAGACATATATAGTTCCGT ATATTACTAAAATGAATCTGTACATCTGCAACCACATAGGCCACATAGATCAAATCGATGTCTCAATGTACAAAAGTAGT TAACTGAGCCGCAATTGGCATTGGCTGCGGCATTAGAGATACGATTAGTGCGGACTAGGGTATAAACTTGCTGTAATATT TATCGGCGCCTTGCTCTTTAAGGCATGCCGAGCAGAACTCACAATAAACTCAAAATATAGTTGGAAATAAATATGAGCC
5' UTR based on RNA-Seq data.
Extended 3' UTR based on RNA-Seq data.