Dear Gillian- I am the graduate student in Richard Cheney's lab who has done most of the myosin gene annotation. You recently sent Richard an E-mail about curating the Oliver et al. paper regarding the myosin gene at 29C3-D1. This gene is indeed the CG10595 transcript. I am also attaching our most recent work which stems from the Oliver et al. paper, entitled 'A Millennial Myosin Census.' This reference is a more comprehensive one which we think identifies the full complement of Drosophila myosin genes, including a novel myosin that does not appear to be recognized in the FlyBase (95E, AE003746). This myosin was partially predicted as CG5501, acc. \# AAF56246 but the Celera prediction lacks most of the motor domain and tail domain. I have attached my predicted amino acid sequence (Dm95E.txt) and cDNA (Dm95E_cds.txt) for this myosin in FASTA format so that you can compare it to FlyBase. I have also attached a Word document containing the GenScan output I obtained for AE003746. I did find two EST sequences (acc. \# AI405551 and AA698637) that are 99-100% identical to the GenScan predicted coding sequence with the exception of several gaps, suggesting that the longer transcript is expressed in some form. For the sake of completeness, you may also wish to look at a recent paper by Yamashita et al. (2000), 'Identification and analysis of the myosin superfamily in Drosophila: a database approach.' J. Muscle Res. Cell Motil. 21, 491-505, which takes a similar approach as ours and obtains essentially the same list of genes. Thank you for your efforts in annotating the Fly myosins, and please let me know if you have any other questions. Sincerely, Jonathan Berg Department of Cell and Molecular Physiology The University of North Carolina at Chapel Hill \------------------------------------------------------------------------------- >Dm95E AE003746|GENSCAN_predicted_CDS_26|3249_bp atggagcaggaaatcggcacctgggactcggtactgttggagaacctgtccgaggatagt ttcataaacaacatccaccagcgctataagcgcgatcacatatatacctacattggaaca tctgttgtggctctgaatccatatcatcacatatccgagcactctctggacaatgtccgc aactatggcgataagggcattttccagctgccgccccacatatatggtctcacaaatctg gcttatcaatcgctcaaagatcagagcgaggatcagtgtgttctgctcaccggtgagagc ggagcgggcaaaacggagacttttaaaatgatcgtgaactttctgacccacatacaagat cgctcccactgccccccaacaccgaatgttttgcgcaagcaatcctcaactagctcggcc agcggattggtgatgcacgcccacaggcgagcctccagcagctgctccggcactgccaat tttattatatgcaaaaaccgggcggaaaatccgtcaggcagtgtttcacggcgacaaagt ccatcgccaggaccatcgcagcgatcgcggacgcgggccgagagcatcgagcgccaaagc aggcgccacatgcgggagaaaattgtcgactttgatttctcacaccacaagagtagcgaa aacatcagcggccttcctgaatcgcacgcccaccacatgcatccgaccaagtcgtgcttc aagcaccagcagacccaagtcagcgcctgcactgcaatgcccgcagcagccaagggatcg cccaaatacgcggtacccaccgtgtacggcggttgccgccagtgcggacacagcaagtgt gtccgtgctcagagcctggaaaaggaggagcgggatgatctacgaggcagcaactgccgc ctgtctaccatagccactgccaccaccaatccggcacatccgcatcgcggtagttgctcc aacctgatgcgccagcactcgacagagagtcagccggagcgggagcgggatcgaagctcc cttatggggtccacgcaacgtatatcgctgtacgatgcacacaagctgagtaaggttctg ggcgatctgccgccgcctccgcctagttctgtttcgcccacgccatcggccagctcttcg ctgcacaggcgtcataaatcgcccacgcaacgaatgcgagagtgcgtcacctgtgcggat gtgttcctggaggccatgggcaatgcctgcaccctgaaaaataataactctagtcgatat acccgtataacggaccctatcattggagagcgaaattttcacatcttttaccaattacta ttaggagctgatctccagttgctaaaatcgctcaagctgtatcgaaatgtggaaaagtac gagctgctccgcaacacaactgccatggaggaggaccgcatgaattttcattatacgaag aattttgtgtttcatgtgctgcgttcggagcaagagctctatattcgcgagggattggaa tggtctcgcattgactatttcgacaacgagtctatttgcgagttaatagacaaacccagc tatggtatattgagcttgattaatgaaccccatttaaatagcaacgacgctttgcttttg cgagttcagcaatgttgtgcggggcatcccaactttatgaccaccggcagcaattccatg tgctttcagattcgtcattatgcaagtgtagtgaactactcaatacatcggtttctcgaa aagaactccgacatgctgccgaagtacataagcgctgccttttatcagagcaaactttct ttggtgcaaagcctattccccgaggggaatccccgtcgacaggttaccaaaaagcccagc acgttgagttcgaatatccgcacccaattgcagacgctgctggccatcgttaagcatcgc cgctcccactatgtgttctgtattaagcccaacgagggcaagcagccgcaccagttcgat atggctctagtgcaacatcaggtgcgctacatgtcgctcatgccgctggtccacctgtgt cgcactggccattgctaccacctgttgcacgttaagttttttcatcgctataagttgctc aacagcctgacgtggccccactttcatggcggcagtcaggtagagggtatcgccctcata atccgtaacctaccgctgccctcagcggagttcacgatcggcaccaaaaatgtgttcgtg cgtagtccccgcaccgtatatgagttggaacagtttcgccgcctgcgtattagcgagctg gccgtgcttattcaaaccatgttccgaatgtatcacgcaaggaagcgctttcagcgcatg cgacacagccagatgatcatatcgagtgcctggcgcacgtggcgggcccgcgaggagtat cggtccttgaagtacaaacgacaggtgagatgggccatcgatattataggccgctactac cgccagtggaagatcagacagttccttctgacaattcccttgcgactgccaccgaacacg ctaagcccgctctccaccgaatggccagtggctcccgcatttctggcagatgcctctcgt catcttaggtccatataccatcgttggaagtgctacatctaccgaaactcctttgatcaa acggcgcgtaatcgaatgcgggagaaggtcacagccagcattatcttcaaggatcgaaaa gcttcatatggacgaagtgtgggtcatccttttgtgggggactacgtgcgactgcgacac aaccagcagtggaaaaagatctgcgccgagaccaacgatcagtatgttgtattcgcagac ataatcaacaagatagcgcgctccagtggcaagtttgtgcccattttgctggtgctatcc acgtcatcgcttttgctgttggaccaacgaacgctgcaaattaagtacagagtgcctgca tcggagatttaccgaatgtctctgagcccctacctagatgacattgctgtgtttcactct gaatttggacggaagaagggtgatttcgtttttcaaacgggtcatgtgattgaaattgtt accaaaatgtttctggtcatacaaaatgccacaggcaaacccccggagatacacataagc actgaatttgaagcgaacttcggccagcagactgtcatcttttcgttcaaatacggcggc atgtcggacttagcacaaggcccacccaaggtcacacgcaaggcgaaccgcatggagata attgtgtga \------------------------------------------------------------------------------- >Dm95E AE003746|GENSCAN_predicted_peptide_26|1082_aa MEQEIGTWDSVLLENLSEDSFINNIHQRYKRDHIYTYIGTSVVALNPYHHISEHSLDNVR NYGDKGIFQLPPHIYGLTNLAYQSLKDQSEDQCVLLTGESGAGKTETFKMIVNFLTHIQD RSHCPPTPNVLRKQSSTSSASGLVMHAHRRASSSCSGTANFIICKNRAENPSGSVSRRQS PSPGPSQRSRTRAESIERQSRRHMREKIVDFDFSHHKSSENISGLPESHAHHMHPTKSCF KHQQTQVSACTAMPAAAKGSPKYAVPTVYGGCRQCGHSKCVRAQSLEKEERDDLRGSNCR LSTIATATTNPAHPHRGSCSNLMRQHSTESQPERERDRSSLMGSTQRISLYDAHKLSKVL GDLPPPPPSSVSPTPSASSSLHRRHKSPTQRMRECVTCADVFLEAMGNACTLKNNNSSRY TRITDPIIGERNFHIFYQLLLGADLQLLKSLKLYRNVEKYELLRNTTAMEEDRMNFHYTK NFVFHVLRSEQELYIREGLEWSRIDYFDNESICELIDKPSYGILSLINEPHLNSNDALLL RVQQCCAGHPNFMTTGSNSMCFQIRHYASVVNYSIHRFLEKNSDMLPKYISAAFYQSKLS LVQSLFPEGNPRRQVTKKPSTLSSNIRTQLQTLLAIVKHRRSHYVFCIKPNEGKQPHQFD MALVQHQVRYMSLMPLVHLCRTGHCYHLLHVKFFHRYKLLNSLTWPHFHGGSQVEGIALI IRNLPLPSAEFTIGTKNVFVRSPRTVYELEQFRRLRISELAVLIQTMFRMYHARKRFQRM RHSQMIISSAWRTWRAREEYRSLKYKRQVRWAIDIIGRYYRQWKIRQFLLTIPLRLPPNT LSPLSTEWPVAPAFLADASRHLRSIYHRWKCYIYRNSFDQTARNRMREKVTASIIFKDRK ASYGRSVGHPFVGDYVRLRHNQQWKKICAETNDQYVVFADIINKIARSSGKFVPILLVLS TSSLLLLDQRTLQIKYRVPASEIYRMSLSPYLDDIAVFHSEFGRKKGDFVFQTGHVIEIV TKMFLVIQNATGKPPEIHISTEFEANFGQQTVIFSFKYGGMSDLAQGPPKVTRKANRMEI IV \------------------------------------------------------------------------------- GENSCAN 1.0 Date run: 2-Aug-100 Time: 15:08:37 Sequence AE003746 : 226318 bp : 44.61% C+G : Isochore 2 (43 \- 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. \----- \---- \- \------ \------ \---- \-- \-- \---- \---- \----- \----- \------ 1.05 PlyA \- 297 292 6 1.05 1.04 Term \- 2142 1618 525 0 0 76 53 461 0.692 35.66 1.03 Intr \- 2571 2470 102 0 0 47 29 117 0.528 2.07 1.02 Intr \- 3211 3058 154 0 1 30 85 217 0.995 15.67 1.01 Init \- 7886 7612 275 2 2 85 113 295 0.827 28.14 1.00 Prom \- 7995 7956 40 \-7.96 2.00 Prom \+ 9051 9090 40 \-8.86 2.01 Init \+ 9315 9453 139 2 1 60 97 253 0.999 23.70 2.02 Intr \+ 9516 9809 294 1 0 60 15 523 0.991 39.18 2.03 Term \+ 9878 11271 1394 0 2 78 45 1678 0.999 153.87 2.04 PlyA \+ 11890 11895 6 1.05 3.03 PlyA \- 12262 12257 6 \-1.75 3.02 Term \- 13936 12728 1209 1 0 61 48 1431 0.908 128.06 3.01 Init \- 14666 14049 618 2 0 63 107 960 0.999 90.75 3.00 Prom \- 14835 14796 40 \-9.36 4.00 Prom \+ 15166 15205 40 \-10.35 4.01 Init \+ 15627 16380 754 2 1 28 113 854 0.999 76.77 4.02 Intr \+ 16713 16883 171 1 0 65 97 248 0.999 23.31 4.03 Intr \+ 17029 17214 186 2 0 46 119 212 0.999 19.76 4.04 Intr \+ 17424 17590 167 1 2 78 59 155 0.961 11.28 4.05 Intr \+ 17686 17938 253 0 1 38 100 181 0.697 11.31 4.06 Term \+ 17997 18742 746 1 2 111 34 971 0.999 87.62 4.07 PlyA \+ 19751 19756 6 1.05 5.06 PlyA \- 20217 20212 6 1.05 5.05 Term \- 21112 20952 161 2 2 15 39 220 0.903 7.90 5.04 Intr \- 21263 21183 81 0 0 60 62 65 0.505 0.71 5.03 Intr \- 21408 21315 94 0 1 109 94 87 0.998 10.94 5.02 Intr \- 22384 21476 909 1 0 69 115 1208 0.692 113.08 5.01 Init \- 22783 22556 228 1 0 97 121 332 0.999 35.77 5.00 Prom \- 23309 23270 40 \-6.56 6.00 Prom \+ 23463 23502 40 \-8.56 6.01 Init \+ 25450 25545 96 0 0 84 79 \-34 0.635 \-4.39 6.02 Term \+ 25604 26206 603 1 0 85 37 959 0.880 85.02 6.03 PlyA \+ 26266 26271 6 1.05 7.03 PlyA \- 26322 26317 6 \-1.95 7.02 Term \- 27584 26533 1052 0 2 33 39 1554 0.998 137.30 7.01 Init \- 27755 27641 115 2 1 68 39 208 0.999 14.29 7.00 Prom \- 28178 28139 40 \-12.21 8.00 Prom \+ 28402 28441 40 \-7.16 8.01 Sngl \+ 29165 30508 1344 1 0 85 48 1759 0.999 167.74 8.02 PlyA \+ 30687 30692 6 1.05 9.00 Prom \+ 31440 31479 40 \-7.86 9.01 Sngl \+ 31636 32079 444 0 0 61 36 516 0.385 39.94 9.02 PlyA \+ 32182 32187 6 1.05 10.02 PlyA \- 32228 32223 6 1.05 10.01 Sngl \- 34224 32317 1908 0 0 89 31 3523 0.999 340.61 10.00 Prom \- 34464 34425 40 \-1.26 11.00 Prom \+ 35001 35040 40 \-8.76 11.01 Init \+ 35521 35919 399 0 0 32 75 866 0.999 76.37 11.02 Intr \+ 35975 36123 149 1 2 28 66 135 0.998 4.33 11.03 Intr \+ 36182 36350 169 2 1 63 12 324 0.998 22.25 11.04 Intr \+ 36409 37431 1023 0 0 60 94 1582 0.999 146.81 11.05 Intr \+ 37495 38064 570 0 0 56 64 751 0.999 62.46 11.06 Intr \+ 38131 38386 256 0 1 91 106 378 0.999 36.92 11.07 Intr \+ 38443 38747 305 2 2 53 106 388 0.999 33.41 11.08 Intr \+ 38811 38911 101 2 2 56 92 65 0.999 2.61 11.09 Term \+ 38977 39739 763 1 1 54 38 718 0.844 56.29 11.10 PlyA \+ 39910 39915 6 1.05 12.04 PlyA \- 40060 40055 6 1.05 12.03 Term \- 41180 41099 82 1 1 60 43 55 0.200 \-4.63 12.02 Intr \- 41450 41302 149 2 2 123 101 41 0.267 7.93 12.01 Init \- 49043 48981 63 2 0 51 29 208 0.746 12.35 12.00 Prom \- 56701 56662 40 \-1.26 13.00 Prom \+ 57307 57346 40 \-4.46 13.01 Init \+ 63107 63182 76 1 1 45 38 95 0.181 1.45 13.02 Term \+ 72419 72783 365 0 2 26 43 200 0.111 4.03 13.03 PlyA \+ 72843 72848 6 1.05 14.03 PlyA \- 73068 73063 6 1.05 14.02 Term \- 81562 80687 876 1 0 \-33 35 1881 0.010 163.28 14.01 Init \- 84420 83944 477 0 0 39 117 764 0.462 69.82 14.00 Prom \- 84472 84433 40 \-16.42 15.00 Prom \+ 84522 84561 40 \-16.39 15.01 Init \+ 84568 85404 837 0 0 71 64 619 0.181 52.50 15.02 Intr \+ 85888 85933 46 0 1 22 57 68 0.268 \-4.92 15.03 Intr \+ 86396 86707 312 0 0 35 55 345 0.366 22.06 15.04 Term \+ 86814 87016 203 1 2 \-8 37 389 0.739 21.75 15.05 PlyA \+ 87071 87076 6 \-1.75 16.04 PlyA \- 87093 87088 6 \-1.75 16.03 Term \- 87393 87160 234 0 0 13 37 217 0.850 5.42 16.02 Intr \- 87667 87452 216 1 0 66 \-3 181 0.807 5.50 16.01 Init \- 89441 87882 1560 2 0 81 100 1617 0.997 154.67 16.00 Prom \- 90388 90349 40 \-6.66 17.02 PlyA \- 90403 90398 6 1.05 17.01 Sngl \- 93196 91556 1641 1 0 69 47 1661 0.999 154.09 17.00 Prom \- 93309 93270 40 \-7.06 18.00 Prom \+ 93895 93934 40 \-10.64 18.01 Init \+ 93995 94162 168 1 0 36 92 141 0.977 8.86 18.02 Intr \+ 97423 97935 513 0 0 92 105 640 0.969 59.26 18.03 Intr \+ 98166 99446 1281 2 0 73 115 2304 0.666 218.90 18.04 Intr \+ 99997 100148 152 0 2 46 52 184 0.991 9.56 18.05 Intr \+ 100205 101581 1377 2 0 52 89 2177 0.999 202.21 18.06 Intr \+ 101657 101832 176 2 2 22 96 312 0.999 24.98 18.07 Intr \+ 101900 103887 1988 0 2 44 97 3060 0.999 289.83 18.08 Intr \+ 104048 104467 420 1 0 97 77 598 0.995 53.74 18.09 Term \+ 104528 104707 180 1 0 64 44 163 0.899 7.11 18.10 PlyA \+ 104736 104741 6 \-3.84 19.07 PlyA \- 104899 104894 6 1.05 19.06 Term \- 105559 105332 228 1 0 67 48 311 0.981 21.63 19.05 Intr \- 106436 106012 425 0 2 60 62 416 0.978 29.79 19.04 Intr \- 106807 106492 316 1 1 74 113 258 0.998 22.54 19.03 Intr \- 106985 106872 114 2 0 85 58 138 0.996 11.14 19.02 Intr \- 107188 107044 145 0 1 58 44 76 0.972 0.48 19.01 Init \- 107399 107242 158 2 2 13 86 95 0.432 1.18 19.00 Prom \- 107494 107455 40 \-16.52 20.00 Prom \+ 107529 107568 40 \-8.96 20.01 Init \+ 107786 108196 411 1 0 102 113 529 0.695 53.31 20.02 Intr \+ 108258 110568 2311 2 1 71 13 2571 0.999 235.24 20.03 Intr \+ 110626 110783 158 2 2 18 97 348 0.999 28.43 20.04 Term \+ 110910 111449 540 2 0 80 39 776 0.877 66.16 20.05 PlyA \+ 111601 111606 6 1.05 21.00 Prom \+ 111769 111808 40 \-7.06 21.01 Init \+ 112049 112066 18 1 0 52 80 24 0.879 \-1.98 21.02 Intr \+ 112127 113031 905 1 2 53 101 1291 0.999 116.99 21.03 Intr \+ 113101 113688 588 1 0 34 87 694 0.999 55.64 21.04 Intr \+ 113747 113936 190 2 1 31 95 188 0.909 13.39 21.05 Intr \+ 114006 114323 318 2 0 111 119 336 0.999 35.15 21.06 Intr \+ 114396 114480 85 2 1 52 29 127 0.725 2.59 21.07 Intr \+ 116899 117006 108 2 0 104 80 208 0.993 21.86 21.08 Intr \+ 117647 117816 170 0 2 71 103 146 0.986 14.07 21.09 Intr \+ 118626 118780 155 2 2 27 84 102 0.413 2.57 21.10 Intr \+ 122313 122990 678 0 0 66 39 422 0.124 25.83 21.11 Intr \+ 123050 123168 119 2 2 62 92 114 0.985 9.31 21.12 Intr \+ 123229 123422 194 2 2 78 95 274 0.987 26.31 21.13 Intr \+ 123486 123859 374 2 2 \-1 15 465 0.963 24.16 21.14 Intr \+ 123921 124106 186 0 0 68 24 220 0.463 12.40 21.15 Term \+ 124384 124492 109 1 1 39 38 157 0.446 3.78 21.16 PlyA \+ 124546 124551 6 \-3.24 22.02 PlyA \- 124576 124571 6 \-5.80 22.01 Sngl \- 126021 124627 1395 0 0 33 42 1378 0.967 123.76 22.00 Prom \- 126112 126073 40 \-9.06 23.02 PlyA \- 126205 126200 6 1.05 23.01 Sngl \- 127302 126232 1071 0 0 72 38 635 0.985 54.04 23.00 Prom \- 127396 127357 40 \-11.43 24.02 PlyA \- 127573 127568 6 \-0.45 24.01 Sngl \- 128822 127722 1101 2 0 45 47 1255 0.999 114.05 24.00 Prom \- 128902 128863 40 \-11.92 25.00 Prom \+ 129176 129215 40 \-7.46 25.01 Init \+ 130217 130887 671 1 2 68 101 972 0.977 91.18 25.02 Term \+ 130944 131907 964 0 1 46 49 1487 0.999 132.13 25.03 PlyA \+ 132062 132067 6 1.05 26.13 PlyA \- 132299 132294 6 1.05 26.12 Term \- 133259 133136 124 1 1 71 42 190 0.999 10.56 26.11 Intr \- 133492 133365 128 1 2 59 52 86 0.995 1.58 26.10 Intr \- 133706 133563 144 2 0 50 50 97 0.780 2.58 26.09 Intr \- 133902 133767 136 2 1 59 121 168 0.999 17.77 26.08 Intr \- 134295 133964 332 0 2 87 87 414 0.999 35.63 26.07 Intr \- 135207 134511 697 2 1 112 102 597 0.985 54.99 26.06 Intr \- 135514 135267 248 1 2 90 36 151 0.919 6.36 26.05 Intr \- 136303 136209 95 2 2 80 72 234 0.999 20.68 26.04 Intr \- 136446 136362 85 0 1 85 60 41 0.076 0.39 26.03 Intr \- 137806 136768 1039 0 1 47 92 906 0.998 77.40 26.02 Intr \- 137982 137867 116 0 2 72 66 134 0.759 8.85 26.01 Init \- 138150 138046 105 0 0 89 73 101 0.758 8.92 26.00 Prom \- 138528 138489 40 \-13.06 27.00 Prom \+ 138746 138785 40 \-7.86 27.01 Init \+ 138910 139511 602 0 2 64 78 896 0.980 80.96 27.02 Intr \+ 139572 139803 232 0 1 61 127 130 0.967 11.98 27.03 Term \+ 139860 139946 87 2 0 48 42 102 0.958 \-0.74 27.04 PlyA \+ 139950 139955 6 \-9.23 28.04 PlyA \- 139964 139959 6 \-3.44 28.03 Term \- 140280 139991 290 1 2 20 32 378 0.944 20.94 28.02 Intr \- 140515 140338 178 1 1 62 86 161 0.557 12.79 28.01 Init \- 140605 140588 18 1 0 49 93 8 0.942 \-2.72 28.00 Prom \- 140963 140924 40 \-6.96 29.00 Prom \+ 141010 141049 40 \-9.46 29.01 Init \+ 141294 141929 636 2 0 56 109 1138 0.999 108.04 29.02 Intr \+ 141993 144206 2214 2 0 49 94 1995 0.999 183.54 29.03 Term \+ 144404 144757 354 1 0 83 43 432 0.995 32.69 29.04 PlyA \+ 144861 144866 6 \-1.95 30.06 PlyA \- 144881 144876 6 \-4.73 30.05 Term \- 145264 145058 207 1 0 115 49 436 0.999 39.74 30.04 Intr \- 145923 145594 330 0 0 33 101 546 0.970 46.23 30.03 Intr \- 146174 145977 198 2 0 35 58 244 0.943 15.65 30.02 Intr \- 146366 146251 116 0 2 67 111 60 0.976 6.37 30.01 Init \- 146573 146435 139 2 1 105 81 235 0.897 22.80 30.00 Prom \- 146905 146866 40 \-5.36 31.00 Prom \+ 149718 149757 40 \-4.66 31.01 Init \+ 152206 152307 102 0 0 42 28 99 0.131 \-0.46 31.02 Intr \+ 157413 157572 160 2 1 51 98 262 0.968 23.06 31.03 Term \+ 157707 157837 131 1 2 93 47 190 0.898 13.74 31.04 PlyA \+ 158517 158522 6 1.05 32.00 Prom \+ 160340 160379 40 \-7.36 32.01 Sngl \+ 160961 161119 159 1 0 40 52 338 0.958 17.91 32.02 PlyA \+ 161244 161249 6 1.05 33.00 Prom \+ 163071 163110 40 \-3.36 33.01 Init \+ 163601 163661 61 1 1 67 103 22 0.971 3.01 33.02 Intr \+ 163721 164271 551 0 2 79 105 803 0.999 73.69 33.03 Intr \+ 164341 164535 195 0 0 \-12 83 333 0.986 22.41 33.04 Term \+ 164594 164851 258 1 0 90 38 268 0.739 17.55 33.05 PlyA \+ 164853 164858 6 \-7.21 34.02 PlyA \- 164871 164866 6 \-1.95 34.01 Sngl \- 165712 164924 789 1 0 67 43 1293 0.939 118.63 34.00 Prom \- 166119 166080 40 \-10.84 35.12 PlyA \- 166135 166130 6 1.05 35.11 Term \- 166481 166338 144 2 0 63 32 110 0.955 0.81 35.10 Intr \- 167436 166543 894 0 0 69 79 765 0.942 65.21 35.09 Intr \- 167956 167494 463 0 1 6 23 451 0.964 23.56 35.08 Intr \- 168426 168027 400 1 1 62 99 400 0.949 32.06 35.07 Intr \- 168880 168483 398 0 2 54 75 379 0.997 27.42 35.06 Intr \- 169108 168936 173 1 2 48 63 134 0.997 5.64 35.05 Intr \- 169244 169167 78 2 0 72 97 101 0.997 9.15 35.04 Intr \- 169438 169312 127 0 1 94 39 71 0.999 3.48 35.03 Intr \- 169771 169498 274 2 1 19 52 273 0.967 13.40 35.02 Intr \- 170045 169826 220 2 1 36 39 92 0.739 \-3.03 35.01 Init \- 170354 170103 252 2 0 49 21 249 0.796 11.66 35.00 Prom \- 170674 170635 40 \-9.26 36.00 Prom \+ 170799 170838 40 \-8.06 36.01 Init \+ 171022 171169 148 0 1 46 61 225 0.977 15.85 36.02 Intr \+ 171232 172166 935 2 2 17 48 927 0.858 72.52 36.03 Intr \+ 172227 172394 168 2 0 70 111 242 0.884 24.84 36.04 Intr \+ 172451 173614 1164 1 0 19 80 1004 0.998 81.65 36.05 Term \+ 173676 173972 297 2 0 89 38 257 0.999 16.07 36.06 PlyA \+ 173985 173990 6 \-0.45 37.09 PlyA \- 174027 174022 6 1.05 37.08 Term \- 176334 175557 778 2 1 59 43 1301 0.998 115.54 37.07 Intr \- 176708 176396 313 0 1 61 89 429 0.961 35.54 37.06 Intr \- 177072 177004 69 1 0 80 95 32 0.802 2.25 37.05 Intr \- 178106 177663 444 0 0 58 86 260 0.692 16.17 37.04 Intr \- 195453 193760 1694 2 2 60 94 1414 0.235 126.52 37.03 Intr \- 196795 196583 213 0 0 115 20 298 0.012 23.53 37.02 Intr \- 197171 196985 187 0 1 72 101 263 0.978 24.75 37.01 Init \- 197349 197235 115 0 1 27 68 193 0.974 11.57 37.00 Prom \- 197861 197822 40 \-5.56 38.00 Prom \+ 198781 198820 40 \-10.35 38.01 Init \+ 199369 199494 126 0 0 44 107 5 0.343 \-1.84 38.02 Term \+ 199556 200443 888 1 0 \-4 41 707 0.947 49.26 38.03 PlyA \+ 200533 200538 6 1.05 39.02 PlyA \- 200548 200543 6 \-1.75 39.01 Sngl \- 203675 200769 2907 2 0 72 39 3147 0.992 298.73 39.00 Prom \- 203750 203711 40 \-6.16 40.00 Prom \+ 203776 203815 40 \-8.56 40.01 Sngl \+ 204144 205238 1095 2 0 81 41 797 0.999 71.30 40.02 PlyA \+ 205556 205561 6 1.05 41.02 PlyA \- 206100 206095 6 1.05 41.01 Sngl \- 206977 206438 540 1 0 46 43 884 0.999 75.79 41.00 Prom \- 208451 208412 40 \-3.16 42.00 Prom \+ 208544 208583 40 \-10.45 42.01 Init \+ 208744 209994 1251 0 0 49 105 1328 0.839 123.13 42.02 Intr \+ 212837 212975 139 1 1 112 97 243 0.997 27.64 42.03 Intr \+ 213047 213290 244 0 1 67 81 510 0.999 44.76 42.04 Intr \+ 213362 213533 172 2 1 35 91 334 0.610 28.35 42.05 Intr \+ 213843 214142 300 2 0 47 115 594 0.515 54.93 42.06 Intr \+ 214382 216368 1987 1 1 63 105 2327 0.951 219.23 42.07 Intr \+ 216403 216500 98 2 2 62 115 28 0.823 2.63 42.08 Intr \+ 216562 216777 216 0 0 58 92 176 0.811 13.70 42.09 Intr \+ 216840 218064 1225 2 1 49 95 1704 0.601 154.50 42.10 Intr \+ 218481 218596 116 1 2 20 92 122 0.965 5.97 42.11 Intr \+ 218663 219238 576 1 0 64 91 500 0.974 40.82 42.12 Intr \+ 219375 219564 190 2 1 54 98 293 0.625 26.06 42.13 Intr \+ 219675 224662 4988 1 2 95 75 5546 0.997 541.45 42.14 Intr \+ 224733 224909 177 2 0 49 113 279 0.977 26.62 42.15 Term \+ 224996 225034 39 1 0 77 41 60 0.885 \-2.61 42.16 PlyA \+ 225843 225848 6 1.05 Predicted peptide sequence(s): Predicted coding sequence(s): >AE003746|GENSCAN_predicted_peptide_1|351_aa MKMVDKSSKLVDKFAVISGTGGSGPGGGPLNLQSGGGGGGSGGGGGVSGSGGISGTAGLS AGAGMTTGQKPVPMKLFAAWEVDRTPPNCIPSAAGAAAAADAFPLSQRHLCIIEITGDES AAVLLAMTDDSGLRVWVTRVFVELANLLVIRVDGKVQQESQLNSMYASINQQPSHENVFA TVDSSSVLVELLLPALDFPFSISPSVFVCTRVQGEGEGAGVEAWKLEVPRHCPGMSTSSL LTRPATCAQVEFCSHTNNDRITANSGNTNGCSNSYTNNSHYNNAKDGSNCNNSRKQRASW RKLPLPQVPSSQFHVPGSMLAKSRQAKRATDVSCINRSPPDPQLYLHPRLR >AE003746|GENSCAN_predicted_CDS_1|1056_bp atgaagatggtcgataagtcgtccaagttggtggataaatttgccgtaatcagcggcacc ggcggcagcggccctggaggcggtccgctcaacctgcaaagtggcggaggaggcggcggt tccggcggcggaggaggagtctccggatctggtggcatatccggaacggctggcttgtcc gccggagcaggaatgaccactggccagaaaccggtgcccatgaagctattcgccgcctgg gaagtggaccgaacgccgcccaattgcatcccaagtgctgctggtgctgctgctgctgct gatgcttttccgctgtcccaacgccatctgtgcatcatcgagattaccggcgacgagtca gcagctgtgcttttggccatgaccgatgattcaggtcttcgagtgtgggtcacgcgggtg tttgtggagctggcaaatctcctggtcattagggtcgatggaaaagtgcagcaggaaagt caattaaattcgatgtatgccagcatcaaccagcagccgagccatgagaatgtcttcgcg accgttgactcatccagtgtgttggtcgaactgcttctgcctgcactggattttccattt tccatttccccctcggtttttgtttgcactcgcgtccaaggcgaaggcgaaggtgctggc gtggaagcgtggaaacttgaagtaccacgacattgtccaggcatgtcaacgtcaagtttg ttgactcgcccggcaacctgcgcccaagtggagttttgcagccacaccaacaacgaccgt atcaccgcaaacagcggcaacacgaacgggtgcagcaacagctacaccaacaacagtcac tacaacaacgccaaagacggcagcaactgcaataacagccgcaaacaacgagcaagttgg cgaaaacttccgctcccccaagttcccagttcccagttccatgttccaggttccatgctg gccaaaagcagacaggccaaacgagctacggatgtcagctgcataaatcgttcgcctcca gacccccagttgtaccttcacccccgccttcgatga >AE003746|GENSCAN_predicted_peptide_2|608_aa MADAWDIKSLKTKRNTLREKLEKRKKERIEILSDIQEDLTNPKKELVEADLEVQKEVLQA LSSCSLALPIVSTQVVEKIAGSSLEMVNFILGKLANQGAIVIRNVTIGTEAGCEIISVQP KELKEILEDTNDTCQQKEEEAKRKLEVDDVDQPQEKTIKLESTVARKESTSLDAPDDIMM LLSMPSTREKQSKQVGEEILELLTKPTAKERSVAEKFKSHGGAQVMEFCSHGTKVECLKA QQATAEMAAKKKQERRDEKELRPDVDAGENVTGKVPKTESAAEDGEIIAEVINNCEAESQ ESTDGSDTCSSETTDKCTKLHFKKIIQAHTDESLGDCSFLNTCFHMATCKYVHYEVDTLP HINTNKPTDVKTKLSLKRSVDSSCTLYPPQWIQCDLRFLDMTVLGKFAVVMADPPWDIHM ELPYGTMSDDEMRALGVPALQDDGLIFLWVTGRAMELGRDCLKLWGYERVDELIWVKTNQ LQRIIRTGRTGHWLNHGKEHCLVGMKGNPTNLNRGLDCDVIVAEVRATSHKPDEIYGIIE RLSPGTRKIELFGRPHNIQPNWITLGNQLDGIRLVDPELITQFQKRYPDGNCMSPASANA ASINGIQK >AE003746|GENSCAN_predicted_CDS_2|1827_bp atggcagatgcgtgggacataaaatcactcaagacaaagcggaacacgctccgcgagaag ctggaaaagcgcaagaaggagcgcattgagattctctcggacattcaggaggatctgacg aatcccaaaaaggaactcgttgaggctgacttggaagtacagaaggaggtgctgcaggcc ctcagttcctgctccctggccctgcccattgtctccacacaggtggtggaaaagattgcc ggcagcagcctggagatggtgaactttattctaggcaaactggccaaccagggtgcaatt gtgatccggaacgtgaccatcggcaccgaggccggctgcgaaatcatctctgtgcagccc aaggagctgaaggagatcctggaagacaccaacgatacgtgtcagcaaaaggaggaggag gccaagagaaagttggaagtcgacgatgttgatcagccacaggagaagacaataaaactg gagtccactgtagcgcggaaagagtccacaagcctggatgctccggacgatattatgatg ctgctatccatgccttccacccgcgaaaaacagagtaagcaggtgggcgaggagattctt gagctgcttactaagcctacggccaaagagcgatccgtggctgagaagtttaagtcccac ggcggagcgcaggtcatggagttttgctcccacggcacaaaagtcgagtgcctgaaggct caacaagccaccgccgagatggcggccaaaaagaaacaagaaagaagagacgaaaaggag cttcgcccggatgttgatgcgggcgaaaacgtcaccggtaaggtacccaagacagaatcg gcagccgaggatggagaaatcattgcggaggttataaacaattgcgaagccgagtcgcag gaatccaccgatggcagcgatacctgcagcagtgagacaacagacaagtgcaccaaactg catttcaaaaagatcatccaggctcacacggacgaatctctgggggactgcagtttccta aacacctgcttccacatggccacctgcaagtacgtgcactatgaggtggacaccctgccg catataaacacaaacaagcccacggatgtgaaaaccaaattgagcctcaagcgcagcgta gactccagttgcacgctctatccgccgcaatggattcaatgcgacctacgcttcttggac atgacggtcctgggaaagttcgctgtggtgatggccgatcctccctgggatatccacatg gaattgccctacggcacaatgtcggacgatgaaatgagagcactgggcgtgccggcactt caggatgacggcttgatctttctgtgggtcactggacgagccatggaacttggccgcgac tgcctcaagttgtggggctatgagcgcgtggacgaactcatctgggtaaagaccaaccaa ctgcagcgaattattcgcactgggcgcactggtcattggctaaaccacggcaaggagcac tgtttggtgggcatgaaaggcaatcccacaaacctgaaccgcggactcgattgcgatgtg atcgttgcggaggtgcgggccacctctcacaagccggatgagatttatggtattatcgag cgtctaagcccgggtactcgcaagatcgagctctttgggcgtccgcacaacatccaaccg aactggataaccctgggaaaccaactggatggcattcgactggtggatccggagctaatt acgcagttccagaagcgttatccagatggtaactgcatgtcgccagcttctgccaatgcg gcgtcgatcaatggaatacaaaagtag >AE003746|GENSCAN_predicted_peptide_3|608_aa MGQYTASQRKNVRILLVGDAGVGKTSLILSLVSEEYPEEVPPRAEEITIPANVTPEQVPT SIVDFSAVEQSEDALAAEINKAHVVCIVYAVDDDDTLDRITSHWLPLVRAKCNPSLDGEG DAEAEAEGDTQREPIRKPIVLVGNKIDLIEYSTMDSVLAIMEDYPEIESCVECSAKSLHN ISEMFYYAQKAVLHPTSPLYMMEEQELTSACKKSLVRIFKICDIDGDNLLNDYELNLFQR RCFNTPLQPQILDEVKAVIQKNVPDGIYNDAVTLKGFLFLHCLFIQRGRNETTWAVLRRF GYNDQLEMCQEYLRPPLKIPPGSSTELSHRGQQFLIAVFERYDRDGDGALSPEEHKMLFS TCPAAPWSYSTDIRKSCPINETTGWVTLHGWLCRWTLMTLIDVVKTMEYLAYLGFNVHEN DSQLAAIHVTRERRIDLAKRQSSRSVYKCHVIGPKGSGKTGMCRGFLVEDMHKLIGKEFK TNVVNCINSVQVYGQEKHLILRDIDVRHALDPLQPQEVNCDVACLVYDSSNPRSFEYVAR IYIKYYAESKIPVMIVGTKCDMDERRQDYLMQPSEFCDKYKLLPPHLFSLKTNKKELYTK LATMAAFP >AE003746|GENSCAN_predicted_CDS_3|1827_bp atgggacagtacacggcgtcgcagcgcaagaatgttaggatcctgctcgtcggcgacgcc ggggtgggtaagacgtcgttgattctgtctctggtcagtgaggagtacccggaggaggtt ccgcctagggctgaggagattaccattccggcgaacgtgacgcccgagcaggtgcccacc agcatagttgacttctcggccgtggagcagtcggaggatgccctggccgccgaaattaac aaggcgcacgtggtgtgcatagtgtacgccgtggacgatgacgacactttggatcggatc acatcccattggctgccgctcgtccgggccaagtgcaatccctccctggacggcgagggt gatgccgaagcggaggcggaaggagacacacaacgagagcccattcggaagccaatcgtt ctggtgggtaacaaaatagacctcatcgagtattccaccatggacagcgtgctggccatc atggaagactaccccgagattgagagttgcgtggagtgctctgccaagtcgctgcacaac atctccgagatgttttactacgcccagaaggctgtgctgcacccgacttctcccctgtat atgatggaagaacaggagctcacatccgcctgcaagaagtcgctggtgcgcattttcaag atctgcgacattgacggggacaatctgttgaacgattacgagctgaatttattccagcga cgctgtttcaacacaccactccagccgcaaatccttgatgaggtgaaggccgtgatacag aaaaatgtgcccgatggcatatacaacgatgcggtcaccctgaagggcttcctcttctta cactgccttttcattcagcggggaaggaacgagacaacctgggcggtgttacggcgcttt ggctataacgaccagttggagatgtgccaggagtatcttaggccaccgctgaaaataccg cctggcagcagcacagaactctcgcaccgcggacagcagtttctgattgccgtgtttgag cgctatgatcgcgatggcgacggagccttatcacctgaggagcacaagatgctcttcagc acatgcccggctgctccgtggtcctactccaccgacatacgcaagtcctgcccgatcaac gagacaactggatgggtgacactgcacgggtggctatgtcggtggacactgatgacgctg attgatgtggtcaaaacaatggagtatttggcctatttgggcttcaatgttcatgaaaac gacagccagttggcggcaattcacgtaactcgagagcgccgcatcgatttggccaagcgc caaagcagtcggtccgtttacaagtgtcatgtgattggaccaaagggatcaggaaagact ggaatgtgcaggggattcctggtggaggatatgcacaaactaatcggaaaagagtttaaa acgaatgtggttaattgcatcaactctgtgcaggtgtatggccaggaaaagcacctcatt ctgcgtgacatagacgtcaggcatgccctcgatcccctccagccacaagaagtcaattgc gatgttgcctgcctagtctacgactcatctaatccccgttcctttgagtacgtggcccgc atctacatcaagtattatgcggagagcaagattccagtgatgatagtcggcaccaagtgc gacatggacgaacgccggcaggactatctaatgcagccctctgaattctgtgacaagtac aagttgctgcctccgcatctcttcagcctgaaaaccaacaaaaaagaactgtataccaag ctggccacaatggcagcgtttccgtga >AE003746|GENSCAN_predicted_peptide_4|758_aa MVRTKNQSSSSSASSSSTKSPIKSSSGAGSSGGGLGGRQSTHRSSSASNVAAVVAGGSSA AGGGSSSNRRSPGSSPDGDDDTTTTDDLTPTTCSPRSGHHHSYGGYSSSVHKQNLYVVSF PIIFLFNVLRSLIYQLFCIFRYLYGASTKVIYRPHRRDCNIEIVVQNSSKEQQQSLNHPS ELNREGDGQEQQLSNQPQRFRPIQPLEMAANRPGGGYSPGPGDPLLAKQKHHHRRAFEYI SKALKIDEENEGHKELAIELYRKGIKELEDGIAVDCWSGRGDVWDRAQRLHDKMQTNLSM ARDRLHFLALREQDLQMQRLSLKEKQKEEAQSKPQKTREPMLAGMTNEPMKLRVRSSGYG PKATTSAQPTASGRKLTIGSKRPVNLAVANKSQTLPRNLGSKTSVGAVQRQPAKTAATPP AVRRQFSSGRNTPPQRSRTPINNNGPSGSGASTPVVSVKGVEQKLVQLILDEIVEGGAKV EWTDIAGQDVAKQALQEMVILPSVRPELFTGLRAPAKGLLLFGPPGNGKTLLARAVATEC SATFLNISAASLTSKYVGDGEKLVRALFAVARHMQPSIIFIDEVDSLLSERSSSEHEASR RLKTEFLVEFDGLPGNPDGDRIVVLAATNRPQELDEAALRRFTKRVYVSLPDEQTRELLL NRLLQKQGSPLDTEALRRLAKITDGYSGSDLTALAKDAALEPIRELNVEQVKCLDISAMR AITEQDFHSSLKRIRRSVAPQSLNSYEKWSQDYGDITI >AE003746|GENSCAN_predicted_CDS_4|2277_bp atggtacgcactaaaaaccagtcgtcctcctccagcgccagcagcagcagcaccaagtca ccaataaagtccagcagcggggcaggatcatccggcggaggactcggtggtcgccagtcc actcaccgctcgtcgagcgcctcaaacgttgctgccgttgttgccggcggttcatctgcc gccggtggaggctcctcatcgaatcgccgtagtccgggcagctcacccgacggcgacgac gacaccaccaccacagatgacctgacccccacaacgtgctccccacgcagcggccaccac cattcgtacggcggctactcgtcctctgtgcacaaacagaacctctacgtcgtctcgttc cccataatctttcttttcaacgttctgcgctcgctgatttatcagctgttttgtattttt cgctacctatacggtgcgagcacaaaggtgatataccgcccgcaccgacgcgactgtaat attgaaatcgtggtgcagaattcctccaaggagcaacagcagtcgttgaaccatccttcg gagctgaaccgtgaaggtgacggacaggaacagcagttatccaatcagccacagcgtttc aggcccattcaaccgttggagatggccgccaatcgtccgggaggagggtattcacctggt cccggcgatccattgctggccaagcagaagcatcaccatcgacgcgccttcgagtacatc tccaaggcgctcaaaattgatgaggagaacgaaggtcacaaagagctggcaattgagctc taccgaaaaggtatcaaggagctagaggatggcatagctgtcgattgctggagtggacgt ggtgatgtgtgggatcgggctcagcgcctgcacgacaagatgcagacaaacctttcgatg gcgcgggatcgtctccattttctagctctgcgtgagcaggatttgcaaatgcagcgcctc tccttaaaggagaagcagaaagaagaggctcaaagcaagccgcagaagaccagggagccc atgctggcaggaatgaccaacgaaccaatgaaactaagggtgcgcagcagtggctatggg cccaaggccaccaccagtgcccaacccactgcatctggccgcaaactgacaattggatcc aaacgacccgtcaacttggccgtggccaacaaatcacaaacgctgccccgcaaccttggc tccaaaacatctgtaggcgcagtccaaaggcaaccagccaagacagctgcaacgcctccc gccgttcggcgacaattttcttcggggcgtaatacgccgccccagcgttcgcggactccc ataaacaacaacggacccagtggcagtggcgccagcactccggtggttagtgtgaaggga gtggaacagaagctggtgcaacttatactcgacgagatcgtcgagggtggggccaaagtg gagtggactgacattgccggtcaggatgtggccaagcaggctctacaggaaatggtcatt ctaccctccgtccgaccggaactctttacggggcttcgtgcaccggctaagggtttattg ctgtttggtcctcccggaaatggcaagacactgctggcccgcgccgtggctactgagtgt agtgccaccttcctgaacatttcggctgcctcgctgaccagcaagtacgtgggggatggc gagaaactggtgagggctctcttcgccgtggcccgtcacatgcagccctccattattttc atcgacgaggtggactcgttgctttcagagcgaagcagcagcgagcatgaagcatcgcgt cgcctaaagaccgagtttctagtggagttcgatgggctgcctggaaatccagacggagac aggatcgtggtgctggccgccaccaatcgaccgcaggaattggacgaggcagccctgcgt cggttcacaaagcgcgtttacgtctcactgcccgacgagcaaacccgcgagctgctcctc aatcgactgctccagaagcaaggcagtccgttggataccgaggcgttgcgtcgccttgca aagataacagacggatactcgggctccgacttaacggcactggccaaggacgccgcattg gagcccattcgagagttgaatgtggagcaagtgaagtgtctggacatcagtgcaatgcgt gcaatcacagagcaggactttcacagttcgctcaagcgcatcaggcgctcggtggcgccg caaagcctcaactcatatgagaagtggtcgcaagattatggcgacatcaccatctag >AE003746|GENSCAN_predicted_peptide_5|490_aa MDESQPKTLYVGNLDSSVSEDLLIALFSTMGPVKSCKIIREPGNDPYAFIEYSNYQAATT ALTAMNKRLFLEKEIKVNWATSPGNQPKTDISSHHHIFVGDLSPEIETETLREAFAPFGE ISNCRIVRDPHTMKSKGYAFVSFVKKAEAENAIQAMNGQWIGSRSIRTNWSTRKLPPPRE PSKGGGQGGGMGGGPGNGSGVKGSQRHTFEEVYNQSSPTNTTVYCGGFPPNVISDDLMHK HFVQFGPIQDVRVFKDKGFSFIKFVTKEAAAHAIEHTHNSEVHGNLVKCFWGKENGGDNS ANNLNAAAAAAAASANVAAVAAANAAVAAGAGMPGQMMTQQQIAAATGAAIPGQMMTPQQ IAAQYPYAYQQMGYWYPPATYPTTQMQTQYMQQGYYPYAYPTSAQQAGGVPCIQFSAAGY RMVPPNVAWGVPGTVVPVLRCHNTRPNELRCDASFDAVCSIREVAPSRSHNHLCCISFRT CSSCKRITHV >AE003746|GENSCAN_predicted_CDS_5|1473_bp atggacgagtcgcaaccgaagaccctatacgtgggcaacctggatagctcagtgtccgag gacctgctaattgccctcttcagcaccatggggcccgtcaaaagctgcaaaatcattcgg gaaccgggcaacgatccatatgccttcatcgaatattccaactaccaggcagccacaaca gctctgaccgccatgaataaacgcctattcctcgaaaaggaaatcaaggtcaactgggcc accagtcccggcaatcagccgaagacagacatcagttcgcaccaccacatattcgtgggc gacctcagtcccgagattgagacagaaacactgcgcgaggctttcgccccattcggagag atctccaactgtcgcattgtgcgcgaccctcacaccatgaagtcaaagggttacgccttc gtgtcgtttgtgaaaaaggcggaggcagagaacgccatccaggcgatgaacggccagtgg attggctcgcgctcgatacgcaccaactggtccacgcgcaagctgccaccaccacgcgag ccttccaagggcggaggccagggaggcggaatgggtggcggaccgggcaatgggtccggt gtaaagggaagtcaacgccacaccttcgaggaagtgtataaccagtcgagccccaccaac accaccgtatactgtggcggattcccgccgaatgtcatcagtgacgacctgatgcacaag cacttcgtccagtttggtcccatccaggacgtgcgggtcttcaaggacaagggcttctcg ttcatcaagtttgttaccaaggaggcagccgcccacgccatcgagcacacgcacaacagc gaggtacatggaaacctggtaaagtgcttctggggcaaagagaacggaggcgataactcg gccaataacctcaatgccgccgctgccgcggcagcagcctctgccaatgttgccgccgtt gcggcagccaatgctgcggttgccgctggagcgggtatgcccggtcagatgatgacgcag caacagatcgccgccgcaacaggagcggcgatacccggccaaatgatgacaccccagcag attgcggcgcagtatccatacgcgtaccagcagatgggctactggtatccacctgcgact tatcccacaacccagatgcagacgcaatacatgcagcagggctactatccctacgcctac cctaccagtgctcagcaagcgggaggagtcccttgcatacaattttcagcggctggatac cgcatggtgccgccgaatgtagcatggggcgtgcccggaactgtggtgcccgtgctgcga tgccacaataccagacccaatgagctgagatgtgacgccagcttcgacgccgtttgcagc atccgggaagtcgcccccagccgcagccacaaccacctctgctgcatcagcttccgcacc tgcagcagttgcaagcgcattacgcacgtttag >AE003746|GENSCAN_predicted_peptide_6|232_aa MGINIFINSKYIISCGDHVKCDELYSRIAEKTNLQPGEYYLVSNGKRLEGEISSGDVHCV LRQLGGKGGFGSMLRAIGAQIEKTTNREACRDLSGRRLRDINEEKRVRAWLEKQGERERE AEERKKRKIEKLLAVPKHDFKDDKYEEARANLTEKVNDAFEEGLKQAEENKEKGVEEATS SGTKRKSPAVDKTKAKKKKKGTLWIDDDISGSDSDSDDSEEEPKTQKKAIQN >AE003746|GENSCAN_predicted_CDS_6|699_bp atgggcataaatatttttataaacagtaaatatataatttcttgtggagatcacgttaaa tgtgatgagctttacagtcggattgcggagaaaacaaacctgcaaccaggggaatactac ttagtcagcaatggcaagcgcttggagggagaaatttcatctggagatgttcactgcgtt ctccgccagctcggtggcaaaggaggatttggttccatgcttcgagccattggtgcgcaa attgaaaagacaaccaatcgcgaggcttgccgcgatctaagtggtcgccgtttgcgggat atcaacgaggagaagcgggtgcgcgcctggctggagaagcagggcgaacgggaacgggag gccgaggagcgtaaaaagcgaaaaatcgagaaactgctggccgtgcccaagcacgacttt aaggacgataagtacgaggaagccagggctaatctgaccgagaaggtaaacgacgccttc gaggagggactcaagcaggccgaggagaacaaggagaaaggcgtcgaggaagcaacttcc agtggcacaaagaggaaatcccccgccgtagacaagaccaaggcaaaaaagaagaaaaag ggcacactctggatagacgatgacatctctggatccgactctgactctgacgacagcgag gaggagccaaaaacacagaaaaaagctatacaaaactag >AE003746|GENSCAN_predicted_peptide_7|388_aa MSEAEKQAVSFACQRCLQPIVLDEQLEKISVHAMAELSLPIYGDNGNTLDPQDASSFDHF VPPYRLTDSINGTGFMLVSDGRDNKKMSAAFKLKAELFDCLSSNSEIDHPLCEECADSML EIMDRELRIAEDEWDVYKAYLDELEQQRVAPNVEALDKELDELKRSEQQLLSELKELKKE EQSLNDAIAEEEQEREELHEQEESYWREYTKHRRELMLTEDDKRSLECQIAYSKQQLDKL RDTNIFNITFHIWHAGHFGTINNFRLGRLPSVSVDWSEINAAWGQTVLLLSALARKIGLT FERYRVVPFGNHSYVEVLGENRELPLYGSGGFKFFWDTKFDAAMVAFLDCLTQFQKEVEK RDTEFLLPYKMEKGKIIDPSTGNSYSIK >AE003746|GENSCAN_predicted_CDS_7|1167_bp atgagtgaggcggaaaagcaggcggtgtccttcgcctgtcagcgctgcctgcagcccatc gtcctggacgagcagctggagaagattagcgtgcacgcaatggcggagttatctttgccc atctacggagacaatggcaatacattggacccgcaggacgccagcagcttcgaccacttt gtgccgccctacaggcttacggactctataaatggcactggttttatgctggtttccgat ggcagggacaacaagaaaatgagtgctgcttttaagctgaaagcggagctgtttgactgc ctctcctccaactctgagattgaccatccgctgtgcgaagagtgtgccgactccatgctg gaaatcatggacagggaactgcgcatcgctgaggacgagtgggatgtgtacaaggcttat ttggatgaactagagcaacagcgtgtagcacccaacgttgaggccctggacaaggagctc gacgaactaaagcgcagcgagcaacagcttctgtcggagctaaaagagctcaaaaaggag gaacaatcgctaaatgatgccattgccgaggaggaacaggagcgagaggagctgcacgag caggaggaaagctactggcgcgagtataccaagcacaggcgtgagctaatgctcaccgag gatgacaaacgaagtctggagtgtcagatcgcctactcgaaacagcagctggacaaactg cgcgacaccaacatattcaacatcacctttcacatctggcatgccgggcatttcggtacc attaataactttaggctgggtcgattgccctctgtatccgtggactggtcagagatcaac gccgcttggggccagacggtgcttctgctctctgcgcttgctcgtaagatagggctcacc ttcgagcggtatcgtgtagtaccctttggaaatcattcgtatgttgaggtgctcggcgag aatcgagagctcccgctttatggcagcggcggatttaagtttttctgggacaccaagttt gatgctgccatggtagcttttctcgactgtcttacccagttccagaaggaggtcgagaag cgcgacaccgagttcctgctgccctacaagatggagaagggcaagatcatcgatccctcc acgggaaattcctattctattaagtag >AE003746|GENSCAN_predicted_peptide_8|447_aa MVSYFVPRGRFLLKAGNLRQVVQQQHQPAQLQLQPIKGPQPQAQNASLPVARHLRQFSSN PASKEAPLHHRRPQHKQQPNPSQELAQIRRNILSRWTGFLLRWAPMGICVFGAIEWQLQK NRCEKEGKPRTASELQSRIYCSLPLRIISRCWGWLAACYLPPSLRPYVYGWYSNTFDVNL SEAMYPEYEHYNSLAEFFTRPLKEGVRVIDQQAPLVSPADGKVLHFGSASDSLIEQVKGV SYSIEDFLGPLETVEQANSGASYAQALKKKSDGSTELYQCVIYLAPGDYHRFHSPTAWKP TIRRHFSGELLSVSPKVAGWLPGLFCLNERVLYMGQWKHGFFSYTAVGATNVGSVEIYMD ADLKTNRWTGFNVGKHPPSTYEYDELVLNKELTEAPKEFGKGDLVGQFNMGSTIVLLFEA PKNFKFDIIAGQKIRVGESLGHIVGSK >AE003746|GENSCAN_predicted_CDS_8|1344_bp atggtttcctacttcgtacctcgtggccgcttcctgctgaaggccggcaatctcagacag gtggtgcagcagcagcatcagccggcccagctacagctgcagcccattaaaggaccacag ccgcaggctcagaatgccagtctgccggtggcccgtcacttgcgccaattctcctcgaat ccagcctccaaggaggcaccattacaccaccggcgcccgcaacataaacaacaaccgaat cctagtcaggagttagctcagattcggcgcaatatactctcacggtggacgggcttcctg ttgcgttgggctcccatgggcatctgtgtgtttggcgccatcgagtggcagttgcagaag aaccgctgcgaaaaggaaggaaaacctcggacagcgtccgagctccagtcgcgcatttac tgctccctgccactgcgcataatcagccgttgctggggctggctggccgcttgctacttg cctcccagtctgcgtccctacgtctacggatggtattccaacacgtttgatgttaatttg agcgaggccatgtatccggagtatgagcactacaatagtctggctgagttctttaccagg ccacttaaggagggcgttcgtgtcatcgatcagcaggctccactggtctcccccgccgac ggtaaggttctacattttggcagcgcctcggactcgctaatagagcaggtcaagggtgtt agctacagtatcgaggacttccttggcccgctggagactgtggagcaggcaaattccggt gcctcctatgcccaggccctcaaaaagaagagcgatggttctacggagctgtaccagtgc gtgatatatctggctcccggagattaccatcgattccactctcctaccgcttggaagccc accattcgtcgtcacttctccggcgaactgctgtccgtgagccccaaagttgccggctgg ctgcctggtctgttttgcctcaacgaacgtgtgctgtacatgggccagtggaagcacgga ttcttctcctacaccgccgtgggtgccacaaacgtgggatccgtcgaaatctacatggat gccgatctgaagacgaaccgttggactggattcaatgtcggcaagcatccgcccagcacc tacgagtacgatgaactggtcttgaacaaagaactaacagaagcgcccaaggaattcggc aagggggatctggtgggccagttcaatatgggcagcaccatagtcctgctcttcgaggcg ccaaagaatttcaaatttgatattatcgccggtcagaagatccgcgttggcgagtccctc ggccacatcgtcggctccaaatga >AE003746|GENSCAN_predicted_peptide_9|147_aa MALRVARSQIPFSTARNTQSNLLQRFYSQAPQIGIVDYDVVKKLPSEPQKLLIDVREPEE LKETGQIPASINIPLGVVSQELAASEQLFKSKYGREKPKPETEIIFHCKIGKRSLKAAEA AAALGFKNVKNYQGSWLDWAEREGLPK >AE003746|GENSCAN_predicted_CDS_9|444_bp atggccttgagggtcgctagatcccagatcccgttcagcactgccaggaatactcagagc aatctcctccaacgtttttacagccaggctcctcaaattggaatcgtggactacgacgtg gtcaagaaactacccagcgaaccacagaaactgctcatcgatgtccgggagccggaagag ctgaaggaaaccggacaaatccccgccagcatcaacatccctctgggcgtggttagtcag gaattggcggccagtgagcagctttttaaatccaaatatggccgggaaaaaccgaagcca gaaacggagattatattccactgcaagattggcaaaagaagccttaaggctgcagaagct gccgccgcattgggattcaagaatgtaaagaactaccagggatcttggctggattgggcc gaacgagagggcctgcccaagtaa >AE003746|GENSCAN_predicted_peptide_10|635_aa MPAIGIDLGTTYSCVGVFQYGKVEIIANDQGNRTTPSYVAFTDSERLIGDAAKNQVAMNP KNSVFDAKRLIGRRFDDSKIQEDIKHWPFKVINDNGKPKISVEFKGANKCFSPEEISSMV LTKMKETAEAYLGTTVKDAVITVPAYFNDSQRQATKDAGAIAGINVLRIINEPTAAALAY GLDKNLKGERNVLIFDLGGGTFDVSILTIDEGSLFEVRSTAGDTHLGGEDFDNRLVNHFA EEFKRKYKKDLRSNPRALRRLRTAAERAKRTLSSSTEASLEIDALYEGHDFYSKVSRARF EELCGDLFRNTLEPVEKALKDAKMDKSQIHDIVLVGGSTRIPKVQNLLQNFFGGKTLNLS INPDEAVAYGAAIQAAILSGDKSSEIKDVLLVDVAPLSLGIETAGGVMTKLIERNSRIPC KQSKTFTTYADNQPAVTIQVFEGERALTKDNNVLGTFDLTGVPPAPRGVPKIDVTFDLDA NGILNVTAKEQGTGNAKNITIKNDKGRLSQADIDRMLSEAEKYAEEDERHRQRIAARNQL ETYLFGVKEAAENGGDRISAADKSSIVERCSEAMKWLDSNTTAEKEEYEYKLKELEQFCS PIMTKMHKGGGDGQQAPNFGQQAGGYKGPTVEEVD >AE003746|GENSCAN_predicted_CDS_10|1908_bp atgccagccattggaatcgatttgggcaccacatactcctgcgtgggagtcttccagtac ggaaaagtggagatcattgccaacgaccagggtaaccgtaccacaccatcgtacgtggcc ttcaccgactcggaacgccttattggagatgccgccaagaaccaggtggccatgaacccc aagaactctgtgttcgatgccaaacgcctgattgggcgtcgtttcgacgattccaagatc caggaggacattaagcactggccgttcaaagtgatcaacgacaacggcaaaccaaagata agcgtggagttcaagggcgcgaataagtgcttctctcccgaggagattagctcgatggta ctcaccaaaatgaaggagactgcagaagcctacctgggcactacagtgaaggacgctgtc atcacagtgccggcctacttcaacgactcccagcgccaggcaacaaaggatgctggtgcc atcgctggcatcaatgtgctccgcatcatcaacgagccaacagcggcggctctggcctac ggcctggacaagaatctgaagggagagcgcaatgtgctgattttcgatttgggcggtggt acttttgacgtctccatcttgaccattgacgagggatccctgtttgaggtgcgctcgact gcgggagacacacatctgggtggcgaggacttcgacaaccgattggttaaccactttgcc gaggagtttaagcgcaagtacaaaaaggatctgcgctccaacccacgcgcactgcgtcgt cttcgcacggcagccgagcgcgcaaagcgcaccctttcctccagcacggaggcttctttg gagattgacgccctgtacgagggacacgacttctactcgaaggtgagccgcgccagattc gaggaactttgcggtgacctcttccgcaacacactggagccagttgagaaggcactcaag gacgctaaaatggacaagagccagatccatgacatagtcctggttggaggctccactcgt attcccaaggttcagaacctgctgcagaacttcttcggcggaaagaccctgaacttgtca atcaatccggacgaggcggtggcctacggagcagccatccaggcagccattctgtcgggc gacaagagcagcgagatcaaggatgtcctactggtcgatgttgccccactctcgttgggc atagaaaccgccggcggggtgatgaccaagctgattgagcgcaacagccgcattccatgc aagcagtccaagaccttcaccacctatgccgacaaccagccggcggtgaccattcaagtg tttgagggcgagagggctctgaccaaggacaacaatgtattgggcacattcgatctcact ggcgttccacccgcaccccgtggagtgcccaagatcgacgttactttcgatttggacgca aacggtatcctgaatgtgaccgccaaggagcagggcactggcaacgccaagaacattacc atcaagaacgacaagggtcgtctgtcgcaggcggacatcgaccgcatgctcagtgaggcg gagaagtacgccgaggaggacgagcgccatcgccagcggatcgccgcccgcaatcaactg gagacctatttgtttggtgtaaaggaggcggccgagaatggtggcgatcgcatcagtgca gccgacaagagcagtattgtggagcgttgcagcgaggcgatgaagtggttggacagcaac accaccgccgagaaggaggagtacgagtacaaactgaaggaactggaacagttctgcagc cccatcatgaccaagatgcacaagggaggtggagatggccagcaggctccaaactttgga cagcaagctggcggttataagggtcccaccgtcgaggaggtggactaa >AE003746|GENSCAN_predicted_peptide_11|1244_aa MTEEDDDVAQRVATAPVRKPDDETAFLEYIEMENFKSYRGHIVVGPLKQFNAVIGPNGSG KSNFMDAISFVMGEKTSSLRVKRLNDLIHGSSIGKPVSRSCYVTAKFVLNEERHMDFQRA VIGGSSEYRINGESVSSSTYLNKLEKIGINVKAKNFLVFQGAVENIAMKTPKERTALFEE ISGSGLLKDDYNRLKQEMIVAEEETQFTYQKKKGIAAERKEAKHEKMEADRYTRLQNEYN EKQVEYQLFRLFHVERDIRKFTSDLEVRQQEVKAVEQRKEAADEILREKKKDAGKITRDL AKIDQEIREFETQMNKRRPLYIKAKEKVTHCKKKLISLQKTLETAREADNAHQSDIRKLE KQLADVEALKKRFEDEIENESQRRGKSVNMEEGLVQEYDRLKQEAEATATQYRSELDSVN REQKSEQDTLDGETNRRASVEESFKKLTLQREEAVKRRDKLMDHIKSSQAALEEQNRIKD ELRRDVGTSKEKIAEKQRELEDVRDQLGDAKSDKHEDARRKKKQEVVELFKKQVPGVYDR MINMCQPTHKRYNVAVTKVLGKFMEAIIVDTEKTARHCIQILKEQMLEVETFLPLDYLQV KPLKERLRNISDPRNVRLVFDVLKFEPQEIERAVLFATGNALVCETPEDAMKVAYEIDRS RFDALALDGTFYQKSGLISGGSHDLARKAKRWDEKHMAQLKMQKERLQEELKELVKKSRK QSELATVESQIKGLENRLKYSMVDLESSKKSISQYDNQLQQVQSQLDEFGPKILEIERRM QNREEHIQEIKENMNNVEDKVYASFCRRLGVKNIRQYEERELVMQQERARKRAEFEQQID SINSQLDFEKQKDTKKNVERWERSVQDEEDALEGLKLAEARYLKEIDEDKEKMEKFKQDK QAKKQAVDDMEEDISKARKDVANLAKEIHNVGSHLSAVESKIEAKKNERQNILLQAKTDC IVVPLLRGSLDDAVRQSDPDVPSTSAAMENIIEVDYSSLPREYTKLKDDSAFKKTHEMLQ KDLQSKLDVLERIQTPNMKALQKLDAVTEKVQSTNEEFENARKKAKRAKAAFERVKNERS SRFVACCQHISDAIDGIYKKLARNEAAQAYIGPDNPEEPYLDGINYNCVAPGKRFQPMNN LSGGEKTIAALALLFSTHRYADIIFFHILRVNINIFLCSFHPAPFFVLDEIDAALDNTNI GKVASYIRDHTTNLQTIVISLKEEFYGHADALVGITPGVRTQLV >AE003746|GENSCAN_predicted_CDS_11|3735_bp atgaccgaagaggacgacgatgtggcccaaagggtggcgacggcgcccgtccgcaagccg gacgacgagacggccttcctggagtacatcgaaatggagaacttcaagtcctaccgcggt cacatagttgtgggtcctctgaagcaattcaacgctgttattgggcccaacggatcaggc aagtcaaacttcatggatgccatcagtttcgtgatgggcgagaagaccagcagtttgcga gtaaagcgcctgaacgacctcatccatggctcctccattggcaagcccgtttcccgcagt tgctacgtgaccgccaaatttgtgctgaacgaggagcgccacatggacttccagagggcg gttatcggcggctcctcggaatatcgtatcaatggagagagcgtatcgagcagcacgtac ttgaacaagctggagaaaattggcatcaatgttaaggccaagaactttttagtattccag ggagctgttgagaacatagccatgaagacacccaaggaacgcactgctctatttgaggaa attagcggttctggcctgcttaaggacgactataatcgacttaaacaagaaatgattgtt gccgaggaggagacccagtttacttaccaaaagaagaagggcatcgcggcggaaagaaaa gaagccaagcacgagaagatggaagctgatcgctacactcgcctgcaaaatgaatacaat gaaaaacaagtggagtatcaattatttaggctattccacgtggagagggacatccggaag tttacgagcgatttggaagtgaggcaacaggaagttaaagcagtcgaacagcgaaaagaa gctgccgatgaaatcctgcgcgaaaagaagaaagacgccggaaaaatcacccgagacctg gccaaaatcgatcaggaaatcagagagtttgaaacccagatgaacaaacgcaggcccctt tacatcaaggccaaagagaaagttacccactgcaagaagaagctcatttccctacagaag actctggaaacagccagggaggcggacaatgcccatcagtcggacatacggaaactggag aagcagctggctgatgtggaggccctgaaaaaacgtttcgaggacgagatcgagaacgag tcgcagcgccgcggcaaaagcgttaacatggaagagggccttgtgcaagagtacgatcgc ctgaagcaggaggcggaagccacagctacgcagtaccgttcagagctggactcagtaaac cgggagcaaaaatccgaacaggacacgcttgatggggagacaaatcgtcgcgcctccgta gaggagtccttcaagaagctcacattacagcgcgaggaagctgtaaagcgtcgagacaaa ctgatggatcacatcaaatcatcacaggctgccttggaggaacagaaccggatcaaggac gagctccggcgagatgttggcacatccaaggagaagatagccgaaaagcaacgcgaattg gaggacgtacgcgatcaactgggcgatgccaagagcgacaagcacgaggatgctcgtagg aaaaagaagcaggaggtggtggagctcttcaaaaaacaggttcccggagtgtacgaccgt atgattaacatgtgtcagccgacgcataaacgctacaatgtggccgtcaccaaagttttg ggcaagttcatggaagccattattgtggacaccgaaaagacggctagacactgtattcag atcttgaaggagcaaatgctggaagtggaaaccttcttgcccttggactatctgcaagtt aagcctctgaaggaacgacttcgtaacatcagcgacccgcggaacgtgcgattggttttc gatgtactaaagtttgagcctcaagagatcgaacgggctgtgctcttcgccacaggcaat gctctcgtttgcgagactcccgaggacgccatgaaagtggcttacgagatagaccgatca cgtttcgatgctctggccctggacggaacattctaccagaaatcgggtctcatatctggc ggtagtcacgatctggctcgcaaagctaagcgatgggacgagaagcacatggctcagctg aaaatgcagaaggagcgccttcaagaagagctcaaggagctggtgaaaaagtcacgtaag cagagtgagctcgccactgtggagtcgcagatcaagggtcttgaaaacaggcttaaatat agtatggtcgatttggagtcctccaagaagtcaattagtcagtatgacaatcagttgcag caagtccaatcgcagttggacgaatttggacccaagatccttgagatcgagcgtcgtatg caaaaccgcgaggagcacatccaggaaatcaaagaaaacatgaacaatgtggaggacaaa gtatatgcctctttctgccgtcgcttgggcgtgaagaacatacgacaatacgaggagcgt gagctcgtcatgcagcaagagcgagcgcgtaaacgggctgagttcgagcagcagattgat tccataaactcgcagctggacttcgagaaacagaaagacactaaaaaaaacgttgagcgc tgggagcgcagcgtgcaggatgaggaagatgccctagagggactcaaattggctgaggca cgctatttaaaggaaatcgacgaggataaggagaaaatggaaaaattcaagcaggacaag caggccaagaagcaagccgtggatgacatggaagaggacatatctaaagctcgcaaggat gtggcaaatttggccaaggagatccataacgtgggcagtcatttatctgcggtagagtct aagatcgaggccaagaaaaacgaacgtcagaacatactgttgcaagcaaaaaccgattgc atcgtggtaccactgttgcgcggctcgctggacgacgcagtgcgacaaagcgatccggat gtcccatccacctctgcagcgatggaaaatattattgaagtggattattcatctctacca cgagagtataccaagctgaaggatgactccgctttcaaaaagacccacgaaatgcttcaa aaggacctgcaaagcaagctggacgtcttggagcgcatacagacacccaacatgaaagca ctgcagaaactcgatgctgtaacggaaaaagtgcaatccaccaatgaggagtttgagaat gcgcgcaagaaagcaaagagggccaaggcggcatttgaacgggttaagaacgaacgctcc tcgcgattcgttgcgtgttgccagcacatatcggacgccattgacggcatctacaagaaa ttggctcgcaacgaggcagcccaggcttatatcggtcccgacaatcctgaggaaccatat ctggatggtattaactataattgtgtggcgccaggcaaacgtttccaacccatgaacaac ttgagtggtggcgaaaaaacaatagcagccttggctctgctcttctcgacccacaggtat gcagatataatatttttccacattttgagagtaaatattaatatatttttatgcagcttt catccagcgccgttctttgtccttgacgagattgatgccgccttggacaacacgaatatt ggcaaagtcgcttcgtatataagggatcacacaaccaacctgcaaaccatcgtcatttcc ctgaaggaagagttctatggtcatgctgatgctcttgtgggcattacgcctggggtacgt actcaactggtgtaa >AE003746|GENSCAN_predicted_peptide_12|97_aa MCTDDDNLDDDDDVDDDDEDNEDSLSVWHRTATDMTSTTRIRDTLRRILNIPLTTALEYK IHCDSLKYVSMPRRQNHKLGNLFYRNRYGFNSRYGKS >AE003746|GENSCAN_predicted_CDS_12|294_bp atgtgtactgatgacgacaatctggatgatgatgatgatgttgatgatgatgatgaggac aacgaggatagcttatcagtatggcaccggactgccactgatatgaccagtacaacacgt atccgtgatacgttacgcaggatattaaatatacctctaacaacggcattggagtataag atacactgtgatagccttaaatatgtttcaatgccgagacgtcaaaatcataagctggga aaccttttttatcgaaatcgatacggctttaattcaagatacggaaaatcatga >AE003746|GENSCAN_predicted_peptide_13|146_aa MFYRRSNTVETKKILEKLRLYRTRLVQIHEYHHFNRVISSKVNSTLQLSIHSEERRSAQP TSTLFRSYKMKMEDVSVEKRVRKTNRKSGVLLCGVRSSRSNWRLHTERLCQQLNQPQNHK KYPHFRLFLLTASNASILQIAVMRFS >AE003746|GENSCAN_predicted_CDS_13|441_bp atgttttatcgacgcagcaacacagtagagacaaagaaaatcttggaaaaactgcgacta tatcgcacgcgtttggtgcaaatccatgaataccaccattttaaccgagtaatatcatca aaagtgaactcgacattgcagctttccattcattcagaagaaagacgttccgcgcagcca actagcactttgttccgcagctacaagatgaaaatggaagacgtcagcgtggaaaagcga gtgaggaaaactaacaggaaaagcggagttttgttgtgcggtgtgcgtagcagcagaagc aactggcgactgcacactgagaggttatgccagcaattgaatcagccacagaaccataag aaatacccgcactttcgtctctttcttctgactgcttctaacgcttctatccttcagatt gccgttatgcggtttagttaa >AE003746|GENSCAN_predicted_peptide_14|450_aa MYSRYRKNWPDIVDSDDSDVDNQIDVDNLPPLEVGPGENRLQHTYCLWFSRKETQRAAAD YSKSLHMVGRCASVQQWWSLYSHLIRPTALKPYRELLLFKQGIIPMWEDPANSKGGQWLI RLRKNKVDRAWENVCMAMLGEQFLVGDEICGVVLQTKYPMTKDRLAALHAAQSDDEEETE VAVNVDGHDSYMDDFFAQVEEIRGMIDKVQDNVEEVKKKHSAILSAPQTDEKTKQELEDL MADIKKNANRVRGKLKGIEQNIEQEEQQNKSSADLRIRKTQHSTLSRKFVEVMTEYNRTQ TDYRERCKGRIQRQLEITGRPTNDDELEKMLEEGNSSVFTQGIIMETQQAKQTLADIEAR HQDIMKLETSIKELHDMFMDMAMLVESQGEMIDRIEYHVEHAMDYVQTATQDTKKALKYQ SKARRKKIMILICLTVLGILAASYVSSYFM >AE003746|GENSCAN_predicted_CDS_14|1353_bp atgtactcgcgttacaggaaaaactggccagatattgtcgacagcgacgacagcgatgtg gataatcagatagatgtggacaacctgccaccactggaggtgggtcccggcgagaaccgg ctgcagcacacatactgcctctggttctctcgcaaggagacgcagcgcgcggccgccgac tacagcaagtcgctgcacatggtcggccggtgcgccagcgtgcagcagtggtggtcgctc tactcgcacctcatccggcccaccgccctgaagccctaccgggagctcctcctcttcaag cagggcatcataccgatgtgggaggacccggcgaacagcaagggcggccagtggttgata cgactacgcaagaacaaggtcgaccgggcctgggagaacgtttgtatggcgatgctcggg gagcagttcctcgtcggcgacgagatatgcggagtcgtgctacagacgaaatatccgatg actaaagacagattagccgctctccatgccgcccaatccgatgacgaggaggagacggag gtggccgtcaatgtggatggccatgattcctacatggacgacttcttcgcccaggtggag gagatccgcgggatgatcgacaaggtgcaggataacgtcgaggaggtaaagaagaagcac tcggccatcctgtccgccccacaaacggacgagaagaccaagcaagagctggaggatctg atggccgacatcaagaagaatgccaatcgcgttcgcggcaagctcaagggcatcgagcag aacatcgagcaggaagagcagcagaataagtcgtcggcggatctgagaattcggaagacg cagcactcgacgttgtcacggaaattcgtcgaagtgatgaccgagtacaatcgcacgcag accgactaccgagagcgctgcaagggaaggatacagcgtcaactggagattaccggacga ccgaccaacgacgatgagctggagaagatgctggaggagggcaactcgtctgtgttcacg cagggcatcatcatggagacgcagcaggccaaacagacgctggcggacattgaggcccgc caccaggacatcatgaagctggagacatcgatcaaggagctgcacgacatgttcatggac atggccatgctggtggagtcgcagggcgagatgatcgatcgcattgagtaccatgtggag cacgctatggactatgtgcagacggcaactcaggacaccaagaaggcgctcaagtaccag agtaaagcccgacgaaagaagatcatgatactgatctgcctcactgtgctgggcatctta gcggcctcatatgttagcagttatttcatgtaa >AE003746|GENSCAN_predicted_peptide_15|465_aa MKLSIRMLDQRTITLEMNESQEVRALKQKLGNLPEVAMPAENLQLIYSGRIMEDAMPLSE YRIAEDKIIVLMGKKKVDKSSPEEKVAPTPPLAAGPNVLRTEDVVPSLAPNDQWVSDLMS MGYGEEEVRSALRASFNHPERAIEYLINGIPQEVVSEQGLAAIPSVQTSDQLQQLMADLN ITRMREMINQNPELIHRLMNRLAETDPATFEVFQRNQEELMNMISGGASRTPNEIEHLQI TLTAEETAAVGRLEALGFERVMAVQAYLACDKDEQLAAEPLFRGAVRYGSAALSECSNTA DTTYPTFGYRVSEYRIFAVILQKSVCLRSPASRIMGKSEKKSKKKHKEKRREHKEKHKSK KSKKHNRKKEESPPIASPTPIQPERVNQEEEDDFAIPIEEYQRRQSQIRKEVDPVTGRVR LIKGDSEVLEEIVTKERHLEINKKATRGDGEFYEARSLDAAKRRK >AE003746|GENSCAN_predicted_CDS_15|1398_bp atgaagctgtctatacgcatgctggaccaacgcaccatcactttggagatgaacgaatcg caggaggtgagggctctgaagcagaaattgggcaatttacccgaagtcgccatgcccgcg gagaaccttcagctgatatacagtggccgcattatggaggatgccatgcccctcagtgaa tatcgtatagccgaggacaagatcattgtgttgatgggtaagaagaaggttgataagagc tcgccagaggagaaggttgccccgacaccaccgttggccgctggcccaaatgttttgcgc acagaggatgtggtgccttcactagctcccaatgatcagtgggtgagcgatctcatgtca atgggatatggcgaagaggaggtacgctcagccctccgggcgagctttaatcatccggaa agggctatagagtatttgattaatgggattcctcaggaggttgtttcagagcagggatta gctgcaatcccgagcgtacagacaagtgatcaattgcagcaattaatggcagatcttaac attacacggatgcgtgagatgattaatcagaatccagaactaatacacagactaatgaac agactggctgaaaccgatccggctaccttcgaagtctttcagcgtaaccaggaggagtta atgaacatgatttcaggcggcgcaagtcgcaccccgaacgagattgaacatttacagatt actttaaccgccgaagaaaccgccgccgtagggcgtttggaggcactgggtttcgaacgt gtgatggccgttcaggcctatctggcctgcgacaaggacgagcagctggccgcagagccg ctttttcggggagcggttcgctacgggtccgccgctctctctgaatgcagcaacaccgct gacacaacataccccacatttggttacagggtatcagagtatcgaatttttgctgtcatt cttcaaaaatcagtttgtttacgttcacctgcaagccgcataatgggaaaaagtgagaaa aaatcaaagaaaaagcataaggagaagcgcagggaacacaaggaaaagcacaagtcaaag aaatccaagaaacacaaccgaaaaaaggaagaatctccgccgatagcttcaccaacacca atacagcctgaaagggtcaaccaggaggaggaggatgactttgccataccaatagaggaa taccagcgccgtcagagccagatccgcaaagaggtggacccggtgacgggtcgcgtccgg ctcatcaaaggcgacagtgaggtgctggaagagattgtgaccaaagagcgccacctggag atcaacaagaaggccacccgcggtgatggagagttctacgaagccagatctctggacgcc gccaagcgtcgcaaatag >AE003746|GENSCAN_predicted_peptide_16|669_aa MNNSPKDPYRGGGGGSGPQDGAGSTGVSINIESDDNDSTTAEHDYLLPASSSSVTSGGGV GGGGVVSGGSTSTSVNLDHQHVPTARSSSLSGGNVRHGNILSTFLARQRSYTPASNSSSP VRVTTQMNRRLQQSRSMGANTGGSLEEPSSAGAAGGPGASPGTAAGVRQMGFWRVNLNDV FSLSTASSTEALRSFISHSNFRYPPATTAAPAAPGPQPAANPVDNSHILLGQQQPSMPEP VSPLRYGRVSAVSGASSAGAMGRSISLREGEMRHNHSLNGGRHNASAIGLNSNPAAFDER EANESQDLDESDGNVENGGGGGNPPEPAENNPEDEHLISDDMVVQILSHFVRYLPLIFIL FFKFLHDHLLGIVDLLVLQTVMYNVNRSVRNQVARLAQKNYAVMVRDTFLVAVVVTVRLF LATSPPDPFGLIVPPSRKSVFIEVTALPFHTSSEGDSPTTSSEPIKTNMYKDTYKSLNVI PLGMLLYYIAVSDLIIKLLTMLVKLIITMLPHHLMRLKVRARLYVLVEYISQFYRAMTPI TQWLLFLYESYSGLEVVSGGLFSAMYLGAKIFELVERGKSLKKAIVTFRKNIDSERPPTK DELDAAGALCPICHDAFNTPTVLECGHIFCDECVQTWFKREQTCPMCRAKVSDDPAWQDG STTFFHQLY >AE003746|GENSCAN_predicted_CDS_16|2010_bp atgaacaacagccccaaggatccctatcgtggaggaggaggaggttctggaccccaggat ggcgctggatcgaccggtgtgtctatcaacatcgagagcgacgacaatgactcgacgacg gcggaacacgactacctgctgcctgcctcctcatcttcggtgacctctggtggtggcgtc ggaggtggaggcgtcgtctctggaggatcgaccagcaccagcgtaaacctggaccaccaa cacgttcccacggcgcggagcagcagccttagtggtggtaatgtccgtcatggtaacatt ctatccacatttttggcccgtcagcgctcgtacacgccagccagcaacagcagttccccg gtgcgggtgaccacacagatgaaccgtcgtcttcagcagtcccgtagtatgggcgccaac accggtggcagtctcgaggagccgtccagcgccggagcagcaggtggaccaggagcatcg ccaggaactgcggcaggtgttcgacaaatgggcttttggcgtgtcaacctcaacgatgtc ttttcgctttcaactgcctcgtcgacagaggcgctccgatccttcatctcacattccaat tttagatatccacctgcaaccactgcagcccccgcggcaccaggacctcagccagccgcc aatcccgtagacaactcgcacatactgctcggccagcagcaaccatcaatgcctgagcca gtcagccctctgcgatacgggcgcgtttcggccgtatctggagccagttcagctggagct atgggtagaagtatctcgctgcgtgagggtgagatgcgacacaatcacagccttaacgga ggacggcacaatgccagtgccataggactaaacagcaatccggcggccttcgacgaacgc gaagcaaatgagagccaagaccttgacgaaagcgatggcaacgtggagaacggaggtgga ggtggcaatccaccagagccagcagagaataatccagaggatgaacatctcatctcggat gacatggtggtccagattctcagccattttgtgcgatatctgccgcttatctttatactg tttttcaagtttttacacgaccacctgctgggcattgtggatcttctggtgctgcagact gtgatgtataatgttaaccgatcagtacgcaatcaggtagcccgtctagctcagaagaac tacgccgtgatggtgcgggatactttcctcgttgccgtggtggttacagtgcgtctattt ttggccacttcgccgccagatccgtttggactgattgtgccgccttccagaaaatccgtt ttcatagaggtcacagcgctgccttttcacactagctcggaaggcgacagccccacaaca tcgtccgaacccatcaagaccaacatgtacaaagatacgtacaaatctctaaatgtgatc ccactgggcatgttgctgtactacattgctgtgagcgatctcattatcaaactgctgaca atgctggtcaaacttattattaccatgctgccgcaccacttgatgagacttaaagtgcgg gctaggctttatgtgttggtggagtacatttcgcagttctaccgggcaatgacacccatt acccagtggctcctatttctatacgagtcctactccggcctggaggtggtttccggagga ctcttttccgcaatgtacctgggtgccaagatatttgagctggtggagcgcggcaagtcg ctaaagaaagccattgtgaccttcagaaaaaatattgactctgagcggccgccgaccaaa gacgagctggatgctgcgggtgccctgtgtccaatctgccatgacgccttcaacacaccc actgtgttggagtgcggccacattttctgtgatgagtgcgtccagacctggttcaagcgc gaacagacctgtccgatgtgcagggcgaaggttagcgatgatcccgcctggcaggacggc agcactaccttcttccatcagttgtactag >AE003746|GENSCAN_predicted_peptide_17|546_aa MSQTLDDLLLRQEHARQLRQATRSKFRRINSLDLIPEHPSQSEEEEDKESQTAHKHFVTL RRQRTADGNLLRHQFQTQSPPQPNLGTRVLLFGPQLLMRLVLTILRYVLYIPLSIAAPSF WLSALLWIFWKLLRVPIALVKWLLSGEEELGAVQRQKTILLSCGSTIQTLHLARNFYGSG ARVVVFEFEGLFGLARFSTAVDKFYVVPRPTASNPDQYIAALCHIVKKERPSVYIPVCAT SPAYYDSLARPHLEVLGCASFIPGVQETLQLDDCLQLFQRCEQQQMALPAHVVLTAPRQL QQIYESGFVGSYRNILMAAGMQGVLERHKYILPNRRAELKLNQHDISERQPWLVVRDQPG YHHYVTCTTVKDSRVVANVSCRVEHHTKNLIPVPRDDEAQIELWLRSFFAKVRFQRPING HISFRLVKSPAHGGQFVPLGTRLGVALPYICHNRSHAQLLCRAMKCIHRRGLPEDELPNW SWSALERTTSTTALDKREALFAYWDPLPYCAYYHFQLPLKNVKLFLQRRNRSATKTLSPR ITVPVH >AE003746|GENSCAN_predicted_CDS_17|1641_bp atgtcccagaccctggacgatctgctgttgcgccaggagcacgcccggcaattgcgccag gccacgcgctccaagttccggcgcatcaattcgctggatctaataccggagcatcccagc caatcggaggaggaggaggacaaggaatcccagacggcccacaagcactttgtcaccctg agacgacagcggacggctgatggcaatctgctgaggcatcagtttcagacgcagtcgccg ccacaacccaatttaggcacacgggtcctgctcttcggaccacaactgctgatgcgtctg gtgctgaccatcttgagatacgtcctctacataccactgtccattgcggcgccaagcttt tggttgtccgccctgctctggatcttttggaaactgctgcgggtacccattgcgttggtc aagtggctactgagcggcgaggaggagctgggtgcagtgcagcgacagaagaccatactc ctcagctgcggcagcaccatacagaccctacatttggccaggaacttctacggatccggc gcccgtgtggtggtctttgagttcgagggtttgttcggattggccagattttccacagcg gtggacaaattctatgtggtgccacggccaacggccagtaatccggatcagtacatagcc gccctgtgccacatcgtgaagaaggaacgaccctcggtctatatacccgtttgtgccacc agtccggcatactatgattccctggcccgaccacatctggaggtcctgggctgtgccagt ttcatacccggcgtgcaggagacactgcagctggatgactgcctccagctctttcagaga tgcgaacagcaacagatggctctgccagcccatgtggtgctgacagctccacggcaattg cagcagatctacgagagcggtttcgtgggcagctataggaacatactcatggccgcaggg atgcagggagtcctggagcggcacaagtacatactgcccaataggcgggcggaactaaag ctgaatcagcacgatatcagcgagcggcaaccgtggctagtggtaagggatcagccgggc tatcaccactatgtgacctgcaccaccgtcaaggattcacgggtggtggccaatgtgagt tgtcgggttgagcatcataccaagaatctgatcccagtgcccagggacgatgaggcgcag atagaactctggttgcgctccttctttgccaaggtgcgcttccagaggcccatcaatggg catataagcttccgcctggtcaagagccccgcccatggtggtcagttcgtgcctctgggc acgcgactgggcgtggccctgccctacatatgccacaatcgatcgcatgcccagttgctg tgccgtgcgatgaagtgcatccacaggcggggactgcccgaggatgaactgcccaactgg agctggtcggcgctggagagaaccacttcgacgacggcgctggacaagcgggaggcattg tttgcctactgggatccactaccctattgcgcatactatcatttccagctgcccctgaag aacgtgaagctgtttttgcagcgccggaatcgcagcgcgaccaaaaccttatcgccacgc atcacggtgccggttcattga >AE003746|GENSCAN_predicted_peptide_18|2084_aa MFIVANSRDDPLSSVRLRFLAIGYKSSSPFEGSSEAEVAPVAKRAGKSPSEPKISIMQAY RDNFKQTPCPSAVDLQAAGPAHQSATASRLPFSRSQRGRDPSASGASVSASVAGGLTPRM MSPGPPGAGGGGGGGVSGGGSGDPSALLRQNQELRQRLADESHSYRRRLDTYKQAQHNQA NLVSRLQSKIQQYRQRCSDLEERMHETIKPTAGVGPKLTTGPTNQVLCSTSLTLGQSSLP CSSSLDSPPPSCSRDYVDDVLVTGGGAGAAELCRKLEEEHQRCEQIVAQNSALRQQLEES NRTNEALTNDLQKLTNDWASLRDELLIKEDEFKEEEQAFKDYYNSEHNRLLKMWREVVAV KRSFKEMQTAMKAEVAKMGQEINCVGKDINGSNATVAFAVQQAKRAADDELKQSQRSNDE LQNQLATLKVQYESARHEIMERDQRLLELMNQLKKLEDRCAQAESQAALASRYSDEIERL NNSMREIAQAVVQDAENADREADAEVTGGVMQHMHLTRDAASVVGGAGGAGSTAGGGGKS PRRNSTRASQAFAEGTISAVQAALHKYQLALHDMQVKFQNTSETLRTTKAQLETSEGTKQ LLTTKMQQLTEKLDSSNSKLSELLQERESLQRGLDDIRVQKQQSEMGRADINSAFENLSS DYEKMQLNCGKLQKRIDSMEEDKKAVELEIQRILKDKNITELNLRSEEDRSSRLREETIS LREELNRVSLNRDLLEQQRIESDNLINLLEKQKSDLEYDLDKLLLEKCDLQEKHEKLSNN SCSTSDELKSVQNCLQEAQEERKKLRIQSVDQCNEIGELKKELAILDKARLELETDNLSA GEKLKCLQLEKEKILQDLACVTRDRGDIHNQLTAMCRKKEALNEELMRTRQRLEQTTETN SRLNRNLEEMVKDVEEKQVVIDLHEKDTHRLNELLAALRSEKESLESVLFDTNTSLEATE ERRSQLERDLQEALVREESLKNHVARLQKELEQCQRKAQETKTQLLNAARAAESDFNQKI ANLQACAEEAAKRHGEEILQLRNALEKRMQQALQALQTAKDDEIEKLQERLATLQAHLES LVQQHEEALIRAESEKQQALLIAHRDKQAVAERLEAVSRDLKTEQESLDRSRREANARDE KQRAAIAQLKDEMVQMRTKEEEHKIKLEECIRKQELQLSSLREERESLCRVSEELKMEIR LKEDRMESTNNELQDALRKSKEGEGFIDSLRKELTDCRRQLADSNIERDKYSGSNKELRD HVKRVESAKREQARAIEEALQKISNLEDTKNSLENERTRLSTILKETENHFTKTTQDLNA TKAQLQKAQVEFAQKDEGGKELQCKLVAEVELKERAQQELCQIKKQLSDLEANLCATRQE LGRARCQNNQEEHRFHAREQELAQRLEEGRGREKRLEDQKHNLEVCLADATQQIQELKAR LGGAEGRIRALDEQLSCVELHKRDTEQKLSSVVHTLRRIAGIQVDGSVNLSHRLLSPSRR FSPSRSCGDYDNRSTSQCPDGPIDVDPDLVRKGVRNLMHQVAQLEREKDDYKSQLGAAKK QLQDAADQQLRCDAKLGKLQAMLRNLQEEKSNLETDRKMKISAIQALEEKLKHRNDECQM LRERLAQTEMQLAATSEENGQNEERLEKSRQQCSKLDNEKRQLQEELAKVEGRASKLELQ RVAMEGDLTRLQMALQEKDCSIRQMAERLENQNRALTQLEDRCTALKSTVDQLKERLQKS AVSETQLRGEIKTLQKELSEQGHCSQANEDKLKLVQKSLQTAENEKRILTERLDSAQTNL NELRRSQQAQLDGNQRLQEQVTDLEVQRSALESQLRIAKWNQESGGDKDLTNGNGGGNGE EELSRQLKSSQREKSELRSKLQTLQDKVKQLECDRKSKFSGGNAYDRAEKSNSFYGGAAE SGEFDSNRYDVGGGNAGGGSFNCGLDHSVIEQETRDLRLKVRRLETLLAEKESELARCKA RMNDSAKCHDGLDGDRYRSAQMHAEKLLDAREQSHRQQVLRLENQISMLREQLAQEAKRR QQYILRSSKANREMQHLRSTLGDSLRNVSQHPVDPHLLESESRR >AE003746|GENSCAN_predicted_CDS_18|6255_bp atgtttattgtcgccaactctcgggatgatccgctctcctccgttcgtctgagatttttg gcgatcgggtataaatcatcgtcgccgtttgaaggttccagcgaggcggaggtagcgccc gttgccaagagagccgggaaatctccgtcggagccgaagatcagtataatgcaggcgtat cgcgataacttcaagcaaacgccctgcccatcggccgtcgaccttcaggcggcgggacca gcccaccaatcggccaccgcctctcggctgcccttctcgcgcagccagcgtgggcgtgac ccctcggcaagtggcgcctcagtttccgcctccgttgctggtggtctaacgcccaggatg atgagtccgggtccgccgggcgccggaggaggaggaggaggaggagtgagcggcggcggc agtggagatccctcggccctgttgcgccagaatcaggagctgcgccaacggctggccgac gagtcgcatagctatcgacgccgcctggacacctacaagcaggcgcagcacaaccaggcc aacttggtcagccggctgcagtcgaagatccagcagtatcgacagcggtgtagcgacctg gaggagcgcatgcacgagaccattaagccgacggcgggagtaggacccaagctgaccacg ggtcccaccaaccaagtgctgtgttcaacttctttaactctgggacagagcagcttgccc tgcagctcctcactggactcgccgccgcccagttgcagtcgtgactatgttgacgatgtc ttggtcaccggaggcggcgccggtgcggcggaactgtgccgcaaactggaggaggagcac cagcgctgcgagcagattgtcgcccagaacagcgcactgcgtcagcagctggaggagtcg aatcgcaccaacgaggcgctcaccaacgacctacagaagctgaccaacgactgggcgagt ctgcgggatgagctgctgatcaaggaggatgagttcaaggaggaggagcaggccttcaag gactactacaacagcgagcacaatcgcctgctgaagatgtggcgcgaagttgtggccgtc aagagatccttcaaggagatgcagacggccatgaaggcggaggtagccaagatgggtcag gagatcaattgtgtgggcaaggacatcaatggctccaacgcaacggtcgcctttgccgtc cagcaagccaagcgggctgcggatgatgaactgaagcaatcgcagcgcagcaacgatgaa ctccagaatcaattggccaccctgaaggtgcagtacgagagtgcccggcacgagatcatg gagcgggatcagcgactactggagctaatgaatcagttgaagaagctggaggatcgctgc gcccaagccgaatcccaagcagctctggccagtcgctatagcgacgagatcgagcgactg aacaattccatgcgagaaatcgcgcaggccgtcgttcaagatgctgagaacgcagatcgc gaagcagacgccgaggtcaccggcggtgtcatgcagcacatgcacctcacgcgtgacgcc gcctctgttgtgggcggagcaggtggagcgggcagcaccgccggcggcggagggaaatca ccgcgtcgcaactcgacacgcgcctctcaagccttcgccgagggcaccatctcagccgtc caggcggcgctccacaaataccagctggccctgcacgacatgcaggtgaaattccagaac accagcgagaccctgcgcaccaccaaggcccagctggagaccagcgagggtaccaaacag ctgctgaccaccaagatgcagcagctcaccgagaaactggacagcagcaactccaagcta tcggaattgctgcaggaaagggagagtctgcagcgcggactggacgatatccgtgtccag aagcagcagtctgagatgggacgagccgatatcaatagtgcgttcgagaatctgagcagt gattatgagaagatgcagctgaactgtggtaaactccaaaaacgtatcgattccatggag gaggacaaaaaggcagtggagctggagatccaacgtatactgaaggacaagaacataacc gagttgaatttgaggtctgaggaagatcgcagtagtcgtttgcgggaggaaaccatatcc ttgcgcgaggagcttaaccgagtgagcctgaatcgtgatcttctcgagcagcagcgcatc gagtccgataatttgatcaatctgctcgagaaacagaagtccgacctggagtacgatctg gacaagctgttgctggagaagtgcgatctgcaggagaagcacgagaagctatccaacaat agctgctccaccagtgatgagctgaagagcgttcaaaattgccttcaggaggcgcaggag gagcgcaagaagctccgtattcagtccgtcgatcagtgcaatgaaatcggagagcttaag aaggagctggcgatactggacaaggcacgactcgaactggagacggacaatctgtcggct ggggaaaagctcaagtgtctgcagctggagaaggagaaaattctgcaagacttggcctgc gtcacccgggatcgtggtgacatccacaatcagctaacggcgatgtgtcgcaaaaaggag gctctgaatgaggaacttatgcggactcggcagcgtctggagcaaaccaccgagaccaat agccggctgaatagaaatctggaggagatggtgaaggatgtggaggagaagcaagtggtc atcgatctgcacgagaaggacacacatcgcttgaacgaactcctggccgccctgcgttcg gagaaggaatccctggaatcggtgctcttcgatacaaacacctcactggaggccaccgag gagcgacgcagtcagctggagcgggatctgcaggaggctctggtgcgtgaggagagccta aagaatcatgtggctcgcttgcaaaaggagctggagcagtgtcagcgcaaggcccaagag accaagacgcagctgcttaacgccgcccgtgcggctgagagtgacttcaaccagaaaatc gccaatctgcaggcttgtgcagaggaggcggccaaacgacatggcgaggagattctacag ttgcgaaatgccttggagaagcgaatgcaacaggctctgcaagcgttgcagacggccaag gatgatgagatcgagaagttgcaggagcgtctggccaccttgcaggcgcatctcgagagc cttgtccagcagcatgaggaggcactgattcgggcggagagcgagaagcagcaagccctt ttgattgcccaccgggataagcaagcggtggccgagcgtttggaggccgtatcccgggat ctcaagaccgaacaggagtccctcgaccggagcagacgggaggccaatgcgcgcgatgag aagcagagggctgccattgcccagctgaaggacgagatggtgcagatgcgcaccaaggag gaggagcacaagattaagttggaggaatgcatccggaagcaggagctgcagttgagcagc ttgcgcgaagaacgcgaatccttgtgccgtgtgagtgaggaactaaagatggagattcgt ctgaaggaggacagaatggagagcaccaacaatgagttgcaagatgcgctgcgcaagtcc aaggagggtgagggcttcatcgatagcctgcgcaaagagttgaccgactgtcgccgccaa ctggcggacagcaacatcgagcgggacaagtattccggcagcaacaaggagctgcgcgac cacgtcaagcgtgtggagagcgccaagcgggagcaggcgcgcgccatcgaggaggctctg cagaagatcagcaatctggaggataccaagaactcgttggagaacgaacgcactcgattg agcaccatactgaaggagacggagaatcactttacaaagaccacccaggatctgaatgct accaaggcgcagctgcagaaggctcaagtggagttcgcccagaaggacgagggcggcaag gagttgcagtgcaagctggtcgccgaggtggaattgaaggagcgggcacagcaggagctc tgccagattaagaagcaattatcggatctggaagccaatctgtgtgccactcgccaggaa ttgggcagggctcggtgtcagaacaaccaggaggagcatcgcttccatgccagggagcag gagttggcccagcgcttggaggagggtcgtggtagggagaagcgcctggaggatcagaag cacaacctggaggtctgcctagccgatgccactcagcagatccaggagttgaaggcccgt ttgggcggcgccgaaggtcgcatccgtgctttggatgagcagttgtcctgcgtggaacta cacaagcgggataccgaacagaagctatcctcggtggttcacactctgcgccggattgct ggcatccaagtggacggcagtgtaaatttgtctcatcgcttgctgagtccctcgcgaaga ttcagtccgtctcgcagctgcggagactatgacaacaggagcacatcacaatgcccagat ggaccaattgatgtggacccggatctggttaggaagggtgtccgcaacctgatgcatcag gtggctcagttggagcgcgagaaggatgactacaaatcccaattgggagcagctaagaag caactccaagatgctgctgaccagcagctacgatgcgatgccaagctgggcaaactgcag gccatgctaagaaatctccaagaggagaagagcaatctggaaacggaccgcaagatgaaa atctccgccatccaggcgctagaggagaagctcaagcatcgcaacgatgagtgtcagatg ctaagagaacgtttggcccaaacagagatgcaactggctgccacatccgaggagaatggc cagaacgaggaacgactggagaagagccgacagcagtgctccaaactggacaatgagaag cgccagctgcaggaggaattggccaaggttgagggccgggccagcaaactcgaactgcag cgtgttgccatggagggcgatcttaccaggctacaaatggccctacaagagaaggactgc agcattcgtcaaatggctgagcggctggagaaccaaaaccgtgccttgacccaattggag gatcgctgcaccgcccttaagtccaccgtcgatcagctgaaggagcgcctccagaagtcc gctgtgagtgaaacccaactgcggggcgagataaagacgctccaaaaggagctttccgag cagggtcactgctcccaggccaacgaggataagctgaagttggtgcaaaagtctctgcag accgctgagaacgagaagcgcatcctcaccgagcgcctagatagcgctcagaccaatctc aacgagctgcgacgtagccaacaggcccagctggatggtaaccagcgcctgcaggaacag gtgaccgatctggaggttcagcgttcggcactggagtcccaacttcggattgccaagtgg aaccaggagagtggcggcgacaaggatctgacaaacggcaacggaggcggaaatggcgag gaggagctcagcaggcagctgaagtcctcgcagcgggaaaagtcggagctacgcagcaaa ctgcaaaccctacaggacaaagtcaaacagttggagtgcgaccggaaaagcaagttttcg ggtggaaacgcctatgatcgggctgagaagtccaactccttttacggtggcgcagctgaa tccggcgagttcgactccaatcgctacgatgtgggtggaggaaatgctggcggaggctct ttcaactgcggattggatcacagtgtaatcgagcaggagacgcgcgatctgcgactcaag gtgcgccgcctggagacattgttggcggagaaggagtccgagttggcgcgctgcaaggcg cgaatgaacgacagcgccaagtgccatgatggcttagatggagatcgttatcgcagtgct cagatgcacgcagagaagcttcttgacgccagggagcagtcgcatcggcaacaagtgctg cgcctggagaaccagatctcaatgctgcgcgagcagttggcccaggaggccaaacggcga cagcagtacattctacgcagctcgaaggccaatagagaaatgcagcatctgaggagcacc ctaggagattccctgcgcaatgtctcccagcacccggtggatccccatctcttggaaagc gagagccggcggtga >AE003746|GENSCAN_predicted_peptide_19|461_aa MSGSFGRNQNNISCFGATFCQFRCECYNRCKIMHYSIIRKLVTAPKLANVPKRKWKSVVG LEVHAQIASASKLFSGSGTSFGAPLNSSVAYFDASIPGTLPVLNRKCVESGIKTSLALGC RVNEVSMFDRKHYFYADLPNGYQITQQRAALANDGKMTFPVITPGKKVYYKTAKLLQLQL EQDSGKSLHDDYLKRSLVDLNRAGLPLMELVFAPDLETGEEAASLVKELILILRRLQTCS CKMEEGALRVDANISIHQEGDPLGVRTEVKNIGSVRSISQAITYEINRQLETVANGGVIT NETRNWDAENRRTVAMRDKEVLQDYRFMPEPNLPPLHVNLKPGSMSTEDLLSVAALSEEI PELPEDTRQRLVEQHNLNAETAIILVLIELHSLQQICSPDEIENLCQLAIANQAKAVEQY QKGKAKALFAIAGEVAKLSSQKANMKLVVQRLEKLLKPTNK >AE003746|GENSCAN_predicted_CDS_19|1386_bp atgtccggcagctttggccgcaatcaaaacaacatcagctgttttggcgcaactttttgc caattccggtgcgaatgttataatcgatgtaaaataatgcattactcaataatccgaaaa ctggtaacggcgccaaagctagccaacgttcccaaaaggaaatggaaaagcgttgtgggt ttggaggtgcacgcacagattgccagtgcgtccaaactgttttccggcagtggcacatcc tttggagcaccacttaactcttcggtggcgtattttgatgcctccataccgggaacattg ccagttctcaacagaaaatgtgtggaatccggcattaagacatcacttgctttgggatgt cgggtgaacgaagtgtccatgtttgaccgcaagcactacttctatgcagatttgcctaat ggctaccaaatcacgcagcagcgcgccgctttggccaatgatggaaaaatgactttcccc gtgataacaccaggcaaaaaagtttactacaagaccgccaaactactgcagttgcaattg gaacaggatagtggtaaatctctgcacgatgattatctcaaaaggagcctggttgacctt aatcgtgctggacttcctctaatggagctggtttttgcaccagatttagaaacgggcgaa gaagcggcatcgctggtcaaggaattaatactgatactaaggcgtctgcagacatgcagt tgtaaaatggaagagggcgccctgcgtgtggatgccaacatatccattcaccaagaaggc gatcccttgggagtccgcaccgaagtaaaaaacattggctcggttcgaagcatttcgcaa gcaattacgtatgaaattaatagacagctagaaactgtggctaatggcggtgtaattaca aatgagactcgcaactgggatgcggagaaccggcgcacagtggccatgcgtgacaaagag gtgctgcaggactacagattcatgccggagccgaatctaccaccccttcatgtaaaccta aagcctggatcaatgtcaacagaggatttactttcagtggctgctctaagcgaggaaatt ccagaattaccagaggacaccaggcaacgtttggtggagcagcacaacctgaatgcggaa actgccatcattttagtgctaatcgaactgcacagcctgcagcaaatctgcagtccggac gagatcgagaacctgtgccagctggctatcgccaaccaggccaaggcagtcgaacagtac cagaaaggcaaggcaaaagctttgttcgcgatcgcaggcgaagttgccaaactgtcgtcc caaaaggccaacatgaagctggtagtgcagcgtctggaaaagctgctaaaacccaccaat aagtaa >AE003746|GENSCAN_predicted_peptide_20|1139_aa MVIEKIIGDLESNMTLENEEAKRKLVELLSQSESSPVSVNMPPIPTYHFPTDKEQWVVKF MLDYFFTTGSQRILEVLVKAQAPHDGYIFDKLDDCLKQSQHRVQSLQVFCFIVRHHPTWL YKIEKHRLIKSVFKLMTHEKEIVPLMSALLCIITLLPIIPNSVPNFLNDLFEVFGHLASW KLQNSNKLPDEKLVHLQLGLQMLFHRLYGMYPCSFIAYLVEFIKRGNGGGIFQHTIKPLL NTVRVHPMLVTATPETEVNNTRWKEMEPHDVVMECANLSLPVLLPETSNEDGSYAYPMTP GYSRMTSNTSNTDYSYQLREFQQSRNVYTRFDSFASGDDVGPIWSPHNEIATTSSGIPLT PTTSFILPLQPAMNSQLMVGMTGSSPPEAAVEATPETTPLKDMRDIKQPGRAVNSHAVRA IFAVSQPSSPMRKDQQSQFSFPDVSREAEESSHSYLEVNRGTAYDRRLSQVIQDRHNVER SVNTPCPSSLPEINSDLSLVGGSVYPSVTQEVAAVCGECNETDRNLCSVGGLHMPTSRSM HQLAKKRRNRMASYSGNGSCADSRSSAAKKASWSTEAENPMRRTKSCSALSGMRQQHLEE NDDEADCSSQRQRGENGNTQKTGSRLQRSGRNLAISAPKDTARSCTHASTQTVEGLDSAP AQYENWLIELLLECKEQRIDYERNLLYPQDILDEYIKHAIKANESFDAEQGQLMCLQLEY ESYRRSIHAERNRRLMGRSRDKRSLEMERDRLREQLKNFDAKNKDLANKMDQAIRLANER QNIHQEELGEMRAKYQHELEEKKCLRQANDDLQTRLTSELARHKEMNYELESLRGQVFSL GTELQHTQQQADIGLQCKQELARLEAEFIIMGEVQVRCRDRLAEIDNFRARDEELQMLQE SSNLELKDLRHSLDEKTSQLESMKHKISDLQAQLANSEKAMTEQKRLLSTVKDEYEEKFK SVNKKYDVQKKIIMQMEEKLMMMMQQPQGTTGHNTCSPDTDRTGECKSCFAAKEYLLLHS PTDIASSIERNSPLSTSLASSESLSASLRSTELKNLHQLVDTPTIPDVLNSMAGGAQFED EVRPPAVDLASSASTASAINIVPHALDLPSTSGGIGHTLTHPHPHPHLHLQQQQQDQLQ >AE003746|GENSCAN_predicted_CDS_20|3420_bp atggtgattgagaagatcattggtgacctggagtccaacatgacgctggagaacgaggag gccaagcgcaagcttgtggagttgctatcccagagtgagtcctcgccagtttcagttaat atgccacctattccaacataccatttccccacagacaaggagcagtgggtggtaaagttc atgctggactacttctttacaactggatctcagcgcattttggaggtactggtcaaagcc caggcacctcacgatgggtacatctttgacaagctggacgactgcctaaagcagtcgcag caccgagtgcagagcctccaggtgttctgcttcattgtgcgccatcaccctacttggctg tacaagatcgagaaacaccggctgatcaaaagtgtttttaagcttatgacgcacgagaag gagatagttccgctgatgagcgccctgttgtgcataattactctgctgccgatcataccg aattctgtgcccaactttcttaacgatctgtttgaggtgttcgggcatttggcctcgtgg aagctgcagaatagcaataaactgccggacgagaagctcgtccacctgcagttgggtcta cagatgctatttcaccgcctgtacggcatgtatccgtgcagctttattgcctatttagtg gagttcatcaagcgaggcaacggcgggggcatcttccagcatacaatcaagccgctgttg aacactgtgcgagtgcatcccatgctggtgacggccacgccagagactgaggtaaacaat acgcgatggaaggagatggagccgcatgacgtggttatggagtgcgccaacctatcgctg cccgtcctcttgcccgagacgagcaacgaagacggcagctatgcgtatcccatgacgcca ggatacagtcgcatgacttcaaatacctcgaatacggactacagctatcagctgagggag tttcagcaatcgagaaatgtctacacccgcttcgattcgtttgcctcgggtgatgatgtg ggtccgatctggagtccgcataacgagattgccacgaccagtagcggcataccgcttaca cccaccacatcgtttattttgccactacaaccggctatgaactctcagcttatggttggc atgactggctcctcaccgcccgaagcagctgtggaggccacaccggaaaccaccccctta aaggatatgagggatatcaagcagccgggacgtgcggtcaattcgcatgctgtgagagct atcttcgccgtaagccagccttcttcacccatgcgcaaggaccagcagagtcagttcagt ttcccggatgtctctcgcgaggcggaggagagcagccactcatatctggaggtcaacaga ggaactgcctatgaccgtcgcctgtcgcaggtcatccaggacaggcataacgtggagcga tctgtaaacacaccttgtccaagcagcctgccagaaattaactccgatttatcccttgtt ggtggttccgtctatccatctgtcacgcaggaggtcgctgcagtttgtggcgagtgcaac gagacggataggaacctctgcagtgtgggtggacttcatatgcccaccagccgatccatg caccagctggcaaagaagcgccgcaatcgtatggcaagctacagtggaaatggttcctgt gcggacagcagaagttcggcggcaaagaaggcaagttggagtactgaggcagagaaccca atgcgacgaaccaaatcctgctcggccctttccggaatgcggcagcagcatctggaggag aatgatgacgaggccgattgttcgagccaaagacaaagaggggagaatggaaatacgcaa aagactggcagccgcctgcagaggagcggccggaacctggccatttcggcgcccaaggat acggctagaagctgcacccatgcctccacccagacggtggaaggactggacagtgctcca gcgcagtacgagaattggcttattgaactcctgctggagtgcaaggagcaaagaatcgac tatgaaaggaaccttctgtacccgcaagatattctagacgaatacattaagcatgcgatc aaggccaatgagtcctttgacgccgagcagggtcaactgatgtgcctacagctggaatac gaaagctaccgtcgatccattcacgcagagcgcaatcgacgactcatggggcgaagcagg gacaagcgcagcctggaaatggagcgggatcggttaagggagcagcttaagaacttcgat gcgaagaacaaggatctggcaaacaaaatggatcaggccattcggttggccaacgagcgc cagaacatccaccaggaggagctgggcgaaatgagggctaagtaccagcacgaactggag gaaaagaagtgcctgcggcaggcaaacgatgacctgcagacgcgactcaccagcgagttg gcgcgccacaaggagatgaactatgaactggagtctctgcgaggtcaggtcttcagtttg ggaaccgagcttcaacacacccagcagcaggcggacattgggctgcagtgcaagcaggag ctggcacgactggaggccgagtttattatcatgggtgaggtgcaagtgcgttgccgcgac cgtctggctgagatcgataacttcagggcccgcgacgaggaactgcagatgctacaagag agcagcaacctggaactgaaggatctgaggcacagcctggacgagaagacatcacagctg gaaagcatgaagcacaagatcagcgacctgcaggcccagctagccaacagcgagaaggcc atgacggagcaaaagcgacttctaagcaccgtcaaggatgagtacgaagaaaagtttaag tccgtgaacaagaagtatgacgtgcaaaagaagataattatgcagatggaggagaagctg atgatgatgatgcagcagccgcaaggaacaacaggtcataacacctgttccccggacacg gacagaactggtgagtgcaagagctgcttcgcagctaaagaatacttactattgcattct cccacagacatagcttcatccattgaacgcaactcaccgctatccacgtcgctggcctcg agcgagagcttatccgccagcctacgctccacggagctgaagaacctgcaccagctagtg gacacgcccactattccggatgtgctgaacagcatggccggtggcgctcagttcgaggac gaagtgcgtccgccggccgtggatctggcctcctcggcaagcaccgccagtgccatcaac atcgtgccgcacgccttggacttgccgtcgacctccggcggcatcggtcacacgctcacc cacccacatccgcatccgcacctgcacctgcagcaacagcaacaggatcaactgcagtag >AE003746|GENSCAN_predicted_peptide_21|1398_aa MEEFEQEPFEVGEFIERLTWRTNNELQNSEDFHPVALHDTFIQTIKDLKILQEKQQSKCE RLEESLRQEKESHAKKIAKLQERHQTAIDVFGQLDEKINSVAGKIMHLGEQLENVNTPRS RSVEAQKLLNFMSEFLAAGPVIVNDIFADAARLSEAADVIQKLYAISQDLPPGNFAESKR KIEKKYDEVERRLIEEFATAQKSEDIERMKTLAQILSQFKGYTQCVDAYIEQSQMQPYSG KDIFIGIVPLCKHHYEIIQKVFANPQQVMSKFILNIYQLKLHQYAMTKLEDKKDEEKYLR TLYELYSRTLKLSTDLQIYMSTIDDDLLQKLTQQIFIKHLAGYAEMETKCLTAKCSTELE KFYASKKHQKTATTKGFRRNMEVLIATRANINIAAIEDYGGETFLSEELAINMLQEAKAS LKRCRLLSNETELPGNAIKLNDILLRFLMHEHVDYALELGLQAVPLAEGRVFPQLYFFDV VQKTNIIVHLLDKLCHTSVIPCVSNTPKYSDYVFKKRILMEQIETKLDQGLDRSISAVIG WVKVYLQYEQKKTDYKPETDVDTISSAACLQVVQNLQPVIVQIKKCVDGENLQNVLTEFG TRLHRVIYDHLQTMQFNTAGAMCAICDVNEYRKCIRELDSPLVTQLFDILHALCNLLLVK PQNLQEVCTGDTLNYLDKSVVRQFIQLRTDFRIIKNTNYLKGSLYQVSYLDIESCTTLPC SMARNATIKVTVRFDDNGNGVSFLKHEVRWVFNYIKTQAAITPDPCDGDHGCIESASGGK AYWANIFVNETLPVINSSQERKSEQCGELKIKVATSDQRRRSDCVGVRDHGSRGVEPVHP KVFPLPLQSPRIGAFQMGSMQVALLALLVLGQLFPSAVANGSSSYSSTSTSASNQLQRQK LAHWFRDSNDVKDKILELQCLAKCGSNPTTKAGREQCLNKCIQELLLGPRAGSCPKIGRQ SRARLSCLDNCQYDHECPEVQKCCPSSCGPMCVEPLGVRNNTQLPPIPKILYFRRSRGHA VDLKIESSLLVYYFHVEVRSHIGRHFAARKLGPWQWQKVEKTMEENIGHSKHTYIFFHMR PGRWYEVRVAAVNAYGFRGYSEPSDPFPSTGNPKPPKSPNDSKIIGKQFDGRYMTLKLVW CPSKSNLPVEKYKITWSLYVNSAKASMITNSSYVKDTHQFEIKELLPNSSYYIQVQAISY LGSRRLKSEQWSMLFNTTLQPLEPITPLQCSGNGNRRRHHHTSSSMSSERATTSEPVALN EVSPTITNRTSAAATYEVGFRLNRKFGMIVQILGFQPHKEKVYELCPQETNCEQREFRAI RAKVSNTLDDALMGATSHMSEYFVPIGRTTGQGQISGTSLTMGPNSVLDDSRNVFTFTTP KCENFRKRFPKLQIKCSD >AE003746|GENSCAN_predicted_CDS_21|4197_bp atggaggagtttgaacaggaaccctttgaggtgggcgagttcatagagcgtctgacttgg cgcaccaacaatgaactccagaacagcgaggactttcatcctgtggccctccacgatacc ttcatccagaccattaaggacctaaagatcctgcaggagaagcaacaaagcaagtgcgaa cggctagaggagtcactgcgccaggagaaggagtcgcacgccaaaaagattgccaagctc caagaacgccaccaaacggccattgatgtgttcggccagctggacgaaaaaatcaattcg gtggccggcaagatcatgcacctgggcgaacagttggagaatgtgaacactccacgcagt cgttcagtggaggcccagaagttactcaattttatgtccgagtttttagccgctggccct gtgattgtcaacgatatttttgcggatgccgcaagattaagtgaggccgcagatgtgata caaaagctctacgctatctcgcaggatctgccgcccggaaactttgcggaatccaaaaga aaaatcgaaaagaaatacgatgaagtcgagcggcggttgatcgaagagtttgccaccgcc cagaagagcgaggacatcgagcgcatgaagacactggcccagatcttgtcccagttcaag ggttacactcagtgtgtagacgcatacatcgagcagagccaaatgcaaccgtacagtggc aaagacatatttataggcattgtaccgctatgcaagcatcactacgagataatccagaag gtgtttgccaatccgcagcaggttatgtccaagttcatacttaacatatatcaactgaag cttcaccagtacgccatgactaagttggaggacaaaaaagacgaggagaaatatcttcgc accctttatgagttatactcgcgcacactaaaattgtcaacagatcttcaaatctacatg tccacgatcgacgacgacttgttacagaagctgacacagcagatttttataaagcatttg gccggctacgccgaaatggagaccaaatgcctcacagccaagtgttccacagagctggaa aagttctatgccagcaagaaacaccaaaagactgcaactactaagggctttcggagaaac atggaggtgctgatagccacgcgggctaacataaatattgctgccatcgaggactatggc ggggagacgttcctatccgaagagttggccatcaacatgctgcaggaggccaaggcgtcg cttaagcgctgtcgcctgctgtccaacgagaccgaactaccaggcaatgctataaagcta aacgatatccttcttcgtttcttgatgcacgagcacgtagattacgcactggagttggga ctgcaagcagtgccactggccgagggcagggtctttccccagctctacttctttgatgtg gtgcaaaagacgaacatcattgtccatctactggacaagctgtgccacacatctgtcata ccctgtgtgagtaatacacccaagtactcggactatgtgttcaaaaaacgtattctgatg gagcaaatcgagacaaagctggaccagggtcttgatcgctctattagcgctgttattggc tgggtcaaggtatatttgcaatatgaacaaaagaaaacggactacaagccggaaacagat gtggatacaatatcttcagcggcctgcttacaagtcgttcagaatctgcagcccgtgatt gtgcagattaaaaaatgtgttgatggagagaatttgcagaatgttcttacagaatttgga actcggttgcaccgagtaatctacgatcacctgcagaccatgcagttcaacacggctggc gccatgtgtgccatctgcgacgtgaacgagtatcgcaagtgcattcgcgagctggatagt ccactggtcacgcagctgtttgacatactgcatgcattgtgcaatttactacttgttaag ccccaaaaccttcaagaagtttgcacgggtgacactctgaattatctggacaagtcggtg gtgcggcaattcattcagctgcgcactgatttcaggatcatcaagaacaccaactatctg aagggttctttgtaccaggtcagctacctggacatcgagagctgtacgacactgccctgc tccatggcccggaacgcgacaattaaggttactgtgcgcttcgatgataatggcaatggc gtcagctttctgaagcacgaagtccgatgggtgtttaactacatcaagacccaggcggcc ataactcccgatccctgcgacggagatcacggatgcatagagagcgcaagtggtggaaag gcctattgggccaatatctttgtgaacgaaactttgccggtgatcaacagttcccaggag cggaaatcggaacagtgcggcgaattaaagattaaagtggcaacaagcgatcaaagaagg cgctccgactgcgtgggagtacgagatcacggatcacgaggcgtggagccagtgcatccg aaagtgtttccattaccattgcaatcaccgaggatcggagcgtttcagatgggcagcatg caagtggcgctgctggcgctgcttgttctcggccagctattcccaagcgccgtggccaat ggatcctcctcctatagttccacctccacatccgcatcgaatcagctgcagcgacagaag ctggcacactggttccgggatagcaatgatgttaaggataagatcctggagctgcaatgc ctggcgaagtgtggcagcaatcccacaaccaaagctggacgggaacagtgcctgaacaag tgcatccaggagcttttgctgggacccagagccggcagttgccccaaaattggaaggcaa tcgcgtgccagactctcctgcctggacaactgtcagtacgatcatgaatgcccagaggtg cagaagtgttgtccctccagttgcggacccatgtgcgtggaacctctcggcgttaggaac aacacacagcttccgcccataccgaagattttgtatttccggagatcgcgaggtcatgct gtcgatctgaagatcgagtcctcgctactggtctactacttccatgtggaggtaagatcc cacataggacggcattttgcagccagaaaactgggtccttggcaatggcagaaggtggag aagaccatggaggagaacatcggacacagcaagcatacttacatcttctttcacatgcga cctggtcggtggtatgaggttcgagtggcagccgtaaacgcctacgggttccgtggatat tccgagccaagcgatccctttccctcgacgggcaacccaaagcccccaaagtctccgaac gattcgaagatcatcggcaagcagttcgatggacgctacatgacccttaagctggtgtgg tgcccctccaagtccaacctgcctgtcgagaagtacaagatcacctggtcattgtacgta aacagtgccaaggcctcgatgattacgaacagctcctacgttaaggatacacaccagttc gaaatcaaggaactgctacccaactcctcgtactacatccaagtgcaggccatatcctac ctgggttcgcgtcgcctcaagtccgagcagtggtcgatgctgttcaacacgacgctgcaa cctctggagccaattacaccgcttcagtgctccgggaatggcaataggcgacggcatcat cacactagcagctctatgagctcggaacgggccacaacctcggagccagttgccctcaat gaagtttcgcccaccattacaaaccggacatcggccgccgccacgtatgaagtgggtttc cggttaaaccggaagttcggcatgattgtgcagattctgggcttccagccacacaaggag aaggtctatgaactgtgtccccaggagacgaactgcgagcagcgagagttccgcgcgatt cgcgccaaagtaagtaatacccttgatgatgctctgatgggagccacgagccacatgtcg gaatacttcgtgccgattggccggaccaccgggcaagggcaaattagtggaacatcgttg accatgggtcccaattccgtgttggacgactccagaaatgtctttaccttcaccacgcct aaatgtgaaaatttccgcaagagatttcccaagctgcagatcaagtgcagcgactag >AE003746|GENSCAN_predicted_peptide_22|464_aa MLSLRSVLKHCLSAKKTCSRNISALYITGDKANENYVTLQPYMDFNKTFGERQFLEQSIS SRGLDIRLETVLSKYEKYKTHHAQLSKVAEERERVTKRLKELTKSGSSAVQLEELKEHGK SLRNELKALKQTLYPIEDDFIHDYLHLPNLLHVQCPVGGEEKLLYRHGIPKSENKTTSHL ARQELVHFVDNNRYYLMEQAALFDVNAMQSLARYFVNHGHFIQTANPDFVRCVLLEANAT PLSDYHLVQEEHLQNKINTAYLTGGASFESYLGAMTKLCVYPSVLPLRYVCCGRSYNRAE ADLYGPIPSLYTATQTNAVQIFVATQTDNEADSQLEHILNLATDFYKALDIPFRISYATA ADLTPAESIRAVIEVYAPSLQRYVCVGRISNYGDFVSKRILFSTRREKHYDFLHMVGGPV LYTSRLIAALVEHGVRLEDCKLLGSISQKPVHQQDLQQFKDLFT >AE003746|GENSCAN_predicted_CDS_22|1395_bp atgttgagcctgcgaagtgtattaaaacactgcctttcggcaaagaaaacgtgtagcaga aacatctccgcgctgtacataaccggcgataaagcgaacgaaaactatgtgactctgcag ccgtacatggacttcaataaaacctttggagagcggcaatttttggagcagagcatctcc agccggggattggacattcgtttggaaactgtgctcagcaagtacgagaagtataaaaca catcacgctcagctgtccaaggtggcggaggaacgcgaaagggtgaccaagcgcctaaag gagctaacgaaatctggtagtagtgcggttcagctggaggagttgaaagagcatgggaaa tcgttgcgcaatgagctaaaggctctgaagcagactctctatcccatagaggatgacttt attcatgactatctgcacttgcccaaccttctgcatgtccagtgccctgttggaggggag gaaaagcttctctatcgtcatgggatacccaaatcagaaaataagactacatcccacttg gcacgccaggaacttgtacattttgtagacaacaatcgttactacttgatggagcaggct gctctttttgacgtaaatgccatgcaatccctggcccgctatttcgtcaaccacgggcac ttcattcaaaccgccaatccggactttgtgcgttgcgttctcctggaagcaaatgcgaca cccttatccgactatcatctggttcaggaagagcacctgcaaaacaagatcaataccgcc tatttgactggaggtgcgtccttcgagagctaccttggtgccatgaccaagctgtgcgtg tatccctcggtgctgccccttcggtatgtctgctgtggtcggagctacaacagagcggaa gcggatttgtatgggcccattcctagtctttatacggcgacacaaacgaatgcggtacaa atctttgtggcaacgcaaacggacaacgaagcggactcccaactagagcatattctcaat ctggcaactgatttctacaaagcactggacataccattcaggatttcttatgcaacagca gcagatctaactccggcggagagtatccgggcagtcatcgaagtttatgccccctcactg cagcgctatgtgtgtgtgggacgtatcagcaactacggagactttgtctctaagcgaatc ctttttagcacgcggcgggagaagcactatgatttcttgcacatggtgggcggaccagtg ctctacacctcgcgactgatagcagctcttgttgagcatggcgtgcgcttggaagactgc aaacttttgggctctattagtcagaaacccgttcatcagcaggatctacagcagtttaag gaccttttcacgtaa >AE003746|GENSCAN_predicted_peptide_23|356_aa MPEFVRVAINESLWEFPDIYEFVRFLGGGSFGQVAKVRLRGTENYFAMKRLMRPFEREED AKGTYREIRLLKHMNHRNVISLLNVFHPPAHNMMEFQQVYLVTHLMDADLHRYSRSKRMS DQEIRIILYQILRGLKYIHSAGVVHRDLKPCNIAVNGNSEVRILDFGLSRMCADKMTDHV GTMWYLAPEIIFLRGQYTKAIDVWSVGCILAELITDRVLFRGENYVSQIRCLINIMGTPT REFITGISMERSRNYLEGYPLRQRCDFHHLFMGYDVQAIDLMEKMLEMVPEKRITAAEAM LHPYLRDLIEPHHHAEDTAPVYDQNFENMVLPVKCWKELVSHEIRNFRPDQLDLHF >AE003746|GENSCAN_predicted_CDS_23|1071_bp atgccggagttcgtgagagtggcaattaacgaaagcctttgggagttcccggatatatac gagttcgttcgttttctgggtggcggttcctttggccaggtggctaaggtgagactacga ggcactgaaaattacttcgctatgaaaaggcttatgcgcccattcgagagggaggaggat gctaagggtacctatcgcgagatccgtctgctgaagcacatgaatcatcgaaatgttatc agcctgctgaacgtcttccatccaccagcgcacaacatgatggaatttcagcaggtttat ttggtgactcacctgatggacgcggatttgcacaggtactcgcgttcgaaaaggatgagc gatcaagagattaggataatcctttaccaaatactgcggggactgaagtacatacatagt gccggggttgttcatcgagacttaaagccctgcaatatcgcagttaatggaaatagcgag gtgcgcatacttgacttcggtttatcccgtatgtgcgcagacaaaatgacggaccatgtt ggaactatgtggtatctagctccggaaattatctttttaaggggtcaatacacaaaggca attgacgtgtggtcggttggttgcattctggcggaacttatcacggatcgtgtcctgttt cgcggtgaaaactatgtaagccaaatacgatgtctgattaacataatgggtactccgacg agggagtttatcaccgggataagcatggaacgttcgcgtaattacctggaggggtacccg ttacgacaaaggtgcgattttcatcacctgtttatgggttacgatgtccaggccatcgat ttgatggaaaaaatgctcgaaatggtacccgaaaaacgtatcacagctgcggaagcaatg ctccatccataccttcgggatcttattgagccacaccatcatgccgaagacaccgcacca gtctatgatcagaacttcgaaaacatggtactgcctgtaaaatgctggaaagaacttgtt tcccacgagattcggaacttcagacctgaccaacttgatttgcatttttaa >AE003746|GENSCAN_predicted_peptide_24|366_aa MSVSITKKFYKLDINRTEWEIPDIYQDLQPVGSGAYGQVSKAVVRGTNMHVAIKKLARPF QSAVHAKRTYRELRLLKHMDHENVIGLLDIFHPHPANGSLENFQQVYLVTHLMDADLNNI IRMQHLSDDHVQFLVYQILRGLKYIHSAGVIHRDLKPSNIAVNEDCELRILDFGLARPTE NEMTGYVATRWYRAPEIMLNWMHYDQTVDIWSVGCIMAELITRRTLFPGTDHIHQLNLIM EMLGTPPAEFLKKISSESARSYIQSLPPMKGRSFKNVFKNANPLAIDLLEKMLELDAEKR ITAEEALSHPYLEKYAEPSVEQTSPPYDHSFEDMDLPVDKWKELIYKEVTNFKPPPSYAQ VLKDVK >AE003746|GENSCAN_predicted_CDS_24|1101_bp atgtcagtgtccattacaaaaaagttttacaagttggatataaatcgaacggaatgggag atcccggatatataccaggatctgcagcccgtgggatcgggagcttacggacaggtgtca aaggcagttgttcgtggcaccaatatgcatgtggccattaaaaagcttgccaggcctttt caatcagctgtccatgcaaagaggacgtaccgggagcttcgacttttaaagcatatggat catgagaacgtaatcggtctgctggacatattccatccacatcccgctaatggatcgctg gagaacttccaacaggtgtacttggttacccacttgatggacgcagatctgaacaacatc atacggatgcagcacttgtccgacgaccacgtccagtttttagtctaccagatactccgt ggcttgaagtatatccacagcgccggagtgatccaccgtgatcttaagccctcaaacatt gccgtcaacgaggattgcgagctgcgcattctagacttcgggctggcccgcccaacggag aacgagatgacaggctatgtggccacgcgttggtaccgggcacctgaaataatgctcaat tggatgcactacgaccaaacagtggacatctggtcggtgggctgcatcatggccgaacta attaccagacgaaccctcttcccaggcaccgaccatattcaccagctaaacctgattatg gagatgttgggcacgccacccgccgaatttttgaagaagatctcatcggaaagtgcacgt tcctacatccagtcacttccgcctatgaagggacgaagttttaaaaatgtttttaagaac gccaatccgctggccattgatttgctggaaaagatgttggagctagatgccgaaaagcgg atcacagccgaggaggctctttcccatccatatctggagaagtatgcggagcccagcgtc gagcagacctcaccaccatacgatcacagcttcgaggatatggatttgcccgtagacaaa tggaaggaattgatctacaaggaggtcaccaactttaagcccccaccatcgtatgctcag gttctaaaggatgtaaagtga >AE003746|GENSCAN_predicted_peptide_25|544_aa MTSKLLPGNIVYGGPVTERQAQDSRSLGQYILDKYKSFGDRTVLVDAVNGVEYSASFMHK SIVRLAYILQKLGVKQNDVVGLSSENSVNFALAMFAGLAVGATVAPLNVTYSDREVDHAI NLSKPKIIFASKITIDRVAKVASKNKFVKGIIALSGTSKKFKNIYDLKELMEDEKFKTQP DFTSPAANKDEDVSLIVCSSGTTGLPKGVQLTQMNLLATLDSQIQPTVIPMEEVTLLTVI PWFHAFGCLTLITTACVGARLVYLPKFEEKLFLSAIEKYRVMMAFMVPPLMVFLAKHPIV DKYDLSSLMVLLCGAAPLSRETEDQIKERIGVPFIRQGYGLSESTLSVLVQNDEFCKPGS VGVLKVGIYAKVIDPDTGKLLGANERGELCFKGDGIMKGYIGDTKSTQTAIKDGWLHTGD IGYYDDDFEFFIVDRIKELIKYKGYQVPPAEIEALLLTNDKIKDAAVIGKPDEEAGELPL AFVVKQANVQLTENEVIQFVNDNASPAKRLRGGVIFVDEIPKNPSGKILRRILREMLKKQ KSKL >AE003746|GENSCAN_predicted_CDS_25|1635_bp atgacttcaaagctactgcccggaaacattgtgtacggaggtcctgtgactgaacgacag gcccaggatagcagatccttgggtcagtacatcctcgacaagtacaagagctttggcgac cggacggtgctggtggatgccgtcaatggagtggagtactctgccagtttcatgcacaag tccattgtacggctggcatacatccttcaaaaactgggagtcaaacagaatgacgtcgtt ggtttgtctagcgaaaacagcgtcaacttcgccctggccatgttcgctggtctagcagtt ggggctacggttgctccccttaacgtaacatattccgatcgtgaggtggaccacgccatt aacttgtccaagccaaagatcatattcgcctctaagattaccattgatcgtgttgccaaa gtggccagcaagaataagttcgtcaagggcatcattgcgctcagtggaacttccaagaaa tttaagaacatctatgatcttaaggagctgatggaggacgagaagttcaagacacagcct gacttcacgagccctgcggccaataaggacgaggacgtgtctcttattgtgtgctcttct ggaaccaccggacttcctaaaggagtgcagctgacccaaatgaacctgctggccactctc gactcacaaatccaacccactgtcattccaatggaggaggtcactctactcactgtcatt ccctggttccacgccttcggctgtctgacgcttatcaccaccgcctgcgttggcgcacga ttggtatacctgcccaagttcgaggaaaagctcttcctttctgccattgaaaagtatcgc gtgatgatggccttcatggtgccaccactgatggtctttttggctaaacaccccatcgtg gataagtacgatttgtcctctttgatggtcctgctgtgtggagcagctccactcagtcgc gaaactgaggatcagatcaaggagcgtattggagtgccattcatccgacagggatacggc ctcagcgaatcaacgctgagtgttctggtgcagaacgatgagttctgcaagccaggcagt gtgggcgttcttaaggtgggaatctatgccaaggtgatcgatcccgacaccggcaagcta ttgggggccaacgagcgcggcgagctttgttttaaaggcgacggcatcatgaagggctac atcggagatacgaagtccacgcagaccgccatcaaggacggttggttgcatactggcgat attggctactatgatgatgattttgagttcttcatcgtggaccgcatcaaggagctgatc aaatacaagggataccaggtgccgccggcagagattgaggctctgctgctcaccaacgac aagattaaggatgcggcggtcattggaaagccagacgaggaggctggcgagctgccgctg gcatttgtggtcaaacaggctaatgttcaactgaccgagaacgaagtcattcagtttgtc aacgacaacgcctcgcccgccaagcgtctaaggggtggcgtgatctttgttgacgaaatt ccaaagaaccccagtggcaagattctgcgtcgcattctgcgggaaatgcttaagaagcaa aaatccaagttgtaa >AE003746|GENSCAN_predicted_peptide_26|1082_aa MEQEIGTWDSVLLENLSEDSFINNIHQRYKRDHIYTYIGTSVVALNPYHHISEHSLDNVR NYGDKGIFQLPPHIYGLTNLAYQSLKDQSEDQCVLLTGESGAGKTETFKMIVNFLTHIQD RSHCPPTPNVLRKQSSTSSASGLVMHAHRRASSSCSGTANFIICKNRAENPSGSVSRRQS PSPGPSQRSRTRAESIERQSRRHMREKIVDFDFSHHKSSENISGLPESHAHHMHPTKSCF KHQQTQVSACTAMPAAAKGSPKYAVPTVYGGCRQCGHSKCVRAQSLEKEERDDLRGSNCR LSTIATATTNPAHPHRGSCSNLMRQHSTESQPERERDRSSLMGSTQRISLYDAHKLSKVL GDLPPPPPSSVSPTPSASSSLHRRHKSPTQRMRECVTCADVFLEAMGNACTLKNNNSSRY TRITDPIIGERNFHIFYQLLLGADLQLLKSLKLYRNVEKYELLRNTTAMEEDRMNFHYTK NFVFHVLRSEQELYIREGLEWSRIDYFDNESICELIDKPSYGILSLINEPHLNSNDALLL RVQQCCAGHPNFMTTGSNSMCFQIRHYASVVNYSIHRFLEKNSDMLPKYISAAFYQSKLS LVQSLFPEGNPRRQVTKKPSTLSSNIRTQLQTLLAIVKHRRSHYVFCIKPNEGKQPHQFD MALVQHQVRYMSLMPLVHLCRTGHCYHLLHVKFFHRYKLLNSLTWPHFHGGSQVEGIALI IRNLPLPSAEFTIGTKNVFVRSPRTVYELEQFRRLRISELAVLIQTMFRMYHARKRFQRM RHSQMIISSAWRTWRAREEYRSLKYKRQVRWAIDIIGRYYRQWKIRQFLLTIPLRLPPNT LSPLSTEWPVAPAFLADASRHLRSIYHRWKCYIYRNSFDQTARNRMREKVTASIIFKDRK ASYGRSVGHPFVGDYVRLRHNQQWKKICAETNDQYVVFADIINKIARSSGKFVPILLVLS TSSLLLLDQRTLQIKYRVPASEIYRMSLSPYLDDIAVFHSEFGRKKGDFVFQTGHVIEIV TKMFLVIQNATGKPPEIHISTEFEANFGQQTVIFSFKYGGMSDLAQGPPKVTRKANRMEI IV >AE003746|GENSCAN_predicted_CDS_26|3249_bp atggagcaggaaatcggcacctgggactcggtactgttggagaacctgtccgaggatagt ttcataaacaacatccaccagcgctataagcgcgatcacatatatacctacattggaaca tctgttgtggctctgaatccatatcatcacatatccgagcactctctggacaatgtccgc aactatggcgataagggcattttccagctgccgccccacatatatggtctcacaaatctg gcttatcaatcgctcaaagatcagagcgaggatcagtgtgttctgctcaccggtgagagc ggagcgggcaaaacggagacttttaaaatgatcgtgaactttctgacccacatacaagat cgctcccactgccccccaacaccgaatgttttgcgcaagcaatcctcaactagctcggcc agcggattggtgatgcacgcccacaggcgagcctccagcagctgctccggcactgccaat tttattatatgcaaaaaccgggcggaaaatccgtcaggcagtgtttcacggcgacaaagt ccatcgccaggaccatcgcagcgatcgcggacgcgggccgagagcatcgagcgccaaagc aggcgccacatgcgggagaaaattgtcgactttgatttctcacaccacaagagtagcgaa aacatcagcggccttcctgaatcgcacgcccaccacatgcatccgaccaagtcgtgcttc aagcaccagcagacccaagtcagcgcctgcactgcaatgcccgcagcagccaagggatcg cccaaatacgcggtacccaccgtgtacggcggttgccgccagtgcggacacagcaagtgt gtccgtgctcagagcctggaaaaggaggagcgggatgatctacgaggcagcaactgccgc ctgtctaccatagccactgccaccaccaatccggcacatccgcatcgcggtagttgctcc aacctgatgcgccagcactcgacagagagtcagccggagcgggagcgggatcgaagctcc cttatggggtccacgcaacgtatatcgctgtacgatgcacacaagctgagtaaggttctg ggcgatctgccgccgcctccgcctagttctgtttcgcccacgccatcggccagctcttcg ctgcacaggcgtcataaatcgcccacgcaacgaatgcgagagtgcgtcacctgtgcggat gtgttcctggaggccatgggcaatgcctgcaccctgaaaaataataactctagtcgatat acccgtataacggaccctatcattggagagcgaaattttcacatcttttaccaattacta ttaggagctgatctccagttgctaaaatcgctcaagctgtatcgaaatgtggaaaagtac gagctgctccgcaacacaactgccatggaggaggaccgcatgaattttcattatacgaag aattttgtgtttcatgtgctgcgttcggagcaagagctctatattcgcgagggattggaa tggtctcgcattgactatttcgacaacgagtctatttgcgagttaatagacaaacccagc tatggtatattgagcttgattaatgaaccccatttaaatagcaacgacgctttgcttttg cgagttcagcaatgttgtgcggggcatcccaactttatgaccaccggcagcaattccatg tgctttcagattcgtcattatgcaagtgtagtgaactactcaatacatcggtttctcgaa aagaactccgacatgctgccgaagtacataagcgctgccttttatcagagcaaactttct ttggtgcaaagcctattccccgaggggaatccccgtcgacaggttaccaaaaagcccagc acgttgagttcgaatatccgcacccaattgcagacgctgctggccatcgttaagcatcgc cgctcccactatgtgttctgtattaagcccaacgagggcaagcagccgcaccagttcgat atggctctagtgcaacatcaggtgcgctacatgtcgctcatgccgctggtccacctgtgt cgcactggccattgctaccacctgttgcacgttaagttttttcatcgctataagttgctc aacagcctgacgtggccccactttcatggcggcagtcaggtagagggtatcgccctcata atccgtaacctaccgctgccctcagcggagttcacgatcggcaccaaaaatgtgttcgtg cgtagtccccgcaccgtatatgagttggaacagtttcgccgcctgcgtattagcgagctg gccgtgcttattcaaaccatgttccgaatgtatcacgcaaggaagcgctttcagcgcatg cgacacagccagatgatcatatcgagtgcctggcgcacgtggcgggcccgcgaggagtat cggtccttgaagtacaaacgacaggtgagatgggccatcgatattataggccgctactac cgccagtggaagatcagacagttccttctgacaattcccttgcgactgccaccgaacacg ctaagcccgctctccaccgaatggccagtggctcccgcatttctggcagatgcctctcgt catcttaggtccatataccatcgttggaagtgctacatctaccgaaactcctttgatcaa acggcgcgtaatcgaatgcgggagaaggtcacagccagcattatcttcaaggatcgaaaa gcttcatatggacgaagtgtgggtcatccttttgtgggggactacgtgcgactgcgacac aaccagcagtggaaaaagatctgcgccgagaccaacgatcagtatgttgtattcgcagac ataatcaacaagatagcgcgctccagtggcaagtttgtgcccattttgctggtgctatcc acgtcatcgcttttgctgttggaccaacgaacgctgcaaattaagtacagagtgcctgca tcggagatttaccgaatgtctctgagcccctacctagatgacattgctgtgtttcactct gaatttggacggaagaagggtgatttcgtttttcaaacgggtcatgtgattgaaattgtt accaaaatgtttctggtcatacaaaatgccacaggcaaacccccggagatacacataagc actgaatttgaagcgaacttcggccagcagactgtcatcttttcgttcaaatacggcggc atgtcggacttagcacaaggcccacccaaggtcacacgcaaggcgaaccgcatggagata attgtgtga >AE003746|GENSCAN_predicted_peptide_27|306_aa MNTDERNFRSIYYEKCQINSVEEQKSLNKLLQDDIRNLSKLKQFCMNYTVPNNNRSYLWA LVMGILPLHKASTAYVRDQRREMYEDLRRAVTVLRFTDHKQKEQPFMWLIDHKKKAQVMH TMWLIESNRLWHGNTSASLQADDMHFIEIVRTLLQIFDDNVETYWIAKGFYKYTRELKKE CVKLKEQTQNILKREDLSLLNHLELLGLFDGNSTLLDNWYITCFAGIICTTHLVKIWDKV CGGSRKIVVFLFVELVKDIRSSILKQTSLADVKRLIETVKDLDGVIIVNKAIKSLQNNSS EVEYTH >AE003746|GENSCAN_predicted_CDS_27|921_bp atgaatactgacgagcgaaacttcaggtccatctactatgaaaaatgccaaataaacagc gtcgaggaacaaaagtcgctgaacaaactactacaggatgacatccgcaacctgagcaag ctgaaacaattctgcatgaactacacagtgccgaacaacaataggagctatctgtgggcc ctggtcatgggtattctcccgctccacaaagcgtccacggcgtacgtacgcgaccaaaga cgcgaaatgtacgaggatctgcggcgagcggtaaccgtgctccggtttaccgaccataag cagaaggagcagcccttcatgtggctgatagaccataagaagaaagcacaggtcatgcac accatgtggctgatagagtccaatcgactgtggcatggcaatactagtgccagtctccag gcggacgacatgcacttcatagagattgtgcgcacgttgctgcagatattcgacgacaac gtggaaacctactggatagcaaagggattctacaagtacacccgcgaactgaagaaggag tgcgtcaagctgaaggagcaaacgcagaatatactgaagcgcgaagatttgtccttgctt aatcacctggagcttctgggcctgtttgatggcaactcaacgctactggataactggtat ataacctgctttgctggaataatctgtacaactcatttggtcaagatatgggacaaagtc tgtggagggtcccggaagattgttgttttcctgtttgtagaactagttaaggacataagg tcctcgatactaaagcaaacatctttagcagacgtcaaaagacttatagaaacggtaaag gatcttgacggtgttattatcgtaaacaaggccatcaaatcactacaaaacaacagcagc gaagtcgagtacacgcactaa >AE003746|GENSCAN_predicted_peptide_28|161_aa MNFLKKVATEVQQLSRSGFHTSSVCCRVQSGRYRITTKRNRPLTYEMANPPHFIGHRKSW NSWNTSTMKDALRPSQTAIEDVFIRKFVTGTWHALVCSEVIIKRQHNTIRIAALIRQAIT PRKMYFLIGYTEELLSYWMQCPVTLELQTVGDKKDVVFKYI >AE003746|GENSCAN_predicted_CDS_28|486_bp atgaacttcttgaaaaaggtggcgaccgaggtgcagcagctcagccggtcaggattccac accagctccgtgtgctgtcgcgtgcaatccgggcgataccgcataaccacaaagcgaaac aggcctcttacctacgagatggccaatccgccgcattttattggccaccgcaaatcgtgg aactcatggaacacgtcaacaatgaaggatgccctgcgtccgtcccagaccgccatagag gatgtgttcatccgcaagtttgtcaccggcacatggcatgccctcgtctgctccgaggtc attatcaagcgacagcacaacaccattaggatcgcggcccttattcggcaggcgattaca ccgcgaaaaatgtacttccttattggttacacggaggagctgctctcctattggatgcag tgtccagtgaccttggaactgcaaacggtgggcgataaaaaggacgtggtcttcaaatac atttag >AE003746|GENSCAN_predicted_peptide_29|1067_aa MTLNESDATRLELTRNFLELSRNPETCTALRSSDCIQLLVQILHANDEGLSTAKKYASQA LHNIVHNNPEEKERQREVKMLRLLDQILDYCNFLHTQLQSGGEAIADDEDRHPLAAMKLL MKASFDEEHRQTMCELGALKAIPNLVHLDHAVHGPAAGREQCNALRSYGLMALTNLTFGD ENVHNKSYLCGQRQFMEVVIAQLNTAPDELLQVLAGVLRNLSWRADKHMKTIFNELGTVT SLARAAMQNKNENTLKAILSALWNLSAHCSTNKAEFCAVDGALAFLVGMLSYEGPSKTLK IIENAGGILRNVSSHIAVCEPYRQILRRYNCLAILLQQLKSESLTVVSNSCGTLWNLSAR CPEDQQYLIDHNAIPLLRALISSKNSMIAEGSASALKNLVNFRATQELMPNGDGGSLPLD KEAGHGGTLPRRFSSLRLSSNPTGSLKKVRPSTVSTTGFLNRKCESRESIYSGKSDSTKY STKSEGAKNPFEIVTPTEEQPIDYSMKYMEHKPNSSKTFEIDLDQPTDFSARYKERRSAQ TAQPELKSETNEIRSKELQLTKSSSATELRNSPGLVAVSAAKQKIATETETETAERPINY CEEGTPGSFSRFDSLNSLTEKPEKCMPPKTPTKTAVLPVHVDGNTPQNIDSALETPLMFS RRSSMDSLVGDDETVACEDNGSVISEYSRMQSGVISPSELPDSPTQSMPQSPRRDRKVST QNNLDTPEQKPSTVFEDKLNRFHVEHTPAAFSCATSLSNLSMMDDSNANAIRGQRGNDIN GNGDAPRSYCTEDTTAVLSKAPSNSDLSILSIPNDLNANEAQPVPAPRADVTGMDTRMPA EDAISKMRCGGNALPSYLPVSDEMSKYYVEDSPCTFSVISGLSHLTVGSAKAGPVLKLPM RTAEEAQAPKLPPRRSAVQGDAEPRLPPKKSDSLSSLSMDSDDDCNLLSQAIAAGSCRPQ PSGASTSSSLANASTSTLCRENGQSKKQVEHGDKPNYSSDDSLDDDDDDARSKSLFEQCI LSGMHKSNDALESEGEPPGQRQEISARDRFVSNQVRQIESMLAGRQH >AE003746|GENSCAN_predicted_CDS_29|3204_bp atgacgctgaacgagagcgacgctactcgcctggagctgacgcgcaacttcctggaactg tctcgaaatccggaaacatgcaccgccctgcgcagctcggactgtatccagcttctggtg cagattctgcacgccaacgacgaaggcctctccacggcgaaaaagtacgccagccaggcg ctgcacaacatcgtccacaataatccggaggagaaggagcgccagcgggaggtgaagatg ctgcgcctgctggaccagatcctcgactactgtaactttctgcacacccagttgcagagc ggcggtgaggctatcgcagatgatgaagatcgtcatccgctggcggctatgaagctcctg atgaaagccagtttcgacgaggagcaccgccagactatgtgcgaactgggagccctcaag gcgattcccaatttggtccaccttgatcatgcggtccatggaccggctgccggtagggaa cagtgcaacgccctcaggagctacggcctcatggccctcacgaatctcaccttcggagac gagaacgtccataacaaatcgtatctgtgcggtcagcgacagttcatggaagtggtcatt gctcaattgaacacggctccggatgaactgctacaggttcttgctggtgtgcttcgcaat ctttcgtggcgcgcggacaaacacatgaagactatctttaacgagctgggtactgtgacc tccttggctcgagcagccatgcaaaacaagaacgagaacactctcaaggccatactttca gccctttggaatctctcagcgcactgcagcaccaacaaggcggagttttgtgcagtagac ggagcactggcatttttggtcggaatgcttagctacgagggtccgagtaaaactcttaag atcatcgaaaatgcaggcggcattcttcgaaatgtatcgagccatattgcggtgtgtgag ccgtaccgacaaatcctaagacggtacaattgccttgccattctgttgcaacagttgaaa tcggagagcctaaccgtggtaagcaactcctgcggaactctgtggaatctctcggcgcgc tgtcccgaggaccagcaatacctcattgaccacaatgccatcccgctcttgcgggcccta atcagctccaagaactccatgattgcggagggcagtgcctcggctttgaaaaacctagtt aatttcagggccactcaggagcttatgcccaatggagatggtgggtcactgccactagac aaggaagctggccatggaggtacgctgccacggagattcagctcactgcgcctaagctca aatcccacgggatcgcttaagaaagtgcgaccctcaacagtcagcacaactggctttctg aacagaaaatgtgagagccgagagtccatttactcgggcaaatccgattccactaaatac tcaaccaagtcggagggagcgaagaatcccttcgaaattgtgacacccactgaagagcag cccattgactactccatgaagtacatggagcacaaacccaatagcagtaagacctttgag atcgacttggatcagccaacggatttcagtgctagatataaggagagacgatccgctcag acggcacagccggagctgaagtcggagaccaatgagattagaagtaaggagttgcaactg acaaagtcctcctcggccacggagctgcgcaatagtcctggcctggtggcggtttcagca gctaagcagaaaattgccaccgaaacggagacggaaacggcagagcgaccaataaactac tgtgaagaagggactcccggcagcttcagtcgcttcgactcgctcaacagcctcacagag aaaccggagaaatgtatgccgccaaaaactccaacgaaaactgcggttctcccggtgcac gttgacggaaatacgcctcaaaacatcgattccgcactagaaactccattaatgttttcg cgtcgcagctctatggactcattggttggcgacgacgaaacggttgcctgcgaggacaat ggatccgtgatcagtgaatacagccggatgcagagtggtgttatttccccctcagagctg cccgattcacccacccagagtatgcctcaatctccgagaagggacagaaaagtgtccact caaaataacttggacacacctgaacagaagcccagtactgtgtttgaggacaagttgaac agattccacgtggagcacacgcctgctgccttctcctgcgccaccagtctaagtaatctg agcatgatggacgactctaatgcaaacgctattcgaggacagcgtggaaatgacatcaac ggcaatggggatgctcctcgcagctattgcacagaggacactactgccgtgctttccaaa gcgccaagcaatagtgatctgtccattttgtccatcccgaatgatctaaatgcgaatgaa gcgcagccagtgcctgctccgcgagctgatgttaccggaatggacactcggatgccggca gaagatgcaatctctaagatgcgctgcggtggcaatgcattgcccagttatctgccagtg tcggatgaaatgagcaagtactatgtcgaggacagtccctgcacgttttcggtcatctcg ggactatcccatctcactgttggctctgctaaggctgggcctgttctgaagctgccaatg aggactgcagaagaggctcaggcacccaaacttcctcccagacgtagtgccgttcaagga gatgcggagccacgcttaccgccgaagaaaagcgactcactgagctcattgtccatggac tcggatgacgactgtaatcttctaagtcaggccattgctgcgggaagttgtcgacctcag cccagcggtgccagtaccagctccagcctggcgaacgctagtaccagcactctatgcagg gaaaatgggcagtcaaagaagcaggtggagcatggcgataagccgaactacagctcagat gactcgctagacgacgacgacgatgatgcacggtccaagtcgctatttgagcagtgcatt ctgagcggcatgcataagtccaacgacgccttggagtcggagggtgagccgccggggcag cgccaggagatcagtgcccgggatcgatttgtcagtaaccaggtgcgccagattgagtcc atgttggctgggcgtcagcactag >AE003746|GENSCAN_predicted_peptide_30|329_aa MTKPTREHCSLALLALLVAPFVVLGEDMPREPDYMKREHSLVRPFQGVGVILPHWDFLGN TMVTSNYIRLTPDLQSKSGALWNYSPVMTRNWEVHVGFKVHGKGTELFGDGFAIWYTKER MQTGPVFGSKDHFSGLAIILDTYSNHNGPHNHQHPYLSAMVNNGSWSYDHDRDGTHTQLA GCEVRFRNVEYETLVSIRYENDILSVSTDLENRNEWKNCFVVANVELPTGYHFGMSATTG DLSDNHDIHSFKFYDLDLNVNHDEIIRRSNIIPNAKTFEPPREHKEDPKPGMSNAKIFFI LLFVVVVAAAVAIFAISYFKDRNARKRFY >AE003746|GENSCAN_predicted_CDS_30|990_bp atgaccaaaccgacgcgcgaacactgttccctggccctgctggcgcttctcgtcgccccg ttcgtggtgctcggtgaggatatgcccagggagccggactacatgaagcgggagcacagt ttggtgcgtccattccaaggcgtaggcgtgatcctgccacactgggacttcctgggtaac acgatggtgaccagcaactatataagactgacgccggacttacagtccaagagcggtgca ctttggaactactcgcccgtgatgactcgcaattgggaagtgcacgttggctttaaggtg cacggcaagggaactgaactgttcggcgacggatttgccatttggtacacaaaggagcgc atgcaaaccgggccggtctttggcagcaaggatcacttctccggactggccatcattctg gacacctatagcaatcacaatggtccacacaaccaccaacatccatatctcagcgccatg gtgaacaatggcagctggagctacgaccacgatcgcgatggaacgcacactcagctggcc ggctgcgaagttcgtttccgcaatgtggagtacgagacgctggttagcattcgatacgaa aacgacattctgtcggtttccacagatctggagaaccgcaacgaatggaagaactgcttt gtagtggccaacgttgagctacccacgggctaccacttcggcatgtctgcgacgacgggt gatctgtccgacaatcacgatattcacagtttcaagttctatgacctggacttgaacgta aatcacgatgagattatccggcgctccaatatcataccgaatgccaagacattcgagccc ccgcgcgagcacaaagaagatcccaagccgggaatgtccaacgccaagatcttcttcatc cttctcttcgtggttgtcgtggcggctgcagtggccatcttcgccatctcctacttcaag gatcgcaacgcgcgaaaacgtttctactga >AE003746|GENSCAN_predicted_peptide_31|130_aa MPKYGDDDKSTPCIITRNLVLAFAFALQFGFVLWLVTIATLCLLVAFAAGQQYFLGQFPS RTRFGFDPVALAGPSSATQVRDPRQNRGPVVFPPSPPDAVDESSGVVVGASGYGFVPPQQ SNANLFKRTV >AE003746|GENSCAN_predicted_CDS_31|393_bp atgcccaagtacggggacgacgacaagtccactccgtgcataattactcgcaatttggtt ttggcttttgcttttgctttacaatttggttttgtgctctggttagtcaccatcgccacc ctctgcctccttgtggccttcgccgcgggccagcaatactttctgggccagttcccaagc cggactcgttttggattcgatcccgtggcgctagcgggaccctcatcggccacacaggtt cgggatcctcgacagaacaggggacccgtcgtcttccccccatccccaccagacgcagtg gacgagtccagcggcgtggttgtgggtgcctccggatacggctttgtgccgccccagcaa agtaatgcgaacctctttaagcgcactgtctga >AE003746|GENSCAN_predicted_peptide_32|52_aa MLLSVRTAAAAVAVAAAGDPSTYFSTTFQTPDTAYATYNFFGSPYTTRFRYF >AE003746|GENSCAN_predicted_CDS_32|159_bp atgctcttgtctgttcgaacagcagcggcggctgtggcagtggcggctgcgggggatccg tccacgtactttagcaccaccttccagacacccgataccgcctacgcgacgtacaacttc ttcggaagtccgtacaccacgcggtttagatacttctga >AE003746|GENSCAN_predicted_peptide_33|354_aa MEQCYNRGCGQLFDPQTNNDESCRHHPGEPFFHDAYKGWSCCNKKSVDFTEFLNIKGCTL AKHSNVKPPEPEKPVKDESDKDEVIEVRAPIREALPRPPIDSPLTVIQPTVAPALKDMVF AVKTPAAQKSSDAIEVGTTCKNNGCTYSFTGNSSDFGECTYHPGVPIFHEGMKFWSCCQK RTSDFSQFMAQKGCTYGEHKWVKENDDKKVVQCRYDWHQTATNVVMAIYAKKYDYSQSVI ELNPIRLHVNLVFPEQDNARFDLDLELRGIVNVSNASAHMYGTKVEIKLPKLEPGSWSNL NFPNKKLPVVKKSQVEEKKKQEESDEEFFDLDDIKAETSFRLSEMSMQSPNNLD >AE003746|GENSCAN_predicted_CDS_33|1065_bp atggaacaatgctataacagaggctgtggccaactcttcgatccgcagaccaacaatgat gaatcttgtcggcaccatccgggtgaacctttcttccacgacgcctataagggttggtcc tgctgcaacaagaagtcggtcgacttcaccgagttcctcaacatcaagggctgcaccttg gcaaagcactcgaatgtgaagccaccagagccggagaaacctgttaaagatgagtccgac aaggatgaggtaattgaggtgcgggcacccatccgggaagccctgccgcgtccgcccatt gattcgccgctaaccgttatacagcccaccgtggctcctgccctgaaagacatggttttt gcagtcaaaacgccggctgctcaaaagtccagcgacgccattgaggtgggaaccacttgc aagaataacggctgcacctactcctttacgggtaacagcagcgactttggcgagtgcacc taccatccgggcgtgccaatctttcacgagggcatgaaattctggtcctgctgccagaaa cgaacctcggacttttcccagtttatggcgcaaaagggctgtacctacggtgagcacaaa tgggtcaaggagaacgacgacaaaaaggttgtgcaatgtcgctacgactggcaccagacg gccaccaatgtggtgatggccatttatgctaaaaaatacgattacagtcagagcgtgata gagctgaaccccatcaggctgcacgtcaacctggtatttcccgagcaggacaacgccagg tttgacctggacctggaattgcgcggaattgtgaatgtgagcaatgcaagcgcgcatatg tatggcaccaaagtggagattaaactgcccaagctggagcccggctcttggtccaaccta aattttcccaacaagaaactgccagtggttaagaaaagccaggtggaagagaagaaaaag caggaggagagcgatgaggagttcttcgacctggatgacattaaagcggagaccagtttc cgtctttccgaaatgagcatgcaaagcccaaacaacttagattaa >AE003746|GENSCAN_predicted_peptide_34|262_aa MSRNYKEDQTNEVEALDSIYCGDMESKYHGTSLKSISNNCFCSVLATEPHHKFQIPIATE EYSSEEPEKGLACKLVFTFTATYPDGAPVVEIEEPENFEDMFETRLLEHLQKTIEENLGM EMIFSLVSSAQEWLNERWDEHKFHQEELREQKLREIEEEERKKFEGTRVTVESFLKWKLE FEESTGIAAKREKNNVSKKQTGRELFMCDNTLNDSDIKFLLEAGENIENVKIDETLFQDI GELDLDDDDDEDWVPGADDDDD >AE003746|GENSCAN_predicted_CDS_34|789_bp atgagccgcaactacaaggaagatcagaccaacgaggtcgaggcgttggactccatatac tgtggcgatatggagagtaagtaccacggcacttccttgaaatccatatccaataattgc ttctgttcagttctcgccacggagccgcaccacaaattccagattccaattgccacggag gagtacagttcggaggagcccgagaaaggtcttgcctgcaaactggtcttcacattcaca gctacctatccggatggagcacccgtggtggaaatcgaggagccagaaaactttgaggac atgtttgaaacgcgcctcctggaacacctgcaaaagacaattgaggagaacctgggcatg gagatgatcttttcgctggtaagcagcgcccaggagtggctgaacgagcggtgggatgaa cacaagttccaccaggaggaactgcgagagcagaagctgcgggaaatcgaagaggaggaa cgcaagaagtttgagggcacccgtgtgacggtggagtccttcctcaaatggaagctcgaa ttcgaggagagcaccggcattgccgccaagcgggagaagaacaatgtgtccaagaagcag accggacgcgaactctttatgtgcgacaacacgctcaacgattcggatatcaagttcctc ctggaggcgggtgaaaacattgaaaatgtcaaaatcgatgagacgctgttccaggatatt ggcgaattggatttggatgacgatgatgatgaggattgggtgcccggggcggacgacgac gacgattaa >AE003746|GENSCAN_predicted_peptide_35|1140_aa MERSRRGRIRRMPSPGSESDGSTAEPTRKRSKQQFVAEAEPDFVEEETEKYANASTSQTA RSRNKQARQTTLNMSQRSVNFNLTSELSIPNAFDRCGKVISMRLTNFMCHSNLFIEFGPN INFLVGNNGSGKSAVITALALGLTSSARATNRASSIQKLIKNGEVSATISITLSNSGLRP FKADIFGPHLTVVRQIRHSSSTYDLQDARGKSVSKKVSDIRRMLLCFGINVENPIFVLNQ EAAREFLKELEPASNYKLLMKATQLDVCTSSLTECHALRRHFTQELEQLEKKKEMMIKHI AAEEEKLSILEDKEMVKENLQQCKTKLAWMAVTSYQNELNNLEHSIKLIENKKASLEQTT SKKESTQATMNQKLKEFEASKNQILATQKFQDERLKTAKKAVQDLLLEASQVKAKIGNAE RRMREDQRSYDECEKLIGNYHADFNRVNEQREENANKIEMLKKQVVKSEEIIAQLRAEQQ EIKRDITSVQERLDAVKNGRIQLHKSKQNISWEIEALSRNKSNKLSVYGEQTIQVVHALR TQYAGSNMHRMPRGPLGQYISAPNPKYRDLIENQLMHCLRSFIVGSDRERQSLRALLQNK FQGGNMPTIITSPFTDRVYDVSRNKVQPTTPNTTVLIDEISCDDPVVMNYLIDILRIETV LVTESKEIAEFLTSDTENVPPNLTRVLVPNLGLEYIPSPNYAVYSTRITPARYIQKNVDD RIRQLQMEQSDLQEKEPSLEIDYMQHKKVLENTQKVISQKSTMIGQHQSRNQKAMQKIME LQNFDYQELPEYDRLKSHLADSGEKIEKCRLEREMLQEKLLSIQHRQTELESTEAEERRA LEGINKKLTALDTEAGEVESKMRSLDLHYEENTRRFQKTLQLERKMLGEKETVLSELEKA RTEAEKLGEFIATTQTEEKIREAISRYKSKIKQVEELNYNPEELERGLAELRDELELQSR HLAVVDSVVKKLRMAYHQRAQLFQRSRHHYFTMVQFQFEVYIGLLETYYILTNAFIYFSQ QALAMRQFKVSFETSDKEKTWKINVFPPSGNETSNTRSLSGGERSFTTVSLLKGLWSTSD HPFYFLDEYDVFTDEVNRKFITEILIGEGLEWLSRQYCFLTPQDTKVEASNLITVHKYEN >AE003746|GENSCAN_predicted_CDS_35|3423_bp atggagcggagtcgcagaggaaggattcggcggatgccttcgcccggatctgaaagcgat ggctcaacggcagaacccactcgcaagaggagcaaacagcagttcgtggcggaggcggag ccggattttgtcgaagaagagacagagaaatatgcgaacgcgtctacttcgcagacggca aggtctcggaacaagcaggcgcgccaaacaactttaaacatgtcccagcgctccgtgaac tttaatctgacgtcagagctatcgataccgaacgccttcgatcgctgcggcaaagtaatt tccatgcgtctcacgaacttcatgtgccactccaatctgtttattgagtttgggcccaat attaacttcctggttggcaataatggcagtggcaagagcgctgtaatcacggcactggct ttgggtctgaccagcagcgctagggcaaccaacagggccagcagtatacagaagttaatc aagaacggcgaagttagtgccaccatctccataacactgtccaattcgggattgcggccc ttcaaagcggacatcttcgggccccaccttaccgtggtgcgtcaaatacgccactcctcc tcgacgtacgatcttcaggacgctcgaggtaaaagcgtctcgaagaaagtgtccgatatt aggcgcatgctgctctgcttcggcatcaatgtggagaatccgatttttgtgctgaatcag gaggcggcgagggagtttctaaaagaattggagccagcatcgaattacaaactgttgatg aaagcaactcaactggatgtttgcaccagcagtctaactgagtgccatgccctgcgacgt catttcacccaagaactggaacaattggaaaagaaaaaagaaatgatgataaagcacatt gccgcggaggaggaaaagctgtcgattcttgaggataaggaaatggtcaaggaaaaccta cagcagtgcaaaacaaagctggcatggatggccgtgactagttaccaaaatgagctcaat aatctagagcattcaattaaactgattgaaaacaagaaggccagtctggagcagacaaca tccaaaaaggagagtacgcaagccaccatgaaccaaaaattgaaagagtttgaggcttct aaaaatcaaatattagcaacccaaaagttccaggatgaaaggcttaaaaccgctaagaaa gcagtacaggatctgcttctagaagccagccaagtcaaagccaagatagggaacgcagaa cggcgcatgcgagaggatcagcgttcgtatgatgaatgtgaaaagctgataggaaactat catgccgactttaatcgggttaatgagcagcgggaagaaaacgctaacaagattgagatg ttaaagaaacaagtagtcaaaagcgaggagatcatcgcccagttgcgagcagagcagcag gagattaaacgagatataacctcagtccaggaaaggttggacgctgtaaaaaatggaaga atacagctgcataaatcaaagcaaaacatcagttgggagatagaagctctgtcccgtaac aagtccaacaaactgtccgtgtacggtgagcaaacaatacaggttgttcatgcactgcga actcagtatgccggctccaatatgcatagaatgcctcgcggcccgctgggccagtatatc agtgcgcccaatccaaagtaccgcgacctcatcgagaaccagctcatgcactgcctgcgt tcctttatcgttggctcagaccgcgagcgccagtcgctgcgggcgttgctgcaaaacaag ttccaaggtggcaatatgcccactattataaccagtccgtttacggatcgggtttacgac gtgtctaggaataaggtgcaacccactactccaaataccacagttctaatcgatgaaatc agctgcgatgatcctgtggtaatgaactaccttattgatattctgcgtatcgaaacggtt cttgtgacggagtccaaggaaattgccgaatttctcacctccgacactgagaatgtgccg cccaatctaacgcgtgtgctagtgccaaacctgggactggagtacataccatctcccaac tatgccgtctactcgactagaataacacccgcccgctatattcagaaaaacgttgatgat cggatacgacagcttcaaatggagcaaagcgatcttcaggaaaaggagccttctttagaa atagactacatgcaacacaaaaaggtactggaaaacacccaaaaagtgatttcgcagaag agtactatgattggtcagcatcaatcaaggaatcagaaggcgatgcagaagataatggag ctgcaaaattttgactatcaagagctaccggagtatgatcgtttgaaatctcatttagct gatagcggcgagaaaattgagaaatgtaggctagaacgggaaatgctgcaggaaaaactt cttagcatccaacatcgccagacagaacttgagtcaactgaagccgaagagaggcgagcc cttgaaggtattaacaaaaagcttaccgcactggacaccgaagctggcgaagtcgaaagc aagatgcgaagcctggacctccactatgaagaaaacacacgtagattccagaaaacgttg cagttggagaggaaaatgcttggcgagaaggaaaccgtgctaagcgaattggaaaaggct cgcaccgaggccgagaaattgggagagttcatagcgacgacgcaaacggaggagaaaata cgcgaggcaatcagtcgctacaaatcaaagatcaaacaggtggaggagctgaactacaat cctgaagagctggagagagggctggcggaattgcgagacgaattggaacttcaatctcgg cacctggccgtggttgactccgtggtcaagaagctgcgcatggcctaccatcagcgagct caacttttccagcgatcgcgacaccattacttcacaatggttcagtttcagtttgaggta tatattgggttgttggaaacctattacattcttaccaatgcttttatttacttttcacag caagctcttgctatgcgacaatttaaagttagtttcgagaccagtgacaaggagaagacg tggaagattaacgtatttccgcccagtgggaatgagacttccaataccaggagtttatcg ggcggtgaacgatcgttcacgacagtttccttgctgaaaggactttggagcacttcagat catccgttctactttttggacgaatacgatgtgtttacagatgaagtgaaccggaagttt attacggaaattctaatcggcgagggtttggagtggttatcgcgtcagtattgttttctg acgccccaggatacgaaggtcgaggctagcaatctgatcactgtgcacaagtatgaaaac taa >AE003746|GENSCAN_predicted_peptide_36|903_aa MSSSDDDWFDQDENKLLQGLEKSLKSLELQKNEEYIECPPSERKCPPSEVGEYVMQHTRF SLTELTNALKMPAIDMFLYFLSDKRDLFENQVLATDNVKRVGLFVDVLWSLCELELGGFD EVFLSAFSRQTALLDKIKNLLQAKAAVAKCDAESALILSHSKWMLLRAHKHGLLSHQGYE LVELYKKLAPSFKSDMIDGLEAFTGNFSHNVKGLIYPTLETLLGKDATKAPNEEEDEGLV SDKVVKYVNALRNLLREDFLAPLVEFVQQLRSGTDVDELKQQGLLWSDVHLTLNPQFANA QRHSLVFLKVQFTKESKNAYKTWLNSIKSGTLLCLTTSLAFDDLILASVGYTEPEKLKED CLSVQIVKQYNIGNAYNRPLIMFQAPVFFEPYLRVHNYLSTCSTEKFPMGRYIVDGQMEI PPPAYMKPGVKLSFNMKPFTLDKLPEDLHLNESQKTAFKEALCREFSIIQGPPGTGKTHL SVQLVNSLIQNAKALGTGPIIVLTYTNNSLDKFLVKISRYTQEILRFGNQSRDPQISKFN LSTTIKPELVPPRLKRIWWLVNCEYKEKFRNLQGLYANFDGSEESYQDTLAAQEKLNQVA ERIETLRMVFQFFLAREKDLLAMTTTCAARHNFLFRLLQSKCVLFEEAAEIQEAHIVACL TPHTEHVILVGDHKQLQPFSGSRKVPQISLFERLIVAGLPFSRLNLQYRMRSCISELLVP SIYDELLCSESVKEYEDIRLMSKNLYFVQHNQPEHCMSDMSIGNLYEAGVLAKLTEFLIQ KAQYKHSDIVILSPYNGQIECIKNALPQNYRSTVQVASVDSFQGLEANIVLLSLVRSNIS GRIGFLRQANRVCVALSRARWALYIVGNVTILKDTFPKIWNPIVKRLKENNAIGEAFPTI TST >AE003746|GENSCAN_predicted_CDS_36|2712_bp atgtcaagttcagatgacgattggtttgaccaggatgagaacaaactgttgcagggcctg gagaagtccttgaagtccctggagctgcagaagaatgaggagtacatcgaatgcccccca tctgagcgtaaatgccccccatctgaggtgggggagtacgtgatgcaacatacacggttc tccctgaccgagttaacaaatgccttaaaaatgccagccatcgacatgttcttatacttt ttgtccgataagcgagatctcttcgagaatcaagtgttggccactgacaatgtgaaacga gttggcctgttcgtggatgtcctgtggtcgctctgtgaactcgaattgggcggattcgat gaagtctttctgtccgcattcagccggcagacggcgcttctggacaagatcaagaatctt ttgcaggccaaagccgctgtggcaaaatgcgatgcggagtcggcactgatattaagccat agtaagtggatgcttctacgagcccataagcatggcctccttagtcaccagggctacgaa ttggtggaactttataagaaattggcaccttcctttaaaagcgacatgattgatggtctt gaagcattcaccggtaacttttcacataacgtcaagggcctaatttatccaacgctggag acgttactgggcaaagatgcaactaaggctcccaatgaagaagaggatgagggcttggtg tccgacaaagtagtcaaatatgtgaatgcactgcgaaatttactaagggaagatttttta gcaccactagttgagtttgtgcaacagctgcgcagcggaacggatgtcgatgagttgaag caacagggccttctgtggtccgatgtgcatctgactttaaatccacagtttgccaacgct cagcgtcatagccttgtttttttgaaggttcaatttactaaagaatccaagaatgcctat aagacttggctgaattctatcaaatctgggactctgctctgtcttaccacgagtctggcc tttgatgatttaattctggcttccgttggctacactgagccagaaaaactaaaggaggat tgcctaagtgtgcagattgtcaagcagtataatattggaaacgcctacaaccgaccactg atcatgttccaggcgcccgtgtttttcgagccttatctcagagttcacaattacctgagc acctgcagcacagagaagtttcccatgggtcgctatattgtagacggccagatggagata ccgccgccagcttacatgaaaccaggagttaagcttagtttcaatatgaagcccttcacg ctggacaaactaccggaagatctgcacttaaatgagagtcagaaaactgcattcaaggaa gctttatgcagggagtttagcatcattcaaggacctccaggcaccggaaaaacacacctt tcagtgcagttggttaacagtttgatacagaatgctaaagccctgggcacgggacccatc attgtgctgacctataccaacaattcgctggacaaattcctggtaaaaatttcgcggtac acccaagaaattcttcgctttggtaatcagtcgcgagatccgcaaatatcaaagtttaat ttgagtaccacgatcaagccagaattggttccaccgcgcctgaagcgaatttggtggcta gtcaattgtgagtacaaggaaaaattccggaacctacaaggcctgtacgcaaactttgat ggcagcgaggaaagctaccaggacactctggcggctcaggagaagctaaatcaggttgcc gagcgtatcgaaacactgcgcatggttttccagttctttctagcaagggagaaggacctg ttggccatgaccaccacgtgcgcggctcgccataactttctgttcaggttgctacaatca aagtgcgtcctctttgaggaggcagccgagatccaggaggcacacatcgtggcatgccta acgccgcacacagagcacgtgatcctcgtcggcgatcacaagcagctgcaacctttcagt ggcagcaggaaggtaccacagatctcgctctttgaacgactcattgtggcggggctgcca ttttcacggctcaatctgcagtaccgcatgcgatcctgtatttctgaactccttgtgccc agcatttatgacgagctgctctgttcggagtcagtgaaggaatacgaggacatccgcctg atgtctaagaacttgtacttcgttcagcacaaccaacctgagcactgcatgtctgatatg tccattgggaatctttatgaagctggagtgttggctaaattaactgagttcctaatccag aaggcccaatataagcacagcgatattgtgatattatccccttacaatggccaaatagag tgtatcaagaacgcgcttccccaaaactatcgctcgactgtgcaagtggccagtgtggat agtttccaaggccttgaggccaatatagtgctgctctcgctggtgcgcagcaatatatcc ggccgaattggcttccttcgccaagcaaatcgagtgtgcgtggctctttctcgagcccgc tgggccctgtacatcgtcggtaatgtgacgattttgaaggataccttcccaaagatttgg aatccaattgtcaagcgcttgaaagagaataatgccatcggagaggcatttccgaccata accagtacctga >AE003746|GENSCAN_predicted_peptide_37|1270_aa MSRNKDKYDSANRRQQIFLSQEDIAAGKKTNWSGLEITGCVRNISPSLWEFEHLTALYLN DNQLLRLPADVGMLTSLRTLDLSSNKLRSLPAELGELIQLRELLLNNNFLRVLPYEIGKL FHLVILGLMGNPLQKEFMNIYNEPNGTQKLLTYMLDNLSCESPPCLSKSLSMAPESAAPP PPPPNYASLRYHQHPLSHWPGSQHPAPVYRLHHGPPSAPPPNWAAQSAGQPPQPVGYGVV NQQRINECAGIPLISNASSHLSPAQRQRLLRKSALKTCPTWPPGGATMSQLAVHAQGMPL LQQNGKHHLQQQQMQHHQPLHHHHPVTNNWSGYQPMSSQPAFKLADKSRALRYQNSAPPV LSSQEKTPNNAAGQDQATSPAETCLPRIIKPRKRRKKDRKPGNGVLLKIEADMQPKMSNP LDGYTANAGASYHLPSGILQHDHTHGVCFCRECDPLRSLWDYPLRRSLSDASSSEPGGGS RESSSSTSSTHSDSSCDSSICSQSLPSSLALQEETTEEFAPITGDSSRAEKVGVIGSQRS HPSAVATPSQCLSDDSGYGDILSGINIANDLFGNCWRGGKLGNASSSISAQAETLLDASL NEISRKLIETCNAVDQAESESGSGFGSGAASDSGLDSAGSHNCGSGLVFNFEHLNLTDAT PTALDFLVDCNNNSSSTTTAAAATATTIGNGNSLGLMWHGGRQSSGGGSHDDSQVAAAKL GPDSRATPTRLILGTVASERSPNRWPSERLGSTFPLPLYRCIAERCTQFVCKPWRPLLRH NVVSNGPVTATNSRNGGAGVARKNFKPHHPHHLHHQHHLNHPHQPPHHPHHQRHPQQQQL QYRSYYAYLLQPSQIGFATPQHAPLSWRSTCPRRQASNATSSRSITVNPPPQRPWLPLAK PNKTRPACIFTVMCYNVLCDKYATRQMYGYCPSWALCWEYRKKSIIDEIRHYAADIISLQ EIETEQFYHFFLPELKNDGYEGIFSPKSRAKTMSELERKYVDGCAIFFRASKFTLIKESL IEFNQLAMANAEGSDNMLNRVMPKDNIGLAALLKVKENAWEPMSEVTQISQPLLVCTAHI HWDPEFCDVKLIQTMMLSNELKTIIDEASHSFRPGHKNDSNAVQLLLCGDFNSLPDSGVV EFLGKGRVSMDHLDFKDMGYKSCLQRLLSNDTNEFTHSFKLASAYNEDIMPHTNYTFDFK GIIDYIFYTKTGMVPLGLLGPVSNDWLRENKVVGCPHPHIPSDHFPLLVELELMHTASQQ APPNGLINRR >AE003746|GENSCAN_predicted_CDS_37|3813_bp atgtctcgtaacaaagacaaatacgacagcgccaatcggcggcagcagatctttctgtcg caggaggacatagctgcgggcaagaagacgaactggagtggcctagagattactggctgc gtccgcaatatcagcccgtcgctgtgggaatttgagcatctaaccgccctctacctgaac gacaaccagctgctgcgattgcccgcagacgttggcatgctgaccagcctccgaactttg gacctgtccagcaataagctaagaagtcttcccgcagagctcggcgagctcatccagttg cgggagctgttgctgaacaacaactttctgcgcgtactgccttacgagatcggcaagctg ttccacctcgtcatactcggcctcatgggcaatccgctgcaaaaggagttcatgaacatc tacaacgaaccgaacggcacgcagaaactgctcacctacatgctggacaacttgtcatgt gagtcccccccatgtctatctaaatccctgagtatggcgccagaatcagcagcaccacca ccaccgccgcccaactacgcgtccttgcgataccaccagcatccgttgagccattggccg ggtagccagcatccagctcctgtgtacaggttgcatcatggtccgccgtctgcaccaccg cccaattgggcagctcagtcggctggccaaccacctcaaccagttggctatggagtggtt aatcagcagcggatcaatgagtgcgccggcataccgctcatctcgaatgcctccagtcac ttgtcgccagcccagcgacagagattgctgaggaaatccgcccttaaaacatgtccaact tggccacctggaggggccactatgtcccagctggctgtgcatgcccagggcatgccgctg ctccagcagaatggcaagcaccatctccagcaacagcagatgcagcaccaccagccacta catcaccatcatcctgtgaccaacaactggagtggctaccagccgatgagcagtcagcct gctttcaagttggctgacaaatctcgagcccttcgctatcagaattcggcaccaccggtg ctcagtagccaagagaagacgcctaacaatgctgctggccaggatcaggccacctcgcca gcggagacctgtctgccgaggatcattaaacccaggaagcgacgaaagaaggatcgaaaa ccgggcaatggggttttgctcaaaattgaggcggatatgcaaccgaagatgtccaatccg ctagacggttacaccgcgaatgcgggagcatcttaccatctaccatctggaatattgcag catgatcatacgcacggagtctgcttctgccgagaatgtgacccacttcgttcgctttgg gattacccgctccggcgatcgctttccgatgcctcttccagcgagccaggaggcggaagc cgagagtcctcctcctcaacgtcatccactcactcggacagctcctgcgatagctccatt tgtagccagagcctgcccagctcattggccctgcaggaggagaccaccgaggaattcgct cccattaccggcgatagcagccgggcggaaaaagtgggagtaatcggatcccagcggagt catccgagcgcagtggccacaccttcgcaatgcctgagcgatgattccggctacggggat atcctgagtggaatcaacattgccaacgatctgtttggaaattgttggcgaggtgggaag ctgggcaacgcctcctcctcgatttccgcccaggcggagacgctgctcgatgcgagcctc aatgagatatcacgcaagctgatcgagacctgcaatgcagtagatcaggcggaatcggaa tcgggatcgggatttggatcgggtgcagccagcgacagtggcttggatagcgctggcagc cacaactgcggatcgggccttgtctttaactttgagcatttgaatctgacggatgcgacg cccacagccttggattttctcgtggattgcaacaacaatagcagcagcaccaccaccgcc gccgccgccaccgccactaccattggcaacggcaacagtttgggtttgatgtggcacggg ggaaggcagtctagtggaggtggcagccacgatgacagccaagtggcagctgccaaactg ggcccagacagccgggccacacccaccaggctgatattgggcacggtggcgtcagagagg agtccgaaccgctggccatcggagagattaggttccacattcccactgccactgtatcgc tgtatcgccgagcggtgcacccagttcgtttgcaagccgtggcgccctttgctgcggcac aatgttgtaagcaacggcccagtcacggctaccaattcgagaaacggtggcgcgggtgtg gctcgcaagaatttcaaaccgcaccacccgcaccacctgcaccaccagcaccatctgaac cacccacatcaaccgccgcatcatccgcatcatcagcgccatccgcagcaacagcagctc caatatcgttcctattatgcttacctgctgcagccgtcgcagatcggtttcgccacaccg cagcatgctcctctcagctggaggagcacatgtcccagacggcaagcaagcaatgctacc tcatcacgtagtattaccgtgaatccaccgccccagaggccctggctgccactggccaag cccaacaaaacgcgaccagcctgcatttttacggtcatgtgctataatgtgctctgcgac aagtacgcgacgcgacaaatgtacggatactgtccgtcgtgggcgctatgctgggagtac cgaaaaaagtcgattatcgacgagatacggcactatgcagcggacattatcagtctgcag gagatcgaaacggagcaattctatcacttcttcctgccggaactcaagaacgatgggtac gagggaatcttctcaccgaagtcgcgtgctaagactatgtccgagctggagaggaagtac gtcgatggctgtgcgatattcttcagggcgtccaagtttacgctgatcaaggaatcattg atcgagttcaatcagctggcaatggccaatgccgagggctccgacaacatgctgaaccgc gtaatgcctaaggataacatcggtctggccgcactgctcaaggtgaaggagaacgcatgg gagccgatgtccgaggtgacgcagatctcgcagccgctgctcgtctgcacggcgcacata cactgggaccctgagttctgcgacgtcaagctcatccagacgatgatgcttagcaatgag ttgaagacgatcatcgacgaggcgagccacagtttccgacctggtcacaagaacgactcc aatgctgtccagctgctgctgtgcggtgacttcaactcgctacccgattcaggcgttgtg gagtttctcggcaagggccgcgtttccatggatcatttggacttcaaggacatgggctac aagtcctgcctgcagcggctgctctcgaacgacaccaacgagtttacgcactcgttcaag ctagcctccgcctacaacgaggacataatgccgcacaccaactatacgttcgattttaag ggcatcatcgactacattttctacacgaagacgggcatggtgccgctgggcctgctgggt cctgtctccaatgattggctgcgcgagaataaggttgttggatgcccacatccgcatata ccctctgatcacttcccactgctggtcgagctagagctgatgcatacggctagccaacag gcgcctcctaacgggctgatcaatcgccggtag >AE003746|GENSCAN_predicted_peptide_38|337_aa MSGTQMSAFLRKYLADEDKKIRAQFKESDPNNKLILWMHEKTRITEEDLARPYTEDEVKE LCLRTKVKVDMTAWNCLWEAKKRFEAKGRFVNKSERFINRMYMKAVRRKMVQPYPEEFVA QRREIVAAETKKQNISRLDRWQKKKSQNLSAPESSPDAHASSNDAVQSHEDQANTNLSSL SQMNFQVEAMAPPGVSSSDLSGIGDDEDEQQQSGFQDENINRPETEINENSVRCDPINLG RMRTGCINSQANNSFRNTESDPDYYMFGTQLSTLVRPTSTQEPDDQVNCPETEMNESWVR CDQINSESLSIGPSIDSEGTITFQNTESEPIDVTSIA >AE003746|GENSCAN_predicted_CDS_38|1014_bp atgtcggggacgcaaatgtctgccttcctgagaaagtatctagccgacgaggacaaaaaa attcgtgcgcaattcaaagaaagcgatcctaacaacaaattgattctatggatgcacgaa aaaacgagaataaccgaggaggacttggcacgcccatacaccgaggacgaggtaaaggag ctttgtctacgcaccaaagtgaaggttgatatgaccgcttggaattgtctgtgggaagcc aaaaagaggtttgaagcaaaaggacgttttgtgaacaagtctgagagattcatcaaccga atgtatatgaaagcggtgcgcagaaagatggtccaaccgtatccggaggagtttgtggcc cagcgaagagaaatagttgcagccgagactaagaagcagaacatcagccgattggataga tggcaaaagaaaaagagtcaaaacctatcagcaccagaatcctctccagacgctcatgca tcttctaatgacgcggtgcagagccacgaggaccaagcaaacacaaatcttagttcactg tctcaaatgaactttcaagtggaagcgatggctccgccaggcgtgtcgtcatctgatctt agcggcatcggagacgatgaggacgaacagcagcagtcaggatttcaggatgagaacatc aaccgtccagagacagagattaatgagaattcggtgagatgtgatccaattaatttagga aggatgcggactggatgtataaattcgcaagcaaataacagtttccggaatacggagtct gatccggactactatatgtttggcacccaattgagcacattagtacgacccacgtcgact caagagcctgacgatcaggtaaactgtccagagacagagatgaatgagagctgggtaagg tgtgatcaaattaattcggaaagcttgtcgattggaccgtcaattgattcggaaggaact atcacttttcaaaatacagaatctgagccgatcgacgtcacctcaatagcctga >AE003746|GENSCAN_predicted_peptide_39|968_aa MLQYKSAASAATSAPPATPLAAGGSSKATIPPNGASAAASQTQVHGAPGTPTMQQIINIH QMPPQFAGGAVAGNGQNAGMPQNMFQIVQPMPMQTVNIDGQEAIFIPNLNAQLATAQAVN FNGQQAFITPNGQILRAPQMAANPAASNCIQLQQLNGLGQEQTQLITIPGTNIQIPVTNL IQQQQQAQQVHQGTVQQQAQGTNASGANGSGVTNSGTAGQLPGSITIPGTNLQIPTSVAA ANGLLGNISNISNLLGGGQSIKLENGQLQMRPQLVQFPAPAMPQQQQTVAVQIPVQTANG QTIYQTVHVPVQAAATSSGGLQNLMQAQSLQMPSASQMQIIPQFSQIAQIVTPNGQIQQV QLAMPYPQLPPNANIIHIQNPHQQQQQQVQQQQQQQQAQQQQQAQQQQAQQQQQQQVQAQ HQQLLQAISDASAGGQLPPNQPITITNAQGQQLTVIPAQLRPNAPTAPTPAPAGVPTPMQ MPNLQALPIQNIPGLGQVQIIHANQLPPNLPANFQQVLTQLPMSHPQVQTQGQVQVMPKQ EPQSPTQMITSIKQEPPDTFGPISATGNPPAPASTPNTASPQQQQIKFLHTESNSLSSLS IPASIQITALPQQATNTPNTPATTQPIPVSLPARSKVNAVTTSSTQITIAPTGGQVVSVT TQARGATASIRSTNTSTTTITTPSQSHLNMNISVASVGGAATGGGGGTATGEPKPRLKRV ACTCPNCTDGEKHSDKKRQHICHITGCHKVYGKTSHLRAHLRWHTGERPFVCSWAFCGKR FTRSDELQRHRRTHTGEKRFQCQECNKKFMRSDHLSKHIKTHFKSRSGVELIELSIKQET KGGNAPKSISTVNGIVTIEIPGGGSAAAGSGASSVAATVAGSTVTPGGATIVQLPTVEAS GGGDSFGDDEDDEEDDELTEEDDEDEELDEDDMDDCEDEEDDEDPELDSGDEVKMTIAVS EPGDNSSN >AE003746|GENSCAN_predicted_CDS_39|2907_bp atgctgcagtacaagagtgccgccagtgcggcgactagtgcgccacccgccacgcctttg gcggcaggcggttccagcaaggcgacgattccaccaaatggagctagtgcagcagcttca caaacgcaggttcacggtgctcccgggacaccaaccatgcagcagatcatcaacattcac caaatgcctcctcagtttgcaggcggagcggttgccgggaacggacagaacgctggcatg ccccaaaacatgttccagatcgtacagcccatgcccatgcaaacggtgaacattgatggc caggaggccatcttcataccaaacctcaatgcacaactggccaccgctcaggcggtcaac ttcaacgggcaacaggcctttatcacgcccaacggtcaaattctgcgggctccccagatg gcagctaatccggcagcctctaactgcattcagctgcaacagctgaacggcttgggccag gagcaaacccagctgatcaccatacccggcaccaacattcaaatacccgtcaccaatctc attcagcaacaacagcaggcacaacaagtgcaccagggtacagtgcaacagcaggcacaa ggaacaaatgcatctggagccaatggatcgggagttaccaatagtgggacagcgggtcag ctgccgggcagcatcaccataccgggcacaaacctgcagataccaacctcggtagcggcg gcgaatggattgctgggaaacatctccaatatctccaatttactaggcggcgggcaatct ataaagttggagaacggccaattgcagatgcggccgcaactggtgcagtttccggcgcca gccatgccgcagcagcaacaaacggttgctgtgcagattcccgttcagactgctaacgga cagaccatctaccagactgtacatgtgcccgtccaggcagcggcaacatcgagcggcgga ctgcaaaacctgatgcaggcgcaatccctgcagatgccctccgcctcgcagatgcaaatc atcccgcagttctcacagatagcccagattgttacgcccaacggtcagattcagcaggtg caactggcaatgccgtatcctcagctgcctccgaatgccaacattatacatatccagaat ccccaccagcagcagcaacaacaagtacaacaacaacaacagcagcagcaggcgcaacag cagcagcaagcgcaacagcaacaggcacagcagcagcaacaacaacaggtccaggcgcag caccagcagctccttcaggcgatcagcgatgcctcggcggggggacaactgccgcccaat cagcccatcaccattaccaatgcccagggccaacagctgaccgtgattcccgcccaacta cgcccaaatgcgcctaccgcacccactcctgctccagctggtgtgcccacaccaatgcag atgcccaatcttcaggctttacccatccagaacatcccaggcctgggtcaagtgcaaatc attcacgccaatcagctgcctcctaatctgccagccaacttccagcaagtactaacccaa ctacccatgtcgcatcctcaagtgcaaactcagggccaagttcaggtgatgcccaagcag gagccgcagagtccaacgcaaatgatcaccagcatcaagcaagagccgccggataccttt ggacccatttccgcaactggtaatcctccggcacctgcctctactccaaacacagcctca ccgcaacagcagcagatcaagttcctgcatacggaaagcaattccctgtccagcctgagc ataccagcctccattcagatcacagccctgccacagcaagcgacgaatacaccgaatacc ccagccacaacccaaccaattcccgtatcgttacctgcaagaagtaaggtcaacgctgtg accacgtcaagcacccagattacaattgcgccgactggtggccaagtggtgtccgtcacg acgcaggctaggggagcaactgccagcataaggagcacgaacaccagcacgacgactata acaacgccgtcacaaagccatctcaatatgaacataagtgtggccagcgtcggaggtgct gcaactggcggcggcggtggaacagcgactggagagcccaaaccacgcttaaagcgagta gcctgcacctgtcccaattgtacagatggcgaaaaacactcggacaagaagcgccagcat atatgccatataaccggctgccataaggtatacgggaaaacttcccatctaagggctcac ctgcgttggcatactggcgagcgaccatttgtctgctcctgggcgttttgcggcaagcgc ttcacccgctccgatgaactgcagcgccaccgacgaacgcacacgggagagaagcgtttc cagtgccaggagtgcaacaaaaagttcatgcgcagcgatcacctgtcgaagcacatcaag acgcactttaagagccgctctggcgtggagctaattgagctgagcatcaagcaggagacc aagggtggcaatgcaccgaaaagcattagcacggtgaacggcattgtgacgattgagatt ccgggcggcggttcagcggcggcgggcagtggagcctccagtgtggcggccacggttgca ggctcaactgtgacgccgggcggggccacgatcgttcagctgccgactgtggaggccagc ggcgggggtgatagtttcggcgatgatgaggacgacgaggaggatgatgaattgaccgaa gaggacgacgaagacgaggagctcgacgaggacgatatggatgattgcgaggatgaggag gatgacgaggatccagaactagacagtggcgatgaagtgaagatgacaattgcggtcagt gagcctggcgacaattcgtccaactag >AE003746|GENSCAN_predicted_peptide_40|364_aa MEPVPEEARGVGDDEGELDCLTHMIGALLLQKEPENPENGDSKETIWSGELEWEDAQILD QPKTLHTVQCKICSMVKEGQPEINTENWPNKLKTQLIPKKVLGKIGEQFLKDARMVVFRS SQGEVLNLLITAMSSGFAGCIHFPSNPNCNIKALILIYSRDHQALVGFIPNNEDSFSERL QEILQGAKRKPGVKTPKQPQPEEPPPVEEDAIINELLWTGSLNWSTQASLEEPSISHKLE CSVYIAIKNGDPGISAEDWPTDMPMVLMPSIYLGQFAGAFIKDSKLIILRSTPGEEHDSL ASSMSAGSCGCARFSSEVVCKVIMLLYSSPRNAFLGFIPRDQANFVKRLREVLDEHRQKA RNKE >AE003746|GENSCAN_predicted_CDS_40|1095_bp atggagccggttccagaggaagcccgaggagtgggtgatgacgaaggggaactcgattgt ctaacacatatgattggggcgttgcttttgcaaaaagaaccggaaaatcctgaaaatggt gactcaaaggaaacaatttggagtggagagctcgaatgggaggatgcacagatattggat caaccgaagactctgcacacggtccaatgcaaaatctgctccatggttaaagaaggtcag ccggagatcaacacggagaactggccaaacaagctgaaaacacaactgatacccaaaaaa gtgctgggtaaaattggcgaacaattcctaaaggatgccaggatggttgtattccgatcc agccaaggagaggtactcaatttattgatcacggcgatgagctctggtttcgctggttgc atccacttcccctccaatcccaactgcaacatcaaggctctaattctcatctactcgcgc gatcatcaagctttggttggcttcatacccaacaatgaggattcgtttagcgaacgactg caggagattcttcagggtgcaaaacgaaagccgggcgtcaaaacacccaagcagccacag ccagaagaaccgcctcctgtggaggaagatgccattataaatgaactcctatggactgga tcgctgaattggtcgacacaagcaagtttagaggagccaagcattagtcacaagcttgag tgttccgtgtacatagctataaaaaacggtgatcccgggatcagtgcggaggattggcca actgatatgcctatggttttgatgccttcaatttatcttggtcagttcgccggagcgttt ataaaggactcgaaacttataattctccgatcgacacctggagaagaacacgactcgcta gcatcatcaatgtccgccggcagttgtggctgtgcccgattttcctccgaggtcgtctgc aaggttatcatgctgctgtattcgtccccgaggaacgccttcctgggctttattccgaga gatcaagctaatttcgtcaagcgattgcgggaagtattagacgaacatcgccaaaaggcg agaaacaaagagtaa >AE003746|GENSCAN_predicted_peptide_41|179_aa MHTVLTRGNATVAYTLSVLACLTFSCFLSTVFLDYRTDANINTVRVLVKNVPDYGASREK HDLGFVTFDLQTNLTGIFNWNVKQLFLYLTAEYQTPANQLNQVVLWDKIILRGDNAVLDF KNMNTKYYFWDDGNGLKDNRNVSLYLSWNIIPNAGLLPSVQATGKHLFKFPADYATSSI >AE003746|GENSCAN_predicted_CDS_41|540_bp atgcacacggtgttaacccgaggaaacgccacggtggcatacacgctgagcgttctagct tgcctcaccttcagctgcttcctgtccaccgtctttcttgactaccgtaccgatgcaaac atcaacacggtcagagtgctggtgaagaatgtcccggattacggagcgtcccgggagaag cacgacttgggcttcgtgaccttcgatctgcaaacgaatctcacaggcatcttcaactgg aacgtgaagcagctgttcctgtatctcaccgccgagtaccagacaccggccaatcaactc aaccaggtggtactgtgggacaagattatcctgcgcggcgacaatgccgtattggacttc aagaacatgaacaccaagtactacttctgggacgacgggaatggcctgaaggacaaccgc aacgtatcgctatacctctcgtggaacatcattccgaacgcgggactccttccctccgtt caggccactggaaagcacctgttcaagttcccggccgactatgccacctcgtccatttag >AE003746|GENSCAN_predicted_peptide_42|3905_aa MNNDAKNHESDDLNVRSTAYFNQQTTTNQPKAPATSKNNTGSGSGSNNNNNNTNQNPNRQ LNHNLPRIAAARQSIAAALLKNSGRKILTAKNEPLTTTESSGVLTNTPLPSNSRLKVNNN NNTNNTAKMSGTSSSQSSATPTPPTASSSTTTTTTTNISTGGGGSGSSGGGGGSTTVIAN PASVTNTGAGSAAKFRAAVASAPSPALPATNAPANATAAAAIAAIATAPAPSSSSSSSSS SKKTRAAVAALKRQVALQQQQPVTGNAPNMTSKDSAHLKFATTTLLMGAAAAAADSNAGA ALGGSGAGGSGSSSSVGAVGGARMALNPAVDMANAAVLLKQKLKDAAAAASASASNRSAT SSMSSTASSLSSSAGIVNAISSALQNIITPDTDTDTEFYPQPVTTDLSESEEESVSEDDI PESDPDSCPHEGEVREDEDETEEESEDSDESEGEEEEEDEEEIDVLQDNDADDEEIDDED EEEDAPEVSSFLLDANNKRSSNISALLEAAANEKAPVLRHATHAIDETKQALTKMRCASS PRDKNSGFSRSLVAACTDNDVNTVKRLLCKGNVNLNDAAASTDDGESLLSMACSAGYYEL AQVLLAMSAAQVEDKGQKDSTPLMEAASAGHLDIVKLLLNHNADVNAHCATGNTPLMFAC AGGQVDVVKVLLKHGANVEEQNENGHTPLMEAASAGHVEVAKVLLEHGAGINTHSNEFKE SALTLACYKGHLDMVRFLLQAGADQEHKTDEMHTALMEASMDGHVEVARLLLDSGAQVNM PTDSFESPLTLAACGGHVELATLLIERGANIEEVNDEGYTPLMEAAREGHEEMVALLLSK GANINATTEETQETALTLACCGGFMEVAAFLIKEGANLELGASTPLMEASQEGHTDLVSF LLKKKANVHAETQTGDTALTHACENGHTDAAGVLLSYGAELEHESEGGRTPLMKACRAGH LCTVKFLIQKGANVNKQTTSNDHTALSLACAGGHQSVVELLLKNNADPFHKLKDNSTMLI EASKGGHTRVVELLFRYPNISPTENAASANVTQAAPTSNQPGPNQMRQKIMKQQLQHQLQ QLNAPPGLHELSEAARASNQQHFHQQQFSSAGNGSSNIVAMGTGDFLDAGELQLTATAGM SAGAGTSTTGSETGMEEYGEVGGIDLTTLGAQQQEGLIAKSRLFHLQQQQQQQQQQQQQQ QQQQQQQQQQQQQQQQPPAAGQHQLVPCKHFDLDMEHINSLQPPQKAPPAPPVLFHTVCQ QPVMQQQQQQLQPGQLKLKAMLPNRNRALKTAEVVEFIDCPVDQQQPGEQVRTQPLGEDG KTPQFACAGEDPRLQRRRGFMPELKKGELPPESSSSDPNELALKGSYSLYNSLQSGADNN QPVPTALDNSACAQIPARNSGGAITHSSEVLQSTAISDRPKVKATNKNNRKQAAAAAAAA AAAAAAAAAAAQHAQQVLPNPMVSIYNNLHLQHLQHPHLQFQQQLQLHHQRVAGLDNAAA AAAAAASSANMAYSISPASPLPSPTGSGNYVDQQLQQQSMDVALQRKTAMDDFRGMLETA VNGPRGRKDLALNTPQLNFFKDGWHMVGVHNFFGDQPKSPTETPPEMEETTMSSPTEADR LGSEPRAEMKNLATLCSAAAAAAAVAAVNKDQVEISSDLESECEDDAEGGAGADCEENTL PPEPIELAAALREDGIIVEEEEDDEEEDDDDEEQDTNSGEVDKLNYDDEDAEVDNDGEVD YIDEDEGGGEGEEEEDDADDDEFFLDEPDSDQGTGNNNNNSKSGASSLPLKQRKMATRLE NLILNSQTVCDFPPELSNSELVHVLPQISNLKAAANSNAALNSVLQQQLAAASAAAAHAK ASVVHQKQQHGEGDQQCGDVPQDAQRQANLVLLDYPMQQNIQLEQRLLDAEEMHLQQHQQ TPLSLLPFTDEQQQQLHHQALSNASDFQQHQQLALENDPELKQQLQQNSNARIIKAVAAQ HQQQPPTNFVYNVESGDKNAPPVQLLFQLPPHMAQHQAQQQQGVGEPLTEQQQQQLHAEQ AHLFQHRTGGQRPPTQSELEQVAQELLLQRSGQVPAGAPVVGVQAIPLKQKHFNLHPPPC PPTCVQHQASQQQQMQQNELSIWPMATPTPAPSSGVSSTKSMPGGIAKKAIDKQSRKERR CVVRQTPAGIQENTKLHLQPQVATAQQQFLVQNQLAVATTVSLDKTIEIDSETESNHDTA LTLACAGGHEELVELLINRGANIEHRDKKGFTPLILAATAGHDKVVDILLKHSAELEAQS ERTKDTPLSLACSGGRYEVVELLLSVGANKEHRNVSDYTPLSLAASGGYVNIIKLLLSHG AEINSRTGSKLGISPLMLAAMNGHTPAVKLLLDQGSDINAQIETNRNTALTLACFQGRHE VVSLLLDRRANVEHRAKTGLTPLMEAASGGYIEVGRVLLDKGADVNAAPVPTSRDTALTI AADKGHQKFVELLLSRNASVEVKNKKGNSPLWLAAHGGHLSVVELLYDHNADIDSQDNRR VSCLMAAFRKGHTKIVKWMVQYVSQFPSDQEMIRFIGTISDKELIDKCFDCMKILRSAKE AQAVKANKNASILLEELDLERTREESRKAAAARRRERKKKKKMEKKEEKRRQQQGNGPGG DDMQGDDDDASDKDDDSDKDDEDEEAAPAAAREEGDSGIDQGSCSSGDTKGARFGGSQSA QAAEAAANSVSTNSQGKKNKKQAKNKVLISVEPTQPVITSNSVLKGVCAKKHPAVEVVKQ PPATQQAAPLKRQLDVKKEEPALKKKEEKNSSSSSSSKREKENLAPKEVALPAKQQPSSS SKLQSSESASNINSSTATNTSSANTTRKEVAKPASQTASATTLNPAKRTEVDGWKEVVRK SSAQQTTAVGASGAPLPVTATSSATSVQHHPHHHLANSSSNSSSSLTTSTTTAASSVPEM TCKKVQVPVNAISRVIGRGGSNINAIRATTGAHIEVEKQGKNQSERCITIKGLTDATKQA HMLILALIKDPDVDILQMLPRINSSIKQASSGGASTPMSVGTWDNRTAAGVNAYTFSSAA STTSTSSSSSASSTTPAGASYSNAHKQHQQQPQSVKGPSGRSSTSVKSNGSSTKVSASSG SGSRSGRAGSSYLAQQQPGRSSGGGSSNGVIKSKSESSSKSLPAAQKSSTTLGKSSTVSP GAQNFAKAAAIGQSSPKKAEGGATSAVVTSAGGRSSGVVAPFGRGKPVAGQGGPAATAAS NVAQLGSVSGNSNILAGPIGTFNVADVAAVNAAAAAGAAAATNSNVKPIAPIAPPSKRVG SPTQVQQQHQTQQQQQQQLPQPAPVPGPQPQQQPLQQQQQQQAPQQQPQQPNQQQQPQTS QQNLVINTNLLNDLMAASAANTTSDSFSAQLAAKLSSAYSLFSDYQQSQWGKLGDPGIGG GAGAVGDGLPQADASKAPGYNRNILSSPVGSSKASSNHSTSPPVGNVIQQQQQQQPQSSQ QALNIITSGPGGPATAPARSPMVSANEGNPAVGQPSMNGTQGLGETAPAHSPGVIKPPTA TVPIQRHVPMPISAPEAGAPPTFGAIGSNPASGNNSAAAQAAAAAAASAMIDRQQQNLQN LQTLQNLQRMVGASQQQQPQQQLNYPMDPTSSFIVDANNVLRLNPRVIFPQGNTKPPQPP PQGGTQSNVFGGNPGRQPPGTGARQPGGAAAQRWYGGTLEYPSYTGRDMLHLENGAGGMA GMGSPSAMSPNHDDIRKMPRPIGTERAASWKYNNFNVGGPSLNMEDALASVLPPWAHELK AQPPGLQQPPPPPQSQQQQQQPLNWLKQQPQQQQYRAYNNGPYPQQQQQHEPMNMPMDYH NMQAPPNMSQQQQQHVNLMPSYGYQHFVGAPGAVDISAHMPDKMEVWDHHDKHMPWTNYT TNWSN >AE003746|GENSCAN_predicted_CDS_42|11718_bp atgaataatgatgcgaaaaaccatgaaagcgatgacttgaatgtgcgctccacagcgtat tttaaccaacaaaccacaaccaatcaaccgaaagcaccagcaaccagcaagaataacaca ggctctggctctggatccaataataacaataacaacaccaatcaaaaccccaacagacag ttgaatcataatttaccccgaatcgctgccgccagacaatcgatagccgccgctctattg aaaaacagcgggcggaagattctgacggccaagaatgagccactgacgacgacggagtca tcaggcgttttaaccaacacacctttacccagcaatagccgattgaaagttaacaacaac aacaacaccaataacactgccaagatgtctggaactagtagcagtcagtcctcggccacg cccacaccgcccacggccagcagcagcacaaccaccacaacaacaacgaacatcagcacc ggaggcggtgggagtggcagcagtggcggtggcggtgggagtaccacggtcattgccaat cccgcatcggtaaccaacaccggagctggaagtgccgccaagttccgtgccgccgtggcc tcggcgcccagcccagcactgcctgcaaccaacgctcctgcaaatgcaactgctgctgcg gcaatagcagcaatcgcaactgctcctgcccccagtagtagctcctccagctcctcgtcc tcgaagaagactagagcagcagtggccgccctgaagcgacaggtggccttgcaacagcag cagcctgttaccggtaatgcacccaacatgaccagcaaggattcggcgcatttgaaattc gccacaaccactctgctgatgggcgccgccgcagctgccgccgatagcaacgctggcgct gctctcggtggatcaggcgcaggaggatcaggatcatcatcatcagtaggagcggtaggc ggggccagaatggccctgaatcccgccgttgatatggccaatgccgctgtcctgcttaag caaaagctaaaggatgcggctgccgctgcctcggcctccgcctccaatcgctcagccacc tcgtccatgtcgtcaaccgcctcctcgctgtcttcgtcggcgggcatcgtgaatgccatt tcctcggcgctgcagaacatcatcacgccggatacggacaccgacaccgaattctatccc cagcccgtcaccacagacctatccgaatcggaggaggagtccgtttcagaggacgatatt ccagaatcggatccggatagctgtccacacgagggtgaggtgcgcgaggatgaggacgaa acggaggaagagtcggaagactctgatgagtctgaaggcgaagaggaagaggaggatgaa gaggagatagatgtgctacaggacaacgacgcggacgacgaggagatcgacgatgaggac gaggaagaggacgcgcccgaggtgagttcctttctcctggatgccaacaacaagcgttcc agcaatatctccgctctgctcgaggccgcggccaatgagaaggctcctgtcctgcgccac gccactcacgccatcgacgagaccaagcaggcgctgacaaagatgcgctgcgccagcagt ccccgcgataagaacagtggattctccagatccctagtggctgcctgcacggataatgat gtcaatacggtgaagcggctgctttgcaagggcaacgtgaacctgaacgacgccgccgcc tccacggatgatggcgagtccctgctctcaatggcctgctctgcgggctactacgaattg gctcaggttctcctggccatgtctgccgcccaggtggaggacaaagggcaaaaggactca acgcctttaatggaagctgcatccgccggtcatttggacatcgtcaaactgctgctcaac cacaacgccgatgtgaacgcccactgtgccacgggcaacaccccgctcatgtttgcttgc gccggtggtcaggtggacgtggtgaaggtgctgctcaagcacggcgccaacgtcgaggag cagaacgagaacggacacactcccttgatggaagcagcttccgccggccacgtggaggta gccaaggtgctgcttgaacatggagccggcatcaacacccactcaaatgagttcaaggag agcgccctcactctagcctgctacaagggtcacctggatatggtgcgattcctgcttcag gcaggtgcagatcaggagcacaaaaccgacgaaatgcacactgccctaatggaggcttca atggacggccatgtagaggttgctcgactgctgctggactccggtgcccaggtgaacatg cccacggactctttcgagtccccgctaacgttggcggcttgcggtggtcatgtggagttg gcaacactcttgatcgagaggggagccaacatcgaggaggtgaacgacgagggctacacc ccgctcatggaggccgctcgcgagggacacgaggagatggtagccctcttgctcagcaag ggtgcaaacatcaatgccacgaccgaggagacccaggagacagctttgacgctggcctgc tgcggtggcttcatggaggtggctgcattcctgatcaaggagggagctaatcttgagctg ggtgcttccacgcccctaatggaagcctctcaggagggacacaccgatttggtaagcttc ctgctgaagaagaaggcaaatgttcatgcagagacccagacgggggatactgccttgacg catgcctgtgagaacggacacacagatgcggccggtgtgctgctatcgtatggagctgaa ctagagcacgagtccgagggtgggcgaacgccactaatgaaagcctgtcgtgccggacac ctgtgcactgtcaagttcctcattcaaaagggcgctaatgtcaacaaacagaccaccagt aatgaccacactgccttgtcgttagcctgtgccggcggtcatcagtctgtggtggagctg ctattaaaaaacaacgccgacccgttccacaagctgaaggacaacagcaccatgttaatt gaagcctccaagggtggacacactcgtgtggtcgaactgcttttccgctatccgaacatt tcgcctacggaaaacgcagcgtctgcgaatgttacccaggcagcaccaaccagcaaccaa cctggtccaaatcagatgcgtcaaaagatcatgaagcagcagcttcagcatcagttgcag cagctgaacgctccccctggcttgcatgagttgtctgaggcggcacgtgcatccaatcaa caacatttccaccagcaacagttcagcagtgccggcaacggatcctccaacatcgtggca atgggaactggcgactttttggatgccggagaactccaacttactgctactgcgggaatg agtgcgggagccggaaccagtaccacgggcagtgagactggtatggaggagtatggcgaa gtcggaggaattgacctgactactcttggcgctcagcagcaagagggtcttattgcaaag tcgagattgttccatttgcagcagcagcagcaacaacagcagcagcagcaacaacagcag cagcagcagcaacaacaacaacagcagcagcaacaacaacaacagcagccacctgcagct ggccagcaccagttagtgccatgtaagcacttcgacctggacatggagcacatcaattct ctgcagccgccgcaaaaggcaccgcccgctccacctgtactcttccacaccgtttgccag cagcctgtaatgcaacagcagcagcagcagcttcagccaggtcagctcaagttgaaggcc atgctacccaaccgcaatcgcgcgttgaaaaccgccgaggtagtggagtttattgactgc ccggtggatcaacaacagcctggcgagcaggtgcgcacgcagcctttgggtgaggatgga aagactcctcagtttgcatgcgctggagaggatccacggttgcagcgccgtcgcggcttt atgccggagctgaagaagggtgaactgccgccggagagcagcagcagtgacccaaacgag ctagcccttaaaggttcttattctttgtataattctttacaatcaggagccgacaacaat cagcccgtaccgacagcactggacaatagcgcatgcgcccagattccagcgcgaaactct ggcggagcaataacccattcctccgaagttctgcagagcacagctatcagcgacaggcca aaggtaaaggcaaccaacaagaacaaccgaaagcaagcggccgcagcagcagcagctgca gcagcagcggcggcggcagcagcagcggccgcccaacacgcccagcaagtgttgccgaac ccaatggtctccatctataataacctgcatttgcagcacttgcagcatccgcatctccag tttcagcagcaacttcaactgcatcaccagcgagtagctggactggacaatgcagcggct gcagcagcagcagcggcttcatccgcgaacatggcctactctatttctccggcatctcca cttccctcgcccactggcagcggcaactatgtcgatcagcagctgcaacagcagtccatg gatgttgctctacagcgcaagacggccatggacgatttccgtggcatgttggagacggcg gtaaatggtccaaggggcagaaaagacctggctcttaacacaccacagctgaacttcttc aaggacggctggcatatggtgggagtgcacaatttctttggtgatcagccaaagtcgccc actgaaacaccgcctgaaatggaggagactaccatgtcctcaccgaccgaagcagatcgt ctcggatcggagcctcgggccgagatgaagaacttggccacgctctgctcggccgcagca gcagctgctgctgtggcagcggttaacaaggatcaggttgagattagttcggatcttgag agcgaatgcgaggatgatgcagaaggtggtgctggggcagattgcgaggaaaacacactg ccgccggagccaattgaattggcggccgctctaagggaggatggcataattgtggaggaa gaagaggatgacgaggaggaggatgatgatgatgaagagcaggataccaacagcggtgag gtcgacaagctaaactatgatgacgaagacgcggaggtggacaacgatggtgaagtagac tacatcgacgaagacgaaggtggtggagaaggcgaagaagaagaagatgatgcagatgat gacgagttcttcttggacgagcctgatagcgaccaaggaactggcaacaataataacaat tccaaaagcggcgccagttcgttgccattgaaacagcgcaaaatggccactcggttggaa aacctaatcctaaactcgcagacagtgtgcgacttcccgcctgaacttagcaactcggaa ctggttcatgtcctgccccaaataagcaatctcaaggcagcggccaacagcaacgcggct ctgaacagcgtactccagcagcagttggcagcagcctccgcggcagcggcgcacgccaaa gcgtctgtagtccaccagaagcagcagcatggagagggagatcagcaatgcggcgatgtt ccgcaggatgcccagcgacaggcgaatcttgtcctcctcgactatcccatgcagcaaaac atccaactagagcagcggctactcgatgctgaggaaatgcacctgcagcaacaccagcaa acaccgctctccttactgccctttacggatgagcagcagcagcagcttcatcaccaagct ttgtccaatgcatccgattttcagcaacaccaacagcttgccctggaaaacgatccagaa ctaaagcagcagcttcagcagaactccaacgcgcgcataattaaagctgttgctgcccag catcagcagcagcctccaaccaacttcgtttacaacgtggaaagcggcgacaagaatgct ccgccagtgcaattgctcttccagttgccaccacacatggcgcaacatcaggcgcagcaa cagcagggcgttggagagcctcttaccgaacagcaacagcagcagttacacgctgagcag gcgcatctctttcagcatcgaactggcggtcagcgtccgcccacccagagtgagttagag caggtggctcaagagctattgcttcagcgaagcggccaggtgccggcaggagctcctgtt gttggtgttcaggcaattccactcaagcaaaaacactttaacctgcatccgccgccgtgt ccacccacctgtgtccagcatcaggcctctcagcaacagcaaatgcaacaaaacgagctc tccatttggccgatggccacgcctactcccgcgcccagcagcggtgtaagctcgaccaag tcgatgcccggcggcattgccaaaaaggccattgacaagcagtcgcgcaaggaacgtcgt tgcgtggtgcgccagacaccagcaggcattcaagagaacaccaaactccatctacagcct caggtcgcaacagcccagcaacaatttctagtgcagaaccagttagcagttgcgaccacc gtgagtttggacaagactatcgaaatagattcagagacggaatccaaccacgacacggcg ctaaccttggcttgtgccggcggtcatgaagagctggtggaactgttgatcaatagggga gcaaacatcgagcaccgtgataagaagggattcacaccgcttatactagccgctaccgct ggccacgacaaagtcgtggacattctgctcaagcacagcgctgagttggaggctcaatca gagcgtacgaaggatacaccgttgtccctggcatgttctggcggccgatacgaggtggtg gaacttctgcttagcgttggtgccaacaaggagcaccgcaatgtatctgattacactcca ctgagcttggcagccagtgggggctatgtgaacatcattaaactgttgcttagccatgga gcagagatcaattcgcgaacgggcagcaaactgggcatttcaccgcttatgctagccgct atgaatggtcatacgccggcggttaagttgcttttagatcagggatcggacataaatgcc cagatcgagacgaatcgcaatacggccttgactttggcttgcttccagggcagacacgag gtcgtaagtctgctgcttgaccgacgggccaatgtggagcatcgagctaagacgggcctg accccactcatggaagccgcgtcaggcggttacatagaggttggtcgcgttctgctggac aagggtgcggatgtgaatgctgctccggtgccgacgtccagggatacggctctaacaatt gccgccgacaagggccatcaaaagttcgtggagctcctgctatctcgtaatgccagcgta gaggtaaaaaataagaagggtaactccccactctggctggctgcccatggtggtcacctg agtgtggttgagcttttgtacgaccataatgctgacattgactcacaggataatcgacgt gtttcctgtctgatggctgccttccgcaaggggcacaccaagattgtgaagtggatggtg cagtatgtgtcccagtttccctctgaccaggagatgattcgcttcattggtaccataagt gacaaggagctgatagacaagtgttttgactgcatgaagatcctgcgtagcgccaaagag gcccaggccgtcaaggccaataagaatgcttcgatccttttggaggagctggatttggag cggactcgcgaagagagccgcaaagctgccgccgcccgtcgtcgtgagcgcaagaagaag aagaagatggagaagaaagaggagaagcgccgtcaacaacagggcaatgggcctggcgga gacgatatgcaaggcgatgacgatgacgccagtgacaaggatgatgattccgacaaggac gatgaagacgaggaggcagcgccggccgccgcccgcgaggagggagactctggcatcgat cagggctcgtgctccagcggagacaccaaaggtgcgcgcttcggtggcagtcagtccgct caggctgcggaagcggcagccaattccgtgtccaccaacagtcagggaaagaagaacaag aagcaggcgaaaaacaaggtgttgatatcggtggaaccaactcaaccagtaatcacatcg aactccgtcctcaagggcgtctgtgcgaagaagcatcccgcagtggaagttgtaaagcaa cctcctgccacacaacaggctgctcctcttaagcggcaactcgatgtaaagaaggaggaa cctgcgctcaaaaagaaggaagagaaaaacagctcgtccagcagcagcagcaagcgtgag aaagagaatcttgcgcccaaggaggttgcactgccagccaagcagcagcccagtagctcc agcaaactgcagagcagcgagtctgcgagcaacataaacagcagcaccgctaccaacacc agtagcgccaatactactcggaaggaagttgcaaagccagcgtcacaaactgcgagtgcc accactttgaatcctgcaaagcgcaccgaagtcgatggctggaaggaagtagtccgcaaa agtagcgcccagcagaccacagcggtgggagcgagtggagccccactgcctgtcacagcc accagttcggccaccagtgtgcaacatcatccgcaccaccacctagccaacagctccagc aacagctcaagctccctgaccaccagcactactacagcagcgtcttcggttcccgagatg acgtgcaagaaggtgcaagtgcccgtaaatgccatctccagggttattggacgaggtgga agcaacattaacgccattcgggccaccactggtgctcacatcgaggtggagaagcagggc aagaaccaatcagagcgttgcatcacgatcaagggcttaaccgatgctacaaaacaggca catatgctcattttggcactaatcaaggatcccgatgtggacatattgcaaatgctgccc aggattaacagcagtattaagcaggcgtctagtggcggagcaagcaccccgatgtccgtg ggaacttgggacaatcgcaccgctgccggtgtgaatgcgtataccttttcttctgccgcg tccaccacctccacatcctccagctcgtcagctagctctactacaccagcgggagcttcg tacagcaatgcgcacaagcagcaccaacagcagccgcagtcagtgaagggcccaagtgga cggtcatcaacgtcggtcaagtctaatggcagtagcaccaaggtatcggcttcgagtgga tcgggttcccggagcggcagggctggcagtagctatcttgcccaacagcagcctggtcgc agctctggaggtggctcttcaaatggcgtgatcaagagcaagtcagaaagctcttccaaa tccctgccagctgcacaaaagagcagtaccactttgggtaaatcatcgactgtgtcaccg ggtgcacagaatttcgcaaaggcagcggctattggacagtcctcgcccaaaaaggctgag ggtggtgctacatctgcggtggtcacctctgccggtggacgcagcagtggcgtggtggct ccatttggacgtggcaagcctgtagctggccaaggaggacctgcagcaacggcggcttcc aacgttgcccagctgggaagtgtgagtggcaacagcaacatattggctggaccaattggc acctttaatgtagcggatgtggctgctgtgaatgcagccgcggcagcaggagcagcagca gctaccaacagcaatgtgaaacccattgctcccattgcaccgcccagtaagcgagttgga tctcccacccaagtccagcaacagcatcaaacacagcagcagcaacaacagcaactaccc cagcccgcaccagttcctggcccacagccacaacaacagccgcttcagcagcagcaacaa caacaagctcctcagcagcagccacaacagccaaaccagcaacagcaaccacagacgtcc cagcaaaatctcgtgatcaatacaaacctactgaacgatctgatggccgccagtgcagca aacaccaccagcgatagcttcagtgcccagctagcagccaagttgtctagcgcatattcc ctgttcagtgactaccagcagtcgcagtggggcaagttgggtgatccaggcatcggcggt ggagcaggagctgttggcgatggtctgccgcaagcggatgcttccaaggcaccaggatac aatcgcaacatccttagctcgcccgttggcagttccaaggcctcgtcgaatcactccacc tcgcctcctgtgggtaatgtgatccaacagcagcagcagcagcaaccgcaatctagccaa caagctctcaacattatcaccagtggaccaggagggcctgctacagcaccagccagatca ccgatggtgtccgccaacgagggcaatcccgctgtgggtcaaccctccatgaatggaacc caaggattgggtgagacggctcctgctcattcaccaggcgttatcaaaccgcccacggcc acagttcccatccagcgtcatgtgcccatgccgatctctgcgccggaggccggagcaccg cccacatttggagcgattggttccaacccagctagcggtaacaattctgcggctgcccaa gccgcagctgccgccgccgcctcggcgatgatcgatcgtcagcagcagaatttgcagaat ctgcagactttgcaaaatttgcaaaggatggtgggagcctcgcagcagcagcagccacaa cagcagctgaactatccaatggaccccacatcgtcgttcatcgtggatgctaataacgta ctgcgcctgaatccacgcgtcatctttccgcaaggcaacaccaagccaccgcagccaccg ccgcagggtggaacgcagtcgaatgtgtttggaggaaatccaggcagacaaccacctgga acgggtgccagacagccaggaggagcagctgcacagcgttggtatggcggcactctggag tatccttcatacacgggccgtgatatgctgcacctggagaatggtgccggcggaatggcg ggtatgggctcaccatctgccatgtcgcccaaccacgacgacattcgcaagatgccgcgc cccataggcactgagcgagccgcatcatggaagtacaacaactttaacgttggaggcccg tcacttaacatggaagatgctctagccagtgtgttgcccccatgggcacatgagcttaag gctcagcctccaggtttgcagcagccgcctcctccgccgcagtcgcagcagcaacagcaa caaccgctcaactggctaaagcagcagccgcagcagcagcagtacagggcctacaacaac ggaccctatccgcagcagcagcagcagcatgagccaatgaatatgcccatggactaccac aacatgcaggcacctccaaatatgagccagcagcagcagcagcacgtcaacttgatgccc tcctacggctaccagcattttgtgggcgctcctggagccgttgacatctccgcacatatg ccggacaagatggaggtgtgggatcaccacgataaacacatgccctggaccaactatacc accaactggtccaactga Column Description \------ \------------------------------------------------------------- Gn.Ex gene number, exon number (for reference) Type Init = Initial exon Intr = Internal exon Term = Terminal exon Sngl = Single-exon gene Prom = Promoter PlyA = poly-A signal S DNA strand (+ = input strand; \- = opposite strand) Begin beginning of exon or signal (numbered on input strand) End end point of exon or signal (numbered on input strand) Len length of exon or signal (bp) Fr 'absolute reading frame' relative to start of sequence. For example, if nucleotides 1,2,3 of the sequence are read as a codon, that's called reading frame 0. If 2,3,4 are read as a codon, that's reading frame 1. If 3,4,5 are read as a codon, that's reading frame 2, and so on. This information, together with the starting and ending positions of the exon, is sufficient to give the amino acid sequence encoded by the exon. Another use of the reading frame is that if you see two adjacent predicted exons separated by a relatively short intron which share the same reading frame, it may be worth looking at the possibility that the intervening intron is not correct, i.e. that the two exons plus the intervening intron might form one long exon (assuming there are no inframe stops in the intron, of course). Ph 'net phase' of exon (exon length modulo 3) For example, an exon of length 15 bp has net phase 0 since 15 is divisible by 3, an exon of length 16 bp has net phase 1 because 16 divided by 3 leaves a remainder of 1, an exon of length 17 bp has net phase 2, and an exon of length 18 bp has net phase 0 again. The point of this is that exons whose net phase is 0 can be omitted from the gene without disrupting the reading frame: such exons are candidates for being either 1) incorrect, or 2) alternatively spliced. I/Ac initiation signal or acceptor splice site score (x 10) (If below zero, probably not a real acceptor site.) Do/T donor splice site or termination signal score (x 10) (If below zero, probably not a real donor site.) CodRg coding region score (x 10) Low coding region scores may indicate potentially incorrect predictions or genes with unusual amino acid and/or codon usage patterns. P probability of exon (sum over all parses containing exon) This quantity is close to the actual probability that the predicted exon is correct. Tscr exon score (depends on length, I/Ac, Do/T and CodRg scores) An overall measure of exon quality based on local sequence properties