Cla97C01G011962 (gene) Watermelon (97103) v2.5

Overview
NameCla97C01G011962
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionPentatricopeptide repeat-containing protein
LocationCla97Chr01: 24723784 .. 24764411 (-)
RNA-Seq ExpressionCla97C01G011962
SyntenyCla97C01G011962
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGGAAGGAGGAAGAGTCCCTGTCATTCTTCTGAAATAGAAACTCAAAAGCGTCAGAAAGGGAGACTTCAGTCCCCATATCATCAAAGTGATTATTCATAGGCTACATAAGAGATTCCACATTGCTTGGGCTAACTTCAAATTGAACATCAAAGTCATCACACCGAAAGGAGAAAGAGGAGTTGTAGGGTCTTCTATTGAATGGATGATGTCATGAATCAAAACAAGATCTTTTGAAGAGAAATCCTTATAGCTTCAAGGATTGGGAATTGAAGGCTTTGATGGAGATATTTCAGATAAGTTCTTATTAGCCACTTCTAATACAGAGGGGATCGAGACCTGAACAACTTTCTTCAATTAAATCAAGATTCATAGTATTGTCTCCAATGAGTGAATGAGAAGAAATTATAGGTTCACTTTTCCGAGTATAGTATAAGGGATAAGTTGCATGTAACAATTTTTAATCTCTTTGACACTAAAGGGCTTTGAGGACTTGAGAGATTTAATGGATGTCTTAAAAGGGGATTCAAGAATCTCAATAGGCACCGCATGAGAAGAATGGTCATTAACAATGGCAAGTGGCTCAAAAGATTTACAAAGATCCTGAAGATTTAGAATGGGTTGTTGATGATCATGCAGACTAGACCCATAACAAACGTAAATTCCTTGTTAAGCAAATATCATCTCCCCTATTGCCAGAACTAAAGCTGGAGGATTAATTACCACATCATTCACATTATAAACTCCTTCGTCTTCCATGACTTGATCTGAACAATGCAAATCCAAAAATTTGGAAAAGTCTTTCTAGGAAAGAATGTCATTAATGATATTTGGAGGATCCAAGGCAGCAACATCACCATATCTATGAAGCGTTCTCTCTTTTTTATCATTGATTTCTAATGTAGTTGCTAGAAAACCATACAAATTCTTCTTGACTTGTGTTAGTGTATAGTGCAATCCAACATTTTAAGGGTTTGAGAAGAAATATTCTCAAGACCACCAAAATAATGACAGATGGCCTTGAATGTGCTACAATCTCAATAGGTTAAAGGTAAGTTCTTAATGGCGATCCTCCCACTATGACCTTCGATAAAATCTGAAAGACTGGGTTTTTCTCGAGACCACTTTTAAATATGCAAATGAAAATCACCATAGAGTTTCCATTTTCCTTCAAATTCTAGAGCCTTTATTGAGTTTTCACCCTTGAAATTATGTAAATCCTTGTCTGCCATGAAAGGATTAATCAAAACATTGTTCTATGTAGGTCTTGTTATACTTGATTGGAGACCGTTTTGTAGTTTGTCTCTTTTTCCATTTTTTTTTTGTGGGTTTCATTTTGTATGACCTTGTGTTTTTTCTTTTTCTTTTTTTTTTTTCTCTCAAAGAAAGTTCAATTATTAATATTTTTTCTCTTTTTTAGAAGAAAGGTGGACAATTTTGGAATAGAGGAAACTAGGAAAGGGAGCGGATTCTTTAAGGCGGAAGCAAAGTTTGTGTGCCAATTCAGGATTGCTAGGCCGTATAATAACACCATATGCTATTGCAGACATCAACAGATTATGAAGGTAACTATAAAACCAAATGTTACTTAACTCCTTGAAGTAGAAGTATTTTCTCAAGCCAACATTTTGTTGGGAAAGAAACTGAAGGCCGAAATGGAAAACCAAATATTACCAACTTCTCAAAGTAAATAGAATTTATCACTCCCACATTTTGTTGGGTTGGTCTAGTGGTAAAAAATGAGACATTGTCTCAATAAATGGCTAAGAGGTTATGGGTTCAATCTATGTTGGCCACCTACCTATGATTTAATATCTTACGAGTTTCCTTCACACCCAAATGTTGTAGAGTCAACCGGGTTCTCCCAAGATGAGATACACGTAGTGGGCTCGGACACTCATAGATATAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGACATAAAGCTAAGCAACTCCTTCAGTAACAAAATGCAAGAAGGAGCTGGACGTGCCCCATTGTGTCAAGCTCGATGAAAATTTCCACGCATAACCACCAGCTCACTGCTTTTAATTACTAACCCAAAAACATTTATATTGAACTCAATCCACCGTCTTGTAATTTCAGAATACTCAAAGAAGAAAACACCGATTGGGCAGCAAAAAAAAAAAACCCTAAAAATCAAATTGATAGCTAGCGATCTAATGTCTGAACTAAATATGTATTTCACCGTCAAAGATTCAAAACCCACATACCAAATACAGTCAAATAGAAACATAGAAAGAGAGAGAGGGAGCAACATACATAATCAATCTTTAAGCTACTGGGACGACGGTAACGCTAAGATCGAAAGCTACGGCTTCTGGCTTCTGGCTTCCGGCCTCCGATCTCCGGTCTCCGGTCTCCGGGTTTTGGGTTCTGGGTTCTGGGTTCGGCAGGAAAAGCTTCTCCCTACAAGTTTGAGTGCAAGAAATCGAGTGAAAGTAAATGAAGAAATGGGGCTCGGGGGTACTTTCTATTCCGTAATGGGGAAATTTTGACCCATAAATTTGTAATTTTCACCCCACTTTCAGTTTAGTCATCTTACATTTTAAGCTCTCCTAATAATTACGATCTTAAAATTGTTAAATATGTAACGATTTTTAGTTCTCCATCAATACTTGCTCTGAATTTCAGAGATAAAATGATATAGTAAAGTTCTATTGTGTTAGCCACAAAATGCATGTGGTTTGAATCTTTTACTTAAATTATATAATATATTTATGATGGTAATTTTTGACATAATGATATGGTTAACATAAAATAAAGATAAAAGCAGCAACGTCGATAAAATAACTAAATTTGGGTCAAAGATAAGATTGTAAGCCCAAACTTGGTTCGTATCAAAACCCAAATTGATGTACTCAACATTGACCCAGGTTGAGGCTGAGCCTAGCAAAATAGTAAACGAGGCATGTCTCGTGTATGCCCCAACCTAAAACATAATTCTAAGACCAAAATGGTCCTATAGGATGATGCTCGGTCGAGGTCGATCCTAACCAAACACAAAATATGGCATGCCTCTCACTAAGATAAGTTGCATGATTATTCTAAAGTTAACTTTAATAGGAAACAACTTATGTCAATCACCTTTATAAATACATAAGTATAGCTTCACTCCATAACTTCCGCGCTCTTTCTCTCTTTTACTTATACACAGAGTGCGTGTGACAAGCACAGTAGCAGTTTGTAGATTTTCTCTTCTCTATCCACATTTTTCTCCCCTTCATACAAAGTCACTCACATGAAAGTCAAGTGTATTTCCTATATTTGAATCTTGTATATGCATGTCTGATAAAATTGATAGTTAAATTTGATTTCTTCCTCAGTTTTGTTAGAAAGCCACCAACTTGCAAAATATATAATTCATACAAATACAGCACTTGAGAGTGGTTTTTTTTTCCAGTTATTAATCAGGCGGCAATATTTTGATTTTTTTTATATATCTTTCAATTCATAAGTTTTTGTCAAACGTTGGAAATATTGTTTCTCAAAATAGCTATTGAATGTAATTAACGAGAACCATCGTTCTTTGGTTTCATGAGTGGGGCCTATTCAAGTCTTTATTTTGAAAAGAAAAAAAACATGTTGGAAGAAATGGCAATGACCTATATAATTCATACAAATACAGCACAATTAAATCAAAACAATTTCGATGCCTCAATTTTCATGCGTTGGTAAGGCTAATTGTGTTGGGAGAGTTGTTGTATGTGATGAACTGCAGCTATGCGTTGAAAGTGTCCCCATGCGTTGGAAGTGAAGCTGGAACTTGGTCAGGAGTTGTCCTTGACCTATCATGCGTTGGTAAGGCTAATTGTGTTGGAGAGTTGCTCGTCGCTGAGAAGTTGAAGAGGGAGTAGCGAAACGAGTATTGCTGGTAGCGCTGAGGGCTATGCGTCTCACAAATCTCGCTGAGTGTCGCTAGGTGTCGTTACTCAAGGTGAATCTCGCTAGTCTCGCTCACTATGATCTGGCTAGTGTTGTATATGTGTAACCCAATCTCTCTGGTAGTCTCGCTGGTTATTTATACTCTCGTTCATTATGAGTTGTTTTAGGAGAAGAGAGTCTCGGTGAAGCAATGTTGGAGTTTAATTATGCCAACTTAATGATTATGGGAAGGAAGTTTAGGGAAGTCTAACACGATAAATGTGTATTTACATGCCACGAAGAGCTAAGAAAGGTGTACAACCCGCTCAAGGCCTAGGACAAGTCGTGAGTGACTTCCTATCTAAAAGTGTTTGATTCTAACATAGTTTCTTGATAATATTTGTCGGTAAATGCTTATATGATCAAATTCTATGAACTTTATGTGAGTTCCATGTGTCGTATAATGAGTTTTTTCGATATATAATTATATACACAGACTGAGCAATAGTATGTGAACAATATGAATAAGTACTTGTGATATGGTGGTAAGGTTTAAGGAAAATATAAGAATGTTGAAGGTGTTGGAATGGTTGAGATACTATCAATTAATTTGAATGGATTTCAATGCCGAGGTACTGAGGAAAAGGCATGAATGCTATCTGTGTACATGTTTAGCTTTGTGTTAGTAATGTGTTAATTGTCATGAATTGTGTAATAAGCAACGGTTACAGAATTTCTGTGCAATCTGGAGCAACGCACAGTAATTTTGAAATAGCTGGAGAATGAAGGAGAGTTAAGCCTTGAAAGTTTGAGGTAATTTGAGAAGTTCAATTTTCTACAAGTTTGTGGAAGGGAAGAAGTTCTAAATCTGTCTGGAAATATGACTTTTAACAGATAGGAGAAAAACAAAAAGGAAAGGAGGAGTCTCTGCTGTCGGGTATCGCTGAGTACATAAAGTAAAGGTAGCGAGACGAGTGTCGCTAGGACGCAAGGGGGTTTCCATCTCAGAAAACTCGCTGGGTATCGCTAGGTATCGCTCATCAGGTAAATCACGCTAGTCTCGCTTATCCTAGTGTCACTAGTATCGTGTCCTGTTGGGTAGTTTCGCTAGTAATCTTGCTGGGAATGTGGGATTTTCGTTGGTAATCTTGCTTATTAGGTTTACTTTGGAAAAATGTTTATTTCAGGCCAAGAAGAGGTATGAAAGGCGTATAATCCACTCAAGGCCTAAGGATAAGTTGTGAGTGACACTTTTCTTGCTTCAAAACGGTTTTTGTGTACATGTTAGTTCGATTTTCCAATTGATTATGGTTTTGCAATCAATCGATTTACAAAGCATAGTTTATGTTTCTTAAAAGGTATTGAAAAGTATGTTTATGAAATGTTATTGATTTAAGGGTTGCCCTATTTTCTTATGGAAGATTTGCATATGAACTGTTTGTACAAAGTATTGCTTTCCTTAAAGGAAAATGTTTTAGAGGAGGCTCAGTGCTGAAGGACATGAGAAGGCACGTGAGTTTCCCCAGGACCCAGTACTGAAGGACATAGAGAAGATACTTGGGTAACTCGCCATGTTCCTATGTGCACATAGAGGTTATATGCTGGTGCATTGAGTTCTGCTCCGCATCGAGCCTTATTTGTTTTCGACGTTGGAGTGTTGAGTTTTGCTCCGCCTCAACTAAGGAGAAGTATTGTAACCAAAATGATTTCTTTATGAATGGCTGAAAAATCATGTATGACTTGATAAGGGCAACTAAGCATGATTTATCCAATGAATGTGAAATGATTCCATGTTTTACTCATGTTGTTGGAATTTGTTTAATCCTGATGATTTTCAGACTAAAAGTAGTCTATTAAAATTAGTCACTCACTGGGCTATTAGCTCATGTTTTCAATGTTTTCTTTTACCCCAGGTAGCGAGGATGTTCCCGATGCTTAGCCTACTCCAGCTCGCCTTGCCTAAGCTTTGAGTTTTCAAAAATTAGTCCTGTTTGAAGAGTTGTATGTGTATATCACCTTATGTAGGTGGCATGTAGACCTAGTTGTACATGTGTTGGGATGTGAGATATGGTTTTGTATAAGTACATTGTGGACGAGTTGAATGGGAGGGGGTGTGACAGACCTAGTACTGATGGACTCGGAGAAGGTACTTGGGCATCCCGAGATATTGCTATGTGCACATAGAGGTTGCGGACGTTGGGTGCATTGAGTTTTGCTCCGCCTCGACTAAGGGAAATATTTGTTTTCGACGTTGGGTGCATTGAGTTTTGCTCCGCCTCGACTAAGGGAAATATTCGTTTTCGACGTTGGATGCATTGAGTCTTGCTCCGCCTCGACTAAGGGAAATTTTAAAACAAAAGAAGATTTGATGCGTTTGTAGACTATTTTATTTAAAAGATCTAGCTTATGTTTTATCTCAAGGCTTAACAGGAGTTGTATACAAATGCTATGTTTGAGCATAATTTGATTTCTTGAGTTATCTTTTTCTGAAATGAACATATATGATTTGACAAGGGTATTTAAGCATGTATTTACAATAGATATGACTTGAACCAGTGTTTTATATGCAATGTGTTTAAATGTTTCTACTTTAATTAATTTTCGAAACAAAGGTAATTTTACAAAACCAGTCACTCAGTGGGCAACCCAGCTCATACTTTCCTATTTTTCTATTTTTCCAGGTAGCGAGCGAGTTCGGGGTGCTTAGCCTACACAAGAATCTCGTTTGGGCTGTCTTTAGTGTTTAGTAGTTGGACAGTGAACCTCACAAAGGAGTGAAGGTTGCTGTCCTCACACCTCCTCTTGGGTTATGGAGGATAATTTGGGAAGGGGTGTGACACCTCATGTGGCCTTACCTTCTGACCAAAATGGAGGAACACCGAGTGATTCTGATCGCCTACGGCACACCGAGTGCCTCTCGTAACCTCCGCCCCCGTTTCGAGAGAAAATGGTTTTAGAAAACGAGTTTAGAGGAGAAGGATTTCGCAAACGTTTGAAAGCATGTTTGAGAAATTAAGATGTGTTGTAAGTCTAGAAAGCAAAAAGGGACAACGACGACTTTATAGCTCACAACTCCGGGTATGAGGATTTGAAGGTTAGTCTCACCCCTATATAGTACATTCTGAACAATTTTAGAGATGTCACCCTTGTACAAGCAAAGTGGCGAGCGGTCACTTACAAAGAAAAATACTAGGAAAGTAAACTAACTAGCAAAACGAAATACATAAGGTGAAAGCATGGTAAAAAGGGATCGAGGCGGGCATGCGGCCATGGGCAACACGCCCTGGACAAGCATGACCGTGACATCTTGTCACACCCTACTTTTAAGAAACTGATGCGTTGAAGAGTCTATGAAGAAAAGAAGTAAAAGACCATGTTGGACGCATGGTCAAGGCTAGGTAACGCATGATAAGGGTGGACGCATGGAAGTAAACCGAGAAGGGAAAGAAGTGAAACGCATAAGCAAGAAAGGAATAAAGTTATGTTTTGAAGTGTTGAAAGAAATCTAAGTCACGGAAAGGTAGAATATTATAGCCAATTGGGAAACGCATAGACTATTAGCAAGGAATGCTCACGAGTGTTGGAAAGTGGAAGGTGAGAAGTAAGTTTAAGATTGGAATAAGAGGTGGACGCATACTACTTTATTATCAGAAAATGTTTTAGTGCAAGGAAAAAGGTTAAATGAATAACCTTGGAAAACAAGCACAAGAACGCATAGAGGGATACTTAAGGAATCCTCAAGGAAAGGAGGAAGATCATGGACGCATGATAGTATGCGGTTCAGGTTGACACGTGGTATTATGCAGTAAGACAAACACATAGTAGTGTGCGTTGAGACGTTGACGCCTAGTAGTATGCGATGGAAGAACAAGTTCAAAGGAAGTATGGCTGAGAGTTAGAAAAGTGAGTAGTAAAACTCAAACCTTAAAGGAAACTCAATGAATGCATGGGGGCGTTATGATGAATGAATAGTTGTTTATTGGAAAATGTTGGATGCATAAGATAAGTTTATAAAGGTTAGGTGGCTTCATATAGATTTGACACCTTAAAAGTAGGAGGTGTTAGAATGCATGTTAACTGCTTCATGCAAGGTGACGCTTGGATAAATGCATGGGTAATTAAGCATGCAGCGTGACATGTGGCAAACAAGTGGCAAAAGGAGAGGAAAAGTTGGATGGCAAGCTCCCTATAAATAGATCCTTGAGGGAAAAGGAGTGACTGATGGCTCTGAAAGAAGAATTCTGGAGAGAAAGGGAAAAAAGAAGGCAGAAAGGAGAAATCCTTGCGTCCGTAGTAGAAGTTTGCGGCCAAAAGAGAAGGTATTTTGACAGCTTAGCTATTTTATGTAAAAGAAGTTGTCTTGCGTTAAAAGGGGGTCGGCTTTGCGTTGAAGACAAACACTACGCTCAAGATGAACGCGACGTCCAAAGTTAGGTGATGCGTTCAAGTTGGACCTTGCATCCAAGAGAGGGTGACGCATCCGAAAGAGAAGTAAGAAAAGTTGGGATGCACTTTGCGGAAGTAATCAACCTTGGTTGAAGGGCAGGGTTTGCGGTAGATATCAACCTTACGTTGAAGGGCAGGGTTGCGTTCAACCGAGCTGACAGAGGAAAATGGAGCTTTGCGTTTCTGAGGTTAGAACGCTGTGTCCAAGAAGCTGGGGAAGAGGAAATAGAGGAAAGGAAGAAGAAGTATTCTTTGGCAAGTACAGGTGAGTAACACCTGCATTTTACAAAATATAAATGAGTTCTTTATGAATATGATTTTATGGTTTCTAATGAAATAATGCATGCGTTTTCATATATAAAGCATATGAATATGTTGTTTATAGAGATAAAAGCATGCGTTTATATATACATAAAAGAATATGAGCCAGCTGAGAGCATGCGTTTTACTTGAAAACTTTGAGAATTCTGGGTACTTGCTATGAGTTTATGAAACGCATGATGATTGATTGATACTTGTATTCATGGTTTTTCTGAAAAGCATGGTGTTATATGATTTCAAGAAAGATTTGAAAATAAATTTGTATGAAAAGCATGTTGTTTGATATTAAAGCTGGAAATTGAATTTATCATGCTGATGATGTGAGATTTGGAAACGGTTAATTCAATTAATATAATGATTCCAAAATATATATGGTGTTTTATAAAGCTTTGAAAAGGGACCCCAATGCTCAGGGACTCGTAGAAGGCATTGTGGGCAGCCCGAATTATAAAAGGTAGCCAATGCTCAAGGACTAGAGGAAGGCAAATGGGTAACCGGAAAATAGAAATGGGAACCAATGCTCAAGGATAAAAGGAAGGCATATGGGCAGCCTGAGTAATAATATCTATGTGCACATAGAAGAATATATATATACGACGTTGATGTGTTGAGATTACTCCACATAGCTAAAGATAATCGACGTTGAGTAGCGTTAGCGCCTCAACTAAGGAAAAGATAAGGAATGAATTTGACGTTGAGAAGCGTAGAAGCAATGCTCCGCCTCAACTAAGGAAAAAGTATTTTGAAGGAATGAGATTTTTATTATTTATTCCAAACAGATATGTATACATGTTATAACAGTGATTTGATCATGTTATAATAAGATTTGAACTGTTATATTTTCATGCATGGTTGAAAATATTTAGATTATACAGTTTTACAGTTAAAGTAAAATGTTTCAGAAAAATCGTTACTCACTGAGCTATTAGCTCACCCTTTTTATATGTTTTCCATTTTCAGGTAGCATAACAGTTCTCGTTGTCGATCGAAGTCGAGGTTTGTCAAGAGCGTGTTAATGGTATTACGTCCTAAAAGGATTTTAAAGGAACTAAGGTGAGATTGACAGTGGGTTACGTGACAGGGTAGTCGGTAACTGCTGTACTCACACTCCGTCTCGGGCTACGGGAGGGTAGTTTGGGGTGGGGTGTGACACATCCTCCCCCACTTAATTGGTTGACGTCCCTGTCAATAGTCTCTGCTCGAACTCAGCAATTTGAACGGTAGCTGACTGCAGGTCTTCAGCGCGCTCCCAACTGATCTCTTCTTCTAGGAGGTCTCTCCACTTCACCAGGAATTCTTTCAGATCTCGTCTTGGTCTCCCCACGCGGCGGGTTTTGTTAGCAAGTATCTCATCGATCACTTTTTCTGTTGGGTTCTTCATAGTGACGGGAGGGCGAGTCACCACATTACGTCGTTCGTCGTTGGGGTCAGGATGGTAAGGCTTCAAGTTACTCACATGTATGACAGGATGAATCTTCATCCATGAGGGTAACTGAACCCGATATGAGGCCTTTCCGACTTTCTCAATCACATCCACTGGTCCTTCGTATTTCCTCACAACCCTTTGGCCTCTTTGTCCTCGAAACCGAAACTGTTTCGGGCGCAACTTAATCAGTACTTTGTCTCCAGATTGGAACTCAAGGGCCCTGCGCTTCATGTCTGCCCACTTCTTCATGCTTCGATGCTCTCTCCAAACACACACGTGCTACCTCTGAGGATTCTTTCCACTCTTTGGTGAAGTTGTGTGCTTGCGGGCTCTTACCCGCATAGGGGTGGTCAACGACGTGCGACATGAGTGGTTGTCTGCCACATACAATCTCAAAAGGTGTCTTACCGATGGAAGAACTCTTCTGGCTGTTAAAGCTGAACTAAGCTACATCCAACATCTGGACCCAATTTTTCTATCTCGCTTCAATAAAGTGTCGGAGATACTCCTCGAGCATACTGTTGAAGCGTTCTGTCTACCCATCAGTTTGTGGATGATAGCTGGAGGAAATATTCAATGTTGAACCCAACATCTGAAATAATTCTGTCCAGAATGTCCTAGTGAATCTTGCGTTGCGGTCGCTTACAATGCTGGCAGGGATGCCCCAAAGTTTCACAATGTGTTTGAAGAACAACTGAGCCGTCATTTCGACTGAACACAATTTTGGTGCTGAGACGAATGTCGCGTATTTCGAAAACCTGTCTACAATCACAAGTATTGCCTCAAACTCCCCAACTTTTGGCAAACGTGTGATGAAATCAAGAGACACACTCTCCCATGGCCTTGAGGGCACTGGTAAGGGCTCCAGCAGACCCGCTAGTTTCAACAGCGCGTACGTTCTTTGCCAATCGTTGTGTCCAGCCCACGGCGTATCGTGACACTCTTTCAATAATAGTCTTCGTAGCTCCCCAGATTGAGGGACATACAGACGATTTCCCTTGGTGAAGAGGAGATTGTCTTCCACCCAAAACTGTCGAGTCTTGCCTTCTCTGGCTAACTAAATTAATGCTTGGGCAGCCGTGTCATTGTGTAGGTGGGTTTTGATGGTTTCCCAGACTGTTCCATTCAGGCTATTGGCTTTCAGGTGAGCAAACAAGCACAGGGCCACATGTTCACTTTTTCGACTGAGGGCGTTTGTTGCTTGGTTGACCTTACCTGGTCTATGTTCAAACTGGAAGTCGAATTCAGCCAGGCATTCCTGCCATCGAGCCTGCTTGAACGATAACTTTGGTTGGGTGAAGAAGTGATAGATCGAGCTATTGTCTGTCTTGACTACGAACTTGAACCTTAACAGATATTGTCTCCAAGCCCGTAAACAGTGAACAATTGCCAGCATTTCTTTCTCAGATGCTTTGTACCTTCTTTCAGCATCGCTTAGCTTTCTGCTCTCATAGGCGATAGGGTGGCCGTCCTGGAGAAGGACGCTGCTAGGGCAAAGTCGAATTCATCGGTCTCGACTTCGAAAGGCTTAGTCACATCAGCAATCCCGAGAACCAGTCCCTCCATCATGGCTTTCTTCAAGCCTTCAAATGCGGCTCGACACTCCGGAATCCAACTCCACTTGTGGTCTTTCTTCAACAGTTTAGTCAACGGTCCCGCCTTTTTCGAGAATCCTTCTATGAATCGCCTATAGTAATTGGCCAATCCAAGGAAGGAACGTAGTTTAGTAACAGCGGTCGGAACCTTCCAGTCTCGTATCGCATGGATTTTATTCTGTTCCATGCCAATTCGGCCACACTCAATCACGTGGCCAAGGAAGTTTATTTGCTCTTGGGCGAATGAGAACTTTTCTCTTTTCACGTATAGTTGGTTTTGTTTCAACTTCTCAAAGACCAACTAAAGGTGGTGTTGGTGTTCCTCCATGGGTCGCACTGTAGACCACAATATCATCCAGATACACCACGACAAACTTGTCAAGGTACTCGTGAAAAACCTAGTTCATCAGAGTACAGAAGGTAGCTGGGGCGTTGGTGAGGCCAAAGGGCATTACGAGGAATTCGAAGGCCCTATATCTCGTGATGCAGGTCGTCTTTGGTTCATCTCCTTCGGCAATTCGCACCTTGTAGTACCTCGATCGAAGGTCGAGTTTGGAAAAATATTTGGCGCCATGAAGGCGGTCAAATAAGTCGGTAATAATGGGCAATGGATACTTGTTGCGAACAGTGAGCTTATTTAGAGCCCGATAATCAATACACAATCAGAGGCTTCCGTCTTTCTTCTTCTGAAACAGTATTGGTGCTCCGAAGGGTGCCTTTGCAGGACGAATGAACCCTGCGTCTAGTAACTCGTCTAGCTGTTTCCGGAGTTCTGGGATGGACTCTTCTTCTGTTTCGACCGACTCTACCGGGATGACTATGAAAGTGGGCTCATCACAAGCAAGGCCTTTCTTGAGTTGAAGGGCCGAGATCATTCTTACCCCGTTGGGCTATTTAATGTTGGTCTGAACCACGGTGGGAGTGGATTCTGTAATTACCAAACACTTTGCCAGTGGCATCAGTATCACTTTGTGTTTGAGTAAGAACTCCATCCCCAAGACTACGTCAAAATCATCCATCTTGACGATCACAAAGTCAGCAGGTCCACTCCACTCTCCCAGTTAACCAGTGTTCTTTTCGCTACTCCCGTGATGGGTAAGGCTGCAGAGTTGACGGCTTTCATTTTCCCCGCATCTCTCTCCCATCGAAGGTTTAGTCGGCGTGCTTTGGTTTCAATTATGAAGTTATGGGTGGCTCCAGAGTTGACCATGGTTCTCTTAGCCGCTCTTTGGTTCACCTAGGTGTCGACAATTGTGCGGCACTTGTTCTAACTCATGAACAAGTCAGCCTAATCAGATTCTGATCGCCTATGGCACACCGAGTGCCTCTTGTAACCTCCGCCACCGTTTCGAGAGAAAATGGTTTTAGAAAATGGATTTAGAAAATGGATTTAGAGGAAAAGGATTTCGCAAACGTTTGAAAGCATGTTTGAGAAATTAAGATGTGTTGTAAGGCTAGAAAGCAAAAAGGGACAACAACGACTTTATAGCGCACAACTTCGGATATGAGGATTTGATGGTTAGTCTCACCCCCATATAGTACATTCTGAACAATTTTAGAGATGTCACCATTGCACATGCAAAGTGGCGAGCGGTCACTTACAAAGAAAAATACTAGGAAAGTAAACTAACTAGCAAAACGAAATACATAAGGTGAAAGCATGGTAAAAAGGGGTCGAGGCGGGCATGCGGCCACGGGCAACACGCCCTGGACAAGCATGACCGTGACACCAGACTACAATGGCTCATTTCAAGACCAACTCCAGAAGACACCCACCCAAAAACCTCCTCTATCCACGACGGGCCAAGCTTCCTCCTGACCCTGTCGTCGTCAACCAATTCTTGAACAACAAAATCTCTGCCCCTTCCTCATCCTTCACTGATTTAACTTCCTCTGAGATTTTCCAACTCTCCGAAGGTAAAGACAATGAGCATGAAGAAATCTATGCTTATGACTATAAGGATACTGATGTTGTTTGGGATTCAGATGAAATTGAAGCTATTTCATCACTCTGTCAAGGGAGAATTCCTCAGAAACCTGGTACATTGAACAGGGACAGATCTCTTCCTCTCCCACTTCCTCACAAGCTACGACCACCAAGACTTCCTAACCTAAAAATCCGCCTAAGAACAATGGTCTCTTCACGTGCTTTGATGTCTAAGCAAGTTTACAAGCGTCCTGATTTTCTTATTGGCCTTGCCAGGGCGATTAGAGATTTGTCCTTGGAGGAAAATGTGTCCAAGGTTCTCAATTGGTGGGGTCCTTTTTTGCAAAAGGGCTCTCTATCATTGACGATCAAGGAACTAGGTCATATGGGTCTTCCTGATAGAGTTCTAAAGACCTTCTGTTGGGCACAGGAACGACCTCGCCTCTTCCCGGATGATCGTGTTTTGGCCAAAACGGTTGAGGTCCTTGCAAGGAACCATGAACTGAAGGTACCTCTAAACTTGGAAGAGTTCACTGAACTTGCTAGTCGTGGCGTGCTTGAGGCAATGGTGAGAGGGTTTATCAAAGGTGGGAGCTTAAATCTTGCTTGGAAGCTTCTTGTAGCTGCGAAGAAGGGTAACATCCGACTAAGTGGTTTACCCTTAAATTATTATAGCCTCTTTCCTCAATTCTACCTTATGCGTTTTATCCTTCTTTTAGATTTCCTTTCAACGGATCACTATTCAACTTTAATCCCTACTTATACGTCCGACTTCTTCATAATTACCTCCTATGCGTTTCACTTCTTTCCCTTCAAGCTCTCGAGTTACTTCCATACGTCCACCCTTACCATGCGTTACCTTAGTCCTGACCATGCGTCCAACATGGTCTTTTACTTCTTTTCTTTATAGGCTCTTCAACACATCGATTTTTAAAAGTAGGGTGTGACACGCTGCGCAATTAGTAGCGTTGAGAAGGGCTATGCGTTGAAGGTTGATATGCGTTGGTGTATAGAAAAGTAGCCGAGAGGACCATGCGTTAGTCGGCTATCCGGCGATGGGGCGTTGAGCAATTAGTAGCACAGTAAGTTGAGGCTATGCTTCGCTGAAGCCATGCGCAGAAGGTTTTTGTTGGCCAACCTAAGTAGGAAGGTGATGGCAGTGCTGGCGAGTCGATGGTCTATATTTATATATAAGTGAGTGCACCGAATAATTAAATACTTTAGACATATTTTTGAACTCAGATTGATCCCAATTCGTAGACGAACAACGAGGAGAGGTGGAAAAAGGTATTTATATCGAATTTAAGTAGAATCCAGATCATTTTCATGGTGAAATTGAAGTTGGAACCAGAAGCTTGATTGTTGTCGATAAATCTGAACATCATTCAGTATTTGAAGAAGAGTGAGCTGAGGAAGTTGAAGCTAGGGCTGGAATTTTAACGTTAGTAAGTTAACTCAATGCTTAGCAACTTCCATGAAGGATGTTTAAGCTGAATTTGAGCAAAAATATGTTAAATTCTAGGTAGAAGTAATCGAAGTTGAAATGCTAGATGATTCAAAGCAGCCGGAGTCATTTGAGAATAGTTCTCAAATGAAGAAGAGTTATGGACTGAAATTTTGAGGTAAGTTAGAAAGCTCACTTTTTTACAAGTTTGTAGAAGCGAGGAAGTTCTGAATTTGGCTGGAATACGATTTTTAGCAGGTTAAAGAAAGAGTTAGAAGGGAAAGAAGAAAGTAGTGGCCATCAGGTGTCGCTAAGACAGTGGTATCCCTAAAAGAGCTGGAGAGTGAGCAACGAAACGAGTGTCGCTGGTAACGCAAAGGGCTTTGCGTCTCACAACTCTAGTTGAAGTCTCGTTGGGTGTCGCTCCTTAAGGTAATCTCGCTAGTCTTGCTAAGCCTGGTCTCACTAGTGCCGCATGGTTGTAAGTAGGTTTCACTAATAATCTCGCTGATACTTGTGGATCTCGCTCATTATGAAAAGTTGTAAACAAGAGAGGGACTAGTGATGAAACACATGTTTGAGATGTTTAAGCAAATATAGTGAATATAAAGAGAAATATTAGAAATGTCTAACACGAAAAAGGTGTGTTTATAGGCCCAGAGAAGCTAGGGAAAGCGTATAACCCGCTCGGAGCTAAGAATAAGTTGTGAGTGACATTATGGTTTGAGATAAGCTTATTTGATTACCATTTTCTATATGCTCAGTAATACCTGTGAAGCATGATTTGGGGGTGTAGAACATATCTTATATGTAAGTTAGAGCTTGAAAGTGTAAGAAGCTTTTTCCAGGAAAAATACTATCCTAAATCGTACTATGATGAGAAAAGGAATGAATTCTTGAGCTTGGTACAGGGGGCAGATGTCTGTAGCTGAGTATGAGAGGAAATTTACAGAACTTGCTAGGTATGCCTTGGCAATTGTTGCATACGAAGCAAATGCAAGCGCTTTGAGGAAGGCTTGCGAAGTTAAATCCGCACTTCAGTGACAGCAAGTACAAAATTGGTTGATTTCGCTAAACTGGTTGAAGCTGCCATGAGAGTAGAAAAGAGTTTGGTAGAAGAAAGAACAGACAGAAGTGGGAGTAAAGCTGGACAGCCACTCAGTGCACCTCGAGAGCAGATGCACCGAGGTGAGGGACGAAAATTCACACCAGGAGTTTCTGGTGGAGGAAAGTTTAAAGCCATATCTAGTGGTGGAACATATCATGCAGGTGGATCCCGAGGTGGAGGTATACAGAAAGGTTCAGGAAGGTTTCCTTTGAGGTCAATATCAACCGGTGGTAATCAATAGTCACCCTAGTGAATCAGTTAGTTCAGCAAAGAAACCATTGTGTAATAACTGTGGTAGAAAACATTGGGGTCGGTGTTTAATGGGAGCTGATGTCTGTTATAACTGTGGTCAGCTGGGACATTTTAAGAAAGATTGCCCTCAGCCTAAAGTAGGAAGAGAATCAGAACAGAAAACTATCTCACATACAGTAAGTCAACCTCGGCAGGAGATGGGAAATAAAGAAGGTAGCAGTGGGGGCAGGTTGAAGGCTCAGGTAGGTAGACTGAGACAACAGGGCAGAGTATTTGTAGTAACTCAACAGGAAGCAGAAGAGGCACCAGATGTAGTAACGGGTATGCTAATTATATGTTCTAGAAATGCATATGTATTAATAGATTCTGGAGCTACTCACTCATTTGTGTCTAGTGAGTTTGCTTTGCATATAAATAGAAAGTTAGAACCTTTGCCTGATACTTTGTTGGTACATACTCCAGTTGGTGACTCTGTTATAATAGAATATGCTTATCTTGATTGTGTGTTGGAACTTGATAGCATAGCCTTGTCAGTTGATTTGTTACCTTTGCCCTTAATAGAATTTGATGTTATTCTTGGTATGGATTTCCTGTCGAAATACCATGCTAAAGTTGATTGTTTTAAGAAGGAAGTAAAATTGATAAAACCTGATGGAGTTGGCGTGATTGTTAAGGGGAAAAGGAGGATTCTTCCTACATGTGTGATCTCGGCAGTAAAAGCCAGGAAATTGCTCAGTAAAGGTTGTGAGGCATATTTAGCCCATGTGCCGGAAGGGAAGTCGGGAAGATTGAAACCAGAAGATGTGCCTGTGGTATAAGAATTTCTTGATGTTAGACAGGATTACCCCCTGATAGAGAATTGGAGTTCACTATTGACCTGATACTTGGAACAACTCCTATTTCTCAGACCCCTTATCGTATGGCACCATCGAAACTAAAAGAGTTAAAGGTGCAGTTACAAGAGTTGGTAGATAAGGGATATATTAGACCAAGTGTATCACCATGGGTGCACCGGTGTTATTTGTAAAGAAGAAAGACGGTACTATGAGATTATGTATTAATTACCTCAGTTAAATAAGGTAACAATACGCAATAAATATCCTTTGCCTCGTATTGATGACCTATTTGATCAACTCAGAGGGGCGTCAATATTTTCTAAAATAGACTTGAGATCAGGATATTATCAGTTGAAGGTTAGGGAAATGGACATACCAAAGACAGATTTTCGAATGAGGTATGGACACTATGAATTCATAGTAATGCCTTTTGGGTTGACAAATGCACCAACTGTATTTATGGACTTAATGAACAGGATCTTTCATCCTTATTTAGATCAGTTTGTAATTGTATTTATTGATGATATTCTAGTATATTCTCGGAGATGAAAGAGAACATACTGAACACCTCAAGATTGTTTTGCAGATTTTGAGAGGAAAGAAATTATATGCAAAGTTTAGCAAATGTGAGTTCTGGTTAAAGCAGGTTGTGTTTTTTGGCATATTGTTTCAGCTGCAGGAGTTAGTGTAGATTCTCAAAAGACAGAAGCAATAGTCAAATGGGAACGACCTAAGACCGTGACTGAAATACGTAGTTTCCTAGGCCTAGCGGGATACTATAGAAGATTTGTGCAAGGATTTTCTAGAATAGCCTTGCCACTCACTATTGACTAAGAAAAGTACTGCTTTTGAGTGGAATGAAGAGTGTGAACAAAGTTTCCAGGAGTTAAAGAAAAGACTAGTGACATCACCTATTTTGGCACTTCCAGAAGCAGGAAAAGAGTTTGAAGTGTATTGTGATGCTTCTCATTAAGGGTTAGGTTGCATGTTGATGCAAGAGGGTAGGGTTATCGCCTACGCCTCTATGACAGTTGAGACCCCATGAGATTAATTATCTTACACATGACTTGGAATTAGCCGCAGTTGTACTAGCATTAAAGATATGGAATTAGCCGCACCGAATGAATTGGAGGAATGGAGACTGCAAGCTTATGAGAACACAAAAATATACAAAGAGCGAACAAATCGTTGGCACGATCAACGCATCAGTAAGAAATCTTTGCATGTAGGTCGAAAGGTCCTCTTTTTTAACTCAAGACCGTTTATTTCTAGGAAAATTAAGAACAAGATGGTCTGGTCCTTTTGTGATCAAGGAAATCTTTCCTCATGGTGCCGTAGAGCTGATGAATGAAGACGGCACCAACGCATTCAAAGTTAATGGTCAACGCGTGAAACCATATTACGGAGACTGCCTTGAACGAGACAAAGTAGTCGTTGACTTGGAAAAGATAGAATGAGGACAAGATGAACGCGTCCTGCGTTGCCAACAAGCACATATATCCTAGGGACGCTTCCTAACTTTGTACCTTAGTTGCTTTTATGTCTTTCTGTAAGCCCTAACTTGTAAGGCATGCTTAGTGTCACAATCGTAGTTCTGCGTCGGGCTAAACAATTGTGCGGCACTTGTTCTAACTCGTGAACAAGTCAGCCTGATCAAATTCTGATCACCTATGGCACACCGAGTGCCTCTCGTAACCTCCGGTCCCATTCCAAGGAAAATGGTTTTAGAAAACGGGTTTTAGAGGAAAAGGATTTCGCAAACGTTTGAAAGCATGTTTGAGAAATTAAGATGTGTTGTAAGGCTAGAAAGCAAAAAGGGACAACAACGGCTTTATAGCTCACAACTCCGGATATGAGGATTTGATGGTTAGTCTCACCCCCATATAGTACATTCTGAACAATTTTAGGATTATCACTCTTGCACATGCAAAGTGGCGAGCGGTAACTTACAAAGAAAAGTACTAGGAAAGTAAACTAACTAGCAAAATAAAATACATAAGGTTAAAGCATGATAAAAAGGGGTTGAGGCGGGCATGCGGCCACGAGCAACACGCCCTGGACAAGCATGACCGTGACATCCTCCCCCACTTAATTGGTTGACGTCCCTGTCAACCGTCTGTGCTCGAACGCTGCAATCTGTGCGGTGGCTGACTGCAGGTCTTTAGCGCGCTCCCAACTGATCTCTTCTTCTGGGAGGTCTCTCCACTTTACCAGGAACTCTTTCAGGTCTTGTTTTGGTCTCCCTACTCGGCAAATTCTGTCAGCAAATATCTCATCGACCACTTTTTCTGTTGGGTTCTTCATAGTGACAGAAGGGCGAGTCACCACATTACGTCGTTCGTCGTAGGGGTCAGGATGGTAAGGCTTCAAATTACTCACATGTATGACAGGATGAATCTTCATCCATGAGGGTAACTGAACCCGATATGAGGTCTTCCCGACTTTCTCGATCACCTCCACTGGTCCTTCGTATTTCCTCACCAGTCGTTGGTCTCTCTGCCCTCGAAATCGAAACTGTTCCGGGTGTAGCTTAATCAAAACTTTGTCTCCAGGTTGGAACTCGAGGGGCCTGCGCTTCGTATCTGCCCACTTCTTCATGCACTTCGAAGCTCTTTCCAAGCACGTACGTGCTACTTCTGAGGACTCTCTCCACTCTTTGGTGAAGTTGTGCTCTTGTGGGCTCTTACTCGCATAGGGGTGGTCAACGACGTGCGGCATGAGTGGTTGTCTACCACATACGATCTCAAAAGGTGTCTTACCAGTGGAAGAACTTTTTTGACTGTTGAAGCTAAACTGAGCCATATCCAACATCTAGACCCAGTTTTTCTATCTTGCATCGATGAAGTGGCGAAGATACTACTCGAGCATACTGTTGAAGCGTTCTGTCTGTCCATCAGTTTGTGGGTGGTAGCTGGAGGAAATATTCAATGTTGAACCTAACATCCGAAATAGTTCTGTCCAGAAAGTCCAAGTGAATCTTGCGTCGCGGTCGCTTATAATGCTGACGGGGACCCCCCAAAGTTTCACGATGTGTTTGAAGAACAACTGGGCCGTCATTTCGGTTGAACACAACTTTGGCGCCGAGATGAATGTTGCGTATTTAGAAAACCTGTCAACTATCACGAGTATTGCCTCGAACTCCCCCACTTTTGGTAAATGAGTGATGAAGTCAAGAGACATACTCTCCCAAGGTCTTGAGGGCACTGGTAAGGGTTCCAGCAGACCTGCTAGCTTCGCGCTATCCACTCTGTCCTGTTGATAGACAAGACAAGTCTTCGTGAATTGCATAACGTCGTCCCTGAGGCTCGACCAGTAGTAACCCTGTTTCAATAGCGCATACGTTTGTTGCCAACCGTTGTGTCTAGCCCACGGTGTATCGTGACACTCCTTCATTAACAGTCTTTGTAGCTCCCCAGATCGGGGACATATAGACGATTTCCCTTCGTGAGGAGGAGGTTCTCTTCTACCCAAAACTGACGAGTCTTGCCTTTTCTAGCTAATTGAATTAATGCTTGGGCAGCTGTGTCGTTGTGTAGGTGGGTTTTGATGGTTTCCCGGACAGTTCCATTCAAGCTACTGGCTTTCAGGTGAGTAAACAAGCACAGGGTCGCATGTTCACTTTTTCGGCTGAGGGCGTCTGCCGCTTGGTTAACCTTACCCGATCTATGTTCAAATTGAAAGTTGAATTCGGCCAGGCATTCCTGCCATCGAGCCTGCTTGGACGACAACTTCGGTTGGGTGAAGAAATGACAGATCGAGCTGTTGTCTGTCTTGACTACGAACTTGGAACCTAACAGATATTGTCTCCAAGCTCACAGACAGTGAACGACTGCCAGCATTTCTTTCTCAGATGCTGCATACCTTATCTCAGCATCGTTTAGCTTTCTGCTCTCGTAGGCGATGGGGTGGCCGTCCTGGAGGAGGACACCGTCCAGGGCAAAGTCAGACGCGTCAGTCTCGACTTCAAAAGGTTTAGTCACATCGGCAATTCCAAGGACCGGTCCCTCTATCATGGCCTTCTTCAAGCCTTCAAACGCAGCTTGACACTCCGGAGTCCAACACCATTTGTGGTCTTTCTTCAACAATTCGGTCAATGGTCTCGCCTTTTCCGAGAATCCTTCTACGATCGTCTGTAGTAGTTGGCCAATCCGAGGAAGGAACGTAGTTCACTAACAGCGGTCGGGATCTTCCAATCTCGTATCGCATGGATTTTGTTCTGTTCCATGCCTATTCGGCCACACTCGATCACGTGGCCAAGGAAGTTTATTCGCTCTTGAGCGAATGAGCACTTTTCTCTCTTCACGTACAGTTGGTTTTGTTTCAGCTTCTCAAAGACCAATTGAAGGTGGTGTTGGTGTTCCTCCATGGTCGCACTATAGACCACGATATCATCCAGGTACACCATGACAAACTTGTCAAGGTACTTATGAAAAACCTGGTTCATCAGAGTACAAAAGGTAGCTGGGGCGTTGGTGAGGCCAAAGGGCATTACGAGGAATTCGAAGGCCTCATATCTCGTGACGCAGGTCGTCTTTGGCTTATCTCCTTCAACAATTCGCACTTGGTAGTACCCTGATCGAAGGTCGAGTTTCGAAAAGTATTTGGCGCCATGAAGCCGGTCAAATAGGTCTGTGATAATCCACAACGGATACTTATTGCGAACAGTGAGCTTATTTAGAGCTCGATAATCTATACACAACCGAAGGCTTCCGTCCTTCTTCTTCTGAAATAGCACTGGTGCTCCGAAGGGTGCCTTTGCAGGGCGAATGAACCCTGCGTTTAGTAACTCATCTAGCTGCTTTCGGAGTTCGGCTAACTCAAGAGGCGCCATTTGATAGGCATTCTTTGCAGGAGGTTTTGCCCCTGGCAACAATTCAATTTCATGGTCGATCCCCTTTCTTGGAGGCAAGGACTTTGGCAAACTATCCGGCATAACATCCCGATATTCCTCTAAGACAACTTGGATAGCTTCGGGAATGAACTCTCCTTCTGTTTCAACTAACTCAACCGGGATGGCCATGAAAGTAGGCTCATCGCGAGCAAGGCCTTTCTTGAGTTGGAGGGCCGAGATCATTTTCACCATGTTGGGTTGCTTAATGTTGGTCTGAACCACGGTGGGGGTGGATCCTGTGATTACTAAACACTTTGCCAGCGGCATCGGTATCACTTTGTGTTCGAGTAAGAACTCCATCCCCAAGACTACGTCAAAATCATCCATCTTGACGATCACAAAGTCGGTAGGTCCACTCCACTCTCCCAACTTAACTATTGTTCTCTCCGCTACTCCCATGATGGGTAGGGCTGCAGAGTTGACGGCTTTCATTTTCCCTGCGTCTCTCTCCCATCGAAGGTTTAGTCGGCGTGCTTCCATTTTAGTCATGAAGTTATGGGTGGCACCAGAGTCGACCATGGTGCTCTTGGCCGCTCTTTGGTTCAACCAGGCATCGACGTACATGAGGCCCCGCTCGACGGGTTCTTTGGTCTCTTTGATCTTCCTTTGGAGGGCCAATAAGAATTTCAGTGCCCCCATTCGAGGGTTGTCCCCCTCTTCTGACGGCGGGGTTTCTGCCTCCAGTTGATCAGTTTGCCCATCTGCATAGTTGCCAGGGCAGCTTGGAAGGCATTAAATGCTGTTTGTTTCGGACACTCGCTCACCCTGTGAGGGCCTCTATAAAGGAAGCACTGTGGGGGTCGGTGGTAATTGTTCTGTTGATTAGGCCCCCGCCAAGTGTTCCCACTCCTAGGTTGAGGGGGTTGATAGGGCTTTCGATCTCCACTCGAAATTTTGTCTCCTCCAGCATTCCGAGGCGGGCTTGCCCGATTGTTCTTATTTCCTCCGCTCGAGGAAGTTGATTGTCGTCTAGTGGCTTGGACGTCACTGCTGAGATCGAACAATCGTTTGGCGGCCGCATATGCCGACGTGAGGTCCTGTACTCTCTGTTCGTACAGTTTGGCTTTCGCCCATGGTTTCAATCCTTCGACAAAACAAAACACTTTGTCCTTCTCGGACATATCACGAATGTCCAACATCAACCTTGCAAACTGTTTCACATAGTCTCGGATCAGGCCCGTGTGTTTCAACTTCCGGAGTTTTCGCCGTGCTAAAATCTCGACGTTTTCGGGGAAGAACTGAGAGCGTAGTTCTTGTTTCAGTTTATCCCAGGTATCAATAGTACAACAACCTTCTTGCATGTCTATGTACCGGGACCTCCACCATAGCTTGGCGTCGTCTGCTAAGTGCATCGTCGCCAAGGTAATCTTCGATTCTTCAACAGTTGTGTTCGCGGCCCTGAAATATTGTTCCATGTAAAATATGAAATTTTCCAGAGCTTTTGCGTCTCGAGCCCCACAGAAGGGCTTGGGTTCCGGGATTTTCACTCGATTCAACTGGACTGCATCCCCAGCTGGGGTTTGGTTTCCCACCGCTCTCATGGTGAGATTTACCCTCGTACTGACATCCGCAATCTCTGTCCTGACGACATCGAATGCGGCTCGAAAGTCTTCTGACAAGTCCGTCACCATCTGGATTATTGCTTTTTGAGAGCTATCTAGCTCCTCGACTCTTTCTTCCATATGGGCTATAGAGCTCGATGAGCTGTTGCCACGCTCGAAGCTACTAGGTCTCGCAGCTTTCGTCTCTAGGGTCTCAACCCTTAACATCAACTCTTTGACGAGCATCCCATGCCGGCTTCAATCTCATTAGCCTTTTCAGAGATCTCTTGTACTCGCGTTTCCAAGAAACGGAGGGCATCAGGAACTTCTCTTAGGTAGAGCATCTGCTCTTCTATCTCTACTAGTCGGTCCACTTGCGACTTGGTTAACTGTTTCGCGGTCGACATAATTCTGGCCTTCTCTCTTGTGAGCCGACTTGGCTCTGATACCAACTGTCACAATCGTAGTTCTGCGCCGAGCTAAACGATTGTGCGGAACTTGTTCTAACTCGCGAACAAGTCAGCCTGATCAAATTCTTATCGCCTACGGCACACTGAGTGCCTCTCGTAACCTCCGGCCCCGTTCCAAGCAAAATGGTTTTAGAAAACGGGTTTTAGAGGAAAAGAATTTCGCAAATGTTTGAAAGCATGTTTGAGAAATTAAGAGGTGTTGTAAGGCTAGAAAGCAAAAAGGGACAATGTTCGTTTGAATTTCTATTCGAATTAAATTGAGTTTCTTAGCTCGTTGCACAAAAATAGATTTATCATTTCCGTACAAAGATGTGTCGAGAATTACTCTAGATGTGATCATGATTAAGCAAGTTTTCCTATAAAGACCCAAGTATAAGCATTTATTGGTTTCCTGGTGAGTCCAGGGTCGAACGCAGGGATTTGTAATAACTAGGCGTTGAGATTAGGTGCTTGACTATCCGGTGAACTATACATATAATATGTGAAATTGTTGGTTGTATGTGTCTAAAACTATACATAAAGAATGAATCAAATTTCCTATAACTAACAATGAGGTGGACGCGGCGATGGGTAGTTATATGTAATTGTGTCAGATAGCATTGAGAATGAATGGAACCATGATATAGAAGTAGGTTGAACGCAAGAGCTAGTTGTTACCTAAACTTTCTAAGTACAATTACATACCTACAAACTGCGTTTGTGAATCCATCTCTCGATGTTTCGACGCCACATTAACTTATCTCTAAGACGCACGGTAAGTTTGCGTTGGCAAAAGATGAACTACATCTCTGCAACTCATCTAGTCTTAATAAATGCTAAAGATTAACTATCCACTCTCTCAAGCTCGACATTCATGCTTACTATGTTTGTTTTTTAGACAAAGTTATGCAAGCTAAAGATATGCATTAATCAAGATGTAACATACTCAGAATTGCTCAAGAAAAACTAACTTTTAACAGCTAACTCATTTAGCATTTAAACAACACGGTGTAAAGTATTTAATACGGAAATAGATGGAAGATAGTAGTCAATCAAACTCATATTAAGAAAATAAATCATTCTCTCGAATGTCATTAACAAAATATAAGAGAGAAATGGAAATAAAACAGAAATGAAGGGCGTCAAGCCGCCACTGGCAACATCTTGCTTCATGGAGCTTGAGATGGTTGCTATCTGATACATAGATTCCATAGGGTGAAGGTTCTCACTCACTGGGAGACGACTCTGAAGGTGCTCAGCATCCTTCTCTCAGAAGAGCTCGAAATGGAGGCTGGTTATGGTGAGGGAGAAACTGTGCAGAGCTTTTGCCTCTTAAACTGGTCTCTCTTCCTCCTTTTCTTGGAGAAGCTTTTGGGCTATTTCTTTGCATTAAATTTCAGCCACACTGCATAGACTTTTCAGATTGTCAGCCTATTTGTTCTGCCATGATCTGATTGGGTGTTGTCCTTTTTCAGAATGACATGACAGCTGTTTGAGCATGCGTCTGTGTCTACTATTTCTCGATGCATGGTCTCAACGCATAGGCTCGACGCATGGGTTTCCACTTGTTCTCGCAGCAGCTTCGTCTCTTTGTACTTGATTGTTTGCGCTTCTTTTCTTCTTAACCCTGCGTTAGGAACAGAAAAGCTCATCGTTTTTCAGGTATTTTCTTTTGTAAGTAGTGCCAACACATGGTGATACATTTTTCATCCTTCAACTACCAAAATCATGCGTTTAGCTCCAAAGTTACTCTAATTAAGTCAAATAGTGACTATAATAACTTGTATTTCTACAAGTTATCACACCCCCAAATTTGAGTTAATGTTTGTCCTCGAACATTTATTTGAAGATTTAGAATTATTTATTAAATTCTACAGCCCAAGTCTAAGTGAAAGCGCACCCTCTCAACATGCCTTCTAATCCTTGACTATATCAACAATCAGATAACCATACCAAGTTCTTTTTCAATCGACGCTTTGAAAGTACTTCTAATTTTTAGATGCGGTAAGCTTATTCCTCAATCTGATGTCCTCATTTTAGTAATGTGCCTTGGAGTTCATTCATTCATAAAGTCAATTTTTTTTTTTTTTGATTAGTGGACTTGCGTTGATCCAGGCTATCTACCTTCGTTTTGACATACCAACGATGAGGTGCATGGAGGCAACCAACATCGAGCAAATACCCTTTTACAACTCGGCACTTATCGCGTTCCTTTTTGGTTACGTCGACAGTGGCACCATTTATTCATTTTGCGCTGATCTTGGACTCACCTTTGATTATGGGGTGTTAAAGAGCCATTCTCAACTCAACTCTTAACAGACTAATTTATGCGTTGTCAGCGATGTTGTTTGTCCAAAGAAATATATATATATATATATATATATATATATATATATATATATATATATATATATTTATTTATTTATTTATTTATTATTTTTATTATTTTTTTTTAATTTTGAACACATGGCTCAAATATTTCTTACCCCCAAATTTGAAGGCAGCAAGGTCCTCATTGCTGAAATCAAGTTATGAGATTGCCTTGGAAAAATTTTGTAAAAGAGGTTTGAGGCAATGCTTGCAACGCATACTAACTCTTAGAAGTAATTTACCAAGTGTAATATCCTAAGGCTTGTGAGTCAATCCTCTTATTTCAATAAGTCTACAGGTGAAGCGCATAAATTTCATCATGGAGGTTGGACGCATAAACAAAAACAAGGCGGAGAAAATAATAAAGTAAGAATTAAATGAAAGTACAAACTAAAGAGAACAAAAAGATTCATGAAGATATGGCATAGATGCAAATGTACTATGCGTTGTACTAATTACAAAAAAATTCAAAGAAATAAATGTAAGAAAAGAAGGGGAAAAGTCCCATAGAACACCCCCAAATTTACGTTCTCCCCGGGGCGTTGTCATTGTTGTCCACAGAGGGATGATTTGGATCGTTGAACTGAAGGGGTTGCTGAAGGGAAGGAGGAATAACTGGCGCTGGGATGCGTTGGACGAACACTTCATAAATATAATTCATCAAAGTGCTGAATTGTTCATCCTGACGCCGAGCTTCGCGTTGAACAAAACTCCGAAGCTCCGATTGTTGCTGCGCCATCTGAGTCACTTGTTGACCCAATTGTGAAACTTGTTGAGCCAAATCTTCTTACCGCTGGAGATTTTCCATCTCCTTGCGCCGAATTTCATCCAGGTCTCCCATGACGGGGTGAAGGATCCCATCATCCAGGTACCGCAGCAAATCATCCAAATTTAAATCAACAAACGGCAGCGGAGCAGAGGCGTGCTGCAAGTGATATGGCTCTTCTTGATGATGCGTGCTGGTAGTTGCTTCTGCCTGCGGTTGAAATGTTGGGCGGGCAGAAGGTTCGGTAGTTGGTGGAGTTGGAAGATTGGAAAATTGAGGTTCATGGCGAGGCGGTGAGAATTGAAAGGGCGCTGACTCCTCAGGATCCAGAGTTAGGGTCGGCTCAAAGGTAGATGATTGAGAGGGTGAGAGGGTTAACTCTAGGTGGGGCGAAGAAGGTGGCGACACTTCCTTCGTCTCTTCATTTAAGTCTGCGATAGGAAGCTGAAGAGGCGTTGGAAGGCCGCGTTGAGTAGTGGGTAGATGAAGTGGCTCTTTTTGGATTTTAGGGGAAAGAGAAGAGGAAGTTGGTGATGAAGAGGGCTCCTCAGTGATTGGGTCGATGAGAGGAGTATAAATGGTCAAGGCTTGGGTGTAAGCGTCGAGCTGAACAGTTTCTTCGGCTTCACTAAGTTGTGAACTCGACGCCTGTATCTCAAAATTCTTGACCAGTTTCCTCTTCTTGGGTTGAGGAAGTTGAGGGGATGATGGTTCTTGTGACTTTGGCGGCCTCTTCTTTGAGTTGGCTGCGTCAAGCAGATGCGGTGAGTCCTTTAATATCCGTCTCAGACATTGCGCTGAAAGGACTCCATTCAATGCTCTGACTGGTGCATCCTCCAAGAATTGTGCGTCGGCGCACAGTCTTGTCACAAGAAAGGGGAAGAATAGCTGACCCCTTGGCTTGAAGAATAACCCCTGAATTTGTGGGACAATGATGCTCCCATGTCTAGCAGAATTCCTCGCATGATGCAGTAAATGACTATGGCACCGTCCCTTGAAATTGAGGCGTCATGCGTTGTGGGGAGCAGCGACCGCTTAATGAAATAAAGCCACAGGTTGGCCTCGGCGATCAAGCAGTTAGGCGCCAATGACTGAATGGCCTTTGGGGAAGTGTTCCACTCTGACCCCGGTTTGGCGATGATTGTGAGGGCGTCATTCATTTCGGCGTTTGTAGGCGTCGACATTATTCTATTACCTTCCGCTGCCGCGATGTTGGGAAGATCAAACAACGCGTTGATCTCTTGAGGCTCGAAGGAAATCACCTCATCTTCAACTTTGACTAAGTGTGGCTTGCGGTGAATTATTCCTTCGTAAAATGCCTCCACGGCGGTGGGTTGGATGACTTGGCCACTTTTACAAAATTGCCTCTAACCCAATGCGTCGACTCCTTGTATGATGAAGGCAGGCACCTCAATTGGCTCTGGGAAAAATCCCACTTCAGTGTAGAGCCCACTGTATTGTTTCTTTCGGCTCTTCGCCGCCTCCTTTGTTGGTTCCTGAACCAATTTTTCTTTTTCTTTCTCATTCCTACTGATGGGACTAGCAGGCGTTCTCCTTGCTCTGCTTGCATCCAAGTCCCTTATAACAACTCCAAATGCCTTAGCCACCTCAGTCTTCTTCTTTTTCTCCGGTAGGTTCTCTTTGGTGACTGTTACCTCTTTTTCTTTCTTCTGCTCCCTTATTCTTTTTATATTCTTCTTGATGCGTTCAGCTTTGGGTGGCCTCTCCGCATCACTCACTTCCAACTTTACAAATTCCGTCACAATCTCTGCTATGCGACGGAGAAACTCGGCTTTTCTAGCGGCCTTTTCAATAGCTTGTTCGCCAGTCGCAATTTCCTCTTCTCTTTTCTTCTTGGCGGCGAGAAACGACTCTGTCCTTTCTTTCATCTTTTTGATGTCTTTGAGCTTTTTGGACTTCTCTGCCACTCTCGCTGCTAATCGCTCAACAGATGATGCGGCGACCGATGATGCGATAGCCTCGGATGTCCCCGCGTCCTTTGCTTTTTCATTTTTCCTTCTTAGTGGAGGGACATCTTCCTCGGCTTCTTCTTCCCTGGCCCTTACTGGCCTGATTTTCTTTTCCTTGGCGGATTTCTTCTTTTCCACCGCCGCTATTACCTTTCCAATACGTCTTTCATCGGAGTCCGACTCCTCAAGACCGCTCATTGCAGGGACTGACCCATGCGATTCATTTTTCATCCTCTGGTCGACCAATCCGTCCAACCAGTCATCGTCTTCTTCATTGATGGTGTCTAGAATATCTCCGAAGACGGTGGATTCGTCATTCCTTCCCACCTTTTGAGCTTCAGACAGGTCCCTGGAGCGTGGCAAAGATGAGGATGATGTCGCTTCCAACTCAGGACTGTTGGGGCGAACAATGGCGAGAGGTTCAGGTGTGGTGGAGGTAGTGGACAGATTCCGACTACCCTGCCTCGAGGATGAGGCCATAGGTTTAGGGCTTTTGGAGGAAGTGGGTTGCTGGTAAGCCATGGAAATTGTATCGGAGAAAAGAGAGAAAAGTGGGTGAGAATCGGAAGATTTTTCAAACAGATCGGAAATCACGCCGGAAATCTGAGAATTCTGGGAGATGTGGAAGCGTAACTTGGAAAAGGGTTGAAGAAGGGTTTATAAAGGGTTTAGGCTGGGCACGTGGGTTATCGACTCGATGCGTGACATCATAGCTGCATTAAATGGTCTGACATCCGTCGATATCAGCTGGTTTAACCGCCGTTAGCCTTTTCGAATTCTTGATCTGAGCGATGTGTCAGAATCGCTTCAGATTCCCCTCTAGGCATCGAGATAATTGTATCGCATCGTTTTAGATGTCGACGCCTTGTAATTGAGAGTTCAGTTTTCACTCAACGCAAGATAATAGATAGCACCAAACAAAATTTTTCAACGCAAATAAAAGAATGAAATAAAATAGAATAAAATAAAATAAATTAAAAATAAACTAAAGATAAAAATAAAATAAAATCAAATAAAATGACGAAAAGAAAGGAAGAAATGAAACAATGCGGTAAGAAAAAAAAATTAGGATGCGTCCCTAGGATATAAGTGCTTGTTGTCAACGCAGGGCGCGTCCATCTTTTCCTCATTCGATCTTTGCTGGGTCAACGGTTACTTTGTCACGTTCAAGGCAGTCTCCAAAATATGGCTTCACGCGTTGACCATTAACTTTGGATGCGTTGGTGCCGTCTTCATTCATCAACTCTGCGGCACCATGAGGAAAAATTCCCGTAAAAAAGAACTCTGAAAATTTGAGATTGCTGCCGCAGCGTTCAATTTGTGAAATTTTTTGGGCTAACGTATCGTTCATTACTTTGTTGCCTGATGTTAGCCCAATTGTTGGCTCTCAAAGGCTGGTTTGCTTAATTTTTTCAATTACTTCTCTTTACAAAGTTGTCGCTGCCTTTGATGAGTTGTTTCTTTTCCGAAAGGTTCTCTCGATCTTACGGTCAGAAATGAACGAAGGCTGTTCTTCCTTGCTCATAAACTGTCAGATACGTCTTCGCCAAAACGCTCAGTACGACCCAAGATAGAGATTTTTGCATAAATCAACAAATAATATCAAAACCTAAATTATTAATTCAAATCCCCGGCAATGGCGCCAAAAACTTATTCGTTTGAATTTCTATTCGAATTAAATTGAGTTTCTTAGCTCGTTGCGCAAAAATAGATTTTATCATTTCCGTACAAGGATGCGTCGAGAATTACTCTAGATGTGATCATGATTAAGCAAGTTTTCCTATAAAGACTCAAGTATAAGCATTTATAGGTTTCCTGGTGAGTCCAGGGTCGAACGCAGGGATTTGTAATAACTAGGCGTTGAGATTAGGTGCTTGACTATGCGGTGAACTATACATGTAATATGTGCAATTGTTGGTTTGTCTAAAACTATACATAAAGAATGAATCAAATTTCCTATAACTAACAATGAGGTGGACGCAGCGATGGGTGGTCATATGAGTGGCAGATAACATGCGTTGAGAATGAATGGAACCATGCGATAGAAGTGAGAAAACAAGCTAAGAAGTCTGTGTTATGCGTTGAAAAGAGGAGTGGTGGCCGAGAGTGAAGGAGCAAAAATGGTTAGGTGTTGGCCATGGCTATGCGTTGGTGAGTTAGAAGAGTTGTTATATTTTGATAGAGCTACTATGCGTTGAAGGTGTAACCATGAAAAGTTGCTATGCGTTGAGGGTGCCACCATGCGCTGAAGGTGTCACGCGTTGGAAGGGAAGTTGGCACTTGGTGGGGGAAAAGTTGTACTTGACCTACCATGCATTGGTTAATGTTACTATGCGTTGGAGAGTGGCTGTAAGTGTTGGAATGCTGTTATGCGTTAAATGTTCCCATGCATTGAAGGCAGCTATGTGTGGATGGTTGCTGTGAGCGTTGGAGGTAGGTTGCGTCGAGGGTGCTATGCGTTGGTTTGTTGGTGAAAAGTCTGCGCCGGCCAGCTTATTTTAAAAAGGAATGAAAGTGGAGGTTAAGTCGGCAGCATATATATATGAGTGGCCCAAATATTTTATCATTTTAAACACTTTCCAAAGCTCGAGTCAGTTCCATTTTTCAGACAAACAACGAGGAAAGGTAGAAGAAGATGTTCATGTCGAATTCAAGTGAAATTCTATTCGTTTTGGCGGTGTAACTAAAGCTGGAATAGGAAGTTTGATTGTGGTCGTAAAATCCGAGCATGGTCTAGTCATTGGAAAAGTGCGAGCTGAGGAAGTGGAAGATGGGGCTGAAATTTTGAGTTTAGTAAGTTAACTCAATTCCTAACCATTTCTATGAAGGATGTTTGAGTTAAATTTGAGGGGAAACATGTAAAATTGTAGGTGGAAGTGATTGCTATTGATTTGCTGGGTAATTGAGAGCAACTGGGTGTTATTTGAGAATATCTCTCAAACGGAGATGAATCATGCTCTGAAATTTCGAGGTAAGTTAGAGAACTTACTCCTTTACAAGTTTGTAGAAGGAAAGAAGTTCTGAATTTAGCTGGAAATGCGATTTTTATCAGGTAGAAATGGGAACAAAAGGAAAGGAAGAAACTCTACTGTCGGGTGTCGCTAGGAAGCGGGTGTCGCTGAGTAAGTAAAGTATAAATAGCAAGACGAGTGTCGCTGGAACGTAGGGGGTTTTCCGTCTCACAAGTTTCGCTAGGTCTCGTTCCTCAGGTAAATCTCGCTAGTCTCGCTTATCATGGTGTCGCTAGTGTCGTGTCGTGTCGTGTTGTGTGTTTTCGCTAGTAATCTCGCTAGAAATGTGGAATCTCGTTGGTAATCAGTGACACAAAATCTCGCTTATTATGATTACTTTGGGAAAAAGGGGAGTCTCGGTGATGAAACATGATTCGAGTTTAATTATGCATACTTAATGATCATGGAAAGGAAGTTTAAGGAAGCGTAACGTGAGGAATGTGTATTTACAGACCAAGAAGAGCTCGGAAAGGCGTATAATCCGCTTAGGGCCTAAGGACAAGATGTGAGTGACACCTTTCTTTCTTCAAAGACGATTTTATTGAAACTGTTTAGTTCAGTTTTTCTGATTAAATATGGCACTGTGATGAATTGTTTTGAAAAGCATGATGTTTGATTTTATTGTTCACAAGGTATGTTATATGTTTCCTGTAAGGTGTCGGAAAGCTTGTTTATGAAAAGTTATTGATTAAAGATCTGTCTGATTTTCTATGAAAAGATTTTCATATGAAACTACATAAATGTTTTAGAGGAGGCTCAATGTTGAAGGACGTGAGAAGGCATGTGAGTTTCCCCGGGACCCAGTACTGAGGGACATGGAGAAGGTACTTGGGCAACCCGCCATATTCCTGTGTGCACATAGAGGTTGTGACGTTGGGTGTGTTGAGTTTTGCTCCACCCCAACTAAGGTTGTTTTCGACGTTGAGTGCGTTGAGTTTTGTTCCGCCTCAACTAAGGTTGTTTTTAACGTTGAGTGCGTTGAGTTTTGCTCCGCCTCAACTAAGGTTGTTTTCGATGTTGGGTGCGTTGAGTCTTGGTCCGCTCTGACTAAGGAAAAATTTAAAACAAAAGAAGACTTGATGCTTTGTAGACTGTTTTATTGAATACGATTGGAAACGATGTTTTAAATGTGAATTCATGTGATTTGATTAAGTGGTGTTATAATGGTTGTAACATGCCTATTTTTGAACAAAAGATAATTTTACAAAACAAGTCATTCACGGGGCTTGAGGAGTTGTATGTGTATATAATTTTACCTTGCCTAAGCTTTGAGGAGTTGTATGTGTATATAATTTTACCTTGCCTAAGCTTTGAGGAGTTGTATGTGTATATAACTTTATACATATGTTGAGATGTGAGATAGGATGTTGTACAAGTATATTGTCGATATGTTGAAAGTTTAATAATATAAGTTCTGGTTGTATGAGGGGATATGTATGTTATGAATTGTCAGGATTATTAGGTACAGATGGACAAGAGGGGTGATGTTTACTGTCCTCACATCTCCTCTCGGGTTGGGTAGGGTAATCTAGGAAGGGGTGTGACACTTTCTATTTTTTTTCACCTCTTTCCGCATTATTTCATTTGTTCCTTTCTTTTCTTTAATTTTTTTTATTATCTGCTTTGAAAAGTTTTTAATTGGATGCTATCTATGTCATTCTGTGATGAGTGAAATTGAACTCTTTCTTAAAAAAAAAACAAAAAAGAAACGGAAAAGAAAAACCATGGACAAGGGATCTCGGATAATTACAAGGAGTCGACATAAAAAAAAAATGCGACACGATTAGCTCAACACCTGGGGGGCAAGGTATCTGAAGCGATTTAGTCAGATCACAAGATTTGAAAAGATCGCGATCAGATCACGAATTCGAAAAGCTGACTGCGCGTACTCCGGCTGACGCCGACTGGTGTCAGATCATTTAATGCGGCTATGATGTCACACATCGAGCCGATAACCCAGGTATTTGTAATAAATTAGGTGTTGAGACTAAATACAAAAACTATGCGGTGACCATACGTATTATATTCAAGAGTTTTATGTGCCTAAAACTATACATAAAAAGAATAAATAGGAAAGAATAAATAAAATTTCCTACAATAAACAAGATGAGGTGGAAGCGGAGAATGTAGTCAAATATCATTCAGATAGCATGCGTTCAGTGACGGTAAAACCATGCGATAGTAAAGAATGCATTATTTCATTTGTTCCTTTTTTTTTTTTTTATTATCTGCTTTGAAAAGTTTGTTTGGATGCTATATATGTCATTCTGTGATTCTTTAAATAAAACTATCCAATTACAAGGCGTCGACATCAAAACAATGCCACACGATTAACTCGACGCCTGGGGGGGAAGGGATCTAAAGAGATTTAGTCAGATCACGATCATATCACTAATTCGAAAAGCTGACTGCGTGTTTTCCGGCTGACGCCGACTGGTGTCAGACCATTTAATGCGGCTATGATGTCACGCATCGAGCCGATAACCCAGGTATTTGTAATAAATTATGCGTTGAGACTAAATACAAAAACTATACAGTGACCATACGTATTATATTCAAGAGTTTTATGTGCCTAAAACTATACATAAAAAGAATAAATAAAATTTCCTACAATTAACAAGATGAGGTGGAAGCGGAGAATGTAGTAAAATATCATCTAGTCAGATAGCATGCATTGAGTGACTAGAAAACCATGCGATAGTAATAGAGTTAAACATTATTTCATTTGTTGCTTTATTTTTTTTTTTCTGCTTTGAAAAGTTCTTGTGGATGCTATCTATGTCATTCTGTAATGAGTGAAATTGAACTCTTAAATAAAACTAGGATAATTACAAGGCGTCGACATCGAAACAATGCGACATGATTAACTCGCCGCCTGGGGGGGAAGGGATCGGAAGTGATTTAGTCAGATCAAGAATTCGAAAAGCAGACTGCGTGTTCGCCGGCTGACACCGAATAGTGTCAGACCATTTAATGCGGCTATGATGCCACGCATCTAGTCGATAACCCACGCGCCCAGCTAAAAGCCTTTATAAACCCCTCTTCAACCCTTTTTCGAGTTTACGCTTTCACTCTCTAAAATCTTCGACTACCGCTGCCATTTCTGGCGTATTCGAAAACTTCCCGATTCTCACCCACTTTTCTGTCATTTTCCTTATACCGGCCCCATGGCTTCCCAGCAATCGACCTCCTCCAAAAGCCTCAAACCCATTGCCTCCAAAAAGGCCGGCATTGCCTCATTCTCGAGGCATGGTAGTCGAAATCCGTCCACTACCTCCACCACACCTGAACCCCTCGCCGTTGTCCGCCCCAACAGTCCTGAGTTGGAGGCGGCATCAACCATATCTCCGCCACGCTCCGAGGACCTGTCTGAAGCTCAAGAGGTTGAGAGGAATGACGAGGCCAACGTCTTCGGAGACATTCTGGACACAATAGACCAAGAAGACGATGACTGATTGGACGGTTTGGTCGACCAAAGGATGAAGAGTATGCCGCATGGGTCAGTCCCTGTGGTGAGTGGTCTTGAGGAGTCGGACTTCGATGACAGACCCATTGGAAAGGTAATTTCGGCAGTGGAAAGGAAGAAATCCGCCGCGAAAAGGAAAAACAGGCCAATCAGGGCCATGGAGGAAGAAGCTGAGGAAGATGTTCTTCCGCTAAGAAGAAAAAATGAAAAAGTCAGGGATGCGGGGACATCCGAGGCTGTCGCAGCATCTTCCGCCGCATCATCGATTGAGCGGCTAGCAGCGAGGGCGGCAGAGAAGTGCAAGAAGCTGAAAGACGACTTCAAAAGAATGAAAGAAAGGACGGAGGGGTTTCTCGCCGCAAAGAAGAAAAGAAAAGAGGAAATCGCATCTGGTGAACAAGCTGCCAAGGAGGCCACCAGAGAAGCTGAGTTTCTCCGTCGTATAGCAGAGATTGCGGCGGAGTTTGAAGTTGAGTTGGAAGGATGTGATGCGGAGAGGGTCCCCAGAGCCGAATGCATCAAGCGCAATATAAAGAAGATAAGGGAGGAGAAAAAGAAAGAAAAGGAGGTAGCAATAAGCAAGGAAAGGCTACCGGAGAAAAAGAAGAGGATTGAGGTGGCCAAGGCACCTGCAGTCGTAATAAGGGATTTGGACGCAGGGGGAGCAAGGAGCACGCCTGGTAGTCCCACCAATAGGAACGTGAAGGGAAAAGAGAAAATGGTTGAGGAACAACCCAAGGAGGCGGCGAAGGGCCGAAGGAAGCAGTTCTGTAGGCTCCATACTGAAGTGGGGTTTTTCCCAGAGCCGATTGAGCTGCCTGCCTTCATCATCCAAGGAGTCGACGCAATGGGTTGGAGGCAATTTTGTGAAAGTGGCCAAGTCATCCAACCCACCGCCGTGAAGGCATTTTACGAAGGAACCATTCACCGCAAGGCACACTTGGTCAAAGTTGAAGATGAGGTGATTTCCTTCGAGCCTCAGGAGATCAACGGGTTATTTGATCTTTGTTCGTTTGAATTTGTTCGTTTAATTTCTAATTAAATTGAATTGAGTTTCTTAGCTCGTTGCACAAAAATAGATTTATCATTTCCGTACAAGCATGCGTTGAGAATTACTCTAGATGTGATCATGACTAAGCAAGTTTTCCTATAAAGACCCAAGTATAAGTATTTATAGGTTTTCCTGGTGAGTCTAGGGTCGAACGCAGGGATTTGTAATAACTAGGCGTTGAGACTAAATGTATGACTATGCGGTGAATGTACATGTAAATATATATGCATTTGTTGATTGTGTATCTAAACTATACAAAAAGAATGAAAAGAATTTCCTACTACTATTAATGAAGTGGACGCGGCGTTGGGTAGTCATATCAAACAATGCGTCAGATAGCATGCGTTGAGAATGAATGGGACCATGCGATAGAGGTAGAGTTGAACGCAAGAGCTAGTTGTTACCTAAACTTTCTAAGTACAATTACATACCTACAAACTGCGTGTGTGAATCCATCTCTCGATGTTTCGACGCCACATTTATCTTATCTCTAAGATGCATGGTAAGTTTGTGTGGCAAAAGATGAACTACATCTCTGCACTCATCTAGTCTTAATTAATGCTAAAGATTAACTATCCACTCTCTCAAGCTCGACAGTGATGTTTACTACGTTTGTTTGTCAGACAAAGTTATGCAAGCTAAAAATATGCATTAATCAAGATGTAACATACTCAGAATTGCTCAAGAAAAACTAACTTTTAACATTCAACTCATTTAGCATTTAGACAGCATGGTGTAAAGTATTTTATACATAAATGGATGGAAGATGATAGTCAATCGAACTTATATTAAAGAAATAAATCATTCTCTCGAATGTTATTAACAAAATATAAGCAAGAAGGAGAGAAATGGAAATAAAATAAAGATGAAGGGCGTCAAGCCGCCACTGGCAACGTCTTACTTCATGGAGGCTTGAGATGGTTGCTATCTGATACATATATTCCATAAGGTGAAGATTCTCACTCACTGGGAGACGACTCTGAAGGTGCTCAGAATTCTTCTCTCAGAAGAGCTCGAAATGGAGGCTGGTTATGGTGAGGGAGAAATAATGCAGAGCTTTCGCCTCTTGAACTAGTCTCTCGTCCTCATTTTCTTGGAGAAGCTTGTGGACTATTTATAGGCGTAAAAAAGTTAATATCATGATGACCTGACACCTTGTGCATTAAATTTCACCCACGCTGCATAGACTTTTCAGATTGTTAGCCTATTTGTTCTGCCATGATCTGATTTGGTGTTGTCCTTTTTCATAATGACATGACAGCTGTTTGAGCATGCGTCTGTTTCTACTTTTCAGATCGTCAACGCATAGTCTCGACGCATGGTCTCAACGCATAGGCTCGACGCATGGGTTTCCACTTGTCTTTGCAGCAGCTTCGTCTCTTCGTACTCTGATTGTTAGTGCTTCTTTTCTTCTTAACCCTGCGTTAGGAACATAAAAGCTCATTGTTTTTCAGGTATTTTCTTTTGTAAATAGTGCCAACGCATGGTGATACATTTTTCATCATTTAACTACCAAAATCATGCGTTTAGCTCTAAAGTTTCTCTAATTAAATCAAATAGTGACTACAATAACTTGTATCTCTACAAGTTATCAATCTTCCCGACATTGTCGCGGTAGATGGTAACAGAATCATGTCGACGCCAACAGAAGCCGAATGATGCCCTCACAACCATCGCCAAACCGGGGTCAAAGTGGAACACTTCCTCAAAGGGCATTCATACATTGTCGCCAAACTGTTTGATTGCGGAGGCCAACCTGTGGCTTTACTTCATCAAGCGGTCGCTCCTCCCCACTACGCATGACGCCTCAATAACAAGGGACCGTACCATGGTCATTTATTGCATCATGCGGGGAATTCGGTTAGATGTTGGACGCATCATTGCCCTACAGATTCGGGGGATGTTCTTCAAGCCTAGGGGTCAGTTGTTCTTCCCCTTTCTTGTGACAAGACTGTGCGCCAACGCGGAATGGATGGAGGATGCGCCAGTCAAAGAAGTAAATGGGGTTCTCTCAGGCCAGGGTCTGAGAAGGATATTGAAAGACTCCCCGCATCTGCTTGAGGCGGCCAACACAAGAAGAGGCTAACCAAGTCACAAGAACCATCATGTGACAAGACTGTGCGCCAACGCGGCCTGTCAAGTTGTTCTTCCCCTTTCTTGTGACAAGACTGTGCGCCAACGCGGAATGGATGGAGGATGCGCCAGTCAAAGAAGTAAATGGGGTTCTCTCAGGCCAGGGTCTGAGAAGGATATTGAAAGACTCCCCGCATCTGCTTGAGGCGGCCAACACAAGAAGAGGCCAACCAAGTCACAAGAACCATCATCCCCCCAACCACCTCAACCCAAGAAGAGGAAACTTGTCAAGAAGCATTTTGAAATGAAGGCGTCGAGTTCGCAACATAGTGAAGCCGAAGAAGCTGCTCAGCTCGACGCCAACAAGAAAGCCTTGACCGTTTATACTCCTATCAACCCAACTACCGAGGAGCCCTCTTCACTGTCACCTTCTCCTTCCTCTTCCCTTAAAATCCAAAAAGAGCCACTTCCTGCACCTACCACCCAACGCAGCCTTCCTTCGCCTCTTTGCCTTCCTGTCGCAGACTTAAATAAAGAGTCGAAGGAAGTGTCGCCGCCTTCTTCACTCCACTTGGAGTTAACCCTCTCCCTGCTTCAATCATCTACCTTTCGAGAGGCAACTCCATCCTCACCACCACCACCCGTTGAGCCGACCTTGACTCTGGATCCTGAGGAGTCGGTGCCCTTTTAAGTTTCCCCGCCTCGCCATGAACCTCAACTTTCCAATCTTCCAACTCCACCAACTACTGACACTTCTGCCCGCACAACCGCTCACCCGCAGGCAGAAGCAACTACAAGCACGCATCATCAAGAGGACCCATATCACTTACGGCACGCCTCTGCTCCGCTGCAGTTTGTTGATTTGAGTCTGGATGACTTGCTGCAGTACTTGGACGATGGGATCCTTCACCCAGTCATGGGAGATCTCGATGAAGTTTGGCGCAAGGAGATGAAACTTCTCGAGCGGCAAGAAGAGTTGGCTCAACAAGTCTCACAGTTGGGTCAGCAAGTGGCTTAGATGGTGCAACAACAATTGGAGCATCGAAGTTTTGTTCAACGCCAAGCTCAGCGTCAGGACGAACAATTCGACACTTTGATGAACTACATTTATGAAGTGTTCGTCCAACGCATACCCAACGCCAGTTATTCCTCCTTGCCTTCAGCAACCCCTTCAGTTCAACGATCCAAATCCTCCCCCTGCAGACAACAATGACCACGACTCGGGGAGAACTTAAATTTAGGGGTGTTCTTTGGGATTTTTCCTTCTTTTCTTATGTTTATTCCTTTGATTCTTTTGTAAATAGTACAACGCAGAATACATTTACATCAATACAACATTCAATTCATGAATTGCTCTTTATTCTCTTTAGTTTGTGGTTTCATTTATTTCTTGCTTTATTATTTTCTCTGCCTTGTTTCTGTTCCTGCGTCCAACCTCCATGATGAAATTTATGCACTTTCCCTGTAGACATTGAAACAAAAGGATTCTCACAAGCCTTGCGATATTACACTTCTAAAAAGTTAGTATGCGTGCAAGCTTTGCCTCAAACCTCTTTTACAACATTTTTCTAAGATAATCTCATAACTCGATTTCAGCAATGGGGACCTTGCTGCCTTCCAAATTTGGGGGTAAGAAAGGTTTGAGCCATGCATTCAGAATTCATAAAATTTCCAAATCAAATCAAAAAACAAAAAAATTAAAAAAATAATGCTTCACCTTTTATAGAAATATTGCATTTAGTAAAATTAATTATGAAAATATTGTAGTTTGGAAACTATTTAGGAAGCTTGACGCCTGA

mRNA sequence

ATGAGGAAGGAGGAAGAGTCCCTAAGAAAGGTGGACAATTTTGGAATAGAGGAAACTAGGAAAGGGAGCGGATTCTTTAAGGCGGAAGCAAACTACTGGGACGACGGTAACGCTAAGATCGAAAGCTACGGCTTCTGGCTTCTGGCTTCCGGCCTCCGATCTCCGGTCTCCGGTCTCCGGGTTTTGGGTTCTGGGTTCTGGGTTCGGCAGGAAAAGCTTCTCCCTACAAGTTTGAGTGCAAGAAATCGAGTGAAAGTAAATGAAGAAATGGGGCTCGGGGGTAAAGACAATGAGCATGAAGAAATCTATGCTTATGACTATAAGGATACTGATGTTGTTTGGGATTCAGATGAAATTGAAGCTATTTCATCACTCTGTCAAGGGAGAATTCCTCAGAAACCTGGTACATTGAACAGGGACAGATCTCTTCCTCTCCCACTTCCTCACAAGCTACGACCACCAAGACTTCCTAACCTAAAAATCCGCCTAAGAACAATGGTCTCTTCACGTGCTTTGATGTCTAAGCAAGTTTACAAGCGTCCTGATTTTCTTATTGGCCTTGCCAGGGCGATTAGAGATTTGTCCTTGGAGGAAAATGTGTCCAAGGTTCTCAATTGGTGGGGTCCTTTTTTGCAAAAGGGCTCTCTATCATTGACGATCAAGGAACTAGGTCATATGGGTCTTCCTGATAGAGTTCTAAAGACCTTCTGTTGGGCACAGGAACGACCTCGCCTCTTCCCGGATGATCGTGTTTTGGCCAAAACGGTTGAGGTCCTTGCAAGGAACCATGAACTGAAGGTACCTCTAAACTTGGAAGAGTTCACTGAACTTGCTAGTCGTGGCGTGCTTGAGGCAATGGTGAGAGGGAAGCTTGACGCCTGA

Coding sequence (CDS)

ATGAGGAAGGAGGAAGAGTCCCTAAGAAAGGTGGACAATTTTGGAATAGAGGAAACTAGGAAAGGGAGCGGATTCTTTAAGGCGGAAGCAAACTACTGGGACGACGGTAACGCTAAGATCGAAAGCTACGGCTTCTGGCTTCTGGCTTCCGGCCTCCGATCTCCGGTCTCCGGTCTCCGGGTTTTGGGTTCTGGGTTCTGGGTTCGGCAGGAAAAGCTTCTCCCTACAAGTTTGAGTGCAAGAAATCGAGTGAAAGTAAATGAAGAAATGGGGCTCGGGGGTAAAGACAATGAGCATGAAGAAATCTATGCTTATGACTATAAGGATACTGATGTTGTTTGGGATTCAGATGAAATTGAAGCTATTTCATCACTCTGTCAAGGGAGAATTCCTCAGAAACCTGGTACATTGAACAGGGACAGATCTCTTCCTCTCCCACTTCCTCACAAGCTACGACCACCAAGACTTCCTAACCTAAAAATCCGCCTAAGAACAATGGTCTCTTCACGTGCTTTGATGTCTAAGCAAGTTTACAAGCGTCCTGATTTTCTTATTGGCCTTGCCAGGGCGATTAGAGATTTGTCCTTGGAGGAAAATGTGTCCAAGGTTCTCAATTGGTGGGGTCCTTTTTTGCAAAAGGGCTCTCTATCATTGACGATCAAGGAACTAGGTCATATGGGTCTTCCTGATAGAGTTCTAAAGACCTTCTGTTGGGCACAGGAACGACCTCGCCTCTTCCCGGATGATCGTGTTTTGGCCAAAACGGTTGAGGTCCTTGCAAGGAACCATGAACTGAAGGTACCTCTAAACTTGGAAGAGTTCACTGAACTTGCTAGTCGTGGCGTGCTTGAGGCAATGGTGAGAGGGAAGCTTGACGCCTGA

Protein sequence

MRKEEESLRKVDNFGIEETRKGSGFFKAEANYWDDGNAKIESYGFWLLASGLRSPVSGLRVLGSGFWVRQEKLLPTSLSARNRVKVNEEMGLGGKDNEHEEIYAYDYKDTDVVWDSDEIEAISSLCQGRIPQKPGTLNRDRSLPLPLPHKLRPPRLPNLKIRLRTMVSSRALMSKQVYKRPDFLIGLARAIRDLSLEENVSKVLNWWGPFLQKGSLSLTIKELGHMGLPDRVLKTFCWAQERPRLFPDDRVLAKTVEVLARNHELKVPLNLEEFTELASRGVLEAMVRGKLDA
Homology
BLAST of Cla97C01G011962 vs. NCBI nr
Match: XP_038893977.1 (pentatricopeptide repeat-containing protein At2g01860 [Benincasa hispida] >XP_038893978.1 pentatricopeptide repeat-containing protein At2g01860 [Benincasa hispida])

HSP 1 Score: 346.3 bits (887), Expect = 2.6e-91
Identity = 175/196 (89.29%), Postives = 183/196 (93.37%), Query Frame = 0

Query: 94  GKDNEHEEIYAYDYKDTDVVWDSDEIEAISSLCQGRIPQKPGTLNRDRSLPLPLPHKLRP 153
           G+D+EHEEI+AYDYKDTDVVWDSDEIEAISSL QGRIPQKPG LNRDR LPLPLPHKLRP
Sbjct: 85  GEDDEHEEIHAYDYKDTDVVWDSDEIEAISSLFQGRIPQKPGKLNRDRPLPLPLPHKLRP 144

Query: 154 PRLPNLKIRLRTMVSSRALMSKQVYKRPDFLIGLARAIRDLSLEENVSKVLNWWGPFLQK 213
             LP+ KIR R MVSSRAL+SKQVYKRPDFLIGLARAIRDLS EENVSKVLN WGPFLQK
Sbjct: 145 SGLPDPKIRPRIMVSSRALLSKQVYKRPDFLIGLARAIRDLSPEENVSKVLNRWGPFLQK 204

Query: 214 GSLSLTIKELGHMGLPDRVLKTFCWAQERPRLFPDDRVLAKTVEVLARNHELKVPLNLEE 273
           GSLSLTIKELGHMGLPDR LKTF WAQE+PRLFPDDRVLA TVEVLARNHELKVPL+LEE
Sbjct: 205 GSLSLTIKELGHMGLPDRALKTFSWAQEQPRLFPDDRVLASTVEVLARNHELKVPLDLEE 264

Query: 274 FTELASRGVLEAMVRG 290
           FT+LASRGVLEAMVRG
Sbjct: 265 FTKLASRGVLEAMVRG 280

BLAST of Cla97C01G011962 vs. NCBI nr
Match: KAA0052071.1 (pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK00686.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 336.3 bits (861), Expect = 2.7e-88
Identity = 170/194 (87.63%), Postives = 180/194 (92.78%), Query Frame = 0

Query: 97  NEHEEIYAYDY-KDTDVVWDSDEIEAISSLCQGRIPQKPGTLNRDRSLPLPLPHKLRPPR 156
           +EHEEI+AYDY KDTDVVWDSDEIEAISSL QGRIPQKPG LNR+R LPLPLPHKLRPPR
Sbjct: 82  DEHEEIHAYDYTKDTDVVWDSDEIEAISSLFQGRIPQKPGKLNRERPLPLPLPHKLRPPR 141

Query: 157 LPNLKIRLRTMVSSRALMSKQVYKRPDFLIGLARAIRDLSLEENVSKVLNWWGPFLQKGS 216
           LPN KIR  T VSSRAL+SK+VYKRPDFLIGLARAIRDLS EENVSKVLN WGPFLQKGS
Sbjct: 142 LPNPKIRPTTTVSSRALLSKKVYKRPDFLIGLARAIRDLSPEENVSKVLNRWGPFLQKGS 201

Query: 217 LSLTIKELGHMGLPDRVLKTFCWAQERPRLFPDDRVLAKTVEVLARNHELKVPLNLEEFT 276
           LSLTIKELGHMGLPDR LKTFCW QE+ RLFPDDRVLA TVEVL+RNHELKVP+NLEEFT
Sbjct: 202 LSLTIKELGHMGLPDRALKTFCWVQEQRRLFPDDRVLASTVEVLSRNHELKVPVNLEEFT 261

Query: 277 ELASRGVLEAMVRG 290
           +LASRGVLEAM+RG
Sbjct: 262 KLASRGVLEAMMRG 275

BLAST of Cla97C01G011962 vs. NCBI nr
Match: XP_008462173.1 (PREDICTED: pentatricopeptide repeat-containing protein At2g01860 [Cucumis melo] >XP_008462181.1 PREDICTED: pentatricopeptide repeat-containing protein At2g01860 [Cucumis melo] >XP_008462189.1 PREDICTED: pentatricopeptide repeat-containing protein At2g01860 [Cucumis melo] >XP_016902994.1 PREDICTED: pentatricopeptide repeat-containing protein At2g01860 [Cucumis melo] >XP_016902996.1 PREDICTED: pentatricopeptide repeat-containing protein At2g01860 [Cucumis melo])

HSP 1 Score: 336.3 bits (861), Expect = 2.7e-88
Identity = 170/194 (87.63%), Postives = 180/194 (92.78%), Query Frame = 0

Query: 97  NEHEEIYAYDY-KDTDVVWDSDEIEAISSLCQGRIPQKPGTLNRDRSLPLPLPHKLRPPR 156
           +EHEEI+AYDY KDTDVVWDSDEIEAISSL QGRIPQKPG LNR+R LPLPLPHKLRPPR
Sbjct: 82  DEHEEIHAYDYTKDTDVVWDSDEIEAISSLFQGRIPQKPGKLNRERPLPLPLPHKLRPPR 141

Query: 157 LPNLKIRLRTMVSSRALMSKQVYKRPDFLIGLARAIRDLSLEENVSKVLNWWGPFLQKGS 216
           LPN KIR  T VSSRAL+SK+VYKRPDFLIGLARAIRDLS EENVSKVLN WGPFLQKGS
Sbjct: 142 LPNPKIRPTTTVSSRALLSKKVYKRPDFLIGLARAIRDLSPEENVSKVLNRWGPFLQKGS 201

Query: 217 LSLTIKELGHMGLPDRVLKTFCWAQERPRLFPDDRVLAKTVEVLARNHELKVPLNLEEFT 276
           LSLTIKELGHMGLPDR LKTFCW QE+ RLFPDDRVLA TVEVL+RNHELKVP+NLEEFT
Sbjct: 202 LSLTIKELGHMGLPDRALKTFCWVQEQRRLFPDDRVLASTVEVLSRNHELKVPVNLEEFT 261

Query: 277 ELASRGVLEAMVRG 290
           +LASRGVLEAM+RG
Sbjct: 262 KLASRGVLEAMMRG 275

BLAST of Cla97C01G011962 vs. NCBI nr
Match: XP_004139567.1 (pentatricopeptide repeat-containing protein At2g01860 [Cucumis sativus] >XP_011654198.1 pentatricopeptide repeat-containing protein At2g01860 [Cucumis sativus] >XP_031739920.1 pentatricopeptide repeat-containing protein At2g01860 [Cucumis sativus] >XP_031739926.1 pentatricopeptide repeat-containing protein At2g01860 [Cucumis sativus] >KGN64877.1 hypothetical protein Csa_022712 [Cucumis sativus])

HSP 1 Score: 331.3 bits (848), Expect = 8.7e-87
Identity = 168/194 (86.60%), Postives = 179/194 (92.27%), Query Frame = 0

Query: 97  NEHEEIYAYDY-KDTDVVWDSDEIEAISSLCQGRIPQKPGTLNRDRSLPLPLPHKLRPPR 156
           +EHEEI+A+DY KDTDVVWDSDEIEAISSL QGRIPQKPG LNR+R LPLPLPHKLRPPR
Sbjct: 82  DEHEEIHAHDYTKDTDVVWDSDEIEAISSLFQGRIPQKPGKLNRERPLPLPLPHKLRPPR 141

Query: 157 LPNLKIRLRTMVSSRALMSKQVYKRPDFLIGLARAIRDLSLEENVSKVLNWWGPFLQKGS 216
           LPN KIR  T+VSSRAL+SKQVYKRPDFLIGLAR IRDLS EENVSKVLN WGPFLQKGS
Sbjct: 142 LPNPKIRPTTVVSSRALLSKQVYKRPDFLIGLAREIRDLSPEENVSKVLNRWGPFLQKGS 201

Query: 217 LSLTIKELGHMGLPDRVLKTFCWAQERPRLFPDDRVLAKTVEVLARNHELKVPLNLEEFT 276
           LSLTIKELGHMGLPDR L TFCWAQE+ RLFPDDRVLA TVEVL+RNHELKV +NLEEFT
Sbjct: 202 LSLTIKELGHMGLPDRALNTFCWAQEQHRLFPDDRVLASTVEVLSRNHELKVAVNLEEFT 261

Query: 277 ELASRGVLEAMVRG 290
           +LASRGVLEAM+RG
Sbjct: 262 KLASRGVLEAMMRG 275

BLAST of Cla97C01G011962 vs. NCBI nr
Match: XP_022951808.1 (pentatricopeptide repeat-containing protein At2g01860 isoform X2 [Cucurbita moschata])

HSP 1 Score: 313.5 bits (802), Expect = 1.9e-81
Identity = 156/190 (82.11%), Postives = 169/190 (88.95%), Query Frame = 0

Query: 100 EEIYAYDYKDTDVVWDSDEIEAISSLCQGRIPQKPGTLNRDRSLPLPLPHKLRPPRLPNL 159
           +  +A D  D+DVVWDS+EIEAI+SL +GRIPQKPG LNR+R LPLPLPHKLRPP LPN 
Sbjct: 98  DNYFANDDNDSDVVWDSEEIEAITSLFRGRIPQKPGKLNRERPLPLPLPHKLRPPGLPNP 157

Query: 160 KIRLRTMVSSRALMSKQVYKRPDFLIGLARAIRDLSLEENVSKVLNWWGPFLQKGSLSLT 219
           KIR RT VSSRALMSKQVYKRPDFLIGLARAIRDL  EENVSKVLN W PFLQKGSLSLT
Sbjct: 158 KIRPRTAVSSRALMSKQVYKRPDFLIGLARAIRDLKPEENVSKVLNRWAPFLQKGSLSLT 217

Query: 220 IKELGHMGLPDRVLKTFCWAQERPRLFPDDRVLAKTVEVLARNHELKVPLNLEEFTELAS 279
           IKELGHMGL DR LKTFCW QE+PRL+PDDRVLA TVEVLARNHELK+P NL+EFT+LAS
Sbjct: 218 IKELGHMGLADRALKTFCWVQEQPRLYPDDRVLASTVEVLARNHELKIPFNLDEFTKLAS 277

Query: 280 RGVLEAMVRG 290
           RGVLEAM+RG
Sbjct: 278 RGVLEAMMRG 287

BLAST of Cla97C01G011962 vs. ExPASy Swiss-Prot
Match: Q5XET4 (Pentatricopeptide repeat-containing protein At2g01860 OS=Arabidopsis thaliana OX=3702 GN=EMB975 PE=2 SV=1)

HSP 1 Score: 188.3 bits (477), Expect = 1.2e-46
Identity = 108/201 (53.73%), Postives = 143/201 (71.14%), Query Frame = 0

Query: 96  DNEHEEIYAYDYKDTD-VVWDSDEIEAISSLCQGRIPQKPGTLNRDRSLPLPLPHKLRPP 155
           D++ E++      D D VVW+ +EIEAISSL Q RIPQKP   +R R LPLP PHKLRP 
Sbjct: 74  DDDDEQVQESVNDDDDAVVWEPEEIEAISSLFQKRIPQKPDKPSRVRPLPLPQPHKLRPL 133

Query: 156 RLPNLKIRLRTMVSSRAL--MSKQVYKRPDFLIGLARAIRDL-SLEENVSKVLNWWGPFL 215
            LP  K   + ++ S AL  +SKQVYK P FLIGLAR I+ L S + +VS VLN W  FL
Sbjct: 134 GLPTPK---KNIIRSPALSSVSKQVYKDPSFLIGLAREIKSLPSSDADVSLVLNKWVSFL 193

Query: 216 QKGSLSLTIKELGHMGLPDRVLKTFCWAQERPRLFPDDRVLAKTVEVLARNHELKVPLNL 275
           +KGSLS TI+ELGHMGLP+R L+T+ WA++   L PD+R+LA T++VLA++HELK+   L
Sbjct: 194 RKGSLSTTIRELGHMGLPERALQTYHWAEKHSHLVPDNRILASTIQVLAKHHELKL---L 253

Query: 276 EEFTELASRGVLEAMVRGKLD 293
           +    LAS+ V+EAM++G ++
Sbjct: 254 KFDNSLASKNVIEAMIKGCIE 268

BLAST of Cla97C01G011962 vs. ExPASy TrEMBL
Match: A0A1S3CGD0 (pentatricopeptide repeat-containing protein At2g01860 OS=Cucumis melo OX=3656 GN=LOC103500594 PE=4 SV=1)

HSP 1 Score: 336.3 bits (861), Expect = 1.3e-88
Identity = 170/194 (87.63%), Postives = 180/194 (92.78%), Query Frame = 0

Query: 97  NEHEEIYAYDY-KDTDVVWDSDEIEAISSLCQGRIPQKPGTLNRDRSLPLPLPHKLRPPR 156
           +EHEEI+AYDY KDTDVVWDSDEIEAISSL QGRIPQKPG LNR+R LPLPLPHKLRPPR
Sbjct: 82  DEHEEIHAYDYTKDTDVVWDSDEIEAISSLFQGRIPQKPGKLNRERPLPLPLPHKLRPPR 141

Query: 157 LPNLKIRLRTMVSSRALMSKQVYKRPDFLIGLARAIRDLSLEENVSKVLNWWGPFLQKGS 216
           LPN KIR  T VSSRAL+SK+VYKRPDFLIGLARAIRDLS EENVSKVLN WGPFLQKGS
Sbjct: 142 LPNPKIRPTTTVSSRALLSKKVYKRPDFLIGLARAIRDLSPEENVSKVLNRWGPFLQKGS 201

Query: 217 LSLTIKELGHMGLPDRVLKTFCWAQERPRLFPDDRVLAKTVEVLARNHELKVPLNLEEFT 276
           LSLTIKELGHMGLPDR LKTFCW QE+ RLFPDDRVLA TVEVL+RNHELKVP+NLEEFT
Sbjct: 202 LSLTIKELGHMGLPDRALKTFCWVQEQRRLFPDDRVLASTVEVLSRNHELKVPVNLEEFT 261

Query: 277 ELASRGVLEAMVRG 290
           +LASRGVLEAM+RG
Sbjct: 262 KLASRGVLEAMMRG 275

BLAST of Cla97C01G011962 vs. ExPASy TrEMBL
Match: A0A5D3BQZ3 (Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1371G00260 PE=4 SV=1)

HSP 1 Score: 336.3 bits (861), Expect = 1.3e-88
Identity = 170/194 (87.63%), Postives = 180/194 (92.78%), Query Frame = 0

Query: 97  NEHEEIYAYDY-KDTDVVWDSDEIEAISSLCQGRIPQKPGTLNRDRSLPLPLPHKLRPPR 156
           +EHEEI+AYDY KDTDVVWDSDEIEAISSL QGRIPQKPG LNR+R LPLPLPHKLRPPR
Sbjct: 82  DEHEEIHAYDYTKDTDVVWDSDEIEAISSLFQGRIPQKPGKLNRERPLPLPLPHKLRPPR 141

Query: 157 LPNLKIRLRTMVSSRALMSKQVYKRPDFLIGLARAIRDLSLEENVSKVLNWWGPFLQKGS 216
           LPN KIR  T VSSRAL+SK+VYKRPDFLIGLARAIRDLS EENVSKVLN WGPFLQKGS
Sbjct: 142 LPNPKIRPTTTVSSRALLSKKVYKRPDFLIGLARAIRDLSPEENVSKVLNRWGPFLQKGS 201

Query: 217 LSLTIKELGHMGLPDRVLKTFCWAQERPRLFPDDRVLAKTVEVLARNHELKVPLNLEEFT 276
           LSLTIKELGHMGLPDR LKTFCW QE+ RLFPDDRVLA TVEVL+RNHELKVP+NLEEFT
Sbjct: 202 LSLTIKELGHMGLPDRALKTFCWVQEQRRLFPDDRVLASTVEVLSRNHELKVPVNLEEFT 261

Query: 277 ELASRGVLEAMVRG 290
           +LASRGVLEAM+RG
Sbjct: 262 KLASRGVLEAMMRG 275

BLAST of Cla97C01G011962 vs. ExPASy TrEMBL
Match: A0A0A0LVM0 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G144300 PE=4 SV=1)

HSP 1 Score: 331.3 bits (848), Expect = 4.2e-87
Identity = 168/194 (86.60%), Postives = 179/194 (92.27%), Query Frame = 0

Query: 97  NEHEEIYAYDY-KDTDVVWDSDEIEAISSLCQGRIPQKPGTLNRDRSLPLPLPHKLRPPR 156
           +EHEEI+A+DY KDTDVVWDSDEIEAISSL QGRIPQKPG LNR+R LPLPLPHKLRPPR
Sbjct: 82  DEHEEIHAHDYTKDTDVVWDSDEIEAISSLFQGRIPQKPGKLNRERPLPLPLPHKLRPPR 141

Query: 157 LPNLKIRLRTMVSSRALMSKQVYKRPDFLIGLARAIRDLSLEENVSKVLNWWGPFLQKGS 216
           LPN KIR  T+VSSRAL+SKQVYKRPDFLIGLAR IRDLS EENVSKVLN WGPFLQKGS
Sbjct: 142 LPNPKIRPTTVVSSRALLSKQVYKRPDFLIGLAREIRDLSPEENVSKVLNRWGPFLQKGS 201

Query: 217 LSLTIKELGHMGLPDRVLKTFCWAQERPRLFPDDRVLAKTVEVLARNHELKVPLNLEEFT 276
           LSLTIKELGHMGLPDR L TFCWAQE+ RLFPDDRVLA TVEVL+RNHELKV +NLEEFT
Sbjct: 202 LSLTIKELGHMGLPDRALNTFCWAQEQHRLFPDDRVLASTVEVLSRNHELKVAVNLEEFT 261

Query: 277 ELASRGVLEAMVRG 290
           +LASRGVLEAM+RG
Sbjct: 262 KLASRGVLEAMMRG 275

BLAST of Cla97C01G011962 vs. ExPASy TrEMBL
Match: A0A6J1GIR9 (pentatricopeptide repeat-containing protein At2g01860 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111454537 PE=4 SV=1)

HSP 1 Score: 313.5 bits (802), Expect = 9.1e-82
Identity = 156/190 (82.11%), Postives = 169/190 (88.95%), Query Frame = 0

Query: 100 EEIYAYDYKDTDVVWDSDEIEAISSLCQGRIPQKPGTLNRDRSLPLPLPHKLRPPRLPNL 159
           +  +A D  D+DVVWDS+EIEAI+SL +GRIPQKPG LNR+R LPLPLPHKLRPP LPN 
Sbjct: 98  DNYFANDDNDSDVVWDSEEIEAITSLFRGRIPQKPGKLNRERPLPLPLPHKLRPPGLPNP 157

Query: 160 KIRLRTMVSSRALMSKQVYKRPDFLIGLARAIRDLSLEENVSKVLNWWGPFLQKGSLSLT 219
           KIR RT VSSRALMSKQVYKRPDFLIGLARAIRDL  EENVSKVLN W PFLQKGSLSLT
Sbjct: 158 KIRPRTAVSSRALMSKQVYKRPDFLIGLARAIRDLKPEENVSKVLNRWAPFLQKGSLSLT 217

Query: 220 IKELGHMGLPDRVLKTFCWAQERPRLFPDDRVLAKTVEVLARNHELKVPLNLEEFTELAS 279
           IKELGHMGL DR LKTFCW QE+PRL+PDDRVLA TVEVLARNHELK+P NL+EFT+LAS
Sbjct: 218 IKELGHMGLADRALKTFCWVQEQPRLYPDDRVLASTVEVLARNHELKIPFNLDEFTKLAS 277

Query: 280 RGVLEAMVRG 290
           RGVLEAM+RG
Sbjct: 278 RGVLEAMMRG 287

BLAST of Cla97C01G011962 vs. ExPASy TrEMBL
Match: A0A6J1GIP2 (pentatricopeptide repeat-containing protein At2g01860 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111454537 PE=4 SV=1)

HSP 1 Score: 313.5 bits (802), Expect = 9.1e-82
Identity = 156/190 (82.11%), Postives = 169/190 (88.95%), Query Frame = 0

Query: 100 EEIYAYDYKDTDVVWDSDEIEAISSLCQGRIPQKPGTLNRDRSLPLPLPHKLRPPRLPNL 159
           +  +A D  D+DVVWDS+EIEAI+SL +GRIPQKPG LNR+R LPLPLPHKLRPP LPN 
Sbjct: 98  DNYFANDDNDSDVVWDSEEIEAITSLFRGRIPQKPGKLNRERPLPLPLPHKLRPPGLPNP 157

Query: 160 KIRLRTMVSSRALMSKQVYKRPDFLIGLARAIRDLSLEENVSKVLNWWGPFLQKGSLSLT 219
           KIR RT VSSRALMSKQVYKRPDFLIGLARAIRDL  EENVSKVLN W PFLQKGSLSLT
Sbjct: 158 KIRPRTAVSSRALMSKQVYKRPDFLIGLARAIRDLKPEENVSKVLNRWAPFLQKGSLSLT 217

Query: 220 IKELGHMGLPDRVLKTFCWAQERPRLFPDDRVLAKTVEVLARNHELKVPLNLEEFTELAS 279
           IKELGHMGL DR LKTFCW QE+PRL+PDDRVLA TVEVLARNHELK+P NL+EFT+LAS
Sbjct: 218 IKELGHMGLADRALKTFCWVQEQPRLYPDDRVLASTVEVLARNHELKIPFNLDEFTKLAS 277

Query: 280 RGVLEAMVRG 290
           RGVLEAM+RG
Sbjct: 278 RGVLEAMMRG 287

BLAST of Cla97C01G011962 vs. TAIR 10
Match: AT2G01860.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 188.3 bits (477), Expect = 8.5e-48
Identity = 108/201 (53.73%), Postives = 143/201 (71.14%), Query Frame = 0

Query: 96  DNEHEEIYAYDYKDTD-VVWDSDEIEAISSLCQGRIPQKPGTLNRDRSLPLPLPHKLRPP 155
           D++ E++      D D VVW+ +EIEAISSL Q RIPQKP   +R R LPLP PHKLRP 
Sbjct: 74  DDDDEQVQESVNDDDDAVVWEPEEIEAISSLFQKRIPQKPDKPSRVRPLPLPQPHKLRPL 133

Query: 156 RLPNLKIRLRTMVSSRAL--MSKQVYKRPDFLIGLARAIRDL-SLEENVSKVLNWWGPFL 215
            LP  K   + ++ S AL  +SKQVYK P FLIGLAR I+ L S + +VS VLN W  FL
Sbjct: 134 GLPTPK---KNIIRSPALSSVSKQVYKDPSFLIGLAREIKSLPSSDADVSLVLNKWVSFL 193

Query: 216 QKGSLSLTIKELGHMGLPDRVLKTFCWAQERPRLFPDDRVLAKTVEVLARNHELKVPLNL 275
           +KGSLS TI+ELGHMGLP+R L+T+ WA++   L PD+R+LA T++VLA++HELK+   L
Sbjct: 194 RKGSLSTTIRELGHMGLPERALQTYHWAEKHSHLVPDNRILASTIQVLAKHHELKL---L 253

Query: 276 EEFTELASRGVLEAMVRGKLD 293
           +    LAS+ V+EAM++G ++
Sbjct: 254 KFDNSLASKNVIEAMIKGCIE 268

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038893977.12.6e-9189.29pentatricopeptide repeat-containing protein At2g01860 [Benincasa hispida] >XP_03... [more]
KAA0052071.12.7e-8887.63pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa] >TYK00686... [more]
XP_008462173.12.7e-8887.63PREDICTED: pentatricopeptide repeat-containing protein At2g01860 [Cucumis melo] ... [more]
XP_004139567.18.7e-8786.60pentatricopeptide repeat-containing protein At2g01860 [Cucumis sativus] >XP_0116... [more]
XP_022951808.11.9e-8182.11pentatricopeptide repeat-containing protein At2g01860 isoform X2 [Cucurbita mosc... [more]
Match NameE-valueIdentityDescription
Q5XET41.2e-4653.73Pentatricopeptide repeat-containing protein At2g01860 OS=Arabidopsis thaliana OX... [more]
Match NameE-valueIdentityDescription
A0A1S3CGD01.3e-8887.63pentatricopeptide repeat-containing protein At2g01860 OS=Cucumis melo OX=3656 GN... [more]
A0A5D3BQZ31.3e-8887.63Pentatricopeptide repeat-containing protein OS=Cucumis melo var. makuwa OX=11946... [more]
A0A0A0LVM04.2e-8786.60Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G144300 PE=4 SV=1[more]
A0A6J1GIR99.1e-8282.11pentatricopeptide repeat-containing protein At2g01860 isoform X2 OS=Cucurbita mo... [more]
A0A6J1GIP29.1e-8282.11pentatricopeptide repeat-containing protein At2g01860 isoform X1 OS=Cucurbita mo... [more]
Match NameE-valueIdentityDescription
AT2G01860.18.5e-4853.73Tetratricopeptide repeat (TPR)-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR46128MITOCHONDRIAL GROUP I INTRON SPLICING FACTOR CCM1coord: 96..290
NoneNo IPR availablePANTHERPTHR46128:SF179TETRATRICOPEPTIDE REPEAT-LIKE SUPERFAMILY PROTEINcoord: 96..290

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C01G011962.1Cla97C01G011962.1mRNA