Tan0001991 (gene) Snake gourd v1

Overview
NameTan0001991
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionLipase_3 domain-containing protein
LocationLG11: 59038507 .. 59079515 (+)
RNA-Seq ExpressionTan0001991
SyntenyTan0001991
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGTGGAGGTGGAAAGTTGCTGCTGGAGGTAAGTTACCCGAGCATTTTGTGGCAACTAAATATTACATGATCGTCATGTTTTCTGACTGCTAATAATTTACTATTGGTCTGTAATAAGATAACTGCTTTGTGTTGAGCTTTACGATATTCGTGTCTGTTTTCTAAGTTTGTAACTCAAAGGGTCAATTTTCTTTTCTTTCTATTTCCTTTAGCATTTTAGATCTGGTAGGATAACAAACTTTTGAAGCTTATTGGTTCTTGTTGGAAAATGAAGTCTTGCTTTCTCAAAACTTGTCTGTCGTTATGTGATTGATATTTTTTTTTTGTTAATGGTGTCTCTCAGATCAAATATAGGACTTTTGATGAAATTGAAGATGACAAACGGTGGTGGAGAGTCCCCTTCATTTCTGAATTTCTTCGCAATAATGGTTTTGCATCTGCTCTAAACAAGGTTGTTGGATCTGACACTGTGCCTGTGCGTCAGTTTGTAGAATATGCTTTTGGGAAGTTAAAGTCATTCAATGATGAATACCCATCGAGTGATCATTTATTAAGCAAGAAAAAAGACAAAGAGGAAATACCTTCATACATGCAGACTAACGGTGAAGTCTCTATAACTGATATAAGCTATCCGAAAGAGAGCAATTCAGATGAGGTTGCAACAAGTGATAATACTGTGGAAAGTGGACAATTGCTGAGAGAAGTGACACAAAGTATTTTAATAAAGCAATTCGATAAACAATTTTGGACAAACTTGGCTGATGTAACAAATCAGCATATTGTCAAGAAACTTGGTCTTCCCGCCCCTGAGAAATTAAAGTGGGATGGATTTGAGTTACTAAATAAAATTGGTTTGGAAGCACGAAAGAGTGCTGAAGCTGGTTATATCGAATCGGGGCTTGCAACTCCCAAAAGCTTGGATGTTGATCATGAACAGAAGAACATTAGAATGGTGGACTCAACGCTAACTGATGTGAAGAAAGTAACAAAGGATCTACTAAGTCAAACTGAGTCTGTATTGGGGGCATTGATGGTTCTGACAGCAACAATTTCTCAATTGAACAAGGAAGCACAGCTTATAGGAAAGAAAGATACTAAAGATGAGGTCTCAAAAAAAGTGGGAGAGAAGCTGGGTAGTTCAGGGGATGGTTCCTTGTTGGATAATAGGAATTCTGAGGAAATGAAAGCGCTTTTTGCAACTGCAGAAAGTGCCATGGAAGCTTGGGCAATGCTTGCTACATCACTTGGCCATCCTAGTTTCATAAAGTCAGAATTTGAAAAGTTATGTTTCTTAGATAATGAATCTACAGACACACAGGTAAAGTATTTGGTTTCCAGAATGGCTATAATATTTGTAAAGTTGTGCCTAATTTTAGTAATTCAATGTCATAGCATCAGCAATTTTTTCCCAATTTTATTGTTTTCATTCCTAACTGTAGGAGTCTACAGGTTGCAATTTGGCGTGATTTTATGCGGAAAAGACTGGTTGTTGCCTTCAGGGGCACAGAACAAGTAGGTTATTATGTTGATTTTCTGTTCACTGGAGTATAACTTTCCTTGTAATAAGTTCTTACATTTGGACTTCCTCCAGTCAAGATGGAAGGATCTAAGAACGGACCTGATGCTAGTCCCTGCAGGGTATTTATTTTTCTTTCCCTCAAGTATTATCCTTTATCTGTTGCTTTGCATATTTATCGTTCTCCTTGTTTCTTGTGGAATCATATGGAGAAGAAAAAGAACGTTGTATACTGAATAAACCTCTCGAATATAAGGCCATATATCATGAAAACTGTGGTAGTGCAATTTATCCCATTCAGCTAATTTAAATTAAGGCAGGGGAAAGAGAACTATTGAATTTATGTCCTTTAAATCATTATACTAGCTTTTCCATTTCTGGGCACAACACATATTTGTATGTTATTCTTTTGAGGTTAAATTATCAAAAATACCACTGAACTATGAGGTTGGTTTCAATTATACCCCTAAACTTGGCAAAGTTTCATTTTAACTTTGAATGTTGAAGTTTGTCTCAAAATAACCCCGGAAGGTTTTCTTTCGTAAAATTCATTGACAGAAAACGTTGTATTTCCATTTGTATTGTCGCTTATGTGAACATATTTGAACATATACGTTGAAAAAAAAATCAAAACATATGCAAATACATCATTTTCATCCATAAATTTAACAAAAAAAAAGCCTTCAAGGGTTATTTTGAAACAAACTTCAAAGTTCAAGGGTTAAAATGAAACTTTTTTTTTTAAGACAACCTCATCTGGATTAGTTGCAAGGCGATATAAACGAGGGAAAGCAACAAAAAAGATGCTCCGTTTTCTCTCCTCCGTTTCTCTCTCTTTCTCTTTCTCGTCCTTTCGTCTTCTCCGGTAGGCTCCCTCCGTTTTCAGGGCCTCCGATCACAGGCCGCCGGCCTCTCCGTTCTCTCCCTTCCGTAATCCCTCCCTTTGTCTTCTCTGTCGGCCTCTCTCTGTTCTCTCCCTTCCGTAACCCCTCCGGTCTCGGGTCCCGTCCCTTTGTCTTCTCCGATTGGTTTCTGTCGTGGTCTGGCCCTTCGGTCTCTGGCCGCCGGCTTCTTCTCGTCGCTTTCTTCTCTGGTTGGTTTCCTTCGCCTTCAGGCCGCCGACTTCTCCTCGCCGGATTCTTCTCGTTTGTCTCCTCCGTTCTCTACAACCTGGTTTTCGTCTCTTCTTTCCTTTTCCTTTCATAGTTGTCGAAGTCGTCCGTTTCCTTTCTTCTTCTTCGTTTTTCACCGCCGGTTTTGTCTAGGTTCTTCGTCTTCTTCCTTTCTTCTTTGTGTGTTGGTTTGTGAAATGGACCACCACTCTTGTTGTATCCAATGTTGTTAAAATCGTACGTTTCACTAATTGAATCGTACGATTCAACTCGAAACGGACCAAAATCGATTCAACTCGTGATTTTGAATCGTAAATGGGACGTATCGTACGATTCGGAAAATAAAATCGTATGTATCGCTTAAAATCGTAACAAACCGATTCAAAAAAAAAAAAATACCGTGATTCGCGGTAGTAGGAGAAGAAGACGTGCTCTTGTAATCTTGTTGCAGCCGCATGAGAAGAAGAAGACTGGAATTCTGGAAATCTAAAGTTGAAGAAGGAAGTTTGAAGGTTGAAGAAGAAGTTTAGAATGAAGGAAGTTTGAAGGTTGAAGAAGAAGTTTTCGAAAAAAGTGTTAAAGAAGAAAGAGAAGAAGAATGAGAAAGTAATGTTGTGTTGTTGGGAATTCCTAAAGGTTAGAAATTAAATAATGGCATTGAAAATATAAATGAATTGATGGCAAGTCATTTCACATTGAAAAAAGGGAGTGATATGATGACCATTAATAAATAAATATTATTTAAAATATATTGTTAAACATATTTTTTCCTCTACACGTTTTTATCAAAAGAAAAATATTCTAAATAGGTTATTTTTTATTTTAAATTATTATTTGTTTATTTTTATCAATTGGCTTTTTGTCTTCATTATAGATTATATAGAATATGAAAAGTTTTTTCTTTTATTATAAATAAAGGACAACCAACTTTCCATTAAAAAAGAAAGAAAGAACAACAACCACTATTGTTGTTTAATTTTCTTGAGTATTTTGAATTTTAAAGTTAGTTTATGTTTTGATCTTATTTTGGAATTATTTAATATTTGGAATTTTATTTCATTCTTTGTTAATTAGATTTATATTTCAATATTCTTTACTAAACTGAAACTTACGATACACGATACGTTACGATACACGATATATTAAAATTGAATTTCGATACTCGTTCCGTTTCACGAGTTAACAACCTTGGCTGTATCGATGGAGAAAGTTTTTATATGGGAAGTTTCAGCGACTCTTTCTTTGTGGAATGTTCTGACCAAGGGGTTCGACTTCCATTGTCAATGGAGAATCTTCGATGGTTATTGATCTCTTTTGATGAGATCGTCGGTTCAGTCAATCGGTTTTTTGGACCTATAAAATAAGTATAGCTTTGGTATAAGTGGGATCTTCAAGTTTCGCTCCGATTGGAAATGATTTCTTCATTGCATCTTTTGGCCTCCCTCGGGTGGTCGAAAAAGTTCTTCGAATTGCGCTGGGACAGTTGGGAAAAGGTTGGTTAGTGTTTAAAGATATGCTTTCAGATTTCTTTTCGGGTCAGAAGTATTTTTCGTTGGATGGTAAGCTGTTGGTTGCTCCAAAGCCGATCACTCCTTCCGGTGCCTATCAGTTGCAGAGGTTTTCTTCCTGTGGTCCTGCTTATTGGATTCAAAAGTTTCAAGAGCTGGTGAAGGTGGACTTTAGCAAATTACTAGTTCTCTCTCGTTTGTTCGCTCATATCTCTTGGAGTACTGTTAAGGCATCACTTGAGAATCATCTTCAGTTAAAGGTTCAAATCAATCCTTTTATGGCCGATAAGACACTCTTCTCAGCTTTGGGAAATGGCAACTTATTGAAGACCTTCATTTGAAGGTTGAGGGTTGGTCCGATGGTCTTCATAGCCCACCCAAGTATATAGTAGGTTATGGAGGGTGGATTTCGATTCGGAACTTACCTCTTAGATTTTGGGCAGGATCGATTTTTGAAGCCATTGGTCAGAACTTAGGTGGATTAGAAGATATTGATTCAGGCACTCTAAACTTCCTTGATCTCGCATCAAAGTAAAGGAAAATTTATGTGTTTTTTTCCCAGCCACAATTGAAATTAAGGATAAGTTTCTTGGTTCGTTTTCGTTACAATTTGGTGACATTATTTCAAATGAAATTCCAACAACCCTTCATAATTCTTTATTTTTGAAAGATTTTTCCAATCCTGTGGATCTCGATCGAATTTTTGAGTTGTTGGAAGATGAGGATGTCCTTGGTGACTCTTCATCCTTCAAAGCAAAGGATGGATTAGGGTCTGATCACTCTCCCTTGGTGACTTCTAATTCCGATTTGGATCGAGATGGTTTGGGTACTTTTGCTCTCTGGGATGCCTTAGAGGTTAGTGTGGAGCCACTAGCCTCTGTTTCTATTTTGTCAGAGGTTTTGAGGAAAGATATGGGGTTGTGTTCGTTCGAAGTGCCTGCTTTACCTTCAAAGGAAGTTGTTGATAAGGTTGATAATTTGGCTTGTCCTAATTCTTCTAAGTCTCCCATTCCCTATTCGGTAAAGGAGGTCAAAGAAAGATTTTGTGCATCCTATGCAAAATACTATACAAGGAAGAAGGTCTTTGATTCGGTGGTAGCTTCTGTCCCTTTAATACTGGCAAGTTTGATGATTCTCTGTCTAGAATTCTTTTACTGGGAGAGTGCTCAAAGACGGATGGTGGTGGTTCTCTTATTCCTGAAACGCCCATTTTAAGTATTGAAAAGTGCAACTTGGGCCCAAGTAGTGTTCAATTCTTCAAGGTTTCTCTCAAGGTTGATCGACATTAGGTGGTTAGTAGTGATCCTCTTTCTCTGCGTGGTGAAGAAGTGTTTGACATTGTCTCTATTGTCAGTTTGAGTAGCCCGGAAAGTGCTAACTCACTTCATCTTGCTCAAATAGATGGGAAAGCTGATGTCTCCTCCCCGGTGGTAAATTTGGATTTGCTCTTTCAAGATTCCATTATTTCCCATGAAGCTAAAGAGCGCGGTTCGTATATCGAGGTTATGAATGAGAAGCTCCGTCTTCATTTGGTTCCGTTTTCTTCGCTTATTAAAGCGAGTGGTATGCAATTTTGGGAGATTGTTCCTTCCTCCCTTCCTCGTTAACTATTTCTTCCTTGACACAGGAAGGGGTTGAATTTGAATTGAATGGTCTCGAATGCGAATTTCTAGATTCTTTTTGAAGAACTTGGAAGCTCAGGTGGTCGACTTTTCTTTGTTTGGGAGTGCTCATTTGAGAGAAGTTCATGGGAGATTGAAGTTATTTATCTATTTTTGTAAGAGGGTTGGTGAAGCGTGTATTTTTATCACAAGTTGGGTTCACAAGCGTAGGTTCAAGTTGACTTGATTTCAGAGTCTATGGGTTTTTGATTCCTTATCTCGGAGTCTTCTCAAGAAGTTTGATGTTTGTTGGGTTCGTGGAGTTGGGGTTACATTTCGGTTGATAGCTTCTACTTGGTGTGTCATTTCCAAGGATTTTGCTATTTTTTCTATTTCGGATCTGTCTTTCTTGGCGCCCGTTCTTTTCTTGTGTTTGTTGGATTGTAGAGTGTCTTCTGGTTTATTGTATTGCTTACAAGGGGGTGGGTGTTTGGTTTTGGCTTTCTTTTGGCTTTGGGTTTCTTTTGGCTAATTCTCTTGTTGTGTTGTCTGTTGTCTGTTTTTCTCTTTCTCTTTGTTTTTTGGGCATTAGTCTCATTTCATTTTATCAATAAAATCTCTTTTGTTGCCTTGTCAAAAAAAAATAAAAAATGAAACTTTTCAAAATTCAGAGGTGTTGGTTATTTTAGATTTTTTCCTTCTTTAGCTCAACAGAATATTTGTACTTCATGTAATATATTGCTGGAAATTTATTTCCTTATGTGTACATGGGTATTTCTTCTATTTAAGAAATCCTAATCCCACTTAATGTTATATAGAAAAAGATTATCTCTTCATATTTTTGCACATGATATCAGATCAAATCCTAATTGTTTAGAAGAGAAAAAAAAACCCTAATTTCCCCAGCAAACCAGACCTGCAAGCTTACCTTCGTGAAATGACTTCTACCCAGAAAAGTCGTCGGAGTGAGCCGCCTCCCTTTCCGCGAGCGTCTGATCCAGCGGCCAACCCGCGAAGCGCAGACCTGCGATCCAGACGCCTTTGACCCGGAACCCAGATCCGCCCAACCCGCGAAGCACAACCCACGTGTTGTTGCTGCTAACCGCAGGTCTACCCGACTGTGTTGTCGTCGTCACTGGACTACCCTCGCCGCCGCTGCCGGTTATGCTGGTTGATCGCCGCCGATTGTACTCGCCGCCATTTACGCTGCCATTCGTAGATCGATTTGGTTTGTTGGTGGCTCTATACGGTTTATCTATTCCTCTGTTTTTGGGGTTTCTTTTCTTTGCTTTGTTAGTTGTTATTGTGTGGTGATTTTCAACTGAATGTCGGACCATAAGTTGCTGCCCGTTGTAAAACCATAAGATAGTCAGATCCACTCCAACAATCCTATTGTCCAAATTACTACCATTCGACTTAATGGCGATAACTTCCTTCATAGGTCGCAAAGCGTTCGAATGTACATCCGAGTACAAGGGAAGATTGAATATTTTACTGGAGATAAATCCATGCCCACAGAAGAGGATCCCACGTTCGCTGTATGGGATGCTGAAAACTCCATGGTGATGACCTTGTTAGTAAATTCCATGGTAGAAGATATAAGTTCTAACTACATGTGTTGTTCTACTGCTAAGGAATTATGGGAGAATGTGACTCAGATGTATTCCGACCTGGGAAATCAATCACAAGTGTTTGAACTCAATCTCAAATTGGGTGATATGCGACAGGGAAATAACTCTGTTACACAATACTTTCATTTGTTAAAATGAATTTGGCAGGATCTGGACCTATTTGATACGTATGAATGGAAGTCAACTGATGATCAACAACACTACAGGAAAGTTGTAGAAGGCGGGCAGATTTATAAATTCCTCACAGGGCTTAATGTTGAATTCGATAAAGTTAGAGGGCGAATACTGGGATGATGTGTTTTCTGAATTTTGCAGGGAAGAGAGTCGTAGAAGTGTTATGATTGGCAAGAAAGTTCTTGATTCAGTTGAAAGTTCGGCTCTAACAGTTGAAGCGACAACACATAAGGCATCAAATCAGTCAACAAGGACTAATGACAAGCCTCGGGTCTGGTGTGATTATTGCAACAAATCTCGACATACTCGTGAAACGTGTTGGAAACTTCACAGGAAACCTGCTAATTGGAAGAGTAATAAGCATGACCCCACTTCATCTGGTGACAGAAGTATACATCATCACTCATCTGGTAATCAACCCTCATCTTCTGCTAATGTTGCTGATTCTAATCCATTTAGCAAGGAGCAACTCGAACAACTCCTGAAACTGCTAAAAGCCAATTCATCTTCTGGTAATCCTAGTGTTTCCTTGGCACAAATAGGTAACTATCCTCAAGCCCTTTCTTCTCTCAATTCATCTCCATGGATTATAGATTCTGGAGCTTCTGACCATATGACTAGTTTTTCCAATCTTTCTGATTCATACTCCCCTATGTATTGTAATGAAAAAAATTAGAATCGCTGATGGTAGTTTCTTTTCTATTGCTTGGAAAGGAATTATTACCTTAACTCCACACATTACTTTACATTCTGTACTTCACGTTCCAAAACTTGCTTGTAACTTATTATCAGTTAGCAAAATTTCTAAAGATGCTAACTGTCGTGTTATATTTTGTGAAACCCACTGTATCTTTCAGGATTAGGAATCGGGAGAGACGATTGGACGTGCTAGGATGCTTGATGATCTCTATTACTTTGATGAATCGCCTACTAGTCATAAAAAAGTTCAGGGCTCAAGTAGTGTTAGTTCTTCTTCTGTTAGAGAAATTATTATGCTTTGACATCGTAGACTAGGACATCCAAGTTTCTTTTATTTAAAATACTTGTTTCCCGACCTTTTTAAAGATCTTGATTGCACTACTTTCCATTGCGAGAATTGTATCTTTGCAAAACATCACCGCTCTACTTATTTACCCAAACCTTATAAGGCTTCCACACCTTTTTATTTAATTCACAGTGATGTTTGGGTCCCTAAAAAATTTTGACACATAGTGGAAAACGGTGGTTTGTTATCTTTATTGATGATCTTACTCGTTTAACTTGGGTTTATCTGTTAACAAAAAAGTCAGAAGTAAAAGAGTTTTTTATTCATTTTTATCATATGATTGAAAATCAATATCCTGCATTCTGATAATAGTATTGAGTACTTCAATACTTGCTTAACTGATTTTTTCCAAGAAAAAGGAAATTGTGCATCAATCTACGTGTATAGACACTCCCAACAAAATGGTATTGCAGAAAGAAAAAATAGACATTTGCTTGATGTTGCTCGTGCTCTTATGTTCTCAATGCATGTGCCATCTTATTTATGGGGGTGAAGCTGTTCTTATTGCTACTTATCTAATTAATAGGATGCCTACAAAAATACTGCATTTTAAAACCCCACTCGATCATTTTAAAACCTTCTTTCCTGCCGTTCGGTTATTCTCTGATTCACCAATAAAAGTCTTTGGTTGCACTGCCTACGTTAATAACTCTAATCTCTCTAGGTCTAAACTTGATCCTAGGGCTGTCAAATGCATCTTCTTAGGTTATGTCTCGCACAAGAAGGCCTATAAATGTTTTGGTTCAGTGACTAAAAAATACTTTAAAAGTATGGATGTGTCCTTTTTGGAAAGTTAGTCTTTTTTCACCAAAACTTCTCTTCAGGGGGAGATCTCTAACTTAGAAAATAATTTTTGGGATACTTCTCCTCTTCCTAATATCATTCATCTTGAGGTTACTACTTCTAGTTTCATGCCAAGCACAAAGAATACTCCTTCAGGGGGAGAAACACTACAAAGTGATCGAAATCTTGAACTTCACGTTTATACTAGAAGGACAGTGCATCAAAGGAACCAAGACTAGATAGTTGACTTAGCACGAGAACAATCCAATGCTCCGATGAATGATTTTGAAGATCCAGGTACCATACCTTCGATTTCCAATTCTTCACCTATCCATAACTGTTTATCTGATATATCTGATCTTGACATTCCAATAGCCCATAGAAAAGGTACTCGAAACTGTACAAAATATCCTATTGCCAATTACTTGTCTTACCATAGGCTCTCTGTCTCTAGGATAATAAATCTGGTTGTTCCAAGGAATATACAGGAAGCACTAAAAGACTTTAATTGGAAGTTAGCAGTGATGGAAGAGATGAATGCTCTTGAACAAAATCAGACATGGGTCATAGTAGAATTACTAAAGGATAAGAAAAGAGTCGAATGCAAATGGGTGTTCAATATAAAGTGCAAAACTGACGGGAGTGTTGAAAGATATAAAACCATACTTGTTGCTAAAGGCTTCACTCAGACTTATGGAATCGATTATCAAGAAACTTTTGCTCCTGTGGCTAAAATTAACTCCATTAGAGTCCTGTTATCTGTTGCTGTTAATGCAGACTGGCCCCTTTATCAACTTGATGTCAAAAATGCTTTTCTCAATGGTGAACTTGAAGAATAAGTATTTATGAATTTGCTCCATGGTTTTGAGGATGATTTTGGAAGTAACAAAGTATGCAGATTGAAGAAATCCTTATATGGTCTAAAACAATCCCCCAGAGCTTGGTTTGAAAGGTTTGCCAAAGTAGTCATGAGCTATGGTTTCTTACAGAGTCAAGCATACCATACTATCTTCTATAAACATACTGAAAATGACAAGATTGTCATTCTAATAGTTTATGTTGATAACATTATTTTTACCGGAGATGACGAGGTAGACCGAACTACCTTAAAAGAAAAACTTACCAATGAATTGCAAATCAAAGATCTGGGAGCCCTGAAATATTTCCTAGGAATGGAATTTGCTAGGTCGAAGGATGATATTCTTGACAATCAAAGAAAGTATATTATTGACTTACTTGAAGAGACAAGATTACTAAGTTGCAGGACAGCAGAAACTCCAGTTGAGCCTAATTGAAAATTGCAAGTTGCAACAGAAAAAAAAGTAAAAGATAAAGAAAAATATTAGAGACTTGTGGGAGACTTATTTACCTGTCACACACACGTCCTGACATTGCATTTGTAGTTAGTATGGTAAGTCAGTTTATGTATGCCCTAGGACCCACTCATTTAGAAGTTGTCTATAGAATTTTTAGATACCTAAAAGGTACCCTTGGGAAAGGAATACTGTTTAAGAAACATGGTCATCTGTGTTTTCTAGATTTGAGGTTGAAGAAGCTTTTAAAAAACTGCAAAATCAAGCCCCCCCCGTTTTCTCTCTTCTCTCTCCTCCCAACCTGCCCTAATTTGCTTCCTCCGATTTCGATTCCTCGCCTCCGATTTCCTCCGGTGACCGACTTCCCTCGCCTCCGATCTTCTCCGGTGTCCAGTTCCTTCTCTAGCGCTCGACCCTTAGGTCTACATTCCAGTTTGAAATTATCTCTTTCGAAGTCCCTTCAGTTCCGTCAGGTTATGAATCGCCACATCTCTTCGTTGTCGTCGTTTTCAGAGGTTGTCTGCCCCGGTACTTCCTTTTCGCTTATATCCTTGAAGAAACTTCATCCGGTCACTTGACGTTTGGACTTCCCCCGAGTTCGTTCTCAGTTTTTGAATGCGTTGTGATAGTAACTGACTGAGGAAAAGGTCCATTCTTTTTTTCCCTCCCTTCCCGTCGCCTCCCTCTTTTTCTTCTTCTCGTTGCCGCTGACGACATCGAACAGAATCCCAACCCCATCCTTCGACTTCGTTTGCTCACGCCGACAACCGCCGAGCCCTCTCCCGCCGGTTTCCTTCTGCCCTCTGCTCGCGCGCGTCAGGGAGCGCCGAGAGCTATCCCTGGAGCTTGAGTTTCGAAAAGGGTTGTTTGGGAAAGCTCGAGTTCGAGTTCGATAAGCTGTTGGGTTCGATATTTCATTCAATTGCATTGTGTTTGTTTTTGCGGGAAGTCTGTTTTGGCTAAAGTCATTTTGTCTCCCAAGCACTTCAATGAGTTTTCATTCTTGTAGCATCGGCAACAATTTGTTTAGGATTTGGAGGGAGGTTTCGTTCATCTTTCTTCAAGATGGCAGTAGTGGTTCGGTGGTCTCGTTGTCCTTGGCTCATGTACGATGGTTTTTGGAGTCCTTCTCGGCAATTGTTTCAAAGTCTTCATCTTACTTTGCTTGAAGGAAATTCAGAGATGATGCAGCAACTTTGGGATTATTCAAGGTGCGTTCGCACGCAGATTGGGATGCGGTTGGTGTTGTGTGGCCCCCTTCGGGTGGTAAGAAATCTTTTCGCGTCCCGCTGGGGAGATTTTTGAATGGTTGGAAAGTCTCTAAGGATATGCTTGGTGACTTCATCACTTCTTTAGAGGGTTCAGGGGCGTCAAAAACTTGTGGTGTGGATATTTCTTATCCGGAGTTGGTTGGGGATGGAATTAGTAGGTCTAAAGGGGTGCGACTTCCTGTTGAAGGTCGTGGCAGGGATGATTTGGGGGTTGCGAGGGGTTGCTCGCTGCAGGGTAGTGTCTCAGCTCCAGTATTAAGGCAATGGGTCCGAAAGTCTGAAGAGGTGGTTGAGGTGGACTTCTGACAGCTCCTTGTGGTGACCCGTCTGTTTGCCCATATCCCTTGGTCGACTGTCAAGAAAGGGTTGGAGGCTCATTTTCGAGTGTTGTTTCAAATCAATCCGTACATGGCTGACAAAGCCATTCTTTTGGTTGATGGTATTGCGAGTGACTTTTCTGTTTCTGGTATTGGGAAATGGCAAGTGATTAGTAATCTCCATTTGAAAATCGAGAGATGGTCGGATCGCCTACATGGTTTACCTGGATGTAATTGGGGGTATGGTGGTTGGCTAACAAAGAAGGAATTTGCCTTTGAAGCTCTACGGAATAGGGAGGTTTTTGAAGAAATTGATCCAAACTTGGGGTTTGGTTGATATTTCTTTGGAGACCATGAATCTTCTTGATCTTTCAGGGGCTAAAATCAAGGTGGCTCAGAATTTGTGTGGCTTCCTACCGGCTTCAATAATTGTGGTTGATCCATTTCTGGGTGAGTTTTCGTTGAAGTTTGGTGATGTGGAGTGTGTGGATCCTTCTCCTTTGTTATCGGGTTCCTTGGTATTAAAGGATGTTTCAAATTCACTGGATCTGGCTCGTATCTCTGTGGTTATGGTTGATGAGATTGGGTTTAATCTGTACTCTTCCCCTGTTGTTGATTTCTTTTCCTCACAGGTTTGTTTGAATCCAGTGGTTGGGGCACCCTCTGGGATTTGGTGGGAGGTTCAGCGAAGGCCGTCATGGGGAGGGATGATCCGTTGGTTGAGTCAGGGGAGGTTAATTTGGTTCGTGGCTTGGATGGTGATGCAACCGTTTTGTCAGATGGGGCTTTAGTAGAGGGATGCGGGACTGGGGCTTGCTTGGCTGGTGGTGCGGTAAGTTCAGAGGCTCTCTTCTCCCTCGCTGGCTACTCCTGTTAAGGGAGTGAATGAGGCATTCTGTGGAAAGTTTTATTCAAGGAAATGAAATGTTAGTGCGATGAAGCTTATGGAGCAGGAGAGCGCTTTTCCTGATTTCCTGGAGCGTCCGATGAGTAGCTTGCAACCACTGATTTCTCCTCTGCCTACTGCTAGGGCTTTTGCCAAAGAGGATTCAAGCATTCCTTCCTTTACCATTGAGGAATGTCAGGTTGGTCCTAAGGGAGTCTCTTTCATTAAACCTTCTTTTCAGCATTTTATGCCAAATCCTAATGATGTCGGTGGATTCTCCCCTTTGGCTGATCTTGACTCGGTGGTGGATTCGGTGGTTAGTTTGAGCAGTCCAGAGTTAGGATCCAATTGCTTCTCCTTATCTAAGGACTTGATTGTGGATCCACCTATTATCAATCTGGCTAGTTTGTTTGATTCCCCAAGTAGTCCAAAGCTGGTGGAAAAGGTTTGTTTGACACCTCTTGTAGATCCAGTCGTGCCTCCGGAACTAAGTAGATTTGAATCGTTAATCAAAGCTAGTGGTTTGCAGTTTAGAGGAATTGACCCTACTTTTAAAGCTGCTTTGACTCATTGAATTCTCTCTCGTAAAGGGGGTTGGTTTGGGGGTGTTTCTAAGTTGGATGGACTAAAGATTAAGTTTGTTCGTGGCGTTGCTTCAGGAATCAGATTTGGTTTTATTTTGATTTTAAGTATCAAGTCTTTTTGGAGCTTGTGGATTTATGTGGGTTTTTGTCCTCAGTAGTTTGATCGATCCGCAGTTGGGCAGTGCTTGGTTGTTTTGGTTTGTGCTTTTGTTGGTTGGTTGTTGAGGTAGTTCTTTGGTCGGTTTTCTTTTTTAGCAGTTAGTCTTGGTAGGATTCGGTTGGGTTTGGGTTACTTTTGGTTTTGTTTGGTCGTGTTGTGGTTAAGTGTGGCTTCTTTGGTTCCTTCTTGTTTCTTTCGCTTCGGCTGTCATTTGTCGTTGTTGTTTTGTTGGATTTTTTAATGTTCTCTTGGGTCTTTGCTTTGTTTTGGTCTGATTTTTCTTTTTCGTTGTAAGATTTGTACTTGGAGCGTTAGTCTCCTTTCATTATTTCAATGAAACCTTTGTTGCCTTGTAAAAAAAAAAGAAACATGGTCATCTGCATGTTGAAGTATATACCGATGCTGATTGGGCAGGCAGTACAATAGATAGAAGATCTACCTTTGGTTACTGTTCCTTTGTAGGGGAAAATCTTGTTACCTGGCGAAGTAAGAAACAAAGTGTAGTAGCTAGAAGTAATGCTGAAGTTGAATTTAAGGCCTTAGCCCATGGCATTTGCGAAGGTATATGGATAAAGAGATTGTTGGAAGAATTGAAGTTTTCTCAAGGTGCACCTATACTTATCTATTGCGACAACAAGGCAACGATTTCCATAGCTCACAATCCAGTTCTTCACGATAGGACAAAATATATTAAGGTTGATAAACATTTTATAAACGAGAAAATCGATTCAGGAGTAATCTGCATTCCCTATCTTTCTACTACAAAGCAGATTGCAGATGTATTGGCAGAAGGGCTTCCTAAGTTACAGTTTGACAAGATGATCAACAAGCTAGCCATGGAAGATATTTTCAAGCCAGTTTGAGGGGGAGTGTTGGTTATTTTAGATTTGTTCCTTATTTGGCTCAATAGAATATTTGTACTTAATGTAATATATTCCTGGAAATTTATTTCCTTATATGTACATGGGTATTTATTCTATTTAAGAAACCCTAATCCCACTTAATGATAGAAAAAGATTATCTCTTTATATTTTTGCACAAGGGGTATAATCGAAACCAACTCCATAATTCAGGGGTATTTTTGATAATTTAACCTTCTTTTGAAAAAGAACAAGACACGAAACTTTTTATTGAATAAATGAAAAGAGACTAATGCTCAAGGATACAAACTCCACGAGGAAGTGAAAGGACCAAGGACCAAACAAAAACAAAAGGAATACAGAAGCAACCCAATAGAGAACAAGAAACTTAAAGGAAATTCCCACAAAACAAGAAAATGAGCCCACTGAAGCTAAGTATAAAGATAATATATTTCTCTACTTTCATTAGGTATAAAAGGGTTATATTTATACCAGAGACTCTCAGTCTAATAAGCTGAAAATATGCAATCTATCTAAACCCATATGGACCAGAAAAATACAACTATATACAAGAAAACAATCCCAGAGATAAAACCCTAATTACAGCTTGATTTTCTACCCACCCAAAAACTTGCATCAAAAGAGCCTTATAAAAAAACACATGCAGAAGCTGCCCACTGAAAACGCCAACAACAACTTGAAGGAGAAGCTCAGAAGACAAAAGGCCAAAAAACCAAGCAACAGAGAAAACTAGCTCCAACAGAGAAAACCAAGAACAAACTTTGAAAAACTCTGAAGGATAGAGAGGACAACGAACAAGAACCAAAACCCCAACAAGACCATGAGAAAAACAACAACAAGAGCCTGGGAGAAGAAGGCGATTAAAATGAAAGCCTTGGAGAAGAAGCTGCAAAACTATGCCCGACCACTATTTCCATTAGAATTCATGGGAGAAAATTCCCGAAACATAAGGCCACAAGCTTGAAAGAGATGAAATTTTCTAGGTATGTGGATGGAGGATGAGGAAACTCATCGTCACATTCTTGAAATAAAGAACCTAGATCTTCAAGATCTTAAGTCTTCAACCACAATATCATCAAGGTTGTTTTCGGTAGGCGGCAAGGAAAATTCCACACTGCTCACGCTAACAATTAACATCTGACTTATCATCACAACATGAGGGAGAATGAGAAGAAGTAGGGTCTTTTGTAGAAAGAATGACTCTGGTGAGGACAAAATCAGAATCTGTTTGGGGAAGCTTGTTGGCTGCCATGGCTAGAGAGTAAACACCGAGGATGAGCATTTGTGGAAAAATTATGAGTGACAACTTGCTTAACAGAGGAAACCAAAAGACTTGAAATATTTTCTTCAACAATATCTGGATTCTCAGAATTGAACTCAGGGGAGGGAGGAGATGAGGGAAGCGACAAATTTCTTCGAGCATGGTAAGAAGGATAAGCTGTAATTAAACAATTCTTCACTTCTTTAACACTTAAAAGGCTTTGAAGACTCTAAAAGATGAAGGGATTCACCAAAGGGCATCTCGGTAGCCTCAATAATGTTTATATATTATTTTACCATTGGATATCAGGTGGGGTCATTTGACCCTTCATCCTTTGAATTTTGACAGAAAGAGTTTCCTTCTTTGTTTAAACTTTAGAAGAGTATTTCTCCATTCCATTGTTGGTATAGTTTGAACACAACTTGCTTATAATATTTTCCGCAAGCAAGGGAAGTCTTTAAAACTTATGCAAGTAAAATGCCGTTTGTAATTCTTATCAGCTTCAGTTTCTCAAACTTTTAACATGTCATCTATTTTCTAATTTTATTCTTAGGTTAAATCCTGAAAGGATAAGTGGAGACTTCAACGAGGAAGTTCAAGTAAGAAATATTGTTATTTTATTACAGGATACGTATTGTTCTGAACTCCATGCACGACCTATCTTCGTTAAATATTTAATGTTTAATGATGAAGATGTGATTATGCCAGGTTCACAGTGGGTTCTTAAGTGCGTATGATTCAGTACGAATGAGGATCATTTCCCTCATTAAAATGGCCATTAATTATAAGTAAGTCTGCTGCACCTCAATTTTGAATAAGATAAAATCTTCATTGGGTCTTAGGATATCTACACTACAATCTTTCAATGAAGAACGGTTGTGTATATTTTCGTGGAGAAAGGTTGCCTTTAAATCTAAGCCCTAGACAACTTGGCTCCAATTTTGTAAACTTCCACTCCTATTCATGTTGCTGTAGTGTAATATGTGGTATAATTAGGAATTGTTTTGTAATTTTCATTTGAGTTTGTTTTATTGTTTGTCAGTTTAATTAGTGGGCTTAAATCTTCGATGTTAGAGAATGAAATCCCTTGTGTTGGCAATCTTTGAATAAAAATATATCTAATTACAATTTAACTCTAAAGTCATAAGTCATACCCCCAAAAAAAAAAATGCACACACTCTCTCTTCTCTTCCCTCCCAAACCATGGTCGCAATCCATGAGAAATCACCAATCTTAATCTCCGACACACTTTTTTGTTTTGTGGAAAGAAAGTAGCATTTGTAGCCTCGTTGATTGGGTGAGTATCCCTAGATTATGCATTGATGTACTTGAAGATCTAATTTGCTACGTTGTTGGGAAGGGATATGATCGAAGGAGGTCTTGAGTTCGAACTCCTGTGAAGTTGTTCTCTCCCCTAATTAATATTGATTTCCACTTGTTTGGATTTTCTTCAAATTTCCAAGCCCACAAGTGAGTGAGAGTGTTAGACTATGTTGATATAATTAAATTCTAAATATTATGCTAGGAGAAAGGGTAGGGAAATTCAAAAGAAGAAGGGTCGGCAGCCATTTTTTATCAGCAATGATTTTTTGGAAGATTCTGAGGATATGTCATTAAAGGAGGATAGGCAGACAGCCGGGTCGTTTGGCCTCCAAGAGCAGGGTATCAGTAAGGGTAGGCATTCGTAGGTGATAAGCTGTGGGGCCAGGCCGTGTTGCTTCTCGAAAATTACTTTCAGAAGTCCTACAAAGGAGGCTGATCTTTCCCCCGAGTGCTCTGAAGTCGAGAAACTCTTTGATCAGGACTCAGAAGTTAGTCTAAGCAGCCCTGAGGGGGTGGGAAAGTCAGCACCGAGCAGTGTTCGTTGCAGCACTCAGATAGGGTCCCCTCTAGCAAGTGTCAGGCACCTCTTCTCTCCCTTGAAGGTTGTCACCTCATGTTTGGAGGAAGGGGAGGCGAGGGCTGACACTGCCTTAACTGAATTTGACTCATTGATCAGGGCCTGCGGGCTGCAGTTCCGGGAGGTTGGCTGTATAGGCTCCACATCCTCGTCTAAATGAGGATCCTTTCGTGGAACACCAGGGGCTTGGGGGATCCCTCCAAGCGTTTAAGTTTGAAAAACTTCGTTCTCAAGGTCAATCCGGACATCGTTATGATACAAGAAACCAAGTTGGAAATGGTGGATGGTAAGCTCGCTAAGTCTTTATGGAGCTCCAAAGATGTTAGTTGGGAGTTGGTAGAGGCAGATGGAAGGTCGGGAGGTTTGCTGATTCTAAAACAACGCCAAGATCAAAGTTTTAGAAGTGCTAAAGGGGATCTTCTCCATTTCTAATAAGGTATTATTCGCCAATTGTCATTTGTGTTGGATTTCGAATTTTTATGGGCCAAACAGGTGTAGGGACAGGAGACGGATGTGGGAAGAGTGGAGTGCATTGGTGCCTCTCGGGAAGGACCTTGGTGTGTGGAGGGGACTTCAATGTCATCAGTGGGCTCATGAGAGGAGTCCTGAAGGGAGGCACACTAAAAGCATGAGGTCATTCAATAAGATCATCCGGGATTTGGATTTATGGGAAGTTCTCCTTAGCAATGGAAGTTATACCTGGTCTCGTTTGGGATTGGAGAACTCTAGATCTCTGTTGGATAGGTTCTTCATTAATGCAAGTTGGGACAGTGTTTTTGCGGACTCCAGGGTTTCTCGGCAGCCAAGGATTTTTCAGACCATTTTCCTCTCCTTCTGGAAGCGGGGTCTTTTCTATGGGGGCCTGCTCCCTTTCGCTTTGTTAATGCCTGGCTGGGAAGCAAGGAGTGTGTATCTTTAGTTAGCAAAGCATTGGAGGAGGATCAGTCCTATGGGTGGGCAGGTTTTACACTATCTTTAAAACTAAAAAAGAAGCTGAAAAAGGAAAACTTAAGAAGTGGGCGTAAGATCGAGATCAAAGTGCAAAAGGAAAAGAAAAGAATCGATTGACGAAATATCAATGTTGGATAAGAAGGTGGAGTCCTCTTCTCTCACCTCTCAAGAGTCATTAATCCGGCCATGGCAAAAGGGGAATTGGCTGAACTATATATGGAGAATGAGAGAAAGTTGATTAAAAATGTAAGCTAGTATGGCTCAAAGAGGAGATGAGAACTCGGGCTTCTTCCATAAATATCTAGCGACAAAAAAGCGGGCGTCCGCCCTGATCTCGGGCATCTTGTCTTGAAGGAACCCCACAGTTAAGTTTGCGGACATTGAATCGGAAATTGTAGGTTTCTTTGGTAGGCTTTATGAGCGCATCCTCGGGCTGCGCTTTGTGCCGATATTCGAGGGTTGGAAGGGGGTCTCATGGGAGGAAAATGGGGCACTCACGACCCGCCGTTGGAAGAAATTAGGATGGCGCTAAAGCGGTTGGGAAGAAATAAGGCCCCCAGGCCGATGGATTCACGATAGAATTCTACCTTAAGTTTTGGGATTTCTTTGAAATTGGACTGCTGCGTTTCTTCAAGGAGTTTCACAGCAACGACCATATCAATTCGGGGCAAAAAGAAAACTTTATATGTCTAGTTCAAAAGAAGGGTGGTGCAAGCTCAATAAAGGATTACAGGCCCATTAGTCTGGTCTCTTCTACGTACAAAATCCTCTCCAAGGTGCTAGCCGACCGCTTAAAAAAGGTAATGCCCTCAATCACCTCGGAGGTCCAAAGCGCTTTCATACAAGGTAGGCAAATTCTAGACTCCGTCCTAGTAGCTAGTGAGGTTATTGAAGATTTTAGAGTTCATAAAAGAAAGGGATGGGTTCTGAAGCTGGATTTTGAGAAAGCCTTCGACTGTGTAGACTGGGGTTTCTTGGAGGAGGTCCTCAAGCAAAAAGAATTTAGTGCCAAGTGGATTCAGTGGATGAGGGAGTGTATGGTTGGCCAAAAGTTCTCGGTAATGATAAATGGAAGGCCTAGAGGCAGGATTCTGGCTACTCGAGGGTTGCGACAGGGGGATCCTTTGTCTCCCTTTCTGTTCGTTTTGTTGGTTGATGTTTTGAATGCCTTGGTGGGAAAAGGGATTGCTCAGGGGCTGTTTGAAGGCTTCAAGGTAGGAAGGGAGAAGGTCGAAGTGTCCATCTTGCAGTTTGCGGATGATACAATCTTGTTTTGCAAAGAAGAAGAAGGGGCGCTGGATAACCTAATCAGTGTGGTAAGATTATTTGAAGTTTGCTCGGGGCTGAAGGTGAATTGGTCAAAATCAGCCGTTTGTGGGGTTAATTTGGAAGAGAGCACGGTTGAGGAGGCTGCAAAGGTAGGGTGCAGGGGTGTCCAAATCTTAGTTTCTCTCATACACAGGGCTTCATTGGAGGTCACCCAAACTCCTCGAGTTACGCGGCCAAGTGGTAAATAGTTTTGAGAAAAGGATGGATAGATGGAAGAGGTTTAGCTTGTCACGAGGGGGTCGTTCGACACTCTGCAATGCGGTTTTGGAATCCCTCCCCCTGTATTTTTTGTCCCTGTTTCAGCTTCCAGCGAGAGTTTGTTTGGTTTTGGAAAGGCTCATCAGGACTTTTTTCTGGGAAGGGTCCGAGGGGGTAAGCTCAAACACCTGGTAAGATGGAGGGAGGTCGTCAAGCCCCGTGCCTCGGGAGGGTTAGGCATCGGGGCTTTGAAGGGCAAAAATCAGGCCCTGCTAGCCAAGTGGGGTGGAGGTTCTCGAGAGGAACCATTTTCTTTACGCGAAGGGTGATAGTCGGCATCCACTACTGTCGGACAACGGATGGGACCACGGCTATTCCGTAGTGCGGCTTGAGAAGCCCACAGGTCAACATTGTAAGGGTATGGAGGCAATTGCACCATTTGGCCCTGTTCAAGTTGGGAAGCGGCAGACTAACTAGGTTCTGGTGGGAACCATGGCTGGGAGGCACACCCCTCAAAGATAAATTCCCAGGCCTTTTTGTGATAGCAAAGGATAAGTTGGCTTCGGTGGCTGAAGTTTGGGACAGTCAGCTGGACGTCTGGGCTTTATCTTTTAGAAGACCATTGAAGGAAGAAGAAATAGAAAGTCTTGCAGAGCTTTTGGGGGTCCTCGGCAGTGTCAAGCCTTCCCAGTGCGGGACACTAGAAGTTGGGTTCTTGAGCCTTGGGGAGCTTTTCAGTGAAGTCCCTTTCAAAGCACTTGTGCCCCCACCCCCCATTGGACAAGTCGACGTCATGATTGCTTGTGGAAGGTGAAGTGTCCAAAGAGGGTTAAGGTGTTGACATGGATTCTGTTGTTTGGAAGTTTAAATACAGTCGATTTTCTTCAGAGAAGGTTGCCGCACATGTGTCTTCATCCTTGATGCCGCCACCTATGTTTGAAGAAGCGAAAATGTGTTTCCACTTGTTTGTGAGTTGCTCCTTTCTTTGGAAATTGTTGGTGAAGCTTCTCCGGAGTTCGGCCTAGGATGGGTCTTAGCAGGAAACCTCAAGGAGAGTGTGTTTCAGTTGTTGTCAGGCCTTCCGCTTTCGCTAAGGGCCTCAATTTTGTGGGGAAACGCAGTAAGGGCGCTGTTGTCAGATATTTGGTTTGAACGAAATCAGAGGGGTTTTCATGATGTTAGGAAGTCTTGGGAAATCAGCTTTGAAGCTGCTCACCTAAAGGCATCGGCTTGGAGTTCTCTTTCGTGCGCGCTATGGGATTCTCAATCTCGACATTCAAAGCCAATTGGAATGCTTTTATTTTTCCTGCTTAGGTTCAGTTAGCTATTTGGTTTTCATTTTTGTCCTTTGGTTGTGTAGGGTTGATTGAACTTTTGTATCTCTTCATTATCTTAATGAAATTGTTTTGTTACCTTGTCAAAAAAAAATTTACCCCAAATCATCAGATTAATATTTTGGGTTGATTGGTGATTTAAGACTAATTAATAATTAAGTATAAATATTTATTAAGAATATATATTTTTTTTTTTGTTTTTATCCTACAATTTAAATATACTTTTAACAAAAATATTTTTATTTTTATTCAATAATTTTATTTTTAAATATAAATTATATAAAAATCTTTTAGTTATAATACTATTATTTAAATTGAAATTCTTAATGTAAATTAGAATACTACATAACAATTTTTTTAATTAAAAAAATCTCCCTATCTCACTTCCACCATATGTCATTAACATTTTTTTTTTTTAATTTTTGAAATTACTTATGTTAATGCTAAAGTATGAGAGAATCAGTATGTTCTATTCCATTCACTCAAGATATATATATATACAAGTATACAGGGTTAACCTAAATAGAAGAGCGTAAAAAGACAATAAAGGACACCCTTAGAGTTATATTAACAAAATACAATTATACTAATATAAGGACTCTAACACTCCCCTCAAGTTGGAGCATATATATCCATCATGCACAACTTGTTAGAGAGGTAATCAATTCGTGCTCCGTTCAATGCTTTGGTGAAGATATCTCCTAATTGCTCTCCGGTCTTCGCATATCCCGTCGATATCAAACCTTGTTGTATCTTCTCACGCACAAAATGACAATCAACTTCAATGTGTTTAGTTCTCTCATGAAATCTTTGGATTAGATGCTATATGTAGAAGTCGCTTGATTATCACACCACAATTTGGTCGGTGTTGTGATATCAAATCCCAATTCAATAAGAAGTTGACGTATCCAAACTAATTCACGACAGTTGTGCCATTGCTCTATATTCGACTCAAAGACTTGATCGTGAAACCACATTTTGTTTCTTACTCTTCCAAGAAACCAAATTACCACCAACGAATACACAATATCCTCGACGTTGACCTTCTATCTTCCTTAGATCCCGCCCAATCAAGCATCCGAGAAACATTCAATGTTCATATGCCCATAGCTCTTATATAATAAACCACGCCCGTGAGCTTTCAAATAACACAAAATCTGCTTCCACTGCAAACCCCAATGATCAATTGTAGGAGAAGACATGTGTACCGGCTCACAATACTCACCGAATAAGCTATGTGCGGTCGAGTTCTTGTAAGATAATTGAGTTTTCCCACCAACCTCCTATATCGTTCGGGATCTTTCAATAATTCTCCCTCTTTTTGTGAGTTGTACATTAGGCATCATCGGGTACTACATGGCTTAGCCCCTAACTTCCCTGTTTAGTCAACAAGTCAATAACATACTTTCTCCGTGATAGCATGATTCCCTTCTTGCTTCTCATTACCTCAATTCCTAGAAAGTATTTCAACATTCCCAAATCTTTCGTGTGGAATTGACTATGGAGAAAGGTCTTGAGAGATTGGATGCCCAATATCGTCACATTTCCTCTCCTTCGTGTCTAGTCTCTTGTTTTGGTTCCCAAAGTTTTTGGGTGTAGTGCCTTTGTTCACATCCATCCCCAACATCGTAGCAAATTGAACCCTCGAGCCTATAAATGCATTTTCTTGGGGTACTCACCAAATCAAAGGGGTTACAGATGCTACTCTCCTCTAACAAAACGAGTAGAGAGAGAGGACAACTCGACTAACAAGCCCACGGAATACGAAGTAGATTCCGGCGAAGCACGTGCCAGCACATGGTGGTGAAAACAGAAGGTTAAGGTCGTGAGCCTCATGTGTTGCTTAGATCTAGCTCGTATGTGTGGGAATGACTGACAACAACTGGGCTTTCTTGCAGTATGAGCAGTGACAAAAGAAAATAAAGGACAAGAAATAACCAACTCGTCGTTGTCACATATGGAAGTAGGTGCAAACCCCACAGTAAATTCACAGAACCCAAGATAAAAAATCAAGATGGTCCTAAACTCTAAACTCTTAGGGGCCGTTTGTGATGCAGAAAACTCTTCAAGAACTGTAAACTTTATTCAATGAATTCAATGAAGGGGGCCAATTGGCCATATTTATACAAATGAACCAACTAACCCTAGTAACTAACTAACCCAAGTCATAACCAACTACTAACTAACCATAACCAAATAATAGGTTAACCAGTAGCTTTACAGGGTTAAAAGGTTAAAAATAACCCACAGTAACTATCAATAAAACATAGTAACTAGGGTAAACATAAGCCTACATCAAATCCACCCCACCAAAACCATACTCGTCCTCGAGTTTAATGAGTTAATTGGAATTCGTCAGGTGCAAAATAAGGGAAGATATCTGAAACATTGAAGGTATTACTGATGCGCATCGAGGATGAAAGTTATACACGGTAGGCATTGGGGCCAATCTTGTCCAAAATAGGGAATAGACCTATCTTCTTGTTGGTTAATTTAGAATAAATCCCAGAAGGTAAACGAGACTTCTTTAAATGTATCATGACAAGGTCGCCTACTTTGAAATCAACTGCACGTTTCTGCTCATCTGCCCGAGCCTTATAGGAAGCATTAGCCTCCTCGAGATGACGACGAACTTCTTGGTCTAGTTCTTTAACTCGATCAGCCATAAGTTCTGCTTCAACACTAATATCAATGGAAGAAGGTAAATTAGCAAGATCAAAAGTTAAACGAGGAATTCGGGTATACACAATCTCAAATGGGGACTTCCCCGTGGATCGGTTGACCATGTGATTGAAAGCAAATTACTGCTGGCGAGAGAAAGATCCCATTGTCGTGGTTTATCACCACTTAAGCAACGTAGGAGATTGCCAAGAGTTCTATTAGTGACCTCTGTTTGCCCATCCGTTTGTGGGTGACTGGTGGTGCTAAAAAGAAGGTCTGTTCCAAATCATTTCCAAAGAGATCGCCAAAAGTACCTCATAAATTTGACATCCCTATCCGAAACAATAGTCTTAGGAATCCCATGGAGTCTAACAATCTCACGAAAAAACAAGTTAGCAATATTCAAAGCATCATTATTTTTTTTGCAAGCAATAAAATGAGCCATTTTGCTAAAACGATCAACCACTACTAAAATAGAGTCAAAACCACGTTGAGTTCTAGGTAAACCTAGTACAAAATCCATTGAAAGATCTTCCCATATGGAACATGGGATGGGAAGTGGGGAATAAAGCTCTTGATTTTGGCTTGATCCCTTTGAAGTTTGACAGATAAAACACCGTTTAACAAAATTAGCAACATCCTTTTTAATTTGTGGCCAAAAAAACTTGGCAGAAAGGGAGGTAAAAGTTTTATCACGACCAAAATGACCGGCCAATCCCCAGAATGAACTTCTTTTAAAAGAGATTCAGTAAAGAGGTTTGAGGAAGACAAAGAGCATCACCCTTAAATAAAAACCATCGACAATATGAAAATCTTTGCATGGTACGTGATTGATGCATTTATACCAAATTTCTTTAAAATCGGGGTCGGAATCATAAAGTTCAGGTAGATGTGAGAAAGCTACAATTTGTCCACGAAGTAAAGTAAGAAGAGTACCTTTACGGCTTAAAGCATCAACCACTTTGTTTAGATTTCCAGGAGTATGTTTGATGACAAAGTCAAAACGTTGGAGAAATTGAACCCATCTAGCATGCAAGCGACTAACAGTCTTTTGGTTATGTAAGAACTTTAAAGAAAAATGGTCAGTAAGTAAAATAAATTCTTTATCAAGCAAATAATGTTCCCATTGTTTGAGAGCACGCACTAAAGAGTAAAGTTCTTGTTCATAAGTACTCCATTTTTGTCGTGCCTCACTCAATTTTTCACTAAAAAACTCAATTGGGTGATTACCTTGGCATAGGACAGCACCAATTCCCACACCTGAAGCATCCACACTTACCTCAAAAGGTTTGGAAAAGTCCACAGGAGTGGAGCTTAAAAGTTGTTTAAGTTGTAAAAAACTTTCTGTTTGAGAATTTCCCCAATTAAACTTCCCTTTCTTTAAACATTCAGTGAGTGGTGCAGCAACAGTGCTAAAATGTTTTATAAACTTTCGATAAAAGGAAGCAAGACCAAGAAAACTTTGTATTTCCCGAACTGTTTTAGGTTTTGGCCAATCAACTATTGCTTGAACTTTTCTAGGGTCAACAGCTACACCTTTTTCCCTAATTGAAAATCCTAAAAAGTAAATTTCATTGCTCAAAAAGTTGCATGTTTTTATATTGATACAAAGTTCATTTTCGTTTAAAGTTGTAAACACCGACGTTAAGTGAGAAATATGTTCTTCATGAGACTTTCTATAAATCAATATATCATCAAAATAAACAACTACAAATTTATTAAGGTAGGGTAAAAGATTTGGTTCATGAGTCTCATAAAAAATCTTTGGGTGCATTGGATAGTCCAAAAGGCATTACTAACCATTCGAAAAGGCCTTCATTGGTTTTAAATGTTGTTTTCCATTCATCTCCAGGTTTTATTCGGATTTGGTGATAACCACTTTTAAGATCTATTTTAGAGAAAAATGAAGCACCTCCTAATTGATCTAATAGGTCGTTAAGTCTAGGGATTGGAAATCTATATTTTATTGTGATTTTGTTAATAGCCCTACTATCAATACAAAGTCGCCATGACCCATCTTTTTTAGGTGCAAGGAGGGCAGGAACAACACATGGACTTAAGCTAGGTTGGATATGACCCTTATCTAAAAGTTCTTGAATTTGTTCATGGAGGATTTGGTATTCTTTGGGGCTCATTCGGTAGTGGGGAAGGTGAGGGAGTGTGCTGCCTGGAATTAGATCAATATGGTGTTGAATATTTCTTAATGGAGGTAAAGAAGTTGGTGGTTCAATAATGGTAGGAAATTGGTTAAGTAGTTGAGAAATCTGATTATTAATGGTCAGTGTGTTATTGTCAAAACTATCTCCTTTGACTATGAGAGCTAACAGAGTACTAGGTTTTGCATTTAAGAAATCTTTGCCTTGTGAAAGTAAGAAAAGTTGTTTCTTTGAGTTAGAGTCCATACATACCTTTTTCTTACTCTCAGTTTTGCCCAAAGGAAGAAGGACAATTCGTTTCCCCATCCAGAAAAATTCATGTGTATTGTCGCGACCCTTGTGTATGACTTGGTTATTGTATTGCCACGGCCTACCCAGTAGGATGTGACATGCGTCCAGGTCCAGTACATCGCAAACTATTTGGTCTTTGTAATGGCGACCAATAGAAAGAGGAATGGTACAAATTTCAGTAACATTAGTTTCCCCACCTTTTTTGATCCAACTTACCTTGTATGGACTAGGATGAGCTTCTGGTTTAAGGTTGAGGGATGTAATCAACTTTTTGGAGACAATATTCTCGCTACTTCCACTTTCAATTATGACTTGGCAAACTTTGCCATTGATTGTACATCGGATACGAAATAGAGAATGTCTTTGAGTGGAAGAATCTGTTTTAGGTGTGAGAAGTACCCTTTGAAGGATACAAGATAGTTGGTCTCCATCATCCGCAGCAATATAATTGGCATCGTCAGAATCTTCTTCGGATTCTTCCTCTAAGTTATTATCATCCAATTCTCGAATTTGTAAAGTGCGTCTTTGAGGGCATTCATTAGAGAGATGTCCTTGTTGTCCGCACCTGAAACATTTACCAAGAGAAGGGCGATTGTAAGGGTTTGAATTTCGTTTAGAAGTAGAATTATCCTAACCTTTGATACTAGTAACATTCTTGGTTTGATCATCATCTTTCTTGGTCGGTTGGTTGGATGTTGAAGAATTTCCTTGAATGATGTTTCTTGTAGAATCATTTGCACTTTTTCTTGGATTATATTGTTGTTTATCCCAAGTACTCCACCTTTGGTAGGTTCTAACCGGTTTGATCTCCTCATTTTCTTCAATCTTTGAAGCCAATGTAATGGCATCCGCGAGGTAAGCAAGTGGGTGTAAATTCACCATTTCTTTAATGTCGTCTCGAAGGCCATTCACAAATCTAGAAATTTGTTGTTGTTCTGTTTCAGGGAGATTATTTCTTGCCCCTAATCGATGAAATTCTTCCGTGTAATCAATTATAGAACGACTTCCTTGTTTACATTGTTGGTATTGGTTATAAAGAATTTGTTGATAATTGAGTGGCAGGAAGCGTTTCTTCATTAATCGTAGCATCTTTGGCCACTTCTTGATAATTTGTTTACCATAGCATCTCCTGTTGTTTTGAAGTTGATCCCACCATGCTGTAGCACCACTTTGAAGCTTTAGAGCCACTAACTTGACTTTTTTGTCTTCTGGAGTGTTTGCATAATCAAAGAAGTTCTCAACATTTTTAACCCAATCTAGGAAACTTTCCACGTTCATCTTACCATAAAAACAAGGAAGATCAATTTTCATTTTGAATTCCGGATTTTCTTGAAAGTATTGTTGTCTTCTTGGCATTCTTTCTTCCCGTTCTTGATTTCTAAAAGGAAGGATTTCATCTTCACCCGAGGATTCAGAATCTTGCATCATAAAACCGGGCATTTGTGCTTCAGGAAATGTTGTTTGTTGCTGGTTTACATGTCTTCTTGTTGGAGCTTGTGGTAATTCTTGATTATTGACATTTACCAGATTTAATCGTTGGATAATTTGACCCATCATCTGACGTAATTAACCTACTTCAGACTTGATCTCATCTATTGTTGATTCAACCTTTGCAAGTTGTTGATCTTGAGAAGAAGAAGAGGAAGCTTCCGCTGCCATACTGTTGATCCAAAAAAGAATTCGCACTTTGGATCGGGCTCTCTGATACCAATTGATGCAGAAAACTCTTCAAGAACTGTAAACTTTATTCAATGAATTCAATGAAGGGGGCCAATTGGCCATATTTATACAAATGAACCAACTAACCCTAGTAACTAACTAACCCAAGTCATAACCAACTACTAACTAACCATAACCAAATAATAGGTTAACCAGTAGCTTTACAGGGTTAAAAGGTTAAAAATAACCCACAGTAACTATCAATAAAACATAGTAACTAGGGTAAACATAAGCCTACATCAGTTTGGTTTAAGGTTTTATTGGATGGGAATGGGTAATAGATGCCTGGGAATAAGATGTCTGGGAATAATAAATGCCTGGGAATAAGATGACTGTGTTTGGTTTTACATGGGAATGTTATCAGTTTTTTATGTAAAAAATGATATTTTATATTTATTTGACGAAATTAATATATAAATCTTATGTAATATATACTTACGATCTAACTCTCTTTGCACCATCCCCCAAAACCCTCGCGTTCCACGATAACACTTTCATTACGAAATAATCCGGTCTCCACCGACCATAATTTCTCTAAATTGTAATCCGCTTGCCTTGATTAAATCACTAAATTTAGACAATTCCACTGGACAATCAGCCACAACCTTCTCCAAACTCTCCTTTTTACCCCCCAAGGGAGAATTATGCATTGCCCCTTCAAAAAACTCCGCTAAATTAACCACCGGAGAAAGAAACTCCGAAACTCTATTCCCAACACTTCTCACCGATCCAATATCTGGGCTACTTAGACTAACCTCAGAATCAGAATCGAATTCCAAATTTTGATCAAACCCCAGAGAAGAAGATGCACCATTAATTGTAGATCCTAGACGAGGCTGAAAAGAGGTTTTAAAAAACGATACCCCATTTGGGCCAATATGGCACTTTTCGATTGCAAACCCTGTGTAATTAGGAGAGTTAAGATCCCCTACCTTGAGAGAGGAAGTCAAAGGATTATACTGAAGAGACTCATCTGCTTGTTCTAAAAAATCAGGGAAATCCCCCTCTTCTGACCGAGCGTTTAAAATTCTAGGCGTTGCACCTCCCTTTCTCCTTGTATAATATTTCCCACAAAATGTGTCCTTTACCTCTTTGACACTAGAGGAGACACCAGAGGAAAAAAGAGGCTCTTCCATACCCCACTGATTAATTCCCAATTCATTATCACCATTAATTAAAGACTCAGATTTTCCCCCATCCATACCAACTGGAAAAGTGTCCCTCAGAACGCCATCATTAGAATCTAATGAAGCATCAATATTTTCCAGCAACTCATCACTCCTAAACTTCCCCTTATCGCCCCCATTAATGATCTCGCACCCTTGCCCCTTCGACTTCCTCCCACTATTCATAAAAGAAATCTCCATTGTTTGCCTTAAAGGTGCTTCTTTAAGAGTCTCAATGCATTCATTCTCAACACATCCACTCAAAGACCCCGTTCCCAAATCACGCCCCCTTATTTCCTCATGATGCAACCTGATCAGATTCTCTGATCCATCCTCCTTGGCCAGTATATTTGTAGTAGATTCAATGATTCCCAACGAAGAGTCATCATTTGGCAGAAGATGATCAAAACCTTCCTGATTTTCAACAACCTCCACAGTCAAAGACATCTCTCCACTTACAACATTATCAAGAAAACCCAATGGACCTGAAGGGGGTTGAAAGGGAATATTAACCATTTCGTCTTCCCATACTTGTATAACCCTTGCCATATCAATCGGATTAGTAACATCACTTAAGACTAAAGCAGTCTCCAAGCATCTAGGAGGGGTCACCTCTTCCAAACCTTCGAAACTCAACCAGAACTTTCCACAAGAAGGATCTTCAAGAGGAATGGCTGCCGGCCAAAAGCCACAAAGATTTGCAGCTACCTTGATTTTTGCCACTGAAATATCAGTAAAGTTCAAAGTATCCATAGAAATGTCTTCAAGACCCCCAAATTGCCATCCAATCCCCTCGAAAACCTTTTTCTTCCAAAATCTTAGAGGTAAGTTCTTAACTCCTATCCAACCACCATAGCCTTTAACCACCGTGGGTGAGCTATGAAAATTTCCTCCCATTTTTCCAGTTTCAGGTGAACATTCCCAACAATCTGCCATTTCCCATTTTTCAGCCTCATCTCCTCCTCTTCTGCATTCTCAAAACCTGTCTCGAATAGGATAATAGCTTTGTCATCCATAAATAGATTTATCTGCAAACTCCTTTATAATAGTCTGTGGATTTATATAATGTGTTTAGGATGTAGAGTTATTTAGTCTGTGTTATAATAATATGTGTTTGGGCGCTGAGTTATTTTACTCCAAAGATTATTATAACCCAGATTACTCTAACTTACTTAGTGTCCCAAACATGTCCTAAACATTCACAAAATCCTAGGTTAAAAAGATAAACCCTTAATTAGCCTAAAGCTCGAATACCATGTAAACGCTGAAGGTTTGAGAGAATTTGAACTATTAGAGTTCTTATATTAGTGAAACTGTATTTTGTGAATATATTAATTAGTTGTCTTTTATTGTGTTTTTTCACTGTCTTATTTAGGTTAGCCTTGTATACTTGAATATATATATATATATATATCTCTAGAGTGAATGGAATGACACATTGATTCTCTCAAACCTTTAGCATTAATATGGTAGCAAAGAATTACGTCAATTAGGATTTTTTCTTTGGTGAGCCTAGGGTTTTGAGAACTTTAGGTTTTTGAGTTTAGAACTCATTTGCTTCTTTCTGGGTTTTGTGGCTTTCTTAACCCAGTTGGTCAAATTCCGCTGTTGCCATCAGTGACCGCTGCTATTAGTGACTACCGCCGTCGCCACCGATTTGACCAATTTGTTTTCCTTTTTTTTTTGTCTCACCGCCACCAGAATTCGAAAGCCCATCCTCTGATTGGCTTATCCCCAAAATTTGGTCTCGATTGGAGTTGCGAGTGAAGCCCACAGGCCGCCCCTATTTCTGCTGCCGTTTTCACGTGCCGACGCGTGAATCACCGACGTCGACTCTTTTTCTTGGGGTTTGATCGGAATTGCATGCTCTTTCTAGTGGTGTACTTGTTTTAGTCATATATTGCTTGGTTTCGTGTCGAGAAATAATCCTTTGAAAATCTGGGGTTTTATGGTACAATGCTGTTTTGGGATTGAAATTTGAATTTTTTGGGTTGTTTTTATAGTTATGGCTGAAAAAAAGGCATTGAAAAAAAAAGCATAGTGATGTTGGAGATGATTCCTATGGTGTCAAAAATTACAGAACATAACCTTAATGAGTCTATTTTTTATGTATGGCGGATGAACATTTGTCACTATCTTTGGAGTCTTCATATGAGTTATCATGTCACCGACAAATTTCCACCGACTGATGATACTAGGTTTATTCTGCAGATAAAGAACTCTAAAGACACTCTTTCAACCTGATCAAGGTGAGAAATCTTTAACTAGTTATTTTATGGAGTGAAAAACATTGTAGGTAGAATTTAATGCTTTGTTGCCCGTTGGTACTGATGTGAAAGTTCAGACTTTCCAACGTGAACAACTCTTCATTATGAGTTTTTTTGGTTGGCCTTGACTCTAAATATGATATGGCCAAAGATCGAGTTTTGTCTAGCTCAGAAATTTCTTCGTTGGAAGATGCTTATACTCAAATACTTCGTATTGGGAAGTCACAAACAATCTTGGCATCTAAGTCTAATAGTGCCTTGGTTGGATAGACAAGGTGAATATAAAGGAAATAGAGAGGTAAGATCCAACACGTTCAAAGAAAATTCGAACACTCAGATAGCAAACTCGGGAGATATTGTTTGCCACTATTTCCATAAGTCTGGTCATATGAAACGCGATTGCAAAGGGTTGTTGAATAAGGGACAAAGGTTTCCATCTGCACATGTTGCCTCTACTCCTAATAATCATAACAAGTCTATTTCAATTTCTGCAGAGGAGTTTTCTTAAATCTGCTAGTATCAAGAGTCCCTAAAGTTGTCATCTTCTACTCCTATTACTGCCACTACTGACTCAGGTTATACATCTACTTGTCTTCTTTTTCCTCTTCGAAATGGGTTAGAGACTCTGGCACTTCTGACCATATGACAATAATCCCAGTTTATTCTTTCATATTTTTCCGTCTAAATCTTCGCCTGATATTACTATAGTTGATGGGAGTACATCTCCTGTCTTACAATACAACATTGTCATTCTTACAAACTCAATTTCATTGTCTTAAACTCAATTTCATTGTCTTCTTTTTTAACTTACCACAATTCTCCTTTAACTTGATTTCAGTTAGTAAACTCACTCGCGATCTAAATTGTTGTGTCTTATTGTTTCCTAGTTATTGTTTGTTTCAGGAACTTTTGACGAAGAAGACTATTGGTAAAGAGCATGACTCTGGAGGTCTCTACATTTTTGAACCACAAATACACACCGCCGCTTCTGGCTATAAAGTGTCCTCTCCCTTTAAAGGACATTGTCGTTTGGATCATCCATCTTTTTCCATTTGGAAGAGTCTTCGTCCTCAATCTAATATCGCTAATATCTTGTCTTTTTTAGATTGGGAGTCATGTCAGTTTGTTACAACCTTGGATAAAACATAGTCACACGTTCAAGATCTTGTGTCAATTTACTTGTCGAAAGTAAATTACAATCCAATCCTGAGGCATGTAGTACAAAGTTAAGATATAAATTTTGAGTAACTTGAACGAGACTTGTCCCAACCACTTTGGAGTATGATACATCTACAATTTTCACAGACGAGTTAGAGGTGTTAACTTGATAGTTTTTGTAGATAATATCAAGAGTAGAATATATTATTCTTATATTCACAATGAGAGTAAAGGGTTTCTTATATAGTAGAAACCTACATTTACATACACAGAAATAAATATTCCAGCACCTAAAATCTAGTAAATACAAGGAAATCTTTATTAACACGTAAGGAAATTACCGATATTTCCTTATTAACAAATAAGGAAAAATAACCAACAATAACTAACACCCCCCTCAAGCTGGTTTGAATATATCTTCCATTGCCAGCTTGGAGATCATCTTGTCAAATTGCAATTTGGGAAGACCCTTTGTTAACACATCAGCTACCTGTTCTGTTGTAGGGAGATAAGGAGTACATATTACTCCTGCATCAATCTTCTCTTTTATGAAGTGTTTATCAACTTGAATATGCTTTGTCCTATCATGGAGGACTGGATTGTAGGCAATGGAAATCGCTGCCTTGTTGTCACAGTAAATATGCATTGGTCTATTCTGGATGATTTTCAATTCCTCCAATAATCGCTTTATCCATATGCCCTCGCAAATTCTATGAGCTAAGGCTCTAAATTCAACTTCAACACTGCTTCTAGCCACTACACTCTGTTCTTTACTTCTCCAGGTGACCAGATTGCCTCCAACAAAGGAACAATACCCAGAAGTCGATCTTCTATCTGTTATCATGTTTCTTAAATAGTATACCTTTTCCAGGAGTACGTTTCAGATATCTTAGGATTCTGTAAACAGCTTCAAAATGAGATTGTCCTGGGGCATGCATAAACTGACTTACCACGCTTCAGCAAAAGCAATGTTAGGAAGTGTGTGAGCCAGATAAATGAGACTTGATATTTTTCTTTGTCTTTTACATCTCCTCCTGTTGCAATCTGCAATTTCAAATTTGGCTTAATAGGAGTTTCTGATGTTCTGCACTCAAGTAAGCCTGTCTCTTTGAGTAAATCAACGATATACTACCTTTGATTCACAAGAATACCTTCTTTTGATCTAGCAAATTCCATTCCTAGAAAGTACTTCAAAGTACCCAAGTCTTTAATTTGGAACTCACTTGCTAGTTGTTTCTTCAGGGTAACCATTCCTGCTTCATCATTTTCAGTGAAAATAATATCATCAACATACATTATTAGAACAATAACTTTGTTATTTTCAGTATGTCTATAGAAAATAGTATGGTCAACTTGACTTTGACAGAATCCATAGTTCATAACTATTTTTCCAAATCGTTCAAACCATGCCCTCGGAGATTGTTTAAGTCCATATAGAGCTTTCTTTAATCTGCACACTTTGTCAATTCCAAGTTCTTTTTCAAAACCAGATGGAGGATCCATAAATACCTCTTCCTCAAGGTCCCCATTAAGAAATGCATTCTTGACATCAAGTTGATAAAGAGGCCAATCTGCATTGACAACAATAGACAATAATACTCTAATAGAGTTAATTTTAACAACAGGGGCAAATGTTTTCTGATAGTCAATCCCATAGGTGTGAGTGAAACCTTTAGCAACGAGTCTAGCTTTGTACCTCTCAATGCTCCCATCGACTTTACACTTTATAGTAAACACCCATTTACATCCTACAATCTTCTTACCTTTTGGTAATTCTACCATGTTCCATGTGCCATTTTGTTTAAGGGCATTTATCTCTTGCATAACTGCTAATCTCCAATTCGAATCATTCAGGGCCTCCTGTATATTCCTTGGAACAAATAGGTCTGTTACTCTAGAGGTGAAAGCTCTAAAATTGTCACTGTAGTGCTTTTGGTCATCAGTAGACTTCCATTCATACGTATCAAATAAATCTAGATCCTGCCAAATCCGTTTTAACACATGAAAGTATTGTGTAACAGAGTTATTTCCCTGTCGCATATCACCTAGTTTAAGATTGAGTTCAAACACCTGTGATTCATTCCCTAAGTCGGAATACATCTGAGTCACGTTCTCGCACAATTACTTAGCAGTGGAATAACACATATAATTAGAACTTATATCTTCTACCATGGAATTTACTAACCAGGTCATCACCATGGAGTTTTCAGCATCCCATATAGTGAACAGGGGGTCGTCCTCAGCAGGCATGGGTTTGTCTCCAGTAAGATAACCAATCTTCCCTTGTCCTCAGATGTACATTCGAACACTTTGCGACCAACGAAGAAAGTTATCGCCATTAAGTCGAATAGAAGTAATCTGGACAATAGAGCTATTGGCATGGATCCGGCTATCTGAAGCTTTTACAACGGGCAAATTTTTATTTTCTAACATTGTTTAGGATCAAAACTTAATGAATATAAACACTGAGATAATAGTTAGAAGATGCCAAAAAGACAATATATAAGGAGCTCAACAATTTTAGGCCTGATATTGCACACAGTCGAAAGAAGAAAAAAAAAAGGCAATAACAATCGATTCAGTGAACCAGTGGATGGCGATGGCGAGGAAGAAACCGACAAGGAGATTTGACACCAACGAAGGACCCAGTGGACGGCAACGCAACGAACAGTCTCGTGGGTCGCGATTCGTGGGTTTGCTGATTCGGGTCAGGAGTTCGCGGGTTCTCTGATGTCGGCAACGCAACGAAATCGTGGGTCTTCTCGATTGAAAACTTTCGGGAAGAACCTTCAATTTTAAGTTGCTGAATATGTCGGCAAGGTTGTATCACAGATGACACTTAAGAGTGGTCGGCTATGGTTACAAAACAACGACTAGGGTTTTTGGTGTTAGAAAAACTTAGGGTTTCTTACAGAATTAGGTTTTCGCTCTGATACTATGTAGATAATATCAAGAGTAGAATATATTATTCTGTATATTCACTATGAGAGTAAAGGGTTTCTTATATAGTAGAAACCTACATTTACGTACACGGAAATAAATATTCCAGCACCTATAATCTAGTAAATACTAGGAAATCTTTATTAACATATAAGGAAATTACCGATATTTCCTTATTAACAAATAAGGAAAAATAACCAACAATAACTAACAATTTTGAAATAGAGATAGATTCCCTGTCGTATGATCTGAAGCTCCAGTGTCTATAATCCAAGGTTGTAGCTCTTCTTTACTAGCTGTCAGGGCAGTAGGTGTAACTTTCTGTGCAACTAGTCCTGCTCCCTGTATTGTGTGACCAATGATTTTTTGCAGTGCATTTATTTGTTCTTTGCTAAATAGAGTTATCTCTTTAGTAATAGAGTTATCAGGAGCCACATTAGCCACGTAGGCCCTACTATCATGATCAACAACCTTTCCATGGATCTTCCAACATGTGTCTTTTATATGACCTACCCTTTTGCAATGTTCGCACTAAGGTCTCTCGTTTCTGTTTATTGTCACTTGGTACTGTTGGGTTAAAACTGCAGGTAACAAGTGCAGAGGCCTCCAAAGCCGTAGGGACTGATTCTGATTTGCCCATCATAAAGTTTTTCCTACTTTCTTCTCGCCTTACTTCAGAGAATGTTGCACGAAGATTAATCAATGGCTTGATTTCCATGATTTGACCTCGAACTTCATCAAGATCATCGTTAAGACTAAGGAGAAATTTGAATGTTCTCTTTTGCCCATCATAAAATTTCGACCTCGAACTTCATCAAGAACGTTGTTAAGACTAGGGATCCCGATAATAAGGGAGGAGGATTCACACACTCCACATCACCAAAGTTCAAAGAAAACTCTCCAAGAAGCGGATCAACCACGTTAATTGACGCCAGGAGGAACCCACATAAGTTCTGAGCCACCTTAATTCTAGCCCCTGAAAGATTGAGAAAGTTCAAGGTATCTAAAGAAATATCCTCCAAACCCCCCAAATGAGAGCCAATTAATTCGAAGGTCTCCCTATTCCAAAACTTTAAAGGCAAATTCTTAATCTCTTACCAACCTCCATACCCCCATCACTTCTGGCAAACTATGAAGACTATCCGACCATCTCTCAACTTTCAAATGAAGATTCCTAATCACTCTCCATTTGCCAATGTTAGAAAAAGAGTCCCCACCCATACTGGTACTACTATAAACCAGGAGAAGGGCTTTATCAACCATGAACGGATTGATTTGAAACTCCATCCGAAAATGAGCCTCTAACCCTTTCTTGACAATCGACCAAAGGATGTGCCCAAACAAACGAGTCACCATCAGGATTCTTGAAAAGTCAATATCCACTACCTCACCATGCTTTCGTACCCAATGCCTTTCTACCAGATTGGAAAAAGACCCCTGTAACGTCCTATCCTTTCCATCAAACAAAGCTTCCCCACCGCTTTCATGAACCGGAAGACGCTTCCCCACCGCTTTTATGAACCGGAAGACGCTTCCTCTTAGAACTAGCGAAACTTGCCACTACACAACCAGGATTAGGACAACAAGACTTTGAAGCCCCAACATCCTCCAAAGCATCAGTGAAGTCGCCAAGCATATCCTTAAAGACTTTCCAACCATTTAAAAACCTCCCCAAAGATACACGAAAAGACTGCATGCCGCCCGAGGGGGGCCATACACTTCCGACCGCATCCCAACCTTCATAAGAACACACCTTGAATAACCTCAAAGTTGCGGCAACATCCCTGAATTTTCTTCGAACAAAGTAAGAAGAAGAATTGGCAGCAATCGCCGCGAAGGAATCCAGAAACCAACGAACATGAACGAGGGACAATGAAACCACCGAATCGCTTAAACCATCCTGAAGAAGGATAACCGATTCATCACACCAAATCCAGAAGAATCTATTCCCCACACAACAACTAAAGTGCCTCATTTAAATAGCCAACTTCAACCTCTTAAAACAGAGAAGCAACAAATCAAGCCAAGAATTCAAGACCCCCAAAAAGAAACTACCAGAGAAGAAAGAAGAGATGTCAGAGACGAGAAGGGGAAGCCGAGAGGGCTCGGAAAGGTAAAAAGAAAAGAGAAGTGGGCTAAGGAGGCAAAAGTGGAGACAACTGGAGAAGAAAGAATGGCAGACGGAGACAAAAGAGGGAAGACGAGAAAGGGTGGACCCACGCAAAAGGGAGGCACGCGAGGGTCAACCGGAGAAGAGAGAAGGACGGCCGGAGACGAGAAGGGGGTGCCGAGGACAATGGTTATGAGATCCTAAGGAAACCGCTAAGAGAACGACGAAGGCTGTCAGGCAACCGGAAACAGACAATGGCTAACGGAAAGGGTGACCTAGATGGGGGAAGAAAAGGGTACGGAGGCCAGAAGGATGAGAGGTGGCCGAGGAAACCACCGGAAAAACGGCGGCAAGCAGATGGAAACTGGCAACGAAGAAGCTAGGGTTGGTTCGTGAGAGGAGAGAGAAGAGAGAAAAAAAAGGTGGGTGGGGGGGGGGCACTGATGGATTTTGAGATATCTAGGTACAAACCCTTTTCTAAGAAGGAAAAAGACATTTTATCCTATTTATATTCTAAGAAGGAAAGAGATAAATCATAAAACAAAATTACAAAAATGCCCTAACTTTAACCATGAGTTTCAACAAATTTATTTCATCTTCCCTCTTTTTCTGTCTCTCTTTTCTTTTTCATTCTTTCTTCTCCTTCACAATCACGATTTCCAATTTTTTTTTTTTTTTTTGACAAGAAACAAACTAACTTTTCATTGAAAGGATGAAAAAAGAGACCAATGCTCCAAAAAACATGATCAAAAAAGAAAAATTTACAAGTCGAACCTATTACTAAAGAGAACAAGTAAAGGCATTCCAGTTCAAATTAATATCAGAAATAGAAAAACCCTCAAAAGTTTTAGATAAGAGCACACCAGGAAGCAACCTTAAGATGAATAGCTTCAAAACGAGCCTCCCACTCCGAAATCAAAAAATAAACGCAAACCTGGAGGTTGCGCTTACAAACTGGTGAAGAAGCCAAGAATTTCAGATTCAATTTTGCTGAAGGAAACCAAAGCCTCACCTTCTAATGAATGAAGGTCCTAAATCACCAACCTCCGTTTCTTAGCCGCCAAAAACTAATGGAAAATCCTGAATTTTCATCCCCTTCTTTGAGCCAATTAACTCCATCAATTCTCCTTTTGCTCAACCTTAGCATCCAGAAAACTAATTTCATCAAGCAAATTCTTTTCCTTTTTCTTCCTTTCCAATTCATAATCAGCATACCAGGATTTCAGAATCAATTTCAAGTGTCTCAATTTTAAAGAAATTGCGAACCCAACCCATCCATGCGCTTTATCTTCCTTAAGAGAGTTCTCAACTAATAAGACACACTCCTTAATGGACAATCGGGGATTAAAAAACCGAAAAGGAGATGGACCCCACTGTAAAATAACGGCCTCCAACAGCAAAGGAAAATGATCTGAAAAAACCCGTGCCTGTCTTGTGACCTGTGAGTTTTCAAACAAATCATCCCATTCCACATTGACAAAGAACCTGTCAATCAAGGATCTGAAGCTTTCCGTACCCAGAGTATAACTCCCATTGCTTGAAGGGAGCTATAACAATCCAAGTTCCGCAATAATTTTATTAAAATTTCTCATCCCTCTTGTCTGCCTCCTCTCCGGAAGACGTTCATGAATCCACCGAGTAATATTAAAGTCACCGCCCACACACCAAGGCGCCTAGCAATAAGATGCTAAAGCAGCCCATTCTTCCCAAAGGAGCTTCCTTTCCCTGTACTTATTTGACCCATAAAAGTTTGAAATCCAACAAAAATTTCATTCAGCATATAAAAATTTAATAGAAATGGAGTAGCCCCGTTTAAGTACTTCCTGAACCGTCACTTTACTTTCGTCTCACAAAATCAGCAAGCCTCCTGACTTCTCGAAAGTCTCAATGTGAACCCAGTTCACCTCTCTGGAGCTCCTAATTGACTTGACAAAAATTCTATCAATAATGCCATTTTGGTTTCTTGTATCGACACAACATCAGGACAATCTTGTCCTTCATGGTTTTCTTGGTTTTCTTGGACATTTCGATTGGCTGCTTGGATTTCTTCCTGTTGGTTCCCTACATGGGGAAGGGTGGAATCTTGATTCATTTGTCGAGTGCATTGTATAATGTGGCCTTCTGAACGTCAGCGACTCCTTTCTCCTTCATATTCATCGAGTTCTCTACCTACTTTGAGCGTCGGTCAAGGAGTTGGAATGTGAGGGGGGGGGGGGGGAGGCACGTCCATGGTGGTTGTTTCAAAGAAGCAGATTTGAAAGAAATTTCGAGCTTGGATCGTGAGCTCTTATGCCACTTGGTTTGACAAGTGAAGGGGTTTTCTTATTAAGATGACAAAAGTATTACACAAAAGGACAGCAGCCCCCTACTTATACGAAAAATCCTAAACAACCTTGGTAACTCGAAATGAGAATAAAAGGAGATATTCTCAGTCAATTTAATGGTCCCTTTCCTGGAATAGAAAAAACACTACCTTCTGCTATTCTTAAAAAGTATATATGATTTACCTAATTAAGTTTATTAACTTCTATTGTTGGGCTGTATTTGAGGTTGCTGTGGGAAAACACGCTTGATTGTTTCTATGCAATATTCACGAATCAAGATGGTTTATCAATTCAAAACTATTTTTTGTGCACTCTGTTCAACTCTTATCCCTTTTTTTGCAGTGATGATTGTGCTGAGCCACCAGTCAAATGGCATGTTTATGTTACGGGTCACAGTTTGGGTGGTGCACTAGCTACACTTCTTGCTCTTGAACTCTCATCAAGTCAACTTGCAAGGTGA

mRNA sequence

ATGGGTGGAGGTGGAAAGTTGCTGCTGGAGATCAAATATAGGACTTTTGATGAAATTGAAGATGACAAACGGTGGTGGAGAGTCCCCTTCATTTCTGAATTTCTTCGCAATAATGGTTTTGCATCTGCTCTAAACAAGGTTGTTGGATCTGACACTGTGCCTGTGCGTCAGTTTGTAGAATATGCTTTTGGGAAGTTAAAGTCATTCAATGATGAATACCCATCGAGTGATCATTTATTAAGCAAGAAAAAAGACAAAGAGGAAATACCTTCATACATGCAGACTAACGGTGAAGTCTCTATAACTGATATAAGCTATCCGAAAGAGAGCAATTCAGATGAGGTTGCAACAAGTGATAATACTGTGGAAAGTGGACAATTGCTGAGAGAAGTGACACAAAGTATTTTAATAAAGCAATTCGATAAACAATTTTGGACAAACTTGGCTGATGTAACAAATCAGCATATTGTCAAGAAACTTGGTCTTCCCGCCCCTGAGAAATTAAAGTGGGATGGATTTGAGTTACTAAATAAAATTGGTTTGGAAGCACGAAAGAGTGCTGAAGCTGGTTATATCGAATCGGGGCTTGCAACTCCCAAAAGCTTGGATGTTGATCATGAACAGAAGAACATTAGAATGGTGGACTCAACGCTAACTGATGTGAAGAAAGTAACAAAGGATCTACTAAGTCAAACTGAGTCTGTATTGGGGGCATTGATGGTTCTGACAGCAACAATTTCTCAATTGAACAAGGAAGCACAGCTTATAGGAAAGAAAGATACTAAAGATGAGGTCTCAAAAAAAGTGGGAGAGAAGCTGGGTAGTTCAGGGGATGGTTCCTTGTTGGATAATAGGAATTCTGAGGAAATGAAAGCGCTTTTTGCAACTGCAGAAAGTGCCATGGAAGCTTGGGCAATGCTTGCTACATCACTTGGCCATCCTAGTTTCATAAAGTCAGAATTTGAAAAGTTATGTTTCTTAGATAATGAATCTACAGACACACAGGTTGCAATTTGGCGTGATTTTATGCGGAAAAGACTGGTTGTTGCCTTCAGGGGCACAGAACAATCAAGATGGAAGGATCTAAGAACGGACCTGATGCTAGTCCCTGCAGGGTTAAATCCTGAAAGGATAAGTGGAGACTTCAACGAGGAAGTTCAAGTAAGAAATATTGTTATTTTATTACAGGATACTGATGATTGTGCTGAGCCACCAGTCAAATGGCATGTTTATGTTACGGGTCACAGTTTGGGTGGTGCACTAGCTACACTTCTTGCTCTTGAACTCTCATCAAGTCAACTTGCAAGGTGA

Coding sequence (CDS)

ATGGGTGGAGGTGGAAAGTTGCTGCTGGAGATCAAATATAGGACTTTTGATGAAATTGAAGATGACAAACGGTGGTGGAGAGTCCCCTTCATTTCTGAATTTCTTCGCAATAATGGTTTTGCATCTGCTCTAAACAAGGTTGTTGGATCTGACACTGTGCCTGTGCGTCAGTTTGTAGAATATGCTTTTGGGAAGTTAAAGTCATTCAATGATGAATACCCATCGAGTGATCATTTATTAAGCAAGAAAAAAGACAAAGAGGAAATACCTTCATACATGCAGACTAACGGTGAAGTCTCTATAACTGATATAAGCTATCCGAAAGAGAGCAATTCAGATGAGGTTGCAACAAGTGATAATACTGTGGAAAGTGGACAATTGCTGAGAGAAGTGACACAAAGTATTTTAATAAAGCAATTCGATAAACAATTTTGGACAAACTTGGCTGATGTAACAAATCAGCATATTGTCAAGAAACTTGGTCTTCCCGCCCCTGAGAAATTAAAGTGGGATGGATTTGAGTTACTAAATAAAATTGGTTTGGAAGCACGAAAGAGTGCTGAAGCTGGTTATATCGAATCGGGGCTTGCAACTCCCAAAAGCTTGGATGTTGATCATGAACAGAAGAACATTAGAATGGTGGACTCAACGCTAACTGATGTGAAGAAAGTAACAAAGGATCTACTAAGTCAAACTGAGTCTGTATTGGGGGCATTGATGGTTCTGACAGCAACAATTTCTCAATTGAACAAGGAAGCACAGCTTATAGGAAAGAAAGATACTAAAGATGAGGTCTCAAAAAAAGTGGGAGAGAAGCTGGGTAGTTCAGGGGATGGTTCCTTGTTGGATAATAGGAATTCTGAGGAAATGAAAGCGCTTTTTGCAACTGCAGAAAGTGCCATGGAAGCTTGGGCAATGCTTGCTACATCACTTGGCCATCCTAGTTTCATAAAGTCAGAATTTGAAAAGTTATGTTTCTTAGATAATGAATCTACAGACACACAGGTTGCAATTTGGCGTGATTTTATGCGGAAAAGACTGGTTGTTGCCTTCAGGGGCACAGAACAATCAAGATGGAAGGATCTAAGAACGGACCTGATGCTAGTCCCTGCAGGGTTAAATCCTGAAAGGATAAGTGGAGACTTCAACGAGGAAGTTCAAGTAAGAAATATTGTTATTTTATTACAGGATACTGATGATTGTGCTGAGCCACCAGTCAAATGGCATGTTTATGTTACGGGTCACAGTTTGGGTGGTGCACTAGCTACACTTCTTGCTCTTGAACTCTCATCAAGTCAACTTGCAAGGTGA

Protein sequence

MGGGGKLLLEIKYRTFDEIEDDKRWWRVPFISEFLRNNGFASALNKVVGSDTVPVRQFVEYAFGKLKSFNDEYPSSDHLLSKKKDKEEIPSYMQTNGEVSITDISYPKESNSDEVATSDNTVESGQLLREVTQSILIKQFDKQFWTNLADVTNQHIVKKLGLPAPEKLKWDGFELLNKIGLEARKSAEAGYIESGLATPKSLDVDHEQKNIRMVDSTLTDVKKVTKDLLSQTESVLGALMVLTATISQLNKEAQLIGKKDTKDEVSKKVGEKLGSSGDGSLLDNRNSEEMKALFATAESAMEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRKRLVVAFRGTEQSRWKDLRTDLMLVPAGLNPERISGDFNEEVQVRNIVILLQDTDDCAEPPVKWHVYVTGHSLGGALATLLALELSSSQLAR
Homology
BLAST of Tan0001991 vs. NCBI nr
Match: XP_038876505.1 (uncharacterized protein LOC120068939 [Benincasa hispida])

HSP 1 Score: 771.9 bits (1992), Expect = 2.9e-219
Identity = 396/452 (87.61%), Postives = 414/452 (91.59%), Query Frame = 0

Query: 1   MGGGGKLLLEIKYRTFDEIEDDKRWWRVPFISEFLRNNGFASALNKVVGSDTVPVRQFVE 60
           MGGGGKLL+EIKYRTFDEIEDDKRWWRVPFISEFLR+NGF SALNKVVGSDTVPVRQFVE
Sbjct: 238 MGGGGKLLMEIKYRTFDEIEDDKRWWRVPFISEFLRSNGFVSALNKVVGSDTVPVRQFVE 297

Query: 61  YAFGKLKSFNDEYPSSDHLLSKKKDKEEIPSYMQTNGEVSITDISYPKESNSDEVATSDN 120
           YAFGKLKSFNDEY SS HLLSK+ + E+IPSY+QTN +VSITDI YP E  SDEV  +DN
Sbjct: 298 YAFGKLKSFNDEYQSSHHLLSKQNNAEDIPSYVQTNTKVSITDIKYPNEGKSDEVEINDN 357

Query: 121 TVESGQLLREVTQSILIKQFDKQFWTNLADVTNQHIVKKLGLPAPEKLKWDGFELLNKIG 180
           TVESGQLL+EVTQS+L KQFDKQFWTNLADVTNQ+IVKKLGLPAPEK KWDGFELLNKIG
Sbjct: 358 TVESGQLLKEVTQSLLTKQFDKQFWTNLADVTNQNIVKKLGLPAPEKFKWDGFELLNKIG 417

Query: 181 LEARKSAEAGYIESGLATPKSLDVDHEQKNIRMVDSTLTDVKKVTKDLLSQTESVLGALM 240
           LEARKSAEAGYIESGLATPKSLDVDHEQKNIRMVDSTLTDVKKVTKDLLSQTESVLGALM
Sbjct: 418 LEARKSAEAGYIESGLATPKSLDVDHEQKNIRMVDSTLTDVKKVTKDLLSQTESVLGALM 477

Query: 241 VLTATISQLNKEAQLIGKKDTKDEVSKKVGEKLGSSGDGSLLDNRNSEEMKALFATAESA 300
           VLTATISQLNKEA+L+GKKDTKDE SKK GEKLGSSGDGSLLDNRNSEEMKALFATAESA
Sbjct: 478 VLTATISQLNKEARLVGKKDTKDEGSKKEGEKLGSSGDGSLLDNRNSEEMKALFATAESA 537

Query: 301 MEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRKRLVVAFRGTEQSRWK 360
           MEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMR+RLVVAFRGTEQSRWK
Sbjct: 538 MEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRRRLVVAFRGTEQSRWK 597

Query: 361 DLRTDLMLVPAGLNPERISGDFNEEVQVRN----------------IVILLQDTDDCAEP 420
           DLRTDLMLVPAGLNPERISGDFNEEVQV +                I + +   D+CAEP
Sbjct: 598 DLRTDLMLVPAGLNPERISGDFNEEVQVHSGFLSAYDSVRMRIISLIKMAINYNDECAEP 657

Query: 421 PVKWHVYVTGHSLGGALATLLALELSSSQLAR 437
           PVKWHVYVTGHSLGGALATLLALELSSSQLAR
Sbjct: 658 PVKWHVYVTGHSLGGALATLLALELSSSQLAR 689

BLAST of Tan0001991 vs. NCBI nr
Match: XP_022155152.1 (uncharacterized protein LOC111022292 [Momordica charantia])

HSP 1 Score: 759.6 bits (1960), Expect = 1.5e-215
Identity = 392/452 (86.73%), Postives = 412/452 (91.15%), Query Frame = 0

Query: 1   MGGGGKLLLEIKYRTFDEIEDDKRWWRVPFISEFLRNNGFASALNKVVGSDTVPVRQFVE 60
           MGGGGKL LEIKYRTFDEIEDDKRWWRVPFISEFLRN  FASALNK+VGSDTVPVRQFVE
Sbjct: 236 MGGGGKLQLEIKYRTFDEIEDDKRWWRVPFISEFLRNKNFASALNKLVGSDTVPVRQFVE 295

Query: 61  YAFGKLKSFNDEYPSSDHLLSKKKDKEEIPSYMQTNGEVSITDISYPKESNSDEVATSDN 120
           YAFGKLKSFNDEYPSSDHLLSK KDK++ PS +Q N EVSITDIS  KESNSDEVA SDN
Sbjct: 296 YAFGKLKSFNDEYPSSDHLLSKPKDKDDTPSQLQNNEEVSITDISSVKESNSDEVAASDN 355

Query: 121 TVESGQLLREVTQSILIKQFDKQFWTNLADVTNQHIVKKLGLPAPEKLKWDGFELLNKIG 180
            VE+GQ L+EVTQSIL KQFDKQFWTNLADVTNQ+IVKKLGLPAPEKLKWDGFELLNKIG
Sbjct: 356 RVENGQSLKEVTQSILAKQFDKQFWTNLADVTNQNIVKKLGLPAPEKLKWDGFELLNKIG 415

Query: 181 LEARKSAEAGYIESGLATPKSLDVDHEQKNIRMVDSTLTDVKKVTKDLLSQTESVLGALM 240
           LEARKSAEAGYIESGLA+PKSLD+D EQKNIRM +STLTDVKKV KDLLSQTESVLGALM
Sbjct: 416 LEARKSAEAGYIESGLASPKSLDIDQEQKNIRMAESTLTDVKKVKKDLLSQTESVLGALM 475

Query: 241 VLTATISQLNKEAQLIGKKDTKDEVSKKVGEKLGSSGDGSLLDNRNSEEMKALFATAESA 300
           VLTAT+SQLNKEAQLIGKK+TKD +SKKVGE LGSSGDGSLLDNRNSEEMKALFATAESA
Sbjct: 476 VLTATVSQLNKEAQLIGKKETKDGISKKVGETLGSSGDGSLLDNRNSEEMKALFATAESA 535

Query: 301 MEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRKRLVVAFRGTEQSRWK 360
           MEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMR+RLVV+FRGTEQSRWK
Sbjct: 536 MEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRRRLVVSFRGTEQSRWK 595

Query: 361 DLRTDLMLVPAGLNPERISGDFNEEVQVRN------------IVILLQDT----DDCAEP 420
           DLRTDLMLVPAGLNPERISGDFN+EVQV +            I+ L++      DDC EP
Sbjct: 596 DLRTDLMLVPAGLNPERISGDFNKEVQVHSGFLSAYDSVRVRIISLIKKAINYKDDCGEP 655

Query: 421 PVKWHVYVTGHSLGGALATLLALELSSSQLAR 437
           PVKWHVYVTGHSLGGALATLLALELSSSQLAR
Sbjct: 656 PVKWHVYVTGHSLGGALATLLALELSSSQLAR 687

BLAST of Tan0001991 vs. NCBI nr
Match: XP_022933071.1 (uncharacterized protein LOC111439777 isoform X2 [Cucurbita moschata])

HSP 1 Score: 750.7 bits (1937), Expect = 6.9e-213
Identity = 389/452 (86.06%), Postives = 411/452 (90.93%), Query Frame = 0

Query: 1   MGGGGKLLLEIKYRTFDEIEDDKRWWRVPFISEFLRNNGFASALNKVVGSDTVPVRQFVE 60
           MGGGGKLLLEIK+ TFDEIEDDKRWWRVPFISEFLR+NGFASALNKVVGSDTV V QFVE
Sbjct: 90  MGGGGKLLLEIKFMTFDEIEDDKRWWRVPFISEFLRSNGFASALNKVVGSDTVSVGQFVE 149

Query: 61  YAFGKLKSFNDEYPSSDHLLSKKKDKEEIPSYMQTNGEVSITDISYPKESNSDEVATSDN 120
           YAFGKLKSFNDEY SSD+LLSK+KDKE+IPSYMQTN EVSITDIS P+E  SD+ AT+DN
Sbjct: 150 YAFGKLKSFNDEYQSSDNLLSKQKDKEDIPSYMQTNAEVSITDISDPEEDESDDDATNDN 209

Query: 121 TVESGQLLREVTQSILIKQFDKQFWTNLADVTNQHIVKKLGLPAPEKLKWDGFELLNKIG 180
           T E+GQLL+EVTQSIL KQFDK FWTNLADVTNQ+IVKKLGLPAPEKLKWDGFELLNKIG
Sbjct: 210 TKETGQLLKEVTQSILAKQFDKHFWTNLADVTNQNIVKKLGLPAPEKLKWDGFELLNKIG 269

Query: 181 LEARKSAEAGYIESGLATPKSLDVDHEQKNIRMVDSTLTDVKKVTKDLLSQTESVLGALM 240
           LEARKSAEAGYIESGLAT KSLDVD EQKNI+MVDSTLTDVKK+TKDLLSQTESVLG LM
Sbjct: 270 LEARKSAEAGYIESGLATSKSLDVDQEQKNIKMVDSTLTDVKKITKDLLSQTESVLGGLM 329

Query: 241 VLTATISQLNKEAQLIGKKDTKDEVSKKVGEKLGSSGDGSLLDNRNSEEMKALFATAESA 300
           VLTATISQLNKE+Q IGKKDT+DE SKKVGEKLGSSGDGSLLDNRNSEEM+ALFATAESA
Sbjct: 330 VLTATISQLNKESQ-IGKKDTEDEGSKKVGEKLGSSGDGSLLDNRNSEEMRALFATAESA 389

Query: 301 MEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRKRLVVAFRGTEQSRWK 360
           MEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDF R+RLVVAFRGTEQSRWK
Sbjct: 390 MEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFPRRRLVVAFRGTEQSRWK 449

Query: 361 DLRTDLMLVPAGLNPERISGDFNEEVQVRN----------------IVILLQDTDDCAEP 420
           DLRTDLML PAGLNPERISGDFNEE+QV +                I + +   DDCAEP
Sbjct: 450 DLRTDLMLAPAGLNPERISGDFNEEIQVHSGFLSAYDSVRMRIMSLIKMAINYNDDCAEP 509

Query: 421 PVKWHVYVTGHSLGGALATLLALELSSSQLAR 437
           PVKWHVYVTGHSLGGALATLLALEL+SSQLAR
Sbjct: 510 PVKWHVYVTGHSLGGALATLLALELTSSQLAR 540

BLAST of Tan0001991 vs. NCBI nr
Match: KAG7032442.1 (faeA, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 750.7 bits (1937), Expect = 6.9e-213
Identity = 389/452 (86.06%), Postives = 411/452 (90.93%), Query Frame = 0

Query: 1   MGGGGKLLLEIKYRTFDEIEDDKRWWRVPFISEFLRNNGFASALNKVVGSDTVPVRQFVE 60
           MGGGGKLLLEIK+ TFDEIEDDKRWWRVPFISEFLR+NGFASALNKVVGSDTV V QFVE
Sbjct: 236 MGGGGKLLLEIKFMTFDEIEDDKRWWRVPFISEFLRSNGFASALNKVVGSDTVSVGQFVE 295

Query: 61  YAFGKLKSFNDEYPSSDHLLSKKKDKEEIPSYMQTNGEVSITDISYPKESNSDEVATSDN 120
           YAFGKLKSFNDEY SSD+LLSK+KDKE+IPSYMQTN EVSITDIS P+E  SD+ AT+DN
Sbjct: 296 YAFGKLKSFNDEYQSSDNLLSKQKDKEDIPSYMQTNAEVSITDISDPEEDESDDDATNDN 355

Query: 121 TVESGQLLREVTQSILIKQFDKQFWTNLADVTNQHIVKKLGLPAPEKLKWDGFELLNKIG 180
           T E+GQLL+EVTQSIL KQFDK FWTNLADVTNQ+IVKKLGLPAPEKLKWDGFELLNKIG
Sbjct: 356 TKETGQLLKEVTQSILAKQFDKHFWTNLADVTNQNIVKKLGLPAPEKLKWDGFELLNKIG 415

Query: 181 LEARKSAEAGYIESGLATPKSLDVDHEQKNIRMVDSTLTDVKKVTKDLLSQTESVLGALM 240
           LEARKSAEAGYIESGLAT KSLDVD EQKNI+MVDSTLTDVKK+TKDLLSQTESVLG LM
Sbjct: 416 LEARKSAEAGYIESGLATSKSLDVDQEQKNIKMVDSTLTDVKKITKDLLSQTESVLGGLM 475

Query: 241 VLTATISQLNKEAQLIGKKDTKDEVSKKVGEKLGSSGDGSLLDNRNSEEMKALFATAESA 300
           VLTATISQLNKE+Q IGKKDT+DE SKKVGEKLGSSGDGSLLDNRNSEEM+ALFATAESA
Sbjct: 476 VLTATISQLNKESQ-IGKKDTEDEGSKKVGEKLGSSGDGSLLDNRNSEEMRALFATAESA 535

Query: 301 MEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRKRLVVAFRGTEQSRWK 360
           MEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDF R+RLVVAFRGTEQSRWK
Sbjct: 536 MEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFPRRRLVVAFRGTEQSRWK 595

Query: 361 DLRTDLMLVPAGLNPERISGDFNEEVQVRN----------------IVILLQDTDDCAEP 420
           DLRTDLML PAGLNPERISGDFNEE+QV +                I + +   DDCAEP
Sbjct: 596 DLRTDLMLAPAGLNPERISGDFNEEIQVHSGFLSAYDSVRMRIMSLIKMAINYNDDCAEP 655

Query: 421 PVKWHVYVTGHSLGGALATLLALELSSSQLAR 437
           PVKWHVYVTGHSLGGALATLLALEL+SSQLAR
Sbjct: 656 PVKWHVYVTGHSLGGALATLLALELTSSQLAR 686

BLAST of Tan0001991 vs. NCBI nr
Match: XP_022933070.1 (uncharacterized protein LOC111439777 isoform X1 [Cucurbita moschata] >KAG6601673.1 hypothetical protein SDJN03_06906, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 750.7 bits (1937), Expect = 6.9e-213
Identity = 389/452 (86.06%), Postives = 411/452 (90.93%), Query Frame = 0

Query: 1   MGGGGKLLLEIKYRTFDEIEDDKRWWRVPFISEFLRNNGFASALNKVVGSDTVPVRQFVE 60
           MGGGGKLLLEIK+ TFDEIEDDKRWWRVPFISEFLR+NGFASALNKVVGSDTV V QFVE
Sbjct: 236 MGGGGKLLLEIKFMTFDEIEDDKRWWRVPFISEFLRSNGFASALNKVVGSDTVSVGQFVE 295

Query: 61  YAFGKLKSFNDEYPSSDHLLSKKKDKEEIPSYMQTNGEVSITDISYPKESNSDEVATSDN 120
           YAFGKLKSFNDEY SSD+LLSK+KDKE+IPSYMQTN EVSITDIS P+E  SD+ AT+DN
Sbjct: 296 YAFGKLKSFNDEYQSSDNLLSKQKDKEDIPSYMQTNAEVSITDISDPEEDESDDDATNDN 355

Query: 121 TVESGQLLREVTQSILIKQFDKQFWTNLADVTNQHIVKKLGLPAPEKLKWDGFELLNKIG 180
           T E+GQLL+EVTQSIL KQFDK FWTNLADVTNQ+IVKKLGLPAPEKLKWDGFELLNKIG
Sbjct: 356 TKETGQLLKEVTQSILAKQFDKHFWTNLADVTNQNIVKKLGLPAPEKLKWDGFELLNKIG 415

Query: 181 LEARKSAEAGYIESGLATPKSLDVDHEQKNIRMVDSTLTDVKKVTKDLLSQTESVLGALM 240
           LEARKSAEAGYIESGLAT KSLDVD EQKNI+MVDSTLTDVKK+TKDLLSQTESVLG LM
Sbjct: 416 LEARKSAEAGYIESGLATSKSLDVDQEQKNIKMVDSTLTDVKKITKDLLSQTESVLGGLM 475

Query: 241 VLTATISQLNKEAQLIGKKDTKDEVSKKVGEKLGSSGDGSLLDNRNSEEMKALFATAESA 300
           VLTATISQLNKE+Q IGKKDT+DE SKKVGEKLGSSGDGSLLDNRNSEEM+ALFATAESA
Sbjct: 476 VLTATISQLNKESQ-IGKKDTEDEGSKKVGEKLGSSGDGSLLDNRNSEEMRALFATAESA 535

Query: 301 MEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRKRLVVAFRGTEQSRWK 360
           MEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDF R+RLVVAFRGTEQSRWK
Sbjct: 536 MEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFPRRRLVVAFRGTEQSRWK 595

Query: 361 DLRTDLMLVPAGLNPERISGDFNEEVQVRN----------------IVILLQDTDDCAEP 420
           DLRTDLML PAGLNPERISGDFNEE+QV +                I + +   DDCAEP
Sbjct: 596 DLRTDLMLAPAGLNPERISGDFNEEIQVHSGFLSAYDSVRMRIMSLIKMAINYNDDCAEP 655

Query: 421 PVKWHVYVTGHSLGGALATLLALELSSSQLAR 437
           PVKWHVYVTGHSLGGALATLLALEL+SSQLAR
Sbjct: 656 PVKWHVYVTGHSLGGALATLLALELTSSQLAR 686

BLAST of Tan0001991 vs. ExPASy TrEMBL
Match: A0A6J1DNK8 (uncharacterized protein LOC111022292 OS=Momordica charantia OX=3673 GN=LOC111022292 PE=4 SV=1)

HSP 1 Score: 759.6 bits (1960), Expect = 7.1e-216
Identity = 392/452 (86.73%), Postives = 412/452 (91.15%), Query Frame = 0

Query: 1   MGGGGKLLLEIKYRTFDEIEDDKRWWRVPFISEFLRNNGFASALNKVVGSDTVPVRQFVE 60
           MGGGGKL LEIKYRTFDEIEDDKRWWRVPFISEFLRN  FASALNK+VGSDTVPVRQFVE
Sbjct: 236 MGGGGKLQLEIKYRTFDEIEDDKRWWRVPFISEFLRNKNFASALNKLVGSDTVPVRQFVE 295

Query: 61  YAFGKLKSFNDEYPSSDHLLSKKKDKEEIPSYMQTNGEVSITDISYPKESNSDEVATSDN 120
           YAFGKLKSFNDEYPSSDHLLSK KDK++ PS +Q N EVSITDIS  KESNSDEVA SDN
Sbjct: 296 YAFGKLKSFNDEYPSSDHLLSKPKDKDDTPSQLQNNEEVSITDISSVKESNSDEVAASDN 355

Query: 121 TVESGQLLREVTQSILIKQFDKQFWTNLADVTNQHIVKKLGLPAPEKLKWDGFELLNKIG 180
            VE+GQ L+EVTQSIL KQFDKQFWTNLADVTNQ+IVKKLGLPAPEKLKWDGFELLNKIG
Sbjct: 356 RVENGQSLKEVTQSILAKQFDKQFWTNLADVTNQNIVKKLGLPAPEKLKWDGFELLNKIG 415

Query: 181 LEARKSAEAGYIESGLATPKSLDVDHEQKNIRMVDSTLTDVKKVTKDLLSQTESVLGALM 240
           LEARKSAEAGYIESGLA+PKSLD+D EQKNIRM +STLTDVKKV KDLLSQTESVLGALM
Sbjct: 416 LEARKSAEAGYIESGLASPKSLDIDQEQKNIRMAESTLTDVKKVKKDLLSQTESVLGALM 475

Query: 241 VLTATISQLNKEAQLIGKKDTKDEVSKKVGEKLGSSGDGSLLDNRNSEEMKALFATAESA 300
           VLTAT+SQLNKEAQLIGKK+TKD +SKKVGE LGSSGDGSLLDNRNSEEMKALFATAESA
Sbjct: 476 VLTATVSQLNKEAQLIGKKETKDGISKKVGETLGSSGDGSLLDNRNSEEMKALFATAESA 535

Query: 301 MEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRKRLVVAFRGTEQSRWK 360
           MEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMR+RLVV+FRGTEQSRWK
Sbjct: 536 MEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRRRLVVSFRGTEQSRWK 595

Query: 361 DLRTDLMLVPAGLNPERISGDFNEEVQVRN------------IVILLQDT----DDCAEP 420
           DLRTDLMLVPAGLNPERISGDFN+EVQV +            I+ L++      DDC EP
Sbjct: 596 DLRTDLMLVPAGLNPERISGDFNKEVQVHSGFLSAYDSVRVRIISLIKKAINYKDDCGEP 655

Query: 421 PVKWHVYVTGHSLGGALATLLALELSSSQLAR 437
           PVKWHVYVTGHSLGGALATLLALELSSSQLAR
Sbjct: 656 PVKWHVYVTGHSLGGALATLLALELSSSQLAR 687

BLAST of Tan0001991 vs. ExPASy TrEMBL
Match: A0A6J1EY43 (uncharacterized protein LOC111439777 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111439777 PE=4 SV=1)

HSP 1 Score: 750.7 bits (1937), Expect = 3.3e-213
Identity = 389/452 (86.06%), Postives = 411/452 (90.93%), Query Frame = 0

Query: 1   MGGGGKLLLEIKYRTFDEIEDDKRWWRVPFISEFLRNNGFASALNKVVGSDTVPVRQFVE 60
           MGGGGKLLLEIK+ TFDEIEDDKRWWRVPFISEFLR+NGFASALNKVVGSDTV V QFVE
Sbjct: 90  MGGGGKLLLEIKFMTFDEIEDDKRWWRVPFISEFLRSNGFASALNKVVGSDTVSVGQFVE 149

Query: 61  YAFGKLKSFNDEYPSSDHLLSKKKDKEEIPSYMQTNGEVSITDISYPKESNSDEVATSDN 120
           YAFGKLKSFNDEY SSD+LLSK+KDKE+IPSYMQTN EVSITDIS P+E  SD+ AT+DN
Sbjct: 150 YAFGKLKSFNDEYQSSDNLLSKQKDKEDIPSYMQTNAEVSITDISDPEEDESDDDATNDN 209

Query: 121 TVESGQLLREVTQSILIKQFDKQFWTNLADVTNQHIVKKLGLPAPEKLKWDGFELLNKIG 180
           T E+GQLL+EVTQSIL KQFDK FWTNLADVTNQ+IVKKLGLPAPEKLKWDGFELLNKIG
Sbjct: 210 TKETGQLLKEVTQSILAKQFDKHFWTNLADVTNQNIVKKLGLPAPEKLKWDGFELLNKIG 269

Query: 181 LEARKSAEAGYIESGLATPKSLDVDHEQKNIRMVDSTLTDVKKVTKDLLSQTESVLGALM 240
           LEARKSAEAGYIESGLAT KSLDVD EQKNI+MVDSTLTDVKK+TKDLLSQTESVLG LM
Sbjct: 270 LEARKSAEAGYIESGLATSKSLDVDQEQKNIKMVDSTLTDVKKITKDLLSQTESVLGGLM 329

Query: 241 VLTATISQLNKEAQLIGKKDTKDEVSKKVGEKLGSSGDGSLLDNRNSEEMKALFATAESA 300
           VLTATISQLNKE+Q IGKKDT+DE SKKVGEKLGSSGDGSLLDNRNSEEM+ALFATAESA
Sbjct: 330 VLTATISQLNKESQ-IGKKDTEDEGSKKVGEKLGSSGDGSLLDNRNSEEMRALFATAESA 389

Query: 301 MEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRKRLVVAFRGTEQSRWK 360
           MEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDF R+RLVVAFRGTEQSRWK
Sbjct: 390 MEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFPRRRLVVAFRGTEQSRWK 449

Query: 361 DLRTDLMLVPAGLNPERISGDFNEEVQVRN----------------IVILLQDTDDCAEP 420
           DLRTDLML PAGLNPERISGDFNEE+QV +                I + +   DDCAEP
Sbjct: 450 DLRTDLMLAPAGLNPERISGDFNEEIQVHSGFLSAYDSVRMRIMSLIKMAINYNDDCAEP 509

Query: 421 PVKWHVYVTGHSLGGALATLLALELSSSQLAR 437
           PVKWHVYVTGHSLGGALATLLALEL+SSQLAR
Sbjct: 510 PVKWHVYVTGHSLGGALATLLALELTSSQLAR 540

BLAST of Tan0001991 vs. ExPASy TrEMBL
Match: A0A6J1EYQ8 (uncharacterized protein LOC111439777 isoform X3 OS=Cucurbita moschata OX=3662 GN=LOC111439777 PE=4 SV=1)

HSP 1 Score: 750.7 bits (1937), Expect = 3.3e-213
Identity = 389/452 (86.06%), Postives = 411/452 (90.93%), Query Frame = 0

Query: 1   MGGGGKLLLEIKYRTFDEIEDDKRWWRVPFISEFLRNNGFASALNKVVGSDTVPVRQFVE 60
           MGGGGKLLLEIK+ TFDEIEDDKRWWRVPFISEFLR+NGFASALNKVVGSDTV V QFVE
Sbjct: 73  MGGGGKLLLEIKFMTFDEIEDDKRWWRVPFISEFLRSNGFASALNKVVGSDTVSVGQFVE 132

Query: 61  YAFGKLKSFNDEYPSSDHLLSKKKDKEEIPSYMQTNGEVSITDISYPKESNSDEVATSDN 120
           YAFGKLKSFNDEY SSD+LLSK+KDKE+IPSYMQTN EVSITDIS P+E  SD+ AT+DN
Sbjct: 133 YAFGKLKSFNDEYQSSDNLLSKQKDKEDIPSYMQTNAEVSITDISDPEEDESDDDATNDN 192

Query: 121 TVESGQLLREVTQSILIKQFDKQFWTNLADVTNQHIVKKLGLPAPEKLKWDGFELLNKIG 180
           T E+GQLL+EVTQSIL KQFDK FWTNLADVTNQ+IVKKLGLPAPEKLKWDGFELLNKIG
Sbjct: 193 TKETGQLLKEVTQSILAKQFDKHFWTNLADVTNQNIVKKLGLPAPEKLKWDGFELLNKIG 252

Query: 181 LEARKSAEAGYIESGLATPKSLDVDHEQKNIRMVDSTLTDVKKVTKDLLSQTESVLGALM 240
           LEARKSAEAGYIESGLAT KSLDVD EQKNI+MVDSTLTDVKK+TKDLLSQTESVLG LM
Sbjct: 253 LEARKSAEAGYIESGLATSKSLDVDQEQKNIKMVDSTLTDVKKITKDLLSQTESVLGGLM 312

Query: 241 VLTATISQLNKEAQLIGKKDTKDEVSKKVGEKLGSSGDGSLLDNRNSEEMKALFATAESA 300
           VLTATISQLNKE+Q IGKKDT+DE SKKVGEKLGSSGDGSLLDNRNSEEM+ALFATAESA
Sbjct: 313 VLTATISQLNKESQ-IGKKDTEDEGSKKVGEKLGSSGDGSLLDNRNSEEMRALFATAESA 372

Query: 301 MEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRKRLVVAFRGTEQSRWK 360
           MEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDF R+RLVVAFRGTEQSRWK
Sbjct: 373 MEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFPRRRLVVAFRGTEQSRWK 432

Query: 361 DLRTDLMLVPAGLNPERISGDFNEEVQVRN----------------IVILLQDTDDCAEP 420
           DLRTDLML PAGLNPERISGDFNEE+QV +                I + +   DDCAEP
Sbjct: 433 DLRTDLMLAPAGLNPERISGDFNEEIQVHSGFLSAYDSVRMRIMSLIKMAINYNDDCAEP 492

Query: 421 PVKWHVYVTGHSLGGALATLLALELSSSQLAR 437
           PVKWHVYVTGHSLGGALATLLALEL+SSQLAR
Sbjct: 493 PVKWHVYVTGHSLGGALATLLALELTSSQLAR 523

BLAST of Tan0001991 vs. ExPASy TrEMBL
Match: A0A6J1F3W8 (uncharacterized protein LOC111439777 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111439777 PE=4 SV=1)

HSP 1 Score: 750.7 bits (1937), Expect = 3.3e-213
Identity = 389/452 (86.06%), Postives = 411/452 (90.93%), Query Frame = 0

Query: 1   MGGGGKLLLEIKYRTFDEIEDDKRWWRVPFISEFLRNNGFASALNKVVGSDTVPVRQFVE 60
           MGGGGKLLLEIK+ TFDEIEDDKRWWRVPFISEFLR+NGFASALNKVVGSDTV V QFVE
Sbjct: 236 MGGGGKLLLEIKFMTFDEIEDDKRWWRVPFISEFLRSNGFASALNKVVGSDTVSVGQFVE 295

Query: 61  YAFGKLKSFNDEYPSSDHLLSKKKDKEEIPSYMQTNGEVSITDISYPKESNSDEVATSDN 120
           YAFGKLKSFNDEY SSD+LLSK+KDKE+IPSYMQTN EVSITDIS P+E  SD+ AT+DN
Sbjct: 296 YAFGKLKSFNDEYQSSDNLLSKQKDKEDIPSYMQTNAEVSITDISDPEEDESDDDATNDN 355

Query: 121 TVESGQLLREVTQSILIKQFDKQFWTNLADVTNQHIVKKLGLPAPEKLKWDGFELLNKIG 180
           T E+GQLL+EVTQSIL KQFDK FWTNLADVTNQ+IVKKLGLPAPEKLKWDGFELLNKIG
Sbjct: 356 TKETGQLLKEVTQSILAKQFDKHFWTNLADVTNQNIVKKLGLPAPEKLKWDGFELLNKIG 415

Query: 181 LEARKSAEAGYIESGLATPKSLDVDHEQKNIRMVDSTLTDVKKVTKDLLSQTESVLGALM 240
           LEARKSAEAGYIESGLAT KSLDVD EQKNI+MVDSTLTDVKK+TKDLLSQTESVLG LM
Sbjct: 416 LEARKSAEAGYIESGLATSKSLDVDQEQKNIKMVDSTLTDVKKITKDLLSQTESVLGGLM 475

Query: 241 VLTATISQLNKEAQLIGKKDTKDEVSKKVGEKLGSSGDGSLLDNRNSEEMKALFATAESA 300
           VLTATISQLNKE+Q IGKKDT+DE SKKVGEKLGSSGDGSLLDNRNSEEM+ALFATAESA
Sbjct: 476 VLTATISQLNKESQ-IGKKDTEDEGSKKVGEKLGSSGDGSLLDNRNSEEMRALFATAESA 535

Query: 301 MEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRKRLVVAFRGTEQSRWK 360
           MEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDF R+RLVVAFRGTEQSRWK
Sbjct: 536 MEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFPRRRLVVAFRGTEQSRWK 595

Query: 361 DLRTDLMLVPAGLNPERISGDFNEEVQVRN----------------IVILLQDTDDCAEP 420
           DLRTDLML PAGLNPERISGDFNEE+QV +                I + +   DDCAEP
Sbjct: 596 DLRTDLMLAPAGLNPERISGDFNEEIQVHSGFLSAYDSVRMRIMSLIKMAINYNDDCAEP 655

Query: 421 PVKWHVYVTGHSLGGALATLLALELSSSQLAR 437
           PVKWHVYVTGHSLGGALATLLALEL+SSQLAR
Sbjct: 656 PVKWHVYVTGHSLGGALATLLALELTSSQLAR 686

BLAST of Tan0001991 vs. ExPASy TrEMBL
Match: A0A6J1K066 (uncharacterized protein LOC111489848 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111489848 PE=4 SV=1)

HSP 1 Score: 746.9 bits (1927), Expect = 4.8e-212
Identity = 387/452 (85.62%), Postives = 410/452 (90.71%), Query Frame = 0

Query: 1   MGGGGKLLLEIKYRTFDEIEDDKRWWRVPFISEFLRNNGFASALNKVVGSDTVPVRQFVE 60
           MGGGGKLLLEIK+ TFDEIEDDKRWWRVPFISEFLR+NGFASALNKVVGSDTV V QFVE
Sbjct: 90  MGGGGKLLLEIKFMTFDEIEDDKRWWRVPFISEFLRSNGFASALNKVVGSDTVSVGQFVE 149

Query: 61  YAFGKLKSFNDEYPSSDHLLSKKKDKEEIPSYMQTNGEVSITDISYPKESNSDEVATSDN 120
           YAFGKLKSFNDEY SSD+LLSK+KDKE+IPSYMQTN EVSITDIS P+E   D+ AT+DN
Sbjct: 150 YAFGKLKSFNDEYQSSDNLLSKQKDKEDIPSYMQTNAEVSITDISDPEEDELDDDATNDN 209

Query: 121 TVESGQLLREVTQSILIKQFDKQFWTNLADVTNQHIVKKLGLPAPEKLKWDGFELLNKIG 180
           T+E+GQLL+EVTQSIL KQFDK FWTNLADVTNQ+IVKKLGLPAPEKLKWDG ELLNKIG
Sbjct: 210 TMETGQLLKEVTQSILAKQFDKHFWTNLADVTNQNIVKKLGLPAPEKLKWDGLELLNKIG 269

Query: 181 LEARKSAEAGYIESGLATPKSLDVDHEQKNIRMVDSTLTDVKKVTKDLLSQTESVLGALM 240
           LEARKSAEAGYIESGLAT KSLDVD EQKNIRMVDSTLTDVKK+TKDLLSQTESVLG+LM
Sbjct: 270 LEARKSAEAGYIESGLATSKSLDVDQEQKNIRMVDSTLTDVKKITKDLLSQTESVLGSLM 329

Query: 241 VLTATISQLNKEAQLIGKKDTKDEVSKKVGEKLGSSGDGSLLDNRNSEEMKALFATAESA 300
           VLTATISQLNKE+Q  GKKDT+DE SKKVGEKLGSSGDGSLLDNRNSEEM+ALFATAESA
Sbjct: 330 VLTATISQLNKESQ-AGKKDTEDEGSKKVGEKLGSSGDGSLLDNRNSEEMRALFATAESA 389

Query: 301 MEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFMRKRLVVAFRGTEQSRWK 360
           MEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDF R+RLVVAFRGTEQSRWK
Sbjct: 390 MEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQVAIWRDFPRRRLVVAFRGTEQSRWK 449

Query: 361 DLRTDLMLVPAGLNPERISGDFNEEVQVRN----------------IVILLQDTDDCAEP 420
           DLRTDLML PAGLNPERISGDFNEE+QV +                I + +   DDCAEP
Sbjct: 450 DLRTDLMLAPAGLNPERISGDFNEEIQVHSGFLSAYDSVRMRIMSLIKMAINYNDDCAEP 509

Query: 421 PVKWHVYVTGHSLGGALATLLALELSSSQLAR 437
           PVKWHVYVTGHSLGGALATLLALEL+SSQLAR
Sbjct: 510 PVKWHVYVTGHSLGGALATLLALELTSSQLAR 540

BLAST of Tan0001991 vs. TAIR 10
Match: AT4G13550.1 (triglyceride lipases;triglyceride lipases )

HSP 1 Score: 430.6 bits (1106), Expect = 1.5e-120
Identity = 244/477 (51.15%), Postives = 315/477 (66.04%), Query Frame = 0

Query: 1   MGGGGKLLLEIKYRTFDEIEDDKRWWRVPFISEFLRNN-------------GFASALNKV 60
           +GGGGK+ LEIKY+ F E+E++K+WWR PF+SEFL+ N                S L  +
Sbjct: 83  IGGGGKVQLEIKYKGFGEVEEEKKWWRFPFVSEFLQRNEIKSVLKNFVDSEAVESVLKNL 142

Query: 61  VGSDTVPVRQFVEYAFGKLKSFNDEYPSSDHLLSKKKDKEEIPSYMQTNGEVSITDISYP 120
           V S+ VP RQFVEYAFG+LKS ND    +  LL+   +  E  S   ++ +   T++S  
Sbjct: 143 VDSEAVPARQFVEYAFGQLKSLNDAPLKNTELLNNTAEDSEGASSEDSSDQHRSTNLSSS 202

Query: 121 KESNSDEVATSDNTVESGQLLREVTQSILIKQFDKQFWTNLADVTNQHIVKKLGLPAPEK 180
            + + D+    D     G  L +  +S  I Q +  FW N+ D+  Q+IV+KLGLP+PEK
Sbjct: 203 GKLSKDKDGDGDG---HGNELEDDNESGSI-QSESNFWDNIPDIVGQNIVQKLGLPSPEK 262

Query: 181 LKWDGFELLNKIGLEARKSAEAGYIESGLATPKSLDVDHEQKN----IRMVDSTLTDVKK 240
           LKW+G ELL   GL++RK+AEAGYIESGLAT  + + D E+++    I    S+L D+K 
Sbjct: 263 LKWNGTELLENFGLQSRKTAEAGYIESGLATADTREADDEKEDGQVAINASKSSLADMKN 322

Query: 241 VTKDLLSQTESVLGALMVLTATISQLNKEA-------QLIGKKDTKDEVS-KKVGEKLGS 300
            T++LL Q ++V GALMVL A +  L+K++       +  G     D+VS     EK+  
Sbjct: 323 ATQELLKQADNVFGALMVLKAVVPHLSKDSVGSEKVIEKNGSSSVTDDVSGSSKTEKISG 382

Query: 301 SGDGSLLDNRNSEEMKALFATAESAMEAWAMLATSLGHPSFIKSEFEKLCFLDNESTDTQ 360
             +    D +N+EEMK LF++AESAMEAWAMLAT+LGHPSFIKSEFEKLCFL+N+ TDTQ
Sbjct: 383 LVNVDGADEKNAEEMKTLFSSAESAMEAWAMLATALGHPSFIKSEFEKLCFLENDITDTQ 442

Query: 361 VAIWRDFMRKRLVVAFRGTEQSRWKDLRTDLMLVPAGLNPERISGDFNEEVQVRN----- 420
           VAIWRD  RKR+V+AFRGTEQ++WKDL+TDLMLVPAGLNPERI GDF +EVQV +     
Sbjct: 443 VAIWRDARRKRVVIAFRGTEQTKWKDLQTDLMLVPAGLNPERIGGDFKQEVQVHSGFLSA 502

Query: 421 -------IVILLQDT----DDCAEPPVKWHVYVTGHSLGGALATLLALELSSSQLAR 437
                  I+ LL+ T    DD  E   KWHVYVTGHSLGGALATLLALELSSSQLA+
Sbjct: 503 YDSVRIRIISLLKMTIGYIDDVTEREDKWHVYVTGHSLGGALATLLALELSSSQLAK 555

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_038876505.12.9e-21987.61uncharacterized protein LOC120068939 [Benincasa hispida][more]
XP_022155152.11.5e-21586.73uncharacterized protein LOC111022292 [Momordica charantia][more]
XP_022933071.16.9e-21386.06uncharacterized protein LOC111439777 isoform X2 [Cucurbita moschata][more]
KAG7032442.16.9e-21386.06faeA, partial [Cucurbita argyrosperma subsp. argyrosperma][more]
XP_022933070.16.9e-21386.06uncharacterized protein LOC111439777 isoform X1 [Cucurbita moschata] >KAG6601673... [more]
Match NameE-valueIdentityDescription
A0A6J1DNK87.1e-21686.73uncharacterized protein LOC111022292 OS=Momordica charantia OX=3673 GN=LOC111022... [more]
A0A6J1EY433.3e-21386.06uncharacterized protein LOC111439777 isoform X2 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1EYQ83.3e-21386.06uncharacterized protein LOC111439777 isoform X3 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1F3W83.3e-21386.06uncharacterized protein LOC111439777 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1K0664.8e-21285.62uncharacterized protein LOC111489848 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... [more]
Match NameE-valueIdentityDescription
AT4G13550.11.5e-12051.15triglyceride lipases;triglyceride lipases [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR029058Alpha/Beta hydrolase foldGENE3D3.40.50.1820alpha/beta hydrolasecoord: 285..436
e-value: 3.1E-18
score: 67.9
IPR029058Alpha/Beta hydrolase foldSUPERFAMILY53474alpha/beta-Hydrolasescoord: 317..432
IPR002921Fungal lipase-like domainPFAMPF01764Lipase_3coord: 349..433
e-value: 1.3E-6
score: 28.3
NoneNo IPR availablePANTHERPTHR47759OS04G0509100 PROTEINcoord: 1..435

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0001991.1Tan0001991.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006629 lipid metabolic process