Lsi01G020100 (gene) Bottle gourd (USVL1VR-Ls) v1

Overview
NameLsi01G020100
Typegene
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls) v1)
DescriptionTOM1-like protein 5
Locationchr01: 27440455 .. 27461481 (+)
RNA-Seq ExpressionLsi01G020100
SyntenyLsi01G020100
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GAGGAGAGAGGAGAGAAAAAAAAAAAGAGGTTTTGGACAGCGTAGATCCATGGCGGTGCCGTTGCTTTCTTCTATAAGCTTTTTTGCTGAAACGCAACGCCCATTCTCTCACCAAAGGACAGAACGACACACGCATACACTTTTAGAGAGAGAGAGAGAGAGAGATTCTCATTTCTCTTCTGTCGTTCAACTCCAATCTTCTTCCCCATTATTTGCTTCTTTATTATCCTTTCAATTCCCTTCTTAATCATTCATCATTACGCCTACCCACAGAGCTTGCAACCATTTTCTTACCCTTTTTTGCCCCAACCCCATTTCTTCCTTCTTCCTCAAATGCTCATCCTCTTCCTCTTCTAGGGTTTTCAGTTCTTTTCTTTTGTGTAATCTGGTAACTTTAACTCTCTTTTCTGGTTTCTCACCTGTTTTTGAGTCTTGGGTCTTCATGTTGAGATCTGTTTTCCTACTCTTTTCATTTTGATTTAATATCCTCTTCTTTTTCACACATGGGTTTGTACTCTACTGGATTTGAAATCGAGTATCAGCTGTATTATTTTCCTTTTACTTTCATTTCCTGTGTTTTTTTTTTAACTTCTCTTTTTTTTTTCTCCATGTGCTAAATCCTATTATTGTTGTTAATTTGTTTGGTTTTTAAGCTTGGACTTCCTTTTCCCCATTTCAGAGTGGCTTATGGTTCCCACACCGGAAGGCCTTCTTGATTTCTGTCTCTGCTTCTATTGGTGCAGTATGCCTAAATTCAAGATATTGTAGAACATTCTTGGTTTTTGTCTATTCTGACGACTTGTTACTCCCATATTGACCATTCAAGTATTTAGGAGGTTGTTGATCTCTGTTAGTTATTCTTTCGCTCTCTACTTCCTCTTTTTAGTGTCAGTTTTTTCATTTTTTGTATGATAATATGCTCAATAATAGTTGTTCTTGGAAGCATTAGTAAGTGAAATTTCCATATTTTTTACTAAAGAATGACGCACCGCAGAACATAAAGATAAATCTGCAGTTGGTTCTTCCTTTAAAAGACTACGAGGCCTTTCACCATTATGCTCTGTTATGGACAGTTCATTTCAGTGATCGAATGTTGTACTTCCATTGAGTTTTTGCTTTGTCATATGTGATGAATGATAATTAGCTCTGAATTATCTCTTTTCTCTCCATTACCTTGTCTGTTGAGTAGGAACACTGCCTACATTTGGAGTAAAGATGTCAAGACTTTAGATGTAGTATAGTTTCTTCTCTTGCAAGTTTCTGTAGTTAATTGAATTTATTATTATTATTTTTAGATACTTCTTTTAGTGTTGAGGGTTCCTGTAAAAAGTTATCACTTGATTGCCTTAGCTTTTTTAGTAAATATTTTCTTAGTCTTGAAATATTATAGTCATCTTTGACTTTGGAATCTGCCTCTTCTAATCTCACATTTTATATTGCCCTTTTCTTTTTTTTTTTTTTAAAAAAAAACCCAAATTAGGGGTACTATAAAGATGGCTGCTGAATTAGTCAACTCTGCAACAAGTGAGAAACTGGCTGAAACTGATTGGATGAAGAATATCCAAATTTGTGAATTAGTTGCTCATGATCAAAGGTGTGCCATATATATTGTCAAAGTATATTTTCCCACTTTGTTTGGTTTGATAAGAAAGAAACTTATTTGGAGTCTGCCTAAAAAGTATGTGTGGGGTAGACTAGGGTTTTTATTTTTATTATACAAATATATATTTATTTATTTATTTATTTTTATATTTAAGTTCAACAACATATGGGGTGGGAGATTCGAACATCTAACCTCTTGGTGGAGGATATATGCCTTAAGTTCTTAACTATTGAGTTATGCCCAAGTTGGTGAGGATAGATTAGGTTTTATTTAAGTATGCGTTTATGCACTCCCAAGCTAGACGGCTTACATTAGATGGATGGATGACAATATCTCATTATGCCTTTATAATAACTCATTGCCACAAGTTCTAGTCTTGGAAGAGGAGATGGTCTTATTACTCTTAGTTGTAAGATTTGTGCCGACAATTTTGTATTCTATGAGAAATACATTTTATGAACGTGTCATTTTCCTTAGTATGGCTTTCTTTGTTCTATGTGTGCAGGCAAGCTAAAGAGGTCATAAAAGCGATAAAAAAACGATTAGGAAGTAAAAATGCAAATACACAACTTTATGCAGTTTTGGTAAGAAAAGATATTAGTATTGGTACATTAATTCTATTTTATCATTTAATACACGAGCATGTGCTTCCTTTCATGTTTTATGTACCAAATTCACCCTGTTAGGAAAAGGCACGCAAATGAATATTACGCATTTTATCCTGCATCCTTCCAATTTTATTGGATGTCCTCAAAGAGACACTTGTTATAATGCATCATGAGTGAGAAAGGGCGCAAAAGAGCATGTGGATGGTGAGGTGGAACTACAGTATTGGCAATATCCAAACCTTAACATCAAGAATGCAAATCTTTTAAAATATCAAATTTAGACGAAGAGTGAGATTAAAGGAGATAGAAGGTTTTTTCTTCTTTCTCCATATAGCTGGTGACACGTTGAGGAGTATATTTAGGGAAAGCCTGGAAGAACTATAATTTTTTAATCAATGTGTGATGACTACATAGTTTGGCTGGAAATGACCTTCAACAAAAGGGGAGGGGTTTGATTGAGCTTTTGTCCAACTGGTGGTTTAAAAAGAAAGCTAAGTTTTTGTGGAGAAATGCTGGTCGTGCTTTCCTATAGCTTTTATGGAAAGAAAGGAATACCGTTCTTAGATAAGTTTACCTCTTTTATTTTTTTGTTTTGTTTTGTTTTTTGTTTTTTTTTTGAAAATGTATAGTTTACCGCCTTGTCTTGGAGTGCATTACCCAACTCTTCTTGTAATACTTAGCAGTTCCTCATCAAGTTAGACATGGGGTTGTTCCTCGTTAATCCTCCGATTTAGGCTTAGGGAATAACAATTAACCTAGTGTTAGAGATTAGGCTTAGATTAAGGGGTGGGAATATGGTATAATTTTGACCGTTCCTATAAAAAAAATGGTGTAATATAGATTGCAAGTATACAACCTGTAACACTCCCCCTCAAGTTGGCACATAGATGTTTATCATGCCCAACTTGTTACAAAAATAGTCAACCCAAGACCCATTTAGAACCTTTGTAAAAATATTGCCTTGTTGTTAACTCATTCTCACATATCCTGTAGAGATTAATCCTTGTTGCAGTGTTTTCACAAACAAAGTGATAATCATTCTTGATGTGTTGGATTTGGTGCAATGTGAAGGGCTGTTTGGTTGTCACACCATAACCTATCTGACATTGAAGGGTTAAAACCCACTTTCGTGAGAAGATTGTAGATCCATACTATTTCATGCACAAATTTTGCCATTTCTCTATATTTTGATTCAACACTTGACCGAGACGCGACATTCTTTTTTTTTTTTTTGCTTTTCCAAGAGACCAAATTTCCTCTAACAAAGACACAATAATGTAGTAGATCTTCGATCGTCCTTAGGCCCCACCCAATCAACATTCGAGCAGTATGCAGTTTCACTGTGGCCTTAGGTTACTATATAGAATTTCTCGTCCGGAAGCACCTTTTAGGTAACATAATATCTGCTCAACAGCTTTCTAGTGCTCAATAGTAGGATATAACATATATTGACTAACTATACTAACCGAATATGTGATGTCTGGGCGAGTTACAATAAAGTAATTTAGTTTCCCAACCAATCTCCGGTATATTTCAGGTTTTGCAAATAATTCACCATCCTTTGTGAATTGCGAATTAGGGACCATTGAAGTGCTACATGGCTCAGAAGTTTCCATTTTTTGATAATAAATCAAGCACATATTTTCTTTGAGATAAGAAAATCCCTTTGTTGCGTCTCGTTACTTCAACATCTAGAAAATATTTTAATGTTATAGACGGTAGCTAAAACGAGCGGCGTATGGGCCCTCACTTGCCGGCACGTAGGAAGGTGGCGATGCTGGATGGCGGAGTGTGGGCTCCACACGCTGACTTTTCTAGCAAAAAGATTTCGTGACGGCTCTCGGGCTGGTGGCGGTTCTTTCTGTGGAGAAAGTGCCGCCGACAAATGGGAACTTTGGTTCCTGACCCTAACTGAAACCCTAACCCTAACGGAAGATAACTCATACCAACTTCAACCAAAAAAAAAAAAAAAATCCAAGACTCTCTGGTACCATGTAACTCAAACAATAAGAGAGTTTTCTTATACACATTTCAATAGGAATAAGTAGGTATAAATACCAATACAATGTTTCTACATAAGGGAACAACAACTAACCTAGTGCAAGAGATTAGGCTTAAACTAAGGGGTGGGAATATGGTATAATATAGATAGCAATATACAGCCTGTAACAGTCTTCTCAACAATAGAGAATTTGCTAATCAAACCAGAACAAACCTTTTCCATGCTAATTACTAGATCACCTTACAAGGGTCGACTAGATCCTTAGAATTCTGAAGAAATTCATCAACATTTTGAAAAAGACCATCCAACCTTTTTTTGTAACCAAGCTAGGACCAAAATACAATTCTTGGCATTATTTCAATGACGGGGCTGGTTTCCTTTTCAGAAAAAAAAAAAAAAATGCAATTCTTGGCACCTTCCTTAGACAAAAGCGAAATTTCAGCACACCAACCAAAATGCTTAAGACTTTTTTTGAACCCTAAAAAAACCCTTTTCACCTAACAGCAGTCTAAAGAAATGAACATTCCACCTAGCATATGGATGGAAAGAAACCAATGGGCTTTTGAAGATAAATATTGCCCTTGGGTGGACGTCTTTGATTCAGCTTGTCGCAACACCTCACTTTGGTTCTCTTTATCCAAATCTTTAGAGGGATACTCTCTTCAAGATATTGTTTTAAATTGGGCTATCTTTATTGATGCTTAACAGAGCCTTGTTCCTTCTTTTTATTTATTTATTTTCTACTTTTTTTTTTGGTATTGTATTTGTTATTTGTTACTAGTACTTCACTCCCTTGTGGAGTTTGTTTCCCTTGAACTTTAGTCTCTTTTTATTTCATCAATGAAAAGTTCTCTCTCTTGTTCAAAATTAAAAAAAAAAAAAAAAAGAAATGAACATTCCAAAGGTGAGCGAAAGCTTGCTCTATCCACATCATAGGCTTCCGTTGAAGCTTAATAAAGAAAGAATGATCTTTTTTCTTTTCTTGCAAAAGATAAGTGCCCTAGACAGTTATTAAATCACCGACTGACCACGATAAATCACCAATTGACCAAAAGACTTAGAGTCTGTTTGGATTGACTTTTTAAGTGTTTATAAACACTTTTTTACACTTATAAACACTTATTTAAACATTTAGAAAGTCAATCCAAACGCACCCTTTAGCTAATGGATTATGGTAAATTGAATTATATCAAGACTTTATTACCAAATGGAACAAAGAAAGATTTATCGGTGATGCATCAACTATTATCCATGTTTTCCATAGACTATCAAGAAACGATGACGAGAAAGTGCTGGAAAACAAACCAAAAATTATAAGCACACTTGTTTCTGCCCACTGACTTGAATTAGAGAACCAAAAAAGCTAACAAAAAGCATAAAAATTACTGCAGAAGGCTAAAAATGGTGAAATTTGGTAAGGAGGGAGGGAGAAGGAAAGGAAGGAGAAGAGAGAGAGAGGAGAGAGCATTTCCATTCTCAAACGAATATTTGGCGACAATTGCAAAGAAGTTTTTCAAAGAATCACAAAATGTTCCTCCAACCAAAATTGATTACATTATGAGAATAATAGAAAAGGAGCTTGATTGTGAACACCATGGAGAAAATATGTTTCAGACCTTCTTATTGATACGTTGAAGGCTTTGGATGGTTGGAGGAGTATGCTTACTTCAAAAGGGGGAAGAGTCACCCTTGCTCAATCAGTCCTTAGTAGTTTTCCTATAGAGAAGATTATCATAGGCTTCATATGGAATGGTGGAATATACAAACATGGTACAAATTTGGTGAAATGGGAATGGATTGCCTTACCGTGCAACATTGAGGTTTGAGGATTGGGTAAATGTTAGACCACCAATCGTCCATCAATACAAGAGGAGGGGTAAGAAGGTAATTGTGGAGGCTTTCTCACAGAGAGGGAGAAGAGTTGTATTGCCTTAGGGCAAGTAGTATCATATTGTGAGGCAATACCTTACATATCTCACCCATAAACTTCCTTGACACCTAAATGTTGTAGAGGCAGGCGGATTGTCCCGTGAGATTAGTCAAGGTGCGCGTAAGCTGGCCCGGACACTCACGGATATAAAAAAAAAATCTCACTCATAAATATCTCTTCACTTACCAAATGGATTTGAAGATTTGCTCAAGAATGAAACCCCCCCTTTGGAAAAAGGTCATTAGTAGTATTTATGGTGTTAGCTCTCAAGGGTGGATGATGAAAGAACTCAAGAAGAAAAAGGGTGTAGGCCGTTGGTGGATATTACTAAAAATGCTAAAAATGTGACATTTTTTGATCGGTTTGTTAGATCCAAAGCTCATATGGGCAAGCAAATTCGTTTTTGAGAATATATCTGGCTTGGGCAGACCTCTTTACAAACTTGTTATCCAAACCTTTACATTGTTTCTAACAAAAACAGAGCCTCAATTTAAGATTGTTGGGTCTATGTCTTTAGTCATGGGTTGTTTGATAGAGGAGTTTGTTGGGTAGTGATTGTTGATTGGCTGGATATGACTACATTGGTGATGGGTTGGATTGGGTCTTTTGGTCTCCTGAGCGATTTGGGAAGAAATCTACTACATCAGCTTGCTTGAGTCTTTCTAATTCCACTACCAAGCCTAATATTCCATTCACTAAGCTTGTTTGGTCGAGTAAGTGCCCAAAGAAAGTTAAAATCCTTTTGTGGTCGATTTCCTTTAGAAGTCTTAATATTGGTGATGTCCTCTCGAGGAAGTTTAAGCATTGGGCCTTGTCTCCATTGGGTTTTCATCTTTGTTGGAAGGATTTGGAAGATATCAACCATCTTTTTCTTCATTGCACCTTTGCCTATAGAACCTGGTGTTTTGTTCTTGATAAGGTTGGGATTGTGGCTTGCCTTCCTCGAAGGATTGATGATTGGCTGTTTGAAGGGCTTAATGGGTGGAATTGTATGGGTAAAGCTAGAATTTCGGAAAGTTGTGCTTAGAGCCACAATGTGGCATTTGTGGTCAGAAAGAAACGACAGATCCTTTCGAGACAAGTCTCTTCTTTTGGTCTCTTTTTCTAATTTAATGCAAAGTTCAGCCTCGTGGTGGACATCCTTACATTCCTCTTTTTTTTTTTTTGTGTGTGTAATTATAGCCTATGTTGATTATTAATGATTGGAAGGCTCTCTTCATGTAATTTTTGGGAGGGGTTCCCTCTACCTCTGACTCCTAGGGTTGCCTTCTTTGCCCATTTTGAATATAATCTCTCTATTTATTATGAGAAAAAAGAATAGACTGAAACATAAACACTAAACAACAAATGTCAATTAGCACAACTTACAGGAAGAGTTAGGGTTCAAATTATCTTTGGATTGATGGTCAAGTTCATAAGCCCATTCACTTCAAGGAAGAAACACTCCACAAGCTTGATGACCAATTATTTAAGAAAATTTCAAATGTCATTGATCATGATTGTTCAATTACAATTGAGAATAAGAGGCCTAGATAGCCTTGGAGTTGACAATTTCAACAAGTTACAAAAACCATAAAATTTAATAAAGAAGATTTTCAAAACATTTAGGGAAAAAATTCAACTTAAATCATAAAATGAAATAAACTTTTCCTAAATTAAATTTGCTGGAAATTTAAATTCTAATAATGAAAAATCTTTCCTAAATTCCCAAAAACTTACTTAAGAATGTAAAATATGGAAATTAGGGCAAAGGGTCTAGCGACAATGCATATTGGTTGGACAGGGCACTAGTGCATGAAGTGCAAAGGTTGCGTAGGTGTGCACGTCAGCGAATCAGGGGGCGTAGGCGCATGACCATGGGCAGATGCATAAGCGTGCACGGGGATTGTGCTGCTTGAGGTGGGGTGCGCGCGGGATGGGCAGGCGTAGGGCGTCTCAGCACTAGGCAGCAATTTGTGTGGTTGGTGTGCTTAGCCATGATGTGCAAGCGGCATGGGTGCGTAATGCGCACTTTCTGAATTCCACGGGGCCTCCTTATGCTCTTGCTAGCTTCCTTCTCCATCTTTGATGGCTTAATGTGTTTATGGTCTATTAGGATCATATTAGGTGTGTGCTTTGTTGTGGGGTACTTGGAGAGAGAGAAATAATAAAATCTTAAAAGGATTGAGAGGGAATCTTTTGATGTTTGGTCCCTTGTGAGATTTAATGATTCTCTTTGAATGTGGTGATGAGTTTCTTTTGTATTTTTCATTTTTTTCTCAATTAAAGTTGGTCTTTTATTTTAAAAAAATGCCCCTTCATCAATTTTAGTTTCTATTAAATTCATTCTAAGCCCTATAACCCAAGAAGAATTTCCAAATATTCCACATTATCAGCGATACAATCTCAATCTCAACAATTTTTTTTTTGTTTTTTTGTTTTTTTTTTAAAAGGACACAGAAATTTTCATTGATATAATGAAAATATATAATATTACAAGTAGAAGGAAACCAAAAGAGCTTGAGGATCAGACGGTGCACCCGGGCATCTCAACTAGGTTGACACCCCTTAGCACCACATTATTTCCAGGAACAATACAAGCCATAAACTGAATATGCAACTTCCAGAATAACACAATCCTGCTGGAAGTTTTTACGAGAATCATCAAAACTATAAAAAAAGAAAATGAGCTATGCAACAACACCATTACAAAGAATAGGAACATTATACGGGGAAAATAAATGCCCTCCAATTGAGGCTGAGATCTTGAATTGAGAAGTCTTCAAATGCTTTTGTAAGACAGCTCCAAGAAGATGCATTTAAACAAGCTATTTCGTATCGAACTCTCCAATCTGCTTATTTTTCATGAAAAACTCATTGATTCCTCTCGAACCAAACTTCATTTAACATAGCTTTGACCGCATTTGACCATAGAATCGATGCCTTGGAGTTCACGGCTGGCCCTACCAGAATTTGCATAATATTTTCTCAAAAAATATTTCCAAAAACCCAACTAACGTTAAAAATAGCAAACAACCGGTCCCAACAATTTCTAGAGAATGAACATAGAAAGAATAGGTGCTGTAAATCCTCAATTCTAGCAAGACAGAGAGGGCAAATTGAGGGAGAAAGACAGTGCAAGGGCAGTTTCTGTTGCATAACCGAGGAGCAGTTCACTTTTCCAAAAAGTAATATCCAAACAAGGATTTTCACTCTTCTAGGACTTCTAGTCTTCCATAAAGCTTTGCGCACCTGTTTATCTAATGGGGAAGCCACTAATAAATGTCTTGAAAGGGATTTAACCGAGAATGTCCCACTTGCTTCTAAAGTCCACATGCGCCTATCCATATTTCCATTCACTCTTTTGGAAGAAATCAATTTAAGGAGTGATTGAAAATCCACAATCTCTTCATCTTTTAATAAACTTCGAAAGGATAATGACCAAGAAGAGGATAAGGAATCCCAATGGTCTTGAACTGAGCCACTTGGGCTATTTGCTATTCTAAATAGAGTTGGGAAGCTTACTTTTAGAGGTAATTGACTTATCCAAGGGTCGGTGCAAAAGCCAATCTAGCACCCGTTTCCAAGTTTGTAGCTAGCTAGCGTGCTCACCTTTAGCCAAGTGCGAGAGATGCTTATCCAGGGACTTCAAAGGCTAAGATTTGCTTTACCGGCAGTGTGCCCGATTGTATCTTTCTTTCCCATGAATGTTGCGAACAACTTGCCGCCATAATGCATTTTCCTCATTCAAGAATCTCCACCCCCATTTGGCTAAGAGAGCAAGGTTTTTGACTTTTATATTGCTTAAACCGAGGCCTCCATCATTCTGTGAGCGGGTCACTAAGTCCCATTTGACTAAGTGATTCAACTTTCCTCCTTTGTTTCATTCCCAGAAGAACATTCTCATAATACGCTCCAATTTAGCAATTACTTTTTCAGGCATGAGAAATAAAGACATAATAATTGGGAAGATTGGATAGTACTGACTTACAAAGTGTGGCTTGACCTCCTCTTGATAAATTATATCTTCTCCATTTGTCTAGCTTCCTTTAAACTTTATCAATAACTGGTTGCCAGAATGTGACTTGTTTTGGATAAACCCCTAACGACAAACCAAGGTAAATAAAGGGTAACCGTTCTATTTTACAATTTAGCCGAGAGGCCATTGAAACAAGCTTATCTTCTTCCATATTAATGCCACAAAGGGCTGATCTCTCCTAATTCACCTTTTGTCCTGAAATCCATTCAAATAATTCAACTGATTTCTGCAAGGCATTTAGCATAGAGTCATCATCCTGGCAAAATAACAATGTGTCATCTGCAAACTGTAGTATAGAAATATGAATGTTATCCCTTCCCACCATAAAGCCTTCAAAAGAACCTTGTTCATGGATCTTCAAGATGAGAGCATTCAAGGCCTCACTAACTAGAAGAAATAGAATGGGGGATAGGGGATCCCCTTGTCTAAGTCCCCTTGAAGCAAGGATTCTTCCACGTGGTCTACCATTTATGAAAATTGAAGATCTTGGATCTTTAACACAACCCAAAATCCATGTAATCCATTTAGGATGAAAATTTTTCCAAATTAAAATCCAAAATCCATGTAATCTCAATATCAACAATTGATACCGGGATTTCAATTTTACGAATATATCAATGAATGTATTGATAACGTTTACCAAAAAAAAAAATCGATGAGTATATTTATAAAATTATTGGACGATATTTTTATAAAGTAAAAAACTATTTAAATTAATACTTTAGTTATTTTTACTTCCAAACTAAGTTAAATATTTTATTATTAGTATTCATATTCCTATGATGTTTTTTATGTTTTATGTTTATTTTTGAAAGATATTGGCGATGTCGATTACTTCTTAATATCGATGTCGAATCCTTGGATTTATGGATGTATTGACATATAGATAGATATTTCAATCCTTAACCTGGTTGACTCTTTTTATTTTATTCATAATTCAAATTTTCTTGGTTAGGAAAAAAAAAGGAGAAGCAAAGAAAGTTGAGTATTATTATATTTTGAATTTTGATGCTAAATTTGTCCCCAGTACAAAGCATAGCATACTTCACATACTAGTCTGCTGCTCTATTGCTGGACACCACACCATCAATATCTTAATCTTTTGTGCAGTTACTTGAAATGTTGATGAACAATATTGGAGAAGCAATACATAAGCAGGTGATTGATTCAGGAGTTCTCCCGATTCTTGTGAAGATAGTGAAGAAAAAGGTGTGAATTTGTTCTATTTTATTTACAACAAATAGTTGACTACTAGTAATCTTTCTTTAGAAGTATGGTGAACATGGTTATGGATATGACGGCATGTCATATTATAGAGAATTAGAACTTGAATACTACTATTAAAAGTATATCATTTAAATGTATTTGATTATTTCGTACATATCAATAAATTCACACTAGATATTGGATTTTTTTTAAAAAAAAAATTATTATTATTATTTTTAATAATATTAAAAATTTTATAATTTACAAGGCAATATTAAAGAAATGAGGTAATTTCCAATGTTTCAAGAAGTTGAAGAAACCAACAATATATATCCAATGATTCTAGTAAAGCAAAGAAATCCAAATCCAAATAAAAAATTTGTTAATTGTTGTTCTCCCCTCTTAAGGGTATTTGTATCTTTGAGCATTAGTCTCTTTTCACTATATTGTTTCCTTTTGAAAAATAAAAATAAAAATAAACAATTTGTTAAACAATTCAACATCTTGAAAAAGATGAAATTTGAAGTCAATTATTTTTTTTCAATTCTATGGTTAAAAATAAGTTTAAAATGTTTTTCACTTTAAAGTTATTATCTCTCTTACATTTTGTTTTTCTTTTTTGGAATAAGATGCAAACATTCTGTTGCTAGATGAAAGAAAAACAAAAATGTTCAGGGGTACAAACTCTATAAAGAGTAAAAACAGAACTAAAATCAGAAGAAGAAACATTCTATGACTGAGATATGAAGGCACCACAATCAATACGAAGATCAAAGGAAGAGAATTCCATGTTCTTTAGCTCAGTTGAACCTCTCTCTCTCTCTCTTCCTCTTCTTTCCAACGAGAAATGAAAATTTTTCATTGAATTCATGAAGAAAGACTAATGTTCAAAGGATACAAGCTCTACAAAGGAATATTTTGAGCATTAGTCTCTTTTCATTATATCAATGAAAAGTTTCATTTCTTTTTCAAAACAAAAATAAACTCCACATAGGAAAGAAAAGAAAAAAACTAAGAGATCCTCTTTATCAAAGAACCAAAATAACAATTCAACCAGAACACCACCGAAACTAAGAAATCCTCTTCATCAAAGAACCAGAATAACAATCCAACCAAAACACCACCACTATTCCCTACTACAAAGCCGTAACTAAGAAAGTCTTTGATTGTTATTGAACCATAACTTCAAAATGATTGACATGGTAGCAATATCCCAAACCAAGAGCATTCTTAGGGACGGAACAAATCCGATAAAGAAACTTAAAGCTTCTTCCATAGAAAGTCAGCAATGATGAGGCTTCTCCAGAGATGTAAAAGTGCTGAAGATTATTTTCTAGACCTAGGAAAACTTCTGAACCTCACAAAACTCACAACTACCAAAAAAGGCAATAATAAAGACAAACTATCCATTAAAGCAACCTTCAAAAGTTTTCTTGAAAAAATTTTCTCCAACAACGTTTTTCAAAGAGGAAAAAGAATCTAATTTAAATTTTGTGTCTAAGTGTGGTCAAAGTGCCCGTGATCCTAATTAAGGTGGTAGTGCTTCACAGCTTTTGTTAGTGGTATTTTGCTTTCTTTTTTACAATTTCCTGATTTGTCTATTTTTGCAGTCTAACTTACCGGTGCGAGAGAGAATATTTCTTCTCCTAGATGCCACACAGACAGCTCTTGGTGGTGCTTCCGGAAAGTTCCCTCAATATTATTCAGCATATTACGATTTGGTGGTAGGACATAGCCCCATATTTTGCTATATATCTCTTTGATTAGAAGAAGTTTTTGAAGCAAGTGTATATAATCATGTTAAGAGAACTAAATCCTTGTAGTTTGACACTATTTATAGTTCCAAGGGGTGGAAGAAAATTATTAGTATAGTAGATCATAGTGATATTGTATTTTTGAAGAGAGCAGGATTTCACACAATTCCTTGACCACCTATTATATACTTCTCTGTCTTGGTGTAAATTTACATCTTCTTTTTGTGATTACAGTCTTACTTCCATGCCCTTTTGTAATTTCATCCGTCAATGAAATTATTTCGAATGTTTCTCATAAAAAAAAAAAGGAAACAAAGAAGAAAAAAAAAGATATGTTTGGTTGGTTTTAATGGGTTGAAAGACATTGACTTTGAAGAAATGAGTTCAAGGCATGGTGGCCAAATATAGTGAGGCTAGATGGTTGTCTTGTGAGAGTAGTCAAGGTGTGCTAAAGCTGGCTGGGACATTCACAGATATCAAAAAGATAAAAAAAGGGTTGATGCACATTGTTAGTTTCATACAAAATTTATGATTGATATAGGTTGATGCTTGCTAAATAATAGTAGTCCTTATAGGTTCGTAGTGAAAGATATTTTGATTGTGTCTAATCTTCAACTTTGTAGCGTTGGGATTATATTTTTGGGTTTGGTGAAAATGAAGGAAGAGAGGGAAATTGGAAATTGGAGACCAGGTTAAGTGAGTCTAAGATTTCATTCCCACACATTATTGTATATTACATTAATTCCTTTTGTTATATATATATCCGTTCTTCATTTCTTATCCAAACCAAAAAAAGAAAAAAAAAGAAAAAAAGAAAAGAAAGCAAGAAAGCAAGAAAGAAGAAGAAACATTACAGCTTAGCTAATCCCCTGAAGAGTGTCATTGGAAACCCTGGTACTACGTAAAAGAAAAGATTAGGAGCTCCAAACATTGTTGCTGTTTGCTTGCTTTTGGTTTCTTTTGGTTTTTTGGATCTTAGAGCACTAATCTCTTTTCATTTCCTTAATGAAAAGTGTGGTATCATTTTCAAAAAGAGGAACTCCGAACATATGATCTATTTGTTATGATTAAATACAGTAACTATACTTTGAATGAAAAGTGTCTTTTCATTTCCTTGAATACAACACTAATCTCTTTTCATTTCCTTAATGAAAAGTGTCTGAATACTTTGATGAAGGTTTCATTTGTTTCTCAGAGTGCAGGAGTCCAGTTTCCTCAAAGGCCTCCTGCAGTTCCATCAAATAATCCTACCCAGCAGCAAAGTAATACCTCACAAAATGGAGTAATAAGATTATCTGAGCAGGAGGATGTTGCTAGAGTGGAACCTCAGATATTACCAGAATCTAGGTATATCTTGTTGAGCTTATTCTTCTGCACAGTTCTGTCTTGTTAACCACTTGTTTTTCCGTTTTATTTTTGGCATACAGAACTCTAAAGTGTTAAAGATCCTAGAAATCTGTACATGAACTATTTATGGAATTCTAGAATCCTTGTTCATAACCATAATTAAAGGATTTAGACAAGCTTCTCTATAAGAACTAGTCATCCTTATTCGTCCTCCGTGTGTGCATTTGTTTAGGGCCTTTTTCTTCTGAAAGCAACAAATTAGTCTTCACATCTAAAGTTTCATGTTAGATTCAACAACTCAGGTTGACCGTTTATATGTGACGAGGAGAATTGGAAGATAATTAGTATTATTTTAGTTTAATATTTGTTTAGTTGTTAATTTAGATTAATTAGTATTTTAGAACTAATTAGTTAATTGTTTGATTTTAGCCACTTTAGGGTTATAATAGGGAGCTTTTGATTATCCTTTGGATATTAAACTTTTATTCTGGGAGAGTCTACTCTTAACCTTAGGAATGGACTTCCCTTCTTCGGCCATGGACGTTAATACCTTTTCCGCTGCAACAATATGTTTCCTAGTTTCTAATTATTGATTACAAAAGTTTAAGGTTATTTTTTCAAAACAAGAAATAATATTTTTATTAATAAATATTTCTTACCCTTGATCTAAGCCCACATGGCATGCTTCAAGATTGTGGATTATTTTGTTCGTGGTCGTGCTCCGAAAATGAATGTTTTCCATGGAATTTTTTTTCAATCATAGAAAGAAACCACCTTATTTTCTTCATTTTTCTTTTTAGAACTTGAGGACGAGATTTTTCTTTTTGAAGGGATAGTTTTTTTTTTTTTTTTTTTTTTTGGTAAATTCATATTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTTTTTTTTTTTTTTTTTTTTTTTGGGTAAATTCATATTAATCTTGGTTAAATCATTACATATCCTTTTTACTTTACTTTTTGAATTTCAGATTTGCTTTTGATCCTCTATAAATAGAGAATTCCTCCTTTTGTATCCTTAACTTTGGATTGATAATGAAGCCTTTAATTTGATTCTTGGAAAATTCTCCCGGTTAATCCCTAGGCTACACTGGTCATAAAAGAATAGCAGTGGAAGAGGTTTTATTCTTGTTTTGGGTAAACTTATAGAAACATAGCAATCCTTATTGCTGATCATGCTTGCTCTTTTTCCAGTTATTATGCTTTTTCTTAGCCTCTTTCTTGCAAGTTAAACCGTTTTCCTCCTTTTAATGTTTTTTATGCAGTATAATTGAAAAGGCTAGCAATGCATTAGAAGTTCTGAAAGAAGTTCTTGATGCCGTTGATCCTCGGCATCCTGAGGTATATTATCTTGCCAGTTTTGATTCTCTCTCTCTATTGGTTGTAAATATGTTAATTAGTTGACTTATTTCTTATTCTAATTCTTTTTTTTGTGTATTGAGCCTCTCTCCTATAGAAGAAGGTTGTGTAACTTACTATTATACTATTGGAATACAAGTATTTTATTCCATTAAAATTAAGATGACATTAGAGCAGGTAAGCTCTAACTGTCCTTAAAACCCTAAAAAAAATCCTCGTCACTGCTGCTAGAGAAACCCTAATTGCTAAGTTGTTGGTATTCTAGCTGCTGATCCTAACTAGTGCAATCTACTTCTGGTCGAATTGACAAGTGAAGGTGTTCATGTTCAAGTGGTCATTTGTTGGAGTGTGTTCGCAAGAGAAGAGGCGTGTGAATGTCTTCATGATCATTGTTGCATAAACCTTTGCCTTTGCCATCTGTTATCGTGCGTCGCTTTTGCTGCCACTCCATCACAGCTTTTGGTCGTGGATGTGTTCGTATTACCGTCTTTCTGTTGTCGTGTGTTGTAGCCATTTGAGATCGTTTGGGTTGATATTGTCATTCATAAGCTTCGTGTTTGTGCTCGACGGTCGACCATCGTGACCGTCATTTGAAGGTTTTTTTTACTGTTGATCATATGTGATGCCGTTCACAATTTTTCATATGTTGGGATTGTATCCAAGTTTCTTCCGTCAAACAAGAACAAAGACTATTGGGGTGTCCTTACTTCATGTCGACATTAGGTTTAAAATATTTTGAATTCGATTTGGTTCAATTTGGTTCAACGTTTGGTTCAGTTCAGTTCAAAATACAGTTGGTTTGATTCTATTTGGTTCACGTCGGTTCGATATGGTTTGGTTCGATTATGGTTTTCTCGCACAATTGGTACATGATTTTCCATTACTCATTGTTTACTACAATGTCAAAAGAGCAAATTCTTCAAATCTTAAAATTGCTCCCAACCAATTTACCATCTAATAGTTCTAATAGTTCTTGTAATCCCCTAGGCACAAACCGGTTATTATCCTCAAGCAATCTCGAGGTGCAATTCTTCTTCATGGATTATCAATTCCAAAGAACAATTCCCTTCTTTCCACCAACTGTTGGGCCCCCAATTGTCAATACATATTCTAGGAGGGATACTAAGGGTATTGGATAGGGGACCACAAGTTAGTCGAATGTGAGAATGTTGTTAGTACATTTGTGGGTATTATTATAGCCGACTGTAGTGTGTCTTTATATTGAATAGAATAGTGGTTCTAGGGTTGAAGAGGTTACGAGCGCTCCTTAATATGTTGGGTTGTCATTGTGTTAGGTGTACATTTCCTTGCTATCGGTGATATTTTGTTAATCGATGCATCGCAAGGCAGTATCCTTACATCATTCTATTTATTGGGTAGACAGCAATGAACATTGCTTCCTTATATTAATATTTTGGAACATGCATGGACAACATTATAGCACATGCTTCTTCAAGTAAATGCTTATTTTTCTTCTAGCAATTCCCTTCTTGAGGAGTATCTTGACAAGTGAACTGATGAACAACCTTTTATCTTTCATAAAGTCACCAAAAAATTCATCAAATATTGTTAGGTACTTAAGCACCTTGGAGAACCACACCCAAAAGCCAGCTATTGAGGTGGGAGAACCAAACCACTTAAGTACCACATTGGTCATCTCATTCTAACCAATGTGGGATAGAGGTAGCCCATACTACCTTGGTTCCTAACAATACCCTATCCACGGAAAGCCGACGTCCCGGCGGCTATTCCGACGGTATTTCACTTGGCCACACCCAGTCAGAACCCTTCGTCGGTTCCGCTTCAAAACGTCAACAAACGGCTTCGATACCATTGTTAGGTACTTAAGCACCTTGGAGAACCACACCCTAAAATCCAGCTATTGAGGTGGGAGAGCCAAGCCACTTAAGTACCACATTGGTCATCCCATTCTAACCAACGTGGGACAAAGGTAGCCCATACTACCTTGGTTCCTAACAAATACTCAACTCCATTATTTGAGTGCAAAGAGCAAATTTTAGTTCAAAATTGGGTTTCAACCATATTTTAAAAATGTTAAGAGTGTCTCACTTCAAATCTCAAAGCCTCAAATGCTTTCTTGCAGCTTCTTTTGGTGTTTGGTCAGTTCCTCCTTGTTGATTGGTTCTATCTTTTCCTCTGTGTTGAAGGTGAAAATTTCCAAGAAGGTTAGTTTTTTTTTATGGCAAGTTATGTGCAGAAGAGTACACCTTGGATCGGGTCTCAGGTAGGGTGACCACCTTGGTTGGACTGTTTTATTGTACTCTTTGTAGGAAGGCGGGGGAGGATCTCGATCATATCCTTTGGAGATGTGATTTTGCAGGTTCCATTTGGAGTTGTTTCTTTGAAGCGATTGGCTTTTGCTTTTCCAGCTTTCAGAGTTGTAGAGGGACAGATTGAGAAGCTCCTTTTCCTTCTGCAGGGCTATTTTCATGCCAAGCTAGGGTGTGCACTATTTTCTGGGACCTTTGGGCGAGAGAAATAATCTAATCTTCAGGGGTTAGGAGAGGTCTTCTAGTGATATTTGGTCCCTTATATTTTATCTCTCTCTAGGCTTCGGTTGCGAGGTCTTTTTGTAATTATTCTTGTTCTCTTATTTTACTAGATTGGAGACCATGTTCTCTAGTTTGGTTCTCTTTTGTAGGTTTTATTTTTGTATGCCATTGTATTTTTCTTTTTTTTTTTTAATCCATGAAAGATTTTCATAAAAAGTCAATTGATATTTTCAACGAAAATAATGAAAGTATTTAGAGAGTTCAGATTTCACATATACGTGTTGAGATTTGAGATAAACATGCACATTTGGTGGAGGGAAAAACCAAAATTGAGTTGTAGTCTAGTTGTTCACAACGGTCATCTGATAAGAGAAGCCATTTGGTTTTTTTTTCTGGTTAAGTCACATAGAGAAGCATACCATTTTAACTGCTTTACTGCTATTTTCCAGGGGGCAAGAGATGAGTTTACCCTTGATCTAGTAGAACAGTGTTCGTTTCAGAAGCAGAAACTAATGCATCTTGTGTTGTCTTCTCGGTATGGTTCCATCTTCGCCTGTTTCTTCCACACTGCTCAGTAATATATTCCATTCAATGAATGGTTGTTTTTTTCCTATCTCGATGCTTTATGAAACAATAACGTACTATGATGAAGTTAATGGAAGGGAGCAGGAAGCCACAAAATATTGTTGATATCATAGTTGGCAATTTTTGTAAATGAAAAACTGTGCAATAATGTTTTTGTTTTTGGAACAACTTTGCGTAAAAATCTCATAGATCGCTCACCCTATTTTAACAATTGTAGATAGCTTTAAGGATGCTGGATTTGACACATCTCTAGATATCCTATAAATTAGTACATTCTTTTGAAAAAAAAAATTGTTTAATTTATAAGAAATAAGTTAAATATACAATGTTGCTACCTTTTTCTAATTTCCTGACCTAAGAACTCTCTGCTATGTGACTGTTGTTTTGAATTTCCCTTATCTTTTCATATATTGTGGTTGTTACTCTTGTTAGCCTGCCACAGACATCCACTATTAATATTGCATGAGATAGTAGTAATTGTATATTCTTTCATTTCTCTCAATAAAGCTGATTTCGTTTTTTTTATAAATAGTAATAAAATAATAATAATATGTGAATTTTTCTCTCCAAATTTCTTGTTTAATTGGAGGATTTTTTTTTTTTTTGCCTATTCTTCATTGTAATGCATGGTTCTCTTGGGATTGTACTTTTTAAGCTTTCAGGCTCTTCGCTTGTATTTTTGTAATTTGCGCAATAGTCTCTTTTCATTTTACCAATGAAAAGTTATGTTTCCTTTATTTAAAAAAATAATAATAAAATAATAATCTACTTAATTGAAGCATACTTGCAAACTTCAGTATCTTGTAATTCTATTGGTCTATCATTCAAATTTCCGGTTGCTTTCACTGTGTTTCAGGGATGAGAAGATTGTCTGTCGTGCCATTGAATTGAATGAGAAGCTCCAAAAGGTCCTTGCAAGACATGATGCCCTCCTCTCGGGTCAGTTTGTGTCAACACAAAATCAGTTCAACGGTGAAGAAGCTGGTATGTCTAGGTTGCCTGACAATCATTATAACCAAGACGAAGGAGAAGATGAAGAAGAGGCCGAGCAACTTTTCCGAAGGTAATACATAAAATCCAATGTTGAGTCCAGTATGGCGTCATGTAAGTATAAGCAAACTAAGTAGTAAGTGATATGGACAATGACAGGCTGCGAAAAGGAAAAGCTTGTGTAAGGCCTGAAGATGAAGAAGATTCTTCAGAGGAGCGACCGTCGTTAGGTTTGCTAGGTTTGTCAATTCCAGTCGAACGAGCAAGCCGTCCGATCATTCGACCTATTACCGAGAAGCTGTCAACAACATCGGATATGCAGCATGGCCAGGGTGTTTCAATACCACCACCACCAGTAAAACATGCAGAAAGGGAGAAGTTCTTTAAGGATAAAAAGATGGATGTTGGAGTTGGGCATATGAGAGGCCTTTCTTTACACAGTCGTAATGCTAGCAGCTCCCGCAGTGGGAGCGTAGATTTCAATGAGTCATGAAGGAATGGTTTGAAGAAGTGTTTGGGAGTGTGCAATTTTTATTTATTTTCACTTCAACTCATTATTTGTAAAGTTGAGTCCTTTTTCTTCTTTTCTTTCTTGGTCTTCTTTCTTTTCAATGAGTTTGATGTCTTTAAATTAATGTGTAAGGGCCAGGGACTTTTGCATATGTCCATTCCATTCGTGTATAATTTTGTAATTTAAAGGTGGTTAACTATTTCAAGTTTGCTCAGGAAGGAAAAAGAAAAAAAGGAAGATTGAAAGACCCTTTTTTTCCCTA

mRNA sequence

GAGGAGAGAGGAGAGAAAAAAAAAAAGAGGTTTTGGACAGCGTAGATCCATGGCGGTGCCGTTGCTTTCTTCTATAAGCTTTTTTGCTGAAACGCAACGCCCATTCTCTCACCAAAGGACAGAACGACACACGCATACACTTTTAGAGAGAGAGAGAGAGAGAGATTCTCATTTCTCTTCTGTCGTTCAACTCCAATCTTCTTCCCCATTATTTGCTTCTTTATTATCCTTTCAATTCCCTTCTTAATCATTCATCATTACGCCTACCCACAGAGCTTGCAACCATTTTCTTACCCTTTTTTGCCCCAACCCCATTTCTTCCTTCTTCCTCAAATGCTCATCCTCTTCCTCTTCTAGGGTTTTCAGTTCTTTTCTTTTGTGTAATCTGAGTGGCTTATGGTTCCCACACCGGAAGGCCTTCTTGATTTCTGTCTCTGCTTCTATTGGTGCAGTATGCCTAAATTCAAGATATTGTAGAACATTCTTGGTTTTTGTCTATTCTGACGACTTGTTACTCCCATATTGACCATTCAAGTATTTAGGAGGGGTACTATAAAGATGGCTGCTGAATTAGTCAACTCTGCAACAAGTGAGAAACTGGCTGAAACTGATTGGATGAAGAATATCCAAATTTGTGAATTAGTTGCTCATGATCAAAGGCAAGCTAAAGAGGTCATAAAAGCGATAAAAAAACGATTAGGAAGTAAAAATGCAAATACACAACTTTATGCAGTTTTGTTACTTGAAATGTTGATGAACAATATTGGAGAAGCAATACATAAGCAGGTGATTGATTCAGGAGTTCTCCCGATTCTTGTGAAGATAGTGAAGAAAAAGTCTAACTTACCGGTGCGAGAGAGAATATTTCTTCTCCTAGATGCCACACAGACAGCTCTTGGTGGTGCTTCCGGAAAGTTCCCTCAATATTATTCAGCATATTACGATTTGGTGGTTTCATTTGTTTCTCAGAGTGCAGGAGTCCAGTTTCCTCAAAGGCCTCCTGCAGTTCCATCAAATAATCCTACCCAGCAGCAAAGTAATACCTCACAAAATGGAGTAATAAGATTATCTGAGCAGGAGGATGTTGCTAGAGTGGAACCTCAGATATTACCAGAATCTAGTTCAGTTCAAAATACAGTTGGTTTGATTCTATTTGGTTCACGTCGGTTCGATATGGGGGCAAGAGATGAGTTTACCCTTGATCTAGTAGAACAGTGTTCGTTTCAGAAGCAGAAACTAATGCATCTTGTGTTGTCTTCTCGGTATGGTTCCATCTTCGCCTGTTTCTTCCACACTGCTCAGGATGAGAAGATTGTCTGTCGTGCCATTGAATTGAATGAGAAGCTCCAAAAGGTCCTTGCAAGACATGATGCCCTCCTCTCGGGTCAGTTTGTGTCAACACAAAATCAGTTCAACGGTGAAGAAGCTGGTATGTCTAGGTTGCCTGACAATCATTATAACCAAGACGAAGGAGAAGATGAAGAAGAGGCCGAGCAACTTTTCCGAAGGCTGCGAAAAGGAAAAGCTTGTGTAAGGCCTGAAGATGAAGAAGATTCTTCAGAGGAGCGACCGTCGTTAGGTTTGCTAGGTTTGTCAATTCCAGTCGAACGAGCAAGCCGTCCGATCATTCGACCTATTACCGAGAAGCTGTCAACAACATCGGATATGCAGCATGGCCAGGGTGTTTCAATACCACCACCACCAGTAAAACATGCAGAAAGGGAGAAGTTCTTTAAGGATAAAAAGATGGATGTTGGAGTTGGGCATATGAGAGGCCTTTCTTTACACAGTCGTAATGCTAGCAGCTCCCGCAGTGGGAGCGTAGATTTCAATGAGTCATGAAGGAATGGTTTGAAGAAGTGTTTGGGAGTGTGCAATTTTTATTTATTTTCACTTCAACTCATTATTTGTAAAGTTGAGTCCTTTTTCTTCTTTTCTTTCTTGGTCTTCTTTCTTTTCAATGAGTTTGATGTCTTTAAATTAATGTGTAAGGGCCAGGGACTTTTGCATATGTCCATTCCATTCGTGTATAATTTTGTAATTTAAAGGTGGTTAACTATTTCAAGTTTGCTCAGGAAGGAAAAAGAAAAAAAGGAAGATTGAAAGACCCTTTTTTTCCCTA

Coding sequence (CDS)

ATGGCTGCTGAATTAGTCAACTCTGCAACAAGTGAGAAACTGGCTGAAACTGATTGGATGAAGAATATCCAAATTTGTGAATTAGTTGCTCATGATCAAAGGCAAGCTAAAGAGGTCATAAAAGCGATAAAAAAACGATTAGGAAGTAAAAATGCAAATACACAACTTTATGCAGTTTTGTTACTTGAAATGTTGATGAACAATATTGGAGAAGCAATACATAAGCAGGTGATTGATTCAGGAGTTCTCCCGATTCTTGTGAAGATAGTGAAGAAAAAGTCTAACTTACCGGTGCGAGAGAGAATATTTCTTCTCCTAGATGCCACACAGACAGCTCTTGGTGGTGCTTCCGGAAAGTTCCCTCAATATTATTCAGCATATTACGATTTGGTGGTTTCATTTGTTTCTCAGAGTGCAGGAGTCCAGTTTCCTCAAAGGCCTCCTGCAGTTCCATCAAATAATCCTACCCAGCAGCAAAGTAATACCTCACAAAATGGAGTAATAAGATTATCTGAGCAGGAGGATGTTGCTAGAGTGGAACCTCAGATATTACCAGAATCTAGTTCAGTTCAAAATACAGTTGGTTTGATTCTATTTGGTTCACGTCGGTTCGATATGGGGGCAAGAGATGAGTTTACCCTTGATCTAGTAGAACAGTGTTCGTTTCAGAAGCAGAAACTAATGCATCTTGTGTTGTCTTCTCGGTATGGTTCCATCTTCGCCTGTTTCTTCCACACTGCTCAGGATGAGAAGATTGTCTGTCGTGCCATTGAATTGAATGAGAAGCTCCAAAAGGTCCTTGCAAGACATGATGCCCTCCTCTCGGGTCAGTTTGTGTCAACACAAAATCAGTTCAACGGTGAAGAAGCTGGTATGTCTAGGTTGCCTGACAATCATTATAACCAAGACGAAGGAGAAGATGAAGAAGAGGCCGAGCAACTTTTCCGAAGGCTGCGAAAAGGAAAAGCTTGTGTAAGGCCTGAAGATGAAGAAGATTCTTCAGAGGAGCGACCGTCGTTAGGTTTGCTAGGTTTGTCAATTCCAGTCGAACGAGCAAGCCGTCCGATCATTCGACCTATTACCGAGAAGCTGTCAACAACATCGGATATGCAGCATGGCCAGGGTGTTTCAATACCACCACCACCAGTAAAACATGCAGAAAGGGAGAAGTTCTTTAAGGATAAAAAGATGGATGTTGGAGTTGGGCATATGAGAGGCCTTTCTTTACACAGTCGTAATGCTAGCAGCTCCCGCAGTGGGAGCGTAGATTTCAATGAGTCATGA

Protein sequence

MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGSKNANTQLYAVLLLEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKSNLPVRERIFLLLDATQTALGGASGKFPQYYSAYYDLVVSFVSQSAGVQFPQRPPAVPSNNPTQQQSNTSQNGVIRLSEQEDVARVEPQILPESSSVQNTVGLILFGSRRFDMGARDEFTLDLVEQCSFQKQKLMHLVLSSRYGSIFACFFHTAQDEKIVCRAIELNEKLQKVLARHDALLSGQFVSTQNQFNGEEAGMSRLPDNHYNQDEGEDEEEAEQLFRRLRKGKACVRPEDEEDSSEERPSLGLLGLSIPVERASRPIIRPITEKLSTTSDMQHGQGVSIPPPPVKHAEREKFFKDKKMDVGVGHMRGLSLHSRNASSSRSGSVDFNES
Homology
BLAST of Lsi01G020100 vs. ExPASy Swiss-Prot
Match: Q9FFQ0 (TOM1-like protein 5 OS=Arabidopsis thaliana OX=3702 GN=TOL5 PE=1 SV=1)

HSP 1 Score: 414.1 bits (1063), Expect = 2.0e-114
Identity = 253/469 (53.94%), Postives = 311/469 (66.31%), Query Frame = 0

Query: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGSKNANTQLYAVL 60
           MAAELV+SATSEKLA+ DW KNI+ICEL A D+RQAK+VIKAIKKRLGSKN NTQLYAV 
Sbjct: 1   MAAELVSSATSEKLADVDWAKNIEICELAARDERQAKDVIKAIKKRLGSKNPNTQLYAVQ 60

Query: 61  LLEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKSNLPVRERIFLLLDATQTALGGASGKF 120
           LLEMLMNNIGE IHKQVID+GVLP LVKIVKKKS+LPVRERIFLLLDATQT+LGGASGKF
Sbjct: 61  LLEMLMNNIGENIHKQVIDTGVLPTLVKIVKKKSDLPVRERIFLLLDATQTSLGGASGKF 120

Query: 121 PQYYSAYYDLVVSFVSQSAGVQFPQRP---------PAVPSNNPTQQQSNTSQNGVIRLS 180
           PQYY+AYY+LV      +AGV+F QRP          AVP N   +Q ++    G     
Sbjct: 121 PQYYTAYYELV------NAGVKFTQRPNATPVVVTAQAVPRNTLNEQLASARNEGPATTQ 180

Query: 181 EQEDVARVEPQILPESSSVQNTVGLILFG-SRRFDMGARDEFTLDLVEQCSFQKQKLMHL 240
           ++E  +     IL ++S+    +  +L     +   GA+DEFTLDLVEQCSFQK+++MHL
Sbjct: 181 QRESQSVSPSSILQKASTALEILKEVLDAVDSQNPEGAKDEFTLDLVEQCSFQKERVMHL 240

Query: 241 VLSSRYGSIFACFFHTAQDEKIVCRAIELNEKLQKVLARHDALLSGQFV-----STQNQF 300
           V++SR             DEK V +AIELNE+LQ++L RH+ LLSG+       +T N +
Sbjct: 241 VMTSR-------------DEKAVSKAIELNEQLQRILNRHEDLLSGRITVPSRSTTSNGY 300

Query: 301 -----------NGEEAGMSRLPD------------NHYNQDEGEDEEEAEQLFRRLRKGK 360
                      NG++    +  +             H   +E ++EEE EQLFRRLRKGK
Sbjct: 301 HSNLEPVRPISNGDQKRELKASNANTESSSFISNRAHLKLEEEDEEEEPEQLFRRLRKGK 360

Query: 361 ACVRPEDEEDSSEERPSLGLLGLSIPVERASRPIIRPITEKLSTTSDMQHGQG--VSIPP 420
           A  RPEDEE+ S   P  GL G +I  ER +RP+IRP+  + ++     H Q   V IPP
Sbjct: 361 ARARPEDEEEPS---PPQGLPGSAIHNERLNRPLIRPLPSEEASRGGDSHSQSPPVVIPP 420

Query: 421 PPVKHAEREKFFKDKKMDVGV---GHMRGLSLHSRNASSSRSGSVDFNE 427
           PP KH EREKFFK+ K D  +   GHMRGLSLHSR+ SSSRSGSVDF++
Sbjct: 421 PPAKHVEREKFFKENKGDGALGLPGHMRGLSLHSRDGSSSRSGSVDFSD 447

BLAST of Lsi01G020100 vs. ExPASy Swiss-Prot
Match: Q6NQK0 (TOM1-like protein 4 OS=Arabidopsis thaliana OX=3702 GN=TOL4 PE=1 SV=1)

HSP 1 Score: 185.3 bits (469), Expect = 1.5e-45
Identity = 139/429 (32.40%), Postives = 208/429 (48.48%), Query Frame = 0

Query: 2   AAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGSKNANTQLYAVLL 61
           AA     AT++ L   DW  NI++C+L+  D  QAKE +K +KKRLGSKN+  Q+ A+  
Sbjct: 5   AAACAERATNDMLIGPDWAINIELCDLINMDPSQAKEAVKVLKKRLGSKNSKVQILALYA 64

Query: 62  LEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKSNLPVRERIFLLLDATQTALGGASGKFP 121
           LE L  N GE +++ +ID G+L  +VKIVKKK  L VRE+I  LLD  Q A GG  G++P
Sbjct: 65  LETLSKNCGENVYQLIIDRGLLNDMVKIVKKKPELNVREKILTLLDTWQEAFGGRGGRYP 124

Query: 122 QYYSAYYDLVVSFVSQSAGVQFPQRPPAVPSNNPTQQQSNTSQNGVIRLSEQEDVARVEP 181
           QYY+AY DL      +SAG++FP R  +  S   T  Q+   ++  I+ S Q D A    
Sbjct: 125 QYYNAYNDL------RSAGIEFPPRTESSLSFF-TPPQTQPDEDAAIQASLQGDDA--SS 184

Query: 182 QILPESSSVQNTVGLILFGSRRFDMG----ARDEFTLDLVEQCSFQKQKLMHLVLSSRYG 241
             L E  S + +V +++      D G     ++E  +DLVEQC   ++++M LV      
Sbjct: 185 LSLEEIQSAEGSVDVLMDMLGAHDPGNPESLKEEVIVDLVEQCRTYQRRVMTLV------ 244

Query: 242 SIFACFFHTAQDEKIVCRAIELNEKLQKVLARHDALLSGQFVSTQNQFNGEEAGMSRLPD 301
                  +T  DE+++C+ + LN+ LQ VL RHD + +   V +  +       +  +  
Sbjct: 245 -------NTTTDEELLCQGLALNDNLQHVLQRHDDIANVGSVPSNGRNTRAPPPVQIVDI 304

Query: 302 NHYNQDEGEDEEEAEQLFRRLR----------KGKACVRPEDEEDSSEERPSLGLLGLSI 361
           NH ++D+  D+E A    R              G   +   D         S G+     
Sbjct: 305 NHDDEDDESDDEFARLAHRSSTPTRRPVHGSDSGMVDILSGDVYKPQGNSSSQGVKKPPP 364

Query: 362 PVERASRPIIRPITEKLSTTSDMQHGQGVSIPPPPVKHAEREKFFKDKKMDVG-----VG 412
           P    S     P+ +  S           ++PPPP +H +R++FF+      G      G
Sbjct: 365 PPPHTSSSSSSPVFDDASPQQSKSSEVIRNLPPPPSRHNQRQQFFEHHHSSSGSDSSYEG 411

BLAST of Lsi01G020100 vs. ExPASy Swiss-Prot
Match: Q9LPL6 (TOM1-like protein 3 OS=Arabidopsis thaliana OX=3702 GN=TOL3 PE=1 SV=1)

HSP 1 Score: 173.7 bits (439), Expect = 4.4e-42
Identity = 117/335 (34.93%), Postives = 180/335 (53.73%), Query Frame = 0

Query: 2   AAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGSKNANTQLYAVLL 61
           AA     AT++ L   DW  NI++C+++  +  QAKE +K +KKRLGSKN+  Q+ A+  
Sbjct: 5   AAACAERATNDMLIGPDWAINIELCDIINMEPSQAKEAVKVLKKRLGSKNSKVQILALYA 64

Query: 62  LEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKSNLPVRERIFLLLDATQTALGGASGKFP 121
           LE L  N GE++++ ++D  +LP +VKIVKKK +L VRE+I  LLD  Q A GG+ G+FP
Sbjct: 65  LETLSKNCGESVYQLIVDRDILPDMVKIVKKKPDLTVREKILSLLDTWQEAFGGSGGRFP 124

Query: 122 QYYSAYYDLVVSFVSQSAGVQFPQR-PPAVPSNNPTQ------QQSNTSQNGVIRLSEQE 181
           QYY+AY +L      +SAG++FP R   +VP   P Q      Q + + ++  I+ S Q 
Sbjct: 125 QYYNAYNEL------RSAGIEFPPRTESSVPFFTPPQTQPIVAQATASDEDAAIQASLQS 184

Query: 182 DVARVEPQILPESSSVQNTVGLILFGSRRFD----MGARDEFTLDLVEQCSFQKQKLMHL 241
           D A      + E  S Q +V ++       D     G ++E  +DLVEQC   ++++M L
Sbjct: 185 DDA--SALSMEEIQSAQGSVDVLTDMLGALDPSHPEGLKEELIVDLVEQCRTYQRRVMAL 244

Query: 242 VLSSRYGSIFACFFHTAQDEKIVCRAIELNEKLQKVLARHDALLSGQFVSTQNQFNGEEA 301
           V             +T  DE+++C+ + LN+ LQ+VL  HD    G  V           
Sbjct: 245 V-------------NTTSDEELMCQGLALNDNLQRVLQHHDDKAKGNSVPA--------T 304

Query: 302 GMSRLPDNHYNQDEGEDEEEAE--QLFRRLRKGKA 324
             + +P    N D+ +DE + +  QL  R ++  A
Sbjct: 305 APTPIPLVSINHDDDDDESDDDFLQLAHRSKRESA 310

BLAST of Lsi01G020100 vs. ExPASy Swiss-Prot
Match: Q9C9Y1 (TOM1-like protein 8 OS=Arabidopsis thaliana OX=3702 GN=TOL8 PE=2 SV=1)

HSP 1 Score: 172.6 bits (436), Expect = 9.9e-42
Identity = 127/416 (30.53%), Postives = 202/416 (48.56%), Query Frame = 0

Query: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGSKNANTQLYAVL 60
           M   LV+ ATS+ L   DW  N++IC+++ H+  Q +EV+  IKKRL S+ +  QL A+ 
Sbjct: 1   MVHPLVDRATSDMLIGPDWAMNLEICDMLNHEPGQTREVVSGIKKRLTSRTSKVQLLALT 60

Query: 61  LLEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKSNLPVRERIFLLLDATQTALGGASGKF 120
           LLE ++ N GE IH QV +  +L  +VK+ K+K N+ V+E+I +L+D  Q +  G  G+ 
Sbjct: 61  LLETIITNCGELIHMQVAEKDILHKMVKMAKRKPNIQVKEKILILIDTWQESFSGPQGRH 120

Query: 121 PQYYSAYYDLVVSFVSQSAGVQFPQRPPAVPSNNPTQQQSNTSQNGVIRLSEQEDVARVE 180
           PQYY+AY +L+       AG+ FPQRP   PS+      +   QN   R + QE +    
Sbjct: 121 PQYYAAYQELL------RAGIVFPQRPQITPSSGQNGPSTRYPQNS--RNARQEAIDTST 180

Query: 181 PQILPESS--SVQNTVGLI---------LFGSRRFDMGARDEFTLDLVEQCSFQKQKLMH 240
               P  S   +QN  G++         + G+ +   G + E  +DLV QC   KQ+++H
Sbjct: 181 ESEFPTLSLTEIQNARGIMDVLAEMMNAIDGNNK--EGLKQEVVVDLVSQCRTYKQRVVH 240

Query: 241 LVLSSRYGSIFACFFHTAQDEKIVCRAIELNEKLQKVLARHDALLSGQFVSTQNQFNGEE 300
           LV             ++  DE ++C+ + LN+ LQ++LA+H+A+ SG      +    EE
Sbjct: 241 LV-------------NSTSDESMLCQGLALNDDLQRLLAKHEAIASG-----NSMIKKEE 300

Query: 301 AGMSRLP-DNHYNQDEGEDEEEAEQLFRRLRKG-KACVRPEDEEDSSEERPSLGLLGLSI 360
                +P D     D G  E +   +      G K  +   D+ ++     SL L+ L  
Sbjct: 301 KSKKEVPKDTTQIIDVGSSETKNGSVVAYTTNGPKIDLLSGDDFETPNADNSLALVPLGP 360

Query: 361 PVERASRPIIRPITEKLSTTSDMQHGQGVSIPPPPVKHAEREKFFKDKKMDVGVGH 404
           P  + S P+ +P    +     +      S  P    HA  +K  ++     G GH
Sbjct: 361 P--QPSSPVAKP-DNSIVLIDMLSDNNCESSTPTSNPHANHQKVQQNYSNGFGPGH 385

BLAST of Lsi01G020100 vs. ExPASy Swiss-Prot
Match: Q8L860 (TOM1-like protein 9 OS=Arabidopsis thaliana OX=3702 GN=TOL9 PE=1 SV=1)

HSP 1 Score: 162.5 bits (410), Expect = 1.0e-38
Identity = 115/328 (35.06%), Postives = 173/328 (52.74%), Query Frame = 0

Query: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGSKNANTQLYAVL 60
           M   +V  ATSE L   DW  N++IC+++  D  QAK+V+K IKKR+GS+N   QL A+ 
Sbjct: 1   MVNAMVERATSEMLIGPDWAMNLEICDMLNSDPAQAKDVVKGIKKRIGSRNPKAQLLALT 60

Query: 61  LLEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKSNLPVRERIFLLLDATQTALGGASGKF 120
           LLE ++ N G+ +H  V + GV+  +V+IVKKK +  V+E+I +L+D  Q A GG   ++
Sbjct: 61  LLETIVKNCGDMVHMHVAEKGVIHEMVRIVKKKPDFHVKEKILVLIDTWQEAFGGPRARY 120

Query: 121 PQYYSAYYDLVVSFVSQSAGVQFPQRPP-AVPSNNPTQQQSNTSQNGVIR-LSEQEDVAR 180
           PQYY+ Y +L+       AG  FPQR   + P   P Q Q  TS    +R      DV  
Sbjct: 121 PQYYAGYQELL------RAGAVFPQRSERSAPVFTPPQTQPLTSYPPNLRNAGPGNDVP- 180

Query: 181 VEPQILPE-----SSSVQNTVGLI---------LFGSRRFDMGARDEFTLDLVEQCSFQK 240
            EP   PE      S +QN  G++         L    + D+  + E  +DLVEQC   K
Sbjct: 181 -EPSAEPEFPTLSLSEIQNAKGIMDVLAEMLSALEPGNKEDL--KQEVMVDLVEQCRTYK 240

Query: 241 QKLMHLVLSSRYGSIFACFFHTAQDEKIVCRAIELNEKLQKVLARHDALLSG-QFVSTQN 300
           Q+++HLV             ++  DE ++C+ + LN+ LQ+VL  ++A+ SG    S+Q 
Sbjct: 241 QRVVHLV-------------NSTSDESLLCQGLALNDDLQRVLTNYEAIASGLPGTSSQI 300

Query: 301 QFNGEEAGMSRLPDNHYNQDEGEDEEEA 312
           +    E G S +  +    D G+   +A
Sbjct: 301 EKPKSETGKSLVDVDGPLIDTGDSSNQA 305

BLAST of Lsi01G020100 vs. ExPASy TrEMBL
Match: A0A6J1D6L6 (TOM1-like protein 5 OS=Momordica charantia OX=3673 GN=LOC111017804 PE=3 SV=1)

HSP 1 Score: 667.5 bits (1721), Expect = 3.6e-188
Identity = 358/434 (82.49%), Postives = 374/434 (86.18%), Query Frame = 0

Query: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGSKNANTQLYAVL 60
           MAAELVNSATSEKLAETDWMKNI+ICELVA DQRQAK+V+KAIKKR+GSK ANTQLYAVL
Sbjct: 1   MAAELVNSATSEKLAETDWMKNIEICELVARDQRQAKDVVKAIKKRIGSKTANTQLYAVL 60

Query: 61  LLEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKSNLPVRERIFLLLDATQTALGGASGKF 120
           LLEMLMNNIGE IHKQVIDSGVLPILVK VKKKSNLPVRERIFLLLDATQTALGGASGKF
Sbjct: 61  LLEMLMNNIGEPIHKQVIDSGVLPILVKTVKKKSNLPVRERIFLLLDATQTALGGASGKF 120

Query: 121 PQYYSAYYDLVVSFVSQSAGVQFPQRPPAVPSNNPTQQQSNTSQNGVIRLSEQEDVARVE 180
           PQYYSAYYDLV      SAGVQFPQRPP+VP NNPTQQ +NT QNGVIRLSEQE  ARVE
Sbjct: 121 PQYYSAYYDLV------SAGVQFPQRPPSVPPNNPTQQHNNTLQNGVIRLSEQEGAARVE 180

Query: 181 PQILPESSSVQNTVGLILFGSRRFDM-------GARDEFTLDLVEQCSFQKQKLMHLVLS 240
           PQILPESS ++     +       D        GARDEFTLDLVEQCSFQKQ+LMHLVLS
Sbjct: 181 PQILPESSIIEKAGNALEVLKEVLDAVNPRNPEGARDEFTLDLVEQCSFQKQRLMHLVLS 240

Query: 241 SRYGSIFACFFHTAQDEKIVCRAIELNEKLQKVLARHDALLSGQFVSTQNQFNGEEAGMS 300
           SR             DEKIVCRAIELNEKLQKVLARHDALLSGQF+STQN F+ E+ G S
Sbjct: 241 SR-------------DEKIVCRAIELNEKLQKVLARHDALLSGQFISTQNNFSDEDTGRS 300

Query: 301 RLPDNHYNQDEGEDEEEAEQLFRRLRKGKACVRPEDEEDSSEERPSLGLLGLSIPVERAS 360
           RLP NH N DEGEDEEEAEQLFRRLRKGKACVRPEDEEDSSEERPSLG LGLSIPVERA+
Sbjct: 301 RLPANHCNHDEGEDEEEAEQLFRRLRKGKACVRPEDEEDSSEERPSLGSLGLSIPVERAN 360

Query: 361 RPIIRPITEKLSTTSDMQHGQGVSIPPPPVKHAEREKFFKDKKMDVGVGHMRGLSLHSRN 420
           RPIIRPI EK STTSDMQHGQGVSIPPPPVKHAEREKFFKDKKMDVG GHMRGLSLHSRN
Sbjct: 361 RPIIRPINEKASTTSDMQHGQGVSIPPPPVKHAEREKFFKDKKMDVGSGHMRGLSLHSRN 415

Query: 421 ASSSRSGSVDFNES 428
           ASSSRSGS+DF+ES
Sbjct: 421 ASSSRSGSIDFSES 415

BLAST of Lsi01G020100 vs. ExPASy TrEMBL
Match: A0A1S3BMS2 (target of Myb protein 1 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103491258 PE=3 SV=1)

HSP 1 Score: 622.1 bits (1603), Expect = 1.7e-174
Identity = 336/404 (83.17%), Postives = 353/404 (87.38%), Query Frame = 0

Query: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGSKNANTQLYAVL 60
           MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAK+VIKAIKKRLG+KNANTQLYAVL
Sbjct: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKDVIKAIKKRLGNKNANTQLYAVL 60

Query: 61  LLEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKSNLPVRERIFLLLDATQTALGGASGKF 120
           LLEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKS+LPVRERIFLLLDATQTALGGASGKF
Sbjct: 61  LLEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKSDLPVRERIFLLLDATQTALGGASGKF 120

Query: 121 PQYYSAYYDLVVSFVSQSAGVQFPQRPPAVPSNNPTQ-QQSNTSQNGVIRLSEQEDVARV 180
           PQYYSAYYDLV      SAGVQFPQRPPAV SN+PTQ Q +NTSQNG+IRLSEQE+VARV
Sbjct: 121 PQYYSAYYDLV------SAGVQFPQRPPAVSSNSPTQLQTNNTSQNGIIRLSEQENVARV 180

Query: 181 EPQILPESSSVQNTVGLILFGSRRFDM-------GARDEFTLDLVEQCSFQKQKLMHLVL 240
           EPQILPESS ++     +       D        GARDEFTLDLVEQCSFQKQKLMHLVL
Sbjct: 181 EPQILPESSIIEKAGNALEVLKEVLDAVDPQHPEGARDEFTLDLVEQCSFQKQKLMHLVL 240

Query: 241 SSRYGSIFACFFHTAQDEKIVCRAIELNEKLQKVLARHDALLSGQFVSTQNQFNGEEAGM 300
           SSR             DEKIVC AIELNEKLQKVLARHDALLSGQF+STQNQFNGEE GM
Sbjct: 241 SSR-------------DEKIVCGAIELNEKLQKVLARHDALLSGQFMSTQNQFNGEEVGM 300

Query: 301 SRLPDNHYNQDEGEDEEEAEQLFRRLRKGKACVRPEDEEDSSEERPSLGLLGLSIPVERA 360
           SRLP NHYN DEGEDEEEAEQL+RRLRKGKACV PEDEEDSSEERPSLGLLGLSIPVERA
Sbjct: 301 SRLPANHYNHDEGEDEEEAEQLYRRLRKGKACVMPEDEEDSSEERPSLGLLGLSIPVERA 360

Query: 361 SRPIIRPITEKLSTTSDMQHGQGVSIPPPPVKHAEREKFFKDKK 397
           +RPIIRPI EK+STT ++QHGQGV IPPPPVKHAEREKFFK+KK
Sbjct: 361 NRPIIRPIDEKVSTTLEIQHGQGVGIPPPPVKHAEREKFFKEKK 385

BLAST of Lsi01G020100 vs. ExPASy TrEMBL
Match: A0A6J1EH71 (TOM1-like protein 5 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111432500 PE=3 SV=1)

HSP 1 Score: 622.1 bits (1603), Expect = 1.7e-174
Identity = 345/435 (79.31%), Postives = 361/435 (82.99%), Query Frame = 0

Query: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGSKNANTQLYAVL 60
           MA+ELVNSATSEKLAETDWMKNI+ICELVA DQRQAK+VIKAIKKR+GSKN NTQLYAVL
Sbjct: 1   MASELVNSATSEKLAETDWMKNIEICELVARDQRQAKDVIKAIKKRIGSKNTNTQLYAVL 60

Query: 61  LLEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKSNLPVRERIFLLLDATQTALGGASGKF 120
           LLEMLMNN+GE IHKQVIDSGVLP LVKIVKKKSNLPVRERIFLLLDATQTALGGASGKF
Sbjct: 61  LLEMLMNNVGETIHKQVIDSGVLPSLVKIVKKKSNLPVRERIFLLLDATQTALGGASGKF 120

Query: 121 PQYYSAYYDLVVSFVSQSAGVQFPQRPPAVPSNNPTQQQSNTSQNGVIRLSEQEDVARVE 180
           PQYY AYYDLV      SAGVQFPQRPPA PS+N TQQ +N  QNGVIRLSEQED A VE
Sbjct: 121 PQYYQAYYDLV------SAGVQFPQRPPATPSDNATQQHTNNLQNGVIRLSEQEDAASVE 180

Query: 181 PQILPESSSVQNTVGLILFGSRRFDM-------GARDEFTLDLVEQCSFQKQKLMHLVLS 240
           PQ LPESS ++     +       D        GARDEFTLDLVEQCSFQKQKLMHLVLS
Sbjct: 181 PQTLPESSIIEKASNALEILKEVLDAVDPQRPEGARDEFTLDLVEQCSFQKQKLMHLVLS 240

Query: 241 SRYGSIFACFFHTAQDEKIVCRAIELNEKLQKVLARHDALLSGQFVSTQNQFNGEEAGMS 300
           SR             DEKIVCRAIELNEKLQKVL RHDALLSGQF+ST NQFNGEE G  
Sbjct: 241 SR-------------DEKIVCRAIELNEKLQKVLERHDALLSGQFMSTHNQFNGEEVG-- 300

Query: 301 RLPDNHYNQDEGEDEEEAEQLFRRLRKGKACVRPEDEEDSSEERPSLGLLGLSIPVERAS 360
           RL  NHYNQDEGED EEAEQLFRRLRKGKAC+RPEDE  SS ERPSLG LGLSIPVERA+
Sbjct: 301 RLRANHYNQDEGED-EEAEQLFRRLRKGKACIRPEDENGSS-ERPSLGSLGLSIPVERAN 360

Query: 361 RPIIRPITEKLSTTSDMQHGQGVSIPPPPVKHAEREKFFKDKKMDVGVG-HMRGLSLHSR 420
           RPIIRPI EK+STTSDMQ GQGV IPPPPVKHAEREKFFKDKK   GV  HMRGLSLHSR
Sbjct: 361 RPIIRPIDEKVSTTSDMQQGQGVVIPPPPVKHAEREKFFKDKKTGGGVNEHMRGLSLHSR 412

Query: 421 NASSSRSGSVDFNES 428
           NASSSRSGS+D +ES
Sbjct: 421 NASSSRSGSIDLSES 412

BLAST of Lsi01G020100 vs. ExPASy TrEMBL
Match: A0A6J1HRT0 (TOM1-like protein 5 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111466022 PE=3 SV=1)

HSP 1 Score: 613.6 bits (1581), Expect = 6.2e-172
Identity = 341/435 (78.39%), Postives = 358/435 (82.30%), Query Frame = 0

Query: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGSKNANTQLYAVL 60
           MA+ELVNSATSEKLAETDWMKNI+ICELVA DQRQAK+VIKAIKKR+GSKN NTQLYAVL
Sbjct: 1   MASELVNSATSEKLAETDWMKNIEICELVARDQRQAKDVIKAIKKRIGSKNTNTQLYAVL 60

Query: 61  LLEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKSNLPVRERIFLLLDATQTALGGASGKF 120
           LLEMLMNN+GE IHKQVI+SGVLP+LVKIVKKKSNLPVRERIFLLLDATQTALGGASGKF
Sbjct: 61  LLEMLMNNVGEPIHKQVIESGVLPVLVKIVKKKSNLPVRERIFLLLDATQTALGGASGKF 120

Query: 121 PQYYSAYYDLVVSFVSQSAGVQFPQRPPAVPSNNPTQQQSNTSQNGVIRLSEQEDVARVE 180
           PQYY AYYDLV      SAGVQFPQRPPA PS+N  QQ +N  QNGV RLSEQED A VE
Sbjct: 121 PQYYQAYYDLV------SAGVQFPQRPPATPSDNAIQQHTNNLQNGVKRLSEQEDAASVE 180

Query: 181 PQILPESSSVQNTVGLILFGSRRFDM-------GARDEFTLDLVEQCSFQKQKLMHLVLS 240
           PQ LPESS ++     +       D        GARDEFTLDLVEQCSFQKQKLMHLVLS
Sbjct: 181 PQALPESSIIEKASNALEILKEVLDAVDPQRPEGARDEFTLDLVEQCSFQKQKLMHLVLS 240

Query: 241 SRYGSIFACFFHTAQDEKIVCRAIELNEKLQKVLARHDALLSGQFVSTQNQFNGEEAGMS 300
            R             DEKIVCRAIELNEKLQKVL RHDALLSGQF+ST NQFNGEE G S
Sbjct: 241 FR-------------DEKIVCRAIELNEKLQKVLERHDALLSGQFMSTHNQFNGEEVGRS 300

Query: 301 RLPDNHYNQDEGEDEEEAEQLFRRLRKGKACVRPEDEEDSSEERPSLGLLGLSIPVERAS 360
           R   NHYNQDEGED EEAEQLFRRLRKGKACVRPEDE  SS ERPSLG LGLSIPVERA+
Sbjct: 301 RA--NHYNQDEGED-EEAEQLFRRLRKGKACVRPEDENGSS-ERPSLGSLGLSIPVERAN 360

Query: 361 RPIIRPITEKLSTTSDMQHGQGVSIPPPPVKHAEREKFFKDKKMDVGVG-HMRGLSLHSR 420
           RPIIRPI EK+STTSDMQ GQGV IPPPPVKHAEREKFFKDKK   GV  HMRGLSLHSR
Sbjct: 361 RPIIRPIDEKVSTTSDMQQGQGVVIPPPPVKHAEREKFFKDKKTGGGVSEHMRGLSLHSR 412

Query: 421 NASSSRSGSVDFNES 428
           NASSS SGS+D +ES
Sbjct: 421 NASSSGSGSIDLSES 412

BLAST of Lsi01G020100 vs. ExPASy TrEMBL
Match: A0A1S4DXH7 (target of Myb protein 1 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103491258 PE=3 SV=1)

HSP 1 Score: 582.0 bits (1499), Expect = 2.0e-162
Identity = 319/404 (78.96%), Postives = 336/404 (83.17%), Query Frame = 0

Query: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGSKNANTQLYAVL 60
           MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAK+VIKAIKKRLG+KNANTQLYAVL
Sbjct: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKDVIKAIKKRLGNKNANTQLYAVL 60

Query: 61  LLEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKSNLPVRERIFLLLDATQTALGGASGKF 120
           LLEMLMNNIGEAIHKQ                 S+LPVRERIFLLLDATQTALGGASGKF
Sbjct: 61  LLEMLMNNIGEAIHKQ-----------------SDLPVRERIFLLLDATQTALGGASGKF 120

Query: 121 PQYYSAYYDLVVSFVSQSAGVQFPQRPPAVPSNNPTQ-QQSNTSQNGVIRLSEQEDVARV 180
           PQYYSAYYDLV      SAGVQFPQRPPAV SN+PTQ Q +NTSQNG+IRLSEQE+VARV
Sbjct: 121 PQYYSAYYDLV------SAGVQFPQRPPAVSSNSPTQLQTNNTSQNGIIRLSEQENVARV 180

Query: 181 EPQILPESSSVQNTVGLILFGSRRFDM-------GARDEFTLDLVEQCSFQKQKLMHLVL 240
           EPQILPESS ++     +       D        GARDEFTLDLVEQCSFQKQKLMHLVL
Sbjct: 181 EPQILPESSIIEKAGNALEVLKEVLDAVDPQHPEGARDEFTLDLVEQCSFQKQKLMHLVL 240

Query: 241 SSRYGSIFACFFHTAQDEKIVCRAIELNEKLQKVLARHDALLSGQFVSTQNQFNGEEAGM 300
           SSR             DEKIVC AIELNEKLQKVLARHDALLSGQF+STQNQFNGEE GM
Sbjct: 241 SSR-------------DEKIVCGAIELNEKLQKVLARHDALLSGQFMSTQNQFNGEEVGM 300

Query: 301 SRLPDNHYNQDEGEDEEEAEQLFRRLRKGKACVRPEDEEDSSEERPSLGLLGLSIPVERA 360
           SRLP NHYN DEGEDEEEAEQL+RRLRKGKACV PEDEEDSSEERPSLGLLGLSIPVERA
Sbjct: 301 SRLPANHYNHDEGEDEEEAEQLYRRLRKGKACVMPEDEEDSSEERPSLGLLGLSIPVERA 360

Query: 361 SRPIIRPITEKLSTTSDMQHGQGVSIPPPPVKHAEREKFFKDKK 397
           +RPIIRPI EK+STT ++QHGQGV IPPPPVKHAEREKFFK+KK
Sbjct: 361 NRPIIRPIDEKVSTTLEIQHGQGVGIPPPPVKHAEREKFFKEKK 368

BLAST of Lsi01G020100 vs. NCBI nr
Match: XP_038882891.1 (TOM1-like protein 5 [Benincasa hispida] >XP_038882892.1 TOM1-like protein 5 [Benincasa hispida] >XP_038882893.1 TOM1-like protein 5 [Benincasa hispida])

HSP 1 Score: 686.4 bits (1770), Expect = 1.6e-193
Identity = 373/435 (85.75%), Postives = 383/435 (88.05%), Query Frame = 0

Query: 2   AAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGSKNANTQLYAVLL 61
           AAELVNSATSEKL ETDWMKNIQ+CELVAHDQRQAKEVIKAIKKRLGSKNANTQLYAVLL
Sbjct: 3   AAELVNSATSEKLTETDWMKNIQVCELVAHDQRQAKEVIKAIKKRLGSKNANTQLYAVLL 62

Query: 62  LEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKSNLPVRERIFLLLDATQTALGGASGKFP 121
           LEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKSNLPVRERIFLLLDATQTALGGASGKFP
Sbjct: 63  LEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKSNLPVRERIFLLLDATQTALGGASGKFP 122

Query: 122 QYYSAYYDLVVSFVSQSAGVQFPQRPPAVPSNNPT-QQQSNTSQNGVIRLSEQEDVARVE 181
           QYYSAYYDLV      SAGVQFPQRP AVP NNPT QQQ+NTSQNGVIRLSE+EDVAR+E
Sbjct: 123 QYYSAYYDLV------SAGVQFPQRPSAVPLNNPTSQQQNNTSQNGVIRLSEKEDVARLE 182

Query: 182 PQILPESSSVQNTVGLILFGSRRFDM-------GARDEFTLDLVEQCSFQKQKLMHLVLS 241
           PQILPESS  +     +       D        GARDEFTLDLVEQCSFQKQKLMHLVLS
Sbjct: 183 PQILPESSITEKASNALEVLKEVLDAVDPRRPEGARDEFTLDLVEQCSFQKQKLMHLVLS 242

Query: 242 SRYGSIFACFFHTAQDEKIVCRAIELNEKLQKVLARHDALLSGQFVSTQNQFNGEEAGMS 301
           SR             DEKIVCRAIELNEKLQKVLARHDALLSGQF+STQNQ NGEE GMS
Sbjct: 243 SR-------------DEKIVCRAIELNEKLQKVLARHDALLSGQFMSTQNQLNGEEVGMS 302

Query: 302 RLPDNHYNQDEGEDEEEAEQLFRRLRKGKACVRPEDEEDSSEERPSLGLLGLSIPVERAS 361
           RLP NHYNQDEGEDEEEAEQLFRRLRKGKAC RPEDEEDSSEERPSLGLLGLSIPVERA+
Sbjct: 303 RLPANHYNQDEGEDEEEAEQLFRRLRKGKACARPEDEEDSSEERPSLGLLGLSIPVERAN 362

Query: 362 RPIIRPITEKLSTTSDMQHGQGVSIPPPPVKHAEREKFFKDKKMDVGV-GHMRGLSLHSR 421
           RPIIRPITEKLST SD+QHGQGVSIPPPPVKHAEREKFFKDKKMDVGV GHMR LSLHSR
Sbjct: 363 RPIIRPITEKLSTASDVQHGQGVSIPPPPVKHAEREKFFKDKKMDVGVGGHMRSLSLHSR 418

Query: 422 NASSSRSGSVDFNES 428
           NASSSRSGS+DFNES
Sbjct: 423 NASSSRSGSIDFNES 418

BLAST of Lsi01G020100 vs. NCBI nr
Match: XP_004142659.1 (TOM1-like protein 5 isoform X1 [Cucumis sativus])

HSP 1 Score: 682.2 bits (1759), Expect = 2.9e-192
Identity = 369/435 (84.83%), Postives = 385/435 (88.51%), Query Frame = 0

Query: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGSKNANTQLYAVL 60
           MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLG+KNAN QLYAVL
Sbjct: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL 60

Query: 61  LLEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKSNLPVRERIFLLLDATQTALGGASGKF 120
           LLEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKS+LPVRERIFLLLDATQTALGGASGKF
Sbjct: 61  LLEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKSDLPVRERIFLLLDATQTALGGASGKF 120

Query: 121 PQYYSAYYDLVVSFVSQSAGVQFPQRPPAVPSNNPTQQQ-SNTSQNGVIRLSEQEDVARV 180
           PQYYSAYYDLV      SAGVQFPQRPPAV SN+PTQQQ +NTSQNGVIRLSEQE+VARV
Sbjct: 121 PQYYSAYYDLV------SAGVQFPQRPPAVSSNSPTQQQINNTSQNGVIRLSEQENVARV 180

Query: 181 EPQILPESSSVQNTVGLILFGSRRFDM-------GARDEFTLDLVEQCSFQKQKLMHLVL 240
           EPQIL ESS ++     +       D        GARDEFTLDLVEQCSFQKQKLMHLVL
Sbjct: 181 EPQILSESSIIEKAGNALEVLKEVLDAVDPRHPEGARDEFTLDLVEQCSFQKQKLMHLVL 240

Query: 241 SSRYGSIFACFFHTAQDEKIVCRAIELNEKLQKVLARHDALLSGQFVSTQNQFNGEEAGM 300
           SSR             DEKIVC AIELNEKLQKVLARHDALLSGQF+STQNQFNGEE GM
Sbjct: 241 SSR-------------DEKIVCGAIELNEKLQKVLARHDALLSGQFMSTQNQFNGEEVGM 300

Query: 301 SRLPDNHYNQDEGEDEEEAEQLFRRLRKGKACVRPEDEEDSSEERPSLGLLGLSIPVERA 360
           SRLP NHYN DEGEDEEEA+QLFRRLRKGKACVRPEDEEDSSEERPSLGLLGLSIPVERA
Sbjct: 301 SRLPANHYNHDEGEDEEEADQLFRRLRKGKACVRPEDEEDSSEERPSLGLLGLSIPVERA 360

Query: 361 SRPIIRPITEKLSTTSDMQHGQGVSIPPPPVKHAEREKFFKDKKMDVGVGHMRGLSLHSR 420
           +RPIIRPI EK+STT ++QHGQGVSIPPPPVKHAEREKFFKDKK+DVGVGHMRGLSLHSR
Sbjct: 361 NRPIIRPIDEKVSTTLEIQHGQGVSIPPPPVKHAEREKFFKDKKIDVGVGHMRGLSLHSR 416

Query: 421 NASSSRSGSVDFNES 428
           NASSSRSGS+DFNES
Sbjct: 421 NASSSRSGSIDFNES 416

BLAST of Lsi01G020100 vs. NCBI nr
Match: XP_022149373.1 (TOM1-like protein 5 [Momordica charantia])

HSP 1 Score: 667.5 bits (1721), Expect = 7.5e-188
Identity = 358/434 (82.49%), Postives = 374/434 (86.18%), Query Frame = 0

Query: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGSKNANTQLYAVL 60
           MAAELVNSATSEKLAETDWMKNI+ICELVA DQRQAK+V+KAIKKR+GSK ANTQLYAVL
Sbjct: 1   MAAELVNSATSEKLAETDWMKNIEICELVARDQRQAKDVVKAIKKRIGSKTANTQLYAVL 60

Query: 61  LLEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKSNLPVRERIFLLLDATQTALGGASGKF 120
           LLEMLMNNIGE IHKQVIDSGVLPILVK VKKKSNLPVRERIFLLLDATQTALGGASGKF
Sbjct: 61  LLEMLMNNIGEPIHKQVIDSGVLPILVKTVKKKSNLPVRERIFLLLDATQTALGGASGKF 120

Query: 121 PQYYSAYYDLVVSFVSQSAGVQFPQRPPAVPSNNPTQQQSNTSQNGVIRLSEQEDVARVE 180
           PQYYSAYYDLV      SAGVQFPQRPP+VP NNPTQQ +NT QNGVIRLSEQE  ARVE
Sbjct: 121 PQYYSAYYDLV------SAGVQFPQRPPSVPPNNPTQQHNNTLQNGVIRLSEQEGAARVE 180

Query: 181 PQILPESSSVQNTVGLILFGSRRFDM-------GARDEFTLDLVEQCSFQKQKLMHLVLS 240
           PQILPESS ++     +       D        GARDEFTLDLVEQCSFQKQ+LMHLVLS
Sbjct: 181 PQILPESSIIEKAGNALEVLKEVLDAVNPRNPEGARDEFTLDLVEQCSFQKQRLMHLVLS 240

Query: 241 SRYGSIFACFFHTAQDEKIVCRAIELNEKLQKVLARHDALLSGQFVSTQNQFNGEEAGMS 300
           SR             DEKIVCRAIELNEKLQKVLARHDALLSGQF+STQN F+ E+ G S
Sbjct: 241 SR-------------DEKIVCRAIELNEKLQKVLARHDALLSGQFISTQNNFSDEDTGRS 300

Query: 301 RLPDNHYNQDEGEDEEEAEQLFRRLRKGKACVRPEDEEDSSEERPSLGLLGLSIPVERAS 360
           RLP NH N DEGEDEEEAEQLFRRLRKGKACVRPEDEEDSSEERPSLG LGLSIPVERA+
Sbjct: 301 RLPANHCNHDEGEDEEEAEQLFRRLRKGKACVRPEDEEDSSEERPSLGSLGLSIPVERAN 360

Query: 361 RPIIRPITEKLSTTSDMQHGQGVSIPPPPVKHAEREKFFKDKKMDVGVGHMRGLSLHSRN 420
           RPIIRPI EK STTSDMQHGQGVSIPPPPVKHAEREKFFKDKKMDVG GHMRGLSLHSRN
Sbjct: 361 RPIIRPINEKASTTSDMQHGQGVSIPPPPVKHAEREKFFKDKKMDVGSGHMRGLSLHSRN 415

Query: 421 ASSSRSGSVDFNES 428
           ASSSRSGS+DF+ES
Sbjct: 421 ASSSRSGSIDFSES 415

BLAST of Lsi01G020100 vs. NCBI nr
Match: XP_011653749.1 (TOM1-like protein 5 isoform X2 [Cucumis sativus] >KAE8649605.1 hypothetical protein Csa_012667 [Cucumis sativus])

HSP 1 Score: 642.5 bits (1656), Expect = 2.6e-180
Identity = 352/435 (80.92%), Postives = 368/435 (84.60%), Query Frame = 0

Query: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGSKNANTQLYAVL 60
           MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLG+KNAN QLYAVL
Sbjct: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGNKNANAQLYAVL 60

Query: 61  LLEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKSNLPVRERIFLLLDATQTALGGASGKF 120
           LLEMLMNNIGEAIHKQ                 S+LPVRERIFLLLDATQTALGGASGKF
Sbjct: 61  LLEMLMNNIGEAIHKQ-----------------SDLPVRERIFLLLDATQTALGGASGKF 120

Query: 121 PQYYSAYYDLVVSFVSQSAGVQFPQRPPAVPSNNPTQQQ-SNTSQNGVIRLSEQEDVARV 180
           PQYYSAYYDLV      SAGVQFPQRPPAV SN+PTQQQ +NTSQNGVIRLSEQE+VARV
Sbjct: 121 PQYYSAYYDLV------SAGVQFPQRPPAVSSNSPTQQQINNTSQNGVIRLSEQENVARV 180

Query: 181 EPQILPESSSVQNTVGLILFGSRRFDM-------GARDEFTLDLVEQCSFQKQKLMHLVL 240
           EPQIL ESS ++     +       D        GARDEFTLDLVEQCSFQKQKLMHLVL
Sbjct: 181 EPQILSESSIIEKAGNALEVLKEVLDAVDPRHPEGARDEFTLDLVEQCSFQKQKLMHLVL 240

Query: 241 SSRYGSIFACFFHTAQDEKIVCRAIELNEKLQKVLARHDALLSGQFVSTQNQFNGEEAGM 300
           SSR             DEKIVC AIELNEKLQKVLARHDALLSGQF+STQNQFNGEE GM
Sbjct: 241 SSR-------------DEKIVCGAIELNEKLQKVLARHDALLSGQFMSTQNQFNGEEVGM 300

Query: 301 SRLPDNHYNQDEGEDEEEAEQLFRRLRKGKACVRPEDEEDSSEERPSLGLLGLSIPVERA 360
           SRLP NHYN DEGEDEEEA+QLFRRLRKGKACVRPEDEEDSSEERPSLGLLGLSIPVERA
Sbjct: 301 SRLPANHYNHDEGEDEEEADQLFRRLRKGKACVRPEDEEDSSEERPSLGLLGLSIPVERA 360

Query: 361 SRPIIRPITEKLSTTSDMQHGQGVSIPPPPVKHAEREKFFKDKKMDVGVGHMRGLSLHSR 420
           +RPIIRPI EK+STT ++QHGQGVSIPPPPVKHAEREKFFKDKK+DVGVGHMRGLSLHSR
Sbjct: 361 NRPIIRPIDEKVSTTLEIQHGQGVSIPPPPVKHAEREKFFKDKKIDVGVGHMRGLSLHSR 399

Query: 421 NASSSRSGSVDFNES 428
           NASSSRSGS+DFNES
Sbjct: 421 NASSSRSGSIDFNES 399

BLAST of Lsi01G020100 vs. NCBI nr
Match: XP_008449359.2 (PREDICTED: target of Myb protein 1 isoform X1 [Cucumis melo] >XP_016900684.1 PREDICTED: target of Myb protein 1 isoform X1 [Cucumis melo])

HSP 1 Score: 622.1 bits (1603), Expect = 3.6e-174
Identity = 336/404 (83.17%), Postives = 353/404 (87.38%), Query Frame = 0

Query: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGSKNANTQLYAVL 60
           MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAK+VIKAIKKRLG+KNANTQLYAVL
Sbjct: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKDVIKAIKKRLGNKNANTQLYAVL 60

Query: 61  LLEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKSNLPVRERIFLLLDATQTALGGASGKF 120
           LLEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKS+LPVRERIFLLLDATQTALGGASGKF
Sbjct: 61  LLEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKSDLPVRERIFLLLDATQTALGGASGKF 120

Query: 121 PQYYSAYYDLVVSFVSQSAGVQFPQRPPAVPSNNPTQ-QQSNTSQNGVIRLSEQEDVARV 180
           PQYYSAYYDLV      SAGVQFPQRPPAV SN+PTQ Q +NTSQNG+IRLSEQE+VARV
Sbjct: 121 PQYYSAYYDLV------SAGVQFPQRPPAVSSNSPTQLQTNNTSQNGIIRLSEQENVARV 180

Query: 181 EPQILPESSSVQNTVGLILFGSRRFDM-------GARDEFTLDLVEQCSFQKQKLMHLVL 240
           EPQILPESS ++     +       D        GARDEFTLDLVEQCSFQKQKLMHLVL
Sbjct: 181 EPQILPESSIIEKAGNALEVLKEVLDAVDPQHPEGARDEFTLDLVEQCSFQKQKLMHLVL 240

Query: 241 SSRYGSIFACFFHTAQDEKIVCRAIELNEKLQKVLARHDALLSGQFVSTQNQFNGEEAGM 300
           SSR             DEKIVC AIELNEKLQKVLARHDALLSGQF+STQNQFNGEE GM
Sbjct: 241 SSR-------------DEKIVCGAIELNEKLQKVLARHDALLSGQFMSTQNQFNGEEVGM 300

Query: 301 SRLPDNHYNQDEGEDEEEAEQLFRRLRKGKACVRPEDEEDSSEERPSLGLLGLSIPVERA 360
           SRLP NHYN DEGEDEEEAEQL+RRLRKGKACV PEDEEDSSEERPSLGLLGLSIPVERA
Sbjct: 301 SRLPANHYNHDEGEDEEEAEQLYRRLRKGKACVMPEDEEDSSEERPSLGLLGLSIPVERA 360

Query: 361 SRPIIRPITEKLSTTSDMQHGQGVSIPPPPVKHAEREKFFKDKK 397
           +RPIIRPI EK+STT ++QHGQGV IPPPPVKHAEREKFFK+KK
Sbjct: 361 NRPIIRPIDEKVSTTLEIQHGQGVGIPPPPVKHAEREKFFKEKK 385

BLAST of Lsi01G020100 vs. TAIR 10
Match: AT5G63640.1 (ENTH/VHS/GAT family protein )

HSP 1 Score: 414.1 bits (1063), Expect = 1.4e-115
Identity = 253/469 (53.94%), Postives = 311/469 (66.31%), Query Frame = 0

Query: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGSKNANTQLYAVL 60
           MAAELV+SATSEKLA+ DW KNI+ICEL A D+RQAK+VIKAIKKRLGSKN NTQLYAV 
Sbjct: 1   MAAELVSSATSEKLADVDWAKNIEICELAARDERQAKDVIKAIKKRLGSKNPNTQLYAVQ 60

Query: 61  LLEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKSNLPVRERIFLLLDATQTALGGASGKF 120
           LLEMLMNNIGE IHKQVID+GVLP LVKIVKKKS+LPVRERIFLLLDATQT+LGGASGKF
Sbjct: 61  LLEMLMNNIGENIHKQVIDTGVLPTLVKIVKKKSDLPVRERIFLLLDATQTSLGGASGKF 120

Query: 121 PQYYSAYYDLVVSFVSQSAGVQFPQRP---------PAVPSNNPTQQQSNTSQNGVIRLS 180
           PQYY+AYY+LV      +AGV+F QRP          AVP N   +Q ++    G     
Sbjct: 121 PQYYTAYYELV------NAGVKFTQRPNATPVVVTAQAVPRNTLNEQLASARNEGPATTQ 180

Query: 181 EQEDVARVEPQILPESSSVQNTVGLILFG-SRRFDMGARDEFTLDLVEQCSFQKQKLMHL 240
           ++E  +     IL ++S+    +  +L     +   GA+DEFTLDLVEQCSFQK+++MHL
Sbjct: 181 QRESQSVSPSSILQKASTALEILKEVLDAVDSQNPEGAKDEFTLDLVEQCSFQKERVMHL 240

Query: 241 VLSSRYGSIFACFFHTAQDEKIVCRAIELNEKLQKVLARHDALLSGQFV-----STQNQF 300
           V++SR             DEK V +AIELNE+LQ++L RH+ LLSG+       +T N +
Sbjct: 241 VMTSR-------------DEKAVSKAIELNEQLQRILNRHEDLLSGRITVPSRSTTSNGY 300

Query: 301 -----------NGEEAGMSRLPD------------NHYNQDEGEDEEEAEQLFRRLRKGK 360
                      NG++    +  +             H   +E ++EEE EQLFRRLRKGK
Sbjct: 301 HSNLEPVRPISNGDQKRELKASNANTESSSFISNRAHLKLEEEDEEEEPEQLFRRLRKGK 360

Query: 361 ACVRPEDEEDSSEERPSLGLLGLSIPVERASRPIIRPITEKLSTTSDMQHGQG--VSIPP 420
           A  RPEDEE+ S   P  GL G +I  ER +RP+IRP+  + ++     H Q   V IPP
Sbjct: 361 ARARPEDEEEPS---PPQGLPGSAIHNERLNRPLIRPLPSEEASRGGDSHSQSPPVVIPP 420

Query: 421 PPVKHAEREKFFKDKKMDVGV---GHMRGLSLHSRNASSSRSGSVDFNE 427
           PP KH EREKFFK+ K D  +   GHMRGLSLHSR+ SSSRSGSVDF++
Sbjct: 421 PPAKHVEREKFFKENKGDGALGLPGHMRGLSLHSRDGSSSRSGSVDFSD 447

BLAST of Lsi01G020100 vs. TAIR 10
Match: AT1G76970.1 (Target of Myb protein 1 )

HSP 1 Score: 185.3 bits (469), Expect = 1.0e-46
Identity = 139/429 (32.40%), Postives = 208/429 (48.48%), Query Frame = 0

Query: 2   AAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGSKNANTQLYAVLL 61
           AA     AT++ L   DW  NI++C+L+  D  QAKE +K +KKRLGSKN+  Q+ A+  
Sbjct: 5   AAACAERATNDMLIGPDWAINIELCDLINMDPSQAKEAVKVLKKRLGSKNSKVQILALYA 64

Query: 62  LEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKSNLPVRERIFLLLDATQTALGGASGKFP 121
           LE L  N GE +++ +ID G+L  +VKIVKKK  L VRE+I  LLD  Q A GG  G++P
Sbjct: 65  LETLSKNCGENVYQLIIDRGLLNDMVKIVKKKPELNVREKILTLLDTWQEAFGGRGGRYP 124

Query: 122 QYYSAYYDLVVSFVSQSAGVQFPQRPPAVPSNNPTQQQSNTSQNGVIRLSEQEDVARVEP 181
           QYY+AY DL      +SAG++FP R  +  S   T  Q+   ++  I+ S Q D A    
Sbjct: 125 QYYNAYNDL------RSAGIEFPPRTESSLSFF-TPPQTQPDEDAAIQASLQGDDA--SS 184

Query: 182 QILPESSSVQNTVGLILFGSRRFDMG----ARDEFTLDLVEQCSFQKQKLMHLVLSSRYG 241
             L E  S + +V +++      D G     ++E  +DLVEQC   ++++M LV      
Sbjct: 185 LSLEEIQSAEGSVDVLMDMLGAHDPGNPESLKEEVIVDLVEQCRTYQRRVMTLV------ 244

Query: 242 SIFACFFHTAQDEKIVCRAIELNEKLQKVLARHDALLSGQFVSTQNQFNGEEAGMSRLPD 301
                  +T  DE+++C+ + LN+ LQ VL RHD + +   V +  +       +  +  
Sbjct: 245 -------NTTTDEELLCQGLALNDNLQHVLQRHDDIANVGSVPSNGRNTRAPPPVQIVDI 304

Query: 302 NHYNQDEGEDEEEAEQLFRRLR----------KGKACVRPEDEEDSSEERPSLGLLGLSI 361
           NH ++D+  D+E A    R              G   +   D         S G+     
Sbjct: 305 NHDDEDDESDDEFARLAHRSSTPTRRPVHGSDSGMVDILSGDVYKPQGNSSSQGVKKPPP 364

Query: 362 PVERASRPIIRPITEKLSTTSDMQHGQGVSIPPPPVKHAEREKFFKDKKMDVG-----VG 412
           P    S     P+ +  S           ++PPPP +H +R++FF+      G      G
Sbjct: 365 PPPHTSSSSSSPVFDDASPQQSKSSEVIRNLPPPPSRHNQRQQFFEHHHSSSGSDSSYEG 411

BLAST of Lsi01G020100 vs. TAIR 10
Match: AT1G21380.1 (Target of Myb protein 1 )

HSP 1 Score: 173.7 bits (439), Expect = 3.2e-43
Identity = 117/335 (34.93%), Postives = 180/335 (53.73%), Query Frame = 0

Query: 2   AAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGSKNANTQLYAVLL 61
           AA     AT++ L   DW  NI++C+++  +  QAKE +K +KKRLGSKN+  Q+ A+  
Sbjct: 5   AAACAERATNDMLIGPDWAINIELCDIINMEPSQAKEAVKVLKKRLGSKNSKVQILALYA 64

Query: 62  LEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKSNLPVRERIFLLLDATQTALGGASGKFP 121
           LE L  N GE++++ ++D  +LP +VKIVKKK +L VRE+I  LLD  Q A GG+ G+FP
Sbjct: 65  LETLSKNCGESVYQLIVDRDILPDMVKIVKKKPDLTVREKILSLLDTWQEAFGGSGGRFP 124

Query: 122 QYYSAYYDLVVSFVSQSAGVQFPQR-PPAVPSNNPTQ------QQSNTSQNGVIRLSEQE 181
           QYY+AY +L      +SAG++FP R   +VP   P Q      Q + + ++  I+ S Q 
Sbjct: 125 QYYNAYNEL------RSAGIEFPPRTESSVPFFTPPQTQPIVAQATASDEDAAIQASLQS 184

Query: 182 DVARVEPQILPESSSVQNTVGLILFGSRRFD----MGARDEFTLDLVEQCSFQKQKLMHL 241
           D A      + E  S Q +V ++       D     G ++E  +DLVEQC   ++++M L
Sbjct: 185 DDA--SALSMEEIQSAQGSVDVLTDMLGALDPSHPEGLKEELIVDLVEQCRTYQRRVMAL 244

Query: 242 VLSSRYGSIFACFFHTAQDEKIVCRAIELNEKLQKVLARHDALLSGQFVSTQNQFNGEEA 301
           V             +T  DE+++C+ + LN+ LQ+VL  HD    G  V           
Sbjct: 245 V-------------NTTSDEELMCQGLALNDNLQRVLQHHDDKAKGNSVPA--------T 304

Query: 302 GMSRLPDNHYNQDEGEDEEEAE--QLFRRLRKGKA 324
             + +P    N D+ +DE + +  QL  R ++  A
Sbjct: 305 APTPIPLVSINHDDDDDESDDDFLQLAHRSKRESA 310

BLAST of Lsi01G020100 vs. TAIR 10
Match: AT3G08790.1 (ENTH/VHS/GAT family protein )

HSP 1 Score: 172.6 bits (436), Expect = 7.0e-43
Identity = 127/416 (30.53%), Postives = 202/416 (48.56%), Query Frame = 0

Query: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGSKNANTQLYAVL 60
           M   LV+ ATS+ L   DW  N++IC+++ H+  Q +EV+  IKKRL S+ +  QL A+ 
Sbjct: 1   MVHPLVDRATSDMLIGPDWAMNLEICDMLNHEPGQTREVVSGIKKRLTSRTSKVQLLALT 60

Query: 61  LLEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKSNLPVRERIFLLLDATQTALGGASGKF 120
           LLE ++ N GE IH QV +  +L  +VK+ K+K N+ V+E+I +L+D  Q +  G  G+ 
Sbjct: 61  LLETIITNCGELIHMQVAEKDILHKMVKMAKRKPNIQVKEKILILIDTWQESFSGPQGRH 120

Query: 121 PQYYSAYYDLVVSFVSQSAGVQFPQRPPAVPSNNPTQQQSNTSQNGVIRLSEQEDVARVE 180
           PQYY+AY +L+       AG+ FPQRP   PS+      +   QN   R + QE +    
Sbjct: 121 PQYYAAYQELL------RAGIVFPQRPQITPSSGQNGPSTRYPQNS--RNARQEAIDTST 180

Query: 181 PQILPESS--SVQNTVGLI---------LFGSRRFDMGARDEFTLDLVEQCSFQKQKLMH 240
               P  S   +QN  G++         + G+ +   G + E  +DLV QC   KQ+++H
Sbjct: 181 ESEFPTLSLTEIQNARGIMDVLAEMMNAIDGNNK--EGLKQEVVVDLVSQCRTYKQRVVH 240

Query: 241 LVLSSRYGSIFACFFHTAQDEKIVCRAIELNEKLQKVLARHDALLSGQFVSTQNQFNGEE 300
           LV             ++  DE ++C+ + LN+ LQ++LA+H+A+ SG      +    EE
Sbjct: 241 LV-------------NSTSDESMLCQGLALNDDLQRLLAKHEAIASG-----NSMIKKEE 300

Query: 301 AGMSRLP-DNHYNQDEGEDEEEAEQLFRRLRKG-KACVRPEDEEDSSEERPSLGLLGLSI 360
                +P D     D G  E +   +      G K  +   D+ ++     SL L+ L  
Sbjct: 301 KSKKEVPKDTTQIIDVGSSETKNGSVVAYTTNGPKIDLLSGDDFETPNADNSLALVPLGP 360

Query: 361 PVERASRPIIRPITEKLSTTSDMQHGQGVSIPPPPVKHAEREKFFKDKKMDVGVGH 404
           P  + S P+ +P    +     +      S  P    HA  +K  ++     G GH
Sbjct: 361 P--QPSSPVAKP-DNSIVLIDMLSDNNCESSTPTSNPHANHQKVQQNYSNGFGPGH 385

BLAST of Lsi01G020100 vs. TAIR 10
Match: AT4G32760.1 (ENTH/VHS/GAT family protein )

HSP 1 Score: 162.5 bits (410), Expect = 7.3e-40
Identity = 115/328 (35.06%), Postives = 173/328 (52.74%), Query Frame = 0

Query: 1   MAAELVNSATSEKLAETDWMKNIQICELVAHDQRQAKEVIKAIKKRLGSKNANTQLYAVL 60
           M   +V  ATSE L   DW  N++IC+++  D  QAK+V+K IKKR+GS+N   QL A+ 
Sbjct: 1   MVNAMVERATSEMLIGPDWAMNLEICDMLNSDPAQAKDVVKGIKKRIGSRNPKAQLLALT 60

Query: 61  LLEMLMNNIGEAIHKQVIDSGVLPILVKIVKKKSNLPVRERIFLLLDATQTALGGASGKF 120
           LLE ++ N G+ +H  V + GV+  +V+IVKKK +  V+E+I +L+D  Q A GG   ++
Sbjct: 61  LLETIVKNCGDMVHMHVAEKGVIHEMVRIVKKKPDFHVKEKILVLIDTWQEAFGGPRARY 120

Query: 121 PQYYSAYYDLVVSFVSQSAGVQFPQRPP-AVPSNNPTQQQSNTSQNGVIR-LSEQEDVAR 180
           PQYY+ Y +L+       AG  FPQR   + P   P Q Q  TS    +R      DV  
Sbjct: 121 PQYYAGYQELL------RAGAVFPQRSERSAPVFTPPQTQPLTSYPPNLRNAGPGNDVP- 180

Query: 181 VEPQILPE-----SSSVQNTVGLI---------LFGSRRFDMGARDEFTLDLVEQCSFQK 240
            EP   PE      S +QN  G++         L    + D+  + E  +DLVEQC   K
Sbjct: 181 -EPSAEPEFPTLSLSEIQNAKGIMDVLAEMLSALEPGNKEDL--KQEVMVDLVEQCRTYK 240

Query: 241 QKLMHLVLSSRYGSIFACFFHTAQDEKIVCRAIELNEKLQKVLARHDALLSG-QFVSTQN 300
           Q+++HLV             ++  DE ++C+ + LN+ LQ+VL  ++A+ SG    S+Q 
Sbjct: 241 QRVVHLV-------------NSTSDESLLCQGLALNDDLQRVLTNYEAIASGLPGTSSQI 300

Query: 301 QFNGEEAGMSRLPDNHYNQDEGEDEEEA 312
           +    E G S +  +    D G+   +A
Sbjct: 301 EKPKSETGKSLVDVDGPLIDTGDSSNQA 305

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9FFQ02.0e-11453.94TOM1-like protein 5 OS=Arabidopsis thaliana OX=3702 GN=TOL5 PE=1 SV=1[more]
Q6NQK01.5e-4532.40TOM1-like protein 4 OS=Arabidopsis thaliana OX=3702 GN=TOL4 PE=1 SV=1[more]
Q9LPL64.4e-4234.93TOM1-like protein 3 OS=Arabidopsis thaliana OX=3702 GN=TOL3 PE=1 SV=1[more]
Q9C9Y19.9e-4230.53TOM1-like protein 8 OS=Arabidopsis thaliana OX=3702 GN=TOL8 PE=2 SV=1[more]
Q8L8601.0e-3835.06TOM1-like protein 9 OS=Arabidopsis thaliana OX=3702 GN=TOL9 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A6J1D6L63.6e-18882.49TOM1-like protein 5 OS=Momordica charantia OX=3673 GN=LOC111017804 PE=3 SV=1[more]
A0A1S3BMS21.7e-17483.17target of Myb protein 1 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103491258 PE=3 ... [more]
A0A6J1EH711.7e-17479.31TOM1-like protein 5 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111432500 PE=... [more]
A0A6J1HRT06.2e-17278.39TOM1-like protein 5 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111466022 PE=3 ... [more]
A0A1S4DXH72.0e-16278.96target of Myb protein 1 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103491258 PE=3 ... [more]
Match NameE-valueIdentityDescription
XP_038882891.11.6e-19385.75TOM1-like protein 5 [Benincasa hispida] >XP_038882892.1 TOM1-like protein 5 [Ben... [more]
XP_004142659.12.9e-19284.83TOM1-like protein 5 isoform X1 [Cucumis sativus][more]
XP_022149373.17.5e-18882.49TOM1-like protein 5 [Momordica charantia][more]
XP_011653749.12.6e-18080.92TOM1-like protein 5 isoform X2 [Cucumis sativus] >KAE8649605.1 hypothetical prot... [more]
XP_008449359.23.6e-17483.17PREDICTED: target of Myb protein 1 isoform X1 [Cucumis melo] >XP_016900684.1 PRE... [more]
Match NameE-valueIdentityDescription
AT5G63640.11.4e-11553.94ENTH/VHS/GAT family protein [more]
AT1G76970.11.0e-4632.40Target of Myb protein 1 [more]
AT1G21380.13.2e-4334.93Target of Myb protein 1 [more]
AT3G08790.17.0e-4330.53ENTH/VHS/GAT family protein [more]
AT4G32760.17.3e-4035.06ENTH/VHS/GAT family protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (USVL1VR-Ls) v1
Date Performed: 2021-10-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002014VHS domainSMARTSM00288VHS_2coord: 2..134
e-value: 9.1E-30
score: 114.9
IPR002014VHS domainPFAMPF00790VHScoord: 4..112
e-value: 1.2E-21
score: 77.1
IPR002014VHS domainPROSITEPS50179VHScoord: 9..130
score: 28.599323
IPR008942ENTH/VHSGENE3D1.25.40.90coord: 2..135
e-value: 1.7E-36
score: 127.1
IPR008942ENTH/VHSSUPERFAMILY48464ENTH/VHS domaincoord: 4..145
IPR038425GAT domain superfamilyGENE3D1.20.58.160coord: 157..282
e-value: 5.0E-6
score: 28.4
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 145..165
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 405..427
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 289..313
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 409..427
NoneNo IPR availablePANTHERPTHR45898:SF3TOM1-LIKE PROTEIN 5coord: 1..425
NoneNo IPR availableCDDcd03561VHScoord: 2..130
e-value: 6.25335E-41
score: 140.092
NoneNo IPR availableSUPERFAMILY89009GAT-like domaincoord: 207..277
IPR044836TOM1-like protein, plantPANTHERPTHR45898TOM1-LIKE PROTEINcoord: 1..425

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lsi01G020100.1Lsi01G020100.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0043328 protein transport to vacuole involved in ubiquitin-dependent protein catabolic process via the multivesicular body sorting pathway
cellular_component GO:0016020 membrane
molecular_function GO:0035091 phosphatidylinositol binding
molecular_function GO:0043130 ubiquitin binding