Spg002036 (gene) Sponge gourd (cylindrica) v1

Overview
NameSpg002036
Typegene
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Locationscaffold10: 200231 .. 209986 (+)
RNA-Seq ExpressionSpg002036
SyntenySpg002036
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAGTTATCAGCTGTTGTATTCAGAATAGGTATTTTTGTACTTGGAGGGAAGGAAATATCCATTTTGTTGAAGATACTTGCAACAAGCGTTTGATTCCATTGTCCATTTCCTTCTTACAGTGGTTTGAAAAAGTGTTAGTTGAGATTTTGCAAAATCCCGTTTCTTCATTCTTTCATGAGAAAATCAAGGAAGAATTTGGAGTCATTAGGTTGATTAAGTTCTTCTCAGATAATGAATGGTTCTTTGAATGTGCTGTTTGGCCTTCCACGGGTGGAAGAAGGATTATTCAAGTTCCTGCTGGCTTGAATAAGAAAGGATGGTATGTTTTTTGGGAAATGATTAGGGATTTCATCCTTAAAATTCATTCTAATGAGAATCAACCTATTCGGTCATTGTTAAGCAAAGAGGAGAGTCTTCCGGTTTTTGATAAAGTTTCAGCAGGTCATGCCTCTTCCAATTCATATGCTGAGGTGGTAAAGCGAGGTGGTTCTTTAAAAAGTTCAGTTTCTTTGAATGATTCAATAAGAAATGCCAAGGGTATTAACGAAGAAGCTTACTGGGTTCGCAAGAATTGTGATGTGCTGAAATTAGATTTGGAAAGATCAATTGTTGTTTCTAGATTGATGGCCCAATATTCTTGGAAGGATGTTAAGATTGCCCTTGAGAATTTCTTTAAAACTTTTGTCTTAGTTAACCCCTTCATGGATGATAAAGCTCTGATTCATGCAGCAGATGGTGGATTGGAATTTTCTGCAAATGGCAAGTGGAAGAAATTTGGAAACTTACATTTGAAATTGGATTTTTGGTCCTCTGAAATTCATTCACAGCCGAAGTCTATAAAAAGTTATGGAGGCTGGCTTGCAATTAGAAATATTCCATTGAATCTATGGCATCGTGATTCCTTTGAAGCTATCGGAAAGAACCTTGGAGGGTTGGTTAGTATTTCTTCCAATACGCTTAATTTGTTAGATTGTTCTGAAGCCTTCATTGAAGTAGAAAAGAATTTTTGTGGATTTATTCCTGCTGATATTAATGTTAAGATTGGTAATAAGTATGAATTTTCATTAAGATATGGTGATATTAATTCTTTGGAGAACAGAAATTTGAATTTTGATTCAAGAAAACAGCTAGATGCCAATGACTTTTCAAATTCCCTGGATTTAATTAGGGTAAGGCAGGTGATTTTGGATGAAGAATCTGATATTGTTAATAAAGAGGATAGGATGAATGAGTTGCCTGCTTTCTCTAGGCATGAGGAGGCATTTAATGAGGATTTGGATATTTCAAAGGATGTCTCGGCACAAGATAAATGTATTAAGTGCTGTGGCTGTATTATTCCTCCAACCAAGGTGATTAATGATGATAGCTGGTTTTTTGAATAATGCAGATTTGAATGGGGAATTGGTTCTTTCAATGGATACCTCGGTGCAAGATCAGAATTTAAAAGAGAGAGTCCAAGTTAATGAGATGTTGGGTTCTCCAAAAGGTGCTTCACTGCATGACAGGTGTATTAATAATGCTGGTTGTAAAGGTTTTAATGCCAGAATTAATGAGCCGACATTAGCTCTCTCTCCTTCATTAAATGACAATGAATTTAATGAGTCCGGTCCTCAGGAAGCCCAACAGTTTCAGGTTTTTGAACTTTCTTATAAGAATGATAATGCCGTTAATGGTATCTTAAATCATGATGTCCAGCAAGTAGCATTAAAGACCTATTCTCGGAAAAAATGTTCTCTCTCATCGGCTGTTATGACCAACTTTAAGACCAACTTTAATGCTGATCATTTAGAGTCTGACTGTACTCATTTAATTGCTGGAAATAAGGCTTCGGGATCTGCTATAATCAATGCTGGAAACGGGTTGAGTCAGGCCAAGGTATTTAAGGAATCTTCTATTCAAATTCCAGGGGGAAGTAATGTTTTTGTCAGAGGTATTGGTAGTTCCTTCAATCATAGTATTCATTCCCCGGTGGATTCAGATGATGAGTCTATGGTTAGTGTTAGCAGTGAAGATTCTGATCAATTGTTAGATAAAGAGGATAATGTGGAACAATTTTCAGATGATCAAATTGGTGAGTCTTTAGAATCTCTTTTTTGTGAGAAGGTTGATGGTTTAGGTTCTCAAATTATTCATGAGTCTTTATTATCACCTTCTCAAATTCCTAACCAATTCTCTTCGATAGTTGATACTTGTGGATTTCAGTTGTGTAAAATTTCGCCTCAGTCTTCTAAAGTGGCTGTTTGATTCCTGTTTAATTGATTATGAAGATTGTTTCATGGAATACCAGGGGTCTCGGTGATAAGTCTAAAAGGGTGTCTTTAAAGAAATTCCTACAGAACATTTGTCTAATTCAGGAGACTAAACAAGTAGCAATTGATTTGAAATTCATTAAATCCTTATGGAGTTCCAAGGAAATCGGCTGGTCGTTTGTGGAAGCTTATGGAAAATCAGGTGGACTTCTTATTATGTGGGATGAAAGTAAATTGTCAGTGCTGGAATTTTTAAAGGGTGGTTATTCTCTTTCAGTCAAATGTCTCACTCTTTGTAAAAAAGTTTGTTGGGTTTCAAATGTTTATGGTCCAAATGACTACAAAGAAAGGAGATTCCTTTGGTTTGAATTACGCTCTCTCTCTTATTATTGCACGGATCCTTGGTGTATTGGAGGAGACTTTAATATTACTCGATGGGTTCATGAACGATTTCCAGTAGGAAGGCAAACGAAAGGGATGCGTAGATTTAACAAATTCATTGAAGACTCGGGTCTTATGGAAATTCCTTTATCAAATGGTAAATTTACATGGTCTAGGGATGGAAACGCTTATTCTCACTCTCTTATTGATAGATTTTTGGTGACAAAAGAATGGGATGTGTTATTTGATAATTCCAGAGTATCAAGGAAGGCACGCATATTTTCTGATCATTTTCCTCTTTTATTAGAAGCTGGTTCTTTTATGTGGGGACCAAGTCCTTTCAGGTTTTATAATAGTTGGCTTTCTCAAGCGGAATGTGATAGGATTATTTTGGATTCTCTTTCCATTGATCGATCACAAGGATGGGCTGGTTTTGTTATTAGCTCCAAATTCAGAAATTTAAAAGTTGCCATTAAGAAGTGGTTTGCAGAATTTGAAGATAGCAGAAAAAGTAAAGAGAAAAATTTGCTTTTTGAACTTGAATTCTTTGATGCAAAGGCTGAAGAATCTCTTTTATCTGATGAAGAGTTGGATATTCTCTTGGCTATAAAAGGCGAAATTATGGGTTTATACATGTCTGATGAAAGAAATTTAATTAAAAAATGTAAGCTTAATTGGCTTAAGCTTGGTGATGAGAATACAAGTTTTTTCCATCGATTTTTAGCAGCCAAGAAGAGGAAGAACTTGATTACTGATTTAATTTCCAGCAATGGTGTTTCTTTAGTTTCCTTCAGGGAAATTGAACAAGAAATTCTGGATTTCTTTTCTTTATCAGAAAATTCCAGGTCATCGGTTTCTACCTTTTAACTTATCTTGGGATACTATTTCAGCTGGTCACAACACTGCCCTTTCGGTTCCTTTCTCTGTTGATGAAATCAGGGATGCTTTGCAATCTTTGGGTCGTAATAAGGCCCCGGGACCAGATGGTTTTACTTCGGAATTCTTATTAAAGTATTGGCTTCTGTTGAAACCAGATTTTATTCGACTTTTTGAGGAGTTTTTCCATAATGGTCACCTGAATGCGTGTATAAAGGAGAATTTTATATGTTTAATTCAGAAAAAGGAAGCTGCTGTCCGTATAAAAGACTTTAGGCCTATAAGCCTTACCACTTCAGTTTATAAAGTTATTGCTAAAGTCCTTGCTGAAAGAATGAGAAAGGTAATGGCCAATATAATTTCTCAATCTCAAAGCGCTTTTATTAGTGGCAGACAGATTCTTGATTCAGTTCTCATTGCTAATGAAGTTGTGGAGGAGTATCGAGCTAAGAAAAGAAAGGGGTGGATTTTGAAACTAGATCTTGAGAAAGCCTTTGATAGGGTGGATTGGGATTTCCTTGAGTATGTTCTCAAGTTGAAAGGATTTTGTGAAAAATGGATTGAATGGATAAATGGTTGTGTTAGGGATCCAAAATTTTCCATTTTCATTAATGGTCGGCCACGAGGGAGAATTTGTGCTGCTAGAGGTCTTAGGCAAGGAGATCCTCTTTCTCCTTTCCTATTTCTTTTAGTTAGCGAAGTATTGAATGCTTTTATCAACATCATCCATGAGAAGGGCGTTTATGAGGGTTTCACCGTTGGCAAGGATAAAGTTCATATTTCCATTCTTCAATTCGCAGATGATACTCTTTTATTCTGTAAGTATGATGATGCTATGCTTGATTCCCTCATTTCTACTATTGGTCTTTTTGAATGGTGCTCTGGGCAGAAAGTTAATTGGGAGAAATCTGCTTTGTGTGGAATTAATATCGATAATGTCAAGATTCTGGCTACTTCTTCCCGATTGAATTGCAAAGTGGAATCTCTTCCATTTTTGTATCTTGGTCTTCCATTAGGTGGTTATCCGAAAAAGATGTCATTTTGGCAGCCAGTGATTGATAAAATTCAGAAAAAGCTTGATAGATGGAGGCGTTTTAATTTATCTAGAGGCGGCCGATTGACTTTATGCAATTCAGTTTTATCAAGCATTCCGTTATATTATATGTCCTTGTTTCTCATGCCTACTAAAGTTATTTCAAAGGTGGAGCAGTTAATTAGGTCGTTTTTATGGGAAGGAAGTAATGGTTCAAAGTTAAATCACTTGGCTCGTTGGGAGCAAGCTTCAAAACCTCTTTTAAGTGGAGGTCTCGGTATTGGTGGCTTGAAAAACAGAAACTTGGCTCTTCTTGCTAAATGGGGGTGGCGATATATGCGTGAGCATGATTCTTTATGGTGTAAAGTTGTTAAAAGCATTCATGGGCAGGATTGTTATAATTGGCACACTTCTGGTAAGGCCGGCCTAAGTCTTCGAAGTCCTTGGATTAATATTTCTAAGGTTTGGAAGCAATTTGAGTATTTAGCTTCTTTTAAACTTGGTAATGGTTCTAGAATTGCCTTCTGGCTTGATTCTTGGGTTGATGATCTTCCTTTTTGTTCAAAGTATCCTAGTTTATTTCGGATTGCTTCTCTTCCTAATGCCTCCGTTTTGGATCATTGGGATGGGGAGACTCTCTCATGGAATATTTCCTTTCGTCGGCTTCTCAAAGAGGAAGAAATTTCTGATTTTCAGCAGTTGTTGGTCTGTTTAAATGATGCCATTGTATCTGAATTTTCAGATTCTCGTATTTGGTCTCTTGAGAATTCGGGACTATATTCGGTTAAGTCCCTTTTTTACTTCTTGGCAGCCTCCTCTTCGATTAACAAGGAAGTGTTTAAAGCTATTTGGAAAACGAAATGCCCTAAGAGAATTAATTTTCTTGTTTGGGTTATGATTTTTGGGACTCTAAATTGTTCAGAAGTTCTTCAAAGAAGGCTACCTTCTCATGCTCTATCTCCTTCTATTTGCCCTTTATGTTTGAAGCCAGTGAATCTTTGCAGCATTTGTTCTTTGATTGCGTTTATTCCTATCAGTGTTGGGGAAAGCTATTGTCTATCTTCAAGCTTCAATGGGTTCTGGATCAGTCATTCAAAGAAAATGTGCAGCAACTTTTAAGTGGTCCATCAGTTAAGCCGGTCTCTAAATTGCTTTGGTCTAATGGGGTCAAAGCTGTTTTATCAGAGCTTTGGTTTGAAAGAAATCAGAGAATTTTTCATGATATTTCACTTCCTTGGGTGGATCGTTTTTACTCTGCTCGGCTCAAAGCTTCTTCTTGGTGTTCTTTGTCCAAGCTTTTTTCAGGATTCTCCATTCAAGATATTTGCCTCAACTGGGAAGCTTTTATTTTTCCGTCTTAGTTCTGTTGTTTTCCTTCTTTTATGATGTTTCTTTATCTAATTCGGTCTTTTTGTATCTTTTCACCTTTGTACTTTGAGCATTAGTCTCTTTTCATCTCTTCAATGAAAAGTGTGGTTTCCTTTTCAAAAAAAAAAAAAGAAACCAAAGGATCTTTCACAACAAGCATCTTTTTTGGGTAGATCGATTTAGTACGACTCGTCTCAAAGCTTCATTCTGGCGTTCTCTTTCTAAATTTTATGCAAATTATTCTATTCAAGATTTATGTATTAACTGGAATGCTTTTATTTTCCATTGTAACTTTATTACTTGTATTTTTTTATTTTTTGTTTTTATTCCTTCCGTCTTTGTTTCCTATAAAAATAAAATGAAAAGTTTTGTTTCTTGTCAAAAAAAAAAAAAATAAATAAATAAAATTGAAAGTATTGGGAGTTGGATTTTTTGGGCTGCTGTTTTAGCCCATTTTATTTGGTTGGTTGGTTGGTTTTTTGTTTTGTTTTGTTTTGTTTTTTGTTTTTTGTTTTTTTTTTCCTCCTTTTATGGCCTGTTTGGATAATAAATATCTGTGTTTGGATAGAATTTTTTAATGTCCTTCCCCCTGTGTCAAAATGCTCTCTTCTCTCTCCTCTCTTTCTCTTTCCCTTTTCCTCCCCATCTTGTTTAGCCTTCTTCTGCCTCATCCTGTTTGGTGTTTTGGATTTGTTCTTGGTATGTTTGATACATTCCGTTTCCATCCCACCTCTACAAGTTATGAACAATTTTTTTTGTTGTTTCTTCGAGAAGGACATGTTCTATATAGAGTATATGTATTTGAAGAAGGTAATTCCTTTAGCCGTTTCACATTTTAGTTGGTTCGAAGCTTCTTTGGCGGAATTGTTGCAGCATCCTAGTCATTTTCCATTTTTTAAGCATGCTCGAGATGAATCAGCAGCAACACGTCTGGTAAAACTTAAACTTCAATCAGGTTGGTTTATTCAGTGTATTGTGTGGCCTTCTACGGGAGGAAGAAAGTCGATTCAAGTCCCGATGGACCAAAGAAAGAAGGATGGAATTATCTTTTGGGAATTGTTACAGGAATGTCTTAAGAGTTTTGAGTGTGTATCCTTCAGTCTATGAGTTTGCATCAAGAGGATAAGCTACACAGTATGGGAGGTGAAAAGTTTTATGCGGAAGTGGTAAAGATGAATCCTATGGAAAATCTCAGTACCAAAGACTCTTCAGTACAAAAAGTTGTTATAAAGAAGTCTTCTTCCATTAGCTCTTATTGGGTTCGTAATGATCATGAGGTGCTAAGTTTAGATTTTGATAATTTATGGGCAGTGACTAGGTTATTCGCCCATAATGATTGGAATAAGATTAAAGCTTCACTAGAAGATTATTTCCAATCAAAAGTAATGATTAATCCACTTTTTGATGATAAAGCCTCGATCAAATTTGGTGAAGATATCCAGGATAGTCCAAAGGTTTCATTTGGCAAGTGGAAATGTATTGGAAATTATCATCTATGTATTGAAAAATGGTCTAGAAGATTCCATAGTCATCCTAATTGGATCAAAGGGTATGGTGGATGGATTTCTATTAAAAATCTCCCCTTGGACTATTGGAAAATTGATTCTTTTTAAGCAATTGGAGCCAATTTTGGTGGTTTGGTAAGTGTTTCTTCAGATACTCTTAATTTGATCAATTGTCAAGAAGCAAAAATCCAGGTGCAACGAAACTTATGTGGATTTATACCCGTCTCAATTGAAGTCAAAGATAAAAAAAGAGGGAATATCCTTCTTCACTTTGGAGACATTGAAGCATTGGACCCTCCAAACATCATTGATAGAGAGCTTCATGTGAATGGTTTTCAGAATCCAATGGATCTTTTCCGGCTTAATAAGGTAATGGATGATGAAGGTTTTGGCGATTCTCAAGTTTGGAATTCAAAGGTAAAGTTTTATTCTTCAATTCAGGAATTACATTCGATTTCAAGAGATTCGTTGATGGCTTTGAAAACTCATGAATTAAATCCTATTTTTATGTCAAAAGGAATCATTGAGTTATGTAAAAATGGCTCGTGTGTAATGGTTGAATGAGTCATTTTTTGAACCGGTTTTGGAGGCTGGTGTAGTTTGTAATGAAGTACCAAAGAAAGTTAATGATGAGATTGCTTTAAACATGAAGGTCATGCAAGAAGAGGGCATTAATTTGGGGATAAATCACGATGTGAATATTGAAAAATTGAATGAAGTGGTGCCAACCAGATTTTATGGGTCCCAATGTGAAAAGTTGTCAACCCTTTCTCCTTCCAATCCTTGTACTTACCAAAAATCTTGGTCTCTGTGTGAGGATAATCCAGTTTCAGCAGAAGTGTTGTGTGAAAAGATTAATATCTGTAGCCAAAGAATCAAGACTTCTCAGATGCCTTCATCTCACAAATCTTCTTTAAGGAGCATGAATCATTATCCTCTTTATTATACCAGGAAGAAAGGTAACTCTCTCACTTTTAATATTTCAGTGTTGAATTCAGACACTTTAGAAGAATTTTGTACAAGACTCTTGGTTACCTCTTCCCTGGTTGAAAATGCTCAGGACAAAGATGTATGTCAGCAAATTAATAACATTCAATAAAATATATCAGGCAATCAGATATCCCTTTACTAGATTCGAATATCATTTTAACAAAAGGGATTATCAGTTCTTCTTGTACTAAGGAATTGCCATCCAATCTTGATGAATCTGATGGGGATCAGACATTAGCTTAAGCAGCGAGGAGATAGAAGATCAAGTAGTTGAAGCAGATAGTGATGGAATTGTTGCTGAAGAGTCTTTCACGGAAGCTTTTGAAACTCTGTTTATGGATGCTAATAATGAACAAGTCAATGACTCTTCTCTTGGTATAGTTTCAGAAGTAAATTATTCTTCAAGCCCTTCTAAATTTTCCTCTCTTATTGAGGTGTGTGGTATACAGTTACGTGAAATTCCCCCATTGTTACCTCAGGTTAAAGCCTTTTGAGCTTCATCATATTTTATAGCAATGAGAATAATCTCTTGGAACACTAGAGGTTTAGGGGACAGCTCTAAGCGTGTTGCTTTGAAAAAATTCGTTCAGAATCATTGTTCGGATTTAGTCTTAATTCAGGAGTCAAAAAGGGAGGATTTTGATATTTAATTGATCAAATCTTTATGGAGTTCAAGGGAGATTAGTTGGATTAATGTTGGTTCTATTGGAAGATCCGAAGGGCTTTTGATTATGTGGGATGAATCTAAATTGTCAGTTTCAGAATTTTTAAAAGGAGGCTATACCCTTTCAACTAAATTCTCTACCCAATATAAGAAAATTTGTTGGGTAACAAATGTATATGGGCCTAGTGATTACAGTGAAAGAAAACACCTATGTTCAGAATTGAGATAGCTATCATGTTATTGCATAGAACCTTGGTGTATTGGGGTGACTTCAATACTACTTGATGGGTTCATGAAAGGTCTCCACTGGGTAGACAAACAAAGGGAATGAAAGGTTTAATAAGTTGATTAAAGACCTTGATCTTATGGAAATTCTTTTGTCTAACGGAAAGTTCACTTGGTCACGAATTGGTAATGAGTCATCTTACTCTCTGATGGATAGGTTCCTTGTTTCAAAGGAATGTGATAATTTGTTTGATAATTCTAGAGTTTCAAGGCAAGCCTGTACACTCTCAGATCATTTCCTCTTATTGCTAGAAGCTGGAAATTTTATTTGGAGACCTTCTCCATTTCGGTTTTATAATAGTTGGTTACCTTTGCCAGATTGTGTGTCTATTATTGAGAATTCTGTTACTCAAGATCTTTCTTATGGATGGGCTGGGTTTGTAATTGCTTCTAAACTCTAG

mRNA sequence

ATGGAAGTTATCAGCTGTTGTATTCAGAATAGGTATTTTTGTACTTGGAGGGAAGGAAATATCCATTTTGTTGAAGATACTTGCAACAAGCGTTTGATTCCATTGTCCATTTCCTTCTTACAGTGGTTTGAAAAAGTGTTAGTTGAGATTTTGCAAAATCCCGTTTCTTCATTCTTTCATGAGAAAATCAAGGAAGAATTTGGAGTCATTAGGTTGATTAAGTTCTTCTCAGATAATGAATGGTTCTTTGAATGTGCTGTTTGGCCTTCCACGGGTGGAAGAAGGATTATTCAAGTTCCTGCTGGCTTGAATAAGAAAGGATGGTATGTTTTTTGGGAAATGATTAGGGATTTCATCCTTAAAATTCATTCTAATGAGAATCAACCTATTCGGTCATTGTTAAGCAAAGAGGAGAGTCTTCCGGTTTTTGATAAAGTTTCAGCAGGTCATGCCTCTTCCAATTCATATGCTGAGGTGGTAAAGCGAGGTGGTTCTTTAAAAAGTTCAGTTTCTTTGAATGATTCAATAAGAAATGCCAAGGGTATTAACGAAGAAGCTTACTGGGTTCGCAAGAATTGTGATGTGCTGAAATTAGATTTGGAAAGATCAATTGTTGTTTCTAGATTGATGGCCCAATATTCTTGGAAGGATGTTAAGATTGCCCTTGAGAATTTCTTTAAAACTTTTGTCTTAGTTAACCCCTTCATGGATGATAAAGCTCTGATTCATGCAGCAGATGGTGGATTGGAATTTTCTGCAAATGGCAAGTGGAAGAAATTTGGAAACTTACATTTGAAATTGGATTTTTGGTCCTCTGAAATTCATTCACAGCCGAAGTCTATAAAAAGTTATGGAGGCTGGCTTGCAATTAGAAATATTCCATTGAATCTATGGCATCGTGATTCCTTTGAAGCTATCGGAAAGAACCTTGGAGGGTTGGTTAGTATTTCTTCCAATACGCTTAATTTGTTAGATTGTTCTGAAGCCTTCATTGAAGTAGAAAAGAATTTTTGTGGATTTATTCCTGCTGATATTAATGTTAAGATTGGTAATAAGTATGAATTTTCATTAAGATATGGTGATATTAATTCTTTGGAGAACAGAAATTTGAATTTTGATTCAAGAAAACAGCTAGATGCCAATGACTTTTCAAATTCCCTGGATTTAATTAGGGTAAGGCAGGTGATTTTGGATGAAGAATCTGATATTGTTAATAAAGAGGATAGGATGAATGAGTTGCCTGCTTTCTCTAGGCATGAGGAGGCATTTAATGAGGATTTGGATATTTCAAAGGATGTCTCGGCACAAGATAAATATTTGAATGGGGAATTGGTTCTTTCAATGGATACCTCGGTGCAAGATCAGAATTTAAAAGAGAGAGTCCAAGTTAATGAGATGTTGGGTTCTCCAAAAGGTGCTTCACTGCATGACAGGTGTATTAATAATGCTGGTTGTAAAGGTTTTAATGCCAGAATTAATGAGCCGACATTAGCTCTCTCTCCTTCATTAAATGACAATGAATTTAATGAGTCCGGTCCTCAGGAAGCCCAACAGTTTCAGGTTTTTGAACTTTCTTATAAGAATGATAATGCCGTTAATGGTATCTTAAATCATGATGTCCAGCAAGTAGCATTAAAGACCTATTCTCGGAAAAAATGTTCTCTCTCATCGGCTGTTATGACCAACTTTAAGACCAACTTTAATGCTGATCATTTAGAGTCTGACTGTACTCATTTAATTGCTGGAAATAAGGCTTCGGGATCTGCTATAATCAATGCTGGAAACGGGTTGAGTCAGGCCAAGGTATTTAAGGAATCTTCTATTCAAATTCCAGGGGGAAGTAATGTTTTTGTCAGAGGTATTGGTAGTTCCTTCAATCATAGTATTCATTCCCCGGTGGATTCAGATGATGAGTCTATGGTTAGTGTTAGCAGTGAAGATTCTGATCAATTGTTAGATAAAGAGGATAATGTGGAACAATTTTCAGATGATCAAATTGGTGAGTCTTTAGAATCTCTTTTTTGTGAGAAGGTTGATGGTTTAGGTTCTCAAATTATTCATGAGTCTTTATTATCACCTTCTCAAATTCCTAACCAATTCTCTTCGATAGTTGATACTTGTGGATTTCAGTTGTGTAAAATTTCGCCTCAGTCTTCTAAAGTGGCTGAGACTAAACAAGTAGCAATTGATTTGAAATTCATTAAATCCTTATGGAGTTCCAAGGAAATCGGCTGGTCGTTTGTGGAAGCTTATGGAAAATCAGGTGGACTTCTTATTATGTGGGATGAAAGTAAATTGTCAGTGCTGGAATTTTTAAAGGGTGGTTATTCTCTTTCAGTCAAATGTCTCACTCTTTGTAAAAAAGTTTGTTGGGTTTCAAATGTTTATGGTCCAAATGACTACAAAGAAAGGAGATTCCTTTGGTTTGAATTACGCTCTCTCTCTTATTATTGCACGGATCCTTGGTGTATTGGAGGAGACTTTAATATTACTCGATGGGTTCATGAACGATTTCCAGTAGGAAGGCAAACGAAAGGGATGCGTAGATTTAACAAATTCATTGAAGACTCGGGTCTTATGGAAATTCCTTTATCAAATGGTAAATTTACATGGTCTAGGGATGGAAACGCTTATTCTCACTCTCTTATTGATAGATTTTTGGTGACAAAAGAATGGGATGTGTTATTTGATAATTCCAGAGTATCAAGGAAGGCACGCATATTTTCTGATCATTTTCCTCTTTTATTAGAAGCTGGTTCTTTTATGTGGGGACCAAGTCCTTTCAGGTTTTATAATAGTTGGCTTTCTCAAGCGGAATGTGATAGGATTATTTTGGATTCTCTTTCCATTGATCGATCACAAGGATGGGCTGGTTTTGTTATTAGCTCCAAATTCAGAAATTTAAAAGTTGCCATTAAGAAGTGGTTTGCAGAATTTGAAGATAGCAGAAAAAGTAAAGAGAAAAATTTGCTTTTTGAACTTGAATTCTTTGATGCAAAGGCTGAAGAATCTCTTTTATCTGATGAAGAGTTGGATATTCTCTTGGCTATAAAAGGCGAAATTATGGGTTTATACATGTCTGATGAAAGAAATTTAATTAAAAAATGTAAGCTTAATTGGCTTAAGCTTGGTGATGAGAATACAAGTTTTTTCCATCGATTTTTAGCAGCCAAGAAGAGGAAGAACTTGATTACTGATTTAATTTCCAGCAATGGTGTTTCTTTAGTTTCCTTCAGGGAAATTGAACAAGAAATTCTGGATTTCTTTTCTTTATCAGAAAATTCCAGAATTGCCTTCTGGCTTGATTCTTGGGTTGATGATCTTCCTTTTTGTTCAAAGTATCCTAGTTTATTTCGGATTGCTTCTCTTCCTAATGCCTCCGTTTTGGATCATTGGGATGGGGAGACTCTCTCATGGAATATTTCCTTTCGTCGGCTTCTCAAAGAGGAAGAAATTTCTGATTTTCAGCAGTTGTTGGTCTGTTTAAATGATGCCATTGTATCTGAATTTTCAGATTCTCGTATTTGGTCTCTTGAGAATTCGGGACTATATTCGTGTTGGGGAAAGCTATTGTCTATCTTCAAGCTTCAATGGGTTCTGGATCAGTCATTCAAAGAAAATGTGCAGCAACTTTTAAGTGGTCCATCAGTTAAGCCGCATCCTAGTCATTTTCCATTTTTTAAGCATGCTCGAGATGAATCAGCAGCAACACGTCTGTCTATGAGTTTGCATCAAGAGGATAAGCTACACAGTATGGGAGGTGAAAAGTTTTATGCGGAAGTGGTAAAGATGAATCCTATGGAAAATCTCAGTACCAAAGACTCTTCAGTACAAAAAGTTGTTATAAAGAAGTCTTCTTCCATTAGCTCTTATTGGGTTCGTAATGATCATGAGGTGCTAAGTTTAGATTTTGATAATTTATGGGCAGTGACTAGGTTATTCGCCCATAATGATTGGAATAAGATTAAAGCTTCACTAGAAGATTATTTCCAATCAAAAGTAATGATTAATCCACTTTTTGATGATAAAGCCTCGATCAAATTTGGTGAAGATATCCAGGATAGTCCAAAGGTGCAACGAAACTTATGTGGATTTATACCCGTCTCAATTGAAGTCAAAGATAAAAAAAGAGGGAATATCCTTCTTCACTTTGGAGACATTGAAGCATTGGACCCTCCAAACATCATTGATAGAGAGCTTCATGTGAATGGTTTTCAGAATCCAATGGATCTTTTCCGGCTTAATAAGGTAATGGATGATGAAGGTTTTGGCGATTCTCAAGTTTGGAATTCAAAGGCTGGTGTAGTTTGTAATGAAGTACCAAAGAAAGTTAATGATGAGATTGCTTTAAACATGAAGGTCATGCAAGAAGAGGGCATTAATTTGGGGATAAATCACGATGTGAATATTGAAAAATTGAATGAAGTGGTGCCAACCAGATTTTATGGGTCCCAATGTGAAAAGTTGTCAACCCTTTCTCCTTCCAATCCTTGTACTTACCAAAAATCTTGGTCTCTGTGTGAGGATAATCCAGTTTCAGCAGAAGTGTTGTGTGAAAAGATTAATATCTGTAGCCAAAGAATCAAGACTTCTCAGATGCCTTCATCTCACAAATCTTCTTTAAGGAGCATGAATCATTATCCTCTTTATTATACCAGGAAGAAAGACATTAGCTTAAGCAGCGAGGAGATAGAAGATCAAGTAGTTGAAGCAGATAGTGATGGAATTGTTGCTGAAGAGTCTTTCACGGAAGCTTTTGAAACTCTGTTTATGGATGCTAATAATGAACAAGTCAATGACTCTTCTCTTGGTATAGTTTCAGAAGTAAATTATTCTTCAAGCCCTTCTAAATTTTCCTCTCTTATTGAGGTGTGTGGTATACAGTTACGTGAAATTCCCCCATTGTTACCTCAGACAAACAAAGGGAATGAAAGGTTTAATAAGTTGATTAAAGACCTTGATCTTATGGAAATTCTTTTGTCTAACGGAAAGTTCACTTGGTCACGAATTGGTAATGAGTCATCTTACTCTCTGATGGATAGGTTCCTTGTTTCAAAGGAATGTGATAATTTGTTTGATAATTCTAGAGTTTCAAGGCAAGCCTGTACACTCTCAGATCATTTCCTCTTATTGCTAGAAGCTGGAAATTTTATTTGGAGACCTTCTCCATTTCGGTTTTATAATAGTTGGTTACCTTTGCCAGATTGTGTGTCTATTATTGAGAATTCTGTTACTCAAGATCTTTCTTATGGATGGGCTGGGTTTGTAATTGCTTCTAAACTCTAG

Coding sequence (CDS)

ATGGAAGTTATCAGCTGTTGTATTCAGAATAGGTATTTTTGTACTTGGAGGGAAGGAAATATCCATTTTGTTGAAGATACTTGCAACAAGCGTTTGATTCCATTGTCCATTTCCTTCTTACAGTGGTTTGAAAAAGTGTTAGTTGAGATTTTGCAAAATCCCGTTTCTTCATTCTTTCATGAGAAAATCAAGGAAGAATTTGGAGTCATTAGGTTGATTAAGTTCTTCTCAGATAATGAATGGTTCTTTGAATGTGCTGTTTGGCCTTCCACGGGTGGAAGAAGGATTATTCAAGTTCCTGCTGGCTTGAATAAGAAAGGATGGTATGTTTTTTGGGAAATGATTAGGGATTTCATCCTTAAAATTCATTCTAATGAGAATCAACCTATTCGGTCATTGTTAAGCAAAGAGGAGAGTCTTCCGGTTTTTGATAAAGTTTCAGCAGGTCATGCCTCTTCCAATTCATATGCTGAGGTGGTAAAGCGAGGTGGTTCTTTAAAAAGTTCAGTTTCTTTGAATGATTCAATAAGAAATGCCAAGGGTATTAACGAAGAAGCTTACTGGGTTCGCAAGAATTGTGATGTGCTGAAATTAGATTTGGAAAGATCAATTGTTGTTTCTAGATTGATGGCCCAATATTCTTGGAAGGATGTTAAGATTGCCCTTGAGAATTTCTTTAAAACTTTTGTCTTAGTTAACCCCTTCATGGATGATAAAGCTCTGATTCATGCAGCAGATGGTGGATTGGAATTTTCTGCAAATGGCAAGTGGAAGAAATTTGGAAACTTACATTTGAAATTGGATTTTTGGTCCTCTGAAATTCATTCACAGCCGAAGTCTATAAAAAGTTATGGAGGCTGGCTTGCAATTAGAAATATTCCATTGAATCTATGGCATCGTGATTCCTTTGAAGCTATCGGAAAGAACCTTGGAGGGTTGGTTAGTATTTCTTCCAATACGCTTAATTTGTTAGATTGTTCTGAAGCCTTCATTGAAGTAGAAAAGAATTTTTGTGGATTTATTCCTGCTGATATTAATGTTAAGATTGGTAATAAGTATGAATTTTCATTAAGATATGGTGATATTAATTCTTTGGAGAACAGAAATTTGAATTTTGATTCAAGAAAACAGCTAGATGCCAATGACTTTTCAAATTCCCTGGATTTAATTAGGGTAAGGCAGGTGATTTTGGATGAAGAATCTGATATTGTTAATAAAGAGGATAGGATGAATGAGTTGCCTGCTTTCTCTAGGCATGAGGAGGCATTTAATGAGGATTTGGATATTTCAAAGGATGTCTCGGCACAAGATAAATATTTGAATGGGGAATTGGTTCTTTCAATGGATACCTCGGTGCAAGATCAGAATTTAAAAGAGAGAGTCCAAGTTAATGAGATGTTGGGTTCTCCAAAAGGTGCTTCACTGCATGACAGGTGTATTAATAATGCTGGTTGTAAAGGTTTTAATGCCAGAATTAATGAGCCGACATTAGCTCTCTCTCCTTCATTAAATGACAATGAATTTAATGAGTCCGGTCCTCAGGAAGCCCAACAGTTTCAGGTTTTTGAACTTTCTTATAAGAATGATAATGCCGTTAATGGTATCTTAAATCATGATGTCCAGCAAGTAGCATTAAAGACCTATTCTCGGAAAAAATGTTCTCTCTCATCGGCTGTTATGACCAACTTTAAGACCAACTTTAATGCTGATCATTTAGAGTCTGACTGTACTCATTTAATTGCTGGAAATAAGGCTTCGGGATCTGCTATAATCAATGCTGGAAACGGGTTGAGTCAGGCCAAGGTATTTAAGGAATCTTCTATTCAAATTCCAGGGGGAAGTAATGTTTTTGTCAGAGGTATTGGTAGTTCCTTCAATCATAGTATTCATTCCCCGGTGGATTCAGATGATGAGTCTATGGTTAGTGTTAGCAGTGAAGATTCTGATCAATTGTTAGATAAAGAGGATAATGTGGAACAATTTTCAGATGATCAAATTGGTGAGTCTTTAGAATCTCTTTTTTGTGAGAAGGTTGATGGTTTAGGTTCTCAAATTATTCATGAGTCTTTATTATCACCTTCTCAAATTCCTAACCAATTCTCTTCGATAGTTGATACTTGTGGATTTCAGTTGTGTAAAATTTCGCCTCAGTCTTCTAAAGTGGCTGAGACTAAACAAGTAGCAATTGATTTGAAATTCATTAAATCCTTATGGAGTTCCAAGGAAATCGGCTGGTCGTTTGTGGAAGCTTATGGAAAATCAGGTGGACTTCTTATTATGTGGGATGAAAGTAAATTGTCAGTGCTGGAATTTTTAAAGGGTGGTTATTCTCTTTCAGTCAAATGTCTCACTCTTTGTAAAAAAGTTTGTTGGGTTTCAAATGTTTATGGTCCAAATGACTACAAAGAAAGGAGATTCCTTTGGTTTGAATTACGCTCTCTCTCTTATTATTGCACGGATCCTTGGTGTATTGGAGGAGACTTTAATATTACTCGATGGGTTCATGAACGATTTCCAGTAGGAAGGCAAACGAAAGGGATGCGTAGATTTAACAAATTCATTGAAGACTCGGGTCTTATGGAAATTCCTTTATCAAATGGTAAATTTACATGGTCTAGGGATGGAAACGCTTATTCTCACTCTCTTATTGATAGATTTTTGGTGACAAAAGAATGGGATGTGTTATTTGATAATTCCAGAGTATCAAGGAAGGCACGCATATTTTCTGATCATTTTCCTCTTTTATTAGAAGCTGGTTCTTTTATGTGGGGACCAAGTCCTTTCAGGTTTTATAATAGTTGGCTTTCTCAAGCGGAATGTGATAGGATTATTTTGGATTCTCTTTCCATTGATCGATCACAAGGATGGGCTGGTTTTGTTATTAGCTCCAAATTCAGAAATTTAAAAGTTGCCATTAAGAAGTGGTTTGCAGAATTTGAAGATAGCAGAAAAAGTAAAGAGAAAAATTTGCTTTTTGAACTTGAATTCTTTGATGCAAAGGCTGAAGAATCTCTTTTATCTGATGAAGAGTTGGATATTCTCTTGGCTATAAAAGGCGAAATTATGGGTTTATACATGTCTGATGAAAGAAATTTAATTAAAAAATGTAAGCTTAATTGGCTTAAGCTTGGTGATGAGAATACAAGTTTTTTCCATCGATTTTTAGCAGCCAAGAAGAGGAAGAACTTGATTACTGATTTAATTTCCAGCAATGGTGTTTCTTTAGTTTCCTTCAGGGAAATTGAACAAGAAATTCTGGATTTCTTTTCTTTATCAGAAAATTCCAGAATTGCCTTCTGGCTTGATTCTTGGGTTGATGATCTTCCTTTTTGTTCAAAGTATCCTAGTTTATTTCGGATTGCTTCTCTTCCTAATGCCTCCGTTTTGGATCATTGGGATGGGGAGACTCTCTCATGGAATATTTCCTTTCGTCGGCTTCTCAAAGAGGAAGAAATTTCTGATTTTCAGCAGTTGTTGGTCTGTTTAAATGATGCCATTGTATCTGAATTTTCAGATTCTCGTATTTGGTCTCTTGAGAATTCGGGACTATATTCGTGTTGGGGAAAGCTATTGTCTATCTTCAAGCTTCAATGGGTTCTGGATCAGTCATTCAAAGAAAATGTGCAGCAACTTTTAAGTGGTCCATCAGTTAAGCCGCATCCTAGTCATTTTCCATTTTTTAAGCATGCTCGAGATGAATCAGCAGCAACACGTCTGTCTATGAGTTTGCATCAAGAGGATAAGCTACACAGTATGGGAGGTGAAAAGTTTTATGCGGAAGTGGTAAAGATGAATCCTATGGAAAATCTCAGTACCAAAGACTCTTCAGTACAAAAAGTTGTTATAAAGAAGTCTTCTTCCATTAGCTCTTATTGGGTTCGTAATGATCATGAGGTGCTAAGTTTAGATTTTGATAATTTATGGGCAGTGACTAGGTTATTCGCCCATAATGATTGGAATAAGATTAAAGCTTCACTAGAAGATTATTTCCAATCAAAAGTAATGATTAATCCACTTTTTGATGATAAAGCCTCGATCAAATTTGGTGAAGATATCCAGGATAGTCCAAAGGTGCAACGAAACTTATGTGGATTTATACCCGTCTCAATTGAAGTCAAAGATAAAAAAAGAGGGAATATCCTTCTTCACTTTGGAGACATTGAAGCATTGGACCCTCCAAACATCATTGATAGAGAGCTTCATGTGAATGGTTTTCAGAATCCAATGGATCTTTTCCGGCTTAATAAGGTAATGGATGATGAAGGTTTTGGCGATTCTCAAGTTTGGAATTCAAAGGCTGGTGTAGTTTGTAATGAAGTACCAAAGAAAGTTAATGATGAGATTGCTTTAAACATGAAGGTCATGCAAGAAGAGGGCATTAATTTGGGGATAAATCACGATGTGAATATTGAAAAATTGAATGAAGTGGTGCCAACCAGATTTTATGGGTCCCAATGTGAAAAGTTGTCAACCCTTTCTCCTTCCAATCCTTGTACTTACCAAAAATCTTGGTCTCTGTGTGAGGATAATCCAGTTTCAGCAGAAGTGTTGTGTGAAAAGATTAATATCTGTAGCCAAAGAATCAAGACTTCTCAGATGCCTTCATCTCACAAATCTTCTTTAAGGAGCATGAATCATTATCCTCTTTATTATACCAGGAAGAAAGACATTAGCTTAAGCAGCGAGGAGATAGAAGATCAAGTAGTTGAAGCAGATAGTGATGGAATTGTTGCTGAAGAGTCTTTCACGGAAGCTTTTGAAACTCTGTTTATGGATGCTAATAATGAACAAGTCAATGACTCTTCTCTTGGTATAGTTTCAGAAGTAAATTATTCTTCAAGCCCTTCTAAATTTTCCTCTCTTATTGAGGTGTGTGGTATACAGTTACGTGAAATTCCCCCATTGTTACCTCAGACAAACAAAGGGAATGAAAGGTTTAATAAGTTGATTAAAGACCTTGATCTTATGGAAATTCTTTTGTCTAACGGAAAGTTCACTTGGTCACGAATTGGTAATGAGTCATCTTACTCTCTGATGGATAGGTTCCTTGTTTCAAAGGAATGTGATAATTTGTTTGATAATTCTAGAGTTTCAAGGCAAGCCTGTACACTCTCAGATCATTTCCTCTTATTGCTAGAAGCTGGAAATTTTATTTGGAGACCTTCTCCATTTCGGTTTTATAATAGTTGGTTACCTTTGCCAGATTGTGTGTCTATTATTGAGAATTCTGTTACTCAAGATCTTTCTTATGGATGGGCTGGGTTTGTAATTGCTTCTAAACTCTAG

Protein sequence

MEVISCCIQNRYFCTWREGNIHFVEDTCNKRLIPLSISFLQWFEKVLVEILQNPVSSFFHEKIKEEFGVIRLIKFFSDNEWFFECAVWPSTGGRRIIQVPAGLNKKGWYVFWEMIRDFILKIHSNENQPIRSLLSKEESLPVFDKVSAGHASSNSYAEVVKRGGSLKSSVSLNDSIRNAKGINEEAYWVRKNCDVLKLDLERSIVVSRLMAQYSWKDVKIALENFFKTFVLVNPFMDDKALIHAADGGLEFSANGKWKKFGNLHLKLDFWSSEIHSQPKSIKSYGGWLAIRNIPLNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSEAFIEVEKNFCGFIPADINVKIGNKYEFSLRYGDINSLENRNLNFDSRKQLDANDFSNSLDLIRVRQVILDEESDIVNKEDRMNELPAFSRHEEAFNEDLDISKDVSAQDKYLNGELVLSMDTSVQDQNLKERVQVNEMLGSPKGASLHDRCINNAGCKGFNARINEPTLALSPSLNDNEFNESGPQEAQQFQVFELSYKNDNAVNGILNHDVQQVALKTYSRKKCSLSSAVMTNFKTNFNADHLESDCTHLIAGNKASGSAIINAGNGLSQAKVFKESSIQIPGGSNVFVRGIGSSFNHSIHSPVDSDDESMVSVSSEDSDQLLDKEDNVEQFSDDQIGESLESLFCEKVDGLGSQIIHESLLSPSQIPNQFSSIVDTCGFQLCKISPQSSKVAETKQVAIDLKFIKSLWSSKEIGWSFVEAYGKSGGLLIMWDESKLSVLEFLKGGYSLSVKCLTLCKKVCWVSNVYGPNDYKERRFLWFELRSLSYYCTDPWCIGGDFNITRWVHERFPVGRQTKGMRRFNKFIEDSGLMEIPLSNGKFTWSRDGNAYSHSLIDRFLVTKEWDVLFDNSRVSRKARIFSDHFPLLLEAGSFMWGPSPFRFYNSWLSQAECDRIILDSLSIDRSQGWAGFVISSKFRNLKVAIKKWFAEFEDSRKSKEKNLLFELEFFDAKAEESLLSDEELDILLAIKGEIMGLYMSDERNLIKKCKLNWLKLGDENTSFFHRFLAAKKRKNLITDLISSNGVSLVSFREIEQEILDFFSLSENSRIAFWLDSWVDDLPFCSKYPSLFRIASLPNASVLDHWDGETLSWNISFRRLLKEEEISDFQQLLVCLNDAIVSEFSDSRIWSLENSGLYSCWGKLLSIFKLQWVLDQSFKENVQQLLSGPSVKPHPSHFPFFKHARDESAATRLSMSLHQEDKLHSMGGEKFYAEVVKMNPMENLSTKDSSVQKVVIKKSSSISSYWVRNDHEVLSLDFDNLWAVTRLFAHNDWNKIKASLEDYFQSKVMINPLFDDKASIKFGEDIQDSPKVQRNLCGFIPVSIEVKDKKRGNILLHFGDIEALDPPNIIDRELHVNGFQNPMDLFRLNKVMDDEGFGDSQVWNSKAGVVCNEVPKKVNDEIALNMKVMQEEGINLGINHDVNIEKLNEVVPTRFYGSQCEKLSTLSPSNPCTYQKSWSLCEDNPVSAEVLCEKINICSQRIKTSQMPSSHKSSLRSMNHYPLYYTRKKDISLSSEEIEDQVVEADSDGIVAEESFTEAFETLFMDANNEQVNDSSLGIVSEVNYSSSPSKFSSLIEVCGIQLREIPPLLPQTNKGNERFNKLIKDLDLMEILLSNGKFTWSRIGNESSYSLMDRFLVSKECDNLFDNSRVSRQACTLSDHFLLLLEAGNFIWRPSPFRFYNSWLPLPDCVSIIENSVTQDLSYGWAGFVIASKL
Homology
BLAST of Spg002036 vs. NCBI nr
Match: TYJ98683.1 (hypothetical protein E5676_scaffold429G00120 [Cucumis melo var. makuwa])

HSP 1 Score: 325.1 bits (832), Expect = 3.8e-84
Identity = 142/233 (60.94%), Postives = 176/233 (75.54%), Query Frame = 0

Query: 729 IDLKFIKSLWSSKEIGWSFVEAYGKSGGLLIMWDESKLSVLEFLKGGYSLSVKCLTLCKK 788
           ID+  IKSLWSSK+IGW  VE++G+ GG+L MWD SK+ V+E LKGGYSLS+  +T CKK
Sbjct: 82  IDIALIKSLWSSKDIGWELVESFGRFGGILTMWDMSKIKVVETLKGGYSLSINSITSCKK 141

Query: 789 VCWVSNVYGPNDYKERRFLWFELRSLSYYCTDPWCIGGDFNITRWVHERFPVGRQTKGMR 848
            CW++NVYGP DY+ERRF+W  L SLS YCT  WCIGG  NITRW HE FP+ +QT+GMR
Sbjct: 142 SCWITNVYGPYDYEERRFVWLVLVSLSGYCTGAWCIGGKCNITRWAHECFPLEKQTRGMR 201

Query: 849 RFNKFIEDSGLMEIPLSNGKFTWSRDGNAYSHSLIDRFLVTKEWDVLFDNSRVSRKARIF 908
           +FN  I+   + E+PL NG+ TWSR+G++ S SL+D F + KEWD + +NSRV RKA   
Sbjct: 202 QFNNPIDSLNIWELPLQNGRCTWSREGSSISRSLLDPFFIDKEWDEISENSRVGRKAHTI 261

Query: 909 SDHFPLLLEAGSFMWGPSPFRFYNSWLSQAECDRIILDSLSIDRSQGWAGFVI 962
           SDHFPLLLEAGS  WGPSPFRF NSWL  +EC+RII +  +I     WAGFV+
Sbjct: 262 SDHFPLLLEAGSIKWGPSPFRFSNSWLPFSECNRIIKEVWNITSITDWAGFVL 314

BLAST of Spg002036 vs. NCBI nr
Match: XP_038904301.1 (uncharacterized protein LOC120090656 [Benincasa hispida])

HSP 1 Score: 290.4 bits (742), Expect = 1.0e-73
Identity = 145/285 (50.88%), Postives = 189/285 (66.32%), Query Frame = 0

Query: 807  LWFELRSLSYYCTDPWCIGGDFNITRWVHERFPVGRQTKGMRRFNKFIEDSGLMEIPLSN 866
            +W EL SL+    DPWCIG +FN  R  HERFPVGR T+ M  FNKFI  + L+E PLSN
Sbjct: 1    MWAELSSLAEKFDDPWCIGENFNSIRRRHERFPVGRATRDMNNFNKFIRLNNLLEFPLSN 60

Query: 867  GKFTWSRDGNAYSHSLIDRFLVTKEWDVLFDNSRVSRKARIFSDHFPLLLEAGSFMWGPS 926
            G+FTWSR+G+  S SL+D FLV+  W+ +FDNSRV+R+AR  SDHFPL LEAG+F WGPS
Sbjct: 61   GQFTWSREGDVASKSLLDHFLVSSTWEDVFDNSRVARQARTMSDHFPLTLEAGAFEWGPS 120

Query: 927  PFRFYNSWLSQAECDRIILDSLSIDRSQGWAGFVISSKFRNLKVAIKKWFAEFEDSRKSK 986
             FRF NSWL+  E  ++I  SL    +  WA   +S+  R  K A+KKWF EF    K K
Sbjct: 121  SFRFCNSWLNNKESCKLIEKSLKKKENHQWAA-TLSTNLRKTKSALKKWFHEFGKEMKLK 180

Query: 987  EKNLLFELEFFDAKAEESLLSDEELDILLAIKGEIMGLYMSDERNLIKKCKLNWLKLGDE 1046
            E++LL EL+  D+   +        D   ++K +++ LY  +E++LI+KCKL WLK GDE
Sbjct: 181  EESLLNELQRKDSLTVDVSSQIRVDDASYSLKADLLALYQLEEKSLIQKCKLKWLKEGDE 240

Query: 1047 NTSFFHRFLAAKKRKNLITDLISSNGVSLVSFREIEQEILDFFSL 1092
            NTSFFHRFL+ +KRKNL   L++   +     R+IE  IL F+SL
Sbjct: 241  NTSFFHRFLSTRKRKNLFAKLLNDQDLPTRFTRDIEDIILGFYSL 284

BLAST of Spg002036 vs. NCBI nr
Match: TYJ99315.1 (LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa])

HSP 1 Score: 271.9 bits (694), Expect = 3.8e-68
Identity = 272/1111 (24.48%), Postives = 452/1111 (40.68%), Query Frame = 0

Query: 33   IPLSISFLQWFEKVLVEILQNPVSSFFHEKIKEEFGVIRLIKFFSDNEWFFECAVWPSTG 92
            I +S   L W    L  ++  P ++ F  + ++    I + K  +      E        
Sbjct: 98   IEVSPRDLDWIRCTLKSLIATPNTNRFFLETRDSEQRIWIRKTRNSKGCTAEIFRVDQKN 157

Query: 93   GRRIIQVPAGLNKKGWYVFWEMIRDFI-LKIHSNENQPIRSLLSKEESLPVFDKVSAGHA 152
             +  I VP G +K GW  F  MI   + +K  +      R+      S P+         
Sbjct: 158  RKSCILVPEGPDKSGWVSFLSMITPKVEVKAKTRPTFLPRTSPDCRLSPPI-------DY 217

Query: 153  SSNSYAEVVKRGGSLKSSVSLNDSIRNAKGINEEAYWVRKNCDVLKLD-LERSIVVSRLM 212
               SYA+ V  G    +S S +DS  ++   +  +      CD    D LE ++V+ R  
Sbjct: 218  HKRSYAKAVTEGRPFATSDS-SDSYDSSDSSHSSS---NSFCDSPSSDLLENTVVIVRRF 277

Query: 213  AQYSWKDVKIALENFFKTFVLVNPFMDDKALIHAADG--GLEFSANGKWKKFGNLHLKLD 272
                W  +   L    +     N F  +KAL+H +          N  W   G   ++ +
Sbjct: 278  FHDDWHKILQNLRKQTEESFTYNAFHAEKALVHFSSNIPANLLCQNKGWSTVGKYSVRFE 337

Query: 273  FWSSEIHSQPKSIKSYGGWLAIRNIPLNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSE 332
             WS   H+ PK I SYGGW   R IPL+LW+  +F+ IGK   GL+ ++  T +  +  E
Sbjct: 338  KWSPVYHATPKLIPSYGGWTTFRGIPLHLWNMMTFQQIGKACEGLIKVAEETRSAKNLIE 397

Query: 333  AFIEVEKNFCGFIPADINV--KIGNKY--------------------------EFSLRYG 392
            A I+V  N+ GF+PA++ +    GNK+                          + +  + 
Sbjct: 398  ARIKVRYNYSGFLPANVRIFDNEGNKFFVQVVTHPEGKWLIERNVRLHGTFKRQAAASFD 457

Query: 393  DINSLENRNLNFDSRKQLDANDFSNSLDLIRVRQVILDEES---DIVNKEDRMNELPAFS 452
            D N  E+    F+  + +  +  S S D    +    D+ S    ++ K DR   LP+F 
Sbjct: 458  DFNP-ESEQFFFEGSEAISPDFLSTSSD--GRKSSTPDQPSALKSVIIKPDRNATLPSF- 517

Query: 453  RHEEAFNEDLDISKDVSAQDKYLNGELVLSMDTSVQDQNLKERVQVNEMLGSPKGASLHD 512
                  NE+L    ++ A       E++  +         K++V +     S        
Sbjct: 518  -----LNEELVNDSNLHATANKSKLEILSGISNDGVLDKGKQKVDIQLQPNSALNLDKSK 577

Query: 513  RCINNAGCKGFNARINEPTLALSPSLNDNEFN-ESGPQEAQQFQVFELSYKNDNAVNGIL 572
            R ++      FN          SPS   N FN +S P                N    + 
Sbjct: 578  RKVS------FN----------SPSNKTNIFNPDSAPA---------------NHSPSLN 637

Query: 573  NHDVQQVALKTYSRKKCSLSSAVMTNFKTNFNADHLESDCTHLIAGNKASGSAIINAGNG 632
            + + +Q   +  S KK   SS+   N K N N     +    ++A ++ +      A  G
Sbjct: 638  SPEKKQKVSRERSIKK--KSSSTQPNSKANQNKGVFITQPIQIVAHDRDA------AKKG 697

Query: 633  LSQAKVFKESSIQIPGGSNVFVRGIGSSFNHSIHSPVDSDDESMVSVSSEDSDQLLDKED 692
            LS      +     P              N S+    +SD+  +V ++   + +++ +  
Sbjct: 698  LSLTVDLGDLPALDP--------------NKSLEDHHNSDNAEVVDIT---NTEVVPETP 757

Query: 693  NVEQFSDDQIGESLESLFCEKVDGLGSQIIHESLLSPSQIPN------QFSSIVDTCGFQ 752
             ++   ++    S E+ + +       +  +       + P+      Q  S +   G +
Sbjct: 758  EMKMPVNENSNSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSEAFKKQLVSWLKKNGLK 817

Query: 753  LCKISPQSSKVAETKQVAIDL---------KFIKSLWSSKEIGWSFVEAYGKSGGLLIMW 812
            L   +  S     T  +   +         + IKSLW S  I W    A G SGG+LI+W
Sbjct: 818  LSTDTDSSGATTSTNVLLNQMNSGLKITNKRIIKSLWPSNSINWIAKNASGSSGGILILW 877

Query: 813  DESKLSVLEFLKGGYSLSVKCLTLCKKVCWVSNVYGPNDYKERRFLWFELRSLSYYCTDP 872
            D    S+L   +G +SLS   L       W++ +YGP   +ER   W EL +L +  + P
Sbjct: 878  DAQNHSLLSQEEGLFSLSANFLLNNNSSWWLTGLYGPVKRRERIHFWAELHNLQHLNSFP 937

Query: 873  WCIGGDFNITRWVHERFPVGRQTKGMRRFNKFIEDSGLMEIPLSNGKFTWSRDGNAYSHS 932
            W +GGD N+ R   E   V   +   R  N FI ++ L++ PL+N +FTWS   N  + S
Sbjct: 938  WILGGDLNVIRMREESTSVLSSSHNSRMLNNFISNNLLIDPPLTNNRFTWSNLRNPPTFS 997

Query: 933  LIDRFLVTKEWDVLFDNSRVSRKARIFSDHFPLLLEAGS--FMWGPSPFRFYNSWLSQAE 992
             IDRFL    W+ LF         R  SDHFPL+ E  +    WGP PFR  +  LS  E
Sbjct: 998  RIDRFLYNSSWENLFSPHTTRTLPRSTSDHFPLVCEDSNPKLSWGPIPFRLNSITLSDPE 1057

Query: 993  CDRIILDSLSIDRSQGWAGFVISSKFRNLKVAIKKWFAEFEDSRKSKEKNLLFELEFFDA 1052
              R +          G+ GF    + ++L   IK W  E   S    ++ ++ E++  D 
Sbjct: 1058 FKRNMGRWWENSIQAGYPGFSFIQRLKSLANFIKPWQKEKLHSLTYAKEAIIREVDSIDK 1117

Query: 1053 KAEESLLSDEELDILLAIKGEIMGLYMSDERNLIKKCKLNWLKLGDENTSFFHRFLAAKK 1091
            K  ++ L+ EE +  LA+K ++  L + + +   ++ K  WL+ GDEN+SFFHR  ++++
Sbjct: 1118 KELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQ 1132

BLAST of Spg002036 vs. NCBI nr
Match: XP_022158956.1 (uncharacterized protein LOC111025405 [Momordica charantia])

HSP 1 Score: 261.5 bits (667), Expect = 5.1e-65
Identity = 136/378 (35.98%), Postives = 208/378 (55.03%), Query Frame = 0

Query: 711  LCKISPQSSKVAETKQVAIDLKFIKSLWSSKEIGWSFVEAYGKSGGLLIMWDESKLSVLE 770
            + +++P    + ETK   +D+  +KSLWS+  I WS ++A G + G+LI+W++  L   E
Sbjct: 24   ISRLNPNVVILQETKLSYMDILIVKSLWSAHGINWSALDASGMASGILILWNDPDLKAAE 83

Query: 771  FLKGGYSLSVKCLTLCKKVCWVSNVYGPNDYKERRFLWFELRSLSYYCTDPWCIGGDFNI 830
             ++G +SL++        + WVS +YGP+  +     W EL  LS  C + W + GDFN+
Sbjct: 84   MIEGVFSLTINFCLSDGFLFWVSGIYGPSTTEFHYLFWQELLDLSDLCENHWILAGDFNV 143

Query: 831  TRWVHERFPVGRQTKGMRRFNKFIEDSGLMEIPLSNGKFTWSRDGNAYSHSLIDRFLVTK 890
            TRW  E+      TK M  FN FIEDS L+++PL+NG+ TWSR+    S SLID FL+T 
Sbjct: 144  TRWSWEKSNGRPLTKSMWLFNSFIEDSSLIDVPLTNGQHTWSRN---TSFSLIDCFLLTN 203

Query: 891  EWDVLFDNSRVSRKARIFSDHFPLLLEAGSFMWGPSPFRFYNSWLSQAECDRIILDSLSI 950
                        R  R  SDHFP+LL+ G   WG +PFRF N WLS       +      
Sbjct: 204  GCIDKLGMPIAKRMTRTTSDHFPILLDFGQNNWGLTPFRFENMWLSHKTFKPFLETWWGN 263

Query: 951  DRSQGWAGFVISSKFRNLKVAIKKWFAEFEDSRKSKEKNLLFELEFFDAKAEESLLSDEE 1010
                GW G  +  K ++LK AIK W  E      S++++L   +   D       ++ ++
Sbjct: 264  KPLHGWPGHGLMMKLKSLKYAIKLWITEHFRCIHSQKEDLTNLMNSLDDLEGSQPVTPDQ 323

Query: 1011 LDILLAIKGEIMGLYMSDERNLIKKCKLNWLKLGDENTSFFHRFLAAKKRKNLITDLISS 1070
                +  K +++ +   +E    ++CK  WL  GDENT FFHRFLA K+R+++IT+++S 
Sbjct: 324  SRARIQAKEDLLSVVAKEEAFWRQRCKQKWLCEGDENTKFFHRFLANKRRRSIITEILSK 383

Query: 1071 NGVSLVSFREIEQEILDF 1089
             G+ L   ++IE+E +DF
Sbjct: 384  KGIGLTQIKDIEEEFIDF 398

BLAST of Spg002036 vs. NCBI nr
Match: KAA0056838.1 (LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa] >TYJ99341.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa])

HSP 1 Score: 259.2 bits (661), Expect = 2.5e-64
Identity = 271/1103 (24.57%), Postives = 441/1103 (39.98%), Query Frame = 0

Query: 23   FVEDTCNKRLIPLSIS--FLQWFEKVLVEILQNPVSS-FFHEKIKEEFGV-IRLIKFFSD 82
            ++ + C  +   + I+   L W      ++L    +  FF E+  E+  + +R  K  S 
Sbjct: 2    WLTEVCQHKSFSMDITPDTLAWIRNCFKDLLDTSTTKHFFAERRMEDNCMWVRKTKNKSK 61

Query: 83   NEWFFECAVWPSTGGRRIIQVPAGLNKKGWYVFWEMIRDFILKIHSNENQPIRSLLSKEE 142
                 E     + G +  I VP G +  GW  F  +    I    S   + IRS + KE 
Sbjct: 62   TSITAEIFRIDNKGRKCSILVPEGPDSFGWKCFLAL----ITFRSSAPTKRIRSEIRKEP 121

Query: 143  SLPVFDKVSAGHASS-NSYAEVVKRGG-----SLKSSVSLNDSIRNAKGINEEAYWVRKN 202
                 D  S+   SS  SYA+V+             + S + S R +  I  + + +  N
Sbjct: 122  VSTFSDSFSSDSDSSRKSYAKVLSDSSEDDNKKRYKATSDDSSSRRSSSIGFKPFTLSGN 181

Query: 203  CDVLKLDLERSIVVSRLMAQYSWKDVKIALENFFKTFVLVNPFMDDKALI-----HAADG 262
                    E++++++R      W  +  +L    +      PF  DKA++     HA   
Sbjct: 182  ------SFEKTVIITRRCFHDDWNRIMFSLRKQSEIAFSYKPFQADKAILFLNSDHAKLL 241

Query: 263  GLEFSANGKWKKFGNLHLKLDFWSSEIHSQPKSIKSYGGWLAIRNIPLNLWHRDSFEAIG 322
                 ANG W   GN  +K + W S +HS    I SYGGWL  R IPL+LW+ ++F+ IG
Sbjct: 242  CSNKGANG-WSTVGNYQVKFESWDSNLHSFHSVIPSYGGWLRFRGIPLHLWNYNTFQHIG 301

Query: 323  KNLGGLVSISSNTLNLLDCSEAFIEVEKNFCGFIPADINVKIGNKYEF---SLRYGDINS 382
               GG + ++  T+ +    +A I+V  N+ GF+PA I +       F   +++  +   
Sbjct: 302  SACGGFLDVAKETMQMDKLIDAKIKVRYNYTGFVPASILITDNQGENFIVTTVQPAEARW 361

Query: 383  LENRNL------------NFDSRKQLDANDFSNSLDLIRVRQVILDEESDIVNKEDRMNE 442
            L  RN+             FD    L      N    I         +  I N +     
Sbjct: 362  LVERNVRVHGSFRTKAADEFDQHNHLAETYTYNGFQAIPPEPTRTHGDYSIHNSDKH--- 421

Query: 443  LPAFSRHEEAFNEDLDISKDVSAQDKYLNGELVLSMDTSVQDQNLKERVQVNEMLGSPKG 502
              + S H +A        K+ S++ +Y              DQ L +R          KG
Sbjct: 422  --SISYHTQA-------KKNNSSESEY-----------DPFDQQLSDR-------RKEKG 481

Query: 503  ASLHDRCINNAGCKGFNAR----INEPTLALSPSLNDNEFNESGPQEAQQFQVFELSYKN 562
             ++    IN+     ++ R     N     LSP     + N S  +   + +  E+S  N
Sbjct: 482  KAI--LIINDQNHGHYSKRSKRISNRKVSFLSP--GGIQSNSSNTEINTKGKSLEISTIN 541

Query: 563  DNAVNGILNHDVQQVALKTYSRKKCSLSSAVMTNFKTNFNADHLESDCTHLIAGNKASGS 622
            D                     K+ S      T        D  ES   H +        
Sbjct: 542  DQ------------------FEKRWSPRQKTKTKLTYRIKKDPQESTEDHNL-------- 601

Query: 623  AIINAGNGLSQAKVFKESSIQIPGGSNVFVRGIGSSFNHSIHSPVDSDDESMVSVSSEDS 682
            ++   G G  Q  +    S+ + G  +     I S  NH + +  +   +   +  S DS
Sbjct: 602  SLKETGEGSKQMNL----SVDM-GPISPLESMIQSENNHGLDTLNNQTPDG--NSKSTDS 661

Query: 683  DQLLDKEDNVEQFSDDQIGESLESLFCEKVDG-LGSQIIHESLLSPSQIPNQF-SSIVDT 742
             +  +   +V++ +D     S  +      D   GS++         +I   F   +V  
Sbjct: 662  AEAKNLTVSVKEGADQNKSASRSTAEGNSKDAKTGSEM---------EIDRAFKEKLVIW 721

Query: 743  CGFQLCKISPQSSKVAETKQVAIDLKFIKSLWSSKEIGWSFVEAYGKSGGLLIMWDESKL 802
                  K+SP+      T  V     F   + S + +  +     G  GG+L++WD++K 
Sbjct: 722  LKENELKLSPK-----YTNDVPSSSSF-PVIVSDQNMDIAGHGPLGDKGGILVLWDDTKF 781

Query: 803  SVLEFLKGGYSLSVKCLTLCKKVCWVSNVYGPNDYKERRFLWFELRSLSYYCTDPWCIGG 862
             V +   G YS+S+  L       W+++VYGP  Y +R  LW EL  L   C   W I G
Sbjct: 782  KVNDIKVGNYSISLNILN-TNGNWWLTSVYGPYKYNDRTKLWPELEILQSLCLPNWLIAG 841

Query: 863  DFNITRWVHERFPVGRQTKGMRRFNKFIEDSGLMEIPLSNGKFTWSRDGNAYSHSLIDRF 922
            DFNI RW  E        + M  FN FI  + L++ PL N  FTWS      ++S +DRF
Sbjct: 842  DFNIVRWERETNAKSLDKRNMANFNNFISVNELIDPPLLNNNFTWSNLRVNPTYSRLDRF 901

Query: 923  LVTKEWDVLFDNSRVSRKARIFSDHFPLLLEAGSFMWGPSPFRFYNSWLSQAECDRIILD 982
            L++K W+  F         R  SDHFP+LLE+    WGP PFR  NS L   +  +  ++
Sbjct: 902  LLSKGWENAFGLHTSRTLERNISDHFPILLESPQIKWGPCPFRLNNSSLRDKDFQKNFIN 961

Query: 983  SLSIDRSQGWAGFVISSKFRNLKVAIKKWFAEFEDSRKSKEKNLLFELEFFDAKAEESLL 1042
              +  +  G+ G+       +L   IK+W     +   + +K LL E++  D    +  +
Sbjct: 962  WWNNSKQAGFPGYAFIQSLNSLSKFIKEWQHNKVNLYDANKKALLKEIDIIDKLEFQGEM 1010

Query: 1043 SDEELDILLAIKGEIMGLYMSDERNLIKKCKLNWLKLGDENTSFFHRFLAAKKRKNLITD 1090
            S       +++K +++ +  +  +   ++ +  W  LGDEN S+FHR     +RKNLI  
Sbjct: 1022 STTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTINQRKNLIKS 1010

BLAST of Spg002036 vs. ExPASy TrEMBL
Match: A0A5D3BHE3 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold429G00120 PE=4 SV=1)

HSP 1 Score: 325.1 bits (832), Expect = 1.8e-84
Identity = 142/233 (60.94%), Postives = 176/233 (75.54%), Query Frame = 0

Query: 729 IDLKFIKSLWSSKEIGWSFVEAYGKSGGLLIMWDESKLSVLEFLKGGYSLSVKCLTLCKK 788
           ID+  IKSLWSSK+IGW  VE++G+ GG+L MWD SK+ V+E LKGGYSLS+  +T CKK
Sbjct: 82  IDIALIKSLWSSKDIGWELVESFGRFGGILTMWDMSKIKVVETLKGGYSLSINSITSCKK 141

Query: 789 VCWVSNVYGPNDYKERRFLWFELRSLSYYCTDPWCIGGDFNITRWVHERFPVGRQTKGMR 848
            CW++NVYGP DY+ERRF+W  L SLS YCT  WCIGG  NITRW HE FP+ +QT+GMR
Sbjct: 142 SCWITNVYGPYDYEERRFVWLVLVSLSGYCTGAWCIGGKCNITRWAHECFPLEKQTRGMR 201

Query: 849 RFNKFIEDSGLMEIPLSNGKFTWSRDGNAYSHSLIDRFLVTKEWDVLFDNSRVSRKARIF 908
           +FN  I+   + E+PL NG+ TWSR+G++ S SL+D F + KEWD + +NSRV RKA   
Sbjct: 202 QFNNPIDSLNIWELPLQNGRCTWSREGSSISRSLLDPFFIDKEWDEISENSRVGRKAHTI 261

Query: 909 SDHFPLLLEAGSFMWGPSPFRFYNSWLSQAECDRIILDSLSIDRSQGWAGFVI 962
           SDHFPLLLEAGS  WGPSPFRF NSWL  +EC+RII +  +I     WAGFV+
Sbjct: 262 SDHFPLLLEAGSIKWGPSPFRFSNSWLPFSECNRIIKEVWNITSITDWAGFVL 314

BLAST of Spg002036 vs. ExPASy TrEMBL
Match: A0A5D3BLV7 (LINE-1 retrotransposable element ORF2 protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold248G005290 PE=4 SV=1)

HSP 1 Score: 271.9 bits (694), Expect = 1.8e-68
Identity = 272/1111 (24.48%), Postives = 452/1111 (40.68%), Query Frame = 0

Query: 33   IPLSISFLQWFEKVLVEILQNPVSSFFHEKIKEEFGVIRLIKFFSDNEWFFECAVWPSTG 92
            I +S   L W    L  ++  P ++ F  + ++    I + K  +      E        
Sbjct: 98   IEVSPRDLDWIRCTLKSLIATPNTNRFFLETRDSEQRIWIRKTRNSKGCTAEIFRVDQKN 157

Query: 93   GRRIIQVPAGLNKKGWYVFWEMIRDFI-LKIHSNENQPIRSLLSKEESLPVFDKVSAGHA 152
             +  I VP G +K GW  F  MI   + +K  +      R+      S P+         
Sbjct: 158  RKSCILVPEGPDKSGWVSFLSMITPKVEVKAKTRPTFLPRTSPDCRLSPPI-------DY 217

Query: 153  SSNSYAEVVKRGGSLKSSVSLNDSIRNAKGINEEAYWVRKNCDVLKLD-LERSIVVSRLM 212
               SYA+ V  G    +S S +DS  ++   +  +      CD    D LE ++V+ R  
Sbjct: 218  HKRSYAKAVTEGRPFATSDS-SDSYDSSDSSHSSS---NSFCDSPSSDLLENTVVIVRRF 277

Query: 213  AQYSWKDVKIALENFFKTFVLVNPFMDDKALIHAADG--GLEFSANGKWKKFGNLHLKLD 272
                W  +   L    +     N F  +KAL+H +          N  W   G   ++ +
Sbjct: 278  FHDDWHKILQNLRKQTEESFTYNAFHAEKALVHFSSNIPANLLCQNKGWSTVGKYSVRFE 337

Query: 273  FWSSEIHSQPKSIKSYGGWLAIRNIPLNLWHRDSFEAIGKNLGGLVSISSNTLNLLDCSE 332
             WS   H+ PK I SYGGW   R IPL+LW+  +F+ IGK   GL+ ++  T +  +  E
Sbjct: 338  KWSPVYHATPKLIPSYGGWTTFRGIPLHLWNMMTFQQIGKACEGLIKVAEETRSAKNLIE 397

Query: 333  AFIEVEKNFCGFIPADINV--KIGNKY--------------------------EFSLRYG 392
            A I+V  N+ GF+PA++ +    GNK+                          + +  + 
Sbjct: 398  ARIKVRYNYSGFLPANVRIFDNEGNKFFVQVVTHPEGKWLIERNVRLHGTFKRQAAASFD 457

Query: 393  DINSLENRNLNFDSRKQLDANDFSNSLDLIRVRQVILDEES---DIVNKEDRMNELPAFS 452
            D N  E+    F+  + +  +  S S D    +    D+ S    ++ K DR   LP+F 
Sbjct: 458  DFNP-ESEQFFFEGSEAISPDFLSTSSD--GRKSSTPDQPSALKSVIIKPDRNATLPSF- 517

Query: 453  RHEEAFNEDLDISKDVSAQDKYLNGELVLSMDTSVQDQNLKERVQVNEMLGSPKGASLHD 512
                  NE+L    ++ A       E++  +         K++V +     S        
Sbjct: 518  -----LNEELVNDSNLHATANKSKLEILSGISNDGVLDKGKQKVDIQLQPNSALNLDKSK 577

Query: 513  RCINNAGCKGFNARINEPTLALSPSLNDNEFN-ESGPQEAQQFQVFELSYKNDNAVNGIL 572
            R ++      FN          SPS   N FN +S P                N    + 
Sbjct: 578  RKVS------FN----------SPSNKTNIFNPDSAPA---------------NHSPSLN 637

Query: 573  NHDVQQVALKTYSRKKCSLSSAVMTNFKTNFNADHLESDCTHLIAGNKASGSAIINAGNG 632
            + + +Q   +  S KK   SS+   N K N N     +    ++A ++ +      A  G
Sbjct: 638  SPEKKQKVSRERSIKK--KSSSTQPNSKANQNKGVFITQPIQIVAHDRDA------AKKG 697

Query: 633  LSQAKVFKESSIQIPGGSNVFVRGIGSSFNHSIHSPVDSDDESMVSVSSEDSDQLLDKED 692
            LS      +     P              N S+    +SD+  +V ++   + +++ +  
Sbjct: 698  LSLTVDLGDLPALDP--------------NKSLEDHHNSDNAEVVDIT---NTEVVPETP 757

Query: 693  NVEQFSDDQIGESLESLFCEKVDGLGSQIIHESLLSPSQIPN------QFSSIVDTCGFQ 752
             ++   ++    S E+ + +       +  +       + P+      Q  S +   G +
Sbjct: 758  EMKMPVNENSNSSSEANYRKPKHVHKRKYYYRKKEEKEKDPDSEAFKKQLVSWLKKNGLK 817

Query: 753  LCKISPQSSKVAETKQVAIDL---------KFIKSLWSSKEIGWSFVEAYGKSGGLLIMW 812
            L   +  S     T  +   +         + IKSLW S  I W    A G SGG+LI+W
Sbjct: 818  LSTDTDSSGATTSTNVLLNQMNSGLKITNKRIIKSLWPSNSINWIAKNASGSSGGILILW 877

Query: 813  DESKLSVLEFLKGGYSLSVKCLTLCKKVCWVSNVYGPNDYKERRFLWFELRSLSYYCTDP 872
            D    S+L   +G +SLS   L       W++ +YGP   +ER   W EL +L +  + P
Sbjct: 878  DAQNHSLLSQEEGLFSLSANFLLNNNSSWWLTGLYGPVKRRERIHFWAELHNLQHLNSFP 937

Query: 873  WCIGGDFNITRWVHERFPVGRQTKGMRRFNKFIEDSGLMEIPLSNGKFTWSRDGNAYSHS 932
            W +GGD N+ R   E   V   +   R  N FI ++ L++ PL+N +FTWS   N  + S
Sbjct: 938  WILGGDLNVIRMREESTSVLSSSHNSRMLNNFISNNLLIDPPLTNNRFTWSNLRNPPTFS 997

Query: 933  LIDRFLVTKEWDVLFDNSRVSRKARIFSDHFPLLLEAGS--FMWGPSPFRFYNSWLSQAE 992
             IDRFL    W+ LF         R  SDHFPL+ E  +    WGP PFR  +  LS  E
Sbjct: 998  RIDRFLYNSSWENLFSPHTTRTLPRSTSDHFPLVCEDSNPKLSWGPIPFRLNSITLSDPE 1057

Query: 993  CDRIILDSLSIDRSQGWAGFVISSKFRNLKVAIKKWFAEFEDSRKSKEKNLLFELEFFDA 1052
              R +          G+ GF    + ++L   IK W  E   S    ++ ++ E++  D 
Sbjct: 1058 FKRNMGRWWENSIQAGYPGFSFIQRLKSLANFIKPWQKEKLHSLTYAKEAIIREVDSIDK 1117

Query: 1053 KAEESLLSDEELDILLAIKGEIMGLYMSDERNLIKKCKLNWLKLGDENTSFFHRFLAAKK 1091
            K  ++ L+ EE +  LA+K ++  L + + +   ++ K  WL+ GDEN+SFFHR  ++++
Sbjct: 1118 KELDTPLTQEESNRRLALKADLSELSLKESQFWYQRAKKLWLREGDENSSFFHRICSSRQ 1132

BLAST of Spg002036 vs. ExPASy TrEMBL
Match: A0A6J1E2G6 (uncharacterized protein LOC111025405 OS=Momordica charantia OX=3673 GN=LOC111025405 PE=4 SV=1)

HSP 1 Score: 261.5 bits (667), Expect = 2.5e-65
Identity = 136/378 (35.98%), Postives = 208/378 (55.03%), Query Frame = 0

Query: 711  LCKISPQSSKVAETKQVAIDLKFIKSLWSSKEIGWSFVEAYGKSGGLLIMWDESKLSVLE 770
            + +++P    + ETK   +D+  +KSLWS+  I WS ++A G + G+LI+W++  L   E
Sbjct: 24   ISRLNPNVVILQETKLSYMDILIVKSLWSAHGINWSALDASGMASGILILWNDPDLKAAE 83

Query: 771  FLKGGYSLSVKCLTLCKKVCWVSNVYGPNDYKERRFLWFELRSLSYYCTDPWCIGGDFNI 830
             ++G +SL++        + WVS +YGP+  +     W EL  LS  C + W + GDFN+
Sbjct: 84   MIEGVFSLTINFCLSDGFLFWVSGIYGPSTTEFHYLFWQELLDLSDLCENHWILAGDFNV 143

Query: 831  TRWVHERFPVGRQTKGMRRFNKFIEDSGLMEIPLSNGKFTWSRDGNAYSHSLIDRFLVTK 890
            TRW  E+      TK M  FN FIEDS L+++PL+NG+ TWSR+    S SLID FL+T 
Sbjct: 144  TRWSWEKSNGRPLTKSMWLFNSFIEDSSLIDVPLTNGQHTWSRN---TSFSLIDCFLLTN 203

Query: 891  EWDVLFDNSRVSRKARIFSDHFPLLLEAGSFMWGPSPFRFYNSWLSQAECDRIILDSLSI 950
                        R  R  SDHFP+LL+ G   WG +PFRF N WLS       +      
Sbjct: 204  GCIDKLGMPIAKRMTRTTSDHFPILLDFGQNNWGLTPFRFENMWLSHKTFKPFLETWWGN 263

Query: 951  DRSQGWAGFVISSKFRNLKVAIKKWFAEFEDSRKSKEKNLLFELEFFDAKAEESLLSDEE 1010
                GW G  +  K ++LK AIK W  E      S++++L   +   D       ++ ++
Sbjct: 264  KPLHGWPGHGLMMKLKSLKYAIKLWITEHFRCIHSQKEDLTNLMNSLDDLEGSQPVTPDQ 323

Query: 1011 LDILLAIKGEIMGLYMSDERNLIKKCKLNWLKLGDENTSFFHRFLAAKKRKNLITDLISS 1070
                +  K +++ +   +E    ++CK  WL  GDENT FFHRFLA K+R+++IT+++S 
Sbjct: 324  SRARIQAKEDLLSVVAKEEAFWRQRCKQKWLCEGDENTKFFHRFLANKRRRSIITEILSK 383

Query: 1071 NGVSLVSFREIEQEILDF 1089
             G+ L   ++IE+E +DF
Sbjct: 384  KGIGLTQIKDIEEEFIDF 398

BLAST of Spg002036 vs. ExPASy TrEMBL
Match: A0A5D3BKT8 (LINE-1 retrotransposable element ORF2 protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold248G005570 PE=4 SV=1)

HSP 1 Score: 259.2 bits (661), Expect = 1.2e-64
Identity = 271/1103 (24.57%), Postives = 441/1103 (39.98%), Query Frame = 0

Query: 23   FVEDTCNKRLIPLSIS--FLQWFEKVLVEILQNPVSS-FFHEKIKEEFGV-IRLIKFFSD 82
            ++ + C  +   + I+   L W      ++L    +  FF E+  E+  + +R  K  S 
Sbjct: 2    WLTEVCQHKSFSMDITPDTLAWIRNCFKDLLDTSTTKHFFAERRMEDNCMWVRKTKNKSK 61

Query: 83   NEWFFECAVWPSTGGRRIIQVPAGLNKKGWYVFWEMIRDFILKIHSNENQPIRSLLSKEE 142
                 E     + G +  I VP G +  GW  F  +    I    S   + IRS + KE 
Sbjct: 62   TSITAEIFRIDNKGRKCSILVPEGPDSFGWKCFLAL----ITFRSSAPTKRIRSEIRKEP 121

Query: 143  SLPVFDKVSAGHASS-NSYAEVVKRGG-----SLKSSVSLNDSIRNAKGINEEAYWVRKN 202
                 D  S+   SS  SYA+V+             + S + S R +  I  + + +  N
Sbjct: 122  VSTFSDSFSSDSDSSRKSYAKVLSDSSEDDNKKRYKATSDDSSSRRSSSIGFKPFTLSGN 181

Query: 203  CDVLKLDLERSIVVSRLMAQYSWKDVKIALENFFKTFVLVNPFMDDKALI-----HAADG 262
                    E++++++R      W  +  +L    +      PF  DKA++     HA   
Sbjct: 182  ------SFEKTVIITRRCFHDDWNRIMFSLRKQSEIAFSYKPFQADKAILFLNSDHAKLL 241

Query: 263  GLEFSANGKWKKFGNLHLKLDFWSSEIHSQPKSIKSYGGWLAIRNIPLNLWHRDSFEAIG 322
                 ANG W   GN  +K + W S +HS    I SYGGWL  R IPL+LW+ ++F+ IG
Sbjct: 242  CSNKGANG-WSTVGNYQVKFESWDSNLHSFHSVIPSYGGWLRFRGIPLHLWNYNTFQHIG 301

Query: 323  KNLGGLVSISSNTLNLLDCSEAFIEVEKNFCGFIPADINVKIGNKYEF---SLRYGDINS 382
               GG + ++  T+ +    +A I+V  N+ GF+PA I +       F   +++  +   
Sbjct: 302  SACGGFLDVAKETMQMDKLIDAKIKVRYNYTGFVPASILITDNQGENFIVTTVQPAEARW 361

Query: 383  LENRNL------------NFDSRKQLDANDFSNSLDLIRVRQVILDEESDIVNKEDRMNE 442
            L  RN+             FD    L      N    I         +  I N +     
Sbjct: 362  LVERNVRVHGSFRTKAADEFDQHNHLAETYTYNGFQAIPPEPTRTHGDYSIHNSDKH--- 421

Query: 443  LPAFSRHEEAFNEDLDISKDVSAQDKYLNGELVLSMDTSVQDQNLKERVQVNEMLGSPKG 502
              + S H +A        K+ S++ +Y              DQ L +R          KG
Sbjct: 422  --SISYHTQA-------KKNNSSESEY-----------DPFDQQLSDR-------RKEKG 481

Query: 503  ASLHDRCINNAGCKGFNAR----INEPTLALSPSLNDNEFNESGPQEAQQFQVFELSYKN 562
             ++    IN+     ++ R     N     LSP     + N S  +   + +  E+S  N
Sbjct: 482  KAI--LIINDQNHGHYSKRSKRISNRKVSFLSP--GGIQSNSSNTEINTKGKSLEISTIN 541

Query: 563  DNAVNGILNHDVQQVALKTYSRKKCSLSSAVMTNFKTNFNADHLESDCTHLIAGNKASGS 622
            D                     K+ S      T        D  ES   H +        
Sbjct: 542  DQ------------------FEKRWSPRQKTKTKLTYRIKKDPQESTEDHNL-------- 601

Query: 623  AIINAGNGLSQAKVFKESSIQIPGGSNVFVRGIGSSFNHSIHSPVDSDDESMVSVSSEDS 682
            ++   G G  Q  +    S+ + G  +     I S  NH + +  +   +   +  S DS
Sbjct: 602  SLKETGEGSKQMNL----SVDM-GPISPLESMIQSENNHGLDTLNNQTPDG--NSKSTDS 661

Query: 683  DQLLDKEDNVEQFSDDQIGESLESLFCEKVDG-LGSQIIHESLLSPSQIPNQF-SSIVDT 742
             +  +   +V++ +D     S  +      D   GS++         +I   F   +V  
Sbjct: 662  AEAKNLTVSVKEGADQNKSASRSTAEGNSKDAKTGSEM---------EIDRAFKEKLVIW 721

Query: 743  CGFQLCKISPQSSKVAETKQVAIDLKFIKSLWSSKEIGWSFVEAYGKSGGLLIMWDESKL 802
                  K+SP+      T  V     F   + S + +  +     G  GG+L++WD++K 
Sbjct: 722  LKENELKLSPK-----YTNDVPSSSSF-PVIVSDQNMDIAGHGPLGDKGGILVLWDDTKF 781

Query: 803  SVLEFLKGGYSLSVKCLTLCKKVCWVSNVYGPNDYKERRFLWFELRSLSYYCTDPWCIGG 862
             V +   G YS+S+  L       W+++VYGP  Y +R  LW EL  L   C   W I G
Sbjct: 782  KVNDIKVGNYSISLNILN-TNGNWWLTSVYGPYKYNDRTKLWPELEILQSLCLPNWLIAG 841

Query: 863  DFNITRWVHERFPVGRQTKGMRRFNKFIEDSGLMEIPLSNGKFTWSRDGNAYSHSLIDRF 922
            DFNI RW  E        + M  FN FI  + L++ PL N  FTWS      ++S +DRF
Sbjct: 842  DFNIVRWERETNAKSLDKRNMANFNNFISVNELIDPPLLNNNFTWSNLRVNPTYSRLDRF 901

Query: 923  LVTKEWDVLFDNSRVSRKARIFSDHFPLLLEAGSFMWGPSPFRFYNSWLSQAECDRIILD 982
            L++K W+  F         R  SDHFP+LLE+    WGP PFR  NS L   +  +  ++
Sbjct: 902  LLSKGWENAFGLHTSRTLERNISDHFPILLESPQIKWGPCPFRLNNSSLRDKDFQKNFIN 961

Query: 983  SLSIDRSQGWAGFVISSKFRNLKVAIKKWFAEFEDSRKSKEKNLLFELEFFDAKAEESLL 1042
              +  +  G+ G+       +L   IK+W     +   + +K LL E++  D    +  +
Sbjct: 962  WWNNSKQAGFPGYAFIQSLNSLSKFIKEWQHNKVNLYDANKKALLKEIDIIDKLEFQGEM 1010

Query: 1043 SDEELDILLAIKGEIMGLYMSDERNLIKKCKLNWLKLGDENTSFFHRFLAAKKRKNLITD 1090
            S       +++K +++ +  +  +   ++ +  W  LGDEN S+FHR     +RKNLI  
Sbjct: 1022 STTHHQKRISLKSDLLSIENNQAQIWHQRARQRWNLLGDENNSYFHRICTINQRKNLIKS 1010

BLAST of Spg002036 vs. ExPASy TrEMBL
Match: A0A803QQM3 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 246.1 bits (627), Expect = 1.1e-60
Identity = 131/380 (34.47%), Postives = 195/380 (51.32%), Query Frame = 0

Query: 711  LCKISPQSSKVAETKQVAIDLKFIKSLWSSKEIGWSFVEAYGKSGGLLIMWDESKLSVLE 770
            +CK +P    + E K+  +D +FI S+W S+   W  + A G+SGG L++WD   +SVL+
Sbjct: 947  ICKANPDLVILQEVKRATVDRRFIGSIWRSRFKAWILLPALGRSGGTLLIWDTRTISVLD 1006

Query: 771  FLKGGYSLSVKCLTLCKKVCWVSNVYGPNDYKERRFLWFELRSLSYYCTDPWCIGGDFNI 830
             L G +S+SV      K+  W S VYGP  YK R   W EL  LS  C + WC+GGDFN+
Sbjct: 1007 SLVGEFSISVLINAEGKEPWWFSGVYGPCSYKLRPEFWDELAGLSSICGESWCVGGDFNV 1066

Query: 831  TRWVHERFPVGRQTKGMRRFNKFIEDSGLMEIPLSNGKFTWSRDGNAYSHSLIDRFLVTK 890
            TR V E+      T+ M+ F+  I +  L++  L NG FTWS    +   S +DRFL + 
Sbjct: 1067 TRRVGEKLNSSSCTRSMKLFDGLIRELQLIDPKLENGSFTWSNFRASPVCSRLDRFLFSN 1126

Query: 891  EWDVLFDNSRVSRKARIFSDHFPLLLEAGSFMWGPSPFRFYNSWLSQAECDRIILDSLSI 950
             W+V++   R     R+ SDH P+++++    WGP PFRF N WL      +        
Sbjct: 1127 NWNVIYPFVRQEMLVRLVSDHSPVVIDSNPPKWGPGPFRFDNHWLEHKSFSKCFESWWKE 1186

Query: 951  DRSQGWAGFVISSKFRNLKVAIKKWFAEFEDSRKSKEKNLLFELEFFDAKAEESLLSDEE 1010
            + + GW G     K + L+  +K+W        K+ +  L   L   D     S  +   
Sbjct: 1187 EINDGWPGTKFMKKLKLLQGKVKEWSKSTFGQNKATKIALEGRLGVLDRLEGTSSWNQSV 1246

Query: 1011 LDILLAIKGEIMGLYMSDERNLIKKCKLNWLKLGDENTSFFHRFLAAKKRKNLITDLISS 1070
            LD    +K E   L+  +ER +  K K  W + GD N+  FH  L A+K KN I+ +   
Sbjct: 1247 LDERRKLKEEWQQLHFEEERGIWLKSKCKWAREGDANSRLFHNLLNARKAKNTISRIERD 1306

Query: 1071 NGVSLVSFREIEQEILDFFS 1091
            NG  + + +EI +E++ FFS
Sbjct: 1307 NGDIIDNEKEIVEELIAFFS 1326

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
TYJ98683.13.8e-8460.94hypothetical protein E5676_scaffold429G00120 [Cucumis melo var. makuwa][more]
XP_038904301.11.0e-7350.88uncharacterized protein LOC120090656 [Benincasa hispida][more]
TYJ99315.13.8e-6824.48LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa][more]
XP_022158956.15.1e-6535.98uncharacterized protein LOC111025405 [Momordica charantia][more]
KAA0056838.12.5e-6424.57LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa] >TYJ993... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A5D3BHE31.8e-8460.94Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A5D3BLV71.8e-6824.48LINE-1 retrotransposable element ORF2 protein OS=Cucumis melo var. makuwa OX=119... [more]
A0A6J1E2G62.5e-6535.98uncharacterized protein LOC111025405 OS=Momordica charantia OX=3673 GN=LOC111025... [more]
A0A5D3BKT81.2e-6424.57LINE-1 retrotransposable element ORF2 protein OS=Cucumis melo var. makuwa OX=119... [more]
A0A803QQM31.1e-6034.47Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (cylindrica) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR036691Endonuclease/exonuclease/phosphatase superfamilyGENE3D3.60.10.10Endonuclease/exonuclease/phosphatasecoord: 704..917
e-value: 3.3E-24
score: 88.0
IPR036691Endonuclease/exonuclease/phosphatase superfamilyGENE3D3.60.10.10Endonuclease/exonuclease/phosphatasecoord: 1606..1714
e-value: 2.9E-5
score: 25.6
IPR036691Endonuclease/exonuclease/phosphatase superfamilySUPERFAMILY56219DNase I-likecoord: 730..917
IPR025558Domain of unknown function DUF4283PFAMPF14111DUF4283coord: 259..323
e-value: 1.2E-8
score: 34.6
NoneNo IPR availablePANTHERPTHR33710BNAC02G09200D PROTEINcoord: 778..1042
NoneNo IPR availablePANTHERPTHR33710BNAC02G09200D PROTEINcoord: 1645..1731
NoneNo IPR availablePANTHERPTHR33710:SF32SUBFAMILY NOT NAMEDcoord: 778..1042
NoneNo IPR availablePANTHERPTHR33710:SF32SUBFAMILY NOT NAMEDcoord: 1645..1731

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Spg002036.1Spg002036.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0110165 cellular anatomical entity