Lag0035208 (gene) Sponge gourd (AG‐4) v1

Overview
NameLag0035208
Typegene
OrganismLuffa acutangula (Sponge gourd (AG‐4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Locationchr3: 16664363 .. 16675367 (-)
RNA-Seq ExpressionLag0035208
SyntenyLag0035208
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAATTTTTGGGCGCGTTCCACCTCCTGCCCGTCTACGTAAGTGGCGCAGTATGAAAAAATAGAAAAAAAAGTTTCGTTGCGCTTTCTACTAAAATCTGAACGAAAAAAAAAGAAAAGAACCCTCATGTCTTGGTTTACATGGTGATTTAACAAATATTCTTCTAACTTACCACTCACCACCCCCACTCCCAACCCCACTTCTCCCTTTCCGTTCTCTGTTACTGCCCCTTTAGAGTTATCCTCCATTCAAAAAAGTGGGACATCATCCATTATTTTGCAATTGTTCATTTATAAAGATGTTTTTCCCCTGATATAATGGGAACATGCAAGGACATAGACAATCATAAATATTATTGGATTGCCAGTTTAAATGAAAACTTTGATTCATTACATGTTTCTTTACGAAATTGGATGCTTAATTATCACGTGTTAGATTTTTGATAACTTACAGAAGCAATGTTTTCTTTTTTCTTTTTCTTTTTTAAATATTTTTTAATCTTTTTTTGTTTGTGTAGGTTGATTTTGTTCCAACTTTTGATAATTCCCAAAAGGAACCTTCACTTCTACCAGCTCAGCTCCCAACTCTATTGTCGAACGGTTCCTCAGGGATTACGGTGATCCATCTCAGTTTTCTCTTTCCTTTTTCCCCTCACAGATGGTTTTTTGACGTTCTCATTTAAAGATAAATACTTTCAGATTAAAATGACAACTAATACTCCACCACATAATCTTGGCGAGTTAGTAGATGCACTCTGTTTTCTAATTCATAATCCAAAGGCTACCGTGAGTACTCTCTCTCTTTCTTCATGGATATTCATGGAAATTGACTTCTTGTTTCTGAATGTTTGTTATGAGTAGTGTGTGGTGAACTGCATGTTGTATTCCCTTTGCACAATAAGTTTTGGAAACTTTTTTCTGTTTGGTTATGGGAGGACCTTAATAGCAGTATACTTGAAAGCCAAATGTTATATTACAAAGTTTTAAAGATCTTGGTAATAGTCTTTCGTGTAAAAGATCTTTGTATCTTTGGTAAATATTGGATAATAAAATATTTTCCTTTTAACCTAGTTTAGTTCCTAAGCTTTCAATTGTGTCCTTTATATCGAATAAATCTTTGTATCGAATAAATCTTTGATATTATTAACTTCATTTAATTAAAATTTTCCTTTTTCTTACTAAATTTTGTTAACTTTTTCAAATTTCAAAATTTCCAAGGCGATGATATTATTGCTTGCATCATATTAGTAATTAGTACTAGATCTACTTGCAACATGAACTTTTTTCAAAAGAAGTCTTGACTACTCACATTTGAAAGAAACATTTGAGTCTTTTTATCTTTGAAAATGAAATGTTCAATGGGAAGATACAAATCTTTTTTGTTTTTTAGATGTTAAGTAAAATTACCAAATAAAGTTAGTTTAAAAGTGCTACATTTTGACACAAGTTTTGGGTACTCCTGAACACTCAGGACGTGTAAGAGGCATTGGTGAATTAATTGTACCTTCCTACTTCTTTCACAAGCCAATTCCATCAATTTCCAAGAATGTCGTAGTAGAGGATGAACCAAAAGGTTGGAAAAAAAATGCACAACTCCAGAGACAATACAATGAAATGGAGAAGAAATTGCGTGAAGTGCAGTCACAAATGGAGAGTGGTAGACTAACACCAATGTCAGATCAAGGTAGTTGCCCACAAGTACCAGACCCTCAGCCTTCTGATGAAAAACCTGACCAGGTAAACAATATTGGAATTATATTGAAATGTATTTTATGACAATACTTGATACATATAATATTAAATGAAAGTTCTAATATATAATTTTAATTTTAGGGGAGATCATGTAAACTTGTTGTTGAAGATGTAAAGAATATTGTGACGACAGAAATTGTATATAAGAGGAAAGACGACCATGAAATTGTTTATGGTGTCCCACTCATAGTATCTCATAGGAGCAAAGTGATTACAGATGTGGTACTAATCGATTTTTATTCAACTTATCTTTTTTATATTTATTTTAATTTTTGTTATGTAGTATGTTTACTTTGTTGGGTTTTATGCCCTAAAACTCGTAGATAGTAAATGTACATTTGGCCGAACATTAATAAACGTGATGTATTATTCAATGTTTGTATTATATCTTGTCTTAATAACCCTAATCCAATAAACTAAACATCCAAGGCTAATAGAATGAGGCTTGAACTAGTATGTAGGGACATACGGGGATCAATGTTCGAGTTTTAGCCTAAAGAGTATGTAGTATAGGGATAAGGCTGGGTACCTTATCCTGGTGACACTATGGATACGGCCCGCTTTGTATATTGATACAAACGTAGTGATCCAACGCGTTCATGTAGTTGACATGCAAGTGGGGGTATCCTGCGCAATGAGTTTGCACAAAGATCGGACCGCGAAATAGTTTACCACTAACTGTAACACCGTTAGTTAGCTCGGTTTCTATTTCACTAGGATGACCTAGGCAACTTGGCCTTAATCCTGAGTGGATTATGGACTCCTGTCCATGAGGGATTGTCCTTTGATTTGTACGGGTGAGAGTGGTCTGTTCGTCGACTCAATAAGCCTACCATTTTGGGGACAAGACCGAGTGGGGAGCTGGGAACGTAGTCTTACAAGATGGAATTCACTCCTTCCCGATATTAGGGTAAGTAGAGGTGTGTTCCCTTAAGTGGTGTCTCCGGGTCTTGAACAATGGGTCATACCCTCTTTATGGCACGAGAGAGATTTCTGTTTGTTGGTTGGACCTCAAACAGGTTGTTCATTGGAGGAGCACTGGTACTTAAGGACCAAGAGGTAGCCCAGGGGTAAAACGGTAATTTGACCTAACTGGGGTACGAACACACGTGAAGGGCTAATTTGCTGTTGACGGTCGATATCCGTGGACACAGAAATATATCTACAGTGAGAAGAGTGCAGCTGTGGTTCTTTAGTGGAGTGAACCACAGTTGACGAATATTGATTAACTTGGTTAATGAGTTTAGTCAACTAATCTCATATCATTGGAGCTTCTGATATGTAGGTCCATTAGGTCCCACCGGTAGCTCATTTAGGGCGTTGAGGCAAAATTCTAAGGAAGGAATGAATATTCGAAGGTGTTCGAATGTTAAGCCAAAGTATGAATTTGATTCATATTAAAACTATAGGTTATAATTAATGTGCATGGGATGTATATTAATACTATAGTTTTATGAGAGAGAATAATAATTGAATATGTGATATTCAAATTTATTATTTTAAATTAATTAATTTTGAATTAATTAATTGATTAAGTTTAATTAAATTTGATTTAATTAAACTATAGGTTAAAGAGGAATATTAATTTAAATAAGATTTAAATTAATGTTGTTTTAATTAATATTGATTAATTAAAATTTAATTAATTATAATTTATATTTTGAATTTAAAATTTATAATTAATATCAGGTAACTGATTGTGCAGTTACCTGATGATGCGTGAGGTGGCCCTAATTTTCCTCTCCTAATCGATAGGTAAAAGGGATGGGTCTTCAATTCAGTTTTATCGATCCCACATTCACGCAGACTCTCTCTCTCAAGCTTTCTCGATTTCTCCTTCTCTGTGATTGACAGAAAGAAGATCCTCCAGTTAATCTTTAGAGTTTTTGCTCCCACAAGCTCTCAACGCAAAATCCTGTTCGAGAATATTGGTGCGATTGCTTGGTGGTGTTCAAGGGTGATTTTCAGAAAGATAAAGGTTCTTTTGGTTGCTGGATTTTTTGCAAAAATCAAAGGAAAAGGCGAAACGTTCAAGAATTTCTACAAGGGTAAGTTGTTCTTGAACTAGGATCAATAGGATCCAATTTTAGATGAGATCTAATTTATTTTTATGGTTTAAAGCATGTATTATATCCGTTTTACAGCATGATAATTTTATTTTCTGCTCTAAATTTGTTTTGCCGTCGTATTGGTATCCCAATCGACGTTTTTATTCGAAAAAGTCTCTGACGTCTTCCGCTGCCTAGGGTAATTTATCCCTACATACTTAACTTTTTTTTTTGTTTGTAGTTGACGAGAGGAACTGCATTTAGCCAATCAGAATTCGATGAGATAAGAGTTAAATTATGTGAATATGTAGTACAATACATGAGATGATTGTTTTCTAAGGTTATACTTAGTTTTTTTTATTCGTTTTTGACAACTTGTTACACACTGTGATATGTTCATAGTTTGATGAAATTAATATTATGACTTACTTAAGTTGTGTTCTTAGATAAATTTTTCGTTAATGAAATCTATTATGTGTTTGTTATGAGATCTGACTTGTACTATTTATACTTGAGTTCTGGTGTTACTTTCATATAACGTATTGAACATGTCATGTTATTATCAAGTGAAGTGAAGGTTGTGTTCTGGGTTTAGAAGTGTATGTTTATGTTCTTTCAGGCACCCAAAAATTTTTGGTTAGTTATAATTACGTTGTTATGCTGTCGAATTTTTGGTAAATCAAACTTCGTTACAGGGTAAAATTTAGTAATGTCATTTATATTATGTTGAATTTTGGAGGCATTACAATTGGTAGCTAGCTACTATTGTGTAAATATAGGTTGCAATGCAGGGGTAAAAACCAACGTGAGGTTTTTATTTTATTTTTTTCCAAATATGTATCGTTGACGGTTAAAAAACGTCATAACAAACACCATAATTGACATATATTAACCGTCAATGAAAAGACACAATTTGAAAGTTTAATACGGTAACGACTCATAATCGTCAATAAAATGATATATTGACAGCTCTTAAGCGTCAATATAAATATATATTTTGACATTTTTGATCTTATAATTGACATTTAAAAAATGTCAACATAAGTTTATATTAACGGTTTATAAATATCAATAAATATGCAACATTGATGCTTTATGGATATTATTGACGAATAAAATGTGTCAATAATACGATTTGTTGATGGCTAGTAAACGTCAACAAAGGTGCTTACGTTGACGCTTTATGAATATCATTGACGGATAAAACGTGTCAATAATGTCTTTGTTGATGACTTATAAACGTCAACAACGACATCTTCGTTGACGTCCTATAAGCGTCAATGAAAATAGTAATATTGACGCTTTAGTTTTATTGATATCACTTCAATGGACTTCTAAAAAGCGTCAATGAAAGTGAAAGTTGATACTTTTTAGCCGTCAACAAATTCCAAATTTGTAGTAGTGACGGTCTATAGGATAGGAATAAGACTGGGTACCTTATTCTCGTGACACTATAGATACGACCCACTTTGTATTTGATTCAAACACAATGATCCAACATGCTCGTGTAGGTGACATGCGAGTGGAGGTATCTTATGCAATGAGTTTGCATAAGACCAGATCGTGAAATATTTTGTCTGATACCTGTAAATAAACATACGTTTCCAAAATAAAGCACTCAAATCTTGACACAAAATAAATATCTACATAAATAACAACTTTTATATATCATTGAACCAAAATTGTCAAATTTTTACATATATGTATATACAAACTAAGTGTAATAGATTTTGTACACCACAACAATTCCTACACTGGACGTCAACCATGAGAAAATATATTATATGGTTTTTTGTCACTAACCTAACAAAAAAGGTCATACATCAATTGTCCACAATATGGTTTTTCTGGACTTCGTTGAGTTCTTGCAAAACATAACCAACCAAACAATTGTCATTTTAGAGTTGAATGCAATATTTCTTACATCAAGATGAGAGTAATGATATTCATTTTAATAATATGTATGAGTTATACCTTTAAAATGGTGACAATATTCCCAACTCAATACATTCGATCTTGCATGAATCTTCTTCTTGCTCAAAACATTGATCTCACAAACTAAAACATAAAAAACGAATACAAAAAAAAAAAAAAAAAACTTAATGCAAAATAAACAGTTGATAAAATAATTATCTTGAAATACATAATTTGATTTGTCTCATACAACAACAGTGAGCAATAAAATGAACTCCAAAATAGCGTGAGCAACATAAAGGACTCCATCGATCATCCTACATTTCTAATTTATGATAGGAGTTTTGTGCCAATGAATAAAATACAAACTATTAAACTTACCAAAACATAGAAACATAACAACATTACTAATTTATGAAAAATTAAGAACGTAATAACGAAACTTATCTGGAACGTTCAACACTTACTAATACAACTATCAAGAACCCAACAACACAACAACTAAAACACTTACTAACAACACTCACTAATACAATTACTTACCAAGAACCCAACAACTAAACACAAAAACATGAACACAAACAACACTTATTAATACAATTACTTATTTAACTAATTATCAATATCTTCTTTATATATATAGTAACTAATTATTGTTTTTTGTTTGGGGGGAGGTGAGTTTATCTAAATAATGTAATATATCTATCTCATGAGCATCTCACATATGTAGTACATACATTCTAAGTACATATTCAGTTATACAAGTGAAACATGTAATTCTAAGTTATTATAGTGTACCTTTAAAACATGATAGATACTTCAAATTGAAAACATCTTCCTTGAATGTACTCCTTGTTCAAAACATTACTCTCATAAGCTTCTTCTGGTACAAAATTTGAATGCTCATATCAATATACTATCTAGAACAACTTAAAGGTAAAACGATAAGTTTATAAAATAATTAATATCTTACCAATAAGCCACTATTTCTCTCGATGACAAGATTTGATTCTGCCATAGAACAAAGGTGAGCAACAAGATGGACTCCATAACAGTTTGAACAACAACAAGGACTACAACATCCAATCATATGAGTTGTGATCTAACTCCAAAATAGTGTGAGCAACAAAAGTGTACATTATTCACACAATCTAATCTGTTAAGTTTATACCCAAAAAAAGATAACAGTATCTTATGCAAAGAATATGTGCGATCCTACGAACTTTTTCCACTCAGTTCTCACAATTTCGATATCATCTTGTGTGTAAGTAGATGGTGCATCTTTCATCTGTAAGCAAAAAAATAACAATTAACAGCTGAGTAATGCAATCTTAATTAAATGCATTTTAGATTGTGTGCATACTAGATCTATAATTGAGGTGTTCTTTTGATAGATTATCTCACACATAAATCACATTACATAAAATATCCGCATTTCACAACACATGTTTGTTTAGGGCATTGTGCATATAACATACAAACATCAAATCTTCAATAGATAAGATACAAGTTTAATAAAACTTTATAAGAACTATTAAGGTTGCTATTTATAAAATAATTTACCTTAAGGACTTCCAAATAGGCTTTTTCTTTTTCATAATCGTGAATGCCTTACATGTGCATGAAATATAAATGATATTTGTTATTAATAGAAACTAATCAATCATTAAATTATAATAGATAAGATGAGTTTAACATACATGCCAACTACATCAATTAGATCCTCATGAAGACGATTTTTGAGGGGTCCATAAAGAATGTAAGACCCTTAAAATGATCAATAAGAATCAAACTCGAATGATTTCTACAAATGAGAGTAATGCAACACATTAAATAATTCACACAAAACATGTTGATGTTTTGCCAGATATTCACAAGCTCACAGTCAGCCTTGAAGAGTCAAAGAACCTCTCACCAGCATGGACTGTCATAGTCACTTTCCAGCTCAGATTAAATTCTCCTAGTCAGCAAGCCAAGCGATCCATTCGGGTTCGGGTTAGAAATTTGGGTACTGACCTGCTTTGTTAGTTACCAAATTTCAGAGAGTTTGTTACAGGGGAATCTTGCGTGTGTTTTTGTACAGTCTCGCAAGGTTGATGATCGATTTATAAGGTCTCAATCTTCGAGTGGGAAAGAGAGTGAGTGAGAGAGAGTAATGTGTGTGTAATCACTGTGATTCTGAGTGATTGTGAGCAAGAGAGCTGAGGGATTGCTGAGGGTTATCTTTGGGTAGCAACTGGGTCTCTGTGATAGAGCGATTGCTGTAGAAGCCGTGGTGTGCTTGTTCGAGAAGATAACGAATAATCCCTGTACTGAGCTGATGATTCTAGTGGATAGCCTCAAGTGGATGTAGATCACCTTTTTTTTGATCGAACCACTATAAGTTCTCTGTGTCCTTTTCTTCTCATCTCTTTACTCTGTTTTTATTGGGTTATGTTGTTACTTGTTCTTTGGATTGTTTCAATTGGGTCTGCTGCTTAATTTGTGGATTGATACATGTTGTATGGTAAATTTGCATAACAATTGGTATCAGAGCTAGAATTAAAATTGTTTGAATCCAGATTTGTTTTCGATTATGGCTGCTAAGTTTGAGGTAGAGCGTTTTGATGGTAGAGGTGATTTTTCCCTATGGAAAAAGAAAATGTGTGCCCTACTTGTTCAACAAAAAGTTGCTAAATTTTAGATGAATCAACTGACTGGACAACAGGCCTACCTGAATCTGAAATAAAAGAAACTAAGGAGATTGCCTTTAGCACCATAATCTTATACTTAGCTGATAATGTTCTTCGCCAAGTTCATGAAGCTAATACAGCTGAAGAAGTCTGGAAACAATTAGATAAAATTTACCTGACGAAATCCTTAACAAATAAGTTGTACATCAAAGAGAGATTCTTTGGTTTTAAAATGGATCCCAACAAAGACCTGGAACATAACCTTGATGAATATAATCGTATTGTGTTAGACCTTGCAAACATTGATGAAAAAATGTCAGATGAAAATAGGGCTATTATTTTGTTGAATTCACTCCCTGAATCCTACAATGAGGTGAAATCTGCAATAAGGTACGATAGAGACAGCCTTTCAATGGATATAGTGTTGAGTGCTCTTAGGTCCAGAGATTTAGAACTCAAGAAGGGGAAATCAAAAGAGAGTGAAGCTTTATTTACAAGGGGAAGAACAGAGAAGAAATCCTCTAGAAATAATAGCAGAAGTAGGTCTAAATCACAAGGGAACAACACTAAAAATGTTTTTACTGTCATAAAGAGGGACATATACGCAGAAACTGTTATGAATTAAAAAACAAAAGGAAAAATGAGTCAAATAAGGATGATGAACAACATAATGCTAATGTAACTGAAAATTACGAAACAGCAGAAGTCCTTACTGTGTTAGAAGGTAATTTTGACTCTGAATGGATCCTTGATTCTGGTTGTTCGTTCCATATGACACCTAATAAGCATTGGTTCCTGAATTTTGAAGAAATTGATGGTGGCAAAGTGCTACTAGGCAATCACCAAAATGTAAATTAAAGGAATAGGGACTGTCAAGTTAAAATGTTTGATAATCAAGTTAGAGAATTATCAAATGTTAGATATGTTCCTGACCTCAAGCGAAATCTCATTTCCTTAGGAGTTTTGACAAAGCTGGTTACCTTTGTAAACTTGAAAACAGAACATCAAAATTGTCAAAGGAGCTATTGTCAAAGCAAAAGGAATTTTACATAATGGTCTATATGTCCTAAGCGCGAATACAATGGTGGGGACAACTGCTGTTGCATCTGAAAGAGATCAAAAACAAACAAAGCTATGGCATGCTAGGCTTGGGCATATGAGTGAAAGGGGTTTGAGGGAACTGTCCAAACAGGGTCTGTTAGGTAAGGTTGTAGTTTCATCCATAGATGTTTGTGAACACTGTATATATGGCAAGTCCACGAGAGTATCCTTTGGTAGAGGGCAACATAACTCTAAGAAAATACTTGATTACGTTCATGCTGATTTGTGGGGTCCTGAGAAAACACCTACAATGGGAGGAGCAAGATATTTTTTAAGTATTGTTGATGATTACTCTAGAAAAGTGTGGACTTACCTATTGAAATCTAAAGATGAAACATATAAAACCTTTGTTCAGTGGAAGACCTTGTTGAGAAACAAACTGAAAGGAAGCTAAAATGTTTGAGAACTGATAACGGGTTGGAATTCTTAAGCAATGAATTTAAAGAATTTTGTAAATTAACAGAAGGTATAATTAGACATTTGACTGTGAGAGGCACACCACAACAAAATGGCTTGGCTGAACGGATGAATAGAACTCTTCTTGAAAAAATAAGATGTTTGATGTCTAATGCGTGTTTGCCGAAAAAATTCTGGGGTGAAGCACTTATGACTGCAACCTACTTGGTCAATAGAAGTCCTTCAACAGCCATTGATTTTAAAACACCAATGGAGAAGTGGTCCAATCACCCTCCTGATTTAAGTAATCTAAGAACCTTTGGTTGCATTGCTTATGCACATTCTAAAGAAGGAAAATTAGATAATCGTGCCAAGAAATGTCTGTTTTTGGGGTATCAATCTGGTATAAAAGGTTATAGACTATGGTGCATTGAAAAAGGTGAAGAAAAATGCATAATTAGTAGGGATGTAACCTTTGATGAATCAGTAATTGCTTGGGAAAAGAATCAGAGTGAAACAGAAACAAATTCTGAAAAGAATAAATCTTTTGAAATGGAATTAGAGTTAGCATCCATACAAACACCAACTGAAAACCAACCTGCTGAAACTGATGTTCGTGTTGAAGAAGGTGCTGAAACACAAGCTGAAACACAAGCTGAAAACATTCCACCAGAACCTGATTCTTTGCAAAATTATAACCTAACCCGTGATAGACAAAGAAGAGAAATAAGAAGACCAGCAAGATATGCTAGTGCAGATATTGTTCACTATGCCTTATTCACTGAAATGAATTCAATTGATGAAGAACCTCTAACTTATCATGAAGCTATAAACTCCATAAACAGTGATAAGTGGAAAGAAGCAATGCAAGAAGAAATGAATTCTCTTCTGAAAAATAATACCTGGGAATTAGTAGATAGGCCATCAAACAAAATATTGGTTGGTTGTAAGTGGATTTATAAGGTAAAACAAAGTGTTGATCCTTCACAACCTAAAAGGTACAAAGCAAGGTTGGTTGCCAAGGGGTACACTCAAAAGGAAGGAGTGGATTATGGAGAGATTTTCTCCCCTGTAGTTAGGCATTCATCTATAAGAACCTTGCTGTCCCTTAACAATCTTTGA

mRNA sequence

ATGAAATTTTTGGGCGCGTTCCACCTCCTGCCCGTCTACGTTGATTTTGTTCCAACTTTTGATAATTCCCAAAAGGAACCTTCACTTCTACCAGCTCAGCTCCCAACTCTATTGTCGAACGGTTCCTCAGGGATTACGGTGATCCATCTCAGACGTGTAAGAGGCATTGGTGAATTAATTGTACCTTCCTACTTCTTTCACAAGCCAATTCCATCAATTTCCAAGAATGTCGTAGTAGAGGATGAACCAAAAGGTTGGAAAAAAAATGCACAACTCCAGAGACAATACAATGAAATGGAGAAGAAATTGCGTGAAGTGCAGTCACAAATGGAGAGTGGTAGACTAACACCAATGTCAGATCAAGGTAGTTGCCCACAAGTACCAGACCCTCAGCCTTCTGATGAAAAACCTGACCAGGTTGTTCATTGGAGGAGCACTGGTCCATTAGGTCCCACCGGTAGCTCATTTAGGGCGTTGAGGCAAAATTCTAAGGAAGGAATGAATATTCGAAGAAAGAAGATCCTCCAGTTAATCTTTAGAGTTTTTGCTCCCACAAGCTCTCAACGCAAAATCCTGTTCGAGAATATTGGTGCGATTGCTTGGTGGTGTTCAAGGGTGATTTTCAGAAAGATAAAGGTTCTTTTGGTTGCTGGATTTTTTGCAAAAATCAAAGGAAAAGGCGAAACGTTCAAGAATTTCTACAAGGATATTCACAAGCTCACAGTCAGCCTTGAAGAGTCAAAGAACCTCTCACCAGCATGGACTGTCATAGTCACTTTCCAGCTCAGATTAAATTCTCCTAGTCAGCAAGCCAAGCGATCCATTCGGCAACTGGGTCTCTGTGATAGAGCGATTGCTGTAGAAGCCGTGGTGTGCTTGTTCGAGAAGATAACGAATAATCCCTATTTGTTTTCGATTATGGCTGCTAAGTTTGAGGTAGAGCGTTTTGATGGTAGAGGCCTACCTGAATCTGAAATAAAAGAAACTAAGGAGATTGCCTTTAGCACCATAATCTTATACTTAGCTGATAATGTTCTTCGCCAAGTTCATGAAGCTAATACAGCTGAAGAAGTCTGGAAACAATTAGATAAAATTTACCTGACGAAATCCTTAACAAATAAGTTGTACATCAAAGAGAGATTCTTTGGTTTTAAAATGGATCCCAACAAAGACCTGGAACATAACCTTGATGAATATAATCGTATTGTGTTAGACCTTGCAAACATTGATGAAAAAATGTCAGATGAAAATAGGGCTATTATTTTGTTGAATTCACTCCCTGAATCCTACAATGAGGTGAAATCTGCAATAAGGTACGATAGAGACAGCCTTTCAATGGATATAGTGTTGAGTGCTCTTAGGTCCAGAGATTTAGAACTCAAGAAGGGGAAATCAAAAGAGAGTGAAGCTTTATTTACAAGGGGAAGAACAGAGAAGAAATCCTCTAGAAATAATAGCAGAACAGAAGTCCTTACTGTGTTAGAAGGTAATTTTGACTCTGAATGGATCCTTGATTCTGGTTGTTCGTTCCATATGACACCTAATAAGCATTGGTTCCTGAATTTTGAAGAAATTGATGGTGGCAAAGTGCTACTAGGCAATCACCAAAATAACATCAAAATTGTCAAAGGAGCTATTGTCAAAGCAAAAGGAATTTTACATAATGGTCTATATGTCCTAAGCGCGAATACAATGGTGGGGACAACTGCTGTTGCATCTGAAAGAGATCAAAAACAAACAAAGCTATGGCATGCTAGGCTTGGGCATATGAGTGAAAGGGGTTTGAGGGAACTGTCCAAACAGGGTCTGTTAGTGGAAGACCTTGTTGAGAAACAAACTGAAAGGAAGCTAAAATGTTTGAGAACTGATAACGGGTTGGAATTCTTAAGCAATGAATTTAAAGAATTTTGTAAATTAACAGAAGGTATAATTAGACATTTGACTGTGAGAGGCACACCACAACAAAATGGCTTGGCTGAACGGATGAATAGAACTCTTCTTGAAAAAATAAGATGTTTGATGTCTAATGCGTGTTTGCCGAAAAAATTCTGGGGTGAAGCACTTATGACTGCAACCTACTTGGTCAATAGAAGTCCTTCAACAGCCATTGATTTTAAAACACCAATGGAGAAGTGGTCCAATCACCCTCCTGATTTAAGTAATCTAAGAACCTTTGGTTGCATTGCTTATGCACATTCTAAAGAAGGAAAATTAGATAATCGTGCCAAGAAATGTCTGTTTTTGGGGTATCAATCTGGTATAAAAGGTTATAGACTATGGTGCATTGAAAAAGGTGAAGAAAAATGCATAATTAGTAGGGATGTAACCTTTGATGAATCAGTAATTGCTTGGGAAAAGAATCAGAGTGAAACAGAAACAAATTCTGAAAAGAATAAATCTTTTGAAATGGAATTAGAGTTAGCATCCATACAAACACCAACTGAAAACCAACCTGCTGAAACTGATGTTCGTGTTGAAGAAGGTGCTGAAACACAAGCTGAAACACAAGCTGAAAACATTCCACCAGAACCTGATTCTTTGCAAAATTATAACCTAACCCGTGATAGACAAAGAAGAGAAATAAGAAGACCAGCAAGATATGCTAGTGCAGATATTGTTCACTATGCCTTATTCACTGAAATGAATTCAATTGATGAAGAACCTCTAACTTATCATGAAGCTATAAACTCCATAAACAGTGATAAGTGGAAAGAAGCAATGCAAGAAGAAATGAATTCTCTTCTGAAAAATAATACCTGGGAATTAGTAGATAGGCCATCAAACAAAATATTGGTTGGTTGTAAGTGGATTTATAAGGTAAAACAAAGTGTTGATCCTTCACAACCTAAAAGGTACAAAGCAAGGTTGGTTGCCAAGGGGTACACTCAAAAGGAAGGAGTGGATTATGGAGAGATTTTCTCCCCTGTAGTTAGGCATTCATCTATAAGAACCTTGCTGTCCCTTAACAATCTTTGA

Coding sequence (CDS)

ATGAAATTTTTGGGCGCGTTCCACCTCCTGCCCGTCTACGTTGATTTTGTTCCAACTTTTGATAATTCCCAAAAGGAACCTTCACTTCTACCAGCTCAGCTCCCAACTCTATTGTCGAACGGTTCCTCAGGGATTACGGTGATCCATCTCAGACGTGTAAGAGGCATTGGTGAATTAATTGTACCTTCCTACTTCTTTCACAAGCCAATTCCATCAATTTCCAAGAATGTCGTAGTAGAGGATGAACCAAAAGGTTGGAAAAAAAATGCACAACTCCAGAGACAATACAATGAAATGGAGAAGAAATTGCGTGAAGTGCAGTCACAAATGGAGAGTGGTAGACTAACACCAATGTCAGATCAAGGTAGTTGCCCACAAGTACCAGACCCTCAGCCTTCTGATGAAAAACCTGACCAGGTTGTTCATTGGAGGAGCACTGGTCCATTAGGTCCCACCGGTAGCTCATTTAGGGCGTTGAGGCAAAATTCTAAGGAAGGAATGAATATTCGAAGAAAGAAGATCCTCCAGTTAATCTTTAGAGTTTTTGCTCCCACAAGCTCTCAACGCAAAATCCTGTTCGAGAATATTGGTGCGATTGCTTGGTGGTGTTCAAGGGTGATTTTCAGAAAGATAAAGGTTCTTTTGGTTGCTGGATTTTTTGCAAAAATCAAAGGAAAAGGCGAAACGTTCAAGAATTTCTACAAGGATATTCACAAGCTCACAGTCAGCCTTGAAGAGTCAAAGAACCTCTCACCAGCATGGACTGTCATAGTCACTTTCCAGCTCAGATTAAATTCTCCTAGTCAGCAAGCCAAGCGATCCATTCGGCAACTGGGTCTCTGTGATAGAGCGATTGCTGTAGAAGCCGTGGTGTGCTTGTTCGAGAAGATAACGAATAATCCCTATTTGTTTTCGATTATGGCTGCTAAGTTTGAGGTAGAGCGTTTTGATGGTAGAGGCCTACCTGAATCTGAAATAAAAGAAACTAAGGAGATTGCCTTTAGCACCATAATCTTATACTTAGCTGATAATGTTCTTCGCCAAGTTCATGAAGCTAATACAGCTGAAGAAGTCTGGAAACAATTAGATAAAATTTACCTGACGAAATCCTTAACAAATAAGTTGTACATCAAAGAGAGATTCTTTGGTTTTAAAATGGATCCCAACAAAGACCTGGAACATAACCTTGATGAATATAATCGTATTGTGTTAGACCTTGCAAACATTGATGAAAAAATGTCAGATGAAAATAGGGCTATTATTTTGTTGAATTCACTCCCTGAATCCTACAATGAGGTGAAATCTGCAATAAGGTACGATAGAGACAGCCTTTCAATGGATATAGTGTTGAGTGCTCTTAGGTCCAGAGATTTAGAACTCAAGAAGGGGAAATCAAAAGAGAGTGAAGCTTTATTTACAAGGGGAAGAACAGAGAAGAAATCCTCTAGAAATAATAGCAGAACAGAAGTCCTTACTGTGTTAGAAGGTAATTTTGACTCTGAATGGATCCTTGATTCTGGTTGTTCGTTCCATATGACACCTAATAAGCATTGGTTCCTGAATTTTGAAGAAATTGATGGTGGCAAAGTGCTACTAGGCAATCACCAAAATAACATCAAAATTGTCAAAGGAGCTATTGTCAAAGCAAAAGGAATTTTACATAATGGTCTATATGTCCTAAGCGCGAATACAATGGTGGGGACAACTGCTGTTGCATCTGAAAGAGATCAAAAACAAACAAAGCTATGGCATGCTAGGCTTGGGCATATGAGTGAAAGGGGTTTGAGGGAACTGTCCAAACAGGGTCTGTTAGTGGAAGACCTTGTTGAGAAACAAACTGAAAGGAAGCTAAAATGTTTGAGAACTGATAACGGGTTGGAATTCTTAAGCAATGAATTTAAAGAATTTTGTAAATTAACAGAAGGTATAATTAGACATTTGACTGTGAGAGGCACACCACAACAAAATGGCTTGGCTGAACGGATGAATAGAACTCTTCTTGAAAAAATAAGATGTTTGATGTCTAATGCGTGTTTGCCGAAAAAATTCTGGGGTGAAGCACTTATGACTGCAACCTACTTGGTCAATAGAAGTCCTTCAACAGCCATTGATTTTAAAACACCAATGGAGAAGTGGTCCAATCACCCTCCTGATTTAAGTAATCTAAGAACCTTTGGTTGCATTGCTTATGCACATTCTAAAGAAGGAAAATTAGATAATCGTGCCAAGAAATGTCTGTTTTTGGGGTATCAATCTGGTATAAAAGGTTATAGACTATGGTGCATTGAAAAAGGTGAAGAAAAATGCATAATTAGTAGGGATGTAACCTTTGATGAATCAGTAATTGCTTGGGAAAAGAATCAGAGTGAAACAGAAACAAATTCTGAAAAGAATAAATCTTTTGAAATGGAATTAGAGTTAGCATCCATACAAACACCAACTGAAAACCAACCTGCTGAAACTGATGTTCGTGTTGAAGAAGGTGCTGAAACACAAGCTGAAACACAAGCTGAAAACATTCCACCAGAACCTGATTCTTTGCAAAATTATAACCTAACCCGTGATAGACAAAGAAGAGAAATAAGAAGACCAGCAAGATATGCTAGTGCAGATATTGTTCACTATGCCTTATTCACTGAAATGAATTCAATTGATGAAGAACCTCTAACTTATCATGAAGCTATAAACTCCATAAACAGTGATAAGTGGAAAGAAGCAATGCAAGAAGAAATGAATTCTCTTCTGAAAAATAATACCTGGGAATTAGTAGATAGGCCATCAAACAAAATATTGGTTGGTTGTAAGTGGATTTATAAGGTAAAACAAAGTGTTGATCCTTCACAACCTAAAAGGTACAAAGCAAGGTTGGTTGCCAAGGGGTACACTCAAAAGGAAGGAGTGGATTATGGAGAGATTTTCTCCCCTGTAGTTAGGCATTCATCTATAAGAACCTTGCTGTCCCTTAACAATCTTTGA

Protein sequence

MKFLGAFHLLPVYVDFVPTFDNSQKEPSLLPAQLPTLLSNGSSGITVIHLRRVRGIGELIVPSYFFHKPIPSISKNVVVEDEPKGWKKNAQLQRQYNEMEKKLREVQSQMESGRLTPMSDQGSCPQVPDPQPSDEKPDQVVHWRSTGPLGPTGSSFRALRQNSKEGMNIRRKKILQLIFRVFAPTSSQRKILFENIGAIAWWCSRVIFRKIKVLLVAGFFAKIKGKGETFKNFYKDIHKLTVSLEESKNLSPAWTVIVTFQLRLNSPSQQAKRSIRQLGLCDRAIAVEAVVCLFEKITNNPYLFSIMAAKFEVERFDGRGLPESEIKETKEIAFSTIILYLADNVLRQVHEANTAEEVWKQLDKIYLTKSLTNKLYIKERFFGFKMDPNKDLEHNLDEYNRIVLDLANIDEKMSDENRAIILLNSLPESYNEVKSAIRYDRDSLSMDIVLSALRSRDLELKKGKSKESEALFTRGRTEKKSSRNNSRTEVLTVLEGNFDSEWILDSGCSFHMTPNKHWFLNFEEIDGGKVLLGNHQNNIKIVKGAIVKAKGILHNGLYVLSANTMVGTTAVASERDQKQTKLWHARLGHMSERGLRELSKQGLLVEDLVEKQTERKLKCLRTDNGLEFLSNEFKEFCKLTEGIIRHLTVRGTPQQNGLAERMNRTLLEKIRCLMSNACLPKKFWGEALMTATYLVNRSPSTAIDFKTPMEKWSNHPPDLSNLRTFGCIAYAHSKEGKLDNRAKKCLFLGYQSGIKGYRLWCIEKGEEKCIISRDVTFDESVIAWEKNQSETETNSEKNKSFEMELELASIQTPTENQPAETDVRVEEGAETQAETQAENIPPEPDSLQNYNLTRDRQRREIRRPARYASADIVHYALFTEMNSIDEEPLTYHEAINSINSDKWKEAMQEEMNSLLKNNTWELVDRPSNKILVGCKWIYKVKQSVDPSQPKRYKARLVAKGYTQKEGVDYGEIFSPVVRHSSIRTLLSLNNL
Homology
BLAST of Lag0035208 vs. NCBI nr
Match: TYK25306.1 (putative gag-pol polyprotein [Cucumis melo var. makuwa])

HSP 1 Score: 638.6 bits (1646), Expect = 8.6e-179
Identity = 379/887 (42.73%), Postives = 499/887 (56.26%), Query Frame = 0

Query: 321 LPESEIKETKEIAFSTIILYLADNVLRQVHEANTAEEVWKQLDKIYLTKSLTNKLYIKER 380
           + ESE ++  E+A+STI+LYL+D VLR V EA T  E+WK+L+ +YLTKSL NK+YIKE+
Sbjct: 46  ITESEKRDMDEMAYSTILLYLSDEVLRLVDEATTTGELWKKLESLYLTKSLPNKIYIKEK 105

Query: 381 FFGFKMDPNKDLEHNLDEYNRIVLDLANIDEKMSDENRAIILLNSLPESYNEVKSAIRYD 440
           FFG+KMD +K LE NLDE+ +IV+DL NI EKMSDEN+A+ILLNSLPE+Y EVK+AI+Y 
Sbjct: 106 FFGYKMDQSKSLEENLDEFQKIVVDLNNIGEKMSDENQAVILLNSLPETYREVKAAIKYG 165

Query: 441 RDSLSMDIVLSALRSRDLELKKGKSKESEALFTRGRTEKKSSRNNSRT------------ 500
           RDSL+M IVL AL++R+LE+KK + K+ E L  RGR+EKKS +   R+            
Sbjct: 166 RDSLTMSIVLDALKTRNLEIKK-ERKDGELLMARGRSEKKSWKGKERSFRSKSKGKSRKC 225

Query: 501 ---------------------------------------------------EVLTVLEGN 560
                                                              EVL V   +
Sbjct: 226 FLCHKEGHFKKNCPLNKSREASTSEANVTDGYNSAEITDGYDSAETGYESAEVLMVSHRD 285

Query: 561 FDSEWILDSGCSFHMTPNKHWFLNFEEIDGGKVLLGNH---------------------- 620
               WI+DSGC+FHMTP++ +  NF+++DGGKVLLG++                      
Sbjct: 286 IQDAWIMDSGCTFHMTPHRDFLTNFQKVDGGKVLLGDNGTCDVKGTGSVQIATHDGMVRI 345

Query: 621 -----------------------------QNNI-KIVKGAIVKAKGILHNGLYVLSANTM 680
                                        +N + K+ KG++VK +G L +GLYVL   T+
Sbjct: 346 LTNVRYVPKLKRNLISLGELDRSGCTIKSENGVMKVTKGSLVKLRGTLRHGLYVLEGTTV 405

Query: 681 VGTTAVASERDQKQTKLWHARLGHMSERGLRELSKQGLL--------------------- 740
            G+ A+AS +    + LWH RL H+SERGL+ LS+QGLL                     
Sbjct: 406 SGSAAIASGKVTDMSMLWHKRLAHVSERGLQALSQQGLLGGVKNVELPFCEHCIMGKSTR 465

Query: 741 ---------------------------------------VEDL----------------- 800
                                                  ++D                  
Sbjct: 466 VKFGKGKHTTKGILDYVHSDLWGPTKEVSMGGSRYFISIIDDFSRKVWIYPLKQKDEAFG 525

Query: 801 --------VEKQTERKLKCLRTDNGLEFLSNEFKEFCKLTEGIIRHLTVRGTPQQNGLAE 860
                   VE QT RK+K LRTDNGLEF++N+F +FCK +EGI RH TV  TPQQNGLAE
Sbjct: 526 KFLEWKKQVENQTGRKVKYLRTDNGLEFVNNKFNQFCK-SEGITRHFTVTYTPQQNGLAE 585

Query: 861 RMNRTLLEKIRCLMSNACLPKKFWGEALMTATYLVNRSPSTAIDFKTPMEKWSNHPPDLS 920
           R NRT++E+ RCL++NA LP KFWGEA  TA YL+NRSPSTA++ KTP E W+   P L 
Sbjct: 586 RFNRTIMERTRCLLTNASLPLKFWGEAAQTACYLINRSPSTALNLKTPQEVWTGKAPSLE 645

Query: 921 NLRTFGCIAYAHSKEGKLDNRAKKCLFLGYQSGIKGYRLWCIEKGEEKCIISRDVTFDES 980
           +LR FGC AYAH K+GKL+ RA KC+F+GY  G+KGY+LWCIEKG  KCIISRDVTF+E+
Sbjct: 646 HLRVFGCTAYAHVKDGKLNKRALKCMFIGYPQGVKGYKLWCIEKGMNKCIISRDVTFNET 705

Query: 981 VIAW-----EKNQ------SETETNSEKNKSFEMELELASIQTPTENQPAETD------- 989
            + +     +K Q      +E    SE   S +++ +   +    + Q +E D       
Sbjct: 706 EMPYCVKEQQKQQTGDHVVTEVRIASEVRPSIDLDNQPPLVSEIEDTQQSEFDGIQSQQE 765

BLAST of Lag0035208 vs. NCBI nr
Match: KAA0050719.1 (putative gag-pol polyprotein [Cucumis melo var. makuwa])

HSP 1 Score: 638.6 bits (1646), Expect = 8.6e-179
Identity = 379/887 (42.73%), Postives = 499/887 (56.26%), Query Frame = 0

Query: 321 LPESEIKETKEIAFSTIILYLADNVLRQVHEANTAEEVWKQLDKIYLTKSLTNKLYIKER 380
           + ESE ++  E+A+STI+LYL+D VLR V EA T  E+WK+L+ +YLTKSL NK+YIKE+
Sbjct: 46  ITESEKRDMDEMAYSTILLYLSDEVLRLVDEATTTGELWKKLESLYLTKSLLNKIYIKEK 105

Query: 381 FFGFKMDPNKDLEHNLDEYNRIVLDLANIDEKMSDENRAIILLNSLPESYNEVKSAIRYD 440
           FFG+KMD +K LE NLDE+ +IV+DL NI EKMSDEN+A+ILLNSLPE+Y EVK+AI+Y 
Sbjct: 106 FFGYKMDQSKSLEENLDEFQKIVVDLNNIGEKMSDENQAVILLNSLPETYREVKAAIKYG 165

Query: 441 RDSLSMDIVLSALRSRDLELKKGKSKESEALFTRGRTEKKSSRNNSRT------------ 500
           RDSL+M IVL AL++R+LE+KK + K+ E L  RGR+EKKS +   R+            
Sbjct: 166 RDSLTMSIVLDALKTRNLEIKK-ERKDGELLMARGRSEKKSWKGKERSFRSKSKGKSRKC 225

Query: 501 ---------------------------------------------------EVLTVLEGN 560
                                                              EVL V   +
Sbjct: 226 FLCHKEGHFKKNCPLNKSREASTSEANVTDGYNSAEITDGYDSAETGYESAEVLMVSHRD 285

Query: 561 FDSEWILDSGCSFHMTPNKHWFLNFEEIDGGKVLLGNH---------------------- 620
               WI+DSGC+FHMTP++ +  NF+++DGGKVLLG++                      
Sbjct: 286 IQDAWIMDSGCTFHMTPHRDFLTNFQKVDGGKVLLGDNGTCDVKGTGSVQIATHDGMVRI 345

Query: 621 -----------------------------QNNI-KIVKGAIVKAKGILHNGLYVLSANTM 680
                                        +N + K+ KG++VK +G L +GLYVL   T+
Sbjct: 346 LTNVRYVPKLKRNLISLGELDRSGCTIKSENGVMKVTKGSLVKLRGTLRHGLYVLEGTTV 405

Query: 681 VGTTAVASERDQKQTKLWHARLGHMSERGLRELSKQGLL--------------------- 740
            G+ A+AS +    + LWH RL H+SERGL+ LS+QGLL                     
Sbjct: 406 SGSAAIASGKVTDMSMLWHKRLAHVSERGLQALSQQGLLGGVKNVELPFCEHCIMGKSTR 465

Query: 741 ---------------------------------------VEDL----------------- 800
                                                  ++D                  
Sbjct: 466 VKFGKGKHTTKGILDYVHSDLWGPTKEVSMGGSRYFISIIDDFSRKVWIYPLKQKDEAFG 525

Query: 801 --------VEKQTERKLKCLRTDNGLEFLSNEFKEFCKLTEGIIRHLTVRGTPQQNGLAE 860
                   VE QT RK+K LRTDNGLEF++N+F +FCK +EGI RH TV  TPQQNGLAE
Sbjct: 526 KFLEWKKQVENQTGRKVKYLRTDNGLEFVNNKFNQFCK-SEGITRHFTVTYTPQQNGLAE 585

Query: 861 RMNRTLLEKIRCLMSNACLPKKFWGEALMTATYLVNRSPSTAIDFKTPMEKWSNHPPDLS 920
           R NRT++E+ RCL++NA LP KFWGEA  TA YL+NRSPSTA++ KTP E W+   P L 
Sbjct: 586 RFNRTIMERTRCLLTNASLPLKFWGEAAQTACYLINRSPSTALNLKTPQEVWTGKAPSLE 645

Query: 921 NLRTFGCIAYAHSKEGKLDNRAKKCLFLGYQSGIKGYRLWCIEKGEEKCIISRDVTFDES 980
           +LR FGC AYAH K+GKL+ RA KC+F+GY  G+KGY+LWCIEKG  KCIISRDVTF+E+
Sbjct: 646 HLRVFGCTAYAHVKDGKLNKRALKCMFIGYPQGVKGYKLWCIEKGMNKCIISRDVTFNET 705

Query: 981 VIAW-----EKNQ------SETETNSEKNKSFEMELELASIQTPTENQPAETD------- 989
            + +     +K Q      +E    SE   S +++ +   +    + Q +E D       
Sbjct: 706 EMPYCVKEQQKQQTGDHVVTEVRIASEVRPSIDLDNQPPLVSEIEDTQQSEFDGIQSQQE 765

BLAST of Lag0035208 vs. NCBI nr
Match: KAA0047995.1 (retrotransposon protein, putative, Ty1-copia sub-class [Cucumis melo var. makuwa])

HSP 1 Score: 576.6 bits (1485), Expect = 4.0e-160
Identity = 350/854 (40.98%), Postives = 467/854 (54.68%), Query Frame = 0

Query: 321 LPESEIKETKEIAFSTIILYLADNVLRQVHEANTAEEVWKQLDKIYLTKSLTNKLYIKER 380
           + ESE ++  E+A+ TI+LYL+D VLR V EA T  E+WK+L+ +YLTKSL NK+YIKE+
Sbjct: 46  ITESEKRDMDEMAYWTILLYLSDEVLRLVDEATTTGELWKKLESLYLTKSLPNKIYIKEK 105

Query: 381 FFGFKMDPNKDLEHNLDEYNRIVLDLANIDEKMSDENRAIILLNSLPESYNEVKSAIRYD 440
           FFG+KMD +K LE NLDE+ +IV+DL NI EKMSDEN+A+ILLNSLPE+Y EVK+AI+Y 
Sbjct: 106 FFGYKMDQSKILEENLDEFQKIVVDLNNIGEKMSDENQAVILLNSLPETYREVKAAIKYG 165

Query: 441 RDSLSMDIVLSALRSRDLELKKGKSKESEALFTRGRTEKKSSRNNSRT------------ 500
            DSL+M IVL AL++R+LE+KK + K+ E L  RGR+EKKS +   R+            
Sbjct: 166 WDSLTMSIVLDALKTRNLEIKK-ERKDGELLMARGRSEKKSWKGKERSFRSKSKGKSRKC 225

Query: 501 ---------------------------------------------------EVLTVLEGN 560
                                                              EVL V   +
Sbjct: 226 FLCHKEGHFKKNCPLNKSREASTSEANVTDGYNSAEITDGCDSAETGYESAEVLMVSHRD 285

Query: 561 FDSEWILDSGCSFHMTPNKHWFLNFEEIDGGKVLLGNH---------------------- 620
               WI+DSGC+FHMTP++ +  NF+++DGGKVLLG++                      
Sbjct: 286 IQDAWIMDSGCTFHMTPHRDFLTNFQKVDGGKVLLGDNGTCDVKGTGSVQIATHDGMVRI 345

Query: 621 -----------------------------QNNI-KIVKGAIVKAKGILHNGLYVLSANTM 680
                                        +N + K+ KG++VK +G L +GLYVL   T+
Sbjct: 346 LTNVRYVPKLKRNLISLGELDRSGCTIKSENGVMKVTKGSLVKLRGTLRHGLYVLEGTTV 405

Query: 681 VGTTAVASERDQKQTKLWHARLGHMSERGLRELSKQGLL--------------------- 740
            G+ A+AS +    + LWH RL H+SERGL+ LS+QGLL                     
Sbjct: 406 SGSAAIASGKVTNMSMLWHKRLAHVSERGLQALSQQGLLGGVKNVELPFCEHCIMGKSTR 465

Query: 741 ---------------------------------------VEDL----------------- 800
                                                  ++D                  
Sbjct: 466 VKFGKGKHTTKGILDYIHSDLWGPTKEVSMGGSRYFISIIDDFSRKVWIYPLKQKDEAFG 525

Query: 801 --------VEKQTERKLKCLRTDNGLEFLSNEFKEFCKLTEGIIRHLTVRGTPQQNGLAE 860
                   VE QT RK+K LRTDNGLEF++N+F +FCK +EGI RH TV  TPQQNGLAE
Sbjct: 526 KFLEWKKQVENQTGRKVKYLRTDNGLEFVNNKFNQFCK-SEGITRHFTVTYTPQQNGLAE 585

Query: 861 RMNRTLLEKIRCLMSNACLPKKFWGEALMTATYLVNRSPSTAIDFKTPMEKWSNHPPDLS 920
           R NRT++E+ RCL++NA LP KFWGEA  TA YL+NRSPSTA++ KTP E W+   P L 
Sbjct: 586 RFNRTIMERTRCLLTNASLPLKFWGEAAQTACYLINRSPSTALNLKTPQEVWTGKAPSLE 645

Query: 921 NLRTFGCIAYAHSKEGKLDNRAKKCLFLGYQSGIKGYRLWCIEKGEEKCIISRDVTFDES 958
           +LR FGC AYAH K+GKL+ RA KC+F+GY  G+KGY+LWCIEKG  KCIISRDVTF+E+
Sbjct: 646 HLRVFGCTAYAHVKDGKLNKRALKCMFIGYPQGVKGYKLWCIEKGMNKCIISRDVTFNET 705

BLAST of Lag0035208 vs. NCBI nr
Match: PKU72844.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Dendrobium catenatum])

HSP 1 Score: 544.3 bits (1401), Expect = 2.2e-150
Identity = 324/844 (38.39%), Postives = 467/844 (55.33%), Query Frame = 0

Query: 321 LPESEIKET---------KEIAFSTIILYLADNVLRQVHEANTAEEVWKQLDKIYLTKSL 380
           LPESE+  T         ++ AFS+IIL LAD VLR+V    T  E+WK+L+++Y  K+L
Sbjct: 17  LPESELPSTMSDQEKLSIQKKAFSSIILCLADQVLRKVSHVKTVSELWKKLEELYRQKTL 76

Query: 381 TNKLYIKERFFGFKMDPNKDLEHNLDEYNRIVLDLANIDEKMSDENRAIILLNSLPESYN 440
            N++Y+KE+FFG+KMD  K ++ NLDE+N+++LDL N++ K+ DE++AIILLNSLP+S  
Sbjct: 77  PNRIYLKEKFFGYKMDEAKSIDDNLDEFNKLILDLENLEVKIEDEDKAIILLNSLPKSLR 136

Query: 441 EVKSAIRYDRDSLSMDIVLSALRSRDLELK-KGKSKESEALFTRGRTEKKSS-----RNN 500
             K  ++Y R+++++D V +AL S+ L++K   K+   E L  RGR++K+ +     ++ 
Sbjct: 137 NFKETLKYGRETITVDEVQNALSSKILDMKISEKNHSGEGLHVRGRSQKRGTSQKKWKSK 196

Query: 501 SRTEVLT--------------------------------------VLEGNFDSEWIL--- 560
           SR++  +                                      ++  N+DS  +L   
Sbjct: 197 SRSKSASKKDYKNVKCWQCNKTGHIRRFCPEKNPKDKSQSQGDAAIVGENYDSADVLNVS 256

Query: 561 ------DSGCSF----------------------HMTPNKHWFLNFEEIDGGKVLLGNHQ 620
                 +  C                        H+   K   ++   +D    +  + +
Sbjct: 257 DLLLGNNKACDVVGIGSIAVKMHDGHVRILKDVRHVPDLKRNLISLGTLDDSGYIFRSER 316

Query: 621 NNIKIVKGAIVKAKGILHNGLYVLSANTMVGTTAVASERDQKQTKLWHARLGHMSERGLR 680
             ++I KGA+V  KGI  NGLYVL   T+VG T V ++++  +TKLWH RLGH+S+RGL 
Sbjct: 317 GLLRISKGALVIMKGIKRNGLYVLQGATLVGETHVTAKQNLDKTKLWHQRLGHLSDRGLI 376

Query: 681 ELSKQGLLVED------------------------------------------------- 740
           EL KQGL   D                                                 
Sbjct: 377 ELQKQGLFGNDSIAKIDFCESCIIGKSHRLSFKLSTHRAEGILDYIHSDLWGPARVATHG 436

Query: 741 ------------------------------------LVEKQTERKLKCLRTDNGLEFLSN 800
                                               +VE Q  RKLK LRTDNGLEF + 
Sbjct: 437 GNRYFLSFIDDYSRKVWIFLLKSKDETFSKFLEWKSMVENQKNRKLKVLRTDNGLEFCNE 496

Query: 801 EFKEFCKLTEGIIRHLTVRGTPQQNGLAERMNRTLLEKIRCLMSNACLPKKFWGEALMTA 860
            F +FC    GI+RH TV  TPQQNGLAERMNRTLL+++RCL+ ++ L K FWGEAL TA
Sbjct: 497 SFNKFCS-DSGIVRHKTVSHTPQQNGLAERMNRTLLDRVRCLLFSSGLSKFFWGEALSTA 556

Query: 861 TYLVNRSPSTAIDFKTPMEKWSNHPPDLSNLRTFGCIAYAHSKEGKLDNRAKKCLFLGYQ 920
            YLVNR+PS+AI+FKTP E W   PP L++LR FGC+AY H  +GKL+ R+ KC+FLGY 
Sbjct: 557 CYLVNRTPSSAINFKTPQELWKGKPPSLTHLRVFGCLAYPHQNKGKLEPRSIKCVFLGYP 616

Query: 921 SGIKGYRLWCIEKGEEKCIISRDVTFDESVIAWEKNQSETET-----NSEKNKS-FEMEL 980
           +G+KGYRLW +     K IISRDV F+E+ +   +++++  T     NS+ NK  +E E+
Sbjct: 617 TGVKGYRLWDLSSPGVKTIISRDVIFNENRLYISESENKDTTISSIENSDSNKDYYEFEV 676

Query: 981 ELASIQTPTENQPAETDVRVEEGAETQAETQAENIP-PEPDSLQNYNLTRDRQRREIRRP 989
           E  S     EN               Q  T+ EN P P  ++  +Y L+RDR+RR I+ P
Sbjct: 677 EPPSNTLSFEN--------------PQQSTEPENEPEPVSENNHDYLLSRDRERRNIKPP 736

BLAST of Lag0035208 vs. NCBI nr
Match: RVW99173.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera])

HSP 1 Score: 541.2 bits (1393), Expect = 1.9e-149
Identity = 331/824 (40.17%), Postives = 455/824 (55.22%), Query Frame = 0

Query: 309 AKFEVERFDGR-----------------GLPE---------SEIKETKEI-----AFSTI 368
           AKF+VERF G+                 GL +         S ++E K+I     A S I
Sbjct: 4   AKFDVERFTGKNDFGLWRLKMRALLVQQGLHDALLGEKNLPSTMQEKKKIKLLEKAHSAI 63

Query: 369 ILYLADNVLRQVHEANTAEEVWKQLDKIYLTKSLTNKLYIKERFFGFKMDPNKDLEHNLD 428
           IL L D VLR+V +A +  EVW +L+ +Y+TKSL N+L+ K + + FKM P   +E +LD
Sbjct: 64  ILSLGDTVLREVAKAESTAEVWLKLESLYMTKSLANRLHKKIKLYTFKMTPGMSIEEHLD 123

Query: 429 EYNRIVLDLANIDEKMSDENRAIILLNSLPESYNEVKSAIRYDRDSLSMDIVLSALRSRD 488
            +N+I+LDL NID  +SDE++AI+LL SL   Y  +K AI Y RDSL+ D   ++     
Sbjct: 124 HFNKIILDLENIDITISDEDKAILLLTSLDAFYTNMKDAIMYGRDSLTFDEGKNSKSRSK 183

Query: 489 LELKKGK----SKE----SEALFTRGRTEKKSSR---------NNSRTEVLTVLEGNFDS 548
            + KK K     KE     +    R  T KK+                EVL V E +   
Sbjct: 184 SKTKKFKCFICHKEGHFKKDCPDRRQNTIKKTVNECDAAVILDGYDSAEVLNVAEVDSGK 243

Query: 549 EWILDSGCSFHMTPNKHWFLNFEEIDGGKVLLGNHQ------------------------ 608
           EWILDSGCSFHM P K WF +F+E DGG VLLGN++                        
Sbjct: 244 EWILDSGCSFHMCPIKAWFEDFKEADGGYVLLGNNKHCKILGTGTVRIKHYDGIERVLED 303

Query: 609 ----------------------------NNIKIVKGAIVKAKGILHNGLYVLSANTMVGT 668
                                       N++++ +G++   K  + NGLY L   T++  
Sbjct: 304 VRYIPELKRNLISLGMLDKSGYTFKSEPNSLRVARGSLTVMKETIKNGLYTLIGQTVIDK 363

Query: 669 TAVASERDQKQTKLWHARLGHMSERGLRELSKQGLL------------------------ 728
            +   + D   TKLWH RLGHMS +GL+EL KQG+L                        
Sbjct: 364 ASTVLKEDVGTTKLWHQRLGHMSHKGLQELEKQGVLGNYKLTDLPFCEHCVFGKATRVKF 423

Query: 729 VEDLVEKQTE---------------------RKLKCLRTDNGLEFLSNEFKEFCKLTEGI 788
            + + E Q +                     RK+K LRTDNGLEFLSN+F  FC+  EGI
Sbjct: 424 AKAIHETQNQLDYIHSDLWGPSRVPSIGGASRKVKKLRTDNGLEFLSNDFNSFCQ-KEGI 483

Query: 789 IRHLTVRGTPQQNGLAERMNRTLLEKIRCLMSNACLPKKFWGEALMTATYLVNRSPSTAI 848
             H TVR TPQQNGLAERMNRT+LE++RC++S++ L K FW EA  T  +L+NRSPS+A+
Sbjct: 484 ATHRTVRYTPQQNGLAERMNRTILERMRCMLSSSGLSKVFWAEAADTVVHLINRSPSSAL 543

Query: 849 DFKTPMEKWSNHPPDLSNLRTFGCIAYAHSKEGKLDNRAKKCLFLGYQSGIKGYRLWCIE 908
            FKTP EKW+    +  +L+ FGC AY H+K  KL+ RA KC+FLGY  G+KGY+LW   
Sbjct: 544 QFKTPQEKWTGKATNYQHLKVFGCTAYVHTKTDKLEPRAVKCIFLGYPKGVKGYKLWIET 603

Query: 909 KGEEKCIISRDVTFDESVIAWEKNQSETETNSEKNKSFEMELELASIQTPTENQPAETDV 968
           +G+  CIISRDVTF+E  ++ +    + E + +    FE+E E          QP ++  
Sbjct: 604 QGKGNCIISRDVTFNEQDMSKQTPAKDVEGSDQ--LQFEVEHETL--------QPEKSKE 663

Query: 969 RVEEGAETQAETQAENIPPEPDSLQNYNLTRDRQRREIRRPARYASADIVHYALFTEMNS 988
              + A+ +   + +N P +   L++YNL RDRQ+R+++ P RY   ++  +AL      
Sbjct: 664 TSSKTAQEEIVHERQNEPTQ--GLKSYNLVRDRQKRQVKPPKRYGQVEMTTFALSVAEEI 723

BLAST of Lag0035208 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 344.0 bits (881), Expect = 5.8e-93
Identity = 267/878 (30.41%), Postives = 407/878 (46.36%), Query Frame = 0

Query: 331 EIAFSTIILYLADNVLRQVHEANTAEEVWKQLDKIYLTKSLTNKLYIKERFFGFKMDPNK 390
           E A S I L+L+D+V+  + + +TA  +W +L+ +Y++K+LTNKLY+K++ +   M    
Sbjct: 57  ERAASAIRLHLSDDVVNNIIDEDTARGIWTRLESLYMSKTLTNKLYLKKQLYALHMSEGT 116

Query: 391 DLEHNLDEYNRIVLDLANIDEKMSDENRAIILLNSLPESYNEVKSAIRYDRDSLSMDIVL 450
           +   +L+ +N ++  LAN+  K+ +E++AI+LLNSLP SY+ + + I + + ++ +  V 
Sbjct: 117 NFLSHLNVFNGLITQLANLGVKIEEEDKAILLLNSLPSSYDNLATTILHGKTTIELKDVT 176

Query: 451 SALRSRDLELKKGKSKESEALFT--RGRTEKKS------------SRNNSRTEVLTVLE- 510
           SAL   + +++K    + +AL T  RGR+ ++S            S+N S++ V      
Sbjct: 177 SALLLNE-KMRKKPENQGQALITEGRGRSYQRSSNNYGRSGARGKSKNRSKSRVRNCYNC 236

Query: 511 ---GNF-------------------------------------------------DSEWI 570
              G+F                                                 +SEW+
Sbjct: 237 NQPGHFKRDCPNPRKGKGETSGQKNDDNTAAMVQNNDNVVLFINEEEECMHLSGPESEWV 296

Query: 571 LDSGCSFHMTPNKHWFLNFEEIDGGKVLLGNH-------------QNNI----------- 630
           +D+  S H TP +  F  +   D G V +GN              + N+           
Sbjct: 297 VDTAASHHATPVRDLFCRYVAGDFGTVKMGNTSYSKIAGIGDICIKTNVGCTLVLKDVRH 356

Query: 631 ----------------------------KIVKGAIVKAKGILHNGLYVLSANTMVGTTAV 690
                                       ++ KG++V AKG+    LY  +A    G    
Sbjct: 357 VPDLRMNLISGIALDRDGYESYFANQKWRLTKGSLVIAKGVARGTLYRTNAEICQGELNA 416

Query: 691 ASERDQKQTKLWHARLGHMSERGLRELSKQGLL--------------------------- 750
           A  +D+    LWH R+GHMSE+GL+ L+K+ L+                           
Sbjct: 417 A--QDEISVDLWHKRMGHMSEKGLQILAKKSLISYAKGTTVKPCDYCLFGKQHRVSFQTS 476

Query: 751 ---------------------------------VED------------------------ 810
                                            ++D                        
Sbjct: 477 SERKLNILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYILKTKDQVFQVFQKFH 536

Query: 811 -LVEKQTERKLKCLRTDNGLEFLSNEFKEFCKLTEGIIRHLTVRGTPQQNGLAERMNRTL 870
            LVE++T RKLK LR+DNG E+ S EF+E+C  + GI    TV GTPQ NG+AERMNRT+
Sbjct: 537 ALVERETGRKLKRLRSDNGGEYTSREFEEYCS-SHGIRHEKTVPGTPQHNGVAERMNRTI 596

Query: 871 LEKIRCLMSNACLPKKFWGEALMTATYLVNRSPSTAIDFKTPMEKWSNHPPDLSNLRTFG 930
           +EK+R ++  A LPK FWGEA+ TA YL+NRSPS  + F+ P   W+N     S+L+ FG
Sbjct: 597 VEKVRSMLRMAKLPKSFWGEAVQTACYLINRSPSVPLAFEIPERVWTNKEVSYSHLKVFG 656

Query: 931 CIAYAH---SKEGKLDNRAKKCLFLGYQSGIKGYRLWCIEKGEEKCIISRDVTFDESVIA 989
           C A+AH    +  KLD+++  C+F+GY     GYRLW  +  ++K I SRDV F ES + 
Sbjct: 657 CRAFAHVPKEQRTKLDDKSIPCIFIGYGDEEFGYRLW--DPVKKKVIRSRDVVFRESEVR 716

BLAST of Lag0035208 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 216.5 bits (550), Expect = 1.4e-54
Identity = 155/469 (33.05%), Postives = 239/469 (50.96%), Query Frame = 0

Query: 604 LVEDLVEKQTER---KLKCLRTDNGLEFLSNEFKEFCKLTEGIIRHLTVRGTPQQNGLAE 663
           + +D V K       K+  L  DNG E+LSNE ++FC + +GI  HLTV  TPQ NG++E
Sbjct: 528 MFQDFVAKSEAHFNLKVVYLYIDNGREYLSNEMRQFC-VKKGISYHLTVPHTPQLNGVSE 587

Query: 664 RMNRTLLEKIRCLMSNACLPKKFWGEALMTATYLVNRSPSTAI--DFKTPMEKWSNHPPD 723
           RM RT+ EK R ++S A L K FWGEA++TATYL+NR PS A+    KTP E W N  P 
Sbjct: 588 RMIRTITEKARTMVSGAKLDKSFWGEAVLTATYLINRIPSRALVDSSKTPYEMWHNKKPY 647

Query: 724 LSNLRTFGCIAYAH--SKEGKLDNRAKKCLFLGYQSGIKGYRLWCIEKGEEKCIISRDVT 783
           L +LR FG   Y H  +K+GK D+++ K +F+GY+    G++LW  +   EK I++RDV 
Sbjct: 648 LKHLRVFGATVYVHIKNKQGKFDDKSFKSIFVGYEP--NGFKLW--DAVNEKFIVARDVV 707

Query: 784 FDE-SVIAWEKNQSET----ETNSEKNKSFEME-LELASIQTPTENQPAETDVRVEEGAE 843
            DE +++     + ET    ++   +NK+F  +  ++   + P E++  +    +++  E
Sbjct: 708 VDETNMVNSRAVKFETVFLKDSKESENKNFPNDSRKIIQTEFPNESKECDNIQFLKDSKE 767

Query: 844 TQ-----------AETQAENIPPEPDSLQ------------------------------- 903
           ++            +T+  N   E D++Q                               
Sbjct: 768 SENKNFPNDSRKIIQTEFPNESKECDNIQFLKDSKESNKYFLNESKKRKRDDHLNESKGS 827

Query: 904 -NYNLTRDRQRRE------IRRPARYASADIVH-----------YALFTEMNSIDEEPLT 963
            N N +R+ +  E      I  P +    +I++            +   E NS+++  L 
Sbjct: 828 GNPNESRESETAEHLKEIGIDNPTKNDGIEIINRRSERLKTKPQISYNEEDNSLNKVVLN 887

Query: 964 YHEAINSI-----------NSDKWKEAMQEEMNSLLKNNTWELVDRPSNKILVGCKWIYK 989
            H   N +           +   W+EA+  E+N+   NNTW +  RP NK +V  +W++ 
Sbjct: 888 AHTIFNDVPNSFDEIQYRDDKSSWEEAINTELNAHKINNTWTITKRPENKNIVDSRWVFS 947

BLAST of Lag0035208 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 154.1 bits (388), Expect = 8.5e-36
Identity = 131/490 (26.73%), Postives = 214/490 (43.67%), Query Frame = 0

Query: 606  EDLVEKQTERKLKCLRTDNGLEFLSNEFKEFCKLTEGIIRHLT-VRGTPQQNGLAERMNR 665
            ++L+E + + ++    +DNG EF++    E+   ++  I HLT    TP+ NGL+ER +R
Sbjct: 575  KNLLENRFQTRIGTFYSDNGGEFVA--LWEY--FSQHGISHLTSPPHTPEHNGLSERKHR 634

Query: 666  TLLEKIRCLMSNACLPKKFWGEALMTATYLVNRSPSTAIDFKTPMEKWSNHPPDLSNLRT 725
             ++E    L+S+A +PK +W  A   A YL+NR P+  +  ++P +K     P+   LR 
Sbjct: 635  HIVETGLTLLSHASIPKTYWPYAFAVAVYLINRLPTPLLQLESPFQKLFGTSPNYDKLRV 694

Query: 726  FGCIAYAHSK---EGKLDNRAKKCLFLGYQSGIKGYRLWCIEKGEEKCIISRDVTFD--- 785
            FGC  Y   +   + KLD+++++C+FLGY      Y   C+     +  ISR V FD   
Sbjct: 695  FGCACYPWLRPYNQHKLDDKSRQCVFLGYSLTQSAY--LCLHLQTSRLYISRHVRFDENC 754

Query: 786  ------------------ESVIAW------------------------------------ 845
                              ES   W                                    
Sbjct: 755  FPFSNYLATLSPVQEQRRESSCVWSPHTTLPTRTPVLPAPSCSDPHHAATPPSSPSAPFR 814

Query: 846  -----------------------------------EKNQSETETNSEKNKS-----FEME 905
                                               +  Q++T+T+S +N S      E  
Sbjct: 815  NSQVSSSNLDSSFSSSFPSSPEPTAPRQNGPQPTTQPTQTQTQTHSSQNTSQNNPTNESP 874

Query: 906  LELA-SIQTPTENQPAETDVRVEEGAETQAETQAENIPPEPDSLQNYNLTRDRQRREIRR 965
             +LA S+ TP ++  +         + + + T    +   P  L       ++       
Sbjct: 875  SQLAQSLSTPAQSSSSSPSPTTSASSSSTSPTPPSILIHPPPPLAQIVNNNNQAPLNTHS 934

Query: 966  PARYASADIV----HYALFTEMNSIDEEPLTYHEAINSINSDKWKEAMQEEMNSLLKNNT 989
                A A I+     Y+L   + + + EP T   AI ++  ++W+ AM  E+N+ + N+T
Sbjct: 935  MGTRAKAGIIKPNPKYSLAVSL-AAESEPRT---AIQALKDERWRNAMGSEINAQIGNHT 994

BLAST of Lag0035208 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 150.6 bits (379), Expect = 9.3e-35
Identity = 129/503 (25.65%), Postives = 206/503 (40.95%), Query Frame = 0

Query: 603  LLVEDLVEKQTERKLKCLRTDNGLEFLSNEFKEFCKLTEGIIRHLTVRGTPQQNGLAERM 662
            ++ + LVE + + ++  L +DNG EF+    +++     GI    +   TP+ NGL+ER 
Sbjct: 551  IIFKSLVENRFQTRIGTLYSDNGGEFV--VLRDYLS-QHGISHFTSPPHTPEHNGLSERK 610

Query: 663  NRTLLEKIRCLMSNACLPKKFWGEALMTATYLVNRSPSTAIDFKTPMEKWSNHPPDLSNL 722
            +R ++E    L+S+A +PK +W  A   A YL+NR P+  +  ++P +K    PP+   L
Sbjct: 611  HRHIVEMGLTLLSHASVPKTYWPYAFSVAVYLINRLPTPLLQLQSPFQKLFGQPPNYEKL 670

Query: 723  RTFGCIAYAHSK---EGKLDNRAKKCLFLGYQSGIKGYRLWCIEKGEEKCIISRDVTFDE 782
            + FGC  Y   +     KL++++K+C F+GY      Y   C+     +   SR V FDE
Sbjct: 671  KVFGCACYPWLRPYNRHKLEDKSKQCAFMGYSLTQSAY--LCLHIPTGRLYTSRHVQFDE 730

Query: 783  SVIAWEKNQSETETNSEKNKS--------------------------------------- 842
                +        T+ E+                                          
Sbjct: 731  RCFPFSTTNFGVSTSQEQRSDSAPNWPSHTTLPTTPLVLPAPPCLGPHLDTSPRPPSSPS 790

Query: 843  -------FEMELELASIQTPTENQPAETDVRVEEGAETQAETQAEN-------------- 902
                       L  +SI +P+ ++P        +      +TQ  N              
Sbjct: 791  PLCTTQVSSSNLPSSSISSPSSSEPTAPSHNGPQPTAQPHQTQNSNSNSPILNNPNPNSP 850

Query: 903  ----------------------------------------------IPPEPDSLQ----- 962
                                                          + P P  +Q     
Sbjct: 851  SPNSPNQNSPLPQSPISSPHIPTPSTSISEPNSPSSSSTSTPPLPPVLPAPPIIQVNAQA 910

Query: 963  --NYNLTRDRQRREIRRPARYASADIVHYALFTEMNSIDEEPLTYHEAINSINSDKWKEA 989
              N +    R +  IR+P +  S     YA     NS   EP T   AI ++  D+W++A
Sbjct: 911  PVNTHSMATRAKDGIRKPNQKYS-----YATSLAANS---EPRT---AIQAMKDDRWRQA 970

BLAST of Lag0035208 vs. ExPASy Swiss-Prot
Match: P92512 (Uncharacterized mitochondrial protein AtMg00710 OS=Arabidopsis thaliana OX=3702 GN=AtMg00710 PE=4 SV=1)

HSP 1 Score: 90.9 bits (224), Expect = 8.8e-17
Identity = 42/83 (50.60%), Postives = 54/83 (65.06%), Query Frame = 0

Query: 662 MNRTLLEKIRCLMSNACLPKKFWGEALMTATYLVNRSPSTAIDFKTPMEKWSNHPPDLSN 721
           MNRT++EK+R ++    LPK F  +A  TA +++N+ PSTAI+F  P E W    P  S 
Sbjct: 1   MNRTIIEKVRSMLCECGLPKTFRADAANTAVHIINKYPSTAINFHVPDEVWFQSVPTYSY 60

Query: 722 LRTFGCIAYAHSKEGKLDNRAKK 745
           LR FGC+AY H  EGKL  RAKK
Sbjct: 61  LRRFGCVAYIHCDEGKLKPRAKK 83

BLAST of Lag0035208 vs. ExPASy TrEMBL
Match: A0A5D3DNU1 (Putative gag-pol polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold352G004440 PE=4 SV=1)

HSP 1 Score: 638.6 bits (1646), Expect = 4.2e-179
Identity = 379/887 (42.73%), Postives = 499/887 (56.26%), Query Frame = 0

Query: 321 LPESEIKETKEIAFSTIILYLADNVLRQVHEANTAEEVWKQLDKIYLTKSLTNKLYIKER 380
           + ESE ++  E+A+STI+LYL+D VLR V EA T  E+WK+L+ +YLTKSL NK+YIKE+
Sbjct: 46  ITESEKRDMDEMAYSTILLYLSDEVLRLVDEATTTGELWKKLESLYLTKSLPNKIYIKEK 105

Query: 381 FFGFKMDPNKDLEHNLDEYNRIVLDLANIDEKMSDENRAIILLNSLPESYNEVKSAIRYD 440
           FFG+KMD +K LE NLDE+ +IV+DL NI EKMSDEN+A+ILLNSLPE+Y EVK+AI+Y 
Sbjct: 106 FFGYKMDQSKSLEENLDEFQKIVVDLNNIGEKMSDENQAVILLNSLPETYREVKAAIKYG 165

Query: 441 RDSLSMDIVLSALRSRDLELKKGKSKESEALFTRGRTEKKSSRNNSRT------------ 500
           RDSL+M IVL AL++R+LE+KK + K+ E L  RGR+EKKS +   R+            
Sbjct: 166 RDSLTMSIVLDALKTRNLEIKK-ERKDGELLMARGRSEKKSWKGKERSFRSKSKGKSRKC 225

Query: 501 ---------------------------------------------------EVLTVLEGN 560
                                                              EVL V   +
Sbjct: 226 FLCHKEGHFKKNCPLNKSREASTSEANVTDGYNSAEITDGYDSAETGYESAEVLMVSHRD 285

Query: 561 FDSEWILDSGCSFHMTPNKHWFLNFEEIDGGKVLLGNH---------------------- 620
               WI+DSGC+FHMTP++ +  NF+++DGGKVLLG++                      
Sbjct: 286 IQDAWIMDSGCTFHMTPHRDFLTNFQKVDGGKVLLGDNGTCDVKGTGSVQIATHDGMVRI 345

Query: 621 -----------------------------QNNI-KIVKGAIVKAKGILHNGLYVLSANTM 680
                                        +N + K+ KG++VK +G L +GLYVL   T+
Sbjct: 346 LTNVRYVPKLKRNLISLGELDRSGCTIKSENGVMKVTKGSLVKLRGTLRHGLYVLEGTTV 405

Query: 681 VGTTAVASERDQKQTKLWHARLGHMSERGLRELSKQGLL--------------------- 740
            G+ A+AS +    + LWH RL H+SERGL+ LS+QGLL                     
Sbjct: 406 SGSAAIASGKVTDMSMLWHKRLAHVSERGLQALSQQGLLGGVKNVELPFCEHCIMGKSTR 465

Query: 741 ---------------------------------------VEDL----------------- 800
                                                  ++D                  
Sbjct: 466 VKFGKGKHTTKGILDYVHSDLWGPTKEVSMGGSRYFISIIDDFSRKVWIYPLKQKDEAFG 525

Query: 801 --------VEKQTERKLKCLRTDNGLEFLSNEFKEFCKLTEGIIRHLTVRGTPQQNGLAE 860
                   VE QT RK+K LRTDNGLEF++N+F +FCK +EGI RH TV  TPQQNGLAE
Sbjct: 526 KFLEWKKQVENQTGRKVKYLRTDNGLEFVNNKFNQFCK-SEGITRHFTVTYTPQQNGLAE 585

Query: 861 RMNRTLLEKIRCLMSNACLPKKFWGEALMTATYLVNRSPSTAIDFKTPMEKWSNHPPDLS 920
           R NRT++E+ RCL++NA LP KFWGEA  TA YL+NRSPSTA++ KTP E W+   P L 
Sbjct: 586 RFNRTIMERTRCLLTNASLPLKFWGEAAQTACYLINRSPSTALNLKTPQEVWTGKAPSLE 645

Query: 921 NLRTFGCIAYAHSKEGKLDNRAKKCLFLGYQSGIKGYRLWCIEKGEEKCIISRDVTFDES 980
           +LR FGC AYAH K+GKL+ RA KC+F+GY  G+KGY+LWCIEKG  KCIISRDVTF+E+
Sbjct: 646 HLRVFGCTAYAHVKDGKLNKRALKCMFIGYPQGVKGYKLWCIEKGMNKCIISRDVTFNET 705

Query: 981 VIAW-----EKNQ------SETETNSEKNKSFEMELELASIQTPTENQPAETD------- 989
            + +     +K Q      +E    SE   S +++ +   +    + Q +E D       
Sbjct: 706 EMPYCVKEQQKQQTGDHVVTEVRIASEVRPSIDLDNQPPLVSEIEDTQQSEFDGIQSQQE 765

BLAST of Lag0035208 vs. ExPASy TrEMBL
Match: A0A5A7UB25 (Putative gag-pol polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold560G00190 PE=4 SV=1)

HSP 1 Score: 638.6 bits (1646), Expect = 4.2e-179
Identity = 379/887 (42.73%), Postives = 499/887 (56.26%), Query Frame = 0

Query: 321 LPESEIKETKEIAFSTIILYLADNVLRQVHEANTAEEVWKQLDKIYLTKSLTNKLYIKER 380
           + ESE ++  E+A+STI+LYL+D VLR V EA T  E+WK+L+ +YLTKSL NK+YIKE+
Sbjct: 46  ITESEKRDMDEMAYSTILLYLSDEVLRLVDEATTTGELWKKLESLYLTKSLLNKIYIKEK 105

Query: 381 FFGFKMDPNKDLEHNLDEYNRIVLDLANIDEKMSDENRAIILLNSLPESYNEVKSAIRYD 440
           FFG+KMD +K LE NLDE+ +IV+DL NI EKMSDEN+A+ILLNSLPE+Y EVK+AI+Y 
Sbjct: 106 FFGYKMDQSKSLEENLDEFQKIVVDLNNIGEKMSDENQAVILLNSLPETYREVKAAIKYG 165

Query: 441 RDSLSMDIVLSALRSRDLELKKGKSKESEALFTRGRTEKKSSRNNSRT------------ 500
           RDSL+M IVL AL++R+LE+KK + K+ E L  RGR+EKKS +   R+            
Sbjct: 166 RDSLTMSIVLDALKTRNLEIKK-ERKDGELLMARGRSEKKSWKGKERSFRSKSKGKSRKC 225

Query: 501 ---------------------------------------------------EVLTVLEGN 560
                                                              EVL V   +
Sbjct: 226 FLCHKEGHFKKNCPLNKSREASTSEANVTDGYNSAEITDGYDSAETGYESAEVLMVSHRD 285

Query: 561 FDSEWILDSGCSFHMTPNKHWFLNFEEIDGGKVLLGNH---------------------- 620
               WI+DSGC+FHMTP++ +  NF+++DGGKVLLG++                      
Sbjct: 286 IQDAWIMDSGCTFHMTPHRDFLTNFQKVDGGKVLLGDNGTCDVKGTGSVQIATHDGMVRI 345

Query: 621 -----------------------------QNNI-KIVKGAIVKAKGILHNGLYVLSANTM 680
                                        +N + K+ KG++VK +G L +GLYVL   T+
Sbjct: 346 LTNVRYVPKLKRNLISLGELDRSGCTIKSENGVMKVTKGSLVKLRGTLRHGLYVLEGTTV 405

Query: 681 VGTTAVASERDQKQTKLWHARLGHMSERGLRELSKQGLL--------------------- 740
            G+ A+AS +    + LWH RL H+SERGL+ LS+QGLL                     
Sbjct: 406 SGSAAIASGKVTDMSMLWHKRLAHVSERGLQALSQQGLLGGVKNVELPFCEHCIMGKSTR 465

Query: 741 ---------------------------------------VEDL----------------- 800
                                                  ++D                  
Sbjct: 466 VKFGKGKHTTKGILDYVHSDLWGPTKEVSMGGSRYFISIIDDFSRKVWIYPLKQKDEAFG 525

Query: 801 --------VEKQTERKLKCLRTDNGLEFLSNEFKEFCKLTEGIIRHLTVRGTPQQNGLAE 860
                   VE QT RK+K LRTDNGLEF++N+F +FCK +EGI RH TV  TPQQNGLAE
Sbjct: 526 KFLEWKKQVENQTGRKVKYLRTDNGLEFVNNKFNQFCK-SEGITRHFTVTYTPQQNGLAE 585

Query: 861 RMNRTLLEKIRCLMSNACLPKKFWGEALMTATYLVNRSPSTAIDFKTPMEKWSNHPPDLS 920
           R NRT++E+ RCL++NA LP KFWGEA  TA YL+NRSPSTA++ KTP E W+   P L 
Sbjct: 586 RFNRTIMERTRCLLTNASLPLKFWGEAAQTACYLINRSPSTALNLKTPQEVWTGKAPSLE 645

Query: 921 NLRTFGCIAYAHSKEGKLDNRAKKCLFLGYQSGIKGYRLWCIEKGEEKCIISRDVTFDES 980
           +LR FGC AYAH K+GKL+ RA KC+F+GY  G+KGY+LWCIEKG  KCIISRDVTF+E+
Sbjct: 646 HLRVFGCTAYAHVKDGKLNKRALKCMFIGYPQGVKGYKLWCIEKGMNKCIISRDVTFNET 705

Query: 981 VIAW-----EKNQ------SETETNSEKNKSFEMELELASIQTPTENQPAETD------- 989
            + +     +K Q      +E    SE   S +++ +   +    + Q +E D       
Sbjct: 706 EMPYCVKEQQKQQTGDHVVTEVRIASEVRPSIDLDNQPPLVSEIEDTQQSEFDGIQSQQE 765

BLAST of Lag0035208 vs. ExPASy TrEMBL
Match: A0A5A7U2U7 (Retrotransposon protein, putative, Ty1-copia sub-class OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold385G00590 PE=4 SV=1)

HSP 1 Score: 576.6 bits (1485), Expect = 1.9e-160
Identity = 350/854 (40.98%), Postives = 467/854 (54.68%), Query Frame = 0

Query: 321 LPESEIKETKEIAFSTIILYLADNVLRQVHEANTAEEVWKQLDKIYLTKSLTNKLYIKER 380
           + ESE ++  E+A+ TI+LYL+D VLR V EA T  E+WK+L+ +YLTKSL NK+YIKE+
Sbjct: 46  ITESEKRDMDEMAYWTILLYLSDEVLRLVDEATTTGELWKKLESLYLTKSLPNKIYIKEK 105

Query: 381 FFGFKMDPNKDLEHNLDEYNRIVLDLANIDEKMSDENRAIILLNSLPESYNEVKSAIRYD 440
           FFG+KMD +K LE NLDE+ +IV+DL NI EKMSDEN+A+ILLNSLPE+Y EVK+AI+Y 
Sbjct: 106 FFGYKMDQSKILEENLDEFQKIVVDLNNIGEKMSDENQAVILLNSLPETYREVKAAIKYG 165

Query: 441 RDSLSMDIVLSALRSRDLELKKGKSKESEALFTRGRTEKKSSRNNSRT------------ 500
            DSL+M IVL AL++R+LE+KK + K+ E L  RGR+EKKS +   R+            
Sbjct: 166 WDSLTMSIVLDALKTRNLEIKK-ERKDGELLMARGRSEKKSWKGKERSFRSKSKGKSRKC 225

Query: 501 ---------------------------------------------------EVLTVLEGN 560
                                                              EVL V   +
Sbjct: 226 FLCHKEGHFKKNCPLNKSREASTSEANVTDGYNSAEITDGCDSAETGYESAEVLMVSHRD 285

Query: 561 FDSEWILDSGCSFHMTPNKHWFLNFEEIDGGKVLLGNH---------------------- 620
               WI+DSGC+FHMTP++ +  NF+++DGGKVLLG++                      
Sbjct: 286 IQDAWIMDSGCTFHMTPHRDFLTNFQKVDGGKVLLGDNGTCDVKGTGSVQIATHDGMVRI 345

Query: 621 -----------------------------QNNI-KIVKGAIVKAKGILHNGLYVLSANTM 680
                                        +N + K+ KG++VK +G L +GLYVL   T+
Sbjct: 346 LTNVRYVPKLKRNLISLGELDRSGCTIKSENGVMKVTKGSLVKLRGTLRHGLYVLEGTTV 405

Query: 681 VGTTAVASERDQKQTKLWHARLGHMSERGLRELSKQGLL--------------------- 740
            G+ A+AS +    + LWH RL H+SERGL+ LS+QGLL                     
Sbjct: 406 SGSAAIASGKVTNMSMLWHKRLAHVSERGLQALSQQGLLGGVKNVELPFCEHCIMGKSTR 465

Query: 741 ---------------------------------------VEDL----------------- 800
                                                  ++D                  
Sbjct: 466 VKFGKGKHTTKGILDYIHSDLWGPTKEVSMGGSRYFISIIDDFSRKVWIYPLKQKDEAFG 525

Query: 801 --------VEKQTERKLKCLRTDNGLEFLSNEFKEFCKLTEGIIRHLTVRGTPQQNGLAE 860
                   VE QT RK+K LRTDNGLEF++N+F +FCK +EGI RH TV  TPQQNGLAE
Sbjct: 526 KFLEWKKQVENQTGRKVKYLRTDNGLEFVNNKFNQFCK-SEGITRHFTVTYTPQQNGLAE 585

Query: 861 RMNRTLLEKIRCLMSNACLPKKFWGEALMTATYLVNRSPSTAIDFKTPMEKWSNHPPDLS 920
           R NRT++E+ RCL++NA LP KFWGEA  TA YL+NRSPSTA++ KTP E W+   P L 
Sbjct: 586 RFNRTIMERTRCLLTNASLPLKFWGEAAQTACYLINRSPSTALNLKTPQEVWTGKAPSLE 645

Query: 921 NLRTFGCIAYAHSKEGKLDNRAKKCLFLGYQSGIKGYRLWCIEKGEEKCIISRDVTFDES 958
           +LR FGC AYAH K+GKL+ RA KC+F+GY  G+KGY+LWCIEKG  KCIISRDVTF+E+
Sbjct: 646 HLRVFGCTAYAHVKDGKLNKRALKCMFIGYPQGVKGYKLWCIEKGMNKCIISRDVTFNET 705

BLAST of Lag0035208 vs. ExPASy TrEMBL
Match: A0A2I0WB13 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Dendrobium catenatum OX=906689 GN=MA16_Dca013638 PE=4 SV=1)

HSP 1 Score: 544.3 bits (1401), Expect = 1.1e-150
Identity = 324/844 (38.39%), Postives = 467/844 (55.33%), Query Frame = 0

Query: 321 LPESEIKET---------KEIAFSTIILYLADNVLRQVHEANTAEEVWKQLDKIYLTKSL 380
           LPESE+  T         ++ AFS+IIL LAD VLR+V    T  E+WK+L+++Y  K+L
Sbjct: 17  LPESELPSTMSDQEKLSIQKKAFSSIILCLADQVLRKVSHVKTVSELWKKLEELYRQKTL 76

Query: 381 TNKLYIKERFFGFKMDPNKDLEHNLDEYNRIVLDLANIDEKMSDENRAIILLNSLPESYN 440
            N++Y+KE+FFG+KMD  K ++ NLDE+N+++LDL N++ K+ DE++AIILLNSLP+S  
Sbjct: 77  PNRIYLKEKFFGYKMDEAKSIDDNLDEFNKLILDLENLEVKIEDEDKAIILLNSLPKSLR 136

Query: 441 EVKSAIRYDRDSLSMDIVLSALRSRDLELK-KGKSKESEALFTRGRTEKKSS-----RNN 500
             K  ++Y R+++++D V +AL S+ L++K   K+   E L  RGR++K+ +     ++ 
Sbjct: 137 NFKETLKYGRETITVDEVQNALSSKILDMKISEKNHSGEGLHVRGRSQKRGTSQKKWKSK 196

Query: 501 SRTEVLT--------------------------------------VLEGNFDSEWIL--- 560
           SR++  +                                      ++  N+DS  +L   
Sbjct: 197 SRSKSASKKDYKNVKCWQCNKTGHIRRFCPEKNPKDKSQSQGDAAIVGENYDSADVLNVS 256

Query: 561 ------DSGCSF----------------------HMTPNKHWFLNFEEIDGGKVLLGNHQ 620
                 +  C                        H+   K   ++   +D    +  + +
Sbjct: 257 DLLLGNNKACDVVGIGSIAVKMHDGHVRILKDVRHVPDLKRNLISLGTLDDSGYIFRSER 316

Query: 621 NNIKIVKGAIVKAKGILHNGLYVLSANTMVGTTAVASERDQKQTKLWHARLGHMSERGLR 680
             ++I KGA+V  KGI  NGLYVL   T+VG T V ++++  +TKLWH RLGH+S+RGL 
Sbjct: 317 GLLRISKGALVIMKGIKRNGLYVLQGATLVGETHVTAKQNLDKTKLWHQRLGHLSDRGLI 376

Query: 681 ELSKQGLLVED------------------------------------------------- 740
           EL KQGL   D                                                 
Sbjct: 377 ELQKQGLFGNDSIAKIDFCESCIIGKSHRLSFKLSTHRAEGILDYIHSDLWGPARVATHG 436

Query: 741 ------------------------------------LVEKQTERKLKCLRTDNGLEFLSN 800
                                               +VE Q  RKLK LRTDNGLEF + 
Sbjct: 437 GNRYFLSFIDDYSRKVWIFLLKSKDETFSKFLEWKSMVENQKNRKLKVLRTDNGLEFCNE 496

Query: 801 EFKEFCKLTEGIIRHLTVRGTPQQNGLAERMNRTLLEKIRCLMSNACLPKKFWGEALMTA 860
            F +FC    GI+RH TV  TPQQNGLAERMNRTLL+++RCL+ ++ L K FWGEAL TA
Sbjct: 497 SFNKFCS-DSGIVRHKTVSHTPQQNGLAERMNRTLLDRVRCLLFSSGLSKFFWGEALSTA 556

Query: 861 TYLVNRSPSTAIDFKTPMEKWSNHPPDLSNLRTFGCIAYAHSKEGKLDNRAKKCLFLGYQ 920
            YLVNR+PS+AI+FKTP E W   PP L++LR FGC+AY H  +GKL+ R+ KC+FLGY 
Sbjct: 557 CYLVNRTPSSAINFKTPQELWKGKPPSLTHLRVFGCLAYPHQNKGKLEPRSIKCVFLGYP 616

Query: 921 SGIKGYRLWCIEKGEEKCIISRDVTFDESVIAWEKNQSETET-----NSEKNKS-FEMEL 980
           +G+KGYRLW +     K IISRDV F+E+ +   +++++  T     NS+ NK  +E E+
Sbjct: 617 TGVKGYRLWDLSSPGVKTIISRDVIFNENRLYISESENKDTTISSIENSDSNKDYYEFEV 676

Query: 981 ELASIQTPTENQPAETDVRVEEGAETQAETQAENIP-PEPDSLQNYNLTRDRQRREIRRP 989
           E  S     EN               Q  T+ EN P P  ++  +Y L+RDR+RR I+ P
Sbjct: 677 EPPSNTLSFEN--------------PQQSTEPENEPEPVSENNHDYLLSRDRERRNIKPP 736

BLAST of Lag0035208 vs. ExPASy TrEMBL
Match: A0A438IR25 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX=29760 GN=POLX_2664 PE=4 SV=1)

HSP 1 Score: 541.2 bits (1393), Expect = 9.1e-150
Identity = 331/824 (40.17%), Postives = 455/824 (55.22%), Query Frame = 0

Query: 309 AKFEVERFDGR-----------------GLPE---------SEIKETKEI-----AFSTI 368
           AKF+VERF G+                 GL +         S ++E K+I     A S I
Sbjct: 4   AKFDVERFTGKNDFGLWRLKMRALLVQQGLHDALLGEKNLPSTMQEKKKIKLLEKAHSAI 63

Query: 369 ILYLADNVLRQVHEANTAEEVWKQLDKIYLTKSLTNKLYIKERFFGFKMDPNKDLEHNLD 428
           IL L D VLR+V +A +  EVW +L+ +Y+TKSL N+L+ K + + FKM P   +E +LD
Sbjct: 64  ILSLGDTVLREVAKAESTAEVWLKLESLYMTKSLANRLHKKIKLYTFKMTPGMSIEEHLD 123

Query: 429 EYNRIVLDLANIDEKMSDENRAIILLNSLPESYNEVKSAIRYDRDSLSMDIVLSALRSRD 488
            +N+I+LDL NID  +SDE++AI+LL SL   Y  +K AI Y RDSL+ D   ++     
Sbjct: 124 HFNKIILDLENIDITISDEDKAILLLTSLDAFYTNMKDAIMYGRDSLTFDEGKNSKSRSK 183

Query: 489 LELKKGK----SKE----SEALFTRGRTEKKSSR---------NNSRTEVLTVLEGNFDS 548
            + KK K     KE     +    R  T KK+                EVL V E +   
Sbjct: 184 SKTKKFKCFICHKEGHFKKDCPDRRQNTIKKTVNECDAAVILDGYDSAEVLNVAEVDSGK 243

Query: 549 EWILDSGCSFHMTPNKHWFLNFEEIDGGKVLLGNHQ------------------------ 608
           EWILDSGCSFHM P K WF +F+E DGG VLLGN++                        
Sbjct: 244 EWILDSGCSFHMCPIKAWFEDFKEADGGYVLLGNNKHCKILGTGTVRIKHYDGIERVLED 303

Query: 609 ----------------------------NNIKIVKGAIVKAKGILHNGLYVLSANTMVGT 668
                                       N++++ +G++   K  + NGLY L   T++  
Sbjct: 304 VRYIPELKRNLISLGMLDKSGYTFKSEPNSLRVARGSLTVMKETIKNGLYTLIGQTVIDK 363

Query: 669 TAVASERDQKQTKLWHARLGHMSERGLRELSKQGLL------------------------ 728
            +   + D   TKLWH RLGHMS +GL+EL KQG+L                        
Sbjct: 364 ASTVLKEDVGTTKLWHQRLGHMSHKGLQELEKQGVLGNYKLTDLPFCEHCVFGKATRVKF 423

Query: 729 VEDLVEKQTE---------------------RKLKCLRTDNGLEFLSNEFKEFCKLTEGI 788
            + + E Q +                     RK+K LRTDNGLEFLSN+F  FC+  EGI
Sbjct: 424 AKAIHETQNQLDYIHSDLWGPSRVPSIGGASRKVKKLRTDNGLEFLSNDFNSFCQ-KEGI 483

Query: 789 IRHLTVRGTPQQNGLAERMNRTLLEKIRCLMSNACLPKKFWGEALMTATYLVNRSPSTAI 848
             H TVR TPQQNGLAERMNRT+LE++RC++S++ L K FW EA  T  +L+NRSPS+A+
Sbjct: 484 ATHRTVRYTPQQNGLAERMNRTILERMRCMLSSSGLSKVFWAEAADTVVHLINRSPSSAL 543

Query: 849 DFKTPMEKWSNHPPDLSNLRTFGCIAYAHSKEGKLDNRAKKCLFLGYQSGIKGYRLWCIE 908
            FKTP EKW+    +  +L+ FGC AY H+K  KL+ RA KC+FLGY  G+KGY+LW   
Sbjct: 544 QFKTPQEKWTGKATNYQHLKVFGCTAYVHTKTDKLEPRAVKCIFLGYPKGVKGYKLWIET 603

Query: 909 KGEEKCIISRDVTFDESVIAWEKNQSETETNSEKNKSFEMELELASIQTPTENQPAETDV 968
           +G+  CIISRDVTF+E  ++ +    + E + +    FE+E E          QP ++  
Sbjct: 604 QGKGNCIISRDVTFNEQDMSKQTPAKDVEGSDQ--LQFEVEHETL--------QPEKSKE 663

Query: 969 RVEEGAETQAETQAENIPPEPDSLQNYNLTRDRQRREIRRPARYASADIVHYALFTEMNS 988
              + A+ +   + +N P +   L++YNL RDRQ+R+++ P RY   ++  +AL      
Sbjct: 664 TSSKTAQEEIVHERQNEPTQ--GLKSYNLVRDRQKRQVKPPKRYGQVEMTTFALSVAEEI 723

BLAST of Lag0035208 vs. TAIR 10
Match: AT4G23160.1 (cysteine-rich RLK (RECEPTOR-like protein kinase) 8 )

HSP 1 Score: 97.4 bits (241), Expect = 6.7e-20
Identity = 47/104 (45.19%), Postives = 72/104 (69.23%), Query Frame = 0

Query: 886 EEPLTYHEAINSINSDKWKEAMQEEMNSLLKNNTWELVDRPSNKILVGCKWIYKVKQSVD 945
           +EP TY+EA   +    W  AM +E+ ++   +TWE+   P NK  +GCKW+YK+K + D
Sbjct: 84  KEPSTYNEAKEFL---VWCGAMDDEIGAMETTHTWEICTLPPNKKPIGCKWVYKIKYNSD 143

Query: 946 PSQPKRYKARLVAKGYTQKEGVDYGEIFSPVVRHSSIRTLLSLN 990
               +RYKARLVAKGYTQ+EG+D+ E FSPV + +S++ +L+++
Sbjct: 144 -GTIERYKARLVAKGYTQQEGIDFIETFSPVCKLTSVKLILAIS 183

BLAST of Lag0035208 vs. TAIR 10
Match: ATMG00710.1 (Polynucleotidyl transferase, ribonuclease H-like superfamily protein )

HSP 1 Score: 90.9 bits (224), Expect = 6.2e-18
Identity = 42/83 (50.60%), Postives = 54/83 (65.06%), Query Frame = 0

Query: 662 MNRTLLEKIRCLMSNACLPKKFWGEALMTATYLVNRSPSTAIDFKTPMEKWSNHPPDLSN 721
           MNRT++EK+R ++    LPK F  +A  TA +++N+ PSTAI+F  P E W    P  S 
Sbjct: 1   MNRTIIEKVRSMLCECGLPKTFRADAANTAVHIINKYPSTAINFHVPDEVWFQSVPTYSY 60

Query: 722 LRTFGCIAYAHSKEGKLDNRAKK 745
           LR FGC+AY H  EGKL  RAKK
Sbjct: 61  LRRFGCVAYIHCDEGKLKPRAKK 83

BLAST of Lag0035208 vs. TAIR 10
Match: ATMG00820.1 (Reverse transcriptase (RNA-dependent DNA polymerase) )

HSP 1 Score: 90.9 bits (224), Expect = 6.2e-18
Identity = 48/110 (43.64%), Postives = 73/110 (66.36%), Query Frame = 0

Query: 879 TEMNSIDEEPLTYHEAINSINSDKWKEAMQEEMNSLLKNNTWELVDRPSNKILVGCKWIY 938
           T   +I +EP      I ++    W +AMQEE+++L +N TW LV  P N+ ++GCKW++
Sbjct: 19  TITTTIKKEP---KSVIFALKDPGWCQAMQEELDALSRNKTWILVPPPVNQNILGCKWVF 78

Query: 939 KVKQSVDPSQPKRYKARLVAKGYTQKEGVDYGEIFSPVVRHSSIRTLLSL 989
           K K   D +   R KARLVAKG+ Q+EG+ + E +SPVVR ++IRT+L++
Sbjct: 79  KTKLHSDGTL-DRLKARLVAKGFHQEEGIYFVETYSPVVRTATIRTILNV 124

BLAST of Lag0035208 vs. TAIR 10
Match: AT3G10690.1 (DNA GYRASE A )

HSP 1 Score: 56.2 bits (134), Expect = 1.7e-07
Identity = 31/59 (52.54%), Postives = 38/59 (64.41%), Query Frame = 0

Query: 14  VDFVPTFDNSQKEPSLLPAQLPTLLSNGSSGITVIHLRRV--RGIGELI-VPSYFFHKP 70
           VDFV  FDNSQKEP++LPA+LP LL NG+SGI V     +    +GEL+ V     H P
Sbjct: 237 VDFVANFDNSQKEPAVLPARLPALLLNGASGIAVGMATNIPPHNLGELVDVLCALIHNP 295

BLAST of Lag0035208 vs. TAIR 10
Match: ATMG00300.1 (Gag-Pol-related retrotransposon family protein )

HSP 1 Score: 55.1 bits (131), Expect = 3.8e-07
Identity = 27/66 (40.91%), Postives = 42/66 (63.64%), Query Frame = 0

Query: 539 IKIVKGAIVKAKGILHNGLYVLSANTMVGTTAVASERDQKQTKLWHARLGHMSERGLREL 598
           +K++KG     KG  H+ LY+L  +   G + +A E  + +T+LWH+RL HMS+RG+  L
Sbjct: 29  LKVLKGCRTILKGNRHDSLYILQGSVETGESNLA-ETAKDETRLWHSRLAHMSQRGMELL 88

Query: 599 SKQGLL 605
            K+G L
Sbjct: 89  VKKGFL 93

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
TYK25306.18.6e-17942.73putative gag-pol polyprotein [Cucumis melo var. makuwa][more]
KAA0050719.18.6e-17942.73putative gag-pol polyprotein [Cucumis melo var. makuwa][more]
KAA0047995.14.0e-16040.98retrotransposon protein, putative, Ty1-copia sub-class [Cucumis melo var. makuwa... [more]
PKU72844.12.2e-15038.39Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Dendrobium catenatu... [more]
RVW99173.11.9e-14940.17Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera][more]
Match NameE-valueIdentityDescription
P109785.8e-9330.41Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
P041461.4e-5433.05Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
Q94HW28.5e-3626.73Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
Q9ZT949.3e-3525.65Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
P925128.8e-1750.60Uncharacterized mitochondrial protein AtMg00710 OS=Arabidopsis thaliana OX=3702 ... [more]
Match NameE-valueIdentityDescription
A0A5D3DNU14.2e-17942.73Putative gag-pol polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_sca... [more]
A0A5A7UB254.2e-17942.73Putative gag-pol polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_sca... [more]
A0A5A7U2U71.9e-16040.98Retrotransposon protein, putative, Ty1-copia sub-class OS=Cucumis melo var. maku... [more]
A0A2I0WB131.1e-15038.39Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Dendrobium catena... [more]
A0A438IR259.1e-15040.17Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX... [more]
Match NameE-valueIdentityDescription
AT4G23160.16.7e-2045.19cysteine-rich RLK (RECEPTOR-like protein kinase) 8 [more]
ATMG00710.16.2e-1850.60Polynucleotidyl transferase, ribonuclease H-like superfamily protein [more]
ATMG00820.16.2e-1843.64Reverse transcriptase (RNA-dependent DNA polymerase) [more]
AT3G10690.11.7e-0752.54DNA GYRASE A [more]
ATMG00300.13.8e-0740.91Gag-Pol-related retrotransposon family protein [more]
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (AG-4) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 89..116
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 327..457
e-value: 2.1E-31
score: 108.5
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 105..157
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 109..128
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 832..851
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 829..853
NoneNo IPR availablePANTHERPTHR45895FAMILY NOT NAMEDcoord: 609..987
coord: 324..457
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 917..988
e-value: 3.3E-18
score: 66.2
IPR013758DNA topoisomerase, type IIA, subunit A/ C-terminal, alpha-betaGENE3D3.90.199.10Topoisomerase II, domain 5coord: 13..46
e-value: 1.5E-8
score: 35.6
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 598..716
e-value: 1.5E-26
score: 95.2
IPR025724GAG-pre-integrase domainPFAMPF13976gag_pre-integrscoord: 556..604
e-value: 2.0E-10
score: 40.3
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 510..716
score: 13.751912
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 613..725
IPR013760DNA topoisomerase, type IIA-like domain superfamilySUPERFAMILY56719Type II DNA topoisomerasecoord: 13..47

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lag0035208.1Lag0035208.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
biological_process GO:0006265 DNA topological change
biological_process GO:0006259 DNA metabolic process
molecular_function GO:0005524 ATP binding
molecular_function GO:0003677 DNA binding
molecular_function GO:0003918 DNA topoisomerase type II (double strand cut, ATP-hydrolyzing) activity
molecular_function GO:0003676 nucleic acid binding