Lag0005116 (gene) Sponge gourd (AG‐4) v1

Overview
NameLag0005116
Typegene
OrganismLuffa acutangula (Sponge gourd (AG‐4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Locationchr6: 10871265 .. 10878857 (+)
RNA-Seq ExpressionLag0005116
SyntenyLag0005116
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAACCTCTGAGAACAATTCTGAGATCTCAAGCGGCTCGCAAAATGGCCAAATCGTCAATCCTGGGAATAAGATCTCAACTGTGAAGCTGACCGATGAAAACTTCCTCATGTGGAAATTTCAGATCCTCACTGCTCTCGAAGGTCATGATCTTGATGACCATATCAGTGGCGATTCTCAACCACTGCCTGAGCTAATCCAGGTAAGTGAAAATGCGACGACGGTCAGTAAGCCTAACCCTGCCTATAAAGTTTGGAAAAAGCAAGATAAATTAGTGTCCTCGTGGATTGTTGGGTCTATGTCTGAATCCATCCTCGAGCAAGTCCTTCACTGTAAGTCGGCAAAAGAAATTTGGTCTTGCTTGCTTCAGATTTTTAATTCACGTCATCTTGCTCAAGTTATGAAGATTAAATCGAAATTACAAAATATTCAGAAAGGTGGACTGTCTATGAATGAATATATATCTAAAATCAAGAAGTGTATTGATGCTTTGTCTGCGATAGGAAAGGAAATAGATGTTCAGGATCACATTATGTATATTCTGTCTGGATTGGGGGCTGAATTGAAACTATGGTCTCTGTTATTACTGCTAAAACCGGTACTCAAACTGTTCAAGATGTTGTAGCCCTGTTATTGACTCATGAGAGTAGAATAGAGAGTAAGTCAGTGATTAATTCTGATAATGTTTTACCTTCTGCGAACCTAGCGGTTCAAAATGTGAGTCAAAACTCAGTTCCAAATCCTTCCCCTAACTCTCAACAGCAAAATTTTGGTAATGGTAGAGGTAGGAGTCGGTCTAATTTTGGTCAAAACAGAGGAGGAAGGTCCTGGAACAATCGTAATAGGCCTCAGTGTCAATTGTGTAATAAAATTGGCCATACTGCTATGAAGTGCTACTCTCGAGTTCAGATGCCAGGAGCCTATGCAACTCAATTTAATCCTCCTGGGCAAATGAATCCCTCAGGCCTGAATTTTAGTCCACAACAATTTAATGGTCAGTTTCCCCAATGCAAGCTATGCTAGCCTCTCCTAATTACAATCAGGATTGTAACTGGTATCCTGATTCCGGAGCCACGAATCACTTGACTAATAGTCTCAGTAACATGTCCGTGAGCTCGAATATCCTGGCAATAATCAGGTTCTCATTGGCAATGGTGCAGGTTTGGCTATTTCTCATCTTGGATATGCTTCTTTTACTTCTTCAAATAATCATATGTTTCATTTAAATAACCTTTTACATGTTCCCTCCATTACAAAAAATCTTATCAGTGTCAGTCAATTCGCTAAGGATAATGCTGTTTTCTTTGAATTTCATCCAACTTTCTGTGTTGTGAAGGATCTAGCAACTGGACGGGCACTCCTTCGAGGGACTCTACATGAAGGACTATATAGGTTCAACCTGCCGCAGCCTTTGCCATCGTTAAACACCTCTGTTGTTCGACCGGAAACTAATACTGCTGTTTTTTCTACTTTACCTTGTGTTTCCAATGATGTGTCTGCTTTATATTCCTCTGTTCAGTCTTCAAATAATTTGTCCATAGATGTTTGGCATCAACGTCTTGGACATCCATCTATTTCTATTGTTAAGCAAGTTGTTCGTTCCTGTAATCCAAAAGTTTCTACTAATGCCATTATGTCATTTTGTCATGCATGTGCAATAGGCAAACATCATGCCATGCCTTTCTCTCCCTCTACTACATCTTACTCTGCTCCTTTGCAACTTATAGTTACTGATTTATGGGGTCCAGCTTATAAACTGTCTACCCATGGATTTCAGTATTACATTAGCTTTGTGGATGCTTTTTCGAGATATACATGGATTTATTTTCTTCAAACCAAGTCTGAAGCATTTCAGGCTTTTCTCAAATTTAAAACTCATGTGGAAAAACAGTTTGGAACTCCTATTGTTTCCTTCAAACTGATGGAGGTACTGAATTTAAACCATTCATTCCATTTCTTCATACTCATGGCATAAATCATAGAGTTTCATGTCCCTATACATCTCAACAGAATGGCATTGTAGAACGTAAACATCGTCACATTGTTGATGTTGGTCTCACCTTGTTATCACATTCTTCTATACCTCTAACATTCTGGGATGATGCTTTTTCCACCAGTGTTTATCTTATCAACAGGTTACCCTCTATAGTTCTTGGTGGCATGAGTCCCTTGGAGAAGCTCTTTCGGAAGCAACCAGATTATTCCACACTTAAAGTCTTTGGTTGTAAGTGTTTTCCTTGCCTTCGCCCATATAATTCTCATAAGTTGAGTTTTCGGTCGAGTCCCTGTACATTCATTGGTTATAGTCATATTCATAAAGGCTATAAATGTTTGTCCTCTGACGGTAGACTTTATATCTCTAGACATGTATTGTTTGATGAAAATTCTTTTCCATTTGCTTCTCTTACTTCTCATTCTTCTGTTTCTCCCAATTGTGTAACTCAAAGTTTACCTACATTGTCTTCTGTTTCTTCCTCTACTACAGTTGAGTCTTCTTCTGATGCACACTTAAGTATCAGTGAGACATCCTCTATTCCATCAACTGCTGATCATCCTACTAATAATAGTCCTTCACCCTGTTTCCTGAACCGTGTGTCCAACCTAGTCAACCTCCTCCTACTTTACCGTCTACAACTAGTTTAGGTACTCACCACATGATTACACGAAGTAAAAGAGGCATATTCAAACCTAAGGCTTTTCTTGCTACCTTTGTTGATGTTGAACCGCCTAATGTTAAAGAGACCCTTAAATGTTCTCATTGGAAACAAGCAATGCAAGATGAGTATGATGCTCTTAACGTAAAGGCAGCCCACTGAAGGTTTTTATTATTTGTCCAGCCCATATTCTTTGTAAAAGGCATAAGCCCATATTATGAAGCCCAACCCATTTTCTCTTTATCAGATTAATAGGGCTATATGTAAAGGTTCTAGAATAGTTGTCTTCTAGAACTCTCTCTCACAGGTGTATTCTTTCGTGATAGCTCTCAGAGGTAAGTTTTTGTGATTTAGGGTTAGGGTTTGTTATATTTTCCAAGAGTTTTTTCATCTCTATCTAGCAGTATATATGTAGCTGCCTCTTCAATAGTTGAATTCAATTCAATGTGATAGAGATTTTTCTCTTGGAATCCTTCTGCGTTTTTCTCTTCTTTCGTCTCAATGTTTTTTCTATTTTCGCCTTTATCGCTTATGCTTTGATGAGCAAAGAGGTATTACCTCAACATTGATTGTGTATGGAAAATCGACTACAATCCCTTTGTGTTGAAATCCTTGCCATTTGGATTGAGTAAAGAATTTGCAAGAAGGCCTTGTTGGGGGAAATTACAAATAACAGAGGAAATGGAGAAAATATTGAGAAACAGACACGTAAACAATTAGGTTGAGGCGCGGGTATCTCGCTCTCTTTAAGGAGATTCAAGCCCTCTGCGGTGTATAATTCTTTATCAATCACTGGTGCAGCAGGTATTCTCAAGCCTGTCCCTTCCAGGATACAACAACCCAACTGAAACGTTGTGTAGCTCAAACCACTATGCACTAAAAGTTTGAGCTCAAAGCTCCACAACAAAGAACACCTCTCTTTTGGCCAATTGCTCTTAAAAGAGCAAATGAGAGAAAGATGAGAAAGTAAAGAAAAGAAGATAGGAATTGTTGTGTTTCTTGTTGTGTGTAGAAAGTGTTTGGTGTGAGAGGTATTTATAGGATTTTCAAGATCTTGAAATTTAATACAAAACAAAAGTAACCAAAGTGTTAAAACCAAAACTCCCCCACCAAATACGTTTTTCTCTTTTAAAATGAGAGAAAACTCTCCAAAATTAATTTCTAAAATGAAACTCGTATTTAAATTACAATTAAATATCAAAATGTTACAACAGGTCTCAATCTCACTCGCTTAATCGTGAAAATCGATTACCTTCAAGCTCGAACCTCCTCTTTAATGAAATTGAATCGCTAGCGGAAGAGAGAATTTAAGTTGGAGAAATTTTGTCACTGTCTAAAACACTAAAAATAGTGCGATTTTGCTCATGTTAAGAGAAAGTTGAATTAAGTGGATGATAAAATATGACAAACGTGACATACTCCTTTTAATTGACTGTTTTATGGAAGTTTGGTTTTTCTAGATGGATTTCAGACGTTGTTAAGGAGGATTTTCAAACGTGTAGCCACATGACACAACTTCTTTTTAATGAAATACTATCATCTTATTTTAAAAGAAAACATGTTCCTTAATTAATTAGAGTACCGGTTAAAATTAATCATGATTCAATAAAAGTTTTATTATAATTTTTGTGGAGTTGTATATATCTCTTTCAATTAAAATCTTTTAACAACCAAAATAGAATTTTAAGAACAAATTGTCTATTTTTAGGTGAATTTTTCAACTCTAATACATCAATAATAAAGTCAATAGACTTCACATAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACAGAAATGAGTACTAAAAAAATCTCTACCAACCAGACAGATTCACTCACATATTTATATTATTTTAAATAAATTTATTAAATAATAATAAAATTTGATATGGGAATAAAATATTCCAGAAGAATTTATTTTTCCATTTTCCTAATGGACTTTAGAATATTTGAATAATAGACCAATAACAAATTGGGAAAGTGTTATTATCCAAAGCTTAGAGACAAAGTGGGGGATTGAAAGAACATAAAGATGGAATATTCAAATAGAACAATTAATTAGAATCCAAAACCTAAATTTTGTCATTTGATTATATTTTCAAATATTAAAGGTGTTGTGTTAACTTCAATTTCATATATCAAGTATATTATATACATTGGATCTTTATTTATTTATTTATTTATTTTAAAGTTATATATGTTAGATCAAATTCTAAAAAAAAAAAGTTGATGCAAAGACTGAAACTACATTTACCTACTTAATTTAGTTTTAATCTTAGATGCATACTGTTATAGGCAATTTTGGACCACCCCGATACACAAGGAGCTGACGAGGACAACCGGGGAGAAATCGGGCTGAAAGATGGACTAAGAAGGCAAAACCGGCAAATGGGACGGGCCAAGACCGAAGGGGTCGGGTTTTCGGCCCGACCCCCTGCTCGGCCCGCTCGTGCGGGCCGAGTCCGTTTGGTCCCGTCTGGTCTCCACCGCCTCTGGATGCCCCGGTTTCGCCTGGTTTGACCTAAAACGCCTCCGAAACCCTAAAAAGGCTAGGAGGATGAGCAGGTATTTATATCCCTCTTTGCCACTGAAGAGGGGATCCCGAATTCTATCCCTAAACTCTACTATTGACTCTCTACTTTCTGCTCTTGCTCTTACTTTTCCACGCCCTCCGTTCTGCTTTCTGACTTAAGCATCGGAGCCGGTGTGGCGAGCACTACACCGGTGTGCAGGTTTACTGTCTTGCAGGCCACGTCTTCCCCCCTCAACTACAAATTTACCGTTGGTGGCACGTGAAGGTCAGGCGAGATCCTCTGGCCAAAATCGACCATCAACAGTTGGCGCCGTCTGTGGGGAAGAAAGCCTGTTAATCTGCACATCGGTTATTCCATGAGTAAGGGTATGGAAAGAAAGACCAAGACGTAAACATAGAGAATTCGGATGGTGACCGCCACCAGCGGAGGTCACGTGACGAGGATAGCATCCGGGGGTCACCGAGACAAGCAGGCCGAGGCCGAGGCCGAGGCCGAGCAGAGGATGCCGACACCAAGATTGCCGCCCTTGAGGATGAGGTGAAGGGAATGAATCGGAGTTTATCCAAAATACTCCAAATCCTGGATAAACCCGGCCCTAGCACCAAAGTCCATGAGGGGAGCTTGATTAGAGACCCGAGGAAGGGGAAGGAGCCTATGGAGCACACTGCAGAGTCAGGGACGAGGTCAAGAGGAAAGAAGACTGACAGCATGACCAGCAAGGTCAGGGGGCTCAAACCCACTGATCACACGATTTTGAGGAGCCCAGAGTCAAGCACACTTAAAGGGCGTCACTACACAGTTTCTACCCCAAGCTTCGGTCATACTAAGACAGACCTGAGGAATCTGATCGTTGAGAAGCGCAGAAGTGCCAAAACTGCCGAGTCCGAAGCCAAAGCCGCCGAAGCTGAAGCCCGGGCTGCCGAGGCCGAGGCCAGAGCAGCCGAGGCCGAGGCTAGGTTGGCCGAGGCCGAGGCCAAGAAAGACGACCTCCCTTGGAAGACCGAGCTTCTTAATGCACTAAAGGAGCTCGGAAATCCTCAGGGAGACCAGCAGAGGTCAAAGAACTTTGGAGATCAAAACTTGGAGGAACTAGCCGACCAAGTCGATCCGCCCTTCACAGAAGAAGTCATGAAAGCCGAGGTGCCTCAAAAGTTTAAGGTACCCACGTTCAAACAGTACGATGGCAAGAAAGACCCCGTGCAACATCTAAATGCATACAGAAGTTGGATGGACTTCCACGGCGTCTCAGATGCAATCAGGTGCCGTGCATTCTTTTTCACCCTGACAGGATCAGCTAGGCACTGGTTTGAGAGGCTGAAAAGGAGATCCATCAGCTGTTTCAAAGAGTTAGCCCAAGCATTCCTTGCACAGTTCATGGGGCTAGAGAGCAGCGCAAGCCTCACATCAACCTCTTGACGGTCAAACAACAACCAGGTGAGAGCTTGCGTGATTACATAACTCGTTTTAATGATGAGGCACTACAGGTTGAGGGGTACAGCGAGGGAGCAGCCCTGGTAGCCATAACAGCCGGTCTGGAAGACGAAAGACTGCTCAATTCAATAGGTAAGAGCCAGCCTCGAACCTATGCGGAGTTCGTCTCCCGGGCACAGAAGTATATGAGCGCAGAGGAGTTACTGAAGTCAAAAAGGTCAGAACGAGAATACAAGAGGTTTTCTTCATCTAGCTATGACAGTAAAAAGGACAAAAGGCAGCGGACCGACGAAGGAGGCCGGGGCCGAGCAGACCATGGCCGAGGCCGACCAGACAATGGCCGAGGCCGACCAGACCAAGGAGCACCTCCTTTCGGTAAGTTTGAGAAATACACCCCAACTGCTGTTCCGCAGGAGCAAGTACTGATGGAGATCCGAAATACGGGCCTCCTGAAATTCCCAGGGAGGATGAAGTCGAGTGCCGATAGAAGAGACAAGAGCCAGTATTGTCTTTTCCACCGGGACCACGGGCATTCAACCAGGAATTGTATTCAGTTGAAGGATGAAATCGAAGCACTGATCCAGAATGGGTATTTGAAAGAGTTCGTCGGTGAGCCAAAGGCCGAGGCCGACCACGGATGGCCGAGGCCGAGCCTTACCAAAGATGGCCGAGACAAAGAAGAACCCCTACGGGAGATCAGAACCATCTTTGGAGGACCAGCAGGAGGAGGTTCGAGCAGGAAGAGGAAAGCTATGGTCAGGGAAGCAAGGTCCGAACCAGAATATCGAGGTATGTACTCTGTCCATCTATCAAAGGCACACCCCCCTTTGGAGTTCACTGAGGCTGAGGCAGCGAGCATTCATCAGCCACATAATGATGCTCTGGTGGTCACTCTAATCGTAGCCAATGTGAAAATCCATCGGATCCTAATTGATGGGGGAAGCTCGGCTGATGTCCTTTCTTAA

mRNA sequence

ATGGAAACCTCTGAGAACAATTCTGAGATCTCAAGCGGCTCGCAAAATGGCCAAATCGTCAATCCTGGGAATAAGATCTCAACTGTGAAGCTGACCGATGAAAACTTCCTCATGTGGAAATTTCAGATCCTCACTGCTCTCGAAGGTCATGATCTTGATGACCATATCAGTGGCGATTCTCAACCACTGCCTGAGCTAATCCAGGTAAGTGAAAATGCGACGACGGTCAGTAAGCCTAACCCTGCCTATAAAGTTTGGAAAAAGCAAGATAAATTAGTGTCCTCGTGGATTGTTGGGTCTATGTCTGAATCCATCCTCGAGCAAGTCCTTCACTATGTTGTAGCCCTGTTATTGACTCATGAGAGTAGAATAGAGAGTAAGTCAGTGATTAATTCTGATAATGTTTTACCTTCTGCGAACCTAGCGGTTCAAAATGTGAGTCAAAACTCAGTTCCAAATCCTTCCCCTAACTCTCAACAGCAAAATTTTGGTAATGGTAGAGGTAGGAGTCGGTCTAATTTTGGTCAAAACAGAGGAGGAAGGTCCTGGAACAATCGTAATAGGCCTCAGTGTCAATTGTGTAATAAAATTGGCCATACTGCTATGAAGTGCTACTCTCGAGTTCAGATGCCAGGAGCCTATGCAACTCAATTTAATCCTCCTGGGCAAATGAATCCCTCAGGCCTGAATTTTAGTCCACAACAATTTAATGGTTTGGCTATTTCTCATCTTGGATATGCTTCTTTTACTTCTTCAAATAATCATATGTTTCATTTAAATAACCTTTTACATGTTCCCTCCATTACAAAAAATCTTATCAGTGTCAGTCAATTCGCTAAGGATAATGCTGTTTTCTTTGAATTTCATCCAACTTTCTGTGTTGTGAAGGATCTAGCAACTGGACGGGCACTCCTTCGAGGGACTCTACATGAAGGACTATATAGGTTCAACCTGCCGCAGCCTTTGCCATCGTTAAACACCTCTGTTGTTCGACCGGAAACTAATACTGCTGTTTTTTCTACTTTACCTTGTGTTTCCAATGATGTGTCTGCTTTATATTCCTCTGTTCAGTCTTCAAATAATTTGTCCATAGATGTTTGGCATCAACGTCTTGGACATCCATCTATTTCTATTGTTAAGCAAGTTGTTCGTTCCTGTAATCCAAAAGTTTCTACTAATGCCATTATGTCATTTTGTCATGCATGTGCAATAGGCAAACATCATGCCATGCCTTTCTCTCCCTCTACTACATCTTACTCTGCTCCTTTGCAACTTATAGTTACTGATTTATGGGGTCCAGCTTATAAACTGTCTACCCATGGATTTCAGTATTACATTAGCTTTGTGGATGCTTTTTCGAGATATACATGGATTTATTTTCTTCAAACCAAGTCTGAAGCATTTCAGGCTTTTCTCAAATTTAAAACTCATGTGGAAAAACAGTTTGGAACTCCTATTGTTTCCTTCAAACTGATGGAGAATGGCATTGTAGAACGTAAACATCGTCACATTGTTGATGTTGGTCTCACCTTGTTATCACATTCTTCTATACCTCTAACATTCTGGGATGATGCTTTTTCCACCAGTGTTTATCTTATCAACAGGTTACCCTCTATAGTTCTTGGTGGCATGAGTCCCTTGGAGAAGCTCTTTCGGAAGCAACCAGATTATTCCACACTTAAAGTCTTTGGTTGTAAGTGTTTTCCTTGCCTTCGCCCATATAATTCTCATAAGTTGAGTTTTCGGTCGAGTCCCTGTACATTCATTGGTTATAGTCATATTCATAAAGGCTATAAATGTTTGTCCTCTGACGGTAGACTTTATATCTCTAGACATGTATTGTTTGATGAAAATTCTTTTCCATTTGCTTCTCTTACTTCTCATTCTTCTGTTTCTCCCAATTGTGTAACTCAAAGTTTACCTACATTGTCTTCTGTTTCTTCCTCTACTACAGTTGAGTCTTCTTCTGATGCACACTTAAGTATCAGTGAGACATCCTCTATTCCATCAACTGCTGATCATCCTACTAATAATAGTCCTTCACCCTGTTTCCTGAACCGTACTCACCACATGATTACACGAAGTAAAAGAGGCATATTCAAACCTAAGGCTTTTCTTGCTACCTTTGTTGATGTTGAACCGCCTAATGTTAAAGAGACCCTTAAATGTTCTCATTGGAAACAAGCAATGCAAGATGAGTATGATGCTCTTAACGCAATTTTGGACCACCCCGATACACAAGGAGCTGACGAGGACAACCGGGGAGAAATCGGGCTGAAAGATGGACTAAGAAGGCAAAACCGGCAAATGGGACGGGCCAAGACCGAAGGGGTCGGGTTTTCGGCCCGACCCCCTGCTCGGCCCGCTCGTGCGGGCCGAGTCCGTTTGGTCCCGTCTGGTCTCCACCGCCTCTGGATGCCCCGGCCACGTCTTCCCCCCTCAACTACAAATTTACCGTTGGTGGCACGTGAAGGGTATGGAAAGAAAGACCAAGACGTAAACATAGAGAATTCGGATGGTGACCGCCACCAGCGGAGGTCACGTGACGAGGATAGCATCCGGGGGTCACCGAGACAAGCAGGCCGAGGCCGAGGCCGAGGCCGAGCAGAGGATGCCGACACCAAGATTGCCGCCCTTGAGGATGAGGTGAAGGGAATGAATCGGAGTTTATCCAAAATACTCCAAATCCTGGATAAACCCGGCCCTAGCACCAAAGTCCATGAGGGGAGCTTGATTAGAGACCCGAGGAAGGGGAAGGAGCCTATGGAGCACACTGCAGAGTCAGGGACGAGGTCAAGAGGAAAGAAGACTGACAGCATGACCAGCAAGGTCAGGGGGCTCAAACCCACTGATCACACGATTTTGAGGAGCCCAGAGTCAAGCACACTTAAAGGGCGTCACTACACAGTTTCTACCCCAAGCTTCGGTCATACTAAGACAGACCTGAGGAATCTGATCGTTGAGAAGCGCAGAAGTGCCAAAACTGCCGAGTCCGAAGCCAAAGCCGCCGAAGCTGAAGCCCGGGCTGCCGAGGCCGAGGCCAGAGCAGCCGAGGCCGAGGCTAGGTTGGCCGAGGCCGAGGCCAAGAAAGACGACCTCCCTTGGAAGACCGAGCTTCTTAATGCACTAAAGGAGCTCGGAAATCCTCAGGGAGACCAGCAGAGGTCAAAGAACTTTGGAGATCAAAACTTGGAGGAACTAGCCGACCAAGTCGATCCGCCCTTCACAGAAGAAGTCATGAAAGCCGAGGTGCCTCAAAAGTTTAAGGTACCCACGTTCAAACAGTACGATGGCAAGAAAGACCCCGTGCAACATCTAAATGCATACAGAAGTTGGATGGACTTCCACGGCGTCTCAGATGCAATCAGAGTTAGCCCAAGCATTCCTTGCACAGTTCATGGGGCTAGAGAGCAGCGCAAGCCTCACATCAACCTCTTGACGGTCAAACAACAACCAGGTGAGAGCTTGCGTGATTACATAACTCGTTTTAATGATGAGGCACTACAGGTTGAGGGGTACAGCGAGGGAGCAGCCCTGGTAGCCATAACAGCCGGTCTGGAAGACGAAAGACTGCTCAATTCAATAGGTAAGAGCCAGCCTCGAACCTATGCGGAGTTCGTCTCCCGGGCACAGAAGTATATGAGCGCAGAGGAGTTACTGAAGTCAAAAAGGTCAGAACGAGAATACAAGAGGTTTTCTTCATCTAGCTATGACAGTAAAAAGGACAAAAGGCAGCGGACCGACGAAGGAGGCCGGGGCCGAGCAGACCATGGCCGAGGCCGACCAGACAATGGCCGAGGCCGACCAGACCAAGGAGCACCTCCTTTCGGTAAGTTTGAGAAATACACCCCAACTGCTGTTCCGCAGGAGCAAGTACTGATGGAGATCCGAAATACGGGCCTCCTGAAATTCCCAGGGAGGATGAAGTCGAGTGCCGATAGAAGAGACAAGAGCCAGTATTGTCTTTTCCACCGGGACCACGGGCATTCAACCAGGAATTGTATTCAGTTGAAGGATGAAATCGAAGCACTGATCCAGAATGGGTATTTGAAAGAGTTCGTCGGTGAGCCAAAGGCCGAGGCCGACCACGGATGGCCGAGGCCGAGCCTTACCAAAGATGGCCGAGACAAAGAAGAACCCCTACGGGAGATCAGAACCATCTTTGGAGGACCAGCAGGAGGAGGTTCGAGCAGGAAGAGGAAAGCTATGGTCAGGGAAGCAAGGTCCGAACCAGAATATCGAGGTATGTACTCTGTCCATCTATCAAAGGCACACCCCCCTTTGGAGTTCACTGAGGCTGAGGCAGCGAGCATTCATCAGCCACATAATGATGCTCTGGTGGTCACTCTAATCGTAGCCAATGTGAAAATCCATCGGATCCTAATTGATGGGGGAAGCTCGGCTGATGTCCTTTCTTAA

Coding sequence (CDS)

ATGGAAACCTCTGAGAACAATTCTGAGATCTCAAGCGGCTCGCAAAATGGCCAAATCGTCAATCCTGGGAATAAGATCTCAACTGTGAAGCTGACCGATGAAAACTTCCTCATGTGGAAATTTCAGATCCTCACTGCTCTCGAAGGTCATGATCTTGATGACCATATCAGTGGCGATTCTCAACCACTGCCTGAGCTAATCCAGGTAAGTGAAAATGCGACGACGGTCAGTAAGCCTAACCCTGCCTATAAAGTTTGGAAAAAGCAAGATAAATTAGTGTCCTCGTGGATTGTTGGGTCTATGTCTGAATCCATCCTCGAGCAAGTCCTTCACTATGTTGTAGCCCTGTTATTGACTCATGAGAGTAGAATAGAGAGTAAGTCAGTGATTAATTCTGATAATGTTTTACCTTCTGCGAACCTAGCGGTTCAAAATGTGAGTCAAAACTCAGTTCCAAATCCTTCCCCTAACTCTCAACAGCAAAATTTTGGTAATGGTAGAGGTAGGAGTCGGTCTAATTTTGGTCAAAACAGAGGAGGAAGGTCCTGGAACAATCGTAATAGGCCTCAGTGTCAATTGTGTAATAAAATTGGCCATACTGCTATGAAGTGCTACTCTCGAGTTCAGATGCCAGGAGCCTATGCAACTCAATTTAATCCTCCTGGGCAAATGAATCCCTCAGGCCTGAATTTTAGTCCACAACAATTTAATGGTTTGGCTATTTCTCATCTTGGATATGCTTCTTTTACTTCTTCAAATAATCATATGTTTCATTTAAATAACCTTTTACATGTTCCCTCCATTACAAAAAATCTTATCAGTGTCAGTCAATTCGCTAAGGATAATGCTGTTTTCTTTGAATTTCATCCAACTTTCTGTGTTGTGAAGGATCTAGCAACTGGACGGGCACTCCTTCGAGGGACTCTACATGAAGGACTATATAGGTTCAACCTGCCGCAGCCTTTGCCATCGTTAAACACCTCTGTTGTTCGACCGGAAACTAATACTGCTGTTTTTTCTACTTTACCTTGTGTTTCCAATGATGTGTCTGCTTTATATTCCTCTGTTCAGTCTTCAAATAATTTGTCCATAGATGTTTGGCATCAACGTCTTGGACATCCATCTATTTCTATTGTTAAGCAAGTTGTTCGTTCCTGTAATCCAAAAGTTTCTACTAATGCCATTATGTCATTTTGTCATGCATGTGCAATAGGCAAACATCATGCCATGCCTTTCTCTCCCTCTACTACATCTTACTCTGCTCCTTTGCAACTTATAGTTACTGATTTATGGGGTCCAGCTTATAAACTGTCTACCCATGGATTTCAGTATTACATTAGCTTTGTGGATGCTTTTTCGAGATATACATGGATTTATTTTCTTCAAACCAAGTCTGAAGCATTTCAGGCTTTTCTCAAATTTAAAACTCATGTGGAAAAACAGTTTGGAACTCCTATTGTTTCCTTCAAACTGATGGAGAATGGCATTGTAGAACGTAAACATCGTCACATTGTTGATGTTGGTCTCACCTTGTTATCACATTCTTCTATACCTCTAACATTCTGGGATGATGCTTTTTCCACCAGTGTTTATCTTATCAACAGGTTACCCTCTATAGTTCTTGGTGGCATGAGTCCCTTGGAGAAGCTCTTTCGGAAGCAACCAGATTATTCCACACTTAAAGTCTTTGGTTGTAAGTGTTTTCCTTGCCTTCGCCCATATAATTCTCATAAGTTGAGTTTTCGGTCGAGTCCCTGTACATTCATTGGTTATAGTCATATTCATAAAGGCTATAAATGTTTGTCCTCTGACGGTAGACTTTATATCTCTAGACATGTATTGTTTGATGAAAATTCTTTTCCATTTGCTTCTCTTACTTCTCATTCTTCTGTTTCTCCCAATTGTGTAACTCAAAGTTTACCTACATTGTCTTCTGTTTCTTCCTCTACTACAGTTGAGTCTTCTTCTGATGCACACTTAAGTATCAGTGAGACATCCTCTATTCCATCAACTGCTGATCATCCTACTAATAATAGTCCTTCACCCTGTTTCCTGAACCGTACTCACCACATGATTACACGAAGTAAAAGAGGCATATTCAAACCTAAGGCTTTTCTTGCTACCTTTGTTGATGTTGAACCGCCTAATGTTAAAGAGACCCTTAAATGTTCTCATTGGAAACAAGCAATGCAAGATGAGTATGATGCTCTTAACGCAATTTTGGACCACCCCGATACACAAGGAGCTGACGAGGACAACCGGGGAGAAATCGGGCTGAAAGATGGACTAAGAAGGCAAAACCGGCAAATGGGACGGGCCAAGACCGAAGGGGTCGGGTTTTCGGCCCGACCCCCTGCTCGGCCCGCTCGTGCGGGCCGAGTCCGTTTGGTCCCGTCTGGTCTCCACCGCCTCTGGATGCCCCGGCCACGTCTTCCCCCCTCAACTACAAATTTACCGTTGGTGGCACGTGAAGGGTATGGAAAGAAAGACCAAGACGTAAACATAGAGAATTCGGATGGTGACCGCCACCAGCGGAGGTCACGTGACGAGGATAGCATCCGGGGGTCACCGAGACAAGCAGGCCGAGGCCGAGGCCGAGGCCGAGCAGAGGATGCCGACACCAAGATTGCCGCCCTTGAGGATGAGGTGAAGGGAATGAATCGGAGTTTATCCAAAATACTCCAAATCCTGGATAAACCCGGCCCTAGCACCAAAGTCCATGAGGGGAGCTTGATTAGAGACCCGAGGAAGGGGAAGGAGCCTATGGAGCACACTGCAGAGTCAGGGACGAGGTCAAGAGGAAAGAAGACTGACAGCATGACCAGCAAGGTCAGGGGGCTCAAACCCACTGATCACACGATTTTGAGGAGCCCAGAGTCAAGCACACTTAAAGGGCGTCACTACACAGTTTCTACCCCAAGCTTCGGTCATACTAAGACAGACCTGAGGAATCTGATCGTTGAGAAGCGCAGAAGTGCCAAAACTGCCGAGTCCGAAGCCAAAGCCGCCGAAGCTGAAGCCCGGGCTGCCGAGGCCGAGGCCAGAGCAGCCGAGGCCGAGGCTAGGTTGGCCGAGGCCGAGGCCAAGAAAGACGACCTCCCTTGGAAGACCGAGCTTCTTAATGCACTAAAGGAGCTCGGAAATCCTCAGGGAGACCAGCAGAGGTCAAAGAACTTTGGAGATCAAAACTTGGAGGAACTAGCCGACCAAGTCGATCCGCCCTTCACAGAAGAAGTCATGAAAGCCGAGGTGCCTCAAAAGTTTAAGGTACCCACGTTCAAACAGTACGATGGCAAGAAAGACCCCGTGCAACATCTAAATGCATACAGAAGTTGGATGGACTTCCACGGCGTCTCAGATGCAATCAGAGTTAGCCCAAGCATTCCTTGCACAGTTCATGGGGCTAGAGAGCAGCGCAAGCCTCACATCAACCTCTTGACGGTCAAACAACAACCAGGTGAGAGCTTGCGTGATTACATAACTCGTTTTAATGATGAGGCACTACAGGTTGAGGGGTACAGCGAGGGAGCAGCCCTGGTAGCCATAACAGCCGGTCTGGAAGACGAAAGACTGCTCAATTCAATAGGTAAGAGCCAGCCTCGAACCTATGCGGAGTTCGTCTCCCGGGCACAGAAGTATATGAGCGCAGAGGAGTTACTGAAGTCAAAAAGGTCAGAACGAGAATACAAGAGGTTTTCTTCATCTAGCTATGACAGTAAAAAGGACAAAAGGCAGCGGACCGACGAAGGAGGCCGGGGCCGAGCAGACCATGGCCGAGGCCGACCAGACAATGGCCGAGGCCGACCAGACCAAGGAGCACCTCCTTTCGGTAAGTTTGAGAAATACACCCCAACTGCTGTTCCGCAGGAGCAAGTACTGATGGAGATCCGAAATACGGGCCTCCTGAAATTCCCAGGGAGGATGAAGTCGAGTGCCGATAGAAGAGACAAGAGCCAGTATTGTCTTTTCCACCGGGACCACGGGCATTCAACCAGGAATTGTATTCAGTTGAAGGATGAAATCGAAGCACTGATCCAGAATGGGTATTTGAAAGAGTTCGTCGGTGAGCCAAAGGCCGAGGCCGACCACGGATGGCCGAGGCCGAGCCTTACCAAAGATGGCCGAGACAAAGAAGAACCCCTACGGGAGATCAGAACCATCTTTGGAGGACCAGCAGGAGGAGGTTCGAGCAGGAAGAGGAAAGCTATGGTCAGGGAAGCAAGGTCCGAACCAGAATATCGAGGTATGTACTCTGTCCATCTATCAAAGGCACACCCCCCTTTGGAGTTCACTGAGGCTGAGGCAGCGAGCATTCATCAGCCACATAATGATGCTCTGGTGGTCACTCTAATCGTAGCCAATGTGAAAATCCATCGGATCCTAATTGATGGGGGAAGCTCGGCTGATGTCCTTTCTTAA

Protein sequence

METSENNSEISSGSQNGQIVNPGNKISTVKLTDENFLMWKFQILTALEGHDLDDHISGDSQPLPELIQVSENATTVSKPNPAYKVWKKQDKLVSSWIVGSMSESILEQVLHYVVALLLTHESRIESKSVINSDNVLPSANLAVQNVSQNSVPNPSPNSQQQNFGNGRGRSRSNFGQNRGGRSWNNRNRPQCQLCNKIGHTAMKCYSRVQMPGAYATQFNPPGQMNPSGLNFSPQQFNGLAISHLGYASFTSSNNHMFHLNNLLHVPSITKNLISVSQFAKDNAVFFEFHPTFCVVKDLATGRALLRGTLHEGLYRFNLPQPLPSLNTSVVRPETNTAVFSTLPCVSNDVSALYSSVQSSNNLSIDVWHQRLGHPSISIVKQVVRSCNPKVSTNAIMSFCHACAIGKHHAMPFSPSTTSYSAPLQLIVTDLWGPAYKLSTHGFQYYISFVDAFSRYTWIYFLQTKSEAFQAFLKFKTHVEKQFGTPIVSFKLMENGIVERKHRHIVDVGLTLLSHSSIPLTFWDDAFSTSVYLINRLPSIVLGGMSPLEKLFRKQPDYSTLKVFGCKCFPCLRPYNSHKLSFRSSPCTFIGYSHIHKGYKCLSSDGRLYISRHVLFDENSFPFASLTSHSSVSPNCVTQSLPTLSSVSSSTTVESSSDAHLSISETSSIPSTADHPTNNSPSPCFLNRTHHMITRSKRGIFKPKAFLATFVDVEPPNVKETLKCSHWKQAMQDEYDALNAILDHPDTQGADEDNRGEIGLKDGLRRQNRQMGRAKTEGVGFSARPPARPARAGRVRLVPSGLHRLWMPRPRLPPSTTNLPLVAREGYGKKDQDVNIENSDGDRHQRRSRDEDSIRGSPRQAGRGRGRGRAEDADTKIAALEDEVKGMNRSLSKILQILDKPGPSTKVHEGSLIRDPRKGKEPMEHTAESGTRSRGKKTDSMTSKVRGLKPTDHTILRSPESSTLKGRHYTVSTPSFGHTKTDLRNLIVEKRRSAKTAESEAKAAEAEARAAEAEARAAEAEARLAEAEAKKDDLPWKTELLNALKELGNPQGDQQRSKNFGDQNLEELADQVDPPFTEEVMKAEVPQKFKVPTFKQYDGKKDPVQHLNAYRSWMDFHGVSDAIRVSPSIPCTVHGAREQRKPHINLLTVKQQPGESLRDYITRFNDEALQVEGYSEGAALVAITAGLEDERLLNSIGKSQPRTYAEFVSRAQKYMSAEELLKSKRSEREYKRFSSSSYDSKKDKRQRTDEGGRGRADHGRGRPDNGRGRPDQGAPPFGKFEKYTPTAVPQEQVLMEIRNTGLLKFPGRMKSSADRRDKSQYCLFHRDHGHSTRNCIQLKDEIEALIQNGYLKEFVGEPKAEADHGWPRPSLTKDGRDKEEPLREIRTIFGGPAGGGSSRKRKAMVREARSEPEYRGMYSVHLSKAHPPLEFTEAEAASIHQPHNDALVVTLIVANVKIHRILIDGGSSADVLS
Homology
BLAST of Lag0005116 vs. NCBI nr
Match: TYK10642.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa])

HSP 1 Score: 550.8 bits (1418), Expect = 3.5e-152
Identity = 353/851 (41.48%), Postives = 458/851 (53.82%), Query Frame = 0

Query: 7   NSEISSGSQNGQIVNPGNKISTVKLTDENFLMWKFQILTALEGHDLDDHISGDSQPLPEL 66
           N+E S  S   QI   GNKIS VKL D+ FL+WKFQILTALE +DL++ +  +S+P  + 
Sbjct: 12  NTEAS--SPINQIFGSGNKISLVKLNDDTFLLWKFQILTALEAYDLENFLESESEPPSKY 71

Query: 67  I--QVSENATTVSKPNPAYKVWKKQDKLVSSWIVGSMSESILEQVLH------------- 126
           +    S +A+    PNPAYKVWK+QD+L+SSW++GSMSE IL Q+LH             
Sbjct: 72  LISTESSSASATGTPNPAYKVWKRQDRLISSWLLGSMSEEILNQMLHCKSAKEIWETLQG 131

Query: 127 ------------------------------------------------------------ 186
                                                                       
Sbjct: 132 IFSSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKILQCVDALASINKPVSSDDHILYILAG 191

Query: 187 ----------------------YVVALLLTHESRIESKSVINSDNVLPSANLAVQNVSQN 246
                                  V++LLLT ES+ ESK +  S+  LPS N+  Q   + 
Sbjct: 192 LGSDYQSMISVISARTDSPSVQEVMSLLLTQESQNESKLI--SETALPSVNIVTQTTEKG 251

Query: 247 SV------PNPSPNSQQQNFGNGRGRSRSNFGQNRGGRSWNNRNRPQCQLCNKIGHTAMK 306
           +        N   N+   N   GRG  RSN G+        NRN+PQCQ+C K+G++A +
Sbjct: 252 AESYIRTNQNNYHNNHSYNQRGGRGNGRSNRGRR------GNRNKPQCQICAKLGYSADR 311

Query: 307 CYSRVQMPGAYATQFNPPG-QMNPSGLNFSPQQF-------------------------- 366
           C+ R   P + ++ ++P     + + +N  PQ                            
Sbjct: 312 CFFR-YTPRSNSSGYSPNSHNTSYTNMNNHPQMSAMVAALDLNIDSNWYPDSGATNHLTH 371

Query: 367 ----------------------NGLAISHLGYASFTSSN--NHMFHLNNLLHVPSITKNL 426
                                 +GL I+H G  SF SS      F LNNLL VPSITKNL
Sbjct: 372 SLSNLSIGSEYGGGNQIYAANGSGLPITHYGSMSFNSSTLPFKSFTLNNLLQVPSITKNL 431

Query: 427 ISVSQFAKDNAVFFEFHPTFCVVKDLATGRALLRGTLHEGLYRFNLPQPLPSLNTSVVRP 486
           ISVSQFAKDN VFFEFHPT C VKDL TG+ LL+G L++GLY+F +      L+ S    
Sbjct: 432 ISVSQFAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTIEPSHKRLHHS---- 491

Query: 487 ETNT-AVFSTLPCVSNDVSALYSSVQSSNNLSIDVWHQRLGHPSISIVKQVVRSCNPKVS 546
            +NT  VF+T+             V  SN   +D+WH+RLGHP + IVK V+   +    
Sbjct: 492 NSNTKPVFNTV-------------VPKSNTPLLDLWHRRLGHPHLPIVKAVLNHIDNSSG 551

Query: 547 TNAIMSFCHACAIGKHHAMPFSPSTTSYSAPLQLIVTDLWGPAYKLSTHGFQYYISFVDA 606
           T   ++FC ACA+GKHHA+PFS S T Y+ PLQLI  DLWGPA  +S +GF+YYISFVDA
Sbjct: 552 TINKLNFCEACALGKHHALPFSHSLTLYTHPLQLITCDLWGPAVNVSHNGFRYYISFVDA 611

Query: 607 FSRYTWIYFLQTKSEAFQAFLKFKTHVEKQFGTPIVS----------------------- 666
           +SRYTWIYFL +KS+AF AF KFKT VEK  G  I S                       
Sbjct: 612 YSRYTWIYFLNSKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFKPFLDQHGIEH 671

Query: 667 -----FKLMENGIVERKHRHIVDVGLTLLSHSSIPLTFWDDAFSTSVYLINRLPSIVLGG 675
                +   +N IVERKHR+I+++GLTLLS +++PL+FWD+AFSTSVYLINRLP+ VL  
Sbjct: 672 RITCPYTSKQNDIVERKHRYIMEMGLTLLSQATLPLSFWDEAFSTSVYLINRLPTPVLDN 731

BLAST of Lag0005116 vs. NCBI nr
Match: KAA0048297.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa])

HSP 1 Score: 549.7 bits (1415), Expect = 7.8e-152
Identity = 353/851 (41.48%), Postives = 458/851 (53.82%), Query Frame = 0

Query: 7   NSEISSGSQNGQIVNPGNKISTVKLTDENFLMWKFQILTALEGHDLDDHISGDSQPLPEL 66
           N+E S  S   QI   GNKIS VKL D+ FL+WKFQILTALE +DL++ +  +S+P  + 
Sbjct: 12  NTEAS--SPINQIFGSGNKISLVKLNDDTFLLWKFQILTALEAYDLENFLESESEPPSKY 71

Query: 67  I--QVSENATTVSKPNPAYKVWKKQDKLVSSWIVGSMSESILEQVLH------------- 126
           +    S +A+    PNPAYKVWK+QD+L+SSW++GSMSE IL Q+LH             
Sbjct: 72  LISTESSSASATGTPNPAYKVWKRQDRLISSWLLGSMSEEILNQMLHCKSAKEIWETLQG 131

Query: 127 ------------------------------------------------------------ 186
                                                                       
Sbjct: 132 IFSSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKILQCVDALASINKPVSSDDHILYILAG 191

Query: 187 ----------------------YVVALLLTHESRIESKSVINSDNVLPSANLAVQNVSQN 246
                                  V++LLLT ES+ ESK +  S+  LPS N+  Q   + 
Sbjct: 192 LGSDYQSMISVISARTDSPSVQEVMSLLLTQESQNESKLI--SETALPSVNIVTQTTEKG 251

Query: 247 SV------PNPSPNSQQQNFGNGRGRSRSNFGQNRGGRSWNNRNRPQCQLCNKIGHTAMK 306
           +        N   N+   N   GRG  RSN G+        NRN+PQCQ+C K+G++A +
Sbjct: 252 AESYIRTNQNNYHNNHSYNQRGGRGNGRSNRGRR------GNRNKPQCQICAKLGYSADR 311

Query: 307 CYSRVQMPGAYATQFNPPG-QMNPSGLNFSPQQF-------------------------- 366
           C+ R   P + ++ ++P     + + +N  PQ                            
Sbjct: 312 CFFR-YTPRSNSSGYSPNSHNTSYTNMNNHPQMSAMVAALDLNIDSNWYPDSGATNHLTH 371

Query: 367 ----------------------NGLAISHLGYASFTSSN--NHMFHLNNLLHVPSITKNL 426
                                 +GL I+H G  SF SS      F LNNLL VPSITKNL
Sbjct: 372 SLSNLSIGSEYGGGNQIYAANGSGLPITHYGSMSFNSSTLPFKSFTLNNLLQVPSITKNL 431

Query: 427 ISVSQFAKDNAVFFEFHPTFCVVKDLATGRALLRGTLHEGLYRFNLPQPLPSLNTSVVRP 486
           ISVSQFAKDN VFFEFHPT C VKDL TG+ LL+G L++GLY+F +      L+ S    
Sbjct: 432 ISVSQFAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTIEPSHKRLHHS---- 491

Query: 487 ETNT-AVFSTLPCVSNDVSALYSSVQSSNNLSIDVWHQRLGHPSISIVKQVVRSCNPKVS 546
            +NT  VF+T+             V  SN   +D+WH+RLGHP + IVK V+   +    
Sbjct: 492 NSNTKPVFNTV-------------VPKSNTPLLDLWHRRLGHPHLPIVKAVLNHIDNSSG 551

Query: 547 TNAIMSFCHACAIGKHHAMPFSPSTTSYSAPLQLIVTDLWGPAYKLSTHGFQYYISFVDA 606
           T   ++FC ACA+GKHHA+PFS S T Y+ PLQLI  DLWGPA  +S +GF+YYISFVDA
Sbjct: 552 TINKLNFCEACALGKHHALPFSHSLTLYTHPLQLITCDLWGPAVNVSHNGFRYYISFVDA 611

Query: 607 FSRYTWIYFLQTKSEAFQAFLKFKTHVEKQFGTPIVS----------------------- 666
           +SRYTWIYFL +KS+AF AF KFKT VEK  G  I S                       
Sbjct: 612 YSRYTWIYFLNSKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFKPFLDQHGIEH 671

Query: 667 -----FKLMENGIVERKHRHIVDVGLTLLSHSSIPLTFWDDAFSTSVYLINRLPSIVLGG 675
                +   +N IVERKHR+I+++GLTLLS +++PL+FWD+AFSTSVYLINRLP+ VL  
Sbjct: 672 RITCPYTSKQNDIVERKHRYIMEMGLTLLSQATLPLSFWDEAFSTSVYLINRLPTPVLDN 731

BLAST of Lag0005116 vs. NCBI nr
Match: RVW44519.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera])

HSP 1 Score: 486.5 bits (1251), Expect = 8.1e-133
Identity = 339/948 (35.76%), Postives = 464/948 (48.95%), Query Frame = 0

Query: 19  IVNPGNKISTVKLTDENFLMWKFQILTALEGHDLDDHISGDSQPLPELIQVSENATTVSK 78
           +++P +++ T++L D+NFLMWK+QI  A+ G+ L+  + G  Q  P+++    +   V  
Sbjct: 35  VISPLSQLITMRLEDDNFLMWKYQIENAVRGYGLEGFLFGTEQVPPKMV---TDKIGVLV 94

Query: 79  PNPAYKVWKKQDKLVSSWIVGSMSESILEQV----------------------------- 138
           PNP ++ +++QD L+ SW++ S+  + L QV                             
Sbjct: 95  PNPKFRDYQRQDHLLISWLLSSIGSAFLPQVVGCSSAFEDGLTMRDYLTKMKNYCDLLAT 154

Query: 139 -------------------------------------LHYVVALLLTHESRIESKSVINS 198
                                                L YV + L+ HE RI  K   N 
Sbjct: 155 AGHKISDTDHILAIMQGLGDEYESVIAVISSKKSSPSLQYVTSTLIAHEGRIAHKISSND 214

Query: 199 DNVLPSANLAVQNVSQ--NSVPNPSPNSQQQNFGNGRGRSRSNFGQNRG---GRSWNNRN 258
            +V  ++  + +  S   NS   PS   Q +N   G   +R +F  NRG   GR+     
Sbjct: 215 LSVNYTSQYSNRGPSSSWNSNGYPSSGFQNRNQFGGNQVTRGSFVHNRGRGRGRA-QGGI 274

Query: 259 RPQCQLCNKIGHTAMKCYSRVQ------------MPGA---------------------- 318
           +PQCQLCNK GHT  +C+ R               PG                       
Sbjct: 275 KPQCQLCNKFGHTVHRCFYRYDPNFHGNMPANGPTPGVLGSGARNGASGSISSAGNVNLT 334

Query: 319 -YATQFNPP---------------------------------GQMNPSGLNFSPQ----- 378
            Y  Q N                                   G +N SG  ++       
Sbjct: 335 EYDAQENQDYSEMEAMVATPEDLQNCCWFPDSGATNHVTHDLGNLN-SGAEYNGNSKIHM 394

Query: 379 -QFNGLAISHLGYASFTSSN--NHMFHLNNLLHVPSITKNLISVSQFAKDNAVFFEFHPT 438
               GL ISH+G + F SS+  N +  L N+L VP+I KNL+SVSQFA+DN V+FEFHP 
Sbjct: 395 GNGTGLKISHIGLSVFPSSSSPNKVLFLKNILRVPAIKKNLLSVSQFARDNNVYFEFHPK 454

Query: 439 FCVVKDLATGRALLRGTLHEGLYRFNLPQPLPSLNTSV-VRPETNTAVFSTLPCVSNDVS 498
            C VKD +    LL+G LH+GLY+FNL + L    + + +  + N         V ND S
Sbjct: 455 VCFVKDKSNHSLLLQGNLHKGLYQFNLSKKLFGKASGLSLSNDKNELTCCNASLVHNDNS 514

Query: 499 ALYSSVQSSNNLSIDVWHQRLGHPSISIVKQVVRSCNPKVSTNAIMSFCHACAIGKHHAM 558
                  SS ++  D+WH+RLGHP+  IV QV+       ST +  S C AC +GK H +
Sbjct: 515 DFPEKTNSSFHV-FDLWHKRLGHPASKIVTQVLNDNKIPFSTKSGSSICSACQLGKSHNL 574

Query: 559 PFSPSTTSYSAPLQLIVTDLWGPAYKLSTHGFQYYISFVDAFSRYTWIYFLQTKSEAFQA 618
           PF  S T Y+ PLQL+V+DLWGPA   S++GF YY+SFVDA+SRYTW+YFL+TKS+  +A
Sbjct: 575 PFPISQTVYTKPLQLVVSDLWGPAPINSSYGFTYYVSFVDAYSRYTWVYFLKTKSQTREA 634

Query: 619 FLKFKTHVEKQFGTPIVSFKL----------------------------MENGIVERKHR 678
           FL FK   E QFG  + +F+                              +NGI+ERKHR
Sbjct: 635 FLMFKAQAELQFGCKLKTFQTDWGGEFRSLKTYFEQNGIIHRLSCPHTSKQNGIIERKHR 694

Query: 679 HIVDVGLTLLSHSSIPLTFWDDAFSTSVYLINRLPSIVLGGMSPLEKLFRKQPDYSTLKV 738
           HIV++GLTLL+ +S+PL +W DAFST+V+LINRLP+ VL    P E LF  +P+YS LKV
Sbjct: 695 HIVELGLTLLAQASLPLKYWPDAFSTAVFLINRLPTEVLKQKCPYEFLFNSKPNYSQLKV 754

BLAST of Lag0005116 vs. NCBI nr
Match: RVW80632.1 (Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera])

HSP 1 Score: 484.6 bits (1246), Expect = 3.1e-132
Identity = 346/909 (38.06%), Postives = 449/909 (49.39%), Query Frame = 0

Query: 30  KLTDENFLMWKFQILTALEGHDLDDHISGDSQPLPELIQVSENATTVSKPNPAYKVWKKQ 89
           KL + NFL+W+ QILT L GH L  H   ++  LP    +S +  T +  NP ++ W++Q
Sbjct: 22  KLDNHNFLVWRKQILTTLRGHKL-QHFLSETSVLPSEF-LSSDDETQNHVNPKFQDWEQQ 81

Query: 90  DKLVSSWIVGSMSESILEQVLH-------------------------------------- 149
           D+L+ SW++ S+++++L ++++                                      
Sbjct: 82  DQLIMSWLLASITDALLTRMVNCDTSAQVWKTLELYFATQVRAKVTQFKTQLHNTKKGDL 141

Query: 150 -----------------------------------------------------YVV---- 209
                                                                Y V    
Sbjct: 142 SISDYLLKIRNVVDLLALVGHKISVKDHIDAIFEGLPQDYETFIISVNSRLDPYTVEEIE 201

Query: 210 ALLLTHESRIESKSVINSDNVLPS-ANLAVQNVSQNSVPNPSPNSQQQNF------GNGR 269
            LLL  ESRIE K++  +D   PS A+L   N + +   N   +++  NF      GNG 
Sbjct: 202 VLLLAQESRIE-KNIKIADLSTPSLAHLITTNRNGSPHFNYRASTRNSNFRPPTHSGNGM 261

Query: 270 GRSRSNF---GQNRGGR-SWNNRNRPQCQLCNKIGHTAMKCYSRVQMPGAYATQF---NP 329
              R NF   G+ R GR SW   N+PQCQLC +IGH  M+CY R        +Q     P
Sbjct: 262 QHFRGNFTQQGRGRHGRGSWKGNNKPQCQLCGRIGHVVMQCYYRFDQSFTGPSQLQGNRP 321

Query: 330 PGQM----NPSGLNFSP--------------------------------------QQF-- 389
            G M         NF P                                       QF  
Sbjct: 322 QGNMAHLHQQLSENFFPGTSSVKPTTAEIIQDNNWYPDSGATHHLTPNLNNLLTKSQFPS 381

Query: 390 ---------NGLAISHLGYASFTSS--NNHMFHLNNLLHVPSITKNLISVSQFAKDNAVF 449
                     GL I H+G+ SF+SS   +    L  LLHVP ITKNL+SVS+FA DN VF
Sbjct: 382 SDEVFVGNGKGLPIHHIGHTSFSSSFIPSKTLALKQLLHVPEITKNLLSVSKFAADNHVF 441

Query: 450 FEFHPTFCVVKDLATGRALLRGTLHEGLYRFNLPQPLPSLNTSVVRPETNTAVFSTLPCV 509
           FEFHPT C VKDL+T   L+ G L  GLY F+        NT +  P  N++ F++    
Sbjct: 442 FEFHPTSCFVKDLSTRTVLMHGQLKGGLYVFD--------NTQLKLPLHNSSCFASTALP 501

Query: 510 SNDVSALYSSVQSSNNLSIDVWHQRLGHPSISIVKQVVRSCNPKVSTNAIMSFCHACAIG 569
           S +      +V +S+     +WH RLGHPS  IV  V+  CN           C AC +G
Sbjct: 502 SKE-----PTVPASSTSPFTLWHNRLGHPSSHIVSLVLNKCNLPHLNKIPSLICSACCMG 561

Query: 570 KHHAMPFSPSTTSYSAPLQLIVTDLWGPAYKLSTHGFQYYISFVDAFSRYTWIYFLQTKS 629
           K H  PF  S +SY+ PL+LI TDLWGP    S+HG QYYI F+DA+SR+TWIY L+ KS
Sbjct: 562 KIHKSPFLHSKSSYTKPLELIHTDLWGPISTPSSHGHQYYIHFIDAYSRFTWIYMLKHKS 621

Query: 630 EAFQAFLKFKTHVEKQFGTPIVS----------------------------FKLMENGIV 689
           EAFQ FL FK+ VE Q G  I +                            +   +NG+ 
Sbjct: 622 EAFQVFLHFKSQVELQLGHKIKAVQSDWGGEYRSFTQYLTSNGIIHRISCPYTHEQNGLA 681

Query: 690 ERKHRHIVDVGLTLLSHSSIPLTFWDDAFSTSVYLINRLPSIVLGGMSPLEKLFRKQPDY 738
           ERKHRHIV+ G+ LL+ +S+P  +WD+AF TSVYLINRLP+ VL   SPLE LF ++P Y
Sbjct: 682 ERKHRHIVEHGIALLAQASLPFKYWDEAFRTSVYLINRLPTPVLKNKSPLEVLFHQKPSY 741

BLAST of Lag0005116 vs. NCBI nr
Match: RVW60229.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera])

HSP 1 Score: 475.3 bits (1222), Expect = 1.9e-129
Identity = 339/977 (34.70%), Postives = 464/977 (47.49%), Query Frame = 0

Query: 19   IVNPGNKISTVKLTDENFLMWKFQILTALEGHDLDDHISGDSQPLPELIQVSENATTVSK 78
            +++P +++ T++L D+NFLMWK+QI  A+ G+ L+  + G  Q  P+++    +   V  
Sbjct: 142  VISPLSQLITMRLEDDNFLMWKYQIENAVRGYGLEGFLFGTEQVPPKMV---TDKIGVLV 201

Query: 79   PNPAYKVWKKQDKLVSSWIVGSMSESILEQV----------------------------- 138
            PNP ++ +++QD L+ SW++ S+  + L QV                             
Sbjct: 202  PNPKFRDYQRQDHLLISWLLSSIGSAFLPQVVGCSSAFEVWNTISQNFNSQSSAKVMFYK 261

Query: 139  ------------------------------------------------------------ 198
                                                                        
Sbjct: 262  SQMQMLKKDGLTMRDYLTKMKNYCDLLATAGHKISDTDHILAIMQGLGDEYESVIAVISS 321

Query: 199  ------LHYVVALLLTHESRIESKSVINSDNVLPSANLAVQNVSQ--NSVPNPSPNSQQQ 258
                  L YV + L+ HE RI  K   N  +V  ++  + +  S   NS   PS   Q +
Sbjct: 322  KKSSPSLQYVTSTLIAHEGRIAHKISSNDLSVNYTSQYSNRGPSSSWNSNGYPSSGFQNR 381

Query: 259  NFGNGRGRSRSNFGQNRG---GRSWNNRNRPQCQLCNKIGHTAMKCYSRVQ--------- 318
            N   G   +R +F  NRG   GR+     +PQCQLCNK GHT  +C+ R           
Sbjct: 382  NQFGGNQVTRGSFVHNRGRGRGRA-QGGIKPQCQLCNKFGHTVHRCFYRYDPNFHGNMPA 441

Query: 319  ---MPGA-----------------------YATQFNPP---------------------- 378
                PG                        Y  Q N                        
Sbjct: 442  NGPTPGVLGSGARNGASGSISSAGNVNLTEYDAQENQDYSEMEAMVATPEDLQNCCWFPD 501

Query: 379  -----------GQMNPSGLNFSPQ------QFNGLAISHLGYASFTSSN--NHMFHLNNL 438
                       G +N SG  ++           GL ISH+G + F SS+  N +  L N+
Sbjct: 502  SGATNHVTHDLGNLN-SGAEYNGNSKIHMGNGTGLKISHIGLSVFPSSSSPNKVLFLKNI 561

Query: 439  LHVPSITKNLISVSQFAKDNAVFFEFHPTFCVVKDLATGRALLRGTLHEGLYRFNLPQPL 498
            L VP+I KNL+SVSQFA+DN V+FEFHP  C VKD +    LL+G LH+GLY+FNL + L
Sbjct: 562  LRVPAIKKNLLSVSQFARDNNVYFEFHPKVCFVKDKSNHSLLLQGNLHKGLYQFNLSKKL 621

Query: 499  PSLNTSV-VRPETNTAVFSTLPCVSNDVSALYSSVQSSNNLSIDVWHQRLGHPSISIVKQ 558
                + + +  + N         V ND S       SS ++  D+WH+RLGHP+  IV Q
Sbjct: 622  FGKASGLSLSNDKNELTCCNASLVHNDNSDFPEKTNSSFHV-FDLWHKRLGHPASKIVTQ 681

Query: 559  VVRSCNPKVSTNAIMSFCHACAIGKHHAMPFSPSTTSYSAPLQLIVTDLWGPAYKLSTHG 618
            V+       ST +  S C AC +GK H +PF  S T Y+ PLQL+V+DLWGPA   S++G
Sbjct: 682  VLNDNKIPFSTKSGSSICSACQLGKSHNLPFPISQTVYTKPLQLVVSDLWGPAPINSSYG 741

Query: 619  FQYYISFVDAFSRYTWIYFLQTKSEAFQAFLKFKTHVEKQFGTPIVSFKL---------- 678
            F YY+SFVDA+SRYTW+YFL+TKS+  +AFL FK   E QFG  + +F+           
Sbjct: 742  FTYYVSFVDAYSRYTWVYFLKTKSQTREAFLMFKAQAELQFGCKLKTFQTDWGGEFRSLK 801

Query: 679  ------------------MENGIVERKHRHIVDVGLTLLSHSSIPLTFWDDAFSTSVYLI 738
                               +NGI+ERKHRHIV++GLTLL+ +S+PL +W DAFST+V+LI
Sbjct: 802  TYFEQNGIIHRLSCPHTSKQNGIIERKHRHIVELGLTLLAQASLPLKYWPDAFSTAVFLI 861

BLAST of Lag0005116 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 306.2 bits (783), Expect = 2.0e-81
Identity = 264/846 (31.21%), Postives = 379/846 (44.80%), Query Frame = 0

Query: 16  NGQIVNPGNKISTVKLTDENFLMWKFQILTALEGHDLDDHISGDSQPLPELIQVSENATT 75
           N  I+N  N  +  KLT  N+LMW  Q+    +G++L   + G S P+P     +     
Sbjct: 12  NTNILNV-NMSNVTKLTSTNYLMWSRQVHALFDGYELAGFLDG-STPMP---PATIGTDA 71

Query: 76  VSKPNPAYKVWKKQDKLVSSWIVGSMSESI------------------------------ 135
           V + NP Y  W++QDKL+ S I+G++S S+                              
Sbjct: 72  VPRVNPDYTRWRRQDKLIYSAILGAISMSVQPAVSRATTAAQIWETLRKIYANPSYGHVT 131

Query: 136 -----------------------LEQVLHYV-----------------VALLLTHESRIE 195
                                  +E+VL  +                  +L   HE  I 
Sbjct: 132 QLRFITRFDQLALLGKPMDHDEQVERVLENLPDDYKPVIDQIAAKDTPPSLTEIHERLIN 191

Query: 196 SKS---VINSDNVLPSANLAVQNVSQNSVPNPSPNSQQQNFGNGRGRSRSNFGQNRGGRS 255
            +S    +NS  V+P     V + + N+  N +     +N+ N   RS S    + G RS
Sbjct: 192 RESKLLALNSAEVVPITANVVTHRNTNTNRNQNNRGDNRNYNNNNNRSNSWQPSSSGSRS 251

Query: 256 WNNRNRP---QCQLCNKIGHTAMKCYSRVQMPG--------------------AYATQFN 315
            N + +P   +CQ+C+  GH+A +C    Q                       A  + +N
Sbjct: 252 DNRQPKPYLGRCQICSVQGHSAKRCPQLHQFQSTTNQQQSTSPFTPWQPRANLAVNSPYN 311

Query: 316 PPGQMNPSG-LNFSPQQFNGLA--------------------ISHLGYASFTSSNNHMFH 375
               +  SG  +     FN L+                    I+H G AS  +S+  +  
Sbjct: 312 ANNWLLDSGATHHITSDFNNLSFHQPYTGGDDVMIADGSTIPITHTGSASLPTSSRSL-D 371

Query: 376 LNNLLHVPSITKNLISVSQFAKDNAVFFEFHPTFCVVKDLATGRALLRGTLHEGLYRFNL 435
           LN +L+VP+I KNLISV +    N V  EF P    VKDL TG  LL+G   + LY +  
Sbjct: 372 LNKVLYVPNIHKNLISVYRLCNTNRVSVEFFPASFQVKDLNTGVPLLQGKTKDELYEW-- 431

Query: 436 PQPLPSLNTSVVRPETNTAVFSTLPCVSNDVSALYSSVQSSNNLSIDVWHQRLGHPSISI 495
                                   P  S+   ++++S  S    S   WH RLGHPS++I
Sbjct: 432 ------------------------PIASSQAVSMFASPCSKATHS--SWHSRLGHPSLAI 491

Query: 496 VKQVVRSCN-PKVSTNAIMSFCHACAIGKHHAMPFSPSTTSYSAPLQLIVTDLWGPAYKL 555
           +  V+ + + P ++ +  +  C  C I K H +PFS ST + S PL+ I +D+W     L
Sbjct: 492 LNSVISNHSLPVLNPSHKLLSCSDCFINKSHKVPFSNSTITSSKPLEYIYSDVWSSPI-L 551

Query: 556 STHGFQYYISFVDAFSRYTWIYFLQTKSEAFQAFLKFKTHVEKQFGTPIVS--------F 615
           S   ++YY+ FVD F+RYTW+Y L+ KS+    F+ FK+ VE +F T I +        F
Sbjct: 552 SIDNYRYYVIFVDHFTRYTWLYPLKQKSQVKDTFIIFKSLVENRFQTRIGTLYSDNGGEF 611

Query: 616 KLM--------------------ENGIVERKHRHIVDVGLTLLSHSSIPLTFWDDAFSTS 675
            ++                     NG+ ERKHRHIV++GLTLLSH+S+P T+W  AFS +
Sbjct: 612 VVLRDYLSQHGISHFTSPPHTPEHNGLSERKHRHIVEMGLTLLSHASVPKTYWPYAFSVA 671

Query: 676 VYLINRLPSIVLGGMSPLEKLFRKQPDYSTLKVFGCKCFPCLRPYNSHKLSFRSSPCTFI 683
           VYLINRLP+ +L   SP +KLF + P+Y  LKVFGC C+P LRPYN HKL  +S  C F+
Sbjct: 672 VYLINRLPTPLLQLQSPFQKLFGQPPNYEKLKVFGCACYPWLRPYNRHKLEDKSKQCAFM 731

BLAST of Lag0005116 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 288.9 bits (738), Expect = 3.3e-76
Identity = 252/840 (30.00%), Postives = 364/840 (43.33%), Query Frame = 0

Query: 16  NGQIVNPGNKISTVKLTDENFLMWKFQILTALEGHDLDDHISGDSQPLPELIQVSENATT 75
           N  I+N  N  +  KLT  N+LMW  Q+    +G++L   + G +   P  I        
Sbjct: 12  NTSILNV-NMSNVTKLTSTNYLMWSRQVHALFDGYELAGFLDGSTTMPPATI----GTDA 71

Query: 76  VSKPNPAYKVWKKQDKLVSSWIVGSMSESILEQVLHYV---------------------- 135
             + NP Y  WK+QDKL+ S ++G++S S+   V                          
Sbjct: 72  APRVNPDYTRWKRQDKLIYSAVLGAISMSVQPAVSRATTAAQIWETLRKIYANPSYGHVT 131

Query: 136 -----------------------------VALL---LTHESRIES---------KSVI-- 195
                                        +ALL   + H+ ++E          K VI  
Sbjct: 132 QLRTQLKQWTKGTKTIDDYMQGLVTRFDQLALLGKPMDHDEQVERVLENLPEEYKPVIDQ 191

Query: 196 -------------------NSDNVLPSANLAVQNVSQNSVPNPSPNSQQQNFGNGRGRSR 255
                              +   +L  ++  V  ++ N+V + +  +   N    R    
Sbjct: 192 IAAKDTPPTLTEIHERLLNHESKILAVSSATVIPITANAVSHRNTTTTNNNNNGNRNNRY 251

Query: 256 SNFGQNRGGRSW----------NNRNRP---QCQLCNKIGHTAMKCYSRVQMPGAYATQF 315
            N   N   + W          NN+++P   +CQ+C   GH+A +C S++Q   +     
Sbjct: 252 DNRNNNNNSKPWQQSSTNFHPNNNQSKPYLGKCQICGVQGHSAKRC-SQLQHFLSSVNSQ 311

Query: 316 NPPGQMNP----------------------SGLNFSPQQFNGLA---------------- 375
            PP    P                         +     FN L+                
Sbjct: 312 QPPSPFTPWQPRANLALGSPYSSNNWLLDSGATHHITSDFNNLSLHQPYTGGDDVMVADG 371

Query: 376 ----ISHLGYASFTSSNNHMFHLNNLLHVPSITKNLISVSQFAKDNAVFFEFHPTFCVVK 435
               ISH G  S  S+ +   +L+N+L+VP+I KNLISV +    N V  EF P    VK
Sbjct: 372 STIPISHTGSTSL-STKSRPLNLHNILYVPNIHKNLISVYRLCNANGVSVEFFPASFQVK 431

Query: 436 DLATGRALLRGTLHEGLYRFNLPQPLPSLNTSVVRPETNTAVFSTLPCVSNDVSALYSSV 495
           DL TG  LL+G   + LY +                          P  S+   +L++S 
Sbjct: 432 DLNTGVPLLQGKTKDELYEW--------------------------PIASSQPVSLFAS- 491

Query: 496 QSSNNLSIDVWHQRLGHPSISIVKQVVRSCNPKV--STNAIMSFCHACAIGKHHAMPFSP 555
             S+  +   WH RLGHP+ SI+  V+ + +  V   ++  +S C  C I K + +PFS 
Sbjct: 492 -PSSKATHSSWHARLGHPAPSILNSVISNYSLSVLNPSHKFLS-CSDCLINKSNKVPFSQ 551

Query: 556 STTSYSAPLQLIVTDLWGPAYKLSTHGFQYYISFVDAFSRYTWIYFLQTKSEAFQAFLKF 615
           ST + + PL+ I +D+W     LS   ++YY+ FVD F+RYTW+Y L+ KS+  + F+ F
Sbjct: 552 STINSTRPLEYIYSDVWSSPI-LSHDNYRYYVIFVDHFTRYTWLYPLKQKSQVKETFITF 611

Query: 616 KTHVEKQFGTPIVSF---------KLME-------------------NGIVERKHRHIVD 675
           K  +E +F T I +F          L E                   NG+ ERKHRHIV+
Sbjct: 612 KNLLENRFQTRIGTFYSDNGGEFVALWEYFSQHGISHLTSPPHTPEHNGLSERKHRHIVE 671

Query: 676 VGLTLLSHSSIPLTFWDDAFSTSVYLINRLPSIVLGGMSPLEKLFRKQPDYSTLKVFGCK 680
            GLTLLSH+SIP T+W  AF+ +VYLINRLP+ +L   SP +KLF   P+Y  L+VFGC 
Sbjct: 672 TGLTLLSHASIPKTYWPYAFAVAVYLINRLPTPLLQLESPFQKLFGTSPNYDKLRVFGCA 731

BLAST of Lag0005116 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 159.5 bits (402), Expect = 3.0e-37
Identity = 142/531 (26.74%), Postives = 225/531 (42.37%), Query Frame = 0

Query: 259 LNNLLHVPSITKNLISVSQFAKDNAVFFEFHPTFCVVK-DLATGRALLRGTLHEGLYRFN 318
           L ++ HVP +  NLIS     +D    +  +  + + K  L   + + RGTL+       
Sbjct: 350 LKDVRHVPDLRMNLISGIALDRDGYESYFANQKWRLTKGSLVIAKGVARGTLYR------ 409

Query: 319 LPQPLPSLNTSVVRPETNTAVFSTLPCVSNDVSALYSSVQSSNNLSIDVWHQRLGHPSIS 378
                   N  + + E N A                      + +S+D+WH+R+GH S  
Sbjct: 410 -------TNAEICQGELNAA---------------------QDEISVDLWHKRMGHMSEK 469

Query: 379 IVKQVVRSCNPKVSTNAIMSFCHACAIGKHHAMPFSPSTTSYSAPLQLIVTDLWGPAYKL 438
            ++ + +      +    +  C  C  GK H + F  S+      L L+ +D+ GP    
Sbjct: 470 GLQILAKKSLISYAKGTTVKPCDYCLFGKQHRVSFQTSSERKLNILDLVYSDVCGPMEIE 529

Query: 439 STHGFQYYISFVDAFSRYTWIYFLQTKSEAFQAFLKFKTHVEKQFGTPIVSFKL------ 498
           S  G +Y+++F+D  SR  W+Y L+TK + FQ F KF   VE++ G  +   +       
Sbjct: 530 SMGGNKYFVTFIDDASRKLWVYILKTKDQVFQVFQKFHALVERETGRKLKRLRSDNGGEY 589

Query: 499 ------------------------MENGIVERKHRHIVDVGLTLLSHSSIPLTFWDDAFS 558
                                     NG+ ER +R IV+   ++L  + +P +FW +A  
Sbjct: 590 TSREFEEYCSSHGIRHEKTVPGTPQHNGVAERMNRTIVEKVRSMLRMAKLPKSFWGEAVQ 649

Query: 559 TSVYLINRLPSIVLGGMSPLEKLFRKQPDYSTLKVFGCKCFPCLRPYNSHKLSFRSSPCT 618
           T+ YLINR PS+ L    P      K+  YS LKVFGC+ F  +      KL  +S PC 
Sbjct: 650 TACYLINRSPSVPLAFEIPERVWTNKEVSYSHLKVFGCRAFAHVPKEQRTKLDDKSIPCI 709

Query: 619 FIGYSHIHKGYKCLSSDGRLYI-SRHVLFDENSFPFASLTSH---SSVSPNCVTQSLPTL 678
           FIGY     GY+      +  I SR V+F E+    A+  S    + + PN VT  +P+ 
Sbjct: 710 FIGYGDEEFGYRLWDPVKKKVIRSRDVVFRESEVRTAADMSEKVKNGIIPNFVT--IPST 769

Query: 679 SSVSSSTTVESSSDAHLSISETSSIP--------------STADHPTNNSPSPCFLNRTH 738
           S  ++ T+ ES++D    +SE    P                 +HPT        L R+ 
Sbjct: 770 S--NNPTSAESTTD---EVSEQGEQPGEVIEQGEQLDEGVEEVEHPTQGEEQHQPLRRSE 829

BLAST of Lag0005116 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 88.2 bits (217), Expect = 8.5e-16
Identity = 120/597 (20.10%), Postives = 238/597 (39.87%), Query Frame = 0

Query: 90  DKLVSSWIVGSMSESILEQVLHYVVALLLTHESRIESKSVINSDNV---LPSANLAVQNV 149
           D+L+S  +        ++++ H ++ L   ++  I +   ++ +N+        L  Q +
Sbjct: 122 DELISELLAAGAKIEEMDKISHLLITLPSCYDGIITAIETLSEENLTLAFVKNRLLDQEI 181

Query: 150 S-QNSVPNPSPNSQQQNFGNGRGRSRSNFGQNRGGR-----SWNNRNRPQCQLCNKIGHT 209
             +N   + S         N     ++N  +NR  +       N++ + +C  C + GH 
Sbjct: 182 KIKNDHNDTSKKVMNAIVHNNNNTYKNNLFKNRVTKPKKIFKGNSKYKVKCHHCGREGHI 241

Query: 210 AMKCYSRVQMPGAYATQFNPPGQMNPS-GLNFSPQQFNGLAI-SHLGYASFTSSNNHMFH 269
              C+   ++      +     Q   S G+ F  ++ N  ++  + G+   + +++H+ +
Sbjct: 242 KKDCFHYKRILNNKNKENEKQVQTATSHGIAFMVKEVNNTSVMDNCGFVLDSGASDHLIN 301

Query: 270 LNNLLH-----VPSITKNLISVSQF-----------AKDNAVFFEFHPTFCVVKDLATGR 329
             +L       VP +   +    +F             D+ +  E    FC     A G 
Sbjct: 302 DESLYTDSVEVVPPLKIAVAKQGEFIYATKRGIVRLRNDHEITLE-DVLFC---KEAAGN 361

Query: 330 ALLRGTLHEGLYRFNLPQPLPSLNTSVVRPETNTAVFSTLPCVSNDVSALYSSVQSSNNL 389
            +    L E        +   +++ + +    N+ + + +P ++        S+ + +  
Sbjct: 362 LMSVKRLQEAGMSIEFDKSGVTISKNGLMVVKNSGMLNNVPVINFQA----YSINAKHKN 421

Query: 390 SIDVWHQRLGHPSISIVKQVVRS---CNPKVSTNAIMS--FCHACAIGKHHAMPFS--PS 449
           +  +WH+R GH S   + ++ R     +  +  N  +S   C  C  GK   +PF     
Sbjct: 422 NFRLWHERFGHISDGKLLEIKRKNMFSDQSLLNNLELSCEICEPCLNGKQARLPFKQLKD 481

Query: 450 TTSYSAPLQLIVTDLWGPAYKLSTHGFQYYISFVDAFSRYTWIYFLQTKSEAFQAFLKFK 509
            T    PL ++ +D+ GP   ++     Y++ FVD F+ Y   Y ++ KS+ F  F  F 
Sbjct: 482 KTHIKRPLFVVHSDVCGPITPVTLDDKNYFVIFVDQFTHYCVTYLIKYKSDVFSMFQDFV 541

Query: 510 THVEKQFGTPIV------------------------SFKL------MENGIVERKHRHIV 569
              E  F   +V                        S+ L        NG+ ER  R I 
Sbjct: 542 AKSEAHFNLKVVYLYIDNGREYLSNEMRQFCVKKGISYHLTVPHTPQLNGVSERMIRTIT 601

Query: 570 DVGLTLLSHSSIPLTFWDDAFSTSVYLINRLPS--IVLGGMSPLEKLFRKQPDYSTLKVF 620
           +   T++S + +  +FW +A  T+ YLINR+PS  +V    +P E    K+P    L+VF
Sbjct: 602 EKARTMVSGAKLDKSFWGEAVLTATYLINRIPSRALVDSSKTPYEMWHNKKPYLKHLRVF 661

BLAST of Lag0005116 vs. ExPASy Swiss-Prot
Match: Q12491 (Transposon Ty2-B Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY2B-B PE=3 SV=1)

HSP 1 Score: 59.7 bits (143), Expect = 3.2e-07
Identity = 77/358 (21.51%), Postives = 127/358 (35.47%), Query Frame = 0

Query: 227 SGLNFSPQQFNGLAISHLGYASFTSSNNHMFHLNNLLHVPSITKNLISVSQFAKDNAVFF 286
           S +N    Q   + I+ +G   F   N     +   LH P+I  +L+S+S+ A  N    
Sbjct: 478 SEINIVDAQKQDIPINAIGNLHFNFQNGTKTSI-KALHTPNIAYDLLSLSELANQNI--- 537

Query: 287 EFHPTFCVVK---DLATGRALLRGTLHEGLYRFNLPQPLPSLNTSVVRPETNTAVFSTLP 346
               T C  +   + + G  L     H   Y  +    +PS  + +     N +      
Sbjct: 538 ----TACFTRNTLERSDGTVLAPIVKHGDFYWLSKKYLIPSHISKLTINNVNKS------ 597

Query: 347 CVSNDVSALYSSVQSSNNLSIDVWHQRLGHPSI-SIVKQVVRSCNPKVS------TNAIM 406
                        +S N     + H+ LGH +  SI K + ++    +       +NA  
Sbjct: 598 -------------KSVNKYPYPLIHRMLGHANFRSIQKSLKKNAVTYLKESDIEWSNAST 657

Query: 407 SFCHACAIGK----HHAMPFSPSTTSYSAPLQLIVTDLWGPAYKLSTHGFQYYISFVDAF 466
             C  C IGK     H             P Q + TD++GP + L      Y+ISF D  
Sbjct: 658 YQCPDCLIGKSTKHRHVKGSRLKYQESYEPFQYLHTDIFGPVHHLPKSAPSYFISFTDEK 717

Query: 467 SRYTWIYFLQTKSE--AFQAFLKFKTHVEKQFGTPIVSFKL------------------- 526
           +R+ W+Y L  + E      F      ++ QF   ++  ++                   
Sbjct: 718 TRFQWVYPLHDRREESILNVFTSILAFIKNQFNARVLVIQMDRGSEYTNKTLHKFFTNRG 777

Query: 527 -----------MENGIVERKHRHIVDVGLTLLSHSSIPLTFWDDAFSTSVYLINRLPS 539
                        +G+ ER +R +++   TLL  S +P   W  A   S  + N L S
Sbjct: 778 ITACYTTTADSRAHGVAERLNRTLLNDCRTLLHCSGLPNHLWFSAVEFSTIIRNSLVS 808

BLAST of Lag0005116 vs. ExPASy TrEMBL
Match: A0A5D3CH97 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold25G00040 PE=4 SV=1)

HSP 1 Score: 550.8 bits (1418), Expect = 1.7e-152
Identity = 353/851 (41.48%), Postives = 458/851 (53.82%), Query Frame = 0

Query: 7   NSEISSGSQNGQIVNPGNKISTVKLTDENFLMWKFQILTALEGHDLDDHISGDSQPLPEL 66
           N+E S  S   QI   GNKIS VKL D+ FL+WKFQILTALE +DL++ +  +S+P  + 
Sbjct: 12  NTEAS--SPINQIFGSGNKISLVKLNDDTFLLWKFQILTALEAYDLENFLESESEPPSKY 71

Query: 67  I--QVSENATTVSKPNPAYKVWKKQDKLVSSWIVGSMSESILEQVLH------------- 126
           +    S +A+    PNPAYKVWK+QD+L+SSW++GSMSE IL Q+LH             
Sbjct: 72  LISTESSSASATGTPNPAYKVWKRQDRLISSWLLGSMSEEILNQMLHCKSAKEIWETLQG 131

Query: 127 ------------------------------------------------------------ 186
                                                                       
Sbjct: 132 IFSSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKILQCVDALASINKPVSSDDHILYILAG 191

Query: 187 ----------------------YVVALLLTHESRIESKSVINSDNVLPSANLAVQNVSQN 246
                                  V++LLLT ES+ ESK +  S+  LPS N+  Q   + 
Sbjct: 192 LGSDYQSMISVISARTDSPSVQEVMSLLLTQESQNESKLI--SETALPSVNIVTQTTEKG 251

Query: 247 SV------PNPSPNSQQQNFGNGRGRSRSNFGQNRGGRSWNNRNRPQCQLCNKIGHTAMK 306
           +        N   N+   N   GRG  RSN G+        NRN+PQCQ+C K+G++A +
Sbjct: 252 AESYIRTNQNNYHNNHSYNQRGGRGNGRSNRGRR------GNRNKPQCQICAKLGYSADR 311

Query: 307 CYSRVQMPGAYATQFNPPG-QMNPSGLNFSPQQF-------------------------- 366
           C+ R   P + ++ ++P     + + +N  PQ                            
Sbjct: 312 CFFR-YTPRSNSSGYSPNSHNTSYTNMNNHPQMSAMVAALDLNIDSNWYPDSGATNHLTH 371

Query: 367 ----------------------NGLAISHLGYASFTSSN--NHMFHLNNLLHVPSITKNL 426
                                 +GL I+H G  SF SS      F LNNLL VPSITKNL
Sbjct: 372 SLSNLSIGSEYGGGNQIYAANGSGLPITHYGSMSFNSSTLPFKSFTLNNLLQVPSITKNL 431

Query: 427 ISVSQFAKDNAVFFEFHPTFCVVKDLATGRALLRGTLHEGLYRFNLPQPLPSLNTSVVRP 486
           ISVSQFAKDN VFFEFHPT C VKDL TG+ LL+G L++GLY+F +      L+ S    
Sbjct: 432 ISVSQFAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTIEPSHKRLHHS---- 491

Query: 487 ETNT-AVFSTLPCVSNDVSALYSSVQSSNNLSIDVWHQRLGHPSISIVKQVVRSCNPKVS 546
            +NT  VF+T+             V  SN   +D+WH+RLGHP + IVK V+   +    
Sbjct: 492 NSNTKPVFNTV-------------VPKSNTPLLDLWHRRLGHPHLPIVKAVLNHIDNSSG 551

Query: 547 TNAIMSFCHACAIGKHHAMPFSPSTTSYSAPLQLIVTDLWGPAYKLSTHGFQYYISFVDA 606
           T   ++FC ACA+GKHHA+PFS S T Y+ PLQLI  DLWGPA  +S +GF+YYISFVDA
Sbjct: 552 TINKLNFCEACALGKHHALPFSHSLTLYTHPLQLITCDLWGPAVNVSHNGFRYYISFVDA 611

Query: 607 FSRYTWIYFLQTKSEAFQAFLKFKTHVEKQFGTPIVS----------------------- 666
           +SRYTWIYFL +KS+AF AF KFKT VEK  G  I S                       
Sbjct: 612 YSRYTWIYFLNSKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFKPFLDQHGIEH 671

Query: 667 -----FKLMENGIVERKHRHIVDVGLTLLSHSSIPLTFWDDAFSTSVYLINRLPSIVLGG 675
                +   +N IVERKHR+I+++GLTLLS +++PL+FWD+AFSTSVYLINRLP+ VL  
Sbjct: 672 RITCPYTSKQNDIVERKHRYIMEMGLTLLSQATLPLSFWDEAFSTSVYLINRLPTPVLDN 731

BLAST of Lag0005116 vs. ExPASy TrEMBL
Match: A0A5A7U233 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold264G00060 PE=4 SV=1)

HSP 1 Score: 549.7 bits (1415), Expect = 3.8e-152
Identity = 353/851 (41.48%), Postives = 458/851 (53.82%), Query Frame = 0

Query: 7   NSEISSGSQNGQIVNPGNKISTVKLTDENFLMWKFQILTALEGHDLDDHISGDSQPLPEL 66
           N+E S  S   QI   GNKIS VKL D+ FL+WKFQILTALE +DL++ +  +S+P  + 
Sbjct: 12  NTEAS--SPINQIFGSGNKISLVKLNDDTFLLWKFQILTALEAYDLENFLESESEPPSKY 71

Query: 67  I--QVSENATTVSKPNPAYKVWKKQDKLVSSWIVGSMSESILEQVLH------------- 126
           +    S +A+    PNPAYKVWK+QD+L+SSW++GSMSE IL Q+LH             
Sbjct: 72  LISTESSSASATGTPNPAYKVWKRQDRLISSWLLGSMSEEILNQMLHCKSAKEIWETLQG 131

Query: 127 ------------------------------------------------------------ 186
                                                                       
Sbjct: 132 IFSSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKILQCVDALASINKPVSSDDHILYILAG 191

Query: 187 ----------------------YVVALLLTHESRIESKSVINSDNVLPSANLAVQNVSQN 246
                                  V++LLLT ES+ ESK +  S+  LPS N+  Q   + 
Sbjct: 192 LGSDYQSMISVISARTDSPSVQEVMSLLLTQESQNESKLI--SETALPSVNIVTQTTEKG 251

Query: 247 SV------PNPSPNSQQQNFGNGRGRSRSNFGQNRGGRSWNNRNRPQCQLCNKIGHTAMK 306
           +        N   N+   N   GRG  RSN G+        NRN+PQCQ+C K+G++A +
Sbjct: 252 AESYIRTNQNNYHNNHSYNQRGGRGNGRSNRGRR------GNRNKPQCQICAKLGYSADR 311

Query: 307 CYSRVQMPGAYATQFNPPG-QMNPSGLNFSPQQF-------------------------- 366
           C+ R   P + ++ ++P     + + +N  PQ                            
Sbjct: 312 CFFR-YTPRSNSSGYSPNSHNTSYTNMNNHPQMSAMVAALDLNIDSNWYPDSGATNHLTH 371

Query: 367 ----------------------NGLAISHLGYASFTSSN--NHMFHLNNLLHVPSITKNL 426
                                 +GL I+H G  SF SS      F LNNLL VPSITKNL
Sbjct: 372 SLSNLSIGSEYGGGNQIYAANGSGLPITHYGSMSFNSSTLPFKSFTLNNLLQVPSITKNL 431

Query: 427 ISVSQFAKDNAVFFEFHPTFCVVKDLATGRALLRGTLHEGLYRFNLPQPLPSLNTSVVRP 486
           ISVSQFAKDN VFFEFHPT C VKDL TG+ LL+G L++GLY+F +      L+ S    
Sbjct: 432 ISVSQFAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTIEPSHKRLHHS---- 491

Query: 487 ETNT-AVFSTLPCVSNDVSALYSSVQSSNNLSIDVWHQRLGHPSISIVKQVVRSCNPKVS 546
            +NT  VF+T+             V  SN   +D+WH+RLGHP + IVK V+   +    
Sbjct: 492 NSNTKPVFNTV-------------VPKSNTPLLDLWHRRLGHPHLPIVKAVLNHIDNSSG 551

Query: 547 TNAIMSFCHACAIGKHHAMPFSPSTTSYSAPLQLIVTDLWGPAYKLSTHGFQYYISFVDA 606
           T   ++FC ACA+GKHHA+PFS S T Y+ PLQLI  DLWGPA  +S +GF+YYISFVDA
Sbjct: 552 TINKLNFCEACALGKHHALPFSHSLTLYTHPLQLITCDLWGPAVNVSHNGFRYYISFVDA 611

Query: 607 FSRYTWIYFLQTKSEAFQAFLKFKTHVEKQFGTPIVS----------------------- 666
           +SRYTWIYFL +KS+AF AF KFKT VEK  G  I S                       
Sbjct: 612 YSRYTWIYFLNSKSDAFLAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFKPFLDQHGIEH 671

Query: 667 -----FKLMENGIVERKHRHIVDVGLTLLSHSSIPLTFWDDAFSTSVYLINRLPSIVLGG 675
                +   +N IVERKHR+I+++GLTLLS +++PL+FWD+AFSTSVYLINRLP+ VL  
Sbjct: 672 RITCPYTSKQNDIVERKHRYIMEMGLTLLSQATLPLSFWDEAFSTSVYLINRLPTPVLDN 731

BLAST of Lag0005116 vs. ExPASy TrEMBL
Match: A0A438EA49 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX=29760 GN=POLX_2917 PE=4 SV=1)

HSP 1 Score: 486.5 bits (1251), Expect = 3.9e-133
Identity = 339/948 (35.76%), Postives = 464/948 (48.95%), Query Frame = 0

Query: 19  IVNPGNKISTVKLTDENFLMWKFQILTALEGHDLDDHISGDSQPLPELIQVSENATTVSK 78
           +++P +++ T++L D+NFLMWK+QI  A+ G+ L+  + G  Q  P+++    +   V  
Sbjct: 35  VISPLSQLITMRLEDDNFLMWKYQIENAVRGYGLEGFLFGTEQVPPKMV---TDKIGVLV 94

Query: 79  PNPAYKVWKKQDKLVSSWIVGSMSESILEQV----------------------------- 138
           PNP ++ +++QD L+ SW++ S+  + L QV                             
Sbjct: 95  PNPKFRDYQRQDHLLISWLLSSIGSAFLPQVVGCSSAFEDGLTMRDYLTKMKNYCDLLAT 154

Query: 139 -------------------------------------LHYVVALLLTHESRIESKSVINS 198
                                                L YV + L+ HE RI  K   N 
Sbjct: 155 AGHKISDTDHILAIMQGLGDEYESVIAVISSKKSSPSLQYVTSTLIAHEGRIAHKISSND 214

Query: 199 DNVLPSANLAVQNVSQ--NSVPNPSPNSQQQNFGNGRGRSRSNFGQNRG---GRSWNNRN 258
            +V  ++  + +  S   NS   PS   Q +N   G   +R +F  NRG   GR+     
Sbjct: 215 LSVNYTSQYSNRGPSSSWNSNGYPSSGFQNRNQFGGNQVTRGSFVHNRGRGRGRA-QGGI 274

Query: 259 RPQCQLCNKIGHTAMKCYSRVQ------------MPGA---------------------- 318
           +PQCQLCNK GHT  +C+ R               PG                       
Sbjct: 275 KPQCQLCNKFGHTVHRCFYRYDPNFHGNMPANGPTPGVLGSGARNGASGSISSAGNVNLT 334

Query: 319 -YATQFNPP---------------------------------GQMNPSGLNFSPQ----- 378
            Y  Q N                                   G +N SG  ++       
Sbjct: 335 EYDAQENQDYSEMEAMVATPEDLQNCCWFPDSGATNHVTHDLGNLN-SGAEYNGNSKIHM 394

Query: 379 -QFNGLAISHLGYASFTSSN--NHMFHLNNLLHVPSITKNLISVSQFAKDNAVFFEFHPT 438
               GL ISH+G + F SS+  N +  L N+L VP+I KNL+SVSQFA+DN V+FEFHP 
Sbjct: 395 GNGTGLKISHIGLSVFPSSSSPNKVLFLKNILRVPAIKKNLLSVSQFARDNNVYFEFHPK 454

Query: 439 FCVVKDLATGRALLRGTLHEGLYRFNLPQPLPSLNTSV-VRPETNTAVFSTLPCVSNDVS 498
            C VKD +    LL+G LH+GLY+FNL + L    + + +  + N         V ND S
Sbjct: 455 VCFVKDKSNHSLLLQGNLHKGLYQFNLSKKLFGKASGLSLSNDKNELTCCNASLVHNDNS 514

Query: 499 ALYSSVQSSNNLSIDVWHQRLGHPSISIVKQVVRSCNPKVSTNAIMSFCHACAIGKHHAM 558
                  SS ++  D+WH+RLGHP+  IV QV+       ST +  S C AC +GK H +
Sbjct: 515 DFPEKTNSSFHV-FDLWHKRLGHPASKIVTQVLNDNKIPFSTKSGSSICSACQLGKSHNL 574

Query: 559 PFSPSTTSYSAPLQLIVTDLWGPAYKLSTHGFQYYISFVDAFSRYTWIYFLQTKSEAFQA 618
           PF  S T Y+ PLQL+V+DLWGPA   S++GF YY+SFVDA+SRYTW+YFL+TKS+  +A
Sbjct: 575 PFPISQTVYTKPLQLVVSDLWGPAPINSSYGFTYYVSFVDAYSRYTWVYFLKTKSQTREA 634

Query: 619 FLKFKTHVEKQFGTPIVSFKL----------------------------MENGIVERKHR 678
           FL FK   E QFG  + +F+                              +NGI+ERKHR
Sbjct: 635 FLMFKAQAELQFGCKLKTFQTDWGGEFRSLKTYFEQNGIIHRLSCPHTSKQNGIIERKHR 694

Query: 679 HIVDVGLTLLSHSSIPLTFWDDAFSTSVYLINRLPSIVLGGMSPLEKLFRKQPDYSTLKV 738
           HIV++GLTLL+ +S+PL +W DAFST+V+LINRLP+ VL    P E LF  +P+YS LKV
Sbjct: 695 HIVELGLTLLAQASLPLKYWPDAFSTAVFLINRLPTEVLKQKCPYEFLFNSKPNYSQLKV 754

BLAST of Lag0005116 vs. ExPASy TrEMBL
Match: A0A438H844 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Vitis vinifera OX=29760 GN=RE1_3152 PE=4 SV=1)

HSP 1 Score: 484.6 bits (1246), Expect = 1.5e-132
Identity = 346/909 (38.06%), Postives = 449/909 (49.39%), Query Frame = 0

Query: 30  KLTDENFLMWKFQILTALEGHDLDDHISGDSQPLPELIQVSENATTVSKPNPAYKVWKKQ 89
           KL + NFL+W+ QILT L GH L  H   ++  LP    +S +  T +  NP ++ W++Q
Sbjct: 22  KLDNHNFLVWRKQILTTLRGHKL-QHFLSETSVLPSEF-LSSDDETQNHVNPKFQDWEQQ 81

Query: 90  DKLVSSWIVGSMSESILEQVLH-------------------------------------- 149
           D+L+ SW++ S+++++L ++++                                      
Sbjct: 82  DQLIMSWLLASITDALLTRMVNCDTSAQVWKTLELYFATQVRAKVTQFKTQLHNTKKGDL 141

Query: 150 -----------------------------------------------------YVV---- 209
                                                                Y V    
Sbjct: 142 SISDYLLKIRNVVDLLALVGHKISVKDHIDAIFEGLPQDYETFIISVNSRLDPYTVEEIE 201

Query: 210 ALLLTHESRIESKSVINSDNVLPS-ANLAVQNVSQNSVPNPSPNSQQQNF------GNGR 269
            LLL  ESRIE K++  +D   PS A+L   N + +   N   +++  NF      GNG 
Sbjct: 202 VLLLAQESRIE-KNIKIADLSTPSLAHLITTNRNGSPHFNYRASTRNSNFRPPTHSGNGM 261

Query: 270 GRSRSNF---GQNRGGR-SWNNRNRPQCQLCNKIGHTAMKCYSRVQMPGAYATQF---NP 329
              R NF   G+ R GR SW   N+PQCQLC +IGH  M+CY R        +Q     P
Sbjct: 262 QHFRGNFTQQGRGRHGRGSWKGNNKPQCQLCGRIGHVVMQCYYRFDQSFTGPSQLQGNRP 321

Query: 330 PGQM----NPSGLNFSP--------------------------------------QQF-- 389
            G M         NF P                                       QF  
Sbjct: 322 QGNMAHLHQQLSENFFPGTSSVKPTTAEIIQDNNWYPDSGATHHLTPNLNNLLTKSQFPS 381

Query: 390 ---------NGLAISHLGYASFTSS--NNHMFHLNNLLHVPSITKNLISVSQFAKDNAVF 449
                     GL I H+G+ SF+SS   +    L  LLHVP ITKNL+SVS+FA DN VF
Sbjct: 382 SDEVFVGNGKGLPIHHIGHTSFSSSFIPSKTLALKQLLHVPEITKNLLSVSKFAADNHVF 441

Query: 450 FEFHPTFCVVKDLATGRALLRGTLHEGLYRFNLPQPLPSLNTSVVRPETNTAVFSTLPCV 509
           FEFHPT C VKDL+T   L+ G L  GLY F+        NT +  P  N++ F++    
Sbjct: 442 FEFHPTSCFVKDLSTRTVLMHGQLKGGLYVFD--------NTQLKLPLHNSSCFASTALP 501

Query: 510 SNDVSALYSSVQSSNNLSIDVWHQRLGHPSISIVKQVVRSCNPKVSTNAIMSFCHACAIG 569
           S +      +V +S+     +WH RLGHPS  IV  V+  CN           C AC +G
Sbjct: 502 SKE-----PTVPASSTSPFTLWHNRLGHPSSHIVSLVLNKCNLPHLNKIPSLICSACCMG 561

Query: 570 KHHAMPFSPSTTSYSAPLQLIVTDLWGPAYKLSTHGFQYYISFVDAFSRYTWIYFLQTKS 629
           K H  PF  S +SY+ PL+LI TDLWGP    S+HG QYYI F+DA+SR+TWIY L+ KS
Sbjct: 562 KIHKSPFLHSKSSYTKPLELIHTDLWGPISTPSSHGHQYYIHFIDAYSRFTWIYMLKHKS 621

Query: 630 EAFQAFLKFKTHVEKQFGTPIVS----------------------------FKLMENGIV 689
           EAFQ FL FK+ VE Q G  I +                            +   +NG+ 
Sbjct: 622 EAFQVFLHFKSQVELQLGHKIKAVQSDWGGEYRSFTQYLTSNGIIHRISCPYTHEQNGLA 681

Query: 690 ERKHRHIVDVGLTLLSHSSIPLTFWDDAFSTSVYLINRLPSIVLGGMSPLEKLFRKQPDY 738
           ERKHRHIV+ G+ LL+ +S+P  +WD+AF TSVYLINRLP+ VL   SPLE LF ++P Y
Sbjct: 682 ERKHRHIVEHGIALLAQASLPFKYWDEAFRTSVYLINRLPTPVLKNKSPLEVLFHQKPSY 741

BLAST of Lag0005116 vs. ExPASy TrEMBL
Match: A0A438FJP6 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX=29760 GN=POLX_1134 PE=4 SV=1)

HSP 1 Score: 475.3 bits (1222), Expect = 9.1e-130
Identity = 339/977 (34.70%), Postives = 464/977 (47.49%), Query Frame = 0

Query: 19   IVNPGNKISTVKLTDENFLMWKFQILTALEGHDLDDHISGDSQPLPELIQVSENATTVSK 78
            +++P +++ T++L D+NFLMWK+QI  A+ G+ L+  + G  Q  P+++    +   V  
Sbjct: 142  VISPLSQLITMRLEDDNFLMWKYQIENAVRGYGLEGFLFGTEQVPPKMV---TDKIGVLV 201

Query: 79   PNPAYKVWKKQDKLVSSWIVGSMSESILEQV----------------------------- 138
            PNP ++ +++QD L+ SW++ S+  + L QV                             
Sbjct: 202  PNPKFRDYQRQDHLLISWLLSSIGSAFLPQVVGCSSAFEVWNTISQNFNSQSSAKVMFYK 261

Query: 139  ------------------------------------------------------------ 198
                                                                        
Sbjct: 262  SQMQMLKKDGLTMRDYLTKMKNYCDLLATAGHKISDTDHILAIMQGLGDEYESVIAVISS 321

Query: 199  ------LHYVVALLLTHESRIESKSVINSDNVLPSANLAVQNVSQ--NSVPNPSPNSQQQ 258
                  L YV + L+ HE RI  K   N  +V  ++  + +  S   NS   PS   Q +
Sbjct: 322  KKSSPSLQYVTSTLIAHEGRIAHKISSNDLSVNYTSQYSNRGPSSSWNSNGYPSSGFQNR 381

Query: 259  NFGNGRGRSRSNFGQNRG---GRSWNNRNRPQCQLCNKIGHTAMKCYSRVQ--------- 318
            N   G   +R +F  NRG   GR+     +PQCQLCNK GHT  +C+ R           
Sbjct: 382  NQFGGNQVTRGSFVHNRGRGRGRA-QGGIKPQCQLCNKFGHTVHRCFYRYDPNFHGNMPA 441

Query: 319  ---MPGA-----------------------YATQFNPP---------------------- 378
                PG                        Y  Q N                        
Sbjct: 442  NGPTPGVLGSGARNGASGSISSAGNVNLTEYDAQENQDYSEMEAMVATPEDLQNCCWFPD 501

Query: 379  -----------GQMNPSGLNFSPQ------QFNGLAISHLGYASFTSSN--NHMFHLNNL 438
                       G +N SG  ++           GL ISH+G + F SS+  N +  L N+
Sbjct: 502  SGATNHVTHDLGNLN-SGAEYNGNSKIHMGNGTGLKISHIGLSVFPSSSSPNKVLFLKNI 561

Query: 439  LHVPSITKNLISVSQFAKDNAVFFEFHPTFCVVKDLATGRALLRGTLHEGLYRFNLPQPL 498
            L VP+I KNL+SVSQFA+DN V+FEFHP  C VKD +    LL+G LH+GLY+FNL + L
Sbjct: 562  LRVPAIKKNLLSVSQFARDNNVYFEFHPKVCFVKDKSNHSLLLQGNLHKGLYQFNLSKKL 621

Query: 499  PSLNTSV-VRPETNTAVFSTLPCVSNDVSALYSSVQSSNNLSIDVWHQRLGHPSISIVKQ 558
                + + +  + N         V ND S       SS ++  D+WH+RLGHP+  IV Q
Sbjct: 622  FGKASGLSLSNDKNELTCCNASLVHNDNSDFPEKTNSSFHV-FDLWHKRLGHPASKIVTQ 681

Query: 559  VVRSCNPKVSTNAIMSFCHACAIGKHHAMPFSPSTTSYSAPLQLIVTDLWGPAYKLSTHG 618
            V+       ST +  S C AC +GK H +PF  S T Y+ PLQL+V+DLWGPA   S++G
Sbjct: 682  VLNDNKIPFSTKSGSSICSACQLGKSHNLPFPISQTVYTKPLQLVVSDLWGPAPINSSYG 741

Query: 619  FQYYISFVDAFSRYTWIYFLQTKSEAFQAFLKFKTHVEKQFGTPIVSFKL---------- 678
            F YY+SFVDA+SRYTW+YFL+TKS+  +AFL FK   E QFG  + +F+           
Sbjct: 742  FTYYVSFVDAYSRYTWVYFLKTKSQTREAFLMFKAQAELQFGCKLKTFQTDWGGEFRSLK 801

Query: 679  ------------------MENGIVERKHRHIVDVGLTLLSHSSIPLTFWDDAFSTSVYLI 738
                               +NGI+ERKHRHIV++GLTLL+ +S+PL +W DAFST+V+LI
Sbjct: 802  TYFEQNGIIHRLSCPHTSKQNGIIERKHRHIVELGLTLLAQASLPLKYWPDAFSTAVFLI 861

BLAST of Lag0005116 vs. TAIR 10
Match: ATMG00300.1 (Gag-Pol-related retrotransposon family protein )

HSP 1 Score: 47.4 bits (111), Expect = 1.2e-04
Identity = 42/146 (28.77%), Postives = 60/146 (41.10%), Query Frame = 0

Query: 294 VVKDLATGRALLRGTLHEGLYRFNLPQPLPSLNTSVVRPETNTAVFSTLPCVSNDVSALY 353
           V+K L   R +L+G  H+ LY          L  SV   E+N A        + D + L 
Sbjct: 28  VLKVLKGCRTILKGNRHDSLY---------ILQGSVETGESNLAE------TAKDETRL- 87

Query: 354 SSVQSSNNLSIDVWHQRLGHPSISIVKQVVRSCNPKVSTNAIMSFCHACAIGKHHAMPFS 413
                        WH RL H S   ++ +V+      S  + + FC  C  GK H + FS
Sbjct: 88  -------------WHSRLAHMSQRGMELLVKKGFLDSSKVSSLKFCEDCIYGKTHRVNFS 144

Query: 414 PSTTSYSAPLQLIVTDLWG-PAYKLS 439
               +   PL  + +DLWG P+  LS
Sbjct: 148 TGQHTTKNPLDYVHSDLWGAPSVPLS 144

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
TYK10642.13.5e-15241.48Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. m... [more]
KAA0048297.17.8e-15241.48Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. m... [more]
RVW44519.18.1e-13335.76Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera][more]
RVW80632.13.1e-13238.06Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera][more]
RVW60229.11.9e-12934.70Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera][more]
Match NameE-valueIdentityDescription
Q9ZT942.0e-8131.21Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
Q94HW23.3e-7630.00Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
P109783.0e-3726.74Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
P041468.5e-1620.10Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
Q124913.2e-0721.51Transposon Ty2-B Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
Match NameE-valueIdentityDescription
A0A5D3CH971.7e-15241.48Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var.... [more]
A0A5A7U2333.8e-15241.48Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var.... [more]
A0A438EA493.9e-13335.76Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX... [more]
A0A438H8441.5e-13238.06Retrovirus-related Pol polyprotein from transposon RE1 OS=Vitis vinifera OX=2976... [more]
A0A438FJP69.1e-13034.70Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX... [more]
Match NameE-valueIdentityDescription
ATMG00300.11.2e-0428.77Gag-Pol-related retrotransposon family protein [more]
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (AG-4) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 869..889
NoneNo IPR availableCOILSCoilCoilcoord: 986..1029
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 147..188
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1225..1266
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 909..942
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 743..793
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..23
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 825..873
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 657..681
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 899..946
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 806..873
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 745..771
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1225..1285
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1360..1380
NoneNo IPR availablePANTHERPTHR34222:SF48SUBFAMILY NOT NAMEDcoord: 110..250
coord: 28..110
NoneNo IPR availablePANTHERPTHR34222FAMILY NOT NAMEDcoord: 110..250
coord: 28..110
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 414..489
e-value: 3.7E-9
score: 38.1
coord: 492..562
e-value: 5.3E-6
score: 27.8
IPR025724GAG-pre-integrase domainPFAMPF13976gag_pre-integrscoord: 354..407
e-value: 1.0E-11
score: 44.5
IPR036875Zinc finger, CCHC-type superfamilySUPERFAMILY57756Retrovirus zinc finger-like domainscoord: 172..212
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 420..563

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lag0005116.1Lag0005116.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding