Lag0038334 (gene) Sponge gourd (AG‐4) v1

Overview
NameLag0038334
Typegene
OrganismLuffa acutangula (Sponge gourd (AG‐4) v1)
DescriptionReverse transcriptase domain-containing protein
Locationchr2: 15576665 .. 15582328 (-)
RNA-Seq ExpressionLag0038334
SyntenyLag0038334
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCATCAGCGAAGCGAGCTTTGGGTTTCGAGAACGGTTTTTGTGTTGATAGCAAAGGTAAAAGTGGTGGTTTGGCTCTGTTGTGGGATGCGTCTGTCACCTTCAGCCTTTTGTCATTTTCGAATAACCACATTGATGGGTGGATCACGTGGGATGATTACCATTGGCGTCTCACTGGCTTCTATGGTTTTCCTGCGGCGGATATGCGAGATCAGACGTGGTCCCTTCTCTCTAAGTTAAGGGGTGGTTCTGATACTCCTTGGCTTATAGGAGGGGATTTCAATGCCCTGTTGTATCAGCATGAGAAGGAGGGTGGCAGAGATAAACCTCTTTCAGAGTTGGCGGCCTTTCAGAACGTGATTGACTCCTGTGCACTTCTTGACTTGGGTTTTGTGGGGAATAGGTTCACATGGTGCAACAGGCGGCCTGATGGAACGATCTACGAGCGTTTGGATAGGTGCTTTAGTTCCGCTACGTGGCATGATATCTATCCCAACTGTGTAGTTAATCACCTGGATTATCACCAGTCTGATCATCGTCCAATTGAGTTGGTCCTTTCCCCGCAACCTGGCTGTTGGAGAAACCCGAGTCAGCGAATCACTCGGTTCGATGAGACTTGGCTAAAGCGTGCAGATTTGCAGCAGTTGGTCAGAGACTCATGGGGGTTGAGTAGGGAGGACCCTGGTTTGTCAGCTCCCCAGATTTTGGCTCAGGTGTCCAAGAGATGCATGCGTTCGATGGCTGGTTGGGGTCGCTCAAGAATGGGGAACTTCCCTCAGCGCATCAGTGAGGCCAATCAGAAGGTACAGCTGGCCATTGAGGGGTTGAGAGGGGCTGGGTCCCGTGAACTACTTTCCCAGGCAGAAGCCCAGTTGGAAGATGTATTGCAGGAGGAGGAACTTTACTGGAAGCAGCGATCCAGAGAGGTGTGGTTGAAGGAAGGGGATCAAAATACTCGGTGGTTTCATCGTCAGGCTTCGTATAGACAACGACTCAATCGGATTAGGGGTCTCACAGATGACCAAGGAGAATGGCGCCAGGACAAAACTATGATTCTTCAGTTGGTGAATGATTATTTCCAGCAGCTCTTCTCGACATCAGAGCCGAGTGAACAGGATTTTGATATATCTCTTAGGGACATTCAACGCTCTGTAGATAATGAGATGAATGTGGAGTTGCTGCGCCCTTTTACGGAGAATGAGATTCTTCGGGCTCTGAAGCAGTCTCATCCTCACAAGGCCCCAGGTCCAGATGGGCTATCTGGCAGTTTCTATAAGAACCACTGGTCGATAGTGGGGCCTTCAGTGGTACAGAGTTGCCTGGCTGTTTTAAATCACGGATGTTCCCCGGTTTCAATTAATGATACTATGATTGTTCTCATTCCGAAGATCAAGGTCCCTCGTCGAGTTTCTGACTTTCGGCCCATCTCGCTATGCAATTTTAGCTATAAGTTGATTTCGAAGGCAGTGGTTAATAGGATGAAGCATATCCTTCCAAAACTTATTTCGCCCAACCAAAGTGCCTTTGTCGCTGGAAGGTGTGTGGTGGATAATGCCATCTTGGGGTTCGAATGCATTCATGAGTTAAGGCGACGGACTGGAGGAAAATCTAAATGGGCTGCTCTAAAACTTGACATGAGCAAAGCATATGACAGGATAGAGTGGTCGTTTCTACGGTCAGTTATGGATAGAATGGGTTTTGCTCAACAGTGGACTGATTTGATTCTTCGGTGCGTTAGCTCGGTCTCCTTCTCATTTAACCTGAATGGGGAGCGGTTGGGGAATGTGATCCCTTCCCGTGGGCTCAGGCAGGGAGACCCGCTGTCCCCGTACCTGTTTTTGCTTTGTGCGGAGGGTTTGTCGAGCTTGTTGCGGGGAGCTGAACGTCGAGCTTTGATATCTGGGTTTAGGGTTGCGCGGAGTAGCCCCCCGATTTCTCATCTATTTTTTGCGGATGATAGTCTCCTTTTCTTCAAAGCGAACGTTAATGAAGCAGTGACTATCCGGGACCTATTGATCTGTTATGAACGAGCCTCGGGTCAGGTGATTAATTATGAGAAGTCAGTGGTTGCGTTCAGTCCAAACACTGGTGAGGACTCACAACAGTATATCAGTCATGTGCTCTCGGTATCTCGGTGTCCGTGTCATCAACGATATCTTGGGCTCCCCTCATTTATGCCTAAGAATCGGTCGGGAACGTTGATGTTTATTAAGGATCGTGTATGGAAGCAGATCCAGGGTTGGAAGGGAAAGTTTTTTTCCTTGGGTGGTAAGGAAGTCCTTCTAAAATCTATCATTCAGGCCATACCTTGTTACACGATGAATTGCTTTCGTCTGCCTCGTTGCCTGATTAGAGAAATCCATCGGGCCATGGCCAGGTTCTGGTGGAATGAGTCTGAGGAGGGGAAGAGGATCCATTGGGTGAGTTGGGACCACATGTGTCGTCCTAAGTGTATGGGGGGTTTGGGTTTCCGTAATATGGAGCTTTTTAATCAAGCGCTTTTGGCTAAACAGTGCTGGCGAGTAATCCAGGATCCTGAATCCCTTTTGGGTGCCGTTCTGAAGGGTAGGTATTTTCCTCACTCTGAGTTTTGGGAGGCGTCTCTGGGTCATCGGCCTTCGTTCATCTGGCGCAGTCTGTTGTGGGGTCGGGAGCTGCTGGTTCGAGGGTGCAGGTGGAGGATTGGGAATGGTCGATCTATTCCCATATATGGTTCGAATTGGGTGCCGGATAATCCGTCTCTGCGTGTGCAGTCTGCTCCTTCGCTTCCTTTATCAAGTAGGGTCTGTGATTTGTTTTCTCCGTCAGGACAGTGGGACGAGGCTAAGGTGCGTGCCCATTTTTTGGGGCCTGAGTGTGAGGCCATTCTAAGGATTCCCTTGCGCTCTGGACTGCTTGAAGATCGACTTATTTGGCATTTTGAGAAGCATGGTGTGTTCTCTGTGAAGAGTGGGTATAGGTTGGCTTTCTCCTTGGCGTCCCAGGGTGTCCGTCTTCTTCTGAGTCTGAGCCCTGGCGGATTTGGTGGTCTAGTCTATGGAGACTTGGGATCCCGAATAAGCACAAGGTTTTCCTATGGCGTCTCGTCCTGGAACGGCTGCCCACTAAGGTAAATCTCCTTAAAAGAGGAGTTGAGGTGTCGCCTTTGTGTGTATTGTGTAGTTCTGTGGTGGAGGATGGTCTCCATCTTTTCTGGAAGTGCGCCGTGACTAGGGAAATGTGGCTCTGCTCGAAATTTTCTCAGCTATACCAGTCGTTATACCACCTGGATCTTGTTGATGTCATCTGGGCATTGAGGGAGAAGTTGGGCGCATTAGACTTTGAGCTTGTGACGGTGTTCTGGTGGTCAGTTTGGAATTTACGTAACAATTTGTGTTGGAGGGGAGAATCTGATGGTCGAGATTTGTGGTCATGGTCTGAAGAGTATCTGAGGGCGTACTATGATGTTGTCGGGCGGCGGGAGTCTCGTTGTAGTTTGCAGCCTTGTCCCAGGCGGCCGGCCGAGCAGTCTTCATGGACTCCCCCGGTGGGCGGCGGATTTAAGTTGAACACCGATGCCTCGGTTAGGCCTGATACGGGTGAAGCGGGGGGAGGTTGTGTTCTTCGGGATATGTCTGGTGCAGTGCTTTTAGCGGCATGTTTGGACCTGCCCAGGTGCTGGAGTGTGGATCTGGCGGAAGGTTGGGCATTGGTGAAGGGCGTGGAGCTAGCGTTACAGATGGGTTTCTTAAGTTTCTGTGTCGAGGTGGATTCATTAAGGCTGGTTCGAATTCTACATGGGGAGGTGATTGATTCTTCAGAAGTAGGCCTGTTGATGGATGATGTCCGACGTCTTCTCCATCCTTGTGGGAGGGGAAAGGTCCTTTTTACGCCACGGAATGGGAATAGAGTGGCTCATGCTCTAGCCTGTTTGACCTTCTCCTATTCGGGTTGCGTTTGGCTGGAGGAGTGGCCTATGGAAATCGCTGCGGTGCTGGCTGGGGATGTCGCGTTGTGTCATGAGGGGGCCCCTGTAGGGGAGAGTGAGCGCTCCATGCCGTCTCCAGTCAGTGTACATCCTCTATGCATGTAGTCGGGGAAGGCTTTCTGTTGCCCGTTACCTTGATTTTGGTTGTGCATGTCTGGGTTACTTTAACTGGTTTCTCTATGTGCTATTGTTTTGCAGGTGGCGGTTGAGTTTGAGCAGGTCGAGAGCGGGAGTGGTCCAAGGAATCAACTGTACTGAGATGCGATCAGACTGAGTAGCGGAAACCTCTACCCAATGCTGGGGTGCAGATAGCAGGGGGGTGTTGTTTCCCTGAATTTTGTGTTTCGTGGCAGTGCATGTTTAGGGTGCTATTGTGGGTGGGGTGTGTTTGTAGTTAAGGAGGGGTTGCATTGTCTGGTATGGATGTGGTTTATTTGTGGGACTTGATGGGTTACCTGGGTTGATTTACTTTGTGATTTCGTCCGGTTTGCATGTTTAGTGGATGCTAGGTTGTGGCGGGGAGCATCTGAGCATGTGTTTTCTCTTATGTGTGCCTTTTGTGCTGTCTGAGTATGTTAGGCTTGTGGGCCTTGTAATTGTGTTGTGGCTGTGGTGATGTGGTGAGGTGTTTAAGCTTGTCAGGGCTTATGTGTTTGTATGTATGCCCTTTGCCTGTGTTAAAATGAGGCGTGAGTAGCGGACTTCGAGTTGTTAAAGCTTCCCTTGAGGGTTCCTTGCATGTGTGAGGGTGGAGGCTCCCTTAGCTGTCTCGAAAGTCTAGGACGGTTTTATGGGGTCCATAAGTCTGTTGTCACTCCGATAGGATCAAGGCTGTGTGTGGAATATATCTCTTGCTTTCTAGTGTGTGAAGCCTGCCTAGGGTTGGTGGTAGGGGGTTTATATGTAGGGTTTTAGGGGTAACAAGAGCTACCTCCAGTGTGATCAGTTAGGCCGTCCAAGGATCTCTTTGGGTATGGGTCCCTGCACTGTAGTGTGAAAGCTAGTACCCTGTAAATAAAGCCTAGGTTGGGTGAGGAAAAAAAAAAAAAAAAAAAAAAAAAAAGGTTGAAGGATAAGGAAAAGAGGAAAGTGGGAGGAAATGGGAAAAGGAAAAGAAATGGTGAGGAAAGAAAGGGAGTATGGCCTTTTACCTGGCCCTATGTTAGGGGATAACATCCTAGTTTGGCTGTTTTAGTTTCTATATGGGGCTTTGGGTCCAGTGTAGGGGGAAGATGGGTGGAGGGTTCTATAAATTAGTTCTGGTCTAGTAGGGGTTGCTTTAGGTGAGGCTTTCTCTCCTGCGCGAGTTGTCCTTGGGAAGTTTTGACTTTTCTCTTTTATGTTTGCTTTCAATTGCTTCGCTTAGGTGTAAGGGCTAACATATGTAACTCTTGGCTTTGGGAATGTTAAATGCTAGTACGAGATGCTTGAACAGTTATTACCGGGGTAGTGTTTTAAGTTAAGGAGTAGGGTTGATTTGGAGGATGAGATTGAGTTTGTCTTGCGTTGGCAGTATCTTATCTCTGTTTCGTTTTATGTTAAGATGCTTTGATGGTCATTTTGGTGGCAGGTTCTGCATGGTGAAATAGGTCCGTCCTGGAGTGGTTTAGTCTGGCATCCTGTTAGGATGACTTGGGGCGGAAGCGAGGGAGGAGCTTGGTGGCTAACTGGATCGTGGATGAAGTCATATGGTTGA

mRNA sequence

ATGGCATCAGCGAAGCGAGCTTTGGGTTTCGAGAACGGTTTTTGTGTTGATAGCAAAGGTAAAAGTGGTGGTTTGGCTCTGTTGTGGGATGCGTCTGTCACCTTCAGCCTTTTGTCATTTTCGAATAACCACATTGATGGGTGGATCACGTGGGATGATTACCATTGGCGTCTCACTGGCTTCTATGGTTTTCCTGCGGCGGATATGCGAGATCAGACGTGGTCCCTTCTCTCTAAGTTAAGGGGTGGTTCTGATACTCCTTGGCTTATAGGAGGGGATTTCAATGCCCTGTTGTATCAGCATGAGAAGGAGGGTGGCAGAGATAAACCTCTTTCAGAGTTGGCGGCCTTTCAGAACGTGATTGACTCCTGTGCACTTCTTGACTTGGGTTTTGTGGGGAATAGGTTCACATGGTGCAACAGGCGGCCTGATGGAACGATCTACGAGCGTTTGGATAGGTGCTTTAGTTCCGCTACGTGGCATGATATCTATCCCAACTGTGTAGTTAATCACCTGGATTATCACCAGTCTGATCATCGTCCAATTGAGTTGGTCCTTTCCCCGCAACCTGGCTGTTGGAGAAACCCGAGTCAGCGAATCACTCGGTTCGATGAGACTTGGCTAAAGCGTGCAGATTTGCAGCAGTTGGTCAGAGACTCATGGGGGTTGAGTAGGGAGGACCCTGGTTTGTCAGCTCCCCAGATTTTGGCTCAGGTGTCCAAGAGATGCATGCGTTCGATGGCTGGTTGGGGTCGCTCAAGAATGGGGAACTTCCCTCAGCGCATCAGTGAGGCCAATCAGAAGGTACAGCTGGCCATTGAGGGGTTGAGAGGGGCTGGGTCCCGTGAACTACTTTCCCAGGCAGAAGCCCAGTTGGAAGATGTATTGCAGGAGGAGGAACTTTACTGGAAGCAGCGATCCAGAGAGGTGTGGTTGAAGGAAGGGGATCAAAATACTCGGTGGTTTCATCGTCAGGCTTCGTATAGACAACGACTCAATCGGATTAGGGGTCTCACAGATGACCAAGGAGAATGGCGCCAGGACAAAACTATGATTCTTCAGTTGGTGAATGATTATTTCCAGCAGCTCTTCTCGACATCAGAGCCGAGTGAACAGGATTTTGATATATCTCTTAGGGACATTCAACGCTCTGTAGATAATGAGATGAATGTGGAGTTGCTGCGCCCTTTTACGGAGAATGAGATTCTTCGGGCTCTGAAGCAGTCTCATCCTCACAAGGCCCCAGGTCCAGATGGGCTATCTGGCAGTTTCTATAAGAACCACTGGTCGATAGTGGGGCCTTCAGTGGTACAGAGTTGCCTGGCTGTTTTAAATCACGGATGTTCCCCGGTTTCAATTAATGATACTATGATTGTTCTCATTCCGAAGATCAAGGTCCCTCGTCGAGTTTCTGACTTTCGGCCCATCTCGCTATGCAATTTTAGCTATAAGTTGATTTCGAAGGCAGTGGTTAATAGGATGAAGCATATCCTTCCAAAACTTATTTCGCCCAACCAAAGTGCCTTTGTCGCTGGAAGGTGTGTGGTGGATAATGCCATCTTGGGGTTCGAATGCATTCATGAGTTAAGGCGACGGACTGGAGGAAAATCTAAATGGGCTGCTCTAAAACTTGACATGAGCAAAGCATATGACAGGATAGAGTGGTCGTTTCTACGGTCAGTTATGGATAGAATGGGTTTTGCTCAACAGTGGACTGATTTGATTCTTCGGTGCGTTAGCTCGGTCTCCTTCTCATTTAACCTGAATGGGGAGCGGTTGGGGAATGTGATCCCTTCCCGTGGGCTCAGGCAGGGAGACCCGCTGTCCCCGTACCTGTTTTTGCTTTGTGCGGAGGGTTTGTCGAGCTTGTTGCGGGGAGCTGAACGTCGAGCTTTGATATCTGGGTTTAGGGTTGCGCGGAGTAGCCCCCCGATTTCTCATCTATTTTTTGCGGATGATAGTCTCCTTTTCTTCAAAGCGAACGTTAATGAAGCAGTGACTATCCGGGACCTATTGATCTGTTATGAACGAGCCTCGGGTCAGGTGATTAATTATGAGAAGTCAGTGGTTGCGTTCAGTCCAAACACTGGTGAGGACTCACAACAGTATATCAGTCATGTGCTCTCGGTATCTCGGTGTCCGTGTCATCAACGATATCTTGGGCTCCCCTCATTTATGCCTAAGAATCGGTCGGGAACGTTGATGTTTATTAAGGATCGTGTATGGAAGCAGATCCAGGGTTGGAAGGGAAAGTTTTTTTCCTTGGGTGGTAAGGAAGTCCTTCTAAAATCTATCATTCAGGCCATACCTTGTTACACGATGAATTGCTTTCGTCTGCCTCGTTGCCTGATTAGAGAAATCCATCGGGCCATGGCCAGGTTCTGGTGGAATGAGTCTGAGGAGGGGAAGAGGATCCATTGGGTGAGTTGGGACCACATGTGTCGTCCTAAGTGTATGGGGGGTTTGGGTTTCCGTAATATGGAGCTTTTTAATCAAGCGCTTTTGGCTAAACAGTGCTGGCGAGTAATCCAGGATCCTGAATCCCTTTTGGGTGCCGTTCTGAAGGGTAGGTATTTTCCTCACTCTGAGTTTTGGGAGGCGTCTCTGGGTCATCGGCCTTCGTTCATCTGGCGCAGTCTGTTGTGGGGTCGGGAGCTGCTGGTTCGAGGGTGCAGGTGGAGGATTGGGAATGGTCGATCTATTCCCATATATGGTTCGAATTGGGTGCCGGATAATCCGTCTCTGCGTGTGCAGTCTGCTCCTTCGCTTCCTTTATCAAGTAGGGTCTGTGATTTGTTTTCTCCGTCAGGACAGTGGGACGAGGCTAAGGTGCGTGCCCATTTTTTGGGGCCTGAGTGTGAGGCCATTCTAAGGATTCCCTTGCGCTCTGGACTGCTTGAAGATCGACTTATTTGGCATTTTGAGAAGCATGGTGTGTTCTCTGTGAAGAGTGGGTATAGGTTGGCTTTCTCCTTGGCGTCCCAGGGTGTCCGTCTTCTTCTGAGTCTGAGCCCTGGCGGATTTGGTGGTCTAGTCTATGGAGACTTGGGATCCCGAATAAGCACAAGGTTTTCCTATGGCGTCTCGTCCTGGAACGGCTGCCCACTAAGTTCTGTGGTGGAGGATGGTCTCCATCTTTTCTGGAAGTGCGCCGTGACTAGGGAAATGTGGCTCTGCTCGAAATTTTCTCAGCTATACCAGTCGTTATACCACCTGGATCTTGTTGATGTCATCTGGGCATTGAGGGAGAAGTTGGGCGCATTAGACTTTGAGCTTGTGACGGTGTTCTGGTGGTCAGTTTGGAATTTACGTAACAATTTGTGTTGGAGGGGAGAATCTGATGGTCGAGATTTGTGGTCATGGTCTGAAGAGTATCTGAGGGCGTACTATGATGTTGTCGGGCGGCGGGAGTCTCGTTGTAGTTTGCAGCCTTGTCCCAGGCGGCCGGCCGAGCAGTCTTCATGGACTCCCCCGGTGGGCGGCGGATTTAAGTTGAACACCGATGCCTCGGTTAGGCCTGATACGGGTGAAGCGGGGGGAGGTTGTGTTCTTCGGGATATGTCTGGTGCAGTGCTTTTAGCGGCATGTTTGGACCTGCCCAGGTGCTGGAGTGTGGATCTGGCGGAAGGTTGGGCATTGGTGAAGGGCGTGGAGCTAGCGTTACAGATGGGTTTCTTAAGTTTCTGTGTCGAGGTGGATTCATTAAGGCTGGTTCGAATTCTACATGGGGAGGTGATTGATTCTTCAGAAGTAGGCCTGTTGATGGATGATGTCCGACGTCTTCTCCATCCTTGTGGGAGGGGAAAGGTCCTTTTTACGCCACGGAATGGGAATAGAGTGGCTCATGCTCTAGCCTGTTTGACCTTCTCCTATTCGGGTTGCGTTTGGCTGGAGGAGTGGCCTATGGAAATCGCTGCGGTGCTGGCTGGGGATGTCGCGTTGTGTCATGAGGGGGCCCCTGTAGGGGAGAGTGAGCGCTCCATGCCGTCTCCAGTGGCGGTTGAGTTTGAGCAGGTCGAGAGCGGGAGTGGTCCAAGGAATCAACTTGCATGTTTAGGGTGCTATTGTGGGTGGGGTGTGTTTGTAGTTAAGGAGGGGTTGCATTGTCTGGTTCTGCATGGTGAAATAGGTCCGTCCTGGAGTGGTTTAGTCTGGCATCCTGTTAGGATGACTTGGGGCGGAAGCGAGGGAGGAGCTTGGTGGCTAACTGGATCGTGGATGAAGTCATATGGTTGA

Coding sequence (CDS)

ATGGCATCAGCGAAGCGAGCTTTGGGTTTCGAGAACGGTTTTTGTGTTGATAGCAAAGGTAAAAGTGGTGGTTTGGCTCTGTTGTGGGATGCGTCTGTCACCTTCAGCCTTTTGTCATTTTCGAATAACCACATTGATGGGTGGATCACGTGGGATGATTACCATTGGCGTCTCACTGGCTTCTATGGTTTTCCTGCGGCGGATATGCGAGATCAGACGTGGTCCCTTCTCTCTAAGTTAAGGGGTGGTTCTGATACTCCTTGGCTTATAGGAGGGGATTTCAATGCCCTGTTGTATCAGCATGAGAAGGAGGGTGGCAGAGATAAACCTCTTTCAGAGTTGGCGGCCTTTCAGAACGTGATTGACTCCTGTGCACTTCTTGACTTGGGTTTTGTGGGGAATAGGTTCACATGGTGCAACAGGCGGCCTGATGGAACGATCTACGAGCGTTTGGATAGGTGCTTTAGTTCCGCTACGTGGCATGATATCTATCCCAACTGTGTAGTTAATCACCTGGATTATCACCAGTCTGATCATCGTCCAATTGAGTTGGTCCTTTCCCCGCAACCTGGCTGTTGGAGAAACCCGAGTCAGCGAATCACTCGGTTCGATGAGACTTGGCTAAAGCGTGCAGATTTGCAGCAGTTGGTCAGAGACTCATGGGGGTTGAGTAGGGAGGACCCTGGTTTGTCAGCTCCCCAGATTTTGGCTCAGGTGTCCAAGAGATGCATGCGTTCGATGGCTGGTTGGGGTCGCTCAAGAATGGGGAACTTCCCTCAGCGCATCAGTGAGGCCAATCAGAAGGTACAGCTGGCCATTGAGGGGTTGAGAGGGGCTGGGTCCCGTGAACTACTTTCCCAGGCAGAAGCCCAGTTGGAAGATGTATTGCAGGAGGAGGAACTTTACTGGAAGCAGCGATCCAGAGAGGTGTGGTTGAAGGAAGGGGATCAAAATACTCGGTGGTTTCATCGTCAGGCTTCGTATAGACAACGACTCAATCGGATTAGGGGTCTCACAGATGACCAAGGAGAATGGCGCCAGGACAAAACTATGATTCTTCAGTTGGTGAATGATTATTTCCAGCAGCTCTTCTCGACATCAGAGCCGAGTGAACAGGATTTTGATATATCTCTTAGGGACATTCAACGCTCTGTAGATAATGAGATGAATGTGGAGTTGCTGCGCCCTTTTACGGAGAATGAGATTCTTCGGGCTCTGAAGCAGTCTCATCCTCACAAGGCCCCAGGTCCAGATGGGCTATCTGGCAGTTTCTATAAGAACCACTGGTCGATAGTGGGGCCTTCAGTGGTACAGAGTTGCCTGGCTGTTTTAAATCACGGATGTTCCCCGGTTTCAATTAATGATACTATGATTGTTCTCATTCCGAAGATCAAGGTCCCTCGTCGAGTTTCTGACTTTCGGCCCATCTCGCTATGCAATTTTAGCTATAAGTTGATTTCGAAGGCAGTGGTTAATAGGATGAAGCATATCCTTCCAAAACTTATTTCGCCCAACCAAAGTGCCTTTGTCGCTGGAAGGTGTGTGGTGGATAATGCCATCTTGGGGTTCGAATGCATTCATGAGTTAAGGCGACGGACTGGAGGAAAATCTAAATGGGCTGCTCTAAAACTTGACATGAGCAAAGCATATGACAGGATAGAGTGGTCGTTTCTACGGTCAGTTATGGATAGAATGGGTTTTGCTCAACAGTGGACTGATTTGATTCTTCGGTGCGTTAGCTCGGTCTCCTTCTCATTTAACCTGAATGGGGAGCGGTTGGGGAATGTGATCCCTTCCCGTGGGCTCAGGCAGGGAGACCCGCTGTCCCCGTACCTGTTTTTGCTTTGTGCGGAGGGTTTGTCGAGCTTGTTGCGGGGAGCTGAACGTCGAGCTTTGATATCTGGGTTTAGGGTTGCGCGGAGTAGCCCCCCGATTTCTCATCTATTTTTTGCGGATGATAGTCTCCTTTTCTTCAAAGCGAACGTTAATGAAGCAGTGACTATCCGGGACCTATTGATCTGTTATGAACGAGCCTCGGGTCAGGTGATTAATTATGAGAAGTCAGTGGTTGCGTTCAGTCCAAACACTGGTGAGGACTCACAACAGTATATCAGTCATGTGCTCTCGGTATCTCGGTGTCCGTGTCATCAACGATATCTTGGGCTCCCCTCATTTATGCCTAAGAATCGGTCGGGAACGTTGATGTTTATTAAGGATCGTGTATGGAAGCAGATCCAGGGTTGGAAGGGAAAGTTTTTTTCCTTGGGTGGTAAGGAAGTCCTTCTAAAATCTATCATTCAGGCCATACCTTGTTACACGATGAATTGCTTTCGTCTGCCTCGTTGCCTGATTAGAGAAATCCATCGGGCCATGGCCAGGTTCTGGTGGAATGAGTCTGAGGAGGGGAAGAGGATCCATTGGGTGAGTTGGGACCACATGTGTCGTCCTAAGTGTATGGGGGGTTTGGGTTTCCGTAATATGGAGCTTTTTAATCAAGCGCTTTTGGCTAAACAGTGCTGGCGAGTAATCCAGGATCCTGAATCCCTTTTGGGTGCCGTTCTGAAGGGTAGGTATTTTCCTCACTCTGAGTTTTGGGAGGCGTCTCTGGGTCATCGGCCTTCGTTCATCTGGCGCAGTCTGTTGTGGGGTCGGGAGCTGCTGGTTCGAGGGTGCAGGTGGAGGATTGGGAATGGTCGATCTATTCCCATATATGGTTCGAATTGGGTGCCGGATAATCCGTCTCTGCGTGTGCAGTCTGCTCCTTCGCTTCCTTTATCAAGTAGGGTCTGTGATTTGTTTTCTCCGTCAGGACAGTGGGACGAGGCTAAGGTGCGTGCCCATTTTTTGGGGCCTGAGTGTGAGGCCATTCTAAGGATTCCCTTGCGCTCTGGACTGCTTGAAGATCGACTTATTTGGCATTTTGAGAAGCATGGTGTGTTCTCTGTGAAGAGTGGGTATAGGTTGGCTTTCTCCTTGGCGTCCCAGGGTGTCCGTCTTCTTCTGAGTCTGAGCCCTGGCGGATTTGGTGGTCTAGTCTATGGAGACTTGGGATCCCGAATAAGCACAAGGTTTTCCTATGGCGTCTCGTCCTGGAACGGCTGCCCACTAAGTTCTGTGGTGGAGGATGGTCTCCATCTTTTCTGGAAGTGCGCCGTGACTAGGGAAATGTGGCTCTGCTCGAAATTTTCTCAGCTATACCAGTCGTTATACCACCTGGATCTTGTTGATGTCATCTGGGCATTGAGGGAGAAGTTGGGCGCATTAGACTTTGAGCTTGTGACGGTGTTCTGGTGGTCAGTTTGGAATTTACGTAACAATTTGTGTTGGAGGGGAGAATCTGATGGTCGAGATTTGTGGTCATGGTCTGAAGAGTATCTGAGGGCGTACTATGATGTTGTCGGGCGGCGGGAGTCTCGTTGTAGTTTGCAGCCTTGTCCCAGGCGGCCGGCCGAGCAGTCTTCATGGACTCCCCCGGTGGGCGGCGGATTTAAGTTGAACACCGATGCCTCGGTTAGGCCTGATACGGGTGAAGCGGGGGGAGGTTGTGTTCTTCGGGATATGTCTGGTGCAGTGCTTTTAGCGGCATGTTTGGACCTGCCCAGGTGCTGGAGTGTGGATCTGGCGGAAGGTTGGGCATTGGTGAAGGGCGTGGAGCTAGCGTTACAGATGGGTTTCTTAAGTTTCTGTGTCGAGGTGGATTCATTAAGGCTGGTTCGAATTCTACATGGGGAGGTGATTGATTCTTCAGAAGTAGGCCTGTTGATGGATGATGTCCGACGTCTTCTCCATCCTTGTGGGAGGGGAAAGGTCCTTTTTACGCCACGGAATGGGAATAGAGTGGCTCATGCTCTAGCCTGTTTGACCTTCTCCTATTCGGGTTGCGTTTGGCTGGAGGAGTGGCCTATGGAAATCGCTGCGGTGCTGGCTGGGGATGTCGCGTTGTGTCATGAGGGGGCCCCTGTAGGGGAGAGTGAGCGCTCCATGCCGTCTCCAGTGGCGGTTGAGTTTGAGCAGGTCGAGAGCGGGAGTGGTCCAAGGAATCAACTTGCATGTTTAGGGTGCTATTGTGGGTGGGGTGTGTTTGTAGTTAAGGAGGGGTTGCATTGTCTGGTTCTGCATGGTGAAATAGGTCCGTCCTGGAGTGGTTTAGTCTGGCATCCTGTTAGGATGACTTGGGGCGGAAGCGAGGGAGGAGCTTGGTGGCTAACTGGATCGTGGATGAAGTCATATGGTTGA

Protein sequence

MASAKRALGFENGFCVDSKGKSGGLALLWDASVTFSLLSFSNNHIDGWITWDDYHWRLTGFYGFPAADMRDQTWSLLSKLRGGSDTPWLIGGDFNALLYQHEKEGGRDKPLSELAAFQNVIDSCALLDLGFVGNRFTWCNRRPDGTIYERLDRCFSSATWHDIYPNCVVNHLDYHQSDHRPIELVLSPQPGCWRNPSQRITRFDETWLKRADLQQLVRDSWGLSREDPGLSAPQILAQVSKRCMRSMAGWGRSRMGNFPQRISEANQKVQLAIEGLRGAGSRELLSQAEAQLEDVLQEEELYWKQRSREVWLKEGDQNTRWFHRQASYRQRLNRIRGLTDDQGEWRQDKTMILQLVNDYFQQLFSTSEPSEQDFDISLRDIQRSVDNEMNVELLRPFTENEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVVQSCLAVLNHGCSPVSINDTMIVLIPKIKVPRRVSDFRPISLCNFSYKLISKAVVNRMKHILPKLISPNQSAFVAGRCVVDNAILGFECIHELRRRTGGKSKWAALKLDMSKAYDRIEWSFLRSVMDRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVARSSPPISHLFFADDSLLFFKANVNEAVTIRDLLICYERASGQVINYEKSVVAFSPNTGEDSQQYISHVLSVSRCPCHQRYLGLPSFMPKNRSGTLMFIKDRVWKQIQGWKGKFFSLGGKEVLLKSIIQAIPCYTMNCFRLPRCLIREIHRAMARFWWNESEEGKRIHWVSWDHMCRPKCMGGLGFRNMELFNQALLAKQCWRVIQDPESLLGAVLKGRYFPHSEFWEASLGHRPSFIWRSLLWGRELLVRGCRWRIGNGRSIPIYGSNWVPDNPSLRVQSAPSLPLSSRVCDLFSPSGQWDEAKVRAHFLGPECEAILRIPLRSGLLEDRLIWHFEKHGVFSVKSGYRLAFSLASQGVRLLLSLSPGGFGGLVYGDLGSRISTRFSYGVSSWNGCPLSSVVEDGLHLFWKCAVTREMWLCSKFSQLYQSLYHLDLVDVIWALREKLGALDFELVTVFWWSVWNLRNNLCWRGESDGRDLWSWSEEYLRAYYDVVGRRESRCSLQPCPRRPAEQSSWTPPVGGGFKLNTDASVRPDTGEAGGGCVLRDMSGAVLLAACLDLPRCWSVDLAEGWALVKGVELALQMGFLSFCVEVDSLRLVRILHGEVIDSSEVGLLMDDVRRLLHPCGRGKVLFTPRNGNRVAHALACLTFSYSGCVWLEEWPMEIAAVLAGDVALCHEGAPVGESERSMPSPVAVEFEQVESGSGPRNQLACLGCYCGWGVFVVKEGLHCLVLHGEIGPSWSGLVWHPVRMTWGGSEGGAWWLTGSWMKSYG
Homology
BLAST of Lag0038334 vs. NCBI nr
Match: XP_024172304.2 (uncharacterized protein LOC112178381 [Rosa chinensis])

HSP 1 Score: 918.7 bits (2373), Expect = 6.1e-263
Identity = 518/1340 (38.66%), Postives = 744/1340 (55.52%), Query Frame = 0

Query: 8    LGFENGFCVDSK---------GKSGGLALLWDASVTFSLLSFSNNHIDGWI--TWDDYHW 67
            LG+ N F VD +          ++GGL LLW   +  +L +FS+NHID  I    D   W
Sbjct: 51   LGYRNAFAVDCQVVKNPNGRVSRAGGLCLLWKEGIDVALSTFSDNHIDVLIGGVGDKNRW 110

Query: 68   RLTGFYGFPAADMRDQTWSLLSKLRGGSDTPWLIGGDFNALLYQHEKEGGRDKPLSELAA 127
            R TG YG    ++R  TW+L++K+   +  PWLIGGDFN +L   EKEGG  +   ++ A
Sbjct: 111  RFTGVYGHSKVELRHLTWALITKIGYNNHWPWLIGGDFNEILKACEKEGGPPRCTRQMEA 170

Query: 128  FQNVIDSCALLDLGFVGNRFTWCNRRPDGTIYERLDRCFSSATWHDIYPNCVVNHLDYHQ 187
            F+  ++ C L DL FVG  FTW  +R    I  RLDR  ++ +W D++P   V HL   +
Sbjct: 171  FRRCVEGCCLNDLNFVGPCFTWRGKRGGEEIKVRLDRFMATRSWSDLFPTSRVTHLKPSK 230

Query: 188  SDHRPIEL-VLSPQPGCWRNPSQRITRFDETWLKRADLQQLVRDSW-GLSREDPGLSAPQ 247
            SDH PI + V S  P   +   +R  RF+E WL  A+   +V+D W  ++  DP     Q
Sbjct: 231  SDHLPILVEVRSTIPR--KRRRKRRFRFEEHWLHEAECANVVKDGWESVAGNDPF----Q 290

Query: 248  ILAQVSKRCMRSMAGWGRSRMGNFPQRISEANQKVQLAIEGLRGAGSRELLSQAEAQLED 307
             +    ++  +++  W   + G+    I     K+ +  +    A   E   + E +L D
Sbjct: 291  TICMRIEQTRKALWVWSDQKFGHLKAEIERIRAKLAVFYDKSLSAYPEEERLELETKLND 350

Query: 308  VLQEEELYWKQRSREVWLKEGDQNTRWFHRQASYRQRLNRIRGLTDDQGEWRQDKTMILQ 367
            +L  E  YW+QRSR +WL +GD NTR+FH +AS R++ N I GL ++ G W  + + +  
Sbjct: 351  LLYHEHNYWQQRSRVMWLTDGDLNTRFFHHRASNRKKRNAISGLFNNDGVWCTEDSDLEN 410

Query: 368  LVNDYFQQLFSTSEPSEQDFDISLRDIQRSVDNEMNVELLRPFTENEILRALKQSHPHKA 427
            +V DYF  LFSTS P   +   +L    + V   MN EL+R F E EIL+AL Q HP KA
Sbjct: 411  IVLDYFGTLFSTSSPKNMELFTNL--FPQVVTGAMNSELVREFGEEEILQALNQMHPLKA 470

Query: 428  PGPDGLSGSFYKNHWSIVGPSVVQSCLAVLNHGCSPVSINDTMIVLIPKIKVPRRVSDFR 487
            PGPDG S  FY+ +WS+VG  V+ +    +N       +N T + LIPK+K    +   R
Sbjct: 471  PGPDGFSPIFYQRYWSVVGRDVIAAVRCFMNSEDFLREVNGTYVTLIPKVKEVENMQQLR 530

Query: 488  PISLCNFSYKLISKAVVNRMKHILPKLISPNQSAFVAGRCVVDNAILGFECIHELRRRTG 547
            PISLCN  YKL SK + NR+K +L  +I+P QSAFV GR + DN++L FE  H L+RRTG
Sbjct: 531  PISLCNVIYKLGSKVLANRLKPLLQDIIAPTQSAFVPGRQISDNSLLAFELSHFLKRRTG 590

Query: 548  GKSKWAALKLDMSKAYDRIEWSFLRSVMDRMGFAQQWTDLILRCVSSVSFSFNLNGERLG 607
            G   + ALKLDMSKAYDR+EW F+ +VM  MGF Q W   I+ CV++VS+SF LNGE  G
Sbjct: 591  GSHGYGALKLDMSKAYDRVEWEFIEAVMRSMGFDQIWIKWIMGCVTTVSYSFLLNGEPRG 650

Query: 608  NVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVARSSPPISHLFFADDS 667
            ++IP+RGLRQGD +SPYLFLLCAEGLS +L   E +  + G  +A  +P I+HLFFADDS
Sbjct: 651  HLIPTRGLRQGDSISPYLFLLCAEGLSRMLSYEEEQHRLHGIAIAMGAPSINHLFFADDS 710

Query: 668  LLFFKANVNEAVTIRDLLICYERASGQVINYEKSVVAFSPNTGEDSQQYISHVLSVSRCP 727
             +F KA   E   ++++L  YE ASGQ +N++KS ++FS N     Q+ ++ V  V R  
Sbjct: 711  FVFMKAEREECARVKEILKWYEDASGQQVNFQKSKISFSKNVDIGCQEELAEVFGVERVD 770

Query: 728  CHQRYLGLPSFMPKNRSGTLMFIKDRVWKQIQGWKGKFFSLGGKEVLLKSIIQAIPCYTM 787
             H +YLGLP+ +  +++    FI ++   +++ WK K  S+ GKEV++KS++Q++P Y M
Sbjct: 771  KHDKYLGLPTEVSYSKTEAFQFIMEKTRNKMKNWKDKTLSVAGKEVMIKSVVQSVPTYVM 830

Query: 788  NCFRLPRCLIREIHRAMARFWWNESEEGKRIHWVSWDHMCRPKCMGGLGFRNMELFNQAL 847
            +CF LP+ L +E+HR MA FWW +SE+G++IHW++WD MC PK  GGLGFRNME FNQAL
Sbjct: 831  SCFELPKHLCQEMHRCMAEFWWGDSEKGRKIHWLAWDKMCVPKEKGGLGFRNMEYFNQAL 890

Query: 848  LAKQCWRVIQDPESLLGAVLKGRYFPHSEFWEASLGHRPSFIWRSLLWGRELLVRGCRWR 907
            LAKQ WR+++ P+SLLG  LK +YFP+++F  AS+    S+ WRSL+ G+ LL +G R++
Sbjct: 891  LAKQGWRILRHPDSLLGKTLKAKYFPNNDFIHASVNQGDSYTWRSLMKGKVLLEKGLRFQ 950

Query: 908  IGNGRSIPIYGSNWVPDNPSLRVQSAPSLPLSS-RVCDLFSP-SGQWDEAKVRAHFLGPE 967
            +G+G  I ++   W+P   S R  S     L    V DL  P S  W    +   F   E
Sbjct: 951  VGSGTRISVWFDPWIPRPYSFRPYSTVMEGLEDLTVADLIDPDSKDWMVDWLEELFFADE 1010

Query: 968  CEAILRIPLRSGLLEDRLIWHFEKHGVFSVKSGYRLAFSLASQGVRLLLSLSPGG----- 1027
             + I +IPL     EDRLIWHF+K G++SVKSGY +A  +AS    +  S S G      
Sbjct: 1011 VDLIRKIPLSLRNPEDRLIWHFDKRGLYSVKSGYHVARCVASLSSHVSTSNSQGDKDLWR 1070

Query: 1028 ----------FGGLVYGDLGSRISTRFSYGVS---SWNGCPLSSV-VEDGLHLFWKCAVT 1087
                          V+  + + + T+ + G         CP      E  LH+F +C V 
Sbjct: 1071 RVWHARVQPKVRNFVWRLVKNIVPTKVNLGRRVNLDERICPFCRCESETTLHVFMECNVI 1130

Query: 1088 REMWLCSKFSQLYQSLYHLDLVDVIWALREKLGALDFELVTVFWWSVWNLRNNLCWRGES 1147
              MWL S      ++     + + +  + + L     ++  +  W++W+ RN L W G +
Sbjct: 1131 ACMWLFSSLGLRAKNHTTNSVKEWVLDMLDVLNKSQVDIFFMLLWAIWSERNKLVWNGGT 1190

Query: 1148 -DGRDLWSWSEEYLRAYYDVVGRRESRCSLQPCPRRPAEQSSWTPPVGGGFKLNTDASVR 1207
             +     +WS   L  Y     R     S    PR  A  + W  P  G  K+N D + +
Sbjct: 1191 FNPMHTVTWSMHLLSEYQ----RCHPEKSTHKSPRGAA--TKWMFPPRGRLKINVDGAYK 1250

Query: 1208 PDTGEAGGGCVLRDMSGAVLLAACLDLPRCWSVDLAEGWALVKGVELALQMGFLSFCVEV 1267
             + G  G G V+RD  G    A    +P   S    E  A   G+ +AL  G+    +E 
Sbjct: 1251 SNEGCGGIGVVVRDEMGIFRGARSRKIPYMCSAFHGEAEACRAGLLMALHHGWKQVELET 1310

Query: 1268 DSLRLVRILHGEVIDSSEVGLLMDDVRRLLHPCGRGKVLFTPRNGNRVAHALA-CLTFSY 1312
            D   L   L+ ++ D+SEV  ++DD +  LH     +V    R  N VA+ LA   +  +
Sbjct: 1311 DCAILATALNQQMEDNSEVSRILDDCKNYLHGFDWIRVRHIYREANSVANRLAHFASLDH 1370

BLAST of Lag0038334 vs. NCBI nr
Match: XP_030936391.1 (uncharacterized protein LOC115961572 [Quercus lobata])

HSP 1 Score: 916.8 bits (2368), Expect = 2.3e-262
Identity = 506/1338 (37.82%), Postives = 733/1338 (54.78%), Query Frame = 0

Query: 11   ENGFCVDSKGKSGGLALLWDASVTFSLLSFSNNHIDGWIT--WDDYHWRLTGFYGFPAAD 70
            ++G  V S G  GGLALLW   +T  + +++ +HID WI   WD   W  TGFYG P   
Sbjct: 54   KHGLTVSSDGSKGGLALLWKEGITVKINTYAQDHIDAWIEGGWDGVSWHFTGFYGNPDTA 113

Query: 71   MRDQTWSLLSKLRGGSDTPWLIGGDFNALLYQHEKEGGRDKPLSELAAFQNVIDSCALLD 130
             R ++W+ L  L+G +  PWL  GDFN +    EKEGGR +P  ++  F + I+ C   +
Sbjct: 114  QRPESWAKLKSLKGTTSVPWLAIGDFNEITGLTEKEGGRVRPRRQMENFVDAINYCGFRE 173

Query: 131  LGFVGNRFTWCNRRPDGT-IYERLDRCFSSATWHDIYPNCVVNHLDYHQSDHRPIELVLS 190
            + F+G ++TW   R DG  I ERLDR  ++  W D++P   + HL    SDH P+ L L 
Sbjct: 174  VDFIGPKYTWWYHRADGMHIRERLDRALANKEWMDLFPAAKLYHLSSSASDHSPLSLHLV 233

Query: 191  PQPGCWRNPSQRITRFDETWLKRADLQQLVRDSWGLSREDPGLSAPQILAQVSKRCMRSM 250
            P+    +   ++  RF+  WLK +  +++V+ +W +        A  IL    + C   +
Sbjct: 234  PKRK--KKKIRKSFRFESMWLKDSRCEEIVKAAWEIGEHS---GAEGILKSCLEHCRHDL 293

Query: 251  AGWGRSRMGNFPQRISEANQKVQLAIEGLRGAGSRELLSQAEAQLEDVLQEEELYWKQRS 310
              W +   G+  ++ISE  QK++        +G    L      L   L++E+  W+QRS
Sbjct: 294  EKWNKEEFGHVGRKISELQQKLEWLELQPSSSGILGELRTTRVNLNKWLEKEDEMWRQRS 353

Query: 311  REVWLKEGDQNTRWFHRQASYRQRLNRIRGLTDDQGEWRQDKTMILQLVNDYFQQLFSTS 370
            R  W + GD+NT +FH +AS R + N I G+ D+QG W++D+  I ++   YF++LF++S
Sbjct: 354  RLNWFQGGDRNTSFFHAKASARHQKNYIDGIVDEQGRWQEDELKIEEVAVAYFEKLFTSS 413

Query: 371  EPSEQDFDISLRDIQRSVDNEMNVELLRPFTENEILRALKQSHPHKAPGPDGLSGSFYKN 430
            +P E  F   L  +Q  V  +MNVEL R +T  E+  ALKQ +P KAPGPDG+   F+++
Sbjct: 414  KPEE--FSDILHAVQPKVTTDMNVELTREYTAQEVRLALKQMYPLKAPGPDGMPPLFFQH 473

Query: 431  HWSIVGPSVVQSCLAVLNHGCSPVSINDTMIVLIPKIKVPRRVSDFRPISLCNFSYKLIS 490
             W+  G  V  + L  LNHG SP + N+T IVLIPKI  P+ VSD+RPISLCN +YK+ S
Sbjct: 474  FWNTCGEVVTSTVLDFLNHGMSPPNFNETHIVLIPKINEPKHVSDYRPISLCNVTYKIAS 533

Query: 491  KAVVNRMKHILPKLISPNQSAFVAGRCVVDNAILGFECIHELRRRTGGKSKWAALKLDMS 550
            KA+ NR+K  LP +IS  QSAFV GR + DN ++ FE +H + R+ GGK    A+KLDMS
Sbjct: 534  KAIANRLKKFLPSIISDTQSAFVHGRLITDNVLVAFETMHHISRKKGGKVGEMAIKLDMS 593

Query: 551  KAYDRIEWSFLRSVMDRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDP 610
            KAYDR+EW F+  +M+++GF      LI++C+++VS++  +NG   G +IPSRG+RQGDP
Sbjct: 594  KAYDRVEWVFVEKIMEKLGFDINLRSLIMQCITTVSYAIKINGRPRGRIIPSRGIRQGDP 653

Query: 611  LSPYLFLLCAEGLSSLLRGAERRALISGFRVARSSPPISHLFFADDSLLFFKANVNEAVT 670
            LSPYLFLLCAEGLS+L++ +     + G  + R  P +SHLFFADDSL+F KA + E   
Sbjct: 654  LSPYLFLLCAEGLSALIKASVGNGSMEGIAICRGGPQLSHLFFADDSLIFCKATIAECDA 713

Query: 671  IRDLLICYERASGQVINYEKSVVAFSPNTGEDSQQYISHVLSVSRCPCHQRYLGLPSFMP 730
            ++ +L  YE+ASGQ +N  K+ + FS NT ++ Q+ I           H++YLGLPS + 
Sbjct: 714  LQRVLGVYEQASGQQLNRAKTSLFFSSNTPKEIQEEIKGRFGAQVIKQHEKYLGLPSLVG 773

Query: 731  KNRSGTLMFIKDRVWKQIQGWKGKFFSLGGKEVLLKSIIQAIPCYTMNCFRLPRCLIREI 790
            KN+  T   IK+++ K++ GWK K  S  GKE+L+K++  A+P YTM+CF+LP  L  E+
Sbjct: 774  KNKRSTFNDIKEKLGKKLSGWKEKLLSKAGKEILIKAVALAVPTYTMSCFKLPDNLCDEL 833

Query: 791  HRAMARFWWNESEEGKRIHWVSWDHMCRPKCMGGLGFRNMELFNQALLAKQCWRVIQDPE 850
               + +FWW + +   RI W+SWD MC  K  GG+GF+N++LFN ALLAKQ WR+    +
Sbjct: 834  TAMIRKFWWGQVKNENRIPWLSWDKMCESKSNGGMGFKNLKLFNLALLAKQGWRLQVGQD 893

Query: 851  SLLGAVLKGRYFPHSEFWEASLGHRPSFIWRSLLWGRELLVRGCRWRIGNGRSIPIYGSN 910
            SL+  VLK +YFP  EF  ASLG+ PS+ WRS++  + L+  G +WR+GNG SI ++   
Sbjct: 894  SLVYRVLKAKYFPRCEFIHASLGNNPSYSWRSIMAAQSLVKEGLKWRVGNGASIRVWEDK 953

Query: 911  WVPDNPSLRVQSAPSLPLSS--RVCDLF-SPSGQWDEAKVRAHFLGPECEAILRIPLRSG 970
            W+P  PS +V   P L L S  RV DL  S  G+W    +   FL  E ++I  IP+ + 
Sbjct: 954  WLPTPPSHKV-ITPRLFLHSDTRVADLLDSEKGEWRTEVIDTVFLPHEADSIKSIPISAR 1013

Query: 971  LLEDRLIWHFEKHGVFSVKSGYRLAFSLASQGVRLLLS--LSPGGFGGLVYGDLGSRIST 1030
            L  D+LIW    +G+F+V+S Y+LA +L S   +   S       F   V+      I  
Sbjct: 1014 LPPDKLIWSETPNGLFTVRSAYKLAVNLLSMPNKGAPSDASKMRSFWRRVWS-----IPV 1073

Query: 1031 RFSYGVSSWNGCPLSSVVEDGL----------------------HLFWKCAVTREMWLCS 1090
                    W  C  +   +D L                      H+ W+C   RE   CS
Sbjct: 1074 PHKIRHFMWRACRNALPTKDNLLRRKIVQDDVCEDCKETPESVFHVLWECRKAREARECS 1133

Query: 1091 K--FSQLYQSLYHLDLVDVIWAL--REKLGALDFELVTVFWWSVWNLRNNL-CWRGESDG 1150
            K  F  L  S   L  VDV+W L  +E +G           W++W+ RN + C      G
Sbjct: 1134 KMVFPDLGGS--SLSFVDVMWKLIMQEDVGEEHVAQAATTAWAIWHNRNEVRCGGVRKTG 1193

Query: 1151 RDLWSWSEEYLRAYYDVVGRRESRCSLQPCPRRPAEQSSWTPPVGGGFKLNTDASVRPDT 1210
            R L+SW+ EYLR Y      R +    +P    P     WTPP  G FK+N D ++    
Sbjct: 1194 RQLFSWATEYLREY------RAANQLDRPVVPTPQRNVRWTPPRDGLFKINVDGAIFIKQ 1253

Query: 1211 GEAGGGCVLRDMSGAVLLAACLDLPRCWSVDLAEGWALVKGVELALQMGFLSFCVEVDSL 1270
               G G V+RD  G +  A    +         E  A+  G+  A  +G     +E DS 
Sbjct: 1254 RAVGVGVVIRDSEGRLEAALSRKIQMPLGAAEVEAKAVEVGLLFAKDVGVRDIVLEGDST 1313

Query: 1271 RLVRILHGEVIDSSEVGLLMDDVRRLLHPCGRGKVLFTPRNGNRVAHALACLTFSYSGCV 1313
             +   L       S +  +++ ++ +       +     R GN  AH LA    S +  V
Sbjct: 1314 VVYNALCNCSRAPSTIAAVINGIQDIGKEFRSIEYSHVRRQGNMPAHILAKNASSINDYV 1370

BLAST of Lag0038334 vs. NCBI nr
Match: XP_038718167.1 (uncharacterized protein LOC120011171 [Tripterygium wilfordii])

HSP 1 Score: 911.8 bits (2355), Expect = 7.5e-261
Identity = 506/1345 (37.62%), Postives = 748/1345 (55.61%), Query Frame = 0

Query: 1    MASAKRALGFENGFCVDSKGKSGGLALLWDASVTFSLLSFSNNHIDGWITWD--DYHWRL 60
            M    R LGF           + GLALLW    +  LLSFS NHIDG +  +  D+ W +
Sbjct: 119  MERIARKLGFAGCLATGCVDGNVGLALLWRGGASVHLLSFSKNHIDGQVVCEGLDHQWCI 178

Query: 61   TGFYGFPAADMRDQTWSLLSKLRGGSDTPWLIGGDFNALLYQHEKEGGRDKPLSELAAFQ 120
            TGFYG P    R  +WSLL  L+G SD PWL+ GDFN +L   EK GGR + +  +  FQ
Sbjct: 179  TGFYGNPVQSQRCHSWSLLRHLKGCSDLPWLVLGDFNEILDSAEKLGGRARGVHAMRDFQ 238

Query: 121  NVIDSCALLDLGFVGNRFTWCNRRPDGTIYERLDRCFSSATWHDIYPNCVVNHLDYHQSD 180
            + +++C L DLG+VG ++TWCN R +G I+ERLDR      W  ++P+ +VNH     SD
Sbjct: 239  SCLEACCLEDLGYVGGKYTWCNNRREGVIFERLDRAVVDGNWRSLFPSAIVNHDVATVSD 298

Query: 181  HRPIELVLSPQPGCWRNPSQRITRFDETWLKRADLQQLVRDSWGLSREDPGLSAPQILAQ 240
            H PI +   P     R  + +  RF+E W +   L+ +V+  W   R D   +  +I+  
Sbjct: 299  HIPIVVDCLPYDVGGR--TNKRFRFEEMWTRHEGLEVVVKQGWEGGRGD---AVDKIVG- 358

Query: 241  VSKRCMRSMAGWGRSRMGNFPQRISEANQKVQLAIEGLRGAGSRELLSQAEAQLEDVLQE 300
                    +  W RS  G+  +R+   ++++    +       RE ++  ++++ ++L  
Sbjct: 359  ----LSFDLTSWDRSVFGSVNRRLKIKHRRLGWLQQQQSSEEIREEINYLKSEINELLGR 418

Query: 301  EELYWKQRSREVWLKEGDQNTRWFHRQASYRQRLNRIRGLTDDQGEWRQDKTMILQLVND 360
            EEL W+QRSR  WL++GDQNT++FHR+AS R + NRI G+ D  G+W Q +  +  ++ +
Sbjct: 419  EELMWRQRSRVEWLQQGDQNTKFFHRKASQRHKKNRIEGIEDVSGDWIQQEGHVNAIIVN 478

Query: 361  YFQQLFSTSEPSEQDFDISLRDIQRSVDNEMNVELLRPFTENEILRALKQSHPHKAPGPD 420
            +F+ LF +  P   D   +L  I   +  +MN  L+R   E+E+ +AL + HP KAPGPD
Sbjct: 479  FFETLFMSDSP--PDCGPALMGIPEVISQQMNEVLVRSPDEDEVKKALFEMHPTKAPGPD 538

Query: 421  GLSGSFYKNHWSIVGPSVVQSCLAVLNHGCSPVSINDTMIVLIPKIKVPRRVSDFRPISL 480
            G+   F++ +W +V   VV         G     +NDT I LIPK+  P+RV+DFRPISL
Sbjct: 539  GMPPLFFQKYWEMVKLDVVNLVQGFFRSGQMKEHVNDTHICLIPKVHNPKRVADFRPISL 598

Query: 481  CNFSYKLISKAVVNRMKHILPKLISPNQSAFVAGRCVVDNAILGFECIHELRRRTGGKSK 540
            CN  YK+ISK + NR+K ++P L+S +QSAFV GR + DN ++ +E +H L+ R  G++ 
Sbjct: 599  CNVLYKVISKVLANRLKPLMPLLVSSSQSAFVKGRLISDNILIAYEVMHYLKTRCSGQNS 658

Query: 541  WAALKLDMSKAYDRIEWSFLRSVMDRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIP 600
            + ALKLDM+KAYDR+EW FL SVM  MGF  +W  LI+ CV +V +S  +NG   G+ IP
Sbjct: 659  YIALKLDMTKAYDRVEWGFLLSVMRAMGFNDKWVGLIMECVRTVHYSVIVNGSSCGSWIP 718

Query: 601  SRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVARSSPPISHLFFADDSLLFF 660
             RGLRQGDPLSPYLF++CAE LS +L  A    L+ G RVAR +P I+HLFFADDSLLF 
Sbjct: 719  GRGLRQGDPLSPYLFVICAEALSRMLVVAHNNQLVHGVRVARGAPSITHLFFADDSLLFC 778

Query: 661  KANVNEAVTIRDLLICYERASGQVINYEKSVVAFSPNTGEDSQQYISHVLSVSRCPCHQR 720
            +AN +E   + +++  YE+ SGQ++N +KS   FS NT   +   I  +LSVS    H R
Sbjct: 779  RANKDECRRLLEVIHGYEKFSGQLVNIQKSNAFFSRNTSTQAAGEILRILSVSEAKLHDR 838

Query: 721  YLGLPSFMPKNRSGTLMFIKDRVWKQIQGWKGKFFSLGGKEVLLKSIIQAIPCYTMNCFR 780
            YLGLP+ + +++     +I+DR+ K++ GW  K  S GGKE+++KS++QAIP Y M+CF+
Sbjct: 839  YLGLPAMIGRSKREVFSYIRDRITKRLNGWNEKMLSRGGKEIMIKSVLQAIPSYAMSCFK 898

Query: 781  LPRCLIREIHRAMARFWWNESEEGKRIHWVSWDHMCRPKCMGGLGFRNMELFNQALLAKQ 840
            LPR + + I + MA FWW  +E   +IHW SWD + + K  GGLGFR++E FN ALL K 
Sbjct: 899  LPRSITQWIMKKMAGFWWGSTEGHNKIHWASWDLLTKSKRNGGLGFRDLECFNDALLGKN 958

Query: 841  CWRVIQDPESLLGAVLKGRYFPHSEFWEASLGHRPSFIWRSLLWGRELLVRGCRWRIGNG 900
             WR+++ P SL+  V K RYFP+ +  +A LG  PSF WRSL     L+ RG RWRIG+G
Sbjct: 959  IWRMLKFPNSLVARVFKSRYFPNCDVLQAGLGTNPSFTWRSLWQAIGLVKRGVRWRIGSG 1018

Query: 901  RSIPIYGSNWVPDNPSLRVQSAPSLPL--SSRVCDLFSPSGQWDEAKVRAHFLGPECEAI 960
              + I    W+P   + +  S  S  +  +S    +     QW+   VR+ FL  E + I
Sbjct: 1019 LGLQIGVDPWLPTPHNFKPFSPMSAVVHGASVASLIVDGPRQWNVELVRSIFLPFEADTI 1078

Query: 961  LRIPLRSGLLEDRLIWHFEKHGVFSVKSGYRLAF---------SLASQGVRL------LL 1020
            L IPL      D ++WH+E  G +SVKS Y  A          + +S G R+      L 
Sbjct: 1079 LSIPLTRERSMDVIMWHWESGGHYSVKSAYHQALIWKLQNCPTTSSSYGDRVGLKWHKLW 1138

Query: 1021 SLS-PGGFGGLVYGDLGSRISTRFSY---GVSSWNGC-PLSSVVEDGLHLFWKCAVTREM 1080
            S+  P     L++  + + + T  +     VSS+ GC    + +ED  H+F +C    ++
Sbjct: 1139 SIQVPSKIKHLLWSMINNALPTNSNLCKRHVSSFAGCLGCGATLEDAEHVFRRCNYASQI 1198

Query: 1081 WLCSKFSQLYQSLYHLDLVDVIWALREKLGALDFELVTVFWWSVWNLRNNLCWRGESDGR 1140
            WL        +    L+L D + A+     + +  L     W +W  RN+L   G   G+
Sbjct: 1199 WLLLSSGLFMKLGRDLNLYDWVEAVLNSCSSSELILFATTIWVLWFERNSLLHEGRRTGQ 1258

Query: 1141 DLWSWSEEYLRAYYDVVGRRESRCSLQP---CPRRPAEQSSWTPPVGGGFKLNTDASVRP 1200
            D       +++  Y      E+ CS QP     R     S+W+ P  G  KLN D +V  
Sbjct: 1259 DF------FMQKVYRFSTEFEA-CSGQPRKVMQRNEGINSAWSGPGPGMIKLNIDGAVFL 1318

Query: 1201 DTGEAGGGCVLRDMSGAVLLAACLDLPRCWSVDLAEGWALVKGVELALQMGFLSFCVEVD 1260
            +    G G +LRD  G+ LL    ++        AE  A+ + +    + G+  F +E+D
Sbjct: 1319 ENDAIGVGAILRDQHGSPLLCFSENVAGRVDPTFAELLAVNRTLTCIEERGYHDFILELD 1378

Query: 1261 SLRLVRILHGEVIDSSEVGLLMDDVRRLLHPCGRGKVLFTPRNGNRVAHALACLT-FSYS 1317
            S  +V+ L  +    S +G  + D + ++   G  K     R+GN VAH LA +  F + 
Sbjct: 1379 SSNIVQALGSDDWLDSRMGHFVSDTKAIMARLGVIKCQHVCRSGNEVAHTLARMAKFRHG 1438

BLAST of Lag0038334 vs. NCBI nr
Match: XP_030508852.1 (uncharacterized protein LOC115723496 [Cannabis sativa])

HSP 1 Score: 909.4 bits (2349), Expect = 3.7e-260
Identity = 503/1338 (37.59%), Postives = 735/1338 (54.93%), Query Frame = 0

Query: 5    KRALGFENGFCVDSKGKSGGLALLWDASVTFSLLSFSNNHIDGWITWDD-YHWRLTGFYG 64
            + +L F NG  V  +G  GGL LLW +SV+ S+ +FS NHID +I  +D   +  TGFYG
Sbjct: 48   RHSLRFPNGIEVPRQGLGGGLMLLWKSSVSVSINNFSTNHIDCFIEINDGPSFHFTGFYG 107

Query: 65   FPAADMRDQTWSLLSKLRGGSD-TPWLIGGDFNALLYQHEKEGGRDKPLSELAAFQNVID 124
             P+   R  TW++L +    +  TPWL+ GDFN +L   +K GG  +  S++ AF+  + 
Sbjct: 108  HPSISQRHHTWTMLKRCYDIAPLTPWLVLGDFNEILSHEDKVGGTMRNFSQIEAFRATVT 167

Query: 125  SCALLDLGFVGNRFTWCNR-RPDGTIYERLDRCFSSATWHDIYPNCVVNHLDYHQSDHRP 184
             C L  L F G+R TW ++    G + ERLD  F +  W   +  C V HLDY++SDHR 
Sbjct: 168  RCCLNSLPFEGDRITWSSQSNGSGPLKERLDYGFVNDLWEATFQACTVQHLDYYKSDHRA 227

Query: 185  IELVLSPQPGCWRNPSQRI-----TRFDETWLKRADLQQLVRDSWGLSREDPGLSAPQIL 244
            I+++++         +Q I      RF++ WL+      L+ D+W  S E     A Q+ 
Sbjct: 228  IKVLVA----ALNEQAQNIHFKSRFRFEKIWLQEDQCASLIFDNW--STESSNCIA-QVT 287

Query: 245  AQVSKRCMRSMAGWGRSRMGNFPQRISEANQKVQLAIEGLRGAGSR-ELLSQAEAQLEDV 304
            A +S      +  W  S  G   Q+I +  + V       R      + +  +E  L+++
Sbjct: 288  ANIS-TISSKLQSWHHSTFGQLKQQIKDTQKHVSTLHNSRRNDSDHLQAVQTSEQILDEL 347

Query: 305  LQEEELYWKQRSREVWLKEGDQNTRWFHRQASYRQRLNRIRGLTDDQGEWRQDKTMILQL 364
            L +EE YW QRSR  WL+ GD NT++FH+ A+ R++ N+IR L D  G  + +   +L +
Sbjct: 348  LAKEEDYWHQRSRISWLQSGDSNTKFFHQHATSRRKNNQIRKLIDVNGNTQTNPPAVLHI 407

Query: 365  VNDYFQQLFSTSEPSEQDFDISLRDIQRSVDNEMNVELLRPFTENEILRALKQSHPHKAP 424
            ++DY+  LF++    ++  DI L  I  ++D+     +  PFT  ++  ALK     K+P
Sbjct: 408  ISDYYNDLFTSRGADQESLDIILDSIPSTLDDTARTFISAPFTAADVYDALKTMSDDKSP 467

Query: 425  GPDGLSGSFYKNHWSIVGPSVVQSCLAVLNHGCSPVSINDTMIVLIPKIKVPRRVSDFRP 484
            G DG+S  FY N+W IVGP V  + L VLN+G  P S N T++ LIPK+K P ++S +RP
Sbjct: 468  GIDGMSVMFYTNYWHIVGPLVTAAVLNVLNNGADPSSFNSTLVTLIPKVKKPSQISQYRP 527

Query: 485  ISLCNFSYKLISKAVVNRMKHILPKLISPNQSAFVAGRCVVDNAILGFECIHELRRRTGG 544
            ISLCN  YKL+SKA+V R+K  L ++IS  QSAF++ R + DN ++ FE +H L+ R  G
Sbjct: 528  ISLCNVLYKLVSKAIVMRLKPFLSQVISEYQSAFLSQRLITDNILVAFELLHSLKNRKRG 587

Query: 545  KSKWAALKLDMSKAYDRIEWSFLRSVMDRMGFAQQWTDLILRCVSSVSFSFNLNGERLGN 604
               +AA+KLDMSKA+DR+EW F+  VM +MGF     +LILRC+ SVS+SF LNG   G 
Sbjct: 588  SKGFAAIKLDMSKAFDRVEWHFVAQVMIKMGFGTVMVELILRCLQSVSYSFLLNGTIQGQ 647

Query: 605  VIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVARSSPPISHLFFADDSL 664
            VIPSRG+RQGDPLSPYLFL+CAEGLS LL+  E    + G +++RS+P +SHLFFADDS+
Sbjct: 648  VIPSRGIRQGDPLSPYLFLICAEGLSRLLQYEELAGSLEGLKISRSAPSVSHLFFADDSV 707

Query: 665  LFFKANVNEAVTIRDLLICYERASGQVINYEKSVVAFSPNTGEDSQQYISHVLSVSRCPC 724
            LF +AN   A  I   LI Y RASGQVIN EK V++FS NT +  Q +   +L +   PC
Sbjct: 708  LFCRANQQSARAIHRCLITYSRASGQVINPEKCVLSFSENTRQHEQIFFKDLLGMPIQPC 767

Query: 725  HQRYLGLPSFMPKNRSGTLMFIKDRVWKQIQGWKGKFFSLGGKEVLLKSIIQAIPCYTMN 784
            H++YLGLPSF  KN+      I D++WK +  WK   FS GGKEVLLK+++QAIP Y M+
Sbjct: 768  HEQYLGLPSFSGKNKKQLFGGITDKIWKLLSSWKEHLFSAGGKEVLLKAVVQAIPTYAMS 827

Query: 785  CFRLPRCLIREIHRAMARFWWNESEEGKRIHWVSWDHMCRPKCMGGLGFRNMELFNQALL 844
            CFRLP  L  +I   MARFWW  +  GK IHW +W+ +C+ K  GGLGFRN   FNQALL
Sbjct: 828  CFRLPVTLCHQIESMMARFWWGSTATGKTIHWKNWNFLCKAKVQGGLGFRNFIHFNQALL 887

Query: 845  AKQCWRVIQDPESLLGAVLKGRYFPHSEFWEASLGHRPSFIWRSLLWGRELLVRGCRWRI 904
            AKQ WR+++ P SLL  +L+ RYF +  +  A LG  PS  WRSL+WG+ELL++G RWR+
Sbjct: 888  AKQAWRILEFPNSLLSNLLRHRYFSNGNYLIAGLGSNPSLTWRSLVWGKELLLKGLRWRV 947

Query: 905  GNGRSIPIYGSNWVPDNPSLRVQSAPSLPLSSRVCDLFSPSGQWDEAKVRAHFLGPECEA 964
            G+G  I     +W+P + + +         +  V DL +    WD   +  +F   +   
Sbjct: 948  GSGERINCKTDSWLPGHTTFKPYFFKGPDPNLLVADLITEHRTWDMISLETNFNQADINR 1007

Query: 965  ILRIPLRSGLLEDRLIWHFEKHGVFSVKSGYRLAFSLASQG---------------VRLL 1024
            +L IPL     +D LIW+    GV++VKSGY  A SLA Q                 +L 
Sbjct: 1008 VLSIPLSPYPHDDVLIWNQSFTGVYNVKSGYHFAVSLAEQDDSTCSNSIEHWWSNFWKLK 1067

Query: 1025 LSLSPGGFGGLVYGDLGSRISTRFSYGVSSWNGCPLSSVVEDGL-HLFWKCAVTREMWLC 1084
            L      F   V+       +  +   ++    C + +  E+ + H  + C   + +W  
Sbjct: 1068 LPPKVRIFVWKVFHTSLPVAAELYRRHIAFSPYCTICNSCEETVHHALFSCPRAKAVWEL 1127

Query: 1085 SKFSQLYQSLYHLDLVDVIWALREKLGALDFELVTVFWWSVWNLRNNLCWRGES--DGRD 1144
            S FS  +Q++      D +  L   L + + EL  V  WS+W+ RN + + G S      
Sbjct: 1128 SNFSIDFQTIERSSTADTLLLLSTSLSSSELELFLVLCWSIWHERNAI-YHGNSVRTPAA 1187

Query: 1145 LWSWSEEYLRAYYDVVGR--RESRCSLQPCPRRPAEQ----SSWTPPVGGGFKLNTDASV 1204
            + +++  YL  +     +  +    S    P RP+ +      WT P  G  KLNTDA++
Sbjct: 1188 VAAYAPSYLTEFQQARAKNAKPVTASGAATPSRPSSEFIHAPKWTTPPRGRLKLNTDAAI 1247

Query: 1205 RPDTGEAGGGCVLRDMSGAVLLAACLDLPRCWSVDLAEGWALVKGVELALQMGFLSFCVE 1264
              +    G G VLR+  G ++ A        +  +  E   L   +   L        +E
Sbjct: 1248 DKERNTIGIGAVLRNSDGIIVAALSKPFRGNFKAEEMEALGLALSLNWLLSHNLSVDFIE 1307

Query: 1265 VDSLRLVRILHGEVIDSSEVGLLMDDVRRLLHPCGRGKVLFTPRNGNRVAHALACLTFSY 1309
             DSL +V+ L       S    L++++  L+    R ++    R+ N  AH LA    + 
Sbjct: 1308 TDSLLVVQGLKTSHSFLSAFHALLNNINYLVSFFPRAQIDHVSRSANTYAHTLAKFALTV 1367

BLAST of Lag0038334 vs. NCBI nr
Match: CAB4263564.1 (unnamed protein product [Prunus armeniaca])

HSP 1 Score: 902.9 bits (2332), Expect = 3.5e-258
Identity = 519/1323 (39.23%), Postives = 727/1323 (54.95%), Query Frame = 0

Query: 1    MASAKRALGFENGFCVDSKGKSGGLALLWDASVTFSLLSFSNNHIDGWITWDDY-HWRLT 60
            M      LG     CV   G SGGL LLW   +   LLS S  HID  +T + Y  +R+T
Sbjct: 1    MGKLHTRLGLGGVVCVPRVGFSGGLCLLWQVGLQVDLLSSSPGHIDVRVTMNTYATFRVT 60

Query: 61   GFYGFPAADMRDQTWSLLSKLRGGSDTPWLIGGDFNALLYQHEKEGGRDKPLSELAAFQN 120
            GFYG P    R  +W LL +L      PWL  GDFN ++  +EK G R +  +++  F+ 
Sbjct: 61   GFYGHPDQTQRHHSWELLRRLGRVDLGPWLCCGDFNEVMECNEKSGNRLRRDAQMEDFKM 120

Query: 121  VIDSCALLDLGFVGNRFTWCNRRPDGTIYE-RLDRCFSSATWHDIYPNCVVNHLDYHQSD 180
             I  C L    F G  FTW N+R D    E RLDR F +      + N   +HL    SD
Sbjct: 121  AITDCCLFQFEFTGYPFTWSNKRKDTAHVEARLDRGFGNLALLQHWGNFTSHHLVAFSSD 180

Query: 181  HRPIELVLS-PQPGCWRNP-SQRITRFDETWLKRADLQQLVRDSWGLSREDPGLSAPQIL 240
            H PI +    PQ    R+P  +R   F+E W    D +++VR SW         +A   L
Sbjct: 181  HHPILIASDRPQGDKARDPRGRRRFHFEEVWTTEVDCEEVVRQSW--------QNAVSPL 240

Query: 241  AQVSKRCMRSMAGWGRSRMGNFPQRISEANQKVQLAIEGLRGAGSRELLSQAEAQLEDVL 300
            + ++  C  +++ W   + G  P+++ E   ++           +    S  E +L+  L
Sbjct: 241  SNIA-NCASNLSRWCAEKGGQVPKKVKELRLRLASLQSDEPSTQTFHTRSLIETELDKCL 300

Query: 301  QEEELYWKQRSREVWLKEGDQNTRWFHRQASYRQRLNRIRGLTDDQGEWRQDKTMILQLV 360
            ++EE+YW QRSR  WL+ GD+NT +FH+QA+ R++ N + G+ D+   W+ +   I  + 
Sbjct: 301  EQEEIYWHQRSRVQWLQHGDRNTSFFHKQATSRRKKNALVGILDENDRWQSENDKIGGVF 360

Query: 361  NDYFQQLFSTSEPSEQDFDISLRDIQRSVDNEMNVELLRPFTENEILRALKQSHPHKAPG 420
             ++F  LF TS+    D ++    +Q  V +     LL P++ +EI  AL    P KAPG
Sbjct: 361  VEFFTNLF-TSDMGVADVEV-FSAVQARVSSRSYHNLLLPYSRDEIEVALNFIGPSKAPG 420

Query: 421  PDGLSGSFYKNHWSIVGPSVVQSCLAVLNHGCSPVSINDTMIVLIPKIKVPRRVSDFRPI 480
            PDG+   FY+ +WSIVGP V   CL VLN        N T++ LIPK+  P RVS++RPI
Sbjct: 421  PDGMPALFYQKYWSIVGPDVSDLCLRVLNGSDGVNDFNHTLVALIPKVNSPTRVSEYRPI 480

Query: 481  SLCNFSYKLISKAVVNRMKHILPKLISPNQSAFVAGRCVVDNAILGFECIHELRRRTGGK 540
            SLCN  YK+ISK + NR+K +LP++IS  QSAF+  R ++DN +  FE +H L+R     
Sbjct: 481  SLCNVLYKIISKTLANRLKKVLPEVISEFQSAFIPNRMILDNVLAAFETVHCLKRWGKTG 540

Query: 541  SKWAALKLDMSKAYDRIEWSFLRSVMDRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNV 600
             K   LKLDM+KAYDR+E  FL  ++  MGF  ++  LI+ CV++VS+S  + G   G +
Sbjct: 541  KKKLILKLDMAKAYDRVERKFLEQMLRTMGFPIRFIQLIMGCVTTVSYSLLIQGRPFGRI 600

Query: 601  IPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVARSSPPISHLFFADDSLL 660
            IPSRGLRQGDP+SPYLFL+ AE  S+LL+ AER + + G  +A S+P I+HLFFADDSLL
Sbjct: 601  IPSRGLRQGDPISPYLFLIVAEAFSALLQQAERDSRLHGVSIAPSAPSINHLFFADDSLL 660

Query: 661  FFKANVNEAVTIRDLLICYERASGQVINYEKSVVAFSPNTGEDSQQYISHVLSVSRCPCH 720
            F  A   EA+ ++ +   YE ASGQ +N  KS + FSP+T    Q  I  +L+V+  PCH
Sbjct: 661  FCNAGTTEALELKRIFGVYESASGQKVNLGKSALCFSPSTPRVLQDDIRQLLNVTIVPCH 720

Query: 721  QRYLGLPSFMPKNRSGTLMFIKDRVWKQIQGWKGKFFSLGGKEVLLKSIIQAIPCYTMNC 780
            +RYLGLP+ + K++      +KDRVW ++ GW+GK  S  GKEVL+KS+ QAIP Y+M+ 
Sbjct: 721  ERYLGLPTIVGKDKKKLFRTVKDRVWNKVNGWQGKLLSKAGKEVLIKSVCQAIPSYSMSV 780

Query: 781  FRLPRCLIREIHRAMARFWWNESEEGKRIHWVSWDHMCRPKCMGGLGFRNMELFNQALLA 840
            FRLP  L REI   +A+FWW+++ +G+ IHW +W  MC+ K  GG+GFR +  FNQALL 
Sbjct: 781  FRLPVGLCREIESIIAKFWWSKN-DGRGIHWKTWRFMCQHKSDGGIGFRELTSFNQALLC 840

Query: 841  KQCWRVIQDPESLLGAVLKGRYFPHSEFWEASLGHRPSFIWRSLLWGRELLVRGCRWRIG 900
            KQ WR+++ P SL+  + K RYFPHS+F  AS G  PSF W+SLLWGR+LL  G RWRIG
Sbjct: 841  KQGWRLLEFPNSLIARMFKARYFPHSDFLAASSGSLPSFTWQSLLWGRDLLRLGLRWRIG 900

Query: 901  NGRSIPIYGSNWVPDNPSLRVQSAPSLPLSSRVCDLFSPSGQWDEAKVRAHFLGPECEAI 960
            +GR + IYG  WVP +    +QS P+LP++SRVCDLF+ SG WD  KV A F  PE EAI
Sbjct: 901  DGRLVNIYGDPWVPYDRFFTIQSIPTLPVTSRVCDLFTASGGWDVRKVFASFSFPEAEAI 960

Query: 961  LRIPLRSGLLEDRLIWHFEKHGVFSVKSGYRLAFSLASQGVRLLLSLSPGGFGGLVYGDL 1020
            L IPL    L DR IW+F K+G +SVKSGY  A        + L  LS GG  G      
Sbjct: 961  LSIPLMGDTL-DRRIWNFTKNGRYSVKSGYWAALE-----YKRLEELSTGGVAG------ 1020

Query: 1021 GSRISTRFSYGVSSWNGCPLSSVVEDGLHLFWKCAVTREMWLCSKFSQLYQSLYHLDLVD 1080
                    S  + SW       V +  LHL W+ A               Q +  L   +
Sbjct: 1021 ------PSSSSLKSWKHLWKLKVPQKILHLLWRVA---------------QDI--LPSKE 1080

Query: 1081 VIWALREKLGALDFELVTVFWWSVWNLRNNLCWRGESDGRDLWSWSEEYLRAYYDVV-GR 1140
            V++  R   G +         W   +  +N      +   D+ +W +    A + ++   
Sbjct: 1081 VLFRRRITQGEV---------WEALDFPSNFLLPTLA---DVGTWMD----AIWSIIPPD 1140

Query: 1141 RESRCSLQPCPRRPAEQSSWTPPVGGGFKLNTDASVRPDTGEAGGGCVLRDMSGAVLLAA 1200
            ++S  +       P     W PP G  FKLN D +   +TG  G G ++RD  G ++ A 
Sbjct: 1141 KQSLFAFTVSLSSPVCDIKWRPPTGNCFKLNVDGATDMETGARGAGAIVRDSHGKLVGAL 1200

Query: 1201 CLDLPRCWSVDLAEGWALVKGVELALQMGFLSFCVEVDSLRLVRILHGEVIDSSEVGLLM 1260
             +  P   SV   E +AL  G+  AL M  +   +E DSL+ V +++ E    +  G L+
Sbjct: 1201 AMRAPSRISVLATELYALKVGISFALDMSPVPLEIESDSLQAVSMVNSEEECLAAEGGLV 1257

Query: 1261 DDVRRLLHPCGRGKVLFTPRNGNRVAHALACLTF-SYSGCVWLEEWPMEIAAVLAGDVAL 1318
            D VRRLL       V   PR  N+ AH +A  +    S  +WL+  P+ +   +  D   
Sbjct: 1261 DGVRRLLVRSASTAVRHIPRQANKAAHRIARFSLRDQSLSLWLDVGPLWLMDAVYDD--- 1257

BLAST of Lag0038334 vs. ExPASy Swiss-Prot
Match: P0C2F6 (Putative ribonuclease H protein At1g65750 OS=Arabidopsis thaliana OX=3702 GN=At1g65750 PE=3 SV=1)

HSP 1 Score: 194.9 bits (494), Expect = 6.2e-48
Identity = 163/601 (27.12%), Postives = 253/601 (42.10%), Query Frame = 0

Query: 722  LPSFMPKNRSGTLMFIKDRVWKQIQGWKGKFFSLGGKEVLLKSIIQAIPCYTMNCFRLPR 781
            +P    +    T   I +RV  ++ GW+ K  S  G+  L K+++ ++P ++M+   LP+
Sbjct: 1    MPVLQKRINKDTFGEILERVSSRMSGWREKTLSFAGRLTLTKAVLSSMPVHSMSTILLPQ 60

Query: 782  CLIREIHRAMARFWWNESEEGKRIHWVSWDHMCRPKCMGGLGFRNMELFNQALLAKQCWR 841
             ++  + +    F W  + E K+ H V W  +C PK  GGLG R  +  N+AL++K  WR
Sbjct: 61   SILNRLDQLSRTFLWGSTAEKKKQHLVKWSKVCSPKKEGGLGVRAAKSMNRALISKVGWR 120

Query: 842  VIQDPESLLGAVLKGRYFP---HSEFWEASLGHRPSFIWRSLLWG-RELLVRGCRWRIGN 901
            ++Q+  SL   VL+ +Y         W    G   S  WRS+  G R+++  G  W  G+
Sbjct: 121  LLQEKNSLWTLVLQKKYHVGEIRDSRWLIPKGSWSS-TWRSIAIGLRDVVSHGVGWIPGD 180

Query: 902  GRSIPIYGSNWVPDNPSLRVQSAPSLPLSSRVC--DLFSPSGQWDEAKVRAHFLGPECEA 961
            G+ I  +   WV   P L + +         V   DL+ P   WD AK+      P    
Sbjct: 181  GQQIRFWTDRWVSGKPLLELDNGERPTDCDTVVAKDLWIPGRGWDFAKI-----DPYTTN 240

Query: 962  ILRIPLRSGLLE------DRLIWHFEKHGVFSVKSGYRL------------AFSLASQGV 1021
              R+ LR+ +L+      DRL W F + G FSV+S Y +            +F      V
Sbjct: 241  NTRLELRAVVLDLVTGARDRLSWKFSQDGQFSVRSAYEMLTVDEVPRPNMASFFNCLWKV 300

Query: 1022 RLLLSLSPGGFGGLVYGDLGSRISTRFSYGVSSWNGCPL-SSVVEDGLHLFWKCAVTREM 1081
            R+   +    F  LV          R    +S+ N C +    VE  LH+   C     +
Sbjct: 301  RVPERVKT--FLWLVGNQAVMTEEERHRRHLSASNVCQVCKGGVESMLHVLRDCPAQLGI 360

Query: 1082 WLCSKFSQLYQSLYHLDLVDVIW-ALREKLGALDFELVTVF----WWSVWNLRNNLCWRG 1141
            W+     +  Q  +   L + ++  L ++ G  D    T+F    WW  W  R    +  
Sbjct: 361  WVRVVPQRRQQGFFSKSLFEWLYDNLGDRSGCEDIPWSTIFAVIIWWG-WKWRCGNIFGE 420

Query: 1142 ESDGRDLWSWSEEYLRAYYDVVGRRESRCSLQPCPRRPAEQSSWTPPVGGGFKLNTDASV 1201
             +  RD   + +E+    Y            QP   R      W  P  G  K+NTD + 
Sbjct: 421  NTKCRDRVKFVKEWAVEVYRAHSGNVLVGITQP---RVERMIGWVSPCVGWVKVNTDGAS 480

Query: 1202 RPDTGEAGGGCVLRDMSGAVLLAACLDLPRCWSVDLAEGWALVKGVELALQMGFLSFCVE 1261
            R + G A  G VLRD +GA      L++ RC S   AE W +  G+  A +       +E
Sbjct: 481  RGNPGLASAGGVLRDCTGAWCGGFSLNIGRC-SAPQAELWGVYYGLYFAWEKKVPRVELE 540

Query: 1262 VDSLRLVRILHGEVIDSSEVGLLMDDVRRLLHPCGRGKVLFTPRNGNRVAHALACLTFSY 1293
            VDS  +V  L   + DS  +  L+      L      +++   R  NR+A  LA   FS 
Sbjct: 541  VDSEVIVGFLKTGISDSHPLSFLVRLCHGFLQKDWLVRIVHVYREANRLADGLANYAFSL 588

BLAST of Lag0038334 vs. ExPASy Swiss-Prot
Match: P14381 (Transposon TX1 uncharacterized 149 kDa protein OS=Xenopus laevis OX=8355 PE=4 SV=1)

HSP 1 Score: 186.8 bits (473), Expect = 1.7e-45
Identity = 194/766 (25.33%), Postives = 322/766 (42.04%), Query Frame = 0

Query: 84  SDTPWLIGGDFNALLYQHEKEGGRDKPLSELAAFQNVIDSCALLDLGFVGN----RFTWC 143
           SD   +IGGDFN  L   ++   + +  SE +  + +I   +L+D+    N     FT+ 
Sbjct: 134 SDEALIIGGDFNYTLDARDRNVPKKRDSSE-SVLRELIAHFSLVDVWREQNPETVAFTYV 193

Query: 144 NRRPDGTIYERLDRCFSSATWHDIYPNCVVNHLDYHQSDHRPIELVLSPQPGCWRNPSQR 203
             R       R+DR + S+       +  +    +  SDH  + L +S  P     P   
Sbjct: 194 RVRDGHVSQSRIDRIYISSHLMSRAQSSTIRLAPF--SDHNCVSLRMSIAPSL---PKAA 253

Query: 204 ITRFDETWLKRADLQQLVRDSW-GLSREDPGLSAPQILAQVSKRCMRSMA-GWGRSRMGN 263
              F+ + L+     + VRD+W G        +       V K  ++ +   + +S  G 
Sbjct: 254 YWHFNNSLLEDEGFAKSVRDTWRGWRAFQDEFATLNQWWDVGKVHLKLLCQEYTKSVSGQ 313

Query: 264 FPQRISEANQKVQLAIEGLRGAGSR----ELLSQAEAQLEDVLQEEELYWKQRSREVWLK 323
               I   N +V    + L G+  +    E L + EA L ++ Q +      RSR   L 
Sbjct: 314 RNAEIEALNGEVLDLEQRLSGSEDQALQCEYLERKEA-LRNMEQRQARGAFVRSRMQLLC 373

Query: 324 EGDQNTRWFHRQASYRQRLNRIRGLTDDQGEWRQDKTMILQLVNDYFQQLFSTSEPSEQD 383
           + D+ +R+F+     +    +I  L  + G   +D   I      ++Q LFS  +P   D
Sbjct: 374 DMDRGSRFFYALEKKKGNRKQITCLFAEDGTPLEDPEAIRDRARSFYQNLFS-PDPISPD 433

Query: 384 FDISLRDIQRSVDNEMNVELLRPFTENEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVG 443
               L D    V       L  P T +E+ +AL+    +K+PG DGL+  F++  W  +G
Sbjct: 434 ACEELWDGLPVVSERRKERLETPITLDELSQALRLMPHNKSPGLDGLTIEFFQFFWDTLG 493

Query: 444 PSVVQSCLAVLNHGCSPVSINDTMIVLIPKIKVPRRVSDFRPISLCNFSYKLISKAVVNR 503
           P   +        G  P+S    ++ L+PK    R + ++RP+SL +  YK+++KA+  R
Sbjct: 494 PDFHRVLTEAFKKGELPLSCRRAVLSLLPKKGDLRLIKNWRPVSLLSTDYKIVAKAISLR 553

Query: 504 MKHILPKLISPNQSAFVAGRCVVDNAILGFECIHELRRRTGGKSKWAALKLDMSKAYDRI 563
           +K +L ++I P+QS  V GR + DN  L  + +H   RRTG     A L LD  KA+DR+
Sbjct: 554 LKSVLAEVIHPDQSYTVPGRTIFDNVFLIRDLLH-FARRTG--LSLAFLSLDQEKAFDRV 613

Query: 564 EWSFLRSVMDRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYLF 623
           +  +L   +    F  Q+   +    +S      +N      +   RG+RQG PLS  L+
Sbjct: 614 DHQYLIGTLQAYSFGPQFVGYLKTMYASAECLVKINWSLTAPLAFGRGVRQGCPLSGQLY 673

Query: 624 LLCAEGLSSLLRGAERRALISGFRVARSSPPISHLFFADDSLLFFKANVNEAVTIRDLLI 683
            L  E    LL     R  ++G  +      +    +ADD +L  + ++ +    ++   
Sbjct: 674 SLAIEPFLCLL-----RKRLTGLVLKEPDMRVVLSAYADDVILVAQ-DLVDLERAQECQE 733

Query: 684 CYERASGQVINYEKS--------VVAFSPNTGEDSQ------QYISHVLSVSRCPCHQRY 743
            Y  AS   IN+ KS         V F P    D        +Y+   LS    P  Q +
Sbjct: 734 VYAAASSARINWSKSSGLLEGSLKVDFLPPAFRDISWESKIIKYLGVYLSAEEYPVSQNF 793

Query: 744 LGLPSFMPKNRSGTLMFIKDRVWKQIQGWKG--KFFSLGGKEVLLKSIIQAIPCYTMNCF 803
           + L               ++ V  ++  WKG  K  S+ G+ +++  ++ +   Y + C 
Sbjct: 794 IEL---------------EECVLTRLGKWKGFAKVLSMRGRALVINQLVASQIWYRLICL 853

Query: 804 RLPRCLIREIHRAMARFWWNESEEGKRIHWVSWDHMCRPKCMGGLG 824
              +  I +I R +  F W     GK  HWVS      P   GG G
Sbjct: 854 SPTQEFIAKIQRRLLDFLW----IGK--HWVSAGVSSLPLKEGGQG 861

BLAST of Lag0038334 vs. ExPASy Swiss-Prot
Match: P11369 (LINE-1 retrotransposable element ORF2 protein OS=Mus musculus OX=10090 GN=Pol PE=1 SV=2)

HSP 1 Score: 169.5 bits (428), Expect = 2.8e-40
Identity = 132/520 (25.38%), Postives = 236/520 (45.38%), Query Frame = 0

Query: 332 LNRIRGLTDDQGEWRQDKTMILQLVNDYFQQLFSTSEPSEQDFDISLRDIQRSVDNEMNV 391
           +N+IR   +++G+   D   I   +  ++++L+ST   +  + D  L   Q    N+  V
Sbjct: 397 INKIR---NEKGDITTDPEEIQNTIRSFYKRLYSTKLENLDEMDKFLDRYQVPKLNQDQV 456

Query: 392 ELLR-PFTENEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVVQSCLAVLNHGCSP 451
           + L  P +  EI   +      K+PGPDG S  FY+     + P + +    +   G  P
Sbjct: 457 DHLNSPISPKEIEAVINSLPTKKSPGPDGFSAEFYQTFKEDLIPILHKLFHKIEVEGTLP 516

Query: 452 VSINDTMIVLIPK-IKVPRRVSDFRPISLCNFSYKLISKAVVNRMKHILPKLISPNQSAF 511
            S  +  I LIPK  K P ++ +FRPISL N   K+++K + NR++  +  +I P+Q  F
Sbjct: 517 NSFYEATITLIPKPQKDPTKIENFRPISLMNIDAKILNKILANRIQEHIKAIIHPDQVGF 576

Query: 512 VAGRCVVDNAILGFECIHELRRRTGGKSKWAALKLDMSKAYDRIEWSFLRSVMDRMGFAQ 571
           + G     N       IH + +          + LD  KA+D+I+  F+  V++R G   
Sbjct: 577 IPGMQGWFNIRKSINVIHYINKLK--DKNHMIISLDAEKAFDKIQHPFMIKVLERSGIQG 636

Query: 572 QWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAER 631
            + ++I    S    +  +NGE+L  +    G RQG PLSPYLF +  E L+  +R   +
Sbjct: 637 PYLNMIKAIYSKPVANIKVNGEKLEAIPLKSGTRQGCPLSPYLFNIVLEVLARAIR---Q 696

Query: 632 RALISGFRVARSSPPISHLFFADDSLLFFKANVNEAVTIRDLLICYERASGQVINYEKSV 691
           +  I G ++ +    IS L  ADD +++     N    + +L+  +    G  IN  KS 
Sbjct: 697 QKEIKGIQIGKEEVKISLL--ADDMIVYISDPKNSTRELLNLINSFGEVVGYKINSNKS- 756

Query: 692 VAFSPNTGEDSQQYISHVLSVSRCPCHQRYLG--LPSFMPKNRSGTLMFIKDRVWKQIQG 751
           +AF     + +++ I      S    + +YLG  L   +          +K  + + ++ 
Sbjct: 757 MAFLYTKNKQAEKEIRETTPFSIVTNNIKYLGVTLTKEVKDLYDKNFKSLKKEIKEDLRR 816

Query: 752 WKGKFFSLGGKEVLLKSIIQAIPCYTMNC--FRLPRCLIREIHRAMARFWWNESEEGKRI 811
           WK    S  G+  ++K  I     Y  N    ++P     E+  A+ +F WN  +     
Sbjct: 817 WKDLPCSWIGRINIVKMAILPKAIYRFNAIPIKIPTQFFNELEGAICKFVWNNKKPR--- 876

Query: 812 HWVSWDHMCRPKCMGGLGFRNMELFNQALLAKQCWRVIQD 846
             ++   +   +  GG+   +++L+ +A++ K  W   +D
Sbjct: 877 --IAKSLLKDKRTSGGITMPDLKLYYRAIVIKTAWYWYRD 900

BLAST of Lag0038334 vs. ExPASy Swiss-Prot
Match: P93295 (Uncharacterized mitochondrial protein AtMg00310 OS=Arabidopsis thaliana OX=3702 GN=AtMg00310 PE=4 SV=1)

HSP 1 Score: 157.9 bits (398), Expect = 8.3e-37
Identity = 70/149 (46.98%), Postives = 97/149 (65.10%), Query Frame = 0

Query: 768 AIPCYTMNCFRLPRCLIREIHRAMARFWWNESEEGKRIHWVSWDHMCRPK-CMGGLGFRN 827
           A+P Y M+CFRL + L +++  AM  FWW+  E  ++I WV+W  +C+ K   GGLGFR+
Sbjct: 2   ALPVYAMSCFRLSKLLCKKLTSAMTEFWWSSCENKRKISWVAWQKLCKSKEDDGGLGFRD 61

Query: 828 MELFNQALLAKQCWRVIQDPESLLGAVLKGRYFPHSEFWEASLGHRPSFIWRSLLWGREL 887
           +  FNQALLAKQ +R+I  P +LL  +L+ RYFPHS   E S+G RPS+ WRS++ GREL
Sbjct: 62  LGWFNQALLAKQSFRIIHQPHTLLSRLLRSRYFPHSSMMECSVGTRPSYAWRSIIHGREL 121

Query: 888 LVRGCRWRIGNGRSIPIYGSNWVPDNPSL 916
           L RG    IG+G    ++   W+ D   L
Sbjct: 122 LSRGLLRTIGDGIHTKVWLDRWIMDETPL 150

BLAST of Lag0038334 vs. ExPASy Swiss-Prot
Match: P92555 (Uncharacterized mitochondrial protein AtMg01250 OS=Arabidopsis thaliana OX=3702 GN=AtMg01250 PE=4 SV=1)

HSP 1 Score: 86.7 bits (213), Expect = 2.4e-15
Identity = 41/69 (59.42%), Postives = 50/69 (72.46%), Query Frame = 0

Query: 586 FNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVARSSPPI 645
           F +NG   G V PSRGLRQGDPLSPYLF+LC E LS L R A+ +  + G RV+ +SP I
Sbjct: 12  FIINGAPQGLVTPSRGLRQGDPLSPYLFILCTEVLSGLCRRAQEQGRLPGIRVSNNSPRI 71

Query: 646 SHLFFADDS 655
           +HL FADD+
Sbjct: 72  NHLLFADDT 80

BLAST of Lag0038334 vs. ExPASy TrEMBL
Match: A0A7N2LIH6 (Uncharacterized protein OS=Quercus lobata OX=97700 PE=3 SV=1)

HSP 1 Score: 920.2 bits (2377), Expect = 1.0e-263
Identity = 511/1345 (37.99%), Postives = 746/1345 (55.46%), Query Frame = 0

Query: 1    MASAKRALGFENGFCVDSKGKSGGLALLWDASVTFSLLSFSNNHIDGWI--TWDDYHWRL 60
            M   +  LGF  G  V S G+SGGLALLW         S S++HID  +        WR 
Sbjct: 733  MKGFQNKLGFTQGIIVPSDGRSGGLALLWKEGTDIRFKSCSHSHIDVVVHGAGSGGPWRA 792

Query: 61   TGFYGFPAADMRDQTWSLLSKLRGGSDTPWLIGGDFNALLYQHEKEGGRDKPLSELAAFQ 120
            TGFYG P    R  +W LL  L    + PWL+ GDFN +++  EK G +D+  +++ AF+
Sbjct: 793  TGFYGHPDTGKRYTSWKLLEILNTQCEMPWLVCGDFNEIVHPDEKMGWKDRDAAQMDAFR 852

Query: 121  NVIDSCALLDLGFVGNRFTWCNRR-PDGTIYERLDRCFSSATWHDIYPNCVVNHLDYHQS 180
             V+  C L+DLGFVG RFTWCN R  D     RLDR  ++  W  ++P   V+H+    S
Sbjct: 853  EVLSKCGLIDLGFVGPRFTWCNGRFGDQRTLIRLDRMVANEAWSLMFPEAKVHHVSMSAS 912

Query: 181  DHRPIELVLSPQPGCWRNPSQRITRFDETWLKRADLQQLVRDSWGLSREDPGLSAPQILA 240
            DH  + L L+      R   +    F+E W +  + +++V  +W   RED  +   + L 
Sbjct: 913  DHCLLALFLNKVNNQRRGKKRFF--FEEMWTRVEECKEIVELAWDPYREDSAMPVQERL- 972

Query: 241  QVSKRCMRSMAGWGRSRMGNFPQRISEANQKVQLAIEGLRGA-GSRELLSQAEAQLEDVL 300
               +RC + +  W ++  GN  + I +   ++Q  +E L     + E +   + ++ ++ 
Sbjct: 973  ---ERCQKMLQQWNQNSFGNVYKGIKQKKNRLQ-QLESLNLLHETAEEIQTLKKEINELH 1032

Query: 301  QEEELYWKQRSREVWLKEGDQNTRWFHRQASYRQRLNRIRGLTDDQGEWRQDKTMILQLV 360
              EE+ WKQRSR  WL+ GD+N+++FH  AS R++ NRI GL DD G W +D+    +L+
Sbjct: 1033 TREEVMWKQRSRVSWLQYGDKNSKFFHATASQRRQKNRIGGLMDDLGVWHEDQETTEKLI 1092

Query: 361  NDYFQQLFSTSEPSEQDFDISLRDIQRSVDNEMNVELLRPFTENEILRALKQSHPHKAPG 420
             DYF+ ++S+++P+   FD+SL  +   V  EMN EL + F   E+ +AL+Q HP KAPG
Sbjct: 1093 LDYFKDIYSSNQPT--SFDVSLEAMDERVTPEMNDELQKEFKAVEVWQALQQMHPTKAPG 1152

Query: 421  PDGLSGSFYKNHWSIVGPSVVQSCLAVLNHGCSPVSINDTMIVLIPKIKVPRRVSDFRPI 480
            PDG+S  FY+ +W IVG SV    L  LN G  P  IN T I LIPK K P+++++FRPI
Sbjct: 1153 PDGMSPIFYQKYWDIVGSSVTNCVLQALNSGVMPKDINKTYICLIPKTKNPQKITEFRPI 1212

Query: 481  SLCNFSYKLISKAVVNRMKHILPKLISPNQSAFVAGRCVVDNAILGFECIHELRRRTGGK 540
            SLCN  YK+ISK + NR+K +L  +I   QSAFV GR + DN I+ FE +H + +R  GK
Sbjct: 1213 SLCNVIYKIISKVLANRLKKVLHGVIDEAQSAFVPGRMITDNVIVAFESMHSINQRRKGK 1272

Query: 541  SKWAALKLDMSKAYDRIEWSFLRSVMDRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNV 600
                A+KLDMSKAYDR+EW++L S+M +MGF  +W  LI+ CV+SVSFS  +NGE  G+ 
Sbjct: 1273 EGLMAIKLDMSKAYDRVEWAYLESMMKKMGFGDRWISLIMMCVTSVSFSVLINGEPKGSF 1332

Query: 601  IPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVARSSPPISHLFFADDSLL 660
             PSRGLRQGDP+SPYLFLLC EGLS++++  ER  LI G   AR +P ISHLFFADDS++
Sbjct: 1333 TPSRGLRQGDPISPYLFLLCGEGLSAMIKKKEREGLIRGVVAARQAPRISHLFFADDSII 1392

Query: 661  FFKANVNEAVTIRDLLICYERASGQVINYEKSVVAFSPNTGEDSQQYISHVLSVSRCPCH 720
            F +A V+E   +  +L  YE  SGQ +N +K+ + FS NT ++ +++   +        H
Sbjct: 1393 FCRATVDECEQVAKVLEVYEEESGQKLNRDKTSLFFSRNTKDEMKEFAKGIFGAQIIQHH 1452

Query: 721  QRYLGLPSFMPKNRSGTLMFIKDRVWKQIQGWKGKFFSLGGKEVLLKSIIQAIPCYTMNC 780
            ++YLGLP  + + +      IKD+V ++I GWKGK  S  G+EVL+K++ QA P YTMN 
Sbjct: 1453 EKYLGLPPLIGRAKKKAFNRIKDQVGRKIAGWKGKLLSNAGREVLIKAVAQATPTYTMNV 1512

Query: 781  FRLPRCLIREIHRAMARFWWNESEEGKRIHWVSWDHMCRPKCMGGLGFRNMELFNQALLA 840
            F+LP  L  E++  M  FWW +    K++ WVSW ++C+PK  GG+GF++++ FN ALLA
Sbjct: 1513 FKLPDSLCAELNSMMGSFWWGQRGREKKMAWVSWKNLCKPKVDGGMGFKDLKAFNLALLA 1572

Query: 841  KQCWRVIQDPESLLGAVLKGRYFPHSEFWEASLGHRPSFIWRSLLWGRELLVRGCRWRIG 900
            KQ WR+ Q+P SL   VLK +YF +S F EA LG +PS+IWRS++  + ++  G RW +G
Sbjct: 1573 KQGWRLHQNPNSLAHRVLKAKYFANSSFMEAQLGKKPSYIWRSIMAAKNIIKEGSRWVVG 1632

Query: 901  NGRSIPIYGSNWVPDNPSLRVQSAPSLPL-SSRVCDLFSPS-GQWDEAKVRAHFLGPECE 960
            +GRSI I+ + W+P   S +V +  S  +   RV  L S   G+W    V+  F+  E E
Sbjct: 1633 DGRSIEIWDARWLPSTASGKVMTTRSGSVQGERVASLISQERGEWKTTLVQQTFIPHEAE 1692

Query: 961  AILRIPLRSGLLEDRLIWHFEKHGVFSVKSGYRLAFSLASQ----------GVRLLLSLS 1020
             IL IPL S  L D L+W    +G F+VKS YR AF    +            +  +S  
Sbjct: 1693 EILSIPLSSMNLADSLVWAETPNGCFTVKSAYRTAFKCILEPREGEANPECSDKSRMSTI 1752

Query: 1021 PGGFGGLVYGDLGSRISTRFSYGVSSWNGCPLSSVV------------EDGLHLFWKCAV 1080
                 GL   +       R   G+     C +   +            E   H  W C V
Sbjct: 1753 WKTIWGLQCPNKIKHFLWRACRGILPTKKCLVHRKIMKDDCCDFCGESETSGHCLWNCIV 1812

Query: 1081 TREMWLCSKFSQLYQSLYHLDLVDVIWALREKLGALDFELVTVFWWSVWNLRNNLCWRGE 1140
             +E W    F+ +    + ++ +DV+W L E  G  D+E   +  WS+WN RNN+   G 
Sbjct: 1813 AKEAWKGLGFN-IDNPEHVVEFLDVVWLLLESQGDKDWEFFAIVAWSLWNNRNNVRHGGV 1872

Query: 1141 S-DGRDLWSWSEEYLRAYYDVVGRRESRCSLQ---PCPRRPAEQSSWTPPVGGGFKLNTD 1200
            S  G+ +   +  Y         R E R +L      P+   +   W+PP+   +K+N D
Sbjct: 1873 SKQGKSITEEARRY---------REEVRTALPAKGQVPKPMPKHKRWSPPLQDWYKVNVD 1932

Query: 1201 ASVRPDTGEAGGGCVLRDMSGAVLLAACLDLPRCWSVDLAEGWALVKGVELALQMGFLSF 1260
            A+V  + G  G G V+R+  G ++ A    +        AE  A   G+ LA  +G  + 
Sbjct: 1933 AAVFREQGTCGIGVVIRNNKGQIMGAMSKKMLFPLRALEAEAKAAEAGILLAWDLGLKNI 1992

Query: 1261 CVEVDSLRLVRILHGEVIDSSEVGLLMDDVRRLLHPCGRGKVLFTPRNGNRVAHALACLT 1313
             VE D+  +++ L G V   + +  +++  RR L      K + T R  N  AH LA  +
Sbjct: 1993 VVEGDAQLVIQALKG-VDAPTPIVKIIEGARRYLQMFCSWKAVHTNRRNNTAAHLLARES 2052

BLAST of Lag0038334 vs. ExPASy TrEMBL
Match: A0A2N9GB96 (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS27778 PE=4 SV=1)

HSP 1 Score: 913.7 bits (2360), Expect = 9.6e-262
Identity = 508/1309 (38.81%), Postives = 739/1309 (56.46%), Query Frame = 0

Query: 30   DASVTF-SLLSFSNNHIDGWITWDDYHWRLTGFYGFPAADMRDQTWSLLSKLRGGSDTPW 89
            D S+ F S     +N IDG      + WRLT FYG P   +R+ +W+LL  L+     PW
Sbjct: 474  DPSIMFLSETWMDDNLIDG---NSAFPWRLTCFYGAPETHLRENSWNLLRALKNQFSLPW 533

Query: 90   LIGGDFNALLYQHEKEGGRDKPLSELAAFQNVIDSCALLDLGFVGNRFTWC-NRRPDGTI 149
               GDFN ++   E +G   +  +++  F+NVID C  +DLG+ G  FTWC NR+ D T 
Sbjct: 534  CCTGDFNEIVRSSEYKGRCSRNDNQMQGFRNVIDDCEFIDLGYRGLPFTWCNNRKGDATT 593

Query: 150  YERLDRCFSSATWHDIYPNCVVNHLDYHQSDHRPIELVLSPQPGCWRNPSQRITRFDETW 209
            + RLDR  ++  W   + + VV HLD  +SDH+PI L  +P     +   +++ RF++ W
Sbjct: 594  WLRLDRFMATNEWILHFHSAVVYHLDNTESDHKPIWLTTAPLQ--IQRTKRKLFRFEDMW 653

Query: 210  LKRADLQQLVRDSW-GLSREDPGLSAPQILAQVSKRCMRSMAGWGRSRMGNFPQRISEAN 269
               +  ++ +  +W    R  P +   ++L     RC R +  W R   G+  ++I E  
Sbjct: 654  RTESGCEETITKAWVPKVRGSPMVQVQEMLT----RCGRDLTAWSRVHFGSITRKIREKK 713

Query: 270  QKVQLAIE-GLRGAGSRELLSQAEAQLEDVLQEEELYWKQRSREVWLKEGDQNTRWFHRQ 329
            ++++ A E  + G G  ++LS  + +L  +L +EE  W+QRSR +WLK+GDQNT++FH +
Sbjct: 714  EELRKAEEQSISGRGHDQVLSLRQ-ELNTLLCKEEKMWQQRSRALWLKDGDQNTKYFHSR 773

Query: 330  ASYRQRLNRIRGLTDDQGEWRQDKTMILQLVNDYFQQLFSTSEPSEQDFDISLRDIQRSV 389
            A++R+R N +  L D  GE  +D   I      Y++ LF  + P E + D  L  I  SV
Sbjct: 774  ATHRKRRNSLVVLRDGTGELVEDPHEIGNRFIRYYEDLFQAA-PLE-EVDQVLAGINPSV 833

Query: 390  DNEMNVELLRPFTENEILRALKQSHPHKAPGPDGLSGSFYKNHWSIVGPSVVQSCLAVLN 449
              EMN +L RP+TE+E+  ALKQ  P KAPGPDG+  +FY+++W +VG  VVQ+ L+ +N
Sbjct: 834  TAEMNTKLTRPYTESEVAVALKQMAPLKAPGPDGMPPAFYQSYWKVVGKEVVQAVLSSIN 893

Query: 450  HGCSPVSINDTMIVLIPKIKVPRRVSDFRPISLCNFSYKLISKAVVNRMKHILPKLISPN 509
             G  P SIN T + LIPK+K P  V+++RPISLCN  YKLISK + NR+K +LP +I+  
Sbjct: 894  SGTLPPSINHTFVALIPKVKNPEHVTEYRPISLCNVIYKLISKVLANRLKEVLPTVIAET 953

Query: 510  QSAFVAGRCVVDNAILGFECIHELRRRTGGKSKWAALKLDMSKAYDRIEWSFLRSVMDRM 569
            QSAFV GR + DN ++ FE +H +  +  G+    ALKLDMSKAYDR+EWSFLR VM +M
Sbjct: 954  QSAFVPGRLITDNVLIAFETLHHMHNQRQGRVGSMALKLDMSKAYDRVEWSFLRQVMLKM 1013

Query: 570  GFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLR 629
            GF  QW  L++ C+++VS+S  +NGE  G++ PSRGLRQGDP+SPYLFLLCAEGL+ LL 
Sbjct: 1014 GFHSQWVSLMMECITTVSYSLLINGEPRGHITPSRGLRQGDPISPYLFLLCAEGLNGLLN 1073

Query: 630  GAERRALISGFRVARSSPPISHLFFADDSLLFFKANVNEAVTIRDLLICYERASGQVINY 689
             A  +  I G  + R  P ++HLFFADDSLLF +A   E   I+DLL  YE+ASGQ +N 
Sbjct: 1074 KAAAQGEIHGVSLCRRGPKLTHLFFADDSLLFCRATQAECHKIQDLLNIYEKASGQQLNR 1133

Query: 690  EKSVVAFSPNTGEDSQQYISHVLSVSRCPCHQRYLGLPSFMPKNRSGTLMFIKDRVWKQI 749
             K+ + FS NT + +Q  I ++L V     +++YLGLPS + K +      IKDRVW ++
Sbjct: 1134 SKTTLFFSHNTSQATQDDIKNILGVPSIRQYEKYLGLPSLVGKEKMACFSQIKDRVWSKV 1193

Query: 750  QGWKGKFFSLGGKEVLLKSIIQAIPCYTMNCFRLPRCLIREIHRAMARFWWNESEEGKRI 809
            +GWK K  S  G+E+L+K++IQAIP YTMNCF+LP  L ++I   M RFWW + ++ +++
Sbjct: 1194 KGWKEKLLSQAGREILIKAVIQAIPTYTMNCFKLPVKLCKDIEAIMRRFWWGQKDQERKV 1253

Query: 810  HWVSWDHMCRPKCMGGLGFRNMELFNQALLAKQCWRVIQDPESLLGAVLKGRYFPHSEFW 869
            HW+SW  +C+PK  GGLGFR ++ FN ALLAKQ WR +    SLL  V   ++FP+    
Sbjct: 1254 HWISWTKLCQPKGNGGLGFRELQKFNIALLAKQFWRFMNCKNSLLFKVFSPKFFPNGNIL 1313

Query: 870  EASLGHRPSFIWRSLLWGRELLVRGCRWRIGNGRSIPIYGSNWVPDNPSLRVQS-APSLP 929
            EASL  R SF WRS++  + L++ G  WR+G+G+ IPI  +NW+ D    RV S  P  P
Sbjct: 1314 EASLKTRGSFAWRSIMQAKSLILSGSSWRVGDGQKIPIKNANWLLDEGHRRVISPLPMFP 1373

Query: 930  LSSRVCDLFSPSG-QWDEAKVRAHFLGPECEAILRIPLRSGLLEDRLIWHFEKHGVFSVK 989
              S+V  L   S  +WD  K+RA FL  + EAIL+IP+ S    D+LIWH  + G +SV+
Sbjct: 1374 HGSKVALLMRGSPLEWDVEKIRASFLPYDAEAILQIPISSSSPPDKLIWHATRDGKYSVR 1433

Query: 990  SGYRLAF--------SLASQGVR------LLLSLSPGGFGGLV----YGDLGSRIS-TRF 1049
            SGY +            +  G R      +    +P      +    +  L S++  +R 
Sbjct: 1434 SGYHILLQEVQNTNPGSSRHGERDPLWKDIWSMCAPAKIRSFLWRACHESLPSKLGLSRR 1493

Query: 1050 SYGVSSW-NGCPLSSVVEDGLHLFWKCAVTREMWLCSKFSQLYQSLYHLDLVDVIWALRE 1109
                S W + C   + VED LH  WKC      W         +        D++  +  
Sbjct: 1494 QIVDSPWCDNC--GTGVEDCLHALWKCPAIECSWSTQHELAEIRKQEFGSFHDLVRQVGS 1553

Query: 1110 KLGALDFELVTVFWWSVWNLRNNLCWRGESDGRDLWSWSEEYLRAYYDVVGRRESRCSLQ 1169
               AL  E      W +W+ RN       SD         E L   +  +  +E   S  
Sbjct: 1554 HNRALLLEKFAAMCWLLWHKRNQTRLHLPSDDYTQICHRAETLIQEHARIHLKEHHQS-- 1613

Query: 1170 PCPRRPAEQSSWTPPVGGGFKLNTDASVRPDTGEAGGGCVLRDMSGAVLLAACLDLPRCW 1229
                 P  + SW PP    +K+N D ++  ++ E G G V+RD +G V+      +  C 
Sbjct: 1614 ----PPNPKVSWQPPTSYKYKVNFDGAIFRESKEGGIGVVIRDQNGLVIATLSQRVKTCP 1673

Query: 1230 SVDLAEGWALVKGVELALQMGFLSFCVEVDSLRLVRILHGEVIDSSEVGLLMDDVRRLLH 1289
            S ++ E  A  + ++ AL++G      E DS  ++R +       +  GL+++D + LLH
Sbjct: 1674 SAEMIEARAAKRAIQFALEIGIFDAIFEGDSDLIIREISSPEAMHNVYGLVLEDAKALLH 1733

Query: 1290 PCGRGKVLFTPRNGNRVAHALACLTFSYSG-CVWLEEWPMEIAAVLAGD 1312
               R +   T R+GN VAHALA    +    CVW+E+ P +I  VL  D
Sbjct: 1734 HFERYQFTHTRRSGNTVAHALARRALNIQNLCVWMEDVPPDIIPVLYSD 1762

BLAST of Lag0038334 vs. ExPASy TrEMBL
Match: A0A7N2R0C3 (Reverse transcriptase domain-containing protein OS=Quercus lobata OX=97700 PE=4 SV=1)

HSP 1 Score: 911.4 bits (2354), Expect = 4.8e-261
Identity = 492/1341 (36.69%), Postives = 739/1341 (55.11%), Query Frame = 0

Query: 5    KRALGFENGFCVDSKGKSGGLALLWDASVTFSLLSFSNNHIDGWITWDD--YHWRLTGFY 64
            +R LG   G  V S G+SGGLA+LW   V  SL S SN+HID  +   +    WR TGFY
Sbjct: 519  QRKLGLTQGIAVPSDGRSGGLAMLWREGVDVSLKSCSNSHIDVVVGGSNGAVPWRATGFY 578

Query: 65   GFPAADMRDQTWSLLSKLRGGSDTPWLIGGDFNALLYQHEKEGGRDKPLSELAAFQNVID 124
            G P A MR  +W LL  L    + PW++ GDFN +L   EK G  ++   ++  F+  + 
Sbjct: 579  GHPDAGMRPISWKLLEVLSRQCNMPWVVFGDFNEILNSDEKLGWLERDARQMECFRECLS 638

Query: 125  SCALLDLGFVGNRFTWCNRR-PDGTIYERLDRCFSSATWHDIYPNCVVNHLDYHQSDHRP 184
            +C LLDLGFVG RFTWCN R  +     RLDR  ++  W +++P   V H     SDH  
Sbjct: 639  NCGLLDLGFVGQRFTWCNGRIGEQRTLVRLDRMVANEEWMNLFPEAKVVHRSMAASDHCL 698

Query: 185  IELVLSPQPGCWRNPSQRITRFDETWLKRADLQQLVRDSWGLSREDPGLSAPQILAQVSK 244
            + L +  +    R  ++R   F+E W +    ++++  +W     DP    P++  Q   
Sbjct: 699  LSLSIRRRE--TRKVARRRFMFEEMWTREEGCREVIERAW-----DPLGCNPELTIQNRL 758

Query: 245  RCMR-SMAGWGRSRMGNFPQRISEANQKVQLAIEGLRGAGSRELLSQAEAQLEDVLQEEE 304
            +C +  +  W R   GN  + + +   ++Q   E      S E + + + ++ +V+  EE
Sbjct: 759  KCCQCQLQNWNRRVFGNVNKILKQKQCRLQQLEELNLLHESAEEVQKLKKEINEVMLREE 818

Query: 305  LYWKQRSREVWLKEGDQNTRWFHRQASYRQRLNRIRGLTDDQGEWRQDKTMILQLVNDYF 364
            + W QRSR +W+K GD+NTR+FH  A+ R+R N+I G+ D +G WR++   + +++ +YF
Sbjct: 819  IMWNQRSRALWIKYGDRNTRFFHATANNRRRKNKIEGILDSEGRWRENNEEVEEIILEYF 878

Query: 365  QQLFSTSEPSEQDFDISLRDIQRSVDNEMNVELLRPFTENEILRALKQSHPHKAPGPDGL 424
            ++++S++ P+E  F   L  + R V  +MN +LLR F E E+ +AL Q HP K+PGPDG+
Sbjct: 879  KEIYSSNFPTE--FGACLGAVGRRVTEDMNEDLLREFKEEEVWQALMQMHPTKSPGPDGM 938

Query: 425  SGSFYKNHWSIVGPSVVQSCLAVLNHGCSPVSINDTMIVLIPKIKVPRRVSDFRPISLCN 484
            S  F++ +W +VGP VVQS +  L  G  P+ +N+T I LIPK+K P++++++RPISLCN
Sbjct: 939  SPIFFQKYWDVVGPQVVQSVIHTLRTGVMPMGVNETYICLIPKVKCPQKITEYRPISLCN 998

Query: 485  FSYKLISKAVVNRMKHILPKLISPNQSAFVAGRCVVDNAILGFECIHELRRRTGGKSKWA 544
              YKL+SK + NR+K +LP ++   QSAFV GR + DN ++ FE +H + +R  GK    
Sbjct: 999  VIYKLVSKVLANRLKVVLPDVVDEAQSAFVPGRQITDNVLVAFEVMHCINQRRKGKEGLM 1058

Query: 545  ALKLDMSKAYDRIEWSFLRSVMDRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSR 604
            A+KLDMSKAYDR+EW +L ++M RMGF ++W  L++ CV++VSFS  +NGE  G ++P+R
Sbjct: 1059 AIKLDMSKAYDRVEWGYLEAIMRRMGFRERWISLMMMCVTTVSFSVLINGEPRGRIVPTR 1118

Query: 605  GLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVARSSPPISHLFFADDSLLFFKA 664
            GLRQGDP+SPYLFLLCAEGLS++LR  E    +SG ++ R +P ISHL FADD ++F KA
Sbjct: 1119 GLRQGDPISPYLFLLCAEGLSAMLRRNEIGEAVSGVQICRRAPRISHLLFADDCIVFGKA 1178

Query: 665  NVNEAVTIRDLLICYERASGQVINYEKSVVAFSPNTGEDSQQYISHVLSVSRCPCHQRYL 724
            ++ E + +  +L  YER SGQ +N EK+ + FS NT  + ++ +  +        H+RYL
Sbjct: 1179 SMEEGLKVTKILEDYERESGQKLNKEKTSLFFSKNTAVEVKEGVKELFGAEIIHQHERYL 1238

Query: 725  GLPSFMPKNRSGTLMFIKDRVWKQIQGWKGKFFSLGGKEVLLKSIIQAIPCYTMNCFRLP 784
            GLP  + + +      IKD+V ++I  WKG+  S  G+E+L+K++ QA P YTMNCF LP
Sbjct: 1239 GLPPLVGRGKRKAFNRIKDQVGRKIASWKGRLLSTAGREILIKAVAQATPTYTMNCFLLP 1298

Query: 785  RCLIREIHRAMARFWWNESEEGKRIHWVSWDHMCRPKCMGGLGFRNMELFNQALLAKQCW 844
              L  E++  +  FWW + ++ K++ W++W  +C+PK  GG+GF++++ FN ALLAKQ W
Sbjct: 1299 DSLCSELNSLVRNFWWGQRDKEKKLAWIAWGKLCKPKAEGGMGFKDLKAFNLALLAKQGW 1358

Query: 845  RVIQDPESLLGAVLKGRYFPHSEFWEASLGHRPSFIWRSLLWGRELLVRGCRWRIGNGRS 904
            R+ Q+P SL   VLK RYFP S F EA LG+ PS+ WRSLL  RE++ RG RW IGNG+ 
Sbjct: 1359 RLSQNPCSLAYRVLKARYFPSSNFMEAQLGNLPSYTWRSLLAAREIIERGRRWNIGNGQQ 1418

Query: 905  IPIYGSNWVPDNPSLRVQSAPSLPLSSRVCD--LFSPSGQWDEAKVRAHFLGPECEAILR 964
            + I+   W+P   S +V S         + +  +    G WD+  VR  F+  E E+IL 
Sbjct: 1419 VRIWVDRWLPTPHSFKVVSPKPQEFEGEMVESLINQQEGGWDKNLVRRVFIPHEAESILS 1478

Query: 965  IPLRSGLLEDRLIWHFEKHGVFSVKSGYRLAFSLASQ--------GV----------RLL 1024
            IP+   L ED + W +  +G F+V S Y++A S   +        GV          + L
Sbjct: 1479 IPISLSLPEDAVSWAWTPNGRFTVSSAYKVACSWLCERRSKEEGCGVSDPGKGRQFWKFL 1538

Query: 1025 LSL-SPGGFGGLVYGDLGSRISTRFSY---GVSSWNGCPLSSVVEDGLHLFWKCAVTREM 1084
              L  P      ++    + + T +      V     C +   +E   H+ W C V   +
Sbjct: 1539 WQLHCPSKVKHFLWRACKNILPTNYCLKLRKVPIEEACGVCGRIESAGHVLWDCEVAGAV 1598

Query: 1085 WLCSKFSQLYQSLYHLDLVDVIWALREKLGALDFELVTVFWWSVWNLRNNLCWRGESDGR 1144
            W  SK         H D ++++W L E    LD+E      W +W  RN+L + G     
Sbjct: 1599 WRESKLMLPKLRNDHRDFMEIVWKLWEGRRELDWECFVTTAWCIWKNRNSLKFEGRGKAA 1658

Query: 1145 DLWSWSEEYLRAYYDVVGRRESRCSLQPCPRRPAEQ--SSWTPPVGGGFKLNTDASVRPD 1204
             +     E L   +     +E         + P E    +W PP  G +K N D +V  +
Sbjct: 1659 RVIVKEAELLVEEFRSGNIKE---------KLPVEVRIQAWRPPREGWYKANVDGAVFKE 1718

Query: 1205 TGEAGGGCVLRDMSGAVLLAACLDLPRCWSVDLAEGWALVKGVELALQMGFLSFCVEVDS 1264
            T   G G V+R+  G ++ A    L         E  A  +G+ LA  +G     +E D+
Sbjct: 1719 TNSCGIGVVIRNDQGQIMGAMSKRLNLPLGAVEVEAKAFEEGLRLAGDLGLQQVILEGDA 1778

Query: 1265 LRLVRILHGEVIDSSEVGLLMDDVRRLLHPCGRGKVLFTPRNGNRVAHALACLTFSYSGC 1315
            L +   L G+ +  S +  L+    R              R GNR AH +A      S C
Sbjct: 1779 LTVTNSLLGKCLPPSSIQRLIAGASRWKQWVQVWSASHVRRAGNRAAHVMAQNAKCISDC 1838

BLAST of Lag0038334 vs. ExPASy TrEMBL
Match: A0A803PV25 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 906.0 bits (2340), Expect = 2.0e-259
Identity = 490/1300 (37.69%), Postives = 726/1300 (55.85%), Query Frame = 0

Query: 8    LGFENGFCVDSKGKSGGLALLWDASVTFSLLSFSNNHIDGWITWDDY-HWRLTGFYGFPA 67
            L +EN + VD  G SGGL L+W A +   +LS S  HI   +    +  W  TGFYG P 
Sbjct: 594  LQYENLWTVDRIGLSGGLLLMWKADIQVQVLSSSPGHILATVAGCGFPPWSFTGFYGNPD 653

Query: 68   ADMRDQTWSLLSKLRGGSDTPWLIGGDFNALLYQHEKEGGRDKPLSELAAFQNVIDSCAL 127
            A  R  +W LL  LR     PWL  GDFN ++   EK GGRD+    +  F+ V+D C  
Sbjct: 654  AGQRRFSWQLLRDLRKEVQGPWLCIGDFNEIVSLSEKVGGRDRLPGVMDGFKEVLDDCQF 713

Query: 128  LDLGFVGNRFTWCNRRPDGTIYERLDRCFSSATWHDIYPNCVVNHLDYHQSDHRPIEL-- 187
            +D     +  TWCN   +  I ERLDR   +  W D +    ++ LD+ +SDHR + +  
Sbjct: 714  IDFSSTKHELTWCNEHSNSRIMERLDRGLCTEEWLDKFEGADISLLDWWESDHRALVVDI 773

Query: 188  -VLSPQPGCWRNPSQRITRFDETWLKRADLQQLVRDSWGLSREDPGLSAPQILAQVSKRC 247
             V      C +   +    F+E W +  +  ++V   W  S E+          +++K C
Sbjct: 774  PVRLDGDKCGKAKRKSRFHFEEAWCQEEECTEIVDRLW--SEENVSGRVGSFRCKINK-C 833

Query: 248  MRSMAGWGRSRMGNFPQRISEANQKVQLAIEGLRGAGSRELLSQAEAQLEDVLQEEELYW 307
             +++  W R +       + E  +K    +  ++  G  E + Q EA+L  +L+++E YW
Sbjct: 834  GKALQTWNRKKKSKLNSEV-EKLKKALHELTMMQQPGVWETIQQMEAKLNGLLEKDEQYW 893

Query: 308  KQRSREVWLKEGDQNTRWFHRQASYRQRLNRIRGLTDDQGEWRQDKTMILQLVNDYFQQL 367
            +QRSR +WL+ GD+NT++FH +AS R++ N I+GL D  G W+ DK ++ ++V DY+++L
Sbjct: 894  RQRSRALWLQWGDRNTKYFHHKASARRKKNEIKGLQDHMGVWQDDKVLVCRIVEDYYEKL 953

Query: 368  FSTSEPSEQDFDISLRDIQRSVDNEMNVELLRPFTENEILRALKQSHPHKAPGPDGLSGS 427
            F++S+ +E   +  L  +Q  V + MN +LL  F E E++RA+K+ +P KAPG DGL   
Sbjct: 954  FTSSDINESVLNEVLSVVQPKVSSVMNNDLLAEFGEEEVIRAVKEMNPTKAPGADGLPAL 1013

Query: 428  FYKNHWSIVGPSVVQSCLAVLNHGCSPVSINDTMIVLIPKIKVPRRVSDFRPISLCNFSY 487
            FY+  WS +   VV   L VLN+G     +NDT++ LIPK+  P+R+ +FRPISLCN  Y
Sbjct: 1014 FYQKFWSKLKVDVVAVSLNVLNNGADLQCLNDTVVALIPKVDKPQRIEEFRPISLCNVIY 1073

Query: 488  KLISKAVVNRMKHILPKLISPNQSAFVAGRCVVDNAILGFECIHELRRRTGGKSKWAALK 547
            K++SK + NRM+  L  ++S +QSAF+ GR + DNAI+G+E +H +R+         ALK
Sbjct: 1074 KIVSKCLANRMRVSLGSVVSDSQSAFLKGRLIHDNAIVGYESLHVMRKDRFRNGSKVALK 1133

Query: 548  LDMSKAYDRIEWSFLRSVMDRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSRGLR 607
            LDM+KAYDR+EW FL ++M ++G++Q W   I+ C++SV FSF +NGE  G V+P RGLR
Sbjct: 1134 LDMAKAYDRVEWRFLEAMMVKLGYSQLWVAKIMNCLTSVQFSFIINGEIQGRVLPQRGLR 1193

Query: 608  QGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVARSSPPISHLFFADDSLLFFKANVN 667
            QGDPLSP+LFLLCAE  S L++ AE++  + G    R    +SHLFFADDSL+F  A  +
Sbjct: 1194 QGDPLSPFLFLLCAEAFSCLIQHAEQQGRLHGVVFGRQRLMVSHLFFADDSLVFLDATED 1253

Query: 668  EAVTIRDLLICYERASGQVINYEKSVVAFSPNTGEDSQQYISHVLSVSRCPCHQRYLGLP 727
            E    R+LL  Y  ASGQ++N+ KS + F  +     + +++  + V     + +YLGLP
Sbjct: 1254 ECRCFRELLEKYSIASGQLVNFHKSEMCFGRSVSAPVRTHLATFMGVKVVDNYGKYLGLP 1313

Query: 728  SFMPKNRSGTLMFIKDRVWKQIQGWKGKFFSLGGKEVLLKSIIQAIPCYTMNCFRLPRCL 787
            SF+ + +     FI ++VW +++GWKG FFS  GKEVL+K+I+QAIP YTM+CFRLP+  
Sbjct: 1314 SFVGRTKKQHFEFI-NKVWNKLKGWKGSFFSAAGKEVLIKAIVQAIPTYTMSCFRLPKKT 1373

Query: 788  IREIHRAMARFWWNESEEGKRIHWVSWDHMCRPKCMGGLGFRNMELFNQALLAKQCWRVI 847
            I  IH   ARFWW  SE+  +IHW  W  +C+ K  GGLGFR++ LFNQALLAKQ WR I
Sbjct: 1374 INSIHSMAARFWWGSSEKDAKIHWCKWSVLCKHKEQGGLGFRDLGLFNQALLAKQIWRCI 1433

Query: 848  QDPESLLGAVLKGRYFPHSEFWEASLGHRPSFIWRSLLWGRELLVRGCRWRIGNGRSIPI 907
            + P SL   VLK  Y+P+    EA  G+  SF+WRSL+WG++++  G RWRIGNG S+ +
Sbjct: 1434 RYPNSLCSKVLKASYYPNVGVLEAKCGNHASFVWRSLVWGKKIIQAGYRWRIGNGNSVRV 1493

Query: 908  YGSNWVPDNPSLRVQSAPSLPLSSRVCDLFSPSGQWDEAKVRAHFLGPECEAILRIPLRS 967
                W+P   + ++   P LP +  V DL   +G+WDE  VRA F   + E IL++    
Sbjct: 1494 LDDPWLPRPVTFKIYDKPPLPDNLHVIDLKKGNGEWDEEFVRAVFNPTDAELILQMATSE 1553

Query: 968  GLLEDRLIWHFEKHGVFSVKSGYRLAFSLASQGV-----------RLLLSLS-PGGFGGL 1027
              +ED+++WH+ K G +SV+SGYR+A +L  + +           R L  L  P      
Sbjct: 1554 CDIEDKILWHYSKDGEYSVRSGYRMAAALEVRDIQSNTEATNRWWRQLWKLKIPPKVKHF 1613

Query: 1028 VYGDLGSRISTRFSYG-----VSSWNGCPLSSVVEDGLHLFWKCAVTREMWLCSKFSQLY 1087
            V+    S I T  +       +  +     S   E+  H  W C V  ++W  S FS   
Sbjct: 1614 VWKMAHSWIPTNSALAHRKVQIEPYCTRCSSGAYENVFHALWSCRVNCDVWKISGFSSKI 1673

Query: 1088 QSLYHLDLVDVIWALREKLGALDFELVTVFWWSVWNLRNNLCWRGESD-GRDLWSWSEEY 1147
            +     D++  +  +   L   DFE   V  W++W +RN++   G       +  W  ++
Sbjct: 1674 KRQGKEDVLAFLMRMSSSLAKEDFEYFLVLTWNLWYIRNSVNHGGHKPVAAAIVEWCSKF 1733

Query: 1148 LRAYYDVVGRRESRCSLQPCPRRPAEQSSWTPPVGGGFKLNTDASVRPDTGEAGGGCVLR 1207
            L  +      R+S  S +    R A  + W  PV G + +N DA V+   G A    V+R
Sbjct: 1734 LAEF------RDSNVSQKAGAARAA--ARWVAPVRGSYTINVDAGVKVGEGLASVSSVMR 1793

Query: 1208 DMSGAVLLAACLDLPRCWSVDLAEGWALVKGVELALQMGFLSFCVEVDSLRLVRILHGEV 1267
            D  G V +AA   + +  S   AE  A+  G++  LQ    SF VE D L+ V ++  + 
Sbjct: 1794 DHEGRVKVAAVRVVEKELSPLHAELTAIADGIKAGLQQKLPSFHVETDCLQAVNLVLKDD 1853

Query: 1268 IDSSEVGLLMDDVRRLLHPCGRGKVLFTPRNGNRVAHALA 1286
                +V  L+  +R LL       + F  R  N+ AH LA
Sbjct: 1854 GGCRDVDGLVTQIRCLLQDVRVHGISFVYREANQFAHVLA 1880

BLAST of Lag0038334 vs. ExPASy TrEMBL
Match: A0A6J5TIF9 (Reverse transcriptase domain-containing protein OS=Prunus armeniaca OX=36596 GN=CURHAP_LOCUS4077 PE=4 SV=1)

HSP 1 Score: 902.9 bits (2332), Expect = 1.7e-258
Identity = 519/1323 (39.23%), Postives = 727/1323 (54.95%), Query Frame = 0

Query: 1    MASAKRALGFENGFCVDSKGKSGGLALLWDASVTFSLLSFSNNHIDGWITWDDY-HWRLT 60
            M      LG     CV   G SGGL LLW   +   LLS S  HID  +T + Y  +R+T
Sbjct: 1    MGKLHTRLGLGGVVCVPRVGFSGGLCLLWQVGLQVDLLSSSPGHIDVRVTMNTYATFRVT 60

Query: 61   GFYGFPAADMRDQTWSLLSKLRGGSDTPWLIGGDFNALLYQHEKEGGRDKPLSELAAFQN 120
            GFYG P    R  +W LL +L      PWL  GDFN ++  +EK G R +  +++  F+ 
Sbjct: 61   GFYGHPDQTQRHHSWELLRRLGRVDLGPWLCCGDFNEVMECNEKSGNRLRRDAQMEDFKM 120

Query: 121  VIDSCALLDLGFVGNRFTWCNRRPDGTIYE-RLDRCFSSATWHDIYPNCVVNHLDYHQSD 180
             I  C L    F G  FTW N+R D    E RLDR F +      + N   +HL    SD
Sbjct: 121  AITDCCLFQFEFTGYPFTWSNKRKDTAHVEARLDRGFGNLALLQHWGNFTSHHLVAFSSD 180

Query: 181  HRPIELVLS-PQPGCWRNP-SQRITRFDETWLKRADLQQLVRDSWGLSREDPGLSAPQIL 240
            H PI +    PQ    R+P  +R   F+E W    D +++VR SW         +A   L
Sbjct: 181  HHPILIASDRPQGDKARDPRGRRRFHFEEVWTTEVDCEEVVRQSW--------QNAVSPL 240

Query: 241  AQVSKRCMRSMAGWGRSRMGNFPQRISEANQKVQLAIEGLRGAGSRELLSQAEAQLEDVL 300
            + ++  C  +++ W   + G  P+++ E   ++           +    S  E +L+  L
Sbjct: 241  SNIA-NCASNLSRWCAEKGGQVPKKVKELRLRLASLQSDEPSTQTFHTRSLIETELDKCL 300

Query: 301  QEEELYWKQRSREVWLKEGDQNTRWFHRQASYRQRLNRIRGLTDDQGEWRQDKTMILQLV 360
            ++EE+YW QRSR  WL+ GD+NT +FH+QA+ R++ N + G+ D+   W+ +   I  + 
Sbjct: 301  EQEEIYWHQRSRVQWLQHGDRNTSFFHKQATSRRKKNALVGILDENDRWQSENDKIGGVF 360

Query: 361  NDYFQQLFSTSEPSEQDFDISLRDIQRSVDNEMNVELLRPFTENEILRALKQSHPHKAPG 420
             ++F  LF TS+    D ++    +Q  V +     LL P++ +EI  AL    P KAPG
Sbjct: 361  VEFFTNLF-TSDMGVADVEV-FSAVQARVSSRSYHNLLLPYSRDEIEVALNFIGPSKAPG 420

Query: 421  PDGLSGSFYKNHWSIVGPSVVQSCLAVLNHGCSPVSINDTMIVLIPKIKVPRRVSDFRPI 480
            PDG+   FY+ +WSIVGP V   CL VLN        N T++ LIPK+  P RVS++RPI
Sbjct: 421  PDGMPALFYQKYWSIVGPDVSDLCLRVLNGSDGVNDFNHTLVALIPKVNSPTRVSEYRPI 480

Query: 481  SLCNFSYKLISKAVVNRMKHILPKLISPNQSAFVAGRCVVDNAILGFECIHELRRRTGGK 540
            SLCN  YK+ISK + NR+K +LP++IS  QSAF+  R ++DN +  FE +H L+R     
Sbjct: 481  SLCNVLYKIISKTLANRLKKVLPEVISEFQSAFIPNRMILDNVLAAFETVHCLKRWGKTG 540

Query: 541  SKWAALKLDMSKAYDRIEWSFLRSVMDRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNV 600
             K   LKLDM+KAYDR+E  FL  ++  MGF  ++  LI+ CV++VS+S  + G   G +
Sbjct: 541  KKKLILKLDMAKAYDRVERKFLEQMLRTMGFPIRFIQLIMGCVTTVSYSLLIQGRPFGRI 600

Query: 601  IPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVARSSPPISHLFFADDSLL 660
            IPSRGLRQGDP+SPYLFL+ AE  S+LL+ AER + + G  +A S+P I+HLFFADDSLL
Sbjct: 601  IPSRGLRQGDPISPYLFLIVAEAFSALLQQAERDSRLHGVSIAPSAPSINHLFFADDSLL 660

Query: 661  FFKANVNEAVTIRDLLICYERASGQVINYEKSVVAFSPNTGEDSQQYISHVLSVSRCPCH 720
            F  A   EA+ ++ +   YE ASGQ +N  KS + FSP+T    Q  I  +L+V+  PCH
Sbjct: 661  FCNAGTTEALELKRIFGVYESASGQKVNLGKSALCFSPSTPRVLQDDIRQLLNVTIVPCH 720

Query: 721  QRYLGLPSFMPKNRSGTLMFIKDRVWKQIQGWKGKFFSLGGKEVLLKSIIQAIPCYTMNC 780
            +RYLGLP+ + K++      +KDRVW ++ GW+GK  S  GKEVL+KS+ QAIP Y+M+ 
Sbjct: 721  ERYLGLPTIVGKDKKKLFRTVKDRVWNKVNGWQGKLLSKAGKEVLIKSVCQAIPSYSMSV 780

Query: 781  FRLPRCLIREIHRAMARFWWNESEEGKRIHWVSWDHMCRPKCMGGLGFRNMELFNQALLA 840
            FRLP  L REI   +A+FWW+++ +G+ IHW +W  MC+ K  GG+GFR +  FNQALL 
Sbjct: 781  FRLPVGLCREIESIIAKFWWSKN-DGRGIHWKTWRFMCQHKSDGGIGFRELTSFNQALLC 840

Query: 841  KQCWRVIQDPESLLGAVLKGRYFPHSEFWEASLGHRPSFIWRSLLWGRELLVRGCRWRIG 900
            KQ WR+++ P SL+  + K RYFPHS+F  AS G  PSF W+SLLWGR+LL  G RWRIG
Sbjct: 841  KQGWRLLEFPNSLIARMFKARYFPHSDFLAASSGSLPSFTWQSLLWGRDLLRLGLRWRIG 900

Query: 901  NGRSIPIYGSNWVPDNPSLRVQSAPSLPLSSRVCDLFSPSGQWDEAKVRAHFLGPECEAI 960
            +GR + IYG  WVP +    +QS P+LP++SRVCDLF+ SG WD  KV A F  PE EAI
Sbjct: 901  DGRLVNIYGDPWVPYDRFFTIQSIPTLPVTSRVCDLFTASGGWDVRKVFASFSFPEAEAI 960

Query: 961  LRIPLRSGLLEDRLIWHFEKHGVFSVKSGYRLAFSLASQGVRLLLSLSPGGFGGLVYGDL 1020
            L IPL    L DR IW+F K+G +SVKSGY  A        + L  LS GG  G      
Sbjct: 961  LSIPLMGDTL-DRRIWNFTKNGRYSVKSGYWAALE-----YKRLEELSTGGVAG------ 1020

Query: 1021 GSRISTRFSYGVSSWNGCPLSSVVEDGLHLFWKCAVTREMWLCSKFSQLYQSLYHLDLVD 1080
                    S  + SW       V +  LHL W+ A               Q +  L   +
Sbjct: 1021 ------PSSSSLKSWKHLWKLKVPQKILHLLWRVA---------------QDI--LPSKE 1080

Query: 1081 VIWALREKLGALDFELVTVFWWSVWNLRNNLCWRGESDGRDLWSWSEEYLRAYYDVV-GR 1140
            V++  R   G +         W   +  +N      +   D+ +W +    A + ++   
Sbjct: 1081 VLFRRRITQGEV---------WEALDFPSNFLLPTLA---DVGTWMD----AIWSIIPPD 1140

Query: 1141 RESRCSLQPCPRRPAEQSSWTPPVGGGFKLNTDASVRPDTGEAGGGCVLRDMSGAVLLAA 1200
            ++S  +       P     W PP G  FKLN D +   +TG  G G ++RD  G ++ A 
Sbjct: 1141 KQSLFAFTVSLSSPVCDIKWRPPTGNCFKLNVDGATDMETGARGAGAIVRDSHGKLVGAL 1200

Query: 1201 CLDLPRCWSVDLAEGWALVKGVELALQMGFLSFCVEVDSLRLVRILHGEVIDSSEVGLLM 1260
             +  P   SV   E +AL  G+  AL M  +   +E DSL+ V +++ E    +  G L+
Sbjct: 1201 AMRAPSRISVLATELYALKVGISFALDMSPVPLEIESDSLQAVSMVNSEEECLAAEGGLV 1257

Query: 1261 DDVRRLLHPCGRGKVLFTPRNGNRVAHALACLTF-SYSGCVWLEEWPMEIAAVLAGDVAL 1318
            D VRRLL       V   PR  N+ AH +A  +    S  +WL+  P+ +   +  D   
Sbjct: 1261 DGVRRLLVRSASTAVRHIPRQANKAAHRIARFSLRDQSLSLWLDVGPLWLMDAVYDD--- 1257

BLAST of Lag0038334 vs. TAIR 10
Match: AT4G29090.1 (Ribonuclease H-like superfamily protein )

HSP 1 Score: 234.6 bits (597), Expect = 5.0e-61
Identity = 180/578 (31.14%), Postives = 255/578 (44.12%), Query Frame = 0

Query: 768  AIPCYTMNCFRLPRCLIREIHRAMARFWWNESEEGKRIHWVSWDHMCRPKCMGGLGFRNM 827
            A+P YTM CF LP+ + ++I   +A FWW   +E K +HW +WDH+   K  GG+GF+++
Sbjct: 2    ALPTYTMACFLLPKTVCKQIISVLADFWWRNKQEAKGMHWKAWDHLSCYKAEGGIGFKDI 61

Query: 828  ELFNQALLAKQCWRVIQDPESLLGAVLKGRYFPHSEFWEASLGHRPSFIWRSLLWGRELL 887
            E FN ALL KQ WR++  PESL+  V K RYF  S+   A LG RPSF+W+S+   +E+L
Sbjct: 62   EAFNLALLGKQMWRMLSRPESLMAKVFKSRYFHKSDPLNAPLGSRPSFVWKSIHASQEIL 121

Query: 888  VRGCRWRIGNGRSIPIYGSNWVPDNP---SLRVQSAPSLPLSS-----RVCDLFSPSG-Q 947
             +G R  +GNG  I I+   W+   P   +LR+Q  P    +S     +V DL   SG +
Sbjct: 122  RQGARAVVGNGEDIIIWRHKWLDSKPASAALRMQRVPPQEYASVSSILKVSDLIDESGRE 181

Query: 948  WDEAKVRAHFLGPECEAILRIPLRSG--LLEDRLIWHFEKHGVFSVKSGYRLAFSLASQG 1007
            W +  +   F  PE E  L   LR G   + D   W +   G ++VKSGY +   + ++ 
Sbjct: 182  WRKDVIEMLF--PEVERKLIGELRPGGRRILDSYTWDYTSSGDYTVKSGYWVLTQIINKR 241

Query: 1008 ------------------------------VRLLLSLSPGGFGGLVYGDLGSRISTRFSY 1067
                                          +   LS S    G L Y  L          
Sbjct: 242  SSPQEVSEPSLNPIYQKIWKSQTSPKIQHFLWKCLSNSLPVAGALAYRHLSKE------- 301

Query: 1068 GVSSWNGCPLSSVVEDGLHLFWKCAVTREMWLCSKF-----SQLYQSLYHLDLVDVIWAL 1127
              S+   CP  S  E   HL +KC   R  W  S        +   S+Y    V++ W  
Sbjct: 302  --SACIRCP--SCKETVNHLLFKCTFARLTWAISSIPIPLGGEWADSIY----VNLYWVF 361

Query: 1128 REKLGALDFE----LVTVFWWSVWNLRNNLCWRGESDGRDLWSWSEEYLRAYYDVVGRRE 1187
                G   +E    LV    W +W  RN L +RG          ++E LR   D +    
Sbjct: 362  NLGNGNPQWEKASQLVPWLLWRLWKNRNELVFRGREFN------AQEVLRRAEDDLEEWR 421

Query: 1188 SRCSLQPCPRRPAEQSS----WTPPVGGGFKLNTDASVRPDTGEAGGGCVLRDMSGAVLL 1247
             R   + C  +P    S    W PP     K NTDA+   D    G G VLR+  G V  
Sbjct: 422  IRTEAESCGTKPQVNRSSCGRWRPPPHQWVKCNTDATWNRDNERCGIGWVLRNEKGEVKW 481

Query: 1248 AACLDLPRCWSVDLAEGWALVKGVELALQMGFLSFCVEVDSLRLVRILHGEVIDSSEVGL 1292
                 LP+  SV  AE  A+   V    +  +     E DS  L+ IL+ + I  S +  
Sbjct: 482  MGARALPKLKSVLEAELEAMRWAVLSLSRFQYNYVIFESDSQVLIEILNNDEIWPS-LKP 541

BLAST of Lag0038334 vs. TAIR 10
Match: ATMG00310.1 (RNA-directed DNA polymerase (reverse transcriptase)-related family protein )

HSP 1 Score: 157.9 bits (398), Expect = 5.9e-38
Identity = 70/149 (46.98%), Postives = 97/149 (65.10%), Query Frame = 0

Query: 768 AIPCYTMNCFRLPRCLIREIHRAMARFWWNESEEGKRIHWVSWDHMCRPK-CMGGLGFRN 827
           A+P Y M+CFRL + L +++  AM  FWW+  E  ++I WV+W  +C+ K   GGLGFR+
Sbjct: 2   ALPVYAMSCFRLSKLLCKKLTSAMTEFWWSSCENKRKISWVAWQKLCKSKEDDGGLGFRD 61

Query: 828 MELFNQALLAKQCWRVIQDPESLLGAVLKGRYFPHSEFWEASLGHRPSFIWRSLLWGREL 887
           +  FNQALLAKQ +R+I  P +LL  +L+ RYFPHS   E S+G RPS+ WRS++ GREL
Sbjct: 62  LGWFNQALLAKQSFRIIHQPHTLLSRLLRSRYFPHSSMMECSVGTRPSYAWRSIIHGREL 121

Query: 888 LVRGCRWRIGNGRSIPIYGSNWVPDNPSL 916
           L RG    IG+G    ++   W+ D   L
Sbjct: 122 LSRGLLRTIGDGIHTKVWLDRWIMDETPL 150

BLAST of Lag0038334 vs. TAIR 10
Match: AT1G43760.1 (DNAse I-like superfamily protein )

HSP 1 Score: 122.5 bits (306), Expect = 2.8e-27
Identity = 105/418 (25.12%), Postives = 179/418 (42.82%), Query Frame = 0

Query: 84  SDTPWLIGGDFN--ALLYQHEKEGGRDKPLSELAAFQNVIDSCALLDLGFVGNRFTWCNR 143
           +D   ++ GDF+  A    H        P+  L  FQN +    L+D+   G  +TW N 
Sbjct: 217 TDQLMILVGDFDQIAATSDHYSVLQTSIPMRGLEEFQNCLRDSDLVDIPSRGVHYTWSNH 276

Query: 144 RPDGTIYERLDRCFSSATWHDIYPNCVVNHLDYHQSDHRPIELVLSPQPGCWRNPSQRIT 203
           + D  I  +LDR  ++  W   +P+ +        SDH P  ++L   P      S++  
Sbjct: 277 QDDNPIIRKLDRAIANGDWFSSFPSAIAVFELSGVSDHSPCIIILENLP----KRSKKCF 336

Query: 204 RFDETWLKRADLQQLVRDSWGLSREDPGLSAPQILAQVSKRCMRSMAGWGRSRMGNFPQR 263
           R+             +  +W    + P  S    L +  K   +      R   GN   +
Sbjct: 337 RYFSFLSTHPTFLVSLTVAW--EEQIPVGSHMFSLGEHLKAAKKCCKLLNRQGFGNIQHK 396

Query: 264 ISEANQKVQLAIEGLRGAGSRELLSQAEA--QLEDVLQEE--------ELYWKQRSREVW 323
             E       A++ L    S+ L + +++  ++E V +++        E +++Q+SR  W
Sbjct: 397 TKE-------ALDSLESIQSQLLTNPSDSLFRVEHVARKKWNFFAAALESFYRQKSRIKW 456

Query: 324 LKEGDQNTRWFHRQASYRQRLNRIRGLTDDQGEWRQDKTMILQLVNDYFQQLF-STSEPS 383
           L++GD NTR+FH+     Q  N I+ L  D     ++ T + +++  Y+  L  S S+  
Sbjct: 457 LQDGDANTRFFHKVILANQAKNLIKFLRMDDDVRVENVTQVKEMIVAYYTHLLGSDSDIL 516

Query: 384 EQDFDISLRDIQRSVDNEMNVELLRPF-TENEILRALKQSHPHKAPGPDGLSGSFYKNHW 443
             D    ++DI     N+     L    ++ EI  A+     +KAPGPD  +  F+   W
Sbjct: 517 TPDSVQRIKDIHPFRCNDTLASRLSALPSDKEITAAVFAMPRNKAPGPDSFTAEFFWESW 576

Query: 444 SIVGPSVVQSCLAVLNHGCSPVSINDTMIVLIPKIKVPRRVSDFRPISLCNFSYKLIS 488
            +V  S + +       G      N T I LIPK+    ++S FRP+S C   YK+I+
Sbjct: 577 FVVKDSTIAAVKEFFRTGHLLKRFNATAITLIPKVTGVDQLSMFRPVSCCTVVYKIIT 621

BLAST of Lag0038334 vs. TAIR 10
Match: ATMG01250.1 (RNA-directed DNA polymerase (reverse transcriptase) )

HSP 1 Score: 86.7 bits (213), Expect = 1.7e-16
Identity = 41/69 (59.42%), Postives = 50/69 (72.46%), Query Frame = 0

Query: 586 FNLNGERLGNVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRALISGFRVARSSPPI 645
           F +NG   G V PSRGLRQGDPLSPYLF+LC E LS L R A+ +  + G RV+ +SP I
Sbjct: 12  FIINGAPQGLVTPSRGLRQGDPLSPYLFILCTEVLSGLCRRAQEQGRLPGIRVSNNSPRI 71

Query: 646 SHLFFADDS 655
           +HL FADD+
Sbjct: 72  NHLLFADDT 80

BLAST of Lag0038334 vs. TAIR 10
Match: AT4G20520.1 (RNA binding;RNA-directed DNA polymerases )

HSP 1 Score: 83.2 bits (204), Expect = 1.9e-15
Identity = 49/149 (32.89%), Postives = 72/149 (48.32%), Query Frame = 0

Query: 490 VVNRMKHILPKLISPNQSAFVAGRCVVDNAILGFECIHELRRRTGGKSKWAALKLDMSKA 549
           +V R+K ++  LI P Q++F+ GR   DN +   E +H +RR+ G K  W  LKLD+ KA
Sbjct: 1   MVERLKPLMTNLIGPAQASFIPGRVSTDNIVFVQEAVHSMRRKKGVKG-WMLLKLDLEKA 60

Query: 550 YDRIEWSFLRSVMDRMGFAQQWTDLILRCVSSVSFSFNLNGERLGNVIPSR--------- 609
           YDRI W +L   +   GF + W   I R     +F        +G    S+         
Sbjct: 61  YDRIRWDYLEDTLISAGFPEVWLPEIARS----TFGARRVAPEVGRADASKRPRVSDHRW 120

Query: 610 GLRQGDPLSPYL--FLLCAEGLSSLLRGA 628
           G R  D  +P+    + CAE L  + RG+
Sbjct: 121 GFRYDDMAAPFTSNSVACAELLREIGRGS 144

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_024172304.26.1e-26338.66uncharacterized protein LOC112178381 [Rosa chinensis][more]
XP_030936391.12.3e-26237.82uncharacterized protein LOC115961572 [Quercus lobata][more]
XP_038718167.17.5e-26137.62uncharacterized protein LOC120011171 [Tripterygium wilfordii][more]
XP_030508852.13.7e-26037.59uncharacterized protein LOC115723496 [Cannabis sativa][more]
CAB4263564.13.5e-25839.23unnamed protein product [Prunus armeniaca][more]
Match NameE-valueIdentityDescription
P0C2F66.2e-4827.12Putative ribonuclease H protein At1g65750 OS=Arabidopsis thaliana OX=3702 GN=At1... [more]
P143811.7e-4525.33Transposon TX1 uncharacterized 149 kDa protein OS=Xenopus laevis OX=8355 PE=4 SV... [more]
P113692.8e-4025.38LINE-1 retrotransposable element ORF2 protein OS=Mus musculus OX=10090 GN=Pol PE... [more]
P932958.3e-3746.98Uncharacterized mitochondrial protein AtMg00310 OS=Arabidopsis thaliana OX=3702 ... [more]
P925552.4e-1559.42Uncharacterized mitochondrial protein AtMg01250 OS=Arabidopsis thaliana OX=3702 ... [more]
Match NameE-valueIdentityDescription
A0A7N2LIH61.0e-26337.99Uncharacterized protein OS=Quercus lobata OX=97700 PE=3 SV=1[more]
A0A2N9GB969.6e-26238.81Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS27778 PE=4 SV=1[more]
A0A7N2R0C34.8e-26136.69Reverse transcriptase domain-containing protein OS=Quercus lobata OX=97700 PE=4 ... [more]
A0A803PV252.0e-25937.69Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
A0A6J5TIF91.7e-25839.23Reverse transcriptase domain-containing protein OS=Prunus armeniaca OX=36596 GN=... [more]
Match NameE-valueIdentityDescription
AT4G29090.15.0e-6131.14Ribonuclease H-like superfamily protein [more]
ATMG00310.15.9e-3846.98RNA-directed DNA polymerase (reverse transcriptase)-related family protein [more]
AT1G43760.12.8e-2725.12DNAse I-like superfamily protein [more]
ATMG01250.11.7e-1659.42RNA-directed DNA polymerase (reverse transcriptase) [more]
AT4G20520.11.9e-1532.89RNA binding;RNA-directed DNA polymerases [more]
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (AG-4) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000477Reverse transcriptase domainPFAMPF00078RVT_1coord: 462..703
e-value: 2.3E-37
score: 128.6
IPR000477Reverse transcriptase domainPROSITEPS50878RT_POLcoord: 442..708
score: 16.296255
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 1161..1289
e-value: 2.2E-9
score: 39.5
IPR002156Ribonuclease H domainPFAMPF13456RVT_3coord: 1166..1285
e-value: 6.5E-23
score: 80.9
IPR036691Endonuclease/exonuclease/phosphatase superfamilyGENE3D3.60.10.10Endonuclease/exonuclease/phosphatasecoord: 2..186
e-value: 8.9E-23
score: 83.3
IPR036691Endonuclease/exonuclease/phosphatase superfamilySUPERFAMILY56219DNase I-likecoord: 9..186
NoneNo IPR availablePANTHERPTHR19446REVERSE TRANSCRIPTASEScoord: 200..1120
NoneNo IPR availablePANTHERPTHR19446:SF440SUBFAMILY NOT NAMEDcoord: 200..1120
NoneNo IPR availableCDDcd01650RT_nLTR_likecoord: 458..723
e-value: 7.5804E-53
score: 182.876
IPR044730Ribonuclease H-like domain, plant typeCDDcd06222RNase_H_likecoord: 1165..1285
e-value: 6.4722E-22
score: 90.4512
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 1163..1287
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 401..691

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lag0038334.1Lag0038334.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0004523 RNA-DNA hybrid ribonuclease activity