Moc06g12620 (gene) Bitter gourd (OHB3-1) v2

Overview
NameMoc06g12620
Typegene
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase domain-containing protein
Locationchr6: 9706056 .. 9712683 (-)
RNA-Seq ExpressionMoc06g12620
SyntenyMoc06g12620
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACCGGACATCCTCCCTCAACCATGAGATCTGCCACCATTGAAAAGAAGAGATTTTCTTTATCTGTTGACCAACGCTTCAGGGGCTCCCGTGCAAAACTCACCGAATCCTCCAGAGACAAAACTTTCTCCATCTCTGTACACCGGTCCTCCCTCCCCTGGCTTTGCAACTGTTTCACTTCACTTTTACATGTTCCTATCAACCAAAAATTCTTTAGAGAGAGCAGGGTTGATGAACAAACCCTTTGGGTCGAGAAAACTACCAACAGAAAGGGCACTTTGGCAGAAATAGCTAAGCTGGATAACAATGGAAGTATCAACAAGCTATTTATTCCAGTTGGTGAAGAAAGAAAGGGATGGCTATCCTTTTATAATCTCATTAAAGACTTCCCAGCTAAATCTTCTCACCCCATTATTGCACCTCAACAACCACCACAGAAACCCTATCTATCGGTTGCTCTCCCCAAATCATCTAACTCGCCTACCTTTTGTGAGATCATAAAGAAAGGCTCCAACCCAGCAGGGTCACAAAAGAACCCGACAAAGAGAGCCCCCCCCCCCCCCCCCCCCCCTTCACAGTTACCCACATGCCAACACCAATCTACATACCTCCATCATCGTCTATAGGCAACACTTCCACGATGACTAGTCTAGAATTATGAGGGCCCTCCAAATACACGTTAGCCACTTCTGCACAGTTAATCCATTCGCTGCTGATAGAGCTCTCCTAAAGTGCGAGGATAAGGAACAAGCTAGAGTACTGGCAGCTATAAAGGATTGGTATAAAGTTGGGTCCTCCATGCTCAGATTTGCCCCTTGGAATAGTGAAGCCTCAAATTCTTTTCCAGTAGTACCATCCTATGGAGGTTGGATCAAAATTAGGAATCTTCCTTTGGACAGATGGGATGATGACACGTTCCATTTTATTGGGGATTCCTGCGGGGGTTTCATTGAAATCGCTAAGAAGACCCTCTCTCGATTAGACCTTTATGAAGCCACTATTAAACACGGGTTTTATTCCGGCGACAATACCCATTCCATCGGCTTCTCACGGACATACAACCGTTCAGATAGACCCATTTTTTGCTCCAGAAAACTTCATTGGATATTCAGCCGGAATCCATGGAAGCTTGATGTCCACGCGCCGGGAAACACCGATGGTAGAGATTGGTGATGGGATCCTCGGACCGTACCCTCGAGCTGCACAAACTGACAAGATTGGAAGCCGTGAAACAGAGAACTGCACCATTGAGACAGATCTGAATTGCTCTGTCTCTGAAACTGACCACATATCTGGCCTGCTGGAACTGGAAGTGTTTCAGGCTGCACAGAAAGACACTGCAGCCATAGAGAAGAGTGCCAATCACCAAAAAGGAAAAAGTGTAATAAATGAACCTTTTCTACCTTTTCCTTTTTCAGAAAACAATGAACCCCCACCCTCTGACCCACCCCTTTTAACCACTGATTCTACTCTGCCACAGTCCTTTCGCATCATCTTTTCCATCTGATACTCTCAAACCCATCCAAATTGCATCAAAAAAGACCTTCCTCGTTCCTGGCACCAAATTGACCACAAATCCTCTTCCATCTGAGGTTGATTCTGAAAACTTTCTCTCCAGCCCCTACCCTTCCTCACTAGCATCCCTCCCACCTCCTACTCCACCCGCCTCTCCTTCTCCTGCCCCTCTTTTCGCGCCAGAAACATTGCCCAACCTATTCCCGAACCCTCTTGCCATTCAAGTTGTGGATGCCCCTCCTATTACCCCTTTCCCCCTCATTGACTGTACGCAACACCCCTCGGCTTATCTACAAGTCGTAGTCCCTTGGCTCAACACTATTGGTCTTGGTATCCTCCCTCTCCCTACAAAGACCACAAAAAAGACCCAGGAACAGAAGAAAGAAGTTAAACTCGTTCGTGAATTAGCTGGTCTTTACTCATCCATCAACTATGATGGCCCCTCATCTTCGCACACAAGGAAAGGGTCTTTGGTTATCTCATGAATGTTATCTCCTGGAACGTGAGGGGTATAGGCTCTAAAGAAAAAAGAGCCCATGTTAAAAGTGTCATTCAGAAGCATCACCCAACCATAGTGATCCTTCAGGAGACAAAAGTCGCGGGAGTGGACCGTTTTTTCATCAAAACTCTCTGGAGCTCGAGGAACATTGCTTGGGCCATCCAAAATTCCATTGGCGCTTCTGGTGGCATCATCATCCTCTGGAATGACCCGGCCATTAAAGTCAATGACATAAAAATAGGTGCTTTCTCTCTCACCCTCCATATCACCCTTGTTGATGGCTTCCATTTTTGGCTAACAGGAATTTATGGCCCCCCAAGAACGAGAGATAGAGGCCTTTTCTGGGATGAATTAGCCAATCTCACCTTTCTCTGTGCTGAAAGATGGCTGCTGGGAGGTGACTTCAACGTCACAAGATGGGTTCATGAAAAATCTTCACATCGCCGCCCCACCCGTAGTATGAGATTATTCAACAACTTCATTGACACAGCCAACCTAAGGGACCTCCCTCTCACCAATGGTTTGTATACATGGTCTAATTTCCGAGAATCTCCCCATCTCTCATTATTGGACAGGTACCTATGCTCTGACTTGGTCCTTTCTAATTTTCCTAACGCTATGGTTAAGAGATTGAATAGGGAAACATCATATCACTTCCCTATCCAACTTGCACTTGGTGCCATCCGTTGGGGCCCTACACCCTCTCGATTCGATAATGAGTGGCTGCAGCAAGCAACTTTCCAGCCCCTGATTGAAGGCTGGTGGAACAACAATCCTCTTCATGGATGGCCAGGTCATGGATTCATCCAAAAACTCAAAGCCCTCAAAGTTGTGATAAAAGATTGGAAAGCAAACTTCATTGACAGTTCTTATCGACACAAGGAACAGCTACTGACAGAATTAAACATCCTAGACTCCCTTGAAGAGGAAGGTTCCATCCAAACAGTGCAAATGGCTCAAAGCATCTCACTAAAAGATCAGCTACATTCTTTAGCCATAGCAGAAGAAGCTCATTGGAGACAGAGATGCAAATTGAAATGGCTTAAAGAGGGAGACCTGAACACTGGATTCTTCCACAGAGTTGTTGCTGCCAGACGTAGAAAAAGCTCCATTATTGAGCTGCTATCCAGGGATGGTAAGAGTTTGGTTAATGATACAGAGATAGAAGCAGAGTTCCTCTCCTTTTATAAAAACCTATTTTCCAAAAAAGATGGCACCAGATTTCTACCTCACCCTATCAATTGGGACCCCATCTCACAGCAACAGTCAACCCTCCTTGAAGCCCCCTTTCTTGAGACAGAGGTTTGGCAAGCTGTTAAGGATCTTGGTACAAACAAAACCCCTGGACCTGATGGCTTCACAGCTGAATTCTATAAAAAATTTTGGAACATCCTCAAAACCGATATCATGAGAGTGTTCCAAGATTTTTTCAAGAATGGGATCATAAATGCTAGCCTTAACGAGACCTATATCTGCCTTATACCTAAGAAGATTGACGCAAGAACAGTCAGTGACTATCGGCCAATTAGTCTGATATCGTGTATGTATAAGATCATAGCTAGAGTCCTCTCGGAGAGGCTTAAGAGGGTCCTTCCACATACAATCTCAGCTACACAATCGGCTTTTGTGGCTAACAGACAGATTTTGGATGCCTCTCTGATTGCAAATGAGATCATTGACACATGGAATAGGAAAAAGAAAAAAGGGGTAGTCATCAAGCTTGATATTGAGAAAGCTTTCGATAAAGTTGATTGGGACTATCTAGACGGTATTCTCTCAGCTAAGGGCTTTGGCACAACATGGAGGAAATGGATTAGAGGCTGCATCTCATCGGCCAATTACTCTATCATCATAAACGGGCGTCCCAAAGGAAAAATTAGGGCTTCTAGGGGTCTTAGACAAGGTGATCCCCTCTCCCCGTTTTTATTTATTCTTGTGGTTGACTGTTTTAGTAGGCTCCTAACACAAGCTGCTGATTTTGACGCGATTGAGGGCTTTGCTGTTGGTAATCCTCCCACTCACCTACAATTTGCGGATGATACCCTCCTCTTCTCATCTTCGAAAGACAGTAAACTCCAGAATTTATTCAACTTTATAAAGGTTTTTGAAGAGGCATCAGGTTTAAACTCCAACCTACAGAAAACAGAGATGATGGGCATTAACTTGGAAGACAATGTCTTAGAATCATTGGCCGTCAGATTTGACTGCAGAAAGGGTTCTTGGCCCAACACATACCTCGGCCTTCCTCTAAATGGTAACCCTCGATCCCCCTCCTTTTGGGACCCTATCATGGAGAAAATCAAAAAAAGGCTTTCCTCTTGGGAGCATAATCATATATCCAAAGGAGGCAGACTCACCCTTATCAATGCCACACTCTCCAATCTCCCCATATACTTCCTCTCTTTATTCTCCATACCAACCAAGGTTGCCAACGAACTTGATAAAATTGTGAGGAACTTTCTGTGGAAGGGGTCTATGGACAAGAAGGGACAAAACCTGGTTAGATGGGATACTGTGCAGAAACCTATTGATTTTGGAGGTCTGGGAATCACTAGCATCAAAGCCAAAAATACAGCTCTATTAGCAAAGTGGAATTGGAGGTTCATAACAGAGGAATCGTCTCTTTGGAGACGAGTGATTCAAGCCAAATACTCCATAGTTGACTATCACAGTCCCCTCCGTACTCTCCCTGCTGCCAGAGGTCCATGGAGAGCCATTTCCAAGCTAAATTCTCTGATTTTGGACAGAATATCCTACCGCTTGGGAGAGGGCTCCCTCCCCTTGTTTTGGAAAGACACTTGGATCAACGATGAACCTTTGTGTCGCACTTACCCTCTCCTTTTCGCTTTGCATACCAGGAAACTTGGTCTTGTGAGGGATTTCTGGTCTACTGAAACAAATTCATGGGATCTTAATTTCCATAGAAACCTAAAGGATGTTGAGATTATTGAACTGGTAGCTCTCCTACACTGTCTTTCTTCACAGAGGCCTTCTCTAAATAGGGACTCTGACGCCTGGAGGTTGGATCCCCTTGGATCATTCACTACCAGTTCTCTCCTCAATGATCTCCAAAGAAACAACACTTCGTCTCCTCCAACAGATTTGTACAAAGCTATATGGAAGGATTCCTACCCAAAGAAGATTAAATTTTTCCTTTGGGAAACCAGCCTACAAGCTCTAAATACGCATGACAAACTACAGCGTAGGATGCCCTATATGGCCTTATCTCCTCATTGGTGTCCTCTTTGCAAGCTACAAAGCGAATCCATTGGGCATACTCTCCTTACATGCCCCTTCTCTACTGCCCTTTGGAACAGAATTTTATCCATCTTTGATTGGTCGGTTGCCCTCCCGACAGATATGTCTCAATTGCTCGCATTAACTCTTGTTGGACATCCCTTCAAAAAACGAAAAGCTAGCCTTTGGAGCCACTTCATACGGGCCCTTCTGTGGACTATTTGGACTGAAAGGAACCATCGAATTTTCCAAGCTATGGAGTGCACTTTACATGCCACTTTTGAATCCATTGTTTTTTTGGTCATTACTTGGTGTAAATGCTCCCCTCTCTTTCACTCCTATAGCTTTGCCTCCATGTTAGCTAATTGGAGAGCCTTTTTGTAACCCGCCTGTGGGTCTTCCTTCATTTTGTACTCTTCTCTCGCCTCTCTTTTTCATTTTATCAATGAAATCGTTTCAGTTACCAAAAAAAACAAACAAAAAAAAAGTTATCCTCACTGAACAATAATGTATCATCCAAAAACTAGGGTGCAAAATGAGACAAAAGAGAGGAAAAAACCCGATGAAACTAGTTAAATTGTGGCAGTTCAATGAGCTTGAGCATTATTTGTTGGATAGATTTTATCATAATAAATATATTCGGTTTTGATCTGGGCAGTACATTCACCTTTTTTTTTCTTATACTAAGAGGTCAATTTTGGATGTGAGTTTTGGCCGAAAATTGAGCTGAACCAACTCACTTACACCCCCCTACAGAGTGGTTACGGGTGTTAAATGTTAATAGAGCATTGAAACAACCAGGAGTGGGCTAATTAACTATTGGTAGAAATAAGGTGTGTCAAACTCTAACACCAACCAATACCGGTTGGCAAAAGAAAATAAATCCAAAGGGAGGAAGAATTCAAAAAAGTTGTAGAGAGTTTTGATTTCTGAGATCAACTTAATAAGAAAGAGCATGAAATTATCTTAGATCAGAAAGTCATGACAATAAAGAAAAGAAGTCCCTTTAAGTTGGGAGGAACAATAAGATTTTTAACCAGAAGGGAAAATCCGTAGATTATTTTTATTTCATAGAAACTTGTCTCTCCTTCAATAATTATAGTGTTGATACTCTTCGTGCTACTCGAAAGAGTTTTTTATTATCTACATTGGCCCGTGGTTTGGCTTTCATGATTTTGTTTCTTTAATTTCATATCATCTATAGAAACATTTCTAAAAAAAATTTTTACAAAAAAAGCCTAGTTAAAAGTAAGGAGGATTCATTACCTATGATGATTAGAATTTTGGTTCACTGTAGTCATAATCAAGGGGTGAAAGCCATCCAAGTAGAAGTATTAGCGTCTAGGCTCAGATCTGATGTGAAGTATCCTGCATGA

mRNA sequence

ATGACCGGACATCCTCCCTCAACCATGAGATCTGCCACCATTGAAAAGAAGAGATTTTCTTTATCTGTTGACCAACGCTTCAGGGGCTCCCGTGCAAAACTCACCGAATCCTCCAGAGACAAAACTTTCTCCATCTCTGTACACCGGTCCTCCCTCCCCTGGCTTTGCAACTGTTTCACTTCACTTTTACATGTTCCTATCAACCAAAAATTCTTTAGAGAGAGCAGGGTTGATGAACAAACCCTTTGGGTCGAGAAAACTACCAACAGAAAGGGCACTTTGGCAGAAATAGCTAAGCTGGATAACAATGGAAGTATCAACAAGCTATTTATTCCAGTTGGTGAAGAAAGAAAGGGATGGCTATCCTTTTATAATCTCATTAAAGACTTCCCAGCTAAATCTTCTCACCCCATTATTGCACCTCAACAACCACCACAGAAACCCTATCTATCGGTTGCTCTCCCCAAATCATCTAACTCGCCTACCTTTTTTAATCCATTCGCTGCTGATAGAGCTCTCCTAAAGTGCGAGGATAAGGAACAAGCTAGAGTACTGGCAGCTATAAAGGATTGGTATAAAGTTGGGTCCTCCATGCTCAGATTTGCCCCTTGGAATAGTGAAGCCTCAAATTCTTTTCCAGTAGTACCATCCTATGGAGCCGGAATCCATGGAAGCTTGATGTCCACGCGCCGGGAAACACCGATGGTAGAGATTGGTGATGGGATCCTCGGACCGTACCCTCGAGCTGCACAAACTGACAAGATTGGAAGCCGTGAAACAGAGAACTGCACCATTGAGACAGATCTGAATTGCTCTGTCTCTGAAACTGACCACATATCTGGCCTGCTGGAACTGGAAGTGTTTCAGGCTGCACAGAAAGACACTGCAGCCATAGAGAAGAGTGCCAATCACCAAAAAGGAAAAAGTACCTTCCTCGTTCCTGGCACCAAATTGACCACAAATCCTCTTCCATCTGAGGTTGATTCTGAAAACTTTCTCTCCAGCCCCTACCCTTCCTCACTAGCATCCCTCCCACCTCCTACTCCACCCGCCTCTCCTTCTCCTGCCCCTCTTTTCGCGCCAGAAACATTGCCCAACCTATTCCCGAACCCTCTTGCCATTCAAGTTGTGGATGCCCCTCCTATTACCCCTTTCCCCCTCATTGACTGTACGCAACACCCCTCGGCTTATCTACAAGTCGTAGTCCCTTGGCTCAACACTATTGGTCTTGGTATCCTCCCTCTCCCTACAAAGACCACAAAAAAGACCCAGGAACAGAAGAAAGAAGTTAAACTCGTTCGTGAATTAGCTGGCTCTAAAGAAAAAAGAGCCCATGTTAAAAGTGTCATTCAGAAGCATCACCCAACCATAGTGATCCTTCAGGAGACAAAAGTCGCGGGAGTGGACCGTTTTTTCATCAAAACTCTCTGGAGCTCGAGGAACATTGCTTGGGCCATCCAAAATTCCATTGGCGCTTCTGGTGGCATCATCATCCTCTGGAATGACCCGGCCATTAAAGTCAATGACATAAAAATAGGTGCTTTCTCTCTCACCCTCCATATCACCCTTGTTGATGGCTTCCATTTTTGGCTAACAGGAATTTATGGCCCCCCAAGAACGAGAGATAGAGGCCTTTTCTGGGATGAATTAGCCAATCTCACCTTTCTCTGTGCTGAAAGATGGCTGCTGGGAGGTGACTTCAACGTCACAAGATGGGTTCATGAAAAATCTTCACATCGCCGCCCCACCCGTAGTATGAGATTATTCAACAACTTCATTGACACAGCCAACCTAAGGGACCTCCCTCTCACCAATGGTTTGTATACATGGTCTAATTTCCGAGAATCTCCCCATCTCTCATTATTGGACAGGTACCTATGCTCTGACTTGGTCCTTTCTAATTTTCCTAACGCTATGGTTAAGAGATTGAATAGGGAAACATCATATCACTTCCCTATCCAACTTGCACTTGGTGCCATCCGTTGGGGCCCTACACCCTCTCGATTCGATAATGAGTGGCTGCAGCAAGCAACTTTCCAGCCCCTGATTGAAGGCTGGTGGAACAACAATCCTCTTCATGGATGGCCAGGTCATGGATTCATCCAAAAACTCAAAGCCCTCAAAGTTGTGATAAAAGATTGGAAAGCAAACTTCATTGACAGTTCTTATCGACACAAGGAACAGCTACTGACAGAATTAAACATCCTAGACTCCCTTGAAGAGGAAGGTTCCATCCAAACAGTGCAAATGGCTCAAAGCATCTCACTAAAAGATCAGCTACATTCTTTAGCCATAGCAGAAGAAGCTCATTGGAGACAGAGATGCAAATTGAAATGGCTTAAAGAGGGAGACCTGAACACTGGATTCTTCCACAGAGTTGTTGCTGCCAGACGTAGAAAAAGCTCCATTATTGAGCTGCTATCCAGGGATGATTTCTACCTCACCCTATCAATTGGGACCCCATCTCACAGCAACAGTCAACCCTCCTTGAAGCCCCCTTTCTTGAGACAGAGGCTCCTAACACAAGCTGCTGATTTTGACGCGATTGAGGGCTTTGCTGTTGGTAATCCTCCCACTCACCTACAATTTGCGGATGATACCCTCCTCTTCTCATCTTCGAAAGACAGTAAACTCCAGAATTTATTCAACTTTATAAAGGTTTTTGAAGAGGCATCAGGTTTAAACTCCAACCTACAGAAAACAGAGATGATGGGCATTAACTTGGAAGACAATGTCTTAGAATCATTGGCCGTCAGATTTGACTGCAGAAAGGGTTCTTGGCCCAACACATACCTCGGCCTTCCTCTAAATGGTAACCCTCGATCCCCCTCCTTTTGGGACCCTATCATGGAGAAAATCAAAAAAAGGCTTTCCTCTTGGGAGCATAATCATATATCCAAAGGAGGCAGACTCACCCTTATCAATGCCACACTCTCCAATCTCCCCATATACTTCCTCTCTTTATTCTCCATACCAACCAAGGTTGCCAACGAACTTGATAAAATTGTGAGGAACTTTCTGTGGAAGGGGTCTATGGACAAGAAGGGACAAAACCTGGTTAGATGGGATACTGTGCAGAAACCTATTGATTTTGGAGGTCTGGGAATCACTAGCATCAAAGCCAAAAATACAGCTCTATTAGCAAAGTGGAATTGGAGGTTCATAACAGAGGAATCGTCTCTTTGGAGACGAGTGATTCAAGCCAAATACTCCATAGTTGACTATCACAGTCCCCTCCGTACTCTCCCTGCTGCCAGAGGTCCATGGAGAGCCATTTCCAAGCTAAATTCTCTGATTTTGGACAGAATATCCTACCGCTTGGGAGAGGGCTCCCTCCCCTTGTTTTGGAAAGACACTTGGATCAACGATGAACCTTTGTGTCGCACTTACCCTCTCCTTTTCGCTTTGCATACCAGGAAACTTGGTCTTGTGAGGGATTTCTGGTCTACTGAAACAAATTCATGGGATCTTAATTTCCATAGAAACCTAAAGGATGTTGAGATTATTGAACTGGTAGCTCTCCTACACTGTCTTTCTTCACAGAGGCCTTCTCTAAATAGGGACTCTGACGCCTGGAGGTTGGATCCCCTTGGATCATTCACTACCAGTTCTCTCCTCAATGATCTCCAAAGAAACAACACTTCGTCTCCTCCAACAGATTTGTACAAAGCTATATGGAAGGATTCCTACCCAAAGAAGATTAAATTTTTCCTTTGGGAAACCAGCCTACAAGCTCTAAATACGCATGACAAACTACAGCGTAGGATGCCCTATATGGCCTTATCTCCTCATTGGTGTCCTCTTTGCAAGCTACAAAGCGAATCCATTGGGCATACTCTCCTTACATGCCCCTTCTCTACTGCCCTTTGGAACAGAATTTTATCCATCTTTGATTGGTCGGTTGCCCTCCCGACAGATATGTCTCAATTGCTCGCATTAACTCTTGTTGGACATCCCTTCAAAAAACGAAAAGCTAGCCTTTGGAGCCACTTCATACGGGCCCTTCTGTGGACTATTTGGACTGAAAGGAACCATCGAATTTTCCAAGCTATGGAGTGCACTTTACATGCCACTTTTGAATCCATTGTTTTTTTGGTCATTACTTGTCATAATCAAGGGGTGAAAGCCATCCAAGTAGAAGTATTAGCGTCTAGGCTCAGATCTGATGTGAAGTATCCTGCATGA

Coding sequence (CDS)

ATGACCGGACATCCTCCCTCAACCATGAGATCTGCCACCATTGAAAAGAAGAGATTTTCTTTATCTGTTGACCAACGCTTCAGGGGCTCCCGTGCAAAACTCACCGAATCCTCCAGAGACAAAACTTTCTCCATCTCTGTACACCGGTCCTCCCTCCCCTGGCTTTGCAACTGTTTCACTTCACTTTTACATGTTCCTATCAACCAAAAATTCTTTAGAGAGAGCAGGGTTGATGAACAAACCCTTTGGGTCGAGAAAACTACCAACAGAAAGGGCACTTTGGCAGAAATAGCTAAGCTGGATAACAATGGAAGTATCAACAAGCTATTTATTCCAGTTGGTGAAGAAAGAAAGGGATGGCTATCCTTTTATAATCTCATTAAAGACTTCCCAGCTAAATCTTCTCACCCCATTATTGCACCTCAACAACCACCACAGAAACCCTATCTATCGGTTGCTCTCCCCAAATCATCTAACTCGCCTACCTTTTTTAATCCATTCGCTGCTGATAGAGCTCTCCTAAAGTGCGAGGATAAGGAACAAGCTAGAGTACTGGCAGCTATAAAGGATTGGTATAAAGTTGGGTCCTCCATGCTCAGATTTGCCCCTTGGAATAGTGAAGCCTCAAATTCTTTTCCAGTAGTACCATCCTATGGAGCCGGAATCCATGGAAGCTTGATGTCCACGCGCCGGGAAACACCGATGGTAGAGATTGGTGATGGGATCCTCGGACCGTACCCTCGAGCTGCACAAACTGACAAGATTGGAAGCCGTGAAACAGAGAACTGCACCATTGAGACAGATCTGAATTGCTCTGTCTCTGAAACTGACCACATATCTGGCCTGCTGGAACTGGAAGTGTTTCAGGCTGCACAGAAAGACACTGCAGCCATAGAGAAGAGTGCCAATCACCAAAAAGGAAAAAGTACCTTCCTCGTTCCTGGCACCAAATTGACCACAAATCCTCTTCCATCTGAGGTTGATTCTGAAAACTTTCTCTCCAGCCCCTACCCTTCCTCACTAGCATCCCTCCCACCTCCTACTCCACCCGCCTCTCCTTCTCCTGCCCCTCTTTTCGCGCCAGAAACATTGCCCAACCTATTCCCGAACCCTCTTGCCATTCAAGTTGTGGATGCCCCTCCTATTACCCCTTTCCCCCTCATTGACTGTACGCAACACCCCTCGGCTTATCTACAAGTCGTAGTCCCTTGGCTCAACACTATTGGTCTTGGTATCCTCCCTCTCCCTACAAAGACCACAAAAAAGACCCAGGAACAGAAGAAAGAAGTTAAACTCGTTCGTGAATTAGCTGGCTCTAAAGAAAAAAGAGCCCATGTTAAAAGTGTCATTCAGAAGCATCACCCAACCATAGTGATCCTTCAGGAGACAAAAGTCGCGGGAGTGGACCGTTTTTTCATCAAAACTCTCTGGAGCTCGAGGAACATTGCTTGGGCCATCCAAAATTCCATTGGCGCTTCTGGTGGCATCATCATCCTCTGGAATGACCCGGCCATTAAAGTCAATGACATAAAAATAGGTGCTTTCTCTCTCACCCTCCATATCACCCTTGTTGATGGCTTCCATTTTTGGCTAACAGGAATTTATGGCCCCCCAAGAACGAGAGATAGAGGCCTTTTCTGGGATGAATTAGCCAATCTCACCTTTCTCTGTGCTGAAAGATGGCTGCTGGGAGGTGACTTCAACGTCACAAGATGGGTTCATGAAAAATCTTCACATCGCCGCCCCACCCGTAGTATGAGATTATTCAACAACTTCATTGACACAGCCAACCTAAGGGACCTCCCTCTCACCAATGGTTTGTATACATGGTCTAATTTCCGAGAATCTCCCCATCTCTCATTATTGGACAGGTACCTATGCTCTGACTTGGTCCTTTCTAATTTTCCTAACGCTATGGTTAAGAGATTGAATAGGGAAACATCATATCACTTCCCTATCCAACTTGCACTTGGTGCCATCCGTTGGGGCCCTACACCCTCTCGATTCGATAATGAGTGGCTGCAGCAAGCAACTTTCCAGCCCCTGATTGAAGGCTGGTGGAACAACAATCCTCTTCATGGATGGCCAGGTCATGGATTCATCCAAAAACTCAAAGCCCTCAAAGTTGTGATAAAAGATTGGAAAGCAAACTTCATTGACAGTTCTTATCGACACAAGGAACAGCTACTGACAGAATTAAACATCCTAGACTCCCTTGAAGAGGAAGGTTCCATCCAAACAGTGCAAATGGCTCAAAGCATCTCACTAAAAGATCAGCTACATTCTTTAGCCATAGCAGAAGAAGCTCATTGGAGACAGAGATGCAAATTGAAATGGCTTAAAGAGGGAGACCTGAACACTGGATTCTTCCACAGAGTTGTTGCTGCCAGACGTAGAAAAAGCTCCATTATTGAGCTGCTATCCAGGGATGATTTCTACCTCACCCTATCAATTGGGACCCCATCTCACAGCAACAGTCAACCCTCCTTGAAGCCCCCTTTCTTGAGACAGAGGCTCCTAACACAAGCTGCTGATTTTGACGCGATTGAGGGCTTTGCTGTTGGTAATCCTCCCACTCACCTACAATTTGCGGATGATACCCTCCTCTTCTCATCTTCGAAAGACAGTAAACTCCAGAATTTATTCAACTTTATAAAGGTTTTTGAAGAGGCATCAGGTTTAAACTCCAACCTACAGAAAACAGAGATGATGGGCATTAACTTGGAAGACAATGTCTTAGAATCATTGGCCGTCAGATTTGACTGCAGAAAGGGTTCTTGGCCCAACACATACCTCGGCCTTCCTCTAAATGGTAACCCTCGATCCCCCTCCTTTTGGGACCCTATCATGGAGAAAATCAAAAAAAGGCTTTCCTCTTGGGAGCATAATCATATATCCAAAGGAGGCAGACTCACCCTTATCAATGCCACACTCTCCAATCTCCCCATATACTTCCTCTCTTTATTCTCCATACCAACCAAGGTTGCCAACGAACTTGATAAAATTGTGAGGAACTTTCTGTGGAAGGGGTCTATGGACAAGAAGGGACAAAACCTGGTTAGATGGGATACTGTGCAGAAACCTATTGATTTTGGAGGTCTGGGAATCACTAGCATCAAAGCCAAAAATACAGCTCTATTAGCAAAGTGGAATTGGAGGTTCATAACAGAGGAATCGTCTCTTTGGAGACGAGTGATTCAAGCCAAATACTCCATAGTTGACTATCACAGTCCCCTCCGTACTCTCCCTGCTGCCAGAGGTCCATGGAGAGCCATTTCCAAGCTAAATTCTCTGATTTTGGACAGAATATCCTACCGCTTGGGAGAGGGCTCCCTCCCCTTGTTTTGGAAAGACACTTGGATCAACGATGAACCTTTGTGTCGCACTTACCCTCTCCTTTTCGCTTTGCATACCAGGAAACTTGGTCTTGTGAGGGATTTCTGGTCTACTGAAACAAATTCATGGGATCTTAATTTCCATAGAAACCTAAAGGATGTTGAGATTATTGAACTGGTAGCTCTCCTACACTGTCTTTCTTCACAGAGGCCTTCTCTAAATAGGGACTCTGACGCCTGGAGGTTGGATCCCCTTGGATCATTCACTACCAGTTCTCTCCTCAATGATCTCCAAAGAAACAACACTTCGTCTCCTCCAACAGATTTGTACAAAGCTATATGGAAGGATTCCTACCCAAAGAAGATTAAATTTTTCCTTTGGGAAACCAGCCTACAAGCTCTAAATACGCATGACAAACTACAGCGTAGGATGCCCTATATGGCCTTATCTCCTCATTGGTGTCCTCTTTGCAAGCTACAAAGCGAATCCATTGGGCATACTCTCCTTACATGCCCCTTCTCTACTGCCCTTTGGAACAGAATTTTATCCATCTTTGATTGGTCGGTTGCCCTCCCGACAGATATGTCTCAATTGCTCGCATTAACTCTTGTTGGACATCCCTTCAAAAAACGAAAAGCTAGCCTTTGGAGCCACTTCATACGGGCCCTTCTGTGGACTATTTGGACTGAAAGGAACCATCGAATTTTCCAAGCTATGGAGTGCACTTTACATGCCACTTTTGAATCCATTGTTTTTTTGGTCATTACTTGTCATAATCAAGGGGTGAAAGCCATCCAAGTAGAAGTATTAGCGTCTAGGCTCAGATCTGATGTGAAGTATCCTGCATGA

Protein sequence

MTGHPPSTMRSATIEKKRFSLSVDQRFRGSRAKLTESSRDKTFSISVHRSSLPWLCNCFTSLLHVPINQKFFRESRVDEQTLWVEKTTNRKGTLAEIAKLDNNGSINKLFIPVGEERKGWLSFYNLIKDFPAKSSHPIIAPQQPPQKPYLSVALPKSSNSPTFFNPFAADRALLKCEDKEQARVLAAIKDWYKVGSSMLRFAPWNSEASNSFPVVPSYGAGIHGSLMSTRRETPMVEIGDGILGPYPRAAQTDKIGSRETENCTIETDLNCSVSETDHISGLLELEVFQAAQKDTAAIEKSANHQKGKSTFLVPGTKLTTNPLPSEVDSENFLSSPYPSSLASLPPPTPPASPSPAPLFAPETLPNLFPNPLAIQVVDAPPITPFPLIDCTQHPSAYLQVVVPWLNTIGLGILPLPTKTTKKTQEQKKEVKLVRELAGSKEKRAHVKSVIQKHHPTIVILQETKVAGVDRFFIKTLWSSRNIAWAIQNSIGASGGIIILWNDPAIKVNDIKIGAFSLTLHITLVDGFHFWLTGIYGPPRTRDRGLFWDELANLTFLCAERWLLGGDFNVTRWVHEKSSHRRPTRSMRLFNNFIDTANLRDLPLTNGLYTWSNFRESPHLSLLDRYLCSDLVLSNFPNAMVKRLNRETSYHFPIQLALGAIRWGPTPSRFDNEWLQQATFQPLIEGWWNNNPLHGWPGHGFIQKLKALKVVIKDWKANFIDSSYRHKEQLLTELNILDSLEEEGSIQTVQMAQSISLKDQLHSLAIAEEAHWRQRCKLKWLKEGDLNTGFFHRVVAARRRKSSIIELLSRDDFYLTLSIGTPSHSNSQPSLKPPFLRQRLLTQAADFDAIEGFAVGNPPTHLQFADDTLLFSSSKDSKLQNLFNFIKVFEEASGLNSNLQKTEMMGINLEDNVLESLAVRFDCRKGSWPNTYLGLPLNGNPRSPSFWDPIMEKIKKRLSSWEHNHISKGGRLTLINATLSNLPIYFLSLFSIPTKVANELDKIVRNFLWKGSMDKKGQNLVRWDTVQKPIDFGGLGITSIKAKNTALLAKWNWRFITEESSLWRRVIQAKYSIVDYHSPLRTLPAARGPWRAISKLNSLILDRISYRLGEGSLPLFWKDTWINDEPLCRTYPLLFALHTRKLGLVRDFWSTETNSWDLNFHRNLKDVEIIELVALLHCLSSQRPSLNRDSDAWRLDPLGSFTTSSLLNDLQRNNTSSPPTDLYKAIWKDSYPKKIKFFLWETSLQALNTHDKLQRRMPYMALSPHWCPLCKLQSESIGHTLLTCPFSTALWNRILSIFDWSVALPTDMSQLLALTLVGHPFKKRKASLWSHFIRALLWTIWTERNHRIFQAMECTLHATFESIVFLVITCHNQGVKAIQVEVLASRLRSDVKYPA
Homology
BLAST of Moc06g12620 vs. NCBI nr
Match: RVW70235.1 (LINE-1 retrotransposable element ORF2 protein [Vitis vinifera])

HSP 1 Score: 594.0 bits (1530), Expect = 3.4e-165
Identity = 374/1184 (31.59%), Postives = 527/1184 (44.51%), Query Frame = 0

Query: 438  GSKEKRAHVKSVIQKHHPTIVILQETKVAGVDRFFIKTLWSSRNIAWAIQNSIGASGGII 497
            GSK+KR  VK  ++   P +V+ QETK    DR F+ ++W++RN  WA   + GASGGI+
Sbjct: 823  GSKKKRRVVKDFLRSEKPDVVMFQETKKEECDRRFVGSVWTARNKDWAALPACGASGGIL 882

Query: 498  ILWNDPAIKVNDIKIGAFSLTLHITLVDGFHFWLTGIYGPPRTRDRGLFWDELANLTFLC 557
            I+W+   +   ++ +G+FS+++  TL      WL+ +YGP  +  R   W EL+++  L 
Sbjct: 883  IIWDTKKLSREEVMLGSFSVSIKFTLNGCESLWLSAVYGPNNSALRKDLWVELSDIAGLA 942

Query: 558  AERWLLGGDFNVTRWVHEKSSHRRPTRSMRLFNNFIDTANLRDLPLTNGLYTWSNFRESP 617
            + RW +GGDFNV R   EK    R T SM+ F++FI    L DLPL +  +TWSN + +P
Sbjct: 943  SPRWCVGGDFNVIRRSSEKLGGSRLTPSMKDFDDFISDCELIDLPLRSASFTWSNMQVNP 1002

Query: 618  HLSLLDRYLCSDLVLSNFPNAMVKRLNRETSYHFPIQLALGAIRWGPTPSRFDNEWLQQA 677
                LDR+L S+     FP ++   L R TS H+PI L     +WGPTP RF+N WLQ  
Sbjct: 1003 VCKRLDRFLYSNEWEQTFPQSIQGVLPRWTSDHWPIVLETNPFKWGPTPFRFENMWLQHP 1062

Query: 678  TFQPLIEGWWNNNPLHGWPGHGFIQKLKALKVVIKDW-KANFIDSSYRHKEQLLTELNIL 737
            +F+     WW     +GW GH F++KL+ +K  +K W KA+F + S R KE +L+ L   
Sbjct: 1063 SFKENFGRWWREFQGNGWEGHKFMRKLQFVKAKLKVWNKASFGELSKR-KEDILSALVNF 1122

Query: 738  DSLEEEGSIQTVQMAQSISLKDQLHSLAIAEEAHWRQRCKLKWLKEGDLNTGFFHRVVAA 797
            DSLE+EG +    +AQ    K +L  L + EE HWRQ+ ++KW+KEGD N+ FFH+V   
Sbjct: 1123 DSLEQEGGLSHELLAQRAIKKGELEELILREEIHWRQKARVKWVKEGDCNSKFFHKVANG 1182

Query: 798  RRRKSSIIELLSRDDFYLTLSIG----------------------------TPSHSNSQP 857
            RR +  I EL + +   +  S                              +P    S  
Sbjct: 1183 RRNRKFIKELENENGQMMNNSESIKEEILRYFEKLYTSPSGESWRVEGLDWSPISGESAV 1242

Query: 858  SLKPPF------------------------------------------------------ 917
             L+ PF                                                      
Sbjct: 1243 RLESPFTEEEICKAIFQMDRDKAPGPDGFTIAVFQDCWEVIKEDLVKVFTEFHRSGIINQ 1302

Query: 918  ------------------------------------------------------------ 977
                                                                        
Sbjct: 1303 STNASFIVLLPKKSMSRRISDFRPISLITSLYKIIAKVLAGRIREVLHETIHSTQGAFVQ 1362

Query: 978  ------------------------------------------------------------ 1037
                                                                        
Sbjct: 1363 GRQILDAVLIANEIVDEKRRSGEEGVVFKIDFEKAYDHVSWDFLDHVMEMKGFGIRWRKW 1422

Query: 1038 ---------------------------LRQ-----------------RLLTQAADFDAIE 1097
                                       LRQ                 R+L +A + + +E
Sbjct: 1423 MRGCLSSVSFAVLVNGNAKGWVKASRGLRQGDPLSPFLFTIVADVLSRMLLKAEERNVLE 1482

Query: 1098 GFAVGNPPT---HLQFADDTLLFSSSKDSKLQNLFNFIKVFEEASGLNSNLQKTEMMGIN 1157
            GF VG   T   HLQFADDT+ FSSS++  +  L N + VF   SGL  NL K+ + GIN
Sbjct: 1483 GFKVGRNRTRVSHLQFADDTIFFSSSREEDMMTLKNVLLVFGHISGLKVNLDKSNIYGIN 1542

Query: 1158 LEDNVLESLAVRFDCRKGSWPNTYLGLPLNGNPRSPSFWDPIMEKIKKRLSSWEHNHISK 1217
            LE N L  LA   DC+   WP  YLGLPL GNP++  FWDP++E+I +RL  W+  ++S 
Sbjct: 1543 LEQNHLSRLAEMLDCKASGWPILYLGLPLGGNPKTSGFWDPVIERISRRLDGWQKAYLSF 1602

Query: 1218 GGRLTLINATLSNLPIYFLSLFSIPTKVANELDKIVRNFLWKGSMDKKGQNLVRWDTVQK 1277
            GGR+TLI + L+++P YFLSLF IP  VA +++++ R+FLW G  + K  +LV WD V K
Sbjct: 1603 GGRITLIQSCLTHMPCYFLSLFKIPASVAAKIERMQRDFLWSGVGEGKRDHLVNWDVVCK 1662

Query: 1278 PIDFGGLGITSIKAKNTALLAKWNWRFITEESSLWRRVIQAKYSI----VDYHSPLRTLP 1337
            P   GGLG   I  +N ALL KW WR+  E S+LW +VI + Y       D ++ +R   
Sbjct: 1663 PKSRGGLGFGKISIRNVALLGKWLWRYPREGSALWHQVILSIYGSHSNGWDVNNIVRW-- 1722

Query: 1338 AARGPWRAISKLNSLILDRISYRLGEGSLPLFWKDTWINDEPLCRTYPLLFALHTRKLGL 1366
            + R PW+AI+ +         + +G G    FW D W  ++PL   YP L  + T K   
Sbjct: 1723 SHRCPWKAIALVYQEFSKFTRFVVGNGDRIRFWDDLWWGEQPLGVQYPRLLRVVTDKNAP 1782

BLAST of Moc06g12620 vs. NCBI nr
Match: RVW64408.1 (LINE-1 retrotransposable element ORF2 protein [Vitis vinifera])

HSP 1 Score: 580.9 bits (1496), Expect = 3.0e-161
Identity = 366/1182 (30.96%), Postives = 523/1182 (44.25%), Query Frame = 0

Query: 438  GSKEKRAHVKSVIQKHHPTIVILQETKVAGVDRFFIKTLWSSRNIAWAIQNSIGASGGII 497
            GSK+KR  V+  +   +P IV+LQETK    DR F+ ++W  + + WA   + GASGGI+
Sbjct: 747  GSKKKRRIVRRFLSTQNPDIVMLQETKRETWDRRFVSSVWKGKRVEWAALPACGASGGIV 806

Query: 498  ILWNDPAIKVNDIKIGAFSLTLHITLVDGFHFWLTGIYGPPRTRDRGLFWDELANLTFLC 557
            ILW+   ++  +  +G+FS+T+     +   FWLT +YGP     R  FW EL +L  L 
Sbjct: 807  ILWDSSKLECTEKVLGSFSVTVKFNSGEEGSFWLTSVYGPINPLWRKDFWLELQDLYGLT 866

Query: 558  AERWLLGGDFNVTRWVHEKSSHRRPTRSMRLFNNFIDTANLRDLPLTNGLYTWSNFRESP 617
              RW +GGDFNV R + EK    R T +MR F+ FI  + L D PL N  +TWSN +  P
Sbjct: 867  FPRWCVGGDFNVIRRISEKLGETRLTLNMRCFDEFIRESGLIDPPLRNAAFTWSNMQADP 926

Query: 618  HLSLLDRYLCSDLVLSNFPNAMVKRLNRETSYHFPIQLALGAIRWGPTPSRFDNEWLQQA 677
                LDR+L S    + F  +  + L R TS H PI L    ++WGPTP RF+N WL   
Sbjct: 927  ICKRLDRFLFSSEWDTFFSQSFQEALPRWTSDHSPICLETNPLKWGPTPFRFENMWLLHP 986

Query: 678  TFQPLIEGWWNNNPLHGWPGHGFIQKLKALKVVIKDWKANFIDSSYRHKEQLLTELNILD 737
             F+     WW      GW GH F++KLK +K  +K+W           K+ +LT+L+ +D
Sbjct: 987  EFKEKFRVWWLECTGEGWEGHKFMRKLKFVKSKLKEWNIMTFGDLKERKKLILTDLSRID 1046

Query: 738  SLEEEGSIQTVQMAQSISLKDQLHSLAIAEEAHWRQRCKLKWLKEGDLNTGFFHRVVAAR 797
             +E+EG++ +  + +    + +L  + + EE  WRQ+ ++KW+KEGD N+ FFHRV   R
Sbjct: 1047 LIEQEGNLNSDLVLERTLKRRELEDVLLKEEVQWRQKSRVKWIKEGDCNSKFFHRVATGR 1106

Query: 798  RRKSSIIELLSR------------------------------------------------ 857
            R +  I  L+S                                                 
Sbjct: 1107 RSRKFIKSLISERGETLNNIEDISEEIVNFFGNLYSKPVGESWRVEGIDWVPISGESGGW 1166

Query: 858  -------------------------DDF-------------------------------- 917
                                     D F                                
Sbjct: 1167 LDRPFTEEEVRRAVFQLNKEKAPGPDGFTIAVYQECWDVIKEDLMRVFLEFHTNGVINQS 1226

Query: 918  --------------------YLTLSIGTPSH-------------------SNSQPSL--- 977
                                Y  +S+ T  +                   S+SQ +    
Sbjct: 1227 TNATFIALVPKKSQSVKISDYRPISLVTSLYKIIAKVLSGRLRKVLHETISDSQGAFVEG 1286

Query: 978  ------------------------------------------------------------ 1037
                                                                        
Sbjct: 1287 RHILDAVLIANEVVDEKRRSGEEGIVFKIDFEKAYDHVDWGFLDHVLQRKGFSQKWRLWI 1346

Query: 1038 --------------------------------KPPFL-------RQRLLTQAADFDAIEG 1097
                                              PFL         R+L +A +    EG
Sbjct: 1347 RGCLSSSSFAILVNGNAKGWVKASRGLRQGDPLSPFLFTLVADVLSRMLFRAEETGLTEG 1406

Query: 1098 FAVGNPPTH---LQFADDTLLFSSSKDSKLQNLFNFIKVFEEASGLNSNLQKTEMMGINL 1157
            F+VG   T    LQFADDT+ FS +    LQNL   + VF + SGL  NL+K+ + GIN 
Sbjct: 1407 FSVGRDRTRVSLLQFADDTIFFSKASMEHLQNLKIILLVFGQVSGLKINLEKSTISGINT 1466

Query: 1158 EDNVLESLAVRFDCRKGSWPNTYLGLPLNGNPRSPSFWDPIMEKIKKRLSSWEHNHISKG 1217
               +L SLA  FDCR   WP +YLGLPL GNP++  FWDP++E+I +RL  W+  ++S G
Sbjct: 1467 RQELLSSLASVFDCRVSEWPLSYLGLPLGGNPKTIGFWDPVVERISRRLDGWKKAYLSLG 1526

Query: 1218 GRLTLINATLSNLPIYFLSLFSIPTKVANELDKIVRNFLWKGSMDKKGQNLVRWDTVQKP 1277
            GR+TLI + LS++P YFLSLF IP  +A++++K+ RNFLW G+ + K  +LVRW+ V +P
Sbjct: 1527 GRITLIQSCLSHIPSYFLSLFKIPASIASKIEKMQRNFLWSGAGEGKKDHLVRWEVVSRP 1586

Query: 1278 IDFGGLGITSIKAKNTALLAKWNWRFITEESSLWRRVIQAKYSI--VDYHSPLRTLPAAR 1337
             + GGLG   I  +N ALL KW WRF  E S LW +VI + Y      + + +    + R
Sbjct: 1587 KELGGLGFGKISLRNIALLGKWLWRFPRERSGLWYKVIGSIYGTHPNGWDANMVVRWSHR 1646

Query: 1338 GPWRAISKLNSLILDRISYRLGEGSLPLFWKDTWINDEPLCRTYPLLF-ALHTRKLGLVR 1365
             PW+AI+++       +   +G G    FW+D W  ++ LC  +  L+  +  + L +  
Sbjct: 1647 CPWKAIAQVFQEFSPFVRLVVGNGERIRFWEDLWWGNQSLCSQFADLYRVISVKNLTVSN 1706

BLAST of Moc06g12620 vs. NCBI nr
Match: RVW12714.1 (Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera])

HSP 1 Score: 575.1 bits (1481), Expect = 1.6e-159
Identity = 366/1180 (31.02%), Postives = 519/1180 (43.98%), Query Frame = 0

Query: 416  PTKTTKKTQEQKKEVKLVRELAGSKEKRAHVKSVIQKHHPTIVILQETKVAGVDRFFIKT 475
            P +  K      ++ +L  +  GSK+KR  VK+ +    P +V++QETK    DR  + +
Sbjct: 692  PRRMAKGQSVSHEDNQLEYQGLGSKKKRRVVKNFLSSEKPDVVMIQETKKEECDRRLVGS 751

Query: 476  LWSSRNIAWAIQNSIGASGGIIILWNDPAIKVNDIKIGAFSLTLHITLVDGFHFWLTGIY 535
            +WS RN  WA   + GASGGI+I+W+   ++  ++ +                     +Y
Sbjct: 752  VWSVRNKDWAALPASGASGGILIIWDSKKLRREEVVL---------------------VY 811

Query: 536  GPPRTRDRGLFWDELANLTFLCAERWLLGGDFNVTRWVHEKSSHRRPTRSMRLFNNFIDT 595
            GP  +  R  FW EL+++  L   RW +GGDFNV R   EK    R T  M+ F+ FI  
Sbjct: 812  GPNNSALRKDFWVELSDIAGLSHPRWCVGGDFNVIRRSSEKLGGSRLTPRMKDFDEFIRD 871

Query: 596  ANLRDLPLTNGLYTWSNFRESPHLSLLDRYLCSDLVLSNFPNAMVKRLNRETSYHFPIQL 655
              L D PL +  YTWSN +E+P    LDR+L S+     FP ++   L R TS H+PI L
Sbjct: 872  CELIDSPLRSASYTWSNMQENPVCKRLDRFLYSNEWEQVFPQSLQGVLPRWTSDHWPIVL 931

Query: 656  ALGAIRWGPTPSRFDNEWLQQATFQPLIEGWWNNNPLHGWPGHGFIQKLKALKVVIKDWK 715
                 +WGPTP RF+N WLQ + F+     WW+    +GW GH F++KL+ +K  +K+W 
Sbjct: 932  ETNPFKWGPTPFRFENMWLQHSNFKENFGRWWSEFQGNGWEGHKFMRKLQFVKAKLKEWN 991

Query: 716  ANFIDSSYRHKEQLLTELNILDSLEEEGSIQTVQMAQSISLKDQLHSLAIAEEAHWRQRC 775
                    + K+ +L  L   DSLE+EG +   ++ Q    K +L  L + EE HWRQ+ 
Sbjct: 992  KTSFGELSKKKKDILAVLANFDSLEQEGGLSHERLVQRAFSKGELEELILREEIHWRQKA 1051

Query: 776  KLKWLKEGDLNTGFFHRVVAARRRKSSIIELLSRDDFYLTLSIG---------------- 835
            ++KW+KEGD N+ FFH+V   RR +  I EL +     L                     
Sbjct: 1052 RVKWVKEGDCNSKFFHKVANGRRNRKFIKELENESGLMLNNPESIKEEILKYFEKLYASP 1111

Query: 836  ------------TPSHSNSQPSLKPPF--------------------------------- 895
                        +P    S   L+ PF                                 
Sbjct: 1112 SGESWRVEGLDWSPIDGESASRLESPFTEEEIYKAIFQMDRDKAPGPDGFTIAVFQDCWD 1171

Query: 896  ------------------------------------------------------------ 955
                                                                        
Sbjct: 1172 VIKEDLVRVFTEFHRSGIINQSTNASFIVLIPKKSMSRRISDYRPISLITSLYKIIAKVL 1231

Query: 956  ------------------------------------------------------------ 1015
                                                                        
Sbjct: 1232 AGRLRGVLHETIHSTQGAFVQGRQILDAVLIANEIVDEKRRTGEEGVVFKIDFEKAYDHV 1291

Query: 1016 ----------------LRQ-----------------RLLTQAADFDAIEGFAVGNPPT-- 1075
                            LRQ                 R+L +A + + +EGF VG   T  
Sbjct: 1292 ILVNGNAKGWVKASRGLRQGDPLSPFLFTIVADVLSRMLLKAEERNVLEGFRVGRNRTRV 1351

Query: 1076 -HLQFADDTLLFSSSKDSKLQNLFNFIKVFEEASGLNSNLQKTEMMGINLEDNVLESLAV 1135
             HLQFADDT+ FSS+++  L  L + + VF   SGL  NL K+ + GINLE N L  LAV
Sbjct: 1352 SHLQFADDTIFFSSTREEDLMTLKSVLLVFGHISGLKVNLDKSNIYGINLEQNHLSRLAV 1411

Query: 1136 RFDCRKGSWPNTYLGLPLNGNPRSPSFWDPIMEKIKKRLSSWEHNHISKGGRLTLINATL 1195
              DC+   WP  YLGLPL GNP++  FWDP++E+I +RL  W+  ++S GGR+TLI + L
Sbjct: 1412 MLDCKASGWPILYLGLPLGGNPKASGFWDPVIERISRRLDGWQKAYLSFGGRITLIQSCL 1471

Query: 1196 SNLPIYFLSLFSIPTKVANELDKIVRNFLWKGSMDKKGQNLVRWDTVQKPIDFGGLGITS 1255
            +++P YFLSLF IP  VA +++++ R FLW G  + K  +LV WD V KP   GGLG   
Sbjct: 1472 THMPCYFLSLFKIPASVAAKIERMQREFLWSGVGEGKRDHLVNWDVVCKPKSRGGLGFGK 1531

Query: 1256 IKAKNTALLAKWNWRFITEESSLWRRVIQAKYSI----VDYHSPLRTLPAARGPWRAISK 1315
            I  +N ALL KW WR+  E S+LW +VI   Y       D ++ +R   + R PW+AI+ 
Sbjct: 1532 ISMRNVALLGKWLWRYPREGSALWHQVILNIYGSHSNGWDVNNNVRW--SHRCPWKAIAL 1591

Query: 1316 LNSLILDRISYRLGEGSLPLFWKDTWINDEPLCRTYPLLFALHTRK-------LGLVRDF 1366
            +         + +G+G    FW D W  D+ L   YP L ++ T K       LG  R F
Sbjct: 1592 VFQEFSKFTRFVVGDGDRIRFWDDLWWGDQTLGTQYPRLLSVVTDKNAPISSILGYSRPF 1651

BLAST of Moc06g12620 vs. NCBI nr
Match: CAN65484.1 (hypothetical protein VITISV_029474 [Vitis vinifera])

HSP 1 Score: 572.4 bits (1474), Expect = 1.1e-158
Identity = 369/1189 (31.03%), Postives = 525/1189 (44.15%), Query Frame = 0

Query: 439  SKEKRAHVKSVIQKHHPTIVILQETKVAGVDRFFIKTLWSSRNIAWAIQNSIGASGGIII 498
            S  K A V+ V++      + ++ETK    DR F+ ++W++RN  WA   + GASGGI+I
Sbjct: 681  SPRKMAKVREVLKN-----LDIKETKKEECDRRFVGSVWTARNKDWAALPACGASGGILI 740

Query: 499  LWNDPAIKVNDIKIGAFSLTLHITLVDGFHFWLTGIYGPPRTRDRGLFWDELANLTFLCA 558
            +W+   +   ++ +G+FS+++   L      WL+ +YGP  +  R  FW EL+++  L +
Sbjct: 741  IWDAKKLSREEVVLGSFSVSIKFALNGCESLWLSAVYGPNISALRKDFWVELSDIAGLAS 800

Query: 559  ERWLLGGDFNVTRWVHEKSSHRRPTRSMRLFNNFIDTANLRDLPLTNGLYTWSNFRESPH 618
             RW +GGDFNV R   EK    R T SM+ F++FI    L DLPL +  +TWSN + +  
Sbjct: 801  PRWCVGGDFNVIRRSSEKLGGSRXTPSMKXFDDFISDCELIDLPLRSASFTWSNMQVNXV 860

Query: 619  LSLLDRYLCSDLVLSNFPNAMVKRLNRETSYHFPIQLALGAIRWGPTPSRFDNEWLQQAT 678
               LDR+L S+     FP ++   L R TS H+PI L     +WGPTP RF+N WLQ  +
Sbjct: 861  CKRLDRFLYSNEWEQAFPQSIQGVLPRWTSDHWPIVLETNPFKWGPTPFRFENMWLQHPS 920

Query: 679  FQPLIEGWWNNNPLHGWPGHGFIQKLKALKVVIKDW-KANFIDSSYRHKEQLLTELNILD 738
            F+     WW     +GW GH F++KL+ +K  +K W KA+F + S R KE +L++L   D
Sbjct: 921  FKENFGRWWREFQGNGWEGHKFMRKLQFVKAKLKVWNKASFGELSKR-KEDILSDLVNFD 980

Query: 739  SLEEEGSIQTVQMAQSISLKDQLHSLAIAEEAHWRQRCKLKWLKEGDLNTGFFHRVVAAR 798
            SLE+EG +    +AQ    K +L  L + EE HWRQ+ ++KW+KEGD N+ FFH+V   R
Sbjct: 981  SLEQEGGLSHELLAQRALKKGELEELILREEIHWRQKARVKWVKEGDCNSRFFHKVANGR 1040

Query: 799  RRKSSIIELLSRDDFYLTLSIG----------------------------TPSHSNSQPS 858
            R +  I EL + +   +  S                              +P    S   
Sbjct: 1041 RNRKFIKELENENGLMMNNSESIKEEILRYFEKLYTSPSGESWRVEGLDWSPISGESAFR 1100

Query: 859  LKPPF------------------------------------------------------- 918
            L+ PF                                                       
Sbjct: 1101 LESPFTEEEIFKAIFQMDRDKAPGPDGFTIAVFQDCWEVIKEDLVKVFTEFHRSGIINQS 1160

Query: 919  ------------------------------------------------------------ 978
                                                                        
Sbjct: 1161 TNASFIVLLPKKSMSRRISDFRPISLITSLYKIIAKVLAGRIRGVLHETIHSTQGAFVQG 1220

Query: 979  ------------------------------------------------------------ 1038
                                                                        
Sbjct: 1221 RQILDAVLIANEIVDEKRRSGEEGVVFKIDFEKAYDHVSWDFLDHVLEMKGFGIRWRKWM 1280

Query: 1039 --------------------------LRQ-----------------RLLTQAADFDAIEG 1098
                                      LRQ                 R+L +A + + +EG
Sbjct: 1281 RGCLSSVSFAVLVNGNAKGWVKASRGLRQGDPLSPFLFTIVADVLSRMLLKAEERNVLEG 1340

Query: 1099 FAVGNPPT---HLQFADDTLLFSSSKDSKLQNLFNFIKVFEEASGLNSNLQKTEMMGINL 1158
            F VG   T   HLQFADDT+ FSSS++  +  L N + VF   SGL  NL K+ + GINL
Sbjct: 1341 FKVGRNRTRVSHLQFADDTIFFSSSREEDMMTLKNVLLVFGHISGLKVNLDKSNIYGINL 1400

Query: 1159 EDNVLESLAVRFDCRKGSWPNTYLGLPLNGNPRSPSFWDPIMEKIKKRLSSWEHNHISKG 1218
            E N L  LA   DC+   WP  YLGLPL GNP++  FWDP++E+I +RL  W+  ++S G
Sbjct: 1401 EQNHLSRLAEMLDCKASGWPILYLGLPLGGNPKTSGFWDPVIERISRRLDGWQKAYLSFG 1460

Query: 1219 GRLTLINATLSNLPIYFLSLFSIPTKVANELDKIVRNFLWKGSMDKKGQNLVRWDTVQKP 1278
            GR+TLI + L+++P YFLSLF IP  VA +++++ R+FLW G  + K  +LV WD V KP
Sbjct: 1461 GRITLIQSCLTHMPCYFLSLFKIPASVAAKIERMQRDFLWSGVGEGKRDHLVNWDVVCKP 1520

Query: 1279 IDFGGLGITSIKAKNTALLAKWNWRFITEESSLWRRVIQAKYSI----VDYHSPLRTLPA 1338
               GGLG   I  +N ALL KW WR+  E S+LW +VI + Y       D ++ +R   +
Sbjct: 1521 KSRGGLGFGKISIRNVALLGKWLWRYPREGSALWHQVILSIYGSHSNGWDVNNIVRW--S 1580

Query: 1339 ARGPWRAISKLNSLILDRISYRLGEGSLPLFWKDTWINDEPLCRTYPLLFALHTRK---- 1366
             R PW+AI+ +         + +G G    FW D W  ++PL   YP L  + T K    
Sbjct: 1581 HRCPWKAIALVYQEFSKFTRFVVGNGDRIRFWDDLWWGEQPLGVQYPRLLRVVTDKNAPI 1640

BLAST of Moc06g12620 vs. NCBI nr
Match: CAN68165.1 (hypothetical protein VITISV_008538 [Vitis vinifera])

HSP 1 Score: 562.4 bits (1448), Expect = 1.1e-155
Identity = 355/1184 (29.98%), Postives = 521/1184 (44.00%), Query Frame = 0

Query: 438  GSKEKRAHVKSVIQKHHPTIVILQETKVAGVDRFFIKTLWSSRNIAWAIQNSIGASGGII 497
            GS++KR  VK  ++   P IV++QETK A  DR F+ ++W++RN  WA+  + GASGGI+
Sbjct: 123  GSRKKRRVVKDFLRSEKPDIVMIQETKKAECDRRFVGSVWTARNKEWAVLPACGASGGIL 182

Query: 498  ILWNDPAIKVNDIKIGAFSLTLHITLVDGFHFWLTGIYGPPRTRDRGLFWDELANLTFLC 557
            ++W+   +   ++ +G+FS+++   +     FW++ +YGP  T  R  FW EL+++  L 
Sbjct: 183  VIWDSKKLHSEEVVLGSFSVSVKFAVDGSEQFWJSAVYGPNSTALRKDFWVELSDIFGLS 242

Query: 558  AERWLLGGDFNVTRWVHEKSSHRRPTRSMRLFNNFIDTANLRDLPLTNGLYTWSNFRESP 617
            +  W +GGDFNV R   EK    R T SM+  ++FI    L D PL +  +TWSN +E P
Sbjct: 243  SPCWCVGGDFNVIRRCSEKLGGGRLTPSMKDLDDFIRENELIDPPLRSASFTWSNMQEHP 302

Query: 618  HLSLLDRYLCSDLVLSNFPNAMVKRLNRETSYHFPIQLALGAIRWGPTPSRFDNEWLQQA 677
                LDR+L S+     FP ++ + L R TS H+PI L     +WGPTP RF+N WL   
Sbjct: 303  VCKRLDRFLYSNEWEQLFPQSLQEVLPRWTSDHWPIVLETNPFKWGPTPFRFENMWLHHP 362

Query: 678  TFQPLIEGWWNNNPLHGWPGHGFIQKLKALKVVIKDWKANFIDSSYRHKEQLLTELNILD 737
            +F+     WW      GW GH F++KL+ LK  +K+W  N        K+ +L ++   D
Sbjct: 363  SFKECFGRWWREFQGDGWEGHKFMRKLQFLKAKLKEWNKNAFGDLIERKKCILLDIANFD 422

Query: 738  SLEEEGSIQTVQMAQSISLKDQLHSLAIAEEAHWRQRCKLKWLKEGDLNTGFFHRVVAAR 797
            S+E+EG +    + Q    K +L  L + EE HWRQ+ ++KW+KEGD N+  FH+V   R
Sbjct: 423  SMEQEGGLSPELLIQRAVRKGELEELILREEIHWRQKARVKWVKEGDCNSKXFHKVANGR 482

Query: 798  RRK--------------------------------------------------------- 857
            R +                                                         
Sbjct: 483  RNRKFIKVLENERGLVLDNSDSIKEEILRYFEKLYASPSGESWRVEGLDWSPISRESASR 542

Query: 858  ------------------------------------------------------------ 917
                                                                        
Sbjct: 543  LESPFTEEEIYKAIFQMDRDXAPGPDGFTIAVFQDCWDVIKEDLVRVFDEFHRSGIINQS 602

Query: 918  --SSIIELLSRDDF------YLTLSIGTP------------------------------- 977
              +S I LL +         Y  +S+ T                                
Sbjct: 603  TNASFIVLLPKKSMAKKISNYRPISLITSLYKIIAKVLAGRLRGILHETIHSTQGAFVQG 662

Query: 978  -------------------------------------------SHSNSQPSLKP------ 1037
                                                        H   +    P      
Sbjct: 663  RQILDAVLIANEIVDEKKRSGEEGVVFKIDFEKAYDHVSWDFLDHVMEKKGFNPXXRKWI 722

Query: 1038 ----------------------------------PFL-------RQRLLTQAADFDAIEG 1097
                                              PFL          +L +A + +  EG
Sbjct: 723  RXCLSSVSFAILVNGNAKGWVKXXRGLRQGDPLSPFLFTIVADVXSXMLLRAEERNVFEG 782

Query: 1098 FAVGNPPT---HLQFADDTLLFSSSKDSKLQNLFNFIKVFEEASGLNSNLQKTEMMGINL 1157
            F VG   T   HLQFADDT+ FSS+++  L  L + + VF   SGL  NL K+ + GINL
Sbjct: 783  FRVGRNRTRVSHLQFADDTIFFSSTREEDLLTLKSVLXVFGHISGLKVNLDKSNIYGINL 842

Query: 1158 EDNVLESLAVRFDCRKGSWPNTYLGLPLNGNPRSPSFWDPIMEKIKKRLSSWEHNHISKG 1217
              + L  LA   DC+   WP  YLGLPL GNP+S SFWDP++E+I  RL  W+  ++S G
Sbjct: 843  GQDHLHRLAELLDCKASGWPILYLGLPLGGNPKSGSFWDPVIERISSRLDGWQKAYLSFG 902

Query: 1218 GRLTLINATLSNLPIYFLSLFSIPTKVANELDKIVRNFLWKGSMDKKGQNLVRWDTVQKP 1277
            GR+TLI + L+++P YFLSLF IP  VA  ++++ R+FLW G  + K  +LV W+ V K 
Sbjct: 903  GRITLIQSCLTHMPCYFLSLFKIPASVAGRIERLQRDFLWSGVGEGKRDHLVSWBVVCKS 962

Query: 1278 IDFGGLGITSIKAKNTALLAKWNWRFITEESSLWRRVIQAKYSI----VDYHSPLRTLPA 1337
               GGLG+  I  +N+ALL KW WR+  E S+LW +VI + Y       D ++ +R   +
Sbjct: 963  KMKGGLGLGRISLRNSALLGKWLWRYPREGSALWHQVILSIYGSHSNGWDANTXVRW--S 1022

Query: 1338 ARGPWRAISKLNSLILDRISYRLGEGSLPLFWKDTWINDEPLCRTYPLLFALHTRKLGLV 1366
             R PW+AI+++         + +G+G    FW+D W  D+ L   +P L  +   K  L+
Sbjct: 1023 HRCPWKAIAQVFQDFSKFTRFIVGDGDRIRFWEDLWWGDQSLGVRFPRLLRVVMDKNILI 1082

BLAST of Moc06g12620 vs. ExPASy Swiss-Prot
Match: P0C2F6 (Putative ribonuclease H protein At1g65750 OS=Arabidopsis thaliana OX=3702 GN=At1g65750 PE=3 SV=1)

HSP 1 Score: 177.6 bits (449), Expect = 1.0e-42
Identity = 118/427 (27.63%), Postives = 191/427 (44.73%), Query Frame = 0

Query: 949  IMEKIKKRLSSWEHNHISKGGRLTLINATLSNLPIYFLSLFSIPTKVANELDKIVRNFLW 1008
            I+E++  R+S W    +S  GRLTL  A LS++P++ +S   +P  + N LD++ R FLW
Sbjct: 16   ILERVSSRMSGWREKTLSFAGRLTLTKAVLSSMPVHSMSTILLPQSILNRLDQLSRTFLW 75

Query: 1009 KGSMDKKGQNLVRWDTVQKPIDFGGLGITSIKAKNTALLAKWNWRFITEESSLWRRVIQA 1068
              + +KK Q+LV+W  V  P   GGLG+ + K+ N AL++K  WR + E++SLW  V+Q 
Sbjct: 76   GSTAEKKKQHLVKWSKVCSPKKEGGLGVRAAKSMNRALISKVGWRLLQEKNSLWTLVLQK 135

Query: 1069 KYSIVDYHSPLRTLPAA--RGPWRAIS-KLNSLILDRISYRLGEGSLPLFWKDTWINDEP 1128
            KY + +       +P       WR+I+  L  ++   + +  G+G    FW D W++ +P
Sbjct: 136  KYHVGEIRDSRWLIPKGSWSSTWRSIAIGLRDVVSHGVGWIPGDGQQIRFWTDRWVSGKP 195

Query: 1129 L-----------CRTYPLLFALHTRKLGLVRDFWSTETNSWDL-NFHRNLKDVEIIELVA 1188
            L           C T             + +D W      WD         +   +EL A
Sbjct: 196  LLELDNGERPTDCDTV------------VAKDLW-IPGRGWDFAKIDPYTTNNTRLELRA 255

Query: 1189 LLHCLSSQRPSLNRDSDAWRLDPLGSFTTSSLLNDLQRNNTSSP-PTDLYKAIWKDSYPK 1248
            ++  L +      RD  +W+    G F+  S    L  +    P     +  +WK   P+
Sbjct: 256  VVLDLVTGA----RDRLSWKFSQDGQFSVRSAYEMLTVDEVPRPNMASFFNCLWKVRVPE 315

Query: 1249 KIKFFLWETSLQALNTHDKLQRRMPYMALSPHWCPLCKLQSESIGHTLLTCPFSTALWNR 1308
            ++K FLW    QA+ T ++  RR  +++ S + C +CK   ES+ H L  CP    +W R
Sbjct: 316  RVKTFLWLVGNQAVMTEEERHRR--HLSAS-NVCQVCKGGVESMLHVLRDCPAQLGIWVR 375

Query: 1309 IL-----------SIFDWSVALPTDMSQLLALTLVGHPFKKRKASLWSHFIRALLWTIWT 1349
            ++           S+F+W      D S    +              WS     ++W  W 
Sbjct: 376  VVPQRRQQGFFSKSLFEWLYDNLGDRSGCEDIP-------------WSTIFAVIIWWGWK 409

BLAST of Moc06g12620 vs. ExPASy Swiss-Prot
Match: P93295 (Uncharacterized mitochondrial protein AtMg00310 OS=Arabidopsis thaliana OX=3702 GN=AtMg00310 PE=4 SV=1)

HSP 1 Score: 68.2 bits (165), Expect = 8.6e-10
Identity = 43/148 (29.05%), Postives = 69/148 (46.62%), Query Frame = 0

Query: 981  LPIYFLSLFSIPTKVANELDKIVRNFLWKGSMDKKGQNLVRWDTVQK-PIDFGGLGITSI 1040
            LP+Y +S F +   +  +L   +  F W    +K+  + V W  + K   D GGLG   +
Sbjct: 3    LPVYAMSCFRLSKLLCKKLTSAMTEFWWSSCENKRKISWVAWQKLCKSKEDDGGLGFRDL 62

Query: 1041 KAKNTALLAKWNWRFITEESSLWRRVIQAKYSIVDYHSPLRTLPAARGP---WRAISKLN 1100
               N ALLAK ++R I +  +L  R+++++Y     HS +        P   WR+I    
Sbjct: 63   GWFNQALLAKQSFRIIHQPHTLLSRLLRSRYF---PHSSMMECSVGTRPSYAWRSIIHGR 122

Query: 1101 SLILDRISYRLGEGSLPLFWKDTWINDE 1125
             L+   +   +G+G     W D WI DE
Sbjct: 123  ELLSRGLLRTIGDGIHTKVWLDRWIMDE 147

BLAST of Moc06g12620 vs. ExPASy TrEMBL
Match: M5WJ76 (Reverse transcriptase domain-containing protein (Fragment) OS=Prunus persica OX=3760 GN=PRUPE_ppa015871mg PE=4 SV=1)

HSP 1 Score: 600.1 bits (1546), Expect = 2.3e-167
Identity = 375/1172 (32.00%), Postives = 540/1172 (46.08%), Query Frame = 0

Query: 405  LNTIGLGILPLPTKTTKKTQEQKKEVKLVRELA------GSKEKRAHVKSVIQKHHPTIV 464
            ++ + + +LPL      +  E+   + L++ ++      GS+ KR  VK  +++  P IV
Sbjct: 295  MHKMDISLLPLKKNLAHRRHEKGLFLSLMKIISWNIRGLGSRRKRLLVKEQLRRLKPDIV 354

Query: 465  ILQETKVAGVDRFFIKTLWSSRNIAWAIQNSIGASGGIIILWNDPAIKVNDIKIGAFSLT 524
            IL ETK   VDR  +  +W SR   W    S+G SGGI +LWN  ++ V D  +G FS++
Sbjct: 355  ILLETKKEIVDRQLVAGVWGSRFKEWVFSPSLGRSGGIAVLWNSQSVSVIDSMVGEFSVS 414

Query: 525  LHITLVDGFHFWLTGIYGPPRTRDRGLFWDELANLTFLCAERWLLGGDFNVTRWVHEKSS 584
            + I    G  +WL+GIYGP R R+R  FW+ELA+L   C + W LGGDFNV R+  EKS+
Sbjct: 415  IRIEENIGTDWWLSGIYGPCRQRERNSFWEELADLYGYCGDMWCLGGDFNVVRFSAEKSN 474

Query: 585  HRRPTRSMRLFNNFIDTANLRDLPLTNGLYTWSNFRESPHLSLLDRYLCSDLVLSNFPNA 644
              R T+SMR FN+FI   NLRD  L N  +TWSN RE+     LDR+L S     +FP+ 
Sbjct: 475  EGRVTKSMRDFNDFIQETNLRDPILLNASFTWSNLRENAVCRRLDRFLVSGSWEEHFPHY 534

Query: 645  MVKRLNRETSYHFPIQLALGAIRWGPTPSRFDNEWLQQATFQPLIEGWWNNNPLHGWPGH 704
              K L R TS H PI+L    ++WGP+P RF+N WL    F+  I+ WW  + + GW G+
Sbjct: 535  RHKALPRITSDHCPIELDTSRVKWGPSPFRFENMWLNHPDFKRKIKLWWGEDQIPGWEGY 594

Query: 705  GFIQKLKALKVVIKDWKANFIDSSYRHKEQLLTELNILDSLEEEGSIQTVQMAQSISLKD 764
             F+ +LK LK  +K W         R   +    L +LD  E    +  +  ++  +L  
Sbjct: 595  KFMTRLKMLKSKLKVWSKEEFGDVERDLREAEARLLVLDQREGTEGLDHLLRSERDNLLL 654

Query: 765  QLHSLAIAEEAHWRQRCKLKWLKEGDLNTGFFHRVVAARRRKSSIIELLSRD-------- 824
            ++  LA  EE  WRQR K+KW ++GD NT FFHRV    R+++ I +L   D        
Sbjct: 655  KIGDLAQKEEVKWRQRGKVKWARDGDGNTKFFHRVANGARKRNYIEKLEVEDLGVIEVDA 714

Query: 825  ---------------------------------------------------------DF- 884
                                                                     DF 
Sbjct: 715  NIEREVIRFFKGLYSSNKNKAVFDCGKDKSPGPDGFSMSFFQSCWEVVKGDLMKVMQDFF 774

Query: 885  ----------------------------YLTLSIGTPSHS-------------------- 944
                                        Y  +S+ T  +                     
Sbjct: 775  QSGIVNGVTNETFICLIPKKANSVKVTDYRPISLVTSLYKVISKVLASSLREVLGNTISQ 834

Query: 945  ------------------------------------------------------------ 1004
                                                                        
Sbjct: 835  SQGAFVQKRQILDAVLVANEVVEEVRKQKRKGLVFKIDFEKAYDHVEWNFVDDVMARKGF 894

Query: 1005 ---------------------NSQPSLK-------------PPFL-------RQRLLTQA 1064
                                 N +P  K              PFL         RL+ +A
Sbjct: 895  GVKWRGWIIGCLESVNFSIMINGKPRGKFRASRGLRQGDPLSPFLFTLVSDVLSRLIERA 954

Query: 1065 ADFDAIEGFAVGNPP---THLQFADDTLLFSSSKDSKLQNLFNFIKVFEEASGLNSNLQK 1124
             D + + G   G+     +HLQFADDT+     K+    NL   +K+F + SG+  N  K
Sbjct: 955  QDVNLVHGIVSGHDQVEVSHLQFADDTIFLLDGKEEYWLNLLQLLKLFCDVSGMKINKAK 1014

Query: 1125 TEMMGINLEDNVLESLAVRFDCRKGSWPNTYLGLPLNGNPRSPSFWDPIMEKIKKRLSSW 1184
            + ++GIN   +VL ++A  + C  G WP  YLGLPL GNPR+ +FW+P+MEK++KRL  W
Sbjct: 1015 SCILGINFSTDVLNNMAGSWGCEVGCWPMVYLGLPLGGNPRALNFWNPVMEKVEKRLQKW 1074

Query: 1185 EHNHISKGGRLTLINATLSNLPIYFLSLFSIPTKVANELDKIVRNFLWKGSMDKKGQNLV 1244
            +   +SKGGRLTLI A LS++P Y++SLF +P  VA ++++++RNFLW+G  + K  +LV
Sbjct: 1075 KRACLSKGGRLTLIQAVLSSIPSYYMSLFKMPIGVAAKVEQLMRNFLWEGLDEGKKCHLV 1134

Query: 1245 RWDTVQKPIDFGGLGITSIKAKNTALLAKWNWRFITEESSLWRRVIQAKYSIVDYHSPLR 1304
            RW+ V K  + GGLGI S++ +  AL AKW WRF  E +SLW R+I++KY I        
Sbjct: 1135 RWERVTKSKEEGGLGIGSLRERIEALRAKWLWRFPLETNSLWHRIIKSKYGI-------- 1194

Query: 1305 TLPAARGPWRAISKLNSLILDRISYRLGEGSLPLFWKDTWINDEPLCRTYPLLFALHTRK 1350
               +   PWR ISK  +  L    + +G G    FW+D W+ +  L   +P L +L  RK
Sbjct: 1195 --DSNGNPWREISKGYNSFLQCCRFSVGNGEKIRFWEDLWLKEGILKDLFPRLSSLSRRK 1254

BLAST of Moc06g12620 vs. ExPASy TrEMBL
Match: A0A438GDE7 (LINE-1 retrotransposable element ORF2 protein OS=Vitis vinifera OX=29760 GN=Pol_4 PE=4 SV=1)

HSP 1 Score: 594.0 bits (1530), Expect = 1.7e-165
Identity = 374/1184 (31.59%), Postives = 527/1184 (44.51%), Query Frame = 0

Query: 438  GSKEKRAHVKSVIQKHHPTIVILQETKVAGVDRFFIKTLWSSRNIAWAIQNSIGASGGII 497
            GSK+KR  VK  ++   P +V+ QETK    DR F+ ++W++RN  WA   + GASGGI+
Sbjct: 823  GSKKKRRVVKDFLRSEKPDVVMFQETKKEECDRRFVGSVWTARNKDWAALPACGASGGIL 882

Query: 498  ILWNDPAIKVNDIKIGAFSLTLHITLVDGFHFWLTGIYGPPRTRDRGLFWDELANLTFLC 557
            I+W+   +   ++ +G+FS+++  TL      WL+ +YGP  +  R   W EL+++  L 
Sbjct: 883  IIWDTKKLSREEVMLGSFSVSIKFTLNGCESLWLSAVYGPNNSALRKDLWVELSDIAGLA 942

Query: 558  AERWLLGGDFNVTRWVHEKSSHRRPTRSMRLFNNFIDTANLRDLPLTNGLYTWSNFRESP 617
            + RW +GGDFNV R   EK    R T SM+ F++FI    L DLPL +  +TWSN + +P
Sbjct: 943  SPRWCVGGDFNVIRRSSEKLGGSRLTPSMKDFDDFISDCELIDLPLRSASFTWSNMQVNP 1002

Query: 618  HLSLLDRYLCSDLVLSNFPNAMVKRLNRETSYHFPIQLALGAIRWGPTPSRFDNEWLQQA 677
                LDR+L S+     FP ++   L R TS H+PI L     +WGPTP RF+N WLQ  
Sbjct: 1003 VCKRLDRFLYSNEWEQTFPQSIQGVLPRWTSDHWPIVLETNPFKWGPTPFRFENMWLQHP 1062

Query: 678  TFQPLIEGWWNNNPLHGWPGHGFIQKLKALKVVIKDW-KANFIDSSYRHKEQLLTELNIL 737
            +F+     WW     +GW GH F++KL+ +K  +K W KA+F + S R KE +L+ L   
Sbjct: 1063 SFKENFGRWWREFQGNGWEGHKFMRKLQFVKAKLKVWNKASFGELSKR-KEDILSALVNF 1122

Query: 738  DSLEEEGSIQTVQMAQSISLKDQLHSLAIAEEAHWRQRCKLKWLKEGDLNTGFFHRVVAA 797
            DSLE+EG +    +AQ    K +L  L + EE HWRQ+ ++KW+KEGD N+ FFH+V   
Sbjct: 1123 DSLEQEGGLSHELLAQRAIKKGELEELILREEIHWRQKARVKWVKEGDCNSKFFHKVANG 1182

Query: 798  RRRKSSIIELLSRDDFYLTLSIG----------------------------TPSHSNSQP 857
            RR +  I EL + +   +  S                              +P    S  
Sbjct: 1183 RRNRKFIKELENENGQMMNNSESIKEEILRYFEKLYTSPSGESWRVEGLDWSPISGESAV 1242

Query: 858  SLKPPF------------------------------------------------------ 917
             L+ PF                                                      
Sbjct: 1243 RLESPFTEEEICKAIFQMDRDKAPGPDGFTIAVFQDCWEVIKEDLVKVFTEFHRSGIINQ 1302

Query: 918  ------------------------------------------------------------ 977
                                                                        
Sbjct: 1303 STNASFIVLLPKKSMSRRISDFRPISLITSLYKIIAKVLAGRIREVLHETIHSTQGAFVQ 1362

Query: 978  ------------------------------------------------------------ 1037
                                                                        
Sbjct: 1363 GRQILDAVLIANEIVDEKRRSGEEGVVFKIDFEKAYDHVSWDFLDHVMEMKGFGIRWRKW 1422

Query: 1038 ---------------------------LRQ-----------------RLLTQAADFDAIE 1097
                                       LRQ                 R+L +A + + +E
Sbjct: 1423 MRGCLSSVSFAVLVNGNAKGWVKASRGLRQGDPLSPFLFTIVADVLSRMLLKAEERNVLE 1482

Query: 1098 GFAVGNPPT---HLQFADDTLLFSSSKDSKLQNLFNFIKVFEEASGLNSNLQKTEMMGIN 1157
            GF VG   T   HLQFADDT+ FSSS++  +  L N + VF   SGL  NL K+ + GIN
Sbjct: 1483 GFKVGRNRTRVSHLQFADDTIFFSSSREEDMMTLKNVLLVFGHISGLKVNLDKSNIYGIN 1542

Query: 1158 LEDNVLESLAVRFDCRKGSWPNTYLGLPLNGNPRSPSFWDPIMEKIKKRLSSWEHNHISK 1217
            LE N L  LA   DC+   WP  YLGLPL GNP++  FWDP++E+I +RL  W+  ++S 
Sbjct: 1543 LEQNHLSRLAEMLDCKASGWPILYLGLPLGGNPKTSGFWDPVIERISRRLDGWQKAYLSF 1602

Query: 1218 GGRLTLINATLSNLPIYFLSLFSIPTKVANELDKIVRNFLWKGSMDKKGQNLVRWDTVQK 1277
            GGR+TLI + L+++P YFLSLF IP  VA +++++ R+FLW G  + K  +LV WD V K
Sbjct: 1603 GGRITLIQSCLTHMPCYFLSLFKIPASVAAKIERMQRDFLWSGVGEGKRDHLVNWDVVCK 1662

Query: 1278 PIDFGGLGITSIKAKNTALLAKWNWRFITEESSLWRRVIQAKYSI----VDYHSPLRTLP 1337
            P   GGLG   I  +N ALL KW WR+  E S+LW +VI + Y       D ++ +R   
Sbjct: 1663 PKSRGGLGFGKISIRNVALLGKWLWRYPREGSALWHQVILSIYGSHSNGWDVNNIVRW-- 1722

Query: 1338 AARGPWRAISKLNSLILDRISYRLGEGSLPLFWKDTWINDEPLCRTYPLLFALHTRKLGL 1366
            + R PW+AI+ +         + +G G    FW D W  ++PL   YP L  + T K   
Sbjct: 1723 SHRCPWKAIALVYQEFSKFTRFVVGNGDRIRFWDDLWWGEQPLGVQYPRLLRVVTDKNAP 1782

BLAST of Moc06g12620 vs. ExPASy TrEMBL
Match: A0A438FWU5 (LINE-1 retrotransposable element ORF2 protein OS=Vitis vinifera OX=29760 GN=LORF2_70 PE=4 SV=1)

HSP 1 Score: 580.9 bits (1496), Expect = 1.5e-161
Identity = 366/1182 (30.96%), Postives = 523/1182 (44.25%), Query Frame = 0

Query: 438  GSKEKRAHVKSVIQKHHPTIVILQETKVAGVDRFFIKTLWSSRNIAWAIQNSIGASGGII 497
            GSK+KR  V+  +   +P IV+LQETK    DR F+ ++W  + + WA   + GASGGI+
Sbjct: 747  GSKKKRRIVRRFLSTQNPDIVMLQETKRETWDRRFVSSVWKGKRVEWAALPACGASGGIV 806

Query: 498  ILWNDPAIKVNDIKIGAFSLTLHITLVDGFHFWLTGIYGPPRTRDRGLFWDELANLTFLC 557
            ILW+   ++  +  +G+FS+T+     +   FWLT +YGP     R  FW EL +L  L 
Sbjct: 807  ILWDSSKLECTEKVLGSFSVTVKFNSGEEGSFWLTSVYGPINPLWRKDFWLELQDLYGLT 866

Query: 558  AERWLLGGDFNVTRWVHEKSSHRRPTRSMRLFNNFIDTANLRDLPLTNGLYTWSNFRESP 617
              RW +GGDFNV R + EK    R T +MR F+ FI  + L D PL N  +TWSN +  P
Sbjct: 867  FPRWCVGGDFNVIRRISEKLGETRLTLNMRCFDEFIRESGLIDPPLRNAAFTWSNMQADP 926

Query: 618  HLSLLDRYLCSDLVLSNFPNAMVKRLNRETSYHFPIQLALGAIRWGPTPSRFDNEWLQQA 677
                LDR+L S    + F  +  + L R TS H PI L    ++WGPTP RF+N WL   
Sbjct: 927  ICKRLDRFLFSSEWDTFFSQSFQEALPRWTSDHSPICLETNPLKWGPTPFRFENMWLLHP 986

Query: 678  TFQPLIEGWWNNNPLHGWPGHGFIQKLKALKVVIKDWKANFIDSSYRHKEQLLTELNILD 737
             F+     WW      GW GH F++KLK +K  +K+W           K+ +LT+L+ +D
Sbjct: 987  EFKEKFRVWWLECTGEGWEGHKFMRKLKFVKSKLKEWNIMTFGDLKERKKLILTDLSRID 1046

Query: 738  SLEEEGSIQTVQMAQSISLKDQLHSLAIAEEAHWRQRCKLKWLKEGDLNTGFFHRVVAAR 797
             +E+EG++ +  + +    + +L  + + EE  WRQ+ ++KW+KEGD N+ FFHRV   R
Sbjct: 1047 LIEQEGNLNSDLVLERTLKRRELEDVLLKEEVQWRQKSRVKWIKEGDCNSKFFHRVATGR 1106

Query: 798  RRKSSIIELLSR------------------------------------------------ 857
            R +  I  L+S                                                 
Sbjct: 1107 RSRKFIKSLISERGETLNNIEDISEEIVNFFGNLYSKPVGESWRVEGIDWVPISGESGGW 1166

Query: 858  -------------------------DDF-------------------------------- 917
                                     D F                                
Sbjct: 1167 LDRPFTEEEVRRAVFQLNKEKAPGPDGFTIAVYQECWDVIKEDLMRVFLEFHTNGVINQS 1226

Query: 918  --------------------YLTLSIGTPSH-------------------SNSQPSL--- 977
                                Y  +S+ T  +                   S+SQ +    
Sbjct: 1227 TNATFIALVPKKSQSVKISDYRPISLVTSLYKIIAKVLSGRLRKVLHETISDSQGAFVEG 1286

Query: 978  ------------------------------------------------------------ 1037
                                                                        
Sbjct: 1287 RHILDAVLIANEVVDEKRRSGEEGIVFKIDFEKAYDHVDWGFLDHVLQRKGFSQKWRLWI 1346

Query: 1038 --------------------------------KPPFL-------RQRLLTQAADFDAIEG 1097
                                              PFL         R+L +A +    EG
Sbjct: 1347 RGCLSSSSFAILVNGNAKGWVKASRGLRQGDPLSPFLFTLVADVLSRMLFRAEETGLTEG 1406

Query: 1098 FAVGNPPTH---LQFADDTLLFSSSKDSKLQNLFNFIKVFEEASGLNSNLQKTEMMGINL 1157
            F+VG   T    LQFADDT+ FS +    LQNL   + VF + SGL  NL+K+ + GIN 
Sbjct: 1407 FSVGRDRTRVSLLQFADDTIFFSKASMEHLQNLKIILLVFGQVSGLKINLEKSTISGINT 1466

Query: 1158 EDNVLESLAVRFDCRKGSWPNTYLGLPLNGNPRSPSFWDPIMEKIKKRLSSWEHNHISKG 1217
               +L SLA  FDCR   WP +YLGLPL GNP++  FWDP++E+I +RL  W+  ++S G
Sbjct: 1467 RQELLSSLASVFDCRVSEWPLSYLGLPLGGNPKTIGFWDPVVERISRRLDGWKKAYLSLG 1526

Query: 1218 GRLTLINATLSNLPIYFLSLFSIPTKVANELDKIVRNFLWKGSMDKKGQNLVRWDTVQKP 1277
            GR+TLI + LS++P YFLSLF IP  +A++++K+ RNFLW G+ + K  +LVRW+ V +P
Sbjct: 1527 GRITLIQSCLSHIPSYFLSLFKIPASIASKIEKMQRNFLWSGAGEGKKDHLVRWEVVSRP 1586

Query: 1278 IDFGGLGITSIKAKNTALLAKWNWRFITEESSLWRRVIQAKYSI--VDYHSPLRTLPAAR 1337
             + GGLG   I  +N ALL KW WRF  E S LW +VI + Y      + + +    + R
Sbjct: 1587 KELGGLGFGKISLRNIALLGKWLWRFPRERSGLWYKVIGSIYGTHPNGWDANMVVRWSHR 1646

Query: 1338 GPWRAISKLNSLILDRISYRLGEGSLPLFWKDTWINDEPLCRTYPLLF-ALHTRKLGLVR 1365
             PW+AI+++       +   +G G    FW+D W  ++ LC  +  L+  +  + L +  
Sbjct: 1647 CPWKAIAQVFQEFSPFVRLVVGNGERIRFWEDLWWGNQSLCSQFADLYRVISVKNLTVSN 1706

BLAST of Moc06g12620 vs. ExPASy TrEMBL
Match: A0A438BP29 (Transposon TX1 uncharacterized 149 kDa protein OS=Vitis vinifera OX=29760 GN=YTX2_691 PE=4 SV=1)

HSP 1 Score: 575.1 bits (1481), Expect = 8.0e-160
Identity = 366/1180 (31.02%), Postives = 519/1180 (43.98%), Query Frame = 0

Query: 416  PTKTTKKTQEQKKEVKLVRELAGSKEKRAHVKSVIQKHHPTIVILQETKVAGVDRFFIKT 475
            P +  K      ++ +L  +  GSK+KR  VK+ +    P +V++QETK    DR  + +
Sbjct: 692  PRRMAKGQSVSHEDNQLEYQGLGSKKKRRVVKNFLSSEKPDVVMIQETKKEECDRRLVGS 751

Query: 476  LWSSRNIAWAIQNSIGASGGIIILWNDPAIKVNDIKIGAFSLTLHITLVDGFHFWLTGIY 535
            +WS RN  WA   + GASGGI+I+W+   ++  ++ +                     +Y
Sbjct: 752  VWSVRNKDWAALPASGASGGILIIWDSKKLRREEVVL---------------------VY 811

Query: 536  GPPRTRDRGLFWDELANLTFLCAERWLLGGDFNVTRWVHEKSSHRRPTRSMRLFNNFIDT 595
            GP  +  R  FW EL+++  L   RW +GGDFNV R   EK    R T  M+ F+ FI  
Sbjct: 812  GPNNSALRKDFWVELSDIAGLSHPRWCVGGDFNVIRRSSEKLGGSRLTPRMKDFDEFIRD 871

Query: 596  ANLRDLPLTNGLYTWSNFRESPHLSLLDRYLCSDLVLSNFPNAMVKRLNRETSYHFPIQL 655
              L D PL +  YTWSN +E+P    LDR+L S+     FP ++   L R TS H+PI L
Sbjct: 872  CELIDSPLRSASYTWSNMQENPVCKRLDRFLYSNEWEQVFPQSLQGVLPRWTSDHWPIVL 931

Query: 656  ALGAIRWGPTPSRFDNEWLQQATFQPLIEGWWNNNPLHGWPGHGFIQKLKALKVVIKDWK 715
                 +WGPTP RF+N WLQ + F+     WW+    +GW GH F++KL+ +K  +K+W 
Sbjct: 932  ETNPFKWGPTPFRFENMWLQHSNFKENFGRWWSEFQGNGWEGHKFMRKLQFVKAKLKEWN 991

Query: 716  ANFIDSSYRHKEQLLTELNILDSLEEEGSIQTVQMAQSISLKDQLHSLAIAEEAHWRQRC 775
                    + K+ +L  L   DSLE+EG +   ++ Q    K +L  L + EE HWRQ+ 
Sbjct: 992  KTSFGELSKKKKDILAVLANFDSLEQEGGLSHERLVQRAFSKGELEELILREEIHWRQKA 1051

Query: 776  KLKWLKEGDLNTGFFHRVVAARRRKSSIIELLSRDDFYLTLSIG---------------- 835
            ++KW+KEGD N+ FFH+V   RR +  I EL +     L                     
Sbjct: 1052 RVKWVKEGDCNSKFFHKVANGRRNRKFIKELENESGLMLNNPESIKEEILKYFEKLYASP 1111

Query: 836  ------------TPSHSNSQPSLKPPF--------------------------------- 895
                        +P    S   L+ PF                                 
Sbjct: 1112 SGESWRVEGLDWSPIDGESASRLESPFTEEEIYKAIFQMDRDKAPGPDGFTIAVFQDCWD 1171

Query: 896  ------------------------------------------------------------ 955
                                                                        
Sbjct: 1172 VIKEDLVRVFTEFHRSGIINQSTNASFIVLIPKKSMSRRISDYRPISLITSLYKIIAKVL 1231

Query: 956  ------------------------------------------------------------ 1015
                                                                        
Sbjct: 1232 AGRLRGVLHETIHSTQGAFVQGRQILDAVLIANEIVDEKRRTGEEGVVFKIDFEKAYDHV 1291

Query: 1016 ----------------LRQ-----------------RLLTQAADFDAIEGFAVGNPPT-- 1075
                            LRQ                 R+L +A + + +EGF VG   T  
Sbjct: 1292 ILVNGNAKGWVKASRGLRQGDPLSPFLFTIVADVLSRMLLKAEERNVLEGFRVGRNRTRV 1351

Query: 1076 -HLQFADDTLLFSSSKDSKLQNLFNFIKVFEEASGLNSNLQKTEMMGINLEDNVLESLAV 1135
             HLQFADDT+ FSS+++  L  L + + VF   SGL  NL K+ + GINLE N L  LAV
Sbjct: 1352 SHLQFADDTIFFSSTREEDLMTLKSVLLVFGHISGLKVNLDKSNIYGINLEQNHLSRLAV 1411

Query: 1136 RFDCRKGSWPNTYLGLPLNGNPRSPSFWDPIMEKIKKRLSSWEHNHISKGGRLTLINATL 1195
              DC+   WP  YLGLPL GNP++  FWDP++E+I +RL  W+  ++S GGR+TLI + L
Sbjct: 1412 MLDCKASGWPILYLGLPLGGNPKASGFWDPVIERISRRLDGWQKAYLSFGGRITLIQSCL 1471

Query: 1196 SNLPIYFLSLFSIPTKVANELDKIVRNFLWKGSMDKKGQNLVRWDTVQKPIDFGGLGITS 1255
            +++P YFLSLF IP  VA +++++ R FLW G  + K  +LV WD V KP   GGLG   
Sbjct: 1472 THMPCYFLSLFKIPASVAAKIERMQREFLWSGVGEGKRDHLVNWDVVCKPKSRGGLGFGK 1531

Query: 1256 IKAKNTALLAKWNWRFITEESSLWRRVIQAKYSI----VDYHSPLRTLPAARGPWRAISK 1315
            I  +N ALL KW WR+  E S+LW +VI   Y       D ++ +R   + R PW+AI+ 
Sbjct: 1532 ISMRNVALLGKWLWRYPREGSALWHQVILNIYGSHSNGWDVNNNVRW--SHRCPWKAIAL 1591

Query: 1316 LNSLILDRISYRLGEGSLPLFWKDTWINDEPLCRTYPLLFALHTRK-------LGLVRDF 1366
            +         + +G+G    FW D W  D+ L   YP L ++ T K       LG  R F
Sbjct: 1592 VFQEFSKFTRFVVGDGDRIRFWDDLWWGDQTLGTQYPRLLSVVTDKNAPISSILGYSRPF 1651

BLAST of Moc06g12620 vs. ExPASy TrEMBL
Match: A5BCI7 (Reverse transcriptase domain-containing protein OS=Vitis vinifera OX=29760 GN=VITISV_029474 PE=4 SV=1)

HSP 1 Score: 572.4 bits (1474), Expect = 5.2e-159
Identity = 369/1189 (31.03%), Postives = 525/1189 (44.15%), Query Frame = 0

Query: 439  SKEKRAHVKSVIQKHHPTIVILQETKVAGVDRFFIKTLWSSRNIAWAIQNSIGASGGIII 498
            S  K A V+ V++      + ++ETK    DR F+ ++W++RN  WA   + GASGGI+I
Sbjct: 681  SPRKMAKVREVLKN-----LDIKETKKEECDRRFVGSVWTARNKDWAALPACGASGGILI 740

Query: 499  LWNDPAIKVNDIKIGAFSLTLHITLVDGFHFWLTGIYGPPRTRDRGLFWDELANLTFLCA 558
            +W+   +   ++ +G+FS+++   L      WL+ +YGP  +  R  FW EL+++  L +
Sbjct: 741  IWDAKKLSREEVVLGSFSVSIKFALNGCESLWLSAVYGPNISALRKDFWVELSDIAGLAS 800

Query: 559  ERWLLGGDFNVTRWVHEKSSHRRPTRSMRLFNNFIDTANLRDLPLTNGLYTWSNFRESPH 618
             RW +GGDFNV R   EK    R T SM+ F++FI    L DLPL +  +TWSN + +  
Sbjct: 801  PRWCVGGDFNVIRRSSEKLGGSRXTPSMKXFDDFISDCELIDLPLRSASFTWSNMQVNXV 860

Query: 619  LSLLDRYLCSDLVLSNFPNAMVKRLNRETSYHFPIQLALGAIRWGPTPSRFDNEWLQQAT 678
               LDR+L S+     FP ++   L R TS H+PI L     +WGPTP RF+N WLQ  +
Sbjct: 861  CKRLDRFLYSNEWEQAFPQSIQGVLPRWTSDHWPIVLETNPFKWGPTPFRFENMWLQHPS 920

Query: 679  FQPLIEGWWNNNPLHGWPGHGFIQKLKALKVVIKDW-KANFIDSSYRHKEQLLTELNILD 738
            F+     WW     +GW GH F++KL+ +K  +K W KA+F + S R KE +L++L   D
Sbjct: 921  FKENFGRWWREFQGNGWEGHKFMRKLQFVKAKLKVWNKASFGELSKR-KEDILSDLVNFD 980

Query: 739  SLEEEGSIQTVQMAQSISLKDQLHSLAIAEEAHWRQRCKLKWLKEGDLNTGFFHRVVAAR 798
            SLE+EG +    +AQ    K +L  L + EE HWRQ+ ++KW+KEGD N+ FFH+V   R
Sbjct: 981  SLEQEGGLSHELLAQRALKKGELEELILREEIHWRQKARVKWVKEGDCNSRFFHKVANGR 1040

Query: 799  RRKSSIIELLSRDDFYLTLSIG----------------------------TPSHSNSQPS 858
            R +  I EL + +   +  S                              +P    S   
Sbjct: 1041 RNRKFIKELENENGLMMNNSESIKEEILRYFEKLYTSPSGESWRVEGLDWSPISGESAFR 1100

Query: 859  LKPPF------------------------------------------------------- 918
            L+ PF                                                       
Sbjct: 1101 LESPFTEEEIFKAIFQMDRDKAPGPDGFTIAVFQDCWEVIKEDLVKVFTEFHRSGIINQS 1160

Query: 919  ------------------------------------------------------------ 978
                                                                        
Sbjct: 1161 TNASFIVLLPKKSMSRRISDFRPISLITSLYKIIAKVLAGRIRGVLHETIHSTQGAFVQG 1220

Query: 979  ------------------------------------------------------------ 1038
                                                                        
Sbjct: 1221 RQILDAVLIANEIVDEKRRSGEEGVVFKIDFEKAYDHVSWDFLDHVLEMKGFGIRWRKWM 1280

Query: 1039 --------------------------LRQ-----------------RLLTQAADFDAIEG 1098
                                      LRQ                 R+L +A + + +EG
Sbjct: 1281 RGCLSSVSFAVLVNGNAKGWVKASRGLRQGDPLSPFLFTIVADVLSRMLLKAEERNVLEG 1340

Query: 1099 FAVGNPPT---HLQFADDTLLFSSSKDSKLQNLFNFIKVFEEASGLNSNLQKTEMMGINL 1158
            F VG   T   HLQFADDT+ FSSS++  +  L N + VF   SGL  NL K+ + GINL
Sbjct: 1341 FKVGRNRTRVSHLQFADDTIFFSSSREEDMMTLKNVLLVFGHISGLKVNLDKSNIYGINL 1400

Query: 1159 EDNVLESLAVRFDCRKGSWPNTYLGLPLNGNPRSPSFWDPIMEKIKKRLSSWEHNHISKG 1218
            E N L  LA   DC+   WP  YLGLPL GNP++  FWDP++E+I +RL  W+  ++S G
Sbjct: 1401 EQNHLSRLAEMLDCKASGWPILYLGLPLGGNPKTSGFWDPVIERISRRLDGWQKAYLSFG 1460

Query: 1219 GRLTLINATLSNLPIYFLSLFSIPTKVANELDKIVRNFLWKGSMDKKGQNLVRWDTVQKP 1278
            GR+TLI + L+++P YFLSLF IP  VA +++++ R+FLW G  + K  +LV WD V KP
Sbjct: 1461 GRITLIQSCLTHMPCYFLSLFKIPASVAAKIERMQRDFLWSGVGEGKRDHLVNWDVVCKP 1520

Query: 1279 IDFGGLGITSIKAKNTALLAKWNWRFITEESSLWRRVIQAKYSI----VDYHSPLRTLPA 1338
               GGLG   I  +N ALL KW WR+  E S+LW +VI + Y       D ++ +R   +
Sbjct: 1521 KSRGGLGFGKISIRNVALLGKWLWRYPREGSALWHQVILSIYGSHSNGWDVNNIVRW--S 1580

Query: 1339 ARGPWRAISKLNSLILDRISYRLGEGSLPLFWKDTWINDEPLCRTYPLLFALHTRK---- 1366
             R PW+AI+ +         + +G G    FW D W  ++PL   YP L  + T K    
Sbjct: 1581 HRCPWKAIALVYQEFSKFTRFVVGNGDRIRFWDDLWWGEQPLGVQYPRLLRVVTDKNAPI 1640

BLAST of Moc06g12620 vs. TAIR 10
Match: AT3G24255.1 (RNA-directed DNA polymerase (reverse transcriptase)-related family protein )

HSP 1 Score: 111.7 bits (278), Expect = 4.8e-24
Identity = 96/402 (23.88%), Postives = 150/402 (37.31%), Query Frame = 0

Query: 908  LEDNVLESLAVRFDCRKGSWPNTYLGLPLNGNPRSPSFWDPIMEKIKKRLSSWEHNHISK 967
            ++DN    +   F    G+ P  YLGLPL     + S + P++EKI+ R+  W   H+S 
Sbjct: 4    VKDNDKADILHSFPFASGALPVRYLGLPLLTKKMTTSDYGPLVEKIRVRIGKWTARHLSF 63

Query: 968  GGRLTLINATLSNLPIYFLSLFSIPTKVANELDKIVRNFLWKGSMDKKGQNLVRWDTVQK 1027
             GRL LI++ + +L  +++S F +P+    E+D I  +FLW G      +  V W  V  
Sbjct: 64   AGRLQLISSVIHSLTNFWMSAFRLPSACIKEIDSICSSFLWSGPELNTKKAKVAWSDVCT 123

Query: 1028 PIDFGGLGITSIKAKNTALLAKWNWRFITEESSLWRRVIQAKYSIVDYHSPLRTLPAARG 1087
            P D GGLGI S+K  N              + S W   I    ++  +            
Sbjct: 124  PKDEGGLGIRSLKEAN--------------KGSFWS--ISGNTTLGSW------------ 183

Query: 1088 PWRAISKLNSLILDRISYRLGEGSLPLFWKDTWINDEPLCRTYPLLFALHTRKLGLVRDF 1147
             W+ I K  +L    + + +  GS   FW D W                   K+G +   
Sbjct: 184  MWKKILKHRALASGFVKHDIHNGSNTSFWFDNW------------------SKIGRL--- 243

Query: 1148 WSTETNSWDLNFHRNLKDVEIIELVALLHCLSSQRPSLNRDSDAWRLDPL---------- 1207
                    D+  HR   D+ I    ++   + + RP  +R     R++ +          
Sbjct: 244  -------IDVTGHRGCIDMGITLHASVAEAVVNHRPRRHRHDTLLRIEDVIAEVRHQGLT 303

Query: 1208 -GSFTT---------SSLLNDLQRNNTSSPP---TDLYKAIWKDSYPKKIKFFLWETSLQ 1267
             G  T              N  +    +  P    + YK +W      K     W     
Sbjct: 304  SGEDTVRWKGNGDIFKPCFNTKETWAATREPKLKVNWYKGVWFSHATPKYSVLAWIAIKN 346

Query: 1268 ALNTHDKLQRRMPYMALSPHWCPLCKLQSESIGHTLLTCPFS 1287
             L T D   R + + A +   C LC    E+  H   TCP+S
Sbjct: 364  RLTTGD---RMLSWNAGADSSCVLCHHLVETRDHLFFTCPYS 346

BLAST of Moc06g12620 vs. TAIR 10
Match: AT1G43760.1 (DNAse I-like superfamily protein )

HSP 1 Score: 76.6 bits (187), Expect = 1.7e-13
Identity = 66/238 (27.73%), Postives = 109/238 (45.80%), Query Frame = 0

Query: 582 PTRSMRLFNNFIDTANLRDLPLTNGLYTWSNFR-ESPHLSLLDRYLCSDLVLSNFPNAMV 641
           P R +  F N +  ++L D+P     YTWSN + ++P +  LDR + +    S+FP+A+ 
Sbjct: 245 PMRGLEEFQNCLRDSDLVDIPSRGVHYTWSNHQDDNPIIRKLDRAIANGDWFSSFPSAIA 304

Query: 642 KRLNRETSYHFPIQLALGAI-RWGPTPSRFDNEWLQQATFQPLIEGWWNNNPLHGWPGHG 701
                  S H P  + L  + +      R+ +      TF   +   W      G     
Sbjct: 305 VFELSGVSDHSPCIIILENLPKRSKKCFRYFSFLSTHPTFLVSLTVAWEEQIPVGSHMFS 364

Query: 702 FIQKLKALKVVIKDW-KANFIDSSYRHKEQLLTELNILDSLEEEGSIQTVQMAQSI---- 761
             + LKA K   K   +  F +  ++ KE        LDSLE   S      + S+    
Sbjct: 365 LGEHLKAAKKCCKLLNRQGFGNIQHKTKE-------ALDSLESIQSQLLTNPSDSLFRVE 424

Query: 762 -SLKDQLHSLAIAEEAHWRQRCKLKWLKEGDLNTGFFHRVVAARRRKSSIIELLSRDD 812
              + + +  A A E+ +RQ+ ++KWL++GD NT FFH+V+ A + K ++I+ L  DD
Sbjct: 425 HVARKKWNFFAAALESFYRQKSRIKWLQDGDANTRFFHKVILANQAK-NLIKFLRMDD 474

BLAST of Moc06g12620 vs. TAIR 10
Match: ATMG00310.1 (RNA-directed DNA polymerase (reverse transcriptase)-related family protein )

HSP 1 Score: 68.2 bits (165), Expect = 6.1e-11
Identity = 43/148 (29.05%), Postives = 69/148 (46.62%), Query Frame = 0

Query: 981  LPIYFLSLFSIPTKVANELDKIVRNFLWKGSMDKKGQNLVRWDTVQK-PIDFGGLGITSI 1040
            LP+Y +S F +   +  +L   +  F W    +K+  + V W  + K   D GGLG   +
Sbjct: 3    LPVYAMSCFRLSKLLCKKLTSAMTEFWWSSCENKRKISWVAWQKLCKSKEDDGGLGFRDL 62

Query: 1041 KAKNTALLAKWNWRFITEESSLWRRVIQAKYSIVDYHSPLRTLPAARGP---WRAISKLN 1100
               N ALLAK ++R I +  +L  R+++++Y     HS +        P   WR+I    
Sbjct: 63   GWFNQALLAKQSFRIIHQPHTLLSRLLRSRYF---PHSSMMECSVGTRPSYAWRSIIHGR 122

Query: 1101 SLILDRISYRLGEGSLPLFWKDTWINDE 1125
             L+   +   +G+G     W D WI DE
Sbjct: 123  ELLSRGLLRTIGDGIHTKVWLDRWIMDE 147

BLAST of Moc06g12620 vs. TAIR 10
Match: AT1G60720.1 (RNA-directed DNA polymerase (reverse transcriptase)-related family protein )

HSP 1 Score: 63.2 bits (152), Expect = 2.0e-09
Identity = 57/248 (22.98%), Postives = 101/248 (40.73%), Query Frame = 0

Query: 1103 ISYRLGEGSLPLFWKDTWINDEPLCRTYPLLFALHTRKLGLVRDFWSTETNSWDLNFHRN 1162
            +   LG G +  FW D+W +  PL +      +   R     R   +   N W L   R+
Sbjct: 13   VKCNLGNGRIAHFWHDSWTSLGPLIKVMGDYGSRSLRIPLNARVVEALGVNGWKLPLSRS 72

Query: 1163 LKDVEIIELVALLHCLSSQRPSLNRDSDAWRLDPL--GSFTTSSLLNDLQRNNTSSPPTD 1222
                 I + ++    +++  P+   DS  W +  +    F+++   + ++     +P  D
Sbjct: 73   APAQAIHDHIS---TITTPSPATIEDSFDWVVGGVVCQGFSSARTWDAIR---PRAPELD 132

Query: 1223 LYKAIWKDSYPKKIKFFLWETSLQALNTHDKLQRRMPYMALSPHWCPLCKLQSESIGHTL 1282
              KA+W      K  F +W + L  L T    QR   +  +    C LC +++ES  H L
Sbjct: 133  WAKAVWFKGAVPKHAFNMWISQLDRLPTR---QRLASWGHIQSFDCCLCTIETESRDHLL 192

Query: 1283 LTCPFSTALWNRILSIFDWSVALPTDMSQLLA---LTLVGHPFKKRKASLWSHFIRALLW 1342
             +C F+  +W    S       L    ++LL+    +    P   RK S       A+++
Sbjct: 193  FSCEFAAQVWRLAFSRLCPRQRLFCSWAELLSWMRSSSSSAPSLLRKVS-----AHAIIY 246

Query: 1343 TIWTERNH 1346
             IW +RN+
Sbjct: 253  NIWRQRNN 246

BLAST of Moc06g12620 vs. TAIR 10
Match: AT1G40390.1 (DNAse I-like superfamily protein )

HSP 1 Score: 53.1 bits (126), Expect = 2.0e-06
Identity = 35/104 (33.65%), Postives = 52/104 (50.00%), Query Frame = 0

Query: 542 DRGLFWDELANLTF---LCAERWLLGGDFNVTRWVHEKSSHRRPTRSMRLFNNF---IDT 601
           +R   WD++  L+    LC   WL+ GDFN    V E  S      S++   +    +  
Sbjct: 100 ERRSLWDDITRLSASSPLCNSPWLVVGDFNQIASVTEHYSLMPSNISLQGLEDLQACMRD 159

Query: 602 ANLRDLPLTNGLYTWSNF-RESPHLSLLDRYLCSDLVLSNFPNA 639
           ++L DLP    LYTWSN  +++P L  LDR + +   L+ FP A
Sbjct: 160 SDLVDLPCRGVLYTWSNHQQDNPILRKLDRAIVNGCWLATFPTA 203

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
RVW70235.13.4e-16531.59LINE-1 retrotransposable element ORF2 protein [Vitis vinifera][more]
RVW64408.13.0e-16130.96LINE-1 retrotransposable element ORF2 protein [Vitis vinifera][more]
RVW12714.11.6e-15931.02Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera][more]
CAN65484.11.1e-15831.03hypothetical protein VITISV_029474 [Vitis vinifera][more]
CAN68165.11.1e-15529.98hypothetical protein VITISV_008538 [Vitis vinifera][more]
Match NameE-valueIdentityDescription
P0C2F61.0e-4227.63Putative ribonuclease H protein At1g65750 OS=Arabidopsis thaliana OX=3702 GN=At1... [more]
P932958.6e-1029.05Uncharacterized mitochondrial protein AtMg00310 OS=Arabidopsis thaliana OX=3702 ... [more]
Match NameE-valueIdentityDescription
M5WJ762.3e-16732.00Reverse transcriptase domain-containing protein (Fragment) OS=Prunus persica OX=... [more]
A0A438GDE71.7e-16531.59LINE-1 retrotransposable element ORF2 protein OS=Vitis vinifera OX=29760 GN=Pol_... [more]
A0A438FWU51.5e-16130.96LINE-1 retrotransposable element ORF2 protein OS=Vitis vinifera OX=29760 GN=LORF... [more]
A0A438BP298.0e-16031.02Transposon TX1 uncharacterized 149 kDa protein OS=Vitis vinifera OX=29760 GN=YTX... [more]
A5BCI75.2e-15931.03Reverse transcriptase domain-containing protein OS=Vitis vinifera OX=29760 GN=VI... [more]
Match NameE-valueIdentityDescription
AT3G24255.14.8e-2423.88RNA-directed DNA polymerase (reverse transcriptase)-related family protein [more]
AT1G43760.11.7e-1327.73DNAse I-like superfamily protein [more]
ATMG00310.16.1e-1129.05RNA-directed DNA polymerase (reverse transcriptase)-related family protein [more]
AT1G60720.12.0e-0922.98RNA-directed DNA polymerase (reverse transcriptase)-related family protein [more]
AT1G40390.12.0e-0633.65DNAse I-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (OHB3-1) v2
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR026960Reverse transcriptase zinc-binding domainPFAMPF13966zf-RVTcoord: 1200..1290
e-value: 1.1E-15
score: 58.1
IPR036691Endonuclease/exonuclease/phosphatase superfamilyGENE3D3.60.10.10Endonuclease/exonuclease/phosphatasecoord: 433..657
e-value: 1.5E-27
score: 98.9
IPR036691Endonuclease/exonuclease/phosphatase superfamilySUPERFAMILY56219DNase I-likecoord: 442..655
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 340..355
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 322..339
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 322..355
NoneNo IPR availablePANTHERPTHR33116REVERSE TRANSCRIPTASE ZINC-BINDING DOMAIN-CONTAINING PROTEIN-RELATED-RELATEDcoord: 609..808
NoneNo IPR availablePANTHERPTHR33116REVERSE TRANSCRIPTASE ZINC-BINDING DOMAIN-CONTAINING PROTEIN-RELATED-RELATEDcoord: 868..1307
NoneNo IPR availablePANTHERPTHR33116:SF38OS01G0158850 PROTEINcoord: 868..1307
coord: 609..808

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Moc06g12620.1Moc06g12620.1mRNA