Clc03G10110 (gene) Watermelon (cordophanus) v2

Overview
NameClc03G10110
Typegene
OrganismCitrullus lanatus subsp. cordophanus cv. cordophanus (Watermelon (cordophanus) v2)
DescriptionReverse transcriptase
LocationClcChr03: 12666269 .. 12668527 (+)
RNA-Seq ExpressionClc03G10110
SyntenyClc03G10110
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCATTCTGGCTATGTAACACACCAAGGACGTTCCAAAGGTGCATGATGGCAATATTCTCTGATTATTTGGAACAGTCAGTAGAGGTGTTTATGGACGACTTCTCAGTATTTGGGAAATCTTATGATGAATGCTTGACCAACCTAGAACAAGTGCTAAAGCGATGCGAAGACACAAATCTAGTCCTCAACTGGGAAAAATGCCACTTTATGGAGACTAAGGGTATTGTGTTGGGGTATAAAATCTCCAAGGTTGGATTGGAAGTGGATCAGGCAAAGATAGATGCGATAGCAAAACTCCCAGCCCCCACAAATGTTCAGACTTTGCGAAGTTTCCTAGGCCATGCGGGCTTCTATAGAAGATTCATTAAGGGATTCTCGCAAGTAGCAAAACCACTGAGCGAGTTGCTGGAAGTCAACAGGGAATTCAATTTCAACGGTAAATGCTTAAACGCATTTGAATCTTTAAGGCAAGCTCTGATTTCAGCACCTATTTTAGTTGCACCGGACTGGTCTCTCCCGTTTGAATTAATGTGCGACGCAAGTAACCATGCGGTGGGAGCAGTATTGGGGCAGAGAAAAGAGAAAATCATGCACCCGATCTATTATGCTAGTAAAACACTGAATTCATCTCAGGAGAACTACACTACTACTGAGAAGGAAATGTTAGCCATAGTCTTTGTGGTTGACAAATTTAGAGCATACTTGATAGGGTCAAAGGTGACCATCTATAGTGATCATTCTGCGATCAAGTATTTGATGGCGAAAAAGGACGCAAAGCCTAGACTTGTCCGTTGGGTCCTACTGTTGCAAGAATTCGATTTGGAGATTAAAGATAGAAGGGGAACCGAGAATCAGGTTGCGAACCACCTATCTCGGTTGGAAAATAAAGAGGTCCAGGAGAGTTGGAGTGATATAGAGGAACGATTCCCAGACAAGCATGTAATGAACGCAGAGAGTCAGGAACCGTGGTACGCAGACATAGTCAACTATCTGGTTTGCAACCAATGGCCTGAAGAATTTAACGCTCAACAAAAGAAAAAGCTCCGACATGAAAGTAAGTTCTACTGCTGGGATGAGCCATATCTATACAGACTTGGCCCTGACCACATACTGCGTCGATGCGTTCCAGAATATGAAACGCATAGCATCTTGAGAAGCTGTCATGAAGCACCTTACGGAGGACACTTTGGAGGGCAGAGAACAGTTACAAAGGTGTTGCAAAGTGGGTATTTCTGGCCCACATTATTCAAAGACCCAAGGGCATATGCGGTAGCTTGCGATGGTTGTCAGAGAACAGGCAACATTTCCAACCGGAATGAGATGCCTTTGAATTCAATGCTGGAAGTTGAGTTGTTCGACGTATGGGGAATTGACTTCATGGGACCATTCCCTCCTTCTTGTGGCAATCAATACATTTTAGTAGCGGTCGACTACGTATCAAAGTGGGTAGAAGCAGCAGCCTGTGCAAGGAATGACGCAAATGCAGTGTCCAAGTTCTTAAAGAAACAAATCTTCTCTCGATTTGGGACACCAAGGGCGATAATTAGTGATGAAGGTACGCATTTTATAAATCGCATAATCACTAATTTACTGACTAAGTTTAATGTCTCGCACAGGGTAGCAACTGCCTATCACCCATAGACAAACGGCCAAGTTGAAATAACAAACAGGGAGATCAAGTCCATACTGGAAAAGGTCGTGAGCACATCAAGGAAAGATTGGACAGAGAGATTAGATGAAGCTCTATGGGCCTACAGAACGGCATTCAAAACACTGATAGGCATGTCACCCTATGCGTTGGTGTTTGGTAAAGCATGCCATCTCCCACTTGAGCTGGAACACAAGGCCATCTGGGCCATGAAGAAGCTCAATTTAGACCAGGAGGCAAGCGGAGAAGCCAGAAAGCTTCAACTAAATGAACTCCTGGAGTGGAGGCACTCAGCTTATGAAAACGCAAAGCTGTACAAAGAAAGGACCAAGAAATGGCACGACAAGAACATTAGTAAGAAAATTCTATACGTTTGCCAGAAGGTCCTGTTATTTAATTCAAGGTTGCGTTTGTTCCCAGGTAAGTTGAAGTCTCGATGGTCGGGTCCATTCATAACCAAGGAAGTGTTCCCGCATGGTGCGGTCATGCTGACAAATGAAAATGGAACCACACCCTTCAAGGTCAATGGACAAAGGGTAAAACCCTACCACATTGGAGAGTTCGAAATTAACAAGACTTCCATTGACCTACGCGAGTGTAATGACTGA

mRNA sequence

ATGCCATTCTGGCTATGTAACACACCAAGGACGTTCCAAAGGTGCATGATGGCAATATTCTCTGATTATTTGGAACAGTCAGTAGAGGTGTTTATGGACGACTTCTCAGTATTTGGGAAATCTTATGATGAATGCTTGACCAACCTAGAACAAGTGCTAAAGCGATGCGAAGACACAAATCTAGTCCTCAACTGGGAAAAATGCCACTTTATGGAGACTAAGGGTATTGTGTTGGGGTATAAAATCTCCAAGGTTGGATTGGAAGTGGATCAGGCAAAGATAGATGCGATAGCAAAACTCCCAGCCCCCACAAATGTTCAGACTTTGCGAAGTTTCCTAGGCCATGCGGGCTTCTATAGAAGATTCATTAAGGGATTCTCGCAAGTAGCAAAACCACTGAGCGAGTTGCTGGAAGTCAACAGGGAATTCAATTTCAACGGTAAATGCTTAAACGCATTTGAATCTTTAAGGCAAGCTCTGATTTCAGCACCTATTTTAGTTGCACCGGACTGGTCTCTCCCGTTTGAATTAATGTGCGACGCAAGTAACCATGCGGTGGGAGCAGTATTGGGGCAGAGAAAAGAGAAAATCATGCACCCGATCTATTATGCTAGTAAAACACTGAATTCATCTCAGGAGAACTACACTACTACTGAGAAGGAAATGTTAGCCATAGTCTTTGTGGTTGACAAATTTAGAGCATACTTGATAGGGTCAAAGGTGACCATCTATAGTGATCATTCTGCGATCAAGTATTTGATGGCGAAAAAGGACGCAAAGCCTAGACTTGTCCGTTGGGTCCTACTGTTGCAAGAATTCGATTTGGAGATTAAAGATAGAAGGGGAACCGAGAATCAGGTTGCGAACCACCTATCTCGGTTGGAAAATAAAGAGGTCCAGGAGAGTTGGAGTGATATAGAGGAACGATTCCCAGACAAGCATGTAATGAACGCAGAGAGTCAGGAACCGTGGTACGCAGACATAGTCAACTATCTGGTTTGCAACCAATGGCCTGAAGAATTTAACGCTCAACAAAAGAAAAAGCTCCGACATGAAAGTAAGTTCTACTGCTGGGATGAGCCATATCTATACAGACTTGGCCCTGACCACATACTGCGTCGATGCGTTCCAGAATATGAAACGCATAGCATCTTGAGAAGCTGTCATGAAGCACCTTACGGAGGACACTTTGGAGGGCAGAGAACAGTTACAAAGGTGTTGCAAAGTGGGTATTTCTGGCCCACATTATTCAAAGACCCAAGGGCATATGCGGTAGCTTGCGATGGTTGTCAGAGAACAGGCAACATTTCCAACCGGAATGAGATGCCTTTGAATTCAATGCTGGAAGTTGAGTTGTTCGACGTATGGGGAATTGACTTCATGGGACCATTCCCTCCTTCTTGTGGCAATCAATACATTTTAGTAGCGGTCGACTACGTATCAAAGTGGGTAGAAGCAGCAGCCTGTGCAAGGAATGACGCAAATGCAGTGTCCAAGTTCTTAAAGAAACAAATCTTCTCTCGATTTGGGACACCAAGGGCGATAATTAGTGATGAAGGCATGTCACCCTATGCGTTGGTGTTTGGTAAAGCATGCCATCTCCCACTTGAGCTGGAACACAAGGCCATCTGGGCCATGAAGAAGCTCAATTTAGACCAGGAGGCAAGCGGAGAAGCCAGAAAGCTTCAACTAAATGAACTCCTGGAGTGGAGGCACTCAGCTTATGAAAACGCAAAGCTGTACAAAGAAAGGACCAAGAAATGGCACGACAAGAACATTAGTAAGAAAATTCTATACGTTTGCCAGAAGGTCCTGTTATTTAATTCAAGGTTGCGTTTGTTCCCAGGTAAGTTGAAGTCTCGATGGTCGGGTCCATTCATAACCAAGGAAGTGTTCCCGCATGGTGCGGTCATGCTGACAAATGAAAATGGAACCACACCCTTCAAGGTCAATGGACAAAGGGTAAAACCCTACCACATTGGAGAGTTCGAAATTAACAAGACTTCCATTGACCTACGCGAGTGTAATGACTGA

Coding sequence (CDS)

ATGCCATTCTGGCTATGTAACACACCAAGGACGTTCCAAAGGTGCATGATGGCAATATTCTCTGATTATTTGGAACAGTCAGTAGAGGTGTTTATGGACGACTTCTCAGTATTTGGGAAATCTTATGATGAATGCTTGACCAACCTAGAACAAGTGCTAAAGCGATGCGAAGACACAAATCTAGTCCTCAACTGGGAAAAATGCCACTTTATGGAGACTAAGGGTATTGTGTTGGGGTATAAAATCTCCAAGGTTGGATTGGAAGTGGATCAGGCAAAGATAGATGCGATAGCAAAACTCCCAGCCCCCACAAATGTTCAGACTTTGCGAAGTTTCCTAGGCCATGCGGGCTTCTATAGAAGATTCATTAAGGGATTCTCGCAAGTAGCAAAACCACTGAGCGAGTTGCTGGAAGTCAACAGGGAATTCAATTTCAACGGTAAATGCTTAAACGCATTTGAATCTTTAAGGCAAGCTCTGATTTCAGCACCTATTTTAGTTGCACCGGACTGGTCTCTCCCGTTTGAATTAATGTGCGACGCAAGTAACCATGCGGTGGGAGCAGTATTGGGGCAGAGAAAAGAGAAAATCATGCACCCGATCTATTATGCTAGTAAAACACTGAATTCATCTCAGGAGAACTACACTACTACTGAGAAGGAAATGTTAGCCATAGTCTTTGTGGTTGACAAATTTAGAGCATACTTGATAGGGTCAAAGGTGACCATCTATAGTGATCATTCTGCGATCAAGTATTTGATGGCGAAAAAGGACGCAAAGCCTAGACTTGTCCGTTGGGTCCTACTGTTGCAAGAATTCGATTTGGAGATTAAAGATAGAAGGGGAACCGAGAATCAGGTTGCGAACCACCTATCTCGGTTGGAAAATAAAGAGGTCCAGGAGAGTTGGAGTGATATAGAGGAACGATTCCCAGACAAGCATGTAATGAACGCAGAGAGTCAGGAACCGTGGTACGCAGACATAGTCAACTATCTGGTTTGCAACCAATGGCCTGAAGAATTTAACGCTCAACAAAAGAAAAAGCTCCGACATGAAAGTAAGTTCTACTGCTGGGATGAGCCATATCTATACAGACTTGGCCCTGACCACATACTGCGTCGATGCGTTCCAGAATATGAAACGCATAGCATCTTGAGAAGCTGTCATGAAGCACCTTACGGAGGACACTTTGGAGGGCAGAGAACAGTTACAAAGGTGTTGCAAAGTGGGTATTTCTGGCCCACATTATTCAAAGACCCAAGGGCATATGCGGTAGCTTGCGATGGTTGTCAGAGAACAGGCAACATTTCCAACCGGAATGAGATGCCTTTGAATTCAATGCTGGAAGTTGAGTTGTTCGACGTATGGGGAATTGACTTCATGGGACCATTCCCTCCTTCTTGTGGCAATCAATACATTTTAGTAGCGGTCGACTACGTATCAAAGTGGGTAGAAGCAGCAGCCTGTGCAAGGAATGACGCAAATGCAGTGTCCAAGTTCTTAAAGAAACAAATCTTCTCTCGATTTGGGACACCAAGGGCGATAATTAGTGATGAAGGCATGTCACCCTATGCGTTGGTGTTTGGTAAAGCATGCCATCTCCCACTTGAGCTGGAACACAAGGCCATCTGGGCCATGAAGAAGCTCAATTTAGACCAGGAGGCAAGCGGAGAAGCCAGAAAGCTTCAACTAAATGAACTCCTGGAGTGGAGGCACTCAGCTTATGAAAACGCAAAGCTGTACAAAGAAAGGACCAAGAAATGGCACGACAAGAACATTAGTAAGAAAATTCTATACGTTTGCCAGAAGGTCCTGTTATTTAATTCAAGGTTGCGTTTGTTCCCAGGTAAGTTGAAGTCTCGATGGTCGGGTCCATTCATAACCAAGGAAGTGTTCCCGCATGGTGCGGTCATGCTGACAAATGAAAATGGAACCACACCCTTCAAGGTCAATGGACAAAGGGTAAAACCCTACCACATTGGAGAGTTCGAAATTAACAAGACTTCCATTGACCTACGCGAGTGTAATGACTGA

Protein sequence

MPFWLCNTPRTFQRCMMAIFSDYLEQSVEVFMDDFSVFGKSYDECLTNLEQVLKRCEDTNLVLNWEKCHFMETKGIVLGYKISKVGLEVDQAKIDAIAKLPAPTNVQTLRSFLGHAGFYRRFIKGFSQVAKPLSELLEVNREFNFNGKCLNAFESLRQALISAPILVAPDWSLPFELMCDASNHAVGAVLGQRKEKIMHPIYYASKTLNSSQENYTTTEKEMLAIVFVVDKFRAYLIGSKVTIYSDHSAIKYLMAKKDAKPRLVRWVLLLQEFDLEIKDRRGTENQVANHLSRLENKEVQESWSDIEERFPDKHVMNAESQEPWYADIVNYLVCNQWPEEFNAQQKKKLRHESKFYCWDEPYLYRLGPDHILRRCVPEYETHSILRSCHEAPYGGHFGGQRTVTKVLQSGYFWPTLFKDPRAYAVACDGCQRTGNISNRNEMPLNSMLEVELFDVWGIDFMGPFPPSCGNQYILVAVDYVSKWVEAAACARNDANAVSKFLKKQIFSRFGTPRAIISDEGMSPYALVFGKACHLPLELEHKAIWAMKKLNLDQEASGEARKLQLNELLEWRHSAYENAKLYKERTKKWHDKNISKKILYVCQKVLLFNSRLRLFPGKLKSRWSGPFITKEVFPHGAVMLTNENGTTPFKVNGQRVKPYHIGEFEINKTSIDLRECND
Homology
BLAST of Clc03G10110 vs. NCBI nr
Match: PIM97577.1 (DNA-directed DNA polymerase [Handroanthus impetiginosus])

HSP 1 Score: 902.1 bits (2330), Expect = 2.9e-258
Identity = 441/734 (60.08%), Postives = 527/734 (71.80%), Query Frame = 0

Query: 1    MPFWLCNTPRTFQRCMMAIFSDYLEQSVEVFMDDFSVFGKSYDECLTNLEQVLKRCEDTN 60
            MPF LCN P TFQRCMMAIF+D +E  +EVFMDDFSV+G S+DECL NL  VLKRCEDTN
Sbjct: 826  MPFGLCNAPATFQRCMMAIFTDMVENCLEVFMDDFSVYGNSFDECLNNLSCVLKRCEDTN 885

Query: 61   LVLNWEKCHFMETKGIVLGYKISKVGLEVDQAKIDAIAKLPAPTNVQTLRSFLGHAGFYR 120
            L+LNWEKCHFM  +GIVLG+K+S  G+EVD+AK++ I KLP PT+V+ +RSFLGHAGFYR
Sbjct: 886  LILNWEKCHFMVQEGIVLGHKVSNRGIEVDKAKLETIEKLPPPTSVKGVRSFLGHAGFYR 945

Query: 121  RFIKGFSQVAKPLSELLEVNREFNFNGKCLNAFESLRQALISAPILVAPDWSLPFELMCD 180
            RFIK FS+++KPL  LLE +  FNF+  C +AF  L+  LISAPI+  PDWS PFELMCD
Sbjct: 946  RFIKDFSKISKPLCNLLEKDIPFNFDDACRDAFNDLKGRLISAPIITVPDWSFPFELMCD 1005

Query: 181  ASNHAVGAVLGQRKEKIMHPIYYASKTLNSSQENYTTTEKEMLAIVFVVDKFRAYLIGSK 240
            AS+ AVGAVLGQRK+KI   IYYASKTLN +Q NYTTTEKE+LA+VF  DKFR+YL+G+K
Sbjct: 1006 ASDFAVGAVLGQRKDKIFRSIYYASKTLNDAQLNYTTTEKELLAVVFAFDKFRSYLVGTK 1065

Query: 241  VTIYSDHSAIKYLMAKKDAKPRLVRWVLLLQEFDLEIKDRRGTENQVANHLSRLENKEVQ 300
            V +Y+DH+AI+YL+ KKDAKPRL+RWVLLLQEFDLEI+DR+GTENQ+A+HLSRLE+    
Sbjct: 1066 VIVYTDHAAIRYLIEKKDAKPRLIRWVLLLQEFDLEIRDRKGTENQIADHLSRLESPAKT 1125

Query: 301  ESWSDIEERFPDKHVMN-AESQEPWYADIVNYLVCNQWPEEFNAQQKKKLRHESKFYCWD 360
            +  + I + FPD+ ++    S  PWYADIVNYL C   P + +AQQKKK   +++ Y WD
Sbjct: 1126 DEPNLINDNFPDEQLLAIVASDVPWYADIVNYLTCGIIPFDLSAQQKKKFLFDTRRYFWD 1185

Query: 361  EPYLYRLGPDHILRRCVPEYETHSILRSCHEAPYGGHFGGQRTVTKVLQSGYFWPTLFKD 420
            +P+L++ GPD+ILRRCVPE E + IL  CH +PYGGHF G RT  K+LQSG+FWP LFKD
Sbjct: 1186 DPFLFKQGPDNILRRCVPEIEMNDILEQCHASPYGGHFHGDRTAAKILQSGFFWPNLFKD 1245

Query: 421  PRAYAVACDGCQRTGNISNRNEMPLNSMLEVELFDVWGIDFMGPFPPSCGNQYILVAVDY 480
              ++   CD CQRTGNIS R+EMPLN++LEVELFDVWGIDFMGPF PS GN YILVAVDY
Sbjct: 1246 AHSFVANCDRCQRTGNISRRHEMPLNTILEVELFDVWGIDFMGPFIPSFGNMYILVAVDY 1305

Query: 481  VSKWVEAAACARNDANAVSKFLKKQIFSRFGTPRAIISDE-------------------- 540
            VSKWVEAAA   ND+  V  F+KK IF+RFGTPRAIISD                     
Sbjct: 1306 VSKWVEAAAVPNNDSKVVVNFIKKNIFTRFGTPRAIISDGGTHFCNRSFEALLSKYGVKH 1365

Query: 541  -------------------------------------------------------GMSPY 600
                                                                   GMSPY
Sbjct: 1366 KISTPYHPQTSGQVEVSNREIKRILEKTVSSTRKDWSKRLDEALWAYRTAYKTPIGMSPY 1425

Query: 601  ALVFGKACHLPLELEHKAIWAMKKLNLDQEASGEARKLQLNELLEWRHSAYENAKLYKER 659
             LVFGKACHLP+ELEH A WA++KLN D +A+GE R LQLNEL E+R  AYENAK+YKE+
Sbjct: 1426 RLVFGKACHLPVELEHNAYWAIRKLNFDMQAAGEKRLLQLNELDEFRLHAYENAKIYKEK 1485

BLAST of Clc03G10110 vs. NCBI nr
Match: BBH06778.1 (transposable element gene [Prunus dulcis])

HSP 1 Score: 883.2 bits (2281), Expect = 1.4e-252
Identity = 433/750 (57.73%), Postives = 528/750 (70.40%), Query Frame = 0

Query: 1   MPFWLCNTPRTFQRCMMAIFSDYLEQSVEVFMDDFSVFGKSYDECLTNLEQVLKRCEDTN 60
           MPF LCN P TFQRCMM+IFSD +E+ +EVFMDDFSVFG S+D CL NL  VL RCE+TN
Sbjct: 148 MPFGLCNAPATFQRCMMSIFSDMVERFIEVFMDDFSVFGSSFDSCLDNLALVLARCEETN 207

Query: 61  LVLNWEKCHFMETKGIVLGYKISKVGLEVDQAKIDAIAKLPAPTNVQTLRSFLGHAGFYR 120
           LVLNWEKCHFM  +GIVLG+KIS  G+EVD+AKI+ I KLP P+ V+ +RSFLGHAGFYR
Sbjct: 208 LVLNWEKCHFMVQEGIVLGHKISARGIEVDRAKIETIEKLPPPSTVKGIRSFLGHAGFYR 267

Query: 121 RFIKGFSQVAKPLSELLEVNREFNFNGKCLNAFESLRQALISAPILVAPDWSLPFELMCD 180
           RFIK FS++ KPL +LL  + EFNF+  CL AF  L+  L +AP+++APDW LPFE+MCD
Sbjct: 268 RFIKDFSKITKPLCKLLLKDSEFNFDSDCLEAFNLLKTKLTTAPVIMAPDWELPFEIMCD 327

Query: 181 ASNHAVGAVLGQRKEKIMHPIYYASKTLNSSQENYTTTEKEMLAIVFVVDKFRAYLIGSK 240
           AS++A+GAVLGQRK K++H I+YAS+TLN +Q NY TTEKE+LA+VF +DKFR+YL+G+K
Sbjct: 328 ASDYAIGAVLGQRKNKLLHVIHYASRTLNDAQLNYATTEKELLAVVFALDKFRSYLLGAK 387

Query: 241 VTIYSDHSAIKYLMAKKDAKPRLVRWVLLLQEFDLEIKDRRGTENQVANHLSRL-ENKEV 300
           V +Y+DH+A+K+L+AKK+AKPRL+RWVLLLQEFD+EI+D++G+EN VA+HLSRL    EV
Sbjct: 388 VIVYTDHAALKFLLAKKEAKPRLIRWVLLLQEFDIEIRDKKGSENVVADHLSRLVREDEV 447

Query: 301 QESWSDIEERFPDKHVMNAESQE----PWYADIVNYLVCNQWPEEFNAQQKKKLRHESKF 360
            E    I E FPD+ + +  S +    PWYAD VNYL C   P + +  QKKK     K 
Sbjct: 448 IEDVGPILETFPDEQLYSIYSAKEFITPWYADFVNYLACGILPPDMSFYQKKKFLSLVKH 507

Query: 361 YCWDEPYLYRLGPDHILRRCVPEYETHSILRSCHEAPYGGHFGGQRTVTKVLQSGYFWPT 420
           Y WD+PYL++ GPD ++RRCVPE E   IL  CH    GGH+G  +T  KVLQSG+FWPT
Sbjct: 508 YYWDDPYLWKHGPDQVIRRCVPETEMADILLHCHTLACGGHYGASKTTAKVLQSGFFWPT 567

Query: 421 LFKDPRAYAVACDGCQRTGNISNRNEMPLNSMLEVELFDVWGIDFMGPFPPSCGNQYILV 480
           LFKD + +   CD CQRTGNIS+RN+MPLN++LEVELFDVWGIDFMGPFP S GN YILV
Sbjct: 568 LFKDAQDFVARCDPCQRTGNISSRNQMPLNNILEVELFDVWGIDFMGPFPASYGNLYILV 627

Query: 481 AVDYVSKWVEAAACARNDANAVSKFLKKQIFSRFGTPRAIISDE---------------- 540
           AVDYVSKWVEAAA   NDA  V +FL+K IF+RFG PRAIISD                 
Sbjct: 628 AVDYVSKWVEAAALPTNDAKVVVRFLRKNIFTRFGVPRAIISDGGTHFCNRQFNSLLAKY 687

Query: 541 -----------------------------------------------------------G 600
                                                                      G
Sbjct: 688 GITHKVSTPYHPQTSGQVEVSNRELKKILEKTVSASRKDWSLKLDDALWAYRTAFKAPIG 747

Query: 601 MSPYALVFGKACHLPLELEHKAIWAMKKLNLDQEASGEARKLQLNELLEWRHSAYENAKL 660
           MSPY LVFGKACHLP+ELEHKA WA+K LN D  ++GE RKLQLNEL E R+ +YENAK+
Sbjct: 748 MSPYRLVFGKACHLPVELEHKAFWAIKTLNFDMSSAGEKRKLQLNELEELRNESYENAKI 807

Query: 661 YKERTKKWHDKNISKKILYVCQKVLLFNSRLRLFPGKLKSRWSGPFITKEVFPHGAVMLT 671
           YK+RTKKWHDK+I KK  YV Q VLL+NSRL+LFPGKL+SRWSGPF    V+P+G V + 
Sbjct: 808 YKDRTKKWHDKHILKKEFYVGQSVLLYNSRLKLFPGKLRSRWSGPFTVLTVYPYGTVEIK 867

BLAST of Clc03G10110 vs. NCBI nr
Match: XP_028962178.1 (LOW QUALITY PROTEIN: uncharacterized protein LOC114826270 [Malus domestica])

HSP 1 Score: 874.0 bits (2257), Expect = 8.3e-250
Identity = 424/749 (56.61%), Postives = 526/749 (70.23%), Query Frame = 0

Query: 1    MPFWLCNTPRTFQRCMMAIFSDYLEQSVEVFMDDFSVFGKSYDECLTNLEQVLKRCEDTN 60
            MPF LCN P TFQRCMM+IFSD +E+ +EVFMDDFSVFG S+D CL NL  VL+RCE+TN
Sbjct: 830  MPFGLCNAPATFQRCMMSIFSDLVERCIEVFMDDFSVFGSSFDSCLDNLSSVLQRCEETN 889

Query: 61   LVLNWEKCHFMETKGIVLGYKISKVGLEVDQAKIDAIAKLPAPTNVQTLRSFLGHAGFYR 120
            LVLNWEKCHFM  +GI LG+K+S  G+EVD+AKI+ I+KLP PT+++ +RSFLGHAGFYR
Sbjct: 890  LVLNWEKCHFMVQEGIXLGHKVSVNGIEVDKAKIETISKLPPPTSIKGVRSFLGHAGFYR 949

Query: 121  RFIKGFSQVAKPLSELLEVNREFNFNGKCLNAFESLRQALISAPILVAPDWSLPFELMCD 180
            RFIK FS++ KPL  LL    EF F+  C  AF  L+  L +AP++VAPDW LPFE+MCD
Sbjct: 950  RFIKDFSKITKPLCNLLLKEAEFVFDSSCFEAFNVLKMKLTTAPVIVAPDWELPFEIMCD 1009

Query: 181  ASNHAVGAVLGQRKEKIMHPIYYASKTLNSSQENYTTTEKEMLAIVFVVDKFRAYLIGSK 240
            AS++A+GAVLGQR++K++H IYYAS+TLN +Q NY TTEKE+LA+VF +DKFR+YLIGSK
Sbjct: 1010 ASDYAIGAVLGQRRDKLLHVIYYASRTLNDAQLNYATTEKELLAVVFALDKFRSYLIGSK 1069

Query: 241  VTIYSDHSAIKYLMAKKDAKPRLVRWVLLLQEFDLEIKDRRGTENQVANHLSRLENKEVQ 300
            V +Y+DHSA+KYL++KKDAKPRL+RWVLLLQEFDLEI+D++G+EN VA+HLSR+ + E  
Sbjct: 1070 VIVYTDHSALKYLLSKKDAKPRLIRWVLLLQEFDLEIRDKKGSENVVADHLSRIMHHEGD 1129

Query: 301  ESWSDIEERFPDKHVMNAESQ-EPWYADIVNYLVCNQWPEEFNAQQKKKLRHESKFYCWD 360
                 I E FPD+ +   +S   PWYAD VNYLV +  P +   QQKK+     + Y WD
Sbjct: 1130 NDLVPISETFPDEQLFTIKSSVTPWYADYVNYLVSDIMPPDLTWQQKKRFISLVRHYYWD 1189

Query: 361  EPYLYRLGPDHILRRCVPEYETHSILRSCHEAPYGGHFGG-QRTVTKVLQSGYFWPTLFK 420
            EPYL++  PD  +RRCVP+ E   ILR CH    GGHFG  +R + ++LQSG++WPTLFK
Sbjct: 1190 EPYLWKHCPDQCIRRCVPDDEMEEILRHCHSLACGGHFGATKRLLPRLLQSGFWWPTLFK 1249

Query: 421  DPRAYAVACDGCQRTGNISNRNEMPLNSMLEVELFDVWGIDFMGPFPPSCGNQYILVAVD 480
            D   +   CD CQR GNIS+R++MPLN++LEVELFDVWGIDFMGPFP S G QYILVAVD
Sbjct: 1250 DAHTFVSTCDRCQRIGNISSRHQMPLNNILEVELFDVWGIDFMGPFPSSYGKQYILVAVD 1309

Query: 481  YVSKWVEAAACARNDANAVSKFLKKQIFSRFGTPRAIISDE------------------- 540
            YVSKWVEA A   NDA  V  FL+K IF+RFGTPRAIISD                    
Sbjct: 1310 YVSKWVEAIALPTNDAKVVVHFLRKHIFTRFGTPRAIISDGGKHFCNRHFNALLAKYGIT 1369

Query: 541  --------------------------------------------------------GMSP 600
                                                                    GMSP
Sbjct: 1370 HKVATPYHPQTSGQVEISNREIKNILEKTVGLSRKDWAAKLDDALWAYRTAFKTPIGMSP 1429

Query: 601  YALVFGKACHLPLELEHKAIWAMKKLNLDQEASGEARKLQLNELLEWRHSAYENAKLYKE 660
            Y LVFGKACHLP+ELEHKA WA+KKLN + +A+GE R LQLNE+ E+R+ AYENAK+YKE
Sbjct: 1430 YRLVFGKACHLPVELEHKAFWAVKKLNFEMDAAGEKRALQLNEMEEFRNDAYENAKIYKE 1489

Query: 661  RTKKWHDKNISKKILYVCQKVLLFNSRLRLFPGKLKSRWSGPFITKEVFPHGAVMLTNEN 673
            RTKKWHD++I ++  Y+ Q+VLLFNSRL+LFPGKL++RWSGPF   +VFP+G + + +  
Sbjct: 1490 RTKKWHDQHILRREFYIGQQVLLFNSRLKLFPGKLRTRWSGPFTVVQVFPYGTIEIRDHT 1549

BLAST of Clc03G10110 vs. NCBI nr
Match: XP_031379021.1 (uncharacterized protein LOC116194359 [Punica granatum])

HSP 1 Score: 872.5 bits (2253), Expect = 2.4e-249
Identity = 429/749 (57.28%), Postives = 531/749 (70.89%), Query Frame = 0

Query: 1    MPFWLCNTPRTFQRCMMAIFSDYLEQSVEVFMDDFSVFGKSYDECLTNLEQVLKRCEDTN 60
            MPF LCN P TFQRCMM+IFSD LE  +E+FMDDFSVFGKS++ CLTNL  VLKRC++TN
Sbjct: 927  MPFGLCNAPATFQRCMMSIFSDMLENFIEIFMDDFSVFGKSFESCLTNLGCVLKRCKETN 986

Query: 61   LVLNWEKCHFMETKGIVLGYKISKVGLEVDQAKIDAIAKLPAPTNVQTLRSFLGHAGFYR 120
            L+LNWEKCHFM  +GIVLG+K+SK G+EVD+AK++ I KLP PT+ + +RSFLGHAGFYR
Sbjct: 987  LLLNWEKCHFMVREGIVLGHKVSKKGIEVDRAKVEIIEKLPPPTSTKGVRSFLGHAGFYR 1046

Query: 121  RFIKGFSQVAKPLSELLEVNREFNFNGKCLNAFESLRQALISAPILVAPDWSLPFELMCD 180
            RFIK FS++++PL  LLE +  F FN  CL AF  L++ L SAP++VAP+W LPFELMCD
Sbjct: 1047 RFIKDFSKISRPLCNLLEKDSAFVFNDNCLQAFNLLKEKLTSAPVIVAPNWELPFELMCD 1106

Query: 181  ASNHAVGAVLGQRKEKIMHPIYYASKTLNSSQENYTTTEKEMLAIVFVVDKFRAYLIGSK 240
            AS++AVGAVLGQR+ K+ H IYYAS+TLN +Q+NY TTEKE+LA++F  DKFR YLIGSK
Sbjct: 1107 ASDYAVGAVLGQRRGKVFHAIYYASRTLNEAQKNYATTEKELLAVIFACDKFRPYLIGSK 1166

Query: 241  VTIYSDHSAIKYLMAKKDAKPRLVRWVLLLQEFDLEIKDRRGTENQVANHLSRLENKEVQ 300
            + +Y+DH+A+KYL AK DAKPRL+RW+LLLQEFDLEI+D +GTEN VA+HLSRLE+  + 
Sbjct: 1167 IIVYTDHAALKYLFAKADAKPRLIRWILLLQEFDLEIRDTKGTENVVADHLSRLESDCLD 1226

Query: 301  ESWSDIEERFPDKHVMNAESQE-PWYADIVNYLVCNQWPEEFNAQQKKKLRHESKFYCWD 360
               S I E+FPD+ +  AE Q  PWYADIVNY+V N  P   ++QQKKK  H+ K+Y WD
Sbjct: 1227 ---SPINEKFPDEQLHVAEIQGLPWYADIVNYMVSNITPYGLSSQQKKKFLHDVKYYFWD 1286

Query: 361  EPYLYRLGPDHILRRCVPEYETHSILRSCHEAPYGGHFGGQRTVTKVLQSGYFWPTLFKD 420
            EPYL++   D ++RRCVPE E  SI++ CH    GGHFG +RT TK+L  G++WP +F D
Sbjct: 1287 EPYLFKYCADQVIRRCVPETEQLSIIQHCHSKEAGGHFGVERTATKILSCGFYWPRVFHD 1346

Query: 421  PRAYAVACDGCQRTGNISNRNEMPLNSMLEVELFDVWGIDFMGPFPPSCGNQYILVAVDY 480
             R Y ++C  CQRTGNIS R+E+P NS+L +ELFDVWGIDFMGPFP S  N+YILVAVDY
Sbjct: 1347 CRNYIMSCAPCQRTGNISRRHEVPQNSILVIELFDVWGIDFMGPFPSSFSNKYILVAVDY 1406

Query: 481  VSKWVEAAACARNDANAVSKFLKKQIFSRFGTPRAIISDE-------------------- 540
            VSKWVEA A   NDA  V +FLKK IFSRFG PRAIISD                     
Sbjct: 1407 VSKWVEAVALQSNDARVVIRFLKKNIFSRFGVPRAIISDGGSHFCNRQFEKLLSKYGVTH 1466

Query: 541  -------------------------------------------------------GMSPY 600
                                                                   GMSPY
Sbjct: 1467 KIATPYHPQTCGQVEVSNREIKRILEKTVNASRKDWSLKLDDALWAYRTAFKTPIGMSPY 1526

Query: 601  ALVFGKACHLPLELEHKAIWAMKKLNLDQEASGEARKLQLNELLEWRHSAYENAKLYKER 660
             +V+GK+CHLP+ELEHKA WA+K LN D +A+GE R LQLN++ E R  AYENA++YKER
Sbjct: 1527 KIVYGKSCHLPVELEHKAYWAIKYLNFDLQAAGEKRLLQLNQMAEMREEAYENARIYKER 1586

Query: 661  TKKWHDKNISKKILYVCQKVLLFNSRLRLFPGKLKSRWSGPFITKEVFPHGAVMLTNENG 673
             K+WHD+NI K+     QKVLL+NSRL+LFPGKLKSRWSGPF+   VFP+GAV L +E+ 
Sbjct: 1587 AKRWHDRNILKREFLPGQKVLLYNSRLKLFPGKLKSRWSGPFVISNVFPYGAVELKSEDD 1646

BLAST of Clc03G10110 vs. NCBI nr
Match: XP_012858910.1 (PREDICTED: uncharacterized protein LOC105978045 [Erythranthe guttata])

HSP 1 Score: 872.5 bits (2253), Expect = 2.4e-249
Identity = 424/735 (57.69%), Postives = 519/735 (70.61%), Query Frame = 0

Query: 1    MPFWLCNTPRTFQRCMMAIFSDYLEQSVEVFMDDFSVFGKSYDECLTNLEQVLKRCEDTN 60
            MPF LCN P TFQRCMM+IF D +E+ +EVFMDDFSVFG S+D C+ NLE VLKRC +TN
Sbjct: 958  MPFGLCNAPATFQRCMMSIFHDMVEEFLEVFMDDFSVFGSSFDHCVHNLELVLKRCTETN 1017

Query: 61   LVLNWEKCHFMETKGIVLGYKISKVGLEVDQAKIDAIAKLPAPTNVQTLRSFLGHAGFYR 120
            LVLNWEKCHFM  +GIVLG+K+SK GLEVD+AKI+ I KLP P +V+ +RSFLGHAGFYR
Sbjct: 1018 LVLNWEKCHFMVREGIVLGHKVSKKGLEVDRAKIETIEKLPPPKDVKGVRSFLGHAGFYR 1077

Query: 121  RFIKGFSQVAKPLSELLEVNREFNFNGKCLNAFESLRQALISAPILVAPDWSLPFELMCD 180
            RFIK FS++ KPL  LLE    F+F+  CL AF  L++ L  +PI++ P+W  PFE+MCD
Sbjct: 1078 RFIKDFSKIVKPLCHLLEKEAVFDFDSACLQAFTFLKEKLTQSPIMITPNWEEPFEIMCD 1137

Query: 181  ASNHAVGAVLGQRKEKIMHPIYYASKTLNSSQENYTTTEKEMLAIVFVVDKFRAYLIGSK 240
            AS++AVGAVLGQR++KI   IYY+S+TL+ +Q+NY+TTEKEMLA+V+ VDKFR Y++GS+
Sbjct: 1138 ASDYAVGAVLGQRRDKIFKAIYYSSRTLDQAQKNYSTTEKEMLAVVYAVDKFRPYILGSQ 1197

Query: 241  VTIYSDHSAIKYLMAKKDAKPRLVRWVLLLQEFDLEIKDRRGTENQVANHLSRLENKEVQ 300
            V IY+DH+AI+YL AKKDAKPRL+RWVLLLQEFDLEI+D++G+EN VA+HLSRL  +EV 
Sbjct: 1198 VIIYTDHAAIRYLFAKKDAKPRLIRWVLLLQEFDLEIRDKKGSENVVADHLSRLILEEVP 1257

Query: 301  ESWSDIEERFPDKHVMNAESQEPWYADIVNYLVCNQWPEEFNAQQKKKLRHESKFYCWDE 360
                +I+E FPD+ ++   +  PWYAD+ N+L     P++    QKKK  H+S+FY WDE
Sbjct: 1258 AE-GNIQESFPDEQLLAISTHTPWYADVANFLASGIIPDDLYYHQKKKFLHDSRFYLWDE 1317

Query: 361  PYLYRLGPDHILRRCVPEYETHSILRSCHEAPYGGHFGGQRTVTKVLQSGYFWPTLFKDP 420
            P L+R GPD ++RRCVPE E   IL  CH +P GGH G  RT  KVLQSG+FWPTLF+D 
Sbjct: 1318 PLLFRTGPDRVIRRCVPETEVREILTHCHSSPCGGHHGESRTAAKVLQSGFFWPTLFRDS 1377

Query: 421  RAYAVACDGCQRTGNISNRNEMPLNSMLEVELFDVWGIDFMGPFPPSCGNQYILVAVDYV 480
              +   CD CQRTGN+SN+++MPLN+M EVELFDVWGIDFMGPFP S G  YIL+AVDYV
Sbjct: 1378 YEFVKRCDRCQRTGNLSNKSQMPLNNMQEVELFDVWGIDFMGPFPSSNGKLYILLAVDYV 1437

Query: 481  SKWVEAAACARNDANAVSKFLKKQIFSRFGTPRAIISDE--------------------- 540
            SKWVEA A   NDA  V KF  K IFSRFGTPRAIISDE                     
Sbjct: 1438 SKWVEAIATTTNDARTVLKFFHKNIFSRFGTPRAIISDEGSHFCNKLLTNLTNKLGIRHK 1497

Query: 541  ------------------------------------------------------GMSPYA 600
                                                                  GMSPY 
Sbjct: 1498 IALAYHPQTNGLVELSNREIKQILEKTVSTNRKDWALKLDDALWAYRTAFKTPIGMSPYK 1557

Query: 601  LVFGKACHLPLELEHKAIWAMKKLNLDQEASGEARKLQLNELLEWRHSAYENAKLYKERT 660
            LVFGKACHLP+ELEH+A WA+KKLN DQ A+G+ R LQLNEL E+R+ AYENAK+YKE+T
Sbjct: 1558 LVFGKACHLPVELEHRAYWAVKKLNFDQTATGDRRLLQLNELEEFRNDAYENAKIYKEKT 1617

BLAST of Clc03G10110 vs. ExPASy Swiss-Prot
Match: P04323 (Retrovirus-related Pol polyprotein from transposon 17.6 OS=Drosophila melanogaster OX=7227 GN=pol PE=4 SV=1)

HSP 1 Score: 224.6 bits (571), Expect = 3.5e-57
Identity = 123/299 (41.14%), Postives = 180/299 (60.20%), Query Frame = 0

Query: 1   MPFWLCNTPRTFQRCMMAIFSDYLEQSVEVFMDDFSVFGKSYDECLTNLEQVLKRCEDTN 60
           MPF L N P TFQRCM  I    L +   V++DD  VF  S DE L +L  V ++    N
Sbjct: 334 MPFGLKNAPATFQRCMNDILRPLLNKHCLVYLDDIIVFSTSLDEHLQSLGLVFEKLAKAN 393

Query: 61  LVLNWEKCHFMETKGIVLGYKISKVGLEVDQAKIDAIAKLPAPTNVQTLRSFLGHAGFYR 120
           L L  +KC F++ +   LG+ ++  G++ +  KI+AI K P PT  + +++FLG  G+YR
Sbjct: 394 LKLQLDKCEFLKQETTFLGHVLTPDGIKPNPEKIEAIQKYPIPTKPKEIKAFLGLTGYYR 453

Query: 121 RFIKGFSQVAKPLSELLEVNREFN-FNGKCLNAFESLRQALISAPILVAPDWSLPFELMC 180
           +FI  F+ +AKP+++ L+ N + +  N +  +AF+ L+  +   PIL  PD++  F L  
Sbjct: 454 KFIPNFADIAKPMTKCLKKNMKIDTTNPEYDSAFKKLKYLISEDPILKVPDFTKKFTLTT 513

Query: 181 DASNHAVGAVLGQRKEKIMHPIYYASKTLNSSQENYTTTEKEMLAIVFVVDKFRAYLIGS 240
           DAS+ A+GAVL Q      HP+ Y S+TLN  + NY+T EKE+LAIV+    FR YL+G 
Sbjct: 514 DASDVALGAVLSQDG----HPLSYISRTLNEHEINYSTIEKELLAIVWATKTFRHYLLGR 573

Query: 241 KVTIYSDHSAIKYLMAKKDAKPRLVRWVLLLQEFDLEIKDRRGTENQVANHLSRLENKE 299
              I SDH  + +L   KD   +L RW + L EFD +IK  +G EN VA+ LSR++ +E
Sbjct: 574 HFEISSDHQPLSWLYRMKDPNSKLTRWRVKLSEFDFDIKYIKGKENCVADALSRIKLEE 628

BLAST of Clc03G10110 vs. ExPASy Swiss-Prot
Match: Q8I7P9 (Retrovirus-related Pol polyprotein from transposon opus OS=Drosophila melanogaster OX=7227 GN=pol PE=4 SV=1)

HSP 1 Score: 214.5 bits (545), Expect = 3.6e-54
Identity = 116/306 (37.91%), Postives = 182/306 (59.48%), Query Frame = 0

Query: 1   MPFWLCNTPRTFQRCMMAIFSDYLEQSVEVFMDDFSVFGKSYDECLTNLEQVLKRCEDTN 60
           +PF L N P  FQR +  I  +++ +   V++DD  VF + YD    NL  VL      N
Sbjct: 250 LPFGLKNAPAIFQRMIDDILREHIGKVCYVYIDDIIVFSEDYDTHWKNLRLVLASLSKAN 309

Query: 61  LVLNWEKCHFMETKGIVLGYKISKVGLEVDQAKIDAIAKLPAPTNVQTLRSFLGHAGFYR 120
           L +N EK HF++T+   LGY ++  G++ D  K+ AI+++P PT+V+ L+ FLG   +YR
Sbjct: 310 LQVNLEKSHFLDTQVEFLGYIVTADGIKADPKKVRAISEMPPPTSVKELKRFLGMTSYYR 369

Query: 121 RFIKGFSQVAKPLSEL---LEVNREFNFNGK--------CLNAFESLRQALISAPILVAP 180
           +FI+ +++VAKPL+ L   L  N + + + K         L +F  L+  L S+ IL  P
Sbjct: 370 KFIQDYAKVAKPLTNLTRGLYANIKSSQSSKVPITLDETALQSFNDLKSILCSSEILAFP 429

Query: 181 DWSLPFELMCDASNHAVGAVLGQRKEKIMHPIYYASKTLNSSQENYTTTEKEMLAIVFVV 240
            ++ PF L  DASN A+GAVL Q  +    PI Y S++LN ++ENY T EKEMLAI++ +
Sbjct: 430 CFTKPFHLTTDASNWAIGAVLSQDDQGRDRPIAYISRSLNKTEENYATIEKEMLAIIWSL 489

Query: 241 DKFRAYLIGS-KVTIYSDHSAIKYLMAKKDAKPRLVRWVLLLQEFDLEIKDRRGTENQVA 295
           D  RAYL G+  + +Y+DH  + + +  ++   +L RW   ++E++ E+  + G  N VA
Sbjct: 490 DNLRAYLYGAGTIKVYTDHQPLTFALGNRNFNAKLKRWKARIEEYNCELIYKPGKSNVVA 549

BLAST of Clc03G10110 vs. ExPASy Swiss-Prot
Match: P20825 (Retrovirus-related Pol polyprotein from transposon 297 OS=Drosophila melanogaster OX=7227 GN=pol PE=4 SV=1)

HSP 1 Score: 212.6 bits (540), Expect = 1.4e-53
Identity = 116/303 (38.28%), Postives = 175/303 (57.76%), Query Frame = 0

Query: 1   MPFWLCNTPRTFQRCMMAIFSDYLEQSVEVFMDDFSVFGKSYDECLTNLEQVLKRCEDTN 60
           MPF L N P TFQRCM  I    L +   V++DD  +F  S  E L +++ V  +  D N
Sbjct: 333 MPFGLRNAPATFQRCMNNILRPLLNKHCLVYLDDIIIFSTSLTEHLNSIQLVFTKLADAN 392

Query: 61  LVLNWEKCHFMETKGIVLGYKISKVGLEVDQAKIDAIAKLPAPTNVQTLRSFLGHAGFYR 120
           L L  +KC F++ +   LG+ ++  G++ +  K+ AI   P PT  + +R+FLG  G+YR
Sbjct: 393 LKLQLDKCEFLKKEANFLGHIVTPDGIKPNPIKVKAIVSYPIPTKDKEIRAFLGLTGYYR 452

Query: 121 RFIKGFSQVAKPLSELLEVNREFNFNG-KCLNAFESLRQALISAPILVAPDWSLPFELMC 180
           +FI  ++ +AKP++  L+   + +    + + AFE L+  +I  PIL  PD+   F L  
Sbjct: 453 KFIPNYADIAKPMTSCLKKRTKIDTQKLEYIEAFEKLKALIIRDPILQLPDFEKKFVLTT 512

Query: 181 DASNHAVGAVLGQRKEKIMHPIYYASKTLNSSQENYTTTEKEMLAIVFVVDKFRAYLIGS 240
           DASN A+GAVL Q      HPI + S+TLN  + NY+  EKE+LAIV+    FR YL+G 
Sbjct: 513 DASNLALGAVLSQNG----HPISFISRTLNDHELNYSAIEKELLAIVWATKTFRHYLLGR 572

Query: 241 KVTIYSDHSAIKYLMAKKDAKPRLVRWVLLLQEFDLEIKDRRGTENQVANHLSRLENKEV 300
           +  I SDH  +++L   K+   +L RW + L E+  +I   +G EN VA+ LSR++ +E 
Sbjct: 573 QFLIASDHQPLRWLHNLKEPGAKLERWRVRLSEYQFKIDYIKGKENSVADALSRIKIEEN 631

Query: 301 QES 303
             S
Sbjct: 633 HHS 631

BLAST of Clc03G10110 vs. ExPASy Swiss-Prot
Match: Q7LHG5 (Transposon Ty3-I Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY3B-I PE=1 SV=2)

HSP 1 Score: 207.6 bits (527), Expect = 4.4e-52
Identity = 186/627 (29.67%), Postives = 287/627 (45.77%), Query Frame = 0

Query: 1    MPFWLCNTPRTFQRCMMAIFSDYLEQSVEVFMDDFSVFGKSYDECLTNLEQVLKRCEDTN 60
            MPF L N P TF R M   F D   + V V++DD  +F +S +E   +L+ VL+R ++ N
Sbjct: 744  MPFGLVNAPSTFARYMADTFRDL--RFVNVYLDDILIFSESPEEHWKHLDTVLERLKNEN 803

Query: 61   LVLNWEKCHFMETKGIVLGYKISKVGLEVDQAKIDAIAKLPAPTNVQTLRSFLGHAGFYR 120
            L++  +KC F   +   LGY I    +   Q K  AI   P P  V+  + FLG   +YR
Sbjct: 804  LIVKKKKCKFASEETEFLGYSIGIQKIAPLQHKCAAIRDFPTPKTVKQAQRFLGMINYYR 863

Query: 121  RFIKGFSQVAKPLSELLEVNREFNFNGKCLNAFESLRQALISAPILVAPDWSLPFELMCD 180
            RFI   S++A+P+   L +  +  +  K   A E L+ AL ++P+LV  +    + L  D
Sbjct: 864  RFIPNCSKIAQPIQ--LFICDKSQWTEKQDKAIEKLKAALCNSPVLVPFNNKANYRLTTD 923

Query: 181  ASNHAVGAVLGQ--RKEKIMHPIYYASKTLNSSQENYTTTEKEMLAIVFVVDKFRAYLIG 240
            AS   +GAVL +   K K++  + Y SK+L S+Q+NY   E E+L I+  +  FR  L G
Sbjct: 924  ASKDGIGAVLEEVDNKNKLVGVVGYFSKSLESAQKNYPAGELELLGIIKALHHFRYMLHG 983

Query: 241  SKVTIYSDHSAIKYLMAKKDAKPRLVRWVLLLQEFDLEIKDRRGTENQVANHLSRL---- 300
               T+ +DH ++  L  K +   R+ RW+  L  +D  ++   G +N VA+ +SR     
Sbjct: 984  KHFTLRTDHISLLSLQNKNEPARRVQRWLDDLATYDFTLEYLAGPKNVVADAISRAIYTI 1043

Query: 301  ---ENKEVQ-ESWSDIEERFPDKHVMNAESQEPWYADIVNYLVCNQWPEEFNAQQKKKLR 360
                ++ +  ESW    +  P    +    +E     +  + V  +    F + QKK   
Sbjct: 1044 TPETSRPIDTESWKSYYKSDPLCSAVLIHMKE-----LTQHNVTPEDMSAFRSYQKKLEL 1103

Query: 361  HES--KFYCWDEPYLYRLGPDHILRRCVPEYETHSILRSCHE-APYGGHFGGQRTVTKVL 420
             E+  K Y  ++  +Y     +  R  VP  + ++++R  H+   +GGHFG   T+ K+ 
Sbjct: 1104 SETFRKNYSLEDEMIY-----YQDRLVVPIKQQNAVMRLYHDHTLFGGHFGVTVTLAKI- 1163

Query: 421  QSGYFWPTLFKDPRAYAVACDGCQRTGNISNRNEMPLNSM--LEVELFDVWGIDFMGPFP 480
               Y+WP L      Y   C  CQ   +   R    L  +   E    D+  +DF+   P
Sbjct: 1164 SPIYYWPKLQHSIIQYIRTCVQCQLIKSHRPRLHGLLQPLPIAEGRWLDI-SMDFVTGLP 1223

Query: 481  PSCGN-QYILVAVDYVSKWVEAAACARN-DANAVSKFLKKQIFSRFGTPRAIISDEGMSP 540
            P+  N   ILV VD  SK     A  +  DA  +   L + IFS  G PR I SD  +  
Sbjct: 1224 PTSNNLNMILVVVDRFSKRAHFIATRKTLDATQLIDLLFRYIFSYHGFPRTITSDRDV-- 1283

Query: 541  YALVFGKACHLPLELEHKAIWAMKKLNLDQEASGEARKLQ-LNELLEWRHSAYENAKLYK 600
              +   K   L   L  K+   M   N  Q      R +Q LN LL     AY +  +  
Sbjct: 1284 -RMTADKYQELTKRLGIKS--TMSSANHPQTDGQSERTIQTLNRLLR----AYVSTNI-- 1332

Query: 601  ERTKKWHDKNISKKILYVCQKVLLFNS 610
               + WH        +Y+ Q   ++NS
Sbjct: 1344 ---QNWH--------VYLPQIEFVYNS 1332

BLAST of Clc03G10110 vs. ExPASy Swiss-Prot
Match: Q99315 (Transposon Ty3-G Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY3B-G PE=1 SV=3)

HSP 1 Score: 207.2 bits (526), Expect = 5.7e-52
Identity = 185/627 (29.51%), Postives = 287/627 (45.77%), Query Frame = 0

Query: 1    MPFWLCNTPRTFQRCMMAIFSDYLEQSVEVFMDDFSVFGKSYDECLTNLEQVLKRCEDTN 60
            MPF L N P TF R M   F D   + V V++DD  +F +S +E   +L+ VL+R ++ N
Sbjct: 718  MPFGLVNAPSTFARYMADTFRDL--RFVNVYLDDILIFSESPEEHWKHLDTVLERLKNEN 777

Query: 61   LVLNWEKCHFMETKGIVLGYKISKVGLEVDQAKIDAIAKLPAPTNVQTLRSFLGHAGFYR 120
            L++  +KC F   +   LGY I    +   Q K  AI   P P  V+  + FLG   +YR
Sbjct: 778  LIVKKKKCKFASEETEFLGYSIGIQKIAPLQHKCAAIRDFPTPKTVKQAQRFLGMINYYR 837

Query: 121  RFIKGFSQVAKPLSELLEVNREFNFNGKCLNAFESLRQALISAPILVAPDWSLPFELMCD 180
            RFI   S++A+P+   L +  +  +  K   A + L+ AL ++P+LV  +    + L  D
Sbjct: 838  RFIPNCSKIAQPIQ--LFICDKSQWTEKQDKAIDKLKDALCNSPVLVPFNNKANYRLTTD 897

Query: 181  ASNHAVGAVLGQ--RKEKIMHPIYYASKTLNSSQENYTTTEKEMLAIVFVVDKFRAYLIG 240
            AS   +GAVL +   K K++  + Y SK+L S+Q+NY   E E+L I+  +  FR  L G
Sbjct: 898  ASKDGIGAVLEEVDNKNKLVGVVGYFSKSLESAQKNYPAGELELLGIIKALHHFRYMLHG 957

Query: 241  SKVTIYSDHSAIKYLMAKKDAKPRLVRWVLLLQEFDLEIKDRRGTENQVANHLSRL---- 300
               T+ +DH ++  L  K +   R+ RW+  L  +D  ++   G +N VA+ +SR     
Sbjct: 958  KHFTLRTDHISLLSLQNKNEPARRVQRWLDDLATYDFTLEYLAGPKNVVADAISRAVYTI 1017

Query: 301  ---ENKEVQ-ESWSDIEERFPDKHVMNAESQEPWYADIVNYLVCNQWPEEFNAQQKKKLR 360
                ++ +  ESW    +  P    +    +E     +  + V  +    F + QKK   
Sbjct: 1018 TPETSRPIDTESWKSYYKSDPLCSAVLIHMKE-----LTQHNVTPEDMSAFRSYQKKLEL 1077

Query: 361  HES--KFYCWDEPYLYRLGPDHILRRCVPEYETHSILRSCHE-APYGGHFGGQRTVTKVL 420
             E+  K Y  ++  +Y     +  R  VP  + ++++R  H+   +GGHFG   T+ K+ 
Sbjct: 1078 SETFRKNYSLEDEMIY-----YQDRLVVPIKQQNAVMRLYHDHTLFGGHFGVTVTLAKI- 1137

Query: 421  QSGYFWPTLFKDPRAYAVACDGCQRTGNISNRNEMPLNSM--LEVELFDVWGIDFMGPFP 480
               Y+WP L      Y   C  CQ   +   R    L  +   E    D+  +DF+   P
Sbjct: 1138 SPIYYWPKLQHSIIQYIRTCVQCQLIKSHRPRLHGLLQPLPIAEGRWLDI-SMDFVTGLP 1197

Query: 481  PSCGN-QYILVAVDYVSKWVEAAACARN-DANAVSKFLKKQIFSRFGTPRAIISDEGMSP 540
            P+  N   ILV VD  SK     A  +  DA  +   L + IFS  G PR I SD  +  
Sbjct: 1198 PTSNNLNMILVVVDRFSKRAHFIATRKTLDATQLIDLLFRYIFSYHGFPRTITSDRDV-- 1257

Query: 541  YALVFGKACHLPLELEHKAIWAMKKLNLDQEASGEARKLQ-LNELLEWRHSAYENAKLYK 600
              +   K   L   L  K+   M   N  Q      R +Q LN LL     AY +  +  
Sbjct: 1258 -RMTADKYQELTKRLGIKS--TMSSANHPQTDGQSERTIQTLNRLLR----AYASTNI-- 1306

Query: 601  ERTKKWHDKNISKKILYVCQKVLLFNS 610
               + WH        +Y+ Q   ++NS
Sbjct: 1318 ---QNWH--------VYLPQIEFVYNS 1306

BLAST of Clc03G10110 vs. ExPASy TrEMBL
Match: A0A2G9FWY3 (Reverse transcriptase OS=Handroanthus impetiginosus OX=429701 GN=CDL12_29952 PE=4 SV=1)

HSP 1 Score: 902.1 bits (2330), Expect = 1.4e-258
Identity = 441/734 (60.08%), Postives = 527/734 (71.80%), Query Frame = 0

Query: 1    MPFWLCNTPRTFQRCMMAIFSDYLEQSVEVFMDDFSVFGKSYDECLTNLEQVLKRCEDTN 60
            MPF LCN P TFQRCMMAIF+D +E  +EVFMDDFSV+G S+DECL NL  VLKRCEDTN
Sbjct: 826  MPFGLCNAPATFQRCMMAIFTDMVENCLEVFMDDFSVYGNSFDECLNNLSCVLKRCEDTN 885

Query: 61   LVLNWEKCHFMETKGIVLGYKISKVGLEVDQAKIDAIAKLPAPTNVQTLRSFLGHAGFYR 120
            L+LNWEKCHFM  +GIVLG+K+S  G+EVD+AK++ I KLP PT+V+ +RSFLGHAGFYR
Sbjct: 886  LILNWEKCHFMVQEGIVLGHKVSNRGIEVDKAKLETIEKLPPPTSVKGVRSFLGHAGFYR 945

Query: 121  RFIKGFSQVAKPLSELLEVNREFNFNGKCLNAFESLRQALISAPILVAPDWSLPFELMCD 180
            RFIK FS+++KPL  LLE +  FNF+  C +AF  L+  LISAPI+  PDWS PFELMCD
Sbjct: 946  RFIKDFSKISKPLCNLLEKDIPFNFDDACRDAFNDLKGRLISAPIITVPDWSFPFELMCD 1005

Query: 181  ASNHAVGAVLGQRKEKIMHPIYYASKTLNSSQENYTTTEKEMLAIVFVVDKFRAYLIGSK 240
            AS+ AVGAVLGQRK+KI   IYYASKTLN +Q NYTTTEKE+LA+VF  DKFR+YL+G+K
Sbjct: 1006 ASDFAVGAVLGQRKDKIFRSIYYASKTLNDAQLNYTTTEKELLAVVFAFDKFRSYLVGTK 1065

Query: 241  VTIYSDHSAIKYLMAKKDAKPRLVRWVLLLQEFDLEIKDRRGTENQVANHLSRLENKEVQ 300
            V +Y+DH+AI+YL+ KKDAKPRL+RWVLLLQEFDLEI+DR+GTENQ+A+HLSRLE+    
Sbjct: 1066 VIVYTDHAAIRYLIEKKDAKPRLIRWVLLLQEFDLEIRDRKGTENQIADHLSRLESPAKT 1125

Query: 301  ESWSDIEERFPDKHVMN-AESQEPWYADIVNYLVCNQWPEEFNAQQKKKLRHESKFYCWD 360
            +  + I + FPD+ ++    S  PWYADIVNYL C   P + +AQQKKK   +++ Y WD
Sbjct: 1126 DEPNLINDNFPDEQLLAIVASDVPWYADIVNYLTCGIIPFDLSAQQKKKFLFDTRRYFWD 1185

Query: 361  EPYLYRLGPDHILRRCVPEYETHSILRSCHEAPYGGHFGGQRTVTKVLQSGYFWPTLFKD 420
            +P+L++ GPD+ILRRCVPE E + IL  CH +PYGGHF G RT  K+LQSG+FWP LFKD
Sbjct: 1186 DPFLFKQGPDNILRRCVPEIEMNDILEQCHASPYGGHFHGDRTAAKILQSGFFWPNLFKD 1245

Query: 421  PRAYAVACDGCQRTGNISNRNEMPLNSMLEVELFDVWGIDFMGPFPPSCGNQYILVAVDY 480
              ++   CD CQRTGNIS R+EMPLN++LEVELFDVWGIDFMGPF PS GN YILVAVDY
Sbjct: 1246 AHSFVANCDRCQRTGNISRRHEMPLNTILEVELFDVWGIDFMGPFIPSFGNMYILVAVDY 1305

Query: 481  VSKWVEAAACARNDANAVSKFLKKQIFSRFGTPRAIISDE-------------------- 540
            VSKWVEAAA   ND+  V  F+KK IF+RFGTPRAIISD                     
Sbjct: 1306 VSKWVEAAAVPNNDSKVVVNFIKKNIFTRFGTPRAIISDGGTHFCNRSFEALLSKYGVKH 1365

Query: 541  -------------------------------------------------------GMSPY 600
                                                                   GMSPY
Sbjct: 1366 KISTPYHPQTSGQVEVSNREIKRILEKTVSSTRKDWSKRLDEALWAYRTAYKTPIGMSPY 1425

Query: 601  ALVFGKACHLPLELEHKAIWAMKKLNLDQEASGEARKLQLNELLEWRHSAYENAKLYKER 659
             LVFGKACHLP+ELEH A WA++KLN D +A+GE R LQLNEL E+R  AYENAK+YKE+
Sbjct: 1426 RLVFGKACHLPVELEHNAYWAIRKLNFDMQAAGEKRLLQLNELDEFRLHAYENAKIYKEK 1485

BLAST of Clc03G10110 vs. ExPASy TrEMBL
Match: A0A4Y1RS99 (Transposable element protein OS=Prunus dulcis OX=3755 GN=Prudu_018514 PE=4 SV=1)

HSP 1 Score: 883.2 bits (2281), Expect = 6.7e-253
Identity = 433/750 (57.73%), Postives = 528/750 (70.40%), Query Frame = 0

Query: 1   MPFWLCNTPRTFQRCMMAIFSDYLEQSVEVFMDDFSVFGKSYDECLTNLEQVLKRCEDTN 60
           MPF LCN P TFQRCMM+IFSD +E+ +EVFMDDFSVFG S+D CL NL  VL RCE+TN
Sbjct: 148 MPFGLCNAPATFQRCMMSIFSDMVERFIEVFMDDFSVFGSSFDSCLDNLALVLARCEETN 207

Query: 61  LVLNWEKCHFMETKGIVLGYKISKVGLEVDQAKIDAIAKLPAPTNVQTLRSFLGHAGFYR 120
           LVLNWEKCHFM  +GIVLG+KIS  G+EVD+AKI+ I KLP P+ V+ +RSFLGHAGFYR
Sbjct: 208 LVLNWEKCHFMVQEGIVLGHKISARGIEVDRAKIETIEKLPPPSTVKGIRSFLGHAGFYR 267

Query: 121 RFIKGFSQVAKPLSELLEVNREFNFNGKCLNAFESLRQALISAPILVAPDWSLPFELMCD 180
           RFIK FS++ KPL +LL  + EFNF+  CL AF  L+  L +AP+++APDW LPFE+MCD
Sbjct: 268 RFIKDFSKITKPLCKLLLKDSEFNFDSDCLEAFNLLKTKLTTAPVIMAPDWELPFEIMCD 327

Query: 181 ASNHAVGAVLGQRKEKIMHPIYYASKTLNSSQENYTTTEKEMLAIVFVVDKFRAYLIGSK 240
           AS++A+GAVLGQRK K++H I+YAS+TLN +Q NY TTEKE+LA+VF +DKFR+YL+G+K
Sbjct: 328 ASDYAIGAVLGQRKNKLLHVIHYASRTLNDAQLNYATTEKELLAVVFALDKFRSYLLGAK 387

Query: 241 VTIYSDHSAIKYLMAKKDAKPRLVRWVLLLQEFDLEIKDRRGTENQVANHLSRL-ENKEV 300
           V +Y+DH+A+K+L+AKK+AKPRL+RWVLLLQEFD+EI+D++G+EN VA+HLSRL    EV
Sbjct: 388 VIVYTDHAALKFLLAKKEAKPRLIRWVLLLQEFDIEIRDKKGSENVVADHLSRLVREDEV 447

Query: 301 QESWSDIEERFPDKHVMNAESQE----PWYADIVNYLVCNQWPEEFNAQQKKKLRHESKF 360
            E    I E FPD+ + +  S +    PWYAD VNYL C   P + +  QKKK     K 
Sbjct: 448 IEDVGPILETFPDEQLYSIYSAKEFITPWYADFVNYLACGILPPDMSFYQKKKFLSLVKH 507

Query: 361 YCWDEPYLYRLGPDHILRRCVPEYETHSILRSCHEAPYGGHFGGQRTVTKVLQSGYFWPT 420
           Y WD+PYL++ GPD ++RRCVPE E   IL  CH    GGH+G  +T  KVLQSG+FWPT
Sbjct: 508 YYWDDPYLWKHGPDQVIRRCVPETEMADILLHCHTLACGGHYGASKTTAKVLQSGFFWPT 567

Query: 421 LFKDPRAYAVACDGCQRTGNISNRNEMPLNSMLEVELFDVWGIDFMGPFPPSCGNQYILV 480
           LFKD + +   CD CQRTGNIS+RN+MPLN++LEVELFDVWGIDFMGPFP S GN YILV
Sbjct: 568 LFKDAQDFVARCDPCQRTGNISSRNQMPLNNILEVELFDVWGIDFMGPFPASYGNLYILV 627

Query: 481 AVDYVSKWVEAAACARNDANAVSKFLKKQIFSRFGTPRAIISDE---------------- 540
           AVDYVSKWVEAAA   NDA  V +FL+K IF+RFG PRAIISD                 
Sbjct: 628 AVDYVSKWVEAAALPTNDAKVVVRFLRKNIFTRFGVPRAIISDGGTHFCNRQFNSLLAKY 687

Query: 541 -----------------------------------------------------------G 600
                                                                      G
Sbjct: 688 GITHKVSTPYHPQTSGQVEVSNRELKKILEKTVSASRKDWSLKLDDALWAYRTAFKAPIG 747

Query: 601 MSPYALVFGKACHLPLELEHKAIWAMKKLNLDQEASGEARKLQLNELLEWRHSAYENAKL 660
           MSPY LVFGKACHLP+ELEHKA WA+K LN D  ++GE RKLQLNEL E R+ +YENAK+
Sbjct: 748 MSPYRLVFGKACHLPVELEHKAFWAIKTLNFDMSSAGEKRKLQLNELEELRNESYENAKI 807

Query: 661 YKERTKKWHDKNISKKILYVCQKVLLFNSRLRLFPGKLKSRWSGPFITKEVFPHGAVMLT 671
           YK+RTKKWHDK+I KK  YV Q VLL+NSRL+LFPGKL+SRWSGPF    V+P+G V + 
Sbjct: 808 YKDRTKKWHDKHILKKEFYVGQSVLLYNSRLKLFPGKLRSRWSGPFTVLTVYPYGTVEIK 867

BLAST of Clc03G10110 vs. ExPASy TrEMBL
Match: A0A6P8CBX2 (Reverse transcriptase OS=Punica granatum OX=22663 GN=LOC116194359 PE=4 SV=1)

HSP 1 Score: 872.5 bits (2253), Expect = 1.2e-249
Identity = 429/749 (57.28%), Postives = 531/749 (70.89%), Query Frame = 0

Query: 1    MPFWLCNTPRTFQRCMMAIFSDYLEQSVEVFMDDFSVFGKSYDECLTNLEQVLKRCEDTN 60
            MPF LCN P TFQRCMM+IFSD LE  +E+FMDDFSVFGKS++ CLTNL  VLKRC++TN
Sbjct: 927  MPFGLCNAPATFQRCMMSIFSDMLENFIEIFMDDFSVFGKSFESCLTNLGCVLKRCKETN 986

Query: 61   LVLNWEKCHFMETKGIVLGYKISKVGLEVDQAKIDAIAKLPAPTNVQTLRSFLGHAGFYR 120
            L+LNWEKCHFM  +GIVLG+K+SK G+EVD+AK++ I KLP PT+ + +RSFLGHAGFYR
Sbjct: 987  LLLNWEKCHFMVREGIVLGHKVSKKGIEVDRAKVEIIEKLPPPTSTKGVRSFLGHAGFYR 1046

Query: 121  RFIKGFSQVAKPLSELLEVNREFNFNGKCLNAFESLRQALISAPILVAPDWSLPFELMCD 180
            RFIK FS++++PL  LLE +  F FN  CL AF  L++ L SAP++VAP+W LPFELMCD
Sbjct: 1047 RFIKDFSKISRPLCNLLEKDSAFVFNDNCLQAFNLLKEKLTSAPVIVAPNWELPFELMCD 1106

Query: 181  ASNHAVGAVLGQRKEKIMHPIYYASKTLNSSQENYTTTEKEMLAIVFVVDKFRAYLIGSK 240
            AS++AVGAVLGQR+ K+ H IYYAS+TLN +Q+NY TTEKE+LA++F  DKFR YLIGSK
Sbjct: 1107 ASDYAVGAVLGQRRGKVFHAIYYASRTLNEAQKNYATTEKELLAVIFACDKFRPYLIGSK 1166

Query: 241  VTIYSDHSAIKYLMAKKDAKPRLVRWVLLLQEFDLEIKDRRGTENQVANHLSRLENKEVQ 300
            + +Y+DH+A+KYL AK DAKPRL+RW+LLLQEFDLEI+D +GTEN VA+HLSRLE+  + 
Sbjct: 1167 IIVYTDHAALKYLFAKADAKPRLIRWILLLQEFDLEIRDTKGTENVVADHLSRLESDCLD 1226

Query: 301  ESWSDIEERFPDKHVMNAESQE-PWYADIVNYLVCNQWPEEFNAQQKKKLRHESKFYCWD 360
               S I E+FPD+ +  AE Q  PWYADIVNY+V N  P   ++QQKKK  H+ K+Y WD
Sbjct: 1227 ---SPINEKFPDEQLHVAEIQGLPWYADIVNYMVSNITPYGLSSQQKKKFLHDVKYYFWD 1286

Query: 361  EPYLYRLGPDHILRRCVPEYETHSILRSCHEAPYGGHFGGQRTVTKVLQSGYFWPTLFKD 420
            EPYL++   D ++RRCVPE E  SI++ CH    GGHFG +RT TK+L  G++WP +F D
Sbjct: 1287 EPYLFKYCADQVIRRCVPETEQLSIIQHCHSKEAGGHFGVERTATKILSCGFYWPRVFHD 1346

Query: 421  PRAYAVACDGCQRTGNISNRNEMPLNSMLEVELFDVWGIDFMGPFPPSCGNQYILVAVDY 480
             R Y ++C  CQRTGNIS R+E+P NS+L +ELFDVWGIDFMGPFP S  N+YILVAVDY
Sbjct: 1347 CRNYIMSCAPCQRTGNISRRHEVPQNSILVIELFDVWGIDFMGPFPSSFSNKYILVAVDY 1406

Query: 481  VSKWVEAAACARNDANAVSKFLKKQIFSRFGTPRAIISDE-------------------- 540
            VSKWVEA A   NDA  V +FLKK IFSRFG PRAIISD                     
Sbjct: 1407 VSKWVEAVALQSNDARVVIRFLKKNIFSRFGVPRAIISDGGSHFCNRQFEKLLSKYGVTH 1466

Query: 541  -------------------------------------------------------GMSPY 600
                                                                   GMSPY
Sbjct: 1467 KIATPYHPQTCGQVEVSNREIKRILEKTVNASRKDWSLKLDDALWAYRTAFKTPIGMSPY 1526

Query: 601  ALVFGKACHLPLELEHKAIWAMKKLNLDQEASGEARKLQLNELLEWRHSAYENAKLYKER 660
             +V+GK+CHLP+ELEHKA WA+K LN D +A+GE R LQLN++ E R  AYENA++YKER
Sbjct: 1527 KIVYGKSCHLPVELEHKAYWAIKYLNFDLQAAGEKRLLQLNQMAEMREEAYENARIYKER 1586

Query: 661  TKKWHDKNISKKILYVCQKVLLFNSRLRLFPGKLKSRWSGPFITKEVFPHGAVMLTNENG 673
             K+WHD+NI K+     QKVLL+NSRL+LFPGKLKSRWSGPF+   VFP+GAV L +E+ 
Sbjct: 1587 AKRWHDRNILKREFLPGQKVLLYNSRLKLFPGKLKSRWSGPFVISNVFPYGAVELKSEDD 1646

BLAST of Clc03G10110 vs. ExPASy TrEMBL
Match: A0A6P8DLJ8 (Reverse transcriptase OS=Punica granatum OX=22663 GN=LOC116205794 PE=4 SV=1)

HSP 1 Score: 867.1 bits (2239), Expect = 4.9e-248
Identity = 427/749 (57.01%), Postives = 529/749 (70.63%), Query Frame = 0

Query: 1    MPFWLCNTPRTFQRCMMAIFSDYLEQSVEVFMDDFSVFGKSYDECLTNLEQVLKRCEDTN 60
            MPF LCN P TFQRCMM+IFSD LE  +E+FMDDFSVFGKS++ CLTNL  VLKRC++TN
Sbjct: 889  MPFGLCNAPATFQRCMMSIFSDMLENFIEIFMDDFSVFGKSFESCLTNLGCVLKRCKETN 948

Query: 61   LVLNWEKCHFMETKGIVLGYKISKVGLEVDQAKIDAIAKLPAPTNVQTLRSFLGHAGFYR 120
            L+LNWEKCHFM  +GIVLG+K+SK G+EVD+AK++ I KLP PT+ + +RSFLGHAGFYR
Sbjct: 949  LLLNWEKCHFMVREGIVLGHKVSKKGIEVDRAKVEIIEKLPPPTSTKGVRSFLGHAGFYR 1008

Query: 121  RFIKGFSQVAKPLSELLEVNREFNFNGKCLNAFESLRQALISAPILVAPDWSLPFELMCD 180
            RFIK FS++++PL  LLE +  F FN  CL AF  L++ L SAP++VAP+W LPFELMC 
Sbjct: 1009 RFIKDFSKISRPLCNLLEKDSAFVFNDNCLQAFNLLKEKLTSAPVIVAPNWELPFELMCG 1068

Query: 181  ASNHAVGAVLGQRKEKIMHPIYYASKTLNSSQENYTTTEKEMLAIVFVVDKFRAYLIGSK 240
            AS++AVGAVLGQR+ K+ H IYYAS+TLN +Q+NY TTEKE+LA++F  DKFR YLIGSK
Sbjct: 1069 ASDYAVGAVLGQRRGKVFHAIYYASRTLNEAQKNYATTEKELLAVIFACDKFRPYLIGSK 1128

Query: 241  VTIYSDHSAIKYLMAKKDAKPRLVRWVLLLQEFDLEIKDRRGTENQVANHLSRLENKEVQ 300
            + +Y+DH+A+KYL AK DAKPRL+RW+LLLQEFDLEI+D +GTEN VA+HLSRLE+  + 
Sbjct: 1129 IIVYTDHAALKYLFAKADAKPRLIRWILLLQEFDLEIRDTKGTENVVADHLSRLESDCLD 1188

Query: 301  ESWSDIEERFPDKHVMNAESQE-PWYADIVNYLVCNQWPEEFNAQQKKKLRHESKFYCWD 360
               S I E+FPD+ +  AE Q  PWYADIVNY+V N  P   ++QQKKK  H+ K+Y WD
Sbjct: 1189 ---SPINEKFPDEQLHVAEIQGLPWYADIVNYMVSNITPYGLSSQQKKKFLHDVKYYFWD 1248

Query: 361  EPYLYRLGPDHILRRCVPEYETHSILRSCHEAPYGGHFGGQRTVTKVLQSGYFWPTLFKD 420
            EPYL++   D ++RRCVPE E  SI++ CH    GGHFG +RT TK+L  G++WP +F D
Sbjct: 1249 EPYLFKYCADQVIRRCVPETEQLSIIQHCHSKEAGGHFGVERTATKILSCGFYWPRVFHD 1308

Query: 421  PRAYAVACDGCQRTGNISNRNEMPLNSMLEVELFDVWGIDFMGPFPPSCGNQYILVAVDY 480
             R Y ++C  CQRTGNIS R+E+P NS+L +ELFDVWGIDFMGPFP S  N+YILVAVDY
Sbjct: 1309 CRNYIMSCAPCQRTGNISRRHEVPQNSILVIELFDVWGIDFMGPFPSSFSNKYILVAVDY 1368

Query: 481  VSKWVEAAACARNDANAVSKFLKKQIFSRFGTPRAIISDE-------------------- 540
            VSKWVEA A   NDA  V +FLKK IFSR G PRAIISD                     
Sbjct: 1369 VSKWVEAVALQSNDARVVIRFLKKNIFSRVGVPRAIISDGGSHFCNRQFEKLLSKYGVTH 1428

Query: 541  -------------------------------------------------------GMSPY 600
                                                                   GMSPY
Sbjct: 1429 KIATPYHPQTCGQVEVSNREIKRILEKTVNASRKDWSLKLDDALWAYRTAFKTPIGMSPY 1488

Query: 601  ALVFGKACHLPLELEHKAIWAMKKLNLDQEASGEARKLQLNELLEWRHSAYENAKLYKER 660
             +V+GK+CHLP+ELEHKA WA+K LN D +A+GE R LQLN++ E R  AYENA++YKER
Sbjct: 1489 KIVYGKSCHLPVELEHKAYWAIKYLNFDLQAAGEKRLLQLNQMAEMREEAYENARIYKER 1548

Query: 661  TKKWHDKNISKKILYVCQKVLLFNSRLRLFPGKLKSRWSGPFITKEVFPHGAVMLTNENG 673
             K+WHD+NI K+     QKVLL+NSRL+LFPGKLKSRWSGPF+   VFP+GAV L +E+ 
Sbjct: 1549 AKRWHDRNILKREFLPGQKVLLYNSRLKLFPGKLKSRWSGPFVISNVFPYGAVELKSEDD 1608

BLAST of Clc03G10110 vs. ExPASy TrEMBL
Match: A0A2K3NPD0 (Reverse transcriptase OS=Trifolium pratense OX=57577 GN=L195_g001324 PE=4 SV=1)

HSP 1 Score: 858.6 bits (2217), Expect = 1.8e-245
Identity = 415/749 (55.41%), Postives = 523/749 (69.83%), Query Frame = 0

Query: 1    MPFWLCNTPRTFQRCMMAIFSDYLEQSVEVFMDDFSVFGKSYDECLTNLEQVLKRCEDTN 60
            MPF LCN P TFQRCM AIFSD +E+ +EVFMDDFSVFG S+D CL NL+ VLKRC +TN
Sbjct: 633  MPFGLCNAPATFQRCMQAIFSDLIEKCIEVFMDDFSVFGSSFDCCLANLDTVLKRCVETN 692

Query: 61   LVLNWEKCHFMETKGIVLGYKISKVGLEVDQAKIDAIAKLPAPTNVQTLRSFLGHAGFYR 120
            LVLNWEKCHFM T+GIVLG+KIS  G+EVD+AK++ I KLP P N++ +RSFLGHAGFYR
Sbjct: 693  LVLNWEKCHFMVTEGIVLGHKISSKGIEVDKAKVEVIEKLPPPINIKGIRSFLGHAGFYR 752

Query: 121  RFIKGFSQVAKPLSELLEVNREFNFNGKCLNAFESLRQALISAPILVAPDWSLPFELMCD 180
            RFIK FS++AKPLS LL  ++ FNF+  CL AF  L++ L +API+ APDWSL FELMCD
Sbjct: 753  RFIKDFSKIAKPLSNLLNKDKPFNFDKSCLIAFNDLKERLTTAPIITAPDWSLDFELMCD 812

Query: 181  ASNHAVGAVLGQRKEKIMHPIYYASKTLNSSQENYTTTEKEMLAIVFVVDKFRAYLIGSK 240
            AS++AVGAVLGQRK K  H I+YASK LN +Q NY TTEKE+LAIV+ ++KFR+YLIGSK
Sbjct: 813  ASDYAVGAVLGQRKNKFFHAIHYASKVLNDAQINYATTEKELLAIVYALEKFRSYLIGSK 872

Query: 241  VTIYSDHSAIKYLMAKKDAKPRLVRWVLLLQEFDLEIKDRRGTENQVANHLSRLENKEVQ 300
            + +Y+DH+AIKYL+ K D+K RL+RW+LLLQEFDLEIKD++GTEN VA+HLSRL NK V 
Sbjct: 873  IIVYTDHAAIKYLITKSDSKQRLIRWMLLLQEFDLEIKDKKGTENLVADHLSRLVNKGVT 932

Query: 301  ESWSDIEERFPDKHVMNAESQEPWYADIVNYLVCNQWPEEFNAQQKKKLRHESKFYCWDE 360
            E   ++ E FPD+ ++  + + PW+AD+ NY      P++FN  QKK+    +  + WD+
Sbjct: 933  EQEREVLEEFPDEKLLMVQ-ERPWFADMANYKASGLIPDDFNWHQKKRFLRIANQFVWDD 992

Query: 361  PYLYRLGPDHILRRCVPEYETHSILRSCHEAPYGGHFGGQRTVTKVLQSGYFWPTLFKDP 420
            PYL++LG D++LRRCV + E  SIL  CH +PYGGH+ G+RT  K+LQ+G+FWPT+FKD 
Sbjct: 993  PYLFKLGADNLLRRCVTKEEATSILWHCHNSPYGGHYNGERTAAKILQAGFFWPTVFKDS 1052

Query: 421  RAYAVACDGCQRTGNISNRNEMPLNSMLEVELFDVWGIDFMGPFPPSCGNQYILVAVDYV 480
              Y  +CD CQRTG IS RNEMPL S+LEVE+FD WGIDF+GPFP S  N+YILVAVDYV
Sbjct: 1053 YEYVQSCDNCQRTGGISRRNEMPLQSILEVEVFDCWGIDFVGPFPSSLSNEYILVAVDYV 1112

Query: 481  SKWVEAAACARNDANAVSKFLKKQIFSRFGTPRAIISDE--------------------- 540
            SKWVEA A  + D   V KFLK+ IF+RFGTPR +ISD                      
Sbjct: 1113 SKWVEAIASPKADGKTVIKFLKRNIFTRFGTPRVLISDGGSHFCNSQLAKALEHYGVKHK 1172

Query: 541  ------------------------------------------------------GMSPYA 600
                                                                  G++P+ 
Sbjct: 1173 IASPYHPQTNGQAEVSNREIKKILEKTVSTSRKDWSLKLDEALWAYRTAFKSPIGLTPFQ 1232

Query: 601  LVFGKACHLPLELEHKAIWAMKKLNLDQEASGEARKLQLNELLEWRHSAYENAKLYKERT 660
            +++GKACHLP+ELEHKA WA+K LN D+  +GE RK QL+EL E R  AYE++KLYK++ 
Sbjct: 1233 MIYGKACHLPVELEHKAFWALKFLNFDENQAGEKRKFQLHELEEMRFHAYESSKLYKQKV 1292

Query: 661  KKWHDKNISKKILYVCQKVLLFNSRLRLFPGKLKSRWSGPFITKEVFPHGAVMLTNENGT 675
            K +HDK I K+     QKVLLFNSRL+LFPGKLKS+WSGPFI KEV P+GAV + +   +
Sbjct: 1293 KSYHDKQIVKRDFQPGQKVLLFNSRLKLFPGKLKSKWSGPFIIKEVKPYGAVEIEDVEMS 1352

BLAST of Clc03G10110 vs. TAIR 10
Match: ATMG00860.1 (DNA/RNA polymerases superfamily protein )

HSP 1 Score: 92.0 bits (227), Expect = 1.9e-18
Identity = 48/132 (36.36%), Postives = 75/132 (56.82%), Query Frame = 0

Query: 46  LTNLEQVLKRCEDTNLVLNWEKCHFMETKGIVLGYK--ISKVGLEVDQAKIDAIAKLPAP 105
           + +L  VL+  E      N +KC F + +   LG++  IS  G+  D AK++A+   P P
Sbjct: 1   MNHLGMVLQIWEQHQFYANRKKCAFGQPQIAYLGHRHIISGEGVSADPAKLEAMVGWPEP 60

Query: 106 TNVQTLRSFLGHAGFYRRFIKGFSQVAKPLSELLEVNREFNFNGKCLNAFESLRQALISA 165
            N   LR FLG  G+YRRF+K + ++ +PL+ELL+ N    +      AF++L+ A+ + 
Sbjct: 61  KNTTELRGFLGLTGYYRRFVKNYGKIVRPLTELLKKN-SLKWTEMAALAFKALKGAVTTL 120

Query: 166 PILVAPDWSLPF 176
           P+L  PD  LPF
Sbjct: 121 PVLALPDLKLPF 131

BLAST of Clc03G10110 vs. TAIR 10
Match: ATMG00750.1 (GAG/POL/ENV polyprotein )

HSP 1 Score: 85.5 bits (210), Expect = 1.8e-16
Identity = 34/56 (60.71%), Postives = 43/56 (76.79%), Query Frame = 0

Query: 406 VLQSGYFWPTLFKDPRAYAVACDGCQRTGNISNRNEMPLNSMLEVELFDVWGIDFM 462
           VLQ+G++WPT FKD   +  +CD CQR GN + RNEMP + +LEVE+FDVWGI FM
Sbjct: 35  VLQAGFYWPTTFKDAHGFVSSCDACQRKGNFTKRNEMPQHFILEVEVFDVWGIYFM 90

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PIM97577.12.9e-25860.08DNA-directed DNA polymerase [Handroanthus impetiginosus][more]
BBH06778.11.4e-25257.73transposable element gene [Prunus dulcis][more]
XP_028962178.18.3e-25056.61LOW QUALITY PROTEIN: uncharacterized protein LOC114826270 [Malus domestica][more]
XP_031379021.12.4e-24957.28uncharacterized protein LOC116194359 [Punica granatum][more]
XP_012858910.12.4e-24957.69PREDICTED: uncharacterized protein LOC105978045 [Erythranthe guttata][more]
Match NameE-valueIdentityDescription
P043233.5e-5741.14Retrovirus-related Pol polyprotein from transposon 17.6 OS=Drosophila melanogast... [more]
Q8I7P93.6e-5437.91Retrovirus-related Pol polyprotein from transposon opus OS=Drosophila melanogast... [more]
P208251.4e-5338.28Retrovirus-related Pol polyprotein from transposon 297 OS=Drosophila melanogaste... [more]
Q7LHG54.4e-5229.67Transposon Ty3-I Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
Q993155.7e-5229.51Transposon Ty3-G Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
Match NameE-valueIdentityDescription
A0A2G9FWY31.4e-25860.08Reverse transcriptase OS=Handroanthus impetiginosus OX=429701 GN=CDL12_29952 PE=... [more]
A0A4Y1RS996.7e-25357.73Transposable element protein OS=Prunus dulcis OX=3755 GN=Prudu_018514 PE=4 SV=1[more]
A0A6P8CBX21.2e-24957.28Reverse transcriptase OS=Punica granatum OX=22663 GN=LOC116194359 PE=4 SV=1[more]
A0A6P8DLJ84.9e-24857.01Reverse transcriptase OS=Punica granatum OX=22663 GN=LOC116205794 PE=4 SV=1[more]
A0A2K3NPD01.8e-24555.41Reverse transcriptase OS=Trifolium pratense OX=57577 GN=L195_g001324 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
ATMG00860.11.9e-1836.36DNA/RNA polymerases superfamily protein [more]
ATMG00750.11.8e-1660.71GAG/POL/ENV polyprotein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (cordophanus) v2
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR041373Reverse transcriptase, RNase H-like domainPFAMPF17917RT_RNaseHcoord: 170..273
e-value: 5.9E-35
score: 119.7
IPR000477Reverse transcriptase domainPFAMPF00078RVT_1coord: 5..82
e-value: 1.8E-9
score: 37.5
IPR043128Reverse transcriptase/Diguanylate cyclase domainGENE3D3.30.70.270coord: 2..86
e-value: 1.2E-26
score: 95.2
IPR043128Reverse transcriptase/Diguanylate cyclase domainGENE3D3.30.70.270coord: 91..186
e-value: 9.7E-26
score: 91.6
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 449..521
e-value: 1.1E-19
score: 72.5
IPR041588Integrase zinc-binding domainPFAMPF17921Integrase_H2C2coord: 376..433
e-value: 4.9E-10
score: 39.3
NoneNo IPR availableGENE3D1.10.340.70coord: 339..433
e-value: 2.7E-15
score: 58.4
NoneNo IPR availablePANTHERPTHR24559:SF373REVERSE TRANSCRIPTASE DOMAIN, RIBONUCLEASE H-LIKE DOMAIN PROTEIN-RELATEDcoord: 24..520
NoneNo IPR availablePANTHERPTHR24559TRANSPOSON TY3-I GAG-POL POLYPROTEINcoord: 24..520
NoneNo IPR availableCDDcd09274RNase_HI_RT_Ty3coord: 176..294
e-value: 2.97728E-57
score: 187.7
NoneNo IPR availableCDDcd01647RT_LTRcoord: 1..82
e-value: 7.8918E-30
score: 114.23
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 439..520
score: 11.681875
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 1..277
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 452..532

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Clc03G10110.1Clc03G10110.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0071897 DNA biosynthetic process
biological_process GO:0015074 DNA integration
molecular_function GO:0034061 DNA polymerase activity
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0003676 nucleic acid binding