Clc03G10990 (gene) Watermelon (cordophanus) v2

Overview
NameClc03G10990
Typegene
OrganismCitrullus lanatus subsp. cordophanus cv. cordophanus (Watermelon (cordophanus) v2)
DescriptionIntegrase catalytic domain-containing protein
LocationClcChr03: 13945476 .. 13947167 (+)
RNA-Seq ExpressionClc03G10990
SyntenyClc03G10990
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTGTGATGCAAGCGACTATGCGATGGGCGCTGCGTTGGTTCAACGCAGAGACATAATGCTGCACCCTATAGCCTACGCCAGCAAAACTTTCAACACAGCTCAAGCAAATTATACAACTATAGAGAAAGAGTTGCTGGCGGTAGTATTCGCTGTGGAAAAATTCAGAGCATATCTTCTAGGGTCGAAAGTCATCATCCATACTGACCACTCGGCTATCCGATATTTGATGACTAAAAAAGATGCCAAGCCGAGGCTTATTCGATGGGTTTTACTTTTTCAAGAGTTCGATGCAAAGATAGTTGATCGAAAGGGAACAAAGAACAACGTGGCAGATCGTCTTTCAAGACTCGAGAATATGGAACACGATCGCAAACAGCTAGACGTCAACGTAAGCTTTCCAGATGAAGCTGTATTGAAAGTCACAGAGTTCGCGCCGTGGTATGCAGACATTGTCAACTTCTTGGTGTGCAAACAATTTCCTGAAGACTTCAACACACAGCAAAAGAAGAAGTTGATACATGACGCGAAGTTTTATTATTGGGACGAGCCCCAGCTCTACAAAAGGGGGCCGGATCACATCTTCAGACTTTGCGTCCCAGAGACTTCGTATCAACACATCCTATCTCAATGTCATGACTCCCCCTATGGAGGACATTTTGGAGGACAACGAACTGCACCCAAGGTATTGCAAAGCGGATACTTCTGGCCAACTCTTTTTAGAGACGCCAAGGACTATGCGATCAGATGCGACAGATGTCAACGCATCGGAAATATCTCAAGTCGGAACGAAATGCCACTTACCTCTATTTTAGAAGTTGAGCTCTTTGACGTTTGGGGTATTGATTTCATGGGCCCATTCCAGCCATCAAACGGCCACAACTACATACTGGTAGCGGTGGACTATGTATCAAAATGGGTTGAAGCAATCTCATGCGCTAGGAATGACGCGGCAACAGTCTCAAACTTTCTTCAGAAGAATATTAGCGATGAAGGTACACATTTCATTAATAAAATCATTTCCAATTTGCTAATAAAATACAATGTACATCATAGGGTGGCCACTGCATACCACCCACAGACAAATGGGCAGGCAGAAATTTCCAATAGAGAGTTGAAAACTATTTTGGAAAAAGTAGTTAACACCTCTCGAAAGGACTGGGCGTCCAAATTAAACGAGTCGTTATGGGCATACCGTACAGCATTCAAAACGCCTATCGGGATGTCACCGTATGCGCTGGTCTTTGGAAAGGCATGCCATCTGCCGCTGGAGCTAGAACATAAAGTATTGTGGGCTGCCAAGAGATTGAACATGGACCTGAAAGCAGCTGGAGAAGCGCGTCAACTTCAATTGAATGAACTGGAGGAATGGAGACTGCAAGCATATGAGAACGCAAAGATATACAAGGAGCGAACAAAGCGTTGGCACGACCAACACATCAGTAAGAAATCTTTGTATATAGGTCAAAAGGTCCTTCTTTTTAACTCAAGACTGCGTTTATTTCCAGGAAAATTAAAAACAAGATGGTCTGGTCCTTTTGTGATCAAGGAAATCTTTCCTCATGGTGCCGTAGAATTGATGAATGAAGACGGCACCAACGCATTCAAAGTTAATGGTCAGCGCGTGAAACCATATTTTGGAGACTGCCTTGAACGCGACAAAGTAACGGTTGACCTAGCAAAAATCGAATGA

mRNA sequence

ATGTGTGATGCAAGCGACTATGCGATGGGCGCTGCGTTGGTTCAACGCAGAGACATAATGCTGCACCCTATAGCCTACGCCAGCAAAACTTTCAACACAGCTCAAGCAAATTATACAACTATAGAGAAAGAGTTGCTGGCGGTAGTATTCGCTGTGGAAAAATTCAGAGCATATCTTCTAGGGTCGAAAGTCATCATCCATACTGACCACTCGGCTATCCGATATTTGATGACTAAAAAAGATGCCAAGCCGAGGCTTATTCGATGGGTTTTACTTTTTCAAGAGTTCGATGCAAAGATAGTTGATCGAAAGGGAACAAAGAACAACGTGGCAGATCGTCTTTCAAGACTCGAGAATATGGAACACGATCGCAAACAGCTAGACGTCAACGTAAGCTTTCCAGATGAAGCTGTATTGAAAGTCACAGAGTTCGCGCCGTGGTATGCAGACATTGTCAACTTCTTGGTGTGCAAACAATTTCCTGAAGACTTCAACACACAGCAAAAGAAGAAGTTGATACATGACGCGAAGTTTTATTATTGGGACGAGCCCCAGCTCTACAAAAGGGGGCCGGATCACATCTTCAGACTTTGCGTCCCAGAGACTTCGTATCAACACATCCTATCTCAATGTCATGACTCCCCCTATGGAGGACATTTTGGAGGACAACGAACTGCACCCAAGGTATTGCAAAGCGGATACTTCTGGCCAACTCTTTTTAGAGACGCCAAGGACTATGCGATCAGATGCGACAGATGTCAACGCATCGGAAATATCTCAAGTCGGAACGAAATGCCACTTACCTCTATTTTAGAAGTTGAGCTCTTTGACGTTTGGGGTATTGATTTCATGGGCCCATTCCAGCCATCAAACGGCCACAACTACATACTGGTAGCGGTGGACTATGTATCAAAATGGGTTGAAGCAATCTCATGCGCTAGGAATGACGCGGCAACAGTCTCAAACTTTCTTCAGAAGAATATTAGCGATGAAGGTACACATTTCATTAATAAAATCATTTCCAATTTGCTAATAAAATACAATGTACATCATAGGGTGGCCACTGCATACCACCCACAGACAAATGGGCAGGCAGAAATTTCCAATAGAGAGTTGAAAACTATTTTGGAAAAAGTAGTTAACACCTCTCGAAAGGACTGGGCGTCCAAATTAAACGAGTCGTTATGGGCATACCGTACAGCATTCAAAACGCCTATCGGGATGTCACCGTATGCGCTGGTCTTTGGAAAGGCATGCCATCTGCCGCTGGAGCTAGAACATAAAGTATTGTGGGCTGCCAAGAGATTGAACATGGACCTGAAAGCAGCTGGAGAAGCGCGTCAACTTCAATTGAATGAACTGGAGGAATGGAGACTGCAAGCATATGAGAACGCAAAGATATACAAGGAGCGAACAAAGCGTTGGCACGACCAACACATCAGTAAGAAATCTTTGTATATAGGTCAAAAGGTCCTTCTTTTTAACTCAAGACTGCGTTTATTTCCAGGAAAATTAAAAACAAGATGGTCTGGTCCTTTTGTGATCAAGGAAATCTTTCCTCATGGTGCCGTAGAATTGATGAATGAAGACGGCACCAACGCATTCAAAGTTAATGGTCAGCGCGTGAAACCATATTTTGGAGACTGCCTTGAACGCGACAAAGTAACGGTTGACCTAGCAAAAATCGAATGA

Coding sequence (CDS)

ATGTGTGATGCAAGCGACTATGCGATGGGCGCTGCGTTGGTTCAACGCAGAGACATAATGCTGCACCCTATAGCCTACGCCAGCAAAACTTTCAACACAGCTCAAGCAAATTATACAACTATAGAGAAAGAGTTGCTGGCGGTAGTATTCGCTGTGGAAAAATTCAGAGCATATCTTCTAGGGTCGAAAGTCATCATCCATACTGACCACTCGGCTATCCGATATTTGATGACTAAAAAAGATGCCAAGCCGAGGCTTATTCGATGGGTTTTACTTTTTCAAGAGTTCGATGCAAAGATAGTTGATCGAAAGGGAACAAAGAACAACGTGGCAGATCGTCTTTCAAGACTCGAGAATATGGAACACGATCGCAAACAGCTAGACGTCAACGTAAGCTTTCCAGATGAAGCTGTATTGAAAGTCACAGAGTTCGCGCCGTGGTATGCAGACATTGTCAACTTCTTGGTGTGCAAACAATTTCCTGAAGACTTCAACACACAGCAAAAGAAGAAGTTGATACATGACGCGAAGTTTTATTATTGGGACGAGCCCCAGCTCTACAAAAGGGGGCCGGATCACATCTTCAGACTTTGCGTCCCAGAGACTTCGTATCAACACATCCTATCTCAATGTCATGACTCCCCCTATGGAGGACATTTTGGAGGACAACGAACTGCACCCAAGGTATTGCAAAGCGGATACTTCTGGCCAACTCTTTTTAGAGACGCCAAGGACTATGCGATCAGATGCGACAGATGTCAACGCATCGGAAATATCTCAAGTCGGAACGAAATGCCACTTACCTCTATTTTAGAAGTTGAGCTCTTTGACGTTTGGGGTATTGATTTCATGGGCCCATTCCAGCCATCAAACGGCCACAACTACATACTGGTAGCGGTGGACTATGTATCAAAATGGGTTGAAGCAATCTCATGCGCTAGGAATGACGCGGCAACAGTCTCAAACTTTCTTCAGAAGAATATTAGCGATGAAGGTACACATTTCATTAATAAAATCATTTCCAATTTGCTAATAAAATACAATGTACATCATAGGGTGGCCACTGCATACCACCCACAGACAAATGGGCAGGCAGAAATTTCCAATAGAGAGTTGAAAACTATTTTGGAAAAAGTAGTTAACACCTCTCGAAAGGACTGGGCGTCCAAATTAAACGAGTCGTTATGGGCATACCGTACAGCATTCAAAACGCCTATCGGGATGTCACCGTATGCGCTGGTCTTTGGAAAGGCATGCCATCTGCCGCTGGAGCTAGAACATAAAGTATTGTGGGCTGCCAAGAGATTGAACATGGACCTGAAAGCAGCTGGAGAAGCGCGTCAACTTCAATTGAATGAACTGGAGGAATGGAGACTGCAAGCATATGAGAACGCAAAGATATACAAGGAGCGAACAAAGCGTTGGCACGACCAACACATCAGTAAGAAATCTTTGTATATAGGTCAAAAGGTCCTTCTTTTTAACTCAAGACTGCGTTTATTTCCAGGAAAATTAAAAACAAGATGGTCTGGTCCTTTTGTGATCAAGGAAATCTTTCCTCATGGTGCCGTAGAATTGATGAATGAAGACGGCACCAACGCATTCAAAGTTAATGGTCAGCGCGTGAAACCATATTTTGGAGACTGCCTTGAACGCGACAAAGTAACGGTTGACCTAGCAAAAATCGAATGA

Protein sequence

MCDASDYAMGAALVQRRDIMLHPIAYASKTFNTAQANYTTIEKELLAVVFAVEKFRAYLLGSKVIIHTDHSAIRYLMTKKDAKPRLIRWVLLFQEFDAKIVDRKGTKNNVADRLSRLENMEHDRKQLDVNVSFPDEAVLKVTEFAPWYADIVNFLVCKQFPEDFNTQQKKKLIHDAKFYYWDEPQLYKRGPDHIFRLCVPETSYQHILSQCHDSPYGGHFGGQRTAPKVLQSGYFWPTLFRDAKDYAIRCDRCQRIGNISSRNEMPLTSILEVELFDVWGIDFMGPFQPSNGHNYILVAVDYVSKWVEAISCARNDAATVSNFLQKNISDEGTHFINKIISNLLIKYNVHHRVATAYHPQTNGQAEISNRELKTILEKVVNTSRKDWASKLNESLWAYRTAFKTPIGMSPYALVFGKACHLPLELEHKVLWAAKRLNMDLKAAGEARQLQLNELEEWRLQAYENAKIYKERTKRWHDQHISKKSLYIGQKVLLFNSRLRLFPGKLKTRWSGPFVIKEIFPHGAVELMNEDGTNAFKVNGQRVKPYFGDCLERDKVTVDLAKIE
Homology
BLAST of Clc03G10990 vs. NCBI nr
Match: PIM97577.1 (DNA-directed DNA polymerase [Handroanthus impetiginosus])

HSP 1 Score: 754.6 bits (1947), Expect = 6.1e-214
Identity = 360/571 (63.05%), Postives = 439/571 (76.88%), Query Frame = 0

Query: 1    MCDASDYAMGAALVQRRDIMLHPIAYASKTFNTAQANYTTIEKELLAVVFAVEKFRAYLL 60
            MCDASD+A+GA L QR+D +   I YASKT N AQ NYTT EKELLAVVFA +KFR+YL+
Sbjct: 1003 MCDASDFAVGAVLGQRKDKIFRSIYYASKTLNDAQLNYTTTEKELLAVVFAFDKFRSYLV 1062

Query: 61   GSKVIIHTDHSAIRYLMTKKDAKPRLIRWVLLFQEFDAKIVDRKGTKNNVADRLSRLENM 120
            G+KVI++TDH+AIRYL+ KKDAKPRLIRWVLL QEFD +I DRKGT+N +AD LSRLE+ 
Sbjct: 1063 GTKVIVYTDHAAIRYLIEKKDAKPRLIRWVLLLQEFDLEIRDRKGTENQIADHLSRLESP 1122

Query: 121  EHDRKQLDVNVSFPDEAVLK-VTEFAPWYADIVNFLVCKQFPEDFNTQQKKKLIHDAKFY 180
                +   +N +FPDE +L  V    PWYADIVN+L C   P D + QQKKK + D + Y
Sbjct: 1123 AKTDEPNLINDNFPDEQLLAIVASDVPWYADIVNYLTCGIIPFDLSAQQKKKFLFDTRRY 1182

Query: 181  YWDEPQLYKRGPDHIFRLCVPETSYQHILSQCHDSPYGGHFGGQRTAPKVLQSGYFWPTL 240
            +WD+P L+K+GPD+I R CVPE     IL QCH SPYGGHF G RTA K+LQSG+FWP L
Sbjct: 1183 FWDDPFLFKQGPDNILRRCVPEIEMNDILEQCHASPYGGHFHGDRTAAKILQSGFFWPNL 1242

Query: 241  FRDAKDYAIRCDRCQRIGNISSRNEMPLTSILEVELFDVWGIDFMGPFQPSNGHNYILVA 300
            F+DA  +   CDRCQR GNIS R+EMPL +ILEVELFDVWGIDFMGPF PS G+ YILVA
Sbjct: 1243 FKDAHSFVANCDRCQRTGNISRRHEMPLNTILEVELFDVWGIDFMGPFIPSFGNMYILVA 1302

Query: 301  VDYVSKWVEAISCARNDAATVSNFLQKN-----------ISDEGTHFINKIISNLLIKYN 360
            VDYVSKWVEA +   ND+  V NF++KN           ISD GTHF N+    LL KY 
Sbjct: 1303 VDYVSKWVEAAAVPNNDSKVVVNFIKKNIFTRFGTPRAIISDGGTHFCNRSFEALLSKYG 1362

Query: 361  VHHRVATAYHPQTNGQAEISNRELKTILEKVVNTSRKDWASKLNESLWAYRTAFKTPIGM 420
            V H+++T YHPQT+GQ E+SNRE+K ILEK V+++RKDW+ +L+E+LWAYRTA+KTPIGM
Sbjct: 1363 VKHKISTPYHPQTSGQVEVSNREIKRILEKTVSSTRKDWSKRLDEALWAYRTAYKTPIGM 1422

Query: 421  SPYALVFGKACHLPLELEHKVLWAAKRLNMDLKAAGEARQLQLNELEEWRLQAYENAKIY 480
            SPY LVFGKACHLP+ELEH   WA ++LN D++AAGE R LQLNEL+E+RL AYENAKIY
Sbjct: 1423 SPYRLVFGKACHLPVELEHNAYWAIRKLNFDMQAAGEKRLLQLNELDEFRLHAYENAKIY 1482

Query: 481  KERTKRWHDQHISKKSLYIGQKVLLFNSRLRLFPGKLKTRWSGPFVIKEIFPHGAVELMN 540
            KE+ KRWH++ I ++    GQ VLLFNSRL+LFPGKLK+RWSGPF I E+FPHGAVEL N
Sbjct: 1483 KEKKKRWHEKKIVERHFEPGQYVLLFNSRLKLFPGKLKSRWSGPFRITEVFPHGAVELEN 1542

Query: 541  EDGTNAFKVNGQRVKPYFGDCLERDKVTVDL 560
            ++  N FKVN QR+K Y+G+ ++R   ++ L
Sbjct: 1543 KNSRNRFKVNAQRIKHYWGEIVDRQHASITL 1573

BLAST of Clc03G10990 vs. NCBI nr
Match: XP_023874613.1 (uncharacterized protein LOC111987139 [Quercus suber])

HSP 1 Score: 749.6 bits (1934), Expect = 2.0e-212
Identity = 361/570 (63.33%), Postives = 435/570 (76.32%), Query Frame = 0

Query: 1    MCDASDYAMGAALVQRRDIMLHPIAYASKTFNTAQANYTTIEKELLAVVFAVEKFRAYLL 60
            MCDASD+A+GA L QRRD +   I YAS+T N AQ NYTT EKE+LAVVFA +KFR+YL+
Sbjct: 1185 MCDASDFALGAVLGQRRDKLFRAIYYASRTLNEAQLNYTTTEKEMLAVVFACDKFRSYLI 1244

Query: 61   GSKVIIHTDHSAIRYLMTKKDAKPRLIRWVLLFQEFDAKIVDRKGTKNNVADRLSRLENM 120
             +KVI+ TDH+A+RYL +KKDAKPRLIRW+LL QEFD ++ D+KG++N+VAD LSRLE  
Sbjct: 1245 CTKVIVFTDHAALRYLFSKKDAKPRLIRWILLLQEFDLEVRDKKGSENSVADHLSRLE-Q 1304

Query: 121  EHDRKQLDVNVSFPDEAVLKVTEFAPWYADIVNFLVCKQFPEDFNTQQKKKLIHDAKFYY 180
            E  R  L +  +FPDE +       PWYADIVNFL CK  P D    Q+KK +HD K+Y 
Sbjct: 1305 EEVRPDLVIQEAFPDEQLFACEIKLPWYADIVNFLACKVLPPDLTYHQRKKFLHDVKYYL 1364

Query: 181  WDEPQLYKRGPDHIFRLCVPETSYQHILSQCHDSPYGGHFGGQRTAPKVLQSGYFWPTLF 240
            WDEP L+KR PD I R CVPE   Q IL  CH S YGGHFG  RTA KVLQSG+FWP++F
Sbjct: 1365 WDEPLLFKRCPDQIIRRCVPEEEMQAILHHCHSSSYGGHFGVTRTAAKVLQSGFFWPSIF 1424

Query: 241  RDAKDYAIRCDRCQRIGNISSRNEMPLTSILEVELFDVWGIDFMGPFQPSNGHNYILVAV 300
            RD+      CDRCQR+GNIS R E+PL +ILEVELFDVWGIDFMGPF PS G  YIL+AV
Sbjct: 1425 RDSYTLVKTCDRCQRMGNISRRQELPLKNILEVELFDVWGIDFMGPFPPSFGFVYILLAV 1484

Query: 301  DYVSKWVEAISCARNDAATVSNFLQKN-----------ISDEGTHFINKIISNLLIKYNV 360
            DYVSKWVEAI+   NDA  V  FL KN           ISDEGTHF NK+  NLL KY V
Sbjct: 1485 DYVSKWVEAIATTTNDAKVVLKFLHKNIFTRFGTPRAIISDEGTHFCNKLFDNLLSKYGV 1544

Query: 361  HHRVATAYHPQTNGQAEISNRELKTILEKVVNTSRKDWASKLNESLWAYRTAFKTPIGMS 420
             H++A AYHPQTNGQAEISNRE+K ILEK VNT+RKDWA KL+++LWAYRTAFKTPIGMS
Sbjct: 1545 KHKIALAYHPQTNGQAEISNREIKNILEKTVNTNRKDWAKKLDDALWAYRTAFKTPIGMS 1604

Query: 421  PYALVFGKACHLPLELEHKVLWAAKRLNMDLKAAGEARQLQLNELEEWRLQAYENAKIYK 480
            PY LVFGKACHLP+ELEHK  WA K+ N+DLKAAGE R LQLNE++E+R  AYENAKIYK
Sbjct: 1605 PYRLVFGKACHLPVELEHKAYWAVKKFNLDLKAAGEKRLLQLNEMDEFRNDAYENAKIYK 1664

Query: 481  ERTKRWHDQHISKKSLYIGQKVLLFNSRLRLFPGKLKTRWSGPFVIKEIFPHGAVELMNE 540
            ERTK+WHD+ I ++    GQ+VLLFNSRL+LFPGKL++RW+GP+ I ++   GA++L ++
Sbjct: 1665 ERTKKWHDKQILRREFAPGQQVLLFNSRLKLFPGKLRSRWTGPYTIDKVSSFGAIDLKDK 1724

Query: 541  DGTNAFKVNGQRVKPYFGDCLERDKVTVDL 560
             G + F+VNGQR+K Y+G+ +ER+   + L
Sbjct: 1725 AG-HIFRVNGQRLKHYYGEQVERNCAFIPL 1752

BLAST of Clc03G10990 vs. NCBI nr
Match: PNX77934.1 (hypothetical protein L195_g033907 [Trifolium pratense])

HSP 1 Score: 746.9 bits (1927), Expect = 1.3e-211
Identity = 358/557 (64.27%), Postives = 427/557 (76.66%), Query Frame = 0

Query: 1   MCDASDYAMGAALVQRRDIMLHPIAYASKTFNTAQANYTTIEKELLAVVFAVEKFRAYLL 60
           MCDASD A+GA L QR+D +LH I YAS   N AQ NY T EKELLAVV+A +KFR+YLL
Sbjct: 24  MCDASDIAVGAVLGQRKDKLLHVIYYASHVLNPAQLNYATTEKELLAVVYAFDKFRSYLL 83

Query: 61  GSKVIIHTDHSAIRYLMTKKDAKPRLIRWVLLFQEFDAKIVDRKGTKNNVADRLSRLENM 120
           GSKVI++TDH+A+RYL  K+++KPRL+RW+LL QEFD +I D+KG++N VAD LSRLE +
Sbjct: 84  GSKVIVYTDHAALRYLFAKQESKPRLLRWILLLQEFDLEIRDKKGSENTVADHLSRLEKV 143

Query: 121 EHDRKQLDVNVSFPDEAVLKVTEFAPWYADIVNFLVCKQFPEDFNTQQKKKLIHDAKFYY 180
               ++  +   F DE +L VT  APW+AD  N++V +  P DF  QQ+KK +HD KFY 
Sbjct: 144 VETEEERAIQDLFADEHILAVT-VAPWFADFANYMVGRTIPSDFTPQQRKKFLHDCKFYV 203

Query: 181 WDEPQLYKRGPDHIFRLCVPETSYQHILSQCHDSPYGGHFGGQRTAPKVLQSGYFWPTLF 240
           WDEP LYKRG D + R CVPE   + +L  CHDS YGGHF G RTA KVLQSG FWPTLF
Sbjct: 204 WDEPFLYKRGVDGLLRRCVPEGEQEKVLWHCHDSSYGGHFSGDRTAAKVLQSGLFWPTLF 263

Query: 241 RDAKDYAIRCDRCQRIGNISSRNEMPLTSILEVELFDVWGIDFMGPFQPSNGHNYILVAV 300
           +DA  Y  RCDRCQR GNIS RNEMP   ILEVE+FDVWGIDFMGPF  S    YILVAV
Sbjct: 264 KDAFTYVKRCDRCQRTGNISKRNEMPQNPILEVEIFDVWGIDFMGPFPSSYSKTYILVAV 323

Query: 301 DYVSKWVEAISCARNDAATVSNFLQKN-----------ISDEGTHFINKIISNLLIKYNV 360
           DYVSKWVEAI+   NDA  V  FL++N           ISDEGTHF+N+ +  LL KYNV
Sbjct: 324 DYVSKWVEAIATHTNDAQVVVAFLKRNIFSRFGVPRALISDEGTHFLNRKMEALLKKYNV 383

Query: 361 HHRVATAYHPQTNGQAEISNRELKTILEKVVNTSRKDWASKLNESLWAYRTAFKTPIGMS 420
           HHR+AT YHPQT+GQ E+SNR++K ILEK VN+SRKDW+ KL+++LWAYRTAFKTPIGMS
Sbjct: 384 HHRIATPYHPQTSGQVEVSNRQIKQILEKTVNSSRKDWSVKLDDALWAYRTAFKTPIGMS 443

Query: 421 PYALVFGKACHLPLELEHKVLWAAKRLNMDLKAAGEARQLQLNELEEWRLQAYENAKIYK 480
           P+ +V+GKACHLPLELEHK LWA K LN DL  AGE+R LQL+EL+E+R  AYENAKI+K
Sbjct: 444 PFQIVYGKACHLPLELEHKALWATKFLNFDLSKAGESRILQLHELDEFRNYAYENAKIFK 503

Query: 481 ERTKRWHDQHISKKSLYIGQKVLLFNSRLRLFPGKLKTRWSGPFVIKEIFPHGAVELMNE 540
           E+TK+WHD+ I KK    GQ VLLFNSRLRLFPGKLK+RWSGPF IK++ PHGA+EL + 
Sbjct: 504 EKTKKWHDRKIQKKEFREGQLVLLFNSRLRLFPGKLKSRWSGPFKIKKVLPHGAIELEDR 563

Query: 541 DGTNAFKVNGQRVKPYF 547
           +    FKVNGQR+KPYF
Sbjct: 564 ETDQTFKVNGQRLKPYF 579

BLAST of Clc03G10990 vs. NCBI nr
Match: XP_028962178.1 (LOW QUALITY PROTEIN: uncharacterized protein LOC114826270 [Malus domestica])

HSP 1 Score: 740.3 bits (1910), Expect = 1.2e-209
Identity = 356/573 (62.13%), Postives = 433/573 (75.57%), Query Frame = 0

Query: 1    MCDASDYAMGAALVQRRDIMLHPIAYASKTFNTAQANYTTIEKELLAVVFAVEKFRAYLL 60
            MCDASDYA+GA L QRRD +LH I YAS+T N AQ NY T EKELLAVVFA++KFR+YL+
Sbjct: 1007 MCDASDYAIGAVLGQRRDKLLHVIYYASRTLNDAQLNYATTEKELLAVVFALDKFRSYLI 1066

Query: 61   GSKVIIHTDHSAIRYLMTKKDAKPRLIRWVLLFQEFDAKIVDRKGTKNNVADRLSRLENM 120
            GSKVI++TDHSA++YL++KKDAKPRLIRWVLL QEFD +I D+KG++N VAD LSR+ + 
Sbjct: 1067 GSKVIVYTDHSALKYLLSKKDAKPRLIRWVLLLQEFDLEIRDKKGSENVVADHLSRIMHH 1126

Query: 121  EHDRKQLDVNVSFPDEAVLKV-TEFAPWYADIVNFLVCKQFPEDFNTQQKKKLIHDAKFY 180
            E D   + ++ +FPDE +  + +   PWYAD VN+LV    P D   QQKK+ I   + Y
Sbjct: 1127 EGDNDLVPISETFPDEQLFTIKSSVTPWYADYVNYLVSDIMPPDLTWQQKKRFISLVRHY 1186

Query: 181  YWDEPQLYKRGPDHIFRLCVPETSYQHILSQCHDSPYGGHFGG-QRTAPKVLQSGYFWPT 240
            YWDEP L+K  PD   R CVP+   + IL  CH    GGHFG  +R  P++LQSG++WPT
Sbjct: 1187 YWDEPYLWKHCPDQCIRRCVPDDEMEEILRHCHSLACGGHFGATKRLLPRLLQSGFWWPT 1246

Query: 241  LFRDAKDYAIRCDRCQRIGNISSRNEMPLTSILEVELFDVWGIDFMGPFQPSNGHNYILV 300
            LF+DA  +   CDRCQRIGNISSR++MPL +ILEVELFDVWGIDFMGPF  S G  YILV
Sbjct: 1247 LFKDAHTFVSTCDRCQRIGNISSRHQMPLNNILEVELFDVWGIDFMGPFPSSYGKQYILV 1306

Query: 301  AVDYVSKWVEAISCARNDAATVSNFLQKN-----------ISDEGTHFINKIISNLLIKY 360
            AVDYVSKWVEAI+   NDA  V +FL+K+           ISD G HF N+  + LL KY
Sbjct: 1307 AVDYVSKWVEAIALPTNDAKVVVHFLRKHIFTRFGTPRAIISDGGKHFCNRHFNALLAKY 1366

Query: 361  NVHHRVATAYHPQTNGQAEISNRELKTILEKVVNTSRKDWASKLNESLWAYRTAFKTPIG 420
             + H+VAT YHPQT+GQ EISNRE+K ILEK V  SRKDWA+KL+++LWAYRTAFKTPIG
Sbjct: 1367 GITHKVATPYHPQTSGQVEISNREIKNILEKTVGLSRKDWAAKLDDALWAYRTAFKTPIG 1426

Query: 421  MSPYALVFGKACHLPLELEHKVLWAAKRLNMDLKAAGEARQLQLNELEEWRLQAYENAKI 480
            MSPY LVFGKACHLP+ELEHK  WA K+LN ++ AAGE R LQLNE+EE+R  AYENAKI
Sbjct: 1427 MSPYRLVFGKACHLPVELEHKAFWAVKKLNFEMDAAGEKRALQLNEMEEFRNDAYENAKI 1486

Query: 481  YKERTKRWHDQHISKKSLYIGQKVLLFNSRLRLFPGKLKTRWSGPFVIKEIFPHGAVELM 540
            YKERTK+WHDQHI ++  YIGQ+VLLFNSRL+LFPGKL+TRWSGPF + ++FP+G +E+ 
Sbjct: 1487 YKERTKKWHDQHILRREFYIGQQVLLFNSRLKLFPGKLRTRWSGPFTVVQVFPYGTIEIR 1546

Query: 541  NEDGTNAFKVNGQRVKPYFGDCLERDKVTVDLA 561
            +      FKVNGQR+KPY     ER+   V LA
Sbjct: 1547 DHTTDATFKVNGQRLKPYLAASFERNTNVVTLA 1579

BLAST of Clc03G10990 vs. NCBI nr
Match: XP_031120206.1 (uncharacterized protein LOC116023351 [Ipomoea triloba])

HSP 1 Score: 739.2 bits (1907), Expect = 2.7e-209
Identity = 349/571 (61.12%), Postives = 433/571 (75.83%), Query Frame = 0

Query: 1    MCDASDYAMGAALVQRRDIMLHPIAYASKTFNTAQANYTTIEKELLAVVFAVEKFRAYLL 60
            MCDASD+A+GA L Q+RD  LHPI YAS T N AQ NY+T EKE+LAV+FA+EKFR+YL+
Sbjct: 1093 MCDASDFAVGAVLCQKRDKSLHPIYYASHTLNDAQVNYSTTEKEMLAVIFAIEKFRSYLV 1152

Query: 61   GSKVIIHTDHSAIRYLMTKKDAKPRLIRWVLLFQEFDAKIVDRKGTKNNVADRLSRLENM 120
            GSKVIIHTDHSA++YL+ KKDAKPRLIRW LL QEFD +I D+KG++N VAD LSRLEN 
Sbjct: 1153 GSKVIIHTDHSALKYLINKKDAKPRLIRWTLLLQEFDIEIKDKKGSENLVADHLSRLENQ 1212

Query: 121  -EHDRKQLDVNVSFPDEAVLKVTEFAPWYADIVNFLVCKQFPEDFNTQQKKKLIHDAKFY 180
             E   ++L ++ +   E ++ +  F PWY+D+VN++VC++FP + +   KKKL +DAK+Y
Sbjct: 1213 KEVVGEELPIDDNLGGEQLMAIKAFLPWYSDLVNYVVCQKFPPNASRALKKKLKNDAKYY 1272

Query: 181  YWDEPQLYKRGPDHIFRLCVPETSYQHILSQCHDSPYGGHFGGQRTAPKVLQSGYFWPTL 240
             W EP LYKR  D + R C+PE   + +L+ CH SPYGGHFG  RTA KVLQSG++WP+L
Sbjct: 1273 LWKEPFLYKRCADGVMRRCLPEDEIESVLNHCHSSPYGGHFGASRTAAKVLQSGFYWPSL 1332

Query: 241  FRDAKDYAIRCDRCQRIGNISSRNEMPLTSILEVELFDVWGIDFMGPFQPSNGHNYILVA 300
            F+DA  Y   CD+CQR+GN+  RNEMP  SILEVELFDVWG+DFMGPF  S GH YILV 
Sbjct: 1333 FKDAHSYVKHCDKCQRVGNVGRRNEMPQQSILEVELFDVWGLDFMGPFPKSFGHEYILVV 1392

Query: 301  VDYVSKWVEAISCARNDAATVSNFLQKN-----------ISDEGTHFINKIISNLLIKYN 360
            VDYVSKWVEA+SC  NDA  V  F++KN           I+D G HF NK +  +L KY 
Sbjct: 1393 VDYVSKWVEAVSCPTNDAKVVIKFIKKNIFARFGVPRALITDNGKHFCNKALGKVLEKYG 1452

Query: 361  VHHRVATAYHPQTNGQAEISNRELKTILEKVVNTSRKDWASKLNESLWAYRTAFKTPIGM 420
            VHHR+AT YHPQT+GQ E+SNRE+K+ILEKVVN SR+DWA+KL+++LWAYRTA+K PIG 
Sbjct: 1453 VHHRLATPYHPQTSGQVELSNREIKSILEKVVNPSRRDWATKLDDALWAYRTAYKNPIGT 1512

Query: 421  SPYALVFGKACHLPLELEHKVLWAAKRLNMDLKAAGEARQLQLNELEEWRLQAYENAKIY 480
            SP+ LV+GKACHLP+ELEHK  WA   LNMDLK AGE R LQLNELEE+RL AYENA+IY
Sbjct: 1513 SPFKLVYGKACHLPVELEHKAHWAVTTLNMDLKLAGEKRLLQLNELEEFRLDAYENAQIY 1572

Query: 481  KERTKRWHDQHISKKSLYIGQKVLLFNSRLRLFPGKLKTRWSGPFVIKEIFPHGAVELMN 540
            KE+TK+WHD+HI K+    G KVLLFNSRL+LFPGKL++RWSGP+ +   +P GAV +  
Sbjct: 1573 KEKTKKWHDKHILKREFVSGDKVLLFNSRLKLFPGKLRSRWSGPYEVVHAYPSGAVVIRG 1632

Query: 541  EDGTNAFKVNGQRVKPYFGDCLERDKVTVDL 560
            ++G   F  NGQR+K Y  D     KV + L
Sbjct: 1633 KNGD--FTANGQRLKHYHDDTKIEAKVAISL 1661

BLAST of Clc03G10990 vs. ExPASy Swiss-Prot
Match: P10394 (Retrovirus-related Pol polyprotein from transposon 412 OS=Drosophila melanogaster OX=7227 GN=POL PE=4 SV=1)

HSP 1 Score: 166.4 bits (420), Expect = 9.4e-40
Identity = 161/628 (25.64%), Postives = 264/628 (42.04%), Query Frame = 0

Query: 3    DASDYAMGAALVQRRDIMLHPIAYASKTFNTAQANYTTIEKELLAVVFAVEKFRAYLLGS 62
            DAS  A GA L Q  +    P+AYAS+ F   ++N +T E+EL A+ +A+  FR Y+ G 
Sbjct: 621  DASKQACGAVLTQNHNGHQLPVAYASRAFTKGESNKSTTEQELAAIHWAIIHFRPYIYGK 680

Query: 63   KVIIHTDHSAIRYLMTKKDAKPRLIRWVLLFQEFDAKIVDRKGTKNNVADRLSRLENME- 122
               + TDH  + YL +  +   +L R  L  +E++  +   KG  N+VAD LSR+   E 
Sbjct: 681  HFTVKTDHRPLTYLFSMVNPSSKLTRIRLELEEYNFTVEYLKGKDNHVADALSRITIKEL 740

Query: 123  ---------------------HDRKQLDVNVS-------------FPDEAVLKVTE---- 182
                                   ++QLD+                  ++ V KV      
Sbjct: 741  KDITGNILKVTTRFQSRQKSCAGKEQLDLQKQTKEIASEPNVYEVITNDEVRKVVTLQLN 800

Query: 183  ---------------------FAPWYADIVNFLVCKQFPEDFNTQQKKKLIHDAKFYYWD 242
                                 +     D+  FL   +         + K+    K +   
Sbjct: 801  DSICLFKHGKKIIARYDVGDLYTNGILDLDQFLQRLELQAGIYDISQIKMAPWKKIFEHV 860

Query: 243  EPQLYKRGPDHIFR-----LCVPETSYQH------ILSQCHDSP-YGGHFGGQRTAPKVL 302
                +K   + I +     L  P T   +      ILS  HD P  GGH G  +T  KV 
Sbjct: 861  SIDKFKNMGNKILKNLKVALLNPVTQINNEKEKEAILSTLHDDPIQGGHTGITKTLAKV- 920

Query: 303  QSGYFWPTLFRDAKDYAIRCDRCQRIGNIS-SRNEMPLTSILEVELFDVWGIDFMGPFQP 362
            +  Y+W  + +  K+Y  +C +CQ+      ++  M +T   E   FD   +D +GP   
Sbjct: 921  KRHYYWKNMSKYIKEYVRKCQKCQKAKTTKHTKTPMTITETPE-HAFDRVVVDTIGPLPK 980

Query: 363  S-NGHNYILVAVDYVSKWVEAISCARNDAATVS-----NFLQKN------ISDEGTHFIN 422
            S NG+ Y +  +  ++K++ AI  A   A TV+     +F+ K       I+D GT + N
Sbjct: 981  SENGNEYAVTLICDLTKYLVAIPIANKSAKTVAKAIFESFILKYGPMKTFITDMGTEYKN 1040

Query: 423  KIISNLLIKYNVHHRVATAYHPQTNGQAEISNRELKTILEKVVNTSRKDWASKLNESLWA 482
             II++L     + +  +TA+H QT G  E S+R L   +   ++T + DW   L   ++ 
Sbjct: 1041 SIITDLCKYLKIKNITSTAHHHQTVGVVERSHRTLNEYIRSYISTDKTDWDVWLQYFVYC 1100

Query: 483  YRTAFKTPIGMSPYALVFGKACHLPLELEHKVLWAAKRLNMDLKAAGEARQLQLNELEEW 542
            + T         PY LVFG+  +LP    +K+       N+D   A E++      LE  
Sbjct: 1101 FNTTQSMVHNYCPYELVFGRTSNLPKHF-NKLHSIEPIYNID-DYAKESKY----RLEVA 1160

Query: 543  RLQAYENAKIYKERTKRWHDQHISKKSLYIGQKVLLFNSRLRLFPGKLKTRWSGPFVIKE 546
              +A +  + +KE+ K  +D  +    L +G KVLL N        KL  +++GP+ I+ 
Sbjct: 1161 YARARKLLEAHKEKNKENYDLKVKDIELEVGDKVLLRNE----VGHKLDFKYTGPYKIES 1220

BLAST of Clc03G10990 vs. ExPASy Swiss-Prot
Match: Q99315 (Transposon Ty3-G Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY3B-G PE=1 SV=3)

HSP 1 Score: 125.9 bits (315), Expect = 1.4e-27
Identity = 151/581 (25.99%), Postives = 239/581 (41.14%), Query Frame = 0

Query: 3    DASDYAMGAAL--VQRRDIMLHPIAYASKTFNTAQANYTTIEKELLAVVFAVEKFRAYLL 62
            DAS   +GA L  V  ++ ++  + Y SK+  +AQ NY   E ELL ++ A+  FR  L 
Sbjct: 893  DASKDGIGAVLEEVDNKNKLVGVVGYFSKSLESAQKNYPAGELELLGIIKALHHFRYMLH 952

Query: 63   GSKVIIHTDHSAIRYLMTKKDAKPRLIRWVLLFQEFDAKIVDRKGTKNNVADRLSRLENM 122
            G    + TDH ++  L  K +   R+ RW+     +D  +    G KN VAD +SR    
Sbjct: 953  GKHFTLRTDHISLLSLQNKNEPARRVQRWLDDLATYDFTLEYLAGPKNVVADAISRA--- 1012

Query: 123  EHDRKQLDVNVSFPDEAVLKVTEFAPWY--------ADIVNFLVCKQF---PED---FNT 182
                    V    P+ +    TE    Y        A +++     Q    PED   F +
Sbjct: 1013 --------VYTITPETSRPIDTESWKSYYKSDPLCSAVLIHMKELTQHNVTPEDMSAFRS 1072

Query: 183  QQKKKLIHDA--KFYYWDEPQLYKRGPDHIFRLCVPETSYQHILSQCHD-SPYGGHFGGQ 242
             QKK  + +   K Y  ++  +Y +      RL VP      ++   HD + +GGHFG  
Sbjct: 1073 YQKKLELSETFRKNYSLEDEMIYYQD-----RLVVPIKQQNAVMRLYHDHTLFGGHFGVT 1132

Query: 243  RTAPKVLQSGYFWPTLFRDAKDYAIRCDRCQRIGNISSRNE---MPLTSILEVELFDVWG 302
             T  K+    Y+WP L      Y   C +CQ I +   R      PL  I E    D+  
Sbjct: 1133 VTLAKI-SPIYYWPKLQHSIIQYIRTCVQCQLIKSHRPRLHGLLQPL-PIAEGRWLDI-S 1192

Query: 303  IDFMGPFQP-SNGHNYILVAVDYVSKWVEAISCARN-DAATVSNFLQKNI---------- 362
            +DF+    P SN  N ILV VD  SK    I+  +  DA  + + L + I          
Sbjct: 1193 MDFVTGLPPTSNNLNMILVVVDRFSKRAHFIATRKTLDATQLIDLLFRYIFSYHGFPRTI 1252

Query: 363  -SDEGTHFINKIISNLLIKYNVHHRVATAYHPQTNGQAEISNRELKTILEKVVNTSRKDW 422
             SD            L  +  +   +++A HPQT+GQ+E + + L  +L    +T+ ++W
Sbjct: 1253 TSDRDVRMTADKYQELTKRLGIKSTMSSANHPQTDGQSERTIQTLNRLLRAYASTNIQNW 1312

Query: 423  ASKLNESLWAYRTAFKTPIGMSPYALVFGKACHLPLELEHKVLWAAKRLNMDLKAAGEAR 482
               L +  + Y +     +G SP+ +  G   + P       + A     ++L    +A 
Sbjct: 1313 HVYLPQIEFVYNSTPTRTLGKSPFEIDLGYLPNTPAIKSDDEVNARSFTAVELAKHLKAL 1372

Query: 483  QLQLNELEEWRLQAYENAKIYKERTKRWHDQHISKKSLYIGQKVLLFNSRLRLFPGKLKT 542
             +Q  E         E+A+I  E     ++Q      L IG  VL+           +K 
Sbjct: 1373 TIQTKE-------QLEHAQIEMETN---NNQRRKPLLLNIGDHVLVHRDAYFKKGAYMKV 1432

Query: 543  R--WSGPFVIKEIFPHGAVEL-MNEDGTNAFKVNGQRVKPY 546
            +  + GPF + +     A EL +N        +N Q +K +
Sbjct: 1433 QQIYVGPFRVVKKINDNAYELDLNSHKKKHRVINVQFLKKF 1444

BLAST of Clc03G10990 vs. ExPASy Swiss-Prot
Match: Q7LHG5 (Transposon Ty3-I Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY3B-I PE=1 SV=2)

HSP 1 Score: 125.2 bits (313), Expect = 2.4e-27
Identity = 147/554 (26.53%), Postives = 232/554 (41.88%), Query Frame = 0

Query: 3    DASDYAMGAAL--VQRRDIMLHPIAYASKTFNTAQANYTTIEKELLAVVFAVEKFRAYLL 62
            DAS   +GA L  V  ++ ++  + Y SK+  +AQ NY   E ELL ++ A+  FR  L 
Sbjct: 919  DASKDGIGAVLEEVDNKNKLVGVVGYFSKSLESAQKNYPAGELELLGIIKALHHFRYMLH 978

Query: 63   GSKVIIHTDHSAIRYLMTKKDAKPRLIRWVLLFQEFDAKIVDRKGTKNNVADRLSRLENM 122
            G    + TDH ++  L  K +   R+ RW+     +D  +    G KN VAD +SR    
Sbjct: 979  GKHFTLRTDHISLLSLQNKNEPARRVQRWLDDLATYDFTLEYLAGPKNVVADAISRA--- 1038

Query: 123  EHDRKQLDVNVSFPDEAVLKVTEFAPWY--------ADIVNFLVCKQF---PED---FNT 182
                    +    P+ +    TE    Y        A +++     Q    PED   F +
Sbjct: 1039 --------IYTITPETSRPIDTESWKSYYKSDPLCSAVLIHMKELTQHNVTPEDMSAFRS 1098

Query: 183  QQKKKLIHDA--KFYYWDEPQLYKRGPDHIFRLCVPETSYQHILSQCHD-SPYGGHFGGQ 242
             QKK  + +   K Y  ++  +Y +      RL VP      ++   HD + +GGHFG  
Sbjct: 1099 YQKKLELSETFRKNYSLEDEMIYYQD-----RLVVPIKQQNAVMRLYHDHTLFGGHFGVT 1158

Query: 243  RTAPKVLQSGYFWPTLFRDAKDYAIRCDRCQRIGNISSRNE---MPLTSILEVELFDVWG 302
             T  K+    Y+WP L      Y   C +CQ I +   R      PL  I E    D+  
Sbjct: 1159 VTLAKI-SPIYYWPKLQHSIIQYIRTCVQCQLIKSHRPRLHGLLQPL-PIAEGRWLDI-S 1218

Query: 303  IDFMGPFQP-SNGHNYILVAVDYVSKWVEAISCARN-DAATVSNFLQKNI---------- 362
            +DF+    P SN  N ILV VD  SK    I+  +  DA  + + L + I          
Sbjct: 1219 MDFVTGLPPTSNNLNMILVVVDRFSKRAHFIATRKTLDATQLIDLLFRYIFSYHGFPRTI 1278

Query: 363  -SDEGTHFINKIISNLLIKYNVHHRVATAYHPQTNGQAEISNRELKTILEKVVNTSRKDW 422
             SD            L  +  +   +++A HPQT+GQ+E + + L  +L   V+T+ ++W
Sbjct: 1279 TSDRDVRMTADKYQELTKRLGIKSTMSSANHPQTDGQSERTIQTLNRLLRAYVSTNIQNW 1338

Query: 423  ASKLNESLWAYRTAFKTPIGMSPYALVFGKACHLPLELEHKVLWAAKRLNMDLKAAGEAR 482
               L +  + Y +     +G SP+ +  G   + P       + A     ++L    +A 
Sbjct: 1339 HVYLPQIEFVYNSTPTRTLGKSPFEIDLGYLPNTPAIKSDDEVNARSFTAVELAKHLKAL 1398

Query: 483  QLQLNELEEWRLQAYENAKIYKERTKRWHDQHISKKSLYIGQKVLLFNSRLRLFPGKLKT 519
             +Q  E         E+A+I  E     ++Q      L IG  VL+           +K 
Sbjct: 1399 TIQTKE-------QLEHAQIEMETN---NNQRRKPLLLNIGDHVLVHRDAYFKKGAYMKV 1443

BLAST of Clc03G10990 vs. ExPASy Swiss-Prot
Match: Q87040 (Pro-Pol polyprotein OS=Simian foamy virus (isolate chimpanzee) OX=298339 GN=pol PE=3 SV=1)

HSP 1 Score: 103.6 bits (257), Expect = 7.4e-21
Identity = 90/365 (24.66%), Postives = 158/365 (43.29%), Query Frame = 0

Query: 179  YYWDEPQLYKRGPDHIFRLCVPETSYQHILSQCHDSPYGGHFGGQRTAPKVLQSGYFWPT 238
            YY ++ ++    P+ + ++  P++  Q I+ Q H+     H G + T  K+    Y+WP 
Sbjct: 781  YYLEDGKVKVSRPEGV-KIIPPQSDRQKIVLQAHNL---AHTGREATLLKIANL-YWWPN 840

Query: 239  LFRDAKDYAIRCDRCQRIGNISSRNEMP-LTSILEVELFDVWGIDFMGPFQPSNGHNYIL 298
            + +D      RC +C  I N S++   P L      + FD + ID++GP  PS G+ Y+L
Sbjct: 841  MRKDVVKQLGRCKQC-LITNASNKTSGPILRPDRPQKPFDKFFIDYIGPLPPSQGYLYVL 900

Query: 299  VAVDYVS--KWVEAISCARNDAATVSNFLQKNI-------SDEGTHFINKIISNLLIKYN 358
            V VD ++   W+         A   S  +  +I       SD+G  F +   +    +  
Sbjct: 901  VIVDGMTGFTWLYPTKAPSTSATVKSLNVLTSIAIPKVIHSDQGAAFTSSTFAEWAKERG 960

Query: 359  VHHRVATAYHPQTNGQAEISNRELKTILEKVVNTSRKDWASKLNESLWAYRTAFKTPIGM 418
            +H   +T YHPQ++G+ E  N ++K +L K++      W   L     A    +   +  
Sbjct: 961  IHLEFSTPYHPQSSGKVERKNSDIKRLLTKLLVGRPTKWYDLLPVVQLALNNTYSPVLKY 1020

Query: 419  SPYALVFGKACHLPLELEHKVLWAAKRLNMDLKAAGEARQLQLNELEEWRLQAYENAKIY 478
            +P+ L+FG   + P          A +  +DL      R+ +L+ L+E R   Y+     
Sbjct: 1021 TPHQLLFGIDSNTPF---------ANQDTLDL-----TREEELSLLQEIRASLYQ-PSTP 1080

Query: 479  KERTKRWHDQHISKKSLYIGQKVLLFNSRLRLFPGKLKTRWSGPFVIKEIFPHGAVELMN 534
               ++ W        S  +GQ V    +R    P  L+ RW  P  + E+     V +++
Sbjct: 1081 PASSRSW--------SPVVGQLVQERVAR----PASLRPRWHKPSTVLEVLNPRTVVILD 1112

BLAST of Clc03G10990 vs. ExPASy Swiss-Prot
Match: P04323 (Retrovirus-related Pol polyprotein from transposon 17.6 OS=Drosophila melanogaster OX=7227 GN=pol PE=4 SV=1)

HSP 1 Score: 99.8 bits (247), Expect = 1.1e-19
Identity = 55/116 (47.41%), Postives = 72/116 (62.07%), Query Frame = 0

Query: 3   DASDYAMGAALVQRRDIMLHPIAYASKTFNTAQANYTTIEKELLAVVFAVEKFRAYLLGS 62
           DASD A+GA L Q      HP++Y S+T N  + NY+TIEKELLA+V+A + FR YLLG 
Sbjct: 514 DASDVALGAVLSQDG----HPLSYISRTLNEHEINYSTIEKELLAIVWATKTFRHYLLGR 573

Query: 63  KVIIHTDHSAIRYLMTKKDAKPRLIRWVLLFQEFDAKIVDRKGTKNNVADRLSRLE 119
              I +DH  + +L   KD   +L RW +   EFD  I   KG +N VAD LSR++
Sbjct: 574 HFEISSDHQPLSWLYRMKDPNSKLTRWRVKLSEFDFDIKYIKGKENCVADALSRIK 625

BLAST of Clc03G10990 vs. ExPASy TrEMBL
Match: A0A2G9FWY3 (Reverse transcriptase OS=Handroanthus impetiginosus OX=429701 GN=CDL12_29952 PE=4 SV=1)

HSP 1 Score: 754.6 bits (1947), Expect = 3.0e-214
Identity = 360/571 (63.05%), Postives = 439/571 (76.88%), Query Frame = 0

Query: 1    MCDASDYAMGAALVQRRDIMLHPIAYASKTFNTAQANYTTIEKELLAVVFAVEKFRAYLL 60
            MCDASD+A+GA L QR+D +   I YASKT N AQ NYTT EKELLAVVFA +KFR+YL+
Sbjct: 1003 MCDASDFAVGAVLGQRKDKIFRSIYYASKTLNDAQLNYTTTEKELLAVVFAFDKFRSYLV 1062

Query: 61   GSKVIIHTDHSAIRYLMTKKDAKPRLIRWVLLFQEFDAKIVDRKGTKNNVADRLSRLENM 120
            G+KVI++TDH+AIRYL+ KKDAKPRLIRWVLL QEFD +I DRKGT+N +AD LSRLE+ 
Sbjct: 1063 GTKVIVYTDHAAIRYLIEKKDAKPRLIRWVLLLQEFDLEIRDRKGTENQIADHLSRLESP 1122

Query: 121  EHDRKQLDVNVSFPDEAVLK-VTEFAPWYADIVNFLVCKQFPEDFNTQQKKKLIHDAKFY 180
                +   +N +FPDE +L  V    PWYADIVN+L C   P D + QQKKK + D + Y
Sbjct: 1123 AKTDEPNLINDNFPDEQLLAIVASDVPWYADIVNYLTCGIIPFDLSAQQKKKFLFDTRRY 1182

Query: 181  YWDEPQLYKRGPDHIFRLCVPETSYQHILSQCHDSPYGGHFGGQRTAPKVLQSGYFWPTL 240
            +WD+P L+K+GPD+I R CVPE     IL QCH SPYGGHF G RTA K+LQSG+FWP L
Sbjct: 1183 FWDDPFLFKQGPDNILRRCVPEIEMNDILEQCHASPYGGHFHGDRTAAKILQSGFFWPNL 1242

Query: 241  FRDAKDYAIRCDRCQRIGNISSRNEMPLTSILEVELFDVWGIDFMGPFQPSNGHNYILVA 300
            F+DA  +   CDRCQR GNIS R+EMPL +ILEVELFDVWGIDFMGPF PS G+ YILVA
Sbjct: 1243 FKDAHSFVANCDRCQRTGNISRRHEMPLNTILEVELFDVWGIDFMGPFIPSFGNMYILVA 1302

Query: 301  VDYVSKWVEAISCARNDAATVSNFLQKN-----------ISDEGTHFINKIISNLLIKYN 360
            VDYVSKWVEA +   ND+  V NF++KN           ISD GTHF N+    LL KY 
Sbjct: 1303 VDYVSKWVEAAAVPNNDSKVVVNFIKKNIFTRFGTPRAIISDGGTHFCNRSFEALLSKYG 1362

Query: 361  VHHRVATAYHPQTNGQAEISNRELKTILEKVVNTSRKDWASKLNESLWAYRTAFKTPIGM 420
            V H+++T YHPQT+GQ E+SNRE+K ILEK V+++RKDW+ +L+E+LWAYRTA+KTPIGM
Sbjct: 1363 VKHKISTPYHPQTSGQVEVSNREIKRILEKTVSSTRKDWSKRLDEALWAYRTAYKTPIGM 1422

Query: 421  SPYALVFGKACHLPLELEHKVLWAAKRLNMDLKAAGEARQLQLNELEEWRLQAYENAKIY 480
            SPY LVFGKACHLP+ELEH   WA ++LN D++AAGE R LQLNEL+E+RL AYENAKIY
Sbjct: 1423 SPYRLVFGKACHLPVELEHNAYWAIRKLNFDMQAAGEKRLLQLNELDEFRLHAYENAKIY 1482

Query: 481  KERTKRWHDQHISKKSLYIGQKVLLFNSRLRLFPGKLKTRWSGPFVIKEIFPHGAVELMN 540
            KE+ KRWH++ I ++    GQ VLLFNSRL+LFPGKLK+RWSGPF I E+FPHGAVEL N
Sbjct: 1483 KEKKKRWHEKKIVERHFEPGQYVLLFNSRLKLFPGKLKSRWSGPFRITEVFPHGAVELEN 1542

Query: 541  EDGTNAFKVNGQRVKPYFGDCLERDKVTVDL 560
            ++  N FKVN QR+K Y+G+ ++R   ++ L
Sbjct: 1543 KNSRNRFKVNAQRIKHYWGEIVDRQHASITL 1573

BLAST of Clc03G10990 vs. ExPASy TrEMBL
Match: A0A2K3LHD8 (Integrase catalytic domain-containing protein OS=Trifolium pratense OX=57577 GN=L195_g033907 PE=4 SV=1)

HSP 1 Score: 746.9 bits (1927), Expect = 6.2e-212
Identity = 358/557 (64.27%), Postives = 427/557 (76.66%), Query Frame = 0

Query: 1   MCDASDYAMGAALVQRRDIMLHPIAYASKTFNTAQANYTTIEKELLAVVFAVEKFRAYLL 60
           MCDASD A+GA L QR+D +LH I YAS   N AQ NY T EKELLAVV+A +KFR+YLL
Sbjct: 24  MCDASDIAVGAVLGQRKDKLLHVIYYASHVLNPAQLNYATTEKELLAVVYAFDKFRSYLL 83

Query: 61  GSKVIIHTDHSAIRYLMTKKDAKPRLIRWVLLFQEFDAKIVDRKGTKNNVADRLSRLENM 120
           GSKVI++TDH+A+RYL  K+++KPRL+RW+LL QEFD +I D+KG++N VAD LSRLE +
Sbjct: 84  GSKVIVYTDHAALRYLFAKQESKPRLLRWILLLQEFDLEIRDKKGSENTVADHLSRLEKV 143

Query: 121 EHDRKQLDVNVSFPDEAVLKVTEFAPWYADIVNFLVCKQFPEDFNTQQKKKLIHDAKFYY 180
               ++  +   F DE +L VT  APW+AD  N++V +  P DF  QQ+KK +HD KFY 
Sbjct: 144 VETEEERAIQDLFADEHILAVT-VAPWFADFANYMVGRTIPSDFTPQQRKKFLHDCKFYV 203

Query: 181 WDEPQLYKRGPDHIFRLCVPETSYQHILSQCHDSPYGGHFGGQRTAPKVLQSGYFWPTLF 240
           WDEP LYKRG D + R CVPE   + +L  CHDS YGGHF G RTA KVLQSG FWPTLF
Sbjct: 204 WDEPFLYKRGVDGLLRRCVPEGEQEKVLWHCHDSSYGGHFSGDRTAAKVLQSGLFWPTLF 263

Query: 241 RDAKDYAIRCDRCQRIGNISSRNEMPLTSILEVELFDVWGIDFMGPFQPSNGHNYILVAV 300
           +DA  Y  RCDRCQR GNIS RNEMP   ILEVE+FDVWGIDFMGPF  S    YILVAV
Sbjct: 264 KDAFTYVKRCDRCQRTGNISKRNEMPQNPILEVEIFDVWGIDFMGPFPSSYSKTYILVAV 323

Query: 301 DYVSKWVEAISCARNDAATVSNFLQKN-----------ISDEGTHFINKIISNLLIKYNV 360
           DYVSKWVEAI+   NDA  V  FL++N           ISDEGTHF+N+ +  LL KYNV
Sbjct: 324 DYVSKWVEAIATHTNDAQVVVAFLKRNIFSRFGVPRALISDEGTHFLNRKMEALLKKYNV 383

Query: 361 HHRVATAYHPQTNGQAEISNRELKTILEKVVNTSRKDWASKLNESLWAYRTAFKTPIGMS 420
           HHR+AT YHPQT+GQ E+SNR++K ILEK VN+SRKDW+ KL+++LWAYRTAFKTPIGMS
Sbjct: 384 HHRIATPYHPQTSGQVEVSNRQIKQILEKTVNSSRKDWSVKLDDALWAYRTAFKTPIGMS 443

Query: 421 PYALVFGKACHLPLELEHKVLWAAKRLNMDLKAAGEARQLQLNELEEWRLQAYENAKIYK 480
           P+ +V+GKACHLPLELEHK LWA K LN DL  AGE+R LQL+EL+E+R  AYENAKI+K
Sbjct: 444 PFQIVYGKACHLPLELEHKALWATKFLNFDLSKAGESRILQLHELDEFRNYAYENAKIFK 503

Query: 481 ERTKRWHDQHISKKSLYIGQKVLLFNSRLRLFPGKLKTRWSGPFVIKEIFPHGAVELMNE 540
           E+TK+WHD+ I KK    GQ VLLFNSRLRLFPGKLK+RWSGPF IK++ PHGA+EL + 
Sbjct: 504 EKTKKWHDRKIQKKEFREGQLVLLFNSRLRLFPGKLKSRWSGPFKIKKVLPHGAIELEDR 563

Query: 541 DGTNAFKVNGQRVKPYF 547
           +    FKVNGQR+KPYF
Sbjct: 564 ETDQTFKVNGQRLKPYF 579

BLAST of Clc03G10990 vs. ExPASy TrEMBL
Match: A0A2G9HBV9 (DNA-directed DNA polymerase OS=Handroanthus impetiginosus OX=429701 GN=CDL12_12579 PE=4 SV=1)

HSP 1 Score: 739.2 bits (1907), Expect = 1.3e-209
Identity = 355/571 (62.17%), Postives = 434/571 (76.01%), Query Frame = 0

Query: 1    MCDASDYAMGAALVQRRDIMLHPIAYASKTFNTAQANYTTIEKELLAVVFAVEKFRAYLL 60
            MCDASD+A+GA L QR+D +   I YASKT N AQ NYTT EKELLAVVFA +KFR+YL+
Sbjct: 702  MCDASDFAVGAVLGQRKDKIFRSIYYASKTLNDAQLNYTTTEKELLAVVFAFDKFRSYLV 761

Query: 61   GSKVIIHTDHSAIRYLMTKKDAKPRLIRWVLLFQEFDAKIVDRKGTKNNVADRLSRLENM 120
            G+KVI++TDH+AIRYL+ KKDA P LI WV L QEFD +I DRKGT+N +AD LSRLE+ 
Sbjct: 762  GTKVIVYTDHAAIRYLIEKKDANPWLILWVFLLQEFDLEIRDRKGTENQIADHLSRLESP 821

Query: 121  EHDRKQLDVNVSFPDEAVLK-VTEFAPWYADIVNFLVCKQFPEDFNTQQKKKLIHDAKFY 180
                +   +N +F DE +L  V    PWYADIVN+L C   P D + QQKKK++ D + Y
Sbjct: 822  AKIDESNLINDNFSDEQLLAIVASDVPWYADIVNYLTCGIIPFDLSAQQKKKILFDTRRY 881

Query: 181  YWDEPQLYKRGPDHIFRLCVPETSYQHILSQCHDSPYGGHFGGQRTAPKVLQSGYFWPTL 240
            +WD+  L+K+GPD+I R CVPE     IL QCH SPYGGHF G RTA K+LQSG+FWP L
Sbjct: 882  FWDDLFLFKQGPDNILRRCVPEMEMNDILEQCHASPYGGHFHGDRTAAKILQSGFFWPNL 941

Query: 241  FRDAKDYAIRCDRCQRIGNISSRNEMPLTSILEVELFDVWGIDFMGPFQPSNGHNYILVA 300
            F+DA  +   CDRCQR GNIS R+EMPL +ILEVELFDVWGIDFMG F PS G+ YILVA
Sbjct: 942  FKDANSFVANCDRCQRTGNISRRHEMPLNTILEVELFDVWGIDFMGLFVPSFGNMYILVA 1001

Query: 301  VDYVSKWVEAISCARNDAATVSNFLQKN-----------ISDEGTHFINKIISNLLIKYN 360
            VDYVSKWVEA++   ND+  V NF++KN           IS+ GTHF N+    LL KY 
Sbjct: 1002 VDYVSKWVEAVAVPNNDSKVVVNFIKKNIFTRFGTPRAIISNGGTHFCNRSFEALLSKYG 1061

Query: 361  VHHRVATAYHPQTNGQAEISNRELKTILEKVVNTSRKDWASKLNESLWAYRTAFKTPIGM 420
            V H+++T YHPQT+GQ E+SNRE+K ILEK V+++RKDW+ +L+E+LWAYRTAFKTPIGM
Sbjct: 1062 VKHKISTPYHPQTSGQVEVSNREIKRILEKTVSSTRKDWSKRLDEALWAYRTAFKTPIGM 1121

Query: 421  SPYALVFGKACHLPLELEHKVLWAAKRLNMDLKAAGEARQLQLNELEEWRLQAYENAKIY 480
            SPY LVFGKACHLP+ELEH   WA ++LN D++AAGE R LQLNEL+E+RLQAYENAKIY
Sbjct: 1122 SPYKLVFGKACHLPVELEHNAYWAIRKLNFDMQAAGEKRLLQLNELDEFRLQAYENAKIY 1181

Query: 481  KERTKRWHDQHISKKSLYIGQKVLLFNSRLRLFPGKLKTRWSGPFVIKEIFPHGAVELMN 540
            KE+TKRWHD+ I ++    GQ VLLFNSRL+LFPGKLK+RWSGPF + E+F HGAVEL N
Sbjct: 1182 KEKTKRWHDKKIVERRFEPGQYVLLFNSRLKLFPGKLKSRWSGPFRVTEVFSHGAVELEN 1241

Query: 541  EDGTNAFKVNGQRVKPYFGDCLERDKVTVDL 560
            E+  N FKVN QR+K Y+G  ++R   ++ L
Sbjct: 1242 ENSKNRFKVNAQRIKHYWGGVIDRHHTSITL 1272

BLAST of Clc03G10990 vs. ExPASy TrEMBL
Match: A0A4Y1RSJ3 (Transposable element protein OS=Prunus dulcis OX=3755 GN=Prudu_018513 PE=4 SV=1)

HSP 1 Score: 727.2 bits (1876), Expect = 5.1e-206
Identity = 347/573 (60.56%), Postives = 429/573 (74.87%), Query Frame = 0

Query: 1   MCDASDYAMGAALVQRRDIMLHPIAYASKTFNTAQANYTTIEKELLAVVFAVEKFRAYLL 60
           MCDASDYA+GA L QR++ +LH I YAS+T N AQ NY T EKELLAVVFA++KFR+YLL
Sbjct: 12  MCDASDYAIGAVLGQRKNKLLHVIHYASRTLNDAQLNYATTEKELLAVVFALDKFRSYLL 71

Query: 61  GSKVIIHTDHSAIRYLMTKKDAKPRLIRWVLLFQEFDAKIVDRKGTKNNVADRLSRLENM 120
           G+KVI++TDH+A+++L+ KK+AKPRLIRWVLL QEFD +I D+KG++N VAD LSRL   
Sbjct: 72  GAKVIVYTDHAALKFLLAKKEAKPRLIRWVLLLQEFDIEIRDKKGSENVVADHLSRLVRE 131

Query: 121 EHDRKQL-DVNVSFPDE---AVLKVTEF-APWYADIVNFLVCKQFPEDFNTQQKKKLIHD 180
           +   + +  +  +FPDE   ++    EF  PWYAD VN+L C   P D +  QKKK +  
Sbjct: 132 DEVIEDIGPILETFPDEQLYSIYSAKEFITPWYADFVNYLACGILPPDMSFYQKKKFLSL 191

Query: 181 AKFYYWDEPQLYKRGPDHIFRLCVPETSYQHILSQCHDSPYGGHFGGQRTAPKVLQSGYF 240
            K YYWD+P L+K GPD + R CVPET    IL  CH    GGH+G  +T  KVLQSG+F
Sbjct: 192 VKHYYWDDPYLWKHGPDQVIRRCVPETEMADILLHCHTLACGGHYGASKTTAKVLQSGFF 251

Query: 241 WPTLFRDAKDYAIRCDRCQRIGNISSRNEMPLTSILEVELFDVWGIDFMGPFQPSNGHNY 300
           WPTLF+DA+D+  RCD CQR GNISSRN+MPL +ILEVELFDVWGIDFMGPF  S G+ Y
Sbjct: 252 WPTLFKDAQDFVARCDPCQRTGNISSRNQMPLNNILEVELFDVWGIDFMGPFPASYGNLY 311

Query: 301 ILVAVDYVSKWVEAISCARNDAATVSNFLQKN-----------ISDEGTHFINKIISNLL 360
           ILVAVDYVSKWVEA +   NDA  V  FL+KN           ISD GTHF N+  ++LL
Sbjct: 312 ILVAVDYVSKWVEAAALPTNDAKVVVRFLRKNIFTRFGVPRAIISDGGTHFCNRQFNSLL 371

Query: 361 IKYNVHHRVATAYHPQTNGQAEISNRELKTILEKVVNTSRKDWASKLNESLWAYRTAFKT 420
            KY + H+V+T YHPQT+GQ E+SNRELK ILEK V+ SRKDW+ KL+++LWAYRTAFK 
Sbjct: 372 AKYGITHKVSTPYHPQTSGQVEVSNRELKKILEKTVSASRKDWSLKLDDALWAYRTAFKA 431

Query: 421 PIGMSPYALVFGKACHLPLELEHKVLWAAKRLNMDLKAAGEARQLQLNELEEWRLQAYEN 480
           PIGMSPY LVFGKACHLP+ELEHK  WA K LN D+ +AGE R+LQLNELEE R ++YEN
Sbjct: 432 PIGMSPYRLVFGKACHLPVELEHKAFWAIKTLNFDMSSAGEKRKLQLNELEELRNESYEN 491

Query: 481 AKIYKERTKRWHDQHISKKSLYIGQKVLLFNSRLRLFPGKLKTRWSGPFVIKEIFPHGAV 540
           AKIYK+RTK+WHD+HI KK  Y+GQ VLL+NSRL+LFPGKL++RWSGPF +  ++P+G V
Sbjct: 492 AKIYKDRTKKWHDKHILKKEFYVGQSVLLYNSRLKLFPGKLRSRWSGPFTVLTVYPYGTV 551

Query: 541 ELMNEDGTNAFKVNGQRVKPYFGDCLERDKVTV 558
           E+ N+     FKVNG R+KPY       ++ T+
Sbjct: 552 EIKNDRDGTTFKVNGHRLKPYVAAAFLEEETTI 584

BLAST of Clc03G10990 vs. ExPASy TrEMBL
Match: A0A4Y1RS99 (Transposable element protein OS=Prunus dulcis OX=3755 GN=Prudu_018514 PE=4 SV=1)

HSP 1 Score: 727.2 bits (1876), Expect = 5.1e-206
Identity = 350/575 (60.87%), Postives = 429/575 (74.61%), Query Frame = 0

Query: 1   MCDASDYAMGAALVQRRDIMLHPIAYASKTFNTAQANYTTIEKELLAVVFAVEKFRAYLL 60
           MCDASDYA+GA L QR++ +LH I YAS+T N AQ NY T EKELLAVVFA++KFR+YLL
Sbjct: 325 MCDASDYAIGAVLGQRKNKLLHVIHYASRTLNDAQLNYATTEKELLAVVFALDKFRSYLL 384

Query: 61  GSKVIIHTDHSAIRYLMTKKDAKPRLIRWVLLFQEFDAKIVDRKGTKNNVADRLSRLENM 120
           G+KVI++TDH+A+++L+ KK+AKPRLIRWVLL QEFD +I D+KG++N VAD LSRL  +
Sbjct: 385 GAKVIVYTDHAALKFLLAKKEAKPRLIRWVLLLQEFDIEIRDKKGSENVVADHLSRL--V 444

Query: 121 EHDRKQLDVN---VSFPDE---AVLKVTEF-APWYADIVNFLVCKQFPEDFNTQQKKKLI 180
             D    DV     +FPDE   ++    EF  PWYAD VN+L C   P D +  QKKK +
Sbjct: 445 REDEVIEDVGPILETFPDEQLYSIYSAKEFITPWYADFVNYLACGILPPDMSFYQKKKFL 504

Query: 181 HDAKFYYWDEPQLYKRGPDHIFRLCVPETSYQHILSQCHDSPYGGHFGGQRTAPKVLQSG 240
              K YYWD+P L+K GPD + R CVPET    IL  CH    GGH+G  +T  KVLQSG
Sbjct: 505 SLVKHYYWDDPYLWKHGPDQVIRRCVPETEMADILLHCHTLACGGHYGASKTTAKVLQSG 564

Query: 241 YFWPTLFRDAKDYAIRCDRCQRIGNISSRNEMPLTSILEVELFDVWGIDFMGPFQPSNGH 300
           +FWPTLF+DA+D+  RCD CQR GNISSRN+MPL +ILEVELFDVWGIDFMGPF  S G+
Sbjct: 565 FFWPTLFKDAQDFVARCDPCQRTGNISSRNQMPLNNILEVELFDVWGIDFMGPFPASYGN 624

Query: 301 NYILVAVDYVSKWVEAISCARNDAATVSNFLQKN-----------ISDEGTHFINKIISN 360
            YILVAVDYVSKWVEA +   NDA  V  FL+KN           ISD GTHF N+  ++
Sbjct: 625 LYILVAVDYVSKWVEAAALPTNDAKVVVRFLRKNIFTRFGVPRAIISDGGTHFCNRQFNS 684

Query: 361 LLIKYNVHHRVATAYHPQTNGQAEISNRELKTILEKVVNTSRKDWASKLNESLWAYRTAF 420
           LL KY + H+V+T YHPQT+GQ E+SNRELK ILEK V+ SRKDW+ KL+++LWAYRTAF
Sbjct: 685 LLAKYGITHKVSTPYHPQTSGQVEVSNRELKKILEKTVSASRKDWSLKLDDALWAYRTAF 744

Query: 421 KTPIGMSPYALVFGKACHLPLELEHKVLWAAKRLNMDLKAAGEARQLQLNELEEWRLQAY 480
           K PIGMSPY LVFGKACHLP+ELEHK  WA K LN D+ +AGE R+LQLNELEE R ++Y
Sbjct: 745 KAPIGMSPYRLVFGKACHLPVELEHKAFWAIKTLNFDMSSAGEKRKLQLNELEELRNESY 804

Query: 481 ENAKIYKERTKRWHDQHISKKSLYIGQKVLLFNSRLRLFPGKLKTRWSGPFVIKEIFPHG 540
           ENAKIYK+RTK+WHD+HI KK  Y+GQ VLL+NSRL+LFPGKL++RWSGPF +  ++P+G
Sbjct: 805 ENAKIYKDRTKKWHDKHILKKEFYVGQSVLLYNSRLKLFPGKLRSRWSGPFTVLTVYPYG 864

Query: 541 AVELMNEDGTNAFKVNGQRVKPYFGDCLERDKVTV 558
            VE+ N+     FKVNG R+KPY       ++ T+
Sbjct: 865 TVEIKNDRDGTTFKVNGHRLKPYVAAAFLEEETTI 897

BLAST of Clc03G10990 vs. TAIR 10
Match: ATMG00750.1 (GAG/POL/ENV polyprotein )

HSP 1 Score: 85.1 bits (209), Expect = 1.9e-16
Identity = 38/71 (53.52%), Postives = 46/71 (64.79%), Query Frame = 0

Query: 229 VLQSGYFWPTLFRDAKDYAIRCDRCQRIGNISSRNEMPLTSILEVELFDVWGIDFM---- 288
           VLQ+G++WPT F+DA  +   CD CQR GN + RNEMP   ILEVE+FDVWGI FM    
Sbjct: 35  VLQAGFYWPTTFKDAHGFVSSCDACQRKGNFTKRNEMPQHFILEVEVFDVWGIYFMKKTI 94

Query: 289 ---GPFQPSNG 293
               P  P+ G
Sbjct: 95  FSWKPIHPNGG 105

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PIM97577.16.1e-21463.05DNA-directed DNA polymerase [Handroanthus impetiginosus][more]
XP_023874613.12.0e-21263.33uncharacterized protein LOC111987139 [Quercus suber][more]
PNX77934.11.3e-21164.27hypothetical protein L195_g033907 [Trifolium pratense][more]
XP_028962178.11.2e-20962.13LOW QUALITY PROTEIN: uncharacterized protein LOC114826270 [Malus domestica][more]
XP_031120206.12.7e-20961.12uncharacterized protein LOC116023351 [Ipomoea triloba][more]
Match NameE-valueIdentityDescription
P103949.4e-4025.64Retrovirus-related Pol polyprotein from transposon 412 OS=Drosophila melanogaste... [more]
Q993151.4e-2725.99Transposon Ty3-G Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
Q7LHG52.4e-2726.53Transposon Ty3-I Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
Q870407.4e-2124.66Pro-Pol polyprotein OS=Simian foamy virus (isolate chimpanzee) OX=298339 GN=pol ... [more]
P043231.1e-1947.41Retrovirus-related Pol polyprotein from transposon 17.6 OS=Drosophila melanogast... [more]
Match NameE-valueIdentityDescription
A0A2G9FWY33.0e-21463.05Reverse transcriptase OS=Handroanthus impetiginosus OX=429701 GN=CDL12_29952 PE=... [more]
A0A2K3LHD86.2e-21264.27Integrase catalytic domain-containing protein OS=Trifolium pratense OX=57577 GN=... [more]
A0A2G9HBV91.3e-20962.17DNA-directed DNA polymerase OS=Handroanthus impetiginosus OX=429701 GN=CDL12_125... [more]
A0A4Y1RSJ35.1e-20660.56Transposable element protein OS=Prunus dulcis OX=3755 GN=Prudu_018513 PE=4 SV=1[more]
A0A4Y1RS995.1e-20660.87Transposable element protein OS=Prunus dulcis OX=3755 GN=Prudu_018514 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
ATMG00750.11.9e-1653.52GAG/POL/ENV polyprotein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (cordophanus) v2
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR041373Reverse transcriptase, RNase H-like domainPFAMPF17917RT_RNaseHcoord: 2..96
e-value: 1.7E-31
score: 108.6
NoneNo IPR availableGENE3D1.10.340.70coord: 162..256
e-value: 9.4E-18
score: 66.2
NoneNo IPR availableGENE3D3.10.20.370coord: 1..67
e-value: 6.3E-8
score: 34.5
NoneNo IPR availablePANTHERPTHR24559:SF373REVERSE TRANSCRIPTASE DOMAIN, RIBONUCLEASE H-LIKE DOMAIN PROTEIN-RELATEDcoord: 45..420
NoneNo IPR availablePANTHERPTHR24559TRANSPOSON TY3-I GAG-POL POLYPROTEINcoord: 45..420
NoneNo IPR availableCDDcd09274RNase_HI_RT_Ty3coord: 1..117
e-value: 6.31389E-58
score: 187.315
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 272..464
e-value: 1.3E-45
score: 157.1
IPR041588Integrase zinc-binding domainPFAMPF17921Integrase_H2C2coord: 199..255
e-value: 6.9E-12
score: 45.3
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 262..418
score: 13.655631
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 275..412
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 1..101

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Clc03G10990.1Clc03G10990.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0016779 nucleotidyltransferase activity