CmUC02G037100 (gene) Watermelon (USVL531) v1

Overview
NameCmUC02G037100
Typegene
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
LocationCmU531Chr02: 23788434 .. 23789447 (-)
RNA-Seq ExpressionCmUC02G037100
SyntenyCmUC02G037100
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACAGAGAAGTCAGTTACCATTTCCACAGAGGAGTTTGCTAAATTTCAGGCGTACCAGCACTCATTGGAAGCATCATCTTCATCTAATCCTACTGCCACCATTGCTGACACAGGTAATACGAAATGTCTACTTACGTCATCTACCAAATGGGTCATAGACTCTGGTGTCATAGCTCATATGACAGGTAATTCTAACTTATTCTCCACACCATTGTCCCCTACCTCTTCCCCATCTGTCACTTTGGCATATGGCTCAACATCCTCTGTCCTTGGTTCTGGCACCATTAACCTTTCTCCATCCTTTTCTCTGTCTTTTGTGTTACATTTGCCTCAACTATCCTTTAATTTAATTTCTGTTAGCCAACTTACTCATGACCTTAACTGTGTTGTCTCGTCTTTTCCTGGTTATTGCTTGTTTTAGGATCGTATGATGAAGAAGATTATTGGTAGAGGATATGAGTCAGGGGGCCTTTATCTTTTTGATCACTAGATACCGAAAGTTGTGGTTTGCTCAGGAGTTACATCTCCGTTTAAAGTTCATTGTCTTTTGGGTCATCCATCTCTGTTCATGTTGAAGAAACTATATCCAGAATTTCGATCTTTGTCCTCTTTGAATTGTGATTTGTGTCAGTTCGCTAAATTTCATCTTCTTAGTTCTAGTCCTAGAGTCAATAAACGAGCAAGTGCTCCATTTGAATTAGTTCATTTTGATATTTGGGGTCCGTGCCCAGTCGTTTCCCAAACAGGTTTTCATTACTTTGTTACTTTTGTTGACAATTATTCTCGTTTGACTTGGTTATACTTCATGAAAAATCATTCTGAATTATTATCTCACTTTTGTGCTTTTCATGCTGAAATTCAAAATCAGTTCAATGTTTCTATCAAAACTTTGCGCACTGATAATGCTGGCGAATATTTTTCTAATGTGCTTGGATCGTACTTAAGTGAACATGGCATCATTCATCAATCATCTTGTGCAGACACTCCATCCCAAAATGGGGTTGCTAAATGA

mRNA sequence

ATGACAGAGAAGTCAGTTACCATTTCCACAGAGGAGTTTGCTAAATTTCAGGCGTACCAGCACTCATTGGAAGCATCATCTTCATCTAATCCTACTGCCACCATTGCTGACACAGGTAATACGAAATGTCTACTTACGTCATCTACCAAATGGGTCATAGACTCTGGTGTCATAGCTCATATGACAGGTAATTCTAACTTATTCTCCACACCATTGTCCCCTACCTCTTCCCCATCTGTCACTTTGGCATATGGCTCAACATCCTCTGTCCTTGGTTCTGGCACCATTAACCTTTCTCCATCCTTTTCTCTGTCTTTTGTGTTACATTTGCCTCAACTATCCTTTAATTTAATTTCTGTTAGCCAACTTACTCATGACCTTAACTGTGTTGTCTCGTCTTTTCCTGGAGTTACATCTCCGTTTAAAGTTCATTGTCTTTTGGGTCATCCATCTCTGTTCATGTTGAAGAAACTATATCCAGAATTTCGATCTTTGTCCTCTTTGAATTGTGATTTGTGTCAGTTCGCTAAATTTCATCTTCTTAGTTCTAGTCCTAGAGTCAATAAACGAGCAAGTGCTCCATTTGAATTAGTTCATTTTGATATTTGGGGTCCGTGCCCAGTCGTTTCCCAAACAGGTTTTCATTACTTTGTTACTTTTGTTGACAATTATTCTCGTTTGACTTGGTTATACTTCATGAAAAATCATTCTGAATTATTATCTCACTTTTGTGCTTTTCATGCTGAAATTCAAAATCAGTTCAATGTTTCTATCAAAACTTTGCGCACTGATAATGCTGGCGAATATTTTTCTAATGTGCTTGGATCGTACTTAAGTGAACATGGCATCATTCATCAATCATCTTGTGCAGACACTCCATCCCAAAATGGGGTTGCTAAATGA

Coding sequence (CDS)

ATGACAGAGAAGTCAGTTACCATTTCCACAGAGGAGTTTGCTAAATTTCAGGCGTACCAGCACTCATTGGAAGCATCATCTTCATCTAATCCTACTGCCACCATTGCTGACACAGGTAATACGAAATGTCTACTTACGTCATCTACCAAATGGGTCATAGACTCTGGTGTCATAGCTCATATGACAGGTAATTCTAACTTATTCTCCACACCATTGTCCCCTACCTCTTCCCCATCTGTCACTTTGGCATATGGCTCAACATCCTCTGTCCTTGGTTCTGGCACCATTAACCTTTCTCCATCCTTTTCTCTGTCTTTTGTGTTACATTTGCCTCAACTATCCTTTAATTTAATTTCTGTTAGCCAACTTACTCATGACCTTAACTGTGTTGTCTCGTCTTTTCCTGGAGTTACATCTCCGTTTAAAGTTCATTGTCTTTTGGGTCATCCATCTCTGTTCATGTTGAAGAAACTATATCCAGAATTTCGATCTTTGTCCTCTTTGAATTGTGATTTGTGTCAGTTCGCTAAATTTCATCTTCTTAGTTCTAGTCCTAGAGTCAATAAACGAGCAAGTGCTCCATTTGAATTAGTTCATTTTGATATTTGGGGTCCGTGCCCAGTCGTTTCCCAAACAGGTTTTCATTACTTTGTTACTTTTGTTGACAATTATTCTCGTTTGACTTGGTTATACTTCATGAAAAATCATTCTGAATTATTATCTCACTTTTGTGCTTTTCATGCTGAAATTCAAAATCAGTTCAATGTTTCTATCAAAACTTTGCGCACTGATAATGCTGGCGAATATTTTTCTAATGTGCTTGGATCGTACTTAAGTGAACATGGCATCATTCATCAATCATCTTGTGCAGACACTCCATCCCAAAATGGGGTTGCTAAATGA

Protein sequence

MTEKSVTISTEEFAKFQAYQHSLEASSSSNPTATIADTGNTKCLLTSSTKWVIDSGVIAHMTGNSNLFSTPLSPTSSPSVTLAYGSTSSVLGSGTINLSPSFSLSFVLHLPQLSFNLISVSQLTHDLNCVVSSFPGVTSPFKVHCLLGHPSLFMLKKLYPEFRSLSSLNCDLCQFAKFHLLSSSPRVNKRASAPFELVHFDIWGPCPVVSQTGFHYFVTFVDNYSRLTWLYFMKNHSELLSHFCAFHAEIQNQFNVSIKTLRTDNAGEYFSNVLGSYLSEHGIIHQSSCADTPSQNGVAK
Homology
BLAST of CmUC02G037100 vs. NCBI nr
Match: XP_031744753.1 (uncharacterized protein LOC101212255 isoform X1 [Cucumis sativus])

HSP 1 Score: 466.1 bits (1198), Expect = 2.3e-127
Identity = 247/335 (73.73%), Postives = 262/335 (78.21%), Query Frame = 0

Query: 3   EKSVTISTEEFAKFQAYQHSLEASSSSNPTATIADTGNTKCLLTSSTKWVIDSGVIAHMT 62
           E SVTIS +EFAKFQ YQ SL+ASSSS P A+    GN KCLLTSSTKWVIDSG  AHMT
Sbjct: 282 EASVTISADEFAKFQNYQESLQASSSSTPIASTVAPGNIKCLLTSSTKWVIDSGATAHMT 341

Query: 63  GNSNLFSTPLSPTSSPSVTLAYGSTSSVLGSGTINLSPSFSLSFVLHLPQLSFNLISVSQ 122
           GNS+LFS PLSP   PSVTLA GSTSSVLGSGTI+L+PSFSLS VLHLP LSFNLIS SQ
Sbjct: 342 GNSHLFSRPLSPAPFPSVTLADGSTSSVLGSGTIHLTPSFSLSSVLHLPNLSFNLISTSQ 401

Query: 123 LTHDLNCVVSSFPG-------------------------------------VTSPFKVHC 182
           LTHDLNCVV  F G                                     V SPF+VHC
Sbjct: 402 LTHDLNCVVMFFSGYCLFQDRVTKKIIGRGYESGGLYLFDHQVSQAVACPVVPSPFEVHC 461

Query: 183 LLGHPSLFMLKKLYPEFRSLSSLNCDLCQFAKFHLLSSSPRVNKRASAPFELVHFDIWGP 242
            LGHPSLF+LKKLYPEFRSLSSLNCD CQFAKFH LSSSPRV+KRA APFELVH DIWGP
Sbjct: 462 RLGHPSLFVLKKLYPEFRSLSSLNCDSCQFAKFHRLSSSPRVDKRAIAPFELVHSDIWGP 521

Query: 243 CPVVSQTGFHYFVTFVDNYSRLTWLYFMKNHSELLSHFCAFHAEIQNQFNVSIKTLRTDN 301
           CPVVSQTGF YFVTFVD++SRLTWLY MKN SELLSHFCAFH EI+NQFNVSIKTLRTDN
Sbjct: 522 CPVVSQTGFRYFVTFVDDHSRLTWLYLMKNRSELLSHFCAFHTEIKNQFNVSIKTLRTDN 581

BLAST of CmUC02G037100 vs. NCBI nr
Match: RVX08145.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera])

HSP 1 Score: 329.7 bits (844), Expect = 2.6e-86
Identity = 175/336 (52.08%), Postives = 226/336 (67.26%), Query Frame = 0

Query: 2   TEKSVTISTEEFAKFQAYQHSLEASSSSNPTATIADTGNTKCLLTSSTKWVIDSGVIAHM 61
           ++K VT++ EEF+K+  YQ +L+AS+   P + +A++G T CL++SS KW+IDSG   HM
Sbjct: 195 SDKIVTMTAEEFSKYSQYQDALKAST---PVSALAESGKT-CLVSSSNKWIIDSGATNHM 254

Query: 62  TGNSNLFSTPLSPTSSPSVTLAYGSTSSVLGSGTINLSPSFSLSFVLHLPQLSFNLISVS 121
           TGN   FST     S+P VT+A GST  + GSGT+  + S +LS VL+LP L+FNLISVS
Sbjct: 255 TGNHKTFST-FRTHSAPPVTIADGSTYEIKGSGTVKPTSSITLSSVLNLPNLAFNLISVS 314

Query: 122 QLTHDLNCVVSSFP-------------------------------------GVTSPFKVH 181
           +LT +LNC VS FP                                        SP + H
Sbjct: 315 KLTKNLNCSVSFFPDHCVFQDLMTKRTFGKGHVSDGLYILDEWVPRPVACVSTASPVEAH 374

Query: 182 CLLGHPSLFMLKKLYPEFRSLSSLNCDLCQFAKFHLLSSSPRVNKRASAPFELVHFDIWG 241
           C LGHPSL +LKKL P+F +L SL+C+ C FAK H  S  PR+NKR  + FELVH D+WG
Sbjct: 375 CRLGHPSLPVLKKLCPQFDTLPSLDCESCHFAKHHRSSLGPRLNKRVESLFELVHSDVWG 434

Query: 242 PCPVVSQTGFHYFVTFVDNYSRLTWLYFMKNHSELLSHFCAFHAEIQNQFNVSIKTLRTD 301
           PCPV SQTGF YFVTFVD++SR+TW+YFMKN SE+ SHFCAF AEI+ Q++VS+K LR+D
Sbjct: 435 PCPVTSQTGFRYFVTFVDDFSRMTWIYFMKNRSEVFSHFCAFSAEIKTQYDVSVKILRSD 494

BLAST of CmUC02G037100 vs. NCBI nr
Match: ABY49842.1 (hypothetical protein [Vitis hybrid cultivar])

HSP 1 Score: 328.6 bits (841), Expect = 5.8e-86
Identity = 175/336 (52.08%), Postives = 225/336 (66.96%), Query Frame = 0

Query: 2   TEKSVTISTEEFAKFQAYQHSLEASSSSNPTATIADTGNTKCLLTSSTKWVIDSGVIAHM 61
           + K VT++ EEF+K+  YQ +L+AS+   P + +A++G T CL++SS KW+IDSG   HM
Sbjct: 287 SNKIVTMTAEEFSKYSQYQDALKAST---PVSALAESGKT-CLVSSSNKWIIDSGATDHM 346

Query: 62  TGNSNLFSTPLSPTSSPSVTLAYGSTSSVLGSGTINLSPSFSLSFVLHLPQLSFNLISVS 121
           TGN   FST     S+P VT+A GST  + GSGT+  + S +LS VL+LP L+FNLISVS
Sbjct: 347 TGNHKTFST-FRTHSAPPVTVADGSTYEIKGSGTVKPTSSITLSSVLNLPNLAFNLISVS 406

Query: 122 QLTHDLNCVVSSFP-------------------------------------GVTSPFKVH 181
           +LT +LNC VS FP                                        SP + H
Sbjct: 407 KLTKNLNCSVSFFPDHCVFQDLMTKRTFGKGHVSDGLYILDEWVPRPVACVSTASPVEAH 466

Query: 182 CLLGHPSLFMLKKLYPEFRSLSSLNCDLCQFAKFHLLSSSPRVNKRASAPFELVHFDIWG 241
           C LGHPSL +LKKL P+F +L SL+C+ C FAK H  S  PR+NKRA + FELVH D+WG
Sbjct: 467 CRLGHPSLPVLKKLCPQFDTLPSLDCESCHFAKHHRSSLGPRLNKRAESLFELVHSDVWG 526

Query: 242 PCPVVSQTGFHYFVTFVDNYSRLTWLYFMKNHSELLSHFCAFHAEIQNQFNVSIKTLRTD 301
           PCPV SQTGF YFVTFVD++SR+TW+YFMKN SE+ SHFCAF AEI+ Q++VS+K LR+D
Sbjct: 527 PCPVTSQTGFRYFVTFVDDFSRMTWIYFMKNRSEVFSHFCAFSAEIKTQYDVSVKILRSD 586

BLAST of CmUC02G037100 vs. NCBI nr
Match: RVW69134.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera])

HSP 1 Score: 328.6 bits (841), Expect = 5.8e-86
Identity = 175/336 (52.08%), Postives = 225/336 (66.96%), Query Frame = 0

Query: 2   TEKSVTISTEEFAKFQAYQHSLEASSSSNPTATIADTGNTKCLLTSSTKWVIDSGVIAHM 61
           + K VT++ EEF+K+  YQ +L+AS+   P + +A++G T CL++SS KW+IDSG   HM
Sbjct: 226 SNKIVTMTAEEFSKYSQYQDALKAST---PVSALAESGKT-CLVSSSNKWIIDSGATDHM 285

Query: 62  TGNSNLFSTPLSPTSSPSVTLAYGSTSSVLGSGTINLSPSFSLSFVLHLPQLSFNLISVS 121
           TGN   FST     S+P VT+A GST  + GSGT+  + S +LS VL+LP L+FNLISVS
Sbjct: 286 TGNHKTFST-FRTHSAPPVTVADGSTYEIKGSGTVKPTSSITLSSVLNLPNLAFNLISVS 345

Query: 122 QLTHDLNCVVSSFP-------------------------------------GVTSPFKVH 181
           +LT +LNC VS FP                                        SP + H
Sbjct: 346 KLTKNLNCSVSFFPDHCVFQDLMTKRTFGKGHVSDGLYILDEWVPRPVACVSTASPVEAH 405

Query: 182 CLLGHPSLFMLKKLYPEFRSLSSLNCDLCQFAKFHLLSSSPRVNKRASAPFELVHFDIWG 241
           C LGHPSL +LKKL P+F +L SL+C+ C FAK H  S  PR+NKRA + FELVH D+WG
Sbjct: 406 CRLGHPSLPVLKKLCPQFDTLPSLDCESCHFAKHHRSSLGPRLNKRAESLFELVHSDVWG 465

Query: 242 PCPVVSQTGFHYFVTFVDNYSRLTWLYFMKNHSELLSHFCAFHAEIQNQFNVSIKTLRTD 301
           PCPV SQTGF YFVTFVD++SR+TW+YFMKN SE+ SHFCAF AEI+ Q++VS+K LR+D
Sbjct: 466 PCPVTSQTGFRYFVTFVDDFSRMTWIYFMKNRSEVFSHFCAFSAEIKTQYDVSVKILRSD 525

BLAST of CmUC02G037100 vs. NCBI nr
Match: RVW38649.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera])

HSP 1 Score: 327.8 bits (839), Expect = 9.9e-86
Identity = 175/336 (52.08%), Postives = 224/336 (66.67%), Query Frame = 0

Query: 2   TEKSVTISTEEFAKFQAYQHSLEASSSSNPTATIADTGNTKCLLTSSTKWVIDSGVIAHM 61
           + K VT++ EEF+K+  YQ +L+AS+   P + +A++G T CL++SS KW+IDSG   HM
Sbjct: 286 SNKIVTMTAEEFSKYSQYQDALKAST---PVSALAESGKT-CLVSSSNKWIIDSGATDHM 345

Query: 62  TGNSNLFSTPLSPTSSPSVTLAYGSTSSVLGSGTINLSPSFSLSFVLHLPQLSFNLISVS 121
           TGN   FST     S+P VT+A GST  + GSGT+  + S +LS VL+LP L+FNLISVS
Sbjct: 346 TGNHKTFST-FRTHSAPPVTVADGSTYEIKGSGTVKPTSSITLSSVLNLPNLAFNLISVS 405

Query: 122 QLTHDLNCVVSSFP-------------------------------------GVTSPFKVH 181
           +LT +LNC VS FP                                        SP + H
Sbjct: 406 KLTKNLNCSVSFFPNHCVFQDLMTKRTFGKGHVSDGLYILDEWVPRPVACVSTASPVEAH 465

Query: 182 CLLGHPSLFMLKKLYPEFRSLSSLNCDLCQFAKFHLLSSSPRVNKRASAPFELVHFDIWG 241
           C LGHPSL  LKKL P+F +L SL+C+ C FAK H  S  PR+NKRA + FELVH D+WG
Sbjct: 466 CRLGHPSLPALKKLCPQFDTLPSLDCESCHFAKHHRSSLGPRLNKRAESLFELVHSDVWG 525

Query: 242 PCPVVSQTGFHYFVTFVDNYSRLTWLYFMKNHSELLSHFCAFHAEIQNQFNVSIKTLRTD 301
           PCPV SQTGF YFVTFVD++SR+TW+YFMKN SE+ SHFCAF AEI+ Q++VS+K LR+D
Sbjct: 526 PCPVTSQTGFRYFVTFVDDFSRMTWIYFMKNRSEVFSHFCAFSAEIKTQYDVSVKILRSD 585

BLAST of CmUC02G037100 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 119.0 bits (297), Expect = 9.1e-26
Identity = 95/304 (31.25%), Postives = 141/304 (46.38%), Query Frame = 0

Query: 47  SSTKWVIDSGVIAHMTGNSNLFSTPLSPTSSPSVTLAYGSTSSVLGSGTINL---SPSFS 106
           ++  W++DSG   H+T + N  S     T    V +A GST  +  +G+ +L   S S  
Sbjct: 306 NANNWLLDSGATHHITSDFNNLSFHQPYTGGDDVMIADGSTIPITHTGSASLPTSSRSLD 365

Query: 107 LSFVLHLPQLSFNLISVSQLTH----------------DLNCVVSSFPGVT--------- 166
           L+ VL++P +  NLISV +L +                DLN  V    G T         
Sbjct: 366 LNKVLYVPNIHKNLISVYRLCNTNRVSVEFFPASFQVKDLNTGVPLLQGKTKDELYEWPI 425

Query: 167 ----------SPFKV------HCLLGHPSLFMLKKLYPEFR------SLSSLNCDLCQFA 226
                     SP         H  LGHPSL +L  +           S   L+C  C   
Sbjct: 426 ASSQAVSMFASPCSKATHSSWHSRLGHPSLAILNSVISNHSLPVLNPSHKLLSCSDCFIN 485

Query: 227 KFHLLSSSPRVNKRASAPFELVHFDIWGPCPVVSQTGFHYFVTFVDNYSRLTWLYFMKNH 286
           K H +  S      +S P E ++ D+W   P++S   + Y+V FVD+++R TWLY +K  
Sbjct: 486 KSHKVPFS-NSTITSSKPLEYIYSDVWS-SPILSIDNYRYYVIFVDHFTRYTWLYPLKQK 545

Query: 287 SELLSHFCAFHAEIQNQFNVSIKTLRTDNAGEYFSNVLGSYLSEHGIIHQSSCADTPSQN 301
           S++   F  F + ++N+F   I TL +DN GE+   VL  YLS+HGI H +S   TP  N
Sbjct: 546 SQVKDTFIIFKSLVENRFQTRIGTLYSDNGGEFV--VLRDYLSQHGISHFTSPPHTPEHN 605

BLAST of CmUC02G037100 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 111.7 bits (278), Expect = 1.5e-23
Identity = 91/305 (29.84%), Postives = 140/305 (45.90%), Query Frame = 0

Query: 47  SSTKWVIDSGVIAHMTGNSNLFSTPLSPTSSPSVTLAYGSTSSVLGSGTINLSPS---FS 106
           SS  W++DSG   H+T + N  S     T    V +A GST  +  +G+ +LS      +
Sbjct: 327 SSNNWLLDSGATHHITSDFNNLSLHQPYTGGDDVMVADGSTIPISHTGSTSLSTKSRPLN 386

Query: 107 LSFVLHLPQLSFNLISVSQLTH----------------DLNCVVSSFPGVT--------- 166
           L  +L++P +  NLISV +L +                DLN  V    G T         
Sbjct: 387 LHNILYVPNIHKNLISVYRLCNANGVSVEFFPASFQVKDLNTGVPLLQGKTKDELYEWPI 446

Query: 167 ----------SPFKV------HCLLGHPSLFMLKKLYPEFRSLSSLN-------CDLCQF 226
                     SP         H  LGHP+  +L  +   + SLS LN       C  C  
Sbjct: 447 ASSQPVSLFASPSSKATHSSWHARLGHPAPSILNSVISNY-SLSVLNPSHKFLSCSDCLI 506

Query: 227 AKFHLLSSSPRVNKRASAPFELVHFDIWGPCPVVSQTGFHYFVTFVDNYSRLTWLYFMKN 286
            K + +  S +    ++ P E ++ D+W   P++S   + Y+V FVD+++R TWLY +K 
Sbjct: 507 NKSNKVPFS-QSTINSTRPLEYIYSDVWS-SPILSHDNYRYYVIFVDHFTRYTWLYPLKQ 566

Query: 287 HSELLSHFCAFHAEIQNQFNVSIKTLRTDNAGEYFSNVLGSYLSEHGIIHQSSCADTPSQ 301
            S++   F  F   ++N+F   I T  +DN GE+ +  L  Y S+HGI H +S   TP  
Sbjct: 567 KSQVKETFITFKNLLENRFQTRIGTFYSDNGGEFVA--LWEYFSQHGISHLTSPPHTPEH 626

BLAST of CmUC02G037100 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 101.7 bits (252), Expect = 1.5e-20
Identity = 82/297 (27.61%), Postives = 127/297 (42.76%), Query Frame = 0

Query: 49  TKWVIDSGVIAHMTGNSNLFSTPLSPTSSPSVTLAYGSTSSVLGSGTI----NLSPSFSL 108
           ++WV+D+    H T   +LF   ++     +V +   S S + G G I    N+  +  L
Sbjct: 292 SEWVVDTAASHHATPVRDLFCRYVAGDFG-TVKMGNTSYSKIAGIGDICIKTNVGCTLVL 351

Query: 109 SFVLHLPQLSFNLISVSQLTHD---------------LNCVVSSFPGVTSPFKV------ 168
             V H+P L  NLIS   L  D                + V++      + ++       
Sbjct: 352 KDVRHVPDLRMNLISGIALDRDGYESYFANQKWRLTKGSLVIAKGVARGTLYRTNAEICQ 411

Query: 169 ---------------HCLLGHPS-----LFMLKKLYPEFRSLSSLNCDLCQFAKFHLLSS 228
                          H  +GH S     +   K L    +  +   CD C F K H +S 
Sbjct: 412 GELNAAQDEISVDLWHKRMGHMSEKGLQILAKKSLISYAKGTTVKPCDYCLFGKQHRVSF 471

Query: 229 SPRVNKRASAPFELVHFDIWGPCPVVSQTGFHYFVTFVDNYSRLTWLYFMKNHSELLSHF 288
                ++ +   +LV+ D+ GP  + S  G  YFVTF+D+ SR  W+Y +K   ++   F
Sbjct: 472 QTSSERKLNI-LDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYILKTKDQVFQVF 531

Query: 289 CAFHAEIQNQFNVSIKTLRTDNAGEYFSNVLGSYLSEHGIIHQSSCADTPSQNGVAK 301
             FHA ++ +    +K LR+DN GEY S     Y S HGI H+ +   TP  NGVA+
Sbjct: 532 QKFHALVERETGRKLKRLRSDNGGEYTSREFEEYCSSHGIRHEKTVPGTPQHNGVAE 586

BLAST of CmUC02G037100 vs. ExPASy Swiss-Prot
Match: Q12491 (Transposon Ty2-B Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY2B-B PE=3 SV=1)

HSP 1 Score: 65.9 bits (159), Expect = 9.2e-10
Identity = 46/175 (26.29%), Postives = 81/175 (46.29%), Query Frame = 0

Query: 143 VHCLLGHP------------SLFMLKKLYPEFRSLSSLNCDLCQFAKF--HLLSSSPRVN 202
           +H +LGH             ++  LK+   E+ + S+  C  C   K   H      R+ 
Sbjct: 594 IHRMLGHANFRSIQKSLKKNAVTYLKESDIEWSNASTYQCPDCLIGKSTKHRHVKGSRLK 653

Query: 203 KRAS-APFELVHFDIWGPCPVVSQTGFHYFVTFVDNYSRLTWLYFMKNHSE--LLSHFCA 262
            + S  PF+ +H DI+GP   + ++   YF++F D  +R  W+Y + +  E  +L+ F +
Sbjct: 654 YQESYEPFQYLHTDIFGPVHHLPKSAPSYFISFTDEKTRFQWVYPLHDRREESILNVFTS 713

Query: 263 FHAEIQNQFNVSIKTLRTDNAGEYFSNVLGSYLSEHGIIHQSSCADTPSQNGVAK 301
             A I+NQFN  +  ++ D   EY +  L  + +  GI    +       +GVA+
Sbjct: 714 ILAFIKNQFNARVLVIQMDRGSEYTNKTLHKFFTNRGITACYTTTADSRAHGVAE 768

BLAST of CmUC02G037100 vs. ExPASy Swiss-Prot
Match: P25384 (Transposon Ty2-C Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY2B-C PE=3 SV=2)

HSP 1 Score: 65.9 bits (159), Expect = 9.2e-10
Identity = 46/175 (26.29%), Postives = 81/175 (46.29%), Query Frame = 0

Query: 143 VHCLLGHP------------SLFMLKKLYPEFRSLSSLNCDLCQFAKF--HLLSSSPRVN 202
           +H +LGH             ++  LK+   E+ + S+  C  C   K   H      R+ 
Sbjct: 594 IHRMLGHANFRSIQKSLKKNAVTYLKESDIEWSNASTYQCPDCLIGKSTKHRHVKGSRLK 653

Query: 203 KRAS-APFELVHFDIWGPCPVVSQTGFHYFVTFVDNYSRLTWLYFMKNHSE--LLSHFCA 262
            + S  PF+ +H DI+GP   + ++   YF++F D  +R  W+Y + +  E  +L+ F +
Sbjct: 654 YQESYEPFQYLHTDIFGPVHHLPKSAPSYFISFTDEKTRFQWVYPLHDRREESILNVFTS 713

Query: 263 FHAEIQNQFNVSIKTLRTDNAGEYFSNVLGSYLSEHGIIHQSSCADTPSQNGVAK 301
             A I+NQFN  +  ++ D   EY +  L  + +  GI    +       +GVA+
Sbjct: 714 ILAFIKNQFNARVLVIQMDRGSEYTNKTLHKFFTNRGITACYTTTADSRAHGVAE 768

BLAST of CmUC02G037100 vs. ExPASy TrEMBL
Match: A0A438JGR5 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX=29760 GN=POLX_4093 PE=4 SV=1)

HSP 1 Score: 329.7 bits (844), Expect = 1.3e-86
Identity = 175/336 (52.08%), Postives = 226/336 (67.26%), Query Frame = 0

Query: 2   TEKSVTISTEEFAKFQAYQHSLEASSSSNPTATIADTGNTKCLLTSSTKWVIDSGVIAHM 61
           ++K VT++ EEF+K+  YQ +L+AS+   P + +A++G T CL++SS KW+IDSG   HM
Sbjct: 195 SDKIVTMTAEEFSKYSQYQDALKAST---PVSALAESGKT-CLVSSSNKWIIDSGATNHM 254

Query: 62  TGNSNLFSTPLSPTSSPSVTLAYGSTSSVLGSGTINLSPSFSLSFVLHLPQLSFNLISVS 121
           TGN   FST     S+P VT+A GST  + GSGT+  + S +LS VL+LP L+FNLISVS
Sbjct: 255 TGNHKTFST-FRTHSAPPVTIADGSTYEIKGSGTVKPTSSITLSSVLNLPNLAFNLISVS 314

Query: 122 QLTHDLNCVVSSFP-------------------------------------GVTSPFKVH 181
           +LT +LNC VS FP                                        SP + H
Sbjct: 315 KLTKNLNCSVSFFPDHCVFQDLMTKRTFGKGHVSDGLYILDEWVPRPVACVSTASPVEAH 374

Query: 182 CLLGHPSLFMLKKLYPEFRSLSSLNCDLCQFAKFHLLSSSPRVNKRASAPFELVHFDIWG 241
           C LGHPSL +LKKL P+F +L SL+C+ C FAK H  S  PR+NKR  + FELVH D+WG
Sbjct: 375 CRLGHPSLPVLKKLCPQFDTLPSLDCESCHFAKHHRSSLGPRLNKRVESLFELVHSDVWG 434

Query: 242 PCPVVSQTGFHYFVTFVDNYSRLTWLYFMKNHSELLSHFCAFHAEIQNQFNVSIKTLRTD 301
           PCPV SQTGF YFVTFVD++SR+TW+YFMKN SE+ SHFCAF AEI+ Q++VS+K LR+D
Sbjct: 435 PCPVTSQTGFRYFVTFVDDFSRMTWIYFMKNRSEVFSHFCAFSAEIKTQYDVSVKILRSD 494

BLAST of CmUC02G037100 vs. ExPASy TrEMBL
Match: A0A438GAA6 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX=29760 GN=POLX_2601 PE=4 SV=1)

HSP 1 Score: 328.6 bits (841), Expect = 2.8e-86
Identity = 175/336 (52.08%), Postives = 225/336 (66.96%), Query Frame = 0

Query: 2   TEKSVTISTEEFAKFQAYQHSLEASSSSNPTATIADTGNTKCLLTSSTKWVIDSGVIAHM 61
           + K VT++ EEF+K+  YQ +L+AS+   P + +A++G T CL++SS KW+IDSG   HM
Sbjct: 226 SNKIVTMTAEEFSKYSQYQDALKAST---PVSALAESGKT-CLVSSSNKWIIDSGATDHM 285

Query: 62  TGNSNLFSTPLSPTSSPSVTLAYGSTSSVLGSGTINLSPSFSLSFVLHLPQLSFNLISVS 121
           TGN   FST     S+P VT+A GST  + GSGT+  + S +LS VL+LP L+FNLISVS
Sbjct: 286 TGNHKTFST-FRTHSAPPVTVADGSTYEIKGSGTVKPTSSITLSSVLNLPNLAFNLISVS 345

Query: 122 QLTHDLNCVVSSFP-------------------------------------GVTSPFKVH 181
           +LT +LNC VS FP                                        SP + H
Sbjct: 346 KLTKNLNCSVSFFPDHCVFQDLMTKRTFGKGHVSDGLYILDEWVPRPVACVSTASPVEAH 405

Query: 182 CLLGHPSLFMLKKLYPEFRSLSSLNCDLCQFAKFHLLSSSPRVNKRASAPFELVHFDIWG 241
           C LGHPSL +LKKL P+F +L SL+C+ C FAK H  S  PR+NKRA + FELVH D+WG
Sbjct: 406 CRLGHPSLPVLKKLCPQFDTLPSLDCESCHFAKHHRSSLGPRLNKRAESLFELVHSDVWG 465

Query: 242 PCPVVSQTGFHYFVTFVDNYSRLTWLYFMKNHSELLSHFCAFHAEIQNQFNVSIKTLRTD 301
           PCPV SQTGF YFVTFVD++SR+TW+YFMKN SE+ SHFCAF AEI+ Q++VS+K LR+D
Sbjct: 466 PCPVTSQTGFRYFVTFVDDFSRMTWIYFMKNRSEVFSHFCAFSAEIKTQYDVSVKILRSD 525

BLAST of CmUC02G037100 vs. ExPASy TrEMBL
Match: B0FBS2 (Uncharacterized protein OS=Vitis hybrid cultivar OX=241073 PE=4 SV=1)

HSP 1 Score: 328.6 bits (841), Expect = 2.8e-86
Identity = 175/336 (52.08%), Postives = 225/336 (66.96%), Query Frame = 0

Query: 2   TEKSVTISTEEFAKFQAYQHSLEASSSSNPTATIADTGNTKCLLTSSTKWVIDSGVIAHM 61
           + K VT++ EEF+K+  YQ +L+AS+   P + +A++G T CL++SS KW+IDSG   HM
Sbjct: 287 SNKIVTMTAEEFSKYSQYQDALKAST---PVSALAESGKT-CLVSSSNKWIIDSGATDHM 346

Query: 62  TGNSNLFSTPLSPTSSPSVTLAYGSTSSVLGSGTINLSPSFSLSFVLHLPQLSFNLISVS 121
           TGN   FST     S+P VT+A GST  + GSGT+  + S +LS VL+LP L+FNLISVS
Sbjct: 347 TGNHKTFST-FRTHSAPPVTVADGSTYEIKGSGTVKPTSSITLSSVLNLPNLAFNLISVS 406

Query: 122 QLTHDLNCVVSSFP-------------------------------------GVTSPFKVH 181
           +LT +LNC VS FP                                        SP + H
Sbjct: 407 KLTKNLNCSVSFFPDHCVFQDLMTKRTFGKGHVSDGLYILDEWVPRPVACVSTASPVEAH 466

Query: 182 CLLGHPSLFMLKKLYPEFRSLSSLNCDLCQFAKFHLLSSSPRVNKRASAPFELVHFDIWG 241
           C LGHPSL +LKKL P+F +L SL+C+ C FAK H  S  PR+NKRA + FELVH D+WG
Sbjct: 467 CRLGHPSLPVLKKLCPQFDTLPSLDCESCHFAKHHRSSLGPRLNKRAESLFELVHSDVWG 526

Query: 242 PCPVVSQTGFHYFVTFVDNYSRLTWLYFMKNHSELLSHFCAFHAEIQNQFNVSIKTLRTD 301
           PCPV SQTGF YFVTFVD++SR+TW+YFMKN SE+ SHFCAF AEI+ Q++VS+K LR+D
Sbjct: 527 PCPVTSQTGFRYFVTFVDDFSRMTWIYFMKNRSEVFSHFCAFSAEIKTQYDVSVKILRSD 586

BLAST of CmUC02G037100 vs. ExPASy TrEMBL
Match: A0A438DT29 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX=29760 GN=POLX_3495 PE=4 SV=1)

HSP 1 Score: 327.8 bits (839), Expect = 4.8e-86
Identity = 175/336 (52.08%), Postives = 224/336 (66.67%), Query Frame = 0

Query: 2   TEKSVTISTEEFAKFQAYQHSLEASSSSNPTATIADTGNTKCLLTSSTKWVIDSGVIAHM 61
           + K VT++ EEF+K+  YQ +L+AS+   P + +A++G T CL++SS KW+IDSG   HM
Sbjct: 286 SNKIVTMTAEEFSKYSQYQDALKAST---PVSALAESGKT-CLVSSSNKWIIDSGATDHM 345

Query: 62  TGNSNLFSTPLSPTSSPSVTLAYGSTSSVLGSGTINLSPSFSLSFVLHLPQLSFNLISVS 121
           TGN   FST     S+P VT+A GST  + GSGT+  + S +LS VL+LP L+FNLISVS
Sbjct: 346 TGNHKTFST-FRTHSAPPVTVADGSTYEIKGSGTVKPTSSITLSSVLNLPNLAFNLISVS 405

Query: 122 QLTHDLNCVVSSFP-------------------------------------GVTSPFKVH 181
           +LT +LNC VS FP                                        SP + H
Sbjct: 406 KLTKNLNCSVSFFPNHCVFQDLMTKRTFGKGHVSDGLYILDEWVPRPVACVSTASPVEAH 465

Query: 182 CLLGHPSLFMLKKLYPEFRSLSSLNCDLCQFAKFHLLSSSPRVNKRASAPFELVHFDIWG 241
           C LGHPSL  LKKL P+F +L SL+C+ C FAK H  S  PR+NKRA + FELVH D+WG
Sbjct: 466 CRLGHPSLPALKKLCPQFDTLPSLDCESCHFAKHHRSSLGPRLNKRAESLFELVHSDVWG 525

Query: 242 PCPVVSQTGFHYFVTFVDNYSRLTWLYFMKNHSELLSHFCAFHAEIQNQFNVSIKTLRTD 301
           PCPV SQTGF YFVTFVD++SR+TW+YFMKN SE+ SHFCAF AEI+ Q++VS+K LR+D
Sbjct: 526 PCPVTSQTGFRYFVTFVDDFSRMTWIYFMKNRSEVFSHFCAFSAEIKTQYDVSVKILRSD 585

BLAST of CmUC02G037100 vs. ExPASy TrEMBL
Match: A0A438H537 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX=29760 GN=POLX_4131 PE=4 SV=1)

HSP 1 Score: 327.0 bits (837), Expect = 8.1e-86
Identity = 175/336 (52.08%), Postives = 223/336 (66.37%), Query Frame = 0

Query: 2   TEKSVTISTEEFAKFQAYQHSLEASSSSNPTATIADTGNTKCLLTSSTKWVIDSGVIAHM 61
           + K VT++ EEF+K+  YQ +L+AS+   P   + ++G T CL++SS KW+IDSG   HM
Sbjct: 286 SNKIVTMTAEEFSKYSQYQDALKAST---PVNALVESGKT-CLVSSSNKWIIDSGATDHM 345

Query: 62  TGNSNLFSTPLSPTSSPSVTLAYGSTSSVLGSGTINLSPSFSLSFVLHLPQLSFNLISVS 121
           TGN   FST     S+P VT+A GST  + GSGT+  + S +LS VL+LP L+FNLISVS
Sbjct: 346 TGNHKTFST-FRTHSAPPVTVADGSTYEIKGSGTMKPTSSITLSSVLNLPNLAFNLISVS 405

Query: 122 QLTHDLNCVVSSFP-------------------------------------GVTSPFKVH 181
           +LT DLNC VS FP                                        SP + H
Sbjct: 406 KLTKDLNCSVSFFPDHCVFQDLMTKRTFGKGHVSDGLYILDEWVPRPVACVSTASPVEAH 465

Query: 182 CLLGHPSLFMLKKLYPEFRSLSSLNCDLCQFAKFHLLSSSPRVNKRASAPFELVHFDIWG 241
           C LGHPSL +LKKL P+F +L SL+C+ C FAK H  S  PR+NKRA + FELVH D+WG
Sbjct: 466 CRLGHPSLPVLKKLCPQFDTLPSLDCESCHFAKHHRSSLGPRLNKRAESLFELVHSDVWG 525

Query: 242 PCPVVSQTGFHYFVTFVDNYSRLTWLYFMKNHSELLSHFCAFHAEIQNQFNVSIKTLRTD 301
           PCPV SQTGF YFVTFVD++SR+TW+YFMKN SE+ SHFCAF AEI+ Q++VS+K LR+D
Sbjct: 526 PCPVTSQTGFRYFVTFVDDFSRMTWIYFMKNRSEVFSHFCAFSAEIKTQYDVSVKILRSD 585

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_031744753.12.3e-12773.73uncharacterized protein LOC101212255 isoform X1 [Cucumis sativus][more]
RVX08145.12.6e-8652.08Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera][more]
ABY49842.15.8e-8652.08hypothetical protein [Vitis hybrid cultivar][more]
RVW69134.15.8e-8652.08Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera][more]
RVW38649.19.9e-8652.08Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera][more]
Match NameE-valueIdentityDescription
Q9ZT949.1e-2631.25Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
Q94HW21.5e-2329.84Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
P109781.5e-2027.61Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
Q124919.2e-1026.29Transposon Ty2-B Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
P253849.2e-1026.29Transposon Ty2-C Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
Match NameE-valueIdentityDescription
A0A438JGR51.3e-8652.08Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX... [more]
A0A438GAA62.8e-8652.08Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX... [more]
B0FBS22.8e-8652.08Uncharacterized protein OS=Vitis hybrid cultivar OX=241073 PE=4 SV=1[more]
A0A438DT294.8e-8652.08Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX... [more]
A0A438H5378.1e-8652.08Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Watermelon (USVL531) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 186..300
e-value: 4.8E-24
score: 86.7
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 192..289
e-value: 3.6E-14
score: 52.9
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 190..300
score: 14.859141
NoneNo IPR availablePANTHERPTHR11439:SF324RIBONUCLEASE H-LIKE DOMAIN, GAG-PRE-INTEGRASE DOMAIN, GAG-POLYPEPTIDE OF LTR COPIA-TYPE-RELATEDcoord: 143..290
NoneNo IPR availablePANTHERPTHR11439GAG-POL-RELATED RETROTRANSPOSONcoord: 143..290
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 190..299

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmUC02G037100.1CmUC02G037100.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
cellular_component GO:0005737 cytoplasm
cellular_component GO:0043231 intracellular membrane-bounded organelle
molecular_function GO:0003676 nucleic acid binding