Clc01G14030 (gene) Watermelon (cordophanus) v2

Overview
NameClc01G14030
Typegene
OrganismCitrullus lanatus subsp. cordophanus cv. cordophanus (Watermelon (cordophanus) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
LocationClcChr01: 26889782 .. 26890803 (+)
RNA-Seq ExpressionClc01G14030
SyntenyClc01G14030
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATGTTTGTGGAGAATATCCAACTGTGGCTCATGTCTGCAAACACTTGGGTATTCAGATTCGTGTTTCCTGTCCATACACTTCAGCACAAAATGGTTGCGTTGAACGCAAACACCGGCATCTCATTGAGACTGGCTTTACCATGCTTGCTCAAGCCTCTATGCCTCTGTCATTTTGGTGGTATGCTTTTCAAACAATTGTTTTTCTTATTAATGGTCTTCCTTTTCCTTCACCTGTTTTACAGGACAAATCACCCATGGAGGTTCTTCTCTACTCAAAACTTGATGTTTCTTCTTTAAAAATCTTTGGGAGTGTTTGTTATCCTAATCTTCGACCATATCAGTCTCACAAATTTGATGTTCATAGCGTGCGATGTGTTTACCTTGGCCCGTCTCCAATCCACAAAGGCCATCGGTGTCTTACAACGGATGGTAAATTATTAATTTCTCGCCATGTTCGATTCAATGAAAATGATTATCCATTCAAGACAGGTTTTGGGTTGTGCAATAGGGCATCCACACTGGCCCAATCGGCCCCATCCATTCTTTCCTAGTTTCCAACCTACCTAACTTCTACCAATCCTACTTTACCCAATCCACCCGCACCAGCCCATAGTCCGGACCTTTTATTGAAATATTATAGTTAGTAAAAATAATTAGGAAAATATTGTAGTTTGGAAACTATTTAGGAAAATATGGTAGTTTGGAGGAGAGAGAAGATTGGATAGTGGAGAGAGGAAAAGGTCGAGACCTCAACCTTCGACCAAAGGCAAGGTCGAGATCTCAACCCTCGACCAAAGCAGTCCTTCGACCAATGCAAGGCATTCGACCAATGCATGACCTGCTTCGACCAACGCATGGCCTGCTTCGGCCAACGCATGGAGGCCTCAACCTTCGACCAACGCGAGGCCTACCTTCGACCAACACAAGGCCTGTTTCGACCAACGCAAGGAGGCCTCAAAGTTTGACCAACGCAAGGCCTACTTTGACCAACGCAAGGCTACCTTCGACCAACGCATGA

mRNA sequence

ATGGATGTTTGTGGAGAATATCCAACTGTGGCTCATGTCTGCAAACACTTGGGTATTCAGATTCGTGTTTCCTGTCCATACACTTCAGCACAAAATGGTTGCGTTGAACGCAAACACCGGCATCTCATTGAGACTGGCTTTACCATGCTTGCTCAAGCCTCTATGCCTCTGTCATTTTGGTGGTATGCTTTTCAAACAATTGTTTTTCTTATTAATGGTCTTCCTTTTCCTTCACCTGTTTTACAGGACAAATCACCCATGGAGGTTCTTCTCTACTCAAAACTTGATGTTTCTTCTTTAAAAATCTTTGGGAGTGTTTGTTATCCTAATCTTCGACCATATCAGTCTCACAAATTTGATGTTCATAGCGTGCGATGTGTTTACCTTGGCCCGTCTCCAATCCACAAAGGCCATCGGTGTCTTACAACGGATGGTAAATTATTAATTTCTCGCCATGTTCGATTCAATGAAAATGATTATCCATTCAAGACAGGAAAATATGGTAGTTTGGAGGAGAGAGAAGATTGGATAGTGGAGAGAGGAAAAGGTCGAGACCTCAACCTTCGACCAAAGGCAAGGTCGAGATCTCAACCCTCGACCAAAGCAGTCCTTCGACCAATGCAAGGCATTCGACCAATGCATGACCTGCTTCGACCAACGCATGGCCTGCTTCGGCCAACGCATGGAGGCCTCAACCTTCGACCAACGCGAGGCCTACCTTCGACCAACACAAGGCCTGTTTCGACCAACGCAAGGAGGCCTCAAAGTTTGACCAACGCAAGGCCTACTTTGACCAACGCAAGGCTACCTTCGACCAACGCATGA

Coding sequence (CDS)

ATGGATGTTTGTGGAGAATATCCAACTGTGGCTCATGTCTGCAAACACTTGGGTATTCAGATTCGTGTTTCCTGTCCATACACTTCAGCACAAAATGGTTGCGTTGAACGCAAACACCGGCATCTCATTGAGACTGGCTTTACCATGCTTGCTCAAGCCTCTATGCCTCTGTCATTTTGGTGGTATGCTTTTCAAACAATTGTTTTTCTTATTAATGGTCTTCCTTTTCCTTCACCTGTTTTACAGGACAAATCACCCATGGAGGTTCTTCTCTACTCAAAACTTGATGTTTCTTCTTTAAAAATCTTTGGGAGTGTTTGTTATCCTAATCTTCGACCATATCAGTCTCACAAATTTGATGTTCATAGCGTGCGATGTGTTTACCTTGGCCCGTCTCCAATCCACAAAGGCCATCGGTGTCTTACAACGGATGGTAAATTATTAATTTCTCGCCATGTTCGATTCAATGAAAATGATTATCCATTCAAGACAGGAAAATATGGTAGTTTGGAGGAGAGAGAAGATTGGATAGTGGAGAGAGGAAAAGGTCGAGACCTCAACCTTCGACCAAAGGCAAGGTCGAGATCTCAACCCTCGACCAAAGCAGTCCTTCGACCAATGCAAGGCATTCGACCAATGCATGACCTGCTTCGACCAACGCATGGCCTGCTTCGGCCAACGCATGGAGGCCTCAACCTTCGACCAACGCGAGGCCTACCTTCGACCAACACAAGGCCTGTTTCGACCAACGCAAGGAGGCCTCAAAGTTTGACCAACGCAAGGCCTACTTTGACCAACGCAAGGCTACCTTCGACCAACGCATGA

Protein sequence

MDVCGEYPTVAHVCKHLGIQIRVSCPYTSAQNGCVERKHRHLIETGFTMLAQASMPLSFWWYAFQTIVFLINGLPFPSPVLQDKSPMEVLLYSKLDVSSLKIFGSVCYPNLRPYQSHKFDVHSVRCVYLGPSPIHKGHRCLTTDGKLLISRHVRFNENDYPFKTGKYGSLEEREDWIVERGKGRDLNLRPKARSRSQPSTKAVLRPMQGIRPMHDLLRPTHGLLRPTHGGLNLRPTRGLPSTNTRPVSTNARRPQSLTNARPTLTNARLPSTNA
Homology
BLAST of Clc01G14030 vs. NCBI nr
Match: XP_030492909.1 (uncharacterized protein LOC115709020 isoform X1 [Cannabis sativa])

HSP 1 Score: 191.8 bits (486), Expect = 7.7e-45
Identity = 96/200 (48.00%), Postives = 127/200 (63.50%), Query Frame = 0

Query: 2   DVCGEYPTVAHVCKHLGIQIRVSCPYTSAQNGCVERKHRHLIETGFTMLAQASMPLSFWW 61
           D  GEY     + +  GI  + SCP+TSAQNG  ERKHRH++E G T+LAQASMPL +W 
Sbjct: 192 DYGGEYQAFESLVQEHGIHFQHSCPHTSAQNGRAERKHRHIVEMGLTLLAQASMPLKYWV 251

Query: 62  YAFQTIVFLINGLPFPSPVLQDKSPMEVLLYSKLDVSSLKIFGSVCYPNLRPYQSHKFDV 121
            AFQT V+LIN L  P+P+L DKSP EV+   K +   LK FG+ C+P LRPYQ+HKF  
Sbjct: 252 DAFQTSVYLINRL--PTPILHDKSPFEVVYKKKPNYDMLKTFGATCFPCLRPYQTHKFQF 311

Query: 122 HSVRCVYLGPSPIHKGHRCLTTDGKLLISRHVRFNENDYPFKTGKYGSLEEREDWIVERG 181
           HS++CV LG S   KG++CL++ G++ ISRHV FNE ++PFK G   +       I++  
Sbjct: 312 HSLKCVNLGYSESFKGYKCLSSTGRVYISRHVVFNEQEFPFKIGFLNNFAPEHSVIIKHS 371

Query: 182 KGRDL-NLRPKARSRSQPST 201
               L N+   A +  Q  T
Sbjct: 372 AWSQLPNMHSFAYNMPQERT 389

BLAST of Clc01G14030 vs. NCBI nr
Match: KYP50444.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan])

HSP 1 Score: 191.0 bits (484), Expect = 1.3e-44
Identity = 85/161 (52.80%), Postives = 114/161 (70.81%), Query Frame = 0

Query: 5   GEYPTVAHVCKHLGIQIRVSCPYTSAQNGCVERKHRHLIETGFTMLAQASMPLSFWWYAF 64
           GE+ +++ V    GIQ+R SCPYTSAQNG  ERKHRH++E+G T+LAQA MPL +WW AF
Sbjct: 495 GEFKSLSKVLIKTGIQLRESCPYTSAQNGRAERKHRHVVESGLTLLAQAKMPLHYWWEAF 554

Query: 65  QTIVFLINGLPFPSPVLQDKSPMEVLLYSKLDVSSLKIFGSVCYPNLRPYQSHKFDVHSV 124
            T VFLIN L  P+ V+++KSP + L     D +++K FG  CYP L+PY  HK   H+ 
Sbjct: 555 STAVFLINRL--PTQVIKNKSPYQQLFDKNPDYTAMKTFGCACYPCLKPYNQHKLQFHTT 614

Query: 125 RCVYLGPSPIHKGHRCLTTDGKLLISRHVRFNENDYPFKTG 166
           +CV+LG S  HKG++CL + G++ ISRHV FNE+ +PF  G
Sbjct: 615 KCVFLGYSGSHKGYKCLNSTGRIFISRHVVFNEHHFPFHDG 653

BLAST of Clc01G14030 vs. NCBI nr
Match: GAU19483.1 (hypothetical protein TSUD_77270 [Trifolium subterraneum])

HSP 1 Score: 189.9 bits (481), Expect = 2.9e-44
Identity = 87/161 (54.04%), Postives = 108/161 (67.08%), Query Frame = 0

Query: 5   GEYPTVAHVCKHLGIQIRVSCPYTSAQNGCVERKHRHLIETGFTMLAQASMPLSFWWYAF 64
           GEY  V  +    GIQ R+SCPYTS QNG  ERKHRH+ E G T+LAQA MPL +WW AF
Sbjct: 564 GEYKPVQKLAVEAGIQFRMSCPYTSQQNGRAERKHRHITEFGLTLLAQAQMPLHYWWEAF 623

Query: 65  QTIVFLINGLPFPSPVLQDKSPMEVLLYSKLDVSSLKIFGSVCYPNLRPYQSHKFDVHSV 124
            T V+LIN L  PS V Q++SP  ++L  + D   LK FG  CYP L+PY  HK   H+ 
Sbjct: 624 STAVYLINRL--PSQVTQNESPYSLMLQKEPDYKLLKTFGCACYPCLKPYNQHKLQYHTT 683

Query: 125 RCVYLGPSPIHKGHRCLTTDGKLLISRHVRFNENDYPFKTG 166
           RCV+LG S  HKG++CL + G++ ISRHV FNE+ +PF  G
Sbjct: 684 RCVFLGYSNSHKGYKCLNSHGRIFISRHVIFNEDHFPFHDG 722

BLAST of Clc01G14030 vs. NCBI nr
Match: KYP75364.1 (Copia protein [Cajanus cajan])

HSP 1 Score: 189.9 bits (481), Expect = 2.9e-44
Identity = 83/161 (51.55%), Postives = 115/161 (71.43%), Query Frame = 0

Query: 5   GEYPTVAHVCKHLGIQIRVSCPYTSAQNGCVERKHRHLIETGFTMLAQASMPLSFWWYAF 64
           GE+  +  + +  G Q+R+SCPYTS QNG  ERKHRH++E G T+LAQA MPL F W AF
Sbjct: 24  GEFKGLQKISQESGFQLRMSCPYTSQQNGKAERKHRHVVELGLTLLAQAKMPLYFLWEAF 83

Query: 65  QTIVFLINGLPFPSPVLQDKSPMEVLLYSKLDVSSLKIFGSVCYPNLRPYQSHKFDVHSV 124
            T VFLIN L  P+P++++KSP  VLL  + D ++LK FG  CYP ++PY +HK   H+ 
Sbjct: 84  STAVFLINRL--PTPIIRNKSPYSVLLNKEPDYNNLKSFGCACYPCIKPYNAHKLQYHTT 143

Query: 125 RCVYLGPSPIHKGHRCLTTDGKLLISRHVRFNENDYPFKTG 166
           +CV+LG S  HKG +C+ ++G++ ISRHV FNE+++PF  G
Sbjct: 144 KCVFLGYSSSHKGFKCMNSNGRIFISRHVIFNEHEFPFHDG 182

BLAST of Clc01G14030 vs. NCBI nr
Match: MCH94186.1 (retrovirus-related pol polyprotein from transposon tnt 1-94 [Trifolium medium])

HSP 1 Score: 189.1 bits (479), Expect = 5.0e-44
Identity = 87/161 (54.04%), Postives = 108/161 (67.08%), Query Frame = 0

Query: 5   GEYPTVAHVCKHLGIQIRVSCPYTSAQNGCVERKHRHLIETGFTMLAQASMPLSFWWYAF 64
           GEY  V  +    GIQ R+SCPYTS QNG  ERKHRH+ E G T+LAQA MPL +WW AF
Sbjct: 163 GEYKPVQKLAVEAGIQFRMSCPYTSQQNGRAERKHRHITEFGLTLLAQAQMPLHYWWEAF 222

Query: 65  QTIVFLINGLPFPSPVLQDKSPMEVLLYSKLDVSSLKIFGSVCYPNLRPYQSHKFDVHSV 124
            T V+LIN L  PS V Q++SP  ++L  + D   LK FG  CYP L+PY  HK   H+ 
Sbjct: 223 STAVYLINRL--PSQVTQNESPYSLMLQKEPDYKLLKPFGCACYPCLKPYNQHKLQFHTT 282

Query: 125 RCVYLGPSPIHKGHRCLTTDGKLLISRHVRFNENDYPFKTG 166
           RCV+LG S  HKG++CL + G++ ISRHV FNE+ +PF  G
Sbjct: 283 RCVFLGYSNSHKGYKCLNSHGRIFISRHVIFNEDHFPFHDG 321

BLAST of Clc01G14030 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 143.7 bits (361), Expect = 3.2e-33
Identity = 72/159 (45.28%), Postives = 97/159 (61.01%), Query Frame = 0

Query: 5   GEYPTVAHVCKHLGIQIRVSCPYTSAQNGCVERKHRHLIETGFTMLAQASMPLSFWWYAF 64
           GE+  +       GI    S P+T   NG  ERKHRH++ETG T+L+ AS+P ++W YAF
Sbjct: 595 GEFVALWEYFSQHGISHLTSPPHTPEHNGLSERKHRHIVETGLTLLSHASIPKTYWPYAF 654

Query: 65  QTIVFLINGLPFPSPVLQDKSPMEVLLYSKLDVSSLKIFGSVCYPNLRPYQSHKFDVHSV 124
              V+LIN L  P+P+LQ +SP + L  +  +   L++FG  CYP LRPY  HK D  S 
Sbjct: 655 AVAVYLINRL--PTPLLQLESPFQKLFGTSPNYDKLRVFGCACYPWLRPYNQHKLDDKSR 714

Query: 125 RCVYLGPSPIHKGHRCL-TTDGKLLISRHVRFNENDYPF 163
           +CV+LG S     + CL     +L ISRHVRF+EN +PF
Sbjct: 715 QCVFLGYSLTQSAYLCLHLQTSRLYISRHVRFDENCFPF 751

BLAST of Clc01G14030 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 141.7 bits (356), Expect = 1.2e-32
Identity = 73/175 (41.71%), Postives = 101/175 (57.71%), Query Frame = 0

Query: 5   GEYPTVAHVCKHLGIQIRVSCPYTSAQNGCVERKHRHLIETGFTMLAQASMPLSFWWYAF 64
           GE+  +       GI    S P+T   NG  ERKHRH++E G T+L+ AS+P ++W YAF
Sbjct: 574 GEFVVLRDYLSQHGISHFTSPPHTPEHNGLSERKHRHIVEMGLTLLSHASVPKTYWPYAF 633

Query: 65  QTIVFLINGLPFPSPVLQDKSPMEVLLYSKLDVSSLKIFGSVCYPNLRPYQSHKFDVHSV 124
              V+LIN L  P+P+LQ +SP + L     +   LK+FG  CYP LRPY  HK +  S 
Sbjct: 634 SVAVYLINRL--PTPLLQLQSPFQKLFGQPPNYEKLKVFGCACYPWLRPYNRHKLEDKSK 693

Query: 125 RCVYLGPSPIHKGHRCL-TTDGKLLISRHVRFNENDYPFKTGKYG---SLEERED 176
           +C ++G S     + CL    G+L  SRHV+F+E  +PF T  +G   S E+R D
Sbjct: 694 QCAFMGYSLTQSAYLCLHIPTGRLYTSRHVQFDERCFPFSTTNFGVSTSQEQRSD 746

BLAST of Clc01G14030 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 77.0 bits (188), Expect = 3.6e-13
Identity = 51/158 (32.28%), Postives = 76/158 (48.10%), Query Frame = 0

Query: 5   GEYPT--VAHVCKHLGIQIRVSCPYTSAQNGCVERKHRHLIETGFTMLAQASMPLSFWWY 64
           GEY +      C   GI+   + P T   NG  ER +R ++E   +ML  A +P SFW  
Sbjct: 553 GEYTSREFEEYCSSHGIRHEKTVPGTPQHNGVAERMNRTIVEKVRSMLRMAKLPKSFWGE 612

Query: 65  AFQTIVFLINGLPFPSPVLQDKSPMEVLLYSKLDVSSLKIFGSVCYPNLRPYQSHKFDVH 124
           A QT  +LIN    PS  L  + P  V    ++  S LK+FG   + ++   Q  K D  
Sbjct: 613 AVQTACYLIN--RSPSVPLAFEIPERVWTNKEVSYSHLKVFGCRAFAHVPKEQRTKLDDK 672

Query: 125 SVRCVYLGPSPIHKGHRCL-TTDGKLLISRHVRFNEND 160
           S+ C+++G      G+R       K++ SR V F E++
Sbjct: 673 SIPCIFIGYGDEEFGYRLWDPVKKKVIRSRDVVFRESE 708

BLAST of Clc01G14030 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 69.7 bits (169), Expect = 5.8e-11
Identity = 39/120 (32.50%), Postives = 62/120 (51.67%), Query Frame = 0

Query: 14  CKHLGIQIRVSCPYTSAQNGCVERKHRHLIETGFTMLAQASMPLSFWWYAFQTIVFLING 73
           C   GI   ++ P+T   NG  ER  R + E   TM++ A +  SFW  A  T  +LIN 
Sbjct: 564 CVKKGISYHLTVPHTPQLNGVSERMIRTITEKARTMVSGAKLDKSFWGEAVLTATYLINR 623

Query: 74  LPFPSPVLQDKSPMEVLLYSKLDVSSLKIFGSVCYPNLRPYQSHKFDVHSVRCVYLGPSP 133
           +P  + V   K+P E+    K  +  L++FG+  Y +++  Q  KFD  S + +++G  P
Sbjct: 624 IPSRALVDSSKTPYEMWHNKKPYLKHLRVFGATVYVHIKNKQG-KFDDKSFKSIFVGYEP 682

BLAST of Clc01G14030 vs. ExPASy TrEMBL
Match: A0A803Q615 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 195.3 bits (495), Expect = 3.4e-46
Identity = 92/161 (57.14%), Postives = 114/161 (70.81%), Query Frame = 0

Query: 5   GEYPTVAHVCKHLGIQIRVSCPYTSAQNGCVERKHRHLIETGFTMLAQASMPLSFWWYAF 64
           GEY    +     GI  + SCP+TSAQNG  ERKHRH++E G T+LAQA +P  +WW AF
Sbjct: 59  GEYQAFNNHVAERGIDFQQSCPHTSAQNGRAERKHRHIVEMGLTLLAQAGIPQKYWWDAF 118

Query: 65  QTIVFLINGLPFPSPVLQDKSPMEVLLYSKLDVSSLKIFGSVCYPNLRPYQSHKFDVHSV 124
           QT V+LIN L  P+PVL+ KSP+EVL   K D   LK FG  CYP LRPYQSHKF  HS 
Sbjct: 119 QTSVYLINRL--PTPVLKGKSPLEVLFGKKPDYKFLKTFGCTCYPCLRPYQSHKFQYHST 178

Query: 125 RCVYLGPSPIHKGHRCLTTDGKLLISRHVRFNENDYPFKTG 166
           +CV LG S  HKG++CL++ G+L ISR+V FNE+++PF TG
Sbjct: 179 KCVNLGYSDRHKGYKCLSSTGRLYISRNVIFNEDEFPFLTG 217

BLAST of Clc01G14030 vs. ExPASy TrEMBL
Match: A0A803QD60 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 193.7 bits (491), Expect = 9.8e-46
Identity = 101/224 (45.09%), Postives = 138/224 (61.61%), Query Frame = 0

Query: 2   DVCGEYPTVAHVCKHLGIQIRVSCPYTSAQNGCVERKHRHLIETGFTMLAQASMPLSFWW 61
           D+ GEY  + ++    GI    SCP+TSAQNG  +RKHRH++E G T+LAQA MPL +WW
Sbjct: 321 DLGGEYQALQNIVIENGIDFHHSCPHTSAQNGRAKRKHRHVVEMGLTLLAQAHMPLKYWW 380

Query: 62  YAFQTIVFLINGLPFPSPVLQDKSPMEVLLYSKLDVSSLKIFGSVCYPNLRPYQSHKFDV 121
             FQT V+LIN L  P+P+L++KSP E L   + D + LK+FG  C+P +RPYQ+HKF  
Sbjct: 381 ETFQTAVYLINRL--PTPILENKSPFETLYIKEPDYAFLKVFGFACFPCIRPYQAHKFQF 440

Query: 122 HSVRCVYLGPSPIHKGHRCLTTDGKLLISRHVRFNENDYPFKTGKYGSLEEREDWIVERG 181
           HS++ V LG S  HKG+RCLT  GK+ ISR+V FNE ++PFK G   + ++ +  I+   
Sbjct: 441 HSLKYVNLGYSESHKGYRCLTPTGKIYISRNVVFNELEFPFKLGFLNNYQQEKPVIIH-- 500

Query: 182 KGRDLNLRPKARSRSQPSTKAVLRPMQGIRPMHDLLRPTHGLLR 226
                ++ P     +  ST A L P  G  P      PTH  +R
Sbjct: 501 -SSSWSVLPSFTLPTGTSTTA-LTPSLG-TPEESPSTPTHSQVR 537

BLAST of Clc01G14030 vs. ExPASy TrEMBL
Match: A0A151S6M8 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan OX=3821 GN=KK1_027809 PE=4 SV=1)

HSP 1 Score: 191.0 bits (484), Expect = 6.4e-45
Identity = 85/161 (52.80%), Postives = 114/161 (70.81%), Query Frame = 0

Query: 5   GEYPTVAHVCKHLGIQIRVSCPYTSAQNGCVERKHRHLIETGFTMLAQASMPLSFWWYAF 64
           GE+ +++ V    GIQ+R SCPYTSAQNG  ERKHRH++E+G T+LAQA MPL +WW AF
Sbjct: 495 GEFKSLSKVLIKTGIQLRESCPYTSAQNGRAERKHRHVVESGLTLLAQAKMPLHYWWEAF 554

Query: 65  QTIVFLINGLPFPSPVLQDKSPMEVLLYSKLDVSSLKIFGSVCYPNLRPYQSHKFDVHSV 124
            T VFLIN L  P+ V+++KSP + L     D +++K FG  CYP L+PY  HK   H+ 
Sbjct: 555 STAVFLINRL--PTQVIKNKSPYQQLFDKNPDYTAMKTFGCACYPCLKPYNQHKLQFHTT 614

Query: 125 RCVYLGPSPIHKGHRCLTTDGKLLISRHVRFNENDYPFKTG 166
           +CV+LG S  HKG++CL + G++ ISRHV FNE+ +PF  G
Sbjct: 615 KCVFLGYSGSHKGYKCLNSTGRIFISRHVVFNEHHFPFHDG 653

BLAST of Clc01G14030 vs. ExPASy TrEMBL
Match: A0A151U7U2 (Copia protein OS=Cajanus cajan OX=3821 GN=KK1_008090 PE=4 SV=1)

HSP 1 Score: 189.9 bits (481), Expect = 1.4e-44
Identity = 83/161 (51.55%), Postives = 115/161 (71.43%), Query Frame = 0

Query: 5   GEYPTVAHVCKHLGIQIRVSCPYTSAQNGCVERKHRHLIETGFTMLAQASMPLSFWWYAF 64
           GE+  +  + +  G Q+R+SCPYTS QNG  ERKHRH++E G T+LAQA MPL F W AF
Sbjct: 24  GEFKGLQKISQESGFQLRMSCPYTSQQNGKAERKHRHVVELGLTLLAQAKMPLYFLWEAF 83

Query: 65  QTIVFLINGLPFPSPVLQDKSPMEVLLYSKLDVSSLKIFGSVCYPNLRPYQSHKFDVHSV 124
            T VFLIN L  P+P++++KSP  VLL  + D ++LK FG  CYP ++PY +HK   H+ 
Sbjct: 84  STAVFLINRL--PTPIIRNKSPYSVLLNKEPDYNNLKSFGCACYPCIKPYNAHKLQYHTT 143

Query: 125 RCVYLGPSPIHKGHRCLTTDGKLLISRHVRFNENDYPFKTG 166
           +CV+LG S  HKG +C+ ++G++ ISRHV FNE+++PF  G
Sbjct: 144 KCVFLGYSSSHKGFKCMNSNGRIFISRHVIFNEHEFPFHDG 182

BLAST of Clc01G14030 vs. ExPASy TrEMBL
Match: A0A2Z6MBG6 (Integrase catalytic domain-containing protein OS=Trifolium subterraneum OX=3900 GN=TSUD_77270 PE=4 SV=1)

HSP 1 Score: 189.9 bits (481), Expect = 1.4e-44
Identity = 87/161 (54.04%), Postives = 108/161 (67.08%), Query Frame = 0

Query: 5   GEYPTVAHVCKHLGIQIRVSCPYTSAQNGCVERKHRHLIETGFTMLAQASMPLSFWWYAF 64
           GEY  V  +    GIQ R+SCPYTS QNG  ERKHRH+ E G T+LAQA MPL +WW AF
Sbjct: 564 GEYKPVQKLAVEAGIQFRMSCPYTSQQNGRAERKHRHITEFGLTLLAQAQMPLHYWWEAF 623

Query: 65  QTIVFLINGLPFPSPVLQDKSPMEVLLYSKLDVSSLKIFGSVCYPNLRPYQSHKFDVHSV 124
            T V+LIN L  PS V Q++SP  ++L  + D   LK FG  CYP L+PY  HK   H+ 
Sbjct: 624 STAVYLINRL--PSQVTQNESPYSLMLQKEPDYKLLKTFGCACYPCLKPYNQHKLQYHTT 683

Query: 125 RCVYLGPSPIHKGHRCLTTDGKLLISRHVRFNENDYPFKTG 166
           RCV+LG S  HKG++CL + G++ ISRHV FNE+ +PF  G
Sbjct: 684 RCVFLGYSNSHKGYKCLNSHGRIFISRHVIFNEDHFPFHDG 722

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_030492909.17.7e-4548.00uncharacterized protein LOC115709020 isoform X1 [Cannabis sativa][more]
KYP50444.11.3e-4452.80Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan][more]
GAU19483.12.9e-4454.04hypothetical protein TSUD_77270 [Trifolium subterraneum][more]
KYP75364.12.9e-4451.55Copia protein [Cajanus cajan][more]
MCH94186.15.0e-4454.04retrovirus-related pol polyprotein from transposon tnt 1-94 [Trifolium medium][more]
Match NameE-valueIdentityDescription
Q94HW23.2e-3345.28Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
Q9ZT941.2e-3241.71Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
P109783.6e-1332.28Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
P041465.8e-1132.50Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
Match NameE-valueIdentityDescription
A0A803Q6153.4e-4657.14Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
A0A803QD609.8e-4645.09Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
A0A151S6M86.4e-4552.80Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan OX=... [more]
A0A151U7U21.4e-4451.55Copia protein OS=Cajanus cajan OX=3821 GN=KK1_008090 PE=4 SV=1[more]
A0A2Z6MBG61.4e-4454.04Integrase catalytic domain-containing protein OS=Trifolium subterraneum OX=3900 ... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Watermelon (cordophanus) v2
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 7..97
e-value: 1.5E-13
score: 52.8
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 224..274
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 236..274
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 1..94
score: 11.505361
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 10..90

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Clc01G14030.1Clc01G14030.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0016740 transferase activity