CcUC03G046750 (gene) Watermelon (PI 537277) v1

Overview
NameCcUC03G046750
Typegene
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
LocationCicolChr03: 4729138 .. 4732531 (-)
RNA-Seq ExpressionCcUC03G046750
SyntenyCcUC03G046750
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
GAAAGAATTGGCCAATTGGGAGATCGATCGCAAGAGGCGAATTGTGATATGGAGGGAAGAGGAATTTTAAGTCCCCAAAGATCTCGATATTCTGTTCAAAAACTTAAGAAGGCTACTCCAGATCGAAAACCAGACAGGAAACTTGGAAACAGAAATGTTGATAATAATGGATCAAAACAAAAACTCGTCACTCCAAAGCTAAACCCAAGTCCTAATCCTAACCACAACAAAACTAAACCCACCACGCCTTCCCCAAAGACCAAAATTGCAAAACCATACTCCCATCCTGTTGCCAATCAGAACCACCCTTATTTGCGTGCAGCTTCCGATTCTGCTGTTCTTCAACCTCCTGATATTAGCGATCACTTGTTACAACGCCTTTCTTTCGACGGTCAGTTTTCATAATACTTCAAATATTTACTCTTTTTTTTTTTTTTTTTCATTGTTTACTCATGAATCTCCTCACCAGCCTAACAAAGTAACAAGTGTAATTTTTTAAGAATAAATCCTTGGAAAAGTATCTATTTATACTTCTTAGAAGTTGTATCAATAGACTTATATATAATTTTGAACTCTAAATATTTGATAAGTGAATCAATTAAGGCTTATAAAAATTGTTCATGTAGTAATTCTTCTTACATGCATGTAGAATTCTTCCAAAGTCCGAATAAAAAAATGGACAACGTAGGCCTCCTTGATTTGGGTGGAATCAAAATTTGGTTGAAATTTGATGAATTTTTATTTTAGAAATTTAAAGAAATAAGTTTAAAGCAAAAAGTACCTTTTTAGTCATTGAGTTTTGAGGAATATGTACGTTAGTCCCTAAGTTTTCAAAATATAACTTTTTAGTACTCGAGTTTTTAAGAATATGTATATTTGGTCCCTAAATGTACCTTGTAGCTCTTGAGCTCTTAGAAATAATAGGTTTAAAAGGTCCCTGCAATATTTTTATTATTATTATTTTAAGAAAAATTATTTTAAATAGCAAAACTGCTGAAAATATTTACAATTAATAGCAAAATATCACAGTCTATCTGCGATAGACCGCGATAGATCGCGATAGACTACTATCTGTGTCGATCGTGACACAAATCGTAGTCTACTGCGATTTATAGCAGATAGTGAAATTTTGCTATATTTGTAATTATTTTGGTTCCTTTTGCTATATTGGAAAAGAGACCTTATTTTAAAGGTAGTTGTTTTAAATCATAAAACTACTATAAATATTTTCAAATATAGCAAAATGGCATTGTGTGATAGACAATGATAGAAACTGGTAGATAGTGATATTTTGTTATATTTTTAAATATTTTGGTTCATTTTGCTATATTTGAAAACAACTCTGATTTTAACTTAGAAAATTAATGGTTGAAACTTATCTCTTCTCTCTAAAATTATTTTAAACTTATGTTTTCAATTCAAAATGTTAAATAGTTATTTAAAATAATAAAAATATTGCTTTGAGGCAATTTTTAAACATATCTAAAAAATTCAAAGCTTGAAAGGGGGTACATTTTGAAAATTCACAAACCAAATGCATCTCTTTTTAAAAATTGTTCGACTAAAAATATATTTTTTAAAAATCAGAAACCGAATGCACCTATTCTTCCAAATTGGAGGACTAAAAAAAGAGTAGCTTTCTCTCACTTTTAAGTAGAAGTTAAATTAGTCCATAAACTTCTCCTTATATGTTATAATTATAAATTTAGTCAGCAAACGTTTAAATTTTCTAAAATTAAAAAATTGAATGTTTGGTTTCATTAGACAGAGTTAACTTTGGAAATTTATGTAATTGTTGTACTAAAATAGGTTTCGTAGGAACAAAATTCAAATATCATAATTAAATAAAACAGAAAATTCTTGACATTTCTTGAGTCGTTGTACATCTGTTTTAAATCTTGTACTCTTGCAAAAAANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNAAAAAAAAAAAAAAAGGTAAAGATCTTGCAGAAATCCTCCAAGGAAACTCGTTATATGATTTAATCAGCCCAAATAAAAAGGAAGAATCTTCTTCTCAAAGCAATGATGATTCCAGATTATTTCAAATTTACAAAGAAATTGCATCTCATCGACAAGAAAACTTATCCGTTGAAGCTTACTTCACAAAGCTCGAGGCATTATGGGTTGAACTTGCATACTACACTGCTGATTTGGGTCAACGTTCCAGCAATAGTGCAATAGAAAATCTAAATGAGCTTACGGAGAGAGAAAAAGTTGTGCAATTTCTTGTGGGACTAAATGATTCTTATGCCACAATTTGCACCCAAATCCTTCTTATCAACCCATTTCCAACAGTGGAGAAAGCTTATGCTGAAATATCTCGAGAAGAAAAACGTAGAGAATTGGTTGTTGCATTAGAAGCTGTGGCTGCAAAAGTAATCCAAACCAATTGGCTTCTTAGGAATCAAAATGGTCGATCCAATAATAATGGTGATAATAATTATGGAAGTGATCAAGAAGTCGATGATAGTAACCTTCAATCTCCTAAGTGA

mRNA sequence

GAAAGAATTGGCCAATTGGGAGATCGATCGCAAGAGGCGAATTGTGATATGGAGGGAAGAGGAATTTTAAGTCCCCAAAGATCTCGATATTCTGTTCAAAAACTTAAGAAGGCTACTCCAGATCGAAAACCAGACAGGAAACTTGGAAACAGAAATGTTGATAATAATGGATCAAAACAAAAACTCGTCACTCCAAAGCTAAACCCAAGTCCTAATCCTAACCACAACAAAACTAAACCCACCACGCCTTCCCCAAAGACCAAAATTGCAAAACCATACTCCCATCCTGTTGCCAATCAGAACCACCCTTATTTGCGTGCAGCTTCCGATTCTGCTGTTCTTCAACCTCCTGATATTAGCGATCACTTGTTACAACGCCTTTCTTTCGACGGTAAAGATCTTGCAGAAATCCTCCAAGGAAACTCGTTATATGATTTAATCAGCCCAAATAAAAAGGAAGAATCTTCTTCTCAAAGCAATGATGATTCCAGATTATTTCAAATTTACAAAGAAATTGCATCTCATCGACAAGAAAACTTATCCGTTGAAGCTTACTTCACAAAGCTCGAGGCATTATGGGTTGAACTTGCATACTACACTGCTGATTTGGGTCAACGTTCCAGCAATAGTGCAATAGAAAATCTAAATGAGCTTACGGAGAGAGAAAAAGTTGTGCAATTTCTTGTGGGACTAAATGATTCTTATGCCACAATTTGCACCCAAATCCTTCTTATCAACCCATTTCCAACAGTGGAGAAAGCTTATGCTGAAATATCTCGAGAAGAAAAACGTAGAGAATTGGTTGTTGCATTAGAAGCTGTGGCTGCAAAAGTAATCCAAACCAATTGGCTTCTTAGGAATCAAAATGGTCGATCCAATAATAATGGTGATAATAATTATGGAAGTGATCAAGAAGTCGATGATAGTAACCTTCAATCTCCTAAGTGA

Coding sequence (CDS)

ATGGAGGGAAGAGGAATTTTAAGTCCCCAAAGATCTCGATATTCTGTTCAAAAACTTAAGAAGGCTACTCCAGATCGAAAACCAGACAGGAAACTTGGAAACAGAAATGTTGATAATAATGGATCAAAACAAAAACTCGTCACTCCAAAGCTAAACCCAAGTCCTAATCCTAACCACAACAAAACTAAACCCACCACGCCTTCCCCAAAGACCAAAATTGCAAAACCATACTCCCATCCTGTTGCCAATCAGAACCACCCTTATTTGCGTGCAGCTTCCGATTCTGCTGTTCTTCAACCTCCTGATATTAGCGATCACTTGTTACAACGCCTTTCTTTCGACGGTAAAGATCTTGCAGAAATCCTCCAAGGAAACTCGTTATATGATTTAATCAGCCCAAATAAAAAGGAAGAATCTTCTTCTCAAAGCAATGATGATTCCAGATTATTTCAAATTTACAAAGAAATTGCATCTCATCGACAAGAAAACTTATCCGTTGAAGCTTACTTCACAAAGCTCGAGGCATTATGGGTTGAACTTGCATACTACACTGCTGATTTGGGTCAACGTTCCAGCAATAGTGCAATAGAAAATCTAAATGAGCTTACGGAGAGAGAAAAAGTTGTGCAATTTCTTGTGGGACTAAATGATTCTTATGCCACAATTTGCACCCAAATCCTTCTTATCAACCCATTTCCAACAGTGGAGAAAGCTTATGCTGAAATATCTCGAGAAGAAAAACGTAGAGAATTGGTTGTTGCATTAGAAGCTGTGGCTGCAAAAGTAATCCAAACCAATTGGCTTCTTAGGAATCAAAATGGTCGATCCAATAATAATGGTGATAATAATTATGGAAGTGATCAAGAAGTCGATGATAGTAACCTTCAATCTCCTAAGTGA

Protein sequence

MEGRGILSPQRSRYSVQKLKKATPDRKPDRKLGNRNVDNNGSKQKLVTPKLNPSPNPNHNKTKPTTPSPKTKIAKPYSHPVANQNHPYLRAASDSAVLQPPDISDHLLQRLSFDGKDLAEILQGNSLYDLISPNKKEESSSQSNDDSRLFQIYKEIASHRQENLSVEAYFTKLEALWVELAYYTADLGQRSSNSAIENLNELTEREKVVQFLVGLNDSYATICTQILLINPFPTVEKAYAEISREEKRRELVVALEAVAAKVIQTNWLLRNQNGRSNNNGDNNYGSDQEVDDSNLQSPK
Homology
BLAST of CcUC03G046750 vs. NCBI nr
Match: XP_038895148.1 (proline-rich receptor-like protein kinase PERK2 isoform X1 [Benincasa hispida])

HSP 1 Score: 280.4 bits (716), Expect = 1.8e-71
Identity = 188/358 (52.51%), Postives = 226/358 (63.13%), Query Frame = 0

Query: 1   MEGRGILSPQRSRYSVQKLKKATPD-RKPDRKLGNRNVDNNGSKQKLVTPKLNP------ 60
           MEGRGI+SP+RSR+S +  KK TP   KP +    RNV   GSK + VTP  NP      
Sbjct: 1   MEGRGIMSPKRSRFSPKMTKKGTPQPAKPAKNKHPRNV--TGSKIE-VTPIPNPTPPRAA 60

Query: 61  -SPNPNHNKTK----------PTTPSPKTK-------------------------IAKPY 120
            +P+PNH+KT+          P TP+P T+                         ++K +
Sbjct: 61  TNPSPNHSKTQSRSEPSSPIPPPTPTPTTRSQPTTPSSSNTNVVRQGGLSFDNPSVSKSF 120

Query: 121 SHP-----------------VANQNHPYLRAASDSAVLQPPDISDHLLQRLSFDGKDLAE 180
             P                 V     P   A+S +A     D+SD LLQRLSFDGKD+A+
Sbjct: 121 HSPKDYHKSSPGSWLGLQKDVNTSPDPSSPASSHTA----SDLSDRLLQRLSFDGKDVAD 180

Query: 181 ILQGNSLYDLISPNKKEESSSQSNDDSRLFQIYKEIASHRQENLSVEAYFTKLEALWVEL 240
           ILQG S+YD++  NKKEE +SQS D  R+ QIYKEIASHRQENL VE YF KL ALW EL
Sbjct: 181 ILQGKSIYDIMGSNKKEE-TSQSIDGLRVLQIYKEIASHRQENLFVEPYFRKLNALWDEL 240

Query: 241 AYYTADLGQRSSNSAIENLNELTEREKVVQFLVGLNDSYATICTQILLINPFPTVEKAYA 299
           ++Y  DL Q SS  AI+N++ELTER+KVVQFLVGLNDSYATIC QIL+  PFPTVE+AY+
Sbjct: 241 SFYITDLAQCSSGGAIQNVSELTERQKVVQFLVGLNDSYATICGQILVKRPFPTVEEAYS 300

BLAST of CcUC03G046750 vs. NCBI nr
Match: XP_038895149.1 (proline-rich receptor-like protein kinase PERK2 isoform X2 [Benincasa hispida])

HSP 1 Score: 271.2 bits (692), Expect = 1.1e-68
Identity = 186/358 (51.96%), Postives = 224/358 (62.57%), Query Frame = 0

Query: 1   MEGRGILSPQRSRYSVQKLKKATPD-RKPDRKLGNRNVDNNGSKQKLVTPKLNP------ 60
           MEGRGI+SP+RSR+S +  KK TP   KP +    RNV   GSK + VTP  NP      
Sbjct: 1   MEGRGIMSPKRSRFSPKMTKKGTPQPAKPAKNKHPRNV--TGSKIE-VTPIPNPTPPRAA 60

Query: 61  -SPNPNHNKTK----------PTTPSPKTK-------------------------IAKPY 120
            +P+PNH+KT+          P TP+P T+                         ++K +
Sbjct: 61  TNPSPNHSKTQSRSEPSSPIPPPTPTPTTRSQPTTPSSSNTNVVRQGGLSFDNPSVSKSF 120

Query: 121 SHP-----------------VANQNHPYLRAASDSAVLQPPDISDHLLQRLSFDGKDLAE 180
             P                 V     P   A+S +A     D+SD LLQRLSFD  D+A+
Sbjct: 121 HSPKDYHKSSPGSWLGLQKDVNTSPDPSSPASSHTA----SDLSDRLLQRLSFD--DVAD 180

Query: 181 ILQGNSLYDLISPNKKEESSSQSNDDSRLFQIYKEIASHRQENLSVEAYFTKLEALWVEL 240
           ILQG S+YD++  NKKEE +SQS D  R+ QIYKEIASHRQENL VE YF KL ALW EL
Sbjct: 181 ILQGKSIYDIMGSNKKEE-TSQSIDGLRVLQIYKEIASHRQENLFVEPYFRKLNALWDEL 240

Query: 241 AYYTADLGQRSSNSAIENLNELTEREKVVQFLVGLNDSYATICTQILLINPFPTVEKAYA 299
           ++Y  DL Q SS  AI+N++ELTER+KVVQFLVGLNDSYATIC QIL+  PFPTVE+AY+
Sbjct: 241 SFYITDLAQCSSGGAIQNVSELTERQKVVQFLVGLNDSYATICGQILVKRPFPTVEEAYS 300

BLAST of CcUC03G046750 vs. NCBI nr
Match: XP_038895286.1 (hybrid signal transduction histidine kinase L-like isoform X2 [Benincasa hispida])

HSP 1 Score: 221.9 bits (564), Expect = 7.6e-54
Identity = 153/305 (50.16%), Postives = 184/305 (60.33%), Query Frame = 0

Query: 15  SVQKLKKATPDRKPDRKLGNRNVDNNGSKQKLVTPKLNPSPNPNHNKTKPTTPSP---KT 74
           S  KL  + P        G + +D N + + +  P    +P P  ++T+PT  S    K 
Sbjct: 69  STSKLVSSRPKTATSNPNGQKKLDKNSTTKIISKPSSPRTPKPVTHQTRPTPNSKRNVKN 128

Query: 75  KIA--------------KPYSHP------VANQNHPYLRAASDSAVLQPPDISDHLLQRL 134
           +IA                Y H           NH Y  +A   A L    +SD  LQRL
Sbjct: 129 RIASCSGSRSELSSAGKNKYFHSRNGSPNDLGSNHRY--SAGHYATLHDLHVSDQ-LQRL 188

Query: 135 SFDGKDLA-EILQGNSLYDLISPNKKEESSSQSNDDSRLFQIYKEIASHRQENLSVEAYF 194
           S DGKDLA  +L  NS+Y+ +  + KE SSSQSN DSR+FQIYKEIA HRQEN S+ +YF
Sbjct: 189 SIDGKDLAGMVLHPNSIYESMGSDTKEASSSQSN-DSRMFQIYKEIAFHRQENSSITSYF 248

Query: 195 TKLEALWVELAYYTADLGQRSSNSAIENLNELTEREKVVQFLVGLNDSYATICTQILLIN 254
           TKLEALW ELA +  DL Q S   A E L+E  EREKV+QFLVGLNDSY+ IC QILL  
Sbjct: 249 TKLEALWDELATFKTDLLQCSCGGATEKLSEYMEREKVMQFLVGLNDSYSKICNQILLST 308

Query: 255 PFPTVEKAYAEISREEKRRELVVALEAVAAKVIQTNWL-LRNQNGRSNNNGDNNYGSDQE 295
           PFPT+EKAY+ + REEK RELVV LE+VA KVIQ NWL L+NQN  S+NNGDNN G  Q 
Sbjct: 309 PFPTMEKAYSAVIREEKHRELVVELESVAGKVIQNNWLDLQNQNAHSSNNGDNNDGVQQL 368

BLAST of CcUC03G046750 vs. NCBI nr
Match: XP_038895285.1 (GATA zinc finger domain-containing protein 11-like isoform X1 [Benincasa hispida])

HSP 1 Score: 216.1 bits (549), Expect = 4.2e-52
Identity = 153/309 (49.51%), Postives = 184/309 (59.55%), Query Frame = 0

Query: 15  SVQKLKKATPDRKPDRKLGNRNVDNNGSKQKLVTPKLNPSPNPNHNKTKPTTPSP---KT 74
           S  KL  + P        G + +D N + + +  P    +P P  ++T+PT  S    K 
Sbjct: 69  STSKLVSSRPKTATSNPNGQKKLDKNSTTKIISKPSSPRTPKPVTHQTRPTPNSKRNVKN 128

Query: 75  KIA--------------KPYSHP------VANQNHPYLRAASDSAVLQPPDISDHLLQRL 134
           +IA                Y H           NH Y  +A   A L    +SD  LQRL
Sbjct: 129 RIASCSGSRSELSSAGKNKYFHSRNGSPNDLGSNHRY--SAGHYATLHDLHVSDQ-LQRL 188

Query: 135 SFD----GKDLA-EILQGNSLYDLISPNKKEESSSQSNDDSRLFQIYKEIASHRQENLSV 194
           S D    GKDLA  +L  NS+Y+ +  + KE SSSQSN DSR+FQIYKEIA HRQEN S+
Sbjct: 189 SIDEFCVGKDLAGMVLHPNSIYESMGSDTKEASSSQSN-DSRMFQIYKEIAFHRQENSSI 248

Query: 195 EAYFTKLEALWVELAYYTADLGQRSSNSAIENLNELTEREKVVQFLVGLNDSYATICTQI 254
            +YFTKLEALW ELA +  DL Q S   A E L+E  EREKV+QFLVGLNDSY+ IC QI
Sbjct: 249 TSYFTKLEALWDELATFKTDLLQCSCGGATEKLSEYMEREKVMQFLVGLNDSYSKICNQI 308

Query: 255 LLINPFPTVEKAYAEISREEKRRELVVALEAVAAKVIQTNWL-LRNQNGRSNNNGDNNYG 295
           LL  PFPT+EKAY+ + REEK RELVV LE+VA KVIQ NWL L+NQN  S+NNGDNN G
Sbjct: 309 LLSTPFPTMEKAYSAVIREEKHRELVVELESVAGKVIQNNWLDLQNQNAHSSNNGDNNDG 368

BLAST of CcUC03G046750 vs. NCBI nr
Match: XP_038895287.1 (hybrid signal transduction histidine kinase L-like isoform X3 [Benincasa hispida])

HSP 1 Score: 212.2 bits (539), Expect = 6.0e-51
Identity = 151/305 (49.51%), Postives = 182/305 (59.67%), Query Frame = 0

Query: 15  SVQKLKKATPDRKPDRKLGNRNVDNNGSKQKLVTPKLNPSPNPNHNKTKPTTPSP---KT 74
           S  KL  + P        G + +D N + + +  P    +P P  ++T+PT  S    K 
Sbjct: 69  STSKLVSSRPKTATSNPNGQKKLDKNSTTKIISKPSSPRTPKPVTHQTRPTPNSKRNVKN 128

Query: 75  KIA--------------KPYSHP------VANQNHPYLRAASDSAVLQPPDISDHLLQRL 134
           +IA                Y H           NH Y  +A   A L    +SD  LQRL
Sbjct: 129 RIASCSGSRSELSSAGKNKYFHSRNGSPNDLGSNHRY--SAGHYATLHDLHVSDQ-LQRL 188

Query: 135 SFDGKDLA-EILQGNSLYDLISPNKKEESSSQSNDDSRLFQIYKEIASHRQENLSVEAYF 194
           S D  DLA  +L  NS+Y+ +  + KE SSSQSN DSR+FQIYKEIA HRQEN S+ +YF
Sbjct: 189 SID--DLAGMVLHPNSIYESMGSDTKEASSSQSN-DSRMFQIYKEIAFHRQENSSITSYF 248

Query: 195 TKLEALWVELAYYTADLGQRSSNSAIENLNELTEREKVVQFLVGLNDSYATICTQILLIN 254
           TKLEALW ELA +  DL Q S   A E L+E  EREKV+QFLVGLNDSY+ IC QILL  
Sbjct: 249 TKLEALWDELATFKTDLLQCSCGGATEKLSEYMEREKVMQFLVGLNDSYSKICNQILLST 308

Query: 255 PFPTVEKAYAEISREEKRRELVVALEAVAAKVIQTNWL-LRNQNGRSNNNGDNNYGSDQE 295
           PFPT+EKAY+ + REEK RELVV LE+VA KVIQ NWL L+NQN  S+NNGDNN G  Q 
Sbjct: 309 PFPTMEKAYSAVIREEKHRELVVELESVAGKVIQNNWLDLQNQNAHSSNNGDNNDGVQQL 367

BLAST of CcUC03G046750 vs. ExPASy TrEMBL
Match: A0A0A0LRE6 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G058680 PE=4 SV=1)

HSP 1 Score: 192.6 bits (488), Expect = 2.4e-45
Identity = 147/368 (39.95%), Postives = 204/368 (55.43%), Query Frame = 0

Query: 1   MEGRGILSPQRSRYS-----------VQKLKKATPDR--------KPDRKLGNRNVDNNG 60
           M+ RGI+ P+RS++S            +K  K  P +        KP  +    N ++NG
Sbjct: 1   MDERGIMGPKRSQFSPKPNRKPIDPEAKKSAKKNPKKDPNFPSSSKPKPQAAATNPNSNG 60

Query: 61  SKQK----------------LVTPKLN---PSPNPNH-------------NKTKPTTPSP 120
           +K                  + TPK     P+P+ +H             +K++P TPS 
Sbjct: 61  NKTNPNSETPSSAPPPPPTPISTPKAKSQPPTPSSDHHPPLPHNLSPPRRSKSQPATPSS 120

Query: 121 KTKI-------------AKPYSHPVANQNH----------PYLRAASDSAVLQPPDISDH 180
            +KI             A P + P +  +H          P+    SDS+     DI D 
Sbjct: 121 ASKINFVRRIDNDNSTKAFPKTSPTSGSDHRQKHVKTTADPHSPGYSDSS----HDIGDR 180

Query: 181 LLQRLSFDGKDLAEILQGNSLYDLI-SPNKKEESSSQSNDDSRLFQIYKEIASHRQENLS 240
           LLQRLS +GKDL +IL+GN++ DL+ S N+KEESSS++     + QIY++IASHRQ NLS
Sbjct: 181 LLQRLSSEGKDLDDILKGNTIDDLMGSNNRKEESSSRNVSSLAILQIYQKIASHRQGNLS 240

Query: 241 VEAYFTKLEALWVELAYYTADLGQRSSNSAIENLNELTEREKVVQFLVGLNDSYATICTQ 294
           VE YF KL+ LW ++  Y+++     S   I   +ELTER+KV+QF +GLND Y+ IC+Q
Sbjct: 241 VERYFKKLKKLWNDIGIYSSE-----SVEGIAFWSELTERDKVIQFFIGLNDYYSIICSQ 300

BLAST of CcUC03G046750 vs. ExPASy TrEMBL
Match: A0A0A0LU31 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G058670 PE=4 SV=1)

HSP 1 Score: 187.6 bits (475), Expect = 7.7e-44
Identity = 129/276 (46.74%), Postives = 171/276 (61.96%), Query Frame = 0

Query: 48  TPKLNPSPNPNHNKTK-----PTTPSPKTK-------------IAKPYSHPVANQN---- 107
           +P L P+P+P  + T+     PT PS  +K              A P   P +  +    
Sbjct: 104 SPALPPTPSPTPSPTRKTKCQPTMPSSSSKNNVVRRIYNDNSPKASPKISPTSGSDCHEK 163

Query: 108 ------HPYLRAASDSAVLQPPDISDHLLQRLSFDGKDLAEILQGNSLYDLI-SPNKKEE 167
                 +P   A+SD +     DI   LLQ LSF+GKDL +IL+GNS+ DL+ S NKKEE
Sbjct: 164 AVKTTAYPNSPASSDPS----NDIGHRLLQGLSFEGKDLDDILKGNSIDDLMGSNNKKEE 223

Query: 168 SSSQSNDDSRLFQIYKEIASHRQENLSVEAYFTKLEALWVELAYYTADLGQ-RSSNSAIE 227
           SS ++     + QIY++IASHRQ NLSVE YF KL+ LW ++  Y++D  Q  SSN  I 
Sbjct: 224 SSPRNVSSLAILQIYQKIASHRQGNLSVERYFKKLKKLWNDIGTYSSDFAQGYSSNGTIA 283

Query: 228 NLNELTEREKVVQFLVGLNDSYATICTQILLINPFPTVEKAYAEISREEKRRELVVALEA 287
             +ELTER+KV+QF +GLND Y+ IC+QIL+  PFPTVE+AY+EI REEKRREL VAL  
Sbjct: 284 FWSELTERDKVMQFFIGLNDYYSIICSQILVNQPFPTVEEAYSEIIREEKRRELFVALGI 343

Query: 288 VAAKVIQTNWLLRNQNGRSNNNGDNNYGSDQEVDDS 294
           +AA+VIQ+++    Q G SNN  + N G DQE+D S
Sbjct: 344 MAAQVIQSSY----QYGSSNNGDNKNLGIDQEIDRS 371

BLAST of CcUC03G046750 vs. ExPASy TrEMBL
Match: A0A6J1C5Z8 (uncharacterized protein LOC111008588 OS=Momordica charantia OX=3673 GN=LOC111008588 PE=4 SV=1)

HSP 1 Score: 180.3 bits (456), Expect = 1.2e-41
Identity = 138/301 (45.85%), Postives = 177/301 (58.80%), Query Frame = 0

Query: 9   PQRSRYSVQK-------LKKATPDRKPDRKLGNRNVDNNGSKQKLVTPKLNPSPNPNHNK 68
           PQR R  V+         K  TP  KP   +         S  +  +P  N SP+  H  
Sbjct: 164 PQRLRNGVEPPRGPTPISKNTTPKPKPAIAI--------ASASRSDSPAANSSPSSKH-- 223

Query: 69  TKPTTPSPKTKIAKPYSHPVANQNHPYLRAASDSAVLQPPDISDHLLQRLSFDGKDLAE- 128
                PSP +       H     + PY  +      L  P +++H LQRLS DGKDLA  
Sbjct: 224 ----LPSPGSA-----QHHDMKNHSPY--SGGTYTPLADPQLNNH-LQRLSIDGKDLASI 283

Query: 129 ILQGNSLYDLISPNKKEESSSQSNDDSRLFQIYKEIASHRQENLSVEAYFTKLEALWVEL 188
           IL  NS+Y+ I  +  EE S QSN   R+FQIYK+IASHRQEN SV +YFTKL+ LW EL
Sbjct: 284 ILHANSIYESIGSDTMEE-SFQSN-APRIFQIYKDIASHRQENSSVTSYFTKLKILWDEL 343

Query: 189 AYYTADLGQRSSNSAIENLNELTEREKVVQFLVGLNDSYATICTQILLINPFPTVEKAYA 248
             Y+ D+ Q  S  A+E L+   EREKV+QFL+GLN+SY+TIC QILLI PFPT+EKAY+
Sbjct: 344 ETYSDDVPQCCSCGAMEKLSGHVEREKVMQFLMGLNNSYSTICPQILLIQPFPTMEKAYS 403

Query: 249 EISREEKRRELVVALEAVAAKVIQTNWLLRNQNGRSNNNGDNNYGSDQEVD---DSNLQS 299
            I REEKR ELV +LE VAAKV++  WLL  QN +S+N  D+  G  +EV+   + N++ 
Sbjct: 404 IIIREEKRMELVTSLEMVAAKVMENKWLL--QNDQSSNGYDD--GIHEEVNGNTEDNVEI 436

BLAST of CcUC03G046750 vs. ExPASy TrEMBL
Match: A0A6J1GTG4 (serine/arginine repetitive matrix protein 1-like OS=Cucurbita moschata OX=3662 GN=LOC111456959 PE=4 SV=1)

HSP 1 Score: 177.2 bits (448), Expect = 1.0e-40
Identity = 141/326 (43.25%), Postives = 172/326 (52.76%), Query Frame = 0

Query: 24  PDRKPDRKLGNRNVDNNGSKQKLVTPKLNPSPNPNHNK------------------TKPT 83
           P +  DR      +D +    KL  P    +P+PN  K                  TKP 
Sbjct: 66  PTKPTDRSSNPPRIDPSRPNSKL-APSRPAAPSPNERKLDTKTAPKTTTRFSSPRPTKPI 125

Query: 84  TPSPKT--------------KIAKPYSHPVA------------NQNHPYLRAASDSA--- 143
           TP  K+                AKP                   Q+   +R+ SD +   
Sbjct: 126 TPPSKSNGKGASGSGSRSDFSRAKPSDSQKGTPKNLRSGRLNDQQDEQIVRSYSDGSYGA 185

Query: 144 -VLQPPDISDHLLQRLSFDGKDLAEI-LQGNSLYD-LISPNKKEESSSQSNDDSRLFQIY 203
             L  PD+  H L +LS D KDLA I L  N +Y+ L S  K+EE SSQ N+ SR+FQIY
Sbjct: 186 RTLSDPDV--HKLHQLSLDDKDLANIVLHANLVYESLASETKEEECSSQGNNSSRMFQIY 245

Query: 204 KEIASHRQENLSVEAYFTKLEALWVELAYYTADLGQRSSNSAIENLNELTEREKVVQFLV 263
           KEIASH Q N S+ +Y TKL+ALW EL  Y      + S  + E  +E  EREKV+QFL+
Sbjct: 246 KEIASHHQGNSSITSYITKLKALWDELEAYIDT--PKCSCGSTEKQSEQIEREKVMQFLI 305

Query: 264 GLNDSYATICTQILLINPFPTVEKAYAEISREEKRRELVVALEAVAAKVIQTNWLLRNQN 300
           GLNDSY+TIC QIL + PFPTVEKA   I REEKRRELV++LE VAAKVIQ NWLL  QN
Sbjct: 306 GLNDSYSTICAQILHMKPFPTVEKACCAILREEKRRELVLSLEIVAAKVIQNNWLL--QN 365

BLAST of CcUC03G046750 vs. ExPASy TrEMBL
Match: A0A6J1C7L7 (uncharacterized protein LOC111008986 OS=Momordica charantia OX=3673 GN=LOC111008986 PE=4 SV=1)

HSP 1 Score: 142.9 bits (359), Expect = 2.2e-30
Identity = 104/229 (45.41%), Postives = 136/229 (59.39%), Query Frame = 0

Query: 26  RKPDRKLGNRNVDNNGSKQKLVTPKLNPSPNPNHNKTKPTTPSPK-TKIAKPYS-----H 85
           R+P     N N  N+ S  KL   K+  S N N N  +     P    +A   S     H
Sbjct: 88  RRPTWAPNNPNGHNDNSTTKLTIAKITTSRNSNVNGVQQQPRGPTLISVASGSSHGSGHH 147

Query: 86  PVANQNHPYLRAASDSAVLQPPDISDHLLQRLSFDGKDLAE-ILQGNSLYDLISPNKKEE 145
             AN N+        + V+  P + +  LQ+LS DGK  A+ + + NS+ + + P  KEE
Sbjct: 148 QDANNNNIEGEEEDSTTVIGHPHVINQ-LQQLSIDGKHHAKMVFRANSMDESVGPYTKEE 207

Query: 146 SSSQSNDDSRLFQIYKEIASHRQENLSVEAYFTKLEALWVELAYYTADLGQRSSNSAIEN 205
            S QSN + R+ +IYK+IASHRQ N S+ +YFTKLE LW EL  Y +DL Q  S SA + 
Sbjct: 208 CSPQSNAE-RILEIYKDIASHRQGNSSITSYFTKLETLWEELETY-SDLPQCCSYSATDQ 267

Query: 206 L-NELTEREKVVQFLVGLNDSYATICTQILLINPFPTVEKAYAEISREE 247
             ++L EREKV+QFLVGLNDSY+TIC+QILLI PFPTVEKAY+ I  +E
Sbjct: 268 KPSKLVEREKVMQFLVGLNDSYSTICSQILLIRPFPTVEKAYSIIIMQE 313

BLAST of CcUC03G046750 vs. TAIR 10
Match: AT1G21280.1 (CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 52.0 bits (123), Expect = 9.7e-07
Identity = 32/106 (30.19%), Postives = 56/106 (52.83%), Query Frame = 0

Query: 146 DSRLFQIYKEIASHRQENLSVEAYFTKLEALWVELAYYTADLGQRSSNSAIE---NLNEL 205
           D +++Q+ + +A+ RQ   SVE YF KL  +W+EL+ Y      +      E      E 
Sbjct: 124 DLKIYQLRRRLATLRQGGDSVEEYFGKLSKVWMELSEYAPIPECKCGGCNCECTKRAEEA 183

Query: 206 TEREKVVQFLVG--LNDSYATICTQILLINPFPTVEKAYAEISREE 247
            E+E+  +FL+G  LN  +  + T+I+   P P++ +A+A +   E
Sbjct: 184 REKEQRYEFLMGLKLNQGFEAVTTKIMFQKPPPSLHEAFAMVKDAE 229

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038895148.11.8e-7152.51proline-rich receptor-like protein kinase PERK2 isoform X1 [Benincasa hispida][more]
XP_038895149.11.1e-6851.96proline-rich receptor-like protein kinase PERK2 isoform X2 [Benincasa hispida][more]
XP_038895286.17.6e-5450.16hybrid signal transduction histidine kinase L-like isoform X2 [Benincasa hispida... [more]
XP_038895285.14.2e-5249.51GATA zinc finger domain-containing protein 11-like isoform X1 [Benincasa hispida... [more]
XP_038895287.16.0e-5149.51hybrid signal transduction histidine kinase L-like isoform X3 [Benincasa hispida... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0LRE62.4e-4539.95Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G058680 PE=4 SV=1[more]
A0A0A0LU317.7e-4446.74Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G058670 PE=4 SV=1[more]
A0A6J1C5Z81.2e-4145.85uncharacterized protein LOC111008588 OS=Momordica charantia OX=3673 GN=LOC111008... [more]
A0A6J1GTG41.0e-4043.25serine/arginine repetitive matrix protein 1-like OS=Cucurbita moschata OX=3662 G... [more]
A0A6J1C7L72.2e-3045.41uncharacterized protein LOC111008986 OS=Momordica charantia OX=3673 GN=LOC111008... [more]
Match NameE-valueIdentityDescription
AT1G21280.19.7e-0730.19CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Ha... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (PI 537277) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 272..299
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 18..36
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 37..72
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..84
NoneNo IPR availablePANTHERPTHR34222:SF6OS02G0671800 PROTEINcoord: 136..270
NoneNo IPR availablePANTHERPTHR34222FAMILY NOT NAMEDcoord: 136..270

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CcUC03G046750.1CcUC03G046750.1mRNA