Clc08G04840.1 (mRNA) Watermelon (cordophanus) v2

Overview
NameClc08G04840.1
TypemRNA
OrganismCitrullus lanatus subsp. cordophanus cv. cordophanus (Watermelon (cordophanus) v2)
DescriptionUnknown protein
LocationClcChr08: 14737048 .. 14738379 (+)
Sequence length1332
RNA-Seq ExpressionClc08G04840.1
SyntenyClc08G04840.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CGTGGATAAAAATAATATATTCAAACTCCCCAATCATTATCCGCCACGTCTCAATTTAAATCTTACATTTTCATCTTTGATATTCGCCCTCTATAACCTCCGGCTATTCCCACTCATAATTTCTATTTTCCCCTTTGCAAAATCTGTTTGGCAGCCGAGAAAATTACTTTGTTTTTTATTTTCTTTTCCTTGGAGAATTCAACGACGATGGAAGTTCCTGTTCTGAACAGGATTACTGAATTAGGAGCAGATCTGAGCTCGCTTCCAAATCCCAATTTTCTCTCTCGAATTTTCACTTCCTTTTCCCCATCAGAACATTTCTGGAAATGGAGTGCTCTGATTATTACTTTGTTGGCCACATTTACCGGAATAATCAATCGGGTCAAGATTTTGATCATCGTCATTCGCCGGAGAACTCGAACAACTTCGATTTCCGAACCTCTCTACCGATCTCCCCACAGTGGAGAGACCGGTGGTTTAGTTTCAGAGAATATCAAATCTCCCCTGTTTCCAAGCTCGGAATCGGAGGATGAGAATGAGGGGGACCAGAAACCGGACAACGGCTTGAATTTCCGGGTTAAAGGCTCAGGGTGGTTTCCTGGTGAATTTGACGATCGGTGTTGTTCTCGTCTCCGGCGGCGGCACTTCGACGGAAGAGACGACGGAGATTTGTTTTCGTGGCCGTGTTTTGGGTTGGAGAGGAGCGTAGTGAGGCAATGGGGTGATGTCAAATTGAAATGCGAGTTAGACAAGTTGAGTGGGAGTCTGATTTCGTTGTACGATGAGAATGAAGAGGCGGAGATCTGCTCCATTTTTAGCGGCGGAGCTCCGCTGCAAGCGGCGGCGGTATCGCCGAGAAGAATGGTGGTGGCCGCCAGTGAGGGCGCTTCGGCCAATGTATCGCTGAAGCTTTGGGACACGCGTGGTCGGAGCCGGAAGCCGGTGGTGGCGGCGGAGTGGGATTCGCCGTCGGGAAAGATTGTCGACGTCTATTACGACGATGTAGAAAATGTCTATCTTAGAGATAATGGAGCCGCCGGAATAATGGTCGGCGATGTTAGAAAAGTTAGTTCGGCGTCGGAGAATTCGCCGGCGGGCGGCGGTGACGGCTTGTGGGAGTCGGGTCACTAGCGCCGTTTATAGTTTTGTGAGAGAAATTTGGTGGTAACAACGGTGAATGGGACGGTGGCTCGATTGGCACTTTTGTAAATATTGAAATTGTTTGAATTTTCGAGCAATAAAGATTTTGATTTTAGTTTCGCTCGACAATGGATTGATGTGGTCGTCGTGATTATGTGAATATTATGCTAAATGTTTTTTGTTTTTTTTTTT

mRNA sequence

CGTGGATAAAAATAATATATTCAAACTCCCCAATCATTATCCGCCACGTCTCAATTTAAATCTTACATTTTCATCTTTGATATTCGCCCTCTATAACCTCCGGCTATTCCCACTCATAATTTCTATTTTCCCCTTTGCAAAATCTGTTTGGCAGCCGAGAAAATTACTTTGTTTTTTATTTTCTTTTCCTTGGAGAATTCAACGACGATGGAAGTTCCTGTTCTGAACAGGATTACTGAATTAGGAGCAGATCTGAGCTCGCTTCCAAATCCCAATTTTCTCTCTCGAATTTTCACTTCCTTTTCCCCATCAGAACATTTCTGGAAATGGAGTGCTCTGATTATTACTTTGTTGGCCACATTTACCGGAATAATCAATCGGGTCAAGATTTTGATCATCGTCATTCGCCGGAGAACTCGAACAACTTCGATTTCCGAACCTCTCTACCGATCTCCCCACAGTGGAGAGACCGGTGGTTTAGTTTCAGAGAATATCAAATCTCCCCTGTTTCCAAGCTCGGAATCGGAGGATGAGAATGAGGGGGACCAGAAACCGGACAACGGCTTGAATTTCCGGGTTAAAGGCTCAGGGTGGTTTCCTGGTGAATTTGACGATCGGTGTTGTTCTCGTCTCCGGCGGCGGCACTTCGACGGAAGAGACGACGGAGATTTGTTTTCGTGGCCGTGTTTTGGGTTGGAGAGGAGCGTAGTGAGGCAATGGGGTGATGTCAAATTGAAATGCGAGTTAGACAAGTTGAGTGGGAGTCTGATTTCGTTGTACGATGAGAATGAAGAGGCGGAGATCTGCTCCATTTTTAGCGGCGGAGCTCCGCTGCAAGCGGCGGCGGTATCGCCGAGAAGAATGGTGGTGGCCGCCAGTGAGGGCGCTTCGGCCAATGTATCGCTGAAGCTTTGGGACACGCGTGGTCGGAGCCGGAAGCCGGTGGTGGCGGCGGAGTGGGATTCGCCGTCGGGAAAGATTGTCGACGTCTATTACGACGATGTAGAAAATGTCTATCTTAGAGATAATGGAGCCGCCGGAATAATGGTCGGCGATGTTAGAAAAGTTAGTTCGGCGTCGGAGAATTCGCCGGCGGGCGGCGGTGACGGCTTGTGGGAGTCGGGTCACTAGCGCCGTTTATAGTTTTGTGAGAGAAATTTGGTGGTAACAACGGTGAATGGGACGGTGGCTCGATTGGCACTTTTGTAAATATTGAAATTGTTTGAATTTTCGAGCAATAAAGATTTTGATTTTAGTTTCGCTCGACAATGGATTGATGTGGTCGTCGTGATTATGTGAATATTATGCTAAATGTTTTTTGTTTTTTTTTTT

Coding sequence (CDS)

ATGGAAGTTCCTGTTCTGAACAGGATTACTGAATTAGGAGCAGATCTGAGCTCGCTTCCAAATCCCAATTTTCTCTCTCGAATTTTCACTTCCTTTTCCCCATCAGAACATTTCTGGAAATGGAGTGCTCTGATTATTACTTTGTTGGCCACATTTACCGGAATAATCAATCGGGTCAAGATTTTGATCATCGTCATTCGCCGGAGAACTCGAACAACTTCGATTTCCGAACCTCTCTACCGATCTCCCCACAGTGGAGAGACCGGTGGTTTAGTTTCAGAGAATATCAAATCTCCCCTGTTTCCAAGCTCGGAATCGGAGGATGAGAATGAGGGGGACCAGAAACCGGACAACGGCTTGAATTTCCGGGTTAAAGGCTCAGGGTGGTTTCCTGGTGAATTTGACGATCGGTGTTGTTCTCGTCTCCGGCGGCGGCACTTCGACGGAAGAGACGACGGAGATTTGTTTTCGTGGCCGTGTTTTGGGTTGGAGAGGAGCGTAGTGAGGCAATGGGGTGATGTCAAATTGAAATGCGAGTTAGACAAGTTGAGTGGGAGTCTGATTTCGTTGTACGATGAGAATGAAGAGGCGGAGATCTGCTCCATTTTTAGCGGCGGAGCTCCGCTGCAAGCGGCGGCGGTATCGCCGAGAAGAATGGTGGTGGCCGCCAGTGAGGGCGCTTCGGCCAATGTATCGCTGAAGCTTTGGGACACGCGTGGTCGGAGCCGGAAGCCGGTGGTGGCGGCGGAGTGGGATTCGCCGTCGGGAAAGATTGTCGACGTCTATTACGACGATGTAGAAAATGTCTATCTTAGAGATAATGGAGCCGCCGGAATAATGGTCGGCGATGTTAGAAAAGTTAGTTCGGCGTCGGAGAATTCGCCGGCGGGCGGCGGTGACGGCTTGTGGGAGTCGGGTCACTAG

Protein sequence

MEVPVLNRITELGADLSSLPNPNFLSRIFTSFSPSEHFWKWSALIITLLATFTGIINRVKILIIVIRRRTRTTSISEPLYRSPHSGETGGLVSENIKSPLFPSSESEDENEGDQKPDNGLNFRVKGSGWFPGEFDDRCCSRLRRRHFDGRDDGDLFSWPCFGLERSVVRQWGDVKLKCELDKLSGSLISLYDENEEAEICSIFSGGAPLQAAAVSPRRMVVAASEGASANVSLKLWDTRGRSRKPVVAAEWDSPSGKIVDVYYDDVENVYLRDNGAAGIMVGDVRKVSSASENSPAGGGDGLWESGH
Homology
BLAST of Clc08G04840.1 vs. NCBI nr
Match: XP_038885552.1 (uncharacterized protein LOC120075889 [Benincasa hispida])

HSP 1 Score: 531.6 bits (1368), Expect = 4.6e-147
Identity = 265/307 (86.32%), Postives = 280/307 (91.21%), Query Frame = 0

Query: 1   MEVPVLNRITELGADLSSLPNPNFLSRIFTSFSPSEHFWKWSALIITLLATFTGIINRVK 60
           MEVPVLNRITE+GADLSSLPNP FLSRIFTSFSPS+HFWKW ALII LLATFTGIINRVK
Sbjct: 1   MEVPVLNRITEIGADLSSLPNPKFLSRIFTSFSPSQHFWKWGALIIALLATFTGIINRVK 60

Query: 61  ILIIVIRRRTRTTSISEPLYRSPHSGETGGLVSENIKSPLFPSSESEDENEGDQKPDNGL 120
           ILIIVIRRRTRTTSISEPLYRS H GETGGLVSEN+KSPL  SSESEDENEGD+K D+  
Sbjct: 61  ILIIVIRRRTRTTSISEPLYRSLHGGETGGLVSENLKSPLLSSSESEDENEGDRKQDDSS 120

Query: 121 NFRVKGSGWFPGEFDDRCCSRLRRRHFDGRDDGDLFSWPCFGLERSVVRQWGDVKLKCEL 180
           +FRVKGSG F GEFD RCCSRLRRRH DG  DGDLFSWPCFG + SVVRQWGDVKLKCE 
Sbjct: 121 DFRVKGSGRFSGEFDGRCCSRLRRRHCDGGGDGDLFSWPCFGSDGSVVRQWGDVKLKCEF 180

Query: 181 DKLSGSLISLYDENEEAEICSIFSGGAPLQAAAVSPRRMVVAASEGASANVSLKLWDTRG 240
           ++LSGS+ISLYDENEE EICSIF+GG PLQAAA+SPR+MVVAASEG SANVSLKLWD RG
Sbjct: 181 EELSGSVISLYDENEETEICSIFNGGTPLQAAALSPRKMVVAASEGVSANVSLKLWDMRG 240

Query: 241 RSRKPVVAAEWDSPSGKIVDVYYDDVENVYLRDNGAAGIMVGDVRKVSSASENSPAGGGD 300
           RSR+PVVAAEWDSPSGKIVDVYY+DVENVYLRD GAAGIMVGDVRKVSSAS NSP G GD
Sbjct: 241 RSRRPVVAAEWDSPSGKIVDVYYEDVENVYLRDKGAAGIMVGDVRKVSSASGNSPVGSGD 300

Query: 301 GLWESGH 308
           GLWESGH
Sbjct: 301 GLWESGH 307

BLAST of Clc08G04840.1 vs. NCBI nr
Match: XP_011656511.1 (uncharacterized protein LOC105435751 [Cucumis sativus] >KGN45941.1 hypothetical protein Csa_004792 [Cucumis sativus])

HSP 1 Score: 456.1 bits (1172), Expect = 2.5e-124
Identity = 231/306 (75.49%), Postives = 258/306 (84.31%), Query Frame = 0

Query: 1   MEVPVLNRITELGADLSSLPNPNFLSRIFTSFSPSEHFWKWSALIITLLATFTGIINRVK 60
           MEVPVLNRITELG +L SLPNP+FLSRIFTS  PS+HFWKW+ALII  LATF GIINRVK
Sbjct: 1   MEVPVLNRITELGPNLGSLPNPSFLSRIFTSVFPSQHFWKWAALIIAFLATFPGIINRVK 60

Query: 61  ILIIVIRRRTRTTSISEPLYRSPHSGETGGLVSENIKSPLFPSSESEDENEGDQKPDNGL 120
           + IIV RRRT+TTSISEPLYRS H G++ GLVS+N+KSPL  SSESEDENE D++ +N  
Sbjct: 61  VFIIVCRRRTKTTSISEPLYRSLHFGDSRGLVSKNLKSPLLSSSESEDENERDREHNNDS 120

Query: 121 NFRVKGSGWFPGEFDDRCCSRLRRRHFDGRDDGDLFSWPCFGLERSVVRQWGDVKLKCEL 180
           +FRVKGS  F GEFD  C SR RRR  +G  +GDLFSWPCFGLERSVVRQWGDVKLKCE 
Sbjct: 121 DFRVKGSSLFSGEFDGGCRSRHRRRPCNGGGNGDLFSWPCFGLERSVVRQWGDVKLKCEF 180

Query: 181 DKLSGSLISLYDENEEAEICSIFSGGAPLQAAAVSPRRMVVAASEGASANVSLKLWDTRG 240
           ++LSGS+ISLYD NEEAEICSI SGG  L+AAAVSPRRMVVAA+EG SANVSLKLWDTRG
Sbjct: 181 EELSGSMISLYDVNEEAEICSILSGGGSLKAAAVSPRRMVVAANEGVSANVSLKLWDTRG 240

Query: 241 RSRKPVVAAEWDSPSGKIVDVYYDDVENVYLRDNGAAGIMVGDVRKVSSASENSPAGGGD 300
           RSR PVV  EWDSPSG IVDVYY+DV N+Y+RDN AAGIM+GDVR+ SS  E   AGGG+
Sbjct: 241 RSRTPVVGMEWDSPSGNIVDVYYEDVGNLYVRDNEAAGIMIGDVRRASSGWEKLTAGGGE 300

Query: 301 GLWESG 307
           GLWE G
Sbjct: 301 GLWEVG 306

BLAST of Clc08G04840.1 vs. NCBI nr
Match: XP_008445638.2 (PREDICTED: uncharacterized protein LOC103488596 [Cucumis melo] >KAA0036127.1 uncharacterized protein E6C27_scaffold338G00340 [Cucumis melo var. makuwa] >TYK18508.1 uncharacterized protein E5676_scaffold5066G00090 [Cucumis melo var. makuwa])

HSP 1 Score: 452.6 bits (1163), Expect = 2.7e-123
Identity = 232/307 (75.57%), Postives = 253/307 (82.41%), Query Frame = 0

Query: 1   MEVPVLNRITELGADLSSLPNPNFLSRIFTSFSPSEHFWKWSALIITLLATFTGIINRVK 60
           MEVPVLNRITELGA L SLPNPNFLSRIFTSF PS+HFWKW ALII LLATFTGIINRVK
Sbjct: 1   MEVPVLNRITELGAHLGSLPNPNFLSRIFTSFFPSQHFWKWGALIIALLATFTGIINRVK 60

Query: 61  ILIIVIRRRTRTTSISEPLYRSPHSGETGGLVSENIKSPLFPSSESEDENEGDQKPDNGL 120
           + II+ RRRT+TTSISEPLYRS H  ++GGLVS+N+KSP   SSESEDENE  ++ +N  
Sbjct: 61  VFIIICRRRTKTTSISEPLYRSLHCRDSGGLVSKNLKSPPLSSSESEDENERGRERNNDS 120

Query: 121 NFRVKGSGWFPGEFDDRCCSRLRRRHFDGRDDGDLFSWPCFGLERSVVRQWGDVKLKCEL 180
           NFRVK S  F  E D  C SRLRRRH +G  +GDLF WPCFGL+RSVVRQWGDV    E 
Sbjct: 121 NFRVKVSSRFSCELDGGCHSRLRRRHCNGGSNGDLFPWPCFGLDRSVVRQWGDVISNSEF 180

Query: 181 DKLSGSLISLYDENEEAEICSIFSGGAPLQAAAVSPRRMVVAASEGASANVSLKLWDTRG 240
           ++LSGS+ISLYD+NEEAEICSIF+ G  L+A AVSPRRMVVAASEG SANVSLKLWDTRG
Sbjct: 181 EELSGSMISLYDDNEEAEICSIFNEGGSLKAVAVSPRRMVVAASEGVSANVSLKLWDTRG 240

Query: 241 RSRKPVVAAEWDSPSGKIVDVYYDDVENVYLRDNGAAGIMVGDVRKVSSASENSPAGGGD 300
           RSR PVVA EWDSPS  IVDVYY+DVENVYLRDNGAAGIMVGDVRK SS SE   AG G 
Sbjct: 241 RSRTPVVAVEWDSPSRNIVDVYYEDVENVYLRDNGAAGIMVGDVRKASSGSEKLTAGDGG 300

Query: 301 GLWESGH 308
           GLW  GH
Sbjct: 301 GLWNLGH 307

BLAST of Clc08G04840.1 vs. NCBI nr
Match: XP_023550996.1 (uncharacterized protein LOC111808968 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 393.7 bits (1010), Expect = 1.5e-105
Identity = 212/308 (68.83%), Postives = 239/308 (77.60%), Query Frame = 0

Query: 1   MEVPVLNRITELGADLSSLPNPNFLSRIFTSFSPSEHFWKWSALIITLLATFTGIINRVK 60
           MEVPVLNRI ELG+DL        L RIFTS SPS HFWKW A++ITLLATF+GIINR+K
Sbjct: 50  MEVPVLNRIAELGSDLG-------LLRIFTSISPSPHFWKWGAVLITLLATFSGIINRLK 109

Query: 61  ILIIVIRRRTRTTSISEPLYRSPHSGETGGLVSENIKSPLFPSSESEDENEGDQKPDNGL 120
           ILIIVIRR +RTT ISEPL RS H GET GLVSEN++S L  SSESEDE E D++PD+G 
Sbjct: 110 ILIIVIRRSSRTTPISEPLSRSLHGGETAGLVSENLRSSLLSSSESEDEKEADREPDDGS 169

Query: 121 NFRVKGSGWFPGEFDDRCCSRLRRRHFDGRD-DGDLFSWPCFGLERSVVRQWGDVKLKCE 180
           +FRVKG+           CSRLRRR+  G D D   +SWPCFG ERSVVRQWGDVKLKCE
Sbjct: 170 DFRVKGTA---------RCSRLRRRYGGGSDGDSVPWSWPCFGSERSVVRQWGDVKLKCE 229

Query: 181 LDKLSGSLISLYDENEEAEICSIFSGGAPLQAAAVSPRRMVVAASEGASANVSLKLWDTR 240
            +KLSGS+ISLYDENEEAEICSIFS GAPLQ AA+SP RMVVAA E  S++VSLKLWDTR
Sbjct: 230 FEKLSGSVISLYDENEEAEICSIFSDGAPLQVAALSPGRMVVAAGERVSSSVSLKLWDTR 289

Query: 241 GRSRKPVVAAEWDSPSGKIVDVYYDDVENVYLRDNGAAGIMVGDVRKVSSASENSPAGGG 300
            RSR  V+AAEW+SPSGK+VDVY +DV+ +YLRDNG   I+VGDVRKV   SENS A   
Sbjct: 290 CRSRTSVLAAEWNSPSGKVVDVYSNDVDKIYLRDNGGDRIVVGDVRKVRPVSENSSAVDS 341

Query: 301 DGLWESGH 308
            G WES H
Sbjct: 350 GGWWESEH 341

BLAST of Clc08G04840.1 vs. NCBI nr
Match: XP_022971770.1 (uncharacterized protein LOC111470454 [Cucurbita maxima])

HSP 1 Score: 390.6 bits (1002), Expect = 1.3e-104
Identity = 204/307 (66.45%), Postives = 239/307 (77.85%), Query Frame = 0

Query: 1   MEVPVLNRITELGADLSSLPNPNFLSRIFTSFSPSEHFWKWSALIITLLATFTGIINRVK 60
           MEVPVLN IT  G +L S+P+PNFLSRIFTSFS  + FWKW AL I LLATF+GIINR+K
Sbjct: 1   MEVPVLNMITGFGRELGSIPDPNFLSRIFTSFSIFQQFWKWGALFIALLATFSGIINRIK 60

Query: 61  ILIIVIRRRTRTTSISEPLYRSPHSGETGGLVSENIKSPLFPSSESEDENEGDQ--KPDN 120
             +IVI RRTRTT ISEPL  S H GE GGL+SEN +SP   SSESEDENEGDQ  +PD+
Sbjct: 61  TSVIVIHRRTRTTPISEPLSSSLHGGENGGLISENFRSPPLSSSESEDENEGDQDREPDD 120

Query: 121 GLNFRVKGSGWFPGEFDDRCCSRLRRRHFDGRDDGDLFSWPCFGLERSVVRQWGDVKLKC 180
            L+F VKGS  F GEFDDR  + LRRRH     +GD FSWPCF  ++SVV+QWGDVKLKC
Sbjct: 121 RLDFLVKGSVRFSGEFDDRRFTGLRRRHGSRGGNGDSFSWPCFVSDKSVVKQWGDVKLKC 180

Query: 181 ELDKLSGSLISLYDENEEAEICSIFSGGAPLQAAAVSPRRMVVAASEGASANVSLKLWDT 240
           E ++LSGS+I +YDENEEAEICSIFSGG PL+AAA+S  +MVVAA E    N+SLK+WDT
Sbjct: 181 EFEELSGSVILVYDENEEAEICSIFSGGDPLKAAALSAAKMVVAARESGLGNMSLKIWDT 240

Query: 241 RGRSRKPVVAAEWDSPSGKIVDVYYDDVENVYLRDNGAAGIMVGDVRKVSSASENSPAGG 300
           R RS+ PV+AAEW+SP  KIVDVY +++E V + D GAAG+MVGDVRK  SASE    GG
Sbjct: 241 RDRSQTPVIAAEWNSP--KIVDVYSEEIEKVDIGDKGAAGMMVGDVRKFWSASEKWRKGG 300

Query: 301 GDGLWES 306
           G+G WES
Sbjct: 301 GEGWWES 305

BLAST of Clc08G04840.1 vs. ExPASy TrEMBL
Match: A0A0A0KDT7 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G028940 PE=4 SV=1)

HSP 1 Score: 456.1 bits (1172), Expect = 1.2e-124
Identity = 231/306 (75.49%), Postives = 258/306 (84.31%), Query Frame = 0

Query: 1   MEVPVLNRITELGADLSSLPNPNFLSRIFTSFSPSEHFWKWSALIITLLATFTGIINRVK 60
           MEVPVLNRITELG +L SLPNP+FLSRIFTS  PS+HFWKW+ALII  LATF GIINRVK
Sbjct: 1   MEVPVLNRITELGPNLGSLPNPSFLSRIFTSVFPSQHFWKWAALIIAFLATFPGIINRVK 60

Query: 61  ILIIVIRRRTRTTSISEPLYRSPHSGETGGLVSENIKSPLFPSSESEDENEGDQKPDNGL 120
           + IIV RRRT+TTSISEPLYRS H G++ GLVS+N+KSPL  SSESEDENE D++ +N  
Sbjct: 61  VFIIVCRRRTKTTSISEPLYRSLHFGDSRGLVSKNLKSPLLSSSESEDENERDREHNNDS 120

Query: 121 NFRVKGSGWFPGEFDDRCCSRLRRRHFDGRDDGDLFSWPCFGLERSVVRQWGDVKLKCEL 180
           +FRVKGS  F GEFD  C SR RRR  +G  +GDLFSWPCFGLERSVVRQWGDVKLKCE 
Sbjct: 121 DFRVKGSSLFSGEFDGGCRSRHRRRPCNGGGNGDLFSWPCFGLERSVVRQWGDVKLKCEF 180

Query: 181 DKLSGSLISLYDENEEAEICSIFSGGAPLQAAAVSPRRMVVAASEGASANVSLKLWDTRG 240
           ++LSGS+ISLYD NEEAEICSI SGG  L+AAAVSPRRMVVAA+EG SANVSLKLWDTRG
Sbjct: 181 EELSGSMISLYDVNEEAEICSILSGGGSLKAAAVSPRRMVVAANEGVSANVSLKLWDTRG 240

Query: 241 RSRKPVVAAEWDSPSGKIVDVYYDDVENVYLRDNGAAGIMVGDVRKVSSASENSPAGGGD 300
           RSR PVV  EWDSPSG IVDVYY+DV N+Y+RDN AAGIM+GDVR+ SS  E   AGGG+
Sbjct: 241 RSRTPVVGMEWDSPSGNIVDVYYEDVGNLYVRDNEAAGIMIGDVRRASSGWEKLTAGGGE 300

Query: 301 GLWESG 307
           GLWE G
Sbjct: 301 GLWEVG 306

BLAST of Clc08G04840.1 vs. ExPASy TrEMBL
Match: A0A5A7T3F5 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold5066G00090 PE=4 SV=1)

HSP 1 Score: 452.6 bits (1163), Expect = 1.3e-123
Identity = 232/307 (75.57%), Postives = 253/307 (82.41%), Query Frame = 0

Query: 1   MEVPVLNRITELGADLSSLPNPNFLSRIFTSFSPSEHFWKWSALIITLLATFTGIINRVK 60
           MEVPVLNRITELGA L SLPNPNFLSRIFTSF PS+HFWKW ALII LLATFTGIINRVK
Sbjct: 1   MEVPVLNRITELGAHLGSLPNPNFLSRIFTSFFPSQHFWKWGALIIALLATFTGIINRVK 60

Query: 61  ILIIVIRRRTRTTSISEPLYRSPHSGETGGLVSENIKSPLFPSSESEDENEGDQKPDNGL 120
           + II+ RRRT+TTSISEPLYRS H  ++GGLVS+N+KSP   SSESEDENE  ++ +N  
Sbjct: 61  VFIIICRRRTKTTSISEPLYRSLHCRDSGGLVSKNLKSPPLSSSESEDENERGRERNNDS 120

Query: 121 NFRVKGSGWFPGEFDDRCCSRLRRRHFDGRDDGDLFSWPCFGLERSVVRQWGDVKLKCEL 180
           NFRVK S  F  E D  C SRLRRRH +G  +GDLF WPCFGL+RSVVRQWGDV    E 
Sbjct: 121 NFRVKVSSRFSCELDGGCHSRLRRRHCNGGSNGDLFPWPCFGLDRSVVRQWGDVISNSEF 180

Query: 181 DKLSGSLISLYDENEEAEICSIFSGGAPLQAAAVSPRRMVVAASEGASANVSLKLWDTRG 240
           ++LSGS+ISLYD+NEEAEICSIF+ G  L+A AVSPRRMVVAASEG SANVSLKLWDTRG
Sbjct: 181 EELSGSMISLYDDNEEAEICSIFNEGGSLKAVAVSPRRMVVAASEGVSANVSLKLWDTRG 240

Query: 241 RSRKPVVAAEWDSPSGKIVDVYYDDVENVYLRDNGAAGIMVGDVRKVSSASENSPAGGGD 300
           RSR PVVA EWDSPS  IVDVYY+DVENVYLRDNGAAGIMVGDVRK SS SE   AG G 
Sbjct: 241 RSRTPVVAVEWDSPSRNIVDVYYEDVENVYLRDNGAAGIMVGDVRKASSGSEKLTAGDGG 300

Query: 301 GLWESGH 308
           GLW  GH
Sbjct: 301 GLWNLGH 307

BLAST of Clc08G04840.1 vs. ExPASy TrEMBL
Match: A0A1S3BCP7 (uncharacterized protein LOC103488596 OS=Cucumis melo OX=3656 GN=LOC103488596 PE=4 SV=1)

HSP 1 Score: 452.6 bits (1163), Expect = 1.3e-123
Identity = 232/307 (75.57%), Postives = 253/307 (82.41%), Query Frame = 0

Query: 1   MEVPVLNRITELGADLSSLPNPNFLSRIFTSFSPSEHFWKWSALIITLLATFTGIINRVK 60
           MEVPVLNRITELGA L SLPNPNFLSRIFTSF PS+HFWKW ALII LLATFTGIINRVK
Sbjct: 1   MEVPVLNRITELGAHLGSLPNPNFLSRIFTSFFPSQHFWKWGALIIALLATFTGIINRVK 60

Query: 61  ILIIVIRRRTRTTSISEPLYRSPHSGETGGLVSENIKSPLFPSSESEDENEGDQKPDNGL 120
           + II+ RRRT+TTSISEPLYRS H  ++GGLVS+N+KSP   SSESEDENE  ++ +N  
Sbjct: 61  VFIIICRRRTKTTSISEPLYRSLHCRDSGGLVSKNLKSPPLSSSESEDENERGRERNNDS 120

Query: 121 NFRVKGSGWFPGEFDDRCCSRLRRRHFDGRDDGDLFSWPCFGLERSVVRQWGDVKLKCEL 180
           NFRVK S  F  E D  C SRLRRRH +G  +GDLF WPCFGL+RSVVRQWGDV    E 
Sbjct: 121 NFRVKVSSRFSCELDGGCHSRLRRRHCNGGSNGDLFPWPCFGLDRSVVRQWGDVISNSEF 180

Query: 181 DKLSGSLISLYDENEEAEICSIFSGGAPLQAAAVSPRRMVVAASEGASANVSLKLWDTRG 240
           ++LSGS+ISLYD+NEEAEICSIF+ G  L+A AVSPRRMVVAASEG SANVSLKLWDTRG
Sbjct: 181 EELSGSMISLYDDNEEAEICSIFNEGGSLKAVAVSPRRMVVAASEGVSANVSLKLWDTRG 240

Query: 241 RSRKPVVAAEWDSPSGKIVDVYYDDVENVYLRDNGAAGIMVGDVRKVSSASENSPAGGGD 300
           RSR PVVA EWDSPS  IVDVYY+DVENVYLRDNGAAGIMVGDVRK SS SE   AG G 
Sbjct: 241 RSRTPVVAVEWDSPSRNIVDVYYEDVENVYLRDNGAAGIMVGDVRKASSGSEKLTAGDGG 300

Query: 301 GLWESGH 308
           GLW  GH
Sbjct: 301 GLWNLGH 307

BLAST of Clc08G04840.1 vs. ExPASy TrEMBL
Match: A0A6J1I6N0 (uncharacterized protein LOC111470454 OS=Cucurbita maxima OX=3661 GN=LOC111470454 PE=4 SV=1)

HSP 1 Score: 390.6 bits (1002), Expect = 6.1e-105
Identity = 204/307 (66.45%), Postives = 239/307 (77.85%), Query Frame = 0

Query: 1   MEVPVLNRITELGADLSSLPNPNFLSRIFTSFSPSEHFWKWSALIITLLATFTGIINRVK 60
           MEVPVLN IT  G +L S+P+PNFLSRIFTSFS  + FWKW AL I LLATF+GIINR+K
Sbjct: 1   MEVPVLNMITGFGRELGSIPDPNFLSRIFTSFSIFQQFWKWGALFIALLATFSGIINRIK 60

Query: 61  ILIIVIRRRTRTTSISEPLYRSPHSGETGGLVSENIKSPLFPSSESEDENEGDQ--KPDN 120
             +IVI RRTRTT ISEPL  S H GE GGL+SEN +SP   SSESEDENEGDQ  +PD+
Sbjct: 61  TSVIVIHRRTRTTPISEPLSSSLHGGENGGLISENFRSPPLSSSESEDENEGDQDREPDD 120

Query: 121 GLNFRVKGSGWFPGEFDDRCCSRLRRRHFDGRDDGDLFSWPCFGLERSVVRQWGDVKLKC 180
            L+F VKGS  F GEFDDR  + LRRRH     +GD FSWPCF  ++SVV+QWGDVKLKC
Sbjct: 121 RLDFLVKGSVRFSGEFDDRRFTGLRRRHGSRGGNGDSFSWPCFVSDKSVVKQWGDVKLKC 180

Query: 181 ELDKLSGSLISLYDENEEAEICSIFSGGAPLQAAAVSPRRMVVAASEGASANVSLKLWDT 240
           E ++LSGS+I +YDENEEAEICSIFSGG PL+AAA+S  +MVVAA E    N+SLK+WDT
Sbjct: 181 EFEELSGSVILVYDENEEAEICSIFSGGDPLKAAALSAAKMVVAARESGLGNMSLKIWDT 240

Query: 241 RGRSRKPVVAAEWDSPSGKIVDVYYDDVENVYLRDNGAAGIMVGDVRKVSSASENSPAGG 300
           R RS+ PV+AAEW+SP  KIVDVY +++E V + D GAAG+MVGDVRK  SASE    GG
Sbjct: 241 RDRSQTPVIAAEWNSP--KIVDVYSEEIEKVDIGDKGAAGMMVGDVRKFWSASEKWRKGG 300

Query: 301 GDGLWES 306
           G+G WES
Sbjct: 301 GEGWWES 305

BLAST of Clc08G04840.1 vs. ExPASy TrEMBL
Match: A0A6J1CEG4 (uncharacterized protein LOC111010475 OS=Momordica charantia OX=3673 GN=LOC111010475 PE=4 SV=1)

HSP 1 Score: 388.3 bits (996), Expect = 3.0e-104
Identity = 207/306 (67.65%), Postives = 240/306 (78.43%), Query Frame = 0

Query: 1   MEVPVLNRITELGADLSSLPNPNFLSRIFTSFSPSEHFWKWSALIITLLATFTGIINRVK 60
           MEVPVLNRITELGADLSSLPN NFLSRI TSFSPS+HFWKW A++I LLATF+G+INRVK
Sbjct: 1   MEVPVLNRITELGADLSSLPNANFLSRILTSFSPSQHFWKWGAVVIALLATFSGLINRVK 60

Query: 61  ILIIVIRRRTRTTSISEPLYRSPHSGETGGLVSENIKSPLFPSSESEDENEGDQKPDNGL 120
           ILIIVIRRR RTT I EPL RS H GE GG VSEN+ SP F SSESEDENE    P++G 
Sbjct: 61  ILIIVIRRR-RTTPIYEPLSRSLHGGENGGFVSENLGSPPFFSSESEDENE-LSSPEDGS 120

Query: 121 NFRVKGSGWFPGEFD-DRCCSRLRRRHFDGRDDGDLFSWPCFGLERSVVRQWGDVKLKCE 180
           +F VKGSG    ++   R CS LRRRH+     GD  SW CFG ERSVVRQWG+V+LKC+
Sbjct: 121 DFGVKGSGRSSDDYSCGRRCSGLRRRHY----GGDSLSWSCFGSERSVVRQWGEVQLKCK 180

Query: 181 LDKLSGSLISLYDENEEAEICSIFSGGAPLQAAAVSPRRMVVAASEGASANVSLKLWDTR 240
            D+LSGS+ISLYDENEE EICSIFSGGAP++AAA+SP  MVV+A +    NVS+KLWDTR
Sbjct: 181 FDELSGSVISLYDENEEKEICSIFSGGAPVRAAAMSPAGMVVSAGQSVFGNVSVKLWDTR 240

Query: 241 GRSRKPVVAAEWDSPSGKIVDVYYDDVENVYLRDNGAAGIMVGDVRKVSSASENSPAGGG 300
            RS+ P+VAAEW+SP+ KIVDVYY++ E VYLR++  A + V DVRKV SA ENS  GG 
Sbjct: 241 SRSQTPIVAAEWNSPAAKIVDVYYEESEKVYLRNDSDAKLTVADVRKVCSALENSNLGGV 300

Query: 301 DGLWES 306
           D  W +
Sbjct: 301 DRWWHA 300

BLAST of Clc08G04840.1 vs. TAIR 10
Match: AT1G68440.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G25400.2); Has 86 Blast hits to 86 proteins in 29 species: Archae - 0; Bacteria - 6; Metazoa - 27; Fungi - 11; Plants - 24; Viruses - 0; Other Eukaryotes - 18 (source: NCBI BLink). )

HSP 1 Score: 101.3 bits (251), Expect = 1.4e-21
Identity = 95/327 (29.05%), Postives = 146/327 (44.65%), Query Frame = 0

Query: 1   MEVPVLNRITELGADLSSLPNPNFLSRIFT-----SFSPSEHFWKWSALIITLLATFTGI 60
           MEVPV+NRI +    ++S+ +P+FLSR            +  FWKW ALII  LA FT  
Sbjct: 1   MEVPVINRIRDFEVGINSINDPSFLSRSVAVSGIGKLHQAYGFWKWGALIIAFLAYFTNF 60

Query: 61  INRVKILIIVIRRRTRTTSISEPLYRSPHSGETGGLVSENIKSPLFPSSESEDENEGDQK 120
           ++++  L  V+R R    S+S P     +  ++    S  + S      E ++E+E D +
Sbjct: 61  VSKLNSL--VVRLRKIDVSVSSPTLFDDYDSDSDVSCSSTVSS----DDEKDEEDEADDE 120

Query: 121 PD-----------NGLNFRVKGSGWF---PGEFDDRCCSRLRRRHFDGRDDGDLFSWPCF 180
            +           NG  FRV+GS ++     + D+  C+ + RR+      GDLFSWP  
Sbjct: 121 DEDVDSIFNRRRVNG-GFRVRGSDYYDDDDDQGDNGNCTWMGRRY--SGSFGDLFSWPDL 180

Query: 181 GLERSVVRQWGDVKLKCELDKLSGSLISLYDENEEAEICSIFSGGAPLQAAAVSPRRMVV 240
           G     +   G VKL   LD          D ++   + + F     L+    +      
Sbjct: 181 G----GIGSSGVVKLWDHLD---------IDGDDHENVVATF-----LKNYNSTSSPFFW 240

Query: 241 AASEGASANVSLKLWDTRGRSRKPVVAAEWDSPS---GKIVDVYYDDVENVYLRDNGAAG 300
           AA +     V +K  D R   R P + AEW  P    G I+ V    VE VY+RD+ +  
Sbjct: 241 AAEKKGVDAVKVKACDPRAGFRMPALLAEWRQPGRLLGNIIGVDTGGVEKVYVRDDVSGE 300

Query: 301 IMVGDVRKVSSASENSPAGGGDGLWES 306
           I VGD+RK +    +      +  W++
Sbjct: 301 IAVGDLRKFNGVLTDLTECEAETWWDA 300

BLAST of Clc08G04840.1 vs. TAIR 10
Match: AT1G25400.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G68440.1); Has 21 Blast hits to 21 proteins in 6 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 21; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 88.6 bits (218), Expect = 9.6e-18
Identity = 90/309 (29.13%), Postives = 139/309 (44.98%), Query Frame = 0

Query: 1   MEVPVLNRITELGADLSSLPNPNFLSRIFT-----SFSPSEHFWKWSA-LIITLLATFTG 60
           MEVP++NRI +    ++S+ +P++LSR            +  FWKW A L++   A+FT 
Sbjct: 1   MEVPIINRIGDFDMGINSINDPSYLSRALAVSGVGKLHQAYSFWKWGALLLLAFFASFTS 60

Query: 61  IINRVKILIIVIRRRTRTTSISEPLYRSPHSGETGGLVSENIKSPLFPSSESEDE--NEG 120
           +  R+K L+     R R  ++S P            L + +  S    SS+S DE  +E 
Sbjct: 61  LTTRIKTLVF----RLRNVNVSLPSQTL--------LCNYDSDSDWSFSSDSSDEEKDED 120

Query: 121 DQKPDNGLN--FRVKGSGWFPGEFDDRCCSRLRRRHFDGRDDGDLFSWPCFGLERSVVRQ 180
           D K D+ +N   RV+  G++                 D  D G   S P     R     
Sbjct: 121 DNKEDDSVNGDSRVQRFGYY----------------HDDDDKGISGSVPWL---RRCSGS 180

Query: 181 WGDVKLKCELDKLSGSLISLYD----ENEEAEICSIFS--GGAPLQAAAVSPRRMVVAAS 240
           +GD+     LD  S  ++ L+D      E + + S FS  G   L ++AV     ++AA 
Sbjct: 181 FGDL-----LDLGSSGVVKLWDNLDFNGEGSPVASFFSKCGSYSLLSSAV-----LLAAE 240

Query: 241 EGASANVSLKLWDTRGRSRKPVVAAEWDSPS---GKIVDVYYDDVENVYLRDNGAAGIMV 291
           +  S  + +  WD R     P + AEW  P    GKI+ V   DV+ +Y+ D+    I V
Sbjct: 241 KKGSDGLEVSAWDARVGFGVPALLAEWKQPGRLLGKIIRVDVGDVDKIYVGDDVEGEITV 268

BLAST of Clc08G04840.1 vs. TAIR 10
Match: AT1G25400.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 20 plant structures; EXPRESSED DURING: 10 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G68440.1). )

HSP 1 Score: 88.6 bits (218), Expect = 9.6e-18
Identity = 90/309 (29.13%), Postives = 139/309 (44.98%), Query Frame = 0

Query: 1   MEVPVLNRITELGADLSSLPNPNFLSRIFT-----SFSPSEHFWKWSA-LIITLLATFTG 60
           MEVP++NRI +    ++S+ +P++LSR            +  FWKW A L++   A+FT 
Sbjct: 1   MEVPIINRIGDFDMGINSINDPSYLSRALAVSGVGKLHQAYSFWKWGALLLLAFFASFTS 60

Query: 61  IINRVKILIIVIRRRTRTTSISEPLYRSPHSGETGGLVSENIKSPLFPSSESEDE--NEG 120
           +  R+K L+     R R  ++S P            L + +  S    SS+S DE  +E 
Sbjct: 61  LTTRIKTLVF----RLRNVNVSLPSQTL--------LCNYDSDSDWSFSSDSSDEEKDED 120

Query: 121 DQKPDNGLN--FRVKGSGWFPGEFDDRCCSRLRRRHFDGRDDGDLFSWPCFGLERSVVRQ 180
           D K D+ +N   RV+  G++                 D  D G   S P     R     
Sbjct: 121 DNKEDDSVNGDSRVQRFGYY----------------HDDDDKGISGSVPWL---RRCSGS 180

Query: 181 WGDVKLKCELDKLSGSLISLYD----ENEEAEICSIFS--GGAPLQAAAVSPRRMVVAAS 240
           +GD+     LD  S  ++ L+D      E + + S FS  G   L ++AV     ++AA 
Sbjct: 181 FGDL-----LDLGSSGVVKLWDNLDFNGEGSPVASFFSKCGSYSLLSSAV-----LLAAE 240

Query: 241 EGASANVSLKLWDTRGRSRKPVVAAEWDSPS---GKIVDVYYDDVENVYLRDNGAAGIMV 291
           +  S  + +  WD R     P + AEW  P    GKI+ V   DV+ +Y+ D+    I V
Sbjct: 241 KKGSDGLEVSAWDARVGFGVPALLAEWKQPGRLLGKIIRVDVGDVDKIYVGDDVEGEITV 268

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038885552.14.6e-14786.32uncharacterized protein LOC120075889 [Benincasa hispida][more]
XP_011656511.12.5e-12475.49uncharacterized protein LOC105435751 [Cucumis sativus] >KGN45941.1 hypothetical ... [more]
XP_008445638.22.7e-12375.57PREDICTED: uncharacterized protein LOC103488596 [Cucumis melo] >KAA0036127.1 unc... [more]
XP_023550996.11.5e-10568.83uncharacterized protein LOC111808968 [Cucurbita pepo subsp. pepo][more]
XP_022971770.11.3e-10466.45uncharacterized protein LOC111470454 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KDT71.2e-12475.49Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G028940 PE=4 SV=1[more]
A0A5A7T3F51.3e-12375.57Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A1S3BCP71.3e-12375.57uncharacterized protein LOC103488596 OS=Cucumis melo OX=3656 GN=LOC103488596 PE=... [more]
A0A6J1I6N06.1e-10566.45uncharacterized protein LOC111470454 OS=Cucurbita maxima OX=3661 GN=LOC111470454... [more]
A0A6J1CEG43.0e-10467.65uncharacterized protein LOC111010475 OS=Momordica charantia OX=3673 GN=LOC111010... [more]
Match NameE-valueIdentityDescription
AT1G68440.11.4e-2129.05unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G25400.19.6e-1829.13unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G25400.29.6e-1829.13unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (cordophanus) v2
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 288..307
NoneNo IPR availablePANTHERPTHR36715BNAANNG41370D PROTEINcoord: 1..305
NoneNo IPR availablePANTHERPTHR36715:SF1BNAANNG41370D PROTEINcoord: 1..305

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Clc08G04840Clc08G04840gene


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
Clc08G04840.1-exonClc08G04840.1-exon-ClcChr08:14737048..14738379exon


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Clc08G04840.1-five_prime_utrClc08G04840.1-five_prime_utr-ClcChr08:14737048..14737254five_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Clc08G04840.1-cdsClc08G04840.1-cds-ClcChr08:14737255..14738178CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Clc08G04840.1-three_prime_utrClc08G04840.1-three_prime_utr-ClcChr08:14738179..14738379three_prime_UTR


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Clc08G04840.1Clc08G04840.1-proteinpolypeptide


GO Annotation
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane