CcUC03G047560 (gene) Watermelon (PI 537277) v1

Overview
NameCcUC03G047560
Typegene
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionBEST Arabidopsis thaliana protein match is: glycine-rich protein .
LocationCicolChr03: 5387242 .. 5388553 (-)
RNA-Seq ExpressionCcUC03G047560
SyntenyCcUC03G047560
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTGGTTTAGCTTGACTTCTTTAATCAAGTTGTTAATCTTAATTAACTTTCGTAATCATAAAGATAAAACCATGACACCCCAAAAGAACAAAAGGACAGTCCTATAACACCCCCACTCACGACTATTGCGGCAAAAATGTTGCGTGGCATTCATTTTCGCTACCAAAATCCCAATTAGATAAGCGCTATGTGGTTACAAGAACTTGGGTTGCATTTGATAATGGCGTTGAAACAAATTTCCCAAATGAAGATACCCCAATTTTCTTCATCAGCTTAACATCAGATCTACAATGGCCTTCTACAATTCCCACTACGATTCTGCTCAAACAGAACCCCCAATTTCGCAATTCAGTAACGAACCCACCTTCTACAATCTCTTTGACTACCCACCTCCTTGTTATTTCGAACAGTCTTATGATTCCTGCACATCCAATTTCTATGGATTTCCCCAATTGATCGAACATGAATCCATTGACCATGGCGGTTATGGTTATCCAATTAGCTACTCGGCCAATGCTTGTTCTGCATCGAGTTTCACTTTGCCAAAAGTAATCGAATACGACCCTGATTTGTACAGAGAGGTGCCAACTCAATTTGTGATCTCTTACTCTGTTTCCGAATTCAACGAGACAGAATTTGAAGAGTACGATCCAACCCCTTACGGCGGTGGCTATGACATTTCTGAAACCTACGGTAAGCCCCTTCCACCTTCAACTGAAATTTGTTACCCACCGTCCTCTTCTTCACCGCCGAGTACTGCCGCCGCCATTCCCATCTCCACAATACCCAAGGTAGAGGAAGCACCAAAAGGAAAAATCGAAAAACAAACAAAGCCATCGAGTGAAATCAAACCGACCCAGATCGAAAAAGTTAACGACAGCTCTTCGAGTGAGAGCGACATGAATTCTGAATCTGAAGAAATTGAGGAAATTAAAGCGATTCAATTGGCAGATCTGGGAATTGGGTATGGAAATGGAAGGGAAGTGAATCAATTTCCAAGCGGGTACGGACTGGAAGCGATGGATCTTTGTGAAAGTTTATTTGGGTATTGGCCATGTCTCTCACGGATAAAAAAACAAACAGCTTGTAGGCAACCCAAGAACGGTTGTGGGCGTTGCCATGGCCATTGCTATTGCTATGGGAATTACGGCAACCAGTGGCAGACGGCGGCGGATTATCTATTTGGAAGCCATAATCCATATCCAGATGGAAATGCTATTTATGGCTATCAAAGACAGTTCCAAGGGGAGGCTGCTCATGGGTATGTTTGGTTGAATCAAAATGACTTCAACGGGTGTGAAGATGTTTGA

mRNA sequence

ATGGTGCTTAACATCAGATCTACAATGGCCTTCTACAATTCCCACTACGATTCTGCTCAAACAGAACCCCCAATTTCGCAATTCAGTAACGAACCCACCTTCTACAATCTCTTTGACTACCCACCTCCTTGTTATTTCGAACAGTCTTATGATTCCTGCACATCCAATTTCTATGGATTTCCCCAATTGATCGAACATGAATCCATTGACCATGGCGGTTATGGTTATCCAATTAGCTACTCGGCCAATGCTTGTTCTGCATCGAGTTTCACTTTGCCAAAAGTAATCGAATACGACCCTGATTTGTACAGAGAGGTGCCAACTCAATTTGTGATCTCTTACTCTGTTTCCGAATTCAACGAGACAGAATTTGAAGAGTACGATCCAACCCCTTACGGCGGTGGCTATGACATTTCTGAAACCTACGGTAAGCCCCTTCCACCTTCAACTGAAATTTGTTACCCACCGTCCTCTTCTTCACCGCCGAGTACTGCCGCCGCCATTCCCATCTCCACAATACCCAAGGTAGAGGAAGCACCAAAAGGAAAAATCGAAAAACAAACAAAGCCATCGAGTGAAATCAAACCGACCCAGATCGAAAAAGTTAACGACAGCTCTTCGAGTGAGAGCGACATGAATTCTGAATCTGAAGAAATTGAGGAAATTAAAGCGATTCAATTGGCAGATCTGGGAATTGGGTATGGAAATGGAAGGGAAGTGAATCAATTTCCAAGCGGGTACGGACTGGAAGCGATGGATCTTTGTGAAAGTTTATTTGGGTATTGGCCATGTCTCTCACGGATAAAAAAACAAACAGCTTGTAGGCAACCCAAGAACGGTTGTGGGCGTTGCCATGGCCATTGCTATTGCTATGGGAATTACGGCAACCAGTGGCAGACGGCGGCGGATTATCTATTTGGAAGCCATAATCCATATCCAGATGGAAATGCTATTTATGGCTATCAAAGACAGTTCCAAGGGGAGGCTGCTCATGGGTATGTTTGGTTGAATCAAAATGACTTCAACGGGTGTGAAGATGTTTGA

Coding sequence (CDS)

ATGGTGCTTAACATCAGATCTACAATGGCCTTCTACAATTCCCACTACGATTCTGCTCAAACAGAACCCCCAATTTCGCAATTCAGTAACGAACCCACCTTCTACAATCTCTTTGACTACCCACCTCCTTGTTATTTCGAACAGTCTTATGATTCCTGCACATCCAATTTCTATGGATTTCCCCAATTGATCGAACATGAATCCATTGACCATGGCGGTTATGGTTATCCAATTAGCTACTCGGCCAATGCTTGTTCTGCATCGAGTTTCACTTTGCCAAAAGTAATCGAATACGACCCTGATTTGTACAGAGAGGTGCCAACTCAATTTGTGATCTCTTACTCTGTTTCCGAATTCAACGAGACAGAATTTGAAGAGTACGATCCAACCCCTTACGGCGGTGGCTATGACATTTCTGAAACCTACGGTAAGCCCCTTCCACCTTCAACTGAAATTTGTTACCCACCGTCCTCTTCTTCACCGCCGAGTACTGCCGCCGCCATTCCCATCTCCACAATACCCAAGGTAGAGGAAGCACCAAAAGGAAAAATCGAAAAACAAACAAAGCCATCGAGTGAAATCAAACCGACCCAGATCGAAAAAGTTAACGACAGCTCTTCGAGTGAGAGCGACATGAATTCTGAATCTGAAGAAATTGAGGAAATTAAAGCGATTCAATTGGCAGATCTGGGAATTGGGTATGGAAATGGAAGGGAAGTGAATCAATTTCCAAGCGGGTACGGACTGGAAGCGATGGATCTTTGTGAAAGTTTATTTGGGTATTGGCCATGTCTCTCACGGATAAAAAAACAAACAGCTTGTAGGCAACCCAAGAACGGTTGTGGGCGTTGCCATGGCCATTGCTATTGCTATGGGAATTACGGCAACCAGTGGCAGACGGCGGCGGATTATCTATTTGGAAGCCATAATCCATATCCAGATGGAAATGCTATTTATGGCTATCAAAGACAGTTCCAAGGGGAGGCTGCTCATGGGTATGTTTGGTTGAATCAAAATGACTTCAACGGGTGTGAAGATGTTTGA

Protein sequence

MVLNIRSTMAFYNSHYDSAQTEPPISQFSNEPTFYNLFDYPPPCYFEQSYDSCTSNFYGFPQLIEHESIDHGGYGYPISYSANACSASSFTLPKVIEYDPDLYREVPTQFVISYSVSEFNETEFEEYDPTPYGGGYDISETYGKPLPPSTEICYPPSSSSPPSTAAAIPISTIPKVEEAPKGKIEKQTKPSSEIKPTQIEKVNDSSSSESDMNSESEEIEEIKAIQLADLGIGYGNGREVNQFPSGYGLEAMDLCESLFGYWPCLSRIKKQTACRQPKNGCGRCHGHCYCYGNYGNQWQTAADYLFGSHNPYPDGNAIYGYQRQFQGEAAHGYVWLNQNDFNGCEDV
Homology
BLAST of CcUC03G047560 vs. NCBI nr
Match: XP_038895690.1 (uncharacterized protein LOC120083862 [Benincasa hispida])

HSP 1 Score: 591.3 bits (1523), Expect = 5.5e-165
Identity = 291/351 (82.91%), Postives = 306/351 (87.18%), Query Frame = 0

Query: 10  AFYNSHYDSAQTEPPISQFSNEPTFYNLFDYPPPCYFEQSYDS---------CTSNFYGF 69
           ++ +S+Y SAQ EPPISQ SNEPTFYNLFDYPPPCY EQ YDS           SNFY F
Sbjct: 9   SYNDSYYHSAQIEPPISQSSNEPTFYNLFDYPPPCYLEQVYDSEVGYFANAPYRSNFYEF 68

Query: 70  PQLIEHESIDHGGYGYPISYSANACSASSFTLPKVIEYDPDLYREVPTQFVISYSVSEFN 129
           PQLIE E+++HG YGY ISYSANACSA SFT+PKVIEYDPD Y EV TQFVISYSVSEFN
Sbjct: 69  PQLIERETVNHGAYGYAISYSANACSAPSFTVPKVIEYDPDFYSEVSTQFVISYSVSEFN 128

Query: 130 ETEFEEYDPTPYGGGYDISETYGKPLPPSTEICYPPSSSSPPSTAAAIPISTIPKVEEAP 189
           ETEFEEYDPTPYGGGYDISETYGKPL PSTEICYPPSSSSPP TA AIPI TIPK EE P
Sbjct: 129 ETEFEEYDPTPYGGGYDISETYGKPLQPSTEICYPPSSSSPP-TATAIPIFTIPKEEEPP 188

Query: 190 KGKIEKQTKPSSEIKPTQIEKVNDSSSSESDMNSESEEIEEIKAIQLADLGIGYGNGREV 249
           KGKIE+QTKPSSEIKPTQIEKVN SSSSESD  SESEEIEE+KAIQLAD GI YGNGRE 
Sbjct: 189 KGKIEEQTKPSSEIKPTQIEKVNHSSSSESDTASESEEIEEVKAIQLADPGIEYGNGREA 248

Query: 250 NQFPSGYGLEAMDLCESLFGYWPCLSRIKKQTACRQPKNGCGRCHGHCYCYGNYGNQWQT 309
           NQFPSGYGLEAMDLCESLFGYWPCLSR+KKQT CRQPKNGCGRCHGHCYCYGNYGNQWQT
Sbjct: 249 NQFPSGYGLEAMDLCESLFGYWPCLSRVKKQTPCRQPKNGCGRCHGHCYCYGNYGNQWQT 308

Query: 310 AADYLFGSHNPYPD----GNAIYGYQRQFQGEAAHGYVWLNQNDFNGCEDV 348
           AA+YLFGSHNPYPD    G+A+YGYQRQ QGE  +GYVWLNQNDFNGCEDV
Sbjct: 309 AAEYLFGSHNPYPDGRGEGDAVYGYQRQIQGEPVYGYVWLNQNDFNGCEDV 358

BLAST of CcUC03G047560 vs. NCBI nr
Match: XP_008441695.1 (PREDICTED: uncharacterized protein LOC103485767 [Cucumis melo])

HSP 1 Score: 576.2 bits (1484), Expect = 1.8e-160
Identity = 289/370 (78.11%), Postives = 307/370 (82.97%), Query Frame = 0

Query: 3   LNIRSTMAFY-------NSHYDSAQTEPPISQFSNEPTFYNLFDYPPPCYFEQSYDS--- 62
           LNIRS MAFY       +S+Y+SAQ EPPI Q SNEPTFYNLFDYPPPCYF Q+YDS   
Sbjct: 11  LNIRSPMAFYDSYDFYDDSYYNSAQIEPPILQSSNEPTFYNLFDYPPPCYFGQAYDSEVG 70

Query: 63  -------CTSNFYGFPQLIEHESIDHGGYGYPISYSANACSASSFTLPKVIEYDPDLYRE 122
                    SNF  FPQLIEHE +DHG YGY I YSANACSASSFTLPKV  YDPDLY E
Sbjct: 71  YFAINAAYGSNFSEFPQLIEHEPVDHGDYGYAIRYSANACSASSFTLPKVFGYDPDLYSE 130

Query: 123 VPTQFVISYSVSEFNETEFEEYDPTPYGGGYDISETYGKPLPPSTEICYPPSSSS----P 182
           V TQFVISYSVSEFNET+FEEYDPTPY GGYDI ETYGKPL PSTEICYPPSSSS    P
Sbjct: 131 VSTQFVISYSVSEFNETDFEEYDPTPYDGGYDIYETYGKPLQPSTEICYPPSSSSPSKPP 190

Query: 183 PSTAAAIPISTIPKVEEAPKGKIEKQTKPSSEIKPTQIEKVNDSSSSESDMNSESEEIEE 242
           P TA AIPI+TIPK++EAPKGKIE+QTKPSSEIKP QIEK N+SSSS+SD  SES EIEE
Sbjct: 191 PPTATAIPITTIPKIDEAPKGKIEEQTKPSSEIKPIQIEKTNNSSSSDSDTTSESGEIEE 250

Query: 243 IKAIQLADLGIGYGNGREVNQFPSGYGLEAMDLCESLFGYWPCLSRIKKQTACRQPKNGC 302
           +KAIQL D GIGYGNGREVN+FPSGYGLEAMDLCESLFGYWPCLSR K+QT CRQPKNGC
Sbjct: 251 VKAIQLGDPGIGYGNGREVNEFPSGYGLEAMDLCESLFGYWPCLSRAKRQTLCRQPKNGC 310

Query: 303 GRCHGHCYCYGNYGNQWQTAADYLFGSHNPYPD----GNAIYGYQRQFQGEAAHGYVWLN 348
           GRCHGHCYCYGNYGNQWQTAA+YLFGSHNPY D    G+  YGYQRQFQ E  +GYVWLN
Sbjct: 311 GRCHGHCYCYGNYGNQWQTAAEYLFGSHNPYLDGRGEGDGFYGYQRQFQEEPVYGYVWLN 370

BLAST of CcUC03G047560 vs. NCBI nr
Match: KAA0056916.1 (uncharacterized protein E6C27_scaffold96G00880 [Cucumis melo var. makuwa] >TYK26343.1 uncharacterized protein E5676_scaffold861G00010 [Cucumis melo var. makuwa])

HSP 1 Score: 563.9 bits (1452), Expect = 9.5e-157
Identity = 282/364 (77.47%), Postives = 301/364 (82.69%), Query Frame = 0

Query: 9   MAFY-------NSHYDSAQTEPPISQFSNEPTFYNLFDYPPPCYFEQSYDS--------- 68
           MAFY       +S+Y+SAQ EPPI Q SNEPTFYNLFDYPPPCYF Q+YDS         
Sbjct: 1   MAFYDSYDFYDDSYYNSAQIEPPILQSSNEPTFYNLFDYPPPCYFGQAYDSEVGYFAINA 60

Query: 69  -CTSNFYGFPQLIEHESIDHGGYGYPISYSANACSASSFTLPKVIEYDPDLYREVPTQFV 128
              SNF  FPQLIEHE +DHG YGY I YSANACSASSFTLPKV  YDPDLY EV TQFV
Sbjct: 61  AYGSNFSEFPQLIEHEPVDHGDYGYAIRYSANACSASSFTLPKVFGYDPDLYSEVSTQFV 120

Query: 129 ISYSVSEFNETEFEEYDPTPYGGGYDISETYGKPLPPSTEICYPPSSSS----PPSTAAA 188
           ISYSVSEFNET+FEEYDPTPY GGYDI ETYGKPL PSTEICYPPSSSS    PP TA A
Sbjct: 121 ISYSVSEFNETDFEEYDPTPYDGGYDIYETYGKPLQPSTEICYPPSSSSPSKPPPPTATA 180

Query: 189 IPISTIPKVEEAPKGKIEKQTKPSSEIKPTQIEKVNDSSSSESDMNSESEEIEEIKAIQL 248
           IPI+TIPK++EAPKGKIE+QTKPSSEIKP QIEK N+S SS+SD  SES EIEE+KAIQL
Sbjct: 181 IPITTIPKIDEAPKGKIEEQTKPSSEIKPIQIEKTNNSYSSDSDTTSESGEIEEVKAIQL 240

Query: 249 ADLGIGYGNGREVNQFPSGYGLEAMDLCESLFGYWPCLSRIKKQTACRQPKNGCGRCHGH 308
            D GIGYGNGREVN+FPSGYGLEAMDLCESLFGYWPCLSR K+QT CRQPKNGCGRCHGH
Sbjct: 241 GDPGIGYGNGREVNEFPSGYGLEAMDLCESLFGYWPCLSRAKRQTLCRQPKNGCGRCHGH 300

Query: 309 CYCYGNYGNQWQTAADYLFGSHNPYPD----GNAIYGYQRQFQGEAAHGYVWLNQNDFNG 348
           CYCYGNYGNQWQTAA+YLFGSHNPY D    G+  YGYQR+FQ E  +GYVWLNQNDFN 
Sbjct: 301 CYCYGNYGNQWQTAAEYLFGSHNPYLDGRGEGDGFYGYQRRFQEEPVYGYVWLNQNDFNR 360

BLAST of CcUC03G047560 vs. NCBI nr
Match: XP_011652905.1 (uncharacterized protein At5g39570 [Cucumis sativus] >KGN64592.1 hypothetical protein Csa_013087 [Cucumis sativus])

HSP 1 Score: 543.5 bits (1399), Expect = 1.3e-150
Identity = 276/369 (74.80%), Postives = 297/369 (80.49%), Query Frame = 0

Query: 9   MAFYN-------SHYDSAQTEPPISQFSNEPTFYNLFDYPPPCYFEQSYD---------- 68
           MAFYN       S+Y+ AQ EPPI Q SNEP FYNLFDYPPPCYF Q+YD          
Sbjct: 1   MAFYNSYDFYDDSYYNYAQIEPPIPQSSNEPNFYNLFDYPPPCYFGQAYDYEVGYSANDA 60

Query: 69  SCTSNFYGFPQLIEHESIDHGGYGYPISYSANACSASSFTLPKVIEYDPDLYREVPTQFV 128
              SNF   PQLI+HE +DHG YGY I YSANACSASSFTLPK+ EY+PDLY EV TQFV
Sbjct: 61  PYRSNFNELPQLIDHEPVDHGDYGYAIRYSANACSASSFTLPKLCEYNPDLYSEVSTQFV 120

Query: 129 ISYSVSEFNETEFEEYDPTPYGGGYDISETYGKPLPPSTEICYPPSSSS--------PPS 188
           ISYSVS+FNETEFEEYDPTPY GGYDISETYGKPL PS EICYPPSSSS        PP 
Sbjct: 121 ISYSVSQFNETEFEEYDPTPYDGGYDISETYGKPLQPSIEICYPPSSSSPSKSPPPPPPP 180

Query: 189 TAAAIP-ISTIPKVEEAPKGKIEKQTKPSSEIKPTQIEKVNDSSSSESDMNSESEEIEEI 248
           TA AIP I+TIPK++EAPKGKIE+QTKPSSEIKPTQIEK N+SSSS+SD  SES EIEE 
Sbjct: 181 TATAIPIITTIPKIDEAPKGKIEEQTKPSSEIKPTQIEKTNNSSSSDSDTTSESGEIEED 240

Query: 249 KAIQLADLGIGYGNGREVNQFPSGYGLEAMDLCESLFGYWPCLSRIKKQTACRQPKNGCG 308
           KAIQL D GIGYGN REVN+FPSG GLEAMDLCESLFGYWPCLSR K+QTA RQPKNGCG
Sbjct: 241 KAIQLGDPGIGYGNAREVNEFPSGCGLEAMDLCESLFGYWPCLSRAKRQTAYRQPKNGCG 300

Query: 309 RCHGHCYCYGNYGNQWQTAADYLFGSHNPYPD----GNAIYGYQRQFQGEAAHGYVWLNQ 348
           RCHGHCYCYGNYGN+WQTAA+YLFGSHNPY D    G+ +YGYQRQFQ E  +GYVWLNQ
Sbjct: 301 RCHGHCYCYGNYGNEWQTAAEYLFGSHNPYLDGRREGDVVYGYQRQFQEEPVYGYVWLNQ 360

BLAST of CcUC03G047560 vs. NCBI nr
Match: XP_023001286.1 (uncharacterized protein LOC111495462 [Cucurbita maxima])

HSP 1 Score: 530.4 bits (1365), Expect = 1.2e-146
Identity = 268/351 (76.35%), Postives = 290/351 (82.62%), Query Frame = 0

Query: 9   MAF---YNSHYDSAQTEPPISQFSNEPTFYNLFDYPPPCYFEQSYDSCTSNFYGFPQLIE 68
           MAF   Y+S+YDSAQTEPPI Q S EPTFYNLFDYPPPCYF Q+Y   TSNF  FPQLIE
Sbjct: 1   MAFYDSYDSYYDSAQTEPPIPQSSYEPTFYNLFDYPPPCYFGQAYAPYTSNFNEFPQLIE 60

Query: 69  HESIDHGGYGYPISYSANACSASSFTLPKVIEYDPDLY----REVPTQFVISYSVSEFNE 128
           H+ +DHG YGY ISYSANACSAS+F++PKVIEYD DLY    ++V +QFVISYSVSEFNE
Sbjct: 61  HQPVDHGAYGYTISYSANACSASTFSVPKVIEYDSDLYSDGTQKVSSQFVISYSVSEFNE 120

Query: 129 TEFEEYDPTPYGGGYDISETYGKPLPPSTEICYPPSSSSPPS-TAAAIPISTIPKVEEAP 188
           TEFEEYDPTPYGGGYDI ETYGKPL PST+ICY PSSSSPP     AIPIS I    EAP
Sbjct: 121 TEFEEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSPPKPPPTAIPISAI---HEAP 180

Query: 189 KGKIEKQTKPSSEIKPTQIEKVNDSSSSESDMNSESEEIEEIKAIQLADLGIGYGNGREV 248
           K KIE++T+PSSEIKPTQIEK N +        SESEEIEE+KAI  AD GIGYGNGREV
Sbjct: 181 KEKIEEKTEPSSEIKPTQIEKDNTA--------SESEEIEEVKAIPFADPGIGYGNGREV 240

Query: 249 NQFPSGYGLEAMDLCESLFGYWPCLSRIKKQTACRQPKNGCGRCHGHCYCYGNYGNQWQT 308
           NQFPSGYGLEAMDLCESLFGYWPCLSRIKKQTACRQP NGCGRCHGHCYCYGNYGNQWQT
Sbjct: 241 NQFPSGYGLEAMDLCESLFGYWPCLSRIKKQTACRQPNNGCGRCHGHCYCYGNYGNQWQT 300

Query: 309 AADYLFGSHNPYPD----GNAIYGYQRQFQGEAAHGYVWLNQNDFNGCEDV 348
           AADYLFGSHNPYPD    G+ +YGYQRQ+Q E  + YVWLNQNDF   +DV
Sbjct: 301 AADYLFGSHNPYPDGRSEGDGVYGYQRQYQAEPVYRYVWLNQNDFVRSDDV 340

BLAST of CcUC03G047560 vs. ExPASy Swiss-Prot
Match: Q9FKA5 (Uncharacterized protein At5g39570 OS=Arabidopsis thaliana OX=3702 GN=At5g39570 PE=1 SV=1)

HSP 1 Score: 50.1 bits (118), Expect = 6.0e-05
Identity = 50/157 (31.85%), Postives = 67/157 (42.68%), Query Frame = 0

Query: 114 YSVSEFNETEFEEYDPTPYGGGYDISETYGKPLPPSTEICYPPSSSSPPSTAAAIPISTI 173
           Y+  + +  +F+E+DPTPY GGYDI+  YG+P+PPS E CYP SS          P  T 
Sbjct: 4   YTRDDNDVDDFDEFDPTPYSGGYDITVIYGRPIPPSDETCYPLSSGVDDDFEYERPEFT- 63

Query: 174 PKVEEAPKGKIEKQTKPSSEIKP---------------TQIEKVNDSSSSESDMNSESEE 233
              E +  G     T+ SS  +P                Q E+ N    SES    + E 
Sbjct: 64  QIHEPSAYGDEALNTEYSSYSRPKPRPAFRPDSGGGGHVQGERPNPGYGSESGYGRKPE- 123

Query: 234 IEEIKAIQLADLGIGYGNGREV-------NQFPSGYG 249
                    ++ G GYG   EV         + SGYG
Sbjct: 124 ---------SEYGSGYGGQTEVEYGRRPEQSYGSGYG 149

BLAST of CcUC03G047560 vs. ExPASy TrEMBL
Match: A0A1S3B404 (uncharacterized protein LOC103485767 OS=Cucumis melo OX=3656 GN=LOC103485767 PE=4 SV=1)

HSP 1 Score: 576.2 bits (1484), Expect = 8.9e-161
Identity = 289/370 (78.11%), Postives = 307/370 (82.97%), Query Frame = 0

Query: 3   LNIRSTMAFY-------NSHYDSAQTEPPISQFSNEPTFYNLFDYPPPCYFEQSYDS--- 62
           LNIRS MAFY       +S+Y+SAQ EPPI Q SNEPTFYNLFDYPPPCYF Q+YDS   
Sbjct: 11  LNIRSPMAFYDSYDFYDDSYYNSAQIEPPILQSSNEPTFYNLFDYPPPCYFGQAYDSEVG 70

Query: 63  -------CTSNFYGFPQLIEHESIDHGGYGYPISYSANACSASSFTLPKVIEYDPDLYRE 122
                    SNF  FPQLIEHE +DHG YGY I YSANACSASSFTLPKV  YDPDLY E
Sbjct: 71  YFAINAAYGSNFSEFPQLIEHEPVDHGDYGYAIRYSANACSASSFTLPKVFGYDPDLYSE 130

Query: 123 VPTQFVISYSVSEFNETEFEEYDPTPYGGGYDISETYGKPLPPSTEICYPPSSSS----P 182
           V TQFVISYSVSEFNET+FEEYDPTPY GGYDI ETYGKPL PSTEICYPPSSSS    P
Sbjct: 131 VSTQFVISYSVSEFNETDFEEYDPTPYDGGYDIYETYGKPLQPSTEICYPPSSSSPSKPP 190

Query: 183 PSTAAAIPISTIPKVEEAPKGKIEKQTKPSSEIKPTQIEKVNDSSSSESDMNSESEEIEE 242
           P TA AIPI+TIPK++EAPKGKIE+QTKPSSEIKP QIEK N+SSSS+SD  SES EIEE
Sbjct: 191 PPTATAIPITTIPKIDEAPKGKIEEQTKPSSEIKPIQIEKTNNSSSSDSDTTSESGEIEE 250

Query: 243 IKAIQLADLGIGYGNGREVNQFPSGYGLEAMDLCESLFGYWPCLSRIKKQTACRQPKNGC 302
           +KAIQL D GIGYGNGREVN+FPSGYGLEAMDLCESLFGYWPCLSR K+QT CRQPKNGC
Sbjct: 251 VKAIQLGDPGIGYGNGREVNEFPSGYGLEAMDLCESLFGYWPCLSRAKRQTLCRQPKNGC 310

Query: 303 GRCHGHCYCYGNYGNQWQTAADYLFGSHNPYPD----GNAIYGYQRQFQGEAAHGYVWLN 348
           GRCHGHCYCYGNYGNQWQTAA+YLFGSHNPY D    G+  YGYQRQFQ E  +GYVWLN
Sbjct: 311 GRCHGHCYCYGNYGNQWQTAAEYLFGSHNPYLDGRGEGDGFYGYQRQFQEEPVYGYVWLN 370

BLAST of CcUC03G047560 vs. ExPASy TrEMBL
Match: A0A5D3DRV2 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold861G00010 PE=4 SV=1)

HSP 1 Score: 563.9 bits (1452), Expect = 4.6e-157
Identity = 282/364 (77.47%), Postives = 301/364 (82.69%), Query Frame = 0

Query: 9   MAFY-------NSHYDSAQTEPPISQFSNEPTFYNLFDYPPPCYFEQSYDS--------- 68
           MAFY       +S+Y+SAQ EPPI Q SNEPTFYNLFDYPPPCYF Q+YDS         
Sbjct: 1   MAFYDSYDFYDDSYYNSAQIEPPILQSSNEPTFYNLFDYPPPCYFGQAYDSEVGYFAINA 60

Query: 69  -CTSNFYGFPQLIEHESIDHGGYGYPISYSANACSASSFTLPKVIEYDPDLYREVPTQFV 128
              SNF  FPQLIEHE +DHG YGY I YSANACSASSFTLPKV  YDPDLY EV TQFV
Sbjct: 61  AYGSNFSEFPQLIEHEPVDHGDYGYAIRYSANACSASSFTLPKVFGYDPDLYSEVSTQFV 120

Query: 129 ISYSVSEFNETEFEEYDPTPYGGGYDISETYGKPLPPSTEICYPPSSSS----PPSTAAA 188
           ISYSVSEFNET+FEEYDPTPY GGYDI ETYGKPL PSTEICYPPSSSS    PP TA A
Sbjct: 121 ISYSVSEFNETDFEEYDPTPYDGGYDIYETYGKPLQPSTEICYPPSSSSPSKPPPPTATA 180

Query: 189 IPISTIPKVEEAPKGKIEKQTKPSSEIKPTQIEKVNDSSSSESDMNSESEEIEEIKAIQL 248
           IPI+TIPK++EAPKGKIE+QTKPSSEIKP QIEK N+S SS+SD  SES EIEE+KAIQL
Sbjct: 181 IPITTIPKIDEAPKGKIEEQTKPSSEIKPIQIEKTNNSYSSDSDTTSESGEIEEVKAIQL 240

Query: 249 ADLGIGYGNGREVNQFPSGYGLEAMDLCESLFGYWPCLSRIKKQTACRQPKNGCGRCHGH 308
            D GIGYGNGREVN+FPSGYGLEAMDLCESLFGYWPCLSR K+QT CRQPKNGCGRCHGH
Sbjct: 241 GDPGIGYGNGREVNEFPSGYGLEAMDLCESLFGYWPCLSRAKRQTLCRQPKNGCGRCHGH 300

Query: 309 CYCYGNYGNQWQTAADYLFGSHNPYPD----GNAIYGYQRQFQGEAAHGYVWLNQNDFNG 348
           CYCYGNYGNQWQTAA+YLFGSHNPY D    G+  YGYQR+FQ E  +GYVWLNQNDFN 
Sbjct: 301 CYCYGNYGNQWQTAAEYLFGSHNPYLDGRGEGDGFYGYQRRFQEEPVYGYVWLNQNDFNR 360

BLAST of CcUC03G047560 vs. ExPASy TrEMBL
Match: A0A0A0LUY1 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G070580 PE=4 SV=1)

HSP 1 Score: 543.5 bits (1399), Expect = 6.4e-151
Identity = 276/369 (74.80%), Postives = 297/369 (80.49%), Query Frame = 0

Query: 9   MAFYN-------SHYDSAQTEPPISQFSNEPTFYNLFDYPPPCYFEQSYD---------- 68
           MAFYN       S+Y+ AQ EPPI Q SNEP FYNLFDYPPPCYF Q+YD          
Sbjct: 1   MAFYNSYDFYDDSYYNYAQIEPPIPQSSNEPNFYNLFDYPPPCYFGQAYDYEVGYSANDA 60

Query: 69  SCTSNFYGFPQLIEHESIDHGGYGYPISYSANACSASSFTLPKVIEYDPDLYREVPTQFV 128
              SNF   PQLI+HE +DHG YGY I YSANACSASSFTLPK+ EY+PDLY EV TQFV
Sbjct: 61  PYRSNFNELPQLIDHEPVDHGDYGYAIRYSANACSASSFTLPKLCEYNPDLYSEVSTQFV 120

Query: 129 ISYSVSEFNETEFEEYDPTPYGGGYDISETYGKPLPPSTEICYPPSSSS--------PPS 188
           ISYSVS+FNETEFEEYDPTPY GGYDISETYGKPL PS EICYPPSSSS        PP 
Sbjct: 121 ISYSVSQFNETEFEEYDPTPYDGGYDISETYGKPLQPSIEICYPPSSSSPSKSPPPPPPP 180

Query: 189 TAAAIP-ISTIPKVEEAPKGKIEKQTKPSSEIKPTQIEKVNDSSSSESDMNSESEEIEEI 248
           TA AIP I+TIPK++EAPKGKIE+QTKPSSEIKPTQIEK N+SSSS+SD  SES EIEE 
Sbjct: 181 TATAIPIITTIPKIDEAPKGKIEEQTKPSSEIKPTQIEKTNNSSSSDSDTTSESGEIEED 240

Query: 249 KAIQLADLGIGYGNGREVNQFPSGYGLEAMDLCESLFGYWPCLSRIKKQTACRQPKNGCG 308
           KAIQL D GIGYGN REVN+FPSG GLEAMDLCESLFGYWPCLSR K+QTA RQPKNGCG
Sbjct: 241 KAIQLGDPGIGYGNAREVNEFPSGCGLEAMDLCESLFGYWPCLSRAKRQTAYRQPKNGCG 300

Query: 309 RCHGHCYCYGNYGNQWQTAADYLFGSHNPYPD----GNAIYGYQRQFQGEAAHGYVWLNQ 348
           RCHGHCYCYGNYGN+WQTAA+YLFGSHNPY D    G+ +YGYQRQFQ E  +GYVWLNQ
Sbjct: 301 RCHGHCYCYGNYGNEWQTAAEYLFGSHNPYLDGRREGDVVYGYQRQFQEEPVYGYVWLNQ 360

BLAST of CcUC03G047560 vs. ExPASy TrEMBL
Match: A0A6J1KI70 (uncharacterized protein LOC111495462 OS=Cucurbita maxima OX=3661 GN=LOC111495462 PE=4 SV=1)

HSP 1 Score: 530.4 bits (1365), Expect = 5.6e-147
Identity = 268/351 (76.35%), Postives = 290/351 (82.62%), Query Frame = 0

Query: 9   MAF---YNSHYDSAQTEPPISQFSNEPTFYNLFDYPPPCYFEQSYDSCTSNFYGFPQLIE 68
           MAF   Y+S+YDSAQTEPPI Q S EPTFYNLFDYPPPCYF Q+Y   TSNF  FPQLIE
Sbjct: 1   MAFYDSYDSYYDSAQTEPPIPQSSYEPTFYNLFDYPPPCYFGQAYAPYTSNFNEFPQLIE 60

Query: 69  HESIDHGGYGYPISYSANACSASSFTLPKVIEYDPDLY----REVPTQFVISYSVSEFNE 128
           H+ +DHG YGY ISYSANACSAS+F++PKVIEYD DLY    ++V +QFVISYSVSEFNE
Sbjct: 61  HQPVDHGAYGYTISYSANACSASTFSVPKVIEYDSDLYSDGTQKVSSQFVISYSVSEFNE 120

Query: 129 TEFEEYDPTPYGGGYDISETYGKPLPPSTEICYPPSSSSPPS-TAAAIPISTIPKVEEAP 188
           TEFEEYDPTPYGGGYDI ETYGKPL PST+ICY PSSSSPP     AIPIS I    EAP
Sbjct: 121 TEFEEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSPPKPPPTAIPISAI---HEAP 180

Query: 189 KGKIEKQTKPSSEIKPTQIEKVNDSSSSESDMNSESEEIEEIKAIQLADLGIGYGNGREV 248
           K KIE++T+PSSEIKPTQIEK N +        SESEEIEE+KAI  AD GIGYGNGREV
Sbjct: 181 KEKIEEKTEPSSEIKPTQIEKDNTA--------SESEEIEEVKAIPFADPGIGYGNGREV 240

Query: 249 NQFPSGYGLEAMDLCESLFGYWPCLSRIKKQTACRQPKNGCGRCHGHCYCYGNYGNQWQT 308
           NQFPSGYGLEAMDLCESLFGYWPCLSRIKKQTACRQP NGCGRCHGHCYCYGNYGNQWQT
Sbjct: 241 NQFPSGYGLEAMDLCESLFGYWPCLSRIKKQTACRQPNNGCGRCHGHCYCYGNYGNQWQT 300

Query: 309 AADYLFGSHNPYPD----GNAIYGYQRQFQGEAAHGYVWLNQNDFNGCEDV 348
           AADYLFGSHNPYPD    G+ +YGYQRQ+Q E  + YVWLNQNDF   +DV
Sbjct: 301 AADYLFGSHNPYPDGRSEGDGVYGYQRQYQAEPVYRYVWLNQNDFVRSDDV 340

BLAST of CcUC03G047560 vs. ExPASy TrEMBL
Match: A0A6J1EHF5 (uncharacterized protein LOC111434325 OS=Cucurbita moschata OX=3662 GN=LOC111434325 PE=4 SV=1)

HSP 1 Score: 518.8 bits (1335), Expect = 1.7e-143
Identity = 259/346 (74.86%), Postives = 284/346 (82.08%), Query Frame = 0

Query: 9   MAFYNSHYDSAQTEPPISQFSNEPTFYNLFDYPPPCYFEQSYDSCTSNFYGFPQLIEHES 68
           MAFY+S+YDSAQ EPPI Q S EPTFYNLFDYPPPCYF Q+Y   TS+   FPQLIE++ 
Sbjct: 1   MAFYDSYYDSAQIEPPIPQSSYEPTFYNLFDYPPPCYFGQAYAPYTSSSNEFPQLIEYQP 60

Query: 69  IDHGGYGYPISYSANACSASSFTLPKVIEYDPDL----YREVPTQFVISYSVSEFNETEF 128
           +DHG YGY ISYSANACSAS+F++PKVIEYDPD     Y++V +QFVISYSVSEFNETEF
Sbjct: 61  VDHGAYGYTISYSANACSASTFSVPKVIEYDPDFYSDGYQKVSSQFVISYSVSEFNETEF 120

Query: 129 EEYDPTPYGGGYDISETYGKPLPPSTEICYPPSSSSPPSTAAAIPISTIPKVEEAPKGKI 188
           EEYDPTPYGGGYDI ETYGKPL PST+ICY PSSSSPP      P  T   ++EAPK KI
Sbjct: 121 EEYDPTPYGGGYDIHETYGKPLQPSTDICYSPSSSSPPK-----PPPT--AIQEAPKEKI 180

Query: 189 EKQTKPSSEIKPTQIEKVNDSSSSESDMNSESEEIEEIKAIQLADLGIGYGNGREVNQFP 248
           E++TKPSSEIKPTQIEK N +        SESEEIEE+KAI  AD GIGYGNGREVNQFP
Sbjct: 181 EEKTKPSSEIKPTQIEKDNTA--------SESEEIEEVKAIPFADPGIGYGNGREVNQFP 240

Query: 249 SGYGLEAMDLCESLFGYWPCLSRIKKQTACRQPKNGCGRCHGHCYCYGNYGNQWQTAADY 308
           SGYGLEAMDLCESLFGYWPCLSRIKKQTACRQP NGCGRCHGHCYCYGNYGNQWQTAADY
Sbjct: 241 SGYGLEAMDLCESLFGYWPCLSRIKKQTACRQPNNGCGRCHGHCYCYGNYGNQWQTAADY 300

Query: 309 LFGSHNPYPD----GNAIYGYQRQFQGEAAHGYVWLNQNDFNGCED 347
           LFGSHNPYPD    G+ +YGYQ Q+Q E  +GYVWLNQND    +D
Sbjct: 301 LFGSHNPYPDGRSEGDGVYGYQTQYQTEPVYGYVWLNQNDLVRSDD 331

BLAST of CcUC03G047560 vs. TAIR 10
Match: AT1G11440.1 (BEST Arabidopsis thaliana protein match is: glycine-rich protein (TAIR:AT3G29075.1); Has 19337 Blast hits to 8589 proteins in 488 species: Archae - 26; Bacteria - 641; Metazoa - 7852; Fungi - 2167; Plants - 955; Viruses - 616; Other Eukaryotes - 7080 (source: NCBI BLink). )

HSP 1 Score: 146.0 bits (367), Expect = 5.7e-35
Identity = 115/329 (34.95%), Postives = 158/329 (48.02%), Query Frame = 0

Query: 36  NLFDYPPPCYFEQ----SYDSCTSNFYGFPQL-IEHESIDHGGYGYPISYSAN------- 95
           NL+D     Y +Q     ++  + N+Y + +   E E + + GY  P+SY+         
Sbjct: 19  NLYDQNHYHYNQQQQQLGFEPMSYNYYNWNESESESEYVAYSGYDDPMSYNCYNWNGSES 78

Query: 96  -------ACSASSFTLPKVIEYDPDLYR--EVPTQFVISYSVS---EFNETEFEEYDPTP 155
                  A S S+ + PK + YDP+LY   E P QF I  SV+   +FNE EF+EYDPTP
Sbjct: 79  ETTSAYVAYSVSTMSEPKHLFYDPNLYTTYESPPQFSIYCSVASALDFNEPEFDEYDPTP 138

Query: 156 YGGGYDISETYGKPLPPSTEICYPPSSS------SPPSTAAAIPISTIPKVEEAPKGKIE 215
           YGGGYD+  TYGKPLPPS E CYP S++      SPP   A +P+      ++    K  
Sbjct: 139 YGGGYDVVATYGKPLPPSVETCYPCSTAPHAKAPSPPEIIAPVPLGIYDGGQKNVVKKRV 198

Query: 216 KQTKPSSEIKPTQIEKVNDSSSSE------------SDMNSESEEIEEIKAIQLADLGIG 275
              +P  E+KP +  K  +    E             D + E EE +E    +  D    
Sbjct: 199 SFAEPVEEVKPIETIKEQEQEQDEDYDEESEDEDDGDDDDEEEEEGDEEAKEEEKDHSSS 258

Query: 276 YGNGR-------EVNQF--PSGYGLEAMDLCESLF-GYWPCLSRIKKQTACRQPKNGCGR 313
           YGN         EV     PSGYGLEA DLCE +F GY+PC+ R K++    Q +     
Sbjct: 259 YGNEEYEVVDKGEVKALYVPSGYGLEATDLCEVIFGGYFPCVLRNKRRQEDEQDRGAAVS 318

BLAST of CcUC03G047560 vs. TAIR 10
Match: AT5G39570.1 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cytosol, nucleus; EXPRESSED IN: 25 plant structures; EXPRESSED DURING: 14 growth stages; BEST Arabidopsis thaliana protein match is: glycine-rich protein (TAIR:AT3G29075.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 50.1 bits (118), Expect = 4.3e-06
Identity = 50/157 (31.85%), Postives = 67/157 (42.68%), Query Frame = 0

Query: 114 YSVSEFNETEFEEYDPTPYGGGYDISETYGKPLPPSTEICYPPSSSSPPSTAAAIPISTI 173
           Y+  + +  +F+E+DPTPY GGYDI+  YG+P+PPS E CYP SS          P  T 
Sbjct: 4   YTRDDNDVDDFDEFDPTPYSGGYDITVIYGRPIPPSDETCYPLSSGVDDDFEYERPEFT- 63

Query: 174 PKVEEAPKGKIEKQTKPSSEIKP---------------TQIEKVNDSSSSESDMNSESEE 233
              E +  G     T+ SS  +P                Q E+ N    SES    + E 
Sbjct: 64  QIHEPSAYGDEALNTEYSSYSRPKPRPAFRPDSGGGGHVQGERPNPGYGSESGYGRKPE- 123

Query: 234 IEEIKAIQLADLGIGYGNGREV-------NQFPSGYG 249
                    ++ G GYG   EV         + SGYG
Sbjct: 124 ---------SEYGSGYGGQTEVEYGRRPEQSYGSGYG 149

BLAST of CcUC03G047560 vs. TAIR 10
Match: AT3G29075.1 (glycine-rich protein )

HSP 1 Score: 48.9 bits (115), Expect = 9.5e-06
Identity = 43/135 (31.85%), Postives = 64/135 (47.41%), Query Frame = 0

Query: 114 YSVSEFNETEFEEYDPTPYGGGYDISETYGKPLPPSTEICYPPSSSSPPSTAAAIPISTI 173
           Y+  + +  +F EYDP PY GGYDI+ TYG+ +PPS E CYP SS S  +     P  + 
Sbjct: 4   YTNDDNDVDDFTEYDPMPYSGGYDITVTYGRSIPPSDETCYPLSSLSGDAFEYQRPNFSS 63

Query: 174 PKVEEAPKGKIEKQTKPSSEIKPTQIEKVNDSSSSESDMNSESEEIEEIKAIQLADLGIG 233
                A   +  K T+ SS  +P  +   +D     +       E+E  +  + ++ G G
Sbjct: 64  NHDSSAYDDQALK-TEYSSYARPGPVGSGSDFGRKPNSGYGGRTEVEYGRKTE-SEHGSG 123

Query: 234 YGNGREVNQFPSGYG 249
           YG   E +     YG
Sbjct: 124 YGGRIESDYVKPSYG 136

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038895690.15.5e-16582.91uncharacterized protein LOC120083862 [Benincasa hispida][more]
XP_008441695.11.8e-16078.11PREDICTED: uncharacterized protein LOC103485767 [Cucumis melo][more]
KAA0056916.19.5e-15777.47uncharacterized protein E6C27_scaffold96G00880 [Cucumis melo var. makuwa] >TYK26... [more]
XP_011652905.11.3e-15074.80uncharacterized protein At5g39570 [Cucumis sativus] >KGN64592.1 hypothetical pro... [more]
XP_023001286.11.2e-14676.35uncharacterized protein LOC111495462 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
Q9FKA56.0e-0531.85Uncharacterized protein At5g39570 OS=Arabidopsis thaliana OX=3702 GN=At5g39570 P... [more]
Match NameE-valueIdentityDescription
A0A1S3B4048.9e-16178.11uncharacterized protein LOC103485767 OS=Cucumis melo OX=3656 GN=LOC103485767 PE=... [more]
A0A5D3DRV24.6e-15777.47Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A0A0LUY16.4e-15174.80Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G070580 PE=4 SV=1[more]
A0A6J1KI705.6e-14776.35uncharacterized protein LOC111495462 OS=Cucurbita maxima OX=3661 GN=LOC111495462... [more]
A0A6J1EHF51.7e-14374.86uncharacterized protein LOC111434325 OS=Cucurbita moschata OX=3662 GN=LOC1114343... [more]
Match NameE-valueIdentityDescription
AT1G11440.15.7e-3534.95BEST Arabidopsis thaliana protein match is: glycine-rich protein (TAIR:AT3G29075... [more]
AT5G39570.14.3e-0631.85FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... [more]
AT3G29075.19.5e-0631.85glycine-rich protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (PI 537277) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 177..216
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 191..210
NoneNo IPR availablePANTHERPTHR33971:SF3OS02G0743600 PROTEINcoord: 9..342
IPR038943PLD-regulated protein1-likePANTHERPTHR33971OS06G0232000 PROTEINcoord: 9..342

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CcUC03G047560.1CcUC03G047560.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0070300 phosphatidic acid binding