CmoCh01G008710 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh01G008710
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
DescriptionUnknown protein
LocationCmo_Chr01: 4640945 .. 4643764 (+)
RNA-Seq ExpressionCmoCh01G008710
SyntenyCmoCh01G008710
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGACATGGCTTCTCCATTTTCAGCTTCCTTCCTCCTCCTCTTCCTCCTAATTCAACCATTTGTTCTAAGTTTCATCCATTGTTCATACATATCTGCCATCGGGGATCCCGGAATGAAGAACCCAAATGTCCGAGTGGCATTGGAGGCATGGAATTCCTGTAACGAAGTTGGTGCCGAAGCTCCCACCATGGGCAGCCCCAGATTGGCCGATTGCGCTGATTTGCAGCCCTCATCAGCTCCCACCATGGTTGCCCACAAAGTAAACGAATCAATTAACAAACTCAAAGCCGGTGAAAAATTCCCATCGGAGCGATTCAAGCCATACACAGACCCAAATTTATACGCCGCGGAGAAGGAGCGCTATCTGGGTTCATTATGCGAGGTCGATGGCTCTTCAAATCCATGGAATTTCTGGATGATTATGCTGAAAAATGGGAATTTGGACAAGAATTCTACGCTTTGTCCTGAGAATGGTAAGAAGGTTAGTAAAATTAAAACAGAGTTGAAGTTCCCCTGTTTCGGTGAAGGGTGTATGAACCCACCTCTTGTTTGCCATAACTATTCGAGATTGGTGTCTTTGGAGGACGGGATGGTGTCTTTAAGTGGTGGGTTCTATGGAACTTATGAAGTTGATGCTGATTTGAGTAATGGTATAGGGAAAAATTCTTACTTTTCGGTGTCTTGGCGGAAGAATGTTACAACAGGGAGTTGGGTGTTCTTGAATCGATTGACGACGTCTTCTAAGTATCCTTGGCTTATGTTGTACCTTCGATCTGATGCAACCAAGGGCTTCAATGGTGGGTATCACTATGATGGTCGTGGCATTATGAGCAAGGTTCTTCTTCATCCCCTTCTCTTTTCACTTGTTTTTCAAATTGAGTTCAGATATTGAAGTGAAATTTTGAATTGTTATTATCAGTTGCCTGAGTCCCCAAATTTCAAGGTGAGGTTGACACTTGATATCAAAAGTGGTGGTGGAAAGGGCAGCCAATTCTATCTGCTAGACATAGGAAGCTGTTGGAAGAACAATGGAGATCCTTGCAATGGCGACACTACTACCGATGTAACTCGATACAGTGAAATGATTATCAACCCCAAGACCGGTGGTACGTGCAAGCCGAGCAATCTCATGGCTTGTCCGCCCTATCATGTTAGTTCTTCGGGAGAGAAGATATACAGGAATGAGACCTCGAGGTTTCCATACTCAGCTTATCATCTGTATTGCGGTCCTGGAAATGCTATGCATTTGGAGAAACCATGTGAACTTTGTGATCCATATAGCAACCCGCAGTCTCAAGAGTTGGTACAGATTCTTCCACATCCTGAATGGGCTGTTCATGGCTATCCTCAGAAGCAAGGTGAAGGATGGGTTGGAGATCCTAGAACTTGGGAGCTCGACGTTGGAGCTTTGTCCAATCGCCTGTACTTCTACCAGGTAAATCTTCTTTCTTCAAGAACGGAGTAGGATAGGGAGATTATAAGTGTAACGTCTAAGTCCACCGTTAGCAGATATTGTCATCTCTTAGTTTTTTTAAACACATCTACTAGACAGAAGTTTACACACCTTATAAAGAATGCTTTGTTCTTTTTCTCAGTCAATGTGAGATCTTTTTCTCCATCAGGGCCAACGTCATCGCTGACACTCATTCCCTTCTTCAATTGACGTGGGATCTCACAATTCACCTTCTTCGAGCCTCAGCATTCTTGCTGACACTCGTTCCCTTCTCCAATCAATGCGAGACCCCCAATCCACCTCCTTCAGGGTTCACGCTAGCACACTGCCTCGTGACCACCCCATTACAGGGTTCAACCTCCTCACTGGCACATCGCTCGATGTCTGGCTCTAATATCATTTGTAACGGCCCAAGCTCACCACCCGCAGATATTGTCCTCTTTGGGCTTTTCCCTCAAGGTTTTTAAAATGTCTCCTCGAGACAAATTTTCACACCCTTATAAAAAAATGTTTTGCTCTCCTCTTCAACCGATATTGTGAGACCCCACATCGATTGAGGAGAAGAATGAAACACCCTTATAAGGGTATGGAAACCTCTCCCTAGCATACGCATTTTAAAACCTTAAAGGTAAGCCCGAAAGGGAAAGCCCAAAGAAGAGAATAACCATTGGTGGTGGGCTTAGGTCGTTACAAGTGGTATCATGGCCAGACACCGAACGATGTGCCAGCGAGGAGGCTATTCCTCAAAGGGGTAGACACGAGGCGGTGTGCCAGTAAGGACGGGGTAGATTTGGTGGGGTCCCACATCGATTGGAGAAAGGAACGAGTGTTGGCGAGGACGCCAAGCCTTGAAGGGGGGTGGATTGTGAGATCTCACATTGGTTGGGAGTAAAACGAAACACCCTTTATAAGGGTGTAGAATCCTCCCCCTAACAAACGCATTTTAAAAACCACGAGAGTAAGCACAAAAGAGAAAGCCCAAAAAAAGACAATATCTGCTAGGGTGGACTTGGGCCGTTACAGACGTGGAATCTCAGAGTAAGAAGGTTAAAAGAGTAGACCTACATAATAATGTAATAGGATGCAGTAGACAAGTAGCAAAAAGAGGTGGAAAGCCAAATAATATGAACATAAAAGTGGTATGGTCCATAAAACAGATTTGGCAAGTTGGTTTAAAAAGAGGAAGTCTTTGGTTTCCTTAATTGATAAGAAAGTGAGTGTTTGTTGGCGATGTAGGATCCGGGAACGAAGGCAGCTAGGCGGATATGGAGTTCTATCAATGTTGGCACAGAAGTATACTTCAGCGAAGGGGAGACAGCAGAGTGGAGCGTAAGTGACTTTGATGTTTTGGTGCCTAAAGATTAG

mRNA sequence

ATGGACATGGCTTCTCCATTTTCAGCTTCCTTCCTCCTCCTCTTCCTCCTAATTCAACCATTTGTTCTAAGTTTCATCCATTGTTCATACATATCTGCCATCGGGGATCCCGGAATGAAGAACCCAAATGTCCGAGTGGCATTGGAGGCATGGAATTCCTGTAACGAAGTTGGTGCCGAAGCTCCCACCATGGGCAGCCCCAGATTGGCCGATTGCGCTGATTTGCAGCCCTCATCAGCTCCCACCATGGTTGCCCACAAAGTAAACGAATCAATTAACAAACTCAAAGCCGGTGAAAAATTCCCATCGGAGCGATTCAAGCCATACACAGACCCAAATTTATACGCCGCGGAGAAGGAGCGCTATCTGGGTTCATTATGCGAGGTCGATGGCTCTTCAAATCCATGGAATTTCTGGATGATTATGCTGAAAAATGGGAATTTGGACAAGAATTCTACGCTTTGTCCTGAGAATGGTAAGAAGGTTAGTAAAATTAAAACAGAGTTGAAGTTCCCCTGTTTCGGTGAAGGGTGTATGAACCCACCTCTTGTTTGCCATAACTATTCGAGATTGGTGTCTTTGGAGGACGGGATGGTGTCTTTAAGTGGTGGGTTCTATGGAACTTATGAAGTTGATGCTGATTTGAGTAATGGTATAGGGAAAAATTCTTACTTTTCGGTGTCTTGGCGGAAGAATGTTACAACAGGGAGTTGGGTGTTCTTGAATCGATTGACGACGTCTTCTAAGTATCCTTGGCTTATGTTGTACCTTCGATCTGATGCAACCAAGGGCTTCAATGGTGGGTATCACTATGATGGTCGTGGCATTATGAGCAAGGTTCTTCTTCATCCCCTTCTCTTTTCACTTTTGCCTGAGTCCCCAAATTTCAAGGTGAGGTTGACACTTGATATCAAAAGTGGTGGTGGAAAGGGCAGCCAATTCTATCTGCTAGACATAGGAAGCTGTTGGAAGAACAATGGAGATCCTTGCAATGGCGACACTACTACCGATGTAACTCGATACAGTGAAATGATTATCAACCCCAAGACCGGTGGTACGTGCAAGCCGAGCAATCTCATGGCTTGTCCGCCCTATCATGTTAGTTCTTCGGGAGAGAAGATATACAGGAATGAGACCTCGAGGTTTCCATACTCAGCTTATCATCTGTATTGCGGTCCTGGAAATGCTATGCATTTGGAGAAACCATGTGAACTTTGTGATCCATATAGCAACCCGCAGTCTCAAGAGTTGGTACAGATTCTTCCACATCCTGAATGGGCTGTTCATGGCTATCCTCAGAAGCAAGGTGAAGGATGGGTTGGAGATCCTAGAACTTGGGAGCTCGACGTTGGAGCTTTGTCCAATCGCCTGTACTTCTACCAGGATCCGGGAACGAAGGCAGCTAGGCGGATATGGAGTTCTATCAATGTTGGCACAGAAGTATACTTCAGCGAAGGGGAGACAGCAGAGTGGAGCGTAAGTGACTTTGATGTTTTGGTGCCTAAAGATTAG

Coding sequence (CDS)

ATGGACATGGCTTCTCCATTTTCAGCTTCCTTCCTCCTCCTCTTCCTCCTAATTCAACCATTTGTTCTAAGTTTCATCCATTGTTCATACATATCTGCCATCGGGGATCCCGGAATGAAGAACCCAAATGTCCGAGTGGCATTGGAGGCATGGAATTCCTGTAACGAAGTTGGTGCCGAAGCTCCCACCATGGGCAGCCCCAGATTGGCCGATTGCGCTGATTTGCAGCCCTCATCAGCTCCCACCATGGTTGCCCACAAAGTAAACGAATCAATTAACAAACTCAAAGCCGGTGAAAAATTCCCATCGGAGCGATTCAAGCCATACACAGACCCAAATTTATACGCCGCGGAGAAGGAGCGCTATCTGGGTTCATTATGCGAGGTCGATGGCTCTTCAAATCCATGGAATTTCTGGATGATTATGCTGAAAAATGGGAATTTGGACAAGAATTCTACGCTTTGTCCTGAGAATGGTAAGAAGGTTAGTAAAATTAAAACAGAGTTGAAGTTCCCCTGTTTCGGTGAAGGGTGTATGAACCCACCTCTTGTTTGCCATAACTATTCGAGATTGGTGTCTTTGGAGGACGGGATGGTGTCTTTAAGTGGTGGGTTCTATGGAACTTATGAAGTTGATGCTGATTTGAGTAATGGTATAGGGAAAAATTCTTACTTTTCGGTGTCTTGGCGGAAGAATGTTACAACAGGGAGTTGGGTGTTCTTGAATCGATTGACGACGTCTTCTAAGTATCCTTGGCTTATGTTGTACCTTCGATCTGATGCAACCAAGGGCTTCAATGGTGGGTATCACTATGATGGTCGTGGCATTATGAGCAAGGTTCTTCTTCATCCCCTTCTCTTTTCACTTTTGCCTGAGTCCCCAAATTTCAAGGTGAGGTTGACACTTGATATCAAAAGTGGTGGTGGAAAGGGCAGCCAATTCTATCTGCTAGACATAGGAAGCTGTTGGAAGAACAATGGAGATCCTTGCAATGGCGACACTACTACCGATGTAACTCGATACAGTGAAATGATTATCAACCCCAAGACCGGTGGTACGTGCAAGCCGAGCAATCTCATGGCTTGTCCGCCCTATCATGTTAGTTCTTCGGGAGAGAAGATATACAGGAATGAGACCTCGAGGTTTCCATACTCAGCTTATCATCTGTATTGCGGTCCTGGAAATGCTATGCATTTGGAGAAACCATGTGAACTTTGTGATCCATATAGCAACCCGCAGTCTCAAGAGTTGGTACAGATTCTTCCACATCCTGAATGGGCTGTTCATGGCTATCCTCAGAAGCAAGGTGAAGGATGGGTTGGAGATCCTAGAACTTGGGAGCTCGACGTTGGAGCTTTGTCCAATCGCCTGTACTTCTACCAGGATCCGGGAACGAAGGCAGCTAGGCGGATATGGAGTTCTATCAATGTTGGCACAGAAGTATACTTCAGCGAAGGGGAGACAGCAGAGTGGAGCGTAAGTGACTTTGATGTTTTGGTGCCTAAAGATTAG

Protein sequence

MDMASPFSASFLLLFLLIQPFVLSFIHCSYISAIGDPGMKNPNVRVALEAWNSCNEVGAEAPTMGSPRLADCADLQPSSAPTMVAHKVNESINKLKAGEKFPSERFKPYTDPNLYAAEKERYLGSLCEVDGSSNPWNFWMIMLKNGNLDKNSTLCPENGKKVSKIKTELKFPCFGEGCMNPPLVCHNYSRLVSLEDGMVSLSGGFYGTYEVDADLSNGIGKNSYFSVSWRKNVTTGSWVFLNRLTTSSKYPWLMLYLRSDATKGFNGGYHYDGRGIMSKVLLHPLLFSLLPESPNFKVRLTLDIKSGGGKGSQFYLLDIGSCWKNNGDPCNGDTTTDVTRYSEMIINPKTGGTCKPSNLMACPPYHVSSSGEKIYRNETSRFPYSAYHLYCGPGNAMHLEKPCELCDPYSNPQSQELVQILPHPEWAVHGYPQKQGEGWVGDPRTWELDVGALSNRLYFYQDPGTKAARRIWSSINVGTEVYFSEGETAEWSVSDFDVLVPKD
Homology
BLAST of CmoCh01G008710 vs. ExPASy TrEMBL
Match: A0A6J1GC16 (uncharacterized protein LOC111452605 OS=Cucurbita moschata OX=3662 GN=LOC111452605 PE=4 SV=1)

HSP 1 Score: 1040.4 bits (2689), Expect = 2.4e-300
Identity = 493/503 (98.01%), Postives = 493/503 (98.01%), Query Frame = 0

Query: 1   MDMASPFSASFLLLFLLIQPFVLSFIHCSYISAIGDPGMKNPNVRVALEAWNSCNEVGAE 60
           MDMASPFSASFLLLFLLIQPFVLSFIHCSYISAIGDPGMKNPNVRVALEAWNSCNEVGAE
Sbjct: 1   MDMASPFSASFLLLFLLIQPFVLSFIHCSYISAIGDPGMKNPNVRVALEAWNSCNEVGAE 60

Query: 61  APTMGSPRLADCADLQPSSAPTMVAHKVNESINKLKAGEKFPSERFKPYTDPNLYAAEKE 120
           APTMGSPRLADCADLQPSSAPTMVAHKVNESINKLKAGEKFPSERFKPYTDPNLYAAEKE
Sbjct: 61  APTMGSPRLADCADLQPSSAPTMVAHKVNESINKLKAGEKFPSERFKPYTDPNLYAAEKE 120

Query: 121 RYLGSLCEVDGSSNPWNFWMIMLKNGNLDKNSTLCPENGKKVSKIKTELKFPCFGEGCMN 180
           RYLGSLCEVDGSSNPWNFWMIMLKNGNLDKNSTLCPENGKKVSKIKTELKFPCFGEGCMN
Sbjct: 121 RYLGSLCEVDGSSNPWNFWMIMLKNGNLDKNSTLCPENGKKVSKIKTELKFPCFGEGCMN 180

Query: 181 PPLVCHNYSRLVSLEDGMVSLSGGFYGTYEVDADLSNGIGKNSYFSVSWRKNVTTGSWVF 240
           PPLVCHNYSRLVSLEDGMVSLSGGFYGTYEVDADLSNGIGKNSYFSVSWRKNVTTGSWVF
Sbjct: 181 PPLVCHNYSRLVSLEDGMVSLSGGFYGTYEVDADLSNGIGKNSYFSVSWRKNVTTGSWVF 240

Query: 241 LNRLTTSSKYPWLMLYLRSDATKGFNGGYHYDGRGIMSKVLLHPLLFSLLPESPNFKVRL 300
           LNRLTTSSKYPWLMLYLRSDATKGFNGGYHYDGRGIMSK          LPESPNFKVRL
Sbjct: 241 LNRLTTSSKYPWLMLYLRSDATKGFNGGYHYDGRGIMSK----------LPESPNFKVRL 300

Query: 301 TLDIKSGGGKGSQFYLLDIGSCWKNNGDPCNGDTTTDVTRYSEMIINPKTGGTCKPSNLM 360
           TLDIKSGGGKGSQFYLLDIGSCWKNNGDPCNGDTTTDVTRYSEMIINPKTGGTCKPSNLM
Sbjct: 301 TLDIKSGGGKGSQFYLLDIGSCWKNNGDPCNGDTTTDVTRYSEMIINPKTGGTCKPSNLM 360

Query: 361 ACPPYHVSSSGEKIYRNETSRFPYSAYHLYCGPGNAMHLEKPCELCDPYSNPQSQELVQI 420
           ACPPYHVSSSGEKIYRNETSRFPYSAYHLYCGPGNAMHLEKPCELCDPYSNPQSQELVQI
Sbjct: 361 ACPPYHVSSSGEKIYRNETSRFPYSAYHLYCGPGNAMHLEKPCELCDPYSNPQSQELVQI 420

Query: 421 LPHPEWAVHGYPQKQGEGWVGDPRTWELDVGALSNRLYFYQDPGTKAARRIWSSINVGTE 480
           LPHPEWAVHGYPQKQGEGWVGDPRTWELDVGALSNRLYFYQDPGTKAARRIWSSINVGTE
Sbjct: 421 LPHPEWAVHGYPQKQGEGWVGDPRTWELDVGALSNRLYFYQDPGTKAARRIWSSINVGTE 480

Query: 481 VYFSEGETAEWSVSDFDVLVPKD 504
           VYFSEGETAEWSVSDFDVLVPKD
Sbjct: 481 VYFSEGETAEWSVSDFDVLVPKD 493

BLAST of CmoCh01G008710 vs. ExPASy TrEMBL
Match: A0A6J1KAX7 (uncharacterized protein LOC111492655 OS=Cucurbita maxima OX=3661 GN=LOC111492655 PE=4 SV=1)

HSP 1 Score: 1001.1 bits (2587), Expect = 1.6e-288
Identity = 475/504 (94.25%), Postives = 483/504 (95.83%), Query Frame = 0

Query: 1   MDMASPFSAS-FLLLFLLIQPFVLSFIHCSYISAIGDPGMKNPNVRVALEAWNSCNEVGA 60
           MDMASPFS   FLLL LL+Q F  SFIHCSYISAIGDPGMKNPNVRVALEAWNSCNEVGA
Sbjct: 1   MDMASPFSPPFFLLLLLLVQLFFPSFIHCSYISAIGDPGMKNPNVRVALEAWNSCNEVGA 60

Query: 61  EAPTMGSPRLADCADLQPSSAPTMVAHKVNESINKLKAGEKFPSERFKPYTDPNLYAAEK 120
           EAPTMGSPRLADCADLQPSSAPTMVAHKVNESINKLKAGEKFPSERFKPY DPNLYA EK
Sbjct: 61  EAPTMGSPRLADCADLQPSSAPTMVAHKVNESINKLKAGEKFPSERFKPYKDPNLYAVEK 120

Query: 121 ERYLGSLCEVDGSSNPWNFWMIMLKNGNLDKNSTLCPENGKKVSKIKTELKFPCFGEGCM 180
           ERYLGSLCEVDGSSNPW+FWMIMLKNGNLDKNSTLCPENGKKVS+IKT+LKFPCFGEGCM
Sbjct: 121 ERYLGSLCEVDGSSNPWSFWMIMLKNGNLDKNSTLCPENGKKVSEIKTDLKFPCFGEGCM 180

Query: 181 NPPLVCHNYSRLVSLEDGMVSLSGGFYGTYEVDADLSNGIGKNSYFSVSWRKNVTTGSWV 240
           NPPLV HNYSRLVSLE+GMVSLSGGFYGTYEVDADLSNGIGKNSYFSVSWRKNVTTGSWV
Sbjct: 181 NPPLVYHNYSRLVSLEEGMVSLSGGFYGTYEVDADLSNGIGKNSYFSVSWRKNVTTGSWV 240

Query: 241 FLNRLTTSSKYPWLMLYLRSDATKGFNGGYHYDGRGIMSKVLLHPLLFSLLPESPNFKVR 300
           FLNRLTTSSKYPW+MLYLRSDATKGFNGGYHYDGRGIM K          LPESPNFKVR
Sbjct: 241 FLNRLTTSSKYPWIMLYLRSDATKGFNGGYHYDGRGIMRK----------LPESPNFKVR 300

Query: 301 LTLDIKSGGGKGSQFYLLDIGSCWKNNGDPCNGDTTTDVTRYSEMIINPKTGGTCKPSNL 360
           LTLDIKSGGGKGSQFYLLDIGSCWKNNGDPCNGDTTTDVTRYSEMIINPKTGGTCKPSN+
Sbjct: 301 LTLDIKSGGGKGSQFYLLDIGSCWKNNGDPCNGDTTTDVTRYSEMIINPKTGGTCKPSNV 360

Query: 361 MACPPYHVSSSGEKIYRNETSRFPYSAYHLYCGPGNAMHLEKPCELCDPYSNPQSQELVQ 420
           MACPPYHVSSSGEKIYRNETSRFPYSAYHLYCGPGNAMHLEKPCELCDPYSNPQSQELVQ
Sbjct: 361 MACPPYHVSSSGEKIYRNETSRFPYSAYHLYCGPGNAMHLEKPCELCDPYSNPQSQELVQ 420

Query: 421 ILPHPEWAVHGYPQKQGEGWVGDPRTWELDVGALSNRLYFYQDPGTKAARRIWSSINVGT 480
           ILPHPEWAVHGYPQKQGEGWVGDPRTWELDVGALSNRLYFYQDPGTKAARRIWSSINVGT
Sbjct: 421 ILPHPEWAVHGYPQKQGEGWVGDPRTWELDVGALSNRLYFYQDPGTKAARRIWSSINVGT 480

Query: 481 EVYFSEGETAEWSVSDFDVLVPKD 504
           EVYFSEGETAEWSVSDFDVLVP+D
Sbjct: 481 EVYFSEGETAEWSVSDFDVLVPED 494

BLAST of CmoCh01G008710 vs. ExPASy TrEMBL
Match: A0A6J1ICQ8 (uncharacterized protein LOC111472586 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111472586 PE=4 SV=1)

HSP 1 Score: 853.6 bits (2204), Expect = 4.2e-244
Identity = 404/521 (77.54%), Postives = 439/521 (84.26%), Query Frame = 0

Query: 2   DMASPFSASFLLLFLLIQPFVLSFIHCS------YISAIGDPGMKNPNVRVALEAWNSCN 61
           +M S FS+SF   FLL+  F L+  HCS      +ISAIGDPGMK+PNVRVA EAWN CN
Sbjct: 29  EMGSLFSSSF---FLLLLQFFLNLTHCSSHESLEFISAIGDPGMKSPNVRVAFEAWNFCN 88

Query: 62  EVGAEAPTMGSPRLADCADLQP-------------SSAPTMVAHKVNESINKLKAGEKFP 121
           EVGAEAP MGSPRLADCADL+              S +  +V HKVNES NKL+AGEKFP
Sbjct: 89  EVGAEAPQMGSPRLADCADLRAPLASDKQDCFGHGSDSNCIVLHKVNESDNKLEAGEKFP 148

Query: 122 SERFKPYTDPNLYAAEKERYLGSLCEVDGSSNPWNFWMIMLKNGNLDKNSTLCPENGKKV 181
           S+RFKPY DP+LY  EKERYLGSLCEV  SSNPW+FWMIMLKNGN DKNSTLCPENGK  
Sbjct: 149 SDRFKPYVDPDLYVVEKERYLGSLCEVHDSSNPWSFWMIMLKNGNFDKNSTLCPENGKNF 208

Query: 182 SKIKTELKFPCFGEGCMNPPLVCHNYSRLVSLEDGMVSLSGGFYGTYEVDADLSNGIGKN 241
            KI T+  FPCFGEGCMN PLV HNYSRLVS +  MVSL+GGFYGTYE+DADLSNGIGKN
Sbjct: 209 RKIITDRTFPCFGEGCMNQPLVYHNYSRLVSFDKRMVSLTGGFYGTYELDADLSNGIGKN 268

Query: 242 SYFSVSWRKNVTTGSWVFLNRLTTSSKYPWLMLYLRSDATKGFNGGYHYDGRGIMSKVLL 301
           SYFSVSW KNV++GSW+F NRLTTSSKYPWLMLYLRSDATKGFNGGYHYDGRGIM K   
Sbjct: 269 SYFSVSWHKNVSSGSWIFSNRLTTSSKYPWLMLYLRSDATKGFNGGYHYDGRGIMRK--- 328

Query: 302 HPLLFSLLPESPNFKVRLTLDIKSGGGKGSQFYLLDIGSCWKNNGDPCNGDTTTDVTRYS 361
                  LPESPNFKVRLTLD+KSGGGK SQFYL+DIGSCWKNNGD CNGDTTTDVTRYS
Sbjct: 329 -------LPESPNFKVRLTLDVKSGGGKNSQFYLIDIGSCWKNNGDACNGDTTTDVTRYS 388

Query: 362 EMIINPKTGGTCKPSNLMACPPYHVSSSGEKIYRNETSRFPYSAYHLYCGPGNAMHLEKP 421
           EMIINP+T   C+PSNL++CPPYHV +SGEKIYRNETSRFPYSAYHLYC PGNAMHLEKP
Sbjct: 389 EMIINPETTSWCRPSNLVSCPPYHVQASGEKIYRNETSRFPYSAYHLYCSPGNAMHLEKP 448

Query: 422 CELCDPYSNPQSQELVQILPHPEWAVHGYPQKQGEGWVGDPRTWELDVGALSNRLYFYQD 481
            ++CDPYSNPQ+QEL+QILPHPEW VHGYP KQG+GWVGDPRTWELDVGALSNRLYFYQD
Sbjct: 449 YDVCDPYSNPQAQELIQILPHPEWGVHGYPMKQGDGWVGDPRTWELDVGALSNRLYFYQD 508

Query: 482 PGTKAARRIWSSINVGTEVYFSEGETAEWSVSDFDVLVPKD 504
           PGTK ARRIW+SINVGTE+Y SEG TAEWSVSDFDV+VP D
Sbjct: 509 PGTKPARRIWTSINVGTEIYISEGATAEWSVSDFDVIVPPD 536

BLAST of CmoCh01G008710 vs. ExPASy TrEMBL
Match: A0A6J1IG58 (uncharacterized protein LOC111472586 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111472586 PE=4 SV=1)

HSP 1 Score: 853.2 bits (2203), Expect = 5.5e-244
Identity = 404/522 (77.39%), Postives = 439/522 (84.10%), Query Frame = 0

Query: 2   DMASPFSASFLLLFLLIQPFVLSFIHCS------YISAIGDPGMKNPNVRVALEAWNSCN 61
           +M S FS+SF   FLL+  F L+  HCS      +ISAIGDPGMK+PNVRVA EAWN CN
Sbjct: 29  EMGSLFSSSF---FLLLLQFFLNLTHCSSHESLEFISAIGDPGMKSPNVRVAFEAWNFCN 88

Query: 62  EVGAEAPTMGSPRLADCADLQP--------------SSAPTMVAHKVNESINKLKAGEKF 121
           EVGAEAP MGSPRLADCADL+               S +  +V HKVNES NKL+AGEKF
Sbjct: 89  EVGAEAPQMGSPRLADCADLRAPLASADKQDCFGHGSDSNCIVLHKVNESDNKLEAGEKF 148

Query: 122 PSERFKPYTDPNLYAAEKERYLGSLCEVDGSSNPWNFWMIMLKNGNLDKNSTLCPENGKK 181
           PS+RFKPY DP+LY  EKERYLGSLCEV  SSNPW+FWMIMLKNGN DKNSTLCPENGK 
Sbjct: 149 PSDRFKPYVDPDLYVVEKERYLGSLCEVHDSSNPWSFWMIMLKNGNFDKNSTLCPENGKN 208

Query: 182 VSKIKTELKFPCFGEGCMNPPLVCHNYSRLVSLEDGMVSLSGGFYGTYEVDADLSNGIGK 241
             KI T+  FPCFGEGCMN PLV HNYSRLVS +  MVSL+GGFYGTYE+DADLSNGIGK
Sbjct: 209 FRKIITDRTFPCFGEGCMNQPLVYHNYSRLVSFDKRMVSLTGGFYGTYELDADLSNGIGK 268

Query: 242 NSYFSVSWRKNVTTGSWVFLNRLTTSSKYPWLMLYLRSDATKGFNGGYHYDGRGIMSKVL 301
           NSYFSVSW KNV++GSW+F NRLTTSSKYPWLMLYLRSDATKGFNGGYHYDGRGIM K  
Sbjct: 269 NSYFSVSWHKNVSSGSWIFSNRLTTSSKYPWLMLYLRSDATKGFNGGYHYDGRGIMRK-- 328

Query: 302 LHPLLFSLLPESPNFKVRLTLDIKSGGGKGSQFYLLDIGSCWKNNGDPCNGDTTTDVTRY 361
                   LPESPNFKVRLTLD+KSGGGK SQFYL+DIGSCWKNNGD CNGDTTTDVTRY
Sbjct: 329 --------LPESPNFKVRLTLDVKSGGGKNSQFYLIDIGSCWKNNGDACNGDTTTDVTRY 388

Query: 362 SEMIINPKTGGTCKPSNLMACPPYHVSSSGEKIYRNETSRFPYSAYHLYCGPGNAMHLEK 421
           SEMIINP+T   C+PSNL++CPPYHV +SGEKIYRNETSRFPYSAYHLYC PGNAMHLEK
Sbjct: 389 SEMIINPETTSWCRPSNLVSCPPYHVQASGEKIYRNETSRFPYSAYHLYCSPGNAMHLEK 448

Query: 422 PCELCDPYSNPQSQELVQILPHPEWAVHGYPQKQGEGWVGDPRTWELDVGALSNRLYFYQ 481
           P ++CDPYSNPQ+QEL+QILPHPEW VHGYP KQG+GWVGDPRTWELDVGALSNRLYFYQ
Sbjct: 449 PYDVCDPYSNPQAQELIQILPHPEWGVHGYPMKQGDGWVGDPRTWELDVGALSNRLYFYQ 508

Query: 482 DPGTKAARRIWSSINVGTEVYFSEGETAEWSVSDFDVLVPKD 504
           DPGTK ARRIW+SINVGTE+Y SEG TAEWSVSDFDV+VP D
Sbjct: 509 DPGTKPARRIWTSINVGTEIYISEGATAEWSVSDFDVIVPPD 537

BLAST of CmoCh01G008710 vs. ExPASy TrEMBL
Match: A0A6J1EFC7 (uncharacterized protein LOC111432779 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111432779 PE=4 SV=1)

HSP 1 Score: 850.5 bits (2196), Expect = 3.5e-243
Identity = 403/521 (77.35%), Postives = 436/521 (83.69%), Query Frame = 0

Query: 2   DMASPFSASFLLLFLLIQPFVLSFIHCS------YISAIGDPGMKNPNVRVALEAWNSCN 61
           +M S FS+SF   FLL+  F L+  HCS      +ISAIGDPGMKNPNVRVA EAWN CN
Sbjct: 29  EMGSLFSSSF---FLLLLQFFLNLTHCSSHEALEFISAIGDPGMKNPNVRVAFEAWNFCN 88

Query: 62  EVGAEAPTMGSPRLADCADLQP-------------SSAPTMVAHKVNESINKLKAGEKFP 121
           EVGAEAP MGSPRLADCADL+              S +  +V HKVNES NKL AGEKFP
Sbjct: 89  EVGAEAPQMGSPRLADCADLRAPLASDKQDCFGHGSDSNCIVLHKVNESDNKLGAGEKFP 148

Query: 122 SERFKPYTDPNLYAAEKERYLGSLCEVDGSSNPWNFWMIMLKNGNLDKNSTLCPENGKKV 181
           SERFKPY DP+LY  EKERYLGSLCEV  SSNPW+FWMIMLKNGN DKNSTLC ENGK V
Sbjct: 149 SERFKPYVDPDLYVVEKERYLGSLCEVHDSSNPWSFWMIMLKNGNFDKNSTLCSENGKNV 208

Query: 182 SKIKTELKFPCFGEGCMNPPLVCHNYSRLVSLEDGMVSLSGGFYGTYEVDADLSNGIGKN 241
            KI T+  FPCFGEGCMN PLV HNYSRLVS +  MVSL+GGFYGTYE+DADLSNGIGKN
Sbjct: 209 RKIITDRTFPCFGEGCMNQPLVYHNYSRLVSFDKRMVSLTGGFYGTYELDADLSNGIGKN 268

Query: 242 SYFSVSWRKNVTTGSWVFLNRLTTSSKYPWLMLYLRSDATKGFNGGYHYDGRGIMSKVLL 301
           SYFSVSW KNV++GSW+F NRLTTSSKYPWLMLYLRSDAT GFNGGYHYDGRGIM K   
Sbjct: 269 SYFSVSWHKNVSSGSWIFSNRLTTSSKYPWLMLYLRSDATMGFNGGYHYDGRGIMRK--- 328

Query: 302 HPLLFSLLPESPNFKVRLTLDIKSGGGKGSQFYLLDIGSCWKNNGDPCNGDTTTDVTRYS 361
                  LPESPNFKVRLTLD+KSGGGK SQFYL+DIGSCWKNNGD CNGDTTTDVTRYS
Sbjct: 329 -------LPESPNFKVRLTLDVKSGGGKNSQFYLIDIGSCWKNNGDACNGDTTTDVTRYS 388

Query: 362 EMIINPKTGGTCKPSNLMACPPYHVSSSGEKIYRNETSRFPYSAYHLYCGPGNAMHLEKP 421
           EMIINP+T   C+PSNL++CPPYHV +SGEKIYRNETSRFPYSAYHLYC PGN MHLEKP
Sbjct: 389 EMIINPETTSWCRPSNLVSCPPYHVRASGEKIYRNETSRFPYSAYHLYCSPGNGMHLEKP 448

Query: 422 CELCDPYSNPQSQELVQILPHPEWAVHGYPQKQGEGWVGDPRTWELDVGALSNRLYFYQD 481
            ++CDPYSNPQ+QEL+QILPHPEW VHGYP KQG+GW+GDPRTWELDVGALSNRLYFYQD
Sbjct: 449 YDICDPYSNPQAQELIQILPHPEWGVHGYPMKQGDGWIGDPRTWELDVGALSNRLYFYQD 508

Query: 482 PGTKAARRIWSSINVGTEVYFSEGETAEWSVSDFDVLVPKD 504
           PGTK ARRIW+SINVGTE+Y SEG TAEWSVSDFDV+VP D
Sbjct: 509 PGTKPARRIWTSINVGTEIYISEGSTAEWSVSDFDVIVPPD 536

BLAST of CmoCh01G008710 vs. NCBI nr
Match: XP_022949175.1 (uncharacterized protein LOC111452605 [Cucurbita moschata])

HSP 1 Score: 1040.4 bits (2689), Expect = 5.0e-300
Identity = 493/503 (98.01%), Postives = 493/503 (98.01%), Query Frame = 0

Query: 1   MDMASPFSASFLLLFLLIQPFVLSFIHCSYISAIGDPGMKNPNVRVALEAWNSCNEVGAE 60
           MDMASPFSASFLLLFLLIQPFVLSFIHCSYISAIGDPGMKNPNVRVALEAWNSCNEVGAE
Sbjct: 1   MDMASPFSASFLLLFLLIQPFVLSFIHCSYISAIGDPGMKNPNVRVALEAWNSCNEVGAE 60

Query: 61  APTMGSPRLADCADLQPSSAPTMVAHKVNESINKLKAGEKFPSERFKPYTDPNLYAAEKE 120
           APTMGSPRLADCADLQPSSAPTMVAHKVNESINKLKAGEKFPSERFKPYTDPNLYAAEKE
Sbjct: 61  APTMGSPRLADCADLQPSSAPTMVAHKVNESINKLKAGEKFPSERFKPYTDPNLYAAEKE 120

Query: 121 RYLGSLCEVDGSSNPWNFWMIMLKNGNLDKNSTLCPENGKKVSKIKTELKFPCFGEGCMN 180
           RYLGSLCEVDGSSNPWNFWMIMLKNGNLDKNSTLCPENGKKVSKIKTELKFPCFGEGCMN
Sbjct: 121 RYLGSLCEVDGSSNPWNFWMIMLKNGNLDKNSTLCPENGKKVSKIKTELKFPCFGEGCMN 180

Query: 181 PPLVCHNYSRLVSLEDGMVSLSGGFYGTYEVDADLSNGIGKNSYFSVSWRKNVTTGSWVF 240
           PPLVCHNYSRLVSLEDGMVSLSGGFYGTYEVDADLSNGIGKNSYFSVSWRKNVTTGSWVF
Sbjct: 181 PPLVCHNYSRLVSLEDGMVSLSGGFYGTYEVDADLSNGIGKNSYFSVSWRKNVTTGSWVF 240

Query: 241 LNRLTTSSKYPWLMLYLRSDATKGFNGGYHYDGRGIMSKVLLHPLLFSLLPESPNFKVRL 300
           LNRLTTSSKYPWLMLYLRSDATKGFNGGYHYDGRGIMSK          LPESPNFKVRL
Sbjct: 241 LNRLTTSSKYPWLMLYLRSDATKGFNGGYHYDGRGIMSK----------LPESPNFKVRL 300

Query: 301 TLDIKSGGGKGSQFYLLDIGSCWKNNGDPCNGDTTTDVTRYSEMIINPKTGGTCKPSNLM 360
           TLDIKSGGGKGSQFYLLDIGSCWKNNGDPCNGDTTTDVTRYSEMIINPKTGGTCKPSNLM
Sbjct: 301 TLDIKSGGGKGSQFYLLDIGSCWKNNGDPCNGDTTTDVTRYSEMIINPKTGGTCKPSNLM 360

Query: 361 ACPPYHVSSSGEKIYRNETSRFPYSAYHLYCGPGNAMHLEKPCELCDPYSNPQSQELVQI 420
           ACPPYHVSSSGEKIYRNETSRFPYSAYHLYCGPGNAMHLEKPCELCDPYSNPQSQELVQI
Sbjct: 361 ACPPYHVSSSGEKIYRNETSRFPYSAYHLYCGPGNAMHLEKPCELCDPYSNPQSQELVQI 420

Query: 421 LPHPEWAVHGYPQKQGEGWVGDPRTWELDVGALSNRLYFYQDPGTKAARRIWSSINVGTE 480
           LPHPEWAVHGYPQKQGEGWVGDPRTWELDVGALSNRLYFYQDPGTKAARRIWSSINVGTE
Sbjct: 421 LPHPEWAVHGYPQKQGEGWVGDPRTWELDVGALSNRLYFYQDPGTKAARRIWSSINVGTE 480

Query: 481 VYFSEGETAEWSVSDFDVLVPKD 504
           VYFSEGETAEWSVSDFDVLVPKD
Sbjct: 481 VYFSEGETAEWSVSDFDVLVPKD 493

BLAST of CmoCh01G008710 vs. NCBI nr
Match: KAG6607488.1 (hypothetical protein SDJN03_00830, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1031.2 bits (2665), Expect = 3.0e-297
Identity = 489/503 (97.22%), Postives = 490/503 (97.42%), Query Frame = 0

Query: 1   MDMASPFSASFLLLFLLIQPFVLSFIHCSYISAIGDPGMKNPNVRVALEAWNSCNEVGAE 60
           MDMASPFSASFLLLFLLIQPFVLSFIHCSYISAIGDPGMKNPNVRVALEAWNSCNEVGAE
Sbjct: 1   MDMASPFSASFLLLFLLIQPFVLSFIHCSYISAIGDPGMKNPNVRVALEAWNSCNEVGAE 60

Query: 61  APTMGSPRLADCADLQPSSAPTMVAHKVNESINKLKAGEKFPSERFKPYTDPNLYAAEKE 120
           APTMGSPRLADCADLQPSSAPTMVAHKVNESINKLKAGEKFPSERFKPYTDPNLYAAEKE
Sbjct: 61  APTMGSPRLADCADLQPSSAPTMVAHKVNESINKLKAGEKFPSERFKPYTDPNLYAAEKE 120

Query: 121 RYLGSLCEVDGSSNPWNFWMIMLKNGNLDKNSTLCPENGKKVSKIKTELKFPCFGEGCMN 180
           RYLGSLCEVDGSSNPWNFWMIMLKNGNLDKNSTLCPENGKKVSKIKTELKFPCFGEGCMN
Sbjct: 121 RYLGSLCEVDGSSNPWNFWMIMLKNGNLDKNSTLCPENGKKVSKIKTELKFPCFGEGCMN 180

Query: 181 PPLVCHNYSRLVSLEDGMVSLSGGFYGTYEVDADLSNGIGKNSYFSVSWRKNVTTGSWVF 240
           PPLV HNYSRLVSLEDGMVSLSGGFYGTYEVDADLSNGIGKNSYFSVSWRKNVTTGSWVF
Sbjct: 181 PPLVYHNYSRLVSLEDGMVSLSGGFYGTYEVDADLSNGIGKNSYFSVSWRKNVTTGSWVF 240

Query: 241 LNRLTTSSKYPWLMLYLRSDATKGFNGGYHYDGRGIMSKVLLHPLLFSLLPESPNFKVRL 300
           LNRLTTSSKYPWLMLYLRSDATKGFNGGYHYDGRGIMSK          LPESPNFKVRL
Sbjct: 241 LNRLTTSSKYPWLMLYLRSDATKGFNGGYHYDGRGIMSK----------LPESPNFKVRL 300

Query: 301 TLDIKSGGGKGSQFYLLDIGSCWKNNGDPCNGDTTTDVTRYSEMIINPKTGGTCKPSNLM 360
           TLDIKSGGGKGSQFYLLDIGSCWKNNGDPCNGDTTTDVTRYSEMIINPKTGGTCKPSNLM
Sbjct: 301 TLDIKSGGGKGSQFYLLDIGSCWKNNGDPCNGDTTTDVTRYSEMIINPKTGGTCKPSNLM 360

Query: 361 ACPPYHVSSSGEKIYRNETSRFPYSAYHLYCGPGNAMHLEKPCELCDPYSNPQSQELVQI 420
           ACPPYHVSSSGEKIYRNETSRFPYSAYHLYCGPGNAMHLEKPCELCDPYSNPQSQELVQI
Sbjct: 361 ACPPYHVSSSGEKIYRNETSRFPYSAYHLYCGPGNAMHLEKPCELCDPYSNPQSQELVQI 420

Query: 421 LPHPEWAVHGYPQKQGEGWVGDPRTWELDVGALSNRLYFYQDPGTKAARRIWSSINVGTE 480
           LPHPEWAVHGYPQKQGEGWVGDPRTWELDVGALSNRLYFYQDPGTKAARRIW+SINVGTE
Sbjct: 421 LPHPEWAVHGYPQKQGEGWVGDPRTWELDVGALSNRLYFYQDPGTKAARRIWTSINVGTE 480

Query: 481 VYFSEGETAEWSVSDFDVLVPKD 504
           VYFSEG TAEWS SDFDVLVPKD
Sbjct: 481 VYFSEGATAEWSASDFDVLVPKD 493

BLAST of CmoCh01G008710 vs. NCBI nr
Match: XP_023525505.1 (uncharacterized protein LOC111789092 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1007.7 bits (2604), Expect = 3.6e-290
Identity = 478/503 (95.03%), Postives = 483/503 (96.02%), Query Frame = 0

Query: 1   MDMASPFSASFLLLFLLIQPFVLSFIHCSYISAIGDPGMKNPNVRVALEAWNSCNEVGAE 60
           MDMASPFS  F LL L+IQ F LSFIHCSYISAIGDPGMKNPNVRVALEAWNSCNEVGAE
Sbjct: 1   MDMASPFSPPFFLL-LIIQTFFLSFIHCSYISAIGDPGMKNPNVRVALEAWNSCNEVGAE 60

Query: 61  APTMGSPRLADCADLQPSSAPTMVAHKVNESINKLKAGEKFPSERFKPYTDPNLYAAEKE 120
           APTMGSPRLADCADLQPSSAPTMVAHKVNESINKLKAGEKFPSERFKPY DPNLYAAEKE
Sbjct: 61  APTMGSPRLADCADLQPSSAPTMVAHKVNESINKLKAGEKFPSERFKPYKDPNLYAAEKE 120

Query: 121 RYLGSLCEVDGSSNPWNFWMIMLKNGNLDKNSTLCPENGKKVSKIKTELKFPCFGEGCMN 180
           RYLGSLCEVDGSSNPWNFWMIMLKNGNLDKNSTLCPENGKKVS+IKTELKFPCFGEGCMN
Sbjct: 121 RYLGSLCEVDGSSNPWNFWMIMLKNGNLDKNSTLCPENGKKVSEIKTELKFPCFGEGCMN 180

Query: 181 PPLVCHNYSRLVSLEDGMVSLSGGFYGTYEVDADLSNGIGKNSYFSVSWRKNVTTGSWVF 240
           PPLV HNYSRLVSLEDGM SLSGGFYGTYEVDADLSNGIGKNSYFSVSWRKNVTTGSWVF
Sbjct: 181 PPLVYHNYSRLVSLEDGMASLSGGFYGTYEVDADLSNGIGKNSYFSVSWRKNVTTGSWVF 240

Query: 241 LNRLTTSSKYPWLMLYLRSDATKGFNGGYHYDGRGIMSKVLLHPLLFSLLPESPNFKVRL 300
           LNRLTTSSKYPWLMLYLRSDATKGFNGGYHYDGRGIMSK          LPESPNFKVRL
Sbjct: 241 LNRLTTSSKYPWLMLYLRSDATKGFNGGYHYDGRGIMSK----------LPESPNFKVRL 300

Query: 301 TLDIKSGGGKGSQFYLLDIGSCWKNNGDPCNGDTTTDVTRYSEMIINPKTGGTCKPSNLM 360
           TLDIKSGGGKGSQFYLLDIGSCWKNNGDPCNGDTTTDVTRYSEMIINPKTGGTCKPSN+M
Sbjct: 301 TLDIKSGGGKGSQFYLLDIGSCWKNNGDPCNGDTTTDVTRYSEMIINPKTGGTCKPSNVM 360

Query: 361 ACPPYHVSSSGEKIYRNETSRFPYSAYHLYCGPGNAMHLEKPCELCDPYSNPQSQELVQI 420
           ACPPYHVSSSGEKIYRNETSRFPYSAYHLYCGPGNAMHLEKPCELCDPYSNPQ+QELVQI
Sbjct: 361 ACPPYHVSSSGEKIYRNETSRFPYSAYHLYCGPGNAMHLEKPCELCDPYSNPQAQELVQI 420

Query: 421 LPHPEWAVHGYPQKQGEGWVGDPRTWELDVGALSNRLYFYQDPGTKAARRIWSSINVGTE 480
           LPHPEWAVHGYPQKQGEGWVGDPRTWELDVGALSNRLYFYQDPGTKAARRIWSSINVGTE
Sbjct: 421 LPHPEWAVHGYPQKQGEGWVGDPRTWELDVGALSNRLYFYQDPGTKAARRIWSSINVGTE 480

Query: 481 VYFSEGETAEWSVSDFDVLVPKD 504
           VYFS+GET EWSVSDFDVLVPKD
Sbjct: 481 VYFSKGETVEWSVSDFDVLVPKD 492

BLAST of CmoCh01G008710 vs. NCBI nr
Match: XP_022997810.1 (uncharacterized protein LOC111492655 [Cucurbita maxima])

HSP 1 Score: 1001.1 bits (2587), Expect = 3.4e-288
Identity = 475/504 (94.25%), Postives = 483/504 (95.83%), Query Frame = 0

Query: 1   MDMASPFSAS-FLLLFLLIQPFVLSFIHCSYISAIGDPGMKNPNVRVALEAWNSCNEVGA 60
           MDMASPFS   FLLL LL+Q F  SFIHCSYISAIGDPGMKNPNVRVALEAWNSCNEVGA
Sbjct: 1   MDMASPFSPPFFLLLLLLVQLFFPSFIHCSYISAIGDPGMKNPNVRVALEAWNSCNEVGA 60

Query: 61  EAPTMGSPRLADCADLQPSSAPTMVAHKVNESINKLKAGEKFPSERFKPYTDPNLYAAEK 120
           EAPTMGSPRLADCADLQPSSAPTMVAHKVNESINKLKAGEKFPSERFKPY DPNLYA EK
Sbjct: 61  EAPTMGSPRLADCADLQPSSAPTMVAHKVNESINKLKAGEKFPSERFKPYKDPNLYAVEK 120

Query: 121 ERYLGSLCEVDGSSNPWNFWMIMLKNGNLDKNSTLCPENGKKVSKIKTELKFPCFGEGCM 180
           ERYLGSLCEVDGSSNPW+FWMIMLKNGNLDKNSTLCPENGKKVS+IKT+LKFPCFGEGCM
Sbjct: 121 ERYLGSLCEVDGSSNPWSFWMIMLKNGNLDKNSTLCPENGKKVSEIKTDLKFPCFGEGCM 180

Query: 181 NPPLVCHNYSRLVSLEDGMVSLSGGFYGTYEVDADLSNGIGKNSYFSVSWRKNVTTGSWV 240
           NPPLV HNYSRLVSLE+GMVSLSGGFYGTYEVDADLSNGIGKNSYFSVSWRKNVTTGSWV
Sbjct: 181 NPPLVYHNYSRLVSLEEGMVSLSGGFYGTYEVDADLSNGIGKNSYFSVSWRKNVTTGSWV 240

Query: 241 FLNRLTTSSKYPWLMLYLRSDATKGFNGGYHYDGRGIMSKVLLHPLLFSLLPESPNFKVR 300
           FLNRLTTSSKYPW+MLYLRSDATKGFNGGYHYDGRGIM K          LPESPNFKVR
Sbjct: 241 FLNRLTTSSKYPWIMLYLRSDATKGFNGGYHYDGRGIMRK----------LPESPNFKVR 300

Query: 301 LTLDIKSGGGKGSQFYLLDIGSCWKNNGDPCNGDTTTDVTRYSEMIINPKTGGTCKPSNL 360
           LTLDIKSGGGKGSQFYLLDIGSCWKNNGDPCNGDTTTDVTRYSEMIINPKTGGTCKPSN+
Sbjct: 301 LTLDIKSGGGKGSQFYLLDIGSCWKNNGDPCNGDTTTDVTRYSEMIINPKTGGTCKPSNV 360

Query: 361 MACPPYHVSSSGEKIYRNETSRFPYSAYHLYCGPGNAMHLEKPCELCDPYSNPQSQELVQ 420
           MACPPYHVSSSGEKIYRNETSRFPYSAYHLYCGPGNAMHLEKPCELCDPYSNPQSQELVQ
Sbjct: 361 MACPPYHVSSSGEKIYRNETSRFPYSAYHLYCGPGNAMHLEKPCELCDPYSNPQSQELVQ 420

Query: 421 ILPHPEWAVHGYPQKQGEGWVGDPRTWELDVGALSNRLYFYQDPGTKAARRIWSSINVGT 480
           ILPHPEWAVHGYPQKQGEGWVGDPRTWELDVGALSNRLYFYQDPGTKAARRIWSSINVGT
Sbjct: 421 ILPHPEWAVHGYPQKQGEGWVGDPRTWELDVGALSNRLYFYQDPGTKAARRIWSSINVGT 480

Query: 481 EVYFSEGETAEWSVSDFDVLVPKD 504
           EVYFSEGETAEWSVSDFDVLVP+D
Sbjct: 481 EVYFSEGETAEWSVSDFDVLVPED 494

BLAST of CmoCh01G008710 vs. NCBI nr
Match: KAG7037147.1 (hypothetical protein SDJN02_00769, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 996.9 bits (2576), Expect = 6.3e-287
Identity = 477/503 (94.83%), Postives = 478/503 (95.03%), Query Frame = 0

Query: 1   MDMASPFSASFLLLFLLIQPFVLSFIHCSYISAIGDPGMKNPNVRVALEAWNSCNEVGAE 60
           MDMASPFSASFLLLFLLIQPFVLSFIHCSYISAIGDPGMKNPNVRVALEAWNSCNEVGAE
Sbjct: 1   MDMASPFSASFLLLFLLIQPFVLSFIHCSYISAIGDPGMKNPNVRVALEAWNSCNEVGAE 60

Query: 61  APTMGSPRLADCADLQPSSAPTMVAHKVNESINKLKAGEKFPSERFKPYTDPNLYAAEKE 120
           APTMGSPRLADCADLQPSSAPTMVAHKVNESINKLKAGEKFPSERFKPYTDPNLYAAEKE
Sbjct: 61  APTMGSPRLADCADLQPSSAPTMVAHKVNESINKLKAGEKFPSERFKPYTDPNLYAAEKE 120

Query: 121 RYLGSLCEVDGSSNPWNFWMIMLKNGNLDKNSTLCPENGKKVSKIKTELKFPCFGEGCMN 180
           RYLGSLCEVDGSSNPWNFWMIMLKNGNLDKNSTLCPENGKKVSKIKTELKFPCFGEGCMN
Sbjct: 121 RYLGSLCEVDGSSNPWNFWMIMLKNGNLDKNSTLCPENGKKVSKIKTELKFPCFGEGCMN 180

Query: 181 PPLVCHNYSRLVSLEDGMVSLSGGFYGTYEVDADLSNGIGKNSYFSVSWRKNVTTGSWVF 240
           PPLV HNYSRLVSLEDGMVSLSGGFYGTYEVDADLSNGIGKNSYFSVSWRKNVTTGSWVF
Sbjct: 181 PPLVYHNYSRLVSLEDGMVSLSGGFYGTYEVDADLSNGIGKNSYFSVSWRKNVTTGSWVF 240

Query: 241 LNRLTTSSKYPWLMLYLRSDATKGFNGGYHYDGRGIMSKVLLHPLLFSLLPESPNFKVRL 300
           LNRLTTSSKYPWLMLYLRSDATKGFNGGYHYDGRGIMSK          LPESPNFKVRL
Sbjct: 241 LNRLTTSSKYPWLMLYLRSDATKGFNGGYHYDGRGIMSK----------LPESPNFKVRL 300

Query: 301 TLDIKSGGGKGSQFYLLDIGSCWKNNGDPCNGDTTTDVTRYSEMIINPKTGGTCKPSNLM 360
           TLDIKSGGGKGSQFYLLDIGSCWKNNGDPCNGDTTTDVTRYSEMIINPKTGGTCKPSNLM
Sbjct: 301 TLDIKSGGGKGSQFYLLDIGSCWKNNGDPCNGDTTTDVTRYSEMIINPKTGGTCKPSNLM 360

Query: 361 ACPPYHVSSSGEKIYRNETSRFPYSAYHLYCGPGNAMHLEKPCELCDPYSNPQSQELVQI 420
           ACPPYHVSSSGEKIYRNETSRFPYSAYHLYCGPGNAMHLEKPCELCDPYSNPQSQELVQI
Sbjct: 361 ACPPYHVSSSGEKIYRNETSRFPYSAYHLYCGPGNAMHLEKPCELCDPYSNPQSQELVQI 420

Query: 421 LPHPEWAVHGYPQKQGEGWVGDPRTWELDVGALSNRLYFYQDPGTKAARRIWSSINVGTE 480
           LPHPEWAVHGYPQKQGEGWVGDPRTWELD            DPGTKAARRIW+SINVGTE
Sbjct: 421 LPHPEWAVHGYPQKQGEGWVGDPRTWELD------------DPGTKAARRIWTSINVGTE 480

Query: 481 VYFSEGETAEWSVSDFDVLVPKD 504
           VYFSEG TAEWS SDFDVLVPKD
Sbjct: 481 VYFSEGATAEWSASDFDVLVPKD 481

BLAST of CmoCh01G008710 vs. TAIR 10
Match: AT2G47010.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: 17 plant structures; EXPRESSED DURING: 10 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G17030.1); Has 72 Blast hits to 72 proteins in 13 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 71; Viruses - 0; Other Eukaryotes - 1 (source: NCBI BLink). )

HSP 1 Score: 592.8 bits (1527), Expect = 2.6e-169
Identity = 281/495 (56.77%), Postives = 343/495 (69.29%), Query Frame = 0

Query: 22  VLSFIHCSYISAIGDPGMKNPNVRVALEAWNSCNEVGAEAPTMGSPRLADCADL------ 81
           ++  +H    SA+GDPGMK   +RVA EAWN CNEVG EAP MGSPR ADC DL      
Sbjct: 13  LILLVHGDERSAVGDPGMKRDGLRVAFEAWNFCNEVGFEAPHMGSPRAADCFDLSSKCIK 72

Query: 82  -------QPSSAPTMVAHKVNESINKLKAGEKFP---SERFKPYTDPNLYAAEKERYLGS 141
                    +++ + + HKV++S N+L  G+  P   SE      +P+LYA EKE YLGS
Sbjct: 73  AYTEDQSNKTTSGSSLVHKVSDSDNELGIGKPKPGIISE--SALHNPDLYAVEKELYLGS 132

Query: 142 LCEVDGSSNPWNFWMIMLKNGNLDKNSTLCPENGKKVSKIKTELKFPCFGEGCMNPPLVC 201
           LC+V    NPW+FWM+MLKNGN D  S LCP+NGKK+        FPCFG GCMN P + 
Sbjct: 133 LCQVSDKPNPWSFWMVMLKNGNYDTKSALCPKNGKKIPPFNQPGLFPCFGSGCMNQPTLN 192

Query: 202 HNYSRLVSLEDGMVSLSGGFYGTYEVDADLSNGIGKNSYFSVSWRKNVTTGSWVFLNRLT 261
           H  + L    DG  ++ G F GTYE  AD  NG+   SY+ V W K V  G WVF ++L 
Sbjct: 193 HGKTEL--QRDGQ-TMKGWFNGTYEQGADFGNGLDGISYYEVVWEKRVGVGGWVFKHKLK 252

Query: 262 TSSKYPWLMLYLRSDATKGFNGGYHYDGRGIMSKVLLHPLLFSLLPESPNFKVRLTLDIK 321
           TS+KYPWLMLYLR+DATKGF+GGYHYD RG++            LPESPNFKVRLTL++K
Sbjct: 253 TSAKYPWLMLYLRADATKGFSGGYHYDTRGML----------KTLPESPNFKVRLTLNVK 312

Query: 322 SGGGKGSQFYLLDIGSCWKNNGDPCNGDTTTDVTRYSEMIINPKTGGTCKPSNLMACPPY 381
            GGG  SQFYLLDIGSCWKNNG PC+GD TTDVTRYSEMIINP+T   C P +L  CPPY
Sbjct: 313 QGGGAKSQFYLLDIGSCWKNNGKPCDGDVTTDVTRYSEMIINPETPLWCNPKSLHNCPPY 372

Query: 382 HVSSSGEKIYRNETSRFPYSAYHLYCGPGNAMHLEKPCELCDPYSNPQSQELVQILPHPE 441
           H   +G +++R +   FPY AYH+YC PGNA HLE P   CD YSNPQ+QE++Q+LPHP 
Sbjct: 373 HTFRNGTRVHRTDHRSFPYEAYHVYCAPGNAEHLELPVGTCDAYSNPQAQEILQLLPHPV 432

Query: 442 WAVHGYPQKQGEGWVGDPRTWELDVGALSNRLYFYQDPGTKAARRIWSSINVGTEVYFSE 501
           W  +GYP + G+GWVGDPRTW+LDVG LS+RL+FYQDPGT  ARRIW+S++VGTE+Y  +
Sbjct: 433 WGEYGYPTRLGDGWVGDPRTWDLDVGGLSSRLFFYQDPGTIPARRIWTSVDVGTEIYKED 492

BLAST of CmoCh01G008710 vs. TAIR 10
Match: AT2G47010.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: 17 plant structures; EXPRESSED DURING: 10 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G17030.1); Has 72 Blast hits to 72 proteins in 13 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 71; Viruses - 0; Other Eukaryotes - 1 (source: NCBI BLink). )

HSP 1 Score: 592.8 bits (1527), Expect = 2.6e-169
Identity = 281/495 (56.77%), Postives = 343/495 (69.29%), Query Frame = 0

Query: 22  VLSFIHCSYISAIGDPGMKNPNVRVALEAWNSCNEVGAEAPTMGSPRLADCADL------ 81
           ++  +H    SA+GDPGMK   +RVA EAWN CNEVG EAP MGSPR ADC DL      
Sbjct: 13  LILLVHGDERSAVGDPGMKRDGLRVAFEAWNFCNEVGFEAPHMGSPRAADCFDLSSKCIK 72

Query: 82  -------QPSSAPTMVAHKVNESINKLKAGEKFP---SERFKPYTDPNLYAAEKERYLGS 141
                    +++ + + HKV++S N+L  G+  P   SE      +P+LYA EKE YLGS
Sbjct: 73  AYTEDQSNKTTSGSSLVHKVSDSDNELGIGKPKPGIISE--SALHNPDLYAVEKELYLGS 132

Query: 142 LCEVDGSSNPWNFWMIMLKNGNLDKNSTLCPENGKKVSKIKTELKFPCFGEGCMNPPLVC 201
           LC+V    NPW+FWM+MLKNGN D  S LCP+NGKK+        FPCFG GCMN P + 
Sbjct: 133 LCQVSDKPNPWSFWMVMLKNGNYDTKSALCPKNGKKIPPFNQPGLFPCFGSGCMNQPTLN 192

Query: 202 HNYSRLVSLEDGMVSLSGGFYGTYEVDADLSNGIGKNSYFSVSWRKNVTTGSWVFLNRLT 261
           H  + L    DG  ++ G F GTYE  AD  NG+   SY+ V W K V  G WVF ++L 
Sbjct: 193 HGKTEL--QRDGQ-TMKGWFNGTYEQGADFGNGLDGISYYEVVWEKRVGVGGWVFKHKLK 252

Query: 262 TSSKYPWLMLYLRSDATKGFNGGYHYDGRGIMSKVLLHPLLFSLLPESPNFKVRLTLDIK 321
           TS+KYPWLMLYLR+DATKGF+GGYHYD RG++            LPESPNFKVRLTL++K
Sbjct: 253 TSAKYPWLMLYLRADATKGFSGGYHYDTRGML----------KTLPESPNFKVRLTLNVK 312

Query: 322 SGGGKGSQFYLLDIGSCWKNNGDPCNGDTTTDVTRYSEMIINPKTGGTCKPSNLMACPPY 381
            GGG  SQFYLLDIGSCWKNNG PC+GD TTDVTRYSEMIINP+T   C P +L  CPPY
Sbjct: 313 QGGGAKSQFYLLDIGSCWKNNGKPCDGDVTTDVTRYSEMIINPETPLWCNPKSLHNCPPY 372

Query: 382 HVSSSGEKIYRNETSRFPYSAYHLYCGPGNAMHLEKPCELCDPYSNPQSQELVQILPHPE 441
           H   +G +++R +   FPY AYH+YC PGNA HLE P   CD YSNPQ+QE++Q+LPHP 
Sbjct: 373 HTFRNGTRVHRTDHRSFPYEAYHVYCAPGNAEHLELPVGTCDAYSNPQAQEILQLLPHPV 432

Query: 442 WAVHGYPQKQGEGWVGDPRTWELDVGALSNRLYFYQDPGTKAARRIWSSINVGTEVYFSE 501
           W  +GYP + G+GWVGDPRTW+LDVG LS+RL+FYQDPGT  ARRIW+S++VGTE+Y  +
Sbjct: 433 WGEYGYPTRLGDGWVGDPRTWDLDVGGLSSRLFFYQDPGTIPARRIWTSVDVGTEIYKED 492

BLAST of CmoCh01G008710 vs. TAIR 10
Match: AT1G17030.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G47010.2); Has 70 Blast hits to 70 proteins in 13 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 69; Viruses - 0; Other Eukaryotes - 1 (source: NCBI BLink). )

HSP 1 Score: 586.3 bits (1510), Expect = 2.4e-167
Identity = 266/474 (56.12%), Postives = 336/474 (70.89%), Query Frame = 0

Query: 29  SYISAIGDPGMKNPNVRVALEAWNSCNEVGAEAPTMGSPRLADCADLQPSSAPTMVAHKV 88
           +Y+SA+GDPGM+N N+RVA+EAWN CNEVG EA  MGSPR+ADC D+  SS P  + HKV
Sbjct: 38  NYVSAVGDPGMRNDNLRVAIEAWNQCNEVGEEATNMGSPRMADCFDIDNSSFPVKIIHKV 97

Query: 89  NESINKLKAGEKFPSERFKPYTDPNLYAAEKERYLGSLCEVDGSSNPWNFWMIMLKNGNL 148
           +E  N+L  G            + ++YAA+KE YLG+ C+V    NPW FWMIMLKNGN 
Sbjct: 98  DERDNRLGVGNG-TYGGISAGDNADIYAAQKEVYLGNKCQVVDKPNPWQFWMIMLKNGNT 157

Query: 149 DKNSTLCPENGKKVSKIKTELKFPCFGEGCMNPPLVCHNYSRLVSLEDGMVSLSGGFYGT 208
           D  + +CPENGKK        +FPCFG+GCMN P + H Y+ LV  E+G   +SG FYGT
Sbjct: 158 DTLAAICPENGKKAKPFPPTGRFPCFGKGCMNMPSMHHEYTSLVDNEEG--HMSGSFYGT 217

Query: 209 YEVDADLSNGIGKNSYFSVSWRKNV-TTGSWVFLNRLTTSSKYPWLMLYLRSDATKGFNG 268
           +++D D  + +G NSY+ V W K +    SWVF + L TSSKYPWLMLYLR+DA++GF+G
Sbjct: 218 WDLDNDQKDPVGNNSYYKVKWEKKIGGNESWVFHHLLKTSSKYPWLMLYLRADASRGFSG 277

Query: 269 GYHYDGRGIMSKVLLHPLLFSLLPESPNFKVRLTLDIKSGGGKGSQFYLLDIGSCWKNNG 328
           GYHYD RG+M   L          +SP+FKV+  L+I  GGG GSQFYL+D+GSCWKN+G
Sbjct: 278 GYHYDTRGMMKMTL----------KSPDFKVKFKLEIIKGGGSGSQFYLMDMGSCWKNDG 337

Query: 329 DPCNGDTTTDVTRYSEMIINPKTGGTCKPSNLMACPPYHVSSSGEKIYRNETSRFPYSAY 388
             C+GD TTDVTRYSEMIINP     C  + L ACPP H   +G K++R +  +FP+ AY
Sbjct: 338 RDCDGDVTTDVTRYSEMIINPGATAVCTRNRLGACPPEHTFPNGTKVHRTDKEKFPFEAY 397

Query: 389 HLYCGPGNAMHLEKPCELCDPYSNPQSQELVQILPHPEWAVHGYPQKQGEGWVGDPRTWE 448
           H YC PGNA   E P E+CDPYSNPQ QE++QILPHP W   GYP K+G+GW+GDPRTWE
Sbjct: 398 HYYCVPGNARFAESPYEVCDPYSNPQPQEILQILPHPVWEQFGYPTKKGQGWIGDPRTWE 457

Query: 449 LDVGALSNRLYFYQDPGTKAARRIWSSINVGTEVYFSEGETAEWSVSDFDVLVP 502
           LDVG LS  L+FYQDPGTK   R WSSI++GTE+Y S+ + AEW+V+DFD+++P
Sbjct: 458 LDVGKLSQSLFFYQDPGTKPVERHWSSIDLGTEIYMSKNQIAEWTVTDFDIVIP 498

BLAST of CmoCh01G008710 vs. TAIR 10
Match: AT4G09965.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G47010.2); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 100.1 bits (248), Expect = 5.2e-21
Identity = 48/69 (69.57%), Postives = 59/69 (85.51%), Query Frame = 0

Query: 434 KQGEGWVGDPRTWELDVGALSNRLYFYQD-PGTKAARRIWSSINVGTEVYFS-EGETAEW 493
           KQG GW+GD RTWE++ GALS+RLYFYQ+ PGTK A+R+W+SINV T++Y S   ETAEW
Sbjct: 148 KQGNGWIGDSRTWEVN-GALSSRLYFYQEYPGTKPAKRMWTSINVVTDIYVSNRQETAEW 207

Query: 494 SVSDFDVLV 501
           +VSDFDVLV
Sbjct: 208 TVSDFDVLV 215

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1GC162.4e-30098.01uncharacterized protein LOC111452605 OS=Cucurbita moschata OX=3662 GN=LOC1114526... [more]
A0A6J1KAX71.6e-28894.25uncharacterized protein LOC111492655 OS=Cucurbita maxima OX=3661 GN=LOC111492655... [more]
A0A6J1ICQ84.2e-24477.54uncharacterized protein LOC111472586 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1IG585.5e-24477.39uncharacterized protein LOC111472586 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1EFC73.5e-24377.35uncharacterized protein LOC111432779 isoform X2 OS=Cucurbita moschata OX=3662 GN... [more]
Match NameE-valueIdentityDescription
XP_022949175.15.0e-30098.01uncharacterized protein LOC111452605 [Cucurbita moschata][more]
KAG6607488.13.0e-29797.22hypothetical protein SDJN03_00830, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_023525505.13.6e-29095.03uncharacterized protein LOC111789092 [Cucurbita pepo subsp. pepo][more]
XP_022997810.13.4e-28894.25uncharacterized protein LOC111492655 [Cucurbita maxima][more]
KAG7037147.16.3e-28794.83hypothetical protein SDJN02_00769, partial [Cucurbita argyrosperma subsp. argyro... [more]
Match NameE-valueIdentityDescription
AT2G47010.12.6e-16956.77unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT2G47010.22.6e-16956.77unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT1G17030.12.4e-16756.12unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT4G09965.15.2e-2169.57unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR33916:SF1SUBFAMILY NOT NAMEDcoord: 26..502
NoneNo IPR availablePANTHERPTHR33916FAMILY NOT NAMEDcoord: 26..502

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh01G008710.1CmoCh01G008710.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane