CmoCh04G005670 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh04G005670
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
DescriptionUnknown protein
LocationCmo_Chr04: 2818778 .. 2819936 (+)
RNA-Seq ExpressionCmoCh04G005670
SyntenyCmoCh04G005670
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
TCTGAAATGGGTACCTCTTCTTGCACAGCCACCTCCCTCCATGGCTTCTACCACTTCCTCTCCCACCAGCTCGACGACCTTGATCATGCCTTCCTCTCCTCCGACTTCATGTCCCTTCACTTTCTCCACAAGGTCCTCTCTCTTCTCAGAGCTCTCCACTCCCACCTTATTCAGCTCGGCCACCGCCTCCACCTCCCCGTTGGAGGCAAGTGGCTCGACGAGTACATGGACGACAGCTCCCGTCTCTGGGATGCTTGTCAAGTCCTCAAATCCGGCATCTCAAGGATTGAGCTTTACCATTCTGAAGCTTCTGCCATAGCTTCTTCTTTGCAGGATCCTCACCTTCTTCGTTTCAATCATCGCGCTTCTCAAAGGGTGTGTTTCGATTTGAATTGGGTTCAGTTTGGTTCTAAAGTTTTGTTCTTTTTCATGTATATTGTTCTGGGTTTCCTCTGTTTTGCTCTGTTTTGCTCTGTTTTATTCTGTTTTACTCTGTTTTCCTCTGTTTTTGAAGGTTCTTCGTGCAATTTATGATTTGGAGAGGAATGGTTTTGTGTTAGAGGAGGAGAATAGAATCTTGATGAATACACGAATTCAGCCATTGTCACTGCTTTGTTTCAACGACAACAGTGCATTGACAGGAATGGGATCGACGTCGAAGTCAAATGCGTTCAATGGATTCAGAGGCGTTCTTCATGCAGTAAAGAGTATCAGTTCATTGCTTCTAATGATTTTACTCTGTTCTCTCGTGTATTGTTGGCCAGAATCCAGCTTCCATGCGAATGATGGGATTGAAAATGAAGATGATCACCATCAAAGAACCATGTTCAGCTCAAGCTTTGTAGCTTCAATGGCGAGATTGAGGCAGAGAGTGGCAAATGAGATAGACAGAGTAGAAGGGCAGCCAGTGGGGATCCTGCTGTTTGAGTTCAGAGAAGCAAAGGCAGCCATGGATGACCTGAAAACAGAGCTCGAGAAGGCATTGGAAGAAGAAGAAGAAGAAGAAGATGAAATTGAAGAGAAAGCAGAGAAGTTGAAGAGTTGGAATGGGGCGTTGAGAAGTGGGGTGGACGCCATTGTTGGAGAGCTTGATGATTTCTTCGATGAAATTGTTGAGGGAAGAAAGAAGCTTTTAGACATTTGCACTTATAACAGGTAG

mRNA sequence

TCTGAAATGGGTACCTCTTCTTGCACAGCCACCTCCCTCCATGGCTTCTACCACTTCCTCTCCCACCAGCTCGACGACCTTGATCATGCCTTCCTCTCCTCCGACTTCATGTCCCTTCACTTTCTCCACAAGGTCCTCTCTCTTCTCAGAGCTCTCCACTCCCACCTTATTCAGCTCGGCCACCGCCTCCACCTCCCCGTTGGAGGCAAGTGGCTCGACGAGTACATGGACGACAGCTCCCGTCTCTGGGATGCTTGTCAAGTCCTCAAATCCGGCATCTCAAGGATTGAGCTTTACCATTCTGAAGCTTCTGCCATAGCTTCTTCTTTGCAGGATCCTCACCTTCTTCGTTTCAATCATCGCGCTTCTCAAAGGGTTCTTCGTGCAATTTATGATTTGGAGAGGAATGGTTTTGTGTTAGAGGAGGAGAATAGAATCTTGATGAATACACGAATTCAGCCATTGTCACTGCTTTGTTTCAACGACAACAGTGCATTGACAGGAATGGGATCGACGTCGAAGTCAAATGCGTTCAATGGATTCAGAGGCGTTCTTCATGCAGTAAAGAGTATCAGTTCATTGCTTCTAATGATTTTACTCTGTTCTCTCGTGTATTGTTGGCCAGAATCCAGCTTCCATGCGAATGATGGGATTGAAAATGAAGATGATCACCATCAAAGAACCATGTTCAGCTCAAGCTTTGTAGCTTCAATGGCGAGATTGAGGCAGAGAGTGGCAAATGAGATAGACAGAGTAGAAGGGCAGCCAGTGGGGATCCTGCTGTTTGAGTTCAGAGAAGCAAAGGCAGCCATGGATGACCTGAAAACAGAGCTCGAGAAGGCATTGGAAGAAGAAGAAGAAGAAGAAGATGAAATTGAAGAGAAAGCAGAGAAGTTGAAGAGTTGGAATGGGGCGTTGAGAAGTGGGGTGGACGCCATTGTTGGAGAGCTTGATGATTTCTTCGATGAAATTGTTGAGGGAAGAAAGAAGCTTTTAGACATTTGCACTTATAACAGGTAG

Coding sequence (CDS)

ATGGGTACCTCTTCTTGCACAGCCACCTCCCTCCATGGCTTCTACCACTTCCTCTCCCACCAGCTCGACGACCTTGATCATGCCTTCCTCTCCTCCGACTTCATGTCCCTTCACTTTCTCCACAAGGTCCTCTCTCTTCTCAGAGCTCTCCACTCCCACCTTATTCAGCTCGGCCACCGCCTCCACCTCCCCGTTGGAGGCAAGTGGCTCGACGAGTACATGGACGACAGCTCCCGTCTCTGGGATGCTTGTCAAGTCCTCAAATCCGGCATCTCAAGGATTGAGCTTTACCATTCTGAAGCTTCTGCCATAGCTTCTTCTTTGCAGGATCCTCACCTTCTTCGTTTCAATCATCGCGCTTCTCAAAGGGTTCTTCGTGCAATTTATGATTTGGAGAGGAATGGTTTTGTGTTAGAGGAGGAGAATAGAATCTTGATGAATACACGAATTCAGCCATTGTCACTGCTTTGTTTCAACGACAACAGTGCATTGACAGGAATGGGATCGACGTCGAAGTCAAATGCGTTCAATGGATTCAGAGGCGTTCTTCATGCAGTAAAGAGTATCAGTTCATTGCTTCTAATGATTTTACTCTGTTCTCTCGTGTATTGTTGGCCAGAATCCAGCTTCCATGCGAATGATGGGATTGAAAATGAAGATGATCACCATCAAAGAACCATGTTCAGCTCAAGCTTTGTAGCTTCAATGGCGAGATTGAGGCAGAGAGTGGCAAATGAGATAGACAGAGTAGAAGGGCAGCCAGTGGGGATCCTGCTGTTTGAGTTCAGAGAAGCAAAGGCAGCCATGGATGACCTGAAAACAGAGCTCGAGAAGGCATTGGAAGAAGAAGAAGAAGAAGAAGATGAAATTGAAGAGAAAGCAGAGAAGTTGAAGAGTTGGAATGGGGCGTTGAGAAGTGGGGTGGACGCCATTGTTGGAGAGCTTGATGATTTCTTCGATGAAATTGTTGAGGGAAGAAAGAAGCTTTTAGACATTTGCACTTATAACAGGTAG

Protein sequence

MGTSSCTATSLHGFYHFLSHQLDDLDHAFLSSDFMSLHFLHKVLSLLRALHSHLIQLGHRLHLPVGGKWLDEYMDDSSRLWDACQVLKSGISRIELYHSEASAIASSLQDPHLLRFNHRASQRVLRAIYDLERNGFVLEEENRILMNTRIQPLSLLCFNDNSALTGMGSTSKSNAFNGFRGVLHAVKSISSLLLMILLCSLVYCWPESSFHANDGIENEDDHHQRTMFSSSFVASMARLRQRVANEIDRVEGQPVGILLFEFREAKAAMDDLKTELEKALEEEEEEEDEIEEKAEKLKSWNGALRSGVDAIVGELDDFFDEIVEGRKKLLDICTYNR
Homology
BLAST of CmoCh04G005670 vs. ExPASy TrEMBL
Match: A0A6J1FMG3 (uncharacterized protein LOC111447146 OS=Cucurbita moschata OX=3662 GN=LOC111447146 PE=4 SV=1)

HSP 1 Score: 662.1 bits (1707), Expect = 1.2e-186
Identity = 337/337 (100.00%), Postives = 337/337 (100.00%), Query Frame = 0

Query: 1   MGTSSCTATSLHGFYHFLSHQLDDLDHAFLSSDFMSLHFLHKVLSLLRALHSHLIQLGHR 60
           MGTSSCTATSLHGFYHFLSHQLDDLDHAFLSSDFMSLHFLHKVLSLLRALHSHLIQLGHR
Sbjct: 1   MGTSSCTATSLHGFYHFLSHQLDDLDHAFLSSDFMSLHFLHKVLSLLRALHSHLIQLGHR 60

Query: 61  LHLPVGGKWLDEYMDDSSRLWDACQVLKSGISRIELYHSEASAIASSLQDPHLLRFNHRA 120
           LHLPVGGKWLDEYMDDSSRLWDACQVLKSGISRIELYHSEASAIASSLQDPHLLRFNHRA
Sbjct: 61  LHLPVGGKWLDEYMDDSSRLWDACQVLKSGISRIELYHSEASAIASSLQDPHLLRFNHRA 120

Query: 121 SQRVLRAIYDLERNGFVLEEENRILMNTRIQPLSLLCFNDNSALTGMGSTSKSNAFNGFR 180
           SQRVLRAIYDLERNGFVLEEENRILMNTRIQPLSLLCFNDNSALTGMGSTSKSNAFNGFR
Sbjct: 121 SQRVLRAIYDLERNGFVLEEENRILMNTRIQPLSLLCFNDNSALTGMGSTSKSNAFNGFR 180

Query: 181 GVLHAVKSISSLLLMILLCSLVYCWPESSFHANDGIENEDDHHQRTMFSSSFVASMARLR 240
           GVLHAVKSISSLLLMILLCSLVYCWPESSFHANDGIENEDDHHQRTMFSSSFVASMARLR
Sbjct: 181 GVLHAVKSISSLLLMILLCSLVYCWPESSFHANDGIENEDDHHQRTMFSSSFVASMARLR 240

Query: 241 QRVANEIDRVEGQPVGILLFEFREAKAAMDDLKTELEKALEEEEEEEDEIEEKAEKLKSW 300
           QRVANEIDRVEGQPVGILLFEFREAKAAMDDLKTELEKALEEEEEEEDEIEEKAEKLKSW
Sbjct: 241 QRVANEIDRVEGQPVGILLFEFREAKAAMDDLKTELEKALEEEEEEEDEIEEKAEKLKSW 300

Query: 301 NGALRSGVDAIVGELDDFFDEIVEGRKKLLDICTYNR 338
           NGALRSGVDAIVGELDDFFDEIVEGRKKLLDICTYNR
Sbjct: 301 NGALRSGVDAIVGELDDFFDEIVEGRKKLLDICTYNR 337

BLAST of CmoCh04G005670 vs. ExPASy TrEMBL
Match: A0A6J1J2L7 (uncharacterized protein LOC111480769 OS=Cucurbita maxima OX=3661 GN=LOC111480769 PE=4 SV=1)

HSP 1 Score: 641.7 bits (1654), Expect = 1.7e-180
Identity = 330/349 (94.56%), Postives = 334/349 (95.70%), Query Frame = 0

Query: 1   MGTSSCTATSLHGFYHFLSHQLDDLDHAFLSSDFMSLHFLHKVLSLLRALHSHLIQLGHR 60
           MGTSSCTATSLHGFYHFLSHQLDDLDHAFLSSDFMSLHFLHKVLSLLRALH+HLIQLGHR
Sbjct: 1   MGTSSCTATSLHGFYHFLSHQLDDLDHAFLSSDFMSLHFLHKVLSLLRALHTHLIQLGHR 60

Query: 61  LHLPVGGKWLDEYMDDSSRLWDACQVLKSGISRIELYHSEASAIASSLQDPHLLRFNHRA 120
           LHLPVGGKWLDEYMDDSSRLWDACQVLKSGISRIELYHSEASAI+SSLQDPHLLRFNHRA
Sbjct: 61  LHLPVGGKWLDEYMDDSSRLWDACQVLKSGISRIELYHSEASAISSSLQDPHLLRFNHRA 120

Query: 121 SQRVLRAIYDLERNGFVLEEENRILMNTRIQPLSLLCFNDNSALTGMGSTSKSNAFNGFR 180
           SQRVLRAIYDLERNGFVLEEENRILMNTRIQPLSLLCFNDNSALTGMGSTSKSNAFNGFR
Sbjct: 121 SQRVLRAIYDLERNGFVLEEENRILMNTRIQPLSLLCFNDNSALTGMGSTSKSNAFNGFR 180

Query: 181 GVLHAVKSISSLLLMILLCSLVYCWPESSFHANDGIENEDDHHQRTMFSSSFVASMARLR 240
           GVLHAVKSISSLLLMILLCSLVYCWPESSFHANDGIENEDDHHQRT FSSSFVASMARLR
Sbjct: 181 GVLHAVKSISSLLLMILLCSLVYCWPESSFHANDGIENEDDHHQRTTFSSSFVASMARLR 240

Query: 241 QRVANEIDRVEGQPVGILLFEFREAKAAMDDLKTELEKALEEEEEEED------------ 300
           QRVANEIDRVEGQPVGILLFEFREAKAAMDDLKTELEK LEEEEEEE+            
Sbjct: 241 QRVANEIDRVEGQPVGILLFEFREAKAAMDDLKTELEKGLEEEEEEEEEEEEEEEEEEEV 300

Query: 301 EIEEKAEKLKSWNGALRSGVDAIVGELDDFFDEIVEGRKKLLDICTYNR 338
           EIEEK EKLKSWNGALRSGVDAIVGELDDFFDEIVEGRKKLLDICT+NR
Sbjct: 301 EIEEKLEKLKSWNGALRSGVDAIVGELDDFFDEIVEGRKKLLDICTHNR 349

BLAST of CmoCh04G005670 vs. ExPASy TrEMBL
Match: A0A5D3DG23 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold584G00090 PE=4 SV=1)

HSP 1 Score: 540.0 bits (1390), Expect = 6.9e-150
Identity = 281/342 (82.16%), Postives = 306/342 (89.47%), Query Frame = 0

Query: 3   TSSCTATSLHGFYHFLSHQLDDLDHAFLSSDFMSLHFLHKVLSLLRALHSHLIQLGHRLH 62
           +SSCTATSLHGFYHFLSH+LDDLDHAFLSSDFMSLHFL KVLSLLR LHS LIQLG RLH
Sbjct: 10  SSSCTATSLHGFYHFLSHELDDLDHAFLSSDFMSLHFLQKVLSLLRTLHSQLIQLGQRLH 69

Query: 63  LPVGGKWLDEYMDDSSRLWDACQVLKSGISRIELYHSEASAIASSLQDPHLLRFNHRASQ 122
           LPVGGKWLDEYMD+SSRLW+A QVLKSGISR+E++H EASAIASSLQDPH LRFN RAS+
Sbjct: 70  LPVGGKWLDEYMDESSRLWEASQVLKSGISRMEVFHVEASAIASSLQDPHFLRFNPRASR 129

Query: 123 RVLRAIYDLERNGFVLEEENRILMNTRIQPLSLLCFNDNSAL-TGMGSTSKSNAFNGFRG 182
           RVLRAI D ERN F LEEENR LMNTRI PLSLLCFN  S++ TGMGSTSK NAFNGFRG
Sbjct: 130 RVLRAITDFERNVFGLEEENRSLMNTRIPPLSLLCFNGGSSMSTGMGSTSKLNAFNGFRG 189

Query: 183 VLHAVKSISSLLLMILLCSLVYCWPESSFHANDGIENEDDHHQRTMFSSSFVASMARLRQ 242
           VLHAVK+ISSLLLMILLC LVYCWPESSFH ++GIENE+D HQRTMFSSSFVASM RL+Q
Sbjct: 190 VLHAVKNISSLLLMILLCGLVYCWPESSFHGSNGIENEEDQHQRTMFSSSFVASMERLKQ 249

Query: 243 RVANEIDRVEGQPVGILLFEFREAKAAMDDLKTELEKALEEEEEEED------EIEEKAE 302
           RVANEI+RV+ QPVGILLFEFREAKAAM+ LK ELEK LEEEEEEE+      EIEEK E
Sbjct: 250 RVANEIERVDVQPVGILLFEFREAKAAMEGLKVELEKGLEEEEEEEEEEEEKVEIEEKIE 309

Query: 303 KLKSWNGALRSGVDAIVGELDDFFDEIVEGRKKLLDICTYNR 338
           +L SW G+LR GVDAI+G+LDDFFDEIVEGRKKLLD+CT+NR
Sbjct: 310 RLNSWFGSLRIGVDAIIGQLDDFFDEIVEGRKKLLDMCTHNR 351

BLAST of CmoCh04G005670 vs. ExPASy TrEMBL
Match: A0A1S3C325 (uncharacterized protein LOC103496110 OS=Cucumis melo OX=3656 GN=LOC103496110 PE=4 SV=1)

HSP 1 Score: 538.5 bits (1386), Expect = 2.0e-149
Identity = 280/343 (81.63%), Postives = 306/343 (89.21%), Query Frame = 0

Query: 3   TSSCTATSLHGFYHFLSHQLDDLDHAFLSSDFMSLHFLHKVLSLLRALHSHLIQLGHRLH 62
           +SSCTATSLHGFYHFLSH+LDDLDHAFLSSDFMSLHFL KVLSLLR LHS LIQLG RLH
Sbjct: 52  SSSCTATSLHGFYHFLSHELDDLDHAFLSSDFMSLHFLQKVLSLLRTLHSQLIQLGQRLH 111

Query: 63  LPVGGKWLDEYMDDSSRLWDACQVLKSGISRIELYHSEASAIASSLQDPHLLRFNHRASQ 122
           LPVGGKWLDEYMD+SSRLW+A QVLKSGISR+E++H EASAIASSLQDPH LRFN RAS+
Sbjct: 112 LPVGGKWLDEYMDESSRLWEASQVLKSGISRMEVFHVEASAIASSLQDPHFLRFNPRASR 171

Query: 123 RVLRAIYDLERNGFVLEEENRILMNTRIQPLSLLCFNDNSAL-TGMGSTSKSNAFNGFRG 182
           RVLRAI D ERN F LEEENR LMNTRI PLSLLCFN  S++ TGMGSTSK NAFNGFRG
Sbjct: 172 RVLRAITDFERNVFGLEEENRSLMNTRIPPLSLLCFNGGSSMSTGMGSTSKLNAFNGFRG 231

Query: 183 VLHAVKSISSLLLMILLCSLVYCWPESSFHANDGIENEDDHHQRTMFSSSFVASMARLRQ 242
           VLHAVK+ISSLLLMILLC LVYCWPESSFH ++GIENE+D HQRTMFSSSFVASM RL+Q
Sbjct: 232 VLHAVKNISSLLLMILLCGLVYCWPESSFHGSNGIENEEDQHQRTMFSSSFVASMERLKQ 291

Query: 243 RVANEIDRVEGQPVGILLFEFREAKAAMDDLKTELEKALEEEEEEED-------EIEEKA 302
           RVANEI+RV+ QPVGILLFEFREAKAAM+ LK ELEK LEE+EEEE+       EIEEK 
Sbjct: 292 RVANEIERVDVQPVGILLFEFREAKAAMEGLKVELEKGLEEDEEEEEEEEEEKVEIEEKI 351

Query: 303 EKLKSWNGALRSGVDAIVGELDDFFDEIVEGRKKLLDICTYNR 338
           E+L SW G+LR GVDAI+G+LDDFFDEIVEGRKKLLD+CT+NR
Sbjct: 352 ERLNSWFGSLRIGVDAIIGQLDDFFDEIVEGRKKLLDMCTHNR 394

BLAST of CmoCh04G005670 vs. ExPASy TrEMBL
Match: A0A0A0LC30 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G221740 PE=4 SV=1)

HSP 1 Score: 534.3 bits (1375), Expect = 3.8e-148
Identity = 274/336 (81.55%), Postives = 304/336 (90.48%), Query Frame = 0

Query: 3   TSSCTATSLHGFYHFLSHQLDDLDHAFLSSDFMSLHFLHKVLSLLRALHSHLIQLGHRLH 62
           +SSCTATSLHGFYHFLSH+LDDLDHAF+SSDFMSLHFL KVLSLLR LHS LIQLG RLH
Sbjct: 4   SSSCTATSLHGFYHFLSHELDDLDHAFVSSDFMSLHFLQKVLSLLRTLHSQLIQLGQRLH 63

Query: 63  LPVGGKWLDEYMDDSSRLWDACQVLKSGISRIELYHSEASAIASSLQDPHLLRFNHRASQ 122
           LPVGGKWLDEYMD+SSRLW+A QVLKSGISR+E++H EASAIASSLQDPH LRFN RAS+
Sbjct: 64  LPVGGKWLDEYMDESSRLWEASQVLKSGISRMEVFHVEASAIASSLQDPHFLRFNPRASR 123

Query: 123 RVLRAIYDLERNGFVLEEENRILMNTRIQPLSLLCFNDNSAL-TGMGSTSKSNAFNGFRG 182
           RVLRAI D ERN F LEEENR LMNTRI PLSLLCFN +S++ +GMGSTSK NAFNGFRG
Sbjct: 124 RVLRAITDFERNVFGLEEENRSLMNTRIPPLSLLCFNGSSSVSSGMGSTSKLNAFNGFRG 183

Query: 183 VLHAVKSISSLLLMILLCSLVYCWPESSFHANDGIENEDDHHQRTMFSSSFVASMARLRQ 242
           VLHAVK+ISSLLLMILLC LVYCWPES FH ++GI NE+D HQRTMFSSSF+ASM RL+Q
Sbjct: 184 VLHAVKNISSLLLMILLCGLVYCWPESIFHGSNGIGNEEDQHQRTMFSSSFIASMERLKQ 243

Query: 243 RVANEIDRVEGQPVGILLFEFREAKAAMDDLKTELEKALEEEEEEEDEIEEKAEKLKSWN 302
           RVANEI+RV+ QPVGILLFEFREAKAAM+ LK ELEK LEE++EEE EIEEK E+L SW 
Sbjct: 244 RVANEIERVDVQPVGILLFEFREAKAAMEGLKVELEKGLEEDDEEEVEIEEKIERLNSWF 303

Query: 303 GALRSGVDAIVGELDDFFDEIVEGRKKLLDICTYNR 338
           G+LR GVDAI+G+LDDFFDEIVEGRKKLLD+CT+NR
Sbjct: 304 GSLRIGVDAIIGQLDDFFDEIVEGRKKLLDMCTHNR 339

BLAST of CmoCh04G005670 vs. NCBI nr
Match: XP_022941931.1 (uncharacterized protein LOC111447146 [Cucurbita moschata])

HSP 1 Score: 662.1 bits (1707), Expect = 2.5e-186
Identity = 337/337 (100.00%), Postives = 337/337 (100.00%), Query Frame = 0

Query: 1   MGTSSCTATSLHGFYHFLSHQLDDLDHAFLSSDFMSLHFLHKVLSLLRALHSHLIQLGHR 60
           MGTSSCTATSLHGFYHFLSHQLDDLDHAFLSSDFMSLHFLHKVLSLLRALHSHLIQLGHR
Sbjct: 1   MGTSSCTATSLHGFYHFLSHQLDDLDHAFLSSDFMSLHFLHKVLSLLRALHSHLIQLGHR 60

Query: 61  LHLPVGGKWLDEYMDDSSRLWDACQVLKSGISRIELYHSEASAIASSLQDPHLLRFNHRA 120
           LHLPVGGKWLDEYMDDSSRLWDACQVLKSGISRIELYHSEASAIASSLQDPHLLRFNHRA
Sbjct: 61  LHLPVGGKWLDEYMDDSSRLWDACQVLKSGISRIELYHSEASAIASSLQDPHLLRFNHRA 120

Query: 121 SQRVLRAIYDLERNGFVLEEENRILMNTRIQPLSLLCFNDNSALTGMGSTSKSNAFNGFR 180
           SQRVLRAIYDLERNGFVLEEENRILMNTRIQPLSLLCFNDNSALTGMGSTSKSNAFNGFR
Sbjct: 121 SQRVLRAIYDLERNGFVLEEENRILMNTRIQPLSLLCFNDNSALTGMGSTSKSNAFNGFR 180

Query: 181 GVLHAVKSISSLLLMILLCSLVYCWPESSFHANDGIENEDDHHQRTMFSSSFVASMARLR 240
           GVLHAVKSISSLLLMILLCSLVYCWPESSFHANDGIENEDDHHQRTMFSSSFVASMARLR
Sbjct: 181 GVLHAVKSISSLLLMILLCSLVYCWPESSFHANDGIENEDDHHQRTMFSSSFVASMARLR 240

Query: 241 QRVANEIDRVEGQPVGILLFEFREAKAAMDDLKTELEKALEEEEEEEDEIEEKAEKLKSW 300
           QRVANEIDRVEGQPVGILLFEFREAKAAMDDLKTELEKALEEEEEEEDEIEEKAEKLKSW
Sbjct: 241 QRVANEIDRVEGQPVGILLFEFREAKAAMDDLKTELEKALEEEEEEEDEIEEKAEKLKSW 300

Query: 301 NGALRSGVDAIVGELDDFFDEIVEGRKKLLDICTYNR 338
           NGALRSGVDAIVGELDDFFDEIVEGRKKLLDICTYNR
Sbjct: 301 NGALRSGVDAIVGELDDFFDEIVEGRKKLLDICTYNR 337

BLAST of CmoCh04G005670 vs. NCBI nr
Match: KAG6600364.1 (hypothetical protein SDJN03_05597, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 654.1 bits (1686), Expect = 6.7e-184
Identity = 335/338 (99.11%), Postives = 336/338 (99.41%), Query Frame = 0

Query: 1   MGTSSCTATSLHGFYHFLSHQLDDLDHAFLSSDFMSLHFLHKVLSLLRALHSHLIQLGHR 60
           MGTSSCTATSLHGFYHFLSHQLDDLDHAFLSSDFMSLHFLHKVLSLLRALHSHLIQLGHR
Sbjct: 1   MGTSSCTATSLHGFYHFLSHQLDDLDHAFLSSDFMSLHFLHKVLSLLRALHSHLIQLGHR 60

Query: 61  LHLPVGGKWLDEYMDDSSRLWDACQVLKSGISRIELYHSEASAIASSLQDPHLLRFNHRA 120
           LHLPVGGKWLDEYMDDSSRLWDACQVLKSGISRIELYHSEAS IASSLQDPHLLRFNHRA
Sbjct: 61  LHLPVGGKWLDEYMDDSSRLWDACQVLKSGISRIELYHSEASVIASSLQDPHLLRFNHRA 120

Query: 121 SQRVLRAIYDLERNGFVLEEENRILMNTRIQPLSLLCFNDNSALTGMGSTSKSNAFNGFR 180
           SQRVLRAIYDLERNGFVLEEENRILMNTRIQPLSLLCFNDNSALTGMGSTSKSNAFNGFR
Sbjct: 121 SQRVLRAIYDLERNGFVLEEENRILMNTRIQPLSLLCFNDNSALTGMGSTSKSNAFNGFR 180

Query: 181 GVLHAVKSISSLLLMILLCSLVYCWPESSFHANDGIENEDDHHQRTMFSSSFVASMARLR 240
           GVLHAVKSISSLLLMILLCSLVYCWPESSFHANDGIENEDDHHQRTMFSSSFVASMARLR
Sbjct: 181 GVLHAVKSISSLLLMILLCSLVYCWPESSFHANDGIENEDDHHQRTMFSSSFVASMARLR 240

Query: 241 QRVANEIDRVEGQPVGILLFEFREAKAAMDDLKTELEKAL-EEEEEEEDEIEEKAEKLKS 300
           QRVANEIDRVEGQPVGILLFEFREAKAAMDDLKTELEKAL EEEEEEEDEIEEKAEKLKS
Sbjct: 241 QRVANEIDRVEGQPVGILLFEFREAKAAMDDLKTELEKALEEEEEEEEDEIEEKAEKLKS 300

Query: 301 WNGALRSGVDAIVGELDDFFDEIVEGRKKLLDICTYNR 338
           WNGALRSGVDAIVGELDDFFDEIVEGRKKLLDICT+NR
Sbjct: 301 WNGALRSGVDAIVGELDDFFDEIVEGRKKLLDICTHNR 338

BLAST of CmoCh04G005670 vs. NCBI nr
Match: KAG7031022.1 (hypothetical protein SDJN02_05061, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 651.7 bits (1680), Expect = 3.3e-183
Identity = 334/341 (97.95%), Postives = 336/341 (98.53%), Query Frame = 0

Query: 1   MGTSSCTATSLHGFYHFLSHQLDDLDHAFLSSDFMSLHFLHKVLSLLRALHSHLIQLGHR 60
           MGTSSCTATSLHGFYHFLSHQLDDLDHAFLSSDFMSLHFLHKVLSLLRALHSHLIQLGHR
Sbjct: 1   MGTSSCTATSLHGFYHFLSHQLDDLDHAFLSSDFMSLHFLHKVLSLLRALHSHLIQLGHR 60

Query: 61  LHLPVGGKWLDEYMDDSSRLWDACQVLKSGISRIELYHSEASAIASSLQDPHLLRFNHRA 120
           LHLPVGGKWLDEYMDDSSRLWDACQVLKSGISRIELYHSEAS IASSLQDPHLLRFNHRA
Sbjct: 61  LHLPVGGKWLDEYMDDSSRLWDACQVLKSGISRIELYHSEASVIASSLQDPHLLRFNHRA 120

Query: 121 SQRVLRAIYDLERNGFVLEEENRILMNTRIQPLSLLCFNDNSALTGMGSTSKSNAFNGFR 180
           SQRVLRAIYDLERNGFVLEEENRILMNTRIQPLSLLCFNDNSALTGMGSTSKSNAFNGFR
Sbjct: 121 SQRVLRAIYDLERNGFVLEEENRILMNTRIQPLSLLCFNDNSALTGMGSTSKSNAFNGFR 180

Query: 181 GVLHAVKSISSLLLMILLCSLVYCWPESSFHANDGIENEDDHHQRTMFSSSFVASMARLR 240
           GVLHAVKSISSLLLMILLCSLVYCWPESSFHANDGIENEDDHHQRTMFSSSFVASMARLR
Sbjct: 181 GVLHAVKSISSLLLMILLCSLVYCWPESSFHANDGIENEDDHHQRTMFSSSFVASMARLR 240

Query: 241 QRVANEIDRVEGQPVGILLFEFREAKAAMDDLKTELEKALEEEEEEE----DEIEEKAEK 300
           QRVANEID+VEGQPVGILLFEFREAKAAMDDLKTELEKALEEEEEEE    DEIEEKAEK
Sbjct: 241 QRVANEIDKVEGQPVGILLFEFREAKAAMDDLKTELEKALEEEEEEEEEEDDEIEEKAEK 300

Query: 301 LKSWNGALRSGVDAIVGELDDFFDEIVEGRKKLLDICTYNR 338
           LKSWNGALRSGVDAIVGELDDFFDEIVEGRKKLLDICT+NR
Sbjct: 301 LKSWNGALRSGVDAIVGELDDFFDEIVEGRKKLLDICTHNR 341

BLAST of CmoCh04G005670 vs. NCBI nr
Match: XP_023547569.1 (uncharacterized protein LOC111806465 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 642.5 bits (1656), Expect = 2.0e-180
Identity = 329/340 (96.76%), Postives = 333/340 (97.94%), Query Frame = 0

Query: 1   MGTSSCTATSLHGFYHFLSHQLDDLDHAFLSSDFMSLHFLHKVLSLLRALHSHLIQLGHR 60
           MGTSSCTATSLHGFYHFLSHQLDDLDHAFLSSDFMSLHFLHKVLSLLRALHSHLIQLGHR
Sbjct: 1   MGTSSCTATSLHGFYHFLSHQLDDLDHAFLSSDFMSLHFLHKVLSLLRALHSHLIQLGHR 60

Query: 61  LHLPVGGKWLDEYMDDSSRLWDACQVLKSGISRIELYHSEASAIASSLQDPHLLRFNHRA 120
           LHLPVGGKWLDEYMDDSSRLWDACQVLKSGISRIELYHSEASAIASSLQDPHLLRFNHRA
Sbjct: 61  LHLPVGGKWLDEYMDDSSRLWDACQVLKSGISRIELYHSEASAIASSLQDPHLLRFNHRA 120

Query: 121 SQRVLRAIYDLERNGFVLEEENRILMNTRIQPLSLLCFNDNSALTGMGSTSKSNAFNGFR 180
           SQRVLRAIYDLERNGFVLEEENRILM+TRIQPLSLLCFNDNSALTGMGSTSKSNAFNGFR
Sbjct: 121 SQRVLRAIYDLERNGFVLEEENRILMSTRIQPLSLLCFNDNSALTGMGSTSKSNAFNGFR 180

Query: 181 GVLHAVKSISSLLLMILLCSLVYCWPESSFHANDGIENEDDHHQRTMFSSSFVASMARLR 240
           GVLHAVKSISSLLLMILLCSLVYCWPESSFH NDGIE EDDHHQR MFSSSFVASMARLR
Sbjct: 181 GVLHAVKSISSLLLMILLCSLVYCWPESSFHTNDGIETEDDHHQRPMFSSSFVASMARLR 240

Query: 241 QRVANEIDRVEGQPVGILLFEFREAKAAMDDLKTELEKALEEEEEEEDE---IEEKAEKL 300
           QRVANEIDRVEGQPVGILLFEFREAKAAMDDLKTELE+ALEEEEEEE+E   IEEK EKL
Sbjct: 241 QRVANEIDRVEGQPVGILLFEFREAKAAMDDLKTELERALEEEEEEEEEEDVIEEKVEKL 300

Query: 301 KSWNGALRSGVDAIVGELDDFFDEIVEGRKKLLDICTYNR 338
           KSWNGALRSGVDAIVGELDDFFDEIVEGRKKLLDICT+NR
Sbjct: 301 KSWNGALRSGVDAIVGELDDFFDEIVEGRKKLLDICTHNR 340

BLAST of CmoCh04G005670 vs. NCBI nr
Match: XP_022981709.1 (uncharacterized protein LOC111480769 [Cucurbita maxima])

HSP 1 Score: 641.7 bits (1654), Expect = 3.5e-180
Identity = 330/349 (94.56%), Postives = 334/349 (95.70%), Query Frame = 0

Query: 1   MGTSSCTATSLHGFYHFLSHQLDDLDHAFLSSDFMSLHFLHKVLSLLRALHSHLIQLGHR 60
           MGTSSCTATSLHGFYHFLSHQLDDLDHAFLSSDFMSLHFLHKVLSLLRALH+HLIQLGHR
Sbjct: 1   MGTSSCTATSLHGFYHFLSHQLDDLDHAFLSSDFMSLHFLHKVLSLLRALHTHLIQLGHR 60

Query: 61  LHLPVGGKWLDEYMDDSSRLWDACQVLKSGISRIELYHSEASAIASSLQDPHLLRFNHRA 120
           LHLPVGGKWLDEYMDDSSRLWDACQVLKSGISRIELYHSEASAI+SSLQDPHLLRFNHRA
Sbjct: 61  LHLPVGGKWLDEYMDDSSRLWDACQVLKSGISRIELYHSEASAISSSLQDPHLLRFNHRA 120

Query: 121 SQRVLRAIYDLERNGFVLEEENRILMNTRIQPLSLLCFNDNSALTGMGSTSKSNAFNGFR 180
           SQRVLRAIYDLERNGFVLEEENRILMNTRIQPLSLLCFNDNSALTGMGSTSKSNAFNGFR
Sbjct: 121 SQRVLRAIYDLERNGFVLEEENRILMNTRIQPLSLLCFNDNSALTGMGSTSKSNAFNGFR 180

Query: 181 GVLHAVKSISSLLLMILLCSLVYCWPESSFHANDGIENEDDHHQRTMFSSSFVASMARLR 240
           GVLHAVKSISSLLLMILLCSLVYCWPESSFHANDGIENEDDHHQRT FSSSFVASMARLR
Sbjct: 181 GVLHAVKSISSLLLMILLCSLVYCWPESSFHANDGIENEDDHHQRTTFSSSFVASMARLR 240

Query: 241 QRVANEIDRVEGQPVGILLFEFREAKAAMDDLKTELEKALEEEEEEED------------ 300
           QRVANEIDRVEGQPVGILLFEFREAKAAMDDLKTELEK LEEEEEEE+            
Sbjct: 241 QRVANEIDRVEGQPVGILLFEFREAKAAMDDLKTELEKGLEEEEEEEEEEEEEEEEEEEV 300

Query: 301 EIEEKAEKLKSWNGALRSGVDAIVGELDDFFDEIVEGRKKLLDICTYNR 338
           EIEEK EKLKSWNGALRSGVDAIVGELDDFFDEIVEGRKKLLDICT+NR
Sbjct: 301 EIEEKLEKLKSWNGALRSGVDAIVGELDDFFDEIVEGRKKLLDICTHNR 349

BLAST of CmoCh04G005670 vs. TAIR 10
Match: AT1G22030.1 (CONTAINS InterPro DOMAIN/s: Protein BYPASS related (InterPro:IPR008511); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G77855.1); Has 99 Blast hits to 99 proteins in 17 species: Archae - 0; Bacteria - 0; Metazoa - 3; Fungi - 0; Plants - 96; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 271.9 bits (694), Expect = 6.7e-73
Identity = 157/342 (45.91%), Postives = 225/342 (65.79%), Query Frame = 0

Query: 5   SCTATSLHGFYHFLSHQLDDLDHAFLSSDFMSLHFLHKVLSLLRALHSHLIQLGHRLHLP 64
           SC A S++GFY FL+  ++DL+  +LS++FMS+HFL + L LLR  HSHL  L  +L LP
Sbjct: 4   SC-ANSVNGFYSFLNRSMEDLERVYLSNNFMSVHFLQRALCLLRTSHSHLTLLVQKLQLP 63

Query: 65  VGGKWLDEYMDDSSRLWDACQVLKSGISRIELYHSEASAIASSLQDPHLLRFNHRASQRV 124
           VG KWLDEYMD+SS+LW+AC V+KS +S +E + S   +IAS+L      R + + S++V
Sbjct: 64  VGDKWLDEYMDESSKLWEACLVIKSAVSSVENFSSAGISIASTLD----RRLSPQLSRQV 123

Query: 125 LRAIYDLERNGFVLEEENRILMNTRIQPLSLLCFNDNSALTGMGSTSK-SNAFNGFRGVL 184
           +RAI    R    +EEENR LM  R+Q      +++ ++ T M S++K  N F+GFRGVL
Sbjct: 124 IRAISGCRREAIGIEEENRALMENRVQRFPF--WSEQTSATAMESSTKLQNGFSGFRGVL 183

Query: 185 HAVKSISSLLLMILLCSLVYCWPESSFHANDGIENEDDHHQRTMFSSSFVASMARLRQRV 244
           +A +++SSLLLM+L+  LVYC+P      +   + +    Q       F  +M RL+QRV
Sbjct: 184 YATRNMSSLLLMVLMNGLVYCFP-----GDAATQTQTQITQTQSQVGGFAGAMGRLQQRV 243

Query: 245 ANEIDRVEGQPVGILLFEFREAKAAMDDLKTELEKAL----------EEEEEEEDEIEEK 304
           A E+ R+ G   GIL+ E+R +KAA+++LK ELE+            EEEEE+E E+ E+
Sbjct: 244 AAEVGRM-GIRKGILMHEYRRSKAALEELKAELERRFCGGGGGGGEREEEEEDERELRER 303

Query: 305 AEKLKSWNGALRSGVDAIVGELDDFFDEIVEGRKKLLDICTY 336
            E LK + G LR+G ++IV ++DDFFDEIVEGRKKLLD C++
Sbjct: 304 VENLKGYFGNLRNGTESIVAQIDDFFDEIVEGRKKLLDFCSH 332

BLAST of CmoCh04G005670 vs. TAIR 10
Match: AT1G77855.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G22030.1); Has 120 Blast hits to 120 proteins in 17 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 120; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 241.9 bits (616), Expect = 7.4e-64
Identity = 144/337 (42.73%), Postives = 213/337 (63.20%), Query Frame = 0

Query: 5   SCTATSLHGFYHFLSHQLDDLDHAFLSSDFMSLHFLHKVLSLLRALHSHLIQLGHRLHLP 64
           SC + S++GFY FL+  ++DL+  ++S++FMSL FL +V+ LLR  HSHL  L  +L+LP
Sbjct: 4   SC-SNSVNGFYSFLNRSMEDLERVYISNNFMSLQFLQRVICLLRTSHSHLTLLVQKLNLP 63

Query: 65  VGGKWLDEYMDDSSRLWDACQVLKSGISRIELYHSEASAIASSLQD--PHLLRFNHRASQ 124
           VG KWLD+YMD++S+LWD C V+KS IS IE + S A +I S+L     H    + + S+
Sbjct: 64  VGDKWLDDYMDETSKLWDVCHVIKSAISTIESFCSSAISITSTLDGHYHHRRLLSPQISR 123

Query: 125 RVLRAIYDLERNGFVLEEENRILMNTRIQPLSLLCFNDNSALTGMGSTSKSNAFNGFRGV 184
           +V+RAI    R    +EEENR LM  RIQ      +++    TGM S+   N F+GFRGV
Sbjct: 124 QVIRAISGCRREAVGIEEENRALMENRIQRFPF--WSEQVTTTGMESSKIQNGFSGFRGV 183

Query: 185 LHAVKSISSLLLMILLCSLVYCWPESSFHANDGIENEDDHHQRTMFSSSFVASMARLRQR 244
           ++ +K+I+SLLL+IL+  LVY  P                      ++    +M RL+QR
Sbjct: 184 MNTMKNINSLLLVILMQGLVYYIPGD--------------------TTVPTGTMMRLKQR 243

Query: 245 VANEIDRVEGQPVGILLFEFREAKAAMDDLKTELEKAL----EEEEEEEDEIEEKAEKLK 304
           VA E++R+ G   G++++E+R +K AM++LK ELE+       EEE  E  + E+ E LK
Sbjct: 244 VAAEMERI-GVRKGMMMYEYRRSKTAMEELKVELERRCCGGGGEEEAVEKGLRERIENLK 303

Query: 305 SWNGALRSGVDAIVGELDDFFDEIVEGRKKLLDICTY 336
              G+LR+G ++IV ++DDFFD+IV+GRK LLD C++
Sbjct: 304 GSVGSLRNGTESIVAQIDDFFDDIVDGRKMLLDYCSH 316

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1FMG31.2e-186100.00uncharacterized protein LOC111447146 OS=Cucurbita moschata OX=3662 GN=LOC1114471... [more]
A0A6J1J2L71.7e-18094.56uncharacterized protein LOC111480769 OS=Cucurbita maxima OX=3661 GN=LOC111480769... [more]
A0A5D3DG236.9e-15082.16Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A1S3C3252.0e-14981.63uncharacterized protein LOC103496110 OS=Cucumis melo OX=3656 GN=LOC103496110 PE=... [more]
A0A0A0LC303.8e-14881.55Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G221740 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
XP_022941931.12.5e-186100.00uncharacterized protein LOC111447146 [Cucurbita moschata][more]
KAG6600364.16.7e-18499.11hypothetical protein SDJN03_05597, partial [Cucurbita argyrosperma subsp. sorori... [more]
KAG7031022.13.3e-18397.95hypothetical protein SDJN02_05061, partial [Cucurbita argyrosperma subsp. argyro... [more]
XP_023547569.12.0e-18096.76uncharacterized protein LOC111806465 [Cucurbita pepo subsp. pepo][more]
XP_022981709.13.5e-18094.56uncharacterized protein LOC111480769 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
AT1G22030.16.7e-7345.91CONTAINS InterPro DOMAIN/s: Protein BYPASS related (InterPro:IPR008511); BEST Ar... [more]
AT1G77855.17.4e-6442.73unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 262..303
NoneNo IPR availablePANTHERPTHR31509BPS1-LIKE PROTEINcoord: 4..335
NoneNo IPR availablePANTHERPTHR31509:SF92DUF793 FAMILY PROTEINcoord: 4..335

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh04G005670.1CmoCh04G005670.1mRNA