Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideCDSinitialstart_codonintroninternalterminalstop_codon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATTCTCTCACCAACCTCCACCTCTGTTTCCTCGCTGCAGCTCTCCTCTCCGCCACTGCCCATGGCTACGCCGTCGTCACCGGCACCGTCTTTTGCGATCAGTGCAAGGATGGTCAAATTTCCATGTTCGATTACCCGATCAACGGTATGATTAAATTTATCGTAACCCATTATTTTAATATTAAATTTATCATAACCCGTCAACTTAATTTGGTCGTTGCAGGTGTTAAAGTGATGGTGGCTTGTGGCGATGGCAATGGGGGAGTGACGTATTTGAGGGAGGAGACTACGAATTTGTTTGGAAGCTTTACGATGAGATTTGATGGAATGCCGGATCTTAGCGGCTGTTATGCGACGGTTGGTGGAACTGGAGAGGGAGCGACGACGGACTGCGGCGGCGCTGGTAGCCCGCCGAGGAGTCTTAGGTTAATGTTTAGATTGTTTAGTATGGAAATGTATGTGGTTGATTCGTTGCTGTCTCAACCGGCTCAACCAATGTCGTTCTGTTCCGCATCGGTTCATCCTGTCCCGGTCCCGGTTGTTATGCCACCACCGCCGAATTTGCCTCCTCCATTCAAGATGCCTCCGCTTCCTCAATTGCCACCACTTCCGCCACTTCCTCAGCTGCCGCCAATGCCATTCTTGGAAGCATCTGCTTGTCAACATGAGTAAGTTACACGTTCAGAATTATTGAAAGGGAGTCCTACTCGACCAATTTAGAGAATAATCATAATATTATTATAAGTCAGAAATATATCTCTATTAACACATTTATTTATGTGTAGAAATTGGACGAATCCATACTACAAATGTTATTGGAGAGCGGTGAACCCGGACATGAAAGTGGGCGTTGTTTTTGGAGTACTAGCGGCGAACCGTTACGGGACCGACTTGACCCTATGGAACGGGCTACAGGGGCGAGGGGACCCTTATAGGACCCTTCTACGAGAAGCCATAACGGCGTTCCTCAATTCCTACAATTCCGTTCATTTCCCCTACCCTGCACTGTCTGTAGTGGAAGGCTTAAATTGGGCCTTATTGGGCTCTCAACGGGCCGTCCTCCACACGGCCCTTCGTTTCAAACGGGCCAATTCGGGTAATGGCCACGTCACCTGCAAATTCGATCCTTGCCAATAA
mRNA sequence
ATGGATTCTCTCACCAACCTCCACCTCTGTTTCCTCGCTGCAGCTCTCCTCTCCGCCACTGCCCATGGCTACGCCGTCGTCACCGGCACCGTCTTTTGCGATCAGTGCAAGGATGGTCAAATTTCCATGTTCGATTACCCGATCAACGGTGTTAAAGTGATGGTGGCTTGTGGCGATGGCAATGGGGGAGTGACGTATTTGAGGGAGGAGACTACGAATTTGTTTGGAAGCTTTACGATGAGATTTGATGGAATGCCGGATCTTAGCGGCTGTTATGCGACGGTTGGTGGAACTGGAGAGGGAGCGACGACGGACTGCGGCGGCGCTGGTAGCCCGCCGAGGAGTCTTAGGTTAATGTTTAGATTGTTTAGTATGGAAATGTATGTGGTTGATTCGTTGCTGTCTCAACCGGCTCAACCAATGTCGTTCTGTTCCGCATCGGTTCATCCTGTCCCGGTCCCGGTTGTTATGCCACCACCGCCGAATTTGCCTCCTCCATTCAAGATGCCTCCGCTTCCTCAATTGCCACCACTTCCGCCACTTCCTCAGCTGCCGCCAATGCCATTCTTGGAAGCATCTGCTTGTCAACATGAAAATTGGACGAATCCATACTACAAATGTTATTGGAGAGCGGTGAACCCGGACATGAAAGTGGGCGTTGTTTTTGGAGTACTAGCGGCGAACCGTTACGGGACCGACTTGACCCTATGGAACGGGCTACAGGGGCGAGGGGACCCTTATAGGACCCTTCTACGAGAAGCCATAACGGCGTTCCTCAATTCCTACAATTCCGTTCATTTCCCCTACCCTGCACTGTCTGTAGTGGAAGGCTTAAATTGGGCCTTATTGGGCTCTCAACGGGCCGTCCTCCACACGGCCCTTCGTTTCAAACGGGCCAATTCGGGTAATGGCCACGTCACCTGCAAATTCGATCCTTGCCAATAA
Coding sequence (CDS)
ATGGATTCTCTCACCAACCTCCACCTCTGTTTCCTCGCTGCAGCTCTCCTCTCCGCCACTGCCCATGGCTACGCCGTCGTCACCGGCACCGTCTTTTGCGATCAGTGCAAGGATGGTCAAATTTCCATGTTCGATTACCCGATCAACGGTGTTAAAGTGATGGTGGCTTGTGGCGATGGCAATGGGGGAGTGACGTATTTGAGGGAGGAGACTACGAATTTGTTTGGAAGCTTTACGATGAGATTTGATGGAATGCCGGATCTTAGCGGCTGTTATGCGACGGTTGGTGGAACTGGAGAGGGAGCGACGACGGACTGCGGCGGCGCTGGTAGCCCGCCGAGGAGTCTTAGGTTAATGTTTAGATTGTTTAGTATGGAAATGTATGTGGTTGATTCGTTGCTGTCTCAACCGGCTCAACCAATGTCGTTCTGTTCCGCATCGGTTCATCCTGTCCCGGTCCCGGTTGTTATGCCACCACCGCCGAATTTGCCTCCTCCATTCAAGATGCCTCCGCTTCCTCAATTGCCACCACTTCCGCCACTTCCTCAGCTGCCGCCAATGCCATTCTTGGAAGCATCTGCTTGTCAACATGAAAATTGGACGAATCCATACTACAAATGTTATTGGAGAGCGGTGAACCCGGACATGAAAGTGGGCGTTGTTTTTGGAGTACTAGCGGCGAACCGTTACGGGACCGACTTGACCCTATGGAACGGGCTACAGGGGCGAGGGGACCCTTATAGGACCCTTCTACGAGAAGCCATAACGGCGTTCCTCAATTCCTACAATTCCGTTCATTTCCCCTACCCTGCACTGTCTGTAGTGGAAGGCTTAAATTGGGCCTTATTGGGCTCTCAACGGGCCGTCCTCCACACGGCCCTTCGTTTCAAACGGGCCAATTCGGGTAATGGCCACGTCACCTGCAAATTCGATCCTTGCCAATAA
Protein sequence
MDSLTNLHLCFLAAALLSATAHGYAVVTGTVFCDQCKDGQISMFDYPINGVKVMVACGDGNGGVTYLREETTNLFGSFTMRFDGMPDLSGCYATVGGTGEGATTDCGGAGSPPRSLRLMFRLFSMEMYVVDSLLSQPAQPMSFCSASVHPVPVPVVMPPPPNLPPPFKMPPLPQLPPLPPLPQLPPMPFLEASACQHENWTNPYYKCYWRAVNPDMKVGVVFGVLAANRYGTDLTLWNGLQGRGDPYRTLLREAITAFLNSYNSVHFPYPALSVVEGLNWALLGSQRAVLHTALRFKRANSGNGHVTCKFDPCQ
Homology
BLAST of Csor.00g229810 vs. NCBI nr
Match:
KAG6571035.1 (hypothetical protein SDJN03_29950, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 644 bits (1662), Expect = 1.89e-233
Identity = 314/314 (100.00%), Postives = 314/314 (100.00%), Query Frame = 0
Query: 1 MDSLTNLHLCFLAAALLSATAHGYAVVTGTVFCDQCKDGQISMFDYPINGVKVMVACGDG 60
MDSLTNLHLCFLAAALLSATAHGYAVVTGTVFCDQCKDGQISMFDYPINGVKVMVACGDG
Sbjct: 1 MDSLTNLHLCFLAAALLSATAHGYAVVTGTVFCDQCKDGQISMFDYPINGVKVMVACGDG 60
Query: 61 NGGVTYLREETTNLFGSFTMRFDGMPDLSGCYATVGGTGEGATTDCGGAGSPPRSLRLMF 120
NGGVTYLREETTNLFGSFTMRFDGMPDLSGCYATVGGTGEGATTDCGGAGSPPRSLRLMF
Sbjct: 61 NGGVTYLREETTNLFGSFTMRFDGMPDLSGCYATVGGTGEGATTDCGGAGSPPRSLRLMF 120
Query: 121 RLFSMEMYVVDSLLSQPAQPMSFCSASVHPVPVPVVMPPPPNLPPPFKMPPLPQLPPLPP 180
RLFSMEMYVVDSLLSQPAQPMSFCSASVHPVPVPVVMPPPPNLPPPFKMPPLPQLPPLPP
Sbjct: 121 RLFSMEMYVVDSLLSQPAQPMSFCSASVHPVPVPVVMPPPPNLPPPFKMPPLPQLPPLPP 180
Query: 181 LPQLPPMPFLEASACQHENWTNPYYKCYWRAVNPDMKVGVVFGVLAANRYGTDLTLWNGL 240
LPQLPPMPFLEASACQHENWTNPYYKCYWRAVNPDMKVGVVFGVLAANRYGTDLTLWNGL
Sbjct: 181 LPQLPPMPFLEASACQHENWTNPYYKCYWRAVNPDMKVGVVFGVLAANRYGTDLTLWNGL 240
Query: 241 QGRGDPYRTLLREAITAFLNSYNSVHFPYPALSVVEGLNWALLGSQRAVLHTALRFKRAN 300
QGRGDPYRTLLREAITAFLNSYNSVHFPYPALSVVEGLNWALLGSQRAVLHTALRFKRAN
Sbjct: 241 QGRGDPYRTLLREAITAFLNSYNSVHFPYPALSVVEGLNWALLGSQRAVLHTALRFKRAN 300
Query: 301 SGNGHVTCKFDPCQ 314
SGNGHVTCKFDPCQ
Sbjct: 301 SGNGHVTCKFDPCQ 314
BLAST of Csor.00g229810 vs. NCBI nr
Match:
XP_023513005.1 (uncharacterized protein LOC111777579 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 622 bits (1603), Expect = 2.01e-224
Identity = 305/316 (96.52%), Postives = 309/316 (97.78%), Query Frame = 0
Query: 1 MDSLTNLHLCFLAAALLSATAHGYAVVTGTVFCDQCKDGQISMFDYPINGVKVMVACGDG 60
MDSL+ LHLCFLAAALLSATAHGYAVVTGTVFCDQCKDGQISMFDYPINGVKVMVACG+G
Sbjct: 1 MDSLSKLHLCFLAAALLSATAHGYAVVTGTVFCDQCKDGQISMFDYPINGVKVMVACGNG 60
Query: 61 NGGVTYLREETTNLFGSFTMRFDGMPDLSGCYATVGGTGEGATTDCGGAGSPPRSLRLMF 120
NGGVTYLREETTNLFGSFTMRFDGMPDLSGCYATVGG GEGATTDCGGAGSPPRSLRLMF
Sbjct: 61 NGGVTYLREETTNLFGSFTMRFDGMPDLSGCYATVGGIGEGATTDCGGAGSPPRSLRLMF 120
Query: 121 RLFSMEMYVVDSLLSQPAQPMSFCSASVHPVPVP--VVMPPPPNLPPPFKMPPLPQLPPL 180
RLFSMEMYVVDSLLSQPAQPMSFCSASVHPVPVP VVMPPPPN+PPPFKMPPLPQLPP+
Sbjct: 121 RLFSMEMYVVDSLLSQPAQPMSFCSASVHPVPVPAPVVMPPPPNVPPPFKMPPLPQLPPI 180
Query: 181 PPLPQLPPMPFLEASACQHENWTNPYYKCYWRAVNPDMKVGVVFGVLAANRYGTDLTLWN 240
PPLPQLPPMPFLEASACQHENWTNP YKCYWRAVNPDMKVGVVFGVLAANRYGTDLTLWN
Sbjct: 181 PPLPQLPPMPFLEASACQHENWTNPDYKCYWRAVNPDMKVGVVFGVLAANRYGTDLTLWN 240
Query: 241 GLQGRGDPYRTLLREAITAFLNSYNSVHFPYPALSVVEGLNWALLGSQRAVLHTALRFKR 300
GLQGRGDPYRTLLREAITAFLNSYNSVHFPYP LSVVEGLNWALLGSQRAVLHTALRFKR
Sbjct: 241 GLQGRGDPYRTLLREAITAFLNSYNSVHFPYPTLSVVEGLNWALLGSQRAVLHTALRFKR 300
Query: 301 ANSGNGHVTCKFDPCQ 314
ANSGN HVTCKFDPCQ
Sbjct: 301 ANSGNSHVTCKFDPCQ 316
BLAST of Csor.00g229810 vs. NCBI nr
Match:
XP_022985735.1 (uncharacterized protein LOC111483701 [Cucurbita maxima])
HSP 1 Score: 616 bits (1589), Expect = 2.74e-222
Identity = 302/316 (95.57%), Postives = 305/316 (96.52%), Query Frame = 0
Query: 1 MDSLTNLHLCFLAAALLSATAHGYAVVTGTVFCDQCKDGQISMFDYPINGVKVMVACGDG 60
MDSL+ LHLCFLAAALLSATAHGYAVVTGTVFCDQCKDGQISMFDYP+NGVKVMVACGDG
Sbjct: 1 MDSLSKLHLCFLAAALLSATAHGYAVVTGTVFCDQCKDGQISMFDYPMNGVKVMVACGDG 60
Query: 61 NGGVTYLREETTNLFGSFTMRFDGMPDLSGCYATVGGTGEGATTDCGGAGSPPRSLRLMF 120
NGGVTYLREETTNLFGSFTMRFDGMPDLSGCYATVGGTGEGATTDCGGAGSPPRSLRLMF
Sbjct: 61 NGGVTYLREETTNLFGSFTMRFDGMPDLSGCYATVGGTGEGATTDCGGAGSPPRSLRLMF 120
Query: 121 RLFSMEMYVVDSLLSQPAQPMSFCSASVHPVPVP--VVMPPPPNLPPPFKMPPLPQLPPL 180
RLFSMEMYVVDSLLSQP QPMSFCSASVHPVP P VVMPPPPNLPPP KMPPLPQLPPL
Sbjct: 121 RLFSMEMYVVDSLLSQPTQPMSFCSASVHPVPAPAPVVMPPPPNLPPPLKMPPLPQLPPL 180
Query: 181 PPLPQLPPMPFLEASACQHENWTNPYYKCYWRAVNPDMKVGVVFGVLAANRYGTDLTLWN 240
PPLPQLPPMPFLE SACQHENWTNP YKCYWRAVNPD KVGVVFG+LAANRYGTDLTLWN
Sbjct: 181 PPLPQLPPMPFLEVSACQHENWTNPDYKCYWRAVNPDTKVGVVFGLLAANRYGTDLTLWN 240
Query: 241 GLQGRGDPYRTLLREAITAFLNSYNSVHFPYPALSVVEGLNWALLGSQRAVLHTALRFKR 300
GLQGRGDPYRTLLREAITAFLNSYNSVHFPYP LSVVEGLNWALLGSQRAVLHTALRFKR
Sbjct: 241 GLQGRGDPYRTLLREAITAFLNSYNSVHFPYPTLSVVEGLNWALLGSQRAVLHTALRFKR 300
Query: 301 ANSGNGHVTCKFDPCQ 314
ANSGN HVTCKFDPCQ
Sbjct: 301 ANSGNSHVTCKFDPCQ 316
BLAST of Csor.00g229810 vs. NCBI nr
Match:
XP_022944208.1 (uncharacterized protein LOC111448725 [Cucurbita moschata] >KAG7010866.1 hypothetical protein SDJN02_27664 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 612 bits (1578), Expect = 1.16e-220
Identity = 306/316 (96.84%), Postives = 306/316 (96.84%), Query Frame = 0
Query: 1 MDSLTNLHLCFLAAALLSATAHGYAVVTGTVFCDQCKDGQISMFDYPINGVKVMVACGDG 60
MDSLTNLHLCFLAAALLSATAHGYAVVTGTVFCDQCKDGQISMFDYPINGVKVMVACGDG
Sbjct: 1 MDSLTNLHLCFLAAALLSATAHGYAVVTGTVFCDQCKDGQISMFDYPINGVKVMVACGDG 60
Query: 61 NGGVTYLREETTNLFGSFTMRFDGMPDLSGCYATVGGTGEGATTDCGGAGSPPRSLRLMF 120
NGGVTYLREETTNLFGSFTMRFDGMPDLSGCYATVGGTGEGATTDCGGAGSPPRSLRLMF
Sbjct: 61 NGGVTYLREETTNLFGSFTMRFDGMPDLSGCYATVGGTGEGATTDCGGAGSPPRSLRLMF 120
Query: 121 RLFSMEMYVVDSLLSQPAQPMSFCSASVHPVPVP--VVMPPPPNLPPPFKMPPLPQLPPL 180
RLFSMEMYVVDSLLSQPAQ MSFCSASVHPVPVP VVMP PPNLPPP KM PLPQLPPL
Sbjct: 121 RLFSMEMYVVDSLLSQPAQSMSFCSASVHPVPVPAPVVMPSPPNLPPPSKMLPLPQLPPL 180
Query: 181 PPLPQLPPMPFLEASACQHENWTNPYYKCYWRAVNPDMKVGVVFGVLAANRYGTDLTLWN 240
PPLP PMPFLEASACQHENWTNP YKCYWRAVNPDMKVGVVFGVLAANRYGTDLTLWN
Sbjct: 181 PPLP---PMPFLEASACQHENWTNPRYKCYWRAVNPDMKVGVVFGVLAANRYGTDLTLWN 240
Query: 241 GLQGRGDPYRTLLREAITAFLNSYNSVHFPYPALSVVEGLNWALLGSQRAVLHTALRFKR 300
GLQGRGDPYRTLLREAITAFLNSYNSVHFPYPALSVVEGLNWALLGSQRAVLHTALRFKR
Sbjct: 241 GLQGRGDPYRTLLREAITAFLNSYNSVHFPYPALSVVEGLNWALLGSQRAVLHTALRFKR 300
Query: 301 ANSGNGHVTCKFDPCQ 314
ANSGNGHVTCKFDPCQ
Sbjct: 301 ANSGNGHVTCKFDPCQ 313
BLAST of Csor.00g229810 vs. NCBI nr
Match:
XP_038900829.1 (uncharacterized protein LOC120087893 [Benincasa hispida])
HSP 1 Score: 543 bits (1400), Expect = 2.34e-193
Identity = 266/324 (82.10%), Postives = 289/324 (89.20%), Query Frame = 0
Query: 1 MDSLTNLHLCFLAAALLSATAHGYAVVTGTVFCDQCKDGQISMFDYPINGVKVMVACGDG 60
MD L+ L LCFL A LL ATAHGYAVVTGTVFCDQCKDG ISMFDYP+NGVKV VACGDG
Sbjct: 1 MDPLSKLKLCFLTATLLLATAHGYAVVTGTVFCDQCKDGHISMFDYPMNGVKVKVACGDG 60
Query: 61 NGGVTYLREETTNLFGSFTMRFDGMPDLSGCYATVGGTGEGATTDCGGAGSPPRSLRLMF 120
NGGVTY+REETTNLFGSFTMRFDG PDLSGCYAT+GGT +G T DCGGAG PP+SLRLMF
Sbjct: 61 NGGVTYMREETTNLFGSFTMRFDGTPDLSGCYATIGGTEQGTTMDCGGAGGPPKSLRLMF 120
Query: 121 RLFSMEMYVVDSLLSQPAQPMSFCSASVHPVPVP---VVMPP-------PPNLPPPFKMP 180
RLF MEMYVVDSL++QPA+PMSFCSASV+PVPVP + PP PP LPP FK+P
Sbjct: 121 RLFDMEMYVVDSLVTQPAKPMSFCSASVNPVPVPAPVITKPPSPSLPLSPPPLPPSFKLP 180
Query: 181 PLPQLPPLPPLPQLPPMPFLEASACQHENWTNPYYKCYWRAVNPDMKVGVVFGVLAANRY 240
PLPQLPPLPPLP+LPP+PFLEASACQH+NWTNP Y+CYWRAVNPD KVGV+FG+LAAN+Y
Sbjct: 181 PLPQLPPLPPLPKLPPVPFLEASACQHDNWTNPDYRCYWRAVNPDTKVGVIFGLLAANQY 240
Query: 241 GTDLTLWNGLQGRGDPYRTLLREAITAFLNSYNSVHFPYPALSVVEGLNWALLGSQRAVL 300
GTDLTLWNGLQGRGDPYRTLLREAITAFLNSYNS++FPYP +SVV+ LNWALLGS RAVL
Sbjct: 241 GTDLTLWNGLQGRGDPYRTLLREAITAFLNSYNSLNFPYPTISVVQRLNWALLGSPRAVL 300
Query: 301 HTALRFKRANSGNGHVTCKFDPCQ 314
TALRFKRANSG GHVTCKFDPCQ
Sbjct: 301 LTALRFKRANSGYGHVTCKFDPCQ 324
BLAST of Csor.00g229810 vs. ExPASy TrEMBL
Match:
A0A6J1JC51 (uncharacterized protein LOC111483701 OS=Cucurbita maxima OX=3661 GN=LOC111483701 PE=4 SV=1)
HSP 1 Score: 616 bits (1589), Expect = 1.33e-222
Identity = 302/316 (95.57%), Postives = 305/316 (96.52%), Query Frame = 0
Query: 1 MDSLTNLHLCFLAAALLSATAHGYAVVTGTVFCDQCKDGQISMFDYPINGVKVMVACGDG 60
MDSL+ LHLCFLAAALLSATAHGYAVVTGTVFCDQCKDGQISMFDYP+NGVKVMVACGDG
Sbjct: 1 MDSLSKLHLCFLAAALLSATAHGYAVVTGTVFCDQCKDGQISMFDYPMNGVKVMVACGDG 60
Query: 61 NGGVTYLREETTNLFGSFTMRFDGMPDLSGCYATVGGTGEGATTDCGGAGSPPRSLRLMF 120
NGGVTYLREETTNLFGSFTMRFDGMPDLSGCYATVGGTGEGATTDCGGAGSPPRSLRLMF
Sbjct: 61 NGGVTYLREETTNLFGSFTMRFDGMPDLSGCYATVGGTGEGATTDCGGAGSPPRSLRLMF 120
Query: 121 RLFSMEMYVVDSLLSQPAQPMSFCSASVHPVPVP--VVMPPPPNLPPPFKMPPLPQLPPL 180
RLFSMEMYVVDSLLSQP QPMSFCSASVHPVP P VVMPPPPNLPPP KMPPLPQLPPL
Sbjct: 121 RLFSMEMYVVDSLLSQPTQPMSFCSASVHPVPAPAPVVMPPPPNLPPPLKMPPLPQLPPL 180
Query: 181 PPLPQLPPMPFLEASACQHENWTNPYYKCYWRAVNPDMKVGVVFGVLAANRYGTDLTLWN 240
PPLPQLPPMPFLE SACQHENWTNP YKCYWRAVNPD KVGVVFG+LAANRYGTDLTLWN
Sbjct: 181 PPLPQLPPMPFLEVSACQHENWTNPDYKCYWRAVNPDTKVGVVFGLLAANRYGTDLTLWN 240
Query: 241 GLQGRGDPYRTLLREAITAFLNSYNSVHFPYPALSVVEGLNWALLGSQRAVLHTALRFKR 300
GLQGRGDPYRTLLREAITAFLNSYNSVHFPYP LSVVEGLNWALLGSQRAVLHTALRFKR
Sbjct: 241 GLQGRGDPYRTLLREAITAFLNSYNSVHFPYPTLSVVEGLNWALLGSQRAVLHTALRFKR 300
Query: 301 ANSGNGHVTCKFDPCQ 314
ANSGN HVTCKFDPCQ
Sbjct: 301 ANSGNSHVTCKFDPCQ 316
BLAST of Csor.00g229810 vs. ExPASy TrEMBL
Match:
A0A6J1FXU0 (uncharacterized protein LOC111448725 OS=Cucurbita moschata OX=3662 GN=LOC111448725 PE=4 SV=1)
HSP 1 Score: 612 bits (1578), Expect = 5.64e-221
Identity = 306/316 (96.84%), Postives = 306/316 (96.84%), Query Frame = 0
Query: 1 MDSLTNLHLCFLAAALLSATAHGYAVVTGTVFCDQCKDGQISMFDYPINGVKVMVACGDG 60
MDSLTNLHLCFLAAALLSATAHGYAVVTGTVFCDQCKDGQISMFDYPINGVKVMVACGDG
Sbjct: 1 MDSLTNLHLCFLAAALLSATAHGYAVVTGTVFCDQCKDGQISMFDYPINGVKVMVACGDG 60
Query: 61 NGGVTYLREETTNLFGSFTMRFDGMPDLSGCYATVGGTGEGATTDCGGAGSPPRSLRLMF 120
NGGVTYLREETTNLFGSFTMRFDGMPDLSGCYATVGGTGEGATTDCGGAGSPPRSLRLMF
Sbjct: 61 NGGVTYLREETTNLFGSFTMRFDGMPDLSGCYATVGGTGEGATTDCGGAGSPPRSLRLMF 120
Query: 121 RLFSMEMYVVDSLLSQPAQPMSFCSASVHPVPVP--VVMPPPPNLPPPFKMPPLPQLPPL 180
RLFSMEMYVVDSLLSQPAQ MSFCSASVHPVPVP VVMP PPNLPPP KM PLPQLPPL
Sbjct: 121 RLFSMEMYVVDSLLSQPAQSMSFCSASVHPVPVPAPVVMPSPPNLPPPSKMLPLPQLPPL 180
Query: 181 PPLPQLPPMPFLEASACQHENWTNPYYKCYWRAVNPDMKVGVVFGVLAANRYGTDLTLWN 240
PPLP PMPFLEASACQHENWTNP YKCYWRAVNPDMKVGVVFGVLAANRYGTDLTLWN
Sbjct: 181 PPLP---PMPFLEASACQHENWTNPRYKCYWRAVNPDMKVGVVFGVLAANRYGTDLTLWN 240
Query: 241 GLQGRGDPYRTLLREAITAFLNSYNSVHFPYPALSVVEGLNWALLGSQRAVLHTALRFKR 300
GLQGRGDPYRTLLREAITAFLNSYNSVHFPYPALSVVEGLNWALLGSQRAVLHTALRFKR
Sbjct: 241 GLQGRGDPYRTLLREAITAFLNSYNSVHFPYPALSVVEGLNWALLGSQRAVLHTALRFKR 300
Query: 301 ANSGNGHVTCKFDPCQ 314
ANSGNGHVTCKFDPCQ
Sbjct: 301 ANSGNGHVTCKFDPCQ 313
BLAST of Csor.00g229810 vs. ExPASy TrEMBL
Match:
A0A6J1CWR4 (uncharacterized protein LOC111015504 OS=Momordica charantia OX=3673 GN=LOC111015504 PE=4 SV=1)
HSP 1 Score: 486 bits (1252), Expect = 1.76e-171
Identity = 240/314 (76.43%), Postives = 264/314 (84.08%), Query Frame = 0
Query: 1 MDSLTNLHLCFLAAALLSATAHGYAVVTGTVFCDQCKDGQISMFDYPINGVKVMVACGDG 60
MD L L+LCFLAA+L A A GYAVVTGTVFCDQCKDGQIS+FDYPING KVMVACGDG
Sbjct: 1 MDPLKKLNLCFLAASLFFAAAQGYAVVTGTVFCDQCKDGQISLFDYPINGAKVMVACGDG 60
Query: 61 NGGVTYLREETTNLFGSFTMRFDGMPDLSGCYATVGGTGEGATTDCGGAGSPPRSLRLMF 120
NGGVTY REETTNLFGSFTMRFDG PDLSGCYA VGGT G CGGA P +SLRLMF
Sbjct: 61 NGGVTYSREETTNLFGSFTMRFDGTPDLSGCYAAVGGTATG----CGGAVGPAKSLRLMF 120
Query: 121 RLFSMEMYVVDSLLSQPAQPMSFCSASVHPVPVPVVMPPPPNLPPPFKMPPLPQLPPLPP 180
R+F MEMYVVDSL+SQPA PM FCS SV+PVP PV +PPP + PPP LP
Sbjct: 121 RMFDMEMYVVDSLISQPALPMPFCSPSVNPVPAPVTVPPPSSPPPPPLR--------LPL 180
Query: 181 LPQLPPMPFLEASACQHENWTNPYYKCYWRAVNPDMKVGVVFGVLAANRYGTDLTLWNGL 240
LP LPP+PFLEASACQHENWTNP Y+CYWRAVNP+ KVGV+FG +AANRYGT++TLWNGL
Sbjct: 181 LPPLPPVPFLEASACQHENWTNPDYRCYWRAVNPETKVGVIFGPVAANRYGTEVTLWNGL 240
Query: 241 QGRGDPYRTLLREAITAFLNSYNSVHFPYPALSVVEGLNWALLGSQRAVLHTALRFKRAN 300
QGRGDPYRTLLREAITAFLNSYNS+H+PYP +SV++ +NWALLGS RAVL TALRFKRAN
Sbjct: 241 QGRGDPYRTLLREAITAFLNSYNSLHYPYPTISVLQRMNWALLGSPRAVLITALRFKRAN 300
Query: 301 SGNGHVTCKFDPCQ 314
SG+ H+TCKF PCQ
Sbjct: 301 SGSPHLTCKFHPCQ 302
BLAST of Csor.00g229810 vs. ExPASy TrEMBL
Match:
A0A5D3BLG6 (Protodermal factor 1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold123G001410 PE=4 SV=1)
HSP 1 Score: 446 bits (1146), Expect = 4.42e-155
Identity = 225/320 (70.31%), Postives = 257/320 (80.31%), Query Frame = 0
Query: 1 MDSLTNLHLCFLAAALLSATAHGYAVVTGTVFCDQCKDGQISMFD--YPINGVKVMVACG 60
MD L LCFLA +L SATA+GY VV+G VFCD+CKDGQ+S+FD YPINGVKV +ACG
Sbjct: 1 MDPLFKFKLCFLALSLFSATAYGYTVVSGFVFCDKCKDGQVSIFDFDYPINGVKVKIACG 60
Query: 61 DGNGGVTYLREETTNLFGSFTMRFDGMPDLSGCYATVGGTGEGATTDCGGAGSPPRSLRL 120
DG GGVT LREE TN FG +TM F+G PDLSGC ATV G +GA T+C G PPRSL+L
Sbjct: 61 DGKGGVTDLREEPTNFFGGYTMSFEGTPDLSGCTATVTGPAQGAMTNCSAGGGPPRSLKL 120
Query: 121 MFRLFSMEMYVVDSLLSQPAQPMSFCSASVHPVPVPVVMPPPPNLPPPFKMPPLPQLPPL 180
+FRL +EMY VD L+SQP QPMSFCS+ PVP P P PP+LP K+PPLP+LPPL
Sbjct: 121 LFRLLDLEMYGVDPLVSQPDQPMSFCSSRSAPVPGPKP-PSPPSLPSLPKLPPLPKLPPL 180
Query: 181 PPLP---QLPPMPFLEASACQHENWTNPYYKCYWRAVNPDMKVGVVFGVLAANRYGTDLT 240
PP P Q+P PFLEASACQHENWTNP YKCYWRAVNPD KV V+FG +AA RYGTD+T
Sbjct: 181 PPFPPMRQMPHTPFLEASACQHENWTNPDYKCYWRAVNPDTKVAVIFGAIAAERYGTDMT 240
Query: 241 LWNGLQGRGDPYRTLLREAITAFLNSYNSVHFPYPALSVVEGLNWALL-GSQRAVLHTAL 300
LW GLQGRGDPY+TLLREAITAFLNSY+S+HFPY +LSVV+ N AL+ GS+R+VLH AL
Sbjct: 241 LWKGLQGRGDPYKTLLREAITAFLNSYSSLHFPYHSLSVVQHFNLALMDGSERSVLHAAL 300
Query: 301 RFKRANSGNGHVTCKFDPCQ 314
RFK ANSGNGHVTCKFDPC+
Sbjct: 301 RFKHANSGNGHVTCKFDPCK 319
BLAST of Csor.00g229810 vs. ExPASy TrEMBL
Match:
A0A1S3C647 (uncharacterized protein LOC103497304 OS=Cucumis melo OX=3656 GN=LOC103497304 PE=4 SV=1)
HSP 1 Score: 446 bits (1146), Expect = 4.42e-155
Identity = 225/320 (70.31%), Postives = 257/320 (80.31%), Query Frame = 0
Query: 1 MDSLTNLHLCFLAAALLSATAHGYAVVTGTVFCDQCKDGQISMFD--YPINGVKVMVACG 60
MD L LCFLA +L SATA+GY VV+G VFCD+CKDGQ+S+FD YPINGVKV +ACG
Sbjct: 1 MDPLFKFKLCFLALSLFSATAYGYTVVSGFVFCDKCKDGQVSIFDFDYPINGVKVKIACG 60
Query: 61 DGNGGVTYLREETTNLFGSFTMRFDGMPDLSGCYATVGGTGEGATTDCGGAGSPPRSLRL 120
DG GGVT LREE TN FG +TM F+G PDLSGC ATV G +GA T+C G PPRSL+L
Sbjct: 61 DGKGGVTDLREEPTNFFGGYTMSFEGTPDLSGCTATVTGPAQGAMTNCSAGGGPPRSLKL 120
Query: 121 MFRLFSMEMYVVDSLLSQPAQPMSFCSASVHPVPVPVVMPPPPNLPPPFKMPPLPQLPPL 180
+FRL +EMY VD L+SQP QPMSFCS+ PVP P P PP+LP K+PPLP+LPPL
Sbjct: 121 LFRLLDLEMYGVDPLVSQPDQPMSFCSSRSAPVPGPKP-PSPPSLPSLPKLPPLPKLPPL 180
Query: 181 PPLP---QLPPMPFLEASACQHENWTNPYYKCYWRAVNPDMKVGVVFGVLAANRYGTDLT 240
PP P Q+P PFLEASACQHENWTNP YKCYWRAVNPD KV V+FG +AA RYGTD+T
Sbjct: 181 PPFPPMRQMPHTPFLEASACQHENWTNPDYKCYWRAVNPDTKVAVIFGAIAAERYGTDMT 240
Query: 241 LWNGLQGRGDPYRTLLREAITAFLNSYNSVHFPYPALSVVEGLNWALL-GSQRAVLHTAL 300
LW GLQGRGDPY+TLLREAITAFLNSY+S+HFPY +LSVV+ N AL+ GS+R+VLH AL
Sbjct: 241 LWKGLQGRGDPYKTLLREAITAFLNSYSSLHFPYHSLSVVQHFNLALMDGSERSVLHAAL 300
Query: 301 RFKRANSGNGHVTCKFDPCQ 314
RFK ANSGNGHVTCKFDPC+
Sbjct: 301 RFKHANSGNGHVTCKFDPCK 319
BLAST of Csor.00g229810 vs. TAIR 10
Match:
AT2G16630.1 (Pollen Ole e 1 allergen and extensin family protein )
HSP 1 Score: 297.4 bits (760), Expect = 1.4e-80
Identity = 167/353 (47.31%), Postives = 211/353 (59.77%), Query Frame = 0
Query: 7 LHLCFLAAALLSATAHGYAVVTGTVFCDQCKDGQISMFDYPINGVKVMVACGDGNGGVTY 66
+ + F L + GYA VTG+VFCDQCKDG+ S+FD+P++G+K+ V C D NG V
Sbjct: 10 IFVAFFVLCLATNGVTGYATVTGSVFCDQCKDGERSLFDFPVSGIKISVTCADENGQVYM 69
Query: 67 LREETTNLFGSFTMRFDGMPDLSGCYATVGGTG-EGATTDCGGAGSPPRSLRLMFRLFSM 126
REETTN G + MRFDG PDLS CYA V G + + C A P + L+LMF F +
Sbjct: 70 SREETTNWLGGYVMRFDGTPDLSNCYAQVSDNGVQQDPSSCSIASGPAQKLKLMFSFFGI 129
Query: 127 EMYVVDSLLSQPAQPMSFC----SASVHPVP-----------------VPVVMPPPPNLP 186
E + D+LL+QP QP SFC +A V P P VPV+ P PP
Sbjct: 130 ETFAADALLAQPVQPSSFCPKPPTAPVMPPPQVPVMPPPQVPVKPHPKVPVISPDPPATL 189
Query: 187 PPFKMP---------------PLPQLPP--------LPPLPQLPPMPFLEASACQHENWT 246
PP K+P P+ LPP LPPLPQ+PPMPF+E SAC H+ W
Sbjct: 190 PPPKVPVISPDPPTTLPPPLVPVINLPPVTSPPQFKLPPLPQIPPMPFVEPSACSHQLWM 249
Query: 247 NPYYKCYWRAVNPDMKVGVVFGVLAANRYGTDLTLWNGLQGRGDPYRTLLREAITAFLNS 306
P Y+CYWRA+ PD KV V FG++A YGTD+T+ L GRG+ Y+TLLREA TA LNS
Sbjct: 250 KPEYRCYWRAIGPDTKVAVAFGLVAGRIYGTDMTVREALDGRGEAYKTLLREATTALLNS 309
Query: 307 YNSVHFPYPALSVVEGLNWALLG-SQRAVLHTALRFKRANSGNGHVTCKFDPC 314
YNS+ FPY +++V+ N ALLG S+ VL TA+RF +ANSG TC+F C
Sbjct: 310 YNSLGFPYNSVAVITYTNLALLGNSEHDVLMTAIRFIKANSG----TCRFTVC 358
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
KAG6571035.1 | 1.89e-233 | 100.00 | hypothetical protein SDJN03_29950, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_023513005.1 | 2.01e-224 | 96.52 | uncharacterized protein LOC111777579 [Cucurbita pepo subsp. pepo] | [more] |
XP_022985735.1 | 2.74e-222 | 95.57 | uncharacterized protein LOC111483701 [Cucurbita maxima] | [more] |
XP_022944208.1 | 1.16e-220 | 96.84 | uncharacterized protein LOC111448725 [Cucurbita moschata] >KAG7010866.1 hypothet... | [more] |
XP_038900829.1 | 2.34e-193 | 82.10 | uncharacterized protein LOC120087893 [Benincasa hispida] | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1JC51 | 1.33e-222 | 95.57 | uncharacterized protein LOC111483701 OS=Cucurbita maxima OX=3661 GN=LOC111483701... | [more] |
A0A6J1FXU0 | 5.64e-221 | 96.84 | uncharacterized protein LOC111448725 OS=Cucurbita moschata OX=3662 GN=LOC1114487... | [more] |
A0A6J1CWR4 | 1.76e-171 | 76.43 | uncharacterized protein LOC111015504 OS=Momordica charantia OX=3673 GN=LOC111015... | [more] |
A0A5D3BLG6 | 4.42e-155 | 70.31 | Protodermal factor 1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold123... | [more] |
A0A1S3C647 | 4.42e-155 | 70.31 | uncharacterized protein LOC103497304 OS=Cucumis melo OX=3656 GN=LOC103497304 PE=... | [more] |
Match Name | E-value | Identity | Description | |
AT2G16630.1 | 1.4e-80 | 47.31 | Pollen Ole e 1 allergen and extensin family protein | [more] |