Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTATCCTCTGTCCAAGCTCAAGCTTTGTTTTTTAGCCGCATCTCTTCTTTGGGCCACCGCGAATGGATACGCCGTCGTCACCGGCACCGTCTTTTGCGATCAATGCAAAGATGGTCAAATATCTATGTTCGATTATCCCTTGAACGGTATGATTGAGTTTATCCTAACTCATAACTTAGACATCTTGTTGAAATGACTTGAATCTGTTATCTGATGCTTTGAATGATTAAGCACGAGATCTCGCCATATATGAACCCTCTAACCCTAGAAACACCTCTGCAATCTGTAGGCTGACACAACGTTAATGAACCATGTAAATTTCTATGTCGATCTTTATTGTTTACATTTTCTTGCTTTATTTATTAGTCGATTTCTTAACATTTATTAATTATTAAAAGACTTGTTTTATTGATGAGGTTTAGTTAGAGATTAAAATATTACGAATCGTTATAAAGATTCGTAAAAAAGCAAATACAATTTTTAAAAATCAATCGCGGTTAAATTATTTAGATTATACTTTATAGTAGTGGTTGAGATGTTGTACTTAGTAGAAAGTTTATTTGTTGCAGGAGTTAAAGTGATGGTGGCTTGTGGGGATGGCAATGGGGGAGTTACTTATTTGAGAGAGGAGACTACAAATTTGTTTGGAAGCTTTACGATGAGATTCGACGGAACACCGGATCTTAGCGGTTGTTATGCGGCAGTTGGTGGAACCGTACAGGGAACGACGACGGACTGTGGCGGAGCTGGCGGTCCGCCAAAAAGTCTTAGGTTGATGTTTAGGTTGTTTGACATGGAAATGTATGTGGTTGATTCGTTGGTTTCTCAACCTGCTAAACCAATGTCGTTCTGTTCGGCATCGGTTAATCCGGTCCCGGTCCCTGCACCCGTTATAACAACGCCACCATCTCCATCTCTGCCATCGTCTCCGCCTCCTCTGCCTCCTTCGTTTAAGTTTCCGCCGCTTCCTCAACTGCCGCCACTTCCGCCACTGCCTCATCTGCCGCCTGGGCCATTCTTGGAAGCTTCTGCTTGCCAACATGAGTAAGTGAAGTACATTGGCTGTTAATTTATTTTCTAATTATACTATTATTTGTTCGTTAAAGTATGTGACATTTTTGTTAACTTCATTTCCATTTTGATAGTTCTAAAAGTTTTGTTTTAAGTTTTAACTATATTTCGCCATAAATGAAGAACAATTAATCAGGTGTTAATTTTTAAGGCAAATGGTAAAATTCACCTCTAAATTGTAAGTTGCAATTATACCTTCAAATTTTTAATAGTAAAAATCGATCCCACAAACTTATACATTTGTTAAATTGGACCCTTAAATTTACATAATTTTAGAAATTATAAAAATTGTATAAATTTGAGAGTTCAATTTTTACGGGTGAAAGTTTGAAGGTGTAATTGCAACTTCCACCATGCTTCAAGGGTAAATTTTGTAATTTTTCCTAGTTTTTAAAATAAATATAAGATTTCATCTCAAAATCGATTGATATCTAACTAAGATGTCATGAATTGGAATCCCTACCCCAAATGATCGTATTAAAAGAAAATCGTCTGACAATTTTTATCCCATGGTTATTAAATTTTTACCATGTTTTTATTTCCAGGAATTGGACAAACCCAGACTACAGATGCTACTGGAGGGCAGTGAACCCAGACACAAAAGTAGCCGTTATTTTTGGGCTAGTAGCAGCCAACCGATACGGAACGGACCTGACCCTATGGAACGGCCTACAAGGCCGAGGGGACCCTCATAGGACCCTTTTAAGGGAAGCCATAACGGCGTTCCTTAATTCCTACAATTCCCTTAATTTCCCCTACCCTACACTTTCTGTGGTCCAACGCTTTAATTGGGCCTTATTGGGCTCTCCACGGGCCGTCCTCCTCACTGCCCTTCGTTTCAAACGGGCCAACTCTGGTTATGGCCACGTCACCTGCAAATTCGACCCTTGCCAATAA
mRNA sequence
ATGTATCCTCTGTCCAAGCTCAAGCTTTGTTTTTTAGCCGCATCTCTTCTTTGGGCCACCGCGAATGGATACGCCGTCGTCACCGGCACCGTCTTTTGCGATCAATGCAAAGATGGTCAAATATCTATGTTCGATTATCCCTTGAACGGAGTTAAAGTGATGGTGGCTTGTGGGGATGGCAATGGGGGAGTTACTTATTTGAGAGAGGAGACTACAAATTTGTTTGGAAGCTTTACGATGAGATTCGACGGAACACCGGATCTTAGCGGTTGTTATGCGGCAGTTGGTGGAACCGTACAGGGAACGACGACGGACTGTGGCGGAGCTGGCGGTCCGCCAAAAAGTCTTAGGTTGATGTTTAGGTTGTTTGACATGGAAATGTATGTGGTTGATTCGTTGGTTTCTCAACCTGCTAAACCAATGTCGTTCTGTTCGGCATCGGTTAATCCGGTCCCGGTCCCTGCACCCGTTATAACAACGCCACCATCTCCATCTCTGCCATCGTCTCCGCCTCCTCTGCCTCCTTCGTTTAAGTTTCCGCCGCTTCCTCAACTGCCGCCACTTCCGCCACTGCCTCATCTGCCGCCTGGGCCATTCTTGGAAGCTTCTGCTTGCCAACATGAGAATTGGACAAACCCAGACTACAGATGCTACTGGAGGGCAGTGAACCCAGACACAAAAGTAGCCGTTATTTTTGGGCTAGTAGCAGCCAACCGATACGGAACGGACCTGACCCTATGGAACGGCCTACAAGGCCGAGGGGACCCTCATAGGACCCTTTTAAGGGAAGCCATAACGGCGTTCCTTAATTCCTACAATTCCCTTAATTTCCCCTACCCTACACTTTCTGTGGTCCAACGCTTTAATTGGGCCTTATTGGGCTCTCCACGGGCCGTCCTCCTCACTGCCCTTCGTTTCAAACGGGCCAACTCTGGTTATGGCCACGTCACCTGCAAATTCGACCCTTGCCAATAA
Coding sequence (CDS)
ATGTATCCTCTGTCCAAGCTCAAGCTTTGTTTTTTAGCCGCATCTCTTCTTTGGGCCACCGCGAATGGATACGCCGTCGTCACCGGCACCGTCTTTTGCGATCAATGCAAAGATGGTCAAATATCTATGTTCGATTATCCCTTGAACGGAGTTAAAGTGATGGTGGCTTGTGGGGATGGCAATGGGGGAGTTACTTATTTGAGAGAGGAGACTACAAATTTGTTTGGAAGCTTTACGATGAGATTCGACGGAACACCGGATCTTAGCGGTTGTTATGCGGCAGTTGGTGGAACCGTACAGGGAACGACGACGGACTGTGGCGGAGCTGGCGGTCCGCCAAAAAGTCTTAGGTTGATGTTTAGGTTGTTTGACATGGAAATGTATGTGGTTGATTCGTTGGTTTCTCAACCTGCTAAACCAATGTCGTTCTGTTCGGCATCGGTTAATCCGGTCCCGGTCCCTGCACCCGTTATAACAACGCCACCATCTCCATCTCTGCCATCGTCTCCGCCTCCTCTGCCTCCTTCGTTTAAGTTTCCGCCGCTTCCTCAACTGCCGCCACTTCCGCCACTGCCTCATCTGCCGCCTGGGCCATTCTTGGAAGCTTCTGCTTGCCAACATGAGAATTGGACAAACCCAGACTACAGATGCTACTGGAGGGCAGTGAACCCAGACACAAAAGTAGCCGTTATTTTTGGGCTAGTAGCAGCCAACCGATACGGAACGGACCTGACCCTATGGAACGGCCTACAAGGCCGAGGGGACCCTCATAGGACCCTTTTAAGGGAAGCCATAACGGCGTTCCTTAATTCCTACAATTCCCTTAATTTCCCCTACCCTACACTTTCTGTGGTCCAACGCTTTAATTGGGCCTTATTGGGCTCTCCACGGGCCGTCCTCCTCACTGCCCTTCGTTTCAAACGGGCCAACTCTGGTTATGGCCACGTCACCTGCAAATTCGACCCTTGCCAATAA
Protein sequence
MYPLSKLKLCFLAASLLWATANGYAVVTGTVFCDQCKDGQISMFDYPLNGVKVMVACGDGNGGVTYLREETTNLFGSFTMRFDGTPDLSGCYAAVGGTVQGTTTDCGGAGGPPKSLRLMFRLFDMEMYVVDSLVSQPAKPMSFCSASVNPVPVPAPVITTPPSPSLPSSPPPLPPSFKFPPLPQLPPLPPLPHLPPGPFLEASACQHENWTNPDYRCYWRAVNPDTKVAVIFGLVAANRYGTDLTLWNGLQGRGDPHRTLLREAITAFLNSYNSLNFPYPTLSVVQRFNWALLGSPRAVLLTALRFKRANSGYGHVTCKFDPCQ
Homology
BLAST of HG10000562 vs. NCBI nr
Match:
XP_038900829.1 (uncharacterized protein LOC120087893 [Benincasa hispida])
HSP 1 Score: 617.5 bits (1591), Expect = 6.7e-173
Identity = 298/324 (91.98%), Postives = 309/324 (95.37%), Query Frame = 0
Query: 1 MYPLSKLKLCFLAASLLWATANGYAVVTGTVFCDQCKDGQISMFDYPLNGVKVMVACGDG 60
M PLSKLKLCFL A+LL ATA+GYAVVTGTVFCDQCKDG ISMFDYP+NGVKV VACGDG
Sbjct: 1 MDPLSKLKLCFLTATLLLATAHGYAVVTGTVFCDQCKDGHISMFDYPMNGVKVKVACGDG 60
Query: 61 NGGVTYLREETTNLFGSFTMRFDGTPDLSGCYAAVGGTVQGTTTDCGGAGGPPKSLRLMF 120
NGGVTY+REETTNLFGSFTMRFDGTPDLSGCYA +GGT QGTT DCGGAGGPPKSLRLMF
Sbjct: 61 NGGVTYMREETTNLFGSFTMRFDGTPDLSGCYATIGGTEQGTTMDCGGAGGPPKSLRLMF 120
Query: 121 RLFDMEMYVVDSLVSQPAKPMSFCSASVNPVPVPAPVITTPPSPSLPSSPPPLPPSFKFP 180
RLFDMEMYVVDSLV+QPAKPMSFCSASVNPVPVPAPVIT PPSPSLP SPPPLPPSFK P
Sbjct: 121 RLFDMEMYVVDSLVTQPAKPMSFCSASVNPVPVPAPVITKPPSPSLPLSPPPLPPSFKLP 180
Query: 181 PLPQLPPLPPLPHLPPGPFLEASACQHENWTNPDYRCYWRAVNPDTKVAVIFGLVAANRY 240
PLPQLPPLPPLP LPP PFLEASACQH+NWTNPDYRCYWRAVNPDTKV VIFGL+AAN+Y
Sbjct: 181 PLPQLPPLPPLPKLPPVPFLEASACQHDNWTNPDYRCYWRAVNPDTKVGVIFGLLAANQY 240
Query: 241 GTDLTLWNGLQGRGDPHRTLLREAITAFLNSYNSLNFPYPTLSVVQRFNWALLGSPRAVL 300
GTDLTLWNGLQGRGDP+RTLLREAITAFLNSYNSLNFPYPT+SVVQR NWALLGSPRAVL
Sbjct: 241 GTDLTLWNGLQGRGDPYRTLLREAITAFLNSYNSLNFPYPTISVVQRLNWALLGSPRAVL 300
Query: 301 LTALRFKRANSGYGHVTCKFDPCQ 325
LTALRFKRANSGYGHVTCKFDPCQ
Sbjct: 301 LTALRFKRANSGYGHVTCKFDPCQ 324
BLAST of HG10000562 vs. NCBI nr
Match:
XP_022985735.1 (uncharacterized protein LOC111483701 [Cucurbita maxima])
HSP 1 Score: 557.0 bits (1434), Expect = 1.1e-154
Identity = 272/324 (83.95%), Postives = 288/324 (88.89%), Query Frame = 0
Query: 1 MYPLSKLKLCFLAASLLWATANGYAVVTGTVFCDQCKDGQISMFDYPLNGVKVMVACGDG 60
M LSKL LCFLAA+LL ATA+GYAVVTGTVFCDQCKDGQISMFDYP+NGVKVMVACGDG
Sbjct: 1 MDSLSKLHLCFLAAALLSATAHGYAVVTGTVFCDQCKDGQISMFDYPMNGVKVMVACGDG 60
Query: 61 NGGVTYLREETTNLFGSFTMRFDGTPDLSGCYAAVGGTVQGTTTDCGGAGGPPKSLRLMF 120
NGGVTYLREETTNLFGSFTMRFDG PDLSGCYA VGGT +G TTDCGGAG PP+SLRLMF
Sbjct: 61 NGGVTYLREETTNLFGSFTMRFDGMPDLSGCYATVGGTGEGATTDCGGAGSPPRSLRLMF 120
Query: 121 RLFDMEMYVVDSLVSQPAKPMSFCSASVNPVPVPAPVITTPPSPSLPSSPPPLPPSFKFP 180
RLF MEMYVVDSL+SQP +PMSFCSASV+PVP PAPV+ P PP LPP K P
Sbjct: 121 RLFSMEMYVVDSLLSQPTQPMSFCSASVHPVPAPAPVVMPP--------PPNLPPPLKMP 180
Query: 181 PLPQLPPLPPLPHLPPGPFLEASACQHENWTNPDYRCYWRAVNPDTKVAVIFGLVAANRY 240
PLPQLPPLPPLP LPP PFLE SACQHENWTNPDY+CYWRAVNPDTKV V+FGL+AANRY
Sbjct: 181 PLPQLPPLPPLPQLPPMPFLEVSACQHENWTNPDYKCYWRAVNPDTKVGVVFGLLAANRY 240
Query: 241 GTDLTLWNGLQGRGDPHRTLLREAITAFLNSYNSLNFPYPTLSVVQRFNWALLGSPRAVL 300
GTDLTLWNGLQGRGDP+RTLLREAITAFLNSYNS++FPYPTLSVV+ NWALLGS RAVL
Sbjct: 241 GTDLTLWNGLQGRGDPYRTLLREAITAFLNSYNSVHFPYPTLSVVEGLNWALLGSQRAVL 300
Query: 301 LTALRFKRANSGYGHVTCKFDPCQ 325
TALRFKRANSG HVTCKFDPCQ
Sbjct: 301 HTALRFKRANSGNSHVTCKFDPCQ 316
BLAST of HG10000562 vs. NCBI nr
Match:
XP_023513005.1 (uncharacterized protein LOC111777579 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 554.3 bits (1427), Expect = 7.0e-154
Identity = 270/324 (83.33%), Postives = 290/324 (89.51%), Query Frame = 0
Query: 1 MYPLSKLKLCFLAASLLWATANGYAVVTGTVFCDQCKDGQISMFDYPLNGVKVMVACGDG 60
M LSKL LCFLAA+LL ATA+GYAVVTGTVFCDQCKDGQISMFDYP+NGVKVMVACG+G
Sbjct: 1 MDSLSKLHLCFLAAALLSATAHGYAVVTGTVFCDQCKDGQISMFDYPINGVKVMVACGNG 60
Query: 61 NGGVTYLREETTNLFGSFTMRFDGTPDLSGCYAAVGGTVQGTTTDCGGAGGPPKSLRLMF 120
NGGVTYLREETTNLFGSFTMRFDG PDLSGCYA VGG +G TTDCGGAG PP+SLRLMF
Sbjct: 61 NGGVTYLREETTNLFGSFTMRFDGMPDLSGCYATVGGIGEGATTDCGGAGSPPRSLRLMF 120
Query: 121 RLFDMEMYVVDSLVSQPAKPMSFCSASVNPVPVPAPVITTPPSPSLPSSPPPLPPSFKFP 180
RLF MEMYVVDSL+SQPA+PMSFCSASV+PVPVPAPV+ P PP +PP FK P
Sbjct: 121 RLFSMEMYVVDSLLSQPAQPMSFCSASVHPVPVPAPVVMPP--------PPNVPPPFKMP 180
Query: 181 PLPQLPPLPPLPHLPPGPFLEASACQHENWTNPDYRCYWRAVNPDTKVAVIFGLVAANRY 240
PLPQLPP+PPLP LPP PFLEASACQHENWTNPDY+CYWRAVNPD KV V+FG++AANRY
Sbjct: 181 PLPQLPPIPPLPQLPPMPFLEASACQHENWTNPDYKCYWRAVNPDMKVGVVFGVLAANRY 240
Query: 241 GTDLTLWNGLQGRGDPHRTLLREAITAFLNSYNSLNFPYPTLSVVQRFNWALLGSPRAVL 300
GTDLTLWNGLQGRGDP+RTLLREAITAFLNSYNS++FPYPTLSVV+ NWALLGS RAVL
Sbjct: 241 GTDLTLWNGLQGRGDPYRTLLREAITAFLNSYNSVHFPYPTLSVVEGLNWALLGSQRAVL 300
Query: 301 LTALRFKRANSGYGHVTCKFDPCQ 325
TALRFKRANSG HVTCKFDPCQ
Sbjct: 301 HTALRFKRANSGNSHVTCKFDPCQ 316
BLAST of HG10000562 vs. NCBI nr
Match:
KAG6571035.1 (hypothetical protein SDJN03_29950, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 547.0 bits (1408), Expect = 1.1e-151
Identity = 269/324 (83.02%), Postives = 287/324 (88.58%), Query Frame = 0
Query: 1 MYPLSKLKLCFLAASLLWATANGYAVVTGTVFCDQCKDGQISMFDYPLNGVKVMVACGDG 60
M L+ L LCFLAA+LL ATA+GYAVVTGTVFCDQCKDGQISMFDYP+NGVKVMVACGDG
Sbjct: 1 MDSLTNLHLCFLAAALLSATAHGYAVVTGTVFCDQCKDGQISMFDYPINGVKVMVACGDG 60
Query: 61 NGGVTYLREETTNLFGSFTMRFDGTPDLSGCYAAVGGTVQGTTTDCGGAGGPPKSLRLMF 120
NGGVTYLREETTNLFGSFTMRFDG PDLSGCYA VGGT +G TTDCGGAG PP+SLRLMF
Sbjct: 61 NGGVTYLREETTNLFGSFTMRFDGMPDLSGCYATVGGTGEGATTDCGGAGSPPRSLRLMF 120
Query: 121 RLFDMEMYVVDSLVSQPAKPMSFCSASVNPVPVPAPVITTPPSPSLPSSPPPLPPSFKFP 180
RLF MEMYVVDSL+SQPA+PMSFCSASV+PVPVP + PP PP LPP FK P
Sbjct: 121 RLFSMEMYVVDSLLSQPAQPMSFCSASVHPVPVP---VVMPP-------PPNLPPPFKMP 180
Query: 181 PLPQLPPLPPLPHLPPGPFLEASACQHENWTNPDYRCYWRAVNPDTKVAVIFGLVAANRY 240
PLPQLPPLPPLP LPP PFLEASACQHENWTNP Y+CYWRAVNPD KV V+FG++AANRY
Sbjct: 181 PLPQLPPLPPLPQLPPMPFLEASACQHENWTNPYYKCYWRAVNPDMKVGVVFGVLAANRY 240
Query: 241 GTDLTLWNGLQGRGDPHRTLLREAITAFLNSYNSLNFPYPTLSVVQRFNWALLGSPRAVL 300
GTDLTLWNGLQGRGDP+RTLLREAITAFLNSYNS++FPYP LSVV+ NWALLGS RAVL
Sbjct: 241 GTDLTLWNGLQGRGDPYRTLLREAITAFLNSYNSVHFPYPALSVVEGLNWALLGSQRAVL 300
Query: 301 LTALRFKRANSGYGHVTCKFDPCQ 325
TALRFKRANSG GHVTCKFDPCQ
Sbjct: 301 HTALRFKRANSGNGHVTCKFDPCQ 314
BLAST of HG10000562 vs. NCBI nr
Match:
XP_022944208.1 (uncharacterized protein LOC111448725 [Cucurbita moschata] >KAG7010866.1 hypothetical protein SDJN02_27664 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 535.8 bits (1379), Expect = 2.6e-148
Identity = 266/324 (82.10%), Postives = 284/324 (87.65%), Query Frame = 0
Query: 1 MYPLSKLKLCFLAASLLWATANGYAVVTGTVFCDQCKDGQISMFDYPLNGVKVMVACGDG 60
M L+ L LCFLAA+LL ATA+GYAVVTGTVFCDQCKDGQISMFDYP+NGVKVMVACGDG
Sbjct: 1 MDSLTNLHLCFLAAALLSATAHGYAVVTGTVFCDQCKDGQISMFDYPINGVKVMVACGDG 60
Query: 61 NGGVTYLREETTNLFGSFTMRFDGTPDLSGCYAAVGGTVQGTTTDCGGAGGPPKSLRLMF 120
NGGVTYLREETTNLFGSFTMRFDG PDLSGCYA VGGT +G TTDCGGAG PP+SLRLMF
Sbjct: 61 NGGVTYLREETTNLFGSFTMRFDGMPDLSGCYATVGGTGEGATTDCGGAGSPPRSLRLMF 120
Query: 121 RLFDMEMYVVDSLVSQPAKPMSFCSASVNPVPVPAPVITTPPSPSLPSSPPPLPPSFKFP 180
RLF MEMYVVDSL+SQPA+ MSFCSASV+PVPVPAPV+ SPP LPP K
Sbjct: 121 RLFSMEMYVVDSLLSQPAQSMSFCSASVHPVPVPAPVVM--------PSPPNLPPPSKML 180
Query: 181 PLPQLPPLPPLPHLPPGPFLEASACQHENWTNPDYRCYWRAVNPDTKVAVIFGLVAANRY 240
PLPQLPPLPP LPP PFLEASACQHENWTNP Y+CYWRAVNPD KV V+FG++AANRY
Sbjct: 181 PLPQLPPLPP---LPPMPFLEASACQHENWTNPRYKCYWRAVNPDMKVGVVFGVLAANRY 240
Query: 241 GTDLTLWNGLQGRGDPHRTLLREAITAFLNSYNSLNFPYPTLSVVQRFNWALLGSPRAVL 300
GTDLTLWNGLQGRGDP+RTLLREAITAFLNSYNS++FPYP LSVV+ NWALLGS RAVL
Sbjct: 241 GTDLTLWNGLQGRGDPYRTLLREAITAFLNSYNSVHFPYPALSVVEGLNWALLGSQRAVL 300
Query: 301 LTALRFKRANSGYGHVTCKFDPCQ 325
TALRFKRANSG GHVTCKFDPCQ
Sbjct: 301 HTALRFKRANSGNGHVTCKFDPCQ 313
BLAST of HG10000562 vs. ExPASy TrEMBL
Match:
A0A6J1JC51 (uncharacterized protein LOC111483701 OS=Cucurbita maxima OX=3661 GN=LOC111483701 PE=4 SV=1)
HSP 1 Score: 557.0 bits (1434), Expect = 5.2e-155
Identity = 272/324 (83.95%), Postives = 288/324 (88.89%), Query Frame = 0
Query: 1 MYPLSKLKLCFLAASLLWATANGYAVVTGTVFCDQCKDGQISMFDYPLNGVKVMVACGDG 60
M LSKL LCFLAA+LL ATA+GYAVVTGTVFCDQCKDGQISMFDYP+NGVKVMVACGDG
Sbjct: 1 MDSLSKLHLCFLAAALLSATAHGYAVVTGTVFCDQCKDGQISMFDYPMNGVKVMVACGDG 60
Query: 61 NGGVTYLREETTNLFGSFTMRFDGTPDLSGCYAAVGGTVQGTTTDCGGAGGPPKSLRLMF 120
NGGVTYLREETTNLFGSFTMRFDG PDLSGCYA VGGT +G TTDCGGAG PP+SLRLMF
Sbjct: 61 NGGVTYLREETTNLFGSFTMRFDGMPDLSGCYATVGGTGEGATTDCGGAGSPPRSLRLMF 120
Query: 121 RLFDMEMYVVDSLVSQPAKPMSFCSASVNPVPVPAPVITTPPSPSLPSSPPPLPPSFKFP 180
RLF MEMYVVDSL+SQP +PMSFCSASV+PVP PAPV+ P PP LPP K P
Sbjct: 121 RLFSMEMYVVDSLLSQPTQPMSFCSASVHPVPAPAPVVMPP--------PPNLPPPLKMP 180
Query: 181 PLPQLPPLPPLPHLPPGPFLEASACQHENWTNPDYRCYWRAVNPDTKVAVIFGLVAANRY 240
PLPQLPPLPPLP LPP PFLE SACQHENWTNPDY+CYWRAVNPDTKV V+FGL+AANRY
Sbjct: 181 PLPQLPPLPPLPQLPPMPFLEVSACQHENWTNPDYKCYWRAVNPDTKVGVVFGLLAANRY 240
Query: 241 GTDLTLWNGLQGRGDPHRTLLREAITAFLNSYNSLNFPYPTLSVVQRFNWALLGSPRAVL 300
GTDLTLWNGLQGRGDP+RTLLREAITAFLNSYNS++FPYPTLSVV+ NWALLGS RAVL
Sbjct: 241 GTDLTLWNGLQGRGDPYRTLLREAITAFLNSYNSVHFPYPTLSVVEGLNWALLGSQRAVL 300
Query: 301 LTALRFKRANSGYGHVTCKFDPCQ 325
TALRFKRANSG HVTCKFDPCQ
Sbjct: 301 HTALRFKRANSGNSHVTCKFDPCQ 316
BLAST of HG10000562 vs. ExPASy TrEMBL
Match:
A0A6J1FXU0 (uncharacterized protein LOC111448725 OS=Cucurbita moschata OX=3662 GN=LOC111448725 PE=4 SV=1)
HSP 1 Score: 535.8 bits (1379), Expect = 1.2e-148
Identity = 266/324 (82.10%), Postives = 284/324 (87.65%), Query Frame = 0
Query: 1 MYPLSKLKLCFLAASLLWATANGYAVVTGTVFCDQCKDGQISMFDYPLNGVKVMVACGDG 60
M L+ L LCFLAA+LL ATA+GYAVVTGTVFCDQCKDGQISMFDYP+NGVKVMVACGDG
Sbjct: 1 MDSLTNLHLCFLAAALLSATAHGYAVVTGTVFCDQCKDGQISMFDYPINGVKVMVACGDG 60
Query: 61 NGGVTYLREETTNLFGSFTMRFDGTPDLSGCYAAVGGTVQGTTTDCGGAGGPPKSLRLMF 120
NGGVTYLREETTNLFGSFTMRFDG PDLSGCYA VGGT +G TTDCGGAG PP+SLRLMF
Sbjct: 61 NGGVTYLREETTNLFGSFTMRFDGMPDLSGCYATVGGTGEGATTDCGGAGSPPRSLRLMF 120
Query: 121 RLFDMEMYVVDSLVSQPAKPMSFCSASVNPVPVPAPVITTPPSPSLPSSPPPLPPSFKFP 180
RLF MEMYVVDSL+SQPA+ MSFCSASV+PVPVPAPV+ SPP LPP K
Sbjct: 121 RLFSMEMYVVDSLLSQPAQSMSFCSASVHPVPVPAPVVM--------PSPPNLPPPSKML 180
Query: 181 PLPQLPPLPPLPHLPPGPFLEASACQHENWTNPDYRCYWRAVNPDTKVAVIFGLVAANRY 240
PLPQLPPLPP LPP PFLEASACQHENWTNP Y+CYWRAVNPD KV V+FG++AANRY
Sbjct: 181 PLPQLPPLPP---LPPMPFLEASACQHENWTNPRYKCYWRAVNPDMKVGVVFGVLAANRY 240
Query: 241 GTDLTLWNGLQGRGDPHRTLLREAITAFLNSYNSLNFPYPTLSVVQRFNWALLGSPRAVL 300
GTDLTLWNGLQGRGDP+RTLLREAITAFLNSYNS++FPYP LSVV+ NWALLGS RAVL
Sbjct: 241 GTDLTLWNGLQGRGDPYRTLLREAITAFLNSYNSVHFPYPALSVVEGLNWALLGSQRAVL 300
Query: 301 LTALRFKRANSGYGHVTCKFDPCQ 325
TALRFKRANSG GHVTCKFDPCQ
Sbjct: 301 HTALRFKRANSGNGHVTCKFDPCQ 313
BLAST of HG10000562 vs. ExPASy TrEMBL
Match:
A0A6J1CWR4 (uncharacterized protein LOC111015504 OS=Momordica charantia OX=3673 GN=LOC111015504 PE=4 SV=1)
HSP 1 Score: 511.9 bits (1317), Expect = 1.9e-141
Identity = 260/324 (80.25%), Postives = 277/324 (85.49%), Query Frame = 0
Query: 1 MYPLSKLKLCFLAASLLWATANGYAVVTGTVFCDQCKDGQISMFDYPLNGVKVMVACGDG 60
M PL KL LCFLAASL +A A GYAVVTGTVFCDQCKDGQIS+FDYP+NG KVMVACGDG
Sbjct: 1 MDPLKKLNLCFLAASLFFAAAQGYAVVTGTVFCDQCKDGQISLFDYPINGAKVMVACGDG 60
Query: 61 NGGVTYLREETTNLFGSFTMRFDGTPDLSGCYAAVGGTVQGTTTDCGGAGGPPKSLRLMF 120
NGGVTY REETTNLFGSFTMRFDGTPDLSGCYAAVGGT G CGGA GP KSLRLMF
Sbjct: 61 NGGVTYSREETTNLFGSFTMRFDGTPDLSGCYAAVGGTATG----CGGAVGPAKSLRLMF 120
Query: 121 RLFDMEMYVVDSLVSQPAKPMSFCSASVNPVPVPAPVITTPPSPSLPSSPPPLPPSFKFP 180
R+FDMEMYVVDSL+SQPA PM FCS SVNPVP P +T PP PSSPPP P
Sbjct: 121 RMFDMEMYVVDSLISQPALPMPFCSPSVNPVPAP---VTVPP----PSSPPP-------P 180
Query: 181 PLPQLPPLPPLPHLPPGPFLEASACQHENWTNPDYRCYWRAVNPDTKVAVIFGLVAANRY 240
PL +LP LPP LPP PFLEASACQHENWTNPDYRCYWRAVNP+TKV VIFG VAANRY
Sbjct: 181 PL-RLPLLPP---LPPVPFLEASACQHENWTNPDYRCYWRAVNPETKVGVIFGPVAANRY 240
Query: 241 GTDLTLWNGLQGRGDPHRTLLREAITAFLNSYNSLNFPYPTLSVVQRFNWALLGSPRAVL 300
GT++TLWNGLQGRGDP+RTLLREAITAFLNSYNSL++PYPT+SV+QR NWALLGSPRAVL
Sbjct: 241 GTEVTLWNGLQGRGDPYRTLLREAITAFLNSYNSLHYPYPTISVLQRMNWALLGSPRAVL 300
Query: 301 LTALRFKRANSGYGHVTCKFDPCQ 325
+TALRFKRANSG H+TCKF PCQ
Sbjct: 301 ITALRFKRANSGSPHLTCKFHPCQ 302
BLAST of HG10000562 vs. ExPASy TrEMBL
Match:
A0A5D3BLG6 (Protodermal factor 1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold123G001410 PE=4 SV=1)
HSP 1 Score: 453.8 bits (1166), Expect = 6.2e-124
Identity = 231/330 (70.00%), Postives = 259/330 (78.48%), Query Frame = 0
Query: 1 MYPLSKLKLCFLAASLLWATANGYAVVTGTVFCDQCKDGQISM--FDYPLNGVKVMVACG 60
M PL K KLCFLA SL ATA GY VV+G VFCD+CKDGQ+S+ FDYP+NGVKV +ACG
Sbjct: 1 MDPLFKFKLCFLALSLFSATAYGYTVVSGFVFCDKCKDGQVSIFDFDYPINGVKVKIACG 60
Query: 61 DGNGGVTYLREETTNLFGSFTMRFDGTPDLSGCYAAVGGTVQGTTTDCGGAGGPPKSLRL 120
DG GGVT LREE TN FG +TM F+GTPDLSGC A V G QG T+C GGPP+SL+L
Sbjct: 61 DGKGGVTDLREEPTNFFGGYTMSFEGTPDLSGCTATVTGPAQGAMTNCSAGGGPPRSLKL 120
Query: 121 MFRLFDMEMYVVDSLVSQPAKPMSFCSASVNPVPVPAPVITTPPSPSLPSSPPPLPPSFK 180
+FRL D+EMY VD LVSQP +PMSFCS+ PVP P P P PSLPS P K
Sbjct: 121 LFRLLDLEMYGVDPLVSQPDQPMSFCSSRSAPVPGPKP----PSPPSLPSLP-------K 180
Query: 181 FPPLPQLPPLPPLP---HLPPGPFLEASACQHENWTNPDYRCYWRAVNPDTKVAVIFGLV 240
PPLP+LPPLPP P +P PFLEASACQHENWTNPDY+CYWRAVNPDTKVAVIFG +
Sbjct: 181 LPPLPKLPPLPPFPPMRQMPHTPFLEASACQHENWTNPDYKCYWRAVNPDTKVAVIFGAI 240
Query: 241 AANRYGTDLTLWNGLQGRGDPHRTLLREAITAFLNSYNSLNFPYPTLSVVQRFNWALL-G 300
AA RYGTD+TLW GLQGRGDP++TLLREAITAFLNSY+SL+FPY +LSVVQ FN AL+ G
Sbjct: 241 AAERYGTDMTLWKGLQGRGDPYKTLLREAITAFLNSYSSLHFPYHSLSVVQHFNLALMDG 300
Query: 301 SPRAVLLTALRFKRANSGYGHVTCKFDPCQ 325
S R+VL ALRFK ANSG GHVTCKFDPC+
Sbjct: 301 SERSVLHAALRFKHANSGNGHVTCKFDPCK 319
BLAST of HG10000562 vs. ExPASy TrEMBL
Match:
A0A1S3C647 (uncharacterized protein LOC103497304 OS=Cucumis melo OX=3656 GN=LOC103497304 PE=4 SV=1)
HSP 1 Score: 453.8 bits (1166), Expect = 6.2e-124
Identity = 231/330 (70.00%), Postives = 259/330 (78.48%), Query Frame = 0
Query: 1 MYPLSKLKLCFLAASLLWATANGYAVVTGTVFCDQCKDGQISM--FDYPLNGVKVMVACG 60
M PL K KLCFLA SL ATA GY VV+G VFCD+CKDGQ+S+ FDYP+NGVKV +ACG
Sbjct: 1 MDPLFKFKLCFLALSLFSATAYGYTVVSGFVFCDKCKDGQVSIFDFDYPINGVKVKIACG 60
Query: 61 DGNGGVTYLREETTNLFGSFTMRFDGTPDLSGCYAAVGGTVQGTTTDCGGAGGPPKSLRL 120
DG GGVT LREE TN FG +TM F+GTPDLSGC A V G QG T+C GGPP+SL+L
Sbjct: 61 DGKGGVTDLREEPTNFFGGYTMSFEGTPDLSGCTATVTGPAQGAMTNCSAGGGPPRSLKL 120
Query: 121 MFRLFDMEMYVVDSLVSQPAKPMSFCSASVNPVPVPAPVITTPPSPSLPSSPPPLPPSFK 180
+FRL D+EMY VD LVSQP +PMSFCS+ PVP P P P PSLPS P K
Sbjct: 121 LFRLLDLEMYGVDPLVSQPDQPMSFCSSRSAPVPGPKP----PSPPSLPSLP-------K 180
Query: 181 FPPLPQLPPLPPLP---HLPPGPFLEASACQHENWTNPDYRCYWRAVNPDTKVAVIFGLV 240
PPLP+LPPLPP P +P PFLEASACQHENWTNPDY+CYWRAVNPDTKVAVIFG +
Sbjct: 181 LPPLPKLPPLPPFPPMRQMPHTPFLEASACQHENWTNPDYKCYWRAVNPDTKVAVIFGAI 240
Query: 241 AANRYGTDLTLWNGLQGRGDPHRTLLREAITAFLNSYNSLNFPYPTLSVVQRFNWALL-G 300
AA RYGTD+TLW GLQGRGDP++TLLREAITAFLNSY+SL+FPY +LSVVQ FN AL+ G
Sbjct: 241 AAERYGTDMTLWKGLQGRGDPYKTLLREAITAFLNSYSSLHFPYHSLSVVQHFNLALMDG 300
Query: 301 SPRAVLLTALRFKRANSGYGHVTCKFDPCQ 325
S R+VL ALRFK ANSG GHVTCKFDPC+
Sbjct: 301 SERSVLHAALRFKHANSGNGHVTCKFDPCK 319
BLAST of HG10000562 vs. TAIR 10
Match:
AT2G16630.1 (Pollen Ole e 1 allergen and extensin family protein )
HSP 1 Score: 305.8 bits (782), Expect = 4.0e-83
Identity = 173/354 (48.87%), Postives = 217/354 (61.30%), Query Frame = 0
Query: 9 LCFLAASLLWATAN---GYAVVTGTVFCDQCKDGQISMFDYPLNGVKVMVACGDGNGGVT 68
L F+A +L N GYA VTG+VFCDQCKDG+ S+FD+P++G+K+ V C D NG V
Sbjct: 9 LIFVAFFVLCLATNGVTGYATVTGSVFCDQCKDGERSLFDFPVSGIKISVTCADENGQVY 68
Query: 69 YLREETTNLFGSFTMRFDGTPDLSGCYAAVGGT-VQGTTTDCGGAGGPPKSLRLMFRLFD 128
REETTN G + MRFDGTPDLS CYA V VQ + C A GP + L+LMF F
Sbjct: 69 MSREETTNWLGGYVMRFDGTPDLSNCYAQVSDNGVQQDPSSCSIASGPAQKLKLMFSFFG 128
Query: 129 MEMYVVDSLVSQPAKPMSFC----SASVNPVP----VPAPVITTPPSPSLP--------- 188
+E + D+L++QP +P SFC +A V P P +P P + P P +P
Sbjct: 129 IETFAADALLAQPVQPSSFCPKPPTAPVMPPPQVPVMPPPQVPVKPHPKVPVISPDPPAT 188
Query: 189 ---------------SSPPPLPPSFKFPPLPQLP--PLPPLPHLPPGPFLEASACQHENW 248
+ PPPL P PP+ P LPPLP +PP PF+E SAC H+ W
Sbjct: 189 LPPPKVPVISPDPPTTLPPPLVPVINLPPVTSPPQFKLPPLPQIPPMPFVEPSACSHQLW 248
Query: 249 TNPDYRCYWRAVNPDTKVAVIFGLVAANRYGTDLTLWNGLQGRGDPHRTLLREAITAFLN 308
P+YRCYWRA+ PDTKVAV FGLVA YGTD+T+ L GRG+ ++TLLREA TA LN
Sbjct: 249 MKPEYRCYWRAIGPDTKVAVAFGLVAGRIYGTDMTVREALDGRGEAYKTLLREATTALLN 308
Query: 309 SYNSLNFPYPTLSVVQRFNWALLG-SPRAVLLTALRFKRANSGYGHVTCKFDPC 324
SYNSL FPY +++V+ N ALLG S VL+TA+RF +ANSG TC+F C
Sbjct: 309 SYNSLGFPYNSVAVITYTNLALLGNSEHDVLMTAIRFIKANSG----TCRFTVC 358
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038900829.1 | 6.7e-173 | 91.98 | uncharacterized protein LOC120087893 [Benincasa hispida] | [more] |
XP_022985735.1 | 1.1e-154 | 83.95 | uncharacterized protein LOC111483701 [Cucurbita maxima] | [more] |
XP_023513005.1 | 7.0e-154 | 83.33 | uncharacterized protein LOC111777579 [Cucurbita pepo subsp. pepo] | [more] |
KAG6571035.1 | 1.1e-151 | 83.02 | hypothetical protein SDJN03_29950, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_022944208.1 | 2.6e-148 | 82.10 | uncharacterized protein LOC111448725 [Cucurbita moschata] >KAG7010866.1 hypothet... | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1JC51 | 5.2e-155 | 83.95 | uncharacterized protein LOC111483701 OS=Cucurbita maxima OX=3661 GN=LOC111483701... | [more] |
A0A6J1FXU0 | 1.2e-148 | 82.10 | uncharacterized protein LOC111448725 OS=Cucurbita moschata OX=3662 GN=LOC1114487... | [more] |
A0A6J1CWR4 | 1.9e-141 | 80.25 | uncharacterized protein LOC111015504 OS=Momordica charantia OX=3673 GN=LOC111015... | [more] |
A0A5D3BLG6 | 6.2e-124 | 70.00 | Protodermal factor 1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold123... | [more] |
A0A1S3C647 | 6.2e-124 | 70.00 | uncharacterized protein LOC103497304 OS=Cucumis melo OX=3656 GN=LOC103497304 PE=... | [more] |
Match Name | E-value | Identity | Description | |
AT2G16630.1 | 4.0e-83 | 48.87 | Pollen Ole e 1 allergen and extensin family protein | [more] |