Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CGTTTGAAAATGTCCATGGAGGAAGGAATATTGTTTACTAATTAATTTATTTTTTGATAAATAATTTTAAAACGGAGCCTCACCGGACCACGCCGTCTCTCCGGACGTAACGAGGAAGAAGGAAAGAGACTGTGCTCGCGTTTCCATATTTCGTTGATTTGACTTTTCCATCGACGACTTCTCCTTCCACGTACTTTCCTCGTTGAATTCAAAATCTTCAACAATGTGGCTTGTTTCCGCCATCTCGAGTTCATAGCGGTTCTGCTTGATTCACCCGTCGTAGAAAGGCGGCAGATCTTTCGCGGCGATGAATCGTGATTTGGTGCTCGTATTCGTGTTCTTTGTTCTCATTCTCGTCTCTCCTGGATCCGATGCTTCTCTCTCTGACCGTATTTGGAATCTTCATCTCCGCTTCTCGCTTTCGAAGGATTCTCCCGAGGTCTGTTTGAATTGGATTTGGATCTCCTCTTTTCATCTATTGATTTTAGTGTATCTTTTGGATTGTACTTTGAAACTTATATTTCTCCTTTTCGTTTTTTTTAGCGTATAGCTCCAGCTCCTGGTCCTAGCTCTGTTATCAATGGCAAATTGATTGGGGGCGTCCCGATAAGTTCTCCAACTCCTGCAATTCCTCCATTTCCCAGTTCAACTGATGGTTTTACCTTGGAGAAGTGCGACAGGAACAAGACTTGCCACGACCTTAAGAAAATGACTGCATGCCTTCAGTTCGCAGAACAAGGTAATTGCGTTTTTCTTCCTTTTTAGCTTCTATCATGAGTTGCTTTTCTCTGCAAATAAATTAAGCCTGTGATTTGAGAAGATTGCACCTCTTTTTAATATAGAACGAACACATTTCCCATGATGTTCTATGCGTATTGGTTACCGAAGTGGGATTTTTTGTTTGCGAATTTCTCTTGCTATATATCCCTTTTTTTTTAAAATCTTTCTCTTGCGAACTTAATATGTAGTTGTTATTTTGCTGATAATAGCGCCTCTTACTTGCAACGATGTTACTTTTAGTAACCCTGCCAATGAAGTCAGCTTTGGTTAGAACTAGAAGCTAGGGGATGTGAGGGGTTATTTTCCAATTAGCTTAAGTTTATGAGGGAAACATAAATAGTTACTATAGGAAGCATTTCGTGATAGTAGCAGGATGTCTGTAATGTACTCAAATACATTGAAGAGTTCTATCAGCTCAATACCTGTAACAACCTTGCTGATTCACAGTTCCAACAAGTTGCTTATGATCATGTATTGTGAACCAACAATTTTTCATTACCTAATTGTTATTGATTTGTACGTGGTTATTAGGATCAAAAGTGGTGGGGAAATTTTCCTGAGAGGGAACTCTATCGTCCCATTGTCAGTTAGGAATGCTTTTAAAGGCCCAAGTGATTGAATTGTCCGTGATACGAATATTTGCCTTATTCTATTAGGCTGTCAATGCTTGTATCGTGCAAAGTGTTCATAGGGGAAAGGGGATTATTAGCGAGTTCATTGGTTGGGGTGATAACAGTGATGTGTTGGGTTCCCCCTTCGAACCTTCAAGTATCAATAAAACACACTCAAGCAGCAATAGACAAGAACACTGTTCTATTGAGGAATTCCTGCAAGAGTTCAACTTTCTCAACACCGAAATTACAACAAAACAGAAATATTAAGTTGAAAAGACAAAGATAACTCCTATTCGAGAGAGTTGGTTCCTCTCCTCCCAGGCGACAATTTTCCCCAAAGTTCCACACCGCTGACCACTTTCCCCTTCAAAAACCATGGAATGTCCTTTTCCGTGTCACTATATGCCTACGATAGGAGGGGTAAGAATGGAAATCCAGAAGGGATAGATAGTTAGCGGGCAGGGCCTTATTTACCAATATCATAGCTATGTTGTGAAGCATAGTTTCCTGCCAAGGATTTCTGAAATTTGGAACGAATATCTGCTTAGAATAGTGGTTAAAAACTTCAAACCTTATAGATTGTATTGGAGGTTGGCAGGATAAAAGGAAACCTTGGTTGGAAGGAATATTGGAATGTCCTCGCTGGCACTCGTTCCTTTATCCAATCGATGTGGGACCGCCCCCAAATCCACTCCCTTTGGGGCCCAGCATCCTTACTGGCACACCGCCTCGTGTCTACCCCCTTCGGGGAACAGCAAGAAGGCCGGCACATCGTCCGGTGACTGGCTCTGATACCATTTGTAACGACCCAGATCCACCGCTAGCAGATATTGTCCTCTTTGGGCTTTCCCTTTGGGCTTCCCCTCAAGGCTTTAAAACGCGTCTGCTAGGGGAAGGTTTCCACACCCTTATAAAGGGTGATTTGTTCTCCTCCCCAACCAATGTGGGACATTACAGAAATTCTGTTAGTTTTTCAGTTTTGGTTGGTTGTTGGTTTTGACCTGGGTTTTTGACCCATCTATTTGGCCTAGCTAGAACTTGGTCAAGTGCCAACCTTTTCACTGCAAGAGGATGGTTTCTGGTATTTGTATGTGGGCAGCTGAGCAACAGACGAGCACCTACAATCAACTGTTGTATGGTCTAGGTTAGGCATTCAACGAAAGGGAAAGTTATTTAATGCAAAAGTAATTTAATCCCATTTGTCCAATGTTTTTCACCAAGATTCCAAAATTCCAAGGTCTAGGGACAGGCTTTCTTGGATATATTAGATTTGTGATCGAACATCAATCTGGAGCCAATAACTTGAAAATGCGGGAAATAATCTTATTGCTAACTTTGAAAGGAGAAGGCATTGCTCTTGATTTGTTTGAAGACTACTGTATGCTCATGAGTCACGATGATGATTTTTTAATCATTTAGAAGTCTTGTAAGCAGCTTGATAATGACATTTCCACATATTTGGGAGTTTTTCATTCAAAATAACCAACCATGCTTGCCAAGAACTTTGCTCAGAAGCAGTTCATTAGTGACCTTCATTTCAACAAGATTTTTCTCATCTAGCTGCAAATGGGGATGTGAGTAATTTTGTGAGATGATGTGCAGTCTCACAAAATTACTTAATTTAGGTGATGTCTTTCGTGCGTTCTAATAAGCTAAACGGCTGGGAAGTGGGAGGTCAAGCGTGAAGGAAAAAATACACCATGACCATGAAGTGTTGGAGACAGAAACAAGTAGTTGCTAGCTTTGTATCCTAAAAGAAGTACTAGTAGTCTTAATGTGCTAGTTTGATCCTTTTATCTTATACCACTTAATCCTTTTACTTTAAACTGATTGTTTTCGTAATCTTGTTTCTCCATACTCAGAAAACCTCCTGACACTAATACTAATGTTCTTTTATACCAAGGTTCGTGTTTTCATTAACTGGTGCTTTTCCCCCCTTTAGCCCTTTTTAACATCACCTCCATGCCATACCTGTCTTTCAAACGTTGATGAGATGACTGTTTCAAACAATTTCCCCCTCGGGGAGTTAAAATTCAGTGTAACTTCAGTTAATGTTATCCTTCCGAAGCCAGTATTGCTAAAAGATTGTTTTCTCACAGCGATGGTGGAACAATATCTTCTGATCCAAAATGATGGAGAGACTTCTCTGAAAGTGAATGTTATAGTTTCCGATGCTAAATACAAGGAGGTACAAGTTCCTGAGCATCATGCTAAAAAGGTACTCGTTAACCTTTGCTGGATGTTCACGAGAATAGTTAACTTCTCTTTTAGTTAGGATAACAGTCAGGAATTTGCTACCCAAGGCAACACAATGGTTTCTGTAAATCTTGGTTTTTCGAACTCTAGAGGGTGTAACTAGAGATTCATTTATGGCTCCATATTCTATTTCAGGTTAATGTTTCAGACATTCCAGAAACTTCAACGATTATATTAGATGCTGGAAATGGGAAGTGTGTAATTCACGTAGGATCACCAACAAAAAACGGCAGCATTGTTAAGCAGACCTCTTCCTATGTAACCCATTTAAACCTCATATCCGGATCCTACCTACTATTTTCAATTATTTTGATCATTGGAGGTGTCTGGGCATGCTGCAAAATGAGAACCAAGGAACGTCATGCCAATGGAATCCCATATCAGGAGCTTGAATTGGCAGAGAACGACTCTTCTCCAACCAACGATTTGGAAGCAGCAGAAGGTTGGGATCAAGGCTGGGACGATGACTGGGACGAGTCGAAGCCTGCAAATAAATCCAGTTCTGACATGAAGGAAAATGGATCAAATGGTATTAACTCAAGAACTTCCGAGAGAAATGGATGGGGAAATGATTGGGACGATTGAGGCAATAAACAGCTAAAAAATCTACGTGCTCCTCAATGTTGAAAGGTCAGTAAACTTGGTGAAAATAGAAATTTTTTAGCAACAGGATAGTGAAAAGAAGAAAATTCTATGTGATAAAAAGCTGTGAGCCAATTTTTTATTGTTCAATTACAATAGAGAAAATGTATGGTTATCAGAATAGATCTCAGTTTTCCTGTTAGTAGAGATGGAAATTGGAATTTGGAAAGAGCAATAATGTAGGTTTTTAAGAAAGAGCATTGCAATGATAAGACTATTTGCTTTGTAATCTTAAGTGCTAGCGTGCTTACATAGATCAAGAGGGTATCATTTGGCCCCTCAATAAAGTGCAAACCTAAAGAGAA
mRNA sequence
CGTTTGAAAATGTCCATGGAGGAAGGAATATTGTTTACTAATTAATTTATTTTTTGATAAATAATTTTAAAACGGAGCCTCACCGGACCACGCCGTCTCTCCGGACGTAACGAGGAAGAAGGAAAGAGACTGTGCTCGCGTTTCCATATTTCGTTGATTTGACTTTTCCATCGACGACTTCTCCTTCCACGTACTTTCCTCGTTGAATTCAAAATCTTCAACAATGTGGCTTGTTTCCGCCATCTCGAGTTCATAGCGGTTCTGCTTGATTCACCCGTCGTAGAAAGGCGGCAGATCTTTCGCGGCGATGAATCGTGATTTGGTGCTCGTATTCGTGTTCTTTGTTCTCATTCTCGTCTCTCCTGGATCCGATGCTTCTCTCTCTGACCGTATTTGGAATCTTCATCTCCGCTTCTCGCTTTCGAAGGATTCTCCCGAGCGTATAGCTCCAGCTCCTGGTCCTAGCTCTGTTATCAATGGCAAATTGATTGGGGGCGTCCCGATAAGTTCTCCAACTCCTGCAATTCCTCCATTTCCCAGTTCAACTGATGGTTTTACCTTGGAGAAGTGCGACAGGAACAAGACTTGCCACGACCTTAAGAAAATGACTGCATGCCTTCAGTTCGCAGAACAAGCGATGGTGGAACAATATCTTCTGATCCAAAATGATGGAGAGACTTCTCTGAAAGTGAATGTTATAGTTTCCGATGCTAAATACAAGGAGGTACAAGTTCCTGAGCATCATGCTAAAAAGGTTAATGTTTCAGACATTCCAGAAACTTCAACGATTATATTAGATGCTGGAAATGGGAAGTGTGTAATTCACGTAGGATCACCAACAAAAAACGGCAGCATTGTTAAGCAGACCTCTTCCTATGTAACCCATTTAAACCTCATATCCGGATCCTACCTACTATTTTCAATTATTTTGATCATTGGAGGTGTCTGGGCATGCTGCAAAATGAGAACCAAGGAACGTCATGCCAATGGAATCCCATATCAGGAGCTTGAATTGGCAGAGAACGACTCTTCTCCAACCAACGATTTGGAAGCAGCAGAAGGTTGGGATCAAGGCTGGGACGATGACTGGGACGAGTCGAAGCCTGCAAATAAATCCAGTTCTGACATGAAGGAAAATGGATCAAATGGTATTAACTCAAGAACTTCCGAGAGAAATGGATGGGGAAATGATTGGGACGATTGAGGCAATAAACAGCTAAAAAATCTACGTGCTCCTCAATGTTGAAAGGTCAGTAAACTTGGTGAAAATAGAAATTTTTTAGCAACAGGATAGTGAAAAGAAGAAAATTCTATGTGATAAAAAGCTGTGAGCCAATTTTTTATTGTTCAATTACAATAGAGAAAATGTATGGTTATCAGAATAGATCTCAGTTTTCCTGTTAGTAGAGATGGAAATTGGAATTTGGAAAGAGCAATAATGTAGGTTTTTAAGAAAGAGCATTGCAATGATAAGACTATTTGCTTTGTAATCTTAAGTGCTAGCGTGCTTACATAGATCAAGAGGGTATCATTTGGCCCCTCAATAAAGTGCAAACCTAAAGAGAA
Coding sequence (CDS)
ATGAATCGTGATTTGGTGCTCGTATTCGTGTTCTTTGTTCTCATTCTCGTCTCTCCTGGATCCGATGCTTCTCTCTCTGACCGTATTTGGAATCTTCATCTCCGCTTCTCGCTTTCGAAGGATTCTCCCGAGCGTATAGCTCCAGCTCCTGGTCCTAGCTCTGTTATCAATGGCAAATTGATTGGGGGCGTCCCGATAAGTTCTCCAACTCCTGCAATTCCTCCATTTCCCAGTTCAACTGATGGTTTTACCTTGGAGAAGTGCGACAGGAACAAGACTTGCCACGACCTTAAGAAAATGACTGCATGCCTTCAGTTCGCAGAACAAGCGATGGTGGAACAATATCTTCTGATCCAAAATGATGGAGAGACTTCTCTGAAAGTGAATGTTATAGTTTCCGATGCTAAATACAAGGAGGTACAAGTTCCTGAGCATCATGCTAAAAAGGTTAATGTTTCAGACATTCCAGAAACTTCAACGATTATATTAGATGCTGGAAATGGGAAGTGTGTAATTCACGTAGGATCACCAACAAAAAACGGCAGCATTGTTAAGCAGACCTCTTCCTATGTAACCCATTTAAACCTCATATCCGGATCCTACCTACTATTTTCAATTATTTTGATCATTGGAGGTGTCTGGGCATGCTGCAAAATGAGAACCAAGGAACGTCATGCCAATGGAATCCCATATCAGGAGCTTGAATTGGCAGAGAACGACTCTTCTCCAACCAACGATTTGGAAGCAGCAGAAGGTTGGGATCAAGGCTGGGACGATGACTGGGACGAGTCGAAGCCTGCAAATAAATCCAGTTCTGACATGAAGGAAAATGGATCAAATGGTATTAACTCAAGAACTTCCGAGAGAAATGGATGGGGAAATGATTGGGACGATTGA
Protein sequence
MNRDLVLVFVFFVLILVSPGSDASLSDRIWNLHLRFSLSKDSPERIAPAPGPSSVINGKLIGGVPISSPTPAIPPFPSSTDGFTLEKCDRNKTCHDLKKMTACLQFAEQAMVEQYLLIQNDGETSLKVNVIVSDAKYKEVQVPEHHAKKVNVSDIPETSTIILDAGNGKCVIHVGSPTKNGSIVKQTSSYVTHLNLISGSYLLFSIILIIGGVWACCKMRTKERHANGIPYQELELAENDSSPTNDLEAAEGWDQGWDDDWDESKPANKSSSDMKENGSNGINSRTSERNGWGNDWDD
Homology
BLAST of Carg02159 vs. NCBI nr
Match:
KAG6578483.1 (hypothetical protein SDJN03_22931, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 595.9 bits (1535), Expect = 1.9e-166
Identity = 298/298 (100.00%), Postives = 298/298 (100.00%), Query Frame = 0
Query: 1 MNRDLVLVFVFFVLILVSPGSDASLSDRIWNLHLRFSLSKDSPERIAPAPGPSSVINGKL 60
MNRDLVLVFVFFVLILVSPGSDASLSDRIWNLHLRFSLSKDSPERIAPAPGPSSVINGKL
Sbjct: 295 MNRDLVLVFVFFVLILVSPGSDASLSDRIWNLHLRFSLSKDSPERIAPAPGPSSVINGKL 354
Query: 61 IGGVPISSPTPAIPPFPSSTDGFTLEKCDRNKTCHDLKKMTACLQFAEQAMVEQYLLIQN 120
IGGVPISSPTPAIPPFPSSTDGFTLEKCDRNKTCHDLKKMTACLQFAEQAMVEQYLLIQN
Sbjct: 355 IGGVPISSPTPAIPPFPSSTDGFTLEKCDRNKTCHDLKKMTACLQFAEQAMVEQYLLIQN 414
Query: 121 DGETSLKVNVIVSDAKYKEVQVPEHHAKKVNVSDIPETSTIILDAGNGKCVIHVGSPTKN 180
DGETSLKVNVIVSDAKYKEVQVPEHHAKKVNVSDIPETSTIILDAGNGKCVIHVGSPTKN
Sbjct: 415 DGETSLKVNVIVSDAKYKEVQVPEHHAKKVNVSDIPETSTIILDAGNGKCVIHVGSPTKN 474
Query: 181 GSIVKQTSSYVTHLNLISGSYLLFSIILIIGGVWACCKMRTKERHANGIPYQELELAEND 240
GSIVKQTSSYVTHLNLISGSYLLFSIILIIGGVWACCKMRTKERHANGIPYQELELAEND
Sbjct: 475 GSIVKQTSSYVTHLNLISGSYLLFSIILIIGGVWACCKMRTKERHANGIPYQELELAEND 534
Query: 241 SSPTNDLEAAEGWDQGWDDDWDESKPANKSSSDMKENGSNGINSRTSERNGWGNDWDD 299
SSPTNDLEAAEGWDQGWDDDWDESKPANKSSSDMKENGSNGINSRTSERNGWGNDWDD
Sbjct: 535 SSPTNDLEAAEGWDQGWDDDWDESKPANKSSSDMKENGSNGINSRTSERNGWGNDWDD 592
BLAST of Carg02159 vs. NCBI nr
Match:
XP_022938773.1 (uncharacterized protein LOC111444889 [Cucurbita moschata] >KAG7016047.1 hypothetical protein SDJN02_21151 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 595.9 bits (1535), Expect = 1.9e-166
Identity = 298/298 (100.00%), Postives = 298/298 (100.00%), Query Frame = 0
Query: 1 MNRDLVLVFVFFVLILVSPGSDASLSDRIWNLHLRFSLSKDSPERIAPAPGPSSVINGKL 60
MNRDLVLVFVFFVLILVSPGSDASLSDRIWNLHLRFSLSKDSPERIAPAPGPSSVINGKL
Sbjct: 1 MNRDLVLVFVFFVLILVSPGSDASLSDRIWNLHLRFSLSKDSPERIAPAPGPSSVINGKL 60
Query: 61 IGGVPISSPTPAIPPFPSSTDGFTLEKCDRNKTCHDLKKMTACLQFAEQAMVEQYLLIQN 120
IGGVPISSPTPAIPPFPSSTDGFTLEKCDRNKTCHDLKKMTACLQFAEQAMVEQYLLIQN
Sbjct: 61 IGGVPISSPTPAIPPFPSSTDGFTLEKCDRNKTCHDLKKMTACLQFAEQAMVEQYLLIQN 120
Query: 121 DGETSLKVNVIVSDAKYKEVQVPEHHAKKVNVSDIPETSTIILDAGNGKCVIHVGSPTKN 180
DGETSLKVNVIVSDAKYKEVQVPEHHAKKVNVSDIPETSTIILDAGNGKCVIHVGSPTKN
Sbjct: 121 DGETSLKVNVIVSDAKYKEVQVPEHHAKKVNVSDIPETSTIILDAGNGKCVIHVGSPTKN 180
Query: 181 GSIVKQTSSYVTHLNLISGSYLLFSIILIIGGVWACCKMRTKERHANGIPYQELELAEND 240
GSIVKQTSSYVTHLNLISGSYLLFSIILIIGGVWACCKMRTKERHANGIPYQELELAEND
Sbjct: 181 GSIVKQTSSYVTHLNLISGSYLLFSIILIIGGVWACCKMRTKERHANGIPYQELELAEND 240
Query: 241 SSPTNDLEAAEGWDQGWDDDWDESKPANKSSSDMKENGSNGINSRTSERNGWGNDWDD 299
SSPTNDLEAAEGWDQGWDDDWDESKPANKSSSDMKENGSNGINSRTSERNGWGNDWDD
Sbjct: 241 SSPTNDLEAAEGWDQGWDDDWDESKPANKSSSDMKENGSNGINSRTSERNGWGNDWDD 298
BLAST of Carg02159 vs. NCBI nr
Match:
XP_023549795.1 (uncharacterized protein LOC111808188 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 580.1 bits (1494), Expect = 1.1e-161
Identity = 289/298 (96.98%), Postives = 294/298 (98.66%), Query Frame = 0
Query: 1 MNRDLVLVFVFFVLILVSPGSDASLSDRIWNLHLRFSLSKDSPERIAPAPGPSSVINGKL 60
MNRD VLV VFFVLILVSPGSDASL+DRIWNLHLRFSL KDSPERIAPAPGPSSVINGKL
Sbjct: 1 MNRDFVLVIVFFVLILVSPGSDASLTDRIWNLHLRFSLLKDSPERIAPAPGPSSVINGKL 60
Query: 61 IGGVPISSPTPAIPPFPSSTDGFTLEKCDRNKTCHDLKKMTACLQFAEQAMVEQYLLIQN 120
IGGVPISSPTPAIPPFPSSTDGFTLEKCDRNKTCHDLKKMTACLQFAEQAMVEQYLLIQN
Sbjct: 61 IGGVPISSPTPAIPPFPSSTDGFTLEKCDRNKTCHDLKKMTACLQFAEQAMVEQYLLIQN 120
Query: 121 DGETSLKVNVIVSDAKYKEVQVPEHHAKKVNVSDIPETSTIILDAGNGKCVIHVGSPTKN 180
DGETSLKVNVIVSDAKYK+VQVPEHHAKKVNVSDIPETSTIILDAGNGKCVIHVGSPTKN
Sbjct: 121 DGETSLKVNVIVSDAKYKDVQVPEHHAKKVNVSDIPETSTIILDAGNGKCVIHVGSPTKN 180
Query: 181 GSIVKQTSSYVTHLNLISGSYLLFSIILIIGGVWACCKMRTKERHANGIPYQELELAEND 240
GSIVKQTSSYVTHLNLISGSYLLFSIILIIGGVWACCKMRTKERHANGIPYQELELAE+D
Sbjct: 181 GSIVKQTSSYVTHLNLISGSYLLFSIILIIGGVWACCKMRTKERHANGIPYQELELAEHD 240
Query: 241 SSPTNDLEAAEGWDQGWDDDWDESKPANKSSSDMKENGSNGINSRTSERNGWGNDWDD 299
SSPTNDLEAAEGWDQGWDDDWDESKPANKS+SD+K NGSNGINSRTSERNGWGNDWDD
Sbjct: 241 SSPTNDLEAAEGWDQGWDDDWDESKPANKSNSDIKGNGSNGINSRTSERNGWGNDWDD 298
BLAST of Carg02159 vs. NCBI nr
Match:
XP_022992931.1 (uncharacterized protein LOC111489115 [Cucurbita maxima])
HSP 1 Score: 578.9 bits (1491), Expect = 2.4e-161
Identity = 291/298 (97.65%), Postives = 293/298 (98.32%), Query Frame = 0
Query: 1 MNRDLVLVFVFFVLILVSPGSDASLSDRIWNLHLRFSLSKDSPERIAPAPGPSSVINGKL 60
MNRDLVLVFVFFVLILVSPGS ASLSDRIWNL LRFSLSKDSPERIAPAPGPSSVINGKL
Sbjct: 1 MNRDLVLVFVFFVLILVSPGSGASLSDRIWNLRLRFSLSKDSPERIAPAPGPSSVINGKL 60
Query: 61 IGGVPISSPTPAIPPFPSSTDGFTLEKCDRNKTCHDLKKMTACLQFAEQAMVEQYLLIQN 120
IGGVPISSPTPAIPPFPSSTDGFT EKCDRNKTCHDLKKMTACLQFAEQAMVE+YLLIQN
Sbjct: 61 IGGVPISSPTPAIPPFPSSTDGFTSEKCDRNKTCHDLKKMTACLQFAEQAMVEKYLLIQN 120
Query: 121 DGETSLKVNVIVSDAKYKEVQVPEHHAKKVNVSDIPETSTIILDAGNGKCVIHVGSPTKN 180
DGETSLKVNVIVSDAKYKEVQVPEH AKKVNVSDIPETSTIILDAGNGKCVIHVGSPTKN
Sbjct: 121 DGETSLKVNVIVSDAKYKEVQVPEHRAKKVNVSDIPETSTIILDAGNGKCVIHVGSPTKN 180
Query: 181 GSIVKQTSSYVTHLNLISGSYLLFSIILIIGGVWACCKMRTKERHANGIPYQELELAEND 240
GSIVKQTSSYVTHLNLISGSYLLFSIILIIGGVWACCKMRTKERHANGIPYQELELAE+D
Sbjct: 181 GSIVKQTSSYVTHLNLISGSYLLFSIILIIGGVWACCKMRTKERHANGIPYQELELAEHD 240
Query: 241 SSPTNDLEAAEGWDQGWDDDWDESKPANKSSSDMKENGSNGINSRTSERNGWGNDWDD 299
SSPTNDLEAAEGWDQGWDDDWDESKPANKSSSDMK NGSNGINSRTSERNGWGNDWDD
Sbjct: 241 SSPTNDLEAAEGWDQGWDDDWDESKPANKSSSDMKANGSNGINSRTSERNGWGNDWDD 298
BLAST of Carg02159 vs. NCBI nr
Match:
XP_038886197.1 (uncharacterized protein LOC120076442 [Benincasa hispida])
HSP 1 Score: 495.0 bits (1273), Expect = 4.6e-136
Identity = 248/301 (82.39%), Postives = 269/301 (89.37%), Query Frame = 0
Query: 1 MNRDLVLVFVFFVLILVSPGSDA-SLSDRIWNLHLRFSLSKDSPERIAPAPGPSSVINGK 60
MNRDL LVF+FF+ IL+SPGSDA S RIWNLH RF+LSKD P+ +APAPGPSSVINGK
Sbjct: 1 MNRDLALVFIFFLFILLSPGSDASSFPYRIWNLHRRFALSKDPPQSVAPAPGPSSVINGK 60
Query: 61 LIGGVPISSPTPAIPPFPSSTDGFTLEKCD-RNKTCHDLKKMTACLQFAEQAMVEQYLLI 120
L G P SSPTP IPPFPSSTDGFT EKCD +KTCHDLK MTACL AEQA++EQYLLI
Sbjct: 61 LSRGAPKSSPTPVIPPFPSSTDGFTTEKCDSSSKTCHDLKNMTACLLLAEQAVMEQYLLI 120
Query: 121 QNDGETSLKVNVIVSDAKYKEVQVPEHHAKKVNVSDIPETSTIILDAGNGKCVIHVGSPT 180
QNDGETSLKVNVIVSDAKYKE+QVPEHHAKKVN+SD P S IILDAGNGKC++HVG T
Sbjct: 121 QNDGETSLKVNVIVSDAKYKEIQVPEHHAKKVNISDFPGNSMIILDAGNGKCIVHVGLLT 180
Query: 181 KNGSIVKQTSSYVTHLNLISGSYLLFSIILIIGGVWACCKMRTKERHANGIPYQELELAE 240
K+GSI KQ SSYVTHLN++SGSYLLFSI+LIIGGVWACCKMRTKERHA+GIPYQELELAE
Sbjct: 181 KSGSIFKQISSYVTHLNIVSGSYLLFSIVLIIGGVWACCKMRTKERHADGIPYQELELAE 240
Query: 241 NDSSPTNDLEAAEGWDQGWDDDWDESKPANKSSSDMKENG-SNGINSRTSERNGWGNDWD 299
+DSSPTNDLEAAEGWDQGWDDDWDESKPANKS SDMK NG SNGINSRTS+RNGW NDWD
Sbjct: 241 HDSSPTNDLEAAEGWDQGWDDDWDESKPANKSHSDMKANGSSNGINSRTSDRNGWENDWD 300
BLAST of Carg02159 vs. ExPASy TrEMBL
Match:
A0A6J1FJV4 (uncharacterized protein LOC111444889 OS=Cucurbita moschata OX=3662 GN=LOC111444889 PE=4 SV=1)
HSP 1 Score: 595.9 bits (1535), Expect = 9.3e-167
Identity = 298/298 (100.00%), Postives = 298/298 (100.00%), Query Frame = 0
Query: 1 MNRDLVLVFVFFVLILVSPGSDASLSDRIWNLHLRFSLSKDSPERIAPAPGPSSVINGKL 60
MNRDLVLVFVFFVLILVSPGSDASLSDRIWNLHLRFSLSKDSPERIAPAPGPSSVINGKL
Sbjct: 1 MNRDLVLVFVFFVLILVSPGSDASLSDRIWNLHLRFSLSKDSPERIAPAPGPSSVINGKL 60
Query: 61 IGGVPISSPTPAIPPFPSSTDGFTLEKCDRNKTCHDLKKMTACLQFAEQAMVEQYLLIQN 120
IGGVPISSPTPAIPPFPSSTDGFTLEKCDRNKTCHDLKKMTACLQFAEQAMVEQYLLIQN
Sbjct: 61 IGGVPISSPTPAIPPFPSSTDGFTLEKCDRNKTCHDLKKMTACLQFAEQAMVEQYLLIQN 120
Query: 121 DGETSLKVNVIVSDAKYKEVQVPEHHAKKVNVSDIPETSTIILDAGNGKCVIHVGSPTKN 180
DGETSLKVNVIVSDAKYKEVQVPEHHAKKVNVSDIPETSTIILDAGNGKCVIHVGSPTKN
Sbjct: 121 DGETSLKVNVIVSDAKYKEVQVPEHHAKKVNVSDIPETSTIILDAGNGKCVIHVGSPTKN 180
Query: 181 GSIVKQTSSYVTHLNLISGSYLLFSIILIIGGVWACCKMRTKERHANGIPYQELELAEND 240
GSIVKQTSSYVTHLNLISGSYLLFSIILIIGGVWACCKMRTKERHANGIPYQELELAEND
Sbjct: 181 GSIVKQTSSYVTHLNLISGSYLLFSIILIIGGVWACCKMRTKERHANGIPYQELELAEND 240
Query: 241 SSPTNDLEAAEGWDQGWDDDWDESKPANKSSSDMKENGSNGINSRTSERNGWGNDWDD 299
SSPTNDLEAAEGWDQGWDDDWDESKPANKSSSDMKENGSNGINSRTSERNGWGNDWDD
Sbjct: 241 SSPTNDLEAAEGWDQGWDDDWDESKPANKSSSDMKENGSNGINSRTSERNGWGNDWDD 298
BLAST of Carg02159 vs. ExPASy TrEMBL
Match:
A0A6J1JUX9 (uncharacterized protein LOC111489115 OS=Cucurbita maxima OX=3661 GN=LOC111489115 PE=4 SV=1)
HSP 1 Score: 578.9 bits (1491), Expect = 1.2e-161
Identity = 291/298 (97.65%), Postives = 293/298 (98.32%), Query Frame = 0
Query: 1 MNRDLVLVFVFFVLILVSPGSDASLSDRIWNLHLRFSLSKDSPERIAPAPGPSSVINGKL 60
MNRDLVLVFVFFVLILVSPGS ASLSDRIWNL LRFSLSKDSPERIAPAPGPSSVINGKL
Sbjct: 1 MNRDLVLVFVFFVLILVSPGSGASLSDRIWNLRLRFSLSKDSPERIAPAPGPSSVINGKL 60
Query: 61 IGGVPISSPTPAIPPFPSSTDGFTLEKCDRNKTCHDLKKMTACLQFAEQAMVEQYLLIQN 120
IGGVPISSPTPAIPPFPSSTDGFT EKCDRNKTCHDLKKMTACLQFAEQAMVE+YLLIQN
Sbjct: 61 IGGVPISSPTPAIPPFPSSTDGFTSEKCDRNKTCHDLKKMTACLQFAEQAMVEKYLLIQN 120
Query: 121 DGETSLKVNVIVSDAKYKEVQVPEHHAKKVNVSDIPETSTIILDAGNGKCVIHVGSPTKN 180
DGETSLKVNVIVSDAKYKEVQVPEH AKKVNVSDIPETSTIILDAGNGKCVIHVGSPTKN
Sbjct: 121 DGETSLKVNVIVSDAKYKEVQVPEHRAKKVNVSDIPETSTIILDAGNGKCVIHVGSPTKN 180
Query: 181 GSIVKQTSSYVTHLNLISGSYLLFSIILIIGGVWACCKMRTKERHANGIPYQELELAEND 240
GSIVKQTSSYVTHLNLISGSYLLFSIILIIGGVWACCKMRTKERHANGIPYQELELAE+D
Sbjct: 181 GSIVKQTSSYVTHLNLISGSYLLFSIILIIGGVWACCKMRTKERHANGIPYQELELAEHD 240
Query: 241 SSPTNDLEAAEGWDQGWDDDWDESKPANKSSSDMKENGSNGINSRTSERNGWGNDWDD 299
SSPTNDLEAAEGWDQGWDDDWDESKPANKSSSDMK NGSNGINSRTSERNGWGNDWDD
Sbjct: 241 SSPTNDLEAAEGWDQGWDDDWDESKPANKSSSDMKANGSNGINSRTSERNGWGNDWDD 298
BLAST of Carg02159 vs. ExPASy TrEMBL
Match:
A0A1S3C4Q3 (uncharacterized protein LOC103496622 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103496622 PE=4 SV=1)
HSP 1 Score: 471.9 bits (1213), Expect = 2.0e-129
Identity = 233/299 (77.93%), Postives = 260/299 (86.96%), Query Frame = 0
Query: 1 MNRDLVLVFVFFVLILVSPGSDASLSDRIWNLHLRFSLSKDSPERIAPAPGPSSVINGKL 60
MNRDL +F+F +LIL SPGSDAS + WNLHLRF++SKDS + +AP PGP+SV+NGKL
Sbjct: 1 MNRDLAFLFLFSLLILFSPGSDASFPNHFWNLHLRFAVSKDSLQSVAPTPGPNSVVNGKL 60
Query: 61 IGGVPISSPTPAIPPFPSSTDGFTLEKCDRN-KTCHDLKKMTACLQFAEQAMVEQYLLIQ 120
G SS TPAIPP P+STDGFT EKCD + KTCHDLK ++ACL AEQA VEQYLLIQ
Sbjct: 61 SRGATTSSATPAIPPSPNSTDGFTTEKCDSSYKTCHDLKDLSACLLSAEQAEVEQYLLIQ 120
Query: 121 NDGETSLKVNVIVSDAKYKEVQVPEHHAKKVNVSDIPETSTIILDAGNGKCVIHVGSPTK 180
NDGETSLKVNVIVSD KYKE+QVPEHHAKKVN+SD P S IILDAGNGKC++HV S TK
Sbjct: 121 NDGETSLKVNVIVSDTKYKEIQVPEHHAKKVNISDFPGNSMIILDAGNGKCIVHVRSLTK 180
Query: 181 NGSIVKQTSSYVTHLNLISGSYLLFSIILIIGGVWACCKMRTKERHANGIPYQELELAEN 240
NGSI KQ SSYVTHLNL+SGSYLLFSI+ IIGG+WACCKM+TKERHANGIPYQELELAE+
Sbjct: 181 NGSIFKQISSYVTHLNLVSGSYLLFSIVFIIGGIWACCKMKTKERHANGIPYQELELAEH 240
Query: 241 DSSPTNDLEAAEGWDQGWDDDWDESKPANKSSSDMKENGSNGINSRTSERNGWGNDWDD 299
DSSPTNDLEAAEGWDQGWDDDWDESKPAN+SSSDMK +NGINS+TS+RNGW NDWDD
Sbjct: 241 DSSPTNDLEAAEGWDQGWDDDWDESKPANRSSSDMK---ANGINSKTSDRNGWENDWDD 296
BLAST of Carg02159 vs. ExPASy TrEMBL
Match:
A0A6J1C206 (uncharacterized protein LOC111006692 OS=Momordica charantia OX=3673 GN=LOC111006692 PE=4 SV=1)
HSP 1 Score: 418.3 bits (1074), Expect = 2.7e-113
Identity = 214/302 (70.86%), Postives = 249/302 (82.45%), Query Frame = 0
Query: 1 MNRDLVLVFVFFVLILVSPGSDASLSDRIWNLHLRFSLSKDSPERIAPA--PGPSSVING 60
MN L L+F FF+LI GSDAS D H RF+LS+ SP+ APA PGPSSV N
Sbjct: 1 MNPHLPLLFAFFLLI---RGSDASFPDD----HFRFALSEVSPQSKAPAPGPGPSSVTNR 60
Query: 61 KLIGGVPISSPTPAIPPFPSSTDGFTLEKCDRN-KTCHDLKKMTACLQFAEQAMVEQYLL 120
K GG P SSPTPAIPPF S DGFT EKC+ + TCHDL+ MTACL FAE A+VEQYLL
Sbjct: 61 KFSGGAPKSSPTPAIPPFTSLVDGFTTEKCNSSYNTCHDLENMTACLLFAEHAVVEQYLL 120
Query: 121 IQNDGETSLKVNVIVSDAKYKEVQVPEHHAKKVNVSDIPETSTIILDAGNGKCVIHVGSP 180
IQNDGETS+KVN+I+S+AKYKE+++PEHHAKKVN+SD+P S I L+AGNGKC+IHVG
Sbjct: 121 IQNDGETSMKVNIIISNAKYKEIKIPEHHAKKVNISDVPGNSMITLEAGNGKCMIHVGLL 180
Query: 181 TKNGSIVKQTSSYVTHLNLISGSYLLFSIILIIGGVWACCKMRTKERHANGIPYQELELA 240
TK+GSI+K+ S Y+ HLNL+SGSYLLF+I+LIIGGVWACC M TKERHA+G+PYQELELA
Sbjct: 181 TKSGSILKKISFYLNHLNLVSGSYLLFAIVLIIGGVWACCNMGTKERHADGVPYQELELA 240
Query: 241 ENDSSPTNDLEAAEGWDQGWDDDWDESKPANKSSSDMKENG-SNGINSRTSERNGWGNDW 299
E+DSSPTNDLEAAEGWDQGWDDDWDESK NKSS+ MK NG SNG+NS+TS+R+GWGNDW
Sbjct: 241 EHDSSPTNDLEAAEGWDQGWDDDWDESKSTNKSSAQMKANGSSNGLNSKTSDRDGWGNDW 295
BLAST of Carg02159 vs. ExPASy TrEMBL
Match:
A0A0A0K8R5 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G014580 PE=4 SV=1)
HSP 1 Score: 394.0 bits (1011), Expect = 5.4e-106
Identity = 193/240 (80.42%), Postives = 212/240 (88.33%), Query Frame = 0
Query: 60 LIGGVPISSPTPAIPPFPSSTDGFTLEKCDRN-KTCHDLKKMTACLQFAEQAMVEQYLLI 119
L G P +SPTPAIPPFP STDGFT EKCD + KTCHDLK + ACL AEQA VEQYLLI
Sbjct: 21 LFRGAPTNSPTPAIPPFPKSTDGFTTEKCDSSYKTCHDLKDLIACLLSAEQAEVEQYLLI 80
Query: 120 QNDGETSLKVNVIVSDAKYKEVQVPEHHAKKVNVSDIPETSTIILDAGNGKCVIHVGSPT 179
QN+GETSLKVNV VSD KYKE+QVPEHHAKKVN+SD P S IILDAGNGKC++H+GS T
Sbjct: 81 QNNGETSLKVNVTVSDTKYKEIQVPEHHAKKVNISDFPGNSMIILDAGNGKCIVHLGSLT 140
Query: 180 KNGSIVKQTSSYVTHLNLISGSYLLFSIILIIGGVWACCKMRTKERHANGIPYQELELAE 239
KNGSI KQ SSYVTHLNL+SGSYLL SI+ I+GG+WACCKM+TKERHANGIPYQELELAE
Sbjct: 141 KNGSIFKQISSYVTHLNLVSGSYLLLSIVFIVGGIWACCKMKTKERHANGIPYQELELAE 200
Query: 240 NDSSPTNDLEAAEGWDQGWDDDWDESKPANKSSSDMKENGSNGINSRTSERNGWGNDWDD 299
+D+SPTNDLEAAEGWDQGWDDDWDESKP+NKSSSDMK +NGINSRTS+RNGW NDWDD
Sbjct: 201 HDTSPTNDLEAAEGWDQGWDDDWDESKPSNKSSSDMK---ANGINSRTSDRNGWENDWDD 257
BLAST of Carg02159 vs. TAIR 10
Match:
AT3G51580.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: 25 plant structures; EXPRESSED DURING: 15 growth stages; Has 1768 Blast hits to 1607 proteins in 294 species: Archae - 2; Bacteria - 552; Metazoa - 381; Fungi - 236; Plants - 306; Viruses - 38; Other Eukaryotes - 253 (source: NCBI BLink). )
HSP 1 Score: 148.3 bits (373), Expect = 9.9e-36
Identity = 90/263 (34.22%), Postives = 145/263 (55.13%), Query Frame = 0
Query: 49 APGPSSVINGKLIGGVPISSPTPAIPPF-----PSSTDGFTLEKC-DRNKTCHDLKKMTA 108
AP P S+ +GK SP A P S++ +++ C ++ C + A
Sbjct: 131 APPPKSLESGKNETEPGKESPPLAKDPAKGKDDKGSSESASVDTCVGKSNICRTENSLVA 190
Query: 109 CLQFAEQAMVEQYLLIQNDGETSLKVNVIVSDAKYKEVQVPEHHAKKVNVSDIPETSTII 168
C ++ +L+QN+GETSLK +++ +E+ +P+H ++KVN+S +T+ II
Sbjct: 191 CTLSIDKGAANWLILVQNEGETSLKAKIVLPVNALQELTLPKHQSQKVNISISGDTNKII 250
Query: 169 LDAGNGKCVIHVGSPTKNGSIVKQTSSYVTHLNLISGSYLLFSIILIIGGVWACCKMRTK 228
LD G G+C +H+ P++ ++ SY + I+G+Y L ++I GG+WA C R
Sbjct: 251 LDTGKGQCALHM-YPSEESTLPFHFPSYEKLVTPINGAYFLIVSVIIFGGIWAFCLCRKN 310
Query: 229 ERHANGIPYQELELA-----ENDSSPTNDLEAAEGWDQGWDDDWDESKPANKSSSDMK-- 288
R +G+PY+ELEL+ EN+S +D+E A+ WD+GWDDDWDE+ S K
Sbjct: 311 RRAGSGVPYRELELSGGPGLENESG-VHDVETAD-WDEGWDDDWDENNAVKSPGSAAKSV 370
Query: 289 ENGSNGINSRTSERNGWGNDWDD 299
+NG+ +R R+GW +DWDD
Sbjct: 371 SISANGLTARAPNRDGWDHDWDD 390
BLAST of Carg02159 vs. TAIR 10
Match:
AT3G51580.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15 growth stages. )
HSP 1 Score: 139.4 bits (350), Expect = 4.6e-33
Identity = 93/283 (32.86%), Postives = 147/283 (51.94%), Query Frame = 0
Query: 49 APGPSSVINGKLIGGVPISSPTPAIPPF-----PSSTDGFTLEKC-DRNKTCHDLKKMTA 108
AP P S+ +GK SP A P S++ +++ C ++ C + A
Sbjct: 131 APPPKSLESGKNETEPGKESPPLAKDPAKGKDDKGSSESASVDTCVGKSNICRTENSLVA 190
Query: 109 CL------------------QFAEQAM--VEQYLLIQNDGETSLKVNVIVSDAKYKEVQV 168
C QFA + +L+QN+GETSLK +++ +E+ +
Sbjct: 191 CTLSIDKGYETFLDIIVIPQQFARSLLCAANWLILVQNEGETSLKAKIVLPVNALQELTL 250
Query: 169 PEHHAKKVNVSDIPETSTIILDAGNGKCVIHVGSPTKNGSIVKQTSSYVTHLNLISGSYL 228
P+H ++KVN+S +T+ IILD G G+C +H+ P++ ++ SY + I+G+Y
Sbjct: 251 PKHQSQKVNISISGDTNKIILDTGKGQCALHM-YPSEESTLPFHFPSYEKLVTPINGAYF 310
Query: 229 LFSIILIIGGVWACCKMRTKERHANGIPYQELELA-----ENDSSPTNDLEAAEGWDQGW 288
L ++I GG+WA C R R +G+PY+ELEL+ EN+S +D+E A+ WD+GW
Sbjct: 311 LIVSVIIFGGIWAFCLCRKNRRAGSGVPYRELELSGGPGLENESG-VHDVETAD-WDEGW 370
Query: 289 DDDWDESKPANKSSSDMK--ENGSNGINSRTSERNGWGNDWDD 299
DDDWDE+ S K +NG+ +R R+GW +DWDD
Sbjct: 371 DDDWDENNAVKSPGSAAKSVSISANGLTARAPNRDGWDHDWDD 410
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
KAG6578483.1 | 1.9e-166 | 100.00 | hypothetical protein SDJN03_22931, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_022938773.1 | 1.9e-166 | 100.00 | uncharacterized protein LOC111444889 [Cucurbita moschata] >KAG7016047.1 hypothet... | [more] |
XP_023549795.1 | 1.1e-161 | 96.98 | uncharacterized protein LOC111808188 [Cucurbita pepo subsp. pepo] | [more] |
XP_022992931.1 | 2.4e-161 | 97.65 | uncharacterized protein LOC111489115 [Cucurbita maxima] | [more] |
XP_038886197.1 | 4.6e-136 | 82.39 | uncharacterized protein LOC120076442 [Benincasa hispida] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1FJV4 | 9.3e-167 | 100.00 | uncharacterized protein LOC111444889 OS=Cucurbita moschata OX=3662 GN=LOC1114448... | [more] |
A0A6J1JUX9 | 1.2e-161 | 97.65 | uncharacterized protein LOC111489115 OS=Cucurbita maxima OX=3661 GN=LOC111489115... | [more] |
A0A1S3C4Q3 | 2.0e-129 | 77.93 | uncharacterized protein LOC103496622 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A6J1C206 | 2.7e-113 | 70.86 | uncharacterized protein LOC111006692 OS=Momordica charantia OX=3673 GN=LOC111006... | [more] |
A0A0A0K8R5 | 5.4e-106 | 80.42 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G014580 PE=4 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
AT3G51580.1 | 9.9e-36 | 34.22 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |
AT3G51580.2 | 4.6e-33 | 32.86 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |