Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CCGTGTACATCTAAAGTAGTAAAATCCAGAGGTGTAAGGCACCCATTTTCCCCATAATTTCCTGGAAAAAGCTATGTAATGTAACTCTTAAAAGAAAAACCTAATTTTAATTTTAATTCCTAAAATCTTTGACCATTTCAAGGGCTTTTTCGTAACAACACACGTTTTTAAACCCTACTTATATAAAAACCACATTTCCATTAGTTTCTCAATAACCAAATTAACCAAATAACAAATCATACACACAGCTCTCTGTTTTCAGCCGCCATTGATGGACTTCGACCATCTTCTCAATCTCTTCGACTCATTCTGGTTCCAGCGTCAAGTCCTCAACAATCATCCTTTTCCATCAAACCCACAAATCCTACAACCTCAAATTCAAGATCCCGATCCATTACCCAAGGAATCATTCCTCATTCCTCGCCTTCGAACGAGATCCATAAGCGAAGATTTAAGCTCTAAATTAAGCTTCATGTCCAATTCTAATTCCCCCGATTCAGTTCTCCTTTCTCCAAAGCTTCAAACGATCTTTTCCAGCAAGGACATCGCCGGAGCGGAGTCGCCGGAGACAAACCATAAGGTGGAAATTGAGCGGAGGCCAAAAACAGAGTACAGGAGGAGGCTTAGAGGAAGAAGGACGAGACGGTCGGAAAGTCGGAGTCTTTCAGAGCTGGAATTCGAGGAGTTAAAAGGGTTTATGGATTTGGGATTTGTTTTCTCGGAAGAGGATAAAGGTTCGAGCTTGGCGTCGATAGTTCCGGGATTGAACAGGCTAGGGAAAAGGGAAGAGAAAGGAAATAAAGAAGGAGAAGAAGAAGAAGAAGAAAAAGAAGAAGAAAGAAAATTGGGTGGTGAAATTTCGAGGCCTTATCTTTCAGAAGCTTGGGAAGCTATTGCGGAAGAAGAAGAGAAAGAGGAATTGTTGAAGAGGCCATTGATGATGAAATGGAGGTTTCCTTCTAATCAGATTGATATGAAAGATAATCTAAAATGGTGGGCTCATGCTGTTGCTTCTACTGTCAGATGACATTACTCTAATGCATTCTTTTTCTTTTTGTAATTGTAAATTTTTGGAGTTTCTTAATAATTTTGATTTGAGATCAATGTTTGACATATTCAAAGCTAATGTGTAAATATATATCCAGAAATTTCTTGGGAATTATTCTTGCTTCTTTTCCAAATATATAGATTGAGAATCAACATTTTGGTTATATCATAAATTGGTGTTTCTTTTTCCAATTACACTTCTTTTCTTTTAGAAGTTGCAATTTCATCTTAGTTTTCTTGTAATCATTTGATATTAAAACAAGAACAATCTAGCAAGTTACCTTAATTTTTGTGAGCCAATCAATCATTAATGTATCAAAGATACTAAATTTTCATCTATATTTTTTCTGTCCATGAATTTGTATAAAAAACAAGGATGGA
mRNA sequence
CCGTGTACATCTAAAGTAGTAAAATCCAGAGGTGTAAGGCACCCATTTTCCCCATAATTTCCTGGAAAAAGCTATGTAATGTAACTCTTAAAAGAAAAACCTAATTTTAATTTTAATTCCTAAAATCTTTGACCATTTCAAGGGCTTTTTCGTAACAACACACGTTTTTAAACCCTACTTATATAAAAACCACATTTCCATTAGTTTCTCAATAACCAAATTAACCAAATAACAAATCATACACACAGCTCTCTGTTTTCAGCCGCCATTGATGGACTTCGACCATCTTCTCAATCTCTTCGACTCATTCTGGTTCCAGCGTCAAGTCCTCAACAATCATCCTTTTCCATCAAACCCACAAATCCTACAACCTCAAATTCAAGATCCCGATCCATTACCCAAGGAATCATTCCTCATTCCTCGCCTTCGAACGAGATCCATAAGCGAAGATTTAAGCTCTAAATTAAGCTTCATGTCCAATTCTAATTCCCCCGATTCAGTTCTCCTTTCTCCAAAGCTTCAAACGATCTTTTCCAGCAAGGACATCGCCGGAGCGGAGTCGCCGGAGACAAACCATAAGGTGGAAATTGAGCGGAGGCCAAAAACAGAGTACAGGAGGAGGCTTAGAGGAAGAAGGACGAGACGGTCGGAAAGTCGGAGTCTTTCAGAGCTGGAATTCGAGGAGTTAAAAGGGTTTATGGATTTGGGATTTGTTTTCTCGGAAGAGGATAAAGGTTCGAGCTTGGCGTCGATAGTTCCGGGATTGAACAGGCTAGGGAAAAGGGAAGAGAAAGGAAATAAAGAAGGAGAAGAAGAAGAAGAAGAAAAAGAAGAAGAAAGAAAATTGGGTGGTGAAATTTCGAGGCCTTATCTTTCAGAAGCTTGGGAAGCTATTGCGGAAGAAGAAGAGAAAGAGGAATTGTTGAAGAGGCCATTGATGATGAAATGGAGGTTTCCTTCTAATCAGATTGATATGAAAGATAATCTAAAATGGTGGGCTCATGCTGTTGCTTCTACTGTCAGATGACATTACTCTAATGCATTCTTTTTCTTTTTGTAATTGTAAATTTTTGGAGTTTCTTAATAATTTTGATTTGAGATCAATGTTTGACATATTCAAAGCTAATGTGTAAATATATATCCAGAAATTTCTTGGGAATTATTCTTGCTTCTTTTCCAAATATATAGATTGAGAATCAACATTTTGGTTATATCATAAATTGGTGTTTCTTTTTCCAATTACACTTCTTTTCTTTTAGAAGTTGCAATTTCATCTTAGTTTTCTTGTAATCATTTGATATTAAAACAAGAACAATCTAGCAAGTTACCTTAATTTTTGTGAGCCAATCAATCATTAATGTATCAAAGATACTAAATTTTCATCTATATTTTTTCTGTCCATGAATTTGTATAAAAAACAAGGATGGA
Coding sequence (CDS)
ATGGACTTCGACCATCTTCTCAATCTCTTCGACTCATTCTGGTTCCAGCGTCAAGTCCTCAACAATCATCCTTTTCCATCAAACCCACAAATCCTACAACCTCAAATTCAAGATCCCGATCCATTACCCAAGGAATCATTCCTCATTCCTCGCCTTCGAACGAGATCCATAAGCGAAGATTTAAGCTCTAAATTAAGCTTCATGTCCAATTCTAATTCCCCCGATTCAGTTCTCCTTTCTCCAAAGCTTCAAACGATCTTTTCCAGCAAGGACATCGCCGGAGCGGAGTCGCCGGAGACAAACCATAAGGTGGAAATTGAGCGGAGGCCAAAAACAGAGTACAGGAGGAGGCTTAGAGGAAGAAGGACGAGACGGTCGGAAAGTCGGAGTCTTTCAGAGCTGGAATTCGAGGAGTTAAAAGGGTTTATGGATTTGGGATTTGTTTTCTCGGAAGAGGATAAAGGTTCGAGCTTGGCGTCGATAGTTCCGGGATTGAACAGGCTAGGGAAAAGGGAAGAGAAAGGAAATAAAGAAGGAGAAGAAGAAGAAGAAGAAAAAGAAGAAGAAAGAAAATTGGGTGGTGAAATTTCGAGGCCTTATCTTTCAGAAGCTTGGGAAGCTATTGCGGAAGAAGAAGAGAAAGAGGAATTGTTGAAGAGGCCATTGATGATGAAATGGAGGTTTCCTTCTAATCAGATTGATATGAAAGATAATCTAAAATGGTGGGCTCATGCTGTTGCTTCTACTGTCAGATGA
Protein sequence
MDFDHLLNLFDSFWFQRQVLNNHPFPSNPQILQPQIQDPDPLPKESFLIPRLRTRSISEDLSSKLSFMSNSNSPDSVLLSPKLQTIFSSKDIAGAESPETNHKVEIERRPKTEYRRRLRGRRTRRSESRSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKREEKGNKEGEEEEEEKEEERKLGGEISRPYLSEAWEAIAEEEEKEELLKRPLMMKWRFPSNQIDMKDNLKWWAHAVASTVR*
Homology
BLAST of CsGy1G021600 vs. NCBI nr
Match:
XP_031737252.1 (uncharacterized protein LOC105434498 [Cucumis sativus] >KGN65608.1 hypothetical protein Csa_019529 [Cucumis sativus])
HSP 1 Score: 475 bits (1223), Expect = 1.26e-168
Identity = 250/251 (99.60%), Postives = 251/251 (100.00%), Query Frame = 0
Query: 1 MDFDHLLNLFDSFWFQRQVLNNHPFPSNPQILQPQIQDPDPLPKESFLIPRLRTRSISED 60
MDFDHLLNLFDSFWFQRQVLNNHPFPSNPQILQPQIQDPDPLPKESFLIPRLRTRSISED
Sbjct: 1 MDFDHLLNLFDSFWFQRQVLNNHPFPSNPQILQPQIQDPDPLPKESFLIPRLRTRSISED 60
Query: 61 LSSKLSFMSNSNSPDSVLLSPKLQTIFSSKDIAGAESPETNHKVEIERRPKTEYRRRLRG 120
LSSKLSFMSNSNSPDSVLLSPKLQTIFSSKDIAGAESPET+HKVEIERRPKTEYRRRLRG
Sbjct: 61 LSSKLSFMSNSNSPDSVLLSPKLQTIFSSKDIAGAESPETSHKVEIERRPKTEYRRRLRG 120
Query: 121 RRTRRSESRSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKREEKGNKEGE 180
RRTRRSESRSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKREEKGNKEGE
Sbjct: 121 RRTRRSESRSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKREEKGNKEGE 180
Query: 181 EEEEEKEEERKLGGEISRPYLSEAWEAIAEEEEKEELLKRPLMMKWRFPSNQIDMKDNLK 240
EEEEEKEEERKLGGEISRPYLSEAWEAIAEEEEKEELLKRPLMMKWRFPSNQIDMKDNLK
Sbjct: 181 EEEEEKEEERKLGGEISRPYLSEAWEAIAEEEEKEELLKRPLMMKWRFPSNQIDMKDNLK 240
Query: 241 WWAHAVASTVR 251
WWAHAVASTVR
Sbjct: 241 WWAHAVASTVR 251
BLAST of CsGy1G021600 vs. NCBI nr
Match:
KAA0057058.1 (DUF1685 domain-containing protein [Cucumis melo var. makuwa])
HSP 1 Score: 409 bits (1052), Expect = 1.20e-142
Identity = 222/251 (88.45%), Postives = 233/251 (92.83%), Query Frame = 0
Query: 1 MDFDHLLNLFDSFWFQRQVLNNHPFPSNPQILQPQIQDPDPLPKESFLIPRLRTRSISED 60
MDF+ LLNLFDSFWF+R V N HPF SN LQPQIQDPD LPKESF+IPRL TRSISED
Sbjct: 1 MDFEQLLNLFDSFWFERGVFNKHPFLSN---LQPQIQDPDSLPKESFIIPRLPTRSISED 60
Query: 61 LSSKLSFMSNSNSPDSVLLSPKLQTIFSSKDIAGAESPETNHKVEIERRPKTEYRRRLRG 120
LSSKLSFMS+SNSPDSVL SPKLQTIFSSKDIAGAESPET+ K+EIERRPKTEYRRR RG
Sbjct: 61 LSSKLSFMSSSNSPDSVLFSPKLQTIFSSKDIAGAESPETSRKLEIERRPKTEYRRRFRG 120
Query: 121 RRTRRSESRSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKREEKGNKEGE 180
RRTRRSESRSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGK+EEK +KE
Sbjct: 121 RRTRRSESRSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKKEEKESKE-- 180
Query: 181 EEEEEKEEERKLGGEISRPYLSEAWEAIAEEEEKEELLKRPLMMKWRFPSNQIDMKDNLK 240
EEEEE+EEERKLGGEISRPYLSEAWEA+ EEEEKEEL K+PLMMKWRFPSNQIDMKDNLK
Sbjct: 181 EEEEEEEEERKLGGEISRPYLSEAWEAMEEEEEKEELAKKPLMMKWRFPSNQIDMKDNLK 240
Query: 241 WWAHAVASTVR 251
WWAHAVASTVR
Sbjct: 241 WWAHAVASTVR 246
BLAST of CsGy1G021600 vs. NCBI nr
Match:
XP_038895996.1 (uncharacterized protein LOC120084174 [Benincasa hispida])
HSP 1 Score: 372 bits (956), Expect = 3.91e-128
Identity = 204/251 (81.27%), Postives = 219/251 (87.25%), Query Frame = 0
Query: 1 MDFDHLLNLFDSFWFQRQVLNNHPFPSNPQILQPQIQDPDPLPKESFLIPRLRTRSISED 60
MDF+ LLNLFDSFWF+ ++ N HPFPSNPQ QP+ QD + LPKE F++PRLRTRSISED
Sbjct: 1 MDFEQLLNLFDSFWFEHEIFNKHPFPSNPQNPQPENQD-NSLPKEPFIVPRLRTRSISED 60
Query: 61 LSSKLSFMSNSNSPDSVLLSPKLQTIFSSKDIAGAESPETNHKVEIERRPKTEYRRRLRG 120
LSSKLSFMSNSNSPDSVL SPKLQTIFSSKDIAGAESPE + KV IERRPKTE RR+LRG
Sbjct: 61 LSSKLSFMSNSNSPDSVLFSPKLQTIFSSKDIAGAESPENSRKVGIERRPKTESRRKLRG 120
Query: 121 RRTRRSESRSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKREEKGNKEGE 180
RR RRSESRSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGK E
Sbjct: 121 RRMRRSESRSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKT--------E 180
Query: 181 EEEEEKEEERKLGGEISRPYLSEAWEAIAEEEEKEELLKRPLMMKWRFPSNQIDMKDNLK 240
EE+ E+EEERKLGGEISRPYLSEAWEA+ EEEE+ LK PL MKW+FPSNQIDMKDNLK
Sbjct: 181 EEDIEEEEERKLGGEISRPYLSEAWEAMEEEEEE---LKNPLTMKWKFPSNQIDMKDNLK 239
Query: 241 WWAHAVASTVR 251
WWAHAVASTVR
Sbjct: 241 WWAHAVASTVR 239
BLAST of CsGy1G021600 vs. NCBI nr
Match:
XP_016898936.1 (PREDICTED: uncharacterized protein LOC103489359 [Cucumis melo])
HSP 1 Score: 360 bits (925), Expect = 1.08e-123
Identity = 198/251 (78.88%), Postives = 208/251 (82.87%), Query Frame = 0
Query: 1 MDFDHLLNLFDSFWFQRQVLNNHPFPSNPQILQPQIQDPDPLPKESFLIPRLRTRSISED 60
MDF+ LLNLFDSFWF+R V N HPF SN LQPQIQDPD LPKESF+IPRL TRSISED
Sbjct: 1 MDFEQLLNLFDSFWFERGVFNKHPFLSN---LQPQIQDPDSLPKESFIIPRLPTRSISED 60
Query: 61 LSSKLSFMSNSNSPDSVLLSPKLQTIFSSKDIAGAESPETNHKVEIERRPKTEYRRRLRG 120
LSSKLSFMS+SNSPDSVL SPKLQTIFSSKDIAGAESPET+ K+EIERRPKTEYRRR RG
Sbjct: 61 LSSKLSFMSSSNSPDSVLFSPKLQTIFSSKDIAGAESPETSRKLEIERRPKTEYRRRFRG 120
Query: 121 RRTRRSESRSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKREEKGNKEGE 180
RRTRRSESRSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGK+EEK
Sbjct: 121 RRTRRSESRSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKKEEK------ 180
Query: 181 EEEEEKEEERKLGGEISRPYLSEAWEAIAEEEEKEELLKRPLMMKWRFPSNQIDMKDNLK 240
+AWEA+ EEEEKEEL K+PLMMKWRFPSNQIDMKDNLK
Sbjct: 181 ---------------------EKAWEAMEEEEEKEELAKKPLMMKWRFPSNQIDMKDNLK 221
Query: 241 WWAHAVASTVR 251
WWAHAVASTVR
Sbjct: 241 WWAHAVASTVR 221
BLAST of CsGy1G021600 vs. NCBI nr
Match:
XP_023001638.1 (uncharacterized protein LOC111495710 [Cucurbita maxima])
HSP 1 Score: 306 bits (784), Expect = 6.18e-102
Identity = 174/254 (68.50%), Postives = 201/254 (79.13%), Query Frame = 0
Query: 1 MDFDHLLNLFDSFWFQRQVLNNHPFPSNPQILQPQIQDPDPL---PKESFLIPRLRTRSI 60
MD + +LNLFDSFWF+R++ N HPFP+NPQ +P+ QD DPL P E +PR+ RSI
Sbjct: 1 MDVEQVLNLFDSFWFEREIFNKHPFPTNPQNPRPENQDRDPLKNSPPEEPFVPRICPRSI 60
Query: 61 SEDLSSKLSFMSNSNSPDSVLLSPKLQTIFSSKDIAGAESPETNHKVEIERRPKTEYRRR 120
SEDLSSKL+FMS+S+SPDSVL SPKLQTI SSKDIAG E PE + +V I R K + RRR
Sbjct: 61 SEDLSSKLTFMSDSSSPDSVLFSPKLQTILSSKDIAGEEPPEKSRRVVI--RQKRKQRRR 120
Query: 121 LRGRRTRRSESRSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKREEKGNK 180
+ GR R SESRSLSELEFEELKGFMDLGFVFSE DK SSLA IVPGLNRLGKR+E
Sbjct: 121 IGGRSIRGSESRSLSELEFEELKGFMDLGFVFSEGDKSSSLAWIVPGLNRLGKRDE---- 180
Query: 181 EGEEEEEEKEEERKLGGEISRPYLSEAWEAIAEEEEKEELLKRPLMMKWRFPSNQIDMKD 240
EEEEE+EEE +LGG ISRPYLSEAW A+ +EEE +K+ L+MKWR P+N+IDMKD
Sbjct: 181 ---EEEEEEEEEEELGGGISRPYLSEAWAAMEQEEE----VKKALVMKWRLPANEIDMKD 240
Query: 241 NLKWWAHAVASTVR 251
NLKWWAHAVASTVR
Sbjct: 241 NLKWWAHAVASTVR 241
BLAST of CsGy1G021600 vs. ExPASy TrEMBL
Match:
A0A0A0LV26 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G470240 PE=4 SV=1)
HSP 1 Score: 475 bits (1223), Expect = 6.09e-169
Identity = 250/251 (99.60%), Postives = 251/251 (100.00%), Query Frame = 0
Query: 1 MDFDHLLNLFDSFWFQRQVLNNHPFPSNPQILQPQIQDPDPLPKESFLIPRLRTRSISED 60
MDFDHLLNLFDSFWFQRQVLNNHPFPSNPQILQPQIQDPDPLPKESFLIPRLRTRSISED
Sbjct: 1 MDFDHLLNLFDSFWFQRQVLNNHPFPSNPQILQPQIQDPDPLPKESFLIPRLRTRSISED 60
Query: 61 LSSKLSFMSNSNSPDSVLLSPKLQTIFSSKDIAGAESPETNHKVEIERRPKTEYRRRLRG 120
LSSKLSFMSNSNSPDSVLLSPKLQTIFSSKDIAGAESPET+HKVEIERRPKTEYRRRLRG
Sbjct: 61 LSSKLSFMSNSNSPDSVLLSPKLQTIFSSKDIAGAESPETSHKVEIERRPKTEYRRRLRG 120
Query: 121 RRTRRSESRSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKREEKGNKEGE 180
RRTRRSESRSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKREEKGNKEGE
Sbjct: 121 RRTRRSESRSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKREEKGNKEGE 180
Query: 181 EEEEEKEEERKLGGEISRPYLSEAWEAIAEEEEKEELLKRPLMMKWRFPSNQIDMKDNLK 240
EEEEEKEEERKLGGEISRPYLSEAWEAIAEEEEKEELLKRPLMMKWRFPSNQIDMKDNLK
Sbjct: 181 EEEEEKEEERKLGGEISRPYLSEAWEAIAEEEEKEELLKRPLMMKWRFPSNQIDMKDNLK 240
Query: 241 WWAHAVASTVR 251
WWAHAVASTVR
Sbjct: 241 WWAHAVASTVR 251
BLAST of CsGy1G021600 vs. ExPASy TrEMBL
Match:
A0A5A7UPL7 (DUF1685 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold96G002460 PE=4 SV=1)
HSP 1 Score: 409 bits (1052), Expect = 5.81e-143
Identity = 222/251 (88.45%), Postives = 233/251 (92.83%), Query Frame = 0
Query: 1 MDFDHLLNLFDSFWFQRQVLNNHPFPSNPQILQPQIQDPDPLPKESFLIPRLRTRSISED 60
MDF+ LLNLFDSFWF+R V N HPF SN LQPQIQDPD LPKESF+IPRL TRSISED
Sbjct: 1 MDFEQLLNLFDSFWFERGVFNKHPFLSN---LQPQIQDPDSLPKESFIIPRLPTRSISED 60
Query: 61 LSSKLSFMSNSNSPDSVLLSPKLQTIFSSKDIAGAESPETNHKVEIERRPKTEYRRRLRG 120
LSSKLSFMS+SNSPDSVL SPKLQTIFSSKDIAGAESPET+ K+EIERRPKTEYRRR RG
Sbjct: 61 LSSKLSFMSSSNSPDSVLFSPKLQTIFSSKDIAGAESPETSRKLEIERRPKTEYRRRFRG 120
Query: 121 RRTRRSESRSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKREEKGNKEGE 180
RRTRRSESRSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGK+EEK +KE
Sbjct: 121 RRTRRSESRSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKKEEKESKE-- 180
Query: 181 EEEEEKEEERKLGGEISRPYLSEAWEAIAEEEEKEELLKRPLMMKWRFPSNQIDMKDNLK 240
EEEEE+EEERKLGGEISRPYLSEAWEA+ EEEEKEEL K+PLMMKWRFPSNQIDMKDNLK
Sbjct: 181 EEEEEEEEERKLGGEISRPYLSEAWEAMEEEEEKEELAKKPLMMKWRFPSNQIDMKDNLK 240
Query: 241 WWAHAVASTVR 251
WWAHAVASTVR
Sbjct: 241 WWAHAVASTVR 246
BLAST of CsGy1G021600 vs. ExPASy TrEMBL
Match:
A0A1S4DSH1 (uncharacterized protein LOC103489359 OS=Cucumis melo OX=3656 GN=LOC103489359 PE=4 SV=1)
HSP 1 Score: 360 bits (925), Expect = 5.25e-124
Identity = 198/251 (78.88%), Postives = 208/251 (82.87%), Query Frame = 0
Query: 1 MDFDHLLNLFDSFWFQRQVLNNHPFPSNPQILQPQIQDPDPLPKESFLIPRLRTRSISED 60
MDF+ LLNLFDSFWF+R V N HPF SN LQPQIQDPD LPKESF+IPRL TRSISED
Sbjct: 1 MDFEQLLNLFDSFWFERGVFNKHPFLSN---LQPQIQDPDSLPKESFIIPRLPTRSISED 60
Query: 61 LSSKLSFMSNSNSPDSVLLSPKLQTIFSSKDIAGAESPETNHKVEIERRPKTEYRRRLRG 120
LSSKLSFMS+SNSPDSVL SPKLQTIFSSKDIAGAESPET+ K+EIERRPKTEYRRR RG
Sbjct: 61 LSSKLSFMSSSNSPDSVLFSPKLQTIFSSKDIAGAESPETSRKLEIERRPKTEYRRRFRG 120
Query: 121 RRTRRSESRSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKREEKGNKEGE 180
RRTRRSESRSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGK+EEK
Sbjct: 121 RRTRRSESRSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKKEEK------ 180
Query: 181 EEEEEKEEERKLGGEISRPYLSEAWEAIAEEEEKEELLKRPLMMKWRFPSNQIDMKDNLK 240
+AWEA+ EEEEKEEL K+PLMMKWRFPSNQIDMKDNLK
Sbjct: 181 ---------------------EKAWEAMEEEEEKEELAKKPLMMKWRFPSNQIDMKDNLK 221
Query: 241 WWAHAVASTVR 251
WWAHAVASTVR
Sbjct: 241 WWAHAVASTVR 221
BLAST of CsGy1G021600 vs. ExPASy TrEMBL
Match:
A0A6J1KR35 (uncharacterized protein LOC111495710 OS=Cucurbita maxima OX=3661 GN=LOC111495710 PE=4 SV=1)
HSP 1 Score: 306 bits (784), Expect = 2.99e-102
Identity = 174/254 (68.50%), Postives = 201/254 (79.13%), Query Frame = 0
Query: 1 MDFDHLLNLFDSFWFQRQVLNNHPFPSNPQILQPQIQDPDPL---PKESFLIPRLRTRSI 60
MD + +LNLFDSFWF+R++ N HPFP+NPQ +P+ QD DPL P E +PR+ RSI
Sbjct: 1 MDVEQVLNLFDSFWFEREIFNKHPFPTNPQNPRPENQDRDPLKNSPPEEPFVPRICPRSI 60
Query: 61 SEDLSSKLSFMSNSNSPDSVLLSPKLQTIFSSKDIAGAESPETNHKVEIERRPKTEYRRR 120
SEDLSSKL+FMS+S+SPDSVL SPKLQTI SSKDIAG E PE + +V I R K + RRR
Sbjct: 61 SEDLSSKLTFMSDSSSPDSVLFSPKLQTILSSKDIAGEEPPEKSRRVVI--RQKRKQRRR 120
Query: 121 LRGRRTRRSESRSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKREEKGNK 180
+ GR R SESRSLSELEFEELKGFMDLGFVFSE DK SSLA IVPGLNRLGKR+E
Sbjct: 121 IGGRSIRGSESRSLSELEFEELKGFMDLGFVFSEGDKSSSLAWIVPGLNRLGKRDE---- 180
Query: 181 EGEEEEEEKEEERKLGGEISRPYLSEAWEAIAEEEEKEELLKRPLMMKWRFPSNQIDMKD 240
EEEEE+EEE +LGG ISRPYLSEAW A+ +EEE +K+ L+MKWR P+N+IDMKD
Sbjct: 181 ---EEEEEEEEEEELGGGISRPYLSEAWAAMEQEEE----VKKALVMKWRLPANEIDMKD 240
Query: 241 NLKWWAHAVASTVR 251
NLKWWAHAVASTVR
Sbjct: 241 NLKWWAHAVASTVR 241
BLAST of CsGy1G021600 vs. ExPASy TrEMBL
Match:
A0A6J1EH41 (uncharacterized protein LOC111434058 OS=Cucurbita moschata OX=3662 GN=LOC111434058 PE=4 SV=1)
HSP 1 Score: 291 bits (745), Expect = 1.99e-96
Identity = 166/254 (65.35%), Postives = 193/254 (75.98%), Query Frame = 0
Query: 1 MDFDHLLNLFDSFWFQRQVLNNHPFPSNPQILQPQIQDPDPL---PKESFLIPRLRTRSI 60
MD + +L+LFDS WF+R++ N HPFP+NPQ +P+ QD DPL P E +PR+ RSI
Sbjct: 1 MDVEQVLDLFDSLWFEREIFNKHPFPTNPQNPRPENQDRDPLKNSPPEEPFVPRICPRSI 60
Query: 61 SEDLSSKLSFMSNSNSPDSVLLSPKLQTIFSSKDIAGAESPETNHKVEIERRPKTEYRRR 120
SEDLSSKL+FMS+S+SPDSVL SPKLQTI SSK+IAG E PE + +V I R K + RRR
Sbjct: 61 SEDLSSKLTFMSDSSSPDSVLFSPKLQTILSSKEIAGEEPPEKSRRVVI--RQKRKQRRR 120
Query: 121 LRGRRTRRSESRSLSELEFEELKGFMDLGFVFSEEDKGSSLASIVPGLNRLGKREEKGNK 180
+ GR R SESRSLSELEFEE+KGFMDLGFVFSE DK SSLA IVPGLNRLGKR+E
Sbjct: 121 IGGRSIRGSESRSLSELEFEEVKGFMDLGFVFSEGDKSSSLAWIVPGLNRLGKRDE---- 180
Query: 181 EGEEEEEEKEEERKLGGEISRPYLSEAWEAIAEEEEKEELLKRPLMMKWRFPSNQIDMKD 240
EE +LGG ISRPYLSEAW A+ EEEE LK+ L+MKWR P+N+IDMKD
Sbjct: 181 ----------EEEELGGGISRPYLSEAWAAMEEEEE----LKKALVMKWRLPANEIDMKD 234
Query: 241 NLKWWAHAVASTVR 251
NLKWWAHAVASTVR
Sbjct: 241 NLKWWAHAVASTVR 234
BLAST of CsGy1G021600 vs. TAIR 10
Match:
AT2G42760.1 (unknown protein; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012881); Has 170 Blast hits to 164 proteins in 34 species: Archae - 0; Bacteria - 1; Metazoa - 26; Fungi - 10; Plants - 107; Viruses - 0; Other Eukaryotes - 26 (source: NCBI BLink). )
HSP 1 Score: 128.3 bits (321), Expect = 9.0e-30
Identity = 107/274 (39.05%), Postives = 159/274 (58.03%), Query Frame = 0
Query: 4 DHLLNLFDSFWFQRQVLNNHPFPSN---------PQILQPQIQDPDPLPK--ESFLIPRL 63
+ LL LF+ W +R + N +IL+ + ++ + L SFL+ R
Sbjct: 4 EELLKLFEQNWSERPIFKKDKENLNGKSREKRGEKEILEER-REEEALKNFPVSFLVERA 63
Query: 64 RTRSISEDLSSKLSFMSNSN-----SPDSVL-LSP---KLQTIFSSKDIAGAESPETNHK 123
+ SSK S S+S+ SP SVL + P KLQTI S K++ A + +
Sbjct: 64 MSDETMMTTSSKTSLFSSSSDDLFLSPRSVLPVKPTPMKLQTILSGKEV-NAFTIAERER 123
Query: 124 VEIERRPKTEYRRRLRGRRTRRSESRSLSELEFEELKGFMDLGFVFSEED-KGSSLASIV 183
+ E+ + + +++ RTR+ +S+S+LE+EELKGFMDLGFVFSE+D K S L SI+
Sbjct: 124 LLSEKEEQRKKKKKKSNVRTRK--GKSMSDLEYEELKGFMDLGFVFSEDDHKDSDLVSIL 183
Query: 184 PGLNRLGKREEKGNKEGEEEEEEKEEERKLGG-EISRPYLSEAWEAIAEEEEKEELLKRP 243
PGL RL K+++ K EEEE+EEE K+GG +RPYLSEAW+ + K+++
Sbjct: 184 PGLQRLVKKDDGVTK----EEEEEEEEDKIGGNRAARPYLSEAWDHCGGRKGKKQITPE- 243
Query: 244 LMMKWRFP----SNQIDMKDNLKWWAHAVASTVR 252
+KWR P ++++D+KDNL+ WAHAVAST+R
Sbjct: 244 --IKWRVPAPAAASEVDLKDNLRLWAHAVASTIR 266
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_031737252.1 | 1.26e-168 | 99.60 | uncharacterized protein LOC105434498 [Cucumis sativus] >KGN65608.1 hypothetical ... | [more] |
KAA0057058.1 | 1.20e-142 | 88.45 | DUF1685 domain-containing protein [Cucumis melo var. makuwa] | [more] |
XP_038895996.1 | 3.91e-128 | 81.27 | uncharacterized protein LOC120084174 [Benincasa hispida] | [more] |
XP_016898936.1 | 1.08e-123 | 78.88 | PREDICTED: uncharacterized protein LOC103489359 [Cucumis melo] | [more] |
XP_023001638.1 | 6.18e-102 | 68.50 | uncharacterized protein LOC111495710 [Cucurbita maxima] | [more] |
Match Name | E-value | Identity | Description | |
A0A0A0LV26 | 6.09e-169 | 99.60 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G470240 PE=4 SV=1 | [more] |
A0A5A7UPL7 | 5.81e-143 | 88.45 | DUF1685 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C2... | [more] |
A0A1S4DSH1 | 5.25e-124 | 78.88 | uncharacterized protein LOC103489359 OS=Cucumis melo OX=3656 GN=LOC103489359 PE=... | [more] |
A0A6J1KR35 | 2.99e-102 | 68.50 | uncharacterized protein LOC111495710 OS=Cucurbita maxima OX=3661 GN=LOC111495710... | [more] |
A0A6J1EH41 | 1.99e-96 | 65.35 | uncharacterized protein LOC111434058 OS=Cucurbita moschata OX=3662 GN=LOC1114340... | [more] |
Match Name | E-value | Identity | Description | |
AT2G42760.1 | 9.0e-30 | 39.05 | unknown protein; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685... | [more] |