Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGACCGTCACCGGAACTGTCGTTTCTTCGAAGCCGATCTCTATCTCCAAAGCGGCGTCCACCCTCTCCTCCTTCCTCTCCGTCGACAATGGCGCTTCGCAAGCACTCTGTGCCTATCTGAGGCGCGCCTCCGCCTCTTTCAACGAGTTAAAGCAGCTCCACAAGGAGCTGAAGTCTTCTCGTTCCGTTCGGAAGCACCTGCATCAAGGGTCCGAGGTTTCAAACGAGTTAGAGGCTGCCTTAGATAATCCATGTCGAGTCGAAGACGGCGAGAAGAAAAAATCTTCAGCCTCTGCGAGGATGAAGCGGCCAGACAGTAGGGACGAAACTAGGGATAAACCGAGTCTTAGAGTTCAATCTGATGATGTGCAGATTGGGAAAACAGTAATGGAAAACGGTGGGAGTGGTAAATTTGTGGATGTATCAGGGGAAGATGGAAAGAGAAAGGGCGGCGACTTGAAGATTGAAATTGAAGATAAACCTAGCGGAAAAGTTGAGATGGATGTGGAATCAAGTGATAGAGATAAGAGCGTTGTAGCAGTTGAGAAAAAGAGAAAAAAGCACAAGAAAAAGAGCGAGGATAAACATGGTAACATTGAAGACGATGAACGTGATTCTGGAGCTAGGCTAAGTCATAGTAAATCGCAAAATAGTGATAATAATGGCGATATTGAAGCTTCTGGGGAGTTCGTTCAGAACAATGTAGCAAAGGGGAAAGTTAGAAAGAAGCGTGAGGACAAGAGTTTGGGTGATGAGAAGGATCAAGTAAAGGATGAAGGTCAGAGAAGAAGAGACATGGAGGAGGAAAAAAACACAGATAAGGATAATGATGACGGAACAGATCTTGTGGATCTATCGACCAAGAAGAAGAAGAAGAAGAAGAAGAAAAGGGAAGAAGATGTTGATGATTTTCAAAATAACCGTGGAGGAGCTATGGTGAAGGAGGAAGTGCCAGTTCCGGATAGCAAAGAGTCGAAGAGGAAAGAGAGGAAAAAGAGGAAGAATCGAGAGTTAGGAGAGGAAGGGGGTGATGATGGGTCAGAGGAGCAACAGGGTACGAAGAGAAGAAAAGGATGA
mRNA sequence
ATGAAGACCGTCACCGGAACTGTCGTTTCTTCGAAGCCGATCTCTATCTCCAAAGCGGCGTCCACCCTCTCCTCCTTCCTCTCCGTCGACAATGGCGCTTCGCAAGCACTCTGTGCCTATCTGAGGCGCGCCTCCGCCTCTTTCAACGAGTTAAAGCAGCTCCACAAGGAGCTGAAGTCTTCTCGTTCCGTTCGGAAGCACCTGCATCAAGGGTCCGAGGTTTCAAACGAGTTAGAGGCTGCCTTAGATAATCCATGTCGAGTCGAAGACGGCGAGAAGAAAAAATCTTCAGCCTCTGCGAGGATGAAGCGGCCAGACAGTAGGGACGAAACTAGGGATAAACCGAGTCTTAGAGTTCAATCTGATGATGTGCAGATTGGGAAAACAGTAATGGAAAACGGTGGGAGTGGTAAATTTGTGGATGTATCAGGGGAAGATGGAAAGAGAAAGGGCGGCGACTTGAAGATTGAAATTGAAGATAAACCTAGCGGAAAAGTTGAGATGGATGTGGAATCAAGTGATAGAGATAAGAGCGTTGTAGCAGTTGAGAAAAAGAGAAAAAAGCACAAGAAAAAGAGCGAGGATAAACATGGTAACATTGAAGACGATGAACGTGATTCTGGAGCTAGGCTAAGTCATAGTAAATCGCAAAATAGTGATAATAATGGCGATATTGAAGCTTCTGGGGAGTTCGTTCAGAACAATGTAGCAAAGGGGAAAGTTAGAAAGAAGCGTGAGGACAAGAGTTTGGGTGATGAGAAGGATCAAGTAAAGGATGAAGGTCAGAGAAGAAGAGACATGGAGGAGGAAAAAAACACAGATAAGGATAATGATGACGGAACAGATCTTGTGGATCTATCGACCAAGAAGAAGAAGAAGAAGAAGAAGAAAAGGGAAGAAGATGTTGATGATTTTCAAAATAACCGTGGAGGAGCTATGGTGAAGGAGGAAGTGCCAGTTCCGGATAGCAAAGAGTCGAAGAGGAAAGAGAGGAAAAAGAGGAAGAATCGAGAGTTAGGAGAGGAAGGGGGTGATGATGGGTCAGAGGAGCAACAGGGTACGAAGAGAAGAAAAGGATGA
Coding sequence (CDS)
ATGAAGACCGTCACCGGAACTGTCGTTTCTTCGAAGCCGATCTCTATCTCCAAAGCGGCGTCCACCCTCTCCTCCTTCCTCTCCGTCGACAATGGCGCTTCGCAAGCACTCTGTGCCTATCTGAGGCGCGCCTCCGCCTCTTTCAACGAGTTAAAGCAGCTCCACAAGGAGCTGAAGTCTTCTCGTTCCGTTCGGAAGCACCTGCATCAAGGGTCCGAGGTTTCAAACGAGTTAGAGGCTGCCTTAGATAATCCATGTCGAGTCGAAGACGGCGAGAAGAAAAAATCTTCAGCCTCTGCGAGGATGAAGCGGCCAGACAGTAGGGACGAAACTAGGGATAAACCGAGTCTTAGAGTTCAATCTGATGATGTGCAGATTGGGAAAACAGTAATGGAAAACGGTGGGAGTGGTAAATTTGTGGATGTATCAGGGGAAGATGGAAAGAGAAAGGGCGGCGACTTGAAGATTGAAATTGAAGATAAACCTAGCGGAAAAGTTGAGATGGATGTGGAATCAAGTGATAGAGATAAGAGCGTTGTAGCAGTTGAGAAAAAGAGAAAAAAGCACAAGAAAAAGAGCGAGGATAAACATGGTAACATTGAAGACGATGAACGTGATTCTGGAGCTAGGCTAAGTCATAGTAAATCGCAAAATAGTGATAATAATGGCGATATTGAAGCTTCTGGGGAGTTCGTTCAGAACAATGTAGCAAAGGGGAAAGTTAGAAAGAAGCGTGAGGACAAGAGTTTGGGTGATGAGAAGGATCAAGTAAAGGATGAAGGTCAGAGAAGAAGAGACATGGAGGAGGAAAAAAACACAGATAAGGATAATGATGACGGAACAGATCTTGTGGATCTATCGACCAAGAAGAAGAAGAAGAAGAAGAAGAAAAGGGAAGAAGATGTTGATGATTTTCAAAATAACCGTGGAGGAGCTATGGTGAAGGAGGAAGTGCCAGTTCCGGATAGCAAAGAGTCGAAGAGGAAAGAGAGGAAAAAGAGGAAGAATCGAGAGTTAGGAGAGGAAGGGGGTGATGATGGGTCAGAGGAGCAACAGGGTACGAAGAGAAGAAAAGGATGA
Protein sequence
MKTVTGTVVSSKPISISKAASTLSSFLSVDNGASQALCAYLRRASASFNELKQLHKELKSSRSVRKHLHQGSEVSNELEAALDNPCRVEDGEKKKSSASARMKRPDSRDETRDKPSLRVQSDDVQIGKTVMENGGSGKFVDVSGEDGKRKGGDLKIEIEDKPSGKVEMDVESSDRDKSVVAVEKKRKKHKKKSEDKHGNIEDDERDSGARLSHSKSQNSDNNGDIEASGEFVQNNVAKGKVRKKREDKSLGDEKDQVKDEGQRRRDMEEEKNTDKDNDDGTDLVDLSTKKKKKKKKKREEDVDDFQNNRGGAMVKEEVPVPDSKESKRKERKKRKNRELGEEGGDDGSEEQQGTKRRKG
Homology
BLAST of Cla97C02G046710 vs. NCBI nr
Match:
XP_038902882.1 (probable xyloglucan galactosyltransferase GT11 [Benincasa hispida])
HSP 1 Score: 520.0 bits (1338), Expect = 1.6e-143
Identity = 307/361 (85.04%), Postives = 328/361 (90.86%), Query Frame = 0
Query: 1 MKTVTGTVVSSKPISISKAASTLSSFLSVDNGASQALCAYLRRASASFNELKQLHKELKS 60
MKTVTG+VVSSKPISISKAASTLSSFLSVDNGASQALCAYLRRASASFNELKQLHKELKS
Sbjct: 1 MKTVTGSVVSSKPISISKAASTLSSFLSVDNGASQALCAYLRRASASFNELKQLHKELKS 60
Query: 61 SRSVRKHLHQGSEVSNELEAALDNPCRVEDGEKKKSSASARMKRPDSRDETRDKPSLRVQ 120
SRSVRKHLH GSEVSNELEAALDN RVEDGEKKKSS S R KRP E+R+KPS RVQ
Sbjct: 61 SRSVRKHLHHGSEVSNELEAALDNSYRVEDGEKKKSSVSERKKRP----ESRNKPSARVQ 120
Query: 121 SDDVQIGKTVMENGGSGKFVDVSGEDGKRKGGDLKIEIEDKPSGKVEMDVESSDRDKSVV 180
S+D +I KT MENGG+GK DV GEDGKRKGG+LKIEIEDKP+ KVEMDVESSDRDK VV
Sbjct: 121 SEDERIWKTTMENGGNGKLEDVLGEDGKRKGGELKIEIEDKPNRKVEMDVESSDRDKGVV 180
Query: 181 AVEKKRKKHKKKSEDKHGNIEDDERDSGARLSHSKSQNSDNNGDIEASGEFVQNNVAKGK 240
AVEKKRKKHKKK+EDKHGNIEDDERDSGARLSH+KSQNSDNNG+IEASGEFV+NNVA+ K
Sbjct: 181 AVEKKRKKHKKKNEDKHGNIEDDERDSGARLSHNKSQNSDNNGNIEASGEFVENNVAREK 240
Query: 241 VRKKRED-KSLGDEKDQVKDEGQRRRDMEEEKNTDKDNDDGTDLVDLST-KKKKKKKKKR 300
V KK ED KSLGDEKDQVK E QRRRD+EEEK +KDNDDGTD+VDLST KKKKKKKKKR
Sbjct: 241 VEKKHEDKKSLGDEKDQVKTEVQRRRDIEEEKGINKDNDDGTDIVDLSTKKKKKKKKKKR 300
Query: 301 EEDVDDFQNNRGGAMVKEEVPVPDSKESKRKERKKRKNRELGEEGGDDGSEEQQGTKRRK 360
EEDVDDFQNN GGAMV +E+PV +SKE KRK+RKKRKNRELGEEGGDD SEE+QGTKRRK
Sbjct: 301 EEDVDDFQNNSGGAMVNDEMPVSNSKELKRKDRKKRKNRELGEEGGDDVSEEKQGTKRRK 357
BLAST of Cla97C02G046710 vs. NCBI nr
Match:
KAA0035280.1 (glutamic acid-rich protein [Cucumis melo var. makuwa])
HSP 1 Score: 459.5 bits (1181), Expect = 2.6e-125
Identity = 285/361 (78.95%), Postives = 306/361 (84.76%), Query Frame = 0
Query: 1 MKTVTGTVVSSKPISISKAASTLSSFLSVDNGASQALCAYLRRASASFNELKQLHKELKS 60
MKTVTG+VVSSKPISISKAASTLSSFLS DNGAS+ALCAYLRRAS SFNELK LHKELKS
Sbjct: 1 MKTVTGSVVSSKPISISKAASTLSSFLSADNGASKALCAYLRRASDSFNELKHLHKELKS 60
Query: 61 SRSVRKHLHQGSEVSNELEAALDNPCRVEDGEKKKSSASARMKRPDSRDETRDKPSLRVQ 120
S SVRKHLH GS+VSNE EAA+DN RVEDG+KK SS S + KRPDS+ T DK SLRVQ
Sbjct: 61 SPSVRKHLHHGSKVSNEFEAAMDNEYRVEDGDKKNSSVSEKKKRPDSKYRTTDKTSLRVQ 120
Query: 121 SDDVQIGKTVMENGGSGKFVDVSGEDGKRKGGDLKIEIEDKPSGKVEMDVESSDRDKSVV 180
SDD Q GKT MENGG+G DVS GKRKGG LKIEIEDKPSGKVEMDVESSD VV
Sbjct: 121 SDDEQSGKTAMENGGNGNLEDVS---GKRKGGGLKIEIEDKPSGKVEMDVESSD----VV 180
Query: 181 AVEKKRKKHKKKSEDKHGNIEDDERDSGARLSHSKSQNSDNNGD-IEASGEFVQNNVAKG 240
AVEKKRKKHKKKSED+HG+IEDDER+SGARL H KSQN+DNN D EASGEFV+NNVAKG
Sbjct: 181 AVEKKRKKHKKKSEDRHGDIEDDERESGARLKHGKSQNTDNNCDNAEASGEFVENNVAKG 240
Query: 241 KVRKKREDK-SLGDEKDQVKDEGQRRRDMEEEKNTDKDNDDGTDLVDLSTKKKKKKKKKR 300
K RKK EDK SLGD KDQVK E QRR D++EE++TD DN +GTDLVDLST KKKKK+K+R
Sbjct: 241 KSRKKLEDKRSLGDVKDQVKSEDQRRGDIKEERSTDNDNGNGTDLVDLST-KKKKKRKQR 300
Query: 301 EEDVDDFQNNRGGAMVKEEVPVPDSKESKRKERKKRKNRELGEEGGDDGSEEQQGTKRRK 360
EED DDFQ N GGAMVKEEVPV DSKE KRKE+KK KNRELGEEG DDGSEEQ KRRK
Sbjct: 301 EED-DDFQKNSGGAMVKEEVPVLDSKELKRKEKKKSKNRELGEEGHDDGSEEQHSRKRRK 352
BLAST of Cla97C02G046710 vs. NCBI nr
Match:
XP_008463862.1 (PREDICTED: glutamic acid-rich protein [Cucumis melo] >TYK14356.1 glutamic acid-rich protein [Cucumis melo var. makuwa])
HSP 1 Score: 456.4 bits (1173), Expect = 2.2e-124
Identity = 284/361 (78.67%), Postives = 305/361 (84.49%), Query Frame = 0
Query: 1 MKTVTGTVVSSKPISISKAASTLSSFLSVDNGASQALCAYLRRASASFNELKQLHKELKS 60
MKTVTG+VVSSKPISISKAASTLSSFLS DNGAS+ALCAYLRRAS SFNELK LHKELKS
Sbjct: 1 MKTVTGSVVSSKPISISKAASTLSSFLSADNGASKALCAYLRRASDSFNELKHLHKELKS 60
Query: 61 SRSVRKHLHQGSEVSNELEAALDNPCRVEDGEKKKSSASARMKRPDSRDETRDKPSLRVQ 120
S SVRKHLH GS+VSNE EAA+DN RVEDG+KK SS S + KRPDS+ T DK SLRVQ
Sbjct: 61 SPSVRKHLHHGSKVSNEFEAAMDNEYRVEDGDKKNSSVSEKKKRPDSKYRTTDKTSLRVQ 120
Query: 121 SDDVQIGKTVMENGGSGKFVDVSGEDGKRKGGDLKIEIEDKPSGKVEMDVESSDRDKSVV 180
SDD Q GKT MENGG+G DVS GKRKGG LKIEIEDKPSGKVEMDVESSD VV
Sbjct: 121 SDDEQSGKTAMENGGNGNLEDVS---GKRKGGGLKIEIEDKPSGKVEMDVESSD----VV 180
Query: 181 AVEKKRKKHKKKSEDKHGNIEDDERDSGARLSHSKSQNSDNNGD-IEASGEFVQNNVAKG 240
AVEKKRKKHKKKSED+HG+IEDDER+SGARL H KSQN+DNN D EASGEFV+NNVAKG
Sbjct: 181 AVEKKRKKHKKKSEDRHGDIEDDERESGARLKHGKSQNTDNNCDNAEASGEFVENNVAKG 240
Query: 241 KVRKKREDK-SLGDEKDQVKDEGQRRRDMEEEKNTDKDNDDGTDLVDLSTKKKKKKKKKR 300
K RKK EDK SLGD KDQVK E QRR D++EE++TD DN +GTDLVDLST KKKKK+K+R
Sbjct: 241 KSRKKLEDKRSLGDVKDQVKSEDQRRGDIKEERSTDNDNGNGTDLVDLST-KKKKKRKQR 300
Query: 301 EEDVDDFQNNRGGAMVKEEVPVPDSKESKRKERKKRKNRELGEEGGDDGSEEQQGTKRRK 360
EED DDFQ N G AMVKEEVPV DSKE KRKE+KK KNRELGEEG DDGSEEQ KRRK
Sbjct: 301 EED-DDFQKNSGEAMVKEEVPVLDSKELKRKEKKKSKNRELGEEGHDDGSEEQHSRKRRK 352
BLAST of Cla97C02G046710 vs. NCBI nr
Match:
XP_004148227.1 (uncharacterized protein DDB_G0283697 [Cucumis sativus] >KGN47333.1 hypothetical protein Csa_022954 [Cucumis sativus])
HSP 1 Score: 446.8 bits (1148), Expect = 1.7e-121
Identity = 275/360 (76.39%), Postives = 301/360 (83.61%), Query Frame = 0
Query: 1 MKTVTGTVVSSKPISISKAASTLSSFLSVDNGASQALCAYLRRASASFNELKQLHKELKS 60
MKTV G+VVSSKPISISKAASTLSSFLS DNGAS+ALCAYLRRAS SFNELKQLHKELKS
Sbjct: 1 MKTVNGSVVSSKPISISKAASTLSSFLSADNGASKALCAYLRRASDSFNELKQLHKELKS 60
Query: 61 SRSVRKHLHQGSEVSNELEAALDNPCRVEDGEKKKSSASARMKRPDSRDETRDKPSLRVQ 120
S SVRKHLH GSEVSNE EAA+ + RVEDG+K SS S + KRPD +D T DK SLRVQ
Sbjct: 61 SCSVRKHLHHGSEVSNEFEAAIHDQYRVEDGDKNNSSVSEKKKRPDRKDRTTDKTSLRVQ 120
Query: 121 SDDVQIGKTVMENGGSGKFVDVSGEDGKRKGGDLKIEIEDKPSGKVEMDVESSDRDKSVV 180
S + QIGKT MENGG+G DV+ GK+KG +LKIEIEDKPSGKVEMDVESSDRDKSVV
Sbjct: 121 SYNEQIGKTPMENGGNGNLEDVT---GKKKGSELKIEIEDKPSGKVEMDVESSDRDKSVV 180
Query: 181 AVEKKRKKHKKKSEDKHGNIEDDERDSGARLSHSKSQNSDNNGDIEASGEFVQNNVAKGK 240
AVEKKRK+HKKKSED+H +IEDDER+SGARL H KSQN+DNN D EASGEFV+NNVA GK
Sbjct: 181 AVEKKRKRHKKKSEDRHDDIEDDERESGARLKHGKSQNTDNNCDAEASGEFVENNVANGK 240
Query: 241 VRKKREDKS-LGDEKDQVKDEGQRRRDMEEEKNTDKDNDDGTDLVDLSTKKKKKKKKKRE 300
RKK EDK L D KDQVK E QRR D++E K+T+ DND+GTD VDLS KKKKK++RE
Sbjct: 241 SRKKLEDKKRLDDVKDQVKSEDQRRGDVKEGKSTNNDNDNGTDHVDLS--PKKKKKRRRE 300
Query: 301 EDVDDFQNNRGGAMVKEEVPVPDSKESKRKERKKRKNRELGEEGGDDGSEEQQGTKRRKG 360
ED DDFQ N G AMVKEEVPV DSKE KRKE+KK KNRELGEEG DDGSEEQ TKRRKG
Sbjct: 301 ED-DDFQKNSGEAMVKEEVPVLDSKELKRKEKKKSKNRELGEEGRDDGSEEQHSTKRRKG 354
BLAST of Cla97C02G046710 vs. NCBI nr
Match:
XP_022943393.1 (glutamic acid-rich protein-like [Cucurbita moschata] >XP_022943394.1 glutamic acid-rich protein-like [Cucurbita moschata] >XP_022943395.1 glutamic acid-rich protein-like [Cucurbita moschata] >XP_022943397.1 glutamic acid-rich protein-like [Cucurbita moschata] >XP_022943398.1 glutamic acid-rich protein-like [Cucurbita moschata])
HSP 1 Score: 428.7 bits (1101), Expect = 4.9e-116
Identity = 269/362 (74.31%), Postives = 298/362 (82.32%), Query Frame = 0
Query: 1 MKTVTGTVVSSKPISISKAASTLSSFLSVDNGASQALCAYLRRASASFNELKQLHKELKS 60
MKTVTG++VSSKPISISKAASTLSSFLSVDNGAS+A+CAYLRRASASFNELKQLHKELKS
Sbjct: 1 MKTVTGSIVSSKPISISKAASTLSSFLSVDNGASKAICAYLRRASASFNELKQLHKELKS 60
Query: 61 SRSVRKHLHQGSEVSNELEAALDNPCRVEDGEKKKSSASARMKRPDSRDETRDKPSLRVQ 120
SRS RKH H GSE SN+ EA+ NP +ED EKK ++ D R KPS VQ
Sbjct: 61 SRSDRKHRHHGSEASNDPEASRGNPHWIEDDEKKN---PLYLRAKDGRS---GKPSFNVQ 120
Query: 121 SDDVQIGKTVMENGGSGKFVDVSGEDGKRKGGDLKIEIEDKPSGKVEMDVESSDRDKSVV 180
S+D + GKT E+GGSG F D SGE KRK GDLK EIEDKP+ KVEMDVESSD+DKSVV
Sbjct: 121 SEDGKDGKTEKESGGSGDFEDASGEYRKRKVGDLKTEIEDKPNRKVEMDVESSDKDKSVV 180
Query: 181 AVEKKRKKHKKKSEDKHGNIEDDERDSGARLSHSKSQNSDNNGDIEASGEFVQNNVAKGK 240
AVEKK KKHKKKSED+H IEDDER+ GAR S+SKS+NSDNNG+IEASG+FV+NN+A GK
Sbjct: 181 AVEKKGKKHKKKSEDRHAKIEDDEREDGARRSYSKSRNSDNNGEIEASGKFVENNIASGK 240
Query: 241 VRKKRED-KSLGDEKDQVKDEGQRRRDMEEEKNTDKDNDDGTDLVDLSTKKKKKKKKK-- 300
RKK ED KSLGD+KDQVK EGQRRRD EEEK+T+KDNDDGT+ STKKKKKKKKK
Sbjct: 241 DRKKHEDKKSLGDDKDQVKSEGQRRRDAEEEKSTNKDNDDGTE----STKKKKKKKKKKN 300
Query: 301 REEDVDDFQNNRGGAMVKEEVPVPDSKESKRKERKKRKNRELGEEGGDDGSEEQQGTKRR 360
REE+ DDFQNN GGAMVKEE+PV D KE KRKE+KKRKNR L EEGGDDGSEEQQ TKRR
Sbjct: 301 REEEDDDFQNNSGGAMVKEEIPVSDDKELKRKEKKKRKNRGL-EEGGDDGSEEQQRTKRR 351
BLAST of Cla97C02G046710 vs. ExPASy TrEMBL
Match:
A0A5A7SW64 (Glutamic acid-rich protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold228G00720 PE=4 SV=1)
HSP 1 Score: 459.5 bits (1181), Expect = 1.3e-125
Identity = 285/361 (78.95%), Postives = 306/361 (84.76%), Query Frame = 0
Query: 1 MKTVTGTVVSSKPISISKAASTLSSFLSVDNGASQALCAYLRRASASFNELKQLHKELKS 60
MKTVTG+VVSSKPISISKAASTLSSFLS DNGAS+ALCAYLRRAS SFNELK LHKELKS
Sbjct: 1 MKTVTGSVVSSKPISISKAASTLSSFLSADNGASKALCAYLRRASDSFNELKHLHKELKS 60
Query: 61 SRSVRKHLHQGSEVSNELEAALDNPCRVEDGEKKKSSASARMKRPDSRDETRDKPSLRVQ 120
S SVRKHLH GS+VSNE EAA+DN RVEDG+KK SS S + KRPDS+ T DK SLRVQ
Sbjct: 61 SPSVRKHLHHGSKVSNEFEAAMDNEYRVEDGDKKNSSVSEKKKRPDSKYRTTDKTSLRVQ 120
Query: 121 SDDVQIGKTVMENGGSGKFVDVSGEDGKRKGGDLKIEIEDKPSGKVEMDVESSDRDKSVV 180
SDD Q GKT MENGG+G DVS GKRKGG LKIEIEDKPSGKVEMDVESSD VV
Sbjct: 121 SDDEQSGKTAMENGGNGNLEDVS---GKRKGGGLKIEIEDKPSGKVEMDVESSD----VV 180
Query: 181 AVEKKRKKHKKKSEDKHGNIEDDERDSGARLSHSKSQNSDNNGD-IEASGEFVQNNVAKG 240
AVEKKRKKHKKKSED+HG+IEDDER+SGARL H KSQN+DNN D EASGEFV+NNVAKG
Sbjct: 181 AVEKKRKKHKKKSEDRHGDIEDDERESGARLKHGKSQNTDNNCDNAEASGEFVENNVAKG 240
Query: 241 KVRKKREDK-SLGDEKDQVKDEGQRRRDMEEEKNTDKDNDDGTDLVDLSTKKKKKKKKKR 300
K RKK EDK SLGD KDQVK E QRR D++EE++TD DN +GTDLVDLST KKKKK+K+R
Sbjct: 241 KSRKKLEDKRSLGDVKDQVKSEDQRRGDIKEERSTDNDNGNGTDLVDLST-KKKKKRKQR 300
Query: 301 EEDVDDFQNNRGGAMVKEEVPVPDSKESKRKERKKRKNRELGEEGGDDGSEEQQGTKRRK 360
EED DDFQ N GGAMVKEEVPV DSKE KRKE+KK KNRELGEEG DDGSEEQ KRRK
Sbjct: 301 EED-DDFQKNSGGAMVKEEVPVLDSKELKRKEKKKSKNRELGEEGHDDGSEEQHSRKRRK 352
BLAST of Cla97C02G046710 vs. ExPASy TrEMBL
Match:
A0A5D3CVE3 (Glutamic acid-rich protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold84G00960 PE=4 SV=1)
HSP 1 Score: 456.4 bits (1173), Expect = 1.1e-124
Identity = 284/361 (78.67%), Postives = 305/361 (84.49%), Query Frame = 0
Query: 1 MKTVTGTVVSSKPISISKAASTLSSFLSVDNGASQALCAYLRRASASFNELKQLHKELKS 60
MKTVTG+VVSSKPISISKAASTLSSFLS DNGAS+ALCAYLRRAS SFNELK LHKELKS
Sbjct: 1 MKTVTGSVVSSKPISISKAASTLSSFLSADNGASKALCAYLRRASDSFNELKHLHKELKS 60
Query: 61 SRSVRKHLHQGSEVSNELEAALDNPCRVEDGEKKKSSASARMKRPDSRDETRDKPSLRVQ 120
S SVRKHLH GS+VSNE EAA+DN RVEDG+KK SS S + KRPDS+ T DK SLRVQ
Sbjct: 61 SPSVRKHLHHGSKVSNEFEAAMDNEYRVEDGDKKNSSVSEKKKRPDSKYRTTDKTSLRVQ 120
Query: 121 SDDVQIGKTVMENGGSGKFVDVSGEDGKRKGGDLKIEIEDKPSGKVEMDVESSDRDKSVV 180
SDD Q GKT MENGG+G DVS GKRKGG LKIEIEDKPSGKVEMDVESSD VV
Sbjct: 121 SDDEQSGKTAMENGGNGNLEDVS---GKRKGGGLKIEIEDKPSGKVEMDVESSD----VV 180
Query: 181 AVEKKRKKHKKKSEDKHGNIEDDERDSGARLSHSKSQNSDNNGD-IEASGEFVQNNVAKG 240
AVEKKRKKHKKKSED+HG+IEDDER+SGARL H KSQN+DNN D EASGEFV+NNVAKG
Sbjct: 181 AVEKKRKKHKKKSEDRHGDIEDDERESGARLKHGKSQNTDNNCDNAEASGEFVENNVAKG 240
Query: 241 KVRKKREDK-SLGDEKDQVKDEGQRRRDMEEEKNTDKDNDDGTDLVDLSTKKKKKKKKKR 300
K RKK EDK SLGD KDQVK E QRR D++EE++TD DN +GTDLVDLST KKKKK+K+R
Sbjct: 241 KSRKKLEDKRSLGDVKDQVKSEDQRRGDIKEERSTDNDNGNGTDLVDLST-KKKKKRKQR 300
Query: 301 EEDVDDFQNNRGGAMVKEEVPVPDSKESKRKERKKRKNRELGEEGGDDGSEEQQGTKRRK 360
EED DDFQ N G AMVKEEVPV DSKE KRKE+KK KNRELGEEG DDGSEEQ KRRK
Sbjct: 301 EED-DDFQKNSGEAMVKEEVPVLDSKELKRKEKKKSKNRELGEEGHDDGSEEQHSRKRRK 352
BLAST of Cla97C02G046710 vs. ExPASy TrEMBL
Match:
A0A1S3CK97 (glutamic acid-rich protein OS=Cucumis melo OX=3656 GN=LOC103501895 PE=4 SV=1)
HSP 1 Score: 456.4 bits (1173), Expect = 1.1e-124
Identity = 284/361 (78.67%), Postives = 305/361 (84.49%), Query Frame = 0
Query: 1 MKTVTGTVVSSKPISISKAASTLSSFLSVDNGASQALCAYLRRASASFNELKQLHKELKS 60
MKTVTG+VVSSKPISISKAASTLSSFLS DNGAS+ALCAYLRRAS SFNELK LHKELKS
Sbjct: 1 MKTVTGSVVSSKPISISKAASTLSSFLSADNGASKALCAYLRRASDSFNELKHLHKELKS 60
Query: 61 SRSVRKHLHQGSEVSNELEAALDNPCRVEDGEKKKSSASARMKRPDSRDETRDKPSLRVQ 120
S SVRKHLH GS+VSNE EAA+DN RVEDG+KK SS S + KRPDS+ T DK SLRVQ
Sbjct: 61 SPSVRKHLHHGSKVSNEFEAAMDNEYRVEDGDKKNSSVSEKKKRPDSKYRTTDKTSLRVQ 120
Query: 121 SDDVQIGKTVMENGGSGKFVDVSGEDGKRKGGDLKIEIEDKPSGKVEMDVESSDRDKSVV 180
SDD Q GKT MENGG+G DVS GKRKGG LKIEIEDKPSGKVEMDVESSD VV
Sbjct: 121 SDDEQSGKTAMENGGNGNLEDVS---GKRKGGGLKIEIEDKPSGKVEMDVESSD----VV 180
Query: 181 AVEKKRKKHKKKSEDKHGNIEDDERDSGARLSHSKSQNSDNNGD-IEASGEFVQNNVAKG 240
AVEKKRKKHKKKSED+HG+IEDDER+SGARL H KSQN+DNN D EASGEFV+NNVAKG
Sbjct: 181 AVEKKRKKHKKKSEDRHGDIEDDERESGARLKHGKSQNTDNNCDNAEASGEFVENNVAKG 240
Query: 241 KVRKKREDK-SLGDEKDQVKDEGQRRRDMEEEKNTDKDNDDGTDLVDLSTKKKKKKKKKR 300
K RKK EDK SLGD KDQVK E QRR D++EE++TD DN +GTDLVDLST KKKKK+K+R
Sbjct: 241 KSRKKLEDKRSLGDVKDQVKSEDQRRGDIKEERSTDNDNGNGTDLVDLST-KKKKKRKQR 300
Query: 301 EEDVDDFQNNRGGAMVKEEVPVPDSKESKRKERKKRKNRELGEEGGDDGSEEQQGTKRRK 360
EED DDFQ N G AMVKEEVPV DSKE KRKE+KK KNRELGEEG DDGSEEQ KRRK
Sbjct: 301 EED-DDFQKNSGEAMVKEEVPVLDSKELKRKEKKKSKNRELGEEGHDDGSEEQHSRKRRK 352
BLAST of Cla97C02G046710 vs. ExPASy TrEMBL
Match:
A0A0A0KCS1 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G301030 PE=4 SV=1)
HSP 1 Score: 446.8 bits (1148), Expect = 8.4e-122
Identity = 275/360 (76.39%), Postives = 301/360 (83.61%), Query Frame = 0
Query: 1 MKTVTGTVVSSKPISISKAASTLSSFLSVDNGASQALCAYLRRASASFNELKQLHKELKS 60
MKTV G+VVSSKPISISKAASTLSSFLS DNGAS+ALCAYLRRAS SFNELKQLHKELKS
Sbjct: 1 MKTVNGSVVSSKPISISKAASTLSSFLSADNGASKALCAYLRRASDSFNELKQLHKELKS 60
Query: 61 SRSVRKHLHQGSEVSNELEAALDNPCRVEDGEKKKSSASARMKRPDSRDETRDKPSLRVQ 120
S SVRKHLH GSEVSNE EAA+ + RVEDG+K SS S + KRPD +D T DK SLRVQ
Sbjct: 61 SCSVRKHLHHGSEVSNEFEAAIHDQYRVEDGDKNNSSVSEKKKRPDRKDRTTDKTSLRVQ 120
Query: 121 SDDVQIGKTVMENGGSGKFVDVSGEDGKRKGGDLKIEIEDKPSGKVEMDVESSDRDKSVV 180
S + QIGKT MENGG+G DV+ GK+KG +LKIEIEDKPSGKVEMDVESSDRDKSVV
Sbjct: 121 SYNEQIGKTPMENGGNGNLEDVT---GKKKGSELKIEIEDKPSGKVEMDVESSDRDKSVV 180
Query: 181 AVEKKRKKHKKKSEDKHGNIEDDERDSGARLSHSKSQNSDNNGDIEASGEFVQNNVAKGK 240
AVEKKRK+HKKKSED+H +IEDDER+SGARL H KSQN+DNN D EASGEFV+NNVA GK
Sbjct: 181 AVEKKRKRHKKKSEDRHDDIEDDERESGARLKHGKSQNTDNNCDAEASGEFVENNVANGK 240
Query: 241 VRKKREDKS-LGDEKDQVKDEGQRRRDMEEEKNTDKDNDDGTDLVDLSTKKKKKKKKKRE 300
RKK EDK L D KDQVK E QRR D++E K+T+ DND+GTD VDLS KKKKK++RE
Sbjct: 241 SRKKLEDKKRLDDVKDQVKSEDQRRGDVKEGKSTNNDNDNGTDHVDLS--PKKKKKRRRE 300
Query: 301 EDVDDFQNNRGGAMVKEEVPVPDSKESKRKERKKRKNRELGEEGGDDGSEEQQGTKRRKG 360
ED DDFQ N G AMVKEEVPV DSKE KRKE+KK KNRELGEEG DDGSEEQ TKRRKG
Sbjct: 301 ED-DDFQKNSGEAMVKEEVPVLDSKELKRKEKKKSKNRELGEEGRDDGSEEQHSTKRRKG 354
BLAST of Cla97C02G046710 vs. ExPASy TrEMBL
Match:
A0A6J1FSX8 (glutamic acid-rich protein-like OS=Cucurbita moschata OX=3662 GN=LOC111448174 PE=4 SV=1)
HSP 1 Score: 428.7 bits (1101), Expect = 2.4e-116
Identity = 269/362 (74.31%), Postives = 298/362 (82.32%), Query Frame = 0
Query: 1 MKTVTGTVVSSKPISISKAASTLSSFLSVDNGASQALCAYLRRASASFNELKQLHKELKS 60
MKTVTG++VSSKPISISKAASTLSSFLSVDNGAS+A+CAYLRRASASFNELKQLHKELKS
Sbjct: 1 MKTVTGSIVSSKPISISKAASTLSSFLSVDNGASKAICAYLRRASASFNELKQLHKELKS 60
Query: 61 SRSVRKHLHQGSEVSNELEAALDNPCRVEDGEKKKSSASARMKRPDSRDETRDKPSLRVQ 120
SRS RKH H GSE SN+ EA+ NP +ED EKK ++ D R KPS VQ
Sbjct: 61 SRSDRKHRHHGSEASNDPEASRGNPHWIEDDEKKN---PLYLRAKDGRS---GKPSFNVQ 120
Query: 121 SDDVQIGKTVMENGGSGKFVDVSGEDGKRKGGDLKIEIEDKPSGKVEMDVESSDRDKSVV 180
S+D + GKT E+GGSG F D SGE KRK GDLK EIEDKP+ KVEMDVESSD+DKSVV
Sbjct: 121 SEDGKDGKTEKESGGSGDFEDASGEYRKRKVGDLKTEIEDKPNRKVEMDVESSDKDKSVV 180
Query: 181 AVEKKRKKHKKKSEDKHGNIEDDERDSGARLSHSKSQNSDNNGDIEASGEFVQNNVAKGK 240
AVEKK KKHKKKSED+H IEDDER+ GAR S+SKS+NSDNNG+IEASG+FV+NN+A GK
Sbjct: 181 AVEKKGKKHKKKSEDRHAKIEDDEREDGARRSYSKSRNSDNNGEIEASGKFVENNIASGK 240
Query: 241 VRKKRED-KSLGDEKDQVKDEGQRRRDMEEEKNTDKDNDDGTDLVDLSTKKKKKKKKK-- 300
RKK ED KSLGD+KDQVK EGQRRRD EEEK+T+KDNDDGT+ STKKKKKKKKK
Sbjct: 241 DRKKHEDKKSLGDDKDQVKSEGQRRRDAEEEKSTNKDNDDGTE----STKKKKKKKKKKN 300
Query: 301 REEDVDDFQNNRGGAMVKEEVPVPDSKESKRKERKKRKNRELGEEGGDDGSEEQQGTKRR 360
REE+ DDFQNN GGAMVKEE+PV D KE KRKE+KKRKNR L EEGGDDGSEEQQ TKRR
Sbjct: 301 REEEDDDFQNNSGGAMVKEEIPVSDDKELKRKEKKKRKNRGL-EEGGDDGSEEQQRTKRR 351
BLAST of Cla97C02G046710 vs. TAIR 10
Match:
AT1G75335.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G60030.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )
HSP 1 Score: 73.2 bits (178), Expect = 4.9e-13
Identity = 47/80 (58.75%), Postives = 58/80 (72.50%), Query Frame = 0
Query: 1 MKTVTGTVVSSKPISISKAASTLSSFLSVDNGASQALCAYLRRASASFNELKQLHKELK- 60
MKTVTG V S+KPIS+SKAA+ LS F+S +NGASQ + AYLRRAS +F ELK +H+E+K
Sbjct: 1 MKTVTGRVNSAKPISLSKAATLLSGFVSSENGASQDVSAYLRRASGAFIELKSIHREIKS 60
Query: 61 -----SSRSVRK-HLHQGSE 74
SS+ RK H GSE
Sbjct: 61 KETKLSSKKKRKSHREMGSE 80
BLAST of Cla97C02G046710 vs. TAIR 10
Match:
AT5G60030.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G75335.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )
HSP 1 Score: 72.4 bits (176), Expect = 8.3e-13
Identity = 116/367 (31.61%), Postives = 169/367 (46.05%), Query Frame = 0
Query: 1 MKTVTGTVVSSKPISISKAASTLSSFLSVDNGASQALCAYLRRASASFNELKQLHKELKS 60
MKTVTG VVS++PIS+SKAA LS F S DNGASQ + AYLRRASA+F ELK H+E+KS
Sbjct: 1 MKTVTGRVVSAEPISLSKAAKLLSGFASSDNGASQDVSAYLRRASAAFTELKSFHREIKS 60
Query: 61 SRSVRKHLHQGSEVSNELEAALDNPCRVEDGEKKKSSASARMKRPDSRDETRDKPSLRVQ 120
K+ +S R + ++ D S R
Sbjct: 61 --------------------------------KETKPSSDRETKSTETKQSSDAKSERNV 120
Query: 121 SDDVQIGKTVMENGGSGKFVDVSG----EDGKRKGGDLKIEIEDKPSGKVEMDVESSDRD 180
D+ K N + V G E +K D + +++K + K+E + S +R
Sbjct: 121 IDEFDGRKIRYRNSEAVSVESVYGRERDEKKMKKSKDADV-VDEKVNEKLEAEQRSEER- 180
Query: 181 KSVVAVEKKRKKHKKKSEDKHGNIEDDERDSGARLSHSKSQNSDNNGDIEASGEFVQNNV 240
++RKK KKK K N ++D D + Q S + +
Sbjct: 181 -------RERKKEKKK---KKNNKDEDVVDEKVKEKLEDEQKSADRKE------------ 240
Query: 241 AKGKVRKKREDKSLGDEKDQVKDEGQRRRDMEEEKNTDKDNDDGTDLVDL-----STKKK 300
K K KK D+ + DEK++++DE + E++KN D+D D + L S ++K
Sbjct: 241 RKKKKSKKNNDEDVVDEKEKLEDEQKSAEIKEKKKNKDEDVVDEKEKEKLEDEQRSGERK 286
Query: 301 KKKKKKREEDVDDFQNNRGGAMVKEEVPVPDSKESKRKERKKRKNRELGEEGGDDGSEEQ 359
K+KKKKR+ D + +V EE RK +KKRK+ E + GSEE+
Sbjct: 301 KEKKKKRKSDEE---------IVSEE----------RKSKKKRKSDE------EMGSEER 286
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038902882.1 | 1.6e-143 | 85.04 | probable xyloglucan galactosyltransferase GT11 [Benincasa hispida] | [more] |
KAA0035280.1 | 2.6e-125 | 78.95 | glutamic acid-rich protein [Cucumis melo var. makuwa] | [more] |
XP_008463862.1 | 2.2e-124 | 78.67 | PREDICTED: glutamic acid-rich protein [Cucumis melo] >TYK14356.1 glutamic acid-r... | [more] |
XP_004148227.1 | 1.7e-121 | 76.39 | uncharacterized protein DDB_G0283697 [Cucumis sativus] >KGN47333.1 hypothetical ... | [more] |
XP_022943393.1 | 4.9e-116 | 74.31 | glutamic acid-rich protein-like [Cucurbita moschata] >XP_022943394.1 glutamic ac... | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A5A7SW64 | 1.3e-125 | 78.95 | Glutamic acid-rich protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaff... | [more] |
A0A5D3CVE3 | 1.1e-124 | 78.67 | Glutamic acid-rich protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaff... | [more] |
A0A1S3CK97 | 1.1e-124 | 78.67 | glutamic acid-rich protein OS=Cucumis melo OX=3656 GN=LOC103501895 PE=4 SV=1 | [more] |
A0A0A0KCS1 | 8.4e-122 | 76.39 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G301030 PE=4 SV=1 | [more] |
A0A6J1FSX8 | 2.4e-116 | 74.31 | glutamic acid-rich protein-like OS=Cucurbita moschata OX=3662 GN=LOC111448174 PE... | [more] |
Match Name | E-value | Identity | Description | |
AT1G75335.1 | 4.9e-13 | 58.75 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |
AT5G60030.1 | 8.3e-13 | 31.61 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |