Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
GAGAGTACTCAAAGCCGAGTTCATTTCTACGTTTTTCTTACCCGCCTCCATGCTGTACCTCGCCCGCAACTTTATTTCTTGCCGGAAATTATGGAGTTTCTGAGGGTCGCTTGCGTTGCAGATACGGCCTGTGCACTCTCCCCGTATCTGGTTTCCGCAAACGAAAGCGACTTTACTTGGCCGTGTTTTGAAGTTATTCCATATTCTGTTTAGAAAACTAGAAAAGGAACTACAATATAATCATAGGGATTGGGATTTTGATTTTGAGCTTATCTCAAATTCGTCTGTGTACATATAGTTTTGGGGGCAAAACTCTGGATTTCTGTGTCAATTTTATTTCCTGATATCCGTATTGAGATACGCTTCAGAGATGTCGGTGATGATTCTAATCCAACTCTAGGCTGTGGAACTCACATTTCCCGTCTTCCTAGTATAACATCGTTTTTCTAATTGGTTCTGCAAACACAATTTTCTGTTCTCGAATCCTGAGTTAAATGAAGGAATTGAAGAATACGAGCTTCAATTTTGGAAAATGATGAAGATAATTGTTGTTGGGATCTGACTATCTGTGGAGGTGCTTAAATTCGCTCGAGAGAGATTACCGGTCGCTGGTTGCTCTCAACATGCCTTTTCAAATGAAAATCCAGCCAATCGATTTTGACACAACCGACGAAGCAGCTCGCTTCGAGTTGGTGAAGCCGGCCGCGAAGTCGAAATTGAAGCGGCTATTTGAGAGACAGTTCCCAAACGTACTGAGAAATTCCGCAGAGAAACTCAATTTTGAAGAATTAAGCGTCAACAAGGAAAGCTCCGACGAATTCTCTGAGTTAGAGCCTAGCTCTATATGCTTGGCAAATATGGTCCAGAATTTCTTAGAGGACAACAATGAGAAACAATTCAGCGGGTCTAGGTGCGGCCGCAGTCGCTGCAACTGCTTTAATGGAAACTATACGGACAGCTCCGACGAGGAATTTGATCCACAAAGTGGTTTTGGCGATTCCAAATTTTCTTCCGGTGGTGAGGCATGGGAACTTCTGAAGGTACAACAGATCATTTTTGTTTGTTTTTGTTTGAATTGTGGATTTTCATTCTTTTATTTTTCCCTTCATGGTAAATGTTTCTTGTTTCTATCATTTTGAAAAACATTTTCATATTCGAACAGAGCTTAATTCCCTGTTCGAGCGTTCATGAGAGGAACATATTAGCGGATACTGCCAGGATCGTAGAGAAGAACAAGGTCTGCAAACGTAAAGACAACTTAGCAAGGGAGATTGTTACTAATGGATTGCTGGCTCTTGGATACGACGCTTCAATCTGCAAATCTCACTGGGAAAAGTCGCCCTCATATCCAGCCGGTATATTTCTTACTTACTATTCTTGTTGATTGTGGGAAAATATTAGGTTTTTGAATTTGAATTCGAGCGGTTCATCCATCATTTTCCTTTTTCTCCTGAAGTTTATATAATTTGGTTAATCTGGAGAAAACCTAGGATTCTTGGACCGGGCCGCGTCTATTCTTCTCTTTAAATTTGAATTTCATTAGGCAACTGAACTCTGATTGTTTCTGATTATAGTTGATTGAAATCCGTTTTTTCTGATGTACCAGGTGATTATGAATACGTTGACGTAATTATCGAAGGAGAACGCTTATTGATAGACATCGACTTCCGATCAGAATTCGAAATTGCGCGGTCAACCAAAAGCTACAAATCGATCCTCCAACTTATTCCTTACATCTTCGTCGGCAAGGCTTGTCGGCTTCAGAGGATTGTATCAATCGTATCAGAGGCTGCAAAACAGAGCTTAAAAAAGAAGGGGATGCCAGTTCCGCCATGGAGAAAAGCCGAGTATGTCAAAGCGAAGTGGCTCTCTCCTCATAGTCGTGCATCATCCTTGGCGATTTTGGGTTCTAATCCTGAGTCGAAGAATCCTCTTGAAAACATCCAAATTGATCCATCGCACAGTCACGACGTTGAGAAATCGACGGATCATAACGAATTCGTGGAGAATGCTGCCGTTGTGAAAGAGTGGAAGCCTCCAGAGCTTAAGCCGAAAAGCTCATCGGTTGGGGTTAGAAATCTTAAGATTGTAACCGGTTTGGCATCGGTTATTGAAGACTAG
mRNA sequence
GAGAGTACTCAAAGCCGAGTTCATTTCTACGTTTTTCTTACCCGCCTCCATGCTGTACCTCGCCCGCAACTTTATTTCTTGCCGGAAATTATGGAGTTTCTGAGGGTCGCTTGCGTTGCAGATACGGCCTGTGCACTCTCCCCGTATCTGGTTTCCGCAAACGAAAGCGACTTTACTTGGCCGTGTTTTGAAGTTATTCCATATTCTGTTTAGAAAACTAGAAAAGGAACTACAATATAATCATAGGGATTGGGATTTTGATTTTGAGCTTATCTCAAATTCGTCTGTGTACATATAGTTTTGGGGGCAAAACTCTGGATTTCTGTGTCAATTTTATTTCCTGATATCCGTATTGAGATACGCTTCAGAGATGTCGGTGATGATTCTAATCCAACTCTAGGCTGTGGAACTCACATTTCCCGTCTTCCTAGTATAACATCGTTTTTCTAATTGGTTCTGCAAACACAATTTTCTGTTCTCGAATCCTGAGTTAAATGAAGGAATTGAAGAATACGAGCTTCAATTTTGGAAAATGATGAAGATAATTGTTGTTGGGATCTGACTATCTGTGGAGGTGCTTAAATTCGCTCGAGAGAGATTACCGGTCGCTGGTTGCTCTCAACATGCCTTTTCAAATGAAAATCCAGCCAATCGATTTTGACACAACCGACGAAGCAGCTCGCTTCGAGTTGGTGAAGCCGGCCGCGAAGTCGAAATTGAAGCGGCTATTTGAGAGACAGTTCCCAAACGTACTGAGAAATTCCGCAGAGAAACTCAATTTTGAAGAATTAAGCGTCAACAAGGAAAGCTCCGACGAATTCTCTGAGTTAGAGCCTAGCTCTATATGCTTGGCAAATATGGTCCAGAATTTCTTAGAGGACAACAATGAGAAACAATTCAGCGGGTCTAGGTGCGGCCGCAGTCGCTGCAACTGCTTTAATGGAAACTATACGGACAGCTCCGACGAGGAATTTGATCCACAAAGTGGTTTTGGCGATTCCAAATTTTCTTCCGGTGGTGAGGCATGGGAACTTCTGAAGAGCTTAATTCCCTGTTCGAGCGTTCATGAGAGGAACATATTAGCGGATACTGCCAGGATCGTAGAGAAGAACAAGGTCTGCAAACGTAAAGACAACTTAGCAAGGGAGATTGTTACTAATGGATTGCTGGCTCTTGGATACGACGCTTCAATCTGCAAATCTCACTGGGAAAAGTCGCCCTCATATCCAGCCGGTGATTATGAATACGTTGACGTAATTATCGAAGGAGAACGCTTATTGATAGACATCGACTTCCGATCAGAATTCGAAATTGCGCGGTCAACCAAAAGCTACAAATCGATCCTCCAACTTATTCCTTACATCTTCGTCGGCAAGGCTTGTCGGCTTCAGAGGATTGTATCAATCGTATCAGAGGCTGCAAAACAGAGCTTAAAAAAGAAGGGGATGCCAGTTCCGCCATGGAGAAAAGCCGAGTATGTCAAAGCGAAGTGGCTCTCTCCTCATAGTCGTGCATCATCCTTGGCGATTTTGGGTTCTAATCCTGAGTCGAAGAATCCTCTTGAAAACATCCAAATTGATCCATCGCACAGTCACGACGTTGAGAAATCGACGGATCATAACGAATTCGTGGAGAATGCTGCCGTTGTGAAAGAGTGGAAGCCTCCAGAGCTTAAGCCGAAAAGCTCATCGGTTGGGGTTAGAAATCTTAAGATTGTAACCGGTTTGGCATCGGTTATTGAAGACTAG
Coding sequence (CDS)
ATGCCTTTTCAAATGAAAATCCAGCCAATCGATTTTGACACAACCGACGAAGCAGCTCGCTTCGAGTTGGTGAAGCCGGCCGCGAAGTCGAAATTGAAGCGGCTATTTGAGAGACAGTTCCCAAACGTACTGAGAAATTCCGCAGAGAAACTCAATTTTGAAGAATTAAGCGTCAACAAGGAAAGCTCCGACGAATTCTCTGAGTTAGAGCCTAGCTCTATATGCTTGGCAAATATGGTCCAGAATTTCTTAGAGGACAACAATGAGAAACAATTCAGCGGGTCTAGGTGCGGCCGCAGTCGCTGCAACTGCTTTAATGGAAACTATACGGACAGCTCCGACGAGGAATTTGATCCACAAAGTGGTTTTGGCGATTCCAAATTTTCTTCCGGTGGTGAGGCATGGGAACTTCTGAAGAGCTTAATTCCCTGTTCGAGCGTTCATGAGAGGAACATATTAGCGGATACTGCCAGGATCGTAGAGAAGAACAAGGTCTGCAAACGTAAAGACAACTTAGCAAGGGAGATTGTTACTAATGGATTGCTGGCTCTTGGATACGACGCTTCAATCTGCAAATCTCACTGGGAAAAGTCGCCCTCATATCCAGCCGGTGATTATGAATACGTTGACGTAATTATCGAAGGAGAACGCTTATTGATAGACATCGACTTCCGATCAGAATTCGAAATTGCGCGGTCAACCAAAAGCTACAAATCGATCCTCCAACTTATTCCTTACATCTTCGTCGGCAAGGCTTGTCGGCTTCAGAGGATTGTATCAATCGTATCAGAGGCTGCAAAACAGAGCTTAAAAAAGAAGGGGATGCCAGTTCCGCCATGGAGAAAAGCCGAGTATGTCAAAGCGAAGTGGCTCTCTCCTCATAGTCGTGCATCATCCTTGGCGATTTTGGGTTCTAATCCTGAGTCGAAGAATCCTCTTGAAAACATCCAAATTGATCCATCGCACAGTCACGACGTTGAGAAATCGACGGATCATAACGAATTCGTGGAGAATGCTGCCGTTGTGAAAGAGTGGAAGCCTCCAGAGCTTAAGCCGAAAAGCTCATCGGTTGGGGTTAGAAATCTTAAGATTGTAACCGGTTTGGCATCGGTTATTGAAGACTAG
Protein sequence
MPFQMKIQPIDFDTTDEAARFELVKPAAKSKLKRLFERQFPNVLRNSAEKLNFEELSVNKESSDEFSELEPSSICLANMVQNFLEDNNEKQFSGSRCGRSRCNCFNGNYTDSSDEEFDPQSGFGDSKFSSGGEAWELLKSLIPCSSVHERNILADTARIVEKNKVCKRKDNLAREIVTNGLLALGYDASICKSHWEKSPSYPAGDYEYVDVIIEGERLLIDIDFRSEFEIARSTKSYKSILQLIPYIFVGKACRLQRIVSIVSEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPHSRASSLAILGSNPESKNPLENIQIDPSHSHDVEKSTDHNEFVENAAVVKEWKPPELKPKSSSVGVRNLKIVTGLASVIED
Homology
BLAST of CmoCh02G018200 vs. ExPASy TrEMBL
Match:
A0A6J1ESR4 (uncharacterized protein LOC111437420 OS=Cucurbita moschata OX=3662 GN=LOC111437420 PE=4 SV=1)
HSP 1 Score: 739.6 bits (1908), Expect = 6.6e-210
Identity = 374/374 (100.00%), Postives = 374/374 (100.00%), Query Frame = 0
Query: 1 MPFQMKIQPIDFDTTDEAARFELVKPAAKSKLKRLFERQFPNVLRNSAEKLNFEELSVNK 60
MPFQMKIQPIDFDTTDEAARFELVKPAAKSKLKRLFERQFPNVLRNSAEKLNFEELSVNK
Sbjct: 1 MPFQMKIQPIDFDTTDEAARFELVKPAAKSKLKRLFERQFPNVLRNSAEKLNFEELSVNK 60
Query: 61 ESSDEFSELEPSSICLANMVQNFLEDNNEKQFSGSRCGRSRCNCFNGNYTDSSDEEFDPQ 120
ESSDEFSELEPSSICLANMVQNFLEDNNEKQFSGSRCGRSRCNCFNGNYTDSSDEEFDPQ
Sbjct: 61 ESSDEFSELEPSSICLANMVQNFLEDNNEKQFSGSRCGRSRCNCFNGNYTDSSDEEFDPQ 120
Query: 121 SGFGDSKFSSGGEAWELLKSLIPCSSVHERNILADTARIVEKNKVCKRKDNLAREIVTNG 180
SGFGDSKFSSGGEAWELLKSLIPCSSVHERNILADTARIVEKNKVCKRKDNLAREIVTNG
Sbjct: 121 SGFGDSKFSSGGEAWELLKSLIPCSSVHERNILADTARIVEKNKVCKRKDNLAREIVTNG 180
Query: 181 LLALGYDASICKSHWEKSPSYPAGDYEYVDVIIEGERLLIDIDFRSEFEIARSTKSYKSI 240
LLALGYDASICKSHWEKSPSYPAGDYEYVDVIIEGERLLIDIDFRSEFEIARSTKSYKSI
Sbjct: 181 LLALGYDASICKSHWEKSPSYPAGDYEYVDVIIEGERLLIDIDFRSEFEIARSTKSYKSI 240
Query: 241 LQLIPYIFVGKACRLQRIVSIVSEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPHSRASSL 300
LQLIPYIFVGKACRLQRIVSIVSEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPHSRASSL
Sbjct: 241 LQLIPYIFVGKACRLQRIVSIVSEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPHSRASSL 300
Query: 301 AILGSNPESKNPLENIQIDPSHSHDVEKSTDHNEFVENAAVVKEWKPPELKPKSSSVGVR 360
AILGSNPESKNPLENIQIDPSHSHDVEKSTDHNEFVENAAVVKEWKPPELKPKSSSVGVR
Sbjct: 301 AILGSNPESKNPLENIQIDPSHSHDVEKSTDHNEFVENAAVVKEWKPPELKPKSSSVGVR 360
Query: 361 NLKIVTGLASVIED 375
NLKIVTGLASVIED
Sbjct: 361 NLKIVTGLASVIED 374
BLAST of CmoCh02G018200 vs. ExPASy TrEMBL
Match:
A0A6J1K2U7 (uncharacterized protein LOC111491206 OS=Cucurbita maxima OX=3661 GN=LOC111491206 PE=4 SV=1)
HSP 1 Score: 721.1 bits (1860), Expect = 2.4e-204
Identity = 366/375 (97.60%), Postives = 371/375 (98.93%), Query Frame = 0
Query: 1 MPFQMKIQPIDFDTTDEAARFELVKPAAKSKLKRLFERQFPNVLRNSAEKLNFEELSVNK 60
MPFQMKIQPIDFDTTDEAARFELVKPAAKSKLKRLFERQFPNVLRNSAEKLNFE+LSVNK
Sbjct: 1 MPFQMKIQPIDFDTTDEAARFELVKPAAKSKLKRLFERQFPNVLRNSAEKLNFEDLSVNK 60
Query: 61 ESSDEFSELEPSSICLANMVQNFLEDNNEKQFSGSRCGRSRCNCFNGNYTDSSDEEFDPQ 120
ESSDEFSELEPSSICLANMVQNF+EDNNEKQF GSRCGR+RCNCFNGNYTDSSDEEFDPQ
Sbjct: 61 ESSDEFSELEPSSICLANMVQNFIEDNNEKQFGGSRCGRNRCNCFNGNYTDSSDEEFDPQ 120
Query: 121 SGFGDSKFSSGGEAWELLKSLIPCSSVHERNILADTARIVEKNKVCKRKDNLAREIVTNG 180
SGFGDSKFSSGGEAWELLKSLIPCSSVHERNILADTARIVEKNKVCKRKDNLAR+IVTNG
Sbjct: 121 SGFGDSKFSSGGEAWELLKSLIPCSSVHERNILADTARIVEKNKVCKRKDNLARQIVTNG 180
Query: 181 LLALGYDASICKSHWEKSPSYPAGDYEYVDVIIEGERLLIDIDFRSEFEIARSTKSYKSI 240
LLALGYDASICKSHWEKSPSYPAGDYEYVDVIIEGERLLIDIDFRSEFEIARSTKSYKSI
Sbjct: 181 LLALGYDASICKSHWEKSPSYPAGDYEYVDVIIEGERLLIDIDFRSEFEIARSTKSYKSI 240
Query: 241 LQLIPYIFVGKACRLQRIVSIVSEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPHSRASSL 300
LQLIPYIFVGKACRLQRIVSIVSEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPHSRASSL
Sbjct: 241 LQLIPYIFVGKACRLQRIVSIVSEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPHSRASSL 300
Query: 301 AILGS-NPESKNPLENIQIDPSHSHDVEKSTDHNEFVENAAVVKEWKPPELKPKSSSVGV 360
AILGS NPESKNPLENIQIDPS+SHDVEKS DHNEFVE AAVVKEWKPPELKPKSSSVGV
Sbjct: 301 AILGSTNPESKNPLENIQIDPSYSHDVEKSMDHNEFVEIAAVVKEWKPPELKPKSSSVGV 360
Query: 361 RNLKIVTGLASVIED 375
RNLKIVTGLASVIED
Sbjct: 361 RNLKIVTGLASVIED 375
BLAST of CmoCh02G018200 vs. ExPASy TrEMBL
Match:
A0A6J1HG87 (uncharacterized protein LOC111464003 OS=Cucurbita moschata OX=3662 GN=LOC111464003 PE=4 SV=1)
HSP 1 Score: 619.8 bits (1597), Expect = 7.6e-174
Identity = 318/375 (84.80%), Postives = 339/375 (90.40%), Query Frame = 0
Query: 3 FQMKIQPIDFDTTDEAARFELVKPAAKSKLKRLFERQFPNVLRNSAEKLNFEELSVNKES 62
FQMKIQPIDFDT +EAARFELVKP KSKLKRLFERQF NVLRNSAEK NFEE +VNK+S
Sbjct: 14 FQMKIQPIDFDTAEEAARFELVKPVVKSKLKRLFERQFSNVLRNSAEKANFEESNVNKDS 73
Query: 63 SDEF-SELEPSSICLANMVQNFLEDNNEKQFSGSRCGRSRCNCFNGNYTDSSDEEFDPQS 122
SD SELEPSS+CLANMVQNF+EDNNEKQFS SRCGRSRCNCFNGNYTDSS+EE DP S
Sbjct: 74 SDGVSSELEPSSLCLANMVQNFIEDNNEKQFSASRCGRSRCNCFNGNYTDSSEEELDPHS 133
Query: 123 GFGDSKFSSGGEAWELLKSLIPCSSVHERNILADTARIVEKNKVCKRKDNLAREIVTNGL 182
GFGDS FSSGGEAWELLKSLIPC++VHERN+LADTARIVEKNKVCKRKDNLAR++VTNGL
Sbjct: 134 GFGDSNFSSGGEAWELLKSLIPCTNVHERNLLADTARIVEKNKVCKRKDNLARQVVTNGL 193
Query: 183 LALGYDASICKSHWEKSPSYPAGDYEYVDVIIEGERLLIDIDFRSEFEIARSTKSYKSIL 242
LALGYDA ICKSHWEKSP++PAGDYEY+DVII GERLLIDIDFRSEFEIARSTKSYK+IL
Sbjct: 194 LALGYDAFICKSHWEKSPTHPAGDYEYIDVIIGGERLLIDIDFRSEFEIARSTKSYKAIL 253
Query: 243 QLIPYIFVGKACRLQRIVSIVSEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPHSRASSLA 302
QLIPYIFVG CRLQRIVSI+SEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPH RASSL+
Sbjct: 254 QLIPYIFVGNPCRLQRIVSIISEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPHIRASSLS 313
Query: 303 ILGSNPESKNPLENIQIDPSHSHDVEKSTDHNEF--VENAAVVKEWKPPELKPKSSSVGV 362
LG +PESK+ L NIQI+ EKS D NE VE AAVVKEWKPPELKPKSSS+G
Sbjct: 314 SLGPDPESKHTLGNIQINLRPG--AEKSVDPNELGGVETAAVVKEWKPPELKPKSSSIGA 373
Query: 363 RNLKIVTGLASVIED 375
RNLKIVTGLASVIED
Sbjct: 374 RNLKIVTGLASVIED 386
BLAST of CmoCh02G018200 vs. ExPASy TrEMBL
Match:
A0A6J1CY78 (uncharacterized protein LOC111015299 OS=Momordica charantia OX=3673 GN=LOC111015299 PE=4 SV=1)
HSP 1 Score: 617.1 bits (1590), Expect = 4.9e-173
Identity = 317/386 (82.12%), Postives = 340/386 (88.08%), Query Frame = 0
Query: 1 MPFQMKIQPIDFDTTDEAARFELVKPAAK-SKLKRLFERQFPNVLRNSAEKLNFEELSVN 60
MPFQMKIQPIDFD+ +EAAR ELVKPA K SKLKRLFE+QF NVLRNSAEK NFEEL+VN
Sbjct: 1 MPFQMKIQPIDFDSAEEAARLELVKPAVKSSKLKRLFEKQFHNVLRNSAEKANFEELNVN 60
Query: 61 KESSDEFSELEPSSICLANMVQNFLEDNNEKQFSGSRCGRSRCNCFNGNYTDSSDEEFDP 120
K+SSD FS LEPSSICLA MVQNF+EDNNEKQFS SRCGRSRCNCFNGNYTDSS+EE D
Sbjct: 61 KDSSDCFSVLEPSSICLAKMVQNFIEDNNEKQFSASRCGRSRCNCFNGNYTDSSEEELDS 120
Query: 121 QSGFGDSKFSSGGEAWELLKSLIPCSSVHERNILADTARIVEKNKVCKRKDNLAREIVTN 180
GFGD+KFSSGGEAWELLKSLIPC+SVHERN+LADTARIVEKNKVCKRKDNLAR+IVTN
Sbjct: 121 HGGFGDAKFSSGGEAWELLKSLIPCTSVHERNLLADTARIVEKNKVCKRKDNLARQIVTN 180
Query: 181 GLLALGYDASICKSHWEKSPSYPAGDYEYVDVIIEGERLLIDIDFRSEFEIARSTKSYKS 240
GLLALGYDASICKSHWEKSPSYPAGDYEYVDVIIEGERLLID+DFRSEFEIARSTKSY++
Sbjct: 181 GLLALGYDASICKSHWEKSPSYPAGDYEYVDVIIEGERLLIDVDFRSEFEIARSTKSYRT 240
Query: 241 ILQLIPYIFVGKACRLQRIVSIVSEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPHSRASS 300
ILQL+P+I+VGK RLQRIVS+VSEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPH RASS
Sbjct: 241 ILQLVPFIYVGKPGRLQRIVSVVSEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPHIRASS 300
Query: 301 LAILGSNPESKNPLENIQIDPSHSHDVEKSTDHNEFVE-----------NAAVVKEWKPP 360
L+ILG N ESK PLEN Q++P EKS D NEF E VKEWKPP
Sbjct: 301 LSILGPNSESKQPLENFQLEPRQV--AEKSVDRNEFGEVDLGPSVTIGVEEDTVKEWKPP 360
Query: 361 ELKPKSSSVGVRNLKIVTGLASVIED 375
E+KPKSSS+G RNLKIVTGLASVIED
Sbjct: 361 EVKPKSSSLGARNLKIVTGLASVIED 384
BLAST of CmoCh02G018200 vs. ExPASy TrEMBL
Match:
A0A6J1HV35 (uncharacterized protein LOC111466982 OS=Cucurbita maxima OX=3661 GN=LOC111466982 PE=4 SV=1)
HSP 1 Score: 613.2 bits (1580), Expect = 7.1e-172
Identity = 314/373 (84.18%), Postives = 336/373 (90.08%), Query Frame = 0
Query: 5 MKIQPIDFDTTDEAARFELVKPAAKSKLKRLFERQFPNVLRNSAEKLNFEELSVNKESSD 64
MKIQPI+FDT +EAARFELVKP KSKLKRLFERQF NVLRNSAEK NFEE +VNK+SSD
Sbjct: 1 MKIQPIEFDTAEEAARFELVKPVVKSKLKRLFERQFSNVLRNSAEKANFEESNVNKDSSD 60
Query: 65 EF-SELEPSSICLANMVQNFLEDNNEKQFSGSRCGRSRCNCFNGNYTDSSDEEFDPQSGF 124
SELEPSS+CLANMVQNF+EDNNEK FS SRCGRSRCNCFNGNYTDSS+EE DP SGF
Sbjct: 61 GVSSELEPSSLCLANMVQNFIEDNNEKHFSASRCGRSRCNCFNGNYTDSSEEELDPHSGF 120
Query: 125 GDSKFSSGGEAWELLKSLIPCSSVHERNILADTARIVEKNKVCKRKDNLAREIVTNGLLA 184
GDS FSSGGEAWELLKSLIPC++VHERN+LA+TARIVEKNKVCKRKDNLAR++VTNGLLA
Sbjct: 121 GDSNFSSGGEAWELLKSLIPCTNVHERNLLAETARIVEKNKVCKRKDNLARQVVTNGLLA 180
Query: 185 LGYDASICKSHWEKSPSYPAGDYEYVDVIIEGERLLIDIDFRSEFEIARSTKSYKSILQL 244
LGYDASICKSHWEKSP++PAGDYEY DVII+GERLLIDIDFRSEFEIARSTKSYK+ILQL
Sbjct: 181 LGYDASICKSHWEKSPTHPAGDYEYTDVIIDGERLLIDIDFRSEFEIARSTKSYKAILQL 240
Query: 245 IPYIFVGKACRLQRIVSIVSEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPHSRASSLAIL 304
IPYIFVG CRLQRIVSI+SEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPH RASSL+ L
Sbjct: 241 IPYIFVGNPCRLQRIVSIISEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPHIRASSLSSL 300
Query: 305 GSNPESKNPLENIQIDPSHSHDVEKSTDHNEF--VENAAVVKEWKPPELKPKSSSVGVRN 364
G +PESK+ L NIQID EKS D NE VE AAVVKEWKPPELKPKS S+G RN
Sbjct: 301 GPDPESKHTLGNIQIDLRPG--AEKSVDPNELGGVETAAVVKEWKPPELKPKSPSIGARN 360
Query: 365 LKIVTGLASVIED 375
LKIVTGLASVIED
Sbjct: 361 LKIVTGLASVIED 371
BLAST of CmoCh02G018200 vs. TAIR 10
Match:
AT3G22970.1 (Protein of unknown function (DUF506) )
HSP 1 Score: 340.9 bits (873), Expect = 1.3e-93
Identity = 203/386 (52.59%), Postives = 261/386 (67.62%), Query Frame = 0
Query: 1 MPFQMKIQPIDFDTTDEAARFEL-VKPAAKSKLKRLFERQFPNVLRNSAEKLNFEELSV- 60
MPF MKIQPID D++ AR E KP KS+LKRLF+R F NVLRNS + V
Sbjct: 1 MPFTMKIQPIDIDSSPTVARAESGNKPVLKSRLKRLFDRPFTNVLRNSTTTTTEKPFVVT 60
Query: 61 --NKESSDEFSELEPSSICLANMVQNFLEDNNEKQFSGSRCGRSRCNCFNGNYTDSSDEE 120
+ +E EPSS+CLA MVQNF+E+NNEKQ ++CGR+RCNCFNGN SSD+E
Sbjct: 61 GGEVQCGGVVTEFEPSSVCLAKMVQNFIEENNEKQ---AKCGRNRCNCFNGNNDGSSDDE 120
Query: 121 FDPQSGFGDSKFSSGGEAWELLKSLIPCSSVHERNILADTARIVEKNKVCKRKDNLAREI 180
D G D G +A + LKSLIPC++V ERN+LAD A+IV+KNK KRKD++ ++I
Sbjct: 121 SDLFGGSID-----GCDASDHLKSLIPCTTVGERNLLADAAKIVDKNKSVKRKDDM-KKI 180
Query: 181 VTNGLLALGYDASICKSHWEKSPSYPAGDYEYVDVIIEGERLLIDIDFRSEFEIARSTKS 240
V GLL+L Y++SICKS W+KSPS+PAG+YEY+DVII ERL+ID+DFRSEF+IAR T
Sbjct: 181 VNEGLLSLNYNSSICKSKWDKSPSFPAGEYEYIDVIIGEERLIIDVDFRSEFDIARQTSG 240
Query: 241 YKSILQLIPYIFVGKACRLQRIVSIVSEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPHSR 300
YK +LQ +P+IFVGK+ RL +IV ++SEAAKQSLKKKGMP PPWRKAEY+++KWLS ++R
Sbjct: 241 YKVLLQSLPFIFVGKSDRLSQIVFLISEAAKQSLKKKGMPFPPWRKAEYMRSKWLSSYTR 300
Query: 301 ASSLAILGSNPESKNPLENIQIDPSHSHDVEKSTDHNEFVENAAVVKEWKPPELKPKSSS 360
AS + + E+ + D + VEK D E +E K P + SSS
Sbjct: 301 ASVVVV----DETVTVTDVTAADAA----VEKEVDSVE-IELVFEEKCLSPRVIVNSSSS 360
Query: 361 -------VGV-RNLKIVTGLASVIED 375
V V R +K VTGLAS+ ++
Sbjct: 361 PTDGDDDVAVEREVKAVTGLASLFKE 368
BLAST of CmoCh02G018200 vs. TAIR 10
Match:
AT2G38820.2 (Protein of unknown function (DUF506) )
HSP 1 Score: 322.4 bits (825), Expect = 4.8e-88
Identity = 180/323 (55.73%), Postives = 231/323 (71.52%), Query Frame = 0
Query: 1 MPFQMKIQPID-FDTTDEAARFELVKPAAKSKLKRLFERQFPNVLRNSAEKLNFE--ELS 60
MP MKIQPID D ++E E ++ KS+LKRLFERQF N +N +EK E
Sbjct: 1 MPLHMKIQPIDESDVSEEVPYPETMRQMPKSRLKRLFERQFTN--KNVSEKFTGSDVEAP 60
Query: 61 VNKESSDEFSELEPSSICLANMVQNFLEDNN--EKQFSGSRCGRSRCNCFNGNYTDSSDE 120
+++ +S +F EPSS+CLA MV NF+EDNN EKQ RCGRSRCNCF+G+ T+SSD+
Sbjct: 61 LSRGNSGDF---EPSSVCLAKMVLNFMEDNNGGEKQ----RCGRSRCNCFSGSGTESSDD 120
Query: 121 EFDPQSGFGDSKFSSGGEAWELLKSLIPCSSVHERNILADTARIVEKNKVCKRKDNLARE 180
E + S GEA E+LKSL+ C S+ RN+L D +I E +K CK KD +
Sbjct: 121 ETE----------CSSGEACEILKSLVLCKSIRVRNLLTDVTKIAETSKNCKLKDGSCLK 180
Query: 181 IVTNGLLALGYDASICKSHWEKSPSYPAGDYEYVDVIIEGERLLIDIDFRSEFEIARSTK 240
V NGL++LGYDA++CKS WEKSPS PAG+YEYVDVI++GERLLIDIDF+S+FEIAR+TK
Sbjct: 181 SVANGLVSLGYDAALCKSRWEKSPSCPAGEYEYVDVIMKGERLLIDIDFKSKFEIARATK 240
Query: 241 SYKSILQLIPYIFVGKACRLQRIVSIVSEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPHS 300
+YKS+LQ +PYIFVGKA RLQ+I+ ++ +AAKQSLKKKG+ VPPWR+AEYVK+KWLS H
Sbjct: 241 TYKSMLQTLPYIFVGKADRLQKIIVLICKAAKQSLKKKGLHVPPWRRAEYVKSKWLSSHV 298
Query: 301 RASSLAILGSNPESKNPLENIQI 319
R SN E K E++++
Sbjct: 301 RVDQ----NSNGEVKQ--ESVEV 298
BLAST of CmoCh02G018200 vs. TAIR 10
Match:
AT2G38820.1 (Protein of unknown function (DUF506) )
HSP 1 Score: 289.3 bits (739), Expect = 4.5e-78
Identity = 169/323 (52.32%), Postives = 217/323 (67.18%), Query Frame = 0
Query: 1 MPFQMKIQPID-FDTTDEAARFELVKPAAKSKLKRLFERQFPNVLRNSAEKLNFE--ELS 60
MP MKIQPID D ++E E ++ KS+LKRLFERQF N +N +EK E
Sbjct: 1 MPLHMKIQPIDESDVSEEVPYPETMRQMPKSRLKRLFERQFTN--KNVSEKFTGSDVEAP 60
Query: 61 VNKESSDEFSELEPSSICLANMVQNFLEDNN--EKQFSGSRCGRSRCNCFNGNYTDSSDE 120
+++ +S +F EPSS+CLA MV NF+EDNN EKQ RCGRSRCNCF+G+ T+SSD+
Sbjct: 61 LSRGNSGDF---EPSSVCLAKMVLNFMEDNNGGEKQ----RCGRSRCNCFSGSGTESSDD 120
Query: 121 EFDPQSGFGDSKFSSGGEAWELLKSLIPCSSVHERNILADTARIVEKNKVCKRKDNLARE 180
E + S GEA E+LKSL+ C S+ RN+L D +I E +
Sbjct: 121 ETE----------CSSGEACEILKSLVLCKSIRVRNLLTDVTKIAETS------------ 180
Query: 181 IVTNGLLALGYDASICKSHWEKSPSYPAGDYEYVDVIIEGERLLIDIDFRSEFEIARSTK 240
YDA++CKS WEKSPS PAG+YEYVDVI++GERLLIDIDF+S+FEIAR+TK
Sbjct: 181 ----------YDAALCKSRWEKSPSCPAGEYEYVDVIMKGERLLIDIDFKSKFEIARATK 240
Query: 241 SYKSILQLIPYIFVGKACRLQRIVSIVSEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPHS 300
+YKS+LQ +PYIFVGKA RLQ+I+ ++ +AAKQSLKKKG+ VPPWR+AEYVK+KWLS H
Sbjct: 241 TYKSMLQTLPYIFVGKADRLQKIIVLICKAAKQSLKKKGLHVPPWRRAEYVKSKWLSSHV 276
Query: 301 RASSLAILGSNPESKNPLENIQI 319
R SN E K E++++
Sbjct: 301 RVDQ----NSNGEVKQ--ESVEV 276
BLAST of CmoCh02G018200 vs. TAIR 10
Match:
AT4G14620.1 (Protein of unknown function (DUF506) )
HSP 1 Score: 268.1 bits (684), Expect = 1.1e-71
Identity = 174/381 (45.67%), Postives = 231/381 (60.63%), Query Frame = 0
Query: 5 MKIQPIDFDTTDEAARFE-LVKPAAKSKLKRLFERQFPNVLRNSAEKLNFEELSVNKESS 64
MKIQPI+ D A R E KP KS+LKRL +R F + N E+L ++ +
Sbjct: 2 MKIQPINNDL--PANRVESSTKPVLKSRLKRLLDRPFTRI-------SNSEKLLISGDGV 61
Query: 65 DEFSELEPSSICLANMVQNFLEDNNEKQFSGSRCGRSRCNCFNGNYTDSSDEEFDPQSGF 124
+E EPS LA MVQN++E+NN+KQ R RCNCFNGN D SD+E D F
Sbjct: 62 VAGTEFEPS---LAKMVQNYMEENNDKQTKNGR-NTHRCNCFNGN-NDISDDELD----F 121
Query: 125 GDSKFSSGGEAWELLKSLIPCSSVHERNILADTARIVEKNKVCKRKDNLAREIVTNGLLA 184
D ++ KSLI C S E+++L + +I+EKNK KRKD L R+IV + L +
Sbjct: 122 FD---------YDNFKSLIQCGSFVEKSLLVEATKIIEKNKSVKRKDEL-RKIVVDELSS 181
Query: 185 LGYDASICKSHWEKSPSYPAGDYEYVDVIIEGERLLIDIDFRSEFEIARSTKSYKSILQL 244
LGYD+SICKS W+K+ S PAG+YEY+DVI+ GERL+IDIDFRSEFEIAR T YK +LQ
Sbjct: 182 LGYDSSICKSKWDKTRSIPAGEYEYIDVIVNGERLIIDIDFRSEFEIARQTSGYKELLQS 241
Query: 245 IPYIFVGKACRLQRIVSIVSEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPHSRASSLAIL 304
+P IFVGK+ R+++IVSIVSEA+KQSLKKKGM PPWRKA+Y++AKWLS ++R S
Sbjct: 242 LPLIFVGKSDRIRQIVSIVSEASKQSLKKKGMHFPPWRKADYMRAKWLSSYTRNS----- 301
Query: 305 GSNPESKNPLENIQIDPSHSHDVEKSTDHNEFVENAAVVKEWKPPELKPKSSSVG----- 364
G + + +P +++ S F E + P LK +SVG
Sbjct: 302 GEKKPTVTSAAKVVAEP----ELDSSEIELIFEEKVLL------PPLKSPITSVGRDDDD 339
Query: 365 -----VRNLKIVTGLASVIED 375
+ K+VTGLA + ++
Sbjct: 362 VAESVKKEAKVVTGLALLFKE 339
BLAST of CmoCh02G018200 vs. TAIR 10
Match:
AT3G54550.1 (Protein of unknown function (DUF506) )
HSP 1 Score: 237.3 bits (604), Expect = 2.0e-62
Identity = 140/302 (46.36%), Postives = 203/302 (67.22%), Query Frame = 0
Query: 1 MPFQMKIQPIDFDTTDEAARFELVKPAAKSKLKRLFERQFPNVLRNSAEKLNFEELSVNK 60
MPF+ K+QPI+ + ++ A +S+LKRLFER F S + L + S+++
Sbjct: 1 MPFRSKVQPININGVG-------MRQAPRSRLKRLFERPF------SLKNLAGVDSSLSR 60
Query: 61 ESSDEFSELEPSSICLANMVQNFLED-NNEKQFSGSRC-GRSRCNCFNGNYTDSSDEEFD 120
E+S+ E+EPSS+CL MVQN++ED ++EKQ S+C R+RCNCF+G+ TDSSDE+
Sbjct: 61 ENSE---EIEPSSVCLRRMVQNYIEDPDSEKQ---SKCIVRNRCNCFSGSGTDSSDED-- 120
Query: 121 PQSGFGDSKFSSGGEAWELLKSLIPCSSVHERNILADTARIVEKNKVCKRKDNLAREIVT 180
D + SS LKSL+ C++V ER++ T IVE+ + +D + V
Sbjct: 121 ------DEESSSSRRVLRSLKSLLLCANVSERDLETKTTEIVER----EVEDKSRLKNVV 180
Query: 181 NGLLALGYDASICKSHWEKSPS----YPAGDYEYVDVIIEGERLLIDIDFRSEFEIARST 240
+ L+ALGYDA+ICKS WEKS + PAGDYEY+DV I GER+LID DF+S+FEIARS+
Sbjct: 181 DELVALGYDAAICKSRWEKSKTKSYCVPAGDYEYLDVNIGGERVLIDFDFQSKFEIARSS 240
Query: 241 KSYKSILQLIPYIFVGKACRLQRIVSIVSEAAKQSLKKKGMPVPPWRKAEYVKAKWLSPH 297
++Y+SI + +PY+FVG+ RL ++V +S+AAK S +KKG+ +PPWR+AEY+ KW+S +
Sbjct: 241 ETYESISKTLPYVFVGQVDRLTKVVVFLSKAAKTSFRKKGLFMPPWRRAEYLLTKWVSQY 271
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1ESR4 | 6.6e-210 | 100.00 | uncharacterized protein LOC111437420 OS=Cucurbita moschata OX=3662 GN=LOC1114374... | [more] |
A0A6J1K2U7 | 2.4e-204 | 97.60 | uncharacterized protein LOC111491206 OS=Cucurbita maxima OX=3661 GN=LOC111491206... | [more] |
A0A6J1HG87 | 7.6e-174 | 84.80 | uncharacterized protein LOC111464003 OS=Cucurbita moschata OX=3662 GN=LOC1114640... | [more] |
A0A6J1CY78 | 4.9e-173 | 82.12 | uncharacterized protein LOC111015299 OS=Momordica charantia OX=3673 GN=LOC111015... | [more] |
A0A6J1HV35 | 7.1e-172 | 84.18 | uncharacterized protein LOC111466982 OS=Cucurbita maxima OX=3661 GN=LOC111466982... | [more] |