Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TCTCTTCTGTGTTTCAGAATTTCTAGGTTTTCTCTCTTTCTTTCTCCTTACTGTAACGATTTGGTGTTGTAATTGGTCGGTGGTCATGGCTGAGAGTTTAGATGACGGCGAGTTTTGGCTTCCTCCTCAGTTTCTTTCTGACGACGATGACGACCAAGAGAGTTCTGCTACTAATAAGAACAATGGCCGGAATTCGTTCGGTTCGACTGCTTTTCAGTCTGGACGGCCGTCGTTTCCGCTTGAGTTTGGGACTTTTGGTGGATTTTCGGATTTTAGTTCTACTGCTGAATCGTTGAAGGGTTCTAGTGAGACGGAGAGTGATCTGGAGGATCTTGTCGCTGGATTGACTCTTCATATGGCTCGCTCCACCGTTGATGATGGTCTTGATTCTGATAATGCTAAGGTTTCAACTCGAAGCTGATTTGATTTGATTTTTTATGGTTGCTCTGTTTCTTTTATGTCTAATCTTTAGATTTTTTTTTTTAGGAAAGAGTTTTGGCTGGATCTCCGCAGTCGACTCTTTGCGATATGGGGAGTGGTAGTGGCTGTAGTCAAGGTTCTAGTCGAGGAAGTCCTAAAGGTAATTGTAAGGTTCCATCTCCGCCGGCTACATGGGATCTGCTGCATGCGGCTGCAGGACAAGTAGCGAGGATGCGAATGACGGAGAATCATGGAGTTGTTCATCAAAATAGAGGAGCATCTCAAGTTTCTGTTCCGGTGAAGAACTCGAGCTCCGATACCGGGTTCTACCAGCAGTTGCAGGCTATGCAAGTAAATCTCTCAACTCCTTTATGCGATTTAAATTAATTTTTCAATTGAGATCTGTGTTTCTGTTGAATTAGCCGTCTATTTCGGTTCATTATTGGGTTCATTATTCTTTCGCAGTTTCAGCATCTACAACAGAAGGAAATTATGCAGCGACAAAACCCGACCGTCGCTGAGCAGATAAATTATCATGCAGGGTATCAACAACAACAGATCCATCAAATAGTTCAGAACGGAATGAGGAGCGGTCGGGGACTGTCGTCATCTCCATGGCGTCAACCGCCGCAGGGCGCCGGTACGAGGGCTCTGTTTGTTGGAACATCAGGCGGTGGCAAAAGGGAATGCGCTGGAACTGGTGTCTTTTTGCCTCGCCATACCGCCACTCATTCTCAACCGAGAAGAAAACCAGGTCCGTCCTCTACTTTCATCTTCCTAAAAGCTTAATTATGCTCATATGGGAACAGTTTGGTAAAGGAAGAGGGCATTAAATTCTTCAACCTCACCAATTTTACACGAATTTATGTTTGGTAATGAATTAAACTAAATCACGAGTTCCCTTTTCGACCCACAGATTGTCCTACGGTGTTGGTTCCTGCCAGAGTGATGCAAGCCCTGAATCTTAACCTCGAAGACATTTGCAGTCAACCCCATCTTCAACCCGTCGCCGCCGCACGATTCAATTCAGATAACGGTAAAATAGAATCCACCCTCTAATTGTTATGAACACAGGAATCAATCCCCGTCTGATTAGTTCTGTGTTTTGCTGTTGTAGATGTTCTGCTGAGGCTGCAATTCAACCGCAGTGCAAATTATCAAAAACGCAACAGCCGTCGGCAGGCAGCAACAGAGCATGAAATCAAGCTTCCCCAAGAATGGACTTACTGAAACTAAACCGTCACTACAGAAAGTTAGTTCGGGATCGTCACAGTACAGAATTGGTAAAGATGAAGATATTATGATATTTGTGGTTAGATTTAGAGAAGGAATAGAAAAAAATAAGTTAGAACAAGTTGTTTAGTTTATTTTTTGTTTGGTTGAGGGACTAAGGTCACTCAGTCAAAGTGAAGAACAAGGGTGGATCCAGAAAACGGAGTGTATTTTCCACCTCTTTTTGTGTCAATAAAAGAAATTCTAGATTCCATAAAAGCCCATGTGCTGCTGCATTGGGGCCTTGGGGGGCGCCTTTTTCAGACTCTCACAAGTCCACCCTCCCTCCCCATTTGCTGTATTCATTAATTTTGTATTACTATATGTTAATCATAGTGTTATTAGTTAAAGCATGGAATTTCGTTGGTTAATTGAAATTATGTGGATGGATAAGGAGTCTGATTGGGAGAGAATAATACTTTGCTTAAGATGGAA
mRNA sequence
TCTCTTCTGTGTTTCAGAATTTCTAGGTTTTCTCTCTTTCTTTCTCCTTACTGTAACGATTTGGTGTTGTAATTGGTCGGTGGTCATGGCTGAGAGTTTAGATGACGGCGAGTTTTGGCTTCCTCCTCAGTTTCTTTCTGACGACGATGACGACCAAGAGAGTTCTGCTACTAATAAGAACAATGGCCGGAATTCGTTCGGTTCGACTGCTTTTCAGTCTGGACGGCCGTCGTTTCCGCTTGAGTTTGGGACTTTTGGTGGATTTTCGGATTTTAGTTCTACTGCTGAATCGTTGAAGGGTTCTAGTGAGACGGAGAGTGATCTGGAGGATCTTGTCGCTGGATTGACTCTTCATATGGCTCGCTCCACCGTTGATGATGGTCTTGATTCTGATAATGCTAAGGAAAGAGTTTTGGCTGGATCTCCGCAGTCGACTCTTTGCGATATGGGGAGTGGTAGTGGCTGTAGTCAAGGTTCTAGTCGAGGAAGTCCTAAAGGTAATTGTAAGGTTCCATCTCCGCCGGCTACATGGGATCTGCTGCATGCGGCTGCAGGACAAGTAGCGAGGATGCGAATGACGGAGAATCATGGAGTTGTTCATCAAAATAGAGGAGCATCTCAAGTTTCTGTTCCGGTGAAGAACTCGAGCTCCGATACCGGGTTCTACCAGCAGTTGCAGGCTATGCAATTTCAGCATCTACAACAGAAGGAAATTATGCAGCGACAAAACCCGACCGTCGCTGAGCAGATAAATTATCATGCAGGGTATCAACAACAACAGATCCATCAAATAGTTCAGAACGGAATGAGGAGCGGTCGGGGACTGTCGTCATCTCCATGGCGTCAACCGCCGCAGGGCGCCGGTACGAGGGCTCTGTTTGTTGGAACATCAGGCGGTGGCAAAAGGGAATGCGCTGGAACTGGTGTCTTTTTGCCTCGCCATACCGCCACTCATTCTCAACCGAGAAGAAAACCAGATTGTCCTACGGTGTTGGTTCCTGCCAGAGTGATGCAAGCCCTGAATCTTAACCTCGAAGACATTTGCAGTCAACCCCATCTTCAACCCGTCGCCGCCGCACGATTCAATTCAGATAACGATGTTCTGCTGAGGCTGCAATTCAACCGCAGTGCAAATTATCAAAAACGCAACAGCCGTCGGCAGGCAGCAACAGAGCATGAAATCAAGCTTCCCCAAGAATGGACTTACTGAAACTAAACCGTCACTACAGAAAGTTAGTTCGGGATCGTCACAGTACAGAATTGGTAAAGATGAAGATATTATGATATTTGTGGTTAGATTTAGAGAAGGAATAGAAAAAAATAAGTTAGAACAAGTTGTTTAGTTTATTTTTTGTTTGGTTGAGGGACTAAGGTCACTCAGTCAAAGTGAAGAACAAGGGTGGATCCAGAAAACGGAGTGTATTTTCCACCTCTTTTTGTGTCAATAAAAGAAATTCTAGATTCCATAAAAGCCCATGTGCTGCTGCATTGGGGCCTTGGGGGGCGCCTTTTTCAGACTCTCACAAGTCCACCCTCCCTCCCCATTTGCTGTATTCATTAATTTTGTATTACTATATGTTAATCATAGTGTTATTAGTTAAAGCATGGAATTTCGTTGGTTAATTGAAATTATGTGGATGGATAAGGAGTCTGATTGGGAGAGAATAATACTTTGCTTAAGATGGAA
Coding sequence (CDS)
ATGGCTGAGAGTTTAGATGACGGCGAGTTTTGGCTTCCTCCTCAGTTTCTTTCTGACGACGATGACGACCAAGAGAGTTCTGCTACTAATAAGAACAATGGCCGGAATTCGTTCGGTTCGACTGCTTTTCAGTCTGGACGGCCGTCGTTTCCGCTTGAGTTTGGGACTTTTGGTGGATTTTCGGATTTTAGTTCTACTGCTGAATCGTTGAAGGGTTCTAGTGAGACGGAGAGTGATCTGGAGGATCTTGTCGCTGGATTGACTCTTCATATGGCTCGCTCCACCGTTGATGATGGTCTTGATTCTGATAATGCTAAGGAAAGAGTTTTGGCTGGATCTCCGCAGTCGACTCTTTGCGATATGGGGAGTGGTAGTGGCTGTAGTCAAGGTTCTAGTCGAGGAAGTCCTAAAGGTAATTGTAAGGTTCCATCTCCGCCGGCTACATGGGATCTGCTGCATGCGGCTGCAGGACAAGTAGCGAGGATGCGAATGACGGAGAATCATGGAGTTGTTCATCAAAATAGAGGAGCATCTCAAGTTTCTGTTCCGGTGAAGAACTCGAGCTCCGATACCGGGTTCTACCAGCAGTTGCAGGCTATGCAATTTCAGCATCTACAACAGAAGGAAATTATGCAGCGACAAAACCCGACCGTCGCTGAGCAGATAAATTATCATGCAGGGTATCAACAACAACAGATCCATCAAATAGTTCAGAACGGAATGAGGAGCGGTCGGGGACTGTCGTCATCTCCATGGCGTCAACCGCCGCAGGGCGCCGGTACGAGGGCTCTGTTTGTTGGAACATCAGGCGGTGGCAAAAGGGAATGCGCTGGAACTGGTGTCTTTTTGCCTCGCCATACCGCCACTCATTCTCAACCGAGAAGAAAACCAGATTGTCCTACGGTGTTGGTTCCTGCCAGAGTGATGCAAGCCCTGAATCTTAACCTCGAAGACATTTGCAGTCAACCCCATCTTCAACCCGTCGCCGCCGCACGATTCAATTCAGATAACGATGTTCTGCTGAGGCTGCAATTCAACCGCAGTGCAAATTATCAAAAACGCAACAGCCGTCGGCAGGCAGCAACAGAGCATGAAATCAAGCTTCCCCAAGAATGGACTTACTGA
Protein sequence
MAESLDDGEFWLPPQFLSDDDDDQESSATNKNNGRNSFGSTAFQSGRPSFPLEFGTFGGFSDFSSTAESLKGSSETESDLEDLVAGLTLHMARSTVDDGLDSDNAKERVLAGSPQSTLCDMGSGSGCSQGSSRGSPKGNCKVPSPPATWDLLHAAAGQVARMRMTENHGVVHQNRGASQVSVPVKNSSSDTGFYQQLQAMQFQHLQQKEIMQRQNPTVAEQINYHAGYQQQQIHQIVQNGMRSGRGLSSSPWRQPPQGAGTRALFVGTSGGGKRECAGTGVFLPRHTATHSQPRRKPDCPTVLVPARVMQALNLNLEDICSQPHLQPVAAARFNSDNDVLLRLQFNRSANYQKRNSRRQAATEHEIKLPQEWTY
Homology
BLAST of CmoCh06G002660 vs. ExPASy TrEMBL
Match:
A0A6J1FT27 (uncharacterized protein LOC111447045 OS=Cucurbita moschata OX=3662 GN=LOC111447045 PE=4 SV=1)
HSP 1 Score: 739.2 bits (1907), Expect = 8.6e-210
Identity = 374/374 (100.00%), Postives = 374/374 (100.00%), Query Frame = 0
Query: 1 MAESLDDGEFWLPPQFLSDDDDDQESSATNKNNGRNSFGSTAFQSGRPSFPLEFGTFGGF 60
MAESLDDGEFWLPPQFLSDDDDDQESSATNKNNGRNSFGSTAFQSGRPSFPLEFGTFGGF
Sbjct: 1 MAESLDDGEFWLPPQFLSDDDDDQESSATNKNNGRNSFGSTAFQSGRPSFPLEFGTFGGF 60
Query: 61 SDFSSTAESLKGSSETESDLEDLVAGLTLHMARSTVDDGLDSDNAKERVLAGSPQSTLCD 120
SDFSSTAESLKGSSETESDLEDLVAGLTLHMARSTVDDGLDSDNAKERVLAGSPQSTLCD
Sbjct: 61 SDFSSTAESLKGSSETESDLEDLVAGLTLHMARSTVDDGLDSDNAKERVLAGSPQSTLCD 120
Query: 121 MGSGSGCSQGSSRGSPKGNCKVPSPPATWDLLHAAAGQVARMRMTENHGVVHQNRGASQV 180
MGSGSGCSQGSSRGSPKGNCKVPSPPATWDLLHAAAGQVARMRMTENHGVVHQNRGASQV
Sbjct: 121 MGSGSGCSQGSSRGSPKGNCKVPSPPATWDLLHAAAGQVARMRMTENHGVVHQNRGASQV 180
Query: 181 SVPVKNSSSDTGFYQQLQAMQFQHLQQKEIMQRQNPTVAEQINYHAGYQQQQIHQIVQNG 240
SVPVKNSSSDTGFYQQLQAMQFQHLQQKEIMQRQNPTVAEQINYHAGYQQQQIHQIVQNG
Sbjct: 181 SVPVKNSSSDTGFYQQLQAMQFQHLQQKEIMQRQNPTVAEQINYHAGYQQQQIHQIVQNG 240
Query: 241 MRSGRGLSSSPWRQPPQGAGTRALFVGTSGGGKRECAGTGVFLPRHTATHSQPRRKPDCP 300
MRSGRGLSSSPWRQPPQGAGTRALFVGTSGGGKRECAGTGVFLPRHTATHSQPRRKPDCP
Sbjct: 241 MRSGRGLSSSPWRQPPQGAGTRALFVGTSGGGKRECAGTGVFLPRHTATHSQPRRKPDCP 300
Query: 301 TVLVPARVMQALNLNLEDICSQPHLQPVAAARFNSDNDVLLRLQFNRSANYQKRNSRRQA 360
TVLVPARVMQALNLNLEDICSQPHLQPVAAARFNSDNDVLLRLQFNRSANYQKRNSRRQA
Sbjct: 301 TVLVPARVMQALNLNLEDICSQPHLQPVAAARFNSDNDVLLRLQFNRSANYQKRNSRRQA 360
Query: 361 ATEHEIKLPQEWTY 375
ATEHEIKLPQEWTY
Sbjct: 361 ATEHEIKLPQEWTY 374
BLAST of CmoCh06G002660 vs. ExPASy TrEMBL
Match:
A0A6J1I5F7 (uncharacterized protein LOC111470084 OS=Cucurbita maxima OX=3661 GN=LOC111470084 PE=4 SV=1)
HSP 1 Score: 694.1 bits (1790), Expect = 3.2e-196
Identity = 355/374 (94.92%), Postives = 358/374 (95.72%), Query Frame = 0
Query: 1 MAESLDDGEFWLPPQFLSDDDDDQESSATNKNNGRNSFGSTAFQSGRPSFPLEFGTFGGF 60
MAESLDDGEFWLPPQFLSDD DDQES ATNKNNGRNS STAFQSGRPSFPLEFGTFGGF
Sbjct: 1 MAESLDDGEFWLPPQFLSDDHDDQESCATNKNNGRNSLCSTAFQSGRPSFPLEFGTFGGF 60
Query: 61 SDFSSTAESLKGSSETESDLEDLVAGLTLHMARSTVDDGLDSDNAKERVLAGSPQSTLCD 120
SDFSST ESLKGSSETESDLEDLVAGLTLHMARSTVDDGLDSDNAKE VLAGSPQSTLCD
Sbjct: 61 SDFSSTTESLKGSSETESDLEDLVAGLTLHMARSTVDDGLDSDNAKEIVLAGSPQSTLCD 120
Query: 121 MGSGSGCSQGSSRGSPKGNCKVPSPPATWDLLHAAAGQVARMRMTENHGVVHQNRGASQV 180
MGSGSGCSQGSSRGSPKGNCKVPSPPATWDLLHAAAGQVARMRMTENHGVVHQNRGA QV
Sbjct: 121 MGSGSGCSQGSSRGSPKGNCKVPSPPATWDLLHAAAGQVARMRMTENHGVVHQNRGAPQV 180
Query: 181 SVPVKNSSSDTGFYQQLQAMQFQHLQQKEIMQRQNPTVAEQINYHAGYQQQQIHQIVQNG 240
S+PVKNSSSD GFYQQLQAMQFQHLQQKEIMQRQN TVAEQINYHAGYQQQQIHQIV NG
Sbjct: 181 SLPVKNSSSDNGFYQQLQAMQFQHLQQKEIMQRQNLTVAEQINYHAGYQQQQIHQIVHNG 240
Query: 241 MRSGRGLSSSPWRQPPQGAGTRALFVGTSGGGKRECAGTGVFLPRHTATHSQPRRKPDCP 300
M SGRGLSSSPWRQPPQG+GTR+LFVGT GGKRECAGTGVFLPRHTATHSQ RRKPDCP
Sbjct: 241 MMSGRGLSSSPWRQPPQGSGTRSLFVGTP-GGKRECAGTGVFLPRHTATHSQQRRKPDCP 300
Query: 301 TVLVPARVMQALNLNLEDICSQPHLQPVAAARFNSDNDVLLRLQFNRSANYQKRNSRRQA 360
TVLVPARVMQALNLNLEDICSQPHLQPVAA RFNSDNDVLLRLQFNR ANYQKRNSRRQA
Sbjct: 301 TVLVPARVMQALNLNLEDICSQPHLQPVAAGRFNSDNDVLLRLQFNRGANYQKRNSRRQA 360
Query: 361 ATEHEIKLPQEWTY 375
ATEHEIKLPQEWTY
Sbjct: 361 ATEHEIKLPQEWTY 373
BLAST of CmoCh06G002660 vs. ExPASy TrEMBL
Match:
A0A6J1F0R5 (uncharacterized protein LOC111441333 OS=Cucurbita moschata OX=3662 GN=LOC111441333 PE=4 SV=1)
HSP 1 Score: 535.8 bits (1379), Expect = 1.4e-148
Identity = 280/377 (74.27%), Postives = 311/377 (82.49%), Query Frame = 0
Query: 1 MAESLDDGEFWLPPQFLSDDDD--DQESSATNKNNGRNSFGSTAFQSGRPSFPLEFGTFG 60
MA+SLDDGEFWLPPQFL+DDD+ + ATNKN+ N ST F RPSFP EFGTFG
Sbjct: 1 MADSLDDGEFWLPPQFLADDDNMPSPNTCATNKNSALNCLASTPFPPPRPSFPFEFGTFG 60
Query: 61 GFSDFSSTAESLKGSSETESDLEDLVAGLTLHMARSTVDDGLDSDNAKERVLAGSPQSTL 120
GFSDFSS ESLKGSSETESD ED +AG TL MA+ST+DDG DS N+K +L+GSPQSTL
Sbjct: 61 GFSDFSSPGESLKGSSETESDEEDCLAGFTLRMAQSTIDDGFDSYNSKTSLLSGSPQSTL 120
Query: 121 CDMGSGSGCSQGSSRGSPKGNCKVPSPPATWDLLHAAAGQVARMRMTENHGVVHQNRGAS 180
CDMGSGSGCSQ SSRGSPK NCKVPSPPAT DLLHAAAG+VARMRM E+ GV+ QNRG S
Sbjct: 121 CDMGSGSGCSQVSSRGSPKSNCKVPSPPATCDLLHAAAGEVARMRMNESRGVISQNRGTS 180
Query: 181 QVSVPVKNSSSDTGFYQQLQAMQFQHLQQKEIMQRQNPTVAEQINYHAGYQQQQIHQIVQ 240
QVSVPVKNSS+ TGFYQQLQAMQ HL QKEIMQRQN TV EQ+N AGYQ+QQ+H +VQ
Sbjct: 181 QVSVPVKNSSTGTGFYQQLQAMQVTHLNQKEIMQRQNLTVGEQMNSPAGYQRQQVHVMVQ 240
Query: 241 NGMRSGRGLSSSPWRQPPQGAGTRALFVGTSGGGKRECAGTGVFLPRHTATHS-QPRRKP 300
NG+R RGLSSS W PPQG+GTRALF+GT G KRECAGTGVFLPRH++T S + RRKP
Sbjct: 241 NGVRGCRGLSSSAWIPPPQGSGTRALFLGTQ-GSKRECAGTGVFLPRHSSTQSDEQRRKP 300
Query: 301 DCPTVLVPARVMQALNLNLEDICSQPHLQPVAAARFNSDNDVLLRLQFNRSANYQKRNSR 360
C TVLVPARVMQALNLNLEDICSQPH+QP+ R NS+NDV+LRLQ NR NY+KRN R
Sbjct: 301 ACSTVLVPARVMQALNLNLEDICSQPHVQPMGGGRLNSENDVVLRLQINRGGNYEKRNGR 360
Query: 361 RQAATEHEIKLPQEWTY 375
Q T+HEIKLPQEWTY
Sbjct: 361 GQLPTDHEIKLPQEWTY 376
BLAST of CmoCh06G002660 vs. ExPASy TrEMBL
Match:
A0A6J1J2V3 (uncharacterized protein LOC111482180 OS=Cucurbita maxima OX=3661 GN=LOC111482180 PE=4 SV=1)
HSP 1 Score: 529.6 bits (1363), Expect = 1.0e-146
Identity = 276/376 (73.40%), Postives = 310/376 (82.45%), Query Frame = 0
Query: 1 MAESLDDGEFWLPPQFLSDDDD--DQESSATNKNNGRNSFGSTAFQSGRPSFPLEFGTFG 60
MA+SLDDGEFWLPPQFL+DDD+ Q + ATNKN+ N GST F RP FP EFGT G
Sbjct: 1 MADSLDDGEFWLPPQFLADDDNMPSQNTCATNKNSHLNCLGSTPFLPQRPFFPFEFGTSG 60
Query: 61 GFSDFSSTAESLKGSSETESDLEDLVAGLTLHMARSTVDDGLDSDNAKERVLAGSPQSTL 120
GFSDFSS ESLKGSSETESD ED +A +TL MA+ST+DDG DS N+K +L+GSPQSTL
Sbjct: 61 GFSDFSSPGESLKGSSETESDEEDCLAEITLRMAQSTIDDGFDSYNSKASLLSGSPQSTL 120
Query: 121 CDMGSGSGCSQGSSRGSPKGNCKVPSPPATWDLLHAAAGQVARMRMTENHGVVHQNRGAS 180
CDMGSGSGCSQ SSRGSPK NCKVPSPPAT DLLHAAAG+VARM+M E+ GV+ QNRG S
Sbjct: 121 CDMGSGSGCSQVSSRGSPKSNCKVPSPPATCDLLHAAAGEVARMQMNESRGVISQNRGTS 180
Query: 181 QVSVPVKNSSSDTGFYQQLQAMQFQHLQQKEIMQRQNPTVAEQINYHAGYQQQQIHQIVQ 240
QVSVPVKNSS+ TGFYQQLQA+Q HL QKEIMQRQN TV EQ+N A YQ+QQ+H +VQ
Sbjct: 181 QVSVPVKNSSTGTGFYQQLQAVQVSHLNQKEIMQRQNLTVGEQMNSPAAYQRQQVHVMVQ 240
Query: 241 NGMRSGRGLSSSPWRQPPQGAGTRALFVGTSGGGKRECAGTGVFLPRHTATHSQPRRKPD 300
NG+R RGLSSS W PPQG+GTRALF+G + GGKRECAGTGVFLPRH++T S+ RRKP
Sbjct: 241 NGVRGCRGLSSSAWIPPPQGSGTRALFLG-ARGGKRECAGTGVFLPRHSSTQSEQRRKPA 300
Query: 301 CPTVLVPARVMQALNLNLEDICSQPHLQPVAAARFNSDNDVLLRLQFNRSANYQKRNSRR 360
C TVLVPARVMQALNLNLEDICSQPH+QP+ R NS+ DVLLRLQ NR NY+KRN R
Sbjct: 301 CSTVLVPARVMQALNLNLEDICSQPHVQPMGGGRLNSEKDVLLRLQINRGGNYEKRNGRG 360
Query: 361 QAATEHEIKLPQEWTY 375
Q T+HEIKLPQEWTY
Sbjct: 361 QLPTDHEIKLPQEWTY 375
BLAST of CmoCh06G002660 vs. ExPASy TrEMBL
Match:
A0A6J1CWM5 (uncharacterized protein LOC111015081 OS=Momordica charantia OX=3673 GN=LOC111015081 PE=4 SV=1)
HSP 1 Score: 527.3 bits (1357), Expect = 5.1e-146
Identity = 284/390 (72.82%), Postives = 311/390 (79.74%), Query Frame = 0
Query: 1 MAESLDDGEFWLPPQFLSDDDD--DQESSATNKNNGRNSFGSTAFQSGRPSFPLEFGTFG 60
MAESLDDGEFWLPPQFL+DDD+ D +S T K++ RN Q GR SFPLEFGTFG
Sbjct: 2 MAESLDDGEFWLPPQFLADDDNMLDHKSCNTRKSSDRNCLS----QFGRASFPLEFGTFG 61
Query: 61 GFSDFSSTAESLKGSSETESDLEDLVAGLTLHMARSTVDDGLDSDNAKERVLAGSPQSTL 120
GFSDFSS ESLKGSSETESD ED +AGLTL MA S+VDD D DNAK RV++GSPQSTL
Sbjct: 62 GFSDFSSPGESLKGSSETESDEEDYMAGLTLRMAHSSVDDAFDPDNAKGRVVSGSPQSTL 121
Query: 121 CDMGSGSGCSQGSSRGSPKGNCKVPSPPATWDLLHAAAGQVARMRMTE-NHGVVHQNRGA 180
C MGSGSGCSQGSSRGSPKGNCKV SPPATWDLLHAAAG+VARMR + HGV H NRG
Sbjct: 122 CGMGSGSGCSQGSSRGSPKGNCKVSSPPATWDLLHAAAGEVARMRTNDCAHGVHHHNRGT 181
Query: 181 SQVSVPVKNSSSDTGFYQQLQAMQFQHLQQKEIMQRQNPTVAEQINYHAGYQQQQIHQIV 240
QVSVPV NS++ TGFYQQL AMQF Q+EIMQRQN EQ+N HAGYQQQQIHQ+V
Sbjct: 182 PQVSVPVNNSNAGTGFYQQLHAMQF----QQEIMQRQNLAAGEQMNSHAGYQQQQIHQMV 241
Query: 241 QNGMRSG----------RGLSSSPW---RQPPQGAGTRALFVGTSGGGKRECAGTGVFLP 300
NG+R G RGLSSS W +QPPQG+G RALF+GT GKRECAGTGVFLP
Sbjct: 242 HNGVRGGEFVGNRDHRSRGLSSSAWLPPQQPPQGSGMRALFLGTPPAGKRECAGTGVFLP 301
Query: 301 RHTATHSQPRRKPDCPTVLVPARVMQALNLNLEDICSQPHLQPVAAARFNSDNDVLLRLQ 360
RHT T S+PR+KP C TVLVPARVMQALNLNL+DICSQPHLQPVAA RFN++NDVLLRLQ
Sbjct: 302 RHTGTQSEPRKKPGCSTVLVPARVMQALNLNLDDICSQPHLQPVAAGRFNAENDVLLRLQ 361
Query: 361 FNRSANYQKRNSRRQAATEHEIKLPQEWTY 375
NR AN QKRNSRRQ T+ E+KLPQEWTY
Sbjct: 362 INRGANQQKRNSRRQPPTDRELKLPQEWTY 383
BLAST of CmoCh06G002660 vs. TAIR 10
Match:
AT3G54000.1 (CONTAINS InterPro DOMAIN/s: Uncharacterised conserved protein UCP022260 (InterPro:IPR016802); Has 94 Blast hits to 94 proteins in 14 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 94; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 110.2 bits (274), Expect = 3.8e-24
Identity = 128/395 (32.41%), Postives = 178/395 (45.06%), Query Frame = 0
Query: 5 LDDGEFWLPPQFLSDDDDDQESSATNKNNGRNSFGSTAFQSGRPSFPLE-FGTFGGFSDF 64
+DD EFWLP +FL+DDD K N + F P P FGTFG
Sbjct: 8 VDDAEFWLPTEFLTDDD-----FLVEKENNSVGIDDSLF----PYEPRHGFGTFG----- 67
Query: 65 SSTAESLKGSSETESDLEDLVAGLTLHMARSTVDDGLDSDNAKERVLAGSPQSTLCDMG- 124
++K ++ E D E +AGLT M S++ D +M
Sbjct: 68 ----STVKPNTVKEDDEESFLAGLTRQMVMSSLKDDFSGGVCGNHAFPAGNDHKAWEMNR 127
Query: 125 -----SGSGCSQGSSRGSPKGNCKVPSPPATWDLLHAAAGQVARMRMTENHGVVHQNRG- 184
+G+GC + R + N +V S WDL AA RM+ N H RG
Sbjct: 128 SPPCVAGTGCCCLNQRFNQNLNSRVSS----WDLYCAAE------RMSINDEPYHSGRGL 187
Query: 185 ---ASQVSVPVKN-SSSDTGF--------YQQLQAMQFQHLQQKEIMQRQNPTVAEQINY 244
+++S VKN S++ TG+ YQ+LQA+QFQ L+Q+++M+ + V
Sbjct: 188 LGSPAKLSATVKNHSNNGTGYYNNHQSLQYQKLQAIQFQQLKQQQLMKHRRQLV------ 247
Query: 245 HAGYQQQQIHQIVQNGMRSGRGLSSSPW-RQPPQGAGTRALFVGTSGGGKRECAGTGVFL 304
+Q + ++ N LSSS W Q P+ RA+F+G GKR GTGVFL
Sbjct: 248 ----RQNRGVRVNGNKNVGPVDLSSSAWSNQFPRRDVMRAVFIG-DHTGKRGSTGTGVFL 307
Query: 305 PR---HTATHSQPRRKPDCPTVLVPARVMQALNLNLEDICSQPHLQPVAAARFNSDNDVL 364
PR HT + ++ R KP TVLVPAR+ Q LNLNL +PV + + NDV
Sbjct: 308 PRSVNHT-SRTETREKPTISTVLVPARLAQVLNLNLG--------EPVRSTA--TLNDVS 352
Query: 365 LRLQFNRSA-NYQKRNSRRQAATEHEIKLPQEWTY 375
R + N + Q R + E +LP EW Y
Sbjct: 368 WRQRSNNGGFSSQMVGGVRAEQSVQEPRLPSEWAY 352
BLAST of CmoCh06G002660 vs. TAIR 10
Match:
AT3G54000.2 (unknown protein; Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )
HSP 1 Score: 82.0 bits (201), Expect = 1.1e-15
Identity = 100/317 (31.55%), Postives = 142/317 (44.79%), Query Frame = 0
Query: 5 LDDGEFWLPPQFLSDDDDDQESSATNKNNGRNSFGSTAFQSGRPSFPLE-FGTFGGFSDF 64
+DD EFWLP +FL+DDD K N + F P P FGTFG
Sbjct: 8 VDDAEFWLPTEFLTDDD-----FLVEKENNSVGIDDSLF----PYEPRHGFGTFG----- 67
Query: 65 SSTAESLKGSSETESDLEDLVAGLTLHMARSTVDDGLDSDNAKERVLAGSPQSTLCDMG- 124
++K ++ E D E +AGLT M S++ D +M
Sbjct: 68 ----STVKPNTVKEDDEESFLAGLTRQMVMSSLKDDFSGGVCGNHAFPAGNDHKAWEMNR 127
Query: 125 -----SGSGCSQGSSRGSPKGNCKVPSPPATWDLLHAAAGQVARMRMTENHGVVHQNRG- 184
+G+GC + R + N +V S WDL AA RM+ N H RG
Sbjct: 128 SPPCVAGTGCCCLNQRFNQNLNSRVSS----WDLYCAAE------RMSINDEPYHSGRGL 187
Query: 185 ---ASQVSVPVKN-SSSDTGF--------YQQLQAMQFQHLQQKEIMQRQNPTVAEQINY 244
+++S VKN S++ TG+ YQ+LQA+QFQ L+Q+++M+ + V
Sbjct: 188 LGSPAKLSATVKNHSNNGTGYYNNHQSLQYQKLQAIQFQQLKQQQLMKHRRQLV------ 247
Query: 245 HAGYQQQQIHQIVQNGMRSGRGLSSSPW-RQPPQGAGTRALFVGTSGGGKRECAGTGVFL 298
+Q + ++ N LSSS W Q P+ RA+F+G GKR GTGVFL
Sbjct: 248 ----RQNRGVRVNGNKNVGPVDLSSSAWSNQFPRRDVMRAVFIG-DHTGKRGSTGTGVFL 284
BLAST of CmoCh06G002660 vs. TAIR 10
Match:
AT3G54000.3 (unknown protein; Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )
HSP 1 Score: 82.0 bits (201), Expect = 1.1e-15
Identity = 100/317 (31.55%), Postives = 142/317 (44.79%), Query Frame = 0
Query: 5 LDDGEFWLPPQFLSDDDDDQESSATNKNNGRNSFGSTAFQSGRPSFPLE-FGTFGGFSDF 64
+DD EFWLP +FL+DDD K N + F P P FGTFG
Sbjct: 8 VDDAEFWLPTEFLTDDD-----FLVEKENNSVGIDDSLF----PYEPRHGFGTFG----- 67
Query: 65 SSTAESLKGSSETESDLEDLVAGLTLHMARSTVDDGLDSDNAKERVLAGSPQSTLCDMG- 124
++K ++ E D E +AGLT M S++ D +M
Sbjct: 68 ----STVKPNTVKEDDEESFLAGLTRQMVMSSLKDDFSGGVCGNHAFPAGNDHKAWEMNR 127
Query: 125 -----SGSGCSQGSSRGSPKGNCKVPSPPATWDLLHAAAGQVARMRMTENHGVVHQNRG- 184
+G+GC + R + N +V S WDL AA RM+ N H RG
Sbjct: 128 SPPCVAGTGCCCLNQRFNQNLNSRVSS----WDLYCAAE------RMSINDEPYHSGRGL 187
Query: 185 ---ASQVSVPVKN-SSSDTGF--------YQQLQAMQFQHLQQKEIMQRQNPTVAEQINY 244
+++S VKN S++ TG+ YQ+LQA+QFQ L+Q+++M+ + V
Sbjct: 188 LGSPAKLSATVKNHSNNGTGYYNNHQSLQYQKLQAIQFQQLKQQQLMKHRRQLV------ 247
Query: 245 HAGYQQQQIHQIVQNGMRSGRGLSSSPW-RQPPQGAGTRALFVGTSGGGKRECAGTGVFL 298
+Q + ++ N LSSS W Q P+ RA+F+G GKR GTGVFL
Sbjct: 248 ----RQNRGVRVNGNKNVGPVDLSSSAWSNQFPRRDVMRAVFIG-DHTGKRGSTGTGVFL 284
BLAST of CmoCh06G002660 vs. TAIR 10
Match:
AT5G59050.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G54000.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )
HSP 1 Score: 60.5 bits (145), Expect = 3.4e-09
Identity = 92/332 (27.71%), Postives = 144/332 (43.37%), Query Frame = 0
Query: 64 SSTAESLKGSSETESDLEDLVAGLTLHMARSTVDDGLDSDNAKE-RVLAGSPQSTLCDMG 123
S +S K +E E D + + LT M + D D + K +GSPQSTL
Sbjct: 38 SDEPDSPKAKNEDEED--EYITELTRQMTNYMLQD--DEKHQKSCGSGSGSPQSTLWS-- 97
Query: 124 SGSGCSQGSSRGSPKGNCKVPSPPATWDLLHAAAGQVARMRMTENHGVVHQNRGASQVSV 183
S SP G + PSPP T A V ++ MT+ V++
Sbjct: 98 -----PFASGLSSPIGPSREPSPPLT-----PATVPVEKI-MTK--------IDTKPVTI 157
Query: 184 PVKNSSSDTGFYQQLQAMQ--FQHLQQKEIMQRQNPTVAEQINYHAGYQQQQIHQIVQNG 243
P + S Q++++Q FQ +++++ +RQ G++ + H + QN
Sbjct: 158 PFQ--SKQALIDDQIRSIQANFQKIKKEKEKERQRNADV------LGHKARNYHHLHQN- 217
Query: 244 MRSGRGLSSSPWRQPPQGAGTRALFVGTSGGGKRECAGTGVFLPRHTATHSQPRRKPDCP 303
Q P+ +G +A+FV S G + GTGVFLPR T + R+K C
Sbjct: 218 -------------QRPR-SGVKAVFVDGS-GSRTGSGGTGVFLPRGHGTVVESRKKSGCS 277
Query: 304 TVLVPARVMQALNLNLEDICSQPHLQPVAAARFNSD----NDVLLRLQFNRSANYQKRN- 363
TV++PARV++AL ++ + + + F+SD +D LL N+ K
Sbjct: 278 TVIIPARVVEALKVHFDKL--------GVPSTFSSDIPPFHDALLVSMNNKKIKSNKNTS 312
Query: 364 -SRRQAATEHEIK------------LPQEWTY 375
SR Q+ + +E++ LPQEWTY
Sbjct: 338 LSRVQSGSPYEMEMSAESHQEPPADLPQEWTY 312
BLAST of CmoCh06G002660 vs. TAIR 10
Match:
AT2G39870.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G55690.1); Has 73 Blast hits to 71 proteins in 14 species: Archae - 0; Bacteria - 0; Metazoa - 2; Fungi - 2; Plants - 69; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 56.6 bits (135), Expect = 4.9e-08
Identity = 95/343 (27.70%), Postives = 144/343 (41.98%), Query Frame = 0
Query: 50 FPLEFGTFGGFSDFSSTAESLKGSSETESDLEDLVAGLTLHMARST--VDDGLDSDNAKE 109
FP EF + F+S +S + E+ D ED +AGLT +A ST + L K
Sbjct: 35 FPYEFDSPSFSPGFTSPGDSTETEDESSDDEEDFLAGLTRRLAPSTQRLPSPLFKSEEKR 94
Query: 110 RVLAGSPQSTLCDMGSGSGCSQGSSRGSPKGNCKVPSPPA---------TWDLLHAAAGQ 169
+V A SPQSTL SG G S SP +PSPPA WD++ AAAG+
Sbjct: 95 QVAATSPQSTL----SGLGSFSNSGSRSP----ILPSPPAPTSSFRRDNAWDVISAAAGE 154
Query: 170 VARMRM--TENHGVVHQNRGASQVSVPVKNSSSDTGFYQQLQAMQFQHLQQKEIMQRQNP 229
VAR+++ E H +P++ S ++ A LQ + +++ Q
Sbjct: 155 VARLKLGSYEPH------------HLPLQTPES---LLRRQNAAIHAELQHQRLIE-QMW 214
Query: 230 TVAEQINYHAGYQQQQIHQIVQNGMRSGRGL--SSSPWRQPPQGAGTRALFVGTSGGGKR 289
+ Q + + + + G+ ++P PPQ A KR
Sbjct: 215 LCSAQSRFKLSENRIPRRVVNEEGLFENPRYVRRNNPTWLPPQQAAAPL---------KR 274
Query: 290 ECAGTGVFLPRH--TATHSQPRRKP-DCPTVLVPARVMQALNLNLEDICSQPHLQPVAAA 349
AGTGVFLPR +A S + P + P +L P ++ NLN ++ + V
Sbjct: 275 PSAGTGVFLPRRYPSAAPSDSLKTPVNTPAMLQPK--VKPQNLNFDEFTN-----IVGPR 330
Query: 350 RFNSDNDVLLRLQFNRSANYQKRNSRRQAATEHEIKLPQEWTY 375
R D + +L RS ++ + R + LPQ+W Y
Sbjct: 335 RSQFDYECMLA----RSTVLARQGNFRAVSGG---GLPQDWMY 330
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1FT27 | 8.6e-210 | 100.00 | uncharacterized protein LOC111447045 OS=Cucurbita moschata OX=3662 GN=LOC1114470... | [more] |
A0A6J1I5F7 | 3.2e-196 | 94.92 | uncharacterized protein LOC111470084 OS=Cucurbita maxima OX=3661 GN=LOC111470084... | [more] |
A0A6J1F0R5 | 1.4e-148 | 74.27 | uncharacterized protein LOC111441333 OS=Cucurbita moschata OX=3662 GN=LOC1114413... | [more] |
A0A6J1J2V3 | 1.0e-146 | 73.40 | uncharacterized protein LOC111482180 OS=Cucurbita maxima OX=3661 GN=LOC111482180... | [more] |
A0A6J1CWM5 | 5.1e-146 | 72.82 | uncharacterized protein LOC111015081 OS=Momordica charantia OX=3673 GN=LOC111015... | [more] |
Match Name | E-value | Identity | Description | |
AT3G54000.1 | 3.8e-24 | 32.41 | CONTAINS InterPro DOMAIN/s: Uncharacterised conserved protein UCP022260 (InterPr... | [more] |
AT3G54000.2 | 1.1e-15 | 31.55 | unknown protein; Has 35333 Blast hits to 34131 proteins in 2444 species: Archae ... | [more] |
AT3G54000.3 | 1.1e-15 | 31.55 | unknown protein; Has 35333 Blast hits to 34131 proteins in 2444 species: Archae ... | [more] |
AT5G59050.1 | 3.4e-09 | 27.71 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |
AT2G39870.1 | 4.9e-08 | 27.70 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |