Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CACAGATCGAGTAGCTTATCTTCTTCTCCGCCACTCAAAGCCGCTCTCTGAAAACCCACACTGACCCTCTCTGCTTATACAAACCACACAACAATGTCACTTCAAGCTCACCTGGTTCATTTCCTACTCAGCCATGGCCGCCGCAATTTCCTTCACAATCTCTAGCTCTTCTCTCTGGTCTCGACTTTCTAAACCCGCTTCTAAATTTCGCGATCGCCGTCTGAGAATCATGGCCGCGGCTACCGGAACTAAGGTGGTACCCGCTGTGATAGTCGGAAGTGGGCGCGTTGGGAGGGCGCTGCTGGACATGGGAAATGGTGAAGATTTTTTGGTGAAGAGAGGCGAGTCCGTGCCCCTTGATTTTCCGGGTCCTATCCTTGTTTGTACCAGAAATGATGATCTTGAAGCTGTTCTTGAATCCACTCCTCGATCGAGATGGAACGGTATGATCATCGAATCGTGTTATTGTCTTTATTTTCTCGCTTATTTTGAATCGAGACGCATTTTATTGGCGACGATATGCTTCACTCCCATCTCGTTTTATGGGGTTTGAATTCGAAGTTTTGGCATTATGGGCTTTCTTGTTTTCATTGCCTATCCCATAAACTCAGCTTCAGCTCAAGTTTTTTGGTACAAGTTTACCAAATTTCATTTCATTTTCTTTCCAATTGGTATTATAGTTCACGTCTGGACTTCGTGACTCAATTTAATATCCAAGCTTGCATACCATATGCAATCAACTCACATTTCCTGTGAGATTCCATATTTCATTTCCTTTCCAAGTGTGTGGAAACCTCCCTAGCAAACACTTTTTAAAATCTTGAGGAAAAACTGGAAAGGAAAAGTTCAAAGAGAATGATATTTGCGAGCGGTGGGTCTGTTACAAAGGGTTTCAAAGCTAGACACCGAGCACACCAACGAGGATGTTTCCCGAGGGGTGGATTGTGAGATCCCACATTGGTTGGAGAGGAAATGAAAGATTCTTTACAAGGGTGCGGAAACCTCTCCCTAACACTCTTCTAGCGTTTTAAAGCCTTCAAGGAGAATCCGGAAGTGAAAGCTCAAAGAGGATTGCATTTCTAATGCAACATTTGAATTTAGGAAGAAATTAGGGTATTCCCATTTGGAATATCTCATCATATAATGGCTTGTGCTGTATTGCTTTGGCAATGGATATTGACAATCTAGCAGATTTGGTTTTTTTCCAGAATGGAATGCTGGACCCTTGGTATGAAAGCAAAGGTCTAAATGATGTGAATCAAGTGTTAGCATATTTCGCTATCTCAAAGCTGGGAGAGGCTCCTGTGGATGGAATAACGGATACCAATCCTGAAGGACTGACAGCAGCGTATGGAAAATGGGCATCTGCCGTAGCTGGAAGGCTCAATGCTGCAGGCCTCTCCTGCAAGGTAAATCCACAAAAGAAATGTGCCTCTTGTGTGTAAACTGGGAAGTAAACAATGAGTTGTTTCATGAATGTTCTTTAGAGAAATATGTGTCATTTATGATATTCTTTTATTCCATACACACATTAACTTTAAAGGTTATGTTTGTAAGCTAGGTTCTCGATAAGGAAGCATTTGAGAAACAAATGTTGGAGAAGCTGATTTGGATTTCTGCATTTATGCTCGTTGGAGCACGTCATCCAGGTGCGACCGTGGGTGCTGTGGAAAAGGATTATCGCTCTGAAGTAAGATACCTTCTTCTTCATGCAATAAGTTTACTGCATAATTCTTCAACTATGTAGGTACATTATGGAATCATTCTTTATAAGGGTGTAGAAACCTCTCCCTATCAGATGCGTTTTAAAAACATTGAGGGGAAAGCCTAAAGAGGACAATATATGCTAGTGGTGGACTTGGGCCGTTACATAGACAATCTAATCCGAATTTATAATTGCAGGTTTCTAGCCTCATTGCAGAACTTGCATCTGCAGCCGCAGCTGAAAGGCAGTTGGTGTTTGAAGAAGGTATAGAGGAAAGATTATGTGCATATTCTCGGGCTGTGGCTCACTTCCCCACGGCAGTAAAAGAGGTAGATTGAAATCAAAAGGGAATTCTGGGAGCTCCCTCAATACCTTTTTTTTTTTTTTTTTTTTTTTTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNATTACTTTTTTTTTTTTTTTTTTTTTTTTCCTTGTGCTAACAATCCCTTTACTTACACAGTTCAAATGGCGAAACGGGTGGTTCTATTCTCTCTCCGAGAAAGCCATTGCTGCTGGAAAACCCGACCCCTGCCCTCTGCATACCGCTTGGCTTGCGGAGCTGAAAGTTATCTAGGAATGACCCTGTATTGCTTGCTATCAAATGGATTTCCTGTTGCACAACGGTTGGCTTTAATCCAAATGAAGAACTCTCTTTTTGTTTTAAAAAGTAGATTTTGTACCTGTGTTCGTCATCTACAAATCACCTCCAGTTACTTGTTGAAACTGGGCCCAGCCTCTTTCAGTTGGCGATAGAGAAGGATTCATCATCTGTACCATCCTGAACTTCTGAACAGAAAAATCCATCAATTATGTTAAGAGAGATTTAATCTAACTTCTACAGAAGAAGACTCCTTGCGAACAGAAAAAAAAAAACATGAAATTTAACAATGCCAACATTAGAGCAATGTGGAAGGAAATTGCAACACTGTGGTATGGAGATTGTTCAATAGAGATATCACAAATGAAGACATTATAACATGCTTCCTCCTGTTCAAAGACATCACGATTGAAGTTCCTTTATATTCGAGACCAAGATTGAGAGTCCTATAGTTTTGGTTTTTTTTGGTTTACTGACTGATTTATCTTCCCTAGAAGGCAC
mRNA sequence
CACAGATCGAGTAGCTTATCTTCTTCTCCGCCACTCAAAGCCGCTCTCTGAAAACCCACACTGACCCTCTCTGCTTATACAAACCACACAACAATGTCACTTCAAGCTCACCTGGTTCATTTCCTACTCAGCCATGGCCGCCGCAATTTCCTTCACAATCTCTAGCTCTTCTCTCTGGTCTCGACTTTCTAAACCCGCTTCTAAATTTCGCGATCGCCGTCTGAGAATCATGGCCGCGGCTACCGGAACTAAGGTGGTACCCGCTGTGATAGTCGGAAGTGGGCGCGTTGGGAGGGCGCTGCTGGACATGGGAAATGGTGAAGATTTTTTGGTGAAGAGAGGCGAGTCCGTGCCCCTTGATTTTCCGGGTCCTATCCTTGTTTGTACCAGAAATGATGATCTTGAAGCTGTTCTTGAATCCACTCCTCGATCGAGATGGAACGATTTGGTTTTTTTCCAGAATGGAATGCTGGACCCTTGGTATGAAAGCAAAGGTCTAAATGATGTGAATCAAGTGTTAGCATATTTCGCTATCTCAAAGCTGGGAGAGGCTCCTGTGGATGGAATAACGGATACCAATCCTGAAGGACTGACAGCAGCGTATGGAAAATGGGCATCTGCCGTAGCTGGAAGGCTCAATGCTGCAGGCCTCTCCTGCAAGGTTCTCGATAAGGAAGCATTTGAGAAACAAATGTTGGAGAAGCTGATTTGGATTTCTGCATTTATGCTCGTTGGAGCACGTCATCCAGAACTTGCATCTGCAGCCGCAGCTGAAAGGCAGTTGGTGTTTGAAGAAGGTATAGAGGAAAGATTATGTGCATATTCTCGGGCTGTGGCTCACTTCCCCACGGCAGTAAAAGAGTTCAAATGGCGAAACGGGTGGTTCTATTCTCTCTCCGAGAAAGCCATTGCTGCTGGAAAACCCGACCCCTGCCCTCTGCATACCGCTTGGCTTGCGGAGCTGAAAGTTATCTAGGAATGACCCTGTATTGCTTGCTATCAAATGGATTTCCTGTTGCACAACGGTTGGCTTTAATCCAAATGAAGAACTCTCTTTTTGTTTTAAAAAGTAGATTTTGTACCTGTGTTCGTCATCTACAAATCACCTCCAGTTACTTGTTGAAACTGGGCCCAGCCTCTTTCAGTTGGCGATAGAGAAGGATTCATCATCTGTACCATCCTGAACTTCTGAACAGAAAAATCCATCAATTATGTTAAGAGAGATTTAATCTAACTTCTACAGAAGAAGACTCCTTGCGAACAGAAAAAAAAAAACATGAAATTTAACAATGCCAACATTAGAGCAATGTGGAAGGAAATTGCAACACTGTGGTATGGAGATTGTTCAATAGAGATATCACAAATGAAGACATTATAACATGCTTCCTCCTGTTCAAAGACATCACGATTGAAGTTCCTTTATATTCGAGACCAAGATTGAGAGTCCTATAGTTTTGGTTTTTTTTGGTTTACTGACTGATTTATCTTCCCTAGAAGGCAC
Coding sequence (CDS)
ATGGCCGCCGCAATTTCCTTCACAATCTCTAGCTCTTCTCTCTGGTCTCGACTTTCTAAACCCGCTTCTAAATTTCGCGATCGCCGTCTGAGAATCATGGCCGCGGCTACCGGAACTAAGGTGGTACCCGCTGTGATAGTCGGAAGTGGGCGCGTTGGGAGGGCGCTGCTGGACATGGGAAATGGTGAAGATTTTTTGGTGAAGAGAGGCGAGTCCGTGCCCCTTGATTTTCCGGGTCCTATCCTTGTTTGTACCAGAAATGATGATCTTGAAGCTGTTCTTGAATCCACTCCTCGATCGAGATGGAACGATTTGGTTTTTTTCCAGAATGGAATGCTGGACCCTTGGTATGAAAGCAAAGGTCTAAATGATGTGAATCAAGTGTTAGCATATTTCGCTATCTCAAAGCTGGGAGAGGCTCCTGTGGATGGAATAACGGATACCAATCCTGAAGGACTGACAGCAGCGTATGGAAAATGGGCATCTGCCGTAGCTGGAAGGCTCAATGCTGCAGGCCTCTCCTGCAAGGTTCTCGATAAGGAAGCATTTGAGAAACAAATGTTGGAGAAGCTGATTTGGATTTCTGCATTTATGCTCGTTGGAGCACGTCATCCAGAACTTGCATCTGCAGCCGCAGCTGAAAGGCAGTTGGTGTTTGAAGAAGGTATAGAGGAAAGATTATGTGCATATTCTCGGGCTGTGGCTCACTTCCCCACGGCAGTAAAAGAGTTCAAATGGCGAAACGGGTGGTTCTATTCTCTCTCCGAGAAAGCCATTGCTGCTGGAAAACCCGACCCCTGCCCTCTGCATACCGCTTGGCTTGCGGAGCTGAAAGTTATCTAG
Protein sequence
MAAAISFTISSSSLWSRLSKPASKFRDRRLRIMAAATGTKVVPAVIVGSGRVGRALLDMGNGEDFLVKRGESVPLDFPGPILVCTRNDDLEAVLESTPRSRWNDLVFFQNGMLDPWYESKGLNDVNQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLDKEAFEKQMLEKLIWISAFMLVGARHPELASAAAAERQLVFEEGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWLAELKVI
Homology
BLAST of CmoCh17G008800 vs. ExPASy TrEMBL
Match:
A0A6J1GRW1 (uncharacterized protein LOC111456547 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111456547 PE=4 SV=1)
HSP 1 Score: 551.6 bits (1420), Expect = 1.9e-153
Identity = 280/300 (93.33%), Postives = 280/300 (93.33%), Query Frame = 0
Query: 1 MAAAISFTISSSSLWSRLSKPASKFRDRRLRIMAAATGTKVVPAVIVGSGRVGRALLDMG 60
MAAAISFTISSSSLWSRLSKPASKFRDRRLRIMAAATGTKVVPAVIVGSGRVGRALLDMG
Sbjct: 1 MAAAISFTISSSSLWSRLSKPASKFRDRRLRIMAAATGTKVVPAVIVGSGRVGRALLDMG 60
Query: 61 NGEDFLVKRGESVPLDFPGPILVCTRNDDLEAVLESTPRSRWNDLVFFQNGMLDPWYESK 120
NGEDFLVKRGESVPLDFPGPILVCTRNDDLEAVLESTPRSRWNDLVFFQNGMLDPWYESK
Sbjct: 61 NGEDFLVKRGESVPLDFPGPILVCTRNDDLEAVLESTPRSRWNDLVFFQNGMLDPWYESK 120
Query: 121 GLNDVNQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLDK 180
GLNDVNQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLDK
Sbjct: 121 GLNDVNQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLDK 180
Query: 181 EAFEKQMLEKLIWISAFMLVGARHP--------------------ELASAAAAERQLVFE 240
EAFEKQMLEKLIWISAFMLVGARHP ELASAAAAERQLVFE
Sbjct: 181 EAFEKQMLEKLIWISAFMLVGARHPGATVGAVEKDYRSEVSSLIAELASAAAAERQLVFE 240
Query: 241 EGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWLAELKVI 281
EGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWLAELKVI
Sbjct: 241 EGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWLAELKVI 300
BLAST of CmoCh17G008800 vs. ExPASy TrEMBL
Match:
A0A6J1JW05 (uncharacterized protein LOC111488021 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111488021 PE=4 SV=1)
HSP 1 Score: 548.5 bits (1412), Expect = 1.6e-152
Identity = 278/300 (92.67%), Postives = 279/300 (93.00%), Query Frame = 0
Query: 1 MAAAISFTISSSSLWSRLSKPASKFRDRRLRIMAAATGTKVVPAVIVGSGRVGRALLDMG 60
MAAAISFTISSSSLWSRLSKPASKFRDRRLRIMAAATGTKVVPAVIVGSGRVGRALLDMG
Sbjct: 1 MAAAISFTISSSSLWSRLSKPASKFRDRRLRIMAAATGTKVVPAVIVGSGRVGRALLDMG 60
Query: 61 NGEDFLVKRGESVPLDFPGPILVCTRNDDLEAVLESTPRSRWNDLVFFQNGMLDPWYESK 120
NGEDFLVKRGESVPLDFPGPILVCTRNDDLEAVLESTP+SRWNDLVFFQNGMLDPWYESK
Sbjct: 61 NGEDFLVKRGESVPLDFPGPILVCTRNDDLEAVLESTPQSRWNDLVFFQNGMLDPWYESK 120
Query: 121 GLNDVNQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLDK 180
GLNDVNQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLDK
Sbjct: 121 GLNDVNQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLDK 180
Query: 181 EAFEKQMLEKLIWISAFMLVGARHP--------------------ELASAAAAERQLVFE 240
EAFEKQMLEKLIWISAFMLVGARHP ELASAAAAERQLVFE
Sbjct: 181 EAFEKQMLEKLIWISAFMLVGARHPGATVGAVEKDYRSEVSSLIAELASAAAAERQLVFE 240
Query: 241 EGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWLAELKVI 281
EGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWL ELKVI
Sbjct: 241 EGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWLGELKVI 300
BLAST of CmoCh17G008800 vs. ExPASy TrEMBL
Match:
A0A6J1DKB2 (uncharacterized protein LOC111021267 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111021267 PE=4 SV=1)
HSP 1 Score: 517.7 bits (1332), Expect = 3.0e-143
Identity = 264/300 (88.00%), Postives = 269/300 (89.67%), Query Frame = 0
Query: 1 MAAAISFTISSSSLWSRLSKPASKFRDRRLRIMAAATGTKVVPAVIVGSGRVGRALLDMG 60
MAAAISFTISS SL +LSKP SKFR RRLRIMAAATGTKVVPAVIVGSGRVGRALLDMG
Sbjct: 1 MAAAISFTISSPSLRFQLSKPTSKFRVRRLRIMAAATGTKVVPAVIVGSGRVGRALLDMG 60
Query: 61 NGEDFLVKRGESVPLDFPGPILVCTRNDDLEAVLESTPRSRWNDLVFFQNGMLDPWYESK 120
NGEDFLVKRGESVPLDF GPILVCTRNDDLEAVLE+TPRSRWNDLVFFQNGMLDPWYESK
Sbjct: 61 NGEDFLVKRGESVPLDFSGPILVCTRNDDLEAVLEATPRSRWNDLVFFQNGMLDPWYESK 120
Query: 121 GLNDVNQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLDK 180
GL D NQ+LAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVA RLNAAGLSCKVLDK
Sbjct: 121 GLEDANQLLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVATRLNAAGLSCKVLDK 180
Query: 181 EAFEKQMLEKLIWISAFMLVGARHP--------------------ELASAAAAERQLVFE 240
EAFEKQMLEKLIWISAFMLVGARHP ELA AAAAERQLVFE
Sbjct: 181 EAFEKQMLEKLIWISAFMLVGARHPGATVGAVEKDYRSEVSSLIAELACAAAAERQLVFE 240
Query: 241 EGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWLAELKVI 281
+GIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWL ELKV+
Sbjct: 241 DGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWLKELKVV 300
BLAST of CmoCh17G008800 vs. ExPASy TrEMBL
Match:
A0A5D3CGL0 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold459G001500 PE=4 SV=1)
HSP 1 Score: 515.8 bits (1327), Expect = 1.2e-142
Identity = 263/300 (87.67%), Postives = 268/300 (89.33%), Query Frame = 0
Query: 1 MAAAISFTISSSSLWSRLSKPASKFRDRRLRIMAAATGTKVVPAVIVGSGRVGRALLDMG 60
MAAAISFTIS+ SL S+LSKP S+F RRLRIMAAATGTKVVPAVIVGSGRVGRALLDMG
Sbjct: 1 MAAAISFTISNPSLCSQLSKPVSRFGARRLRIMAAATGTKVVPAVIVGSGRVGRALLDMG 60
Query: 61 NGEDFLVKRGESVPLDFPGPILVCTRNDDLEAVLESTPRSRWNDLVFFQNGMLDPWYESK 120
NGED LVKRGESVPLDF GPILVCTRNDDLEAVLE+TPRSRWNDLVFFQNGMLDPWYESK
Sbjct: 61 NGEDVLVKRGESVPLDFSGPILVCTRNDDLEAVLEATPRSRWNDLVFFQNGMLDPWYESK 120
Query: 121 GLNDVNQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLDK 180
GL D NQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVL K
Sbjct: 121 GLKDANQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLGK 180
Query: 181 EAFEKQMLEKLIWISAFMLVGARHP--------------------ELASAAAAERQLVFE 240
EAFEKQMLEKLIWISAFMLVGARHP ELA AAAAERQLVFE
Sbjct: 181 EAFEKQMLEKLIWISAFMLVGARHPGATVGAVEKDYRSEVSSLIAELACAAAAERQLVFE 240
Query: 241 EGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWLAELKVI 281
EGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWL ELKV+
Sbjct: 241 EGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWLKELKVV 300
BLAST of CmoCh17G008800 vs. ExPASy TrEMBL
Match:
A0A1S3CDP5 (uncharacterized protein LOC103499856 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103499856 PE=4 SV=1)
HSP 1 Score: 515.8 bits (1327), Expect = 1.2e-142
Identity = 263/300 (87.67%), Postives = 268/300 (89.33%), Query Frame = 0
Query: 1 MAAAISFTISSSSLWSRLSKPASKFRDRRLRIMAAATGTKVVPAVIVGSGRVGRALLDMG 60
MAAAISFTIS+ SL S+LSKP S+F RRLRIMAAATGTKVVPAVIVGSGRVGRALLDMG
Sbjct: 1 MAAAISFTISNPSLCSQLSKPVSRFGARRLRIMAAATGTKVVPAVIVGSGRVGRALLDMG 60
Query: 61 NGEDFLVKRGESVPLDFPGPILVCTRNDDLEAVLESTPRSRWNDLVFFQNGMLDPWYESK 120
NGED LVKRGESVPLDF GPILVCTRNDDLEAVLE+TPRSRWNDLVFFQNGMLDPWYESK
Sbjct: 61 NGEDVLVKRGESVPLDFSGPILVCTRNDDLEAVLEATPRSRWNDLVFFQNGMLDPWYESK 120
Query: 121 GLNDVNQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLDK 180
GL D NQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVL K
Sbjct: 121 GLKDANQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLGK 180
Query: 181 EAFEKQMLEKLIWISAFMLVGARHP--------------------ELASAAAAERQLVFE 240
EAFEKQMLEKLIWISAFMLVGARHP ELA AAAAERQLVFE
Sbjct: 181 EAFEKQMLEKLIWISAFMLVGARHPGATVGAVEKDYRSEVSSLIAELACAAAAERQLVFE 240
Query: 241 EGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWLAELKVI 281
EGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWL ELKV+
Sbjct: 241 EGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWLKELKVV 300
BLAST of CmoCh17G008800 vs. NCBI nr
Match:
KAG6575639.1 (hypothetical protein SDJN03_26278, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 563.5 bits (1451), Expect = 1.0e-156
Identity = 280/280 (100.00%), Postives = 280/280 (100.00%), Query Frame = 0
Query: 1 MAAAISFTISSSSLWSRLSKPASKFRDRRLRIMAAATGTKVVPAVIVGSGRVGRALLDMG 60
MAAAISFTISSSSLWSRLSKPASKFRDRRLRIMAAATGTKVVPAVIVGSGRVGRALLDMG
Sbjct: 1 MAAAISFTISSSSLWSRLSKPASKFRDRRLRIMAAATGTKVVPAVIVGSGRVGRALLDMG 60
Query: 61 NGEDFLVKRGESVPLDFPGPILVCTRNDDLEAVLESTPRSRWNDLVFFQNGMLDPWYESK 120
NGEDFLVKRGESVPLDFPGPILVCTRNDDLEAVLESTPRSRWNDLVFFQNGMLDPWYESK
Sbjct: 61 NGEDFLVKRGESVPLDFPGPILVCTRNDDLEAVLESTPRSRWNDLVFFQNGMLDPWYESK 120
Query: 121 GLNDVNQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLDK 180
GLNDVNQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLDK
Sbjct: 121 GLNDVNQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLDK 180
Query: 181 EAFEKQMLEKLIWISAFMLVGARHPELASAAAAERQLVFEEGIEERLCAYSRAVAHFPTA 240
EAFEKQMLEKLIWISAFMLVGARHPELASAAAAERQLVFEEGIEERLCAYSRAVAHFPTA
Sbjct: 181 EAFEKQMLEKLIWISAFMLVGARHPELASAAAAERQLVFEEGIEERLCAYSRAVAHFPTA 240
Query: 241 VKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWLAELKVI 281
VKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWLAELKVI
Sbjct: 241 VKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWLAELKVI 280
BLAST of CmoCh17G008800 vs. NCBI nr
Match:
XP_022954230.1 (uncharacterized protein LOC111456547 isoform X1 [Cucurbita moschata])
HSP 1 Score: 551.6 bits (1420), Expect = 3.9e-153
Identity = 280/300 (93.33%), Postives = 280/300 (93.33%), Query Frame = 0
Query: 1 MAAAISFTISSSSLWSRLSKPASKFRDRRLRIMAAATGTKVVPAVIVGSGRVGRALLDMG 60
MAAAISFTISSSSLWSRLSKPASKFRDRRLRIMAAATGTKVVPAVIVGSGRVGRALLDMG
Sbjct: 1 MAAAISFTISSSSLWSRLSKPASKFRDRRLRIMAAATGTKVVPAVIVGSGRVGRALLDMG 60
Query: 61 NGEDFLVKRGESVPLDFPGPILVCTRNDDLEAVLESTPRSRWNDLVFFQNGMLDPWYESK 120
NGEDFLVKRGESVPLDFPGPILVCTRNDDLEAVLESTPRSRWNDLVFFQNGMLDPWYESK
Sbjct: 61 NGEDFLVKRGESVPLDFPGPILVCTRNDDLEAVLESTPRSRWNDLVFFQNGMLDPWYESK 120
Query: 121 GLNDVNQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLDK 180
GLNDVNQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLDK
Sbjct: 121 GLNDVNQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLDK 180
Query: 181 EAFEKQMLEKLIWISAFMLVGARHP--------------------ELASAAAAERQLVFE 240
EAFEKQMLEKLIWISAFMLVGARHP ELASAAAAERQLVFE
Sbjct: 181 EAFEKQMLEKLIWISAFMLVGARHPGATVGAVEKDYRSEVSSLIAELASAAAAERQLVFE 240
Query: 241 EGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWLAELKVI 281
EGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWLAELKVI
Sbjct: 241 EGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWLAELKVI 300
BLAST of CmoCh17G008800 vs. NCBI nr
Match:
XP_022991363.1 (uncharacterized protein LOC111488021 isoform X1 [Cucurbita maxima])
HSP 1 Score: 548.5 bits (1412), Expect = 3.3e-152
Identity = 278/300 (92.67%), Postives = 279/300 (93.00%), Query Frame = 0
Query: 1 MAAAISFTISSSSLWSRLSKPASKFRDRRLRIMAAATGTKVVPAVIVGSGRVGRALLDMG 60
MAAAISFTISSSSLWSRLSKPASKFRDRRLRIMAAATGTKVVPAVIVGSGRVGRALLDMG
Sbjct: 1 MAAAISFTISSSSLWSRLSKPASKFRDRRLRIMAAATGTKVVPAVIVGSGRVGRALLDMG 60
Query: 61 NGEDFLVKRGESVPLDFPGPILVCTRNDDLEAVLESTPRSRWNDLVFFQNGMLDPWYESK 120
NGEDFLVKRGESVPLDFPGPILVCTRNDDLEAVLESTP+SRWNDLVFFQNGMLDPWYESK
Sbjct: 61 NGEDFLVKRGESVPLDFPGPILVCTRNDDLEAVLESTPQSRWNDLVFFQNGMLDPWYESK 120
Query: 121 GLNDVNQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLDK 180
GLNDVNQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLDK
Sbjct: 121 GLNDVNQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLDK 180
Query: 181 EAFEKQMLEKLIWISAFMLVGARHP--------------------ELASAAAAERQLVFE 240
EAFEKQMLEKLIWISAFMLVGARHP ELASAAAAERQLVFE
Sbjct: 181 EAFEKQMLEKLIWISAFMLVGARHPGATVGAVEKDYRSEVSSLIAELASAAAAERQLVFE 240
Query: 241 EGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWLAELKVI 281
EGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWL ELKVI
Sbjct: 241 EGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWLGELKVI 300
BLAST of CmoCh17G008800 vs. NCBI nr
Match:
XP_023547746.1 (uncharacterized protein LOC111806603 isoform X1 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 548.1 bits (1411), Expect = 4.3e-152
Identity = 278/300 (92.67%), Postives = 278/300 (92.67%), Query Frame = 0
Query: 1 MAAAISFTISSSSLWSRLSKPASKFRDRRLRIMAAATGTKVVPAVIVGSGRVGRALLDMG 60
MAAAISFTISSSSLWSRLSKPASKFRD RLRIMAAATGTKVVPAVIVGSGRVGRALLDMG
Sbjct: 47 MAAAISFTISSSSLWSRLSKPASKFRDHRLRIMAAATGTKVVPAVIVGSGRVGRALLDMG 106
Query: 61 NGEDFLVKRGESVPLDFPGPILVCTRNDDLEAVLESTPRSRWNDLVFFQNGMLDPWYESK 120
NGEDFLVKRGESVPLDFPGPILVCTRNDDLEAVLESTPRSRWNDLVFFQNGMLDPWYESK
Sbjct: 107 NGEDFLVKRGESVPLDFPGPILVCTRNDDLEAVLESTPRSRWNDLVFFQNGMLDPWYESK 166
Query: 121 GLNDVNQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLDK 180
GLNDVNQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLDK
Sbjct: 167 GLNDVNQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLDK 226
Query: 181 EAFEKQMLEKLIWISAFMLVGARHP--------------------ELASAAAAERQLVFE 240
EAFEKQMLEKLIWISAFMLVGARHP ELASAAAAERQLVFE
Sbjct: 227 EAFEKQMLEKLIWISAFMLVGARHPGATVGAVEKDYRSEVSSLIAELASAAAAERQLVFE 286
Query: 241 EGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWLAELKVI 281
EGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIA GKPDPCPLHTAWLAELKVI
Sbjct: 287 EGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAGGKPDPCPLHTAWLAELKVI 346
BLAST of CmoCh17G008800 vs. NCBI nr
Match:
KAG7014193.1 (hypothetical protein SDJN02_24367 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 547.0 bits (1408), Expect = 9.7e-152
Identity = 280/301 (93.02%), Postives = 280/301 (93.02%), Query Frame = 0
Query: 1 MAAAISFTISSSSLWSRLSKPASKFRDRRLRIMAAATGTKVVPAVIVGSGRVGRALLDMG 60
MAAAISFTISSSSLWSRLSKPASKFRDRRLRIMAAATGTKVVPAVIVGSGRVGRALLDMG
Sbjct: 1 MAAAISFTISSSSLWSRLSKPASKFRDRRLRIMAAATGTKVVPAVIVGSGRVGRALLDMG 60
Query: 61 NGEDFLVKRGESVPLDFPGPILVCTRNDDLEAVLESTPRSRWN-DLVFFQNGMLDPWYES 120
NGEDFLVKRGESVPLDFPGPILVCTRNDDLEAVLESTPRSRWN DLVFFQNGMLDPWYES
Sbjct: 61 NGEDFLVKRGESVPLDFPGPILVCTRNDDLEAVLESTPRSRWNADLVFFQNGMLDPWYES 120
Query: 121 KGLNDVNQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLD 180
KGLNDVNQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLD
Sbjct: 121 KGLNDVNQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLD 180
Query: 181 KEAFEKQMLEKLIWISAFMLVGARHP--------------------ELASAAAAERQLVF 240
KEAFEKQMLEKLIWISAFMLVGARHP ELASAAAAERQLVF
Sbjct: 181 KEAFEKQMLEKLIWISAFMLVGARHPGATVGAVEKDYRSEVSSLIAELASAAAAERQLVF 240
Query: 241 EEGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWLAELKV 281
EEGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWLAELKV
Sbjct: 241 EEGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWLAELKV 300
BLAST of CmoCh17G008800 vs. TAIR 10
Match:
AT1G16080.1 (unknown protein; LOCATED IN: apoplast, chloroplast stroma, chloroplast, chloroplast envelope; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 14 growth stages; Has 81 Blast hits to 81 proteins in 28 species: Archae - 0; Bacteria - 2; Metazoa - 0; Fungi - 0; Plants - 62; Viruses - 0; Other Eukaryotes - 17 (source: NCBI BLink). )
HSP 1 Score: 414.8 bits (1065), Expect = 5.3e-116
Identity = 210/299 (70.23%), Postives = 235/299 (78.60%), Query Frame = 0
Query: 2 AAAISFTISSSSLWSRLSKPASKFRDRRLRIMAAATGTKVVPAVIVGSGRVGRALLDMGN 61
+A SF S SL +S+ A + +AAT K+ PAVIVG GRVGRAL +MGN
Sbjct: 16 SARFSFLRRSESLKPSVSR-ARFAVPMAMAAASAATAKKLAPAVIVGGGRVGRALQEMGN 75
Query: 62 GEDFLVKRGESVPLDFPGPILVCTRNDDLEAVLESTPRSRWNDLVFFQNGMLDPWYESKG 121
GED LVKRGE+VP+DF GPILVCTRNDDL+AVLE+TP+SRW DLVFFQNGM++PW+ESKG
Sbjct: 76 GEDLLVKRGEAVPVDFEGPILVCTRNDDLDAVLEATPQSRWKDLVFFQNGMMEPWFESKG 135
Query: 122 LNDVNQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLDKE 181
L D +QVLAYFA+SKLGE PVDG TDTNPEGLTAAYGKWAS +A RL + GLSCKVLDKE
Sbjct: 136 LGDTDQVLAYFAVSKLGEPPVDGKTDTNPEGLTAAYGKWASEIAARLQSGGLSCKVLDKE 195
Query: 182 AFEKQMLEKLIWISAFMLVGARHP--------------------ELASAAAAERQLVFEE 241
AF+KQMLEKLIWI AFMLVGARHP ELA+AAAAE+ L FEE
Sbjct: 196 AFQKQMLEKLIWICAFMLVGARHPGASVGTVEKEYRDEVSRLIQELAAAAAAEKGLTFEE 255
Query: 242 GIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWLAELKVI 281
+ ERLCAYSRAV+HFPTAVKEFKWRNGWFYSLSEKAIA G+PDPCPLHT WL ELKVI
Sbjct: 256 NMVERLCAYSRAVSHFPTAVKEFKWRNGWFYSLSEKAIAEGQPDPCPLHTEWLKELKVI 313
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1GRW1 | 1.9e-153 | 93.33 | uncharacterized protein LOC111456547 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1JW05 | 1.6e-152 | 92.67 | uncharacterized protein LOC111488021 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1DKB2 | 3.0e-143 | 88.00 | uncharacterized protein LOC111021267 isoform X1 OS=Momordica charantia OX=3673 G... | [more] |
A0A5D3CGL0 | 1.2e-142 | 87.67 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |
A0A1S3CDP5 | 1.2e-142 | 87.67 | uncharacterized protein LOC103499856 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
Match Name | E-value | Identity | Description | |
KAG6575639.1 | 1.0e-156 | 100.00 | hypothetical protein SDJN03_26278, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_022954230.1 | 3.9e-153 | 93.33 | uncharacterized protein LOC111456547 isoform X1 [Cucurbita moschata] | [more] |
XP_022991363.1 | 3.3e-152 | 92.67 | uncharacterized protein LOC111488021 isoform X1 [Cucurbita maxima] | [more] |
XP_023547746.1 | 4.3e-152 | 92.67 | uncharacterized protein LOC111806603 isoform X1 [Cucurbita pepo subsp. pepo] | [more] |
KAG7014193.1 | 9.7e-152 | 93.02 | hypothetical protein SDJN02_24367 [Cucurbita argyrosperma subsp. argyrosperma] | [more] |
Match Name | E-value | Identity | Description | |
AT1G16080.1 | 5.3e-116 | 70.23 | unknown protein; LOCATED IN: apoplast, chloroplast stroma, chloroplast, chloropl... | [more] |