Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CACCAAAATCCTCCCACCCTCCCCCCTCCTTAAATATTTTCGCCGACTTTCCATTCACGATGCACATTTGAATCTCTCAGATTTCGAGATTTCTCTCTCCCTTCTTTTCAAAATCTTCACCCAATCTTTTCCATGGCGCCGCCGTCACACAGGTCGTCTTCTCCGTCGATGGTCGCCGGGAGAGCAAGCCCTAATTCCAGAAATTCCGAAATCGTTAACCCTACCCGCCGGAGCTTCTCCAGCGAACCGAGGAGCTTGAACTTTAACACTCCGACGAACAGTCCCTCTGGTTTGTTTGAGTTTCTTTACCTGATTAATTTCTGAGTTTTAGATGGGTTTGAATCTGTTTAGTTGCTGAGACAGTAACAGAGAAAAGTTTTTAGGGTTTATAAGTCGAATAGTCTCTGTTTTTTTTTTTTTTTTTTTTTTTGCCGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTTTTTTTTTTTTTTTTTTTTTTTTTTGCCGAGAAAATGGCAGAGAAAATTTGTTTGGGGTACAAAATTGAAAGAATCTCTTCTAATAATTAGCATCTTAGCTTTTATTTTCGCTCTGATTTTGATGGTTATTTTGATTCTTCCACCGATTGTTTTCCTTCCTGGGATGATTTTGAACATTGGGAATGTTTTTGAGTGGTTCGAATCTGTTTGGTTGTCGAGAAAGTAGCAGAGAAAATTTAAGCGTAAACAAAAGGTCTACTTATTAGCTTCTTTCATTTCTTTGCTACGATTTCAATGGCTATTTTGTTTCTTCCGCTGATTATTTTCCGTTCTGTTCCTTCTAAATCGTAAATTGCTTCCTTTCTGAACGTTTCTTTGAATGGTTTTTCAGATTATCCACGAAGGAATTCTACCAGCAGAGAAAATTTGTTTTATTCTCGTGACAATGAGGAGAAAGAAAATGGGAAAAATCAGAGTCCAAAACCTGTTCGAATCCGTTCGCCGGCGGCCGGAAAATCGACGAAGCACTTTATGTCTCCGACGATCTCTGCCGCCTCCAAGATTTCTGTCTCTCCGAAAAAGAAAATTCTGGGTGATCGGAACGAGCTAGTTCGGTCGTCTCTTTCATTTTCCGGCTTGAAAAGCTCTTCACTCAACTCGGTGAATCCAAATCCGGAGGCATCAGCGGCACTTGAATCCGATACGAATCAGGAAATTGCTCCGATTTCAAATCCCAAGAAATCCACATACCGATACGATACAGAAGTAGCACCAGTGGCAGTTGAAACCGATACGAAGTCGGAAACCGCCCCGATTTCAAAATCCACCATTGCAGCAGCACCTCTCAGAGCTTCCAAAACAGTGAAATCCGGTGGTTTGGATGTCATTTCTGATTCGCATTCAAATTCTGAGGTGGTAACAATGGCAGTTGAAACCGACGCGAAGCTTGAAATTACTCGGATTTCAAATTCTGCCATTGCTGCATTACCTCCCAAAGCTTCAGAGGCCGTGGAATTTGCTGATGTCGAGGTAAGCTCTGACCCAAACAACGACTCAGAGTCTCCGGCTAAGACTGTTGATCTCAACTCAAGTTTTAAGGACAGTCTTGTTTCTTCTTCAATGGAAATAGCACCGCTCGATGCCGATCCATTAATGCCGCGTCCTTATGATCCCAAAACCAATTACCTATCCCCAAGGCCACAGTTCCTCCATTACAAACCAAGAAGAATCAATCAACTCGAACTAGACGGCAAACTTGAGGAACTGTTTTCCTCTGAATCTGAATTCAGTGAGGGAACTGACTCGGAAGATCCACAGATGGAATCTGATGAAGCTTCTTCCAATGAATCGCATATGAAAGAAGAAGAAAGGGAAGAGGGGGAGGAGGAGGAAGAAGTGATTGTTAACGTTTCTGAACAAAGCCCCGTTGAAGCTAAGACTTCATCTAAGCTGCACTTTTCCAGGATATTCAAGATCAGTTCTCTGCTTTTGATTCTGTTGACTGCTTGCTTTTCGATTTGTGTTGTTAATGTTCATGATCTCGAAAGAGCGAGCTTGTGGTTACCAATGGAGGATTCAACAGAAGTTTTCGAGTTTGCAAAAACGAATTTCAATGTGCTGGTGCGGAAATTTGAGGTTTGGCATGCTAGTTCCAGACCTTACATTTCTGATATGGTTTTCAACATCGGAGGGCGGCGGCCATTGATTTATCTTAACCAAACTGGATTCTTACACAAAGATGTCAATTCGGAAGAGCAGTGCCTTGTATTATCTCATCAGACCTCGTGGGAAGAAGAAAATGATCTGAACGTAATGGAAGAAGCGAGGAAGGAGGGAGAAATAGACATTGTTGAAGAACATATCGTGAGAGGAGATCATAATGAAGAAGAAGAAGAACTATTATTGGAAGAGATTGAAGCCATGAAGGAGAGAGAAATTGTCATCGAACACGTTGAAGGAGAAGTTCAGAATGAAGAAGAATCGTTTCAAGAGATTGAAGCCGATGCAAATGATTCAAAAGACGGTGAAGAAGAGAACGGCCAGGCTTCAGCAAATTCAGCTTCTGAAGAACCATTGCAAGAGACTGAAGAAGGATCATTGCAAGAGATTATTGAAGAGACCTCTACAAAATCAGCTTCTGATAAACTCAATGAAGAAGACGAAATCCAGGAGAAGCAAACTGAAGAGAATTACGAAGATTCCTCATCACCAGATTTTATTCACGATCAAATTGAACAAGAAGCTGCAACAGGAGGCGAGACAAAAGAAGAACAACAAAACGATTCAATTCAACAAAGAAACGCAGAAATTCAACGGCAATCACCTCCAGTTTCTCCTCCTTCTGCACCTCAATCTGACGCTGAAGACGAAAATGACAGCAACATCGATCTCGTTGGAACTGCAACCAATAACAGAATCTCAAGAGATTTCTCACAGAACACTGCAGTTATAGCATCTGCAATACTGCTGGGTTTATCTATGATCATACCTGCAGGACTGATTTATGCGAGAAAATCAGGCTCAAAAACATCATCCATGGCGGCCATTGCTGAAGCGCAAGAGGAACCGCCATTGCTGAAAGAGAAGAAGACATACCAGAGTCCGGCGGTACCAGAAGAAGAAGAAGCGATTAATGATGATGATCGTGAGGATATCGCCAGAAAAGGATTTTGCTCTTCTGAAACGAGCAGTTTCCTGCAATACAGCAGCATGAAAGAAGAAGAAAAAGCGATTAATGATGACGATCGCGAGGATATCGCCATAAAAGGATTTTGCTCTTCTGAAATGAGCAGTTTCCTGCAATACAGCAGCATGAAAGAAGAAGACGAAACAGAAACATCTAAGATATTGATAGAAGCTCAAAACCATAGCCATGGGAGGAAGACGAGGAAGAATTCAAGAACTCCAATGGCTTCTTCTTCTTTGGATGAATTTTCAGTGTCCACTTCGTCGGCTTCTCCATCGTATGGGAGCTTCACAACTTATGAGAAGATCCCAATCAAGCATGTGAGTATCCCTATTTTCAAATATTATATAATTTCGTTTTTTTCACTTAATAAACACTTTTTAAAGCCCTTTAAATTATTTTAAACAGCTTTTATTCATTTTTACTCCTGATTTTAAATTAAATTTGATTGCTTTTTAAAGCCCTTTAAATTATTTTAAACAGCTTTTATTCATTTTTACTCCTGATTTTAAATTAAATTTGATTTGTAAAAATGTTAAATTATATATATATGAAAAAAGTGTTAAATTTTAAAAATAAATAAAAACCCAGAAATAAAAAAAAATTTAACGTTAAAATTTCGTTTTTTTTAAGATCAGTTTATTTTCATTCAAAATATATATACTTTATTTTCTTTTTTTTTTTAAACAAATTAAAATAACAAATAAAAAAAATTAGAACTATTTCATGACCATTTTCGAACCTTTTACAGGGAAACGAAGAGGAAGAGATTGTGACTCCAGTAAGACGTTCAAGTAGAATTAGAAAGACCGCAGCACATCGACAGTCGTTTGCTGAATGAAATGAGCTCACATTTTATGTGTGTGGAATGAAATGAGCTCCAGTAGGCTTAGATGAACGACATCCAAGTAGAATTAGAAAGATGACACATCAATATCCAATTTATGTGTGTATCGAATGAAACGAGTTCCTAGTAGAATTAGAAAAATGACGATCAATAGCCAGTTTACTAGAAAAATAGCACATTAATAGCGGGTTTATGTGTACTGAACAAAATGTGATTCTAGTTGATGAAGCTCAC
mRNA sequence
CACCAAAATCCTCCCACCCTCCCCCCTCCTTAAATATTTTCGCCGACTTTCCATTCACGATGCACATTTGAATCTCTCAGATTTCGAGATTTCTCTCTCCCTTCTTTTCAAAATCTTCACCCAATCTTTTCCATGGCGCCGCCGTCACACAGGTCGTCTTCTCCGTCGATGGTCGCCGGGAGAGCAAGCCCTAATTCCAGAAATTCCGAAATCGTTAACCCTACCCGCCGGAGCTTCTCCAGCGAACCGAGGAGCTTGAACTTTAACACTCCGACGAACAGTCCCTCTGATTATCCACGAAGGAATTCTACCAGCAGAGAAAATTTGTTTTATTCTCGTGACAATGAGGAGAAAGAAAATGGGAAAAATCAGAGTCCAAAACCTGTTCGAATCCGTTCGCCGGCGGCCGGAAAATCGACGAAGCACTTTATGTCTCCGACGATCTCTGCCGCCTCCAAGATTTCTGTCTCTCCGAAAAAGAAAATTCTGGGTGATCGGAACGAGCTAGTTCGGTCGTCTCTTTCATTTTCCGGCTTGAAAAGCTCTTCACTCAACTCGGTGAATCCAAATCCGGAGGCATCAGCGGCACTTGAATCCGATACGAATCAGGAAATTGCTCCGATTTCAAATCCCAAGAAATCCACATACCGATACGATACAGAAGTAGCACCAGTGGCAGTTGAAACCGATACGAAGTCGGAAACCGCCCCGATTTCAAAATCCACCATTGCAGCAGCACCTCTCAGAGCTTCCAAAACAGTGAAATCCGGTGGTTTGGATGTCATTTCTGATTCGCATTCAAATTCTGAGGTGGTAACAATGGCAGTTGAAACCGACGCGAAGCTTGAAATTACTCGGATTTCAAATTCTGCCATTGCTGCATTACCTCCCAAAGCTTCAGAGGCCGTGGAATTTGCTGATGTCGAGGTAAGCTCTGACCCAAACAACGACTCAGAGTCTCCGGCTAAGACTGTTGATCTCAACTCAAGTTTTAAGGACAGTCTTGTTTCTTCTTCAATGGAAATAGCACCGCTCGATGCCGATCCATTAATGCCGCGTCCTTATGATCCCAAAACCAATTACCTATCCCCAAGGCCACAGTTCCTCCATTACAAACCAAGAAGAATCAATCAACTCGAACTAGACGGCAAACTTGAGGAACTGTTTTCCTCTGAATCTGAATTCAGTGAGGGAACTGACTCGGAAGATCCACAGATGGAATCTGATGAAGCTTCTTCCAATGAATCGCATATGAAAGAAGAAGAAAGGGAAGAGGGGGAGGAGGAGGAAGAAGTGATTGTTAACGTTTCTGAACAAAGCCCCGTTGAAGCTAAGACTTCATCTAAGCTGCACTTTTCCAGGATATTCAAGATCAGTTCTCTGCTTTTGATTCTGTTGACTGCTTGCTTTTCGATTTGTGTTGTTAATGTTCATGATCTCGAAAGAGCGAGCTTGTGGTTACCAATGGAGGATTCAACAGAAGTTTTCGAGTTTGCAAAAACGAATTTCAATGTGCTGGTGCGGAAATTTGAGGTTTGGCATGCTAGTTCCAGACCTTACATTTCTGATATGGTTTTCAACATCGGAGGGCGGCGGCCATTGATTTATCTTAACCAAACTGGATTCTTACACAAAGATGTCAATTCGGAAGAGCAGTGCCTTGTATTATCTCATCAGACCTCGTGGGAAGAAGAAAATGATCTGAACGTAATGGAAGAAGCGAGGAAGGAGGGAGAAATAGACATTGTTGAAGAACATATCGTGAGAGGAGATCATAATGAAGAAGAAGAAGAACTATTATTGGAAGAGATTGAAGCCATGAAGGAGAGAGAAATTGTCATCGAACACGTTGAAGGAGAAGTTCAGAATGAAGAAGAATCGTTTCAAGAGATTGAAGCCGATGCAAATGATTCAAAAGACGGTGAAGAAGAGAACGGCCAGGCTTCAGCAAATTCAGCTTCTGAAGAACCATTGCAAGAGACTGAAGAAGGATCATTGCAAGAGATTATTGAAGAGACCTCTACAAAATCAGCTTCTGATAAACTCAATGAAGAAGACGAAATCCAGGAGAAGCAAACTGAAGAGAATTACGAAGATTCCTCATCACCAGATTTTATTCACGATCAAATTGAACAAGAAGCTGCAACAGGAGGCGAGACAAAAGAAGAACAACAAAACGATTCAATTCAACAAAGAAACGCAGAAATTCAACGGCAATCACCTCCAGTTTCTCCTCCTTCTGCACCTCAATCTGACGCTGAAGACGAAAATGACAGCAACATCGATCTCGTTGGAACTGCAACCAATAACAGAATCTCAAGAGATTTCTCACAGAACACTGCAGTTATAGCATCTGCAATACTGCTGGGTTTATCTATGATCATACCTGCAGGACTGATTTATGCGAGAAAATCAGGCTCAAAAACATCATCCATGGCGGCCATTGCTGAAGCGCAAGAGGAACCGCCATTGCTGAAAGAGAAGAAGACATACCAGAGTCCGGCGGTACCAGAAGAAGAAGAAGCGATTAATGATGATGATCGTGAGGATATCGCCAGAAAAGGATTTTGCTCTTCTGAAACGAGCAGTTTCCTGCAATACAGCAGCATGAAAGAAGAAGAAAAAGCGATTAATGATGACGATCGCGAGGATATCGCCATAAAAGGATTTTGCTCTTCTGAAATGAGCAGTTTCCTGCAATACAGCAGCATGAAAGAAGAAGACGAAACAGAAACATCTAAGATATTGATAGAAGCTCAAAACCATAGCCATGGGAGGAAGACGAGGAAGAATTCAAGAACTCCAATGGCTTCTTCTTCTTTGGATGAATTTTCAGTGTCCACTTCGTCGGCTTCTCCATCGTATGGGAGCTTCACAACTTATGAGAAGATCCCAATCAAGCATGGAAACGAAGAGGAAGAGATTGTGACTCCAGTAAGACGTTCAAGTAGAATTAGAAAGACCGCAGCACATCGACAGTCGTTTGCTGAATGAAATGAGCTCACATTTTATGTGTGTGGAATGAAATGAGCTCCAGTAGGCTTAGATGAACGACATCCAAGTAGAATTAGAAAGATGACACATCAATATCCAATTTATGTGTGTATCGAATGAAACGAGTTCCTAGTAGAATTAGAAAAATGACGATCAATAGCCAGTTTACTAGAAAAATAGCACATTAATAGCGGGTTTATGTGTACTGAACAAAATGTGATTCTAGTTGATGAAGCTCAC
Coding sequence (CDS)
ATGGCGCCGCCGTCACACAGGTCGTCTTCTCCGTCGATGGTCGCCGGGAGAGCAAGCCCTAATTCCAGAAATTCCGAAATCGTTAACCCTACCCGCCGGAGCTTCTCCAGCGAACCGAGGAGCTTGAACTTTAACACTCCGACGAACAGTCCCTCTGATTATCCACGAAGGAATTCTACCAGCAGAGAAAATTTGTTTTATTCTCGTGACAATGAGGAGAAAGAAAATGGGAAAAATCAGAGTCCAAAACCTGTTCGAATCCGTTCGCCGGCGGCCGGAAAATCGACGAAGCACTTTATGTCTCCGACGATCTCTGCCGCCTCCAAGATTTCTGTCTCTCCGAAAAAGAAAATTCTGGGTGATCGGAACGAGCTAGTTCGGTCGTCTCTTTCATTTTCCGGCTTGAAAAGCTCTTCACTCAACTCGGTGAATCCAAATCCGGAGGCATCAGCGGCACTTGAATCCGATACGAATCAGGAAATTGCTCCGATTTCAAATCCCAAGAAATCCACATACCGATACGATACAGAAGTAGCACCAGTGGCAGTTGAAACCGATACGAAGTCGGAAACCGCCCCGATTTCAAAATCCACCATTGCAGCAGCACCTCTCAGAGCTTCCAAAACAGTGAAATCCGGTGGTTTGGATGTCATTTCTGATTCGCATTCAAATTCTGAGGTGGTAACAATGGCAGTTGAAACCGACGCGAAGCTTGAAATTACTCGGATTTCAAATTCTGCCATTGCTGCATTACCTCCCAAAGCTTCAGAGGCCGTGGAATTTGCTGATGTCGAGGTAAGCTCTGACCCAAACAACGACTCAGAGTCTCCGGCTAAGACTGTTGATCTCAACTCAAGTTTTAAGGACAGTCTTGTTTCTTCTTCAATGGAAATAGCACCGCTCGATGCCGATCCATTAATGCCGCGTCCTTATGATCCCAAAACCAATTACCTATCCCCAAGGCCACAGTTCCTCCATTACAAACCAAGAAGAATCAATCAACTCGAACTAGACGGCAAACTTGAGGAACTGTTTTCCTCTGAATCTGAATTCAGTGAGGGAACTGACTCGGAAGATCCACAGATGGAATCTGATGAAGCTTCTTCCAATGAATCGCATATGAAAGAAGAAGAAAGGGAAGAGGGGGAGGAGGAGGAAGAAGTGATTGTTAACGTTTCTGAACAAAGCCCCGTTGAAGCTAAGACTTCATCTAAGCTGCACTTTTCCAGGATATTCAAGATCAGTTCTCTGCTTTTGATTCTGTTGACTGCTTGCTTTTCGATTTGTGTTGTTAATGTTCATGATCTCGAAAGAGCGAGCTTGTGGTTACCAATGGAGGATTCAACAGAAGTTTTCGAGTTTGCAAAAACGAATTTCAATGTGCTGGTGCGGAAATTTGAGGTTTGGCATGCTAGTTCCAGACCTTACATTTCTGATATGGTTTTCAACATCGGAGGGCGGCGGCCATTGATTTATCTTAACCAAACTGGATTCTTACACAAAGATGTCAATTCGGAAGAGCAGTGCCTTGTATTATCTCATCAGACCTCGTGGGAAGAAGAAAATGATCTGAACGTAATGGAAGAAGCGAGGAAGGAGGGAGAAATAGACATTGTTGAAGAACATATCGTGAGAGGAGATCATAATGAAGAAGAAGAAGAACTATTATTGGAAGAGATTGAAGCCATGAAGGAGAGAGAAATTGTCATCGAACACGTTGAAGGAGAAGTTCAGAATGAAGAAGAATCGTTTCAAGAGATTGAAGCCGATGCAAATGATTCAAAAGACGGTGAAGAAGAGAACGGCCAGGCTTCAGCAAATTCAGCTTCTGAAGAACCATTGCAAGAGACTGAAGAAGGATCATTGCAAGAGATTATTGAAGAGACCTCTACAAAATCAGCTTCTGATAAACTCAATGAAGAAGACGAAATCCAGGAGAAGCAAACTGAAGAGAATTACGAAGATTCCTCATCACCAGATTTTATTCACGATCAAATTGAACAAGAAGCTGCAACAGGAGGCGAGACAAAAGAAGAACAACAAAACGATTCAATTCAACAAAGAAACGCAGAAATTCAACGGCAATCACCTCCAGTTTCTCCTCCTTCTGCACCTCAATCTGACGCTGAAGACGAAAATGACAGCAACATCGATCTCGTTGGAACTGCAACCAATAACAGAATCTCAAGAGATTTCTCACAGAACACTGCAGTTATAGCATCTGCAATACTGCTGGGTTTATCTATGATCATACCTGCAGGACTGATTTATGCGAGAAAATCAGGCTCAAAAACATCATCCATGGCGGCCATTGCTGAAGCGCAAGAGGAACCGCCATTGCTGAAAGAGAAGAAGACATACCAGAGTCCGGCGGTACCAGAAGAAGAAGAAGCGATTAATGATGATGATCGTGAGGATATCGCCAGAAAAGGATTTTGCTCTTCTGAAACGAGCAGTTTCCTGCAATACAGCAGCATGAAAGAAGAAGAAAAAGCGATTAATGATGACGATCGCGAGGATATCGCCATAAAAGGATTTTGCTCTTCTGAAATGAGCAGTTTCCTGCAATACAGCAGCATGAAAGAAGAAGACGAAACAGAAACATCTAAGATATTGATAGAAGCTCAAAACCATAGCCATGGGAGGAAGACGAGGAAGAATTCAAGAACTCCAATGGCTTCTTCTTCTTTGGATGAATTTTCAGTGTCCACTTCGTCGGCTTCTCCATCGTATGGGAGCTTCACAACTTATGAGAAGATCCCAATCAAGCATGGAAACGAAGAGGAAGAGATTGTGACTCCAGTAAGACGTTCAAGTAGAATTAGAAAGACCGCAGCACATCGACAGTCGTTTGCTGAATGA
Protein sequence
MAPPSHRSSSPSMVAGRASPNSRNSEIVNPTRRSFSSEPRSLNFNTPTNSPSDYPRRNSTSRENLFYSRDNEEKENGKNQSPKPVRIRSPAAGKSTKHFMSPTISAASKISVSPKKKILGDRNELVRSSLSFSGLKSSSLNSVNPNPEASAALESDTNQEIAPISNPKKSTYRYDTEVAPVAVETDTKSETAPISKSTIAAAPLRASKTVKSGGLDVISDSHSNSEVVTMAVETDAKLEITRISNSAIAALPPKASEAVEFADVEVSSDPNNDSESPAKTVDLNSSFKDSLVSSSMEIAPLDADPLMPRPYDPKTNYLSPRPQFLHYKPRRINQLELDGKLEELFSSESEFSEGTDSEDPQMESDEASSNESHMKEEEREEGEEEEEVIVNVSEQSPVEAKTSSKLHFSRIFKISSLLLILLTACFSICVVNVHDLERASLWLPMEDSTEVFEFAKTNFNVLVRKFEVWHASSRPYISDMVFNIGGRRPLIYLNQTGFLHKDVNSEEQCLVLSHQTSWEEENDLNVMEEARKEGEIDIVEEHIVRGDHNEEEEELLLEEIEAMKEREIVIEHVEGEVQNEEESFQEIEADANDSKDGEEENGQASANSASEEPLQETEEGSLQEIIEETSTKSASDKLNEEDEIQEKQTEENYEDSSSPDFIHDQIEQEAATGGETKEEQQNDSIQQRNAEIQRQSPPVSPPSAPQSDAEDENDSNIDLVGTATNNRISRDFSQNTAVIASAILLGLSMIIPAGLIYARKSGSKTSSMAAIAEAQEEPPLLKEKKTYQSPAVPEEEEAINDDDREDIARKGFCSSETSSFLQYSSMKEEEKAINDDDREDIAIKGFCSSEMSSFLQYSSMKEEDETETSKILIEAQNHSHGRKTRKNSRTPMASSSLDEFSVSTSSASPSYGSFTTYEKIPIKHGNEEEEIVTPVRRSSRIRKTAAHRQSFAE
Homology
BLAST of CmoCh04G004590 vs. ExPASy TrEMBL
Match:
A0A6J1FNZ9 (uncharacterized protein LOC111447507 OS=Cucurbita moschata OX=3662 GN=LOC111447507 PE=4 SV=1)
HSP 1 Score: 1728.0 bits (4474), Expect = 0.0e+00
Identity = 953/953 (100.00%), Postives = 953/953 (100.00%), Query Frame = 0
Query: 1 MAPPSHRSSSPSMVAGRASPNSRNSEIVNPTRRSFSSEPRSLNFNTPTNSPSDYPRRNST 60
MAPPSHRSSSPSMVAGRASPNSRNSEIVNPTRRSFSSEPRSLNFNTPTNSPSDYPRRNST
Sbjct: 1 MAPPSHRSSSPSMVAGRASPNSRNSEIVNPTRRSFSSEPRSLNFNTPTNSPSDYPRRNST 60
Query: 61 SRENLFYSRDNEEKENGKNQSPKPVRIRSPAAGKSTKHFMSPTISAASKISVSPKKKILG 120
SRENLFYSRDNEEKENGKNQSPKPVRIRSPAAGKSTKHFMSPTISAASKISVSPKKKILG
Sbjct: 61 SRENLFYSRDNEEKENGKNQSPKPVRIRSPAAGKSTKHFMSPTISAASKISVSPKKKILG 120
Query: 121 DRNELVRSSLSFSGLKSSSLNSVNPNPEASAALESDTNQEIAPISNPKKSTYRYDTEVAP 180
DRNELVRSSLSFSGLKSSSLNSVNPNPEASAALESDTNQEIAPISNPKKSTYRYDTEVAP
Sbjct: 121 DRNELVRSSLSFSGLKSSSLNSVNPNPEASAALESDTNQEIAPISNPKKSTYRYDTEVAP 180
Query: 181 VAVETDTKSETAPISKSTIAAAPLRASKTVKSGGLDVISDSHSNSEVVTMAVETDAKLEI 240
VAVETDTKSETAPISKSTIAAAPLRASKTVKSGGLDVISDSHSNSEVVTMAVETDAKLEI
Sbjct: 181 VAVETDTKSETAPISKSTIAAAPLRASKTVKSGGLDVISDSHSNSEVVTMAVETDAKLEI 240
Query: 241 TRISNSAIAALPPKASEAVEFADVEVSSDPNNDSESPAKTVDLNSSFKDSLVSSSMEIAP 300
TRISNSAIAALPPKASEAVEFADVEVSSDPNNDSESPAKTVDLNSSFKDSLVSSSMEIAP
Sbjct: 241 TRISNSAIAALPPKASEAVEFADVEVSSDPNNDSESPAKTVDLNSSFKDSLVSSSMEIAP 300
Query: 301 LDADPLMPRPYDPKTNYLSPRPQFLHYKPRRINQLELDGKLEELFSSESEFSEGTDSEDP 360
LDADPLMPRPYDPKTNYLSPRPQFLHYKPRRINQLELDGKLEELFSSESEFSEGTDSEDP
Sbjct: 301 LDADPLMPRPYDPKTNYLSPRPQFLHYKPRRINQLELDGKLEELFSSESEFSEGTDSEDP 360
Query: 361 QMESDEASSNESHMKEEEREEGEEEEEVIVNVSEQSPVEAKTSSKLHFSRIFKISSLLLI 420
QMESDEASSNESHMKEEEREEGEEEEEVIVNVSEQSPVEAKTSSKLHFSRIFKISSLLLI
Sbjct: 361 QMESDEASSNESHMKEEEREEGEEEEEVIVNVSEQSPVEAKTSSKLHFSRIFKISSLLLI 420
Query: 421 LLTACFSICVVNVHDLERASLWLPMEDSTEVFEFAKTNFNVLVRKFEVWHASSRPYISDM 480
LLTACFSICVVNVHDLERASLWLPMEDSTEVFEFAKTNFNVLVRKFEVWHASSRPYISDM
Sbjct: 421 LLTACFSICVVNVHDLERASLWLPMEDSTEVFEFAKTNFNVLVRKFEVWHASSRPYISDM 480
Query: 481 VFNIGGRRPLIYLNQTGFLHKDVNSEEQCLVLSHQTSWEEENDLNVMEEARKEGEIDIVE 540
VFNIGGRRPLIYLNQTGFLHKDVNSEEQCLVLSHQTSWEEENDLNVMEEARKEGEIDIVE
Sbjct: 481 VFNIGGRRPLIYLNQTGFLHKDVNSEEQCLVLSHQTSWEEENDLNVMEEARKEGEIDIVE 540
Query: 541 EHIVRGDHNEEEEELLLEEIEAMKEREIVIEHVEGEVQNEEESFQEIEADANDSKDGEEE 600
EHIVRGDHNEEEEELLLEEIEAMKEREIVIEHVEGEVQNEEESFQEIEADANDSKDGEEE
Sbjct: 541 EHIVRGDHNEEEEELLLEEIEAMKEREIVIEHVEGEVQNEEESFQEIEADANDSKDGEEE 600
Query: 601 NGQASANSASEEPLQETEEGSLQEIIEETSTKSASDKLNEEDEIQEKQTEENYEDSSSPD 660
NGQASANSASEEPLQETEEGSLQEIIEETSTKSASDKLNEEDEIQEKQTEENYEDSSSPD
Sbjct: 601 NGQASANSASEEPLQETEEGSLQEIIEETSTKSASDKLNEEDEIQEKQTEENYEDSSSPD 660
Query: 661 FIHDQIEQEAATGGETKEEQQNDSIQQRNAEIQRQSPPVSPPSAPQSDAEDENDSNIDLV 720
FIHDQIEQEAATGGETKEEQQNDSIQQRNAEIQRQSPPVSPPSAPQSDAEDENDSNIDLV
Sbjct: 661 FIHDQIEQEAATGGETKEEQQNDSIQQRNAEIQRQSPPVSPPSAPQSDAEDENDSNIDLV 720
Query: 721 GTATNNRISRDFSQNTAVIASAILLGLSMIIPAGLIYARKSGSKTSSMAAIAEAQEEPPL 780
GTATNNRISRDFSQNTAVIASAILLGLSMIIPAGLIYARKSGSKTSSMAAIAEAQEEPPL
Sbjct: 721 GTATNNRISRDFSQNTAVIASAILLGLSMIIPAGLIYARKSGSKTSSMAAIAEAQEEPPL 780
Query: 781 LKEKKTYQSPAVPEEEEAINDDDREDIARKGFCSSETSSFLQYSSMKEEEKAINDDDRED 840
LKEKKTYQSPAVPEEEEAINDDDREDIARKGFCSSETSSFLQYSSMKEEEKAINDDDRED
Sbjct: 781 LKEKKTYQSPAVPEEEEAINDDDREDIARKGFCSSETSSFLQYSSMKEEEKAINDDDRED 840
Query: 841 IAIKGFCSSEMSSFLQYSSMKEEDETETSKILIEAQNHSHGRKTRKNSRTPMASSSLDEF 900
IAIKGFCSSEMSSFLQYSSMKEEDETETSKILIEAQNHSHGRKTRKNSRTPMASSSLDEF
Sbjct: 841 IAIKGFCSSEMSSFLQYSSMKEEDETETSKILIEAQNHSHGRKTRKNSRTPMASSSLDEF 900
Query: 901 SVSTSSASPSYGSFTTYEKIPIKHGNEEEEIVTPVRRSSRIRKTAAHRQSFAE 954
SVSTSSASPSYGSFTTYEKIPIKHGNEEEEIVTPVRRSSRIRKTAAHRQSFAE
Sbjct: 901 SVSTSSASPSYGSFTTYEKIPIKHGNEEEEIVTPVRRSSRIRKTAAHRQSFAE 953
BLAST of CmoCh04G004590 vs. ExPASy TrEMBL
Match:
A0A6J1IJW9 (uncharacterized protein LOC111475455 OS=Cucurbita maxima OX=3661 GN=LOC111475455 PE=4 SV=1)
HSP 1 Score: 1525.0 bits (3947), Expect = 0.0e+00
Identity = 885/976 (90.68%), Postives = 903/976 (92.52%), Query Frame = 0
Query: 1 MAPPSHRSSSPSMVAGRASPNSRNSEIVNPTRRSFSSEPRSLNFNTPTNSPSDYPRRNST 60
MAPPSHRSSSPSMVAGR SPNSRNSEIVNPTRRSFSSE RSLNFNTP NSPSDYPRRNST
Sbjct: 1 MAPPSHRSSSPSMVAGRTSPNSRNSEIVNPTRRSFSSEQRSLNFNTPMNSPSDYPRRNST 60
Query: 61 SRENLFYSRDNEEKENGKNQSPKPVRIRSPAAGKSTKHFMSPTISAASKISVSPKKKILG 120
SRENLF SRDNEEKENGKNQSPKPVRIRSPAAGKSTK+FMSPTISAASKISVSPKKKILG
Sbjct: 61 SRENLFNSRDNEEKENGKNQSPKPVRIRSPAAGKSTKNFMSPTISAASKISVSPKKKILG 120
Query: 121 DRNELVRSSLSFSGLKSSSLNSVNPNPEASAALESDTNQEIAPISNPKK----------- 180
DRNELVRSSLSFSGLKSSSLNSVNPNPEASAALESDTNQEIAPISNPKK
Sbjct: 121 DRNELVRSSLSFSGLKSSSLNSVNPNPEASAALESDTNQEIAPISNPKKSKTVKLGGFEV 180
Query: 181 -----STYRYDTEVAPVAVETDTKSETAPISKSTIAAAPLRASKTVKSGGLDVISDSHSN 240
STYR+D EVAPVAVETDTKSE APISKSTIAAAPLRASKTVKSGGLDVISDSHSN
Sbjct: 181 ITGSESTYRHDPEVAPVAVETDTKSEIAPISKSTIAAAPLRASKTVKSGGLDVISDSHSN 240
Query: 241 SEVVTMAVETDAKLEITRISNSAIAALPPKASEAVEFADVEVSSDPNNDSESPAK----- 300
SEVVTMAVETDAKLE NSAIAALPPKASE VEFADV VSSD NDSESPAK
Sbjct: 241 SEVVTMAVETDAKLE-----NSAIAALPPKASETVEFADVVVSSDSINDSESPAKNSSAE 300
Query: 301 ---TVDLNSSFKDSLVSSSMEIAPLDADPLMPRPYDPKTNYLSPRPQFLHYKPRRINQLE 360
TV LNSSFKDSLVSSSMEIAPLDADPLMPRPYDPKTNYLSPRPQFLHYKPRRINQLE
Sbjct: 301 ELDTVGLNSSFKDSLVSSSMEIAPLDADPLMPRPYDPKTNYLSPRPQFLHYKPRRINQLE 360
Query: 361 LDGKLEELFSSESEFSEGTDSEDPQMESDEASSNESHMKEEEREEGEEEEEVIVNVSEQS 420
LDGKLEELFSSESEF+EGTDSEDPQM+SDE SSNESHMKEEEREE EEEEE+IVNVSEQS
Sbjct: 361 LDGKLEELFSSESEFTEGTDSEDPQMKSDEVSSNESHMKEEEREEEEEEEEMIVNVSEQS 420
Query: 421 PVEAKTSSKLHFSRIFKISSLLLILLTACFSICVVNVHDLERASLWLPMEDSTEVFEFAK 480
PVEAKTSSKLHFSRIFKISSLLLILLTACFSI VVNVHDLERASL LPMEDSTEVFEFAK
Sbjct: 421 PVEAKTSSKLHFSRIFKISSLLLILLTACFSISVVNVHDLERASLLLPMEDSTEVFEFAK 480
Query: 481 TNFNVLVRKFEVWHASSRPYISDMVFNIGGRRPLIYLNQTGFLHKDVNSEEQCLVLSHQT 540
TNFNVL+RKFEVWHA+SR YISDMVFNIGGRRPLIYLNQTGFLHKDVNSE QCLVLSHQT
Sbjct: 481 TNFNVLMRKFEVWHANSRSYISDMVFNIGGRRPLIYLNQTGFLHKDVNSELQCLVLSHQT 540
Query: 541 SWEEENDLNVMEEARKEGEIDIVEEHIVRGDHNEEEEELLLEEIEAMKEREIVIEHVEGE 600
SWEEENDLNVMEEARKEGEIDIVEE IVRGD N EE ELL EEIEAMKEREIVIEHV+GE
Sbjct: 541 SWEEENDLNVMEEARKEGEIDIVEEPIVRGDQN-EEVELLSEEIEAMKEREIVIEHVKGE 600
Query: 601 VQNEEESFQEIEADANDSKDGEEENGQASANSASEEPLQETEEGSLQEIIEETSTKSASD 660
VQNEEESFQEIEA+AND KDGEEENGQASA SASEEPLQE EEGSLQEIIEETSTKSASD
Sbjct: 601 VQNEEESFQEIEANANDPKDGEEENGQASAKSASEEPLQENEEGSLQEIIEETSTKSASD 660
Query: 661 KLNEEDEIQEKQTEENYEDSSSPDFIHDQIEQEAATGGETKEEQQNDSIQQRNAEIQRQS 720
LNEED+IQEKQTEENYEDSS+PDFIHDQIEQEAATGGETKEEQQNDSIQQ NAEIQ QS
Sbjct: 661 ILNEEDKIQEKQTEENYEDSSTPDFIHDQIEQEAATGGETKEEQQNDSIQQSNAEIQHQS 720
Query: 721 PPVS-PPSAPQSDAEDENDSNIDLVGTATNNRISRDFSQNTAVIASAILLGLS-MIIPAG 780
PPVS PPSAPQS+AEDEN SNI VGT TNN+ISRDFSQNTAVIASAILLGLS +IIPAG
Sbjct: 721 PPVSPPPSAPQSEAEDENGSNI--VGTETNNKISRDFSQNTAVIASAILLGLSIIIIPAG 780
Query: 781 LIYARKSGSKTSSMAAIAEAQEEPPLLKEKKTYQSPAVPEEEEAINDDDREDIARKGFCS 840
LIYARKSGSK SSMAAIAEAQEEPPLLKEKKTYQSPAVPEEEEAINDDDREDIARKG S
Sbjct: 781 LIYARKSGSKRSSMAAIAEAQEEPPLLKEKKTYQSPAVPEEEEAINDDDREDIARKGLFS 840
Query: 841 SETSSFLQYSSMKEEEKAIN-DDDREDIAIKGFCSSEMSSFLQYSSMKEEDETETS-KIL 900
SETSSFLQYSSMKE E+A N DDDRE+IA KGFCSSE SSFLQYSSMKEEDETET+ K+L
Sbjct: 841 SETSSFLQYSSMKEVEEAFNGDDDREEIARKGFCSSETSSFLQYSSMKEEDETETAKKLL 900
Query: 901 IEAQNHSHGRKTRKNSRTPMASSSLDEFSVSTSSASPSYGSFTTYEKIPIKHGNEEEEIV 949
IEAQNHSHGRKTRKNSRTPMASSSLDEFSVSTSSASPSYGSFTTYEKIPIKHGNEEEEIV
Sbjct: 901 IEAQNHSHGRKTRKNSRTPMASSSLDEFSVSTSSASPSYGSFTTYEKIPIKHGNEEEEIV 960
BLAST of CmoCh04G004590 vs. ExPASy TrEMBL
Match:
A0A6J1E9M3 (uncharacterized protein LOC111430638 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111430638 PE=4 SV=1)
HSP 1 Score: 767.7 bits (1981), Expect = 5.7e-218
Identity = 579/1057 (54.78%), Postives = 676/1057 (63.95%), Query Frame = 0
Query: 1 MAPPSHRSSSPSMVAGRASPNSRNSEIVNPTRRSFSSEP----------RSLNFNTPTNS 60
MA PS+RSSSPSMV GR SP SRNSEI NP RSFSS P RSLN TP NS
Sbjct: 1 MALPSNRSSSPSMVTGRTSPISRNSEIGNPVYRSFSSNPFSKPSIATSLRSLNPITPANS 60
Query: 61 PSDY-PRRNSTSRENLFYSRDNEEKENGKNQSPKPVRIRSPAAGKSTKHFMSPTISAASK 120
PSDY P+RNS SRE LF SRDNE+KENGK+QSPK R+RSP GKS K+FMS TISAASK
Sbjct: 61 PSDYPPQRNSVSREILFTSRDNEDKENGKDQSPKLTRVRSPTVGKSMKNFMSSTISAASK 120
Query: 121 ISVSPKKKILGDRNELVRSSLSFSGLKSSSLNSVNPNPEASAALESDTNQEIAPISNPK- 180
I+VSPKKKILGDRNE VRSSLSFSG+KSSSLNS+NP PEAS A ESDTN + ISNPK
Sbjct: 121 IAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSMNPTPEASMAFESDTNPPMPLISNPKS 180
Query: 181 -------------------KSTYRYD--TEVAPVAVETDTKSETAPISKSTIAAAPLRAS 240
+STYRYD E+ +A TDTKS PI KS IAAA ++S
Sbjct: 181 TKTVRFGGVEVISGSYEDSESTYRYDLNPELVTMAAVTDTKSGIGPIPKSAIAAASSKSS 240
Query: 241 KTVKSGGLDVISD------------SHSNSEVVTMAVETDAKLEITRISNSAIAALPPKA 300
KTV GG +VISD N E VT+AVE +A+ EI IS+S IAA+ P+A
Sbjct: 241 KTVTFGGFEVISDFCDDSEYTYRHGHDPNPEAVTVAVEANAEPEIGPISDSDIAAVTPEA 300
Query: 301 SEAVEFADVEV------SSDPNNDSESPAKTVDLNSSFKDSLVSSSMEIAPLDADPLMPR 360
S+ + F+D EV S N++ +V+L+ SF S VSS M IAP+DADP++
Sbjct: 301 SKIMRFSDFEVVSNNALESSVNSNLTEEVDSVNLDPSFNISPVSSPM-IAPMDADPII-T 360
Query: 361 PYDPKTNYLSPRPQFLHYKP-RRINQLELDGKLEELFSSESEFSEGTDSEDPQMESDEAS 420
PYDPKTNYLSPRPQFLHY P RRIN+ DG+ EELFSS SE D EDPQ ESDE S
Sbjct: 361 PYDPKTNYLSPRPQFLHYNPNRRINR--PDGRFEELFSS----SEEIDCEDPQKESDEVS 420
Query: 421 SNESHMKEEEREEGEEEEEVIVNVSEQSPVEAKTSSKLHFSRIFKISSLLLILLTACFSI 480
SNES MKEEE+EE E VNVSEQ P E K SSKL SRI KISSLLLIL TAC SI
Sbjct: 421 SNESQMKEEEKEEEE------VNVSEQGPTEVKKSSKL--SRILKISSLLLILFTACLSI 480
Query: 481 CVVNVHD--LERASLWLPMEDSTEVFEFAKTNFNVLVRKFEVWHASSRPYISDMVFNIGG 540
CVVNVHD + + S L D +E+FE AKTNFNVLV K E+WHA+S +ISD+VFN G
Sbjct: 481 CVVNVHDPTIFQRSTMLTTGDQSEIFESAKTNFNVLVEKLEIWHANSISFISDVVFNFRG 540
Query: 541 RRPLIYLNQTGFLHKDVNSEEQCLVLSHQTSWEEENDLNVMEEARKEGEIDIVEEHIVRG 600
PLI+LNQT + DVN +EQCLVLS+Q WEEEN+L EA K+ E G
Sbjct: 541 GPPLIFLNQTEY--GDVNKDEQCLVLSNQNVWEEENNLMNAMEAMKDRE----------G 600
Query: 601 DHNE-EEEELLLEEIEAMKEREIVIEHVEGEVQN---EEESFQEIEADANDSKDGEEENG 660
+ E +E+E +E+EA+K REI I+ VE E QN EEESFQEIEA NDS D EEEN
Sbjct: 601 QNKERQEQEEDAQEVEAIKVREIGIQTVEIESQNEEAEEESFQEIEARTNDSADIEEEND 660
Query: 661 QASANSASE------------------EPLQETEEGSLQEI-------------IEETST 720
+AS S E E Q+TE ++EI +EE
Sbjct: 661 EASEESLQEIIEHIEGEGQNIEGQEQQEEAQDTEAMKVREIGIETVERESQNEEVEEEPF 720
Query: 721 KSASDKLNE--------------------EDEIQEKQTEENYEDSSSPDF-IHDQIEQEA 780
+ A K N+ E+E +++T EN++ SSS DF +HDQIEQ A
Sbjct: 721 QKAEAKANDQKDSEEENDEASEESLLEIVEEEFIQEKTVENFKASSSSDFKLHDQIEQAA 780
Query: 781 ATGGETKEEQQNDSIQQRNAEIQRQSPPV-SPPSAPQSDAEDENDSNI-DLVGTATNNRI 840
AT GET+EE N E Q QSPPV SPPS QSD E+EN I DL+ TAT I
Sbjct: 781 AT-GETQEE--------TNTEFQYQSPPVSSPPSEHQSDVEEENGGKIVDLIRTATG--I 840
Query: 841 SRDFSQNTAVIASAILLGLSMIIPAGLIYARKSGS-KTSSMAAIAEAQEEPPLLKEKKTY 900
SRDF+QNTA I SAILLGL +IIPAGLIYARKSGS +T+S AAIAE Q+E PLLK+KKT
Sbjct: 841 SRDFTQNTAAIISAILLGLFLIIPAGLIYARKSGSRRTTSTAAIAEEQQEEPLLKDKKTN 900
Query: 901 QSPAVPEEEEAINDDDREDIARKGFCSSETSSFLQYSSMKEEEKAINDDDREDIAIKGFC 943
QS EEEE DDD +D+A + FCSSETSSF QYSS++E E E
Sbjct: 901 QSLVEEEEEEEAPDDDDDDMAGE-FCSSETSSFFQYSSVREGETETAKRSNE-------- 960
BLAST of CmoCh04G004590 vs. ExPASy TrEMBL
Match:
A0A6J1E496 (uncharacterized protein LOC111430638 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111430638 PE=4 SV=1)
HSP 1 Score: 762.7 bits (1968), Expect = 1.8e-216
Identity = 579/1059 (54.67%), Postives = 676/1059 (63.83%), Query Frame = 0
Query: 1 MAPPSHRSSSPSMVAGRASPNSRNSEIVNPTRRSFSSEP----------RSLNFNTPTNS 60
MA PS+RSSSPSMV GR SP SRNSEI NP RSFSS P RSLN TP NS
Sbjct: 1 MALPSNRSSSPSMVTGRTSPISRNSEIGNPVYRSFSSNPFSKPSIATSLRSLNPITPANS 60
Query: 61 PS--DY-PRRNSTSRENLFYSRDNEEKENGKNQSPKPVRIRSPAAGKSTKHFMSPTISAA 120
PS DY P+RNS SRE LF SRDNE+KENGK+QSPK R+RSP GKS K+FMS TISAA
Sbjct: 61 PSVADYPPQRNSVSREILFTSRDNEDKENGKDQSPKLTRVRSPTVGKSMKNFMSSTISAA 120
Query: 121 SKISVSPKKKILGDRNELVRSSLSFSGLKSSSLNSVNPNPEASAALESDTNQEIAPISNP 180
SKI+VSPKKKILGDRNE VRSSLSFSG+KSSSLNS+NP PEAS A ESDTN + ISNP
Sbjct: 121 SKIAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSMNPTPEASMAFESDTNPPMPLISNP 180
Query: 181 K--------------------KSTYRYD--TEVAPVAVETDTKSETAPISKSTIAAAPLR 240
K +STYRYD E+ +A TDTKS PI KS IAAA +
Sbjct: 181 KSTKTVRFGGVEVISGSYEDSESTYRYDLNPELVTMAAVTDTKSGIGPIPKSAIAAASSK 240
Query: 241 ASKTVKSGGLDVISD------------SHSNSEVVTMAVETDAKLEITRISNSAIAALPP 300
+SKTV GG +VISD N E VT+AVE +A+ EI IS+S IAA+ P
Sbjct: 241 SSKTVTFGGFEVISDFCDDSEYTYRHGHDPNPEAVTVAVEANAEPEIGPISDSDIAAVTP 300
Query: 301 KASEAVEFADVEV------SSDPNNDSESPAKTVDLNSSFKDSLVSSSMEIAPLDADPLM 360
+AS+ + F+D EV S N++ +V+L+ SF S VSS M IAP+DADP++
Sbjct: 301 EASKIMRFSDFEVVSNNALESSVNSNLTEEVDSVNLDPSFNISPVSSPM-IAPMDADPII 360
Query: 361 PRPYDPKTNYLSPRPQFLHYKP-RRINQLELDGKLEELFSSESEFSEGTDSEDPQMESDE 420
PYDPKTNYLSPRPQFLHY P RRIN+ DG+ EELFSS SE D EDPQ ESDE
Sbjct: 361 -TPYDPKTNYLSPRPQFLHYNPNRRINR--PDGRFEELFSS----SEEIDCEDPQKESDE 420
Query: 421 ASSNESHMKEEEREEGEEEEEVIVNVSEQSPVEAKTSSKLHFSRIFKISSLLLILLTACF 480
SSNES MKEEE+EE E VNVSEQ P E K SSKL SRI KISSLLLIL TAC
Sbjct: 421 VSSNESQMKEEEKEEEE------VNVSEQGPTEVKKSSKL--SRILKISSLLLILFTACL 480
Query: 481 SICVVNVHD--LERASLWLPMEDSTEVFEFAKTNFNVLVRKFEVWHASSRPYISDMVFNI 540
SICVVNVHD + + S L D +E+FE AKTNFNVLV K E+WHA+S +ISD+VFN
Sbjct: 481 SICVVNVHDPTIFQRSTMLTTGDQSEIFESAKTNFNVLVEKLEIWHANSISFISDVVFNF 540
Query: 541 GGRRPLIYLNQTGFLHKDVNSEEQCLVLSHQTSWEEENDLNVMEEARKEGEIDIVEEHIV 600
G PLI+LNQT + DVN +EQCLVLS+Q WEEEN+L EA K+ E
Sbjct: 541 RGGPPLIFLNQTEY--GDVNKDEQCLVLSNQNVWEEENNLMNAMEAMKDRE--------- 600
Query: 601 RGDHNE-EEEELLLEEIEAMKEREIVIEHVEGEVQN---EEESFQEIEADANDSKDGEEE 660
G + E +E+E +E+EA+K REI I+ VE E QN EEESFQEIEA NDS D EEE
Sbjct: 601 -GQNKERQEQEEDAQEVEAIKVREIGIQTVEIESQNEEAEEESFQEIEARTNDSADIEEE 660
Query: 661 NGQASANSASE------------------EPLQETEEGSLQEI-------------IEET 720
N +AS S E E Q+TE ++EI +EE
Sbjct: 661 NDEASEESLQEIIEHIEGEGQNIEGQEQQEEAQDTEAMKVREIGIETVERESQNEEVEEE 720
Query: 721 STKSASDKLNE--------------------EDEIQEKQTEENYEDSSSPDF-IHDQIEQ 780
+ A K N+ E+E +++T EN++ SSS DF +HDQIEQ
Sbjct: 721 PFQKAEAKANDQKDSEEENDEASEESLLEIVEEEFIQEKTVENFKASSSSDFKLHDQIEQ 780
Query: 781 EAATGGETKEEQQNDSIQQRNAEIQRQSPPV-SPPSAPQSDAEDENDSNI-DLVGTATNN 840
AAT GET+EE N E Q QSPPV SPPS QSD E+EN I DL+ TAT
Sbjct: 781 AAAT-GETQEE--------TNTEFQYQSPPVSSPPSEHQSDVEEENGGKIVDLIRTATG- 840
Query: 841 RISRDFSQNTAVIASAILLGLSMIIPAGLIYARKSGS-KTSSMAAIAEAQEEPPLLKEKK 900
ISRDF+QNTA I SAILLGL +IIPAGLIYARKSGS +T+S AAIAE Q+E PLLK+KK
Sbjct: 841 -ISRDFTQNTAAIISAILLGLFLIIPAGLIYARKSGSRRTTSTAAIAEEQQEEPLLKDKK 900
Query: 901 TYQSPAVPEEEEAINDDDREDIARKGFCSSETSSFLQYSSMKEEEKAINDDDREDIAIKG 943
T QS EEEE DDD +D+A + FCSSETSSF QYSS++E E E
Sbjct: 901 TNQSLVEEEEEEEAPDDDDDDMAGE-FCSSETSSFFQYSSVREGETETAKRSNE------ 960
BLAST of CmoCh04G004590 vs. ExPASy TrEMBL
Match:
A0A6J1J980 (uncharacterized protein LOC111482876 isoform X5 OS=Cucurbita maxima OX=3661 GN=LOC111482876 PE=4 SV=1)
HSP 1 Score: 761.9 bits (1966), Expect = 3.1e-216
Identity = 571/1061 (53.82%), Postives = 675/1061 (63.62%), Query Frame = 0
Query: 1 MAPPSHRSSSPSMVAGRASPNSRNSEIVNPTRRSFSSEP----------RSLNFNTPTNS 60
MA PS+RSSSPSMV GR SP SRNSEI NP RSFSS P +SLN TP N+
Sbjct: 1 MALPSNRSSSPSMVTGRTSPISRNSEISNPVYRSFSSNPFSKPSIATSLKSLNPITPANN 60
Query: 61 PS--DY-PRRNSTSRENLFYSRDNEEKENGKNQSPKPVRIRSPAAGKSTKHFMSPTISAA 120
PS DY P+RNS SRE LF SRDNE+KENGK+QSPK R+RSP GKS K+FMS TISAA
Sbjct: 61 PSVADYPPQRNSVSREILFTSRDNEDKENGKDQSPKLTRVRSPTVGKSMKNFMSSTISAA 120
Query: 121 SKISVSPKKKILGDRNELVRSSLSFSGLKSSSLNSVNPNPEASAALESDTNQEIAPISNP 180
SKI+VSPKKKILGDRNE VRSSLSFSG+KSSSLNSVNP PEAS A ESDTN + ISNP
Sbjct: 121 SKIAVSPKKKILGDRNEPVRSSLSFSGMKSSSLNSVNPTPEASMAFESDTNPPMPLISNP 180
Query: 181 K--------------------KSTYRY--DTEVAPVAVETDTKSETAPISKSTIAAAPLR 240
K +S YRY + E+ +A TD+KS PI+KS IAAA +
Sbjct: 181 KSTKTVRFGGVEVISGSYEDSESAYRYNLNPELVTIAAVTDSKSGIVPIAKSAIAAASSK 240
Query: 241 ASKTVKSGGLDVISDSHS------------NSEVVTMAVETDAKLEITRISNSAIAALPP 300
+SKTV GG +VISDS+ N E VT+AVE DA+ EI IS+S IAA+ P
Sbjct: 241 SSKTVTFGGFEVISDSYDDSESTYRHGHDPNPEAVTVAVEADAEPEIGPISDSDIAAVTP 300
Query: 301 KASEAVEFADVE------VSSDPNNDSESPAKTVDLNSSFKDSLVSSSMEIAPLDADPLM 360
+AS+ + F+D+E + S N++ V+L+ SF S VSS M IAP+DADP++
Sbjct: 301 EASKIMRFSDLEAVSNNALESSVNSNFTEEVDCVNLDPSFNISPVSSPM-IAPMDADPII 360
Query: 361 PRPYDPKTNYLSPRPQFLHYKP-RRINQLELDGKLEELFSSESEFSEGTDSEDPQMESDE 420
PYDPKTNYLSPRPQFLHY P RRIN+ DG+ EELFS+ SE TD EDPQ ESDE
Sbjct: 361 -TPYDPKTNYLSPRPQFLHYNPNRRINR--PDGRFEELFST----SEETDCEDPQKESDE 420
Query: 421 ASSNESHMKEEEREEGEEEEEVIVNVSEQSPVEAKTSSKLHFSRIFKISSLLLILLTACF 480
SSNES MKEEE+EE V+VSEQ P E K SSK SRIFKISSLLLIL TAC
Sbjct: 421 VSSNESQMKEEEKEEE-------VDVSEQGPTEVKKSSKPLLSRIFKISSLLLILFTACL 480
Query: 481 SICVVNVHD---LERASLWLPMEDSTEVFEFAKTNFNVLVRKFEVWHASSRPYISDMVFN 540
SICVVNVHD ER++L L M D +E+F AKTNFNVLV K E+WHA+S +ISD+VFN
Sbjct: 481 SICVVNVHDPTIFERSTL-LTMGDQSEIFASAKTNFNVLVGKLEIWHANSISFISDVVFN 540
Query: 541 IGGRRPLIYLNQTGFLHKDVNSEEQCLVLSHQTSWEEENDLNVMEEARKEGEIDIVEEHI 600
G PLI+LNQT F + DVN +EQCLVLSHQ WEEEN+L EA K+ E
Sbjct: 541 FRGGPPLIHLNQTEFFYGDVNKDEQCLVLSHQNVWEEENNLMNAMEAMKDRE-------- 600
Query: 601 VRGDHNEEEEELLLEEIEAMKEREIVIEHVEGEVQN---EEESFQEIEADANDSKDGEEE 660
G + E +E+ + EA+K +EI I+ VE E QN EE+SFQEIEA NDS++ E+E
Sbjct: 601 --GQNKEGQEQEEDAQEEAIKVKEIGIQTVERESQNEEVEEQSFQEIEARTNDSENSEKE 660
Query: 661 NGQASANS------------------------------------------------ASEE 720
N +AS S EE
Sbjct: 661 NDEASEESLQEIIEHIEGEGQNIEGQEQQEEAQDTEAMKEREIGIETVERESQNEEVEEE 720
Query: 721 PLQETEEGSLQEIIEETSTKSASD----KLNEEDEIQEKQTEENYEDSSSPDF-IHDQIE 780
P Q+TE + + E AS+ ++ EE+ +QEK T EN++ SSS DF +H QIE
Sbjct: 721 PFQKTEAKANDQKDREEENDEASEESLLEIVEEESVQEK-TVENFKASSSSDFKLHGQIE 780
Query: 781 QEAATGGETKEEQQNDSIQQRNAEIQRQSPPV-SPPSAPQSDAEDENDSNI-DLVGTATN 840
Q AAT GET+EE N E Q QSPPV SPPS QSD E+EN I DL+ TAT
Sbjct: 781 QAAAT-GETQEE--------TNTEFQYQSPPVSSPPSEHQSDVEEENGGKIVDLIRTATG 840
Query: 841 NRISRDFSQNTAVIASAILLGLSMIIPAGLIYARKSGS-KTSSMAAIAEAQEEPPLLKEK 900
ISRDF+QNTA I SAILLGL +IIPAGLIYARKSGS +T+S AAIAE Q+E PLLK+K
Sbjct: 841 --ISRDFTQNTAAIISAILLGLFLIIPAGLIYARKSGSRRTTSTAAIAEEQQEEPLLKDK 900
Query: 901 KTYQSPAVPEEEEAINDDDREDIARKGFCSSETSSFLQYSSMKEEEKAINDDDREDIAIK 943
KT QS EEEE DDD +D+A + FCSSETSSF QYSS++E E A K
Sbjct: 901 KTNQSLVEEEEEEDALDDDDDDMAGE-FCSSETSSFFQYSSVREGETE---------AAK 960
BLAST of CmoCh04G004590 vs. NCBI nr
Match:
XP_022942486.1 (uncharacterized protein LOC111447507 [Cucurbita moschata])
HSP 1 Score: 1728.0 bits (4474), Expect = 0.0e+00
Identity = 953/953 (100.00%), Postives = 953/953 (100.00%), Query Frame = 0
Query: 1 MAPPSHRSSSPSMVAGRASPNSRNSEIVNPTRRSFSSEPRSLNFNTPTNSPSDYPRRNST 60
MAPPSHRSSSPSMVAGRASPNSRNSEIVNPTRRSFSSEPRSLNFNTPTNSPSDYPRRNST
Sbjct: 1 MAPPSHRSSSPSMVAGRASPNSRNSEIVNPTRRSFSSEPRSLNFNTPTNSPSDYPRRNST 60
Query: 61 SRENLFYSRDNEEKENGKNQSPKPVRIRSPAAGKSTKHFMSPTISAASKISVSPKKKILG 120
SRENLFYSRDNEEKENGKNQSPKPVRIRSPAAGKSTKHFMSPTISAASKISVSPKKKILG
Sbjct: 61 SRENLFYSRDNEEKENGKNQSPKPVRIRSPAAGKSTKHFMSPTISAASKISVSPKKKILG 120
Query: 121 DRNELVRSSLSFSGLKSSSLNSVNPNPEASAALESDTNQEIAPISNPKKSTYRYDTEVAP 180
DRNELVRSSLSFSGLKSSSLNSVNPNPEASAALESDTNQEIAPISNPKKSTYRYDTEVAP
Sbjct: 121 DRNELVRSSLSFSGLKSSSLNSVNPNPEASAALESDTNQEIAPISNPKKSTYRYDTEVAP 180
Query: 181 VAVETDTKSETAPISKSTIAAAPLRASKTVKSGGLDVISDSHSNSEVVTMAVETDAKLEI 240
VAVETDTKSETAPISKSTIAAAPLRASKTVKSGGLDVISDSHSNSEVVTMAVETDAKLEI
Sbjct: 181 VAVETDTKSETAPISKSTIAAAPLRASKTVKSGGLDVISDSHSNSEVVTMAVETDAKLEI 240
Query: 241 TRISNSAIAALPPKASEAVEFADVEVSSDPNNDSESPAKTVDLNSSFKDSLVSSSMEIAP 300
TRISNSAIAALPPKASEAVEFADVEVSSDPNNDSESPAKTVDLNSSFKDSLVSSSMEIAP
Sbjct: 241 TRISNSAIAALPPKASEAVEFADVEVSSDPNNDSESPAKTVDLNSSFKDSLVSSSMEIAP 300
Query: 301 LDADPLMPRPYDPKTNYLSPRPQFLHYKPRRINQLELDGKLEELFSSESEFSEGTDSEDP 360
LDADPLMPRPYDPKTNYLSPRPQFLHYKPRRINQLELDGKLEELFSSESEFSEGTDSEDP
Sbjct: 301 LDADPLMPRPYDPKTNYLSPRPQFLHYKPRRINQLELDGKLEELFSSESEFSEGTDSEDP 360
Query: 361 QMESDEASSNESHMKEEEREEGEEEEEVIVNVSEQSPVEAKTSSKLHFSRIFKISSLLLI 420
QMESDEASSNESHMKEEEREEGEEEEEVIVNVSEQSPVEAKTSSKLHFSRIFKISSLLLI
Sbjct: 361 QMESDEASSNESHMKEEEREEGEEEEEVIVNVSEQSPVEAKTSSKLHFSRIFKISSLLLI 420
Query: 421 LLTACFSICVVNVHDLERASLWLPMEDSTEVFEFAKTNFNVLVRKFEVWHASSRPYISDM 480
LLTACFSICVVNVHDLERASLWLPMEDSTEVFEFAKTNFNVLVRKFEVWHASSRPYISDM
Sbjct: 421 LLTACFSICVVNVHDLERASLWLPMEDSTEVFEFAKTNFNVLVRKFEVWHASSRPYISDM 480
Query: 481 VFNIGGRRPLIYLNQTGFLHKDVNSEEQCLVLSHQTSWEEENDLNVMEEARKEGEIDIVE 540
VFNIGGRRPLIYLNQTGFLHKDVNSEEQCLVLSHQTSWEEENDLNVMEEARKEGEIDIVE
Sbjct: 481 VFNIGGRRPLIYLNQTGFLHKDVNSEEQCLVLSHQTSWEEENDLNVMEEARKEGEIDIVE 540
Query: 541 EHIVRGDHNEEEEELLLEEIEAMKEREIVIEHVEGEVQNEEESFQEIEADANDSKDGEEE 600
EHIVRGDHNEEEEELLLEEIEAMKEREIVIEHVEGEVQNEEESFQEIEADANDSKDGEEE
Sbjct: 541 EHIVRGDHNEEEEELLLEEIEAMKEREIVIEHVEGEVQNEEESFQEIEADANDSKDGEEE 600
Query: 601 NGQASANSASEEPLQETEEGSLQEIIEETSTKSASDKLNEEDEIQEKQTEENYEDSSSPD 660
NGQASANSASEEPLQETEEGSLQEIIEETSTKSASDKLNEEDEIQEKQTEENYEDSSSPD
Sbjct: 601 NGQASANSASEEPLQETEEGSLQEIIEETSTKSASDKLNEEDEIQEKQTEENYEDSSSPD 660
Query: 661 FIHDQIEQEAATGGETKEEQQNDSIQQRNAEIQRQSPPVSPPSAPQSDAEDENDSNIDLV 720
FIHDQIEQEAATGGETKEEQQNDSIQQRNAEIQRQSPPVSPPSAPQSDAEDENDSNIDLV
Sbjct: 661 FIHDQIEQEAATGGETKEEQQNDSIQQRNAEIQRQSPPVSPPSAPQSDAEDENDSNIDLV 720
Query: 721 GTATNNRISRDFSQNTAVIASAILLGLSMIIPAGLIYARKSGSKTSSMAAIAEAQEEPPL 780
GTATNNRISRDFSQNTAVIASAILLGLSMIIPAGLIYARKSGSKTSSMAAIAEAQEEPPL
Sbjct: 721 GTATNNRISRDFSQNTAVIASAILLGLSMIIPAGLIYARKSGSKTSSMAAIAEAQEEPPL 780
Query: 781 LKEKKTYQSPAVPEEEEAINDDDREDIARKGFCSSETSSFLQYSSMKEEEKAINDDDRED 840
LKEKKTYQSPAVPEEEEAINDDDREDIARKGFCSSETSSFLQYSSMKEEEKAINDDDRED
Sbjct: 781 LKEKKTYQSPAVPEEEEAINDDDREDIARKGFCSSETSSFLQYSSMKEEEKAINDDDRED 840
Query: 841 IAIKGFCSSEMSSFLQYSSMKEEDETETSKILIEAQNHSHGRKTRKNSRTPMASSSLDEF 900
IAIKGFCSSEMSSFLQYSSMKEEDETETSKILIEAQNHSHGRKTRKNSRTPMASSSLDEF
Sbjct: 841 IAIKGFCSSEMSSFLQYSSMKEEDETETSKILIEAQNHSHGRKTRKNSRTPMASSSLDEF 900
Query: 901 SVSTSSASPSYGSFTTYEKIPIKHGNEEEEIVTPVRRSSRIRKTAAHRQSFAE 954
SVSTSSASPSYGSFTTYEKIPIKHGNEEEEIVTPVRRSSRIRKTAAHRQSFAE
Sbjct: 901 SVSTSSASPSYGSFTTYEKIPIKHGNEEEEIVTPVRRSSRIRKTAAHRQSFAE 953
BLAST of CmoCh04G004590 vs. NCBI nr
Match:
KAG7030926.1 (hypothetical protein SDJN02_04963 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 1664.8 bits (4310), Expect = 0.0e+00
Identity = 931/950 (98.00%), Postives = 937/950 (98.63%), Query Frame = 0
Query: 1 MAPPSHRSSSPSMVAGRASPNSRNSEIVNPTRRSFSSEPRSLNFNTPTNSPSDYPRRNST 60
M PPSHRSSSPSMVAGRASPNSRNSEIVNPTRRSFSSEPRSLNFNTPTNSPSDYPRRNST
Sbjct: 1 MVPPSHRSSSPSMVAGRASPNSRNSEIVNPTRRSFSSEPRSLNFNTPTNSPSDYPRRNST 60
Query: 61 SRENLFYSRDNEEKENGKNQSPKPVRIRSPAAGKSTKHFMSPTISAASKISVSPKKKILG 120
SRE LF SRDNEEKENGKNQSPKPVRIRSPAAGKSTKHFMSPTISAASKISVSPKKKILG
Sbjct: 61 SREILFNSRDNEEKENGKNQSPKPVRIRSPAAGKSTKHFMSPTISAASKISVSPKKKILG 120
Query: 121 DRNELVRSSLSFSGLKSSSLNSVNPNPEASAALESDTNQEIAPISNPKKSTYRYDTEVAP 180
DRNELVRSSLSFSGLKSSSLNSVNPNPEASAALESDTNQEIAPISNPKKSTYRYDTEVAP
Sbjct: 121 DRNELVRSSLSFSGLKSSSLNSVNPNPEASAALESDTNQEIAPISNPKKSTYRYDTEVAP 180
Query: 181 VAVETDTKSETAPISKSTIAAAPLRASKTVKSGGLDVISDSHSNSEVVTMAVETDAKLEI 240
VAVETDTKSETAPISKSTIAAAPLRASKTVKSGGLDVISDS+SNSEVVTMAVETDAKLEI
Sbjct: 181 VAVETDTKSETAPISKSTIAAAPLRASKTVKSGGLDVISDSYSNSEVVTMAVETDAKLEI 240
Query: 241 TRISNSAIAALPPKASEAVEFADVEVSSDPNNDSESPAKTVDLNSSFKDSLVSSSMEIAP 300
T IS+SAIAALPPKASEAVEFADVEVSSDPNNDSESPAKTVDLNSSFKDSLVSSSMEIAP
Sbjct: 241 TPISDSAIAALPPKASEAVEFADVEVSSDPNNDSESPAKTVDLNSSFKDSLVSSSMEIAP 300
Query: 301 LDADPLMPRPYDPKTNYLSPRPQFLHYKPRRINQLELDGKLEELFSSESEFSEGTDSEDP 360
LDADPLMPRPYDPKTNYLSPRPQFLHYKPRRINQLELDGKLEELFSSESEFSEGTDSEDP
Sbjct: 301 LDADPLMPRPYDPKTNYLSPRPQFLHYKPRRINQLELDGKLEELFSSESEFSEGTDSEDP 360
Query: 361 QMESDEASSNESHMKEEEREEGEEEEEVIVNVSEQSPVEAKTSSKLHFSRIFKISSLLLI 420
QMESDEASSNESHMKEEEREE EEEEEVIVNVSEQSPVEAKTSSKLHFSRIFKISSLLLI
Sbjct: 361 QMESDEASSNESHMKEEEREEEEEEEEVIVNVSEQSPVEAKTSSKLHFSRIFKISSLLLI 420
Query: 421 LLTACFSICVVNVHDLERASLWLPMEDSTEVFEFAKTNFNVLVRKFEVWHASSRPYISDM 480
LLTACFSICVVNVHDLERASL LPMEDSTEVFEFAKTNFNVLVRKFEVWHA+SR YISDM
Sbjct: 421 LLTACFSICVVNVHDLERASLLLPMEDSTEVFEFAKTNFNVLVRKFEVWHANSRSYISDM 480
Query: 481 VFNIGGRRPLIYLNQTGFLHKDVNSEEQCLVLSHQTSWEEENDLNVMEEARKEGEIDIVE 540
VFNIGGRRPLIYLNQTGFLHKDVNSEEQCLVLSHQTSWEEENDLNVM+EARKEGEIDIVE
Sbjct: 481 VFNIGGRRPLIYLNQTGFLHKDVNSEEQCLVLSHQTSWEEENDLNVMKEARKEGEIDIVE 540
Query: 541 EHIVRGDHNEEEEELLLEEIEAMKEREIVIEHVEGEVQNEEESFQEIEADANDSKDGEEE 600
EHIVRGDHNEEEEELLLEEIEAMKEREIVIEHVEGEVQNEEESFQEIEADANDSKDGEEE
Sbjct: 541 EHIVRGDHNEEEEELLLEEIEAMKEREIVIEHVEGEVQNEEESFQEIEADANDSKDGEEE 600
Query: 601 NGQASANSASEEPLQETEEGSLQEIIEETSTKSASDKLNEEDEIQEKQTEENYEDSSSPD 660
NGQASANSASEEPLQETEEGSLQEIIEETSTKSASDKLNEEDEIQEKQTEENYEDSSSPD
Sbjct: 601 NGQASANSASEEPLQETEEGSLQEIIEETSTKSASDKLNEEDEIQEKQTEENYEDSSSPD 660
Query: 661 FIHDQIEQEAATGGETKEEQQNDSIQQRNAEIQRQSPPVSPPSAPQSDAEDENDSNIDLV 720
FIHDQIEQEAATGGETKEEQQNDSIQQRNAEIQRQSPPVSPPSAPQSDAEDEN SNIDLV
Sbjct: 661 FIHDQIEQEAATGGETKEEQQNDSIQQRNAEIQRQSPPVSPPSAPQSDAEDENGSNIDLV 720
Query: 721 GTATNNRISRDFSQNTAVIASAILLGLSMIIPAGLIYARKSGSKTSSMAAIAEAQEEPPL 780
GTATNNRISRDFSQNTAVIASAILLGLS+IIPAGLIYARKSGSKTSSMAAIAEAQEEPPL
Sbjct: 721 GTATNNRISRDFSQNTAVIASAILLGLSIIIPAGLIYARKSGSKTSSMAAIAEAQEEPPL 780
Query: 781 LKEKKTYQSPAVPEEEEAINDDDREDIARKGFCSSETSSFLQY-SSMKEEEKAINDDDRE 840
LKEKKTYQSPAVPEEEEAINDDDREDIARKGFCSSETSSFLQY SSMKE+EKAIN DDRE
Sbjct: 781 LKEKKTYQSPAVPEEEEAINDDDREDIARKGFCSSETSSFLQYSSSMKEDEKAINHDDRE 840
Query: 841 -DIAIKGFCSSEMSSFLQYSSMKEEDETETSKILIEAQNHSHGRKTRKNSRTPMASSSLD 900
DIAIKGFCSSEMSSFLQYSSMKEEDETETSKILIEAQNHSHGRKTRKNSRTPMASSSLD
Sbjct: 841 DDIAIKGFCSSEMSSFLQYSSMKEEDETETSKILIEAQNHSHGRKTRKNSRTPMASSSLD 900
Query: 901 EFSVSTSSASPSYGSFTTYEKIPIKHGNEEEEIVTPVRRSSRIRKTAAHR 949
EFSVSTSSASPSYGSFTTYEKIPIKHGNEEEEIVTPVRRSSRIRKT HR
Sbjct: 901 EFSVSTSSASPSYGSFTTYEKIPIKHGNEEEEIVTPVRRSSRIRKT-THR 949
BLAST of CmoCh04G004590 vs. NCBI nr
Match:
KAG6600267.1 (hypothetical protein SDJN03_05500, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 1649.4 bits (4270), Expect = 0.0e+00
Identity = 919/939 (97.87%), Postives = 923/939 (98.30%), Query Frame = 0
Query: 1 MAPPSHRSSSPSMVAGRASPNSRNSEIVNPTRRSFSSEPRSLNFNTPTNSPSDYPRRNST 60
M PPSHRSSSPSMVAGRASPNSRNSEIVNPTRRSFSSEPRSLNFNTPTNSPSDYPRRNST
Sbjct: 1 MVPPSHRSSSPSMVAGRASPNSRNSEIVNPTRRSFSSEPRSLNFNTPTNSPSDYPRRNST 60
Query: 61 SRENLFYSRDNEEKENGKNQSPKPVRIRSPAAGKSTKHFMSPTISAASKISVSPKKKILG 120
SRE LF SRDNEEKENGKNQSPKPVRIRSPAAGKSTKHFMSPTISAASKISVS KKKILG
Sbjct: 61 SREILFNSRDNEEKENGKNQSPKPVRIRSPAAGKSTKHFMSPTISAASKISVSLKKKILG 120
Query: 121 DRNELVRSSLSFSGLKSSSLNSVNPNPEASAALESDTNQEIAPISNPKKSTYRYDTEVAP 180
DRNELVRSSLSFSGLKSSSLNSVNPNPEASAALESDTNQEIAPISNPKKSTYRYDTEVAP
Sbjct: 121 DRNELVRSSLSFSGLKSSSLNSVNPNPEASAALESDTNQEIAPISNPKKSTYRYDTEVAP 180
Query: 181 VAVETDTKSETAPISKSTIAAAPLRASKTVKSGGLDVISDSHSNSEVVTMAVETDAKLEI 240
VAVETDTKSETAPISKSTIAAAPLRASKTVKSGGLDVISDSHSNSEVVTMAVETDAKLEI
Sbjct: 181 VAVETDTKSETAPISKSTIAAAPLRASKTVKSGGLDVISDSHSNSEVVTMAVETDAKLEI 240
Query: 241 TRISNSAIAALPPKASEAVEFADVEVSSDPNNDSESPAKTVDLNSSFKDSLVSSSMEIAP 300
T ISNSAIAALPPKASEAVEFADVEVSSDPNNDSESPAKTVDLNSSFKDSLVSSSMEIAP
Sbjct: 241 TPISNSAIAALPPKASEAVEFADVEVSSDPNNDSESPAKTVDLNSSFKDSLVSSSMEIAP 300
Query: 301 LDADPLMPRPYDPKTNYLSPRPQFLHYKPRRINQLELDGKLEELFSSESEFSEGTDSEDP 360
LDADPLMPRPYDPKTNYLSPRPQFLHYKPRRINQLELDGKLEELFSSESEFSEGTDSEDP
Sbjct: 301 LDADPLMPRPYDPKTNYLSPRPQFLHYKPRRINQLELDGKLEELFSSESEFSEGTDSEDP 360
Query: 361 QMESDEASSNESHMKEEEREEGEEEEEVIVNVSEQSPVEAKTSSKLHFSRIFKISSLLLI 420
QMESDEASSNESHMKEEEREE EEEEEVIVNVSEQSPVEAKTSSKLHFSRIFKISSLLLI
Sbjct: 361 QMESDEASSNESHMKEEEREEEEEEEEVIVNVSEQSPVEAKTSSKLHFSRIFKISSLLLI 420
Query: 421 LLTACFSICVVNVHDLERASLWLPMEDSTEVFEFAKTNFNVLVRKFEVWHASSRPYISDM 480
LLTACFSICVVNVHDLERASL LPMEDSTEVFEFAKTNFNVLVRKFEVWHA+SR YISDM
Sbjct: 421 LLTACFSICVVNVHDLERASLLLPMEDSTEVFEFAKTNFNVLVRKFEVWHANSRSYISDM 480
Query: 481 VFNIGGRRPLIYLNQTGFLHKDVNSEEQCLVLSHQTSWEEENDLNVMEEARKEGEIDIVE 540
VFNIGGRRPLIYLNQTGFLHKDVNSEEQCLVLSHQT WEEENDLNVM+EARKEGEIDIVE
Sbjct: 481 VFNIGGRRPLIYLNQTGFLHKDVNSEEQCLVLSHQTKWEEENDLNVMKEARKEGEIDIVE 540
Query: 541 EHIVRGDHNEEEEELLLEEIEAMKEREIVIEHVEGEVQNEEESFQEIEADANDSKDGEEE 600
EHIVRGDHNEEEEELLLEEIEAMKEREIVIEHVEGEVQNEEESFQEIEADANDSKDGEEE
Sbjct: 541 EHIVRGDHNEEEEELLLEEIEAMKEREIVIEHVEGEVQNEEESFQEIEADANDSKDGEEE 600
Query: 601 NGQASANSASEEPLQETEEGSLQEIIEETSTKSASDKLNEEDEIQEKQTEENYEDSSSPD 660
NGQASANSASEEPLQETEEGS QEIIEETSTKSASDKLNEEDEIQEKQTEENYEDSSSPD
Sbjct: 601 NGQASANSASEEPLQETEEGSFQEIIEETSTKSASDKLNEEDEIQEKQTEENYEDSSSPD 660
Query: 661 FIHDQIEQEAATGGETKEEQQNDSIQQRNAEIQRQSPPVSPPSAPQSDAEDENDSNIDLV 720
FIHDQIEQEAATGGETKEEQQNDSIQQRNAEIQ+QSPPVSPP APQSDAEDEN SNIDLV
Sbjct: 661 FIHDQIEQEAATGGETKEEQQNDSIQQRNAEIQQQSPPVSPP-APQSDAEDENGSNIDLV 720
Query: 721 GTATNNRISRDFSQNTAVIASAILLGLSMIIPAGLIYARKSGSKTSSMAAIAEAQEEPPL 780
GTATNNRISRDFSQNTAVIASAILLGLS+IIPAGLIYARKSGSKTSSMAAIAEAQEEPPL
Sbjct: 721 GTATNNRISRDFSQNTAVIASAILLGLSIIIPAGLIYARKSGSKTSSMAAIAEAQEEPPL 780
Query: 781 LKEKKTYQSPAVPEEEEAINDDDREDIARKGFCSSETSSFLQYSSMKEEEKAINDDDRED 840
LKEKKTYQSPAVPEEEEAINDDD EDIARKGFCSSETSSFLQYSSMKEEEKAINDDDRED
Sbjct: 781 LKEKKTYQSPAVPEEEEAINDDDSEDIARKGFCSSETSSFLQYSSMKEEEKAINDDDRED 840
Query: 841 IAIKGFCSSEMSSFLQYSSMKEEDETETSKILIEAQNHSHGRKTRKNSRTPMASSSLDEF 900
IAIKGFCSSEMSSFLQYSSMKEEDETETSKILIEAQNHSHGRKTRKNSRTPMASSSLDEF
Sbjct: 841 IAIKGFCSSEMSSFLQYSSMKEEDETETSKILIEAQNHSHGRKTRKNSRTPMASSSLDEF 900
Query: 901 SVSTSSASPSYGSFTTYEKIPIKHGNEEEEIVTPVRRSS 940
SVSTSSASPSYGSFTTYEKIPIKHGNEEEEIVTP SS
Sbjct: 901 SVSTSSASPSYGSFTTYEKIPIKHGNEEEEIVTPHSSSS 938
BLAST of CmoCh04G004590 vs. NCBI nr
Match:
XP_023524378.1 (uncharacterized protein LOC111788283 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 1628.6 bits (4216), Expect = 0.0e+00
Identity = 913/951 (96.00%), Postives = 923/951 (97.06%), Query Frame = 0
Query: 1 MAPPSHRSSSPSMVAGRASPNSRNSEIVNPTRRSFSSEPRSLNFNTPTNSPSDYPRRNST 60
MAPPSHRSSSPSMVAGR SPNSRNSEIVNPTRRSFSSEPRSLNFNTPTNSPSDYPRRNST
Sbjct: 1 MAPPSHRSSSPSMVAGRTSPNSRNSEIVNPTRRSFSSEPRSLNFNTPTNSPSDYPRRNST 60
Query: 61 SRENLFYSRDNEEKENGKNQSPKPVRIRSPAAGKSTKHFMSPTISAASKISVSPKKKILG 120
SRENLF SRDNEEKENGKNQSPKPVR RSPAAGKSTKHFMSPTISAASKISVSPKKKILG
Sbjct: 61 SRENLFNSRDNEEKENGKNQSPKPVRTRSPAAGKSTKHFMSPTISAASKISVSPKKKILG 120
Query: 121 DRNELVRSSLSFSGLKSSSLNSVNPNPEASAALESDTNQEIAPISNPKKSTYRYDTEVAP 180
DRNELVRSSLSFSGLKSSSLNSVNPNPEASAALESDTNQEIAPISNPKKSTYRYDTEVAP
Sbjct: 121 DRNELVRSSLSFSGLKSSSLNSVNPNPEASAALESDTNQEIAPISNPKKSTYRYDTEVAP 180
Query: 181 VAVETDTKSETAPISKSTIAAAPLRASKTVKSGGLDVISDSHSNSEVVTMAVETDAKLEI 240
VAVETDTKSETAPISKST AAAPLRASKTVKSGG DVISDSHSNSEVVT+AVETDAKLEI
Sbjct: 181 VAVETDTKSETAPISKSTTAAAPLRASKTVKSGGFDVISDSHSNSEVVTVAVETDAKLEI 240
Query: 241 TRISNSAIAALPPKASEAVEFADVEVSSDPNNDSESPAKTVDLNSSFKDSLVSSSMEIAP 300
T ISNSAIAALPPKASE VEFADVEVSSD NNDSESPAKTVDL+SSFKDSLVSSSMEIAP
Sbjct: 241 TPISNSAIAALPPKASETVEFADVEVSSDSNNDSESPAKTVDLDSSFKDSLVSSSMEIAP 300
Query: 301 LDADPLMPRPYDPKTNYLSPRPQFLHYKPRRINQLELDGKLEELFSSESEFSEGTDSEDP 360
LDADPLMPRPYDPKTNYLSPRPQFLHYKPRRINQLELDGKLEELFSSESEF+EGTDSEDP
Sbjct: 301 LDADPLMPRPYDPKTNYLSPRPQFLHYKPRRINQLELDGKLEELFSSESEFTEGTDSEDP 360
Query: 361 QMESDEASSNESHMKEEEREEGEEEEEVIVNVSEQSPVEAKTSSKLHFSRIFKISSLLLI 420
QMESDEASSNESHMKEEEREE EEEEEVIVNVSEQSPVEAK SSKLHFSR FKISSLLLI
Sbjct: 361 QMESDEASSNESHMKEEEREEEEEEEEVIVNVSEQSPVEAKNSSKLHFSRTFKISSLLLI 420
Query: 421 LLTACFSICVVNVHDLERASLWLPMEDSTEVFEFAKTNFNVLVRKFEVWHASSRPYISDM 480
LLTACFSICVVNVHDLERASL LPME+STEVFEFAKTNFNVLVRKFEVWHA+SR YISDM
Sbjct: 421 LLTACFSICVVNVHDLERASLLLPMENSTEVFEFAKTNFNVLVRKFEVWHANSRSYISDM 480
Query: 481 VFNIGGRRPLIYLNQTGFLHKDVNSEEQCLVLSHQTSWEEENDLNVMEEARKEGEIDIVE 540
VFNIGGRRPLIY NQTGFLHKDVNSEEQCLVLSHQTSWEEENDLNVMEEARKEGEIDIVE
Sbjct: 481 VFNIGGRRPLIYPNQTGFLHKDVNSEEQCLVLSHQTSWEEENDLNVMEEARKEGEIDIVE 540
Query: 541 EHIVRGDHN--EEEEELLLEEIEAMKEREIVIEHVEGEVQNEEESFQEIEADANDSKDGE 600
EHIVRGD N EEEEELLLEEIEAMKEREI IEHVEGEVQNEEESFQEIEADANDSKDGE
Sbjct: 541 EHIVRGDQNEEEEEEELLLEEIEAMKEREIDIEHVEGEVQNEEESFQEIEADANDSKDGE 600
Query: 601 EENGQASANSASEEPLQETEEGSLQEIIEETSTKSASDKLNEEDEIQEKQTEENYEDSSS 660
EENGQASA SASEEPLQETEEGSLQEIIEETSTKSASDKLNEED+IQEKQTEENYEDSSS
Sbjct: 601 EENGQASAKSASEEPLQETEEGSLQEIIEETSTKSASDKLNEEDKIQEKQTEENYEDSSS 660
Query: 661 PDFIHDQIEQEAATGGETKEEQQNDSIQQRNAEIQRQSPPVSPPS-APQSDAEDENDSNI 720
PDFIHDQIEQEAATGGETKEEQQNDSIQQRNAEIQ QSPPVSPP APQSDAEDEN SNI
Sbjct: 661 PDFIHDQIEQEAATGGETKEEQQNDSIQQRNAEIQHQSPPVSPPPFAPQSDAEDENGSNI 720
Query: 721 DLVGTATNNRISRDFSQNTAVIASAILLGLSMIIPAGLIYARKSGSKTSSMAAIAEAQEE 780
DLVGTAT NRISRDFSQNTAVIASAILLGLS+IIPAGLIYARKSGSKTSSMAAIAEAQEE
Sbjct: 721 DLVGTATKNRISRDFSQNTAVIASAILLGLSIIIPAGLIYARKSGSKTSSMAAIAEAQEE 780
Query: 781 PPLLKEKKTYQSPAVPEEEEAINDDDREDIARKGFCSSETSSFLQYSSMKEEEKAINDDD 840
PPLLKEKKTYQSPAVPEEEEAINDDDREDIARKGFCSSETSSFLQ+SSMKEEEKAINDDD
Sbjct: 781 PPLLKEKKTYQSPAVPEEEEAINDDDREDIARKGFCSSETSSFLQHSSMKEEEKAINDDD 840
Query: 841 REDIAIKGFCSSEMSSFLQYSSMKEEDETETSKILIEAQNHSHGRKTRKNSRTPMASSSL 900
RED+A KGFCSSEMSSFLQYSSMKEEDETET+K L EAQNHSHGRKTRKNSRTPMASSSL
Sbjct: 841 REDLAGKGFCSSEMSSFLQYSSMKEEDETETAKKLTEAQNHSHGRKTRKNSRTPMASSSL 900
Query: 901 DEFSVSTSSASPSYGSFTTYEKIPIKHGNEEEEIVTPVRRSSRIRKTAAHR 949
DEFSVSTSSASPSYGSFTTYEKIPIKHGNEEEEIVTPVRRSSRIRKT AHR
Sbjct: 901 DEFSVSTSSASPSYGSFTTYEKIPIKHGNEEEEIVTPVRRSSRIRKT-AHR 950
BLAST of CmoCh04G004590 vs. NCBI nr
Match:
XP_022975663.1 (uncharacterized protein LOC111475455 [Cucurbita maxima])
HSP 1 Score: 1525.0 bits (3947), Expect = 0.0e+00
Identity = 885/976 (90.68%), Postives = 903/976 (92.52%), Query Frame = 0
Query: 1 MAPPSHRSSSPSMVAGRASPNSRNSEIVNPTRRSFSSEPRSLNFNTPTNSPSDYPRRNST 60
MAPPSHRSSSPSMVAGR SPNSRNSEIVNPTRRSFSSE RSLNFNTP NSPSDYPRRNST
Sbjct: 1 MAPPSHRSSSPSMVAGRTSPNSRNSEIVNPTRRSFSSEQRSLNFNTPMNSPSDYPRRNST 60
Query: 61 SRENLFYSRDNEEKENGKNQSPKPVRIRSPAAGKSTKHFMSPTISAASKISVSPKKKILG 120
SRENLF SRDNEEKENGKNQSPKPVRIRSPAAGKSTK+FMSPTISAASKISVSPKKKILG
Sbjct: 61 SRENLFNSRDNEEKENGKNQSPKPVRIRSPAAGKSTKNFMSPTISAASKISVSPKKKILG 120
Query: 121 DRNELVRSSLSFSGLKSSSLNSVNPNPEASAALESDTNQEIAPISNPKK----------- 180
DRNELVRSSLSFSGLKSSSLNSVNPNPEASAALESDTNQEIAPISNPKK
Sbjct: 121 DRNELVRSSLSFSGLKSSSLNSVNPNPEASAALESDTNQEIAPISNPKKSKTVKLGGFEV 180
Query: 181 -----STYRYDTEVAPVAVETDTKSETAPISKSTIAAAPLRASKTVKSGGLDVISDSHSN 240
STYR+D EVAPVAVETDTKSE APISKSTIAAAPLRASKTVKSGGLDVISDSHSN
Sbjct: 181 ITGSESTYRHDPEVAPVAVETDTKSEIAPISKSTIAAAPLRASKTVKSGGLDVISDSHSN 240
Query: 241 SEVVTMAVETDAKLEITRISNSAIAALPPKASEAVEFADVEVSSDPNNDSESPAK----- 300
SEVVTMAVETDAKLE NSAIAALPPKASE VEFADV VSSD NDSESPAK
Sbjct: 241 SEVVTMAVETDAKLE-----NSAIAALPPKASETVEFADVVVSSDSINDSESPAKNSSAE 300
Query: 301 ---TVDLNSSFKDSLVSSSMEIAPLDADPLMPRPYDPKTNYLSPRPQFLHYKPRRINQLE 360
TV LNSSFKDSLVSSSMEIAPLDADPLMPRPYDPKTNYLSPRPQFLHYKPRRINQLE
Sbjct: 301 ELDTVGLNSSFKDSLVSSSMEIAPLDADPLMPRPYDPKTNYLSPRPQFLHYKPRRINQLE 360
Query: 361 LDGKLEELFSSESEFSEGTDSEDPQMESDEASSNESHMKEEEREEGEEEEEVIVNVSEQS 420
LDGKLEELFSSESEF+EGTDSEDPQM+SDE SSNESHMKEEEREE EEEEE+IVNVSEQS
Sbjct: 361 LDGKLEELFSSESEFTEGTDSEDPQMKSDEVSSNESHMKEEEREEEEEEEEMIVNVSEQS 420
Query: 421 PVEAKTSSKLHFSRIFKISSLLLILLTACFSICVVNVHDLERASLWLPMEDSTEVFEFAK 480
PVEAKTSSKLHFSRIFKISSLLLILLTACFSI VVNVHDLERASL LPMEDSTEVFEFAK
Sbjct: 421 PVEAKTSSKLHFSRIFKISSLLLILLTACFSISVVNVHDLERASLLLPMEDSTEVFEFAK 480
Query: 481 TNFNVLVRKFEVWHASSRPYISDMVFNIGGRRPLIYLNQTGFLHKDVNSEEQCLVLSHQT 540
TNFNVL+RKFEVWHA+SR YISDMVFNIGGRRPLIYLNQTGFLHKDVNSE QCLVLSHQT
Sbjct: 481 TNFNVLMRKFEVWHANSRSYISDMVFNIGGRRPLIYLNQTGFLHKDVNSELQCLVLSHQT 540
Query: 541 SWEEENDLNVMEEARKEGEIDIVEEHIVRGDHNEEEEELLLEEIEAMKEREIVIEHVEGE 600
SWEEENDLNVMEEARKEGEIDIVEE IVRGD N EE ELL EEIEAMKEREIVIEHV+GE
Sbjct: 541 SWEEENDLNVMEEARKEGEIDIVEEPIVRGDQN-EEVELLSEEIEAMKEREIVIEHVKGE 600
Query: 601 VQNEEESFQEIEADANDSKDGEEENGQASANSASEEPLQETEEGSLQEIIEETSTKSASD 660
VQNEEESFQEIEA+AND KDGEEENGQASA SASEEPLQE EEGSLQEIIEETSTKSASD
Sbjct: 601 VQNEEESFQEIEANANDPKDGEEENGQASAKSASEEPLQENEEGSLQEIIEETSTKSASD 660
Query: 661 KLNEEDEIQEKQTEENYEDSSSPDFIHDQIEQEAATGGETKEEQQNDSIQQRNAEIQRQS 720
LNEED+IQEKQTEENYEDSS+PDFIHDQIEQEAATGGETKEEQQNDSIQQ NAEIQ QS
Sbjct: 661 ILNEEDKIQEKQTEENYEDSSTPDFIHDQIEQEAATGGETKEEQQNDSIQQSNAEIQHQS 720
Query: 721 PPVS-PPSAPQSDAEDENDSNIDLVGTATNNRISRDFSQNTAVIASAILLGLS-MIIPAG 780
PPVS PPSAPQS+AEDEN SNI VGT TNN+ISRDFSQNTAVIASAILLGLS +IIPAG
Sbjct: 721 PPVSPPPSAPQSEAEDENGSNI--VGTETNNKISRDFSQNTAVIASAILLGLSIIIIPAG 780
Query: 781 LIYARKSGSKTSSMAAIAEAQEEPPLLKEKKTYQSPAVPEEEEAINDDDREDIARKGFCS 840
LIYARKSGSK SSMAAIAEAQEEPPLLKEKKTYQSPAVPEEEEAINDDDREDIARKG S
Sbjct: 781 LIYARKSGSKRSSMAAIAEAQEEPPLLKEKKTYQSPAVPEEEEAINDDDREDIARKGLFS 840
Query: 841 SETSSFLQYSSMKEEEKAIN-DDDREDIAIKGFCSSEMSSFLQYSSMKEEDETETS-KIL 900
SETSSFLQYSSMKE E+A N DDDRE+IA KGFCSSE SSFLQYSSMKEEDETET+ K+L
Sbjct: 841 SETSSFLQYSSMKEVEEAFNGDDDREEIARKGFCSSETSSFLQYSSMKEEDETETAKKLL 900
Query: 901 IEAQNHSHGRKTRKNSRTPMASSSLDEFSVSTSSASPSYGSFTTYEKIPIKHGNEEEEIV 949
IEAQNHSHGRKTRKNSRTPMASSSLDEFSVSTSSASPSYGSFTTYEKIPIKHGNEEEEIV
Sbjct: 901 IEAQNHSHGRKTRKNSRTPMASSSLDEFSVSTSSASPSYGSFTTYEKIPIKHGNEEEEIV 960
BLAST of CmoCh04G004590 vs. TAIR 10
Match:
AT1G16630.1 (unknown protein; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G16270.1); Has 10587 Blast hits to 5736 proteins in 617 species: Archae - 88; Bacteria - 963; Metazoa - 3686; Fungi - 820; Plants - 541; Viruses - 438; Other Eukaryotes - 4051 (source: NCBI BLink). )
HSP 1 Score: 102.8 bits (255), Expect = 1.5e-21
Identity = 261/992 (26.31%), Postives = 410/992 (41.33%), Query Frame = 0
Query: 1 MAPPSHRSSSPSMVAGRASPNSRNSEIVNPTRRSFSSEPRSLNFNTPTNSPSDYPRRNST 60
++P S SSS + R +P RNSE + RRSF P S +D RRNS
Sbjct: 12 VSPSSLFSSSSPSMPSRPNPKQRNSETGDLMRRSFRGNPFS----------ADPSRRNSI 71
Query: 61 SRE-NLFYSRDNEEKENGKNQSPKPVRIRSPAAGKSTKHFMSPTISAASKISVSPKKKIL 120
RE + ++E +N K+Q V+ K +KHFMSPTISA SKI+ SP+KKIL
Sbjct: 72 GRECSNRVEIGDKENQNDKDQIANVVK----GPTKGSKHFMSPTISAVSKINPSPRKKIL 131
Query: 121 GDRNELVRSSLSFSGLKSSSLNSVNPNPEASAALESDTNQEIAPISNPKKSTYRYDTEVA 180
D+NE+ RS +D
Sbjct: 132 SDKNEVSRS---------------------------------------------FDKSHH 191
Query: 181 PVAVETDTKSETAPISKSTIAAAPLRASKTVKSGGLDVISDSHSNSEVVTMAVETDAKLE 240
V V KS+++ + DVIS + +V + ++ +L
Sbjct: 192 QVQV------------KSSVSFS-------------DVISIIGEDKDVDQICIDETKQLR 251
Query: 241 ITRISNSAIAALPPKASEAVEFADV-EVSSDPNNDSESPAKTVDLNSSFKDSLVSSSMEI 300
+ S + +D E+ +ND NSSFK S + +
Sbjct: 252 -------------EEESHDITVSDFDEILERKSND----------NSSFKISPLPPYVPC 311
Query: 301 A-PL----DADPLMPRPYDPKTNYLSPRPQFLHYKPRRINQLELD--GKLEELFSSESEF 360
P+ + DP++ PYDPK NYLSPRPQFLHYKP + D +LEELF SES
Sbjct: 312 TFPVFESHEVDPVV-APYDPKKNYLSPRPQFLHYKPNPKIEHRSDECKQLEELFISESSS 371
Query: 361 SE---GTDSEDPQMESDEASSNESHMKEEEREE--------------------------G 420
S+ + E+ + +E +S E + EE+E+
Sbjct: 372 SDTDLSAEREEEGQQEEEVASQEGVVAVEEQEDDGEERLEAAEEILDVDGEERLEAVESD 431
Query: 421 EEEEEVIVNVSEQSPVEAKTSSKLHFSRIFKISSLLLILLTACFSICVVNVHDLERA--S 480
+EEEEV+V S + + S + FS+ + +L L A + + S
Sbjct: 432 DEEEEVVVGESIEEEETHQISKQSRFSKTSMLLGWILALGVAYLLLVSSTTFSQQTITDS 491
Query: 481 LWLPMEDSTEVFEFAKTNFNVLVRKFEVWHASSRPYISDMVFNI---GGRRPLIYLNQTG 540
+ S E+ A NF L K +W SS Y+ +V ++ G P + N T
Sbjct: 492 PFYQFNISPEIIMSASENFEQLGAKLRMWAESSFVYLDKLVSSLREEEGSVPFQFHNLTV 551
Query: 541 FLHKDVNSEEQCLVLSHQTSWEEENDLNVMEEARKEGEIDIVEEHIVRGDHNEEEE---E 600
L S+ + TS E D +++ E+DI E ++ + EE E E
Sbjct: 552 LLEDKRLSD----AVFQSTSVEIIVDGFIVDSL----EVDIEEVNVGHQEPEEESENSGE 611
Query: 601 LLLEEIEAMKEREIVIEHVEGEVQNEEESFQEIEADANDSKDGEEENGQASANSASEEPL 660
+ LE + + E+ E+ EG+V E + +A+ + D E G+ + S SEE
Sbjct: 612 ISLEAVYEEDDNEVEQENEEGKVNLEIVDECDEQAEIKIATDTEVNGGERYSESLSEEGH 671
Query: 661 --QETEEGSLQEIIEETSTKSASDKLNEEDEIQEKQTE-ENYEDSSSPDFIHDQIEQEAA 720
QET+ QE EE N+++ ++E +++ + +D S +Q EQ
Sbjct: 672 GGQETDVVEGQEEYEE----------NDQNNMEEAESDAQLLDDVQSAAISSNQQEQTGV 731
Query: 721 TGGETKEEQQN-DSIQQRNAEIQRQSPPVSPPSAPQSDAEDENDSNIDLVGTATNNRISR 780
ET +E++ I + + ++ V ++ E+E ++V A + I
Sbjct: 732 ANVETVQEEEGVGEIAGGSLSVSEEATDVEHDG---NEVEEEESGFGEVVNDAGSEDILL 791
Query: 781 DFSQNTAVIASAILLGLSMIIPAGLIYARKSGSKTSSMAAIAEAQEEPPLLKEKKTYQSP 840
+ V+ S +++ L+ + AG + A+K + + EP + K +
Sbjct: 792 SGQKKVLVLFSTMMVILA-AVAAGFLLAKKK----TKPVMLQHEDGEPTAISATKVVEHV 838
Query: 841 AVPEEEEAINDDDREDIARKGFCSSETSSFLQYSSMKEEEKAINDDDREDIAIKGFCSSE 900
V E++ R+ S + KEEE+ + DD + +++ SE
Sbjct: 852 PV------------ENLIRERLSS---------LNFKEEEEEVGDDRKREVS---SFPSE 838
Query: 901 MSSFLQYSSMKEEDETETSKILIEAQNHSHGRKTRKNSRTPMASSSLDEFSVSTSSASPS 943
MS +S K K ++ G K +S MASS+ E+S+ S S
Sbjct: 912 MS--FSFSKNKPLHSCSNKKDDLKEHQSGGGGKKSNDSGESMASSA-SEYSI----GSVS 838
BLAST of CmoCh04G004590 vs. TAIR 10
Match:
AT2G16270.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 16 plant structures; EXPRESSED DURING: 9 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G16630.1); Has 1844 Blast hits to 1256 proteins in 271 species: Archae - 6; Bacteria - 283; Metazoa - 434; Fungi - 153; Plants - 91; Viruses - 52; Other Eukaryotes - 825 (source: NCBI BLink). )
HSP 1 Score: 99.8 bits (247), Expect = 1.3e-20
Identity = 253/965 (26.22%), Postives = 395/965 (40.93%), Query Frame = 0
Query: 1 MAPPSHRSSSPS-MVAGRASPNSRNSEIVNPTRRSFSSEPRSLNFNTPTNSPSDYPRRNS 60
MA P++++ S S + R +P RNSE +P RRSF P N+ N PSD RRNS
Sbjct: 1 MASPTNKNPSFSPPIPNRPNPKPRNSEAGDPLRRSFGGNP--FPANSKVNIPSDLTRRNS 60
Query: 61 TSRENLFYSRDNEEKENGKNQSPKPVRIRSPAAGKSTKHFMSPTISAASKISVSPKKKIL 120
+ K KPV++ K +K+FMSPTISA SKI+ SP+K++L
Sbjct: 61 FGGD--------------KENETKPVQL----TPKGSKNFMSPTISAVSKINASPRKRVL 120
Query: 121 GDRNELVRSSLSFSGLKSSSLNSVNPNPEASAALESDTNQEIAPISNPKKSTYRYDTEVA 180
D+NE+ RS GL N N + S SD I I + KK +D V
Sbjct: 121 SDKNEMSRSFSDVKGLILEDDNKRNHHRAKSCVSFSDVLHTIC-IDDEKKFVESHDMTV- 180
Query: 181 PVAVETDTKSETAPISKSTIAAAPLRASKTVKSGGLDVISDSHSNSEVVTMAVETDAKLE 240
Sbjct: 181 ------------------------------------------------------------ 240
Query: 241 ITRISNSAIAALPPKASEAVEFADVEVSSDPNNDSESPAKTVDLNSSFKDSLVSSSMEIA 300
+F + EV + P + S + S+ +S E A
Sbjct: 241 -------------------TDFDEKEVYENKGITYSDPRFRI----SPRPSVPYTSPEFA 300
Query: 301 PLDADPLMPRPYDPKTNYLSPRPQFLHYKPRRINQLELD--GKLEELFSSESEFSEGTDS 360
+ D L+P PYDPK N+LSPRPQFLHYKP + D +LEELF SES S+ T+
Sbjct: 301 ACEVDTLLP-PYDPKKNFLSPRPQFLHYKPNPRIEKRFDECKQLEELFISESS-SDDTEL 360
Query: 361 EDPQMESDEASSNESHMKEEERE-----EGEEEEEVIVNVSEQSPVEAKTSS---KLHFS 420
+ E E E + EEE E E E +EE++ E++ + S K F
Sbjct: 361 SVEESEEQEKDGAEEVVVEEETEDVEQSEAESDEEMVCESVEETTSQVPKQSGSRKFKFL 420
Query: 421 RIFKISSLLLILLTACFSICVVNVHDLERASLWLPMEDSTEVFEFAK-TNFNVLVRKFEV 480
F +L +L++A FS + S + E+ EFAK N + L K
Sbjct: 421 GWFLALALGYLLVSATFSPLM--------KSSFNEFHIPKEITEFAKANNLDQLSDKLWT 480
Query: 481 WHASSRPYISDMVFNIGGRRPLIYLNQTGFLHKDVNSEEQCLVLSHQTSWEEENDLNVME 540
SS Y+ ++ +G +EE + H ++ E+
Sbjct: 481 LTESSLVYMDKLISRLGR-----------------GNEEYSQLQFHNLTYTLED-----S 540
Query: 541 EARKEGEIDIVEEHI---VRGDHNEEEEELLLEEIEAMKEREIVIEHVEGEVQNEEESFQ 600
K ++I++E + R +++ E+ + EE A + E+V + F
Sbjct: 541 TVFKPTCVEIIQEPLQENSRSENSLEDGSVNEEESGAEENSEVVCQ------------FD 600
Query: 601 EIEADANDSKDGEEENGQASANSASEEPLQETEEGSLQEIIEETSTKSASDKLNEEDEIQ 660
E+ A+ S D E +G+ + + E+ L E +++E+ E S S +KL E +++
Sbjct: 601 EL-AEVKPSTDIESNDGERNLKALFEDGL----ELNIEELRE--SEMSPEEKLETEKKLE 660
Query: 661 EKQTEENYEDSSSPDF----IHDQIEQE---AATGGETKEEQQNDSIQQRNAEIQRQSPP 720
E ++E Y + +F +H IE E A +G E + D + +
Sbjct: 661 ETESEAIYINQPDVEFAAINVHQHIESEILVAESGSEESFGEIGDLLHLEVGSYNDLAKG 720
Query: 721 VSPPSAPQSDAEDENDSNIDL-VGTATNNRISRDFSQNTAVIASAILLGLSMIIPAGLIY 780
+ + + E +++ DL + ++N+ D ++ V++S +L+ L++ A ++
Sbjct: 721 DAESGSEEGFGEIAAETSDDLHLKVRSSNKAYNDSTKLMIVLSSTVLVLLAV---ASFVF 750
Query: 781 ARKSGSKTSSMAAIAEAQEEPPLLKEKKTYQSPAVPEEEEAINDDDREDIARKGFCSSET 840
A+ KT +AA A E L VPEE
Sbjct: 781 AK----KTKLVAATKPAPESNMELNLSH------VPEE---------------------- 750
Query: 841 SSFLQYSSMKEEEKAINDDDREDIAIKGFCSSEMSSFLQYSSMKEEDETETSKILIEAQN 900
+ +KE+ ++N ++ D +SF + SS +E +++ K + N
Sbjct: 841 ------NLVKEKLFSLNFEEEVD-------DKMSNSFQKKSSCHKEPQSKGGK---KNNN 750
Query: 901 HSHGRKTRKNSRTPMASSSLDEFSVSTSSASPSYGSFTTYEKIPIKHGNEEEEIVTPVRR 943
+S K R+ S +SS E+S+ S SYGSFTTYEKIPIK G EEEE++TPVRR
Sbjct: 901 NSSSSKLRRES----MASSASEYSI----GSFSYGSFTTYEKIPIKSGREEEEMITPVRR 750
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1FNZ9 | 0.0e+00 | 100.00 | uncharacterized protein LOC111447507 OS=Cucurbita moschata OX=3662 GN=LOC1114475... | [more] |
A0A6J1IJW9 | 0.0e+00 | 90.68 | uncharacterized protein LOC111475455 OS=Cucurbita maxima OX=3661 GN=LOC111475455... | [more] |
A0A6J1E9M3 | 5.7e-218 | 54.78 | uncharacterized protein LOC111430638 isoform X2 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1E496 | 1.8e-216 | 54.67 | uncharacterized protein LOC111430638 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1J980 | 3.1e-216 | 53.82 | uncharacterized protein LOC111482876 isoform X5 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
Match Name | E-value | Identity | Description | |
XP_022942486.1 | 0.0e+00 | 100.00 | uncharacterized protein LOC111447507 [Cucurbita moschata] | [more] |
KAG7030926.1 | 0.0e+00 | 98.00 | hypothetical protein SDJN02_04963 [Cucurbita argyrosperma subsp. argyrosperma] | [more] |
KAG6600267.1 | 0.0e+00 | 97.87 | hypothetical protein SDJN03_05500, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_023524378.1 | 0.0e+00 | 96.00 | uncharacterized protein LOC111788283 [Cucurbita pepo subsp. pepo] | [more] |
XP_022975663.1 | 0.0e+00 | 90.68 | uncharacterized protein LOC111475455 [Cucurbita maxima] | [more] |
Match Name | E-value | Identity | Description | |
AT1G16630.1 | 1.5e-21 | 26.31 | unknown protein; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein matc... | [more] |
AT2G16270.1 | 1.3e-20 | 26.22 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |