Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: utr5polypeptideCDSutr3
Hold the cursor over a type above to highlight its positions in the sequence below.
AAGGAATCGAACCGATCTAAATAACCGAATTGAACCGAGTCGGTCGAGTCGGTTCGGTTCGGTTTGCTTTTACACATTCTGGCGTTCTTAACCTTTAAAACCTATATGCGCCGCCCCCCAGGCCAACCTCTCGACTCACGACCCACGCAACAACCCACCCCCATCTCTCTGTGAAGTGTGAGATTCCGATCTATCCGGAAGCGGCGAGTGACCCTCTCCAGCGGCGGCGACGATCACGGCCGGCGACCACGACTCTTGGACGGCCAGACTCTCTCTCTTTCTCTTTCGCTGTAGAAGAAACGACCCACGATAGCTTTTTCTCGGAGTTCATCAGAACCAGAAGGTGTGGACGAACGAGAATCCCTAAAAGCAGCTGGTAATCTTAATTTCGAATTTTGTTTTTGGATCTGTATGATGTCTTAATATCGTTTATTGATCTTGATATTGATTAGTTTTTATCATCTCGAATGAATGGCCAAGATTTTGAAAGAAACTTCAGGAACTCTGCTTCCGATTTGATTTATTTTCCTCTCGATAGAATGATTTAATTTCGTATTTGAAATAGATTTTGACAAGGTTGAATTGTTTTAGGCAGAAATTGTTGGTGAAGAAGAGAGATGTATAACGGTATTGGGTTACAGACGCCAAGAGGGTCTGGTACAAATGGTTACATTCAGACCAACAAGTTCTTTGTGAGGCCGAAGACCGGAAAGGTTGCTGAAAGCACCAGAGGATTCGAAGAAGACCAAGGTACTGCAGGTGTTTCCAAAAAACCTAACAAGGATATCCTTGAACATGATCGCAAGCGTCAGATTGAACTCAAGCTCGTCATACTCGAGGACAAGCTCATCGACCAAGGTTATACAGATGAGGAGGTTTCTGAAAAGTTGAAGGAGGCTCGAAAAACATTGGAAGCTGCATCAGATCAGGAAGAAAAAGGTGGACCTTCGGCCATAGTACTTTTAGATAAGAGGTATGAGCCGCACGCACTTGTTTTCGAATTTGGTAGTTTTATGACATGGGTCACGTAAAAGCAAGATATGGGGGGCATCCTCTAATGAAGACTCCTGGTATTCGGGGAGTTCCTTTTTCTGCCTAGTGCTGTTTTCTTATTTAGTGTACATGATGATGGTTATATAACATGGTGTTTTTCTTTATTGAATAGGATCTCAGATACGCAGACCCACCAAATTGCTGCGAGAAAGGAGGAGCAGATGAAAACATTGAGAGCTGCTCTTGGGTTGGGCTCATTGGATGATAGTGAGCAACTTAAAGAAGGGATTTCTGATCCATTAGAAAATAGCAGAGAAGGTGAAAATTCTGATATTAAGCGTCGTGAGAAGTCTGAACATGCTTTCTTGGATAGAGAATTGAACTGGAAAAAGCATGCAAAGGAAGCACACAATGATGATAAGGATAAAAAAATAAGGGTTTCAAAGGAGTCCAAAGGTCATAAGAAAGATAGGAAAAGAAGGCCCAAGGATGATTCTTCTGATACTGATTCTGGTGGAGAACATAAGGGAACCAAGAAGAACTTGAGAGATAATCGAAGAGACGATTCTGAAAGTGACATTGACAGCGATGTTGACAAGAAATACATCACCTCACGGAGGTATAAGAAAAACAGGAGGCATGATAGTGATGATTCTTCTGATACTGATTCTGGCGGAGAGTATAAGAAAGCCAAGAAGAACTTGAGAGATAATAGAAGAGATGATCCTGAAAGTGACCCTGACAGCGATGTTGATAAGAAATACATCACCTCAAGGAAGCATAAGAAAAACAGAAGGCATGATAGTGATGATTCTTCTACTGATTCTGGTAGAGACCATAAAGGAACCAAGAAAAACTTGAGAGAATATCAAAGAGATGATCATGAAAGTGATCCTGACAGTGATGTTGACAAGAAATACTTCACCTCAAAGAAGCAGGGGAAAAGCAAAAGACATGATAGTGATGATTCTGATTCAGTTACAGATGATGATGATTTCGGGAAGGGCAGACATAAGAAAGGATCTGGTAGACCTAAAAGTCAAAAGGTGAAGAAGAAGCTAGGAAGCCGGAAACAGGAGTCGACTGATGAATCCAATTCTGACGTTGGGACTGATGATAAAGGCAGGCTATCGAGGCACAAGAACCATCAGGGTAAAAGACGCAGGGCAGATAGTGATAGCTCTGACCATGACGGTTCTGATTCAGATGTAGGTCGCAACAAGAGTAAGCATAGGTATCATAGCAGAAGTGTGGGGAAGGACAAGGTAGATAGTGAATTTGATACCGAAAAGTCAAGAAAGCATCCTAAAGAAGATGTTGGGAGACACAGGCATGATACCGATGACAAAGAAAGTGGTGATTTTAGCCATAGCAGTGATGAAAAAGTGGAGAGGCGCAAAAGTAAGAGGTATGATACTGATGATGAATCTAATGGAGGAGGTGAACGTTTTGATAGGAAAAGTGGTAAGATAGCCACAAAGGGGAAAATAGCCGCTAAAAAGCAATATGATGACAGTGATAGTTCTGATGATAGTCGTGCAGTTGATAGAAAAGGCCGTAATAAGCATGAGAGAGCTAAGAAACATACGATAGGTGATGGTTCTGGTCTAGAGAAGGGATTCAAATCAAGTGGTGGAGCTCGTGAAGGAGGAAAAGGGAATTTAAATCATGCAGATGGTTTGGATGAGCCGGTGACTGCAGATGATAACAATTCGTACAAGTCTAGGAAAGATGCTATCGATGAGTTCAACCATGCAAATCAACATACAATGAAAAGCAAGAGAAAGTTTGATGAGGGTGGTGAAAATGAGCAGCGAGAAGCAAAATCTAGAAACCGAAATTCTACAAGAGAGTTGGGTTTCTATGGAGACCTCAAGAAGGATTCCAAAATTGATTCTGGATCAAACAGTAGAGCAGGCAATAATAGGTATGATGAGATGAGGGATGGATGGCACAGGGAGGACCCAAAAGTTGATTCCGAATCAAATACTAGAGCACGCTATAGTAGTATGCATGATGAGGATGACCGCAGCAAGTTGGATCGAACAGGAAGCAAATATAATGAAGAAACAGAGCATGGAAGTAGACATTATCGTAAGGCTAATGAGTCTCACCGTCACGGTAGTGCCAGTCCAGATATTGAAGAGGGAAAAAGGCATATCAGATATGAGGAGCATAGAGGGAGAAAGCATGAAAGAGATGAGGGTCTAAAATCTAGCAGGGAAGTTGAAAGGGGAGAGTATCAACCAAGCAGCAAGCTGAGAAGATCTGATAAAGATTATGAAACTAGAGAGAGAAAGCATGAAAGAGATGAGGATCTAAAATCCAGCAGGGAAGTTGAACGGGGAGAGTATCAACCAAGCAGCAAGCTGGGAAGATCTGATAAAGATTATGAAAGTAGAGAATCTAGGGAAGTTGAGAGTGGGGAGTATAAGCCAAGCAGCAAGCTGAGAAGATCTGAGAAAGACTACGAAACTAGAGAATCTAGGAGAGATAGGGAGACGGATTACAGAAAGAGGGCCAAATATGACGATTCTCGATCAAGCAGACGTGATGATTATTAGGGTTGGGTCCCAAATTCGTACCATAACTTATCTCTATCAACTCTAGAATTTGAATCATATGCCTTGTTAGCATGCATCCGAAGGTTGAATCGTCTGTTTTTTTTTTTTTTTGAAATGGATGACGGATCTTCGTAAAAGAATGAAATGATGTTTGATTTCAGTCAGCGTCCTTTATTCACTTCAAAACTTGGTAAAGAAAAATTTCCCTAGCGCCTGTTTGGTACTTATCTTCTTACTGTAGTTGGATGCTTGAAGCTGTATTTTGATTCATCAAGTGTTGCATTGTTTTCTGTAAATGGTTCAAATTTTAGTTTGACCATTAATTTTCATTTCATCCAGCACTTGGGGATGATTGTTTTTGAAGCTCTTGTTGATTCTTCGTCATGTGTTTTATCTCTTGAAATGAACCTTTTCTTGCCGGTTGGAATGAAATTGAAGATGCAGAATTGATTTCTTTTGCACTAAAATTGCTGTTCTTGAATCAACTCCTTTATTTAGCCTGGCAATAGGAGGGTGAGTCGACCTACATTGAAGGTCGGTGCTCCACGGAAACTGAGTTCCCAATCTCGACCCATCCAAGAATCTGGGAGCCTTTGTGGAGATGATAGGGTAGGTCTATAGACTTCTATTCTAAGCTGTAAAACGTATGGGTTCTCTTGACTTTCTGGCTTTCGTTCATTCTATCGGTATGCTCTCACAAAAAATATGTTGGATGGATTCAGTGAAGACAAATTAGAGCTCTAATGTTAGGAATAGGGGATTTAGCTCTAATGTTAGGAATAGGGGATTGATCTTTGATGTTAAAAGCAATGATATAATTGATTGAGATTGACTACACATGATGGGTAAGTTAACCGTATCAATTAAAATGATATGAAAAAGTTAATTAGGCTACTAAAATCATTTTTCCTTTGCATTTTTTTAATAAAACATGTGAACGTAGAGAATCGAACTCATGATTTTTTGGTCGAAAATAAGTCGATTATGGAACAAGCTTATTTACCTGTTATAAATCAGGTTTGTGTAAAGTCGGTAGACTATGTTATGAGTGAAGTTGTTATAATAGACTGGGGAAGGTACAATATCACCCACTATTTTTGTTGTATTTTAATATATTACTAAACATAATTTCAACAAAAGCTCTTAAATTGATATTTAATTTATGAATCAACTACAAGTTAAAAAATTCTAAAAATATAGAATACAACATTATTTCTCACATTAAATCACTTGTTTCTTTAAATTTGAGAACCATACAGAAATACTTATTATATCTCCATTCAACCTCAACCCTATTACTAAAGGGAGGATTCCCAGAAAAAAAGTGAAGTCTAGTAGTAACATATAAACTTTTCACTATATTGTACTTGAACTAATGGTTTTTTCCTTTTGAAGCCATGGGAGATGGATCCCAAGCTTACCAAGCATCTCTTATCTTAATCTAGTGTAATGTGCTGACAGAATCTGCTCAAACTTCCTTTGCGTTTGGCTTCGGTACGCTTTCAACCGTGGTCTATCATCTGTAATCGAACGAAATAGAAGGCCTTCCTTGACATAACAAAACAGATTATCGCAAATCTGGAAATGGCGGGCTATGTTTAACTTCCAGTTGCATCAAGAGATGGAGACATAATTGAATGATCCCGCTGCTGACGGATCCATTTTTCATAATCTTGCACAATTTGCCGCAGCAGCAGAGGTACAAGGCTGTCCACGAGAGCTTGTAACATCCTAAGAATAATAGAATAGCCACAGTGAAAAAAAAAATTAGTTTCTTGTTTGAGGGTTGTAATATGTTTCTTATGCTGTTCGACTTCTGTATTACTCGAGAGTTGAAACTGGACTCGAGGATTCGTTGCAAAATTTGGGTGGAACTAGGGAGATTCATCAGAATCCAGAATCACAGTTTTAA
mRNA sequence
AAGGAATCGAACCGATCTAAATAACCGAATTGAACCGAGTCGGTCGAGTCGGTTCGGTTCGGTTTGCTTTTACACATTCTGGCGTTCTTAACCTTTAAAACCTATATGCGCCGCCCCCCAGGCCAACCTCTCGACTCACGACCCACGCAACAACCCACCCCCATCTCTCTGTGAAGTGTGAGATTCCGATCTATCCGGAAGCGGCGAGTGACCCTCTCCAGCGGCGGCGACGATCACGGCCGGCGACCACGACTCTTGGACGGCCAGACTCTCTCTCTTTCTCTTTCGCTGTAGAAGAAACGACCCACGATAGCTTTTTCTCGGAGTTCATCAGAACCAGAAGGTGTGGACGAACGAGAATCCCTAAAAGCAGCTGAAATTGTTGGTGAAGAAGAGAGATGTATAACGGTATTGGGTTACAGACGCCAAGAGGGTCTGGTACAAATGGTTACATTCAGACCAACAAGTTCTTTGTGAGGCCGAAGACCGGAAAGGTTGCTGAAAGCACCAGAGGATTCGAAGAAGACCAAGGTACTGCAGGTGTTTCCAAAAAACCTAACAAGGATATCCTTGAACATGATCGCAAGCGTCAGATTGAACTCAAGCTCGTCATACTCGAGGACAAGCTCATCGACCAAGGTTATACAGATGAGGAGGTTTCTGAAAAGTTGAAGGAGGCTCGAAAAACATTGGAAGCTGCATCAGATCAGGAAGAAAAAGGTGGACCTTCGGCCATAGTACTTTTAGATAAGAGGATCTCAGATACGCAGACCCACCAAATTGCTGCGAGAAAGGAGGAGCAGATGAAAACATTGAGAGCTGCTCTTGGGTTGGGCTCATTGGATGATAGTGAGCAACTTAAAGAAGGGATTTCTGATCCATTAGAAAATAGCAGAGAAGGTGAAAATTCTGATATTAAGCGTCGTGAGAAGTCTGAACATGCTTTCTTGGATAGAGAATTGAACTGGAAAAAGCATGCAAAGGAAGCACACAATGATGATAAGGATAAAAAAATAAGGGTTTCAAAGGAGTCCAAAGGTCATAAGAAAGATAGGAAAAGAAGGCCCAAGGATGATTCTTCTGATACTGATTCTGGTGGAGAACATAAGGGAACCAAGAAGAACTTGAGAGATAATCGAAGAGACGATTCTGAAAGTGACATTGACAGCGATGTTGACAAGAAATACATCACCTCACGGAGGTATAAGAAAAACAGGAGGCATGATAGTGATGATTCTTCTGATACTGATTCTGGCGGAGAGTATAAGAAAGCCAAGAAGAACTTGAGAGATAATAGAAGAGATGATCCTGAAAGTGACCCTGACAGCGATGTTGATAAGAAATACATCACCTCAAGGAAGCATAAGAAAAACAGAAGGCATGATAGTGATGATTCTTCTACTGATTCTGGTAGAGACCATAAAGGAACCAAGAAAAACTTGAGAGAATATCAAAGAGATGATCATGAAAGTGATCCTGACAGTGATGTTGACAAGAAATACTTCACCTCAAAGAAGCAGGGGAAAAGCAAAAGACATGATAGTGATGATTCTGATTCAGTTACAGATGATGATGATTTCGGGAAGGGCAGACATAAGAAAGGATCTGGTAGACCTAAAAGTCAAAAGGTGAAGAAGAAGCTAGGAAGCCGGAAACAGGAGTCGACTGATGAATCCAATTCTGACGTTGGGACTGATGATAAAGGCAGGCTATCGAGGCACAAGAACCATCAGGGTAAAAGACGCAGGGCAGATAGTGATAGCTCTGACCATGACGGTTCTGATTCAGATGTAGGTCGCAACAAGAGTAAGCATAGGTATCATAGCAGAAGTGTGGGGAAGGACAAGGTAGATAGTGAATTTGATACCGAAAAGTCAAGAAAGCATCCTAAAGAAGATGTTGGGAGACACAGGCATGATACCGATGACAAAGAAAGTGGTGATTTTAGCCATAGCAGTGATGAAAAAGTGGAGAGGCGCAAAAGTAAGAGGTATGATACTGATGATGAATCTAATGGAGGAGGTGAACGTTTTGATAGGAAAAGTGGTAAGATAGCCACAAAGGGGAAAATAGCCGCTAAAAAGCAATATGATGACAGTGATAGTTCTGATGATAGTCGTGCAGTTGATAGAAAAGGCCGTAATAAGCATGAGAGAGCTAAGAAACATACGATAGGTGATGGTTCTGGTCTAGAGAAGGGATTCAAATCAAGTGGTGGAGCTCGTGAAGGAGGAAAAGGGAATTTAAATCATGCAGATGGTTTGGATGAGCCGGTGACTGCAGATGATAACAATTCGTACAAGTCTAGGAAAGATGCTATCGATGAGTTCAACCATGCAAATCAACATACAATGAAAAGCAAGAGAAAGTTTGATGAGGGTGGTGAAAATGAGCAGCGAGAAGCAAAATCTAGAAACCGAAATTCTACAAGAGAGTTGGGTTTCTATGGAGACCTCAAGAAGGATTCCAAAATTGATTCTGGATCAAACAGTAGAGCAGGCAATAATAGGTATGATGAGATGAGGGATGGATGGCACAGGGAGGACCCAAAAGTTGATTCCGAATCAAATACTAGAGCACGCTATAGTAGTATGCATGATGAGGATGACCGCAGCAAGTTGGATCGAACAGGAAGCAAATATAATGAAGAAACAGAGCATGGAAGTAGACATTATCGTAAGGCTAATGAGTCTCACCGTCACGGTAGTGCCAGTCCAGATATTGAAGAGGGAAAAAGGCATATCAGATATGAGGAGCATAGAGGGAGAAAGCATGAAAGAGATGAGGGTCTAAAATCTAGCAGGGAAGTTGAAAGGGGAGAGTATCAACCAAGCAGCAAGCTGAGAAGATCTGATAAAGATTATGAAACTAGAGAGAGAAAGCATGAAAGAGATGAGGATCTAAAATCCAGCAGGGAAGTTGAACGGGGAGAGTATCAACCAAGCAGCAAGCTGGGAAGATCTGATAAAGATTATGAAAGTAGAGAATCTAGGGAAGTTGAGAGTGGGGAGTATAAGCCAAGCAGCAAGCTGAGAAGATCTGAGAAAGACTACGAAACTAGAGAATCTAGGAGAGATAGGGAGACGGATTACAGAAAGAGGGCCAAATATGACGATTCTCGATCAAGCAGACGTGATGATTATTAGGCCTGGCAATAGGAGGGTGAGTCGACCTACATTGAAGGTCGGTGCTCCACGGAAACTGAGTTCCCAATCTCGACCCATCCAAGAATCTGGGAGCCTTTGTGGAGATGATAGGAATCTGCTCAAACTTCCTTTGCGTTTGGCTTCGGTACGCTTTCAACCGTGGTCTATCATCTGTAATCGAACGAAATAGAAGGCCTTCCTTGACATAACAAAACAGATTATCGCAAATCTGGAAATGGCGGGCTATGTTTAACTTCCAGTTGCATCAAGAGATGGAGACATAATTGAATGATCCCGCTGCTGACGGATCCATTTTTCATAATCTTGCACAATTTGCCGCAGCAGCAGAGGTACAAGGCTGTCCACGAGAGCTTGTAACATCCTAAGAATAATAGAATAGCCACAGTGAAAAAAAAAATTAGTTTCTTGTTTGAGGGTTGTAATATGTTTCTTATGCTGTTCGACTTCTGTATTACTCGAGAGTTGAAACTGGACTCGAGGATTCGTTGCAAAATTTGGGTGGAACTAGGGAGATTCATCAGAATCCAGAATCACAGTTTTAA
Coding sequence (CDS)
ATGTATAACGGTATTGGGTTACAGACGCCAAGAGGGTCTGGTACAAATGGTTACATTCAGACCAACAAGTTCTTTGTGAGGCCGAAGACCGGAAAGGTTGCTGAAAGCACCAGAGGATTCGAAGAAGACCAAGGTACTGCAGGTGTTTCCAAAAAACCTAACAAGGATATCCTTGAACATGATCGCAAGCGTCAGATTGAACTCAAGCTCGTCATACTCGAGGACAAGCTCATCGACCAAGGTTATACAGATGAGGAGGTTTCTGAAAAGTTGAAGGAGGCTCGAAAAACATTGGAAGCTGCATCAGATCAGGAAGAAAAAGGTGGACCTTCGGCCATAGTACTTTTAGATAAGAGGATCTCAGATACGCAGACCCACCAAATTGCTGCGAGAAAGGAGGAGCAGATGAAAACATTGAGAGCTGCTCTTGGGTTGGGCTCATTGGATGATAGTGAGCAACTTAAAGAAGGGATTTCTGATCCATTAGAAAATAGCAGAGAAGGTGAAAATTCTGATATTAAGCGTCGTGAGAAGTCTGAACATGCTTTCTTGGATAGAGAATTGAACTGGAAAAAGCATGCAAAGGAAGCACACAATGATGATAAGGATAAAAAAATAAGGGTTTCAAAGGAGTCCAAAGGTCATAAGAAAGATAGGAAAAGAAGGCCCAAGGATGATTCTTCTGATACTGATTCTGGTGGAGAACATAAGGGAACCAAGAAGAACTTGAGAGATAATCGAAGAGACGATTCTGAAAGTGACATTGACAGCGATGTTGACAAGAAATACATCACCTCACGGAGGTATAAGAAAAACAGGAGGCATGATAGTGATGATTCTTCTGATACTGATTCTGGCGGAGAGTATAAGAAAGCCAAGAAGAACTTGAGAGATAATAGAAGAGATGATCCTGAAAGTGACCCTGACAGCGATGTTGATAAGAAATACATCACCTCAAGGAAGCATAAGAAAAACAGAAGGCATGATAGTGATGATTCTTCTACTGATTCTGGTAGAGACCATAAAGGAACCAAGAAAAACTTGAGAGAATATCAAAGAGATGATCATGAAAGTGATCCTGACAGTGATGTTGACAAGAAATACTTCACCTCAAAGAAGCAGGGGAAAAGCAAAAGACATGATAGTGATGATTCTGATTCAGTTACAGATGATGATGATTTCGGGAAGGGCAGACATAAGAAAGGATCTGGTAGACCTAAAAGTCAAAAGGTGAAGAAGAAGCTAGGAAGCCGGAAACAGGAGTCGACTGATGAATCCAATTCTGACGTTGGGACTGATGATAAAGGCAGGCTATCGAGGCACAAGAACCATCAGGGTAAAAGACGCAGGGCAGATAGTGATAGCTCTGACCATGACGGTTCTGATTCAGATGTAGGTCGCAACAAGAGTAAGCATAGGTATCATAGCAGAAGTGTGGGGAAGGACAAGGTAGATAGTGAATTTGATACCGAAAAGTCAAGAAAGCATCCTAAAGAAGATGTTGGGAGACACAGGCATGATACCGATGACAAAGAAAGTGGTGATTTTAGCCATAGCAGTGATGAAAAAGTGGAGAGGCGCAAAAGTAAGAGGTATGATACTGATGATGAATCTAATGGAGGAGGTGAACGTTTTGATAGGAAAAGTGGTAAGATAGCCACAAAGGGGAAAATAGCCGCTAAAAAGCAATATGATGACAGTGATAGTTCTGATGATAGTCGTGCAGTTGATAGAAAAGGCCGTAATAAGCATGAGAGAGCTAAGAAACATACGATAGGTGATGGTTCTGGTCTAGAGAAGGGATTCAAATCAAGTGGTGGAGCTCGTGAAGGAGGAAAAGGGAATTTAAATCATGCAGATGGTTTGGATGAGCCGGTGACTGCAGATGATAACAATTCGTACAAGTCTAGGAAAGATGCTATCGATGAGTTCAACCATGCAAATCAACATACAATGAAAAGCAAGAGAAAGTTTGATGAGGGTGGTGAAAATGAGCAGCGAGAAGCAAAATCTAGAAACCGAAATTCTACAAGAGAGTTGGGTTTCTATGGAGACCTCAAGAAGGATTCCAAAATTGATTCTGGATCAAACAGTAGAGCAGGCAATAATAGGTATGATGAGATGAGGGATGGATGGCACAGGGAGGACCCAAAAGTTGATTCCGAATCAAATACTAGAGCACGCTATAGTAGTATGCATGATGAGGATGACCGCAGCAAGTTGGATCGAACAGGAAGCAAATATAATGAAGAAACAGAGCATGGAAGTAGACATTATCGTAAGGCTAATGAGTCTCACCGTCACGGTAGTGCCAGTCCAGATATTGAAGAGGGAAAAAGGCATATCAGATATGAGGAGCATAGAGGGAGAAAGCATGAAAGAGATGAGGGTCTAAAATCTAGCAGGGAAGTTGAAAGGGGAGAGTATCAACCAAGCAGCAAGCTGAGAAGATCTGATAAAGATTATGAAACTAGAGAGAGAAAGCATGAAAGAGATGAGGATCTAAAATCCAGCAGGGAAGTTGAACGGGGAGAGTATCAACCAAGCAGCAAGCTGGGAAGATCTGATAAAGATTATGAAAGTAGAGAATCTAGGGAAGTTGAGAGTGGGGAGTATAAGCCAAGCAGCAAGCTGAGAAGATCTGAGAAAGACTACGAAACTAGAGAATCTAGGAGAGATAGGGAGACGGATTACAGAAAGAGGGCCAAATATGACGATTCTCGATCAAGCAGACGTGATGATTATTAG
Protein sequence
MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEHDRKRQIELKLVILEDKLIDQGYTDEEVSEKLKEARKTLEAASDQEEKGGPSAIVLLDKRISDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQLKEGISDPLENSREGENSDIKRREKSEHAFLDRELNWKKHAKEAHNDDKDKKIRVSKESKGHKKDRKRRPKDDSSDTDSGGEHKGTKKNLRDNRRDDSESDIDSDVDKKYITSRRYKKNRRHDSDDSSDTDSGGEYKKAKKNLRDNRRDDPESDPDSDVDKKYITSRKHKKNRRHDSDDSSTDSGRDHKGTKKNLREYQRDDHESDPDSDVDKKYFTSKKQGKSKRHDSDDSDSVTDDDDFGKGRHKKGSGRPKSQKVKKKLGSRKQESTDESNSDVGTDDKGRLSRHKNHQGKRRRADSDSSDHDGSDSDVGRNKSKHRYHSRSVGKDKVDSEFDTEKSRKHPKEDVGRHRHDTDDKESGDFSHSSDEKVERRKSKRYDTDDESNGGGERFDRKSGKIATKGKIAAKKQYDDSDSSDDSRAVDRKGRNKHERAKKHTIGDGSGLEKGFKSSGGAREGGKGNLNHADGLDEPVTADDNNSYKSRKDAIDEFNHANQHTMKSKRKFDEGGENEQREAKSRNRNSTRELGFYGDLKKDSKIDSGSNSRAGNNRYDEMRDGWHREDPKVDSESNTRARYSSMHDEDDRSKLDRTGSKYNEETEHGSRHYRKANESHRHGSASPDIEEGKRHIRYEEHRGRKHERDEGLKSSREVERGEYQPSSKLRRSDKDYETRERKHERDEDLKSSREVERGEYQPSSKLGRSDKDYESRESREVESGEYKPSSKLRRSEKDYETRESRRDRETDYRKRAKYDDSRSSRRDDY
Homology
BLAST of MC08g0362 vs. NCBI nr
Match:
XP_022131365.1 (protein starmaker [Momordica charantia] >XP_022131366.1 protein starmaker [Momordica charantia])
HSP 1 Score: 1599 bits (4141), Expect = 0.0
Identity = 878/915 (95.96%), Postives = 878/915 (95.96%), Query Frame = 0
Query: 1 MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEH 60
MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEH
Sbjct: 1 MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEH 60
Query: 61 DRKRQIELKLVILEDKLIDQGYTDEEVSEKLKEARKTLEAASDQEEKGGPSAIVLLDKRI 120
DRKRQIELKLVILEDKLIDQGYTDEEVSEKLKEARKTLEAASDQEEKGGPSAIVLLDKRI
Sbjct: 61 DRKRQIELKLVILEDKLIDQGYTDEEVSEKLKEARKTLEAASDQEEKGGPSAIVLLDKRI 120
Query: 121 SDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQLKEGISDPLENSREGENSDIKRREKSE 180
SDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQLKEGISDPLENSREGENSDIKRREKSE
Sbjct: 121 SDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQLKEGISDPLENSREGENSDIKRREKSE 180
Query: 181 HAFLDRELNWKKHAKEAHNDDKDKKIRVSKESKGHKKDRKRRPKDDSSDTDSGGEHKGTK 240
HAFLDRELNWKKHAKEAHNDDKDKKIRVSKESKGHKKDRKRRPKDDSSDTDSGGEHKGTK
Sbjct: 181 HAFLDRELNWKKHAKEAHNDDKDKKIRVSKESKGHKKDRKRRPKDDSSDTDSGGEHKGTK 240
Query: 241 KNLRDNRRDDSESDIDSDVDKKYITSRRYKKNRRHDSDDSSDTDSGGEYKKAKKNLRDNR 300
KNLRDNRRDDSESDIDSDVDKKYITSRRYKKNRRHDSDDSSDTDSGGEYKKAKKNLRDNR
Sbjct: 241 KNLRDNRRDDSESDIDSDVDKKYITSRRYKKNRRHDSDDSSDTDSGGEYKKAKKNLRDNR 300
Query: 301 RDDPESDPDSDVDKKYITSRKHKKNRRHDSDDSSTDSGRDHKGTKKNLREYQRDDHESDP 360
RDDPESDPDSDVDKKYITSRKHKKNRRHDSDDSSTDSGRDHKGTKKNLREYQRDDHESDP
Sbjct: 301 RDDPESDPDSDVDKKYITSRKHKKNRRHDSDDSSTDSGRDHKGTKKNLREYQRDDHESDP 360
Query: 361 DSDVDKKYFTSKKQGKSKRHDSDDSDSVTDDDDFGKGRHKKGSGRPKSQKVKKKLGSRKQ 420
DSDVDKKYFTSKKQGKSKRHDSDDSDSVTDDDDFGKGRHKKGSGRPKSQKVKKKLGSRKQ
Sbjct: 361 DSDVDKKYFTSKKQGKSKRHDSDDSDSVTDDDDFGKGRHKKGSGRPKSQKVKKKLGSRKQ 420
Query: 421 ESTDESNSDVGTDDKGRLSRHKNHQGKRRRADSDSSDHDGSDSDVGRNKSKHRYHSRSVG 480
ESTDESNSDVGTDDKGRLSRHKNHQGKRRRADSDSSDHDGSDSDVGRNKSKHRYHSRSVG
Sbjct: 421 ESTDESNSDVGTDDKGRLSRHKNHQGKRRRADSDSSDHDGSDSDVGRNKSKHRYHSRSVG 480
Query: 481 KDKVDSEFDTEKSRKHPKEDVGRHRHDTDDKESGDFSHSSDEKVERRKSKRYDTDDESNG 540
KDKVDSEFDTEKSRKHPKEDVGRHRHDTDDKESGDFSHSSDEKVERRKSKRYDTDDESNG
Sbjct: 481 KDKVDSEFDTEKSRKHPKEDVGRHRHDTDDKESGDFSHSSDEKVERRKSKRYDTDDESNG 540
Query: 541 GGERFDRKSGKIATKGKIAAKKQYDDSDSSDDSRAVDRKGRNKHERAKKHTIGDGSGLEK 600
GGERFDRKSGKIATKGKIAAKKQYDDSDSSDDSRAVDRKGRNKHERAKKHTIGDGSGLEK
Sbjct: 541 GGERFDRKSGKIATKGKIAAKKQYDDSDSSDDSRAVDRKGRNKHERAKKHTIGDGSGLEK 600
Query: 601 GFKSSGGAREGGKGNLNHADGLDEPVTADDNNSYKSRKDAIDEFNHANQHTMKSKRKFDE 660
GFKSSGGAREGGKGNLNHADGLDEPVTADDNNSYKSRKDAIDEFNHANQHTMKSKRKFDE
Sbjct: 601 GFKSSGGAREGGKGNLNHADGLDEPVTADDNNSYKSRKDAIDEFNHANQHTMKSKRKFDE 660
Query: 661 GGENEQREAKSRNRNSTRELGFYGDLKKDSKIDSGSNSRAGNNRYDEMRDGWHREDPKVD 720
GGENEQREAKSRNRNSTRELGFYGDLKKDSKIDSGSNSRAGNNRYDEMRDGWHREDPKVD
Sbjct: 661 GGENEQREAKSRNRNSTRELGFYGDLKKDSKIDSGSNSRAGNNRYDEMRDGWHREDPKVD 720
Query: 721 SESNTRARYSSMHDEDDRSKLDRTGSKYNEETEHGSRHYRKANESHRHGSASPDIEEGKR 780
SESNTRARYSSMHDEDDRSKLDRTGSKYNEETEHGSRHYRKANESHRHGSASPDIEEGKR
Sbjct: 721 SESNTRARYSSMHDEDDRSKLDRTGSKYNEETEHGSRHYRKANESHRHGSASPDIEEGKR 780
Query: 781 HIRYEEHRGRKHERDEGLKSSREVERGEYQPSSKLRRSDKDYETRERKHERDEDLKSSRE 840
HIRYEEHRGRKHERDE DLKSSRE
Sbjct: 781 HIRYEEHRGRKHERDE-------------------------------------DLKSSRE 840
Query: 841 VERGEYQPSSKLGRSDKDYESRESREVESGEYKPSSKLRRSEKDYETRESRRDRETDYRK 900
VERGEYQPSSKLGRSDKDYESRESREVESGEYKPSSKLRRSEKDYETRESRRDRETDYRK
Sbjct: 841 VERGEYQPSSKLGRSDKDYESRESREVESGEYKPSSKLRRSEKDYETRESRRDRETDYRK 878
Query: 901 RAKYDDSRSSRRDDY 915
RAKYDDSRSSRRDDY
Sbjct: 901 RAKYDDSRSSRRDDY 878
BLAST of MC08g0362 vs. NCBI nr
Match:
XP_023545728.1 (protein starmaker-like [Cucurbita pepo subsp. pepo])
HSP 1 Score: 1011 bits (2613), Expect = 0.0
Identity = 621/949 (65.44%), Postives = 700/949 (73.76%), Query Frame = 0
Query: 1 MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEH 60
MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAE+TRGF+EDQGTAGVSKKPNKDILEH
Sbjct: 1 MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAENTRGFDEDQGTAGVSKKPNKDILEH 60
Query: 61 DRKRQIELKLVILEDKLIDQGYTDEEVSEKLKEARKTLEAASDQEEKGGPSAIVLLDKRI 120
DRKRQIELKLVILEDKL DQGYT++E+S+KLKEAR+TLEAAS EEK GPSAIVL DK++
Sbjct: 61 DRKRQIELKLVILEDKLTDQGYTEDEISQKLKEARETLEAASGSEEKDGPSAIVLADKKV 120
Query: 121 SDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQLKEGISDPLENSREGENSDIKRREKSE 180
SDTQ+HQIAARKEEQMKTLRAALGL S +DSEQ+ EGISDP N REG+N+DIKR+EKSE
Sbjct: 121 SDTQSHQIAARKEEQMKTLRAALGLSSSNDSEQVTEGISDPTRNRREGQNADIKRQEKSE 180
Query: 181 HAFLDRELNWKKHAKEAHNDDKDKKIRVSKESKGHKKDRKRRPKDDSSDTDSGGEH-KGT 240
H+FLDRELNWKKH E HNDDK K RVSKE KGH KDR RRPKDDSSD DS GEH KGT
Sbjct: 181 HSFLDRELNWKKHGSEDHNDDKGDKKRVSKELKGHLKDR-RRPKDDSSDNDSVGEHHKGT 240
Query: 241 KKNLRDNRRDDSESDIDSDVDKKYITSRRYKKNRRHDSDDSSDTDSGGEYKKAKKNLRDN 300
KKNLRDNRR+DSESD +SD D+KY TSR+ KKNRRHDSD SSDTDSGGE K KK+LRDN
Sbjct: 241 KKNLRDNRRNDSESDFESDDDEKYKTSRKSKKNRRHDSDVSSDTDSGGERKGTKKHLRDN 300
Query: 301 RRDDPESDPDSDVDKKYITSRKHKKNRRHDSDDSS------------------------- 360
RRD P+ DPDS+ D+KY TSRKHKKNRRHDSDDSS
Sbjct: 301 RRDAPKRDPDSNFDQKYATSRKHKKNRRHDSDDSSDTASGEERKGTMKHLRDSRRDAPER 360
Query: 361 -----------------------------TDSGRDHKGTKKNLREYQRD----------- 420
DSG + KGT K+LR+ +RD
Sbjct: 361 DPGSNVDQKHLTSRKHKKNRRHDSDDSSDADSGEERKGTTKHLRDSRRDAPERELDSNFD 420
Query: 421 -----------------------------------------DHESDPDSDVDKKYFTSKK 480
D ESD DSDVDKKY TSKK
Sbjct: 421 QKHITSRKHKKNRRHDSDASSDTDSGGEHKETKKSLKNNRRDLESDTDSDVDKKYTTSKK 480
Query: 481 QGKSKRHDSDDSDSVTDDDDFGKGRHKKGSGRPKSQKVKKKLGSRKQESTDESNSDVGTD 540
Q K+K DSDDSDS D +FG G H+KGSGRPKSQKV KK SRKQESTDESNSD G D
Sbjct: 481 QEKNKSRDSDDSDS--DSGEFGMGSHRKGSGRPKSQKVMKKQRSRKQESTDESNSDSGID 540
Query: 541 DKGRLSRHKNHQGKRRRADSDSSDHDGSDSDVGRNKSKHRYHSRSVGKDKVDSEFDTEKS 600
DKGR +HKN GKR DSDSSD D SDSDVGRNKSKHRYHS+ GK +VDSE D+EK
Sbjct: 541 DKGRQLKHKNQHGKRYGVDSDSSDRDSSDSDVGRNKSKHRYHSKRAGKSRVDSESDSEKL 600
Query: 601 RKHPKEDVGRHRHDTDDKESGDFSHSSDEKVERRKSKRYDTDDESNGGGERFDRKSGKIA 660
RKHPK+DVGR RHDTD+ ESGD S SSDE V+RR+ +RY++DD+S G + KSGK A
Sbjct: 601 RKHPKKDVGRRRHDTDNDESGDNSSSSDEIVKRRRDRRYNSDDKSEEG--EYFGKSGKTA 660
Query: 661 TKGKIAAKKQYDDSDSSDDSRAVDRKGRNKHERAKKHTIGDGSGLEKGFKSSGGAREGGK 720
TKG IAAK+++DDSD SDDS+A+DR+G +K +RAKKH+ GDGS +KG KSSGGARE GK
Sbjct: 661 TKGTIAAKRKHDDSDKSDDSQAIDRRGNDKQKRAKKHSSGDGSDADKGVKSSGGARERGK 720
Query: 721 GNLNHADGLDEPVTADDNNSYKSRKDAIDEFNHANQHTMKSKRKFDEGGENEQR-EAKSR 780
G+ NHADGLDE VTA N SYKSR D++DEFN ANQ TMKSKRK DEGGE+EQ+ EAKSR
Sbjct: 721 GSSNHADGLDESVTAAKNTSYKSRNDSLDEFNRANQQTMKSKRKLDEGGEDEQQPEAKSR 780
Query: 781 NRNSTRELGFYGDLKKDSKIDSGSNSRAGNNRYDEMRDGWHREDPKVDSESNTRARYSSM 839
+R STRE F+GD KKD K DS S+ RA + RY+E RDG +RE+PK+DSESNTR+RYS+
Sbjct: 781 SRISTRESDFHGDPKKDFKNDSESSRRARSGRYEEARDGRYREEPKIDSESNTRSRYSA- 840
BLAST of MC08g0362 vs. NCBI nr
Match:
XP_022929608.1 (dentin sialophosphoprotein-like [Cucurbita moschata])
HSP 1 Score: 1001 bits (2588), Expect = 0.0
Identity = 619/949 (65.23%), Postives = 694/949 (73.13%), Query Frame = 0
Query: 1 MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEH 60
MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAE+TRGF+EDQGTAGVSKKPNKDILEH
Sbjct: 1 MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAENTRGFDEDQGTAGVSKKPNKDILEH 60
Query: 61 DRKRQIELKLVILEDKLIDQGYTDEEVSEKLKEARKTLEAASDQEEKGGPSAIVLLDKRI 120
DRKRQIELKLVILEDKL DQGYT++E+S+KLKEAR+TLEAAS EEK GPSAIVL DK++
Sbjct: 61 DRKRQIELKLVILEDKLTDQGYTEDEISQKLKEARETLEAASGSEEKDGPSAIVLADKKV 120
Query: 121 SDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQLKEGISDPLENSREGENSDIKRREKSE 180
SDTQ+HQIAARKEEQMKTLRAALGL S +DSEQ+ EGISDP N REG+N+DIKR EKSE
Sbjct: 121 SDTQSHQIAARKEEQMKTLRAALGLSSSNDSEQVTEGISDPTRNRREGQNADIKRHEKSE 180
Query: 181 HAFLDRELNWKKHAKEAHNDDKDKKIRVSKESKGHKKDRKRRPKDDSSDTDSGGEH-KGT 240
H+FLDRELNWKKH E HNDDK K RVSKE KGH KDR RRPKDDSSD DS GEH KGT
Sbjct: 181 HSFLDRELNWKKHGSEDHNDDKGDKKRVSKELKGHPKDR-RRPKDDSSDNDSVGEHHKGT 240
Query: 241 KKNLRDNRRDDSESDIDSDVDKKYITSRRYKKNRRHDSDDSSDTDSGGEYKKAKKNLRDN 300
KKNLRDNRR+DSESD +SD D KY TSR+ KKNRRHDSD SSDTDSGGE K KK+LRDN
Sbjct: 241 KKNLRDNRRNDSESDFESDDDDKYKTSRKSKKNRRHDSDVSSDTDSGGERKGTKKHLRDN 300
Query: 301 RRDDPESDPDSDVDKKYITSRKHKKNRRHDSDDS-------------------------- 360
RRD P+ DPDS+ D+KY TSRKHKKNRRHDSDDS
Sbjct: 301 RRDAPKRDPDSNFDQKYATSRKHKKNRRHDSDDSLDTASGEERKGTMKHLRDSRRDAPER 360
Query: 361 ----------------------------STDSGRDHKGTKKNLREYQRD----------- 420
TDSG + KGT K+LR+ +RD
Sbjct: 361 DPGSNFDQKHLTSRKHKKNRRHDSDDSSDTDSGEERKGTTKHLRDSRRDAPERELDSNFD 420
Query: 421 -----------------------------------------DHESDPDSDVDKKYFTSKK 480
D ESD DSD+DKKY TSKK
Sbjct: 421 QKHITSRKHKKNRRHDSDASSDTDSGGEHKETKKSLKNNRRDLESDTDSDIDKKYTTSKK 480
Query: 481 QGKSKRHDSDDSDSVTDDDDFGKGRHKKGSGRPKSQKVKKKLGSRKQESTDESNSDVGTD 540
Q K+K SDDSDS D +FG G H+KGSGR KSQKV KK RKQESTDESNSD G D
Sbjct: 481 QEKNKSRGSDDSDS--DSGEFGMGSHRKGSGRAKSQKVMKKQRGRKQESTDESNSDSGID 540
Query: 541 DKGRLSRHKNHQGKRRRADSDSSDHDGSDSDVGRNKSKHRYHSRSVGKDKVDSEFDTEKS 600
DKGR +HKN GKR DSDSSD D SDSDVGRNKSKHRY S+ GK +VDSE D+EK
Sbjct: 541 DKGRQLKHKNQHGKRYGVDSDSSDRDSSDSDVGRNKSKHRYQSKRAGKSRVDSESDSEKL 600
Query: 601 RKHPKEDVGRHRHDTDDKESGDFSHSSDEKVERRKSKRYDTDDESNGGGERFDRKSGKIA 660
RKHPK+DVGR RHDTD+ ESGD S SSDE V+ R+ +R+++DD+S GE F KSGKIA
Sbjct: 601 RKHPKKDVGRRRHDTDNDESGDNSSSSDEIVKWRRDRRHNSDDKSEEEGEYFG-KSGKIA 660
Query: 661 TKGKIAAKKQYDDSDSSDDSRAVDRKGRNKHERAKKHTIGDGSGLEKGFKSSGGAREGGK 720
TKG IAAK+++DDSD SDDS+AVDRKG +K +RAKKH+ GDGS +KG KSSGGARE GK
Sbjct: 661 TKGTIAAKRKHDDSDKSDDSQAVDRKGNDKQKRAKKHSSGDGSDADKGVKSSGGARERGK 720
Query: 721 GNLNHADGLDEPVTADDNNSYKSRKDAIDEFNHANQHTMKSKRKFDEGGENEQR-EAKSR 780
G+ NHADGLDE VTA N SYKSR D +DEFN ANQ TMKSKRK DEGGE+EQ+ EAKSR
Sbjct: 721 GSSNHADGLDESVTAAKNTSYKSRNDPLDEFNRANQQTMKSKRKLDEGGEDEQQPEAKSR 780
Query: 781 NRNSTRELGFYGDLKKDSKIDSGSNSRAGNNRYDEMRDGWHREDPKVDSESNTRARYSSM 839
+R STRE F+GD KKD K DS S+ RA + RY+E RDG +REDPK+DSESN R+RYS+
Sbjct: 781 SRISTRESDFHGDPKKDFKNDSESSRRARSGRYEETRDGRYREDPKIDSESNARSRYSA- 840
BLAST of MC08g0362 vs. NCBI nr
Match:
XP_022997381.1 (dentin sialophosphoprotein-like [Cucurbita maxima])
HSP 1 Score: 999 bits (2582), Expect = 0.0
Identity = 616/949 (64.91%), Postives = 700/949 (73.76%), Query Frame = 0
Query: 1 MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEH 60
MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAE+TRGF+EDQGTAGVSKKPNKDILEH
Sbjct: 1 MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAENTRGFDEDQGTAGVSKKPNKDILEH 60
Query: 61 DRKRQIELKLVILEDKLIDQGYTDEEVSEKLKEARKTLEAASDQEEKGGPSAIVLLDKRI 120
DRKRQIELKLVILEDKL DQGYT++E+S+KLKEAR+TLEAAS EEK GPSAIVL DK++
Sbjct: 61 DRKRQIELKLVILEDKLTDQGYTEDEISQKLKEARETLEAASGSEEKDGPSAIVLADKKV 120
Query: 121 SDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQLKEGISDPLENSREGENSDIKRREKSE 180
SDTQ+HQIAARKEEQMKTLRAALGL S +DSEQ+ EGISDP N REG+N+DIKR+EKSE
Sbjct: 121 SDTQSHQIAARKEEQMKTLRAALGLSSSNDSEQVTEGISDPTRNRREGQNADIKRQEKSE 180
Query: 181 HAFLDRELNWKKHAKEAHNDDKDKKIRVSKESKGHKKDRKRRPKDDSSDTDSGGEH-KGT 240
H+FLDRELNWK+H E HNDDK K RVSKE KGH KDR RRPKDDSSD DS GEH KGT
Sbjct: 181 HSFLDRELNWKRHGSEDHNDDKGDKKRVSKELKGHLKDR-RRPKDDSSDNDSVGEHHKGT 240
Query: 241 KKNLRDNRRDDSESDIDSDVDKKYITSRRYKKNRRHDSDDSSDTDSGGEYKKAKKNLRDN 300
KKNLRDNRR DSESD +SD D KY TSR+ KKNRRHDSD SSDTDSGGE K KK+LRDN
Sbjct: 241 KKNLRDNRRKDSESDFESDDDDKYKTSRKSKKNRRHDSDASSDTDSGGERKGTKKHLRDN 300
Query: 301 RRDDPESDPDSDVDKKY------------------------------------------- 360
RRD P+ DPDS+ D+KY
Sbjct: 301 RRDAPKRDPDSNFDQKYATSRKHKKNRRHDRDNSSDTDFGEERKGTMKHLRDSRRDAPER 360
Query: 361 ----------ITSRKHKKNRRHDSDDSS-TDSGRDHKGTKKNLREYQRD----------- 420
ITSRKHKKNRRHDSDDSS TDSG + KGT K+LR+ +RD
Sbjct: 361 DPGSNFDHKHITSRKHKKNRRHDSDDSSDTDSGEERKGTTKHLRDSRRDAPEREPDSNFD 420
Query: 421 -----------------------------------------DHESDPDSDVDKKYFTSKK 480
D ESD DSD+DKKY TSKK
Sbjct: 421 QKHITSMKHKKNRRHDSDASSDTDSGGEHKETKKSLKNNRRDLESDTDSDIDKKYTTSKK 480
Query: 481 QGKSKRHDSDDSDSVTDDDDFGKGRHKKGSGRPKSQKVKKKLGSRKQESTDESNSDVGTD 540
Q K+K DSDDSDS D +FG G H+KGSGRPKSQKV KK SRKQESTDESNSD G D
Sbjct: 481 QEKNKSRDSDDSDS--DSGEFGMGSHRKGSGRPKSQKVMKKQRSRKQESTDESNSDSGID 540
Query: 541 DKGRLSRHKNHQGKRRRADSDSSDHDGSDSDVGRNKSKHRYHSRSVGKDKVDSEFDTEKS 600
DKGR ++KN GKR DSDSSD D SDSDVGRNKSKHRYHS+ GK +VDSE D+EK
Sbjct: 541 DKGRQLKNKNQHGKRYGVDSDSSDRDSSDSDVGRNKSKHRYHSKRTGKSRVDSESDSEKL 600
Query: 601 RKHPKEDVGRHRHDTDDKESGDFSHSSDEKVERRKSKRYDTDDESNGGGERFDRKSGKIA 660
RKHPK+DVGR RHDTD+ ESGD S SSDE V+RR+ +R+++DD+S G + KSGKIA
Sbjct: 601 RKHPKKDVGRRRHDTDNDESGDNSSSSDEIVKRRRDRRHNSDDKSEEG--EYFGKSGKIA 660
Query: 661 TKGKIAAKKQYDDSDSSDDSRAVDRKGRNKHERAKKHTIGDGSGLEKGFKSSGGAREGGK 720
TKG IAAK++++DSD SDDS+AVDR+G +K +RAKKH+ GDGS +KG KSSGGARE GK
Sbjct: 661 TKGTIAAKRKHEDSDKSDDSQAVDRRGNDKQKRAKKHSYGDGSDADKGVKSSGGARERGK 720
Query: 721 GNLNHADGLDEPVTADDNNSYKSRKDAIDEFNHANQHTMKSKRKFDEGGENEQR-EAKSR 780
G+ NHADGLDE VTA N SYKSR D++DEFN ANQ TMKSKRK DEGGE+EQ+ EAKS+
Sbjct: 721 GSSNHADGLDESVTAAKNTSYKSRNDSLDEFNRANQQTMKSKRKLDEGGEDEQQPEAKSQ 780
Query: 781 NRNSTRELGFYGDLKKDSKIDSGSNSRAGNNRYDEMRDGWHREDPKVDSESNTRARYSSM 839
+R STRE F+GD KKD K DS S+ RA + R+ E RDG +REDPK+DSESN R+RYS+
Sbjct: 781 SRISTRESDFHGDPKKDFKNDSESSRRARSGRHKETRDGRYREDPKIDSESNARSRYSAH 840
BLAST of MC08g0362 vs. NCBI nr
Match:
XP_038884695.1 (dentin sialophosphoprotein-like [Benincasa hispida] >XP_038884696.1 dentin sialophosphoprotein-like [Benincasa hispida])
HSP 1 Score: 965 bits (2494), Expect = 0.0
Identity = 597/885 (67.46%), Postives = 678/885 (76.61%), Query Frame = 0
Query: 1 MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEH 60
MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEH
Sbjct: 1 MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEH 60
Query: 61 DRKRQIELKLVILEDKLIDQGYTDEEVSEKLKEARKTLEAASDQEEKGGPSAIVLLDKRI 120
DRKRQIELKLVILEDKLIDQGYT +E+SEKL+EAR+TLEAAS EEK GPSAIVL DKR+
Sbjct: 61 DRKRQIELKLVILEDKLIDQGYTSDEISEKLREARETLEAASGSEEKDGPSAIVLADKRV 120
Query: 121 SDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQLKEGISDPLENSREGENSDIKRREKSE 180
SDTQTHQIAARKEEQMKTLRAALGLGS D+EQ+KE ISDP REG+N+DIKR EKSE
Sbjct: 121 SDTQTHQIAARKEEQMKTLRAALGLGSSGDTEQVKEEISDPSRERREGQNADIKRHEKSE 180
Query: 181 HAFLDRELNWKKHAKEAHNDDKDKKIRVSKESKGHKKDRKRRPKDDSSDTDSGGEHKGTK 240
H+FLDRELNWKKH E DDKD K R+SKE KGH+K RKRRPKDDSSDTDS
Sbjct: 181 HSFLDRELNWKKHGPEDQYDDKDDKKRISKELKGHQKGRKRRPKDDSSDTDS------VD 240
Query: 241 KNLRDNRRDDSESDIDSDVDKKYITSRRYKKNRRHDSDDSSDTDSGGEYKKAKKNLRDNR 300
+NLRD+RR+DSESD+DSDV KY+ SR KNRRHDSDDSSDTDSGGE K KK+LRD R
Sbjct: 241 RNLRDSRRNDSESDLDSDVGHKYVASR---KNRRHDSDDSSDTDSGGERKGTKKHLRDKR 300
Query: 301 RDDPESDPDSDVDKKYITSRKHKKNRRHDSDDSS-TDSGRDHKGTKKNLREYQRDDHESD 360
RDDPESDPDSD D+KYITSRKHKKNRRHD D+SS TDSG +HK TKKN+R +R H SD
Sbjct: 301 RDDPESDPDSDFDQKYITSRKHKKNRRHDRDNSSDTDSGGEHKKTKKNMRNNRRG-HGSD 360
Query: 361 PDSDVDKKYFTSKKQGKSKRHDSDDSDSVTDDDDFGKG-RHKKGSGRPKSQKVKKKLGSR 420
P SD+DKKY SKK K++RHDSDDSDS+TD D+FG G HKKGS R KSQKVK + SR
Sbjct: 361 PSSDIDKKYTFSKKPEKNRRHDSDDSDSITDGDEFGMGGSHKKGSSRHKSQKVKNQR-SR 420
Query: 421 KQESTDESNSDVGTDDKGRLSRHKNHQGKRRRADSDSSDHDGSDSDVGRNKSKHRYHSRS 480
KQESTDESNSD G D+K R +H+N GKR +SDSSDHD SDSDVG KSKHRY S+
Sbjct: 421 KQESTDESNSDSGIDNKRRQLKHRNQHGKRYGVESDSSDHDSSDSDVGCKKSKHRYDSKR 480
Query: 481 VGKDKVDSEFDTEKSRKHPKEDVGRHRHDTDDKESGDFSHSSDEKVERRKSKRYDTDDES 540
GK +VDSE ++EKSRKH K+D GRHRHD D+++SGD S S E V+RR+ + Y+ DD S
Sbjct: 481 AGKSRVDSESNSEKSRKHRKKDGGRHRHDIDNEKSGDNSSSGVEIVKRRRGRSYNADDNS 540
Query: 541 NGGGERFDRKSGKIATKGKIAAKKQYDDSDSSDDSRAVDRKGRNKHERAKKHTIGDGSGL 600
GE R SGKIATKGKI AK+Q+DD+++SDDS AV RKG +KH+RAKK + GD S L
Sbjct: 541 EEEGEYLGR-SGKIATKGKIDAKRQHDDNENSDDSLAVGRKGNDKHKRAKKCSSGDDSDL 600
Query: 601 EKGFKSSGGAREGGKGNLNHADGLDEPVTADDNNSYKSRKDAIDEFNHANQHT--MKSKR 660
EKG K+SGGARE GKG+LNHADGL+ K +KD+I+EFNHA+Q T M SKR
Sbjct: 601 EKGVKASGGARERGKGSLNHADGLE-----------KFKKDSINEFNHASQQTDTMNSKR 660
Query: 661 KFDEGGENEQR-EAKSRNRNSTRELGFYGDLKK--------------------------- 720
KFDEGG+NEQ+ E+KSRNRNSTR F+GD KK
Sbjct: 661 KFDEGGKNEQQLESKSRNRNSTRGSDFHGDPKKGFENDSESSRRARSGRYDEKRDGRYRE 720
Query: 721 DSKIDSG--------------SNSRAGNNRYDEMRDGWHREDPKVDSESNTRARYSSMHD 780
D KIDS S+ RA + RYDE RDG +REDPK+DSESN R+RYS + D
Sbjct: 721 DPKIDSDFHGNPKKGFENDSESSRRARSGRYDETRDGRYREDPKIDSESNIRSRYS-VQD 780
Query: 781 EDDRSKLDRTGSKYNEETEHGSRHYRKANESHRHGSASPDIEEGKRHIRYEEHRGRKHER 839
EDD K +TGS++ EETEHGSRH+RKANESH D EE KRH RYEE RGRKHER
Sbjct: 781 EDDDRKATQTGSRFTEETEHGSRHHRKANESHHRSRTGKDTEEEKRHSRYEEPRGRKHER 840
BLAST of MC08g0362 vs. ExPASy TrEMBL
Match:
A0A6J1BPI2 (protein starmaker OS=Momordica charantia OX=3673 GN=LOC111004608 PE=4 SV=1)
HSP 1 Score: 1599 bits (4141), Expect = 0.0
Identity = 878/915 (95.96%), Postives = 878/915 (95.96%), Query Frame = 0
Query: 1 MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEH 60
MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEH
Sbjct: 1 MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEH 60
Query: 61 DRKRQIELKLVILEDKLIDQGYTDEEVSEKLKEARKTLEAASDQEEKGGPSAIVLLDKRI 120
DRKRQIELKLVILEDKLIDQGYTDEEVSEKLKEARKTLEAASDQEEKGGPSAIVLLDKRI
Sbjct: 61 DRKRQIELKLVILEDKLIDQGYTDEEVSEKLKEARKTLEAASDQEEKGGPSAIVLLDKRI 120
Query: 121 SDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQLKEGISDPLENSREGENSDIKRREKSE 180
SDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQLKEGISDPLENSREGENSDIKRREKSE
Sbjct: 121 SDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQLKEGISDPLENSREGENSDIKRREKSE 180
Query: 181 HAFLDRELNWKKHAKEAHNDDKDKKIRVSKESKGHKKDRKRRPKDDSSDTDSGGEHKGTK 240
HAFLDRELNWKKHAKEAHNDDKDKKIRVSKESKGHKKDRKRRPKDDSSDTDSGGEHKGTK
Sbjct: 181 HAFLDRELNWKKHAKEAHNDDKDKKIRVSKESKGHKKDRKRRPKDDSSDTDSGGEHKGTK 240
Query: 241 KNLRDNRRDDSESDIDSDVDKKYITSRRYKKNRRHDSDDSSDTDSGGEYKKAKKNLRDNR 300
KNLRDNRRDDSESDIDSDVDKKYITSRRYKKNRRHDSDDSSDTDSGGEYKKAKKNLRDNR
Sbjct: 241 KNLRDNRRDDSESDIDSDVDKKYITSRRYKKNRRHDSDDSSDTDSGGEYKKAKKNLRDNR 300
Query: 301 RDDPESDPDSDVDKKYITSRKHKKNRRHDSDDSSTDSGRDHKGTKKNLREYQRDDHESDP 360
RDDPESDPDSDVDKKYITSRKHKKNRRHDSDDSSTDSGRDHKGTKKNLREYQRDDHESDP
Sbjct: 301 RDDPESDPDSDVDKKYITSRKHKKNRRHDSDDSSTDSGRDHKGTKKNLREYQRDDHESDP 360
Query: 361 DSDVDKKYFTSKKQGKSKRHDSDDSDSVTDDDDFGKGRHKKGSGRPKSQKVKKKLGSRKQ 420
DSDVDKKYFTSKKQGKSKRHDSDDSDSVTDDDDFGKGRHKKGSGRPKSQKVKKKLGSRKQ
Sbjct: 361 DSDVDKKYFTSKKQGKSKRHDSDDSDSVTDDDDFGKGRHKKGSGRPKSQKVKKKLGSRKQ 420
Query: 421 ESTDESNSDVGTDDKGRLSRHKNHQGKRRRADSDSSDHDGSDSDVGRNKSKHRYHSRSVG 480
ESTDESNSDVGTDDKGRLSRHKNHQGKRRRADSDSSDHDGSDSDVGRNKSKHRYHSRSVG
Sbjct: 421 ESTDESNSDVGTDDKGRLSRHKNHQGKRRRADSDSSDHDGSDSDVGRNKSKHRYHSRSVG 480
Query: 481 KDKVDSEFDTEKSRKHPKEDVGRHRHDTDDKESGDFSHSSDEKVERRKSKRYDTDDESNG 540
KDKVDSEFDTEKSRKHPKEDVGRHRHDTDDKESGDFSHSSDEKVERRKSKRYDTDDESNG
Sbjct: 481 KDKVDSEFDTEKSRKHPKEDVGRHRHDTDDKESGDFSHSSDEKVERRKSKRYDTDDESNG 540
Query: 541 GGERFDRKSGKIATKGKIAAKKQYDDSDSSDDSRAVDRKGRNKHERAKKHTIGDGSGLEK 600
GGERFDRKSGKIATKGKIAAKKQYDDSDSSDDSRAVDRKGRNKHERAKKHTIGDGSGLEK
Sbjct: 541 GGERFDRKSGKIATKGKIAAKKQYDDSDSSDDSRAVDRKGRNKHERAKKHTIGDGSGLEK 600
Query: 601 GFKSSGGAREGGKGNLNHADGLDEPVTADDNNSYKSRKDAIDEFNHANQHTMKSKRKFDE 660
GFKSSGGAREGGKGNLNHADGLDEPVTADDNNSYKSRKDAIDEFNHANQHTMKSKRKFDE
Sbjct: 601 GFKSSGGAREGGKGNLNHADGLDEPVTADDNNSYKSRKDAIDEFNHANQHTMKSKRKFDE 660
Query: 661 GGENEQREAKSRNRNSTRELGFYGDLKKDSKIDSGSNSRAGNNRYDEMRDGWHREDPKVD 720
GGENEQREAKSRNRNSTRELGFYGDLKKDSKIDSGSNSRAGNNRYDEMRDGWHREDPKVD
Sbjct: 661 GGENEQREAKSRNRNSTRELGFYGDLKKDSKIDSGSNSRAGNNRYDEMRDGWHREDPKVD 720
Query: 721 SESNTRARYSSMHDEDDRSKLDRTGSKYNEETEHGSRHYRKANESHRHGSASPDIEEGKR 780
SESNTRARYSSMHDEDDRSKLDRTGSKYNEETEHGSRHYRKANESHRHGSASPDIEEGKR
Sbjct: 721 SESNTRARYSSMHDEDDRSKLDRTGSKYNEETEHGSRHYRKANESHRHGSASPDIEEGKR 780
Query: 781 HIRYEEHRGRKHERDEGLKSSREVERGEYQPSSKLRRSDKDYETRERKHERDEDLKSSRE 840
HIRYEEHRGRKHERDE DLKSSRE
Sbjct: 781 HIRYEEHRGRKHERDE-------------------------------------DLKSSRE 840
Query: 841 VERGEYQPSSKLGRSDKDYESRESREVESGEYKPSSKLRRSEKDYETRESRRDRETDYRK 900
VERGEYQPSSKLGRSDKDYESRESREVESGEYKPSSKLRRSEKDYETRESRRDRETDYRK
Sbjct: 841 VERGEYQPSSKLGRSDKDYESRESREVESGEYKPSSKLRRSEKDYETRESRRDRETDYRK 878
Query: 901 RAKYDDSRSSRRDDY 915
RAKYDDSRSSRRDDY
Sbjct: 901 RAKYDDSRSSRRDDY 878
BLAST of MC08g0362 vs. ExPASy TrEMBL
Match:
A0A6J1ESM6 (dentin sialophosphoprotein-like OS=Cucurbita moschata OX=3662 GN=LOC111436144 PE=4 SV=1)
HSP 1 Score: 1001 bits (2588), Expect = 0.0
Identity = 619/949 (65.23%), Postives = 694/949 (73.13%), Query Frame = 0
Query: 1 MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEH 60
MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAE+TRGF+EDQGTAGVSKKPNKDILEH
Sbjct: 1 MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAENTRGFDEDQGTAGVSKKPNKDILEH 60
Query: 61 DRKRQIELKLVILEDKLIDQGYTDEEVSEKLKEARKTLEAASDQEEKGGPSAIVLLDKRI 120
DRKRQIELKLVILEDKL DQGYT++E+S+KLKEAR+TLEAAS EEK GPSAIVL DK++
Sbjct: 61 DRKRQIELKLVILEDKLTDQGYTEDEISQKLKEARETLEAASGSEEKDGPSAIVLADKKV 120
Query: 121 SDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQLKEGISDPLENSREGENSDIKRREKSE 180
SDTQ+HQIAARKEEQMKTLRAALGL S +DSEQ+ EGISDP N REG+N+DIKR EKSE
Sbjct: 121 SDTQSHQIAARKEEQMKTLRAALGLSSSNDSEQVTEGISDPTRNRREGQNADIKRHEKSE 180
Query: 181 HAFLDRELNWKKHAKEAHNDDKDKKIRVSKESKGHKKDRKRRPKDDSSDTDSGGEH-KGT 240
H+FLDRELNWKKH E HNDDK K RVSKE KGH KDR RRPKDDSSD DS GEH KGT
Sbjct: 181 HSFLDRELNWKKHGSEDHNDDKGDKKRVSKELKGHPKDR-RRPKDDSSDNDSVGEHHKGT 240
Query: 241 KKNLRDNRRDDSESDIDSDVDKKYITSRRYKKNRRHDSDDSSDTDSGGEYKKAKKNLRDN 300
KKNLRDNRR+DSESD +SD D KY TSR+ KKNRRHDSD SSDTDSGGE K KK+LRDN
Sbjct: 241 KKNLRDNRRNDSESDFESDDDDKYKTSRKSKKNRRHDSDVSSDTDSGGERKGTKKHLRDN 300
Query: 301 RRDDPESDPDSDVDKKYITSRKHKKNRRHDSDDS-------------------------- 360
RRD P+ DPDS+ D+KY TSRKHKKNRRHDSDDS
Sbjct: 301 RRDAPKRDPDSNFDQKYATSRKHKKNRRHDSDDSLDTASGEERKGTMKHLRDSRRDAPER 360
Query: 361 ----------------------------STDSGRDHKGTKKNLREYQRD----------- 420
TDSG + KGT K+LR+ +RD
Sbjct: 361 DPGSNFDQKHLTSRKHKKNRRHDSDDSSDTDSGEERKGTTKHLRDSRRDAPERELDSNFD 420
Query: 421 -----------------------------------------DHESDPDSDVDKKYFTSKK 480
D ESD DSD+DKKY TSKK
Sbjct: 421 QKHITSRKHKKNRRHDSDASSDTDSGGEHKETKKSLKNNRRDLESDTDSDIDKKYTTSKK 480
Query: 481 QGKSKRHDSDDSDSVTDDDDFGKGRHKKGSGRPKSQKVKKKLGSRKQESTDESNSDVGTD 540
Q K+K SDDSDS D +FG G H+KGSGR KSQKV KK RKQESTDESNSD G D
Sbjct: 481 QEKNKSRGSDDSDS--DSGEFGMGSHRKGSGRAKSQKVMKKQRGRKQESTDESNSDSGID 540
Query: 541 DKGRLSRHKNHQGKRRRADSDSSDHDGSDSDVGRNKSKHRYHSRSVGKDKVDSEFDTEKS 600
DKGR +HKN GKR DSDSSD D SDSDVGRNKSKHRY S+ GK +VDSE D+EK
Sbjct: 541 DKGRQLKHKNQHGKRYGVDSDSSDRDSSDSDVGRNKSKHRYQSKRAGKSRVDSESDSEKL 600
Query: 601 RKHPKEDVGRHRHDTDDKESGDFSHSSDEKVERRKSKRYDTDDESNGGGERFDRKSGKIA 660
RKHPK+DVGR RHDTD+ ESGD S SSDE V+ R+ +R+++DD+S GE F KSGKIA
Sbjct: 601 RKHPKKDVGRRRHDTDNDESGDNSSSSDEIVKWRRDRRHNSDDKSEEEGEYFG-KSGKIA 660
Query: 661 TKGKIAAKKQYDDSDSSDDSRAVDRKGRNKHERAKKHTIGDGSGLEKGFKSSGGAREGGK 720
TKG IAAK+++DDSD SDDS+AVDRKG +K +RAKKH+ GDGS +KG KSSGGARE GK
Sbjct: 661 TKGTIAAKRKHDDSDKSDDSQAVDRKGNDKQKRAKKHSSGDGSDADKGVKSSGGARERGK 720
Query: 721 GNLNHADGLDEPVTADDNNSYKSRKDAIDEFNHANQHTMKSKRKFDEGGENEQR-EAKSR 780
G+ NHADGLDE VTA N SYKSR D +DEFN ANQ TMKSKRK DEGGE+EQ+ EAKSR
Sbjct: 721 GSSNHADGLDESVTAAKNTSYKSRNDPLDEFNRANQQTMKSKRKLDEGGEDEQQPEAKSR 780
Query: 781 NRNSTRELGFYGDLKKDSKIDSGSNSRAGNNRYDEMRDGWHREDPKVDSESNTRARYSSM 839
+R STRE F+GD KKD K DS S+ RA + RY+E RDG +REDPK+DSESN R+RYS+
Sbjct: 781 SRISTRESDFHGDPKKDFKNDSESSRRARSGRYEETRDGRYREDPKIDSESNARSRYSA- 840
BLAST of MC08g0362 vs. ExPASy TrEMBL
Match:
A0A6J1K7B6 (dentin sialophosphoprotein-like OS=Cucurbita maxima OX=3661 GN=LOC111492317 PE=4 SV=1)
HSP 1 Score: 999 bits (2582), Expect = 0.0
Identity = 616/949 (64.91%), Postives = 700/949 (73.76%), Query Frame = 0
Query: 1 MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEH 60
MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAE+TRGF+EDQGTAGVSKKPNKDILEH
Sbjct: 1 MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAENTRGFDEDQGTAGVSKKPNKDILEH 60
Query: 61 DRKRQIELKLVILEDKLIDQGYTDEEVSEKLKEARKTLEAASDQEEKGGPSAIVLLDKRI 120
DRKRQIELKLVILEDKL DQGYT++E+S+KLKEAR+TLEAAS EEK GPSAIVL DK++
Sbjct: 61 DRKRQIELKLVILEDKLTDQGYTEDEISQKLKEARETLEAASGSEEKDGPSAIVLADKKV 120
Query: 121 SDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQLKEGISDPLENSREGENSDIKRREKSE 180
SDTQ+HQIAARKEEQMKTLRAALGL S +DSEQ+ EGISDP N REG+N+DIKR+EKSE
Sbjct: 121 SDTQSHQIAARKEEQMKTLRAALGLSSSNDSEQVTEGISDPTRNRREGQNADIKRQEKSE 180
Query: 181 HAFLDRELNWKKHAKEAHNDDKDKKIRVSKESKGHKKDRKRRPKDDSSDTDSGGEH-KGT 240
H+FLDRELNWK+H E HNDDK K RVSKE KGH KDR RRPKDDSSD DS GEH KGT
Sbjct: 181 HSFLDRELNWKRHGSEDHNDDKGDKKRVSKELKGHLKDR-RRPKDDSSDNDSVGEHHKGT 240
Query: 241 KKNLRDNRRDDSESDIDSDVDKKYITSRRYKKNRRHDSDDSSDTDSGGEYKKAKKNLRDN 300
KKNLRDNRR DSESD +SD D KY TSR+ KKNRRHDSD SSDTDSGGE K KK+LRDN
Sbjct: 241 KKNLRDNRRKDSESDFESDDDDKYKTSRKSKKNRRHDSDASSDTDSGGERKGTKKHLRDN 300
Query: 301 RRDDPESDPDSDVDKKY------------------------------------------- 360
RRD P+ DPDS+ D+KY
Sbjct: 301 RRDAPKRDPDSNFDQKYATSRKHKKNRRHDRDNSSDTDFGEERKGTMKHLRDSRRDAPER 360
Query: 361 ----------ITSRKHKKNRRHDSDDSS-TDSGRDHKGTKKNLREYQRD----------- 420
ITSRKHKKNRRHDSDDSS TDSG + KGT K+LR+ +RD
Sbjct: 361 DPGSNFDHKHITSRKHKKNRRHDSDDSSDTDSGEERKGTTKHLRDSRRDAPEREPDSNFD 420
Query: 421 -----------------------------------------DHESDPDSDVDKKYFTSKK 480
D ESD DSD+DKKY TSKK
Sbjct: 421 QKHITSMKHKKNRRHDSDASSDTDSGGEHKETKKSLKNNRRDLESDTDSDIDKKYTTSKK 480
Query: 481 QGKSKRHDSDDSDSVTDDDDFGKGRHKKGSGRPKSQKVKKKLGSRKQESTDESNSDVGTD 540
Q K+K DSDDSDS D +FG G H+KGSGRPKSQKV KK SRKQESTDESNSD G D
Sbjct: 481 QEKNKSRDSDDSDS--DSGEFGMGSHRKGSGRPKSQKVMKKQRSRKQESTDESNSDSGID 540
Query: 541 DKGRLSRHKNHQGKRRRADSDSSDHDGSDSDVGRNKSKHRYHSRSVGKDKVDSEFDTEKS 600
DKGR ++KN GKR DSDSSD D SDSDVGRNKSKHRYHS+ GK +VDSE D+EK
Sbjct: 541 DKGRQLKNKNQHGKRYGVDSDSSDRDSSDSDVGRNKSKHRYHSKRTGKSRVDSESDSEKL 600
Query: 601 RKHPKEDVGRHRHDTDDKESGDFSHSSDEKVERRKSKRYDTDDESNGGGERFDRKSGKIA 660
RKHPK+DVGR RHDTD+ ESGD S SSDE V+RR+ +R+++DD+S G + KSGKIA
Sbjct: 601 RKHPKKDVGRRRHDTDNDESGDNSSSSDEIVKRRRDRRHNSDDKSEEG--EYFGKSGKIA 660
Query: 661 TKGKIAAKKQYDDSDSSDDSRAVDRKGRNKHERAKKHTIGDGSGLEKGFKSSGGAREGGK 720
TKG IAAK++++DSD SDDS+AVDR+G +K +RAKKH+ GDGS +KG KSSGGARE GK
Sbjct: 661 TKGTIAAKRKHEDSDKSDDSQAVDRRGNDKQKRAKKHSYGDGSDADKGVKSSGGARERGK 720
Query: 721 GNLNHADGLDEPVTADDNNSYKSRKDAIDEFNHANQHTMKSKRKFDEGGENEQR-EAKSR 780
G+ NHADGLDE VTA N SYKSR D++DEFN ANQ TMKSKRK DEGGE+EQ+ EAKS+
Sbjct: 721 GSSNHADGLDESVTAAKNTSYKSRNDSLDEFNRANQQTMKSKRKLDEGGEDEQQPEAKSQ 780
Query: 781 NRNSTRELGFYGDLKKDSKIDSGSNSRAGNNRYDEMRDGWHREDPKVDSESNTRARYSSM 839
+R STRE F+GD KKD K DS S+ RA + R+ E RDG +REDPK+DSESN R+RYS+
Sbjct: 781 SRISTRESDFHGDPKKDFKNDSESSRRARSGRHKETRDGRYREDPKIDSESNARSRYSAH 840
BLAST of MC08g0362 vs. ExPASy TrEMBL
Match:
A0A5A7VCH8 (Dentin sialophosphoprotein-like OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold82G003050 PE=4 SV=1)
HSP 1 Score: 932 bits (2410), Expect = 0.0
Identity = 591/907 (65.16%), Postives = 676/907 (74.53%), Query Frame = 0
Query: 1 MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEH 60
MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEH
Sbjct: 290 MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEH 349
Query: 61 DRKRQIELKLVILEDKLIDQGYTDEEVSEKLKEARKTLEAASDQEEKGGPSAIVLLDKRI 120
DRKRQIELKLVILEDKL DQGYT++E+SEKL+EAR+ LEAAS EEK G SAIVL DKR+
Sbjct: 350 DRKRQIELKLVILEDKLNDQGYTEKEISEKLREARENLEAASGSEEKDGSSAIVLADKRV 409
Query: 121 SDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQLKEGISDPLENSREGENSDIKRREKSE 180
SDTQTHQIAARKEEQMKTLRAALGLGSLDD EQ+KE ISDP + REG+N+DIKR EKSE
Sbjct: 410 SDTQTHQIAARKEEQMKTLRAALGLGSLDDGEQVKEEISDPSRSRREGQNADIKRHEKSE 469
Query: 181 HAFLDRELNWKKHAKEAHNDDKDKKIRVSKESKGHKKDRKRRPKDDSSDTDSGGEHKGTK 240
H+FLDRELNWK+ E DDKD K SKE KGH+KD+KRRPKDDSSDTDSG EHKGTK
Sbjct: 470 HSFLDRELNWKRRGTEDQFDDKDVKKGASKELKGHQKDKKRRPKDDSSDTDSG-EHKGTK 529
Query: 241 KNLRDNRRDDSESDIDSDVDKKYITSRRYKKNRRHDSDDSSDTDSGGEYKKAKKNLRDNR 300
KNLRD+RR DSES++D DV+ KY+ SR+ KKNRRHDSDDSS TDSGGE+K KK+ R+ R
Sbjct: 530 KNLRDSRRIDSESELDIDVNNKYVASRKSKKNRRHDSDDSSGTDSGGEHKVTKKHSRNKR 589
Query: 301 RDDPESDPDSDVDKKYITSRKHKKNRRHDSDDSS-TDSGRDHKGTKKNLREYQRDDHESD 360
+DDPESD DSD+D+KY+TSRKHKKNRRHDSDDSS +DSG +HK TK+++R QR H SD
Sbjct: 590 KDDPESDSDSDLDQKYLTSRKHKKNRRHDSDDSSDSDSGGEHKKTKRSVRSNQRG-HGSD 649
Query: 361 PDSDVDKKYFTSKKQGKSKRHDSDDSDSVTDDDDFGKGRHKKGSGRPKSQKVKKKLGSRK 420
PDSDVDKK+ TSKKQ KS RHDSDDSDS TD D G H+KGSGR +SQKVKK+ S+K
Sbjct: 650 PDSDVDKKH-TSKKQKKSTRHDSDDSDSFTDGDKIGMDSHQKGSGRHESQKVKKQR-SQK 709
Query: 421 QESTDESNSDVGTDDKGRLSRHKNHQGKRRRADSDSSDHDGSDSDVGRNKSKHRYHSRSV 480
Q+STDE+NSD +DK R +HKN GKR +SDSSDHD SDSDVGR KS HR+HS+
Sbjct: 710 QDSTDETNSDSVVEDKHRQLKHKNQHGKRY-GESDSSDHDSSDSDVGRKKSTHRFHSKRT 769
Query: 481 GKDKVDSEFDTEKSRKHPKEDVGRHRHDTDDKESGDFSHSSDEKVERRKSKRYDTDDESN 540
GK +VDSE D EKSRK+PK+DV R RHD DD++SGD S SSDE V+RR+ +R+ TDD S
Sbjct: 770 GKSRVDSESDFEKSRKYPKKDVRRRRHDIDDEKSGDNSSSSDELVKRRRGRRHSTDDSSE 829
Query: 541 GGGERFDRKSGKIATKGKIAAKKQYDDSDSSDDSRAVDRKGRNKHERAKKHTIGDGSGLE 600
GE F R SGKI TKGKI AK+Q D S++SD S AVDRKG ++H+RAKK++ GDG LE
Sbjct: 830 EEGEYFGR-SGKITTKGKIDAKRQDDYSNNSDGSLAVDRKGDDEHKRAKKYSSGDGFNLE 889
Query: 601 KGFKSSGGAREGGKGNLNHADGL---------------------------------DEPV 660
KG K S GARE GKGNLNH +G D+
Sbjct: 890 KGRKLSSGARERGKGNLNHPEGRRHNTDDKSEEEGEYLGRSGKIATKRKMDAKRQHDDSE 949
Query: 661 TADDNNSYKSR--------------------------------------KDAIDEFNHAN 720
+DD+ + K + KD+I EFNHA+
Sbjct: 950 NSDDSLAVKHKRAKKYSSSDDSDLEKGVKSTDGARERGKNRADGLDKFKKDSIHEFNHAS 1009
Query: 721 QHT--MKSKRKFDEGGENEQR-EAKSRNRNSTRELGFYGDLKKDSKIDSGSNSRAGNNRY 780
Q T M SKRK DEG ENEQ E+KSRNRNS D KKD K DS S+ R+ + RY
Sbjct: 1010 QRTDKMNSKRKLDEGRENEQEPESKSRNRNS--------DPKKDFKHDSESSRRSRSGRY 1069
Query: 781 DEMRDGWHREDPKVDSESNTRARYSSMHDEDDRSKLDRTGSKYNEETEHGSRHYRKANES 831
DE RDG +RED K+DSESNTR+RYS+ H+EDD K RTGS+Y EETEHGSRH+RKANES
Sbjct: 1070 DETRDGRYREDSKIDSESNTRSRYSA-HNEDDDRKSTRTGSRYTEETEHGSRHHRKANES 1129
BLAST of MC08g0362 vs. ExPASy TrEMBL
Match:
A0A1S3BBX0 (dentin sialophosphoprotein-like OS=Cucumis melo OX=3656 GN=LOC103488247 PE=4 SV=1)
HSP 1 Score: 924 bits (2389), Expect = 0.0
Identity = 588/907 (64.83%), Postives = 672/907 (74.09%), Query Frame = 0
Query: 1 MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEH 60
MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEH
Sbjct: 1 MYNGIGLQTPRGSGTNGYIQTNKFFVRPKTGKVAESTRGFEEDQGTAGVSKKPNKDILEH 60
Query: 61 DRKRQIELKLVILEDKLIDQGYTDEEVSEKLKEARKTLEAASDQEEKGGPSAIVLLDKRI 120
DRKRQIELKLVILEDKL DQGYT++E+SEKL+EAR+ LEAAS EEK G SAIVL DKR+
Sbjct: 61 DRKRQIELKLVILEDKLNDQGYTEKEISEKLREARENLEAASGSEEKDGSSAIVLADKRV 120
Query: 121 SDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQLKEGISDPLENSREGENSDIKRREKSE 180
SDTQTHQIAARKEEQMKTLRAALGLGSL D EQ+KE ISDP + REG+N+DIKR EKSE
Sbjct: 121 SDTQTHQIAARKEEQMKTLRAALGLGSLGDGEQVKEEISDPSRSRREGQNADIKRHEKSE 180
Query: 181 HAFLDRELNWKKHAKEAHNDDKDKKIRVSKESKGHKKDRKRRPKDDSSDTDSGGEHKGTK 240
H+FLDRELNWK+ E DDKD K SKE KGH+KD+KRRPKDD SD DSG EHKGTK
Sbjct: 181 HSFLDRELNWKRRGTEDQFDDKDVKKGASKELKGHQKDKKRRPKDDFSDADSG-EHKGTK 240
Query: 241 KNLRDNRRDDSESDIDSDVDKKYITSRRYKKNRRHDSDDSSDTDSGGEYKKAKKNLRDNR 300
KNLRD+RR DSESD+D DV+ KY+ SR+ KKNRRHDSDDSS TDSGGE+K KK+ R+ R
Sbjct: 241 KNLRDSRRIDSESDLDIDVNNKYVASRKSKKNRRHDSDDSSGTDSGGEHKVTKKHSRNKR 300
Query: 301 RDDPESDPDSDVDKKYITSRKHKKNRRHDSDDSS-TDSGRDHKGTKKNLREYQRDDHESD 360
+DDPESD DSD+D+KY+TSRKHKKNRRHDSDDSS +DSG +HK TK+++R QR H SD
Sbjct: 301 KDDPESDSDSDLDQKYLTSRKHKKNRRHDSDDSSDSDSGGEHKKTKRSVRSNQRG-HGSD 360
Query: 361 PDSDVDKKYFTSKKQGKSKRHDSDDSDSVTDDDDFGKGRHKKGSGRPKSQKVKKKLGSRK 420
PDSDVDKK+ TSKKQ KS RHDSDDSDS TD D G H+KGSGR +SQKVKK+ S+K
Sbjct: 361 PDSDVDKKH-TSKKQKKSTRHDSDDSDSFTDGDKIGMDSHQKGSGRHESQKVKKQR-SQK 420
Query: 421 QESTDESNSDVGTDDKGRLSRHKNHQGKRRRADSDSSDHDGSDSDVGRNKSKHRYHSRSV 480
Q+STDE+NSD +DK R +HKN GKR +SDSSDHD SDSDVGR KS HR+HS+
Sbjct: 421 QDSTDETNSDSVVEDKHRQLKHKNQHGKRY-GESDSSDHDSSDSDVGRKKSTHRFHSKRT 480
Query: 481 GKDKVDSEFDTEKSRKHPKEDVGRHRHDTDDKESGDFSHSSDEKVERRKSKRYDTDDESN 540
GK +VDSE D EKSRK+PK+D R RHD DD++SGD S SSDE V+RR+ +R+ TDD S
Sbjct: 481 GKSRVDSESDFEKSRKYPKKDDRRRRHDIDDEKSGDNSSSSDELVKRRRGRRHSTDDSSE 540
Query: 541 GGGERFDRKSGKIATKGKIAAKKQYDDSDSSDDSRAVDRKGRNKHERAKKHTIGDGSGLE 600
GE F R SGKI TKGKI AK+Q D S++SD S AVDRKG ++H+RAKK++ GDG LE
Sbjct: 541 EEGEYFGR-SGKITTKGKIDAKRQDDYSNNSDGSLAVDRKGDDEHKRAKKYSSGDGFNLE 600
Query: 601 KGFKSSGGAREGGKGNLNHADGL---------------------------------DEPV 660
KG K S GARE GKGNLNH +G D+
Sbjct: 601 KGRKLSSGARERGKGNLNHPEGRRHNTDDKSEEEGEYLGRSGKMATKRKMDAKRQHDDSE 660
Query: 661 TADDNNSYKSR--------------------------------------KDAIDEFNHAN 720
+DD+ + K + KD+I EFNHA+
Sbjct: 661 NSDDSLAVKHKRAKKYSSSDDSDLEKGVKSTDGARERGKNCADGLDKFKKDSIHEFNHAS 720
Query: 721 QHT--MKSKRKFDEGGENEQR-EAKSRNRNSTRELGFYGDLKKDSKIDSGSNSRAGNNRY 780
Q T M SKRK DEG ENEQ E+KSRNRNS D KKD K DS S+ R+ + RY
Sbjct: 721 QRTDKMNSKRKLDEGRENEQEPESKSRNRNS--------DPKKDFKHDSESSRRSRSGRY 780
Query: 781 DEMRDGWHREDPKVDSESNTRARYSSMHDEDDRSKLDRTGSKYNEETEHGSRHYRKANES 831
DE RDG +RED K+DSESNTR+RYS+ H+EDD K RTGS+Y EETEHGSRH+RKANES
Sbjct: 781 DETRDGRYREDSKIDSESNTRSRYSA-HNEDDDRKSTRTGSRYTEETEHGSRHHRKANES 840
BLAST of MC08g0362 vs. TAIR 10
Match:
AT3G49601.1 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; CONTAINS InterPro DOMAIN/s: mRNA splicing factor, Cwf21 (InterPro:IPR013170); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )
HSP 1 Score: 203.8 bits (517), Expect = 6.1e-52
Identity = 222/622 (35.69%), Postives = 313/622 (50.32%), Query Frame = 0
Query: 1 MYNGIGLQTPRGSGTNGYIQTNKFFVRPKT-GKVAESTRGFEEDQGTAGVSKKPNKDILE 60
MYNGIGLQT RGSGTNGY+QTNKFFVRP+ GK + +GFE+D+GTAG+SKKPNK ILE
Sbjct: 1 MYNGIGLQTARGSGTNGYVQTNKFFVRPRNGGKPVKGGKGFEDDEGTAGLSKKPNKAILE 60
Query: 61 HDRKRQIELKLVILEDKLIDQGYTDEEVSEKLKEARKTLEAASDQEEKGGPSAIVLLDKR 120
HDRKRQI LKL ILEDKL DQGY+D E+++KL+EAR +LEAA+ E+ D +
Sbjct: 61 HDRKRQIHLKLAILEDKLADQGYSDIEIAQKLEEARVSLEAAAAANEEES-------DSK 120
Query: 121 ISDTQTHQIAARKEEQMKTLRAALGLGSLDDSEQLKEGISDPLENSREGENSDIKRREKS 180
+S+TQTHQ+AARKE+QM+ RAALGL D + +EGI D E REG +K E+
Sbjct: 121 VSNTQTHQVAARKEKQMEAFRAALGLP--DQQQVAEEGIIDD-EPMREGFEGRLK--ERR 180
Query: 181 EHAFLDRELNWKKHAKEAHNDDKDKKIRVSKESKG------------HKKDRKRRPKDDS 240
EH+FLDR+ KK ++ D+KD K++ SK+ +G KK+ K+R DDS
Sbjct: 181 EHSFLDRDSGRKKVDEDV--DEKDAKVKESKKQRGGDDDDVDVVKRHKKKESKKRRHDDS 240
Query: 241 SDTDSGG---------EHKGTKKNLR-DNRRDDSESDIDSDVDKK-------YITSRRYK 300
S++D G + KG K+ D+ DSESD DSD KK T +R +
Sbjct: 241 SESDEHGRDRRRRSKKKAKGRKQESESDSSSSDSESDSDSDDGKKRGRKKPTKTTKKRSR 300
Query: 301 KNR--RHDSDDSSDTDSGGEYKKAKKNLRDNRRDDPESDPDSDVDKKYITSRKHKKNRRH 360
+ R +S++ DS K KK+L NR E DK SR +K RH
Sbjct: 301 RKRSVSSESEEVESDDSKKLRKSHKKSLPSNRSGSKELR-----DKHDEQSRAGRK--RH 360
Query: 361 DSDDSSTDSGRDHKGTKKNLREYQRDDHESDPDSDVDKKYFTSKKQGKSKRHDSDDSDSV 420
DSD S +S + + +K Y+ + D DV+ + R+ DD +
Sbjct: 361 DSDVSEPESEDNKQPLRKKEEAYRGGQKQKRDDEDVEADHL-------KDRYTRDDKKAA 420
Query: 421 TDDDDFGKGRHKKGSGRPKSQKVKKKLGSRKQESTDESNSDVGTDDKGRLSRHKNHQGKR 480
D DD K R K + + +++E D + G+ + +GK
Sbjct: 421 RDSDDSEIEYQNKKQLRSKVEVYSAGMSQKRKEEEDVTK-------HGKDKYRSDSRGKE 480
Query: 481 RRADSDSSDHDGSDSDVGRNKS--KHRYHSRSVGKDKVDSEFDTEKSRKHPKEDVGRHRH 540
DSD S+ + + +N+S + R H R +D + D + K G +
Sbjct: 481 VARDSDDSEAEYENRKKLKNESYQRGRKHKREEDEDNDNHGRDRYRGDDAVKR-YGTIKE 540
Query: 541 DTDDKESGDFSHSSDEKVERRKSKRYDTDDESNGGGERFDRKSGKIATKGKIAAKKQYDD 589
D D D+ R + +R D+ DR G G+ A K+ DD
Sbjct: 541 DDDRYRGRAIEEEGDDDRGRYRPRRESVKDDEEEYKHGRDRYRG----DGRRATGKEDDD 582
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_022131365.1 | 0.0 | 95.96 | protein starmaker [Momordica charantia] >XP_022131366.1 protein starmaker [Momor... | [more] |
XP_023545728.1 | 0.0 | 65.44 | protein starmaker-like [Cucurbita pepo subsp. pepo] | [more] |
XP_022929608.1 | 0.0 | 65.23 | dentin sialophosphoprotein-like [Cucurbita moschata] | [more] |
XP_022997381.1 | 0.0 | 64.91 | dentin sialophosphoprotein-like [Cucurbita maxima] | [more] |
XP_038884695.1 | 0.0 | 67.46 | dentin sialophosphoprotein-like [Benincasa hispida] >XP_038884696.1 dentin sialo... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1BPI2 | 0.0 | 95.96 | protein starmaker OS=Momordica charantia OX=3673 GN=LOC111004608 PE=4 SV=1 | [more] |
A0A6J1ESM6 | 0.0 | 65.23 | dentin sialophosphoprotein-like OS=Cucurbita moschata OX=3662 GN=LOC111436144 PE... | [more] |
A0A6J1K7B6 | 0.0 | 64.91 | dentin sialophosphoprotein-like OS=Cucurbita maxima OX=3661 GN=LOC111492317 PE=4... | [more] |
A0A5A7VCH8 | 0.0 | 65.16 | Dentin sialophosphoprotein-like OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_... | [more] |
A0A1S3BBX0 | 0.0 | 64.83 | dentin sialophosphoprotein-like OS=Cucumis melo OX=3656 GN=LOC103488247 PE=4 SV=... | [more] |
Match Name | E-value | Identity | Description | |
AT3G49601.1 | 6.1e-52 | 35.69 | FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... | [more] |