Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CCGAATGGAAGATTATATTTAATTTTCATAAAATAAAAATAGGAAATGCCACGAAAACCGTTTAGGGAAGGGGGACTGAATGCTCGCCGGATCTCTTTACAGTTTCCCACCGTCGCCGATTTCCCGATCTGTTACCGCCAAACTCAAAACAAAACACTTACAAATAACCCAAAAATCTCCCGCCTTTCAACAAATTTTCAAAAATTCTTCACCAATTTCCCAATTCCCATCCTCTTTCTAGCCCTTCCCCTCCAAATTCCCCCTCCTTTCCTCTTCTTTCTTCTTCTTCTCTAACCCTTGCATGGCGGCTCCGTCGCCAACCCTTAGCAACAACAGCTCTGACACTGCCACCACCACTGCCGGCGCGGTTGCGGCTAACGTCGCTGTTTCCCCCAATCACTTGGCCAACCGTACAGGAACGCCGCCCAAGACTCTCCGCGGCCTTAACAAACCCAAGTGTAGGGTTTGTGGGAATGTTGCTCGTTCCAGGTTTAATAATCTCTTCACTTGCTGTTTCCTCTCCTTTTGACTTGAATTTCATCTCTCATGCGCTTGAAACCCTAGATTTGTTCTGTTTGTGGATTGTTTTCTTCAGCTTAGTATGTATGAGACGGTACTAGATTTGGTTATATTATTTGGCCCTTGCATTAGCTGATTAGCTGATTAGCTGATTTGATATTTGATTTGCAAGTCATTGTTTAGAATCTCCAACAATATTATCCTAAACAGAAATGTGCGTGCAGGTGTCCTTATGAATCATGCAAGAGCTGTTGTGCGAGAAACCAGAACCCATGCTATATTCACGGTAAGAAACCGATCTATCTATTGGCTACACTGTTACTCTGTTTTTGTTATTTATTTCTTGCCATGGGATGTCAGTTTTAGCGAATGTCAATTTGTCTATCATGGATTATTCTTGTTTGTGTGTGTATGTGTGGTGTGTTGAATCTTCACTTCCATCTCTGCCCTTTCTGAGTATAGGGATGGGATTATGCGGTTGTGGTGGGGATACATCCCTTATAACCTTGGAACACTCTAGCTAGAAAAGGAATGAATGATTCTCAGAACACAATATTCACTTAATGTTTCGAAAGGAACTCATATTAGGGACTGAAAGTTGCTATTTTTTTGGTTTGAAAGAGGCTTATGTGTATATGAGAGTGGTGATGTTAAGATTTAAGAAAACTCGAGATGATATTTTCTTCTCTCTATGTAATCACTGTGATCAGTAATTTTTGGTCCACTTAAAGATTTTTATAAATTCTTTGTAGTTAATTGAGGCTTATGTATTGCACCACTCTTAGGATAAAAAATCCTTTACATTGCTTTAAGCCATTTAAAACATTCAAGTTTCATATTTGGATTGATTTCTAGAGGGGTTGAGACTGCCTGGAAGATATATGGTTCCTCGATAGGTTCTATAGTCTCTCTTTGGATTTGATGTACAATGATTTTGTAATTACTCTTTAGGTCTTATTTTGTGAGATCTGATTCCCTTTTTTTGTCTTAGTTTGGCTTTTTTGCTAAACTATTTTGAGGAATGCCTCTTGTATTCTTCTATTTCTTTCTATAGAAGCTAGAATATATATCATCTTTCTTAGTGAAAGATGGATTCACATTCAAAAGCCATCAAGTGCACCTAGGGCCTTTTGCTTTGCTCATCCTTATCGTCTAAGAAGCACATATACTAAGTTTGGCTAGCACGTCTGAGTCCAACATGTGTTGGACGCTTAAGCACTCCGACATTAGTTGGACACATATTGGACACTTGTTGGTGCAATGAATGTGTTAGACACTAATTGTACACAGTGTTTCAAGGCGTGCGCCTTGGCTCGCTTAAGGCGAGAGGTGAGGCAATCTTTGGTTGTTGCGGTTTGAATGCAGCTAAGGCAAGCTCCTTCAACCAGACGCTCGCCTAACTGTTCCCTGTTGTTGCCTTGATATGCCTTTTCCCTAAGATAAGATGAACACCTGTCCTCGTTTCCCTTTTTTTTTCTTTTGATGTATCAAAACATGAGAAATAAAAAAGCCTAGTGAAAGAGACAAGAAAAAAAGAAGTATAGAAAAAAACAAAGGAATAGAAAGAAGTTCCCTCTTTAATAAAAGAAAAGACAAAGGAATAGAAAGAAAGAAGAAGAGAGTGGTTGGAAGGAGAACAAAAGAACGTAAGTAGTTACAGAAAAGTTGATAATAAATAGAAAGAAGGAAAGGCAGTAGAAGGAGAGTCTAATTCGAGACACAAGTTGGCTGCATGAAACCGCCATATGACATTATTACTAAGGACTAAGGAGAATTTGTGATAGCTTGAAATTTCCATTTGGCAATTTCCTCACATAAGTAATAAATACAGGGCTTTATATTCAGGAGGCTTATAGGGCAAGAAGTTGGTTATCGTAGTCCTAGGTTATTATAGTTGGTTTATAATAGTTTGTGTTTTAGGTGTAGATTATTTTTGTTTAAGTTATAATAGTATGTATGTGAGGTGTAAACTATTTTAAGAAAAAAAAATAATGAATGTAAAATAGTAAAAACGGTAGCAAATAGGAGAGGAATGGGGTTTGAAATATAGGAAAGTATAAAATAGTATTCACTGTAGCAAGAGGTGGTTATTATAGTCTTGAATATGCTTGCCCAAACGGCTCCTTAAAGGTTGCATGGTTAATGGGTAATATTAGGCTTGCCCCAAGGGGTGGTTATTAGCAAATAGGAGGGGAAGGGGGTTTGAAATAATTGCGTAGTTTTCCTTAGAAATGTTTAAACTCATTATATAACTGATTGGGTGGAGTATTTTATGCTTCTCTTTCTCTCTTTTTATTATAAACCGAAGTTTCATTAAAGATTAATTAAACAATACAAATGTTTACTGCAAGAAAAAAGCGAGAAACGAAATAAAAGAAATTACAACAAAAGGGCTGTAATAATTTAGAAAAGTGATGATAGGTAATCACTAAACTCCTTACTCATTGACATCTAAAGGTAGGCATTAGGCTTAATCAAGATTTGCTTCTCTTCAAAAGGGTTTTGTGCCCCTTTTAAAATTCTCCACCTTGCGAATTGGACTTGAAAATTGATCTCTATAAGACCTGCGATCTCTGAGTAACACTAACACCTGCCTAGAGGTTGCATGTTCAAATCTTTGAGTGAGCTTAAATAAGGACAAATCCTCAATATCTCCAAAGTCAGAGTCCTGGGGTGGGTATGCACATTATTTGATTTTATCTATCTATTTATTTTTAATTTCAAAATTGACATATCTATAAAGCGTGCTTCAGTTCAGAAGCACCCTTACATAAGAGGCACAAATGAGGTCCAAGAGATGCAAGAGTTCAATCTTTAGGTAAGCTCAATTATACTGAATCTGTCGTGTATGTATTGCCAGGCAAAAAACTTGACATTATCTGAAATTCTAACCTTCCAAACTGAGGATAAATGTAGGGTGGGGGCTGGGAATTACATAGGGGAGAGGAAAAAGAACTATGAGAAATCTGTGGGATTATGTGGTCCATATACAAACATCTCTAGTCTTGGGTGATGGAGGAAGATGTCATGACCTCACTTGTACAAGATCTGTTCTATTCTATAGATTGTACAACTATTGCCTTGAATTCTAAGAAGATGTTGATCTTCCATGAAAATAAGATTGTACAACTATGCCCTTTAGTTCTAGTTTTGATTTAGAGGAAGTAAAATTCCTCGTGACTTTGATTTTGGTTTCATTCAAAAAACCTCATACCAATGGAGATATTTGGAGGGTTGTGTTAGGATACCATCTAACAAGAAAGTTTCAAGAGCACAAAATTCTCAAAGATAGAAATGTATTGAAATATAAGAGAAGTTACAATATATCATAGCCATTTGAGAGGCTACACTCTCCCAGACTCCTACAAGGGAAATCCCTAACCAAACTCACTACCTCTATTTTTAACTAAAACCCTCAATAAAGTCCTTAGCTAATTACTACGATGCCCTTACTCATAAACATATTAATATTCCTAATATAACCATAACTAGGGCCCTTACACGTTGTTGTCTCTAACAAGTGGTATCGGGGTCATGCTTTTGACCTAGCCATGCTGGTAAAATTCTCGAGTGCTAAATAAAGAAGTGTGTCTTGAAAGATATCAATGGTGGAGATTTGTGGTAGAGCTCTACAGGAATAGTGGAATACTAATGACAAGAGGAGTTGACTTGAAGAGCTTGATGGATTGATGGCCATTTGAGGGGAGGTTTGATGGTCATTTGTTCAAGGAGATGATTGTTGGGAATGTGTAAGACCCCTAGTTACGGAGATAGTTTGTAGGAGGAGCATATTAGTAATTAGACAACAATTTGTTAGGGTTTAGTTATAAATAGTAATTTCCTTCGGGAATGTTGGAAAAGTCTAGGGCCTCTTGAAAGGCTGGTGATTATTTCAATATATTTTCATATTTCAGAGTTTTTGAGTAGTTTTGTTCTCGAGTATTTTCTTGTTAGATGGTATCCTAATAGAATGCAATAAAGTCCCACACTGGCTAAATACGATCATTGACTTTTGATGATTTCCTAAGAAGTCTCATAAACTTTTGTGCCTCCTATATTTTATATGATTTATCAATGGTACGTGGAATATTTGTTTGGACAAAATTTTTATTGATGGTCCATCAAGACCTATTAACTTTTAAGATAAGAATTACAACAATTTTCAAAGGTCAAGTTAAATTTGATTCATTGAAAGTTTGTGGTAAAAATTTGATGCGTCTTGTCAAGTTAGTGGTAAAGATTGATCTTTCCCCTTGAAAATTTATCTCATTCATCAAATATAATCTGGGCATTTAGAATAGGAATATCGGCTTATGTACATATTTTGTTGTTGATCTGTCTGCAGACAGTTTTGCCTGGTAGGTTAGGGAAGACTTGTGGTTTCGTTAGTTTGAGCACTTATTATATGATTCTGATATTTTTGATTTTATGCTGCAGTGTTAAAAGCAAATGCGACTTTTCCAGACAAGACACCTTCTTCTAGCTCTCCTCTATTTGACAAGCAATCACCAGATCCATCATCATCTGGGTATGGGAGTTCTAGACACCGTTTGATTTCACTTTATGCCTTTATTTATTTTGTCTTCTCAGCTGCCATGAATTTTGTTCCTGATATTGGTTTTGACAATCTTGCTTATTTTATTTTACTTGAAGTATTTTCATCTGGCAACTGAACAATATACTCGTCATTATATTATTGGTTTAAGATCATGGAAGATTTTACAGCCGTTGTACATTTGTATTTGTGAGTTCTAAAGATCGTGGAGATTTTTTCACTCATTTTTAGGGTCTCGTGAACTTCTTTAAAGTTAGTTTGATAATATTCTTCCCTTCCTGAACCTGATATCAGATTACTTCTCTGGGATCCAACCCCCTCTACAGCCAACTGTTCAACATGTGCTAATTCCATCATGACCATGCTTAATTAGGCTACCATTACCATAGTCTGAATAGCAATATTTAGTCTTTTCTTCTGGCTCTGGGCATTGAGATTTTCATTATTGTTGGTTTTATGTTTCAGGACTTCAAATAGAGTAGCTTCACTTCGACAACTTTCTAGCAACTTCTCCCAATTCAACAATGTGCGACTCCCTATCCGTTCACCAAAGCCATTGACCAGAAAGGTGTTCATAATTTGTCAATTTATCTTCTGCTTTGTAGTTATTGAACACAAGTAAATTTCCTTATTCTAATTTGAGAAGGATTTTAGGATGCCGCCACAATCAATGAATGGAGATTTTCCAAGTTAAGGGAATTTCGGGAGAGGCATATTGAAGCAGAAAATGAAGCTTTTGATCGATACATGAAGAATATTAATCTATTGGAAGAGGTCTTTTCTACGAAATCTATGATTGATGATAGGCCTCTTAAGGACAGACCCCCTGTAAACTCTGGTACAGAAGCCAACCCCGAGGAAATGACTCCTGGGTTGAAGTTAAAGTTAGGATCGACATCAGACAATTCTAGAAAGAGGATACGCAAAATTGTTGAAGATGGACTAAGAAAAATTAAAATAGTTGAGACATTTGATAACGTCGATGAGGTCACTGACCATGCCCAAGCTGATAGGGGTGAGGACGAGACAAATCTTAATGATGGATGCAAAATGCTAGAAGGTTGGCATGCTAAAAGAACTCGAGCTTTAGGTGATCTCATCGACAAGCTAAACAAAGCCCGAAATGAGGAGGATTTAAAATCTTGCTTGGCAATGAAACATCAGCTTTCTGACCAACACAAAACAACATCCAGTGAAGCAGAATCTGAAGAGACTGACACGTCCAAGGAGCAGCAAGTTATAAAGAAAGATTTAGACTCAAGGAAAGAATTGGGTTTTTCATTGCCAAAGTTGGTTAACAAAACTAACATTGATCAACAAACTCTGAACCAAATTGATGCTCACTTTTCTTCTCTCAAGCAGATAGGCAATCTATGATTTGATGCAGATCTTGTTATTTTTTTTTACCCCTCTATTGACATTAGGTTAAAAATCTCCAAACATGATGCTGTAATGTCCTCATGTAATTTATTCTCTCAACAGCCATTGATGTTTATATGGCAACAGCATTAAATGTGCATTGTCTCTTCACAAAATTCCACACCTCATTTTTCTGATCTTTCCTTATTTGTTATGAAATGATCAGTTGGATCGTGAACCTTGAACTTGTGTTTGAGATTGAAAATTTTCAAAGAAAGATCAGGATATGTGCTAGATCTTTGTGAAAAGGATGTTGAAGAAAAGGAAAAGTTTGTAACATGGGCTGTTTTTTTGGGCCATAAGATTGCAATAAATGGGCC
mRNA sequence
CCGAATGGAAGATTATATTTAATTTTCATAAAATAAAAATAGGAAATGCCACGAAAACCGTTTAGGGAAGGGGGACTGAATGCTCGCCGGATCTCTTTACAGTTTCCCACCGTCGCCGATTTCCCGATCTGTTACCGCCAAACTCAAAACAAAACACTTACAAATAACCCAAAAATCTCCCGCCTTTCAACAAATTTTCAAAAATTCTTCACCAATTTCCCAATTCCCATCCTCTTTCTAGCCCTTCCCCTCCAAATTCCCCCTCCTTTCCTCTTCTTTCTTCTTCTTCTCTAACCCTTGCATGGCGGCTCCGTCGCCAACCCTTAGCAACAACAGCTCTGACACTGCCACCACCACTGCCGGCGCGGTTGCGGCTAACGTCGCTGTTTCCCCCAATCACTTGGCCAACCGTACAGGAACGCCGCCCAAGACTCTCCGCGGCCTTAACAAACCCAAGTGTAGGGTTTGTGGGAATGTTGCTCGTTCCAGGTGTCCTTATGAATCATGCAAGAGCTGTTGTGCGAGAAACCAGAACCCATGCTATATTCACGTGTTAAAAGCAAATGCGACTTTTCCAGACAAGACACCTTCTTCTAGCTCTCCTCTATTTGACAAGCAATCACCAGATCCATCATCATCTGGGACTTCAAATAGAGTAGCTTCACTTCGACAACTTTCTAGCAACTTCTCCCAATTCAACAATGTGCGACTCCCTATCCGTTCACCAAAGCCATTGACCAGAAAGGATGCCGCCACAATCAATGAATGGAGATTTTCCAAGTTAAGGGAATTTCGGGAGAGGCATATTGAAGCAGAAAATGAAGCTTTTGATCGATACATGAAGAATATTAATCTATTGGAAGAGGTCTTTTCTACGAAATCTATGATTGATGATAGGCCTCTTAAGGACAGACCCCCTGTAAACTCTGGTACAGAAGCCAACCCCGAGGAAATGACTCCTGGGTTGAAGTTAAAGTTAGGATCGACATCAGACAATTCTAGAAAGAGGATACGCAAAATTGTTGAAGATGGACTAAGAAAAATTAAAATAGTTGAGACATTTGATAACGTCGATGAGGTCACTGACCATGCCCAAGCTGATAGGGGTGAGGACGAGACAAATCTTAATGATGGATGCAAAATGCTAGAAGGTTGGCATGCTAAAAGAACTCGAGCTTTAGGTGATCTCATCGACAAGCTAAACAAAGCCCGAAATGAGGAGGATTTAAAATCTTGCTTGGCAATGAAACATCAGCTTTCTGACCAACACAAAACAACATCCAGTGAAGCAGAATCTGAAGAGACTGACACGTCCAAGGAGCAGCAAGTTATAAAGAAAGATTTAGACTCAAGGAAAGAATTGGGTTTTTCATTGCCAAAGTTGGTTAACAAAACTAACATTGATCAACAAACTCTGAACCAAATTGATGCTCACTTTTCTTCTCTCAAGCAGATAGGCAATCTATGATTTGATGCAGATCTTGTTATTTTTTTTTACCCCTCTATTGACATTAGGTTAAAAATCTCCAAACATGATGCTGTAATGTCCTCATGTAATTTATTCTCTCAACAGCCATTGATGTTTATATGGCAACAGCATTAAATGTGCATTGTCTCTTCACAAAATTCCACACCTCATTTTTCTGATCTTTCCTTATTTGTTATGAAATGATCAGTTGGATCGTGAACCTTGAACTTGTGTTTGAGATTGAAAATTTTCAAAGAAAGATCAGGATATGTGCTAGATCTTTGTGAAAAGGATGTTGAAGAAAAGGAAAAGTTTGTAACATGGGCTGTTTTTTTGGGCCATAAGATTGCAATAAATGGGCC
Coding sequence (CDS)
ATGGCGGCTCCGTCGCCAACCCTTAGCAACAACAGCTCTGACACTGCCACCACCACTGCCGGCGCGGTTGCGGCTAACGTCGCTGTTTCCCCCAATCACTTGGCCAACCGTACAGGAACGCCGCCCAAGACTCTCCGCGGCCTTAACAAACCCAAGTGTAGGGTTTGTGGGAATGTTGCTCGTTCCAGGTGTCCTTATGAATCATGCAAGAGCTGTTGTGCGAGAAACCAGAACCCATGCTATATTCACGTGTTAAAAGCAAATGCGACTTTTCCAGACAAGACACCTTCTTCTAGCTCTCCTCTATTTGACAAGCAATCACCAGATCCATCATCATCTGGGACTTCAAATAGAGTAGCTTCACTTCGACAACTTTCTAGCAACTTCTCCCAATTCAACAATGTGCGACTCCCTATCCGTTCACCAAAGCCATTGACCAGAAAGGATGCCGCCACAATCAATGAATGGAGATTTTCCAAGTTAAGGGAATTTCGGGAGAGGCATATTGAAGCAGAAAATGAAGCTTTTGATCGATACATGAAGAATATTAATCTATTGGAAGAGGTCTTTTCTACGAAATCTATGATTGATGATAGGCCTCTTAAGGACAGACCCCCTGTAAACTCTGGTACAGAAGCCAACCCCGAGGAAATGACTCCTGGGTTGAAGTTAAAGTTAGGATCGACATCAGACAATTCTAGAAAGAGGATACGCAAAATTGTTGAAGATGGACTAAGAAAAATTAAAATAGTTGAGACATTTGATAACGTCGATGAGGTCACTGACCATGCCCAAGCTGATAGGGGTGAGGACGAGACAAATCTTAATGATGGATGCAAAATGCTAGAAGGTTGGCATGCTAAAAGAACTCGAGCTTTAGGTGATCTCATCGACAAGCTAAACAAAGCCCGAAATGAGGAGGATTTAAAATCTTGCTTGGCAATGAAACATCAGCTTTCTGACCAACACAAAACAACATCCAGTGAAGCAGAATCTGAAGAGACTGACACGTCCAAGGAGCAGCAAGTTATAAAGAAAGATTTAGACTCAAGGAAAGAATTGGGTTTTTCATTGCCAAAGTTGGTTAACAAAACTAACATTGATCAACAAACTCTGAACCAAATTGATGCTCACTTTTCTTCTCTCAAGCAGATAGGCAATCTATGA
Protein sequence
MAAPSPTLSNNSSDTATTTAGAVAANVAVSPNHLANRTGTPPKTLRGLNKPKCRVCGNVARSRCPYESCKSCCARNQNPCYIHVLKANATFPDKTPSSSSPLFDKQSPDPSSSGTSNRVASLRQLSSNFSQFNNVRLPIRSPKPLTRKDAATINEWRFSKLREFRERHIEAENEAFDRYMKNINLLEEVFSTKSMIDDRPLKDRPPVNSGTEANPEEMTPGLKLKLGSTSDNSRKRIRKIVEDGLRKIKIVETFDNVDEVTDHAQADRGEDETNLNDGCKMLEGWHAKRTRALGDLIDKLNKARNEEDLKSCLAMKHQLSDQHKTTSSEAESEETDTSKEQQVIKKDLDSRKELGFSLPKLVNKTNIDQQTLNQIDAHFSSLKQIGNL*
Homology
BLAST of CSPI03G02070 vs. ExPASy TrEMBL
Match:
A0A0A0L209 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G016970 PE=4 SV=1)
HSP 1 Score: 747.7 bits (1929), Expect = 2.5e-212
Identity = 386/388 (99.48%), Postives = 386/388 (99.48%), Query Frame = 0
Query: 1 MAAPSPTLSNNSSDTATTTAGAVAANVAVSPNHLANRTGTPPKTLRGLNKPKCRVCGNVA 60
MAAPSPTLSNNSSDTATTTAGAVAANVAVSPNHLANRTGTPPKTLRGLNKPKCRVCGNVA
Sbjct: 1 MAAPSPTLSNNSSDTATTTAGAVAANVAVSPNHLANRTGTPPKTLRGLNKPKCRVCGNVA 60
Query: 61 RSRCPYESCKSCCARNQNPCYIHVLKANATFPDKTPSSSSPLFDKQSPDPSSSGTSNRVA 120
RSRCPYESCKSCCARNQNPCYIHVLKANATFPDKTPSSSSPLFDKQSPDPSSSGTSNRVA
Sbjct: 61 RSRCPYESCKSCCARNQNPCYIHVLKANATFPDKTPSSSSPLFDKQSPDPSSSGTSNRVA 120
Query: 121 SLRQLSSNFSQFNNVRLPIRSPKPLTRKDAATINEWRFSKLREFRERHIEAENEAFDRYM 180
SLRQLSSNFSQFNNVRLPIRSPKPLTRKDAATINEWRFSKLREFRERHIEAENEAFDRYM
Sbjct: 121 SLRQLSSNFSQFNNVRLPIRSPKPLTRKDAATINEWRFSKLREFRERHIEAENEAFDRYM 180
Query: 181 KNINLLEEVFSTKSMIDDRPLKDRPPVNSGTEANPEEMTPGLKLKLGSTSDNSRKRIRKI 240
KNINLLEEVFSTKSMIDDRPLKDRPPVNSGTEANPEEMTPGLKLKLGSTSDNSRKRIRKI
Sbjct: 181 KNINLLEEVFSTKSMIDDRPLKDRPPVNSGTEANPEEMTPGLKLKLGSTSDNSRKRIRKI 240
Query: 241 VEDGLRKIKIVETFDNVDEVTDHAQADRGEDETNLNDGCKMLEGWHAKRTRALGDLIDKL 300
VEDGLRKIKIVET DNVDEVTDHAQADRGEDETN NDGCKMLEGWHAKRTRALGDLIDKL
Sbjct: 241 VEDGLRKIKIVETIDNVDEVTDHAQADRGEDETNPNDGCKMLEGWHAKRTRALGDLIDKL 300
Query: 301 NKARNEEDLKSCLAMKHQLSDQHKTTSSEAESEETDTSKEQQVIKKDLDSRKELGFSLPK 360
NKARNEEDLKSCLAMKHQLSDQHKTTSSEAESEETDTSKEQQVIKKDLDSRKELGFSLPK
Sbjct: 301 NKARNEEDLKSCLAMKHQLSDQHKTTSSEAESEETDTSKEQQVIKKDLDSRKELGFSLPK 360
Query: 361 LVNKTNIDQQTLNQIDAHFSSLKQIGNL 389
LVNKTNIDQQTLNQIDAHFSSLKQIGNL
Sbjct: 361 LVNKTNIDQQTLNQIDAHFSSLKQIGNL 388
BLAST of CSPI03G02070 vs. ExPASy TrEMBL
Match:
A0A5A7UFS7 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold139G001970 PE=4 SV=1)
HSP 1 Score: 686.4 bits (1770), Expect = 6.9e-194
Identity = 357/388 (92.01%), Postives = 372/388 (95.88%), Query Frame = 0
Query: 1 MAAPSPTLSNNSSDTATTTAGAVAANVAVSPNHLANRTGTPPKTLRGLNKPKCRVCGNVA 60
MAAPSPTLSNNSSDTATTTAGAVAANVAVSPNHLANRTGTPPKTLRGLNKPKCRVCGNVA
Sbjct: 1 MAAPSPTLSNNSSDTATTTAGAVAANVAVSPNHLANRTGTPPKTLRGLNKPKCRVCGNVA 60
Query: 61 RSRCPYESCKSCCARNQNPCYIHVLKANATFPDKTPSSSSPLFDKQSPDPSSSGTSNRVA 120
RSRCPYESCKSCCARNQNPCYIHVLKANATFPDKTPSSSSPLFDKQSPD SSSGTSNRVA
Sbjct: 61 RSRCPYESCKSCCARNQNPCYIHVLKANATFPDKTPSSSSPLFDKQSPDLSSSGTSNRVA 120
Query: 121 SLRQLSSNFSQFNNVRLPIRSPKPLTRKDAATINEWRFSKLREFRERHIEAENEAFDRYM 180
SLRQLSSNFSQFNNVRLPIRSPKP+TRKDAA INEWRFSKLREFRERHIEAENEAFDRYM
Sbjct: 121 SLRQLSSNFSQFNNVRLPIRSPKPVTRKDAAAINEWRFSKLREFRERHIEAENEAFDRYM 180
Query: 181 KNINLLEEVFSTKSMIDDRPLKDRPPVNSGTEANPEEMTPGLKLKLGSTSDNSRKRIRKI 240
KN+NLLEEVFS +SMIDD+ LKD P VNSG EAN EEMTPGLKLKLGSTSDNSRKRIRKI
Sbjct: 181 KNVNLLEEVFSMRSMIDDKSLKDGPSVNSGIEANLEEMTPGLKLKLGSTSDNSRKRIRKI 240
Query: 241 VEDGLRKIKIVETFDNVDEVTDHAQADRGEDETNLNDGCKMLEGWHAKRTRALGDLIDKL 300
VEDGLRKIKIVET DNV+EVTDHAQADR D+ NLNDGCK ++G HAKRTRALGDLI+KL
Sbjct: 241 VEDGLRKIKIVETTDNVNEVTDHAQADRDADQMNLNDGCKTIKGSHAKRTRALGDLIEKL 300
Query: 301 NKARNEEDLKSCLAMKHQLSDQHKTTSSEAESEETDTSKEQQVIKKDLDSRKELGFSLPK 360
NKARNEEDLKSCLAMKHQL ++ +TTSS+AESEETDTSKEQQVIKKDLDSRKELG+SLPK
Sbjct: 301 NKARNEEDLKSCLAMKHQLFNRLQTTSSQAESEETDTSKEQQVIKKDLDSRKELGYSLPK 360
Query: 361 LVNKTNIDQQTLNQIDAHFSSLKQIGNL 389
L+NKTNIDQQTLNQIDAHFSSLKQIGNL
Sbjct: 361 LINKTNIDQQTLNQIDAHFSSLKQIGNL 388
BLAST of CSPI03G02070 vs. ExPASy TrEMBL
Match:
A0A1S3BKF8 (uncharacterized protein LOC103490821 OS=Cucumis melo OX=3656 GN=LOC103490821 PE=4 SV=1)
HSP 1 Score: 686.4 bits (1770), Expect = 6.9e-194
Identity = 357/388 (92.01%), Postives = 372/388 (95.88%), Query Frame = 0
Query: 1 MAAPSPTLSNNSSDTATTTAGAVAANVAVSPNHLANRTGTPPKTLRGLNKPKCRVCGNVA 60
MAAPSPTLSNNSSDTATTTAGAVAANVAVSPNHLANRTGTPPKTLRGLNKPKCRVCGNVA
Sbjct: 1 MAAPSPTLSNNSSDTATTTAGAVAANVAVSPNHLANRTGTPPKTLRGLNKPKCRVCGNVA 60
Query: 61 RSRCPYESCKSCCARNQNPCYIHVLKANATFPDKTPSSSSPLFDKQSPDPSSSGTSNRVA 120
RSRCPYESCKSCCARNQNPCYIHVLKANATFPDKTPSSSSPLFDKQSPD SSSGTSNRVA
Sbjct: 61 RSRCPYESCKSCCARNQNPCYIHVLKANATFPDKTPSSSSPLFDKQSPDLSSSGTSNRVA 120
Query: 121 SLRQLSSNFSQFNNVRLPIRSPKPLTRKDAATINEWRFSKLREFRERHIEAENEAFDRYM 180
SLRQLSSNFSQFNNVRLPIRSPKP+TRKDAA INEWRFSKLREFRERHIEAENEAFDRYM
Sbjct: 121 SLRQLSSNFSQFNNVRLPIRSPKPVTRKDAAAINEWRFSKLREFRERHIEAENEAFDRYM 180
Query: 181 KNINLLEEVFSTKSMIDDRPLKDRPPVNSGTEANPEEMTPGLKLKLGSTSDNSRKRIRKI 240
KN+NLLEEVFS +SMIDD+ LKD P VNSG EAN EEMTPGLKLKLGSTSDNSRKRIRKI
Sbjct: 181 KNVNLLEEVFSMRSMIDDKSLKDGPSVNSGIEANLEEMTPGLKLKLGSTSDNSRKRIRKI 240
Query: 241 VEDGLRKIKIVETFDNVDEVTDHAQADRGEDETNLNDGCKMLEGWHAKRTRALGDLIDKL 300
VEDGLRKIKIVET DNV+EVTDHAQADR D+ NLNDGCK ++G HAKRTRALGDLI+KL
Sbjct: 241 VEDGLRKIKIVETTDNVNEVTDHAQADRDADQMNLNDGCKTIKGSHAKRTRALGDLIEKL 300
Query: 301 NKARNEEDLKSCLAMKHQLSDQHKTTSSEAESEETDTSKEQQVIKKDLDSRKELGFSLPK 360
NKARNEEDLKSCLAMKHQL ++ +TTSS+AESEETDTSKEQQVIKKDLDSRKELG+SLPK
Sbjct: 301 NKARNEEDLKSCLAMKHQLFNRLQTTSSQAESEETDTSKEQQVIKKDLDSRKELGYSLPK 360
Query: 361 LVNKTNIDQQTLNQIDAHFSSLKQIGNL 389
L+NKTNIDQQTLNQIDAHFSSLKQIGNL
Sbjct: 361 LINKTNIDQQTLNQIDAHFSSLKQIGNL 388
BLAST of CSPI03G02070 vs. ExPASy TrEMBL
Match:
A0A6J1E768 (uncharacterized protein LOC111431225 OS=Cucurbita moschata OX=3662 GN=LOC111431225 PE=4 SV=1)
HSP 1 Score: 584.3 bits (1505), Expect = 3.7e-163
Identity = 314/388 (80.93%), Postives = 340/388 (87.63%), Query Frame = 0
Query: 1 MAAPSPTLSNNSSDTATTTAGAVAANVAVSPNHLANRTGTPPKTLRGLNKPKCRVCGNVA 60
MAAPSPT+SNNSSDTA TTA AV N AVS NHLANRT TPPKTLRGLNKPKCRVCGNVA
Sbjct: 1 MAAPSPTVSNNSSDTANTTASAVTVNAAVSSNHLANRTATPPKTLRGLNKPKCRVCGNVA 60
Query: 61 RSRCPYESCKSCCARNQNPCYIHVLKANATFPDKTPSSSSPLFDKQSPDPSSSGTSNRVA 120
RSRCPYESCKSCCARNQNPC+IHVLKANATF DKTPSSSSPLFDKQS DPSSSGTS+RVA
Sbjct: 61 RSRCPYESCKSCCARNQNPCHIHVLKANATFTDKTPSSSSPLFDKQSADPSSSGTSHRVA 120
Query: 121 SLRQLSSNFSQFNNVRLPIRSPKPLTRKDAATINEWRFSKLREFRERHIEAENEAFDRYM 180
SLRQLSSNFSQFNNV++P+RS KPLTRKDAA INEWRFSKLREFRERHIEAENEAFDRYM
Sbjct: 121 SLRQLSSNFSQFNNVQIPLRSRKPLTRKDAAAINEWRFSKLREFRERHIEAENEAFDRYM 180
Query: 181 KNINLLEEVFSTKSMIDDRPLKDRPPVNSGTEANPEEMTPGLKLKLGSTSDNSRKRIRKI 240
KN++LLEEVFS KSM+D LKD P N TE EE+ PGLKL LGS SD+SRKRIR+I
Sbjct: 181 KNVDLLEEVFSMKSMVDG-SLKDGPS-NFSTEGT-EEVIPGLKLSLGSGSDDSRKRIREI 240
Query: 241 VEDGLRKIKIVETFDNVDEVTDHAQADRGEDETNLNDGCKMLEGWHAKRTRALGDLIDKL 300
VEDGL K++ VE DNVDEVTD ++ + D+TN NDGC ++GWHAKR RALGDLIDKL
Sbjct: 241 VEDGLMKLQKVEATDNVDEVTDQVESGKVADQTNRNDGCTTVKGWHAKRARALGDLIDKL 300
Query: 301 NKARNEEDLKSCLAMKHQLSDQHKTTSSEAESEETDTSKEQQVIKKDLDSRKELGFSLPK 360
NKARNEEDLKSCLAMKHQL +QH TSS+ ESEETD KE QVIKKDLDSRKELG+SLPK
Sbjct: 301 NKARNEEDLKSCLAMKHQLFNQH-ITSSQTESEETDVPKE-QVIKKDLDSRKELGYSLPK 360
Query: 361 LVNKTNIDQQTLNQIDAHFSSLKQIGNL 389
L+NKT IDQ+TLN+IDAHFSSLKQI NL
Sbjct: 361 LINKTIIDQETLNRIDAHFSSLKQIDNL 383
BLAST of CSPI03G02070 vs. ExPASy TrEMBL
Match:
A0A6J1DRY6 (uncharacterized protein LOC111022656 OS=Momordica charantia OX=3673 GN=LOC111022656 PE=4 SV=1)
HSP 1 Score: 582.8 bits (1501), Expect = 1.1e-162
Identity = 314/391 (80.31%), Postives = 343/391 (87.72%), Query Frame = 0
Query: 1 MAAPSPTLSNNSSDTATTTAGAVAANVAVSPNHLANRTGTPPKTLRGLNKPKCRVCGNVA 60
MAAPSP+ SNNSSDT TTA A ANVAVS NHLANRTGTPPKTLRGLNKPKCRVCGNVA
Sbjct: 1 MAAPSPSASNNSSDTG-TTATAATANVAVSSNHLANRTGTPPKTLRGLNKPKCRVCGNVA 60
Query: 61 RSRCPYESCKSCCARNQNPCYIHVLKANATFPDKTPSSSSPLFDKQSPDPSSSGTSNRVA 120
RSRCPYESCKSCCARNQNPCYIHVLKANATFPDKTP+SSSPLF+KQS DPSSSGTS RVA
Sbjct: 61 RSRCPYESCKSCCARNQNPCYIHVLKANATFPDKTPTSSSPLFEKQSADPSSSGTSLRVA 120
Query: 121 SLRQLSSNFSQFNNVRLPIRSPKPLTRKDAATINEWRFSKLREFRERHIEAENEAFDRYM 180
SLRQLS+NFSQFN+V++P+RS KPLTRKDAA INEWRFSKLREFRERHIEA NEAFDRYM
Sbjct: 121 SLRQLSNNFSQFNSVQIPLRSRKPLTRKDAAAINEWRFSKLREFRERHIEAGNEAFDRYM 180
Query: 181 KNINLLEEVFSTKSMIDDRPLKDRPPVNSGTEANPEEMTPGLKLKLGS---TSDNSRKRI 240
+N+NLLEEVFS KSMID +KD P VNS EAN EEM GLKLK+GS SDNSRKRI
Sbjct: 181 QNVNLLEEVFSMKSMIDG-SIKDGPSVNSSAEANTEEMVSGLKLKIGSDPIRSDNSRKRI 240
Query: 241 RKIVEDGLRKIKIVETFDNVDEVTDHAQADRGEDETNLNDGCKMLEGWHAKRTRALGDLI 300
++IVEDGLRK+K V+ D VDEVTD A+ D+ D+T+LNDGCK +GW AKR ALGDLI
Sbjct: 241 QQIVEDGLRKLKKVKVTDKVDEVTDQAEPDKVADQTDLNDGCKTAKGWPAKRAIALGDLI 300
Query: 301 DKLNKARNEEDLKSCLAMKHQLSDQHKTTSSEAESEETDTSKEQQVIKKDLDSRKELGFS 360
DKLNKARNEEDLKSCLAMKHQL + H TSS+AESEE D SKE QV+KKDL+SRKELG+S
Sbjct: 301 DKLNKARNEEDLKSCLAMKHQLFNPH-LTSSQAESEEIDMSKE-QVVKKDLESRKELGYS 360
Query: 361 LPKLVNKTNIDQQTLNQIDAHFSSLKQIGNL 389
LPKL+NKTNIDQ+TLN+IDAHFSSLKQI NL
Sbjct: 361 LPKLINKTNIDQETLNRIDAHFSSLKQIDNL 387
BLAST of CSPI03G02070 vs. NCBI nr
Match:
XP_004150175.1 (uncharacterized protein LOC101215791 [Cucumis sativus] >KGN55803.1 hypothetical protein Csa_009752 [Cucumis sativus])
HSP 1 Score: 747.7 bits (1929), Expect = 5.2e-212
Identity = 386/388 (99.48%), Postives = 386/388 (99.48%), Query Frame = 0
Query: 1 MAAPSPTLSNNSSDTATTTAGAVAANVAVSPNHLANRTGTPPKTLRGLNKPKCRVCGNVA 60
MAAPSPTLSNNSSDTATTTAGAVAANVAVSPNHLANRTGTPPKTLRGLNKPKCRVCGNVA
Sbjct: 1 MAAPSPTLSNNSSDTATTTAGAVAANVAVSPNHLANRTGTPPKTLRGLNKPKCRVCGNVA 60
Query: 61 RSRCPYESCKSCCARNQNPCYIHVLKANATFPDKTPSSSSPLFDKQSPDPSSSGTSNRVA 120
RSRCPYESCKSCCARNQNPCYIHVLKANATFPDKTPSSSSPLFDKQSPDPSSSGTSNRVA
Sbjct: 61 RSRCPYESCKSCCARNQNPCYIHVLKANATFPDKTPSSSSPLFDKQSPDPSSSGTSNRVA 120
Query: 121 SLRQLSSNFSQFNNVRLPIRSPKPLTRKDAATINEWRFSKLREFRERHIEAENEAFDRYM 180
SLRQLSSNFSQFNNVRLPIRSPKPLTRKDAATINEWRFSKLREFRERHIEAENEAFDRYM
Sbjct: 121 SLRQLSSNFSQFNNVRLPIRSPKPLTRKDAATINEWRFSKLREFRERHIEAENEAFDRYM 180
Query: 181 KNINLLEEVFSTKSMIDDRPLKDRPPVNSGTEANPEEMTPGLKLKLGSTSDNSRKRIRKI 240
KNINLLEEVFSTKSMIDDRPLKDRPPVNSGTEANPEEMTPGLKLKLGSTSDNSRKRIRKI
Sbjct: 181 KNINLLEEVFSTKSMIDDRPLKDRPPVNSGTEANPEEMTPGLKLKLGSTSDNSRKRIRKI 240
Query: 241 VEDGLRKIKIVETFDNVDEVTDHAQADRGEDETNLNDGCKMLEGWHAKRTRALGDLIDKL 300
VEDGLRKIKIVET DNVDEVTDHAQADRGEDETN NDGCKMLEGWHAKRTRALGDLIDKL
Sbjct: 241 VEDGLRKIKIVETIDNVDEVTDHAQADRGEDETNPNDGCKMLEGWHAKRTRALGDLIDKL 300
Query: 301 NKARNEEDLKSCLAMKHQLSDQHKTTSSEAESEETDTSKEQQVIKKDLDSRKELGFSLPK 360
NKARNEEDLKSCLAMKHQLSDQHKTTSSEAESEETDTSKEQQVIKKDLDSRKELGFSLPK
Sbjct: 301 NKARNEEDLKSCLAMKHQLSDQHKTTSSEAESEETDTSKEQQVIKKDLDSRKELGFSLPK 360
Query: 361 LVNKTNIDQQTLNQIDAHFSSLKQIGNL 389
LVNKTNIDQQTLNQIDAHFSSLKQIGNL
Sbjct: 361 LVNKTNIDQQTLNQIDAHFSSLKQIGNL 388
BLAST of CSPI03G02070 vs. NCBI nr
Match:
XP_008448750.1 (PREDICTED: uncharacterized protein LOC103490821 [Cucumis melo] >KAA0053076.1 uncharacterized protein E6C27_scaffold344G001940 [Cucumis melo var. makuwa] >TYK11531.1 uncharacterized protein E5676_scaffold139G001970 [Cucumis melo var. makuwa])
HSP 1 Score: 686.4 bits (1770), Expect = 1.4e-193
Identity = 357/388 (92.01%), Postives = 372/388 (95.88%), Query Frame = 0
Query: 1 MAAPSPTLSNNSSDTATTTAGAVAANVAVSPNHLANRTGTPPKTLRGLNKPKCRVCGNVA 60
MAAPSPTLSNNSSDTATTTAGAVAANVAVSPNHLANRTGTPPKTLRGLNKPKCRVCGNVA
Sbjct: 1 MAAPSPTLSNNSSDTATTTAGAVAANVAVSPNHLANRTGTPPKTLRGLNKPKCRVCGNVA 60
Query: 61 RSRCPYESCKSCCARNQNPCYIHVLKANATFPDKTPSSSSPLFDKQSPDPSSSGTSNRVA 120
RSRCPYESCKSCCARNQNPCYIHVLKANATFPDKTPSSSSPLFDKQSPD SSSGTSNRVA
Sbjct: 61 RSRCPYESCKSCCARNQNPCYIHVLKANATFPDKTPSSSSPLFDKQSPDLSSSGTSNRVA 120
Query: 121 SLRQLSSNFSQFNNVRLPIRSPKPLTRKDAATINEWRFSKLREFRERHIEAENEAFDRYM 180
SLRQLSSNFSQFNNVRLPIRSPKP+TRKDAA INEWRFSKLREFRERHIEAENEAFDRYM
Sbjct: 121 SLRQLSSNFSQFNNVRLPIRSPKPVTRKDAAAINEWRFSKLREFRERHIEAENEAFDRYM 180
Query: 181 KNINLLEEVFSTKSMIDDRPLKDRPPVNSGTEANPEEMTPGLKLKLGSTSDNSRKRIRKI 240
KN+NLLEEVFS +SMIDD+ LKD P VNSG EAN EEMTPGLKLKLGSTSDNSRKRIRKI
Sbjct: 181 KNVNLLEEVFSMRSMIDDKSLKDGPSVNSGIEANLEEMTPGLKLKLGSTSDNSRKRIRKI 240
Query: 241 VEDGLRKIKIVETFDNVDEVTDHAQADRGEDETNLNDGCKMLEGWHAKRTRALGDLIDKL 300
VEDGLRKIKIVET DNV+EVTDHAQADR D+ NLNDGCK ++G HAKRTRALGDLI+KL
Sbjct: 241 VEDGLRKIKIVETTDNVNEVTDHAQADRDADQMNLNDGCKTIKGSHAKRTRALGDLIEKL 300
Query: 301 NKARNEEDLKSCLAMKHQLSDQHKTTSSEAESEETDTSKEQQVIKKDLDSRKELGFSLPK 360
NKARNEEDLKSCLAMKHQL ++ +TTSS+AESEETDTSKEQQVIKKDLDSRKELG+SLPK
Sbjct: 301 NKARNEEDLKSCLAMKHQLFNRLQTTSSQAESEETDTSKEQQVIKKDLDSRKELGYSLPK 360
Query: 361 LVNKTNIDQQTLNQIDAHFSSLKQIGNL 389
L+NKTNIDQQTLNQIDAHFSSLKQIGNL
Sbjct: 361 LINKTNIDQQTLNQIDAHFSSLKQIGNL 388
BLAST of CSPI03G02070 vs. NCBI nr
Match:
XP_038906244.1 (uncharacterized protein LOC120092108 [Benincasa hispida])
HSP 1 Score: 613.2 bits (1580), Expect = 1.5e-171
Identity = 326/391 (83.38%), Postives = 351/391 (89.77%), Query Frame = 0
Query: 1 MAAPSPTLSNNSSDTATTTAGAVAANVAVSPNHLANRTGTPPKTLRGLNKPKCRVCGNVA 60
MAAPSPT+SNNSSDTA+ AV AN AVS NHLANRTGTPPKTLRGLNKPKCRVCGNVA
Sbjct: 1 MAAPSPTVSNNSSDTAS----AVTANAAVSSNHLANRTGTPPKTLRGLNKPKCRVCGNVA 60
Query: 61 RSRCPYESCKSCCARNQNPCYIHVLKANATFPDKTPSSSSPLFDKQSPDPSSSGTSNRVA 120
RSRCPYESCKSCCARNQNPCYIHVLKANATFPDKTP+SSSPLFDKQS DPSSSGT +RVA
Sbjct: 61 RSRCPYESCKSCCARNQNPCYIHVLKANATFPDKTPTSSSPLFDKQSSDPSSSGTLHRVA 120
Query: 121 SLRQLSSNFSQFNNVRLPIRSPKPLTRKDAATINEWRFSKLREFRERHIEAENEAFDRYM 180
SLRQLS+NFSQFNNV++P+RS KPLTRKDAA INEWRFSKLREFRER+IEAENEAFDRYM
Sbjct: 121 SLRQLSNNFSQFNNVQIPLRSRKPLTRKDAAAINEWRFSKLREFRERNIEAENEAFDRYM 180
Query: 181 KNINLLEEVFSTKSMIDDRPLKDRPPVNSGTEANPEEMTPGLKLKLGS---TSDNSRKRI 240
+N+NLLEEVFS +SMID LKD P +NS TEANPEEM PGLKLKLGS TSD SRKRI
Sbjct: 181 QNVNLLEEVFSMRSMIDG-SLKDGPSINSTTEANPEEMIPGLKLKLGSNPVTSDISRKRI 240
Query: 241 RKIVEDGLRKIKIVETFDNVDEVTDHAQADRGEDETNLNDGCKMLEGWHAKRTRALGDLI 300
++IVED LRK K VE DN+DEVTDHA+AD D+TNLNDG K ++GWHAKR RALGDLI
Sbjct: 241 QEIVEDRLRKFKKVEATDNIDEVTDHAEADETADQTNLNDGFKTVKGWHAKRARALGDLI 300
Query: 301 DKLNKARNEEDLKSCLAMKHQLSDQHKTTSSEAESEETDTSKEQQVIKKDLDSRKELGFS 360
DKLNKARNEEDLKSCLAMKHQL + HKTTSS+ ESEETD SKE QVIKKDLDSRKELG+S
Sbjct: 301 DKLNKARNEEDLKSCLAMKHQLFNPHKTTSSQTESEETDISKE-QVIKKDLDSRKELGYS 360
Query: 361 LPKLVNKTNIDQQTLNQIDAHFSSLKQIGNL 389
LPKL+NKTNIDQ+TLNQIDAHFSSLKQIG L
Sbjct: 361 LPKLINKTNIDQETLNQIDAHFSSLKQIGTL 385
BLAST of CSPI03G02070 vs. NCBI nr
Match:
KAG6577645.1 (hypothetical protein SDJN03_25219, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 585.1 bits (1507), Expect = 4.4e-163
Identity = 314/388 (80.93%), Postives = 341/388 (87.89%), Query Frame = 0
Query: 1 MAAPSPTLSNNSSDTATTTAGAVAANVAVSPNHLANRTGTPPKTLRGLNKPKCRVCGNVA 60
MAAPSPT+SNNSSDTATTTA AV N AVS NHLANRT TPPKTLRGLNKPKCRVCGNVA
Sbjct: 1 MAAPSPTVSNNSSDTATTTASAVTVNAAVSSNHLANRTATPPKTLRGLNKPKCRVCGNVA 60
Query: 61 RSRCPYESCKSCCARNQNPCYIHVLKANATFPDKTPSSSSPLFDKQSPDPSSSGTSNRVA 120
RSRCPYESCKSCCARNQNPC+IHVLKANATF DKTP+SSSPLFDKQS DPSSSGTS+RVA
Sbjct: 61 RSRCPYESCKSCCARNQNPCHIHVLKANATFTDKTPTSSSPLFDKQSSDPSSSGTSHRVA 120
Query: 121 SLRQLSSNFSQFNNVRLPIRSPKPLTRKDAATINEWRFSKLREFRERHIEAENEAFDRYM 180
SLRQLSSNFSQFNNV++P+RS KPLTRKDAA INEWRFSKLREFRERH+EAENEAFDRYM
Sbjct: 121 SLRQLSSNFSQFNNVQIPLRSRKPLTRKDAAAINEWRFSKLREFRERHVEAENEAFDRYM 180
Query: 181 KNINLLEEVFSTKSMIDDRPLKDRPPVNSGTEANPEEMTPGLKLKLGSTSDNSRKRIRKI 240
KN+NLLEEVFS KSM+D LKD P N TE EE+ PGLKL LGS SD+SRKRIR+I
Sbjct: 181 KNVNLLEEVFSMKSMVDG-SLKDGPS-NFCTEGT-EEVIPGLKLSLGSGSDDSRKRIREI 240
Query: 241 VEDGLRKIKIVETFDNVDEVTDHAQADRGEDETNLNDGCKMLEGWHAKRTRALGDLIDKL 300
VEDGL K++ VE DNVDEVTD ++ + D+TN NDGC ++GWHAKR RALGDLIDKL
Sbjct: 241 VEDGLMKLQKVEATDNVDEVTDQVESGKVADQTNRNDGCTTVKGWHAKRARALGDLIDKL 300
Query: 301 NKARNEEDLKSCLAMKHQLSDQHKTTSSEAESEETDTSKEQQVIKKDLDSRKELGFSLPK 360
NKARNEEDLKSCLAMKHQL +QH TSS+ ESEETD KE QVIKKDLDSRKELG+SLPK
Sbjct: 301 NKARNEEDLKSCLAMKHQLFNQH-ITSSQTESEETDMPKE-QVIKKDLDSRKELGYSLPK 360
Query: 361 LVNKTNIDQQTLNQIDAHFSSLKQIGNL 389
L+NKT IDQ+TLN+IDAHFSSLKQI NL
Sbjct: 361 LINKTIIDQETLNRIDAHFSSLKQIDNL 383
BLAST of CSPI03G02070 vs. NCBI nr
Match:
XP_022923586.1 (uncharacterized protein LOC111431225 [Cucurbita moschata])
HSP 1 Score: 584.3 bits (1505), Expect = 7.6e-163
Identity = 314/388 (80.93%), Postives = 340/388 (87.63%), Query Frame = 0
Query: 1 MAAPSPTLSNNSSDTATTTAGAVAANVAVSPNHLANRTGTPPKTLRGLNKPKCRVCGNVA 60
MAAPSPT+SNNSSDTA TTA AV N AVS NHLANRT TPPKTLRGLNKPKCRVCGNVA
Sbjct: 1 MAAPSPTVSNNSSDTANTTASAVTVNAAVSSNHLANRTATPPKTLRGLNKPKCRVCGNVA 60
Query: 61 RSRCPYESCKSCCARNQNPCYIHVLKANATFPDKTPSSSSPLFDKQSPDPSSSGTSNRVA 120
RSRCPYESCKSCCARNQNPC+IHVLKANATF DKTPSSSSPLFDKQS DPSSSGTS+RVA
Sbjct: 61 RSRCPYESCKSCCARNQNPCHIHVLKANATFTDKTPSSSSPLFDKQSADPSSSGTSHRVA 120
Query: 121 SLRQLSSNFSQFNNVRLPIRSPKPLTRKDAATINEWRFSKLREFRERHIEAENEAFDRYM 180
SLRQLSSNFSQFNNV++P+RS KPLTRKDAA INEWRFSKLREFRERHIEAENEAFDRYM
Sbjct: 121 SLRQLSSNFSQFNNVQIPLRSRKPLTRKDAAAINEWRFSKLREFRERHIEAENEAFDRYM 180
Query: 181 KNINLLEEVFSTKSMIDDRPLKDRPPVNSGTEANPEEMTPGLKLKLGSTSDNSRKRIRKI 240
KN++LLEEVFS KSM+D LKD P N TE EE+ PGLKL LGS SD+SRKRIR+I
Sbjct: 181 KNVDLLEEVFSMKSMVDG-SLKDGPS-NFSTEGT-EEVIPGLKLSLGSGSDDSRKRIREI 240
Query: 241 VEDGLRKIKIVETFDNVDEVTDHAQADRGEDETNLNDGCKMLEGWHAKRTRALGDLIDKL 300
VEDGL K++ VE DNVDEVTD ++ + D+TN NDGC ++GWHAKR RALGDLIDKL
Sbjct: 241 VEDGLMKLQKVEATDNVDEVTDQVESGKVADQTNRNDGCTTVKGWHAKRARALGDLIDKL 300
Query: 301 NKARNEEDLKSCLAMKHQLSDQHKTTSSEAESEETDTSKEQQVIKKDLDSRKELGFSLPK 360
NKARNEEDLKSCLAMKHQL +QH TSS+ ESEETD KE QVIKKDLDSRKELG+SLPK
Sbjct: 301 NKARNEEDLKSCLAMKHQLFNQH-ITSSQTESEETDVPKE-QVIKKDLDSRKELGYSLPK 360
Query: 361 LVNKTNIDQQTLNQIDAHFSSLKQIGNL 389
L+NKT IDQ+TLN+IDAHFSSLKQI NL
Sbjct: 361 LINKTIIDQETLNRIDAHFSSLKQIDNL 383
BLAST of CSPI03G02070 vs. TAIR 10
Match:
AT1G32730.1 (unknown protein; LOCATED IN: chloroplast; EXPRESSED IN: 12 plant structures; EXPRESSED DURING: 6 growth stages; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF702 (InterPro:IPR007818); Has 120 Blast hits to 118 proteins in 39 species: Archae - 0; Bacteria - 8; Metazoa - 63; Fungi - 4; Plants - 33; Viruses - 0; Other Eukaryotes - 12 (source: NCBI BLink). )
HSP 1 Score: 240.7 bits (613), Expect = 1.9e-63
Identity = 159/392 (40.56%), Postives = 222/392 (56.63%), Query Frame = 0
Query: 1 MAAPSPTLSNNSSDTATTTAGAVAANVAVSPNHLANRTGTPPKTLRGLNKPKCRVCGNVA 60
MA SP+LSNN + TPPKTLRGLNKPKC CGNVA
Sbjct: 1 MATSSPSLSNNGLSSVV----------------------TPPKTLRGLNKPKCIQCGNVA 60
Query: 61 RSRCPYESCKSCCARNQNPCYIHVLKANATFPDKTPSSSSPLFDKQSPDPSSSGTSNRVA 120
RSRCP++SCK CC+R +NPC IHVLK +T +KT + S+P ++++ + + G++ RV+
Sbjct: 61 RSRCPFQSCKGCCSRAENPCPIHVLKVASTSGEKTQAPSTPSSEQKATE-GTPGSTTRVS 120
Query: 121 SLRQLSSNFSQFNNVRLPIRSPKPLTRKDAATINEWRFSKLREFRERHIEAENEAFDRYM 180
S+RQLSSNF+QFNN+ R KPLT KDA +NEWRF+KL+E+R+R+IE ENEAFDRYM
Sbjct: 121 SIRQLSSNFAQFNNLNASSRQRKPLTIKDAQALNEWRFTKLKEYRDRNIEVENEAFDRYM 180
Query: 181 KNINLLEEVFSTKSMIDDRPLKDRPPVNSGTEANPEE-MTPGLKLKLGSTS---DNSRKR 240
N+NLLEE FS S+ D+ P E N EE + LKL+L S S ++ +KR
Sbjct: 181 SNVNLLEEAFSFTSVPDEESHGTAAP-----EQNKEENIVSELKLRLRSNSARTESFKKR 240
Query: 241 IRKIVEDGLRKIKIVETFDNVDEVTDHAQADRGEDETNLNDGCKMLEGWHAKRTRALGDL 300
I + V+ GL K+K ++ + D+ D + + + W K + AL ++
Sbjct: 241 IAETVKAGLVKLKRLDLGSSSDDQDDIKRRVKRKK-------------WEEKGS-ALNEI 300
Query: 301 IDKLNKARNEEDLKSCLAMKHQLSDQHKTTSSEAESEETDTSKEQQVIKKDLDSRKELGF 360
IDKLNKAR EEDLKSCL MK +L Q T++ +++
Sbjct: 301 IDKLNKARTEEDLKSCLEMKSKLCGQVSPTAASEKNK----------------------- 327
Query: 361 SLPKLVNKTNIDQQTLNQIDAHFSSLKQIGNL 389
P +V K + ++ L +I + S ++G L
Sbjct: 361 IFPGVVRKVEMSEEALQKIAENLQSFDKVGML 327
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A0A0L209 | 2.5e-212 | 99.48 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G016970 PE=4 SV=1 | [more] |
A0A5A7UFS7 | 6.9e-194 | 92.01 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |
A0A1S3BKF8 | 6.9e-194 | 92.01 | uncharacterized protein LOC103490821 OS=Cucumis melo OX=3656 GN=LOC103490821 PE=... | [more] |
A0A6J1E768 | 3.7e-163 | 80.93 | uncharacterized protein LOC111431225 OS=Cucurbita moschata OX=3662 GN=LOC1114312... | [more] |
A0A6J1DRY6 | 1.1e-162 | 80.31 | uncharacterized protein LOC111022656 OS=Momordica charantia OX=3673 GN=LOC111022... | [more] |
Match Name | E-value | Identity | Description | |
XP_004150175.1 | 5.2e-212 | 99.48 | uncharacterized protein LOC101215791 [Cucumis sativus] >KGN55803.1 hypothetical ... | [more] |
XP_008448750.1 | 1.4e-193 | 92.01 | PREDICTED: uncharacterized protein LOC103490821 [Cucumis melo] >KAA0053076.1 unc... | [more] |
XP_038906244.1 | 1.5e-171 | 83.38 | uncharacterized protein LOC120092108 [Benincasa hispida] | [more] |
KAG6577645.1 | 4.4e-163 | 80.93 | hypothetical protein SDJN03_25219, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_022923586.1 | 7.6e-163 | 80.93 | uncharacterized protein LOC111431225 [Cucurbita moschata] | [more] |
Match Name | E-value | Identity | Description | |
AT1G32730.1 | 1.9e-63 | 40.56 | unknown protein; LOCATED IN: chloroplast; EXPRESSED IN: 12 plant structures; EXP... | [more] |