Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CGAAAACCGTTTAGGGAAGGGGGACTGAATGCTCGCCGGATCTCTTTACAGTTTCCCACCGTCGCCGATTTCCCGATCTGTTACCGCCAAACTCAAAACAAAACACTAACAAATAACCCAAAAATCTCCCGCCTTTCAACAAATTTTCAAAAATTCTTCACCAATTTCCCAATTCCCATCCTCTTTCTAGCCCTTCCCCTCCAAATCCCCCCTCCTTTCCTCTTCTTTCTTCTTCTTCTCTAACCCTTGCATGGCGGCTCCGTCGCCAACCCTTAGCAACAACAGCTCTGACACTGCCACCACCACTGCCGGCGCGGTTGCGGCTAACGTCGCTGTTTCCCCCAATCACTTGGCCAACCGTACAGGAACGCCGCCCAAGACTCTCCGCGGCCTTAACAAACCCAAGTGTAGGGTTTGTGGGAATGTTGCTCGTTCCAGGTTTAATAATCTCTTCACTTGCTGCTTCCTCTCCTTTTGACTTGAATTTCATCTCTCATGCGCTTGAAACCCTAGATTTGTTTTGTTTGTGGATTGTTTTCTTCAGCTTAGTATGTATAGGACGGTACTTGGGAAATTTTCGAATTTAACGTCTACTTGTCTTACATTTTCTCATAAGTAGATTTGGTTATATTATTTGGCCCTTGCATTAGCTGATTAGCTGATTTGATATTTGATTTGCAAGTCATTGTTTAGAATCTCCAACAATATTATCCTAAACAGAAATGTGCGTGCAGGTGTCCTTATGAATCATGCAAGAGCTGTTGTGCGAGAAACCAGAACCCATGCTATATTCACGGTAAGAAACTGATCTATCTATTGGCTACACTGTTACTCTGTTTTTGTTATTTATTTCTTGCCATGGGATATCAGTTTTAGCGAATGTCAATTTGTCTATCATGGATTATTCGTGTTTGTGTGTGTATGTGTGGTGTGTTGAATCTTCACTTCCATCTCTGCCCTTTCTGAGTATAGGGATGGGATTATGCGGTTGTGGTGGGGATACATCCCTTATAACCTTGGAACACTCTAGCTAGAAAAGGAACGAATGATTCTCAGAACACAATATTCACTTAATGTTTCGAAAGGAACTCATATTAGGGACTGAAAGTTGCTATTTTTTTGGTTTGAAAGAGGCTTATGTGTATATGAGAGTGGTGATGTTAAGATTTAAGAAAACTCGAGATGATATTTTCTTCTCTCTATGTAATCACTGTGATCAGTAATTTTTGGTCCACTTAAAGATTTTTATAAATTCTTTGTAGTTAAATTGAGGCTTATGTATTGCACCACTCTTAGGATAAAAAATCCTTTACATTGCTTTAAGCCATTTAAAACATTCAAATTTTATATTTGGATTGATTTCTAGAGGGGTTGAGACTGCCTGGAAGATATATGGTTCCTCGATAGGTTCTATAGTCTCTCTTTGGATTTGATGTACAATGATTTTGTAATTACTCTTTAGGTCTTATTTTGTGAGATCTGATTCCCTTTTTTTGTTTTAGTTTGGCTTTTTTGCTAAACTATTTTGAGGAATGCCTCTTGTATGCTTCTATTTCTTTCTATAGAAGCTAGAATATATATCATCTTTCTTGTGAAAGATGGATTTACATTCAAAAGCCATCAAGTGCACCTAGGGCCTTTTGCTTTGCTCATCCTTATCGTCTAAGAAGCACATATACTAAGTTTGGCTAGCGCGTCTGAGTCCAACATGTGTTGGACGCTTGAGCACTCCGACATTAGTTGGACACATATTGGACACTTGTTGGTGCAATGAATGTGTTAGACACTAATTGTACACAGTGTTTCAAGGCGTGCGCCTTGGGTCGCTTAAGGCGAGAGGTGAGGCAATCTTTGGTTGTTGCGGTTTGAATGCAGCTAAGGCAAGCTCCTTCAACCAGACGCTCGCCTAACTGTTCCCTGTTGTTGCCTTGATATGCCTTTTCCCTAAGATAAGATCAACACCTGTCCTCGTTTCCCTTTTTTTTTTCTTTTGATGTATCAAAACATGAGAAATAAAAAAGCCTAGTGAAAGAGACAAGAAAAAAAGAAGTATAGAAAAAAACAAAGGAATAGAAAGAAGTTCCCTCTTTAATAAAAGAAAAGACAAAGGAATAGAAAGAAAGAAGAAGAGAATGGTTGGAAGGAGAACAAAAGAACGTAAATAGTTACAGAAAAGTTGATAATAAATAGAAAGAAGAAGGAAAGGCAGTAGAAGGAGAGTCTAATTCGAGACACAAGTTGGCTGCATGAAACCGCCATATGACATTATTACTAAGAACTAAGGAGAATTTGTGATAGCTTGAAATTTCCATTTGGCAATTTCCTCACATAAGTAATAAATACAGGGCTTTATATTCAGGAGGCTTATAGGGCAAGAAGTTGGTTATCGTAGTCCTAGGTTATTATAGTTGGTTTATAATAGTTTGTGTTTTAGGTGTAGATTATTTTTGTTTAAGTTATAATAGTATGTATGTGAGGTGTAAACTATTTTAAGAAAAAGAAATAATGAATGTAAAATAGTAAAAACGGTAGCAAATAGGAGAGGAATGGGGTTTGAAATATAGGAAAGTATAAAATAGTATTCACTGTAGCAAGAGATGGTTATTATAGTCTTGAATATGCTTGCCCAAACGGCTCCTTAAAGGTTGCATGGTTAATGGGTAATATTAGGCTTGCCCCAAGGGGTGGTTATTAGCAAATAGGAGGGAAAGGGGGTTTGAAATAATTGCGTAGTTTTCCTTAGAAATGTTTAAACTCATTATATAACTGATTGGGTGGAGTATTTTATGCTTCTCTTTCTCTCTTTTTATTATAAACCGAAGTTTCATTAAAGATTAATTAAAAAATACAAATGTTTACTGCAAGAAAAAAGCGAGAAACGAAATAAAAGAAATTACAACAAAAGGGCTGTAATAATTTAGAAAAGTGATGATAGGTAATCACTAAACTCCTTACTCATTGACATCTAAAGGTAGGCATTAGGCTTAATCAAGATTTGCTTCTCTTCAAAAGGGCTTTGTGCCCCTTTTAAAATTCCCCACCTTGCAAATTGGACTTGAAAATTGATCTCTATAAGACCTGCGATCTCTGAGTAACACTAACACCTGCCTAGAGGTTGCATGTTCAAAGCTTTGAGTGAGCTTAAATAAGGACAAATCCTCAATATCTCCAAAGTCAGAGTCCTGGGGTGGGTATGCACATTATTTTATTTTATCTATCTATTTATTTTTAATTTCAAAATTGACATCTCTATAAAGCGTGCTTCAGTTCAGAAGCACCCTTACATAAGAGGCACAAATGAGGTCCAAGAGATGCAAGAGTTCAATCTTTAGGTAAGCTCAATTATACTGAATCTGTCGTGTATGTATTGCCAGGCAAAAAACTTGACATTATCTGAAATTCTAACCTTCCAAACTGAGGATAAATGTAGGGTGGGGGCTGGGAATTACATAGGGGAGAGGAAAAAGAACTATGAGAAATCTGTGGGATTAAGTGGTCCATATACAAACATCTCTAGTCTTGGGTGATGAAGGAAGATGTCATGACCTCACTTGTACAAGATCTGTTCTATTCTATAGATTGTACAACTATTGCCTTGAATTCTAAGAAGATGTTGATCTTCCATGAAAATAAGATTGTACAACTATGCCCTTTAGTTCTAGTTTTGATTTAGAGGAAGTAAAATTCCTCGTGACTTTGATTTTGGTTTCATTCAAAAAACCTCATACCAATGGAGATATTTGGAGGGTTGTGTTAGGATACCATCTAACAAGAAAGTTTCAAGAGCACAAAATTCTCAAAGATAAAAATGTATTGAAATATAAGAGAAGTTACAATATATCATAGCCATTTGAGAGGCTACACTCTCCCAGACTCCTACAAGGGAAATCCCTAACCAAACTCACTACCTCTATTTATAACTAAAACCGTCAATAAAGTCCTTAGCTAATTACTAATATGCCCTTACTCATAAACATATTAATATTCCTAATATAACCATAACTAGGGCCCTTACACGTTGTTGTCTCTAACAAGTGGTATCGGGGTCATGCTTTTGACCTAGCCATGCTGGTAAAATTCTCGAGTGCTAAATAAAGAAGTGTGTCTTGAAAGATAGCTATGGTGGAGATTTGTGATAGAGCTCTACAGGAATAGTGGAATACTAATGACAACAGGAGTTGACTTGAAGAGCTTGATGGATTGATGGCCATTTGAGGGGAGGTTTGATGGTCATTTGTTCAAGGAGATGATTGTTGGGAATGTGTAAGACCCCTAGTTACGGAGATAGTTTGTAGGAGGAGCATATTAGTAATTAGACAACAATTTGTTAGGGTTTAGTTATAAATAGTAATTTCCTTCGGGAATGTTGGAAAAGTCTAGGGCCTCTTGAAAGGCTGGTGATTATTTCAATATATTTTCATATTTCAGAGTTTTTGAGTAGTTTTGTTCTCGAGTATTTTCTTGTTAGATGGTATCCTAATAGAATGTAATAAAGTTCCACACTGGCTAAATACGATCATTGACTTTTGATGATTTCCTAAGAAGTCTCATCAACTTTTGTGCCTCCTATATTTTATATGATTTATCAATGGTACATGGAATATTTGTTTGGACAAAATTTTTATTGATGGTCCATCAAGACCTATTAACTTTTAAGATAAGAATTACAACAATTTTCAAAGGTCAAGTTAAATTTGATTCATTGAAAGTTTGTGGTAAAAATTTGATGCGTCTTGTCAAGTTAGTGGTAAAGATTGATCTTTCCCCTTGAAAATTTATCTCATTCATCAAATATAATCTGGGCATTTAGAATAGGAATATCGGCTTATGTACATATTTTGTTGTTGATCTGTCTGCAGACAGTTTTGCCTGGTAGGTTAGGGAAGACTTGTGGTTTTGTTAGTTTGAGGACTTATTATATGATTCTGATATTTTTGATTTTATGCTGCAGTGTTAAAAGCAAATGCGACTTTTCCAGACAAGACACCTTCTTCTAGCTCTCCTCTATTTGACAAGCAATCACCAGATCCATCATCATCTGGGTATGGGAGTTCTAGACACCGTTTGATTTCACTTTATGCCTTTATTTATTTTGTCTTCTCAGCTGCCATGAATTTTGTTCCTGATATTGGTTTTGACAATCTTGCTTATTTTATTTAACTTGAAGTCTTTTCATCTGGCAACTGAACAATATACTCGTCATTATATTATTGGTTTAAGATCAAGGAAGATTTTACAGCCGTTGTACATTTGTATTTGTGAGTTCTAAAGATCGTGGAGATTTTTTCACTCATTTTTAGGGTCTTGTGAACTCCTTTAAAGTTAGTTTGATAATATTCTTCCCTTCCTGAACCTGATATCAGATTACTTCTCTGGGATCCAACCCCCTACAGCCAACTGTTCAACATGTGCTAATCATGACCAGGCTTAATTAGGCTACCATTACCATAGTCTGAATAGCAATATTTAGGCTTTTCTTCTGGCTCTGGGCATTGAGATTTTCATTTTTGTTGGTTTTATGTTTCAGGACTTCAAATAGAGTAGCTTCACTTCGACAACTTTCTAGCAACTTCTCCCAATTCAACAATGTGCGACTCCCTATCCGTTCACCAAAGCCATTGACCAGAAAGGTGTTTCATAATTTGTCAATTTATCTTCTGCTTTGTAGTTATCGAACACAAGTAAATTTCCTTATTCTAATTTGAGAAGGATTTTAGGATGCCGCCACAATCAATGAATGGAGATTTTCCAAGTTAAGGGAATTTCGGGAGAGGCATATTGAAGCAGAAAATGAAGCTTTTGATCGATACATGAAGAATATTAATCTATTGGAAGAGGTCTTTTCTACAAAATCTATGATTGATGATAGGCCTCTTAAGGACAGACCCCCTGTAAACTCTGGTACAGAAGCCAACCCCGAGGAAATGACACCTGGGTTGAAGTTAAAGTTAGGATCGACATCAGACAATTCTAGAAAGAGGATACGCAAAATTGTTGAAGATGGACTAAGAAAAATTAAAATAGTCGAGACAATTGATAACGTCGATGAGGTCACTGACCATGCCCAAGCTGATAGGGGTGAGGACGAGACAAATCCTAATGATGGATGCAAAATGCTAGAAGGTTGGCATGCTAAAAGAACTCGAGCTTTAGGTGATCTCATCGACAAGCTAAACAAAGCCCGAAATGAGGAGGATTTAAAATCTTGCTTGGCAATGAAACATCAGCTTTCTGACCAACACAAAACAACATCCAGTGAAGCAGAATCTGAAGAGACTGACACGTCCAAGGAGCAGCAAGTTATAAAGAAAGATTTAGACTCAAGGAAAGAATTGGGTTTTTCATTGCCAAAGTTGGTTAACAAAACTAACATTGATCAACAAACTCTGAACCAAATTGATGCTCACTTTTCTTCTCTCAAGCAGATAGGCAATCTATGATTTGATGCAGATCTTGTTATTTTTTTTTACCCCTCTATTGACATTAGGTTAAAAATCTCCAAACATGATGCTGTAATGTCCTCATGTAATTTATTCTCTCAACAGCCATTGATGTTTATATGGCAACAGCATTAAATGTGCATTGTCTCTTCACAAAATTCCACACCTCATTTTTCTGATCTTTCCTTATTTGTTATGAAATGATCAGTTGGATCGTGAACCTTGAACTTGTGTTTGAGATTGAAAATTTTCAAAGAAAGATCAGGATATGTGCTAGATCTTTGTGAAAAGGATGTTGAAGAAAAGGAAAAGTTTGTAACATGGGCTGTTTTTTTGGGCCATAAGATTGCAATAAATGGGCCTAAAAT
mRNA sequence
CGAAAACCGTTTAGGGAAGGGGGACTGAATGCTCGCCGGATCTCTTTACAGTTTCCCACCGTCGCCGATTTCCCGATCTGTTACCGCCAAACTCAAAACAAAACACTAACAAATAACCCAAAAATCTCCCGCCTTTCAACAAATTTTCAAAAATTCTTCACCAATTTCCCAATTCCCATCCTCTTTCTAGCCCTTCCCCTCCAAATCCCCCCTCCTTTCCTCTTCTTTCTTCTTCTTCTCTAACCCTTGCATGGCGGCTCCGTCGCCAACCCTTAGCAACAACAGCTCTGACACTGCCACCACCACTGCCGGCGCGGTTGCGGCTAACGTCGCTGTTTCCCCCAATCACTTGGCCAACCGTACAGGAACGCCGCCCAAGACTCTCCGCGGCCTTAACAAACCCAAGTGTAGGGTTTGTGGGAATGTTGCTCGTTCCAGGTGTCCTTATGAATCATGCAAGAGCTGTTGTGCGAGAAACCAGAACCCATGCTATATTCACGTGTTAAAAGCAAATGCGACTTTTCCAGACAAGACACCTTCTTCTAGCTCTCCTCTATTTGACAAGCAATCACCAGATCCATCATCATCTGGGACTTCAAATAGAGTAGCTTCACTTCGACAACTTTCTAGCAACTTCTCCCAATTCAACAATGTGCGACTCCCTATCCGTTCACCAAAGCCATTGACCAGAAAGGATGCCGCCACAATCAATGAATGGAGATTTTCCAAGTTAAGGGAATTTCGGGAGAGGCATATTGAAGCAGAAAATGAAGCTTTTGATCGATACATGAAGAATATTAATCTATTGGAAGAGGTCTTTTCTACAAAATCTATGATTGATGATAGGCCTCTTAAGGACAGACCCCCTGTAAACTCTGGTACAGAAGCCAACCCCGAGGAAATGACACCTGGGTTGAAGTTAAAGTTAGGATCGACATCAGACAATTCTAGAAAGAGGATACGCAAAATTGTTGAAGATGGACTAAGAAAAATTAAAATAGTCGAGACAATTGATAACGTCGATGAGGTCACTGACCATGCCCAAGCTGATAGGGGTGAGGACGAGACAAATCCTAATGATGGATGCAAAATGCTAGAAGGTTGGCATGCTAAAAGAACTCGAGCTTTAGGTGATCTCATCGACAAGCTAAACAAAGCCCGAAATGAGGAGGATTTAAAATCTTGCTTGGCAATGAAACATCAGCTTTCTGACCAACACAAAACAACATCCAGTGAAGCAGAATCTGAAGAGACTGACACGTCCAAGGAGCAGCAAGTTATAAAGAAAGATTTAGACTCAAGGAAAGAATTGGGTTTTTCATTGCCAAAGTTGGTTAACAAAACTAACATTGATCAACAAACTCTGAACCAAATTGATGCTCACTTTTCTTCTCTCAAGCAGATAGGCAATCTATGATTTGATGCAGATCTTGTTATTTTTTTTTACCCCTCTATTGACATTAGGTTAAAAATCTCCAAACATGATGCTGTAATGTCCTCATGTAATTTATTCTCTCAACAGCCATTGATGTTTATATGGCAACAGCATTAAATGTGCATTGTCTCTTCACAAAATTCCACACCTCATTTTTCTGATCTTTCCTTATTTGTTATGAAATGATCAGTTGGATCGTGAACCTTGAACTTGTGTTTGAGATTGAAAATTTTCAAAGAAAGATCAGGATATGTGCTAGATCTTTGTGAAAAGGATGTTGAAGAAAAGGAAAAGTTTGTAACATGGGCTGTTTTTTTGGGCCATAAGATTGCAATAAATGGGCCTAAAAT
Coding sequence (CDS)
ATGGCGGCTCCGTCGCCAACCCTTAGCAACAACAGCTCTGACACTGCCACCACCACTGCCGGCGCGGTTGCGGCTAACGTCGCTGTTTCCCCCAATCACTTGGCCAACCGTACAGGAACGCCGCCCAAGACTCTCCGCGGCCTTAACAAACCCAAGTGTAGGGTTTGTGGGAATGTTGCTCGTTCCAGGTGTCCTTATGAATCATGCAAGAGCTGTTGTGCGAGAAACCAGAACCCATGCTATATTCACGTGTTAAAAGCAAATGCGACTTTTCCAGACAAGACACCTTCTTCTAGCTCTCCTCTATTTGACAAGCAATCACCAGATCCATCATCATCTGGGACTTCAAATAGAGTAGCTTCACTTCGACAACTTTCTAGCAACTTCTCCCAATTCAACAATGTGCGACTCCCTATCCGTTCACCAAAGCCATTGACCAGAAAGGATGCCGCCACAATCAATGAATGGAGATTTTCCAAGTTAAGGGAATTTCGGGAGAGGCATATTGAAGCAGAAAATGAAGCTTTTGATCGATACATGAAGAATATTAATCTATTGGAAGAGGTCTTTTCTACAAAATCTATGATTGATGATAGGCCTCTTAAGGACAGACCCCCTGTAAACTCTGGTACAGAAGCCAACCCCGAGGAAATGACACCTGGGTTGAAGTTAAAGTTAGGATCGACATCAGACAATTCTAGAAAGAGGATACGCAAAATTGTTGAAGATGGACTAAGAAAAATTAAAATAGTCGAGACAATTGATAACGTCGATGAGGTCACTGACCATGCCCAAGCTGATAGGGGTGAGGACGAGACAAATCCTAATGATGGATGCAAAATGCTAGAAGGTTGGCATGCTAAAAGAACTCGAGCTTTAGGTGATCTCATCGACAAGCTAAACAAAGCCCGAAATGAGGAGGATTTAAAATCTTGCTTGGCAATGAAACATCAGCTTTCTGACCAACACAAAACAACATCCAGTGAAGCAGAATCTGAAGAGACTGACACGTCCAAGGAGCAGCAAGTTATAAAGAAAGATTTAGACTCAAGGAAAGAATTGGGTTTTTCATTGCCAAAGTTGGTTAACAAAACTAACATTGATCAACAAACTCTGAACCAAATTGATGCTCACTTTTCTTCTCTCAAGCAGATAGGCAATCTATGA
Protein sequence
MAAPSPTLSNNSSDTATTTAGAVAANVAVSPNHLANRTGTPPKTLRGLNKPKCRVCGNVARSRCPYESCKSCCARNQNPCYIHVLKANATFPDKTPSSSSPLFDKQSPDPSSSGTSNRVASLRQLSSNFSQFNNVRLPIRSPKPLTRKDAATINEWRFSKLREFRERHIEAENEAFDRYMKNINLLEEVFSTKSMIDDRPLKDRPPVNSGTEANPEEMTPGLKLKLGSTSDNSRKRIRKIVEDGLRKIKIVETIDNVDEVTDHAQADRGEDETNPNDGCKMLEGWHAKRTRALGDLIDKLNKARNEEDLKSCLAMKHQLSDQHKTTSSEAESEETDTSKEQQVIKKDLDSRKELGFSLPKLVNKTNIDQQTLNQIDAHFSSLKQIGNL*
Homology
BLAST of CsGy3G001950 vs. NCBI nr
Match:
XP_004150175.1 (uncharacterized protein LOC101215791 [Cucumis sativus] >KGN55803.1 hypothetical protein Csa_009752 [Cucumis sativus])
HSP 1 Score: 752 bits (1942), Expect = 1.32e-273
Identity = 388/388 (100.00%), Postives = 388/388 (100.00%), Query Frame = 0
Query: 1 MAAPSPTLSNNSSDTATTTAGAVAANVAVSPNHLANRTGTPPKTLRGLNKPKCRVCGNVA 60
MAAPSPTLSNNSSDTATTTAGAVAANVAVSPNHLANRTGTPPKTLRGLNKPKCRVCGNVA
Sbjct: 1 MAAPSPTLSNNSSDTATTTAGAVAANVAVSPNHLANRTGTPPKTLRGLNKPKCRVCGNVA 60
Query: 61 RSRCPYESCKSCCARNQNPCYIHVLKANATFPDKTPSSSSPLFDKQSPDPSSSGTSNRVA 120
RSRCPYESCKSCCARNQNPCYIHVLKANATFPDKTPSSSSPLFDKQSPDPSSSGTSNRVA
Sbjct: 61 RSRCPYESCKSCCARNQNPCYIHVLKANATFPDKTPSSSSPLFDKQSPDPSSSGTSNRVA 120
Query: 121 SLRQLSSNFSQFNNVRLPIRSPKPLTRKDAATINEWRFSKLREFRERHIEAENEAFDRYM 180
SLRQLSSNFSQFNNVRLPIRSPKPLTRKDAATINEWRFSKLREFRERHIEAENEAFDRYM
Sbjct: 121 SLRQLSSNFSQFNNVRLPIRSPKPLTRKDAATINEWRFSKLREFRERHIEAENEAFDRYM 180
Query: 181 KNINLLEEVFSTKSMIDDRPLKDRPPVNSGTEANPEEMTPGLKLKLGSTSDNSRKRIRKI 240
KNINLLEEVFSTKSMIDDRPLKDRPPVNSGTEANPEEMTPGLKLKLGSTSDNSRKRIRKI
Sbjct: 181 KNINLLEEVFSTKSMIDDRPLKDRPPVNSGTEANPEEMTPGLKLKLGSTSDNSRKRIRKI 240
Query: 241 VEDGLRKIKIVETIDNVDEVTDHAQADRGEDETNPNDGCKMLEGWHAKRTRALGDLIDKL 300
VEDGLRKIKIVETIDNVDEVTDHAQADRGEDETNPNDGCKMLEGWHAKRTRALGDLIDKL
Sbjct: 241 VEDGLRKIKIVETIDNVDEVTDHAQADRGEDETNPNDGCKMLEGWHAKRTRALGDLIDKL 300
Query: 301 NKARNEEDLKSCLAMKHQLSDQHKTTSSEAESEETDTSKEQQVIKKDLDSRKELGFSLPK 360
NKARNEEDLKSCLAMKHQLSDQHKTTSSEAESEETDTSKEQQVIKKDLDSRKELGFSLPK
Sbjct: 301 NKARNEEDLKSCLAMKHQLSDQHKTTSSEAESEETDTSKEQQVIKKDLDSRKELGFSLPK 360
Query: 361 LVNKTNIDQQTLNQIDAHFSSLKQIGNL 388
LVNKTNIDQQTLNQIDAHFSSLKQIGNL
Sbjct: 361 LVNKTNIDQQTLNQIDAHFSSLKQIGNL 388
BLAST of CsGy3G001950 vs. NCBI nr
Match:
XP_008448750.1 (PREDICTED: uncharacterized protein LOC103490821 [Cucumis melo] >KAA0053076.1 uncharacterized protein E6C27_scaffold344G001940 [Cucumis melo var. makuwa] >TYK11531.1 uncharacterized protein E5676_scaffold139G001970 [Cucumis melo var. makuwa])
HSP 1 Score: 683 bits (1763), Expect = 2.49e-246
Identity = 356/388 (91.75%), Postives = 371/388 (95.62%), Query Frame = 0
Query: 1 MAAPSPTLSNNSSDTATTTAGAVAANVAVSPNHLANRTGTPPKTLRGLNKPKCRVCGNVA 60
MAAPSPTLSNNSSDTATTTAGAVAANVAVSPNHLANRTGTPPKTLRGLNKPKCRVCGNVA
Sbjct: 1 MAAPSPTLSNNSSDTATTTAGAVAANVAVSPNHLANRTGTPPKTLRGLNKPKCRVCGNVA 60
Query: 61 RSRCPYESCKSCCARNQNPCYIHVLKANATFPDKTPSSSSPLFDKQSPDPSSSGTSNRVA 120
RSRCPYESCKSCCARNQNPCYIHVLKANATFPDKTPSSSSPLFDKQSPD SSSGTSNRVA
Sbjct: 61 RSRCPYESCKSCCARNQNPCYIHVLKANATFPDKTPSSSSPLFDKQSPDLSSSGTSNRVA 120
Query: 121 SLRQLSSNFSQFNNVRLPIRSPKPLTRKDAATINEWRFSKLREFRERHIEAENEAFDRYM 180
SLRQLSSNFSQFNNVRLPIRSPKP+TRKDAA INEWRFSKLREFRERHIEAENEAFDRYM
Sbjct: 121 SLRQLSSNFSQFNNVRLPIRSPKPVTRKDAAAINEWRFSKLREFRERHIEAENEAFDRYM 180
Query: 181 KNINLLEEVFSTKSMIDDRPLKDRPPVNSGTEANPEEMTPGLKLKLGSTSDNSRKRIRKI 240
KN+NLLEEVFS +SMIDD+ LKD P VNSG EAN EEMTPGLKLKLGSTSDNSRKRIRKI
Sbjct: 181 KNVNLLEEVFSMRSMIDDKSLKDGPSVNSGIEANLEEMTPGLKLKLGSTSDNSRKRIRKI 240
Query: 241 VEDGLRKIKIVETIDNVDEVTDHAQADRGEDETNPNDGCKMLEGWHAKRTRALGDLIDKL 300
VEDGLRKIKIVET DNV+EVTDHAQADR D+ N NDGCK ++G HAKRTRALGDLI+KL
Sbjct: 241 VEDGLRKIKIVETTDNVNEVTDHAQADRDADQMNLNDGCKTIKGSHAKRTRALGDLIEKL 300
Query: 301 NKARNEEDLKSCLAMKHQLSDQHKTTSSEAESEETDTSKEQQVIKKDLDSRKELGFSLPK 360
NKARNEEDLKSCLAMKHQL ++ +TTSS+AESEETDTSKEQQVIKKDLDSRKELG+SLPK
Sbjct: 301 NKARNEEDLKSCLAMKHQLFNRLQTTSSQAESEETDTSKEQQVIKKDLDSRKELGYSLPK 360
Query: 361 LVNKTNIDQQTLNQIDAHFSSLKQIGNL 388
L+NKTNIDQQTLNQIDAHFSSLKQIGNL
Sbjct: 361 LINKTNIDQQTLNQIDAHFSSLKQIGNL 388
BLAST of CsGy3G001950 vs. NCBI nr
Match:
XP_038906244.1 (uncharacterized protein LOC120092108 [Benincasa hispida])
HSP 1 Score: 610 bits (1574), Expect = 1.36e-217
Identity = 325/391 (83.12%), Postives = 350/391 (89.51%), Query Frame = 0
Query: 1 MAAPSPTLSNNSSDTATTTAGAVAANVAVSPNHLANRTGTPPKTLRGLNKPKCRVCGNVA 60
MAAPSPT+SNNSSDTA+ AV AN AVS NHLANRTGTPPKTLRGLNKPKCRVCGNVA
Sbjct: 1 MAAPSPTVSNNSSDTAS----AVTANAAVSSNHLANRTGTPPKTLRGLNKPKCRVCGNVA 60
Query: 61 RSRCPYESCKSCCARNQNPCYIHVLKANATFPDKTPSSSSPLFDKQSPDPSSSGTSNRVA 120
RSRCPYESCKSCCARNQNPCYIHVLKANATFPDKTP+SSSPLFDKQS DPSSSGT +RVA
Sbjct: 61 RSRCPYESCKSCCARNQNPCYIHVLKANATFPDKTPTSSSPLFDKQSSDPSSSGTLHRVA 120
Query: 121 SLRQLSSNFSQFNNVRLPIRSPKPLTRKDAATINEWRFSKLREFRERHIEAENEAFDRYM 180
SLRQLS+NFSQFNNV++P+RS KPLTRKDAA INEWRFSKLREFRER+IEAENEAFDRYM
Sbjct: 121 SLRQLSNNFSQFNNVQIPLRSRKPLTRKDAAAINEWRFSKLREFRERNIEAENEAFDRYM 180
Query: 181 KNINLLEEVFSTKSMIDDRPLKDRPPVNSGTEANPEEMTPGLKLKLGS---TSDNSRKRI 240
+N+NLLEEVFS +SMID LKD P +NS TEANPEEM PGLKLKLGS TSD SRKRI
Sbjct: 181 QNVNLLEEVFSMRSMIDGS-LKDGPSINSTTEANPEEMIPGLKLKLGSNPVTSDISRKRI 240
Query: 241 RKIVEDGLRKIKIVETIDNVDEVTDHAQADRGEDETNPNDGCKMLEGWHAKRTRALGDLI 300
++IVED LRK K VE DN+DEVTDHA+AD D+TN NDG K ++GWHAKR RALGDLI
Sbjct: 241 QEIVEDRLRKFKKVEATDNIDEVTDHAEADETADQTNLNDGFKTVKGWHAKRARALGDLI 300
Query: 301 DKLNKARNEEDLKSCLAMKHQLSDQHKTTSSEAESEETDTSKEQQVIKKDLDSRKELGFS 360
DKLNKARNEEDLKSCLAMKHQL + HKTTSS+ ESEETD SKEQ VIKKDLDSRKELG+S
Sbjct: 301 DKLNKARNEEDLKSCLAMKHQLFNPHKTTSSQTESEETDISKEQ-VIKKDLDSRKELGYS 360
Query: 361 LPKLVNKTNIDQQTLNQIDAHFSSLKQIGNL 388
LPKL+NKTNIDQ+TLNQIDAHFSSLKQIG L
Sbjct: 361 LPKLINKTNIDQETLNQIDAHFSSLKQIGTL 385
BLAST of CsGy3G001950 vs. NCBI nr
Match:
KAG6577645.1 (hypothetical protein SDJN03_25219, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 585 bits (1509), Expect = 9.89e-208
Identity = 314/388 (80.93%), Postives = 341/388 (87.89%), Query Frame = 0
Query: 1 MAAPSPTLSNNSSDTATTTAGAVAANVAVSPNHLANRTGTPPKTLRGLNKPKCRVCGNVA 60
MAAPSPT+SNNSSDTATTTA AV N AVS NHLANRT TPPKTLRGLNKPKCRVCGNVA
Sbjct: 1 MAAPSPTVSNNSSDTATTTASAVTVNAAVSSNHLANRTATPPKTLRGLNKPKCRVCGNVA 60
Query: 61 RSRCPYESCKSCCARNQNPCYIHVLKANATFPDKTPSSSSPLFDKQSPDPSSSGTSNRVA 120
RSRCPYESCKSCCARNQNPC+IHVLKANATF DKTP+SSSPLFDKQS DPSSSGTS+RVA
Sbjct: 61 RSRCPYESCKSCCARNQNPCHIHVLKANATFTDKTPTSSSPLFDKQSSDPSSSGTSHRVA 120
Query: 121 SLRQLSSNFSQFNNVRLPIRSPKPLTRKDAATINEWRFSKLREFRERHIEAENEAFDRYM 180
SLRQLSSNFSQFNNV++P+RS KPLTRKDAA INEWRFSKLREFRERH+EAENEAFDRYM
Sbjct: 121 SLRQLSSNFSQFNNVQIPLRSRKPLTRKDAAAINEWRFSKLREFRERHVEAENEAFDRYM 180
Query: 181 KNINLLEEVFSTKSMIDDRPLKDRPPVNSGTEANPEEMTPGLKLKLGSTSDNSRKRIRKI 240
KN+NLLEEVFS KSM+D LKD P N TE EE+ PGLKL LGS SD+SRKRIR+I
Sbjct: 181 KNVNLLEEVFSMKSMVDGS-LKDGPS-NFCTEGT-EEVIPGLKLSLGSGSDDSRKRIREI 240
Query: 241 VEDGLRKIKIVETIDNVDEVTDHAQADRGEDETNPNDGCKMLEGWHAKRTRALGDLIDKL 300
VEDGL K++ VE DNVDEVTD ++ + D+TN NDGC ++GWHAKR RALGDLIDKL
Sbjct: 241 VEDGLMKLQKVEATDNVDEVTDQVESGKVADQTNRNDGCTTVKGWHAKRARALGDLIDKL 300
Query: 301 NKARNEEDLKSCLAMKHQLSDQHKTTSSEAESEETDTSKEQQVIKKDLDSRKELGFSLPK 360
NKARNEEDLKSCLAMKHQL +QH TSS+ ESEETD KEQ VIKKDLDSRKELG+SLPK
Sbjct: 301 NKARNEEDLKSCLAMKHQLFNQH-ITSSQTESEETDMPKEQ-VIKKDLDSRKELGYSLPK 360
Query: 361 LVNKTNIDQQTLNQIDAHFSSLKQIGNL 388
L+NKT IDQ+TLN+IDAHFSSLKQI NL
Sbjct: 361 LINKTIIDQETLNRIDAHFSSLKQIDNL 383
BLAST of CsGy3G001950 vs. NCBI nr
Match:
XP_022923586.1 (uncharacterized protein LOC111431225 [Cucurbita moschata])
HSP 1 Score: 585 bits (1507), Expect = 1.99e-207
Identity = 314/388 (80.93%), Postives = 340/388 (87.63%), Query Frame = 0
Query: 1 MAAPSPTLSNNSSDTATTTAGAVAANVAVSPNHLANRTGTPPKTLRGLNKPKCRVCGNVA 60
MAAPSPT+SNNSSDTA TTA AV N AVS NHLANRT TPPKTLRGLNKPKCRVCGNVA
Sbjct: 1 MAAPSPTVSNNSSDTANTTASAVTVNAAVSSNHLANRTATPPKTLRGLNKPKCRVCGNVA 60
Query: 61 RSRCPYESCKSCCARNQNPCYIHVLKANATFPDKTPSSSSPLFDKQSPDPSSSGTSNRVA 120
RSRCPYESCKSCCARNQNPC+IHVLKANATF DKTPSSSSPLFDKQS DPSSSGTS+RVA
Sbjct: 61 RSRCPYESCKSCCARNQNPCHIHVLKANATFTDKTPSSSSPLFDKQSADPSSSGTSHRVA 120
Query: 121 SLRQLSSNFSQFNNVRLPIRSPKPLTRKDAATINEWRFSKLREFRERHIEAENEAFDRYM 180
SLRQLSSNFSQFNNV++P+RS KPLTRKDAA INEWRFSKLREFRERHIEAENEAFDRYM
Sbjct: 121 SLRQLSSNFSQFNNVQIPLRSRKPLTRKDAAAINEWRFSKLREFRERHIEAENEAFDRYM 180
Query: 181 KNINLLEEVFSTKSMIDDRPLKDRPPVNSGTEANPEEMTPGLKLKLGSTSDNSRKRIRKI 240
KN++LLEEVFS KSM+D LKD P N TE EE+ PGLKL LGS SD+SRKRIR+I
Sbjct: 181 KNVDLLEEVFSMKSMVDGS-LKDGPS-NFSTEGT-EEVIPGLKLSLGSGSDDSRKRIREI 240
Query: 241 VEDGLRKIKIVETIDNVDEVTDHAQADRGEDETNPNDGCKMLEGWHAKRTRALGDLIDKL 300
VEDGL K++ VE DNVDEVTD ++ + D+TN NDGC ++GWHAKR RALGDLIDKL
Sbjct: 241 VEDGLMKLQKVEATDNVDEVTDQVESGKVADQTNRNDGCTTVKGWHAKRARALGDLIDKL 300
Query: 301 NKARNEEDLKSCLAMKHQLSDQHKTTSSEAESEETDTSKEQQVIKKDLDSRKELGFSLPK 360
NKARNEEDLKSCLAMKHQL +QH TSS+ ESEETD KEQ VIKKDLDSRKELG+SLPK
Sbjct: 301 NKARNEEDLKSCLAMKHQLFNQH-ITSSQTESEETDVPKEQ-VIKKDLDSRKELGYSLPK 360
Query: 361 LVNKTNIDQQTLNQIDAHFSSLKQIGNL 388
L+NKT IDQ+TLN+IDAHFSSLKQI NL
Sbjct: 361 LINKTIIDQETLNRIDAHFSSLKQIDNL 383
BLAST of CsGy3G001950 vs. ExPASy TrEMBL
Match:
A0A0A0L209 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G016970 PE=4 SV=1)
HSP 1 Score: 752 bits (1942), Expect = 6.38e-274
Identity = 388/388 (100.00%), Postives = 388/388 (100.00%), Query Frame = 0
Query: 1 MAAPSPTLSNNSSDTATTTAGAVAANVAVSPNHLANRTGTPPKTLRGLNKPKCRVCGNVA 60
MAAPSPTLSNNSSDTATTTAGAVAANVAVSPNHLANRTGTPPKTLRGLNKPKCRVCGNVA
Sbjct: 1 MAAPSPTLSNNSSDTATTTAGAVAANVAVSPNHLANRTGTPPKTLRGLNKPKCRVCGNVA 60
Query: 61 RSRCPYESCKSCCARNQNPCYIHVLKANATFPDKTPSSSSPLFDKQSPDPSSSGTSNRVA 120
RSRCPYESCKSCCARNQNPCYIHVLKANATFPDKTPSSSSPLFDKQSPDPSSSGTSNRVA
Sbjct: 61 RSRCPYESCKSCCARNQNPCYIHVLKANATFPDKTPSSSSPLFDKQSPDPSSSGTSNRVA 120
Query: 121 SLRQLSSNFSQFNNVRLPIRSPKPLTRKDAATINEWRFSKLREFRERHIEAENEAFDRYM 180
SLRQLSSNFSQFNNVRLPIRSPKPLTRKDAATINEWRFSKLREFRERHIEAENEAFDRYM
Sbjct: 121 SLRQLSSNFSQFNNVRLPIRSPKPLTRKDAATINEWRFSKLREFRERHIEAENEAFDRYM 180
Query: 181 KNINLLEEVFSTKSMIDDRPLKDRPPVNSGTEANPEEMTPGLKLKLGSTSDNSRKRIRKI 240
KNINLLEEVFSTKSMIDDRPLKDRPPVNSGTEANPEEMTPGLKLKLGSTSDNSRKRIRKI
Sbjct: 181 KNINLLEEVFSTKSMIDDRPLKDRPPVNSGTEANPEEMTPGLKLKLGSTSDNSRKRIRKI 240
Query: 241 VEDGLRKIKIVETIDNVDEVTDHAQADRGEDETNPNDGCKMLEGWHAKRTRALGDLIDKL 300
VEDGLRKIKIVETIDNVDEVTDHAQADRGEDETNPNDGCKMLEGWHAKRTRALGDLIDKL
Sbjct: 241 VEDGLRKIKIVETIDNVDEVTDHAQADRGEDETNPNDGCKMLEGWHAKRTRALGDLIDKL 300
Query: 301 NKARNEEDLKSCLAMKHQLSDQHKTTSSEAESEETDTSKEQQVIKKDLDSRKELGFSLPK 360
NKARNEEDLKSCLAMKHQLSDQHKTTSSEAESEETDTSKEQQVIKKDLDSRKELGFSLPK
Sbjct: 301 NKARNEEDLKSCLAMKHQLSDQHKTTSSEAESEETDTSKEQQVIKKDLDSRKELGFSLPK 360
Query: 361 LVNKTNIDQQTLNQIDAHFSSLKQIGNL 388
LVNKTNIDQQTLNQIDAHFSSLKQIGNL
Sbjct: 361 LVNKTNIDQQTLNQIDAHFSSLKQIGNL 388
BLAST of CsGy3G001950 vs. ExPASy TrEMBL
Match:
A0A5A7UFS7 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold139G001970 PE=4 SV=1)
HSP 1 Score: 683 bits (1763), Expect = 1.21e-246
Identity = 356/388 (91.75%), Postives = 371/388 (95.62%), Query Frame = 0
Query: 1 MAAPSPTLSNNSSDTATTTAGAVAANVAVSPNHLANRTGTPPKTLRGLNKPKCRVCGNVA 60
MAAPSPTLSNNSSDTATTTAGAVAANVAVSPNHLANRTGTPPKTLRGLNKPKCRVCGNVA
Sbjct: 1 MAAPSPTLSNNSSDTATTTAGAVAANVAVSPNHLANRTGTPPKTLRGLNKPKCRVCGNVA 60
Query: 61 RSRCPYESCKSCCARNQNPCYIHVLKANATFPDKTPSSSSPLFDKQSPDPSSSGTSNRVA 120
RSRCPYESCKSCCARNQNPCYIHVLKANATFPDKTPSSSSPLFDKQSPD SSSGTSNRVA
Sbjct: 61 RSRCPYESCKSCCARNQNPCYIHVLKANATFPDKTPSSSSPLFDKQSPDLSSSGTSNRVA 120
Query: 121 SLRQLSSNFSQFNNVRLPIRSPKPLTRKDAATINEWRFSKLREFRERHIEAENEAFDRYM 180
SLRQLSSNFSQFNNVRLPIRSPKP+TRKDAA INEWRFSKLREFRERHIEAENEAFDRYM
Sbjct: 121 SLRQLSSNFSQFNNVRLPIRSPKPVTRKDAAAINEWRFSKLREFRERHIEAENEAFDRYM 180
Query: 181 KNINLLEEVFSTKSMIDDRPLKDRPPVNSGTEANPEEMTPGLKLKLGSTSDNSRKRIRKI 240
KN+NLLEEVFS +SMIDD+ LKD P VNSG EAN EEMTPGLKLKLGSTSDNSRKRIRKI
Sbjct: 181 KNVNLLEEVFSMRSMIDDKSLKDGPSVNSGIEANLEEMTPGLKLKLGSTSDNSRKRIRKI 240
Query: 241 VEDGLRKIKIVETIDNVDEVTDHAQADRGEDETNPNDGCKMLEGWHAKRTRALGDLIDKL 300
VEDGLRKIKIVET DNV+EVTDHAQADR D+ N NDGCK ++G HAKRTRALGDLI+KL
Sbjct: 241 VEDGLRKIKIVETTDNVNEVTDHAQADRDADQMNLNDGCKTIKGSHAKRTRALGDLIEKL 300
Query: 301 NKARNEEDLKSCLAMKHQLSDQHKTTSSEAESEETDTSKEQQVIKKDLDSRKELGFSLPK 360
NKARNEEDLKSCLAMKHQL ++ +TTSS+AESEETDTSKEQQVIKKDLDSRKELG+SLPK
Sbjct: 301 NKARNEEDLKSCLAMKHQLFNRLQTTSSQAESEETDTSKEQQVIKKDLDSRKELGYSLPK 360
Query: 361 LVNKTNIDQQTLNQIDAHFSSLKQIGNL 388
L+NKTNIDQQTLNQIDAHFSSLKQIGNL
Sbjct: 361 LINKTNIDQQTLNQIDAHFSSLKQIGNL 388
BLAST of CsGy3G001950 vs. ExPASy TrEMBL
Match:
A0A1S3BKF8 (uncharacterized protein LOC103490821 OS=Cucumis melo OX=3656 GN=LOC103490821 PE=4 SV=1)
HSP 1 Score: 683 bits (1763), Expect = 1.21e-246
Identity = 356/388 (91.75%), Postives = 371/388 (95.62%), Query Frame = 0
Query: 1 MAAPSPTLSNNSSDTATTTAGAVAANVAVSPNHLANRTGTPPKTLRGLNKPKCRVCGNVA 60
MAAPSPTLSNNSSDTATTTAGAVAANVAVSPNHLANRTGTPPKTLRGLNKPKCRVCGNVA
Sbjct: 1 MAAPSPTLSNNSSDTATTTAGAVAANVAVSPNHLANRTGTPPKTLRGLNKPKCRVCGNVA 60
Query: 61 RSRCPYESCKSCCARNQNPCYIHVLKANATFPDKTPSSSSPLFDKQSPDPSSSGTSNRVA 120
RSRCPYESCKSCCARNQNPCYIHVLKANATFPDKTPSSSSPLFDKQSPD SSSGTSNRVA
Sbjct: 61 RSRCPYESCKSCCARNQNPCYIHVLKANATFPDKTPSSSSPLFDKQSPDLSSSGTSNRVA 120
Query: 121 SLRQLSSNFSQFNNVRLPIRSPKPLTRKDAATINEWRFSKLREFRERHIEAENEAFDRYM 180
SLRQLSSNFSQFNNVRLPIRSPKP+TRKDAA INEWRFSKLREFRERHIEAENEAFDRYM
Sbjct: 121 SLRQLSSNFSQFNNVRLPIRSPKPVTRKDAAAINEWRFSKLREFRERHIEAENEAFDRYM 180
Query: 181 KNINLLEEVFSTKSMIDDRPLKDRPPVNSGTEANPEEMTPGLKLKLGSTSDNSRKRIRKI 240
KN+NLLEEVFS +SMIDD+ LKD P VNSG EAN EEMTPGLKLKLGSTSDNSRKRIRKI
Sbjct: 181 KNVNLLEEVFSMRSMIDDKSLKDGPSVNSGIEANLEEMTPGLKLKLGSTSDNSRKRIRKI 240
Query: 241 VEDGLRKIKIVETIDNVDEVTDHAQADRGEDETNPNDGCKMLEGWHAKRTRALGDLIDKL 300
VEDGLRKIKIVET DNV+EVTDHAQADR D+ N NDGCK ++G HAKRTRALGDLI+KL
Sbjct: 241 VEDGLRKIKIVETTDNVNEVTDHAQADRDADQMNLNDGCKTIKGSHAKRTRALGDLIEKL 300
Query: 301 NKARNEEDLKSCLAMKHQLSDQHKTTSSEAESEETDTSKEQQVIKKDLDSRKELGFSLPK 360
NKARNEEDLKSCLAMKHQL ++ +TTSS+AESEETDTSKEQQVIKKDLDSRKELG+SLPK
Sbjct: 301 NKARNEEDLKSCLAMKHQLFNRLQTTSSQAESEETDTSKEQQVIKKDLDSRKELGYSLPK 360
Query: 361 LVNKTNIDQQTLNQIDAHFSSLKQIGNL 388
L+NKTNIDQQTLNQIDAHFSSLKQIGNL
Sbjct: 361 LINKTNIDQQTLNQIDAHFSSLKQIGNL 388
BLAST of CsGy3G001950 vs. ExPASy TrEMBL
Match:
A0A6J1E768 (uncharacterized protein LOC111431225 OS=Cucurbita moschata OX=3662 GN=LOC111431225 PE=4 SV=1)
HSP 1 Score: 585 bits (1507), Expect = 9.65e-208
Identity = 314/388 (80.93%), Postives = 340/388 (87.63%), Query Frame = 0
Query: 1 MAAPSPTLSNNSSDTATTTAGAVAANVAVSPNHLANRTGTPPKTLRGLNKPKCRVCGNVA 60
MAAPSPT+SNNSSDTA TTA AV N AVS NHLANRT TPPKTLRGLNKPKCRVCGNVA
Sbjct: 1 MAAPSPTVSNNSSDTANTTASAVTVNAAVSSNHLANRTATPPKTLRGLNKPKCRVCGNVA 60
Query: 61 RSRCPYESCKSCCARNQNPCYIHVLKANATFPDKTPSSSSPLFDKQSPDPSSSGTSNRVA 120
RSRCPYESCKSCCARNQNPC+IHVLKANATF DKTPSSSSPLFDKQS DPSSSGTS+RVA
Sbjct: 61 RSRCPYESCKSCCARNQNPCHIHVLKANATFTDKTPSSSSPLFDKQSADPSSSGTSHRVA 120
Query: 121 SLRQLSSNFSQFNNVRLPIRSPKPLTRKDAATINEWRFSKLREFRERHIEAENEAFDRYM 180
SLRQLSSNFSQFNNV++P+RS KPLTRKDAA INEWRFSKLREFRERHIEAENEAFDRYM
Sbjct: 121 SLRQLSSNFSQFNNVQIPLRSRKPLTRKDAAAINEWRFSKLREFRERHIEAENEAFDRYM 180
Query: 181 KNINLLEEVFSTKSMIDDRPLKDRPPVNSGTEANPEEMTPGLKLKLGSTSDNSRKRIRKI 240
KN++LLEEVFS KSM+D LKD P N TE EE+ PGLKL LGS SD+SRKRIR+I
Sbjct: 181 KNVDLLEEVFSMKSMVDGS-LKDGPS-NFSTEGT-EEVIPGLKLSLGSGSDDSRKRIREI 240
Query: 241 VEDGLRKIKIVETIDNVDEVTDHAQADRGEDETNPNDGCKMLEGWHAKRTRALGDLIDKL 300
VEDGL K++ VE DNVDEVTD ++ + D+TN NDGC ++GWHAKR RALGDLIDKL
Sbjct: 241 VEDGLMKLQKVEATDNVDEVTDQVESGKVADQTNRNDGCTTVKGWHAKRARALGDLIDKL 300
Query: 301 NKARNEEDLKSCLAMKHQLSDQHKTTSSEAESEETDTSKEQQVIKKDLDSRKELGFSLPK 360
NKARNEEDLKSCLAMKHQL +QH TSS+ ESEETD KEQ VIKKDLDSRKELG+SLPK
Sbjct: 301 NKARNEEDLKSCLAMKHQLFNQH-ITSSQTESEETDVPKEQ-VIKKDLDSRKELGYSLPK 360
Query: 361 LVNKTNIDQQTLNQIDAHFSSLKQIGNL 388
L+NKT IDQ+TLN+IDAHFSSLKQI NL
Sbjct: 361 LINKTIIDQETLNRIDAHFSSLKQIDNL 383
BLAST of CsGy3G001950 vs. ExPASy TrEMBL
Match:
A0A6J1DRY6 (uncharacterized protein LOC111022656 OS=Momordica charantia OX=3673 GN=LOC111022656 PE=4 SV=1)
HSP 1 Score: 580 bits (1495), Expect = 7.49e-206
Identity = 313/391 (80.05%), Postives = 342/391 (87.47%), Query Frame = 0
Query: 1 MAAPSPTLSNNSSDTATTTAGAVAANVAVSPNHLANRTGTPPKTLRGLNKPKCRVCGNVA 60
MAAPSP+ SNNSSDT TT A A ANVAVS NHLANRTGTPPKTLRGLNKPKCRVCGNVA
Sbjct: 1 MAAPSPSASNNSSDTGTT-ATAATANVAVSSNHLANRTGTPPKTLRGLNKPKCRVCGNVA 60
Query: 61 RSRCPYESCKSCCARNQNPCYIHVLKANATFPDKTPSSSSPLFDKQSPDPSSSGTSNRVA 120
RSRCPYESCKSCCARNQNPCYIHVLKANATFPDKTP+SSSPLF+KQS DPSSSGTS RVA
Sbjct: 61 RSRCPYESCKSCCARNQNPCYIHVLKANATFPDKTPTSSSPLFEKQSADPSSSGTSLRVA 120
Query: 121 SLRQLSSNFSQFNNVRLPIRSPKPLTRKDAATINEWRFSKLREFRERHIEAENEAFDRYM 180
SLRQLS+NFSQFN+V++P+RS KPLTRKDAA INEWRFSKLREFRERHIEA NEAFDRYM
Sbjct: 121 SLRQLSNNFSQFNSVQIPLRSRKPLTRKDAAAINEWRFSKLREFRERHIEAGNEAFDRYM 180
Query: 181 KNINLLEEVFSTKSMIDDRPLKDRPPVNSGTEANPEEMTPGLKLKLGST---SDNSRKRI 240
+N+NLLEEVFS KSMID +KD P VNS EAN EEM GLKLK+GS SDNSRKRI
Sbjct: 181 QNVNLLEEVFSMKSMIDGS-IKDGPSVNSSAEANTEEMVSGLKLKIGSDPIRSDNSRKRI 240
Query: 241 RKIVEDGLRKIKIVETIDNVDEVTDHAQADRGEDETNPNDGCKMLEGWHAKRTRALGDLI 300
++IVEDGLRK+K V+ D VDEVTD A+ D+ D+T+ NDGCK +GW AKR ALGDLI
Sbjct: 241 QQIVEDGLRKLKKVKVTDKVDEVTDQAEPDKVADQTDLNDGCKTAKGWPAKRAIALGDLI 300
Query: 301 DKLNKARNEEDLKSCLAMKHQLSDQHKTTSSEAESEETDTSKEQQVIKKDLDSRKELGFS 360
DKLNKARNEEDLKSCLAMKHQL + H TSS+AESEE D SKEQ V+KKDL+SRKELG+S
Sbjct: 301 DKLNKARNEEDLKSCLAMKHQLFNPH-LTSSQAESEEIDMSKEQ-VVKKDLESRKELGYS 360
Query: 361 LPKLVNKTNIDQQTLNQIDAHFSSLKQIGNL 388
LPKL+NKTNIDQ+TLN+IDAHFSSLKQI NL
Sbjct: 361 LPKLINKTNIDQETLNRIDAHFSSLKQIDNL 387
BLAST of CsGy3G001950 vs. TAIR 10
Match:
AT1G32730.1 (unknown protein; LOCATED IN: chloroplast; EXPRESSED IN: 12 plant structures; EXPRESSED DURING: 6 growth stages; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF702 (InterPro:IPR007818); Has 120 Blast hits to 118 proteins in 39 species: Archae - 0; Bacteria - 8; Metazoa - 63; Fungi - 4; Plants - 33; Viruses - 0; Other Eukaryotes - 12 (source: NCBI BLink). )
HSP 1 Score: 239.6 bits (610), Expect = 4.3e-63
Identity = 159/392 (40.56%), Postives = 222/392 (56.63%), Query Frame = 0
Query: 1 MAAPSPTLSNNSSDTATTTAGAVAANVAVSPNHLANRTGTPPKTLRGLNKPKCRVCGNVA 60
MA SP+LSNN + TPPKTLRGLNKPKC CGNVA
Sbjct: 1 MATSSPSLSNNGLSSVV----------------------TPPKTLRGLNKPKCIQCGNVA 60
Query: 61 RSRCPYESCKSCCARNQNPCYIHVLKANATFPDKTPSSSSPLFDKQSPDPSSSGTSNRVA 120
RSRCP++SCK CC+R +NPC IHVLK +T +KT + S+P ++++ + + G++ RV+
Sbjct: 61 RSRCPFQSCKGCCSRAENPCPIHVLKVASTSGEKTQAPSTPSSEQKATE-GTPGSTTRVS 120
Query: 121 SLRQLSSNFSQFNNVRLPIRSPKPLTRKDAATINEWRFSKLREFRERHIEAENEAFDRYM 180
S+RQLSSNF+QFNN+ R KPLT KDA +NEWRF+KL+E+R+R+IE ENEAFDRYM
Sbjct: 121 SIRQLSSNFAQFNNLNASSRQRKPLTIKDAQALNEWRFTKLKEYRDRNIEVENEAFDRYM 180
Query: 181 KNINLLEEVFSTKSMIDDRPLKDRPPVNSGTEANPEE-MTPGLKLKLGSTS---DNSRKR 240
N+NLLEE FS S+ D+ P E N EE + LKL+L S S ++ +KR
Sbjct: 181 SNVNLLEEAFSFTSVPDEESHGTAAP-----EQNKEENIVSELKLRLRSNSARTESFKKR 240
Query: 241 IRKIVEDGLRKIKIVETIDNVDEVTDHAQADRGEDETNPNDGCKMLEGWHAKRTRALGDL 300
I + V+ GL K+K ++ + D+ D + + + W K + AL ++
Sbjct: 241 IAETVKAGLVKLKRLDLGSSSDDQDDIKRRVKRKK-------------WEEKGS-ALNEI 300
Query: 301 IDKLNKARNEEDLKSCLAMKHQLSDQHKTTSSEAESEETDTSKEQQVIKKDLDSRKELGF 360
IDKLNKAR EEDLKSCL MK +L Q T++ +++
Sbjct: 301 IDKLNKARTEEDLKSCLEMKSKLCGQVSPTAASEKNK----------------------- 327
Query: 361 SLPKLVNKTNIDQQTLNQIDAHFSSLKQIGNL 389
P +V K + ++ L +I + S ++G L
Sbjct: 361 IFPGVVRKVEMSEEALQKIAENLQSFDKVGML 327
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_004150175.1 | 1.32e-273 | 100.00 | uncharacterized protein LOC101215791 [Cucumis sativus] >KGN55803.1 hypothetical ... | [more] |
XP_008448750.1 | 2.49e-246 | 91.75 | PREDICTED: uncharacterized protein LOC103490821 [Cucumis melo] >KAA0053076.1 unc... | [more] |
XP_038906244.1 | 1.36e-217 | 83.12 | uncharacterized protein LOC120092108 [Benincasa hispida] | [more] |
KAG6577645.1 | 9.89e-208 | 80.93 | hypothetical protein SDJN03_25219, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_022923586.1 | 1.99e-207 | 80.93 | uncharacterized protein LOC111431225 [Cucurbita moschata] | [more] |
Match Name | E-value | Identity | Description | |
A0A0A0L209 | 6.38e-274 | 100.00 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G016970 PE=4 SV=1 | [more] |
A0A5A7UFS7 | 1.21e-246 | 91.75 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |
A0A1S3BKF8 | 1.21e-246 | 91.75 | uncharacterized protein LOC103490821 OS=Cucumis melo OX=3656 GN=LOC103490821 PE=... | [more] |
A0A6J1E768 | 9.65e-208 | 80.93 | uncharacterized protein LOC111431225 OS=Cucurbita moschata OX=3662 GN=LOC1114312... | [more] |
A0A6J1DRY6 | 7.49e-206 | 80.05 | uncharacterized protein LOC111022656 OS=Momordica charantia OX=3673 GN=LOC111022... | [more] |
Match Name | E-value | Identity | Description | |
AT1G32730.1 | 4.3e-63 | 40.56 | unknown protein; LOCATED IN: chloroplast; EXPRESSED IN: 12 plant structures; EXP... | [more] |