CSPI03G06640 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI03G06640
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionUnknown protein
LocationChr3: 5834066 .. 5843239 (+)
RNA-Seq ExpressionCSPI03G06640
SyntenyCSPI03G06640
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AGTGGACCTAAATTCATTTCCACCTCTAAATAAGGGCCATTCCTACAAATTTTTCGAGAGAAAATGATGTGTGGTTGATTTAGCCACACCCAACCCATCGTCTTCCTTTACGAATAACCGTGTAAGAGCCGACGGAGCAATTGCTCGCCGACTTCGATCGGACTAAATTTCCGTTACTATGTAAATCGCTGTGACCGGAACGAACCCTTTTGCGAAGTAAGCGTCTCTCCTTGTCATGGACATCTTTTCTTCAACTTGAATTCATGGAGGTGGTCGGTTCAGTCAAACTCTTTTTAGCTTCACCGGCGAACACGGTTTCAGGCTTAGTTTACTCTCCCTCCTCTTATTCTTCCTTATTTTCTCCCAAGAAGAAGAAGAAGAAGAAGAAGCTGCTTGTTGTGTCAAAGAGCAAAAAGCAGCCTCAAACTTCATCGGTGATTATGCTTCCATCTACATTTTTTTTATTGCTCATTCACAGCAGTTCGCTAGTTGCTAGGTCTTCAGCCTTGCCGGACCAGAGAGATGTCCTCTTTCTGTAATCTGTACCTGTTTTATTTTCTTGGTTTTAATGTGCAGTCTAAGTTTCTGATATTGTTGCTTAGTTAAACACAACGTACCGCCACTTTTATTTTCTGATAGCAATGCGCGAAAATTCGATAATGTTGACGGACAGATCCACAATTTTGTGCAAAATCTAACTGGCTTCTTGCTGCAGGCTGGTTCGGATCCTCCTCCGAGAATTACGTCAAATCTGAAGCAGAATTTGCAGTTTCTGAGATTGTGGAAGGTATGTTATGTGGTTCTGCTCAAACCTTTTTAACTAATTCGGGTATTTGTAGCGCTCTCTCATATAACAAAGACTTGTAGGATTTTCAAAAGAGAAAATCTGGCGTACCTAAGCCTGCTACTAGTTACCGGAGGAAGAAGGTTGAGAAGGAGGACCTCCCAGAAGATACGGAGCTTTATCGTGATCCCACATTGGCTCTTTACCAGTACATCTCTGATTGACAAACTTGTTTTCGTAACATTTTGTCTCTCAACCATGACATAACACAAAAGTTGCTGGGAGAAATGCTGCTCTTAATATCTTTTATTTTTCTGGCAGTACAAACCAGGGCATAGACAATGTATTCCCTGTCTTGCTGGTTGATGGATATAATGTGTGTGGCTACTGGGTGAAACTGAAGAAACATTTTATGAATGGGAGACTTGATGTAGCTCGCCAAAAGCTAATTGACGAGCTTATTACGTTCAGTATGCTGAGAGGTTTGCAGTGTGCTCCTTCTTCATTGTCTAAAAATGAGACCTGATCTTTGGGTTATAGACTTGTAGGCTTATAGTGTAGCTGTTGCTTGATGTTCAGGAATTTTTTTGGAACTTGATTTTTAGTTGCTATTGTCCCTGATTACATGCTTACCACTTCTACAGAGGTCAAAGTGGTAGTTGTATTTGATGCGATGCTGTCTGGACTCCCAACACACAAGGAAAACTTTGCCGGGTAATTCTTATGATCTAAAATATAGCCTCTTTAATCTATATAATCTTCAATTTGCTGCTAATCTTCTGCCAGTAGATGTGCATTTGGGATTAGACGTTCCCATGGTTGCTACTTGCAAGCTTAATGACTTTAAAATTAATATATTTGAAGATGCAGAACAAATGTGGAATGATGAATTAGTGGCATAAGAAAAGAGATGTGTATACGAATGATATAGTTTGTGTAGATTGTTGATGAAACATAGCCCAACTTTTCTTATTTTTGAAAAGGAGACAGCCTCTTTATTAATATAATATAATAAGACAAAAGCTCATAGAACAAGAGAATTATACAATGAGCATGTAACCATGGATCAGGAGGTGCACCTGGACATTTTAACTAGGTTGCCATTCCCATAGCACCTTCGTCATATCCAAACAAGCTAAAAACCAAAACAAGGGAGTAATGTCCAAAGGCAAAACAAAACCAAAAGAAGTATAAATAAATACAAATACAAATAAAAAGACTAAAAGATTATACCAGACTAGATCTATCAACTCGCACTGAGACAAACTTGAGATCCTTGCAGAAGGAACACCAGGAGGAGGCCCAACTTTTCTGTGGCATTAGAAACAGACTTTGACCAATTTCATCTAGAAGTAAGGGCGAAAAGCCCCTTGTCTTCTTGCTTAGGCCATTTTTCTTAAAATAAATTCAATGGTGCACAATGACAGAAGATGGTTCATATTCATTTGGTATGTTATGGATAAATATATTGATGATTTCAGACAAATTACAATTGAGTACCTTAAAAGAATGACTTGAAGAAAGATGATTTGTACTATGGCTTGAAGTTTTCATTGCTCATTTCAGTTCTGAGCTGGAAGTTTAAACAAGAAACAATTTCAGTTTTAAGACTTCAGACCTCTTGGGACCAAGAGGATGAGGATATAGCATACACACTCCAAGGCTATGAAACTTTTCCTGGAATGTAATATTCATAGTCTCATATCAATGGGGCCTGAAGTCTTTTTGAGGAACTTCTTGATGTGCATTTCCAGTTTCGTGTTTGGGCAAGTCCTATTGAGGGTTCATGAGTCATCTAATGCTGTTCAAAACCATGGAAACTTTCAACTCATGAAAAGAAATGTATTTTTAAGGTTTTATTTTTTACTTTATATTATGAGAAATTTTCAGTGCCTTTCTTGCTGATTATTTTGAGTTTGATATTTCAATTTGCGTAGTTTATAAGAACATAGAATTTTCCTTTCTATATTTAATTAAAAAATATTACTCTGAGGATGTTCCTTAACATGTGCTGAAAATGTTTACACTTGGAAAATAAGCATGTATTATGAGAATGAATTACCCTATACCTCATGCTATGTTAGGTATCTTTTATGTCCAAGACACTCTTTCAGACCTTTTTGACATGGCTTGTTACGCTGCAGTCTGCATACTTAGCAATCAAACTATTCAATAGAAAACAGATGGATTTAAATTGGCGGAACTGTTCTTGCAGAATTGACGTGGTTTATTCAGGTGAATCGTGTGCAGATACATGGATTGAAACAGAGGTAAGATTAGGCTTCATGAATAAAATCCTGTGACAGTGGTAAAAAACGGTCCATGTGTCACTCCTGGACTTATAGGTCAAGCACATGTCCCAATAAAAAAATGTATCACCAAGAAGCATTTTACTAGGAAGCAAGGGTTTCGGTCATATCGTGCTTTTCATGAACGTAGATGCAGATTTCAATTTGTAATATATTCAGATGTTTTCTGGTCCTTCACTTTTGGCAATGTCAGATTATTGATTCCATGGTATCACTTTTCATTTTGGAAAGCAACTTGAACGATAAACAATTTGTATCAATTTATATTAAATCATTACTATGGTTTGTCGTTATCTGGTTGGTTTACTACCCTTTTTGGTTATTGAGTTTTCTCTGAACCACATACTTGATGTCGTTGATCTTAAATTTTGAAACTCCTGCAGTTTTTCATTATAATTAGCTACATCCTTTTTGAGTTGCAGGTTGTAGCCCTGAAGGAGGATGGATGTCCCAAAGTTTGGGTAGTGACTTCTGATGTCTGTCATCAGCATGCAGCCCATGGAGCAGTATGATCTCCTCATTGTTTGGGCATCACTGTATTATGCATTAACTTTTGCAGATATGGTTAGAGTATATTATTTTTATCTTTAATCTTCTAATGAAATTAATTCATAACATGGAAGAATTTGTGGTCTAATCTGTTGAGGATTTCTTCTCATCAGTTACTTAGGTTGCCATCTTCTGTATGTGTTAATTTTTATCTTTATTTTGATGTTGTCGCCGTTGAATTATTTTTTCTTTTCTTTTTCTTTCCACGTGAGGTTGTTTTCTTAAGTGAAAACTTCTGATTTAGAAATTTTAGTTGTTCTTCAATGTAAACCATCGAGGATTGTCCTTACAAGGAAGAGAGATAGAATGAATAAATAAAGGCTAATCATTGATTTTTGTCGTGGTCTAGAGTGGCAAGCTTTCAATTTAGATAACTTTTAGGTCATATATCCAAACCATGCTAGTCACCTAGTTGAGATATTAATTAAACACTCATAATTTTTTAGGCACCCATCATAAGTTTGTCTGATACTCGTGGTTATTAGTATTTATTGCTCTCCAATATTGAAACAGAATAGACTGCTTCAATATGCTTTCTTTTCATCATATGGTTTTGATTTACTTGTATTTCATGTGTTTTTGCCATTGTTTTTATTATTATCATCATTACTTTTTTTTTCCCCTTTTGAAGTGTTTAGCTGGGACGGTTAATTCATGTAAGTTTGCATATATAGCTCTTGGTTGTTTAGAACTATACTCTGTTTATGGCTGAAAAGGCTGTTTTCTGTTACTTAATTTTCAGGGAGCCTTTATTTGGAGTTGCAAGGCATTAGTTTCCGAGGTGAGAAAGGTCTAATTTTTTTGGTGCATATTGATAACATGGTAAAAAACATATATTTGTTTTTCTGTGTACATTAATTCACTATATTATAGGTATTATGCTTCAACAATTTTTTCTTTTCCGTGTTTTCTCTATTAAGATGAAAGGAATTTGGAATTCACCCCATGCCATTTTATCCTGTCTTTTTTATTACCTATTGATGACCTTAATTCATATGTACATACACATGCAATCTTAATCTCTGTGGAAAGAAAAGAGTTCTATTTGTGAAGTTGTTATCAATTCTCTCAATAAGTACAGAAGAAGAAAGAACAAGGGATACCCCAACATGAAGCCCTAAATAATAGAATAGGCTAAGTCCAATTTTGCAGCCAAAATAAAAAGACAGGGTTTTCATTAATTAGTGTAAAGGCTTGAAGCTCCTAAATTTTTCTTTGACCAACGTTCAATAGATTACTAATGTTGAATAGATTGTCCAACATTGTTGTTAATATGAGGCGCACTGCAAGTGCTGGCTCACGCTTAAAAAGGTGTTTTCAAAGGTTTGGACTCAACCTGTTCTCCTTGTAATATTGCAACGAGGCTGACATTACCTATTTGATGTTTAGGTCATACATAAGTTTTGTACATATGTCTAATGGTGGATGCTATCTTAAATCCTATATTAGGAATATTGCATGAACATTGGCTTTAGACTACAGTCCGTTCCAGCACAACCTACGCTTGTTAACCCTGTTAATGAACGACGGTAAATGGATGCTTTACCAATTATCTTTTCTGAAGAATATGGCTCACATATTATTTGAACCATTATGTGCCAAACTTACTCTTTAAATCATTCCATTTGATTAAAAGTAAAGGAAGTCATCTATGGAGAGGATTTTTGTGGCGGGAAGACAAATTCTTGATCATGACCTTCTTACTAATGAGACAATGGAGGATTATGGTGCCAATATGAAAGGAGTGATTTTCAAAAAGCCTACATTGTGTGGATTTGTATGTTTGATAAGGTTTTGGAGGACTTTGGTTATAAATGGTGGATTTGGATGCGAAGTTGTATGAGGAACACAACACTAATTCCTCTCCTCATTAATGGGAGTTTTAGGGGCGAAGTTTTGGTCTGAAGAATTCTAAGACAGGGAAACCTTTTATCCCCCTTCTTTTTCTTCTCATCTGTAAGAGTGTTGTTGGTAAAATCCTTATTATTTTGAGGTTTGGAAAGGAAGTTATGACTCTGGCCCACCTTCAAGTCGCTGATAATTCTATTTTCTTCTAGTAAATGTAAGGTTATGAGATTCAATTGTGAAGAGGAGAAGATTGGAGGTGGACGATATTAGTAGGTTGCAAGTTGGTTCCTTTCCTCTTCTTACCTTAGTCTTCCCTTTGGGAGTAGTCTCAAATCCATTTATTTTGGAACCTAGTGGTGGAGAAAGTTCAGAAAAGATTGGCCTCATGGAAAATAGACTTTTTCTCCAAAGCTGGTAGATTGGCTTTGGTTCGCTCTATGTTGAGTGGTATCCCAATTTATTACTTTTTCCTTTTTAGAGCTTTGAGGGAGGTATGTGAGAGTATCAGGAAGCTTATGCATAATTTCCTTGGGGAAGGGGTAGATGTAGGAAAAGGAAAAGAATTACACTTGGTTGGATGGGAGGCCATTGAGAGGTTAGTTTCCTTGGGGGCTAGAAATTGGAGAATTTAAGGATTTTTAAGGGAGCTCTGCTGGGCAAATGGCTTTGCTTTTCTCCCTTAAGCCCAACTCTTTATGGCATAGGACCATATTGCTAGTGCGTTCAATCACCCCTTTATATGGATGTTGCTTGGGGCTAAAGGCACTTATCTGAATTTATGGAATGATTTTTCTGAAAAGCTCTTTTCTTTCTGTCATTTGACTCATTGAGTGAGTGGTGGCAGAGGGGAAGGATATGTACTTTTGTCAAGATCAGTAGATTCGGTTTTTCCACATTTCTTTCATTTGTCTTCCTTTAAATGACTTGTTGTCCAATTTTTTAGTTTGCTCAGAGAGTTCCATTTTCTTTTCGTTCCAGTTTCGTTGTTTTTTGTTCAACAAAGAAGCAACTGTGGTTGCTTCTCTTTTGTCTATGCTTGAGGGCTTTGTTTTTAGACTTTTGTGTGTGGGTCCCCCAACCCCTTGAAGGTATTCTCGTGCAAAACTTTTTTTTGTAATTTGTCTGACCGTTCTCCCTTAGGTGAGTTGTATTTTTCTGGATTTTGGAGAATTAAGGTTCCTGGAAAATAAGTTCTTTACCTGGAAAGTTCTACGTGAAAGAGCAAACACTGTGGATCAACTTCTGAGGAAGTTGCCCTTGTTGGTTGGCCTGCTTAGTTACATTCTCTGTCGATAGGGGAAGAAGACCTGGATCACATTCTTTGGAGACGTTTGCGAGACCTTTGTGGAATTATTTCTTTCAGATGTTTGGACTCTTGCTAGCTCCACTCAAGGATACTTGTGATATGATTGGGGGAGTTCCTCTTTCATTCATTCGCTCTTTTGGGATTGCTTTTTATGGGTTGCTAGGGTGTGTGCGTTATTATGGGATCTATGGGGATGAGCAAAATAATCGCCACATATGGTCCCTCATCATTGTATTCATTTTGGGTTTCCATTTTGAAGCCTTTTGTAATTTTTCCATAGGCACTGCTTTGTATAGTTCGAGCCCCTTCCTTTAGTTGGGCTCCCTTTTTGTAGGCTTGGTTTTTTTGTATTTTTTATTTTTTCCTCATGAAGTTATAGAAAGAGAAAAAGGCATAATACTCCTAAAAACTAGGCCTTCAGTTCTTGCACAATGAGAAGATGTGTATGGCTTCAATATTTTCTGAATGAACATGATAATAGTAGGGATATCAAGCACGTAGTTTACATTTTATTTTGTCATTGTAGATTAATGCATCAAAAAAGGAGGTCGAAATGATGTTGCAAGAACACAGGTATGACTATTCCAGATGCAAATTTTACAGTATGTGTTGAGATAGATTATGATAGGTTTTAGTCTCAGCCCCTTAAATCTGCTTCAGTTTCTAATTTGGTCTCAAGTTTAAAAGTCGAGTGCAATAGTTGGTGTTGATTTTTAATTTGGTCCTTAAAGTTTAAATAATTTATACAAGTCCCTCAAGTTTGCATCAGTTATTGCAGAAGATAAAATGACTTAGTGGACATGTTATAAATCACTAGCATTGAAATCGAGGAAAAAAATGTAGTGTCCACTCTTTAGAGAACCAAAAAAGAATTTTATGATTATGAGATATATTATTAATTTTTTTAAACATGTTGGTCACGTTTTCACTACAGAACTCAATAGAACCTTGTGATTAAAATATAACTTTTTGAACGTCATTGACCAAATTAGAAACAAGGTCAAAACTCAAGGTACCAAAACTATGTAAGTAGGTTTCTATATCAACTGTGAGATGCTTTTGAGTTTTGCCATTCCAATTTTCTGAATCTCTCTCGAGTTATTAGTGATCTTTCTCCTTACATCCGATAAATTTTTGTTTATTTTATTCCTATCCCAAAGTGGCAAAAGCCCAACTCTTCCAACCTGCAGGATAAACCTTGCCCTTGACACTTAATTGTCTTTCCAGATTGTTTAAATTGCACAAGACCGCCTTGAATCTTATGCATTCAAACCTTCCAAATTTCCCTTACTCGAGGGATCAACAAAACTGTTATTGTACAGTAGTGTAAAATCGACCTTGCTTCAACCCATTTACAGACTTGGAATATTTTTCTGCATGCTTCAGCATATTATTTTATAAGAAGAGTGGCAGCATCTACTTTAAATCTCCTGCCTTATTTATTAAATTTTTTTCCAGATCAACATCTTTCCAAGGGAAATTGTTAAAGCATAATCTCGATTCAGAAGTTGTCAATGCTCTCAATGATCTGAAAAGGAAGCTAAACGAAAGTGAACCAAAGTAGATGGGGAAGATCGTGTTTGTACAAATACTCTTTCTCTAATTTCATTGCTTCTTCATTAATTTAGGCTGAATCTCATGTTCATAGGCATGTCGCAGTCTGTTTCAAGCTGATTAATTGAAGACAAACCACATATTTTTTCTTTTCTTTTTCTGTGTACAAATCCCCTAGACGTCTGACAGAAAATAACATTAAGTTTGTTGAGAATGGGGACGAAGATGTAGGTTCCCTAGCCTTTCTGAATTCCAGATGTAGGGTCTCTATGTTCCAACAGTGAAGAGAAGCATATAATATTTGTGCATGACGTGTTTTGGCATATTTAAAAATATAGGAGATGATATGGGAGAAGGAAATAAAAGAAGAGAAGCCAGCTTTCTCATTTGTCAGCATTACCGTTTGTAGAATAGCATTTTTTAGTTCAGCTTTGGCCCCATTTCATTAAATGAAATAGAGCTACCACCACCGCATTTGAAAGTCATGAGTTAATAGAGGGACTACTGCTCATATTTAAGTGGTCGGGCTTATGAGATTTTATGTAAATGTTGTAACCATCGATTTTAGGAAGAAAAATCCTCAAACAATGACTTGGTTTTTGTTGTTTTATAATGTTGGTGAGGTTATAAATGTTGAGGATCACCCCCTGCACCGTAGTTGAAAGTCATGAGACTATCTCATAATACTCACAATTTAAGTGGTTTGGGTTTATAAGATTTTATGTAAATATTGTTAACGATAAGAACATTTCTCAAACAATGAGGTAGTTTTTGTTCTATTATGTTCTACATACCAAGTGTATGTTGATGATTGATTATTCAAG

mRNA sequence

AGTGGACCTAAATTCATTTCCACCTCTAAATAAGGGCCATTCCTACAAATTTTTCGAGAGAAAATGATGTGTGGTTGATTTAGCCACACCCAACCCATCGTCTTCCTTTACGAATAACCGTGTAAGAGCCGACGGAGCAATTGCTCGCCGACTTCGATCGGACTAAATTTCCGTTACTATGTAAATCGCTGTGACCGGAACGAACCCTTTTGCGAAGTAAGCGTCTCTCCTTGTCATGGACATCTTTTCTTCAACTTGAATTCATGGAGGTGGTCGGTTCAGTCAAACTCTTTTTAGCTTCACCGGCGAACACGGTTTCAGGCTTAGTTTACTCTCCCTCCTCTTATTCTTCCTTATTTTCTCCCAAGAAGAAGAAGAAGAAGAAGAAGCTGCTTGTTGTGTCAAAGAGCAAAAAGCAGCCTCAAACTTCATCGGCTGGTTCGGATCCTCCTCCGAGAATTACGTCAAATCTGAAGCAGAATTTGCAGTTTCTGAGATTGTGGAAGGATTTTCAAAAGAGAAAATCTGGCGTACCTAAGCCTGCTACTAGTTACCGGAGGAAGAAGGTTGAGAAGGAGGACCTCCCAGAAGATACGGAGCTTTATCGTGATCCCACATTGGCTCTTTACCATACAAACCAGGGCATAGACAATGTATTCCCTGTCTTGCTGGTTGATGGATATAATGTGTGTGGCTACTGGGTGAAACTGAAGAAACATTTTATGAATGGGAGACTTGATGTAGCTCGCCAAAAGCTAATTGACGAGCTTATTACGTTCAGTATGCTGAGAGAGGTCAAAGTGGTAGTTGTATTTGATGCGATGCTGTCTGGACTCCCAACACACAAGGAAAACTTTGCCGGAATTGACGTGGTTTATTCAGGTGAATCGTGTGCAGATACATGGATTGAAACAGAGGTTGTAGCCCTGAAGGAGGATGGATGTCCCAAAGTTTGGGTAGTGACTTCTGATGTCTGTCATCAGCATGCAGCCCATGGAGCAGGAGCCTTTATTTGGAGTTGCAAGGCATTAGTTTCCGAGATTAATGCATCAAAAAAGGAGGTCGAAATGATGTTGCAAGAACACAGATCAACATCTTTCCAAGGGAAATTGTTAAAGCATAATCTCGATTCAGAAGTTGTCAATGCTCTCAATGATCTGAAAAGGAAGCTAAACGAAAGTGAACCAAAGTAGATGGGGAAGATCGTGTTTGTACAAATACTCTTTCTCTAATTTCATTGCTTCTTCATTAATTTAGGCTGAATCTCATGTTCATAGGCATGTCGCAGTCTGTTTCAAGCTGATTAATTGAAGACAAACCACATATTTTTTCTTTTCTTTTTCTGTGTACAAATCCCCTAGACGTCTGACAGAAAATAACATTAAGTTTGTTGAGAATGGGGACGAAGATGTAGGTTCCCTAGCCTTTCTGAATTCCAGATGTAGGGTCTCTATGTTCCAACAGTGAAGAGAAGCATATAATATTTGTGCATGACGTGTTTTGGCATATTTAAAAATATAGGAGATGATATGGGAGAAGGAAATAAAAGAAGAGAAGCCAGCTTTCTCATTTGTCAGCATTACCGTTTGTAGAATAGCATTTTTTAGTTCAGCTTTGGCCCCATTTCATTAAATGAAATAGAGCTACCACCACCGCATTTGAAAGTCATGAGTTAATAGAGGGACTACTGCTCATATTTAAGTGGTCGGGCTTATGAGATTTTATGTAAATGTTGTAACCATCGATTTTAGGAAGAAAAATCCTCAAACAATGACTTGGTTTTTGTTGTTTTATAATGTTGGTGAGGTTATAAATGTTGAGGATCACCCCCTGCACCGTAGTTGAAAGTCATGAGACTATCTCATAATACTCACAATTTAAGTGGTTTGGGTTTATAAGATTTTATGTAAATATTGTTAACGATAAGAACATTTCTCAAACAATGAGGTAGTTTTTGTTCTATTATGTTCTACATACCAAGTGTATGTTGATGATTGATTATTCAAG

Coding sequence (CDS)

ATGGAGGTGGTCGGTTCAGTCAAACTCTTTTTAGCTTCACCGGCGAACACGGTTTCAGGCTTAGTTTACTCTCCCTCCTCTTATTCTTCCTTATTTTCTCCCAAGAAGAAGAAGAAGAAGAAGAAGCTGCTTGTTGTGTCAAAGAGCAAAAAGCAGCCTCAAACTTCATCGGCTGGTTCGGATCCTCCTCCGAGAATTACGTCAAATCTGAAGCAGAATTTGCAGTTTCTGAGATTGTGGAAGGATTTTCAAAAGAGAAAATCTGGCGTACCTAAGCCTGCTACTAGTTACCGGAGGAAGAAGGTTGAGAAGGAGGACCTCCCAGAAGATACGGAGCTTTATCGTGATCCCACATTGGCTCTTTACCATACAAACCAGGGCATAGACAATGTATTCCCTGTCTTGCTGGTTGATGGATATAATGTGTGTGGCTACTGGGTGAAACTGAAGAAACATTTTATGAATGGGAGACTTGATGTAGCTCGCCAAAAGCTAATTGACGAGCTTATTACGTTCAGTATGCTGAGAGAGGTCAAAGTGGTAGTTGTATTTGATGCGATGCTGTCTGGACTCCCAACACACAAGGAAAACTTTGCCGGAATTGACGTGGTTTATTCAGGTGAATCGTGTGCAGATACATGGATTGAAACAGAGGTTGTAGCCCTGAAGGAGGATGGATGTCCCAAAGTTTGGGTAGTGACTTCTGATGTCTGTCATCAGCATGCAGCCCATGGAGCAGGAGCCTTTATTTGGAGTTGCAAGGCATTAGTTTCCGAGATTAATGCATCAAAAAAGGAGGTCGAAATGATGTTGCAAGAACACAGATCAACATCTTTCCAAGGGAAATTGTTAAAGCATAATCTCGATTCAGAAGTTGTCAATGCTCTCAATGATCTGAAAAGGAAGCTAAACGAAAGTGAACCAAAGTAG

Protein sequence

MEVVGSVKLFLASPANTVSGLVYSPSSYSSLFSPKKKKKKKKLLVVSKSKKQPQTSSAGSDPPPRITSNLKQNLQFLRLWKDFQKRKSGVPKPATSYRRKKVEKEDLPEDTELYRDPTLALYHTNQGIDNVFPVLLVDGYNVCGYWVKLKKHFMNGRLDVARQKLIDELITFSMLREVKVVVVFDAMLSGLPTHKENFAGIDVVYSGESCADTWIETEVVALKEDGCPKVWVVTSDVCHQHAAHGAGAFIWSCKALVSEINASKKEVEMMLQEHRSTSFQGKLLKHNLDSEVVNALNDLKRKLNESEPK*
Homology
BLAST of CSPI03G06640 vs. ExPASy Swiss-Prot
Match: P37574 (Uncharacterized protein YacP OS=Bacillus subtilis (strain 168) OX=224308 GN=yacP PE=1 SV=1)

HSP 1 Score: 65.9 bits (159), Expect = 9.5e-10
Identity = 53/170 (31.18%), Postives = 86/170 (50.59%), Query Frame = 0

Query: 134 VLLVDGYNVCGYWVKLKKHFMNGRLDVARQKLIDELITFSMLREVKVVVVFDA-MLSGLP 193
           +LLVDGYN+ G W +LK    N   + AR  LI ++  +      +V+VVFDA ++ GL 
Sbjct: 3   ILLVDGYNMIGAWPQLKDLKANS-FEEARDVLIQKMAEYQSYTGNRVIVVFDAHLVKGLE 62

Query: 194 THKENFAGIDVVYSGES-CADTWIETEVVALKEDGCPKVWVVTSDVCHQHAAHGAGAFIW 253
             + N   ++V+++ E+  AD  IE    AL  +   ++ V TSD   Q A  G GA   
Sbjct: 63  KKQTNHR-VEVIFTKENETADERIEKLAQAL-NNIATQIHVATSDYTEQWAIFGQGALRK 122

Query: 254 SCKALVSEINASKKEVEMMLQEHRSTSFQGKLLKHNLDSEVVNALNDLKR 302
           S + L+ E+   ++ +E  +++  S    GK+    L  EV+      +R
Sbjct: 123 SARELLREVETIERRIERRVRKITSEKPAGKIA---LSEEVLKTFEKWRR 166

BLAST of CSPI03G06640 vs. ExPASy TrEMBL
Match: A0A1S3BUE5 (uncharacterized protein YacP isoform X1 OS=Cucumis melo OX=3656 GN=LOC103493288 PE=4 SV=1)

HSP 1 Score: 569.7 bits (1467), Expect = 7.5e-159
Identity = 297/311 (95.50%), Postives = 300/311 (96.46%), Query Frame = 0

Query: 1   MEVVGSVKLFLASPANTVSGLVYSPSSYSS-LFSP-KKKKKKKKLLVVSKSKKQPQTSSA 60
           MEVVGSVKLF ASPAN+VSGL YSPSSYSS LFSP KKKKKKKKLLVVSKSKKQPQ SSA
Sbjct: 1   MEVVGSVKLFSASPANSVSGLSYSPSSYSSTLFSPKKKKKKKKKLLVVSKSKKQPQISSA 60

Query: 61  GSDPPPRITSNLKQNLQFLRLWKDFQKRKSGVPKPATSYRRKKVEKEDLPEDTELYRDPT 120
           GSD PPRITSNLKQNLQFLRLWK+FQKRKSGVPKPATSYRRKKVEKEDLP DTELYRDPT
Sbjct: 61  GSD-PPRITSNLKQNLQFLRLWKEFQKRKSGVPKPATSYRRKKVEKEDLPGDTELYRDPT 120

Query: 121 LALYHTNQGIDNVFPVLLVDGYNVCGYWVKLKKHFMNGRLDVARQKLIDELITFSMLREV 180
           LALY+TNQGIDN  PVLLVDGYNVCGYWVKLKKHFMNGRLDVARQKLIDELITFSMLREV
Sbjct: 121 LALYYTNQGIDNAVPVLLVDGYNVCGYWVKLKKHFMNGRLDVARQKLIDELITFSMLREV 180

Query: 181 KVVVVFDAMLSGLPTHKENFAGIDVVYSGESCADTWIETEVVALKEDGCPKVWVVTSDVC 240
           KVVVVFDAMLSGLPTHKENFAGIDVVYSGESCADTWIETEVVALKEDGCPKVWVVTSDVC
Sbjct: 181 KVVVVFDAMLSGLPTHKENFAGIDVVYSGESCADTWIETEVVALKEDGCPKVWVVTSDVC 240

Query: 241 HQHAAHGAGAFIWSCKALVSEINASKKEVEMMLQEHRSTSFQGKLLKHNLDSEVVNALND 300
            QHAAHGAGAFIWSCKALVSEI ASKKEVEMMLQEHRSTSFQGKLLKHNLDSEVVNALND
Sbjct: 241 QQHAAHGAGAFIWSCKALVSEIKASKKEVEMMLQEHRSTSFQGKLLKHNLDSEVVNALND 300

Query: 301 LKRKLNESEPK 310
           LKRKLNESEPK
Sbjct: 301 LKRKLNESEPK 310

BLAST of CSPI03G06640 vs. ExPASy TrEMBL
Match: A0A6J1C7Z2 (uncharacterized protein LOC111008254 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111008254 PE=4 SV=1)

HSP 1 Score: 562.0 bits (1447), Expect = 1.6e-156
Identity = 286/308 (92.86%), Postives = 294/308 (95.45%), Query Frame = 0

Query: 1   MEVVGSVKLFLASPANTVSGLVYSPSSYSSLFS-PKKKKKKKKLLVVSKSKKQPQTSSAG 60
           MEVVG VKLF +SPANTVSGL YS SSY+S FS PKKKKKKKKLLVVSKSKKQPQTSS G
Sbjct: 1   MEVVGPVKLFSSSPANTVSGLCYSASSYTSSFSPPKKKKKKKKLLVVSKSKKQPQTSSGG 60

Query: 61  SDPPPRITSNLKQNLQFLRLWKDFQKRKSGVPKPATSYRRKKVEKEDLPEDTELYRDPTL 120
           SDPPPRITSNLKQNLQFLRLWK+FQKRKSG PKPATSYRRKKVEKEDLP DT+LYRDPTL
Sbjct: 61  SDPPPRITSNLKQNLQFLRLWKEFQKRKSGTPKPATSYRRKKVEKEDLPGDTDLYRDPTL 120

Query: 121 ALYHTNQGIDNVFPVLLVDGYNVCGYWVKLKKHFMNGRLDVARQKLIDELITFSMLREVK 180
           ALY+TNQGIDN  PVLLVDGYNVCGYWVKLKKHFMNGRLDVARQKLIDELITFSMLREVK
Sbjct: 121 ALYYTNQGIDNAVPVLLVDGYNVCGYWVKLKKHFMNGRLDVARQKLIDELITFSMLREVK 180

Query: 181 VVVVFDAMLSGLPTHKENFAGIDVVYSGESCADTWIETEVVALKEDGCPKVWVVTSDVCH 240
           VVVVFDAMLSGLPTHKENFAGIDVV+SGESCADTWIETEVVALKEDGCPKVWVVTSD+C 
Sbjct: 181 VVVVFDAMLSGLPTHKENFAGIDVVFSGESCADTWIETEVVALKEDGCPKVWVVTSDICQ 240

Query: 241 QHAAHGAGAFIWSCKALVSEINASKKEVEMMLQEHRSTSFQGKLLKHNLDSEVVNALNDL 300
           QHAAHGAGAFIWSCKALVSEI ASKKEVEMMLQEHRSTSFQGKLLKHNLDSEVVNALNDL
Sbjct: 241 QHAAHGAGAFIWSCKALVSEIKASKKEVEMMLQEHRSTSFQGKLLKHNLDSEVVNALNDL 300

Query: 301 KRKLNESE 308
           KRKL E+E
Sbjct: 301 KRKLTENE 308

BLAST of CSPI03G06640 vs. ExPASy TrEMBL
Match: A0A6J1C4B6 (uncharacterized protein LOC111008254 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111008254 PE=4 SV=1)

HSP 1 Score: 557.0 bits (1434), Expect = 5.0e-155
Identity = 286/310 (92.26%), Postives = 294/310 (94.84%), Query Frame = 0

Query: 1   MEVVGSVKLFLASPANTVSGLVYSPSSYSSLFS-PKKKKKKKKLLVVSKSKKQPQTSS-- 60
           MEVVG VKLF +SPANTVSGL YS SSY+S FS PKKKKKKKKLLVVSKSKKQPQTSS  
Sbjct: 1   MEVVGPVKLFSSSPANTVSGLCYSASSYTSSFSPPKKKKKKKKLLVVSKSKKQPQTSSLQ 60

Query: 61  AGSDPPPRITSNLKQNLQFLRLWKDFQKRKSGVPKPATSYRRKKVEKEDLPEDTELYRDP 120
            GSDPPPRITSNLKQNLQFLRLWK+FQKRKSG PKPATSYRRKKVEKEDLP DT+LYRDP
Sbjct: 61  GGSDPPPRITSNLKQNLQFLRLWKEFQKRKSGTPKPATSYRRKKVEKEDLPGDTDLYRDP 120

Query: 121 TLALYHTNQGIDNVFPVLLVDGYNVCGYWVKLKKHFMNGRLDVARQKLIDELITFSMLRE 180
           TLALY+TNQGIDN  PVLLVDGYNVCGYWVKLKKHFMNGRLDVARQKLIDELITFSMLRE
Sbjct: 121 TLALYYTNQGIDNAVPVLLVDGYNVCGYWVKLKKHFMNGRLDVARQKLIDELITFSMLRE 180

Query: 181 VKVVVVFDAMLSGLPTHKENFAGIDVVYSGESCADTWIETEVVALKEDGCPKVWVVTSDV 240
           VKVVVVFDAMLSGLPTHKENFAGIDVV+SGESCADTWIETEVVALKEDGCPKVWVVTSD+
Sbjct: 181 VKVVVVFDAMLSGLPTHKENFAGIDVVFSGESCADTWIETEVVALKEDGCPKVWVVTSDI 240

Query: 241 CHQHAAHGAGAFIWSCKALVSEINASKKEVEMMLQEHRSTSFQGKLLKHNLDSEVVNALN 300
           C QHAAHGAGAFIWSCKALVSEI ASKKEVEMMLQEHRSTSFQGKLLKHNLDSEVVNALN
Sbjct: 241 CQQHAAHGAGAFIWSCKALVSEIKASKKEVEMMLQEHRSTSFQGKLLKHNLDSEVVNALN 300

Query: 301 DLKRKLNESE 308
           DLKRKL E+E
Sbjct: 301 DLKRKLTENE 310

BLAST of CSPI03G06640 vs. ExPASy TrEMBL
Match: A0A6J1FQV5 (uncharacterized protein LOC111447625 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111447625 PE=4 SV=1)

HSP 1 Score: 551.2 bits (1419), Expect = 2.7e-153
Identity = 279/309 (90.29%), Postives = 291/309 (94.17%), Query Frame = 0

Query: 1   MEVVGSVKLFLASPANTVSGLVYSPSSYSSLFSPKKKKKKKKLLVVSKSKKQPQTSSAGS 60
           MEVVGSVKLF +SP NTVSGL YSPSSY+SL SPKKKKKKK  LVVSKSKKQPQ+ S  S
Sbjct: 1   MEVVGSVKLFSSSPTNTVSGLCYSPSSYTSLLSPKKKKKKKNFLVVSKSKKQPQSPSGDS 60

Query: 61  DPPPRITSNLKQNLQFLRLWKDFQKRKSGVPKPATSYRRKKVEKEDLPEDTELYRDPTLA 120
           D PPRITSNLKQNLQFL+LWKDFQKRKS VPKPATSYR+KKVEKEDLP DTELYRDPTL 
Sbjct: 61  D-PPRITSNLKQNLQFLKLWKDFQKRKSSVPKPATSYRKKKVEKEDLPGDTELYRDPTLT 120

Query: 121 LYHTNQGIDNVFPVLLVDGYNVCGYWVKLKKHFMNGRLDVARQKLIDELITFSMLREVKV 180
           LY+TNQGIDN  PVLLVDGYNVCGYWVKLKKHFM+GRLDVARQKLIDELITFSMLREVKV
Sbjct: 121 LYYTNQGIDNTVPVLLVDGYNVCGYWVKLKKHFMSGRLDVARQKLIDELITFSMLREVKV 180

Query: 181 VVVFDAMLSGLPTHKENFAGIDVVYSGESCADTWIETEVVALKEDGCPKVWVVTSDVCHQ 240
           VVVFDAMLSGLPTHKE+FAGIDVVYSGESCADTWIE EVVAL+EDGCPKVWVVTSD+CHQ
Sbjct: 181 VVVFDAMLSGLPTHKEDFAGIDVVYSGESCADTWIEKEVVALREDGCPKVWVVTSDICHQ 240

Query: 241 HAAHGAGAFIWSCKALVSEINASKKEVEMMLQEHRSTSFQGKLLKHNLDSEVVNALNDLK 300
           HAAHGAGAFIWSCKALV+EI ASKKEVEMMLQEHRSTSFQGKLLKHNLDSEVVNALNDLK
Sbjct: 241 HAAHGAGAFIWSCKALVTEIKASKKEVEMMLQEHRSTSFQGKLLKHNLDSEVVNALNDLK 300

Query: 301 RKLNESEPK 310
           RKLNE+E K
Sbjct: 301 RKLNENESK 308

BLAST of CSPI03G06640 vs. ExPASy TrEMBL
Match: A0A6J1JLX1 (uncharacterized protein LOC111483584 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111483584 PE=4 SV=1)

HSP 1 Score: 548.9 bits (1413), Expect = 1.4e-152
Identity = 280/310 (90.32%), Postives = 291/310 (93.87%), Query Frame = 0

Query: 1   MEVVGSVKLFLASPANTVSGLVYSPSSYSSLFSPKKKKKKKK-LLVVSKSKKQPQTSSAG 60
           MEVVGSVKLF +SP N VSGL YSPSSY+SLFSPKKKKKKKK  LVVSKSKKQPQ+ S  
Sbjct: 1   MEVVGSVKLFSSSPTNIVSGLCYSPSSYTSLFSPKKKKKKKKNFLVVSKSKKQPQSPSGD 60

Query: 61  SDPPPRITSNLKQNLQFLRLWKDFQKRKSGVPKPATSYRRKKVEKEDLPEDTELYRDPTL 120
           SD PPRITSNLKQNLQFL+LWKDFQKRKS  PKPATSYR+KKVEKEDLP DTELYRDPTL
Sbjct: 61  SD-PPRITSNLKQNLQFLKLWKDFQKRKSSAPKPATSYRKKKVEKEDLPGDTELYRDPTL 120

Query: 121 ALYHTNQGIDNVFPVLLVDGYNVCGYWVKLKKHFMNGRLDVARQKLIDELITFSMLREVK 180
            LY+TNQGIDN  PVLLVDGYNVCGYWVKLKKHFMNGRLDVARQKLIDELITFSMLREVK
Sbjct: 121 TLYYTNQGIDNTVPVLLVDGYNVCGYWVKLKKHFMNGRLDVARQKLIDELITFSMLREVK 180

Query: 181 VVVVFDAMLSGLPTHKENFAGIDVVYSGESCADTWIETEVVALKEDGCPKVWVVTSDVCH 240
           VVVVFDAMLSGLPTHKE+FAGIDVVYSGESCADTWIE EVVAL+EDGCPKVWVVTSD+CH
Sbjct: 181 VVVVFDAMLSGLPTHKEDFAGIDVVYSGESCADTWIEKEVVALREDGCPKVWVVTSDICH 240

Query: 241 QHAAHGAGAFIWSCKALVSEINASKKEVEMMLQEHRSTSFQGKLLKHNLDSEVVNALNDL 300
           QHAAHGAGAFIWSCKALV+EI ASKKEVEMMLQEHRSTSFQGKLLKHNLDSEVVNALNDL
Sbjct: 241 QHAAHGAGAFIWSCKALVTEIKASKKEVEMMLQEHRSTSFQGKLLKHNLDSEVVNALNDL 300

Query: 301 KRKLNESEPK 310
           KRKLNE+E K
Sbjct: 301 KRKLNENESK 309

BLAST of CSPI03G06640 vs. NCBI nr
Match: XP_004133742.2 (uncharacterized protein LOC101212837 isoform X1 [Cucumis sativus] >KAE8650197.1 hypothetical protein Csa_010763 [Cucumis sativus])

HSP 1 Score: 610.9 bits (1574), Expect = 6.0e-171
Identity = 309/309 (100.00%), Postives = 309/309 (100.00%), Query Frame = 0

Query: 1   MEVVGSVKLFLASPANTVSGLVYSPSSYSSLFSPKKKKKKKKLLVVSKSKKQPQTSSAGS 60
           MEVVGSVKLFLASPANTVSGLVYSPSSYSSLFSPKKKKKKKKLLVVSKSKKQPQTSSAGS
Sbjct: 1   MEVVGSVKLFLASPANTVSGLVYSPSSYSSLFSPKKKKKKKKLLVVSKSKKQPQTSSAGS 60

Query: 61  DPPPRITSNLKQNLQFLRLWKDFQKRKSGVPKPATSYRRKKVEKEDLPEDTELYRDPTLA 120
           DPPPRITSNLKQNLQFLRLWKDFQKRKSGVPKPATSYRRKKVEKEDLPEDTELYRDPTLA
Sbjct: 61  DPPPRITSNLKQNLQFLRLWKDFQKRKSGVPKPATSYRRKKVEKEDLPEDTELYRDPTLA 120

Query: 121 LYHTNQGIDNVFPVLLVDGYNVCGYWVKLKKHFMNGRLDVARQKLIDELITFSMLREVKV 180
           LYHTNQGIDNVFPVLLVDGYNVCGYWVKLKKHFMNGRLDVARQKLIDELITFSMLREVKV
Sbjct: 121 LYHTNQGIDNVFPVLLVDGYNVCGYWVKLKKHFMNGRLDVARQKLIDELITFSMLREVKV 180

Query: 181 VVVFDAMLSGLPTHKENFAGIDVVYSGESCADTWIETEVVALKEDGCPKVWVVTSDVCHQ 240
           VVVFDAMLSGLPTHKENFAGIDVVYSGESCADTWIETEVVALKEDGCPKVWVVTSDVCHQ
Sbjct: 181 VVVFDAMLSGLPTHKENFAGIDVVYSGESCADTWIETEVVALKEDGCPKVWVVTSDVCHQ 240

Query: 241 HAAHGAGAFIWSCKALVSEINASKKEVEMMLQEHRSTSFQGKLLKHNLDSEVVNALNDLK 300
           HAAHGAGAFIWSCKALVSEINASKKEVEMMLQEHRSTSFQGKLLKHNLDSEVVNALNDLK
Sbjct: 241 HAAHGAGAFIWSCKALVSEINASKKEVEMMLQEHRSTSFQGKLLKHNLDSEVVNALNDLK 300

Query: 301 RKLNESEPK 310
           RKLNESEPK
Sbjct: 301 RKLNESEPK 309

BLAST of CSPI03G06640 vs. NCBI nr
Match: XP_038904183.1 (uncharacterized protein YacP [Benincasa hispida])

HSP 1 Score: 577.4 bits (1487), Expect = 7.4e-161
Identity = 291/309 (94.17%), Postives = 298/309 (96.44%), Query Frame = 0

Query: 1   MEVVGSVKLFLASPANTVSGLVYSPSSYSSLFSPKKKKKKKKLLVVSKSKKQPQTSSAGS 60
           MEV+GSVKLF +SPANTVSGL YSPSSYSS FSPKKKKKKKKLLVVSKSK+QPQTSS  S
Sbjct: 1   MEVIGSVKLFSSSPANTVSGLSYSPSSYSSSFSPKKKKKKKKLLVVSKSKRQPQTSSGAS 60

Query: 61  DPPPRITSNLKQNLQFLRLWKDFQKRKSGVPKPATSYRRKKVEKEDLPEDTELYRDPTLA 120
           DPPPRITSNLKQNLQFL+LWK+FQKRKSG PKPATSYR+KKVEKEDLP DTELYRDPTLA
Sbjct: 61  DPPPRITSNLKQNLQFLKLWKEFQKRKSGAPKPATSYRKKKVEKEDLPGDTELYRDPTLA 120

Query: 121 LYHTNQGIDNVFPVLLVDGYNVCGYWVKLKKHFMNGRLDVARQKLIDELITFSMLREVKV 180
           LY+TNQGIDN  PVLLVDGYNVCGYWVKLKKHFMNGRLDVARQKLIDELITFSMLREVKV
Sbjct: 121 LYYTNQGIDNAVPVLLVDGYNVCGYWVKLKKHFMNGRLDVARQKLIDELITFSMLREVKV 180

Query: 181 VVVFDAMLSGLPTHKENFAGIDVVYSGESCADTWIETEVVALKEDGCPKVWVVTSDVCHQ 240
           VVVFDAMLSGLPTHKENFAGIDVVYSGESCADTWIETEVVALKEDGCPKVWVVTSDVC Q
Sbjct: 181 VVVFDAMLSGLPTHKENFAGIDVVYSGESCADTWIETEVVALKEDGCPKVWVVTSDVCQQ 240

Query: 241 HAAHGAGAFIWSCKALVSEINASKKEVEMMLQEHRSTSFQGKLLKHNLDSEVVNALNDLK 300
           HAAHGAGAFIWSCKALVSEI ASKKEVEMMLQEHRSTSFQGKLLKHNLDSEVVNALNDLK
Sbjct: 241 HAAHGAGAFIWSCKALVSEIKASKKEVEMMLQEHRSTSFQGKLLKHNLDSEVVNALNDLK 300

Query: 301 RKLNESEPK 310
           RKLNESEPK
Sbjct: 301 RKLNESEPK 309

BLAST of CSPI03G06640 vs. NCBI nr
Match: XP_008452195.1 (PREDICTED: uncharacterized protein YacP isoform X1 [Cucumis melo])

HSP 1 Score: 569.7 bits (1467), Expect = 1.5e-158
Identity = 297/311 (95.50%), Postives = 300/311 (96.46%), Query Frame = 0

Query: 1   MEVVGSVKLFLASPANTVSGLVYSPSSYSS-LFSP-KKKKKKKKLLVVSKSKKQPQTSSA 60
           MEVVGSVKLF ASPAN+VSGL YSPSSYSS LFSP KKKKKKKKLLVVSKSKKQPQ SSA
Sbjct: 1   MEVVGSVKLFSASPANSVSGLSYSPSSYSSTLFSPKKKKKKKKKLLVVSKSKKQPQISSA 60

Query: 61  GSDPPPRITSNLKQNLQFLRLWKDFQKRKSGVPKPATSYRRKKVEKEDLPEDTELYRDPT 120
           GSD PPRITSNLKQNLQFLRLWK+FQKRKSGVPKPATSYRRKKVEKEDLP DTELYRDPT
Sbjct: 61  GSD-PPRITSNLKQNLQFLRLWKEFQKRKSGVPKPATSYRRKKVEKEDLPGDTELYRDPT 120

Query: 121 LALYHTNQGIDNVFPVLLVDGYNVCGYWVKLKKHFMNGRLDVARQKLIDELITFSMLREV 180
           LALY+TNQGIDN  PVLLVDGYNVCGYWVKLKKHFMNGRLDVARQKLIDELITFSMLREV
Sbjct: 121 LALYYTNQGIDNAVPVLLVDGYNVCGYWVKLKKHFMNGRLDVARQKLIDELITFSMLREV 180

Query: 181 KVVVVFDAMLSGLPTHKENFAGIDVVYSGESCADTWIETEVVALKEDGCPKVWVVTSDVC 240
           KVVVVFDAMLSGLPTHKENFAGIDVVYSGESCADTWIETEVVALKEDGCPKVWVVTSDVC
Sbjct: 181 KVVVVFDAMLSGLPTHKENFAGIDVVYSGESCADTWIETEVVALKEDGCPKVWVVTSDVC 240

Query: 241 HQHAAHGAGAFIWSCKALVSEINASKKEVEMMLQEHRSTSFQGKLLKHNLDSEVVNALND 300
            QHAAHGAGAFIWSCKALVSEI ASKKEVEMMLQEHRSTSFQGKLLKHNLDSEVVNALND
Sbjct: 241 QQHAAHGAGAFIWSCKALVSEIKASKKEVEMMLQEHRSTSFQGKLLKHNLDSEVVNALND 300

Query: 301 LKRKLNESEPK 310
           LKRKLNESEPK
Sbjct: 301 LKRKLNESEPK 310

BLAST of CSPI03G06640 vs. NCBI nr
Match: XP_022136583.1 (uncharacterized protein LOC111008254 isoform X2 [Momordica charantia])

HSP 1 Score: 562.0 bits (1447), Expect = 3.2e-156
Identity = 286/308 (92.86%), Postives = 294/308 (95.45%), Query Frame = 0

Query: 1   MEVVGSVKLFLASPANTVSGLVYSPSSYSSLFS-PKKKKKKKKLLVVSKSKKQPQTSSAG 60
           MEVVG VKLF +SPANTVSGL YS SSY+S FS PKKKKKKKKLLVVSKSKKQPQTSS G
Sbjct: 1   MEVVGPVKLFSSSPANTVSGLCYSASSYTSSFSPPKKKKKKKKLLVVSKSKKQPQTSSGG 60

Query: 61  SDPPPRITSNLKQNLQFLRLWKDFQKRKSGVPKPATSYRRKKVEKEDLPEDTELYRDPTL 120
           SDPPPRITSNLKQNLQFLRLWK+FQKRKSG PKPATSYRRKKVEKEDLP DT+LYRDPTL
Sbjct: 61  SDPPPRITSNLKQNLQFLRLWKEFQKRKSGTPKPATSYRRKKVEKEDLPGDTDLYRDPTL 120

Query: 121 ALYHTNQGIDNVFPVLLVDGYNVCGYWVKLKKHFMNGRLDVARQKLIDELITFSMLREVK 180
           ALY+TNQGIDN  PVLLVDGYNVCGYWVKLKKHFMNGRLDVARQKLIDELITFSMLREVK
Sbjct: 121 ALYYTNQGIDNAVPVLLVDGYNVCGYWVKLKKHFMNGRLDVARQKLIDELITFSMLREVK 180

Query: 181 VVVVFDAMLSGLPTHKENFAGIDVVYSGESCADTWIETEVVALKEDGCPKVWVVTSDVCH 240
           VVVVFDAMLSGLPTHKENFAGIDVV+SGESCADTWIETEVVALKEDGCPKVWVVTSD+C 
Sbjct: 181 VVVVFDAMLSGLPTHKENFAGIDVVFSGESCADTWIETEVVALKEDGCPKVWVVTSDICQ 240

Query: 241 QHAAHGAGAFIWSCKALVSEINASKKEVEMMLQEHRSTSFQGKLLKHNLDSEVVNALNDL 300
           QHAAHGAGAFIWSCKALVSEI ASKKEVEMMLQEHRSTSFQGKLLKHNLDSEVVNALNDL
Sbjct: 241 QHAAHGAGAFIWSCKALVSEIKASKKEVEMMLQEHRSTSFQGKLLKHNLDSEVVNALNDL 300

Query: 301 KRKLNESE 308
           KRKL E+E
Sbjct: 301 KRKLTENE 308

BLAST of CSPI03G06640 vs. NCBI nr
Match: XP_022136580.1 (uncharacterized protein LOC111008254 isoform X1 [Momordica charantia] >XP_022136582.1 uncharacterized protein LOC111008254 isoform X1 [Momordica charantia])

HSP 1 Score: 557.0 bits (1434), Expect = 1.0e-154
Identity = 286/310 (92.26%), Postives = 294/310 (94.84%), Query Frame = 0

Query: 1   MEVVGSVKLFLASPANTVSGLVYSPSSYSSLFS-PKKKKKKKKLLVVSKSKKQPQTSS-- 60
           MEVVG VKLF +SPANTVSGL YS SSY+S FS PKKKKKKKKLLVVSKSKKQPQTSS  
Sbjct: 1   MEVVGPVKLFSSSPANTVSGLCYSASSYTSSFSPPKKKKKKKKLLVVSKSKKQPQTSSLQ 60

Query: 61  AGSDPPPRITSNLKQNLQFLRLWKDFQKRKSGVPKPATSYRRKKVEKEDLPEDTELYRDP 120
            GSDPPPRITSNLKQNLQFLRLWK+FQKRKSG PKPATSYRRKKVEKEDLP DT+LYRDP
Sbjct: 61  GGSDPPPRITSNLKQNLQFLRLWKEFQKRKSGTPKPATSYRRKKVEKEDLPGDTDLYRDP 120

Query: 121 TLALYHTNQGIDNVFPVLLVDGYNVCGYWVKLKKHFMNGRLDVARQKLIDELITFSMLRE 180
           TLALY+TNQGIDN  PVLLVDGYNVCGYWVKLKKHFMNGRLDVARQKLIDELITFSMLRE
Sbjct: 121 TLALYYTNQGIDNAVPVLLVDGYNVCGYWVKLKKHFMNGRLDVARQKLIDELITFSMLRE 180

Query: 181 VKVVVVFDAMLSGLPTHKENFAGIDVVYSGESCADTWIETEVVALKEDGCPKVWVVTSDV 240
           VKVVVVFDAMLSGLPTHKENFAGIDVV+SGESCADTWIETEVVALKEDGCPKVWVVTSD+
Sbjct: 181 VKVVVVFDAMLSGLPTHKENFAGIDVVFSGESCADTWIETEVVALKEDGCPKVWVVTSDI 240

Query: 241 CHQHAAHGAGAFIWSCKALVSEINASKKEVEMMLQEHRSTSFQGKLLKHNLDSEVVNALN 300
           C QHAAHGAGAFIWSCKALVSEI ASKKEVEMMLQEHRSTSFQGKLLKHNLDSEVVNALN
Sbjct: 241 CQQHAAHGAGAFIWSCKALVSEIKASKKEVEMMLQEHRSTSFQGKLLKHNLDSEVVNALN 300

Query: 301 DLKRKLNESE 308
           DLKRKL E+E
Sbjct: 301 DLKRKLTENE 310

BLAST of CSPI03G06640 vs. TAIR 10
Match: AT2G02410.1 (unknown protein; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF901 (InterPro:IPR010298); Has 1151 Blast hits to 1151 proteins in 597 species: Archae - 0; Bacteria - 1105; Metazoa - 0; Fungi - 0; Plants - 42; Viruses - 0; Other Eukaryotes - 4 (source: NCBI BLink). )

HSP 1 Score: 401.0 bits (1029), Expect = 8.8e-112
Identity = 201/274 (73.36%), Postives = 239/274 (87.23%), Query Frame = 0

Query: 42  KLLVV---SKSKKQPQTSS-AGSDP-PPRITSNLKQNLQFLRLWKDFQKRKSGVPKPATS 101
           ++LVV    KSKK  Q+SS   S+P PPRI SN+K NLQ L+LWK+FQ R SG+ KPATS
Sbjct: 34  RVLVVKMGGKSKKPHQSSSFKESEPEPPRIKSNVKHNLQLLKLWKEFQSRGSGMAKPATS 93

Query: 102 YRRKKVEKEDLPEDTELYRDPTLALYHTNQG-IDNVFPVLLVDGYNVCGYWVKLKKHFMN 161
           YR+KKVEK++LP+D+ELYRDPT  LY+TNQG +D+  PVLLVDGYNVCGYW+KLKKHFM 
Sbjct: 94  YRKKKVEKDELPDDSELYRDPTNTLYYTNQGLLDDAVPVLLVDGYNVCGYWMKLKKHFMK 153

Query: 162 GRLDVARQKLIDELITFSMLREVKVVVVFDAMLSGLPTHKENFAGIDVVYSGESCADTWI 221
           GRLDVARQKL+DEL++FSM++EVKVVVVFDA++SGLPTHKE+FAG+DV++SGE+CAD WI
Sbjct: 154 GRLDVARQKLVDELVSFSMVKEVKVVVVFDALMSGLPTHKEDFAGVDVIFSGETCADAWI 213

Query: 222 ETEVVALKEDGCPKVWVVTSDVCHQHAAHGAGAFIWSCKALVSEINASKKEVEMMLQEHR 281
           E EVVAL+EDGCPKVWVVTSDVC Q AAHGAGA+IWS KALVSEI +  KEVE M+QE R
Sbjct: 214 EKEVVALREDGCPKVWVVTSDVCQQQAAHGAGAYIWSSKALVSEIKSMHKEVEKMMQETR 273

Query: 282 STSFQGKLLKHNLDSEVVNALNDLKRKLNESEPK 310
           STSFQG+LLKHNLDSEVV+AL DL+ KL+E+E K
Sbjct: 274 STSFQGRLLKHNLDSEVVDALKDLRDKLSENETK 307

BLAST of CSPI03G06640 vs. TAIR 10
Match: AT2G02410.3 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF901 (InterPro:IPR010298). )

HSP 1 Score: 397.1 bits (1019), Expect = 1.3e-110
Identity = 191/252 (75.79%), Postives = 226/252 (89.68%), Query Frame = 0

Query: 60  SDP-PPRITSNLKQNLQFLRLWKDFQKRKSGVPKPATSYRRKKVEKEDLPEDTELYRDPT 119
           S+P PPRI SN+K NLQ L+LWK+FQ R SG+ KPATSYR+KKVEK++LP+D+ELYRDPT
Sbjct: 8   SEPEPPRIKSNVKHNLQLLKLWKEFQSRGSGMAKPATSYRKKKVEKDELPDDSELYRDPT 67

Query: 120 LALYHTNQG-IDNVFPVLLVDGYNVCGYWVKLKKHFMNGRLDVARQKLIDELITFSMLRE 179
             LY+TNQG +D+  PVLLVDGYNVCGYW+KLKKHFM GRLDVARQKL+DEL++FSM++E
Sbjct: 68  NTLYYTNQGLLDDAVPVLLVDGYNVCGYWMKLKKHFMKGRLDVARQKLVDELVSFSMVKE 127

Query: 180 VKVVVVFDAMLSGLPTHKENFAGIDVVYSGESCADTWIETEVVALKEDGCPKVWVVTSDV 239
           VKVVVVFDA++SGLPTHKE+FAG+DV++SGE+CAD WIE EVVAL+EDGCPKVWVVTSDV
Sbjct: 128 VKVVVVFDALMSGLPTHKEDFAGVDVIFSGETCADAWIEKEVVALREDGCPKVWVVTSDV 187

Query: 240 CHQHAAHGAGAFIWSCKALVSEINASKKEVEMMLQEHRSTSFQGKLLKHNLDSEVVNALN 299
           C Q AAHGAGA+IWS KALVSEI +  KEVE M+QE RSTSFQG+LLKHNLDSEVV+AL 
Sbjct: 188 CQQQAAHGAGAYIWSSKALVSEIKSMHKEVEKMMQETRSTSFQGRLLKHNLDSEVVDALK 247

Query: 300 DLKRKLNESEPK 310
           DL+ KL+E+E K
Sbjct: 248 DLRDKLSENETK 259

BLAST of CSPI03G06640 vs. TAIR 10
Match: AT2G02410.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF901 (InterPro:IPR010298). )

HSP 1 Score: 365.9 bits (938), Expect = 3.1e-101
Identity = 175/230 (76.09%), Postives = 207/230 (90.00%), Query Frame = 0

Query: 81  KDFQKRKSGVPKPATSYRRKKVEKEDLPEDTELYRDPTLALYHTNQG-IDNVFPVLLVDG 140
           + FQ R SG+ KPATSYR+KKVEK++LP+D+ELYRDPT  LY+TNQG +D+  PVLLVDG
Sbjct: 13  RHFQSRGSGMAKPATSYRKKKVEKDELPDDSELYRDPTNTLYYTNQGLLDDAVPVLLVDG 72

Query: 141 YNVCGYWVKLKKHFMNGRLDVARQKLIDELITFSMLREVKVVVVFDAMLSGLPTHKENFA 200
           YNVCGYW+KLKKHFM GRLDVARQKL+DEL++FSM++EVKVVVVFDA++SGLPTHKE+FA
Sbjct: 73  YNVCGYWMKLKKHFMKGRLDVARQKLVDELVSFSMVKEVKVVVVFDALMSGLPTHKEDFA 132

Query: 201 GIDVVYSGESCADTWIETEVVALKEDGCPKVWVVTSDVCHQHAAHGAGAFIWSCKALVSE 260
           G+DV++SGE+CAD WIE EVVAL+EDGCPKVWVVTSDVC Q AAHGAGA+IWS KALVSE
Sbjct: 133 GVDVIFSGETCADAWIEKEVVALREDGCPKVWVVTSDVCQQQAAHGAGAYIWSSKALVSE 192

Query: 261 INASKKEVEMMLQEHRSTSFQGKLLKHNLDSEVVNALNDLKRKLNESEPK 310
           I +  KEVE M+QE RSTSFQG+LLKHNLDSEVV+AL DL+ KL+E+E K
Sbjct: 193 IKSMHKEVEKMMQETRSTSFQGRLLKHNLDSEVVDALKDLRDKLSENETK 242

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P375749.5e-1031.18Uncharacterized protein YacP OS=Bacillus subtilis (strain 168) OX=224308 GN=yacP... [more]
Match NameE-valueIdentityDescription
A0A1S3BUE57.5e-15995.50uncharacterized protein YacP isoform X1 OS=Cucumis melo OX=3656 GN=LOC103493288 ... [more]
A0A6J1C7Z21.6e-15692.86uncharacterized protein LOC111008254 isoform X2 OS=Momordica charantia OX=3673 G... [more]
A0A6J1C4B65.0e-15592.26uncharacterized protein LOC111008254 isoform X1 OS=Momordica charantia OX=3673 G... [more]
A0A6J1FQV52.7e-15390.29uncharacterized protein LOC111447625 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1JLX11.4e-15290.32uncharacterized protein LOC111483584 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
Match NameE-valueIdentityDescription
XP_004133742.26.0e-171100.00uncharacterized protein LOC101212837 isoform X1 [Cucumis sativus] >KAE8650197.1 ... [more]
XP_038904183.17.4e-16194.17uncharacterized protein YacP [Benincasa hispida][more]
XP_008452195.11.5e-15895.50PREDICTED: uncharacterized protein YacP isoform X1 [Cucumis melo][more]
XP_022136583.13.2e-15692.86uncharacterized protein LOC111008254 isoform X2 [Momordica charantia][more]
XP_022136580.11.0e-15492.26uncharacterized protein LOC111008254 isoform X1 [Momordica charantia] >XP_022136... [more]
Match NameE-valueIdentityDescription
AT2G02410.18.8e-11273.36unknown protein; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF901 ... [more]
AT2G02410.31.3e-11075.79unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT2G02410.23.1e-10176.09unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR010298Protein of unknown function DUF901PFAMPF05991NYN_YacPcoord: 135..245
e-value: 3.5E-29
score: 101.9
IPR010298Protein of unknown function DUF901PANTHERPTHR34547YACP-LIKE NYN DOMAIN PROTEINcoord: 37..246
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 28..67
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 33..47
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 48..67
NoneNo IPR availableCDDcd10912PIN_YacP-likecoord: 134..246
e-value: 6.54415E-42
score: 138.076

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI03G06640.1CSPI03G06640.1mRNA
CSPI03G06640.2CSPI03G06640.2mRNA