CSPI02G16650 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI02G16650
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionARID domain-containing protein
LocationChr2: 16005914 .. 16007285 (+)
RNA-Seq ExpressionCSPI02G16650
SyntenyCSPI02G16650
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAAGAGAGAGAGAGAGAGATGTGATGATAAAAGAAGAAAAAGTGAAGCATAAGCTGTCCTTTAATATTTCTTTCTTTTCTTTTCTTTTCTTTTTAAATCTCTTAAACCCCTCATTCATTCATTCATTCATTCTGTCTATCACAAATTCACAAATTATGAGATCCATGAAGAGAAGATATCACTCCAAATCAAAACCCACTGCTCCACCGGCCGATCTTCTCGTCTGTTTTCCTGCACGTGCTCATCTCACTCTTCTCCCAAGTCCCGCCAGACCTCCCGCCGAACCCCATCGTCGTCACTACCGGAAAGCTCAACCAAGTCCTCTTCCATGGGCTAAGGAGATGTCGTCGGAGCCCACTTCCCCCAAGGTCACCTGCGCCGGCCAGATCAAAATCAGACCTCATTCTCATCGATCAACCAAGAATTGGCAATCGGTTATGGAAGAAATCGAGCGAATTCACAACAAAAAGAAAAATCCAATTCGAAATCAAAATCCTTTCGGTTTCAAGAGAGAAATTGTTAATTTCCTCTCGTGTTTACGGGGATTCCGTTTTGATTTCCGTTGTTTCAGAGGCTTCCCTCAATCCGACATTACAACAGATGATGAGGATGATGAAGAAGAAGTATTCGAACAACACGAACCCGAATCCGAATCCCAATCAGAATCAGAAGAAGAACCCACAAGAGGAAGAACGATGTTCTCAGAATGGTTCATGGTTTTACAGGAAGGCGAAGAAGAAACCAAACCCATAAACAACGACGCCGTTTCTTCACTCGAATTCCAACCTTCCTCTGTTTCTGTTCCTCCCCCAAATGCTCTCTTACTTATGCGTTGCAGGTCAGCCCCAAATTCTCTTTTACGAAAACCAAAACAACAAGAAGAAAAAGAATTAGAAGAAGAAGAAGAAGAAGAGAAATCGAAAATCAGCTTGAAGTTGTTAATGGAGGAAGAGAAAGAAATGGTGACTGCGAAGAAGAAAAGCTTGATGGTAATGGATTATGATGCTGATTTTTACAAACTTTCATCAGACATTGCTAAAGAAACATGGGTTGTGAGTGGATCAAGTACCAGTAGTAGTAGTAGTAGATGTAATGACGATCCATTGTTGAGAAGTCGAAGCTGGAAGAGATGATGAAGATCGAACAACTCAAATCAATAAAAAATTAAAAGAAAAAACTCTTGTGCTCTGCTTCCCTTGTTTTCTAATTTCCATGAAATTTCTGCCAAGAACTTGAAAACCCCACATGAGATTTTTGAAGATCTGATATGGGTTTTGTTTACTTTCTTTGAATCTGTAATAGATTAATATGAACATGATGTACAGTGAAATTTTACGATTATCTTAATTTTCAGCTTCAAAGCTTTCATTG

mRNA sequence

AAAAGAGAGAGAGAGAGAGATGTGATGATAAAAGAAGAAAAAGTGAAGCATAAGCTGTCCTTTAATATTTCTTTCTTTTCTTTTCTTTTCTTTTTAAATCTCTTAAACCCCTCATTCATTCATTCATTCATTCTGTCTATCACAAATTCACAAATTATGAGATCCATGAAGAGAAGATATCACTCCAAATCAAAACCCACTGCTCCACCGGCCGATCTTCTCGTCTGTTTTCCTGCACGTGCTCATCTCACTCTTCTCCCAAGTCCCGCCAGACCTCCCGCCGAACCCCATCGTCGTCACTACCGGAAAGCTCAACCAAGTCCTCTTCCATGGGCTAAGGAGATGTCGTCGGAGCCCACTTCCCCCAAGGTCACCTGCGCCGGCCAGATCAAAATCAGACCTCATTCTCATCGATCAACCAAGAATTGGCAATCGGTTATGGAAGAAATCGAGCGAATTCACAACAAAAAGAAAAATCCAATTCGAAATCAAAATCCTTTCGGTTTCAAGAGAGAAATTGTTAATTTCCTCTCGTGTTTACGGGGATTCCGTTTTGATTTCCGTTGTTTCAGAGGCTTCCCTCAATCCGACATTACAACAGATGATGAGGATGATGAAGAAGAAGTATTCGAACAACACGAACCCGAATCCGAATCCCAATCAGAATCAGAAGAAGAACCCACAAGAGGAAGAACGATGTTCTCAGAATGGTTCATGGTTTTACAGGAAGGCGAAGAAGAAACCAAACCCATAAACAACGACGCCGTTTCTTCACTCGAATTCCAACCTTCCTCTGTTTCTGTTCCTCCCCCAAATGCTCTCTTACTTATGCGTTGCAGGTCAGCCCCAAATTCTCTTTTACGAAAACCAAAACAACAAGAAGAAAAAGAATTAGAAGAAGAAGAAGAAGAAGAGAAATCGAAAATCAGCTTGAAGTTGTTAATGGAGGAAGAGAAAGAAATGGTGACTGCGAAGAAGAAAAGCTTGATGGTAATGGATTATGATGCTGATTTTTACAAACTTTCATCAGACATTGCTAAAGAAACATGGGTTGTGAGTGGATCAAGTACCAGTAGTAGTAGTAGTAGATGTAATGACGATCCATTGTTGAGAAGTCGAAGCTGGAAGAGATGATGAAGATCGAACAACTCAAATCAATAAAAAATTAAAAGAAAAAACTCTTGTGCTCTGCTTCCCTTGTTTTCTAATTTCCATGAAATTTCTGCCAAGAACTTGAAAACCCCACATGAGATTTTTGAAGATCTGATATGGGTTTTGTTTACTTTCTTTGAATCTGTAATAGATTAATATGAACATGATGTACAGTGAAATTTTACGATTATCTTAATTTTCAGCTTCAAAGCTTTCATTG

Coding sequence (CDS)

ATGATAAAAGAAGAAAAAGTGAAGCATAAGCTGTCCTTTAATATTTCTTTCTTTTCTTTTCTTTTCTTTTTAAATCTCTTAAACCCCTCATTCATTCATTCATTCATTCTGTCTATCACAAATTCACAAATTATGAGATCCATGAAGAGAAGATATCACTCCAAATCAAAACCCACTGCTCCACCGGCCGATCTTCTCGTCTGTTTTCCTGCACGTGCTCATCTCACTCTTCTCCCAAGTCCCGCCAGACCTCCCGCCGAACCCCATCGTCGTCACTACCGGAAAGCTCAACCAAGTCCTCTTCCATGGGCTAAGGAGATGTCGTCGGAGCCCACTTCCCCCAAGGTCACCTGCGCCGGCCAGATCAAAATCAGACCTCATTCTCATCGATCAACCAAGAATTGGCAATCGGTTATGGAAGAAATCGAGCGAATTCACAACAAAAAGAAAAATCCAATTCGAAATCAAAATCCTTTCGGTTTCAAGAGAGAAATTGTTAATTTCCTCTCGTGTTTACGGGGATTCCGTTTTGATTTCCGTTGTTTCAGAGGCTTCCCTCAATCCGACATTACAACAGATGATGAGGATGATGAAGAAGAAGTATTCGAACAACACGAACCCGAATCCGAATCCCAATCAGAATCAGAAGAAGAACCCACAAGAGGAAGAACGATGTTCTCAGAATGGTTCATGGTTTTACAGGAAGGCGAAGAAGAAACCAAACCCATAAACAACGACGCCGTTTCTTCACTCGAATTCCAACCTTCCTCTGTTTCTGTTCCTCCCCCAAATGCTCTCTTACTTATGCGTTGCAGGTCAGCCCCAAATTCTCTTTTACGAAAACCAAAACAACAAGAAGAAAAAGAATTAGAAGAAGAAGAAGAAGAAGAGAAATCGAAAATCAGCTTGAAGTTGTTAATGGAGGAAGAGAAAGAAATGGTGACTGCGAAGAAGAAAAGCTTGATGGTAATGGATTATGATGCTGATTTTTACAAACTTTCATCAGACATTGCTAAAGAAACATGGGTTGTGAGTGGATCAAGTACCAGTAGTAGTAGTAGTAGATGTAATGACGATCCATTGTTGAGAAGTCGAAGCTGGAAGAGATGA

Protein sequence

MIKEEKVKHKLSFNISFFSFLFFLNLLNPSFIHSFILSITNSQIMRSMKRRYHSKSKPTAPPADLLVCFPARAHLTLLPSPARPPAEPHRRHYRKAQPSPLPWAKEMSSEPTSPKVTCAGQIKIRPHSHRSTKNWQSVMEEIERIHNKKKNPIRNQNPFGFKREIVNFLSCLRGFRFDFRCFRGFPQSDITTDDEDDEEEVFEQHEPESESQSESEEEPTRGRTMFSEWFMVLQEGEEETKPINNDAVSSLEFQPSSVSVPPPNALLLMRCRSAPNSLLRKPKQQEEKELEEEEEEEKSKISLKLLMEEEKEMVTAKKKSLMVMDYDADFYKLSSDIAKETWVVSGSSTSSSSSRCNDDPLLRSRSWKR*
Homology
BLAST of CSPI02G16650 vs. ExPASy TrEMBL
Match: A0A0A0LMI6 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G345900 PE=4 SV=1)

HSP 1 Score: 614.4 bits (1583), Expect = 3.1e-172
Identity = 323/330 (97.88%), Postives = 323/330 (97.88%), Query Frame = 0

Query: 45  MRSMKRRYHSKSKPTAPPADLLVCFPARAHLTLLPSPARPPAEPHRRHYRKAQPSPLPWA 104
           MRSMKRR HSKSKPTAPPADLLVCFPARAHLTLLPSPARPPAEPHRRHYRKAQPSPLPWA
Sbjct: 1   MRSMKRRNHSKSKPTAPPADLLVCFPARAHLTLLPSPARPPAEPHRRHYRKAQPSPLPWA 60

Query: 105 KEMSSEPTSPKVTCAGQIKIRPHSHRSTKNWQSVMEEIERIHNKKKNPIRNQNPFGFKRE 164
           KEMSSEPTSPKVTCAGQIKIRPHSHRSTKNWQSVMEEIERIHNKKKNPIRNQNPFGFKRE
Sbjct: 61  KEMSSEPTSPKVTCAGQIKIRPHSHRSTKNWQSVMEEIERIHNKKKNPIRNQNPFGFKRE 120

Query: 165 IVNFLSCLRGFRFDFRCFRGFPQSDITTDDEDDEEEVFEQHEPESESQSESEEEPTRGRT 224
           IVNFLSCLRGFRFDFRCFRGFPQSDITTDDEDDEEEVFEQHEPESESQSESEEEPTRGRT
Sbjct: 121 IVNFLSCLRGFRFDFRCFRGFPQSDITTDDEDDEEEVFEQHEPESESQSESEEEPTRGRT 180

Query: 225 MFSEWFMVLQEGEEETKPINNDAVSSLEFQPSSVSVPPPNALLLMRCRSAPNSLLRKPKQ 284
           MFSEWFMVLQEGEEETKPINNDAVSSLEFQPSSVSVPPPNALLLMRCRSAPNSLLRKPKQ
Sbjct: 181 MFSEWFMVLQEGEEETKPINNDAVSSLEFQPSSVSVPPPNALLLMRCRSAPNSLLRKPKQ 240

Query: 285 QEEK-----ELEEEEEEEKSKISLKLLMEEEKEMVTAKKKSLMVMDYDADFYKLSSDIAK 344
           QEEK     E EEEEEEEKSKISLKLLMEEEKEMVTAKKKSLMVMDYDADFYKLSSDIAK
Sbjct: 241 QEEKEEEEEEEEEEEEEEKSKISLKLLMEEEKEMVTAKKKSLMVMDYDADFYKLSSDIAK 300

Query: 345 ETWVVSGSSTSSSSSRCNDDPLLRSRSWKR 370
           ETWVVSGSSTSSSSSRCNDDPLLRSRSWKR
Sbjct: 301 ETWVVSGSSTSSSSSRCNDDPLLRSRSWKR 330

BLAST of CSPI02G16650 vs. ExPASy TrEMBL
Match: A0A5A7V8Z9 (Myotubularin-related protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold82G001440 PE=4 SV=1)

HSP 1 Score: 595.5 bits (1534), Expect = 1.5e-166
Identity = 311/325 (95.69%), Postives = 317/325 (97.54%), Query Frame = 0

Query: 45  MRSMKRRYHSKSKPTAPPADLLVCFPARAHLTLLPSPARPPAEPHRRHYRKAQPSPLPWA 104
           MRSMKRR HSKSKPTAPPADLLVCFPARAHLTLLPSPAR PAEPHRRHYRKAQPSPLPWA
Sbjct: 1   MRSMKRRNHSKSKPTAPPADLLVCFPARAHLTLLPSPARAPAEPHRRHYRKAQPSPLPWA 60

Query: 105 KEMSSEPTSPKVTCAGQIKIRPHSHRSTKNWQSVMEEIERIHNKKKNPIRNQNPFGFKRE 164
           KEMSSEPTSPKVTCAGQIKIRPH+HRSTKNWQSVMEEIERIHNKKKNPIRNQNP GFKRE
Sbjct: 61  KEMSSEPTSPKVTCAGQIKIRPHAHRSTKNWQSVMEEIERIHNKKKNPIRNQNPLGFKRE 120

Query: 165 IVNFLSCLRGFRFDFRCFRGFPQSDITTDDEDDEEEVFEQHEPESESQSESEEEPTRGRT 224
           IVNFLSCLRGFRFDFRCFRGFPQSDITTDDEDDEEE FEQHEPESES+SESEEEPTRGRT
Sbjct: 121 IVNFLSCLRGFRFDFRCFRGFPQSDITTDDEDDEEEEFEQHEPESESESESEEEPTRGRT 180

Query: 225 MFSEWFMVLQEGEEETKPINNDAVSSLEFQPSSVSVPPPNALLLMRCRSAPNSLLRKPKQ 284
           MFSEWFMVLQE EEETKPINNDAVSSLEFQPSSVSVPPPNALLLMRCRSAPNSLLRKPKQ
Sbjct: 181 MFSEWFMVLQEEEEETKPINNDAVSSLEFQPSSVSVPPPNALLLMRCRSAPNSLLRKPKQ 240

Query: 285 QEEKELEEEEEEEKSKISLKLLMEEEKEMVTAKKKSLMVMDYDADFYKLSSDIAKETWVV 344
           ++E   +E+EEEEKSKISLKLLMEEEKEMVTAKKKSLMVMDYDADFYKLSSDIAKETWVV
Sbjct: 241 EQE---QEQEEEEKSKISLKLLMEEEKEMVTAKKKSLMVMDYDADFYKLSSDIAKETWVV 300

Query: 345 SGSSTSSSSSRCNDDPLLRSRSWKR 370
           SGSSTSSSSSRCNDDPLLRSRSWKR
Sbjct: 301 SGSSTSSSSSRCNDDPLLRSRSWKR 322

BLAST of CSPI02G16650 vs. ExPASy TrEMBL
Match: A0A1S3BDB9 (uncharacterized protein LOC103488403 OS=Cucumis melo OX=3656 GN=LOC103488403 PE=4 SV=1)

HSP 1 Score: 590.5 bits (1521), Expect = 4.9e-165
Identity = 308/322 (95.65%), Postives = 315/322 (97.83%), Query Frame = 0

Query: 48  MKRRYHSKSKPTAPPADLLVCFPARAHLTLLPSPARPPAEPHRRHYRKAQPSPLPWAKEM 107
           MKRR HSKSKPTAPPADLLVCFPARAHLTLLPSPAR PAEPHRRHYRKAQPSPLPWAKEM
Sbjct: 1   MKRRNHSKSKPTAPPADLLVCFPARAHLTLLPSPARAPAEPHRRHYRKAQPSPLPWAKEM 60

Query: 108 SSEPTSPKVTCAGQIKIRPHSHRSTKNWQSVMEEIERIHNKKKNPIRNQNPFGFKREIVN 167
           SSEPTSPKVTCAGQIKIRPH+HRSTKNWQSVMEEIERIHNKKKNPIRNQNP GFKREIVN
Sbjct: 61  SSEPTSPKVTCAGQIKIRPHAHRSTKNWQSVMEEIERIHNKKKNPIRNQNPLGFKREIVN 120

Query: 168 FLSCLRGFRFDFRCFRGFPQSDITTDDEDDEEEVFEQHEPESESQSESEEEPTRGRTMFS 227
           FLSCLRGFRFDFRCFRGFPQSDITTDDEDDEEE FEQHEPESES+SESEEEPTRGRTMFS
Sbjct: 121 FLSCLRGFRFDFRCFRGFPQSDITTDDEDDEEEEFEQHEPESESESESEEEPTRGRTMFS 180

Query: 228 EWFMVLQEGEEETKPINNDAVSSLEFQPSSVSVPPPNALLLMRCRSAPNSLLRKPKQQEE 287
           EWFMVLQE EEETKPINNDAVSSLEFQPSSVSVPPPNALLLMRCRSAPNSLLRKPKQ++E
Sbjct: 181 EWFMVLQEEEEETKPINNDAVSSLEFQPSSVSVPPPNALLLMRCRSAPNSLLRKPKQEQE 240

Query: 288 KELEEEEEEEKSKISLKLLMEEEKEMVTAKKKSLMVMDYDADFYKLSSDIAKETWVVSGS 347
           +  E+E+EEEKSKISLKLLMEEEKEMVTAKKKSLMVMDYDADFYKLSSDIAKETWVVSGS
Sbjct: 241 Q--EQEQEEEKSKISLKLLMEEEKEMVTAKKKSLMVMDYDADFYKLSSDIAKETWVVSGS 300

Query: 348 STSSSSSRCNDDPLLRSRSWKR 370
           STSSSSSRCNDDPLLRSRSWKR
Sbjct: 301 STSSSSSRCNDDPLLRSRSWKR 320

BLAST of CSPI02G16650 vs. ExPASy TrEMBL
Match: A0A6J1HD24 (uncharacterized protein LOC111462985 OS=Cucurbita moschata OX=3662 GN=LOC111462985 PE=4 SV=1)

HSP 1 Score: 391.0 bits (1003), Expect = 5.7e-105
Identity = 233/331 (70.39%), Postives = 262/331 (79.15%), Query Frame = 0

Query: 60  APPADLLVCFPARAHLTLLP----SPARPPAEPHRRHYRKAQP------SPLPWAKEMSS 119
           APPADLLVCFPARA LTLLP    SPAR  AEPHRRH +KA P      SPL WAKEM+S
Sbjct: 2   APPADLLVCFPARARLTLLPKPTCSPARASAEPHRRHQKKAPPPSQSQASPLLWAKEMAS 61

Query: 120 EPTSPKVTCAGQIKIRPHSHRSTKNWQSVMEEIERIHNKKK-----NPIRNQNPFGFKRE 179
           EPTSPKVTCAGQIKI+PH+ RSTKNWQSVMEEIERIH KKK     N IR+QNPFGFKRE
Sbjct: 62  EPTSPKVTCAGQIKIKPHARRSTKNWQSVMEEIERIHKKKKIPNWGNQIRDQNPFGFKRE 121

Query: 180 IVNFLSCLRGFRFDFRCFRGFPQSDITTDDEDDEEEVFEQHEPESESQSESEEEPTRGRT 239
           IVNFLSCLRGFRFDFRCFRGFP+SD  T +E+DEEE    +E E+E + E  E  ++ RT
Sbjct: 122 IVNFLSCLRGFRFDFRCFRGFPESDDITTEEEDEEE----YESEAEYEEEPTEATSKRRT 181

Query: 240 MFSEWFMVLQ----EGEEETKPINNDAVSSLEFQPSSVSVPPPNALLLMRCRSAPNS--L 299
           MFS+WFMVLQ    + EEETKP N+      E QP   SVPPPNALLLMRCRSAP++  +
Sbjct: 182 MFSKWFMVLQDEEEDEEEETKPRNDTG----ELQP--CSVPPPNALLLMRCRSAPSTGWI 241

Query: 300 LRKPKQQEEKELEEEEEEEKSKISLKLLMEEEKEMVTAKKKSLMVMDYDADFYKLSSDIA 359
            RKPKQ+++   E+E+EE++SKISLKLLMEEEK +  AKK+SL+VMDYDADFYKLSSDIA
Sbjct: 242 ERKPKQEQQ---EQEQEEKQSKISLKLLMEEEK-VAVAKKESLVVMDYDADFYKLSSDIA 301

Query: 360 KETWVVSGSSTSSSSSRCNDDPLLRSRSWKR 370
           KETWVVSG    SSSSRCNDDP LRSRSWKR
Sbjct: 302 KETWVVSG----SSSSRCNDDPFLRSRSWKR 314

BLAST of CSPI02G16650 vs. ExPASy TrEMBL
Match: A0A6J1KE65 (uncharacterized protein LOC111492421 OS=Cucurbita maxima OX=3661 GN=LOC111492421 PE=4 SV=1)

HSP 1 Score: 378.3 bits (970), Expect = 3.8e-101
Identity = 228/350 (65.14%), Postives = 253/350 (72.29%), Query Frame = 0

Query: 45  MRSMKRRYHSKSKPTAPPADLLVCFPARAHLTLLP----SPARPPAEPHRRHYRKAQP-- 104
           M+SMK R + K K  APPADLLVCFPARA LTLLP    SPAR  AEPHRRH +KA P  
Sbjct: 1   MKSMKTRNNPKPKSIAPPADLLVCFPARARLTLLPKPTCSPARASAEPHRRHQKKAPPLS 60

Query: 105 ----SPLPWAKEMSSEPTSPKVTCAGQIKIRPHSHRSTKNWQSVMEEIERIHNKKK---- 164
               SPL WAKE++SEPTSPKVTCAGQIKI+PH+ RSTKNWQSVMEEIERIH KKK    
Sbjct: 61  QSQASPLLWAKELASEPTSPKVTCAGQIKIKPHARRSTKNWQSVMEEIERIHKKKKIPNW 120

Query: 165 -NPIRNQNPFGFKREIVNFLSCLRGFRFDFRCFRGFPQSDITTDDEDDEEEVFEQHEPES 224
            N I++QNPFGFKREIVNFLSCLRGFRFDFRCF+GFP+SD  T +E++EEE  E++E ++
Sbjct: 121 GNQIQDQNPFGFKREIVNFLSCLRGFRFDFRCFKGFPESDDITTEEEEEEEEEEEYESDA 180

Query: 225 ESQSESEEEPT----RGRTMFSEWFMVLQ------EGEEETKPINNDAVSSLEFQPSSVS 284
           ES++E EEEPT    + RTMFS+WFMVLQ      E EEETKP N+      E QP   S
Sbjct: 181 ESEAEYEEEPTEATSKRRTMFSKWFMVLQNEEEEEEEEEETKPRNDTG----ELQP--CS 240

Query: 285 VPPPNALLLMRCRSAPNSLLRKPKQQEEKELEEEEEEEKSKISLKLLMEEEKEMVTAKKK 344
           VPPPNALLLMRC                              SLKLLMEEEKE V AKK+
Sbjct: 241 VPPPNALLLMRC------------------------------SLKLLMEEEKEAV-AKKE 300

Query: 345 SLMVMDYDADFYKLSSDIAKETWVVSGSSTSSSSSRCNDDPLLRSRSWKR 370
           SL+VMDYDADFYKLSSDIAKETWVVSGSST    SRCNDDP LRSRSWKR
Sbjct: 301 SLVVMDYDADFYKLSSDIAKETWVVSGSST----SRCNDDPFLRSRSWKR 309

BLAST of CSPI02G16650 vs. NCBI nr
Match: KAE8652025.1 (hypothetical protein Csa_018606 [Cucumis sativus])

HSP 1 Score: 695.7 bits (1794), Expect = 2.2e-196
Identity = 367/372 (98.66%), Postives = 367/372 (98.66%), Query Frame = 0

Query: 1   MIKEEKVKHKLSFNISFFSFLFFLNLLNPSFIHSFILSITNSQIMRSMKRRYHSKSKPTA 60
           MIKEEKVKHKLSFNISFFSFLFFLNLLNP FIHSFILSITNSQIMRSMKRR HSKSKPTA
Sbjct: 1   MIKEEKVKHKLSFNISFFSFLFFLNLLNPKFIHSFILSITNSQIMRSMKRRNHSKSKPTA 60

Query: 61  PPADLLVCFPARAHLTLLPSPARPPAEPHRRHYRKAQPSPLPWAKEMSSEPTSPKVTCAG 120
           PPADLLVCFPARAHLTLLPSPARPPAEPHRRHYRKAQPSPLPWAKEMSSEPTSPKVTCAG
Sbjct: 61  PPADLLVCFPARAHLTLLPSPARPPAEPHRRHYRKAQPSPLPWAKEMSSEPTSPKVTCAG 120

Query: 121 QIKIRPHSHRSTKNWQSVMEEIERIHNKKKNPIRNQNPFGFKREIVNFLSCLRGFRFDFR 180
           QIKIRPHSHRSTKNWQSVMEEIERIHNKKKNPIRNQNPFGFKREIVNFLSCLRGFRFDFR
Sbjct: 121 QIKIRPHSHRSTKNWQSVMEEIERIHNKKKNPIRNQNPFGFKREIVNFLSCLRGFRFDFR 180

Query: 181 CFRGFPQSDITTDDEDDEEEVFEQHEPESESQSESEEEPTRGRTMFSEWFMVLQEGEEET 240
           CFRGFPQSDITTDDEDDEEEVFEQHEPESESQSESEEEPTRGRTMFSEWFMVLQEGEEET
Sbjct: 181 CFRGFPQSDITTDDEDDEEEVFEQHEPESESQSESEEEPTRGRTMFSEWFMVLQEGEEET 240

Query: 241 KPINNDAVSSLEFQPSSVSVPPPNALLLMRCRSAPNSLLRKPKQQEEKEL---EEEEEEE 300
           KPINNDAVSSLEFQPSSVSVPPPNALLLMRCRSAPNSLLRKPKQQEEKEL   EEEEEEE
Sbjct: 241 KPINNDAVSSLEFQPSSVSVPPPNALLLMRCRSAPNSLLRKPKQQEEKELEEEEEEEEEE 300

Query: 301 KSKISLKLLMEEEKEMVTAKKKSLMVMDYDADFYKLSSDIAKETWVVSGSSTSSSSSRCN 360
           KSKISLKLLMEEEKEMVTAKKKSLMVMDYDADFYKLSSDIAKETWVVSGSSTSSSSSRCN
Sbjct: 301 KSKISLKLLMEEEKEMVTAKKKSLMVMDYDADFYKLSSDIAKETWVVSGSSTSSSSSRCN 360

Query: 361 DDPLLRSRSWKR 370
           DDPLLRSRSWKR
Sbjct: 361 DDPLLRSRSWKR 372

BLAST of CSPI02G16650 vs. NCBI nr
Match: KAA0064832.1 (myotubularin-related protein [Cucumis melo var. makuwa])

HSP 1 Score: 595.5 bits (1534), Expect = 3.1e-166
Identity = 311/325 (95.69%), Postives = 317/325 (97.54%), Query Frame = 0

Query: 45  MRSMKRRYHSKSKPTAPPADLLVCFPARAHLTLLPSPARPPAEPHRRHYRKAQPSPLPWA 104
           MRSMKRR HSKSKPTAPPADLLVCFPARAHLTLLPSPAR PAEPHRRHYRKAQPSPLPWA
Sbjct: 1   MRSMKRRNHSKSKPTAPPADLLVCFPARAHLTLLPSPARAPAEPHRRHYRKAQPSPLPWA 60

Query: 105 KEMSSEPTSPKVTCAGQIKIRPHSHRSTKNWQSVMEEIERIHNKKKNPIRNQNPFGFKRE 164
           KEMSSEPTSPKVTCAGQIKIRPH+HRSTKNWQSVMEEIERIHNKKKNPIRNQNP GFKRE
Sbjct: 61  KEMSSEPTSPKVTCAGQIKIRPHAHRSTKNWQSVMEEIERIHNKKKNPIRNQNPLGFKRE 120

Query: 165 IVNFLSCLRGFRFDFRCFRGFPQSDITTDDEDDEEEVFEQHEPESESQSESEEEPTRGRT 224
           IVNFLSCLRGFRFDFRCFRGFPQSDITTDDEDDEEE FEQHEPESES+SESEEEPTRGRT
Sbjct: 121 IVNFLSCLRGFRFDFRCFRGFPQSDITTDDEDDEEEEFEQHEPESESESESEEEPTRGRT 180

Query: 225 MFSEWFMVLQEGEEETKPINNDAVSSLEFQPSSVSVPPPNALLLMRCRSAPNSLLRKPKQ 284
           MFSEWFMVLQE EEETKPINNDAVSSLEFQPSSVSVPPPNALLLMRCRSAPNSLLRKPKQ
Sbjct: 181 MFSEWFMVLQEEEEETKPINNDAVSSLEFQPSSVSVPPPNALLLMRCRSAPNSLLRKPKQ 240

Query: 285 QEEKELEEEEEEEKSKISLKLLMEEEKEMVTAKKKSLMVMDYDADFYKLSSDIAKETWVV 344
           ++E   +E+EEEEKSKISLKLLMEEEKEMVTAKKKSLMVMDYDADFYKLSSDIAKETWVV
Sbjct: 241 EQE---QEQEEEEKSKISLKLLMEEEKEMVTAKKKSLMVMDYDADFYKLSSDIAKETWVV 300

Query: 345 SGSSTSSSSSRCNDDPLLRSRSWKR 370
           SGSSTSSSSSRCNDDPLLRSRSWKR
Sbjct: 301 SGSSTSSSSSRCNDDPLLRSRSWKR 322

BLAST of CSPI02G16650 vs. NCBI nr
Match: XP_008445342.2 (PREDICTED: uncharacterized protein LOC103488403 [Cucumis melo])

HSP 1 Score: 590.5 bits (1521), Expect = 1.0e-164
Identity = 308/322 (95.65%), Postives = 315/322 (97.83%), Query Frame = 0

Query: 48  MKRRYHSKSKPTAPPADLLVCFPARAHLTLLPSPARPPAEPHRRHYRKAQPSPLPWAKEM 107
           MKRR HSKSKPTAPPADLLVCFPARAHLTLLPSPAR PAEPHRRHYRKAQPSPLPWAKEM
Sbjct: 1   MKRRNHSKSKPTAPPADLLVCFPARAHLTLLPSPARAPAEPHRRHYRKAQPSPLPWAKEM 60

Query: 108 SSEPTSPKVTCAGQIKIRPHSHRSTKNWQSVMEEIERIHNKKKNPIRNQNPFGFKREIVN 167
           SSEPTSPKVTCAGQIKIRPH+HRSTKNWQSVMEEIERIHNKKKNPIRNQNP GFKREIVN
Sbjct: 61  SSEPTSPKVTCAGQIKIRPHAHRSTKNWQSVMEEIERIHNKKKNPIRNQNPLGFKREIVN 120

Query: 168 FLSCLRGFRFDFRCFRGFPQSDITTDDEDDEEEVFEQHEPESESQSESEEEPTRGRTMFS 227
           FLSCLRGFRFDFRCFRGFPQSDITTDDEDDEEE FEQHEPESES+SESEEEPTRGRTMFS
Sbjct: 121 FLSCLRGFRFDFRCFRGFPQSDITTDDEDDEEEEFEQHEPESESESESEEEPTRGRTMFS 180

Query: 228 EWFMVLQEGEEETKPINNDAVSSLEFQPSSVSVPPPNALLLMRCRSAPNSLLRKPKQQEE 287
           EWFMVLQE EEETKPINNDAVSSLEFQPSSVSVPPPNALLLMRCRSAPNSLLRKPKQ++E
Sbjct: 181 EWFMVLQEEEEETKPINNDAVSSLEFQPSSVSVPPPNALLLMRCRSAPNSLLRKPKQEQE 240

Query: 288 KELEEEEEEEKSKISLKLLMEEEKEMVTAKKKSLMVMDYDADFYKLSSDIAKETWVVSGS 347
           +  E+E+EEEKSKISLKLLMEEEKEMVTAKKKSLMVMDYDADFYKLSSDIAKETWVVSGS
Sbjct: 241 Q--EQEQEEEKSKISLKLLMEEEKEMVTAKKKSLMVMDYDADFYKLSSDIAKETWVVSGS 300

Query: 348 STSSSSSRCNDDPLLRSRSWKR 370
           STSSSSSRCNDDPLLRSRSWKR
Sbjct: 301 STSSSSSRCNDDPLLRSRSWKR 320

BLAST of CSPI02G16650 vs. NCBI nr
Match: XP_004143011.3 (uncharacterized protein LOC101218308 [Cucumis sativus])

HSP 1 Score: 446.8 bits (1148), Expect = 1.8e-121
Identity = 223/224 (99.55%), Postives = 223/224 (99.55%), Query Frame = 0

Query: 48  MKRRYHSKSKPTAPPADLLVCFPARAHLTLLPSPARPPAEPHRRHYRKAQPSPLPWAKEM 107
           MKRR HSKSKPTAPPADLLVCFPARAHLTLLPSPARPPAEPHRRHYRKAQPSPLPWAKEM
Sbjct: 1   MKRRNHSKSKPTAPPADLLVCFPARAHLTLLPSPARPPAEPHRRHYRKAQPSPLPWAKEM 60

Query: 108 SSEPTSPKVTCAGQIKIRPHSHRSTKNWQSVMEEIERIHNKKKNPIRNQNPFGFKREIVN 167
           SSEPTSPKVTCAGQIKIRPHSHRSTKNWQSVMEEIERIHNKKKNPIRNQNPFGFKREIVN
Sbjct: 61  SSEPTSPKVTCAGQIKIRPHSHRSTKNWQSVMEEIERIHNKKKNPIRNQNPFGFKREIVN 120

Query: 168 FLSCLRGFRFDFRCFRGFPQSDITTDDEDDEEEVFEQHEPESESQSESEEEPTRGRTMFS 227
           FLSCLRGFRFDFRCFRGFPQSDITTDDEDDEEEVFEQHEPESESQSESEEEPTRGRTMFS
Sbjct: 121 FLSCLRGFRFDFRCFRGFPQSDITTDDEDDEEEVFEQHEPESESQSESEEEPTRGRTMFS 180

Query: 228 EWFMVLQEGEEETKPINNDAVSSLEFQPSSVSVPPPNALLLMRC 272
           EWFMVLQEGEEETKPINNDAVSSLEFQPSSVSVPPPNALLLMRC
Sbjct: 181 EWFMVLQEGEEETKPINNDAVSSLEFQPSSVSVPPPNALLLMRC 224

BLAST of CSPI02G16650 vs. NCBI nr
Match: XP_023546364.1 (uncharacterized protein LOC111805493 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 402.1 bits (1032), Expect = 5.1e-108
Identity = 239/342 (69.88%), Postives = 269/342 (78.65%), Query Frame = 0

Query: 48  MKRRYHSKSKPTAPPADLLVCFPARAHLTLLP----SPARPPAEPHRRHYRKAQP----- 107
           MK R ++K K  APPADLLVCFPARA LTLLP    SPAR  AEPHRRH +KA P     
Sbjct: 1   MKSRNNTKPKSIAPPADLLVCFPARARLTLLPKPTCSPARASAEPHRRHQKKAPPPSQSQ 60

Query: 108 -SPLPWAKEMSSEPTSPKVTCAGQIKIRPHSHRSTKNWQSVMEEIERIHNKKK-----NP 167
            SPL WAKEM+SEPTSPKVTCAGQIKI+PH+ RSTKNWQSVMEEIERIH KKK     N 
Sbjct: 61  ASPLLWAKEMASEPTSPKVTCAGQIKIKPHARRSTKNWQSVMEEIERIHKKKKIPNWGNQ 120

Query: 168 IRNQNPFGFKREIVNFLSCLRGFRFDFRCFRGFPQSDITTDDEDDEEEVFEQHEPESESQ 227
           I++QNPFGFKREIVNFLSCLRGFRFDFRCFRGFP+SD  T +E+DEEE   + E E+E +
Sbjct: 121 IQDQNPFGFKREIVNFLSCLRGFRFDFRCFRGFPESDDITTEEEDEEEYESEPESEAEYE 180

Query: 228 SESEEEPTRGRTMFSEWFMVLQ---EGEEETKPINNDAVSSLEFQPSSVSVPPPNALLLM 287
               E  ++ RTMFS+WFMVLQ   E EEETKP N+      E QP   SVPPPNALLLM
Sbjct: 181 EAHTEATSKRRTMFSKWFMVLQDEEEEEEETKPRNDTG----ELQP--CSVPPPNALLLM 240

Query: 288 RCRSAPNS--LLRKPKQQEEKELEEEEEEEKSKISLKLLMEEEKEMVTAKKKSLMVMDYD 347
           RCRSAP++  + RKPKQ+++   E+E+EE++SKISLKLLMEEEKE V AKK+SL+VMDYD
Sbjct: 241 RCRSAPSTGWIERKPKQEQQ---EQEQEEKQSKISLKLLMEEEKEAV-AKKESLVVMDYD 300

Query: 348 ADFYKLSSDIAKETWVVSGSSTSSSSSRCNDDPLLRSRSWKR 370
           ADFYKLSSDIAKETWVVSGSST    SRCNDDP LRSRSWKR
Sbjct: 301 ADFYKLSSDIAKETWVVSGSST----SRCNDDPFLRSRSWKR 328

BLAST of CSPI02G16650 vs. TAIR 10
Match: AT1G78110.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma membrane; EXPRESSED IN: 17 plant structures; EXPRESSED DURING: 7 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G22230.1); Has 5452 Blast hits to 3541 proteins in 289 species: Archae - 4; Bacteria - 165; Metazoa - 1756; Fungi - 532; Plants - 205; Viruses - 141; Other Eukaryotes - 2649 (source: NCBI BLink). )

HSP 1 Score: 221.9 bits (564), Expect = 8.8e-58
Identity = 151/343 (44.02%), Postives = 198/343 (57.73%), Query Frame = 0

Query: 63  ADLLVCFPARAHLTLLPSPARPPAEP----------HRRHYRK-------AQPSPLPWAK 122
           ADLLVCFP+R HL L P P   P+ P          HRR   K          SP+ WAK
Sbjct: 18  ADLLVCFPSRTHLALTPKPICSPSRPSDSSTNRRPHHRRQLSKLSGGGGGGHGSPVLWAK 77

Query: 123 EMSS---------EPTSPKVTCAGQIKIRPHSHRST-KNWQSVMEEIERIHNKKKNPIRN 182
           + SS         EPTSPKVTCAGQIK+RP       KNWQSVMEEIERIH+ +      
Sbjct: 78  QASSKNMGGDEIAEPTSPKVTCAGQIKVRPSKCGGRGKNWQSVMEEIERIHDNRS----Q 137

Query: 183 QNPFGFKREIVNFLSCLRGFRFDFRCFRGFPQSDITTDDEDDEEEVFEQHEPESESQSES 242
              FG K++++ FL+CLR  +FDFRCF  F  +D+T+DD+++E++     + E E   E 
Sbjct: 138 SKFFGLKKDVMGFLTCLRNIKFDFRCFGDFRHADVTSDDDEEEDD----DDDEEEEVVEG 197

Query: 243 EEEPTRGRTMFSEWFMVLQEGEEETKPINN----DAVSSLEFQPSSVSVPPPNALLLMRC 302
           EEE    +T+FS+WFMVLQE +       N    D    LE   +  +VPPPNALLLMRC
Sbjct: 198 EEE-ENSKTVFSKWFMVLQEEQNNKDDDKNNNKCDEKRDLEDTETEPAVPPPNALLLMRC 257

Query: 303 RSAP------NSLLRKPKQQEEKELEEEEEEEKSKISLKLLMEEEKEMVTAKKKSLMVMD 362
           RSAP        +  K +Q++ +E +EE+E E  + S+K   ++ + ++  +K  L++M 
Sbjct: 258 RSAPAKSWLEERMKVKTEQEKREEQKEEKETEDQETSMKTKKKDLRSLMEEEKMELVLMR 317

Query: 363 YDADFYKLSSDIAKETWVVSGSSTSSSSSRCNDDPLLRSRSWK 369
           YD +FY+LSSDIAKETWVV G            DPL RSRSWK
Sbjct: 318 YDTEFYRLSSDIAKETWVVGGI----------QDPLSRSRSWK 341

BLAST of CSPI02G16650 vs. TAIR 10
Match: AT1G22230.1 (unknown protein; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G78110.1); Has 2358 Blast hits to 1759 proteins in 159 species: Archae - 2; Bacteria - 36; Metazoa - 1046; Fungi - 203; Plants - 157; Viruses - 72; Other Eukaryotes - 842 (source: NCBI BLink). )

HSP 1 Score: 205.3 bits (521), Expect = 8.5e-53
Identity = 154/342 (45.03%), Postives = 195/342 (57.02%), Query Frame = 0

Query: 57  KPTAPPADLLVCFPARAHLTL----LPSPA-----RPPAEPHRRHYRKAQPS------PL 116
           K +   ADL+VCFP+RAHL+L    + SP+     R  A  HRR   K   S        
Sbjct: 8   KSSGYSADLMVCFPSRAHLSLPSKSISSPSSSFNRRQNAPHHRRSISKLSSSGGGVRQNR 67

Query: 117 PWAKEMSSEPTSPKVTCAGQIKIRPHSH-RSTKNWQSVMEEIERIHNKKKNPIRNQNPFG 176
              +E+  EPTSPKVTCAGQIK+R        KNWQS+M EIE+IH  K         FG
Sbjct: 68  GGGREVVEEPTSPKVTCAGQIKVRSSKRDGGGKNWQSLMAEIEKIHRSKS----ESKFFG 127

Query: 177 FKREIVNFLSCLRGFRFDFRCFRGFPQSDITTDDEDDEEEVFEQHEPESESQSESEEEPT 236
            KR+++ FL+CLR   FDFRCF  FP  DI +DDE+++EE  E+ E E E +S       
Sbjct: 128 IKRDVMGFLTCLRD--FDFRCFGAFPPVDIISDDEEEDEEEEEEDEEEDEDESSG----- 187

Query: 237 RGRTMFSEWFMVLQEGEEETKPINNDAVSSLE--FQPSSVSVPPPNALLLMRCRSAPNSL 296
              T+FS+W MVL E     K  N + V   E  F     +VPPPNALLLMRCRSAP   
Sbjct: 188 ---TVFSKWLMVLHE-----KQNNEECVDGKENVFSDVETAVPPPNALLLMRCRSAPVKN 247

Query: 297 LRKPKQQEEKELE-------EEEEEEKSKI----SLKLLMEEEKEMVTAKKKSLMVMDYD 356
             + K++E +E +       EEEEEEK ++     L+ LMEEEK+M      +L+VM+YD
Sbjct: 248 WSEEKKEETEEGDNRVKQSGEEEEEEKDRVGNKKDLRSLMEEEKKM------NLVVMNYD 307

Query: 357 ADFYKLSSDIAKETWVVSGSSTSSSSSRCNDDPLLRSRSWKR 370
            ++YKLS+DIAKETWVV G            DPL RSRSWK+
Sbjct: 308 TNYYKLSNDIAKETWVVGGI----------QDPLFRSRSWKK 314

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0LMI63.1e-17297.88Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G345900 PE=4 SV=1[more]
A0A5A7V8Z91.5e-16695.69Myotubularin-related protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_sca... [more]
A0A1S3BDB94.9e-16595.65uncharacterized protein LOC103488403 OS=Cucumis melo OX=3656 GN=LOC103488403 PE=... [more]
A0A6J1HD245.7e-10570.39uncharacterized protein LOC111462985 OS=Cucurbita moschata OX=3662 GN=LOC1114629... [more]
A0A6J1KE653.8e-10165.14uncharacterized protein LOC111492421 OS=Cucurbita maxima OX=3661 GN=LOC111492421... [more]
Match NameE-valueIdentityDescription
KAE8652025.12.2e-19698.66hypothetical protein Csa_018606 [Cucumis sativus][more]
KAA0064832.13.1e-16695.69myotubularin-related protein [Cucumis melo var. makuwa][more]
XP_008445342.21.0e-16495.65PREDICTED: uncharacterized protein LOC103488403 [Cucumis melo][more]
XP_004143011.31.8e-12199.55uncharacterized protein LOC101218308 [Cucumis sativus][more]
XP_023546364.15.1e-10869.88uncharacterized protein LOC111805493 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
AT1G78110.18.8e-5844.02unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT1G22230.18.5e-5345.03unknown protein; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein matc... [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 276..303
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 343..360
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 277..299
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 191..205
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 188..221
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 206..221
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 79..115
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 343..369
NoneNo IPR availablePANTHERPTHR33448CHLOROPLAST PROTEIN HCF243-RELATEDcoord: 55..369
NoneNo IPR availablePANTHERPTHR33448:SF3OS09G0370000 PROTEINcoord: 55..369

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI02G16650.1CSPI02G16650.1mRNA