CSPI03G01790 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI03G01790
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionCAAX amino terminal protease
LocationChr3: 1262140 .. 1265117 (-)
RNA-Seq ExpressionCSPI03G01790
SyntenyCSPI03G01790
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GAGTCAACAAAATTCTGAGGTTCGTAACAAGATTACCATTTTGAGTTCAATGAAGGAGACGAAGTTTTCTCCGTTTCGGACTCCCGAAAGACCCGAAATCTCTGTCTGAAAAACCCGATTTCCGACCCGTTTAGAAGATAACCCAAAAGAAAAGAAAGAAAGAAAGAAAGAATGGAGCTTTCGATTCTCTCTGTATCTTCAAACACTTCGACCATGTCCTTTGGTGCTAGAATTGGGATCTGTTCTACTTCAAGCTCCAGGTTTTTACATTTTTCGATGAGGAAACGTGCCGGCGGGAGAGTTCCAGTGCCGGTTAGCGTTAGGGCGTCGGCGGAGCCGAGGAGTGAGAGATTGGATGAGGGGCAGACACGTAGCCGGTTCACTGCTCCGGCTATGGAGGTGACGACACTTGATACTAGTTTCAGAGAAACAGAGTTTCCTGTTTGGGAAAAGATTGGTGCTGTTGTCAGACTCAGCTATGGAGTTGGTGAGTCAAAATGTTTTCCTTTTGAGATTTAGCTTTGTGGGTGTTTTTTTTCCTTCACTTTCTTTCAATTTTTTTTTATGGGATTTGGAATTAGAATTGAAATCTGTTTTTTGAGTTTGTTTGTTTGTTTTTTTTTTGTTCTGTTCTTGTAGGAATTTATGGTGCAATGGCACTGGCCGGAAAGTTTATATGTTCAATATCTGGGACTGATTGGATGGGAGGATTTCATCCATCTTTGGATGCTATTTTGGAAGGGCTTGGCTATGCCGTTCCTCCAATTATGGCTCTTCTCTTCATTCTTGATGTGAGTTGATTTCCAAGTTAACAAATGCTTTTGCTCGACTGGATTTAGACAAGATGTAGGTCTTGAGTTCTTTGATGAAATGATTTTCGTTTTTGCAATAGTTGTTCGGTTGGTTCTAACTTTGAGGTGAAAACTATGAACAATGTAGGGTATGTTCATTCTTTTAGATTTTGAAAAGTAATTTCCAAAACTGGAAAGAAGCTGTGGATTAGATTTTGTGTTTTGATGCACCTGCCACCTAAGTTTAAGTCTTTAGTTTCCGAATGATTAATCACATGAGGCTTGGGTTTGTTTGTAGGATGAAGTTGTGAAGTTATCGCCCCATGCTCGAGCGATTAGAGATGTCGAAGACGAGGAGCTTCGAAGCTTCTTTTACGGAATGTCTCCATGGCAGGTAAAATGAACTCCTAGTTATGTTGATGAATATCTGATAAAAAAAAAGCTGTTAGTTTGGTGTTTGCCCAAGATGAGAGAGTTTTTGTTTATTTTGTGTCCCAACTTTTGCAGTTCATTCTTATCGTGGCTGCAAGCTCAGTCGGGGAGGAGCTCTTTTACCGGGCAGCCGTTCAGGTTTGTGCTTCCGATATTTAGACATTTTGTCCCCTTGACTTCAACTCCGACCACAGTTACTCTTGCTCAAAATTTTTGCTTTGATTCACCAGTTTCTATACGTACGATTGTTGAATGTTGATTGAATTTCAGTCTCAAGGAGTACTAAATAACATTTCTATTCCCTTAACTTTCTAGGGAGCATTGGCTGATATATTCTTAAGAAGTCCTGATATTGGAGCTGATGTTCAAGGAATGGCATCTTTGGTATGCAAAATTATTCCTTTTGTGGATTATGACGAATTCTATCTGTTCACTTGTCTCGCAGTCGTCGTTTTTCTCTTCATCTTATAACTTTGTACCGTGCTCTCTTGATTATAGACCGGAGTGCTGCCTCCATTCGTACCATTTGCACAAGGATTTGCAGCTTTTATCACAGCTGCGCTTACCGGTTCACTCTATTACGTTGCTGCATCACCAAAAGGTCGATATTCCATCAAATCATTTGCACATGTTCAATTTTCAGTTTGGTTGACTGTCTCCAGTTTAATGAATACGTTTTTTTTTTTTTTTCTCCAGATCCTACTTATGTAGTTGCTCCAGTTTTGCAATCTCGATCAGGTCGCAAAGATCTTAGAAAGCTTTTCGCAGGTTCGATTATAATCCTCATTTCTACTGTAAGTTGCACTTGTAAAGCAACGTTTGAATGTATCTGATTCTCCCTGGCTGAATTCTTGCTTCACTTTTCTCGAATGAGATAGATTAGTACTGAAGCTTGAAAAATGAGAGGGTGGCCAAAGTGAAAATAGAAAATCCATCACGTGCTGATTGATTACATTCTACTTTCACTTCATCGATAACGACGGTCTCGCTCACCCTCAACATCTTTTGTGTAAACATTTGCAGCATGGTACGAGAGACGACAAATGAAAAAGATCTACTCTCCCCTCCTTGAAGGACTCCTTGCTCTCTACCTCGGTTTCGAATGGATCCAGGTAAGACCTTTCCTAGCCAAGCAAGACCTTACCTTTCTAAGTAAACTACACAATGTACAAATTTCAAAAAACAGGAAACTCACATATTACTATGTCTTTTGGCATGATATTTTCAGACCGATAACATTCTTGCTCCAATCATCACACACGGTATATACTCCGCTGTGATACTAGGCCATGGACTTTGGAAGATCCATGACCACCGGAGAAGGCTACGGCAGAGAATTCAGCAGGTTAAGATGGAAGGTAAAAGCTCAGATAGTTTGTGAAAGGGAAGGAACAAAAAGTACTTTACATAGAGAGCAAGAAGAAGTTGTAAATTATATTTTTTTGGCAGAGCCAAAATTTTCCATCTTGACCATAAAAATGTCCATACAAGAATGGAAGTATATATTATATAGATCAAGAAATAAACCTATAGGTGTGCATATGAAGACAAATTGTAGTGATACTTATAGATAGTGAGGACATTTTTGGATATTTGGTTTTTATCCTAAGAAGGTGATGGAAAATATAAGAGAATTATAACTCAATGTAAATCATCTTGAAATGTAGTAATTTGAATTCATTAATGTATATAGAAAAGCCATAAAATGTCCAATAAAATAGTAGTTTTATTTATTACTATTTTTTGAGCATGGGA

mRNA sequence

GAGTCAACAAAATTCTGAGGTTCGTAACAAGATTACCATTTTGAGTTCAATGAAGGAGACGAAGTTTTCTCCGTTTCGGACTCCCGAAAGACCCGAAATCTCTGTCTGAAAAACCCGATTTCCGACCCGTTTAGAAGATAACCCAAAAGAAAAGAAAGAAAGAAAGAAAGAATGGAGCTTTCGATTCTCTCTGTATCTTCAAACACTTCGACCATGTCCTTTGGTGCTAGAATTGGGATCTGTTCTACTTCAAGCTCCAGGTTTTTACATTTTTCGATGAGGAAACGTGCCGGCGGGAGAGTTCCAGTGCCGGTTAGCGTTAGGGCGTCGGCGGAGCCGAGGAGTGAGAGATTGGATGAGGGGCAGACACGTAGCCGGTTCACTGCTCCGGCTATGGAGGTGACGACACTTGATACTAGTTTCAGAGAAACAGAGTTTCCTGTTTGGGAAAAGATTGGTGCTGTTGTCAGACTCAGCTATGGAGTTGGAATTTATGGTGCAATGGCACTGGCCGGAAAGTTTATATGTTCAATATCTGGGACTGATTGGATGGGAGGATTTCATCCATCTTTGGATGCTATTTTGGAAGGGCTTGGCTATGCCGTTCCTCCAATTATGGCTCTTCTCTTCATTCTTGATGATGAAGTTGTGAAGTTATCGCCCCATGCTCGAGCGATTAGAGATGTCGAAGACGAGGAGCTTCGAAGCTTCTTTTACGGAATGTCTCCATGGCAGTTCATTCTTATCGTGGCTGCAAGCTCAGTCGGGGAGGAGCTCTTTTACCGGGCAGCCGTTCAGGGAGCATTGGCTGATATATTCTTAAGAAGTCCTGATATTGGAGCTGATGTTCAAGGAATGGCATCTTTGACCGGAGTGCTGCCTCCATTCGTACCATTTGCACAAGGATTTGCAGCTTTTATCACAGCTGCGCTTACCGGTTCACTCTATTACGTTGCTGCATCACCAAAAGATCCTACTTATGTAGTTGCTCCAGTTTTGCAATCTCGATCAGGTCGCAAAGATCTTAGAAAGCTTTTCGCAGCATGGTACGAGAGACGACAAATGAAAAAGATCTACTCTCCCCTCCTTGAAGGACTCCTTGCTCTCTACCTCGGTTTCGAATGGATCCAGACCGATAACATTCTTGCTCCAATCATCACACACGGTATATACTCCGCTGTGATACTAGGCCATGGACTTTGGAAGATCCATGACCACCGGAGAAGGCTACGGCAGAGAATTCAGCAGGTTAAGATGGAAGGTAAAAGCTCAGATAGTTTGTGAAAGGGAAGGAACAAAAAGTACTTTACATAGAGAGCAAGAAGAAGTTGTAAATTATATTTTTTTGGCAGAGCCAAAATTTTCCATCTTGACCATAAAAATGTCCATACAAGAATGGAAGTATATATTATATAGATCAAGAAATAAACCTATAGGTGTGCATATGAAGACAAATTGTAGTGATACTTATAGATAGTGAGGACATTTTTGGATATTTGGTTTTTATCCTAAGAAGGTGATGGAAAATATAAGAGAATTATAACTCAATGTAAATCATCTTGAAATGTAGTAATTTGAATTCATTAATGTATATAGAAAAGCCATAAAATGTCCAATAAAATAGTAGTTTTATTTATTACTATTTTTTGAGCATGGGA

Coding sequence (CDS)

ATGGAGCTTTCGATTCTCTCTGTATCTTCAAACACTTCGACCATGTCCTTTGGTGCTAGAATTGGGATCTGTTCTACTTCAAGCTCCAGGTTTTTACATTTTTCGATGAGGAAACGTGCCGGCGGGAGAGTTCCAGTGCCGGTTAGCGTTAGGGCGTCGGCGGAGCCGAGGAGTGAGAGATTGGATGAGGGGCAGACACGTAGCCGGTTCACTGCTCCGGCTATGGAGGTGACGACACTTGATACTAGTTTCAGAGAAACAGAGTTTCCTGTTTGGGAAAAGATTGGTGCTGTTGTCAGACTCAGCTATGGAGTTGGAATTTATGGTGCAATGGCACTGGCCGGAAAGTTTATATGTTCAATATCTGGGACTGATTGGATGGGAGGATTTCATCCATCTTTGGATGCTATTTTGGAAGGGCTTGGCTATGCCGTTCCTCCAATTATGGCTCTTCTCTTCATTCTTGATGATGAAGTTGTGAAGTTATCGCCCCATGCTCGAGCGATTAGAGATGTCGAAGACGAGGAGCTTCGAAGCTTCTTTTACGGAATGTCTCCATGGCAGTTCATTCTTATCGTGGCTGCAAGCTCAGTCGGGGAGGAGCTCTTTTACCGGGCAGCCGTTCAGGGAGCATTGGCTGATATATTCTTAAGAAGTCCTGATATTGGAGCTGATGTTCAAGGAATGGCATCTTTGACCGGAGTGCTGCCTCCATTCGTACCATTTGCACAAGGATTTGCAGCTTTTATCACAGCTGCGCTTACCGGTTCACTCTATTACGTTGCTGCATCACCAAAAGATCCTACTTATGTAGTTGCTCCAGTTTTGCAATCTCGATCAGGTCGCAAAGATCTTAGAAAGCTTTTCGCAGCATGGTACGAGAGACGACAAATGAAAAAGATCTACTCTCCCCTCCTTGAAGGACTCCTTGCTCTCTACCTCGGTTTCGAATGGATCCAGACCGATAACATTCTTGCTCCAATCATCACACACGGTATATACTCCGCTGTGATACTAGGCCATGGACTTTGGAAGATCCATGACCACCGGAGAAGGCTACGGCAGAGAATTCAGCAGGTTAAGATGGAAGGTAAAAGCTCAGATAGTTTGTGA

Protein sequence

MELSILSVSSNTSTMSFGARIGICSTSSSRFLHFSMRKRAGGRVPVPVSVRASAEPRSERLDEGQTRSRFTAPAMEVTTLDTSFRETEFPVWEKIGAVVRLSYGVGIYGAMALAGKFICSISGTDWMGGFHPSLDAILEGLGYAVPPIMALLFILDDEVVKLSPHARAIRDVEDEELRSFFYGMSPWQFILIVAASSVGEELFYRAAVQGALADIFLRSPDIGADVQGMASLTGVLPPFVPFAQGFAAFITAALTGSLYYVAASPKDPTYVVAPVLQSRSGRKDLRKLFAAWYERRQMKKIYSPLLEGLLALYLGFEWIQTDNILAPIITHGIYSAVILGHGLWKIHDHRRRLRQRIQQVKMEGKSSDSL*
Homology
BLAST of CSPI03G01790 vs. ExPASy TrEMBL
Match: A0A0A0L406 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G011760 PE=4 SV=1)

HSP 1 Score: 722.6 bits (1864), Expect = 8.2e-205
Identity = 369/370 (99.73%), Postives = 370/370 (100.00%), Query Frame = 0

Query: 1   MELSILSVSSNTSTMSFGARIGICSTSSSRFLHFSMRKRAGGRVPVPVSVRASAEPRSER 60
           MELSILSVSSNTSTMSFGARIGICSTSSSRFLHF+MRKRAGGRVPVPVSVRASAEPRSER
Sbjct: 1   MELSILSVSSNTSTMSFGARIGICSTSSSRFLHFAMRKRAGGRVPVPVSVRASAEPRSER 60

Query: 61  LDEGQTRSRFTAPAMEVTTLDTSFRETEFPVWEKIGAVVRLSYGVGIYGAMALAGKFICS 120
           LDEGQTRSRFTAPAMEVTTLDTSFRETEFPVWEKIGAVVRLSYGVGIYGAMALAGKFICS
Sbjct: 61  LDEGQTRSRFTAPAMEVTTLDTSFRETEFPVWEKIGAVVRLSYGVGIYGAMALAGKFICS 120

Query: 121 ISGTDWMGGFHPSLDAILEGLGYAVPPIMALLFILDDEVVKLSPHARAIRDVEDEELRSF 180
           ISGTDWMGGFHPSLDAILEGLGYAVPPIMALLFILDDEVVKLSPHARAIRDVEDEELRSF
Sbjct: 121 ISGTDWMGGFHPSLDAILEGLGYAVPPIMALLFILDDEVVKLSPHARAIRDVEDEELRSF 180

Query: 181 FYGMSPWQFILIVAASSVGEELFYRAAVQGALADIFLRSPDIGADVQGMASLTGVLPPFV 240
           FYGMSPWQFILIVAASSVGEELFYRAAVQGALADIFLRSPDIGADVQGMASLTGVLPPFV
Sbjct: 181 FYGMSPWQFILIVAASSVGEELFYRAAVQGALADIFLRSPDIGADVQGMASLTGVLPPFV 240

Query: 241 PFAQGFAAFITAALTGSLYYVAASPKDPTYVVAPVLQSRSGRKDLRKLFAAWYERRQMKK 300
           PFAQGFAAFITAALTGSLYYVAASPKDPTYVVAPVLQSRSGRKDLRKLFAAWYERRQMKK
Sbjct: 241 PFAQGFAAFITAALTGSLYYVAASPKDPTYVVAPVLQSRSGRKDLRKLFAAWYERRQMKK 300

Query: 301 IYSPLLEGLLALYLGFEWIQTDNILAPIITHGIYSAVILGHGLWKIHDHRRRLRQRIQQV 360
           IYSPLLEGLLALYLGFEWIQTDNILAPIITHGIYSAVILGHGLWKIHDHRRRLRQRIQQV
Sbjct: 301 IYSPLLEGLLALYLGFEWIQTDNILAPIITHGIYSAVILGHGLWKIHDHRRRLRQRIQQV 360

Query: 361 KMEGKSSDSL 371
           KMEGKSSDSL
Sbjct: 361 KMEGKSSDSL 370

BLAST of CSPI03G01790 vs. ExPASy TrEMBL
Match: A0A1S3BL73 (uncharacterized protein LOC103490793 OS=Cucumis melo OX=3656 GN=LOC103490793 PE=4 SV=1)

HSP 1 Score: 708.0 bits (1826), Expect = 2.1e-200
Identity = 363/370 (98.11%), Postives = 365/370 (98.65%), Query Frame = 0

Query: 1   MELSILSVSSNTSTMSFGARIGICSTSSSRFLHFSMRKRAGGRVPVPVSVRASAEPRSER 60
           MELSILSVSSNTSTMSFGARIGICSTSSSRF +F MRKRAGGRV VP SVRASAEPRSER
Sbjct: 1   MELSILSVSSNTSTMSFGARIGICSTSSSRFSYFPMRKRAGGRVLVPASVRASAEPRSER 60

Query: 61  LDEGQTRSRFTAPAMEVTTLDTSFRETEFPVWEKIGAVVRLSYGVGIYGAMALAGKFICS 120
           LDEGQTRSRFTAPAMEVTTLDTSFRETEFPVWEKIGAVVRLSYGVGIYGAMALAG+FICS
Sbjct: 61  LDEGQTRSRFTAPAMEVTTLDTSFRETEFPVWEKIGAVVRLSYGVGIYGAMALAGRFICS 120

Query: 121 ISGTDWMGGFHPSLDAILEGLGYAVPPIMALLFILDDEVVKLSPHARAIRDVEDEELRSF 180
           ISGTDWMGGFHPSLDAIL GLGYAVPPIMALLFILDDEVVKLSPHARAIRDVEDEELRSF
Sbjct: 121 ISGTDWMGGFHPSLDAILGGLGYAVPPIMALLFILDDEVVKLSPHARAIRDVEDEELRSF 180

Query: 181 FYGMSPWQFILIVAASSVGEELFYRAAVQGALADIFLRSPDIGADVQGMASLTGVLPPFV 240
           FYGMSPWQFILIVAASSVGEELFYRAAVQGALADIFLRSPDIGADVQGMASLTGVLPPFV
Sbjct: 181 FYGMSPWQFILIVAASSVGEELFYRAAVQGALADIFLRSPDIGADVQGMASLTGVLPPFV 240

Query: 241 PFAQGFAAFITAALTGSLYYVAASPKDPTYVVAPVLQSRSGRKDLRKLFAAWYERRQMKK 300
           PFAQGFAAFITAALTGSLYYVAASPKDPTYVVAPVLQSRSGRKDLRKLFAAWYERRQMKK
Sbjct: 241 PFAQGFAAFITAALTGSLYYVAASPKDPTYVVAPVLQSRSGRKDLRKLFAAWYERRQMKK 300

Query: 301 IYSPLLEGLLALYLGFEWIQTDNILAPIITHGIYSAVILGHGLWKIHDHRRRLRQRIQQV 360
           IYSPLLEGLLALYLGFEWIQTDNILAPIITHGIYSAVILGHGLWKIHDHRRRLRQRIQQV
Sbjct: 301 IYSPLLEGLLALYLGFEWIQTDNILAPIITHGIYSAVILGHGLWKIHDHRRRLRQRIQQV 360

Query: 361 KMEGKSSDSL 371
           KMEGKSSDSL
Sbjct: 361 KMEGKSSDSL 370

BLAST of CSPI03G01790 vs. ExPASy TrEMBL
Match: A0A5A7UHN8 (CAAX amino terminal protease OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold139G001700 PE=4 SV=1)

HSP 1 Score: 686.4 bits (1770), Expect = 6.5e-194
Identity = 349/356 (98.03%), Postives = 351/356 (98.60%), Query Frame = 0

Query: 15  MSFGARIGICSTSSSRFLHFSMRKRAGGRVPVPVSVRASAEPRSERLDEGQTRSRFTAPA 74
           MSFGARIGICSTSSSRF +F MRKRAGGRV VP SVRASAEPRSERLDEGQTRSRFTAPA
Sbjct: 1   MSFGARIGICSTSSSRFSYFPMRKRAGGRVLVPASVRASAEPRSERLDEGQTRSRFTAPA 60

Query: 75  MEVTTLDTSFRETEFPVWEKIGAVVRLSYGVGIYGAMALAGKFICSISGTDWMGGFHPSL 134
           MEVTTLDTSFRETEFPVWEKIGAVVRLSYGVGIYGAMALAG+FICSISGTDWMGGFHPSL
Sbjct: 61  MEVTTLDTSFRETEFPVWEKIGAVVRLSYGVGIYGAMALAGRFICSISGTDWMGGFHPSL 120

Query: 135 DAILEGLGYAVPPIMALLFILDDEVVKLSPHARAIRDVEDEELRSFFYGMSPWQFILIVA 194
           DAIL GLGYAVPPIMALLFILDDEVVKLSPHARAIRDVEDEELRSFFYGMSPWQFILIVA
Sbjct: 121 DAILGGLGYAVPPIMALLFILDDEVVKLSPHARAIRDVEDEELRSFFYGMSPWQFILIVA 180

Query: 195 ASSVGEELFYRAAVQGALADIFLRSPDIGADVQGMASLTGVLPPFVPFAQGFAAFITAAL 254
           ASSVGEELFYRAAVQGALADIFLRSPDIGADVQGMASLTGVLPPFVPFAQGFAAFITAAL
Sbjct: 181 ASSVGEELFYRAAVQGALADIFLRSPDIGADVQGMASLTGVLPPFVPFAQGFAAFITAAL 240

Query: 255 TGSLYYVAASPKDPTYVVAPVLQSRSGRKDLRKLFAAWYERRQMKKIYSPLLEGLLALYL 314
           TGSLYYVAASPKDPTYVVAPVLQSRSGRKDLRKLFAAWYERRQMKKIYSPLLEGLLALYL
Sbjct: 241 TGSLYYVAASPKDPTYVVAPVLQSRSGRKDLRKLFAAWYERRQMKKIYSPLLEGLLALYL 300

Query: 315 GFEWIQTDNILAPIITHGIYSAVILGHGLWKIHDHRRRLRQRIQQVKMEGKSSDSL 371
           GFEWIQTDNILAPIITHGIYSAVILGHGLWKIHDHRRRLRQRIQQVKMEGKSSDSL
Sbjct: 301 GFEWIQTDNILAPIITHGIYSAVILGHGLWKIHDHRRRLRQRIQQVKMEGKSSDSL 356

BLAST of CSPI03G01790 vs. ExPASy TrEMBL
Match: A0A6J1E5Y3 (uncharacterized protein LOC111431096 OS=Cucurbita moschata OX=3662 GN=LOC111431096 PE=4 SV=1)

HSP 1 Score: 664.5 bits (1713), Expect = 2.7e-187
Identity = 340/370 (91.89%), Postives = 349/370 (94.32%), Query Frame = 0

Query: 1   MELSILSVSSNTSTMSFGARIGICSTSSSRFLHFSMRKRAGGRVPVPVSVRASAEPRSER 60
           MEL I SVSSNTSTMSFGARIGICSTS+SR  HF +RKRA GRV +P  VRASAEPRSER
Sbjct: 1   MELPIFSVSSNTSTMSFGARIGICSTSTSRISHFPVRKRAVGRVRLPSGVRASAEPRSER 60

Query: 61  LDEGQTRSRFTAPAMEVTTLDTSFRETEFPVWEKIGAVVRLSYGVGIYGAMALAGKFICS 120
           L+EGQTR RFT PAME+TTLD SFRETEFPVWEKIGAVVRLSYGVGIYGAMALAG+FICS
Sbjct: 61  LEEGQTRGRFTGPAMEMTTLDASFRETEFPVWEKIGAVVRLSYGVGIYGAMALAGRFICS 120

Query: 121 ISGTDWMGGFHPSLDAILEGLGYAVPPIMALLFILDDEVVKLSPHARAIRDVEDEELRSF 180
           ISG DWMGGF PSLDAIL GLGYAVPPIMALLFILDDEVVKLSPHARAIRDVEDEELRSF
Sbjct: 121 ISGIDWMGGFQPSLDAILGGLGYAVPPIMALLFILDDEVVKLSPHARAIRDVEDEELRSF 180

Query: 181 FYGMSPWQFILIVAASSVGEELFYRAAVQGALADIFLRSPDIGADVQGMASLTGVLPPFV 240
           FYGMSPWQFILIVAASSVGEELFYRAAVQGALADIFLRS DIGADVQGMASLTGVLPPFV
Sbjct: 181 FYGMSPWQFILIVAASSVGEELFYRAAVQGALADIFLRSTDIGADVQGMASLTGVLPPFV 240

Query: 241 PFAQGFAAFITAALTGSLYYVAASPKDPTYVVAPVLQSRSGRKDLRKLFAAWYERRQMKK 300
           PFAQ FAA ITAALTGSLYYVAASPKDPTYVVAPVLQSRS R DL+KLFAAWYERRQMKK
Sbjct: 241 PFAQAFAAVITAALTGSLYYVAASPKDPTYVVAPVLQSRSSRDDLKKLFAAWYERRQMKK 300

Query: 301 IYSPLLEGLLALYLGFEWIQTDNILAPIITHGIYSAVILGHGLWKIHDHRRRLRQRIQQV 360
           IYSPLLEGLLALYLGFEWIQTDNILAP+ITHGIYSAVILGHGLWKIHDHRRRLRQRIQQ+
Sbjct: 301 IYSPLLEGLLALYLGFEWIQTDNILAPMITHGIYSAVILGHGLWKIHDHRRRLRQRIQQL 360

Query: 361 KMEGKSSDSL 371
           KMEGK SDSL
Sbjct: 361 KMEGKGSDSL 370

BLAST of CSPI03G01790 vs. ExPASy TrEMBL
Match: A0A6J1HQ13 (uncharacterized protein LOC111465053 OS=Cucurbita maxima OX=3661 GN=LOC111465053 PE=4 SV=1)

HSP 1 Score: 661.4 bits (1705), Expect = 2.3e-186
Identity = 337/370 (91.08%), Postives = 348/370 (94.05%), Query Frame = 0

Query: 1   MELSILSVSSNTSTMSFGARIGICSTSSSRFLHFSMRKRAGGRVPVPVSVRASAEPRSER 60
           MELSI SVSSNTSTMSFGARIGICSTS+SR  HF +RKRA GRV +P  VRASAEPRSER
Sbjct: 1   MELSIFSVSSNTSTMSFGARIGICSTSTSRISHFPVRKRAVGRVRLPSGVRASAEPRSER 60

Query: 61  LDEGQTRSRFTAPAMEVTTLDTSFRETEFPVWEKIGAVVRLSYGVGIYGAMALAGKFICS 120
           L+EGQTR RFT PAME+TTLD +FRETEFPVWEKIGAVVRLSYGVGIYGAMALAG+FICS
Sbjct: 61  LEEGQTRGRFTGPAMEMTTLDANFRETEFPVWEKIGAVVRLSYGVGIYGAMALAGRFICS 120

Query: 121 ISGTDWMGGFHPSLDAILEGLGYAVPPIMALLFILDDEVVKLSPHARAIRDVEDEELRSF 180
           ISG DWMGGF PSLDAIL GLGYAVPPIMALLFILDDEVVKLSPHARAIRDVEDEELRSF
Sbjct: 121 ISGIDWMGGFQPSLDAILGGLGYAVPPIMALLFILDDEVVKLSPHARAIRDVEDEELRSF 180

Query: 181 FYGMSPWQFILIVAASSVGEELFYRAAVQGALADIFLRSPDIGADVQGMASLTGVLPPFV 240
           FYGMSPWQFILIVAASSVGEELFYRAA+QGALADIFLRS DIGADVQGMASLTGVLPPFV
Sbjct: 181 FYGMSPWQFILIVAASSVGEELFYRAAIQGALADIFLRSTDIGADVQGMASLTGVLPPFV 240

Query: 241 PFAQGFAAFITAALTGSLYYVAASPKDPTYVVAPVLQSRSGRKDLRKLFAAWYERRQMKK 300
           PFAQ FAA ITAALTGSLYYVAASPKDPTYVVAPVLQSRS R DL+KLFAAWYERRQMKK
Sbjct: 241 PFAQAFAAVITAALTGSLYYVAASPKDPTYVVAPVLQSRSSRNDLKKLFAAWYERRQMKK 300

Query: 301 IYSPLLEGLLALYLGFEWIQTDNILAPIITHGIYSAVILGHGLWKIHDHRRRLRQRIQQV 360
           IYSPLLEGLLALYLG EWIQTDNILAP+ITHGIYSAVILGHGLWKIHDHRRRLRQRIQQ+
Sbjct: 301 IYSPLLEGLLALYLGIEWIQTDNILAPMITHGIYSAVILGHGLWKIHDHRRRLRQRIQQL 360

Query: 361 KMEGKSSDSL 371
           KMEGK SD L
Sbjct: 361 KMEGKGSDGL 370

BLAST of CSPI03G01790 vs. NCBI nr
Match: XP_031738781.1 (uncharacterized protein LOC101203999 [Cucumis sativus] >KGN55774.1 hypothetical protein Csa_009518 [Cucumis sativus])

HSP 1 Score: 722.6 bits (1864), Expect = 1.7e-204
Identity = 369/370 (99.73%), Postives = 370/370 (100.00%), Query Frame = 0

Query: 1   MELSILSVSSNTSTMSFGARIGICSTSSSRFLHFSMRKRAGGRVPVPVSVRASAEPRSER 60
           MELSILSVSSNTSTMSFGARIGICSTSSSRFLHF+MRKRAGGRVPVPVSVRASAEPRSER
Sbjct: 1   MELSILSVSSNTSTMSFGARIGICSTSSSRFLHFAMRKRAGGRVPVPVSVRASAEPRSER 60

Query: 61  LDEGQTRSRFTAPAMEVTTLDTSFRETEFPVWEKIGAVVRLSYGVGIYGAMALAGKFICS 120
           LDEGQTRSRFTAPAMEVTTLDTSFRETEFPVWEKIGAVVRLSYGVGIYGAMALAGKFICS
Sbjct: 61  LDEGQTRSRFTAPAMEVTTLDTSFRETEFPVWEKIGAVVRLSYGVGIYGAMALAGKFICS 120

Query: 121 ISGTDWMGGFHPSLDAILEGLGYAVPPIMALLFILDDEVVKLSPHARAIRDVEDEELRSF 180
           ISGTDWMGGFHPSLDAILEGLGYAVPPIMALLFILDDEVVKLSPHARAIRDVEDEELRSF
Sbjct: 121 ISGTDWMGGFHPSLDAILEGLGYAVPPIMALLFILDDEVVKLSPHARAIRDVEDEELRSF 180

Query: 181 FYGMSPWQFILIVAASSVGEELFYRAAVQGALADIFLRSPDIGADVQGMASLTGVLPPFV 240
           FYGMSPWQFILIVAASSVGEELFYRAAVQGALADIFLRSPDIGADVQGMASLTGVLPPFV
Sbjct: 181 FYGMSPWQFILIVAASSVGEELFYRAAVQGALADIFLRSPDIGADVQGMASLTGVLPPFV 240

Query: 241 PFAQGFAAFITAALTGSLYYVAASPKDPTYVVAPVLQSRSGRKDLRKLFAAWYERRQMKK 300
           PFAQGFAAFITAALTGSLYYVAASPKDPTYVVAPVLQSRSGRKDLRKLFAAWYERRQMKK
Sbjct: 241 PFAQGFAAFITAALTGSLYYVAASPKDPTYVVAPVLQSRSGRKDLRKLFAAWYERRQMKK 300

Query: 301 IYSPLLEGLLALYLGFEWIQTDNILAPIITHGIYSAVILGHGLWKIHDHRRRLRQRIQQV 360
           IYSPLLEGLLALYLGFEWIQTDNILAPIITHGIYSAVILGHGLWKIHDHRRRLRQRIQQV
Sbjct: 301 IYSPLLEGLLALYLGFEWIQTDNILAPIITHGIYSAVILGHGLWKIHDHRRRLRQRIQQV 360

Query: 361 KMEGKSSDSL 371
           KMEGKSSDSL
Sbjct: 361 KMEGKSSDSL 370

BLAST of CSPI03G01790 vs. NCBI nr
Match: XP_008448701.1 (PREDICTED: uncharacterized protein LOC103490793 [Cucumis melo])

HSP 1 Score: 708.0 bits (1826), Expect = 4.3e-200
Identity = 363/370 (98.11%), Postives = 365/370 (98.65%), Query Frame = 0

Query: 1   MELSILSVSSNTSTMSFGARIGICSTSSSRFLHFSMRKRAGGRVPVPVSVRASAEPRSER 60
           MELSILSVSSNTSTMSFGARIGICSTSSSRF +F MRKRAGGRV VP SVRASAEPRSER
Sbjct: 1   MELSILSVSSNTSTMSFGARIGICSTSSSRFSYFPMRKRAGGRVLVPASVRASAEPRSER 60

Query: 61  LDEGQTRSRFTAPAMEVTTLDTSFRETEFPVWEKIGAVVRLSYGVGIYGAMALAGKFICS 120
           LDEGQTRSRFTAPAMEVTTLDTSFRETEFPVWEKIGAVVRLSYGVGIYGAMALAG+FICS
Sbjct: 61  LDEGQTRSRFTAPAMEVTTLDTSFRETEFPVWEKIGAVVRLSYGVGIYGAMALAGRFICS 120

Query: 121 ISGTDWMGGFHPSLDAILEGLGYAVPPIMALLFILDDEVVKLSPHARAIRDVEDEELRSF 180
           ISGTDWMGGFHPSLDAIL GLGYAVPPIMALLFILDDEVVKLSPHARAIRDVEDEELRSF
Sbjct: 121 ISGTDWMGGFHPSLDAILGGLGYAVPPIMALLFILDDEVVKLSPHARAIRDVEDEELRSF 180

Query: 181 FYGMSPWQFILIVAASSVGEELFYRAAVQGALADIFLRSPDIGADVQGMASLTGVLPPFV 240
           FYGMSPWQFILIVAASSVGEELFYRAAVQGALADIFLRSPDIGADVQGMASLTGVLPPFV
Sbjct: 181 FYGMSPWQFILIVAASSVGEELFYRAAVQGALADIFLRSPDIGADVQGMASLTGVLPPFV 240

Query: 241 PFAQGFAAFITAALTGSLYYVAASPKDPTYVVAPVLQSRSGRKDLRKLFAAWYERRQMKK 300
           PFAQGFAAFITAALTGSLYYVAASPKDPTYVVAPVLQSRSGRKDLRKLFAAWYERRQMKK
Sbjct: 241 PFAQGFAAFITAALTGSLYYVAASPKDPTYVVAPVLQSRSGRKDLRKLFAAWYERRQMKK 300

Query: 301 IYSPLLEGLLALYLGFEWIQTDNILAPIITHGIYSAVILGHGLWKIHDHRRRLRQRIQQV 360
           IYSPLLEGLLALYLGFEWIQTDNILAPIITHGIYSAVILGHGLWKIHDHRRRLRQRIQQV
Sbjct: 301 IYSPLLEGLLALYLGFEWIQTDNILAPIITHGIYSAVILGHGLWKIHDHRRRLRQRIQQV 360

Query: 361 KMEGKSSDSL 371
           KMEGKSSDSL
Sbjct: 361 KMEGKSSDSL 370

BLAST of CSPI03G01790 vs. NCBI nr
Match: KAA0053051.1 (CAAX amino terminal protease [Cucumis melo var. makuwa] >TYK11506.1 CAAX amino terminal protease [Cucumis melo var. makuwa])

HSP 1 Score: 686.4 bits (1770), Expect = 1.4e-193
Identity = 349/356 (98.03%), Postives = 351/356 (98.60%), Query Frame = 0

Query: 15  MSFGARIGICSTSSSRFLHFSMRKRAGGRVPVPVSVRASAEPRSERLDEGQTRSRFTAPA 74
           MSFGARIGICSTSSSRF +F MRKRAGGRV VP SVRASAEPRSERLDEGQTRSRFTAPA
Sbjct: 1   MSFGARIGICSTSSSRFSYFPMRKRAGGRVLVPASVRASAEPRSERLDEGQTRSRFTAPA 60

Query: 75  MEVTTLDTSFRETEFPVWEKIGAVVRLSYGVGIYGAMALAGKFICSISGTDWMGGFHPSL 134
           MEVTTLDTSFRETEFPVWEKIGAVVRLSYGVGIYGAMALAG+FICSISGTDWMGGFHPSL
Sbjct: 61  MEVTTLDTSFRETEFPVWEKIGAVVRLSYGVGIYGAMALAGRFICSISGTDWMGGFHPSL 120

Query: 135 DAILEGLGYAVPPIMALLFILDDEVVKLSPHARAIRDVEDEELRSFFYGMSPWQFILIVA 194
           DAIL GLGYAVPPIMALLFILDDEVVKLSPHARAIRDVEDEELRSFFYGMSPWQFILIVA
Sbjct: 121 DAILGGLGYAVPPIMALLFILDDEVVKLSPHARAIRDVEDEELRSFFYGMSPWQFILIVA 180

Query: 195 ASSVGEELFYRAAVQGALADIFLRSPDIGADVQGMASLTGVLPPFVPFAQGFAAFITAAL 254
           ASSVGEELFYRAAVQGALADIFLRSPDIGADVQGMASLTGVLPPFVPFAQGFAAFITAAL
Sbjct: 181 ASSVGEELFYRAAVQGALADIFLRSPDIGADVQGMASLTGVLPPFVPFAQGFAAFITAAL 240

Query: 255 TGSLYYVAASPKDPTYVVAPVLQSRSGRKDLRKLFAAWYERRQMKKIYSPLLEGLLALYL 314
           TGSLYYVAASPKDPTYVVAPVLQSRSGRKDLRKLFAAWYERRQMKKIYSPLLEGLLALYL
Sbjct: 241 TGSLYYVAASPKDPTYVVAPVLQSRSGRKDLRKLFAAWYERRQMKKIYSPLLEGLLALYL 300

Query: 315 GFEWIQTDNILAPIITHGIYSAVILGHGLWKIHDHRRRLRQRIQQVKMEGKSSDSL 371
           GFEWIQTDNILAPIITHGIYSAVILGHGLWKIHDHRRRLRQRIQQVKMEGKSSDSL
Sbjct: 301 GFEWIQTDNILAPIITHGIYSAVILGHGLWKIHDHRRRLRQRIQQVKMEGKSSDSL 356

BLAST of CSPI03G01790 vs. NCBI nr
Match: XP_038906084.1 (uncharacterized protein LOC120091972 [Benincasa hispida])

HSP 1 Score: 682.2 bits (1759), Expect = 2.5e-192
Identity = 348/370 (94.05%), Postives = 354/370 (95.68%), Query Frame = 0

Query: 1   MELSILSVSSNTSTMSFGARIGICSTSSSRFLHFSMRKRAGGRVPVPVSVRASAEPRSER 60
           MELSILSVSSNTSTMSFG RIGICSTSSSR  HF +RKRAGGRV VP  VRASAEPRSER
Sbjct: 1   MELSILSVSSNTSTMSFGGRIGICSTSSSRLSHFPLRKRAGGRVSVPAGVRASAEPRSER 60

Query: 61  LDEGQTRSRFTAPAMEVTTLDTSFRETEFPVWEKIGAVVRLSYGVGIYGAMALAGKFICS 120
           L+EGQTR RF A AMEVTTLD+SFRETEFPVWEKIGAVVRLSYGVGIYGAMALAG+FICS
Sbjct: 61  LEEGQTRGRFNARAMEVTTLDSSFRETEFPVWEKIGAVVRLSYGVGIYGAMALAGRFICS 120

Query: 121 ISGTDWMGGFHPSLDAILEGLGYAVPPIMALLFILDDEVVKLSPHARAIRDVEDEELRSF 180
           ISGTDWMGGFHPSLDAIL GLGYAVPPIMALLFILDDEVVKLSPHARAIRDVEDEELRSF
Sbjct: 121 ISGTDWMGGFHPSLDAILGGLGYAVPPIMALLFILDDEVVKLSPHARAIRDVEDEELRSF 180

Query: 181 FYGMSPWQFILIVAASSVGEELFYRAAVQGALADIFLRSPDIGADVQGMASLTGVLPPFV 240
           FYGMSPWQFILIVAASSVGEELFYRAAVQGALADIFLRSPDIGADVQGMASLTGVLPPFV
Sbjct: 181 FYGMSPWQFILIVAASSVGEELFYRAAVQGALADIFLRSPDIGADVQGMASLTGVLPPFV 240

Query: 241 PFAQGFAAFITAALTGSLYYVAASPKDPTYVVAPVLQSRSGRKDLRKLFAAWYERRQMKK 300
           PFAQ FAA ITA LTGSLYYVAASPKDPTYVVAPVLQSRSGRKDL+KLFAAWYERRQMKK
Sbjct: 241 PFAQAFAAVITAVLTGSLYYVAASPKDPTYVVAPVLQSRSGRKDLKKLFAAWYERRQMKK 300

Query: 301 IYSPLLEGLLALYLGFEWIQTDNILAPIITHGIYSAVILGHGLWKIHDHRRRLRQRIQQV 360
           IYSPLLEGLLALYLGFEWIQTDNILAPIITHGIYSAVILGHGLWKIHDHRRRL QRIQQ+
Sbjct: 301 IYSPLLEGLLALYLGFEWIQTDNILAPIITHGIYSAVILGHGLWKIHDHRRRLHQRIQQL 360

Query: 361 KMEGKSSDSL 371
           KMEGK SDSL
Sbjct: 361 KMEGKGSDSL 370

BLAST of CSPI03G01790 vs. NCBI nr
Match: XP_023553576.1 (uncharacterized protein LOC111810947 [Cucurbita pepo subsp. pepo] >KAG6577697.1 hypothetical protein SDJN03_25271, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 664.8 bits (1714), Expect = 4.2e-187
Identity = 340/370 (91.89%), Postives = 349/370 (94.32%), Query Frame = 0

Query: 1   MELSILSVSSNTSTMSFGARIGICSTSSSRFLHFSMRKRAGGRVPVPVSVRASAEPRSER 60
           MELSI SVSSNTSTMSFGARIGICSTS+SR  HF +RKRA GRV +P  VRASAEPRSER
Sbjct: 1   MELSIFSVSSNTSTMSFGARIGICSTSTSRISHFPVRKRAVGRVRLPSGVRASAEPRSER 60

Query: 61  LDEGQTRSRFTAPAMEVTTLDTSFRETEFPVWEKIGAVVRLSYGVGIYGAMALAGKFICS 120
           L+EGQTR RFT PAME+TTLD SFRETEFPVWEKIGAVVRLSYGVGIYGAMALAG+FICS
Sbjct: 61  LEEGQTRGRFTGPAMEMTTLDASFRETEFPVWEKIGAVVRLSYGVGIYGAMALAGRFICS 120

Query: 121 ISGTDWMGGFHPSLDAILEGLGYAVPPIMALLFILDDEVVKLSPHARAIRDVEDEELRSF 180
           ISG DWMGGF PSLDAIL GLGYAVPPIMALLFILDDEVVKLSPHARAIRDVEDEELRSF
Sbjct: 121 ISGIDWMGGFQPSLDAILGGLGYAVPPIMALLFILDDEVVKLSPHARAIRDVEDEELRSF 180

Query: 181 FYGMSPWQFILIVAASSVGEELFYRAAVQGALADIFLRSPDIGADVQGMASLTGVLPPFV 240
           FYGMSPWQFILIVAASSVGEELFYRAAVQGALADIFLRS DIGADVQGMASLTGVLPPFV
Sbjct: 181 FYGMSPWQFILIVAASSVGEELFYRAAVQGALADIFLRSTDIGADVQGMASLTGVLPPFV 240

Query: 241 PFAQGFAAFITAALTGSLYYVAASPKDPTYVVAPVLQSRSGRKDLRKLFAAWYERRQMKK 300
           PFAQ FAA ITAALTGSLYYVAASPKDPTYVVAPVLQSRS R DL+KLFAAWYERRQMKK
Sbjct: 241 PFAQAFAAVITAALTGSLYYVAASPKDPTYVVAPVLQSRSSRDDLKKLFAAWYERRQMKK 300

Query: 301 IYSPLLEGLLALYLGFEWIQTDNILAPIITHGIYSAVILGHGLWKIHDHRRRLRQRIQQV 360
           IYSPLLEGLLALYLGFEWIQTDNILAP+ITHGIYSAVILGHGLWKIHDHRRRLRQRIQQ+
Sbjct: 301 IYSPLLEGLLALYLGFEWIQTDNILAPMITHGIYSAVILGHGLWKIHDHRRRLRQRIQQL 360

Query: 361 KMEGKSSDSL 371
           KMEGK SD L
Sbjct: 361 KMEGKGSDGL 370

BLAST of CSPI03G01790 vs. TAIR 10
Match: AT2G35260.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G17840.1); Has 42 Blast hits to 42 proteins in 14 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 42; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 481.5 bits (1238), Expect = 6.1e-136
Identity = 259/384 (67.45%), Postives = 302/384 (78.65%), Query Frame = 0

Query: 1   MELSILSVSSNTSTMSFGARIGICSTSSS---RFLHFSMRKRA------GGRVPVPVSVR 60
           MEL +LS +S+ S     +R G+CS+SSS       F  R+R+      GG      SV 
Sbjct: 1   MELPLLSYASSASF----SRTGLCSSSSSSSTSIYEFPERRRSLKLRFNGGE--RSRSVI 60

Query: 61  ASAEPRSERLDE---------GQTRSRFTAPAMEVTTLDTSFRET---EFPVWEKIGAVV 120
           ASAE  SE +++         G    RF   AMEVTTLD  F  +   +FP+W+KIGAVV
Sbjct: 61  ASAERSSEGIEKTTDTVGGGGGGGAGRFAGTAMEVTTLDRGFANSTTVDFPIWDKIGAVV 120

Query: 121 RLSYGVGIYGAMALAGKFICSISGTDWMGGFHPSLDAILEGLGYAVPPIMALLFILDDEV 180
           RL+YG+GIYGAMA+AG+FICS++G D  GGF PSLDA+L GLGYA PPIMALLFILDDEV
Sbjct: 121 RLTYGIGIYGAMAVAGRFICSVTGIDSSGGFDPSLDALLAGLGYATPPIMALLFILDDEV 180

Query: 181 VKLSPHARAIRDVEDEELRSFFYGMSPWQFILIVAASSVGEELFYRAAVQGALADIFLRS 240
           VKLSPHARAIRDVEDEELRSFF+GMSPWQFILIVAASS+GEELFYR AVQGAL+DIFL+ 
Sbjct: 181 VKLSPHARAIRDVEDEELRSFFFGMSPWQFILIVAASSIGEELFYRVAVQGALSDIFLKG 240

Query: 241 PDIGADVQGMASLTGVLPPFVPFAQGFAAFITAALTGSLYYVAASPKDPTYVVAPVLQSR 300
             +  D +GMASLTGV PPFVPFA+ FAA ITA LTGSLY++AASPKDPTY+VAPVL+SR
Sbjct: 241 TQLMTDSRGMASLTGVFPPFVPFAEVFAAVITATLTGSLYFLAASPKDPTYIVAPVLRSR 300

Query: 301 SGRKDLRKLFAAWYERRQMKKIYSPLLEGLLALYLGFEWIQTDNILAPIITHGIYSAVIL 360
             R D +KL +AWYE+RQMKKIYSPLLEGLLALYLG EW+QTDNILAP++THGIYSAVIL
Sbjct: 301 --RDDFKKLLSAWYEKRQMKKIYSPLLEGLLALYLGIEWVQTDNILAPMMTHGIYSAVIL 360

Query: 361 GHGLWKIHDHRRRLRQRIQQVKME 364
           GHGLWKIHDHRRRLR+RI+ ++ E
Sbjct: 361 GHGLWKIHDHRRRLRRRIEHIRSE 376

BLAST of CSPI03G01790 vs. TAIR 10
Match: AT4G17840.1 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast, membrane; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Abortive infection protein (InterPro:IPR003675); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G35260.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 441.0 bits (1133), Expect = 9.2e-124
Identity = 239/420 (56.90%), Postives = 295/420 (70.24%), Query Frame = 0

Query: 1   MELSILSVSSNTSTMSFGARIGICSTSSSRFLHFS------------MRKRAGGR----- 60
           M L +LS SS   T+S  +    CS+ S  F   S            ++KR+G R     
Sbjct: 1   MGLPLLSCSSTRVTLSSSSSSSWCSSGSGGFRSSSKLFDSPACSRSDLKKRSGKRNSRLN 60

Query: 61  -------VPVPVSVRASAEPRSERLDEGQTRSR--------------------------- 120
                    +  S  ++ +  SE +D+G   +R                           
Sbjct: 61  GLSLEKLRSIKASSSSAGQSSSEVIDDGDAAARGLAVTSGDVTSVGSFSSGEFVGAGSGG 120

Query: 121 FTAPAMEVTTLDTSFRET--EFPVWEKIGAVVRLSYGVGIYGAMALAGKFICSISGTDWM 180
              P+ EVT++      +  +F  W+KIGA+VRLSYG+GIY  MA+AG+FIC ++G D+ 
Sbjct: 121 LAGPSGEVTSVGEFVGGSGGDFKDWDKIGAIVRLSYGIGIYCGMAVAGRFICEVAGIDYT 180

Query: 181 GGFHPSLDAILEGLGYAVPPIMALLFILDDEVVKLSPHARAIRDVEDEELRSFFYGMSPW 240
           GGF+ SLD I+ GLGYA PPIMALLFILDDEVVKLSPHARAIRDVED+ELR FF GMS W
Sbjct: 181 GGFNASLDTIIAGLGYASPPIMALLFILDDEVVKLSPHARAIRDVEDDELRGFFQGMSAW 240

Query: 241 QFILIVAASSVGEELFYRAAVQGALADIFLRSPDIGADVQGMASLTGVLPPFVPFAQGFA 300
           QFIL+V ASSVGEELFYRAA QGALADIFLR  D+ +D +GM +LTG+LPPFVPFAQ FA
Sbjct: 241 QFILVVTASSVGEELFYRAAFQGALADIFLRGTDLISDSRGMVALTGLLPPFVPFAQVFA 300

Query: 301 AFITAALTGSLYYVAASPKDPTYVVAPVLQSRSGRKDLRKLFAAWYERRQMKKIYSPLLE 360
           A ITAALTGSLYY+AASPKDPTY++APVL++RS R +L+KLFAAWYERRQMKKIYSPLLE
Sbjct: 301 ATITAALTGSLYYIAASPKDPTYIMAPVLKTRSARDELKKLFAAWYERRQMKKIYSPLLE 360

Query: 361 GLLALYLGFEWIQTDNILAPIITHGIYSAVILGHGLWKIHDHRRRLRQRIQQVKMEGKSS 368
           GLL LYLGFEWIQT+N+LAPIITHGIYSAV+LG+GLWK+H H++RLR R+Q+++ EG ++
Sbjct: 361 GLLGLYLGFEWIQTNNLLAPIITHGIYSAVVLGNGLWKLHHHQQRLRLRVQKLETEGDNN 420

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0L4068.2e-20599.73Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G011760 PE=4 SV=1[more]
A0A1S3BL732.1e-20098.11uncharacterized protein LOC103490793 OS=Cucumis melo OX=3656 GN=LOC103490793 PE=... [more]
A0A5A7UHN86.5e-19498.03CAAX amino terminal protease OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_sca... [more]
A0A6J1E5Y32.7e-18791.89uncharacterized protein LOC111431096 OS=Cucurbita moschata OX=3662 GN=LOC1114310... [more]
A0A6J1HQ132.3e-18691.08uncharacterized protein LOC111465053 OS=Cucurbita maxima OX=3661 GN=LOC111465053... [more]
Match NameE-valueIdentityDescription
XP_031738781.11.7e-20499.73uncharacterized protein LOC101203999 [Cucumis sativus] >KGN55774.1 hypothetical ... [more]
XP_008448701.14.3e-20098.11PREDICTED: uncharacterized protein LOC103490793 [Cucumis melo][more]
KAA0053051.11.4e-19398.03CAAX amino terminal protease [Cucumis melo var. makuwa] >TYK11506.1 CAAX amino t... [more]
XP_038906084.12.5e-19294.05uncharacterized protein LOC120091972 [Benincasa hispida][more]
XP_023553576.14.2e-18791.89uncharacterized protein LOC111810947 [Cucurbita pepo subsp. pepo] >KAG6577697.1 ... [more]
Match NameE-valueIdentityDescription
AT2G35260.16.1e-13667.45unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT4G17840.19.2e-12456.90FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003675Type II CAAX prenyl endopeptidase Rce1-likePFAMPF02517CPBPcoord: 186..336
e-value: 6.0E-8
score: 32.9
NoneNo IPR availablePANTHERPTHR36736OS03G0100030 PROTEINcoord: 1..367

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI03G01790.1CSPI03G01790.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0071586 CAAX-box protein processing
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0016020 membrane
molecular_function GO:0004222 metalloendopeptidase activity