Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATCTTTGAAATGGCTGAAAATGGAGGAAAATCAGCCAATACCCAGAAGAAATCGAGGCGAATTTGAGGTAGAAATGGGTTGTTTTTGAAATTTGAAGATAGATTCAAGAACCCCTTTGAGGGTTGTCGCTGTTTTGGGCGGTGGTGGCCCATGGCGCAAGGTAATCGATGTGCTAATAGCTGAAATGTTGTGGCAGGGAGGTTTTACAGGCACTTTTTGAAGCCCACCCCCACCCCCTCCACTTTATTTTCCCCAATAGGTGACACTCCTTATACACAAAAACAAGTAGATTGAATTGAACATCAAACTTTGAGGGTAAACTATCTTGAATTATTGAAGAAGACAAAGAATGCGGAGAACCCATCAAAAACTGTCCCATTGCTGATTGGGGAACGAAAATGGAGGATTAAACGAGTGAGTAAATGTGAAATCATCCAACCCCATCTCCACTACGAACGATTTTCGGGTTCTTAAGGGTTTCGAATCGTTTTTTTTGGTTGATTTTTCCTCGTTTCGTCGAGGATTTTGTGCTTCCGACTGGCTGAGATGCCGCGGCCAGGCCCGAGACCGTATGAGTGCGTGCGGCGAGCTTGGCATAGTGATAGACATCAGCCGATGAGAGGTTCCATTATTCAGCAAATTTTCAGGTTCTGGGTCGTTTTAAACTTGGCTGATTCTCTTCGATTTGATTGTGTTTTTCTTGTTCTTGTTTTTGTGCATTTTGGTGAATTGCACACCATGTGTTTGATCAGAGTTGTCAATGAGAATCATAGCCCTGCTACTAAAAAGAACAAGGAATGGCAGGAGAAATTGCCGATTGTGGTTTTGAAAGCTGAAGAAATCATGTATTCTAAAGCTAATTCTGAGGTATTTCCCCCTTTTCCTCTTCTGGGTTTTGCTCCAATTTGGATCTTGTTCTTGTTTGAAGCTTTGGATTTCGTCCATTTAATTTTGCTTAGGTTGAGTACATGAATCTTGAGACAGTTTGGGAGCGTTTGAATGATGCTGTTAATACCATAATTCGAAGAGATGAACACAGTGAAACTGGTGAGCTTTTGCCTCCCTGTGTTGAAGGTATAAACCAAATGGTTTCATTTCAAGTTCTTGGATCTAATCATTAAACTCTGCTCTCAAATGGTTTGGTTTGTTGTTTGTTGTTTGTCATTTGTTTTGCTTCTTCTCTGTGGTATGCAGCTGCACTAAACCTCGGCTGCGTTCCGGTACGAGCTTCTCGGAGCCAACGCCATACGAATCCCAGGACATATCTTACTCCAAGAACACAAGAACCATCTACTCTTGCTACCACATTGGATAAAGCCAGTGATGAAAGACCCCTACCGACGTCATTGTTGCGCCTGAGCAACCAGTTGAGTTTTCCACGAGCCACGGCTATGAACTCGAGTATGTTTGGTTCTGAGCATAACAGCCCTACCATTCCAAGTAACCCTGCTTTCTTGATTGAGAATGTTCACAATTACAACTATTCAATGACTGACCTGGGATCTGTTTATCCATTGTATTATGGAATTCGTTTTCGAACTGAAGAGCCGAACGTAGGCTCCCGGTGTTCTGTAGATGCCAACCAACAGACGATATTTTTGGGTAGACCGGTCGTGTCGAGTGCAGAGCCTGCTGAGCTTAGTCTTTGCGCTCGTAAAACTGGAAATGCTATGAGCAGATTCCCATCAGAAGTTATCACAGACACAGAATGTGATTTATCTTTGAGGTTGGGAGTACCTTCCCAGCCATGTGTGAGTACTGGGAAGCCTTGGGCTTCTGAAACCGGAGACATTGGTTCAAGCAGTTCCCATGAGCGGAACAAGTTCCACGATCGACCCATTTACGCAACTAAAGAGTTCACTTTTTTCCCTAACAGAACTTCGTTCGATCCGTCTGGTTCCTGCTCGAATATGTGGAGCTCAGATTGGAGGGGTCAGAATCCAGAATCTTTCACGAAAAAGCGCAAAGAGCCCATTCGTAGCGACGAGGAGGACGAACCATTTTGTCTTCCACCCGAGGCTCCATCTAACTGGTTTGGCAGTCGAACGAAAAGGCCAGGTTTGTAG
mRNA sequence
ATCTTTGAAATGGCTGAAAATGGAGGAAAATCAGCCAATACCCAGAAGAAATCGAGGCGAATTTGAGGTAGAAATGGGTTGTTTTTGAAATTTGAAGATAGATTCAAGAACCCCTTTGAGGGTTGTCGCTGTTTTGGGCGGTGGTGGCCCATGGCGCAAGGTAATCGATGTGCTAATAGCTGAAATGTTGTGGCAGGGAGGTTTTACAGGCACTTTTTGAAGCCCACCCCCACCCCCTCCACTTTATTTTCCCCAATAGGTGACACTCCTTATACACAAAAACAAGTAGATTGAATTGAACATCAAACTTTGAGGGTAAACTATCTTGAATTATTGAAGAAGACAAAGAATGCGGAGAACCCATCAAAAACTGTCCCATTGCTGATTGGGGAACGAAAATGGAGGATTAAACGAGTGAGTAAATGTGAAATCATCCAACCCCATCTCCACTACGAACGATTTTCGGGTTCTTAAGGGTTTCGAATCGTTTTTTTTGGTTGATTTTTCCTCGTTTCGTCGAGGATTTTGTGCTTCCGACTGGCTGAGATGCCGCGGCCAGGCCCGAGACCGTATGAGTGCGTGCGGCGAGCTTGGCATAGTGATAGACATCAGCCGATGAGAGGTTCCATTATTCAGCAAATTTTCAGGTTCTGGGTCGTTTTAAACTTGGCTGATTCTCTTCGATTTGATTGTGTTTTTCTTGTTCTTGTTTTTGTGCATTTTGGTGAATTGCACACCATGTGTTTGATCAGAGTTGTCAATGAGAATCATAGCCCTGCTACTAAAAAGAACAAGGAATGGCAGGAGAAATTGCCGATTGTGGTTTTGAAAGCTGAAGAAATCATGTATTCTAAAGCTAATTCTGAGGTTGAGTACATGAATCTTGAGACAGTTTGGGAGCGTTTGAATGATGCTGTTAATACCATAATTCGAAGAGATGAACACAGTGAAACTGGTGAGCTTTTGCCTCCCTGTGTTGAAGCTGCACTAAACCTCGGCTGCGTTCCGGTACGAGCTTCTCGGAGCCAACGCCATACGAATCCCAGGACATATCTTACTCCAAGAACACAAGAACCATCTACTCTTGCTACCACATTGGATAAAGCCAGTGATGAAAGACCCCTACCGACGTCATTGTTGCGCCTGAGCAACCAGTTGAGTTTTCCACGAGCCACGGCTATGAACTCGAGTATGTTTGGTTCTGAGCATAACAGCCCTACCATTCCAAGTAACCCTGCTTTCTTGATTGAGAATGTTCACAATTACAACTATTCAATGACTGACCTGGGATCTGTTTATCCATTGTATTATGGAATTCGTTTTCGAACTGAAGAGCCGAACGTAGGCTCCCGGTGTTCTGTAGATGCCAACCAACAGACGATATTTTTGGGTAGACCGGTCGTGTCGAGTGCAGAGCCTGCTGAGCTTAGTCTTTGCGCTCGTAAAACTGGAAATGCTATGAGCAGATTCCCATCAGAAGTTATCACAGACACAGAATGTGATTTATCTTTGAGGTTGGGAGTACCTTCCCAGCCATGTGTGAGTACTGGGAAGCCTTGGGCTTCTGAAACCGGAGACATTGGTTCAAGCAGTTCCCATGAGCGGAACAAGTTCCACGATCGACCCATTTACGCAACTAAAGAGTTCACTTTTTTCCCTAACAGAACTTCGTTCGATCCGTCTGGTTCCTGCTCGAATATGTGGAGCTCAGATTGGAGGGGTCAGAATCCAGAATCTTTCACGAAAAAGCGCAAAGAGCCCATTCGTAGCGACGAGGAGGACGAACCATTTTGTCTTCCACCCGAGGCTCCATCTAACTGGTTTGGCAGTCGAACGAAAAGGCCAGGTTTGTAG
Coding sequence (CDS)
ATGCCGCGGCCAGGCCCGAGACCGTATGAGTGCGTGCGGCGAGCTTGGCATAGTGATAGACATCAGCCGATGAGAGGTTCCATTATTCAGCAAATTTTCAGGTTCTGGGTCGTTTTAAACTTGGCTGATTCTCTTCGATTTGATTGTGTTTTTCTTGTTCTTGTTTTTGTGCATTTTGGTGAATTGCACACCATGTGTTTGATCAGAGTTGTCAATGAGAATCATAGCCCTGCTACTAAAAAGAACAAGGAATGGCAGGAGAAATTGCCGATTGTGGTTTTGAAAGCTGAAGAAATCATGTATTCTAAAGCTAATTCTGAGGTTGAGTACATGAATCTTGAGACAGTTTGGGAGCGTTTGAATGATGCTGTTAATACCATAATTCGAAGAGATGAACACAGTGAAACTGGTGAGCTTTTGCCTCCCTGTGTTGAAGCTGCACTAAACCTCGGCTGCGTTCCGGTACGAGCTTCTCGGAGCCAACGCCATACGAATCCCAGGACATATCTTACTCCAAGAACACAAGAACCATCTACTCTTGCTACCACATTGGATAAAGCCAGTGATGAAAGACCCCTACCGACGTCATTGTTGCGCCTGAGCAACCAGTTGAGTTTTCCACGAGCCACGGCTATGAACTCGAGTATGTTTGGTTCTGAGCATAACAGCCCTACCATTCCAAGTAACCCTGCTTTCTTGATTGAGAATGTTCACAATTACAACTATTCAATGACTGACCTGGGATCTGTTTATCCATTGTATTATGGAATTCGTTTTCGAACTGAAGAGCCGAACGTAGGCTCCCGGTGTTCTGTAGATGCCAACCAACAGACGATATTTTTGGGTAGACCGGTCGTGTCGAGTGCAGAGCCTGCTGAGCTTAGTCTTTGCGCTCGTAAAACTGGAAATGCTATGAGCAGATTCCCATCAGAAGTTATCACAGACACAGAATGTGATTTATCTTTGAGGTTGGGAGTACCTTCCCAGCCATGTGTGAGTACTGGGAAGCCTTGGGCTTCTGAAACCGGAGACATTGGTTCAAGCAGTTCCCATGAGCGGAACAAGTTCCACGATCGACCCATTTACGCAACTAAAGAGTTCACTTTTTTCCCTAACAGAACTTCGTTCGATCCGTCTGGTTCCTGCTCGAATATGTGGAGCTCAGATTGGAGGGGTCAGAATCCAGAATCTTTCACGAAAAAGCGCAAAGAGCCCATTCGTAGCGACGAGGAGGACGAACCATTTTGTCTTCCACCCGAGGCTCCATCTAACTGGTTTGGCAGTCGAACGAAAAGGCCAGGTTTGTAG
Protein sequence
MPRPGPRPYECVRRAWHSDRHQPMRGSIIQQIFRFWVVLNLADSLRFDCVFLVLVFVHFGELHTMCLIRVVNENHSPATKKNKEWQEKLPIVVLKAEEIMYSKANSEVEYMNLETVWERLNDAVNTIIRRDEHSETGELLPPCVEAALNLGCVPVRASRSQRHTNPRTYLTPRTQEPSTLATTLDKASDERPLPTSLLRLSNQLSFPRATAMNSSMFGSEHNSPTIPSNPAFLIENVHNYNYSMTDLGSVYPLYYGIRFRTEEPNVGSRCSVDANQQTIFLGRPVVSSAEPAELSLCARKTGNAMSRFPSEVITDTECDLSLRLGVPSQPCVSTGKPWASETGDIGSSSSHERNKFHDRPIYATKEFTFFPNRTSFDPSGSCSNMWSSDWRGQNPESFTKKRKEPIRSDEEDEPFCLPPEAPSNWFGSRTKRPGL
Homology
BLAST of CmoCh12G012250 vs. ExPASy TrEMBL
Match:
A0A6J1FCV2 (uncharacterized protein LOC111444465 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111444465 PE=4 SV=1)
HSP 1 Score: 891.3 bits (2302), Expect = 1.6e-255
Identity = 435/435 (100.00%), Postives = 435/435 (100.00%), Query Frame = 0
Query: 1 MPRPGPRPYECVRRAWHSDRHQPMRGSIIQQIFRFWVVLNLADSLRFDCVFLVLVFVHFG 60
MPRPGPRPYECVRRAWHSDRHQPMRGSIIQQIFRFWVVLNLADSLRFDCVFLVLVFVHFG
Sbjct: 1 MPRPGPRPYECVRRAWHSDRHQPMRGSIIQQIFRFWVVLNLADSLRFDCVFLVLVFVHFG 60
Query: 61 ELHTMCLIRVVNENHSPATKKNKEWQEKLPIVVLKAEEIMYSKANSEVEYMNLETVWERL 120
ELHTMCLIRVVNENHSPATKKNKEWQEKLPIVVLKAEEIMYSKANSEVEYMNLETVWERL
Sbjct: 61 ELHTMCLIRVVNENHSPATKKNKEWQEKLPIVVLKAEEIMYSKANSEVEYMNLETVWERL 120
Query: 121 NDAVNTIIRRDEHSETGELLPPCVEAALNLGCVPVRASRSQRHTNPRTYLTPRTQEPSTL 180
NDAVNTIIRRDEHSETGELLPPCVEAALNLGCVPVRASRSQRHTNPRTYLTPRTQEPSTL
Sbjct: 121 NDAVNTIIRRDEHSETGELLPPCVEAALNLGCVPVRASRSQRHTNPRTYLTPRTQEPSTL 180
Query: 181 ATTLDKASDERPLPTSLLRLSNQLSFPRATAMNSSMFGSEHNSPTIPSNPAFLIENVHNY 240
ATTLDKASDERPLPTSLLRLSNQLSFPRATAMNSSMFGSEHNSPTIPSNPAFLIENVHNY
Sbjct: 181 ATTLDKASDERPLPTSLLRLSNQLSFPRATAMNSSMFGSEHNSPTIPSNPAFLIENVHNY 240
Query: 241 NYSMTDLGSVYPLYYGIRFRTEEPNVGSRCSVDANQQTIFLGRPVVSSAEPAELSLCARK 300
NYSMTDLGSVYPLYYGIRFRTEEPNVGSRCSVDANQQTIFLGRPVVSSAEPAELSLCARK
Sbjct: 241 NYSMTDLGSVYPLYYGIRFRTEEPNVGSRCSVDANQQTIFLGRPVVSSAEPAELSLCARK 300
Query: 301 TGNAMSRFPSEVITDTECDLSLRLGVPSQPCVSTGKPWASETGDIGSSSSHERNKFHDRP 360
TGNAMSRFPSEVITDTECDLSLRLGVPSQPCVSTGKPWASETGDIGSSSSHERNKFHDRP
Sbjct: 301 TGNAMSRFPSEVITDTECDLSLRLGVPSQPCVSTGKPWASETGDIGSSSSHERNKFHDRP 360
Query: 361 IYATKEFTFFPNRTSFDPSGSCSNMWSSDWRGQNPESFTKKRKEPIRSDEEDEPFCLPPE 420
IYATKEFTFFPNRTSFDPSGSCSNMWSSDWRGQNPESFTKKRKEPIRSDEEDEPFCLPPE
Sbjct: 361 IYATKEFTFFPNRTSFDPSGSCSNMWSSDWRGQNPESFTKKRKEPIRSDEEDEPFCLPPE 420
Query: 421 APSNWFGSRTKRPGL 436
APSNWFGSRTKRPGL
Sbjct: 421 APSNWFGSRTKRPGL 435
BLAST of CmoCh12G012250 vs. ExPASy TrEMBL
Match:
A0A6J1FDR6 (uncharacterized protein LOC111444465 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111444465 PE=4 SV=1)
HSP 1 Score: 863.6 bits (2230), Expect = 3.5e-247
Identity = 426/435 (97.93%), Postives = 426/435 (97.93%), Query Frame = 0
Query: 1 MPRPGPRPYECVRRAWHSDRHQPMRGSIIQQIFRFWVVLNLADSLRFDCVFLVLVFVHFG 60
MPRPGPRPYECVRRAWHSDRHQPMRGSIIQQIFRFWVVLNLADSLRFDCVFLVLVFVHFG
Sbjct: 1 MPRPGPRPYECVRRAWHSDRHQPMRGSIIQQIFRFWVVLNLADSLRFDCVFLVLVFVHFG 60
Query: 61 ELHTMCLIRVVNENHSPATKKNKEWQEKLPIVVLKAEEIMYSKANSEVEYMNLETVWERL 120
ELHTMCLIRVVNENHSPATKKNKEWQEKLPIVVLKAEEIMYSKANSEVEYMNLETVWERL
Sbjct: 61 ELHTMCLIRVVNENHSPATKKNKEWQEKLPIVVLKAEEIMYSKANSEVEYMNLETVWERL 120
Query: 121 NDAVNTIIRRDEHSETGELLPPCVEAALNLGCVPVRASRSQRHTNPRTYLTPRTQEPSTL 180
NDAVNTIIRRDEHSET AALNLGCVPVRASRSQRHTNPRTYLTPRTQEPSTL
Sbjct: 121 NDAVNTIIRRDEHSET---------AALNLGCVPVRASRSQRHTNPRTYLTPRTQEPSTL 180
Query: 181 ATTLDKASDERPLPTSLLRLSNQLSFPRATAMNSSMFGSEHNSPTIPSNPAFLIENVHNY 240
ATTLDKASDERPLPTSLLRLSNQLSFPRATAMNSSMFGSEHNSPTIPSNPAFLIENVHNY
Sbjct: 181 ATTLDKASDERPLPTSLLRLSNQLSFPRATAMNSSMFGSEHNSPTIPSNPAFLIENVHNY 240
Query: 241 NYSMTDLGSVYPLYYGIRFRTEEPNVGSRCSVDANQQTIFLGRPVVSSAEPAELSLCARK 300
NYSMTDLGSVYPLYYGIRFRTEEPNVGSRCSVDANQQTIFLGRPVVSSAEPAELSLCARK
Sbjct: 241 NYSMTDLGSVYPLYYGIRFRTEEPNVGSRCSVDANQQTIFLGRPVVSSAEPAELSLCARK 300
Query: 301 TGNAMSRFPSEVITDTECDLSLRLGVPSQPCVSTGKPWASETGDIGSSSSHERNKFHDRP 360
TGNAMSRFPSEVITDTECDLSLRLGVPSQPCVSTGKPWASETGDIGSSSSHERNKFHDRP
Sbjct: 301 TGNAMSRFPSEVITDTECDLSLRLGVPSQPCVSTGKPWASETGDIGSSSSHERNKFHDRP 360
Query: 361 IYATKEFTFFPNRTSFDPSGSCSNMWSSDWRGQNPESFTKKRKEPIRSDEEDEPFCLPPE 420
IYATKEFTFFPNRTSFDPSGSCSNMWSSDWRGQNPESFTKKRKEPIRSDEEDEPFCLPPE
Sbjct: 361 IYATKEFTFFPNRTSFDPSGSCSNMWSSDWRGQNPESFTKKRKEPIRSDEEDEPFCLPPE 420
Query: 421 APSNWFGSRTKRPGL 436
APSNWFGSRTKRPGL
Sbjct: 421 APSNWFGSRTKRPGL 426
BLAST of CmoCh12G012250 vs. ExPASy TrEMBL
Match:
A0A6J1HRU3 (uncharacterized protein LOC111465547 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111465547 PE=4 SV=1)
HSP 1 Score: 851.7 bits (2199), Expect = 1.4e-243
Identity = 419/436 (96.10%), Postives = 424/436 (97.25%), Query Frame = 0
Query: 1 MPRPGPRPYECVRRAWHSDRHQPMRGSIIQQIFRFWVVLNLADSLRFDCVFLVLV-FVHF 60
MPRPGPRPYECVRRAWHSDRHQPMRGSIIQQIFRFWVVLNLADSLRFDCVFLVLV FVHF
Sbjct: 1 MPRPGPRPYECVRRAWHSDRHQPMRGSIIQQIFRFWVVLNLADSLRFDCVFLVLVLFVHF 60
Query: 61 GELHTMCLIRVVNENHSPATKKNKEWQEKLPIVVLKAEEIMYSKANSEVEYMNLETVWER 120
GELHTMCLIRVVNENHSPATKKNKEWQEKLPIVVLKAEEIMYSKANSEVEYMNLETVWER
Sbjct: 61 GELHTMCLIRVVNENHSPATKKNKEWQEKLPIVVLKAEEIMYSKANSEVEYMNLETVWER 120
Query: 121 LNDAVNTIIRRDEHSETGELLPPCVEAALNLGCVPVRASRSQRHTNPRTYLTPRTQEPST 180
LNDAVNTIIRRDEHSETGELLPPCVEAALNLGCVPVRASRS+RH NPRTYLTPRTQEPST
Sbjct: 121 LNDAVNTIIRRDEHSETGELLPPCVEAALNLGCVPVRASRSRRHMNPRTYLTPRTQEPST 180
Query: 181 LATTLDKASDERPLPTSLLRLSNQLSFPRATAMNSSMFGSEHNSPTIPSNPAFLIENVHN 240
LATTLDKA+DER LPTS LR SNQLSFPRATAMNSSMFGSEHNSPTIPSNPAFLIENVHN
Sbjct: 181 LATTLDKATDERRLPTSSLRPSNQLSFPRATAMNSSMFGSEHNSPTIPSNPAFLIENVHN 240
Query: 241 YNYSMTDLGSVYPLYYGIRFRTEEPNVGSRCSVDANQQTIFLGRPVVSSAEPAELSLCAR 300
YNYSMTDLGSVYPLYYGIRFRTEEPN+GS CSVDANQQTIFLGRPVVSSAEPAELSLCAR
Sbjct: 241 YNYSMTDLGSVYPLYYGIRFRTEEPNLGSLCSVDANQQTIFLGRPVVSSAEPAELSLCAR 300
Query: 301 KTGNAMSRFPSEVITDTECDLSLRLGVPSQPCVSTGKPWASETGDIGSSSSHERNKFHDR 360
KTGNAMSRFPSEVITDTECDLSLRLGVPSQPCVSTG+PWASE GDIGSSSSHERNKFHDR
Sbjct: 301 KTGNAMSRFPSEVITDTECDLSLRLGVPSQPCVSTGQPWASEAGDIGSSSSHERNKFHDR 360
Query: 361 PIYATKEFTFFPNRTSFDPSGSCSNMWSSDWRGQNPESFTKKRKEPIRSDEEDEPFCLPP 420
PIYATKEF+FFPNRTSFDP GS SNMWSSDWRGQNPESFTKKRKEPIRSDEEDEPFC PP
Sbjct: 361 PIYATKEFSFFPNRTSFDPFGSFSNMWSSDWRGQNPESFTKKRKEPIRSDEEDEPFCFPP 420
Query: 421 EAPSNWFGSRTKRPGL 436
EAPSNWF RTKRPGL
Sbjct: 421 EAPSNWFDRRTKRPGL 436
BLAST of CmoCh12G012250 vs. ExPASy TrEMBL
Match:
A0A6J1HMI3 (uncharacterized protein LOC111465547 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111465547 PE=4 SV=1)
HSP 1 Score: 823.9 bits (2127), Expect = 3.1e-235
Identity = 410/436 (94.04%), Postives = 415/436 (95.18%), Query Frame = 0
Query: 1 MPRPGPRPYECVRRAWHSDRHQPMRGSIIQQIFRFWVVLNLADSLRFDCVFLVLV-FVHF 60
MPRPGPRPYECVRRAWHSDRHQPMRGSIIQQIFRFWVVLNLADSLRFDCVFLVLV FVHF
Sbjct: 1 MPRPGPRPYECVRRAWHSDRHQPMRGSIIQQIFRFWVVLNLADSLRFDCVFLVLVLFVHF 60
Query: 61 GELHTMCLIRVVNENHSPATKKNKEWQEKLPIVVLKAEEIMYSKANSEVEYMNLETVWER 120
GELHTMCLIRVVNENHSPATKKNKEWQEKLPIVVLKAEEIMYSKANSEVEYMNLETVWER
Sbjct: 61 GELHTMCLIRVVNENHSPATKKNKEWQEKLPIVVLKAEEIMYSKANSEVEYMNLETVWER 120
Query: 121 LNDAVNTIIRRDEHSETGELLPPCVEAALNLGCVPVRASRSQRHTNPRTYLTPRTQEPST 180
LNDAVNTIIRRDEHSET AALNLGCVPVRASRS+RH NPRTYLTPRTQEPST
Sbjct: 121 LNDAVNTIIRRDEHSET---------AALNLGCVPVRASRSRRHMNPRTYLTPRTQEPST 180
Query: 181 LATTLDKASDERPLPTSLLRLSNQLSFPRATAMNSSMFGSEHNSPTIPSNPAFLIENVHN 240
LATTLDKA+DER LPTS LR SNQLSFPRATAMNSSMFGSEHNSPTIPSNPAFLIENVHN
Sbjct: 181 LATTLDKATDERRLPTSSLRPSNQLSFPRATAMNSSMFGSEHNSPTIPSNPAFLIENVHN 240
Query: 241 YNYSMTDLGSVYPLYYGIRFRTEEPNVGSRCSVDANQQTIFLGRPVVSSAEPAELSLCAR 300
YNYSMTDLGSVYPLYYGIRFRTEEPN+GS CSVDANQQTIFLGRPVVSSAEPAELSLCAR
Sbjct: 241 YNYSMTDLGSVYPLYYGIRFRTEEPNLGSLCSVDANQQTIFLGRPVVSSAEPAELSLCAR 300
Query: 301 KTGNAMSRFPSEVITDTECDLSLRLGVPSQPCVSTGKPWASETGDIGSSSSHERNKFHDR 360
KTGNAMSRFPSEVITDTECDLSLRLGVPSQPCVSTG+PWASE GDIGSSSSHERNKFHDR
Sbjct: 301 KTGNAMSRFPSEVITDTECDLSLRLGVPSQPCVSTGQPWASEAGDIGSSSSHERNKFHDR 360
Query: 361 PIYATKEFTFFPNRTSFDPSGSCSNMWSSDWRGQNPESFTKKRKEPIRSDEEDEPFCLPP 420
PIYATKEF+FFPNRTSFDP GS SNMWSSDWRGQNPESFTKKRKEPIRSDEEDEPFC PP
Sbjct: 361 PIYATKEFSFFPNRTSFDPFGSFSNMWSSDWRGQNPESFTKKRKEPIRSDEEDEPFCFPP 420
Query: 421 EAPSNWFGSRTKRPGL 436
EAPSNWF RTKRPGL
Sbjct: 421 EAPSNWFDRRTKRPGL 427
BLAST of CmoCh12G012250 vs. ExPASy TrEMBL
Match:
A0A6J1FIL0 (uncharacterized protein LOC111444465 isoform X3 OS=Cucurbita moschata OX=3662 GN=LOC111444465 PE=4 SV=1)
HSP 1 Score: 803.1 bits (2073), Expect = 5.6e-229
Identity = 400/435 (91.95%), Postives = 400/435 (91.95%), Query Frame = 0
Query: 1 MPRPGPRPYECVRRAWHSDRHQPMRGSIIQQIFRFWVVLNLADSLRFDCVFLVLVFVHFG 60
MPRPGPRPYECVRRAWHSDRHQPMRGSIIQQIF
Sbjct: 1 MPRPGPRPYECVRRAWHSDRHQPMRGSIIQQIF--------------------------- 60
Query: 61 ELHTMCLIRVVNENHSPATKKNKEWQEKLPIVVLKAEEIMYSKANSEVEYMNLETVWERL 120
RVVNENHSPATKKNKEWQEKLPIVVLKAEEIMYSKANSEVEYMNLETVWERL
Sbjct: 61 --------RVVNENHSPATKKNKEWQEKLPIVVLKAEEIMYSKANSEVEYMNLETVWERL 120
Query: 121 NDAVNTIIRRDEHSETGELLPPCVEAALNLGCVPVRASRSQRHTNPRTYLTPRTQEPSTL 180
NDAVNTIIRRDEHSETGELLPPCVEAALNLGCVPVRASRSQRHTNPRTYLTPRTQEPSTL
Sbjct: 121 NDAVNTIIRRDEHSETGELLPPCVEAALNLGCVPVRASRSQRHTNPRTYLTPRTQEPSTL 180
Query: 181 ATTLDKASDERPLPTSLLRLSNQLSFPRATAMNSSMFGSEHNSPTIPSNPAFLIENVHNY 240
ATTLDKASDERPLPTSLLRLSNQLSFPRATAMNSSMFGSEHNSPTIPSNPAFLIENVHNY
Sbjct: 181 ATTLDKASDERPLPTSLLRLSNQLSFPRATAMNSSMFGSEHNSPTIPSNPAFLIENVHNY 240
Query: 241 NYSMTDLGSVYPLYYGIRFRTEEPNVGSRCSVDANQQTIFLGRPVVSSAEPAELSLCARK 300
NYSMTDLGSVYPLYYGIRFRTEEPNVGSRCSVDANQQTIFLGRPVVSSAEPAELSLCARK
Sbjct: 241 NYSMTDLGSVYPLYYGIRFRTEEPNVGSRCSVDANQQTIFLGRPVVSSAEPAELSLCARK 300
Query: 301 TGNAMSRFPSEVITDTECDLSLRLGVPSQPCVSTGKPWASETGDIGSSSSHERNKFHDRP 360
TGNAMSRFPSEVITDTECDLSLRLGVPSQPCVSTGKPWASETGDIGSSSSHERNKFHDRP
Sbjct: 301 TGNAMSRFPSEVITDTECDLSLRLGVPSQPCVSTGKPWASETGDIGSSSSHERNKFHDRP 360
Query: 361 IYATKEFTFFPNRTSFDPSGSCSNMWSSDWRGQNPESFTKKRKEPIRSDEEDEPFCLPPE 420
IYATKEFTFFPNRTSFDPSGSCSNMWSSDWRGQNPESFTKKRKEPIRSDEEDEPFCLPPE
Sbjct: 361 IYATKEFTFFPNRTSFDPSGSCSNMWSSDWRGQNPESFTKKRKEPIRSDEEDEPFCLPPE 400
Query: 421 APSNWFGSRTKRPGL 436
APSNWFGSRTKRPGL
Sbjct: 421 APSNWFGSRTKRPGL 400
BLAST of CmoCh12G012250 vs. TAIR 10
Match:
AT3G24150.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G32295.1); Has 50 Blast hits to 50 proteins in 12 species: Archae - 0; Bacteria - 0; Metazoa - 7; Fungi - 0; Plants - 41; Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink). )
HSP 1 Score: 246.9 bits (629), Expect = 3.0e-65
Identity = 163/419 (38.90%), Postives = 217/419 (51.79%), Query Frame = 0
Query: 1 MPRPGPRPYECVRRAWHSDRHQPMRGSIIQQIFRFWVVLNLADSLRFDCVFLVLVFVHFG 60
MPRPGPRPYECV+RAWHSDRHQP+RGSII+QIF
Sbjct: 1 MPRPGPRPYECVKRAWHSDRHQPIRGSIIRQIF--------------------------- 60
Query: 61 ELHTMCLIRVVNENHSPATKKNKEWQEKLPIVVLKAEEIMYSKANSEVEYMNLETVWERL 120
R+ E HS AT+KNKEWQEKLP+VVLKAEEIMYSKANSE EY + +T+W R+
Sbjct: 61 --------RLAMEAHSAATRKNKEWQEKLPVVVLKAEEIMYSKANSEEEYTDADTMWNRV 120
Query: 121 NDAVNTIIRRDEHSETGELLPPCVEAALNLGCVPVRASRSQRHTNPRTYLTPRTQEPSTL 180
NDA++TIIRRDE +ETG LLPPCVEAALNLGC+ VRASRSQRH++ RTYL P+ QEP +
Sbjct: 121 NDAIDTIIRRDESTETGPLLPPCVEAALNLGCIAVRASRSQRHSSGRTYLGPKIQEPVSA 180
Query: 181 ATTLDKASDERPLPTSLLRLSNQLSFPRATAMNSSMFGSEHNSPTIPSNPAFLIENVHNY 240
+T ++ S + S + S A+ + + + P FL E++ +
Sbjct: 181 ST--NEPSYHHEYRQQAQQSSTKPSQTVQAAVPVDVLDNSNKRVATPRGYPFLHESMQMH 240
Query: 241 NYSM----------------TDLGSVYPLYYGIRFRTEEPNVGSRCSVDANQQTIFLGRP 300
+ +LGSVYPLYY +T++ ++ R + I +G P
Sbjct: 241 QKPLAIRQGTGPASAPAPAPVNLGSVYPLYYEGNNQTQQADMSFR----VPEAPIIIGMP 300
Query: 301 VVSSAEPAELSLCARKTGNAMSRFPSEVITDTECDLSLRLGVPSQPCVSTGKPWASETGD 360
+ PSE T+ CDLSLRLG+ S+P S D
Sbjct: 301 IGIK--------------------PSEEATERVCDLSLRLGISSEP---------STRID 336
Query: 361 IGSSSSHERNKFHDRPIYATKEFTFFPNRTSFDPSGSCSNMWSSDWRGQNPESFTKKRK 404
+GSS ++ + ++A + F+ W S+ GQN +S KK +
Sbjct: 361 VGSSRAYPGRNQEELCLFAEVK-----KNDRFE--------WFSNSEGQNSDSRVKKHR 336
BLAST of CmoCh12G012250 vs. TAIR 10
Match:
AT4G32295.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G24150.1); Has 39 Blast hits to 39 proteins in 10 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 39; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 201.4 bits (511), Expect = 1.4e-51
Identity = 100/171 (58.48%), Postives = 120/171 (70.18%), Query Frame = 0
Query: 1 MPRPGPRPYECVRRAWHSDRHQPMRGSIIQQIFRFWVVLNLADSLRFDCVFLVLVFVHFG 60
MPRPGPRPY+C+RRAWHSDRHQPMRG +IQ+IF
Sbjct: 1 MPRPGPRPYDCIRRAWHSDRHQPMRGLLIQEIF--------------------------- 60
Query: 61 ELHTMCLIRVVNENHSPATKKNKEWQEKLPIVVLKAEEIMYSKANSEVEYMNLETVWERL 120
R+V E HS +T+KN EWQEKLP+VVL+AEEIMYSKANSE EYM+++T+ +R
Sbjct: 61 --------RIVCEIHSQSTRKNTEWQEKLPVVVLRAEEIMYSKANSEAEYMDMKTLLDRT 120
Query: 121 NDAVNTIIRRDEHSETGELLPPCVEAALNLGCVPVRASRSQRHTNPRTYLT 172
NDA+NTIIR DE +ETGE L PC+EAAL+LGC P RASRSQR+ NPR YL+
Sbjct: 121 NDAINTIIRLDETTETGEFLQPCIEAALHLGCTPRRASRSQRNINPRCYLS 136
BLAST of CmoCh12G012250 vs. TAIR 10
Match:
AT4G32295.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G24150.1); Has 34 Blast hits to 34 proteins in 8 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 34; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 107.1 bits (266), Expect = 3.7e-23
Identity = 50/72 (69.44%), Postives = 61/72 (84.72%), Query Frame = 0
Query: 100 MYSKANSEVEYMNLETVWERLNDAVNTIIRRDEHSETGELLPPCVEAALNLGCVPVRASR 159
MYSKANSE EYM+++T+ +R NDA+NTIIR DE +ETGE L PC+EAAL+LGC P RASR
Sbjct: 1 MYSKANSEAEYMDMKTLLDRTNDAINTIIRLDETTETGEFLQPCIEAALHLGCTPRRASR 60
Query: 160 SQRHTNPRTYLT 172
SQR+ NPR YL+
Sbjct: 61 SQRNINPRCYLS 72
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1FCV2 | 1.6e-255 | 100.00 | uncharacterized protein LOC111444465 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1FDR6 | 3.5e-247 | 97.93 | uncharacterized protein LOC111444465 isoform X2 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1HRU3 | 1.4e-243 | 96.10 | uncharacterized protein LOC111465547 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1HMI3 | 3.1e-235 | 94.04 | uncharacterized protein LOC111465547 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1FIL0 | 5.6e-229 | 91.95 | uncharacterized protein LOC111444465 isoform X3 OS=Cucurbita moschata OX=3662 GN... | [more] |
Match Name | E-value | Identity | Description | |
AT3G24150.1 | 3.0e-65 | 38.90 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |
AT4G32295.1 | 1.4e-51 | 58.48 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |
AT4G32295.2 | 3.7e-23 | 69.44 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |