Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TCTACGAACTCCAAAATGCAATCAATCCATGCATTGGGCTTCTCCCACTCTCTTCAATTTTCCCAATCCCATTTCCATCCCAACCCCAAAAATTCTCTGCTAAGAACACTCTCCAGCCATGGAAGCAACAGGAAGGCTAGAACAAGCCTCAGTCTCAGAACATCTTGGCCCTCCATTTCCATTGCCCTATTTGGTTCAGGCTTTCTCTTAGGTCCTCTTCTCGATGGTCTCCATTCTCGGGTGAATCTCGTCGTTTATCAAATAGGATCCGTCGATATCGGCCCACTCCGCACTAACATTTGGGTAAGATTTTCGAATCGCGATTGTAGAACCCTATGATTCGTGAATTGCCACTAATTTTTAGGGCAACTGATGGTTTTACGAGCGATTTTGAATCTGAATGCTGCTCTAGGTTCCTTTCTTGCTGGGAGTATTTTACTGTACTGTTGGATTGATTCAACTCTACATAGATGAGAATTTTTCGCCGAACAGACCGGAGGGGAGTTTGGGCAAGACAGTAGCATCCTTAATGTGAGCTTTTTTTTTTTGTTTGTATTAGATTTTTTAGGGAGTGAAAATGATCTATTACAAGCGATTTTCATATAATTAGTATTCTAATATGTATTGTTTGTTACTGTTTGATCAGAGCATTGGCTTTGTTTATTGAATTGAGTGCTGAAATGTACAAAGCTGGAGTGGCTCCCAACATTGAGGCATATGCATTGTTTGCTGGGGCTGAGTTTATATGGGCATTGCTTGATAGTTCATTGCTTGGCTTCTCACTGGCTTGTGTTGTTGGCCTTCTCTGCCCTCTGGCTGAGATTCCCATTATGAAGTATGTGAACGACCCTCTTTCAATTCACCTACAAGTAGTTTATGATATTTGTTGGTTCTTGTACATTTTTATGTGATGATTTAGTTCCTATATATTTAGAAATGTAGAGGTTAATTCTCTATTGTTAGAAGTTTAGTTGTTAGTGAGGCAATCGAATGACTAAATACTTGGACTACTTTTGAGTTTGGATTATAGGGAGGTGTATATCTACTTTATTCACTTGACCCAACTTCAAGTACGTTAGTTGCTACTTGAAGAGATTCCTACGAAATTTTGAATGTCAAGAACTTAATTGTTATATTTGTAAAAATATTAGGACGAAATTGTTTTTGAGCAAAACATGAGTGCCAACAACGTTAGAAGAGGCTTAGGATGAGAAAAGAAATGTTATCAGAGGTAGACTTTATTATAATTAAAACCCTTGTTACAAGGAGCGAGGTTAGCCTTTTCGAGATCCCACATTAGTTGGAGAGGGGAACGAAGCATTCCTTATAAGAGTGTGGAAAAGTCTCCCTAGCAAACACGTTTTAAAACTGTGAGGCTGACGACGATACGTAACGAGCCAAAGTGGACAATATTTGCTAGCGATGGGCTTAGGTTGTTACAAATAGTATAAGAGCCAGACATCGGGCGTTGTGCCAACAAGGACGCTGGATCTCCAAGGGGGGTGGATTGTGAGATCCCACATCGGTTGGAGAGGGAAACGAAATATTCTTTATAAGGGTGTGAAAATCTCTCCCTAGCAGACGTGTTTTAAAACCATGAGGTTGACGGCGATGTGTAAGAAGTCAAAGTGGACAATATTTGCTAGTAGTGAGCTTAGGCTGTTATAGCTTTAGGATGGACGAAGTCTCCCCAAAAGATTAGTAGGGTAGATCGGGAGCATTGTACAAGGAGTCAAGTCATGTGATACCTTGATTACAAAAAATATGAAGATTTCATCATGAGTTAGATGCAGTTAGTACATTTCTTCTCAAACTAAAAGGAAAAAACTTGAGATGATATTATGTTCTTGGCAACTTTAAAGGTGATTCAAAAGATGCCTCGCTTTTATAAAAGAATATCCAATATTTTATGACTAATCTGATGAGCTGCCGTTCCTTTGAGCCCCAACTGATTTCATCATAGAAGCAAACGATGGTTTACAGAGAATCACTTAGATGATCAAGTGATATTCGATTCGTAAACAATTTTGTTTTTCTTCGATTCTGTTTTCGCTAACCAAATGCTACTCTACTCCTTGATCTAATGCTTATACAGGTTTTTCCATCTCTGGTATTATCCACAAGCGAATGTCGAGATCTTTGGTGAGGTAAGTGATGAAGGCACATATCAGCAAGTGATGGGAAATTCATGAGTATAAATCCTGGAAAGTTTTAAAGCTTGCTTTTGTTTAATGCAGGGGCTAATCAGCTGGACAATCACATGCTATTTTGTATACACTCCATTCTTGATTAATTTATCGAGGTGGCTCAAGTCTGTGTTGGATTCTGTTGCTGTAAAAAAGGATGGGTCTGCTTAATCAAATGTTTTTCTGCCTAATAGATTGGAAGGTTGGAGCTGGGAAGGTAAATAATTAATTTTGATGACTTGTCATTTTTATTTCTGTTTGAAGATTATTTTCTGTTTTTATAGGAAAAATCCATTAAATTGAATAGAAATTCAAATTTGGATTTCAGTAAGAGAGGAAATTAACTAAATTTTGTGTTTGTAGGGTTAGGAATTACGACTCTTTACAATGGTATGATATTGTCCACTTTGAGCCTAAGCTCTCATGACTTTGCTTTGGGCTTCCCCAAAAGACCTCATATCAATGGAGATGTATTTCCTTACTTTAGAACCCCATGATCATTCCCTAAATTAGCCAATATAGGACTCCCTTCCAATAATCCTTAACAATTTTCCCTCGAACAAACTACACTATGGAGCCTCCCTTTAATTGAGGCTCGACTCCTTCTCTAAGCCCTCAAACAAAGTAGAACCTTTGTTCGATACTTTCGTCACTTTTGATTACACCTTCGAGGTTCACAATTCTTTGTTCGACATTTGAGCATTCTATTGACATGACTAAGTTTAGAGCATGACTTTGATACCATGTTAGGATTCACGATTCTCCACAATAGTATGATATTGTCCACTTTGAACCTAAGCTCTCATGGCTTTGCTTTGGGCTTTCCTTAACTCCTCGTACCAATGGAGATGTATTTCTTTACTTATAAACTCATGATCCTTCCTTAAGTTAGGCAACGTGGAACTCCCTCCCAACAATCCTCTAACAAGTAGGAGACAATAAACTTCTAATTTCAAGCCCGGTAAGTTCAATCAAAAGTTGAAGAAGTAATTAAACTAGAAATTTAAGGATTTATTAGACATTTTATAAAATTGCAAGACATATTTAACACACATTTTGCGGTATAACTTCATTTCTACGATGCTTTCTATTCAAAATCCATAATGCAAAGACTCTGCGACCGAAGAATGTGTATTATAAACAACCAATTTGACTTGCCCACCATTAAGATTCTAGGGTTTTGTGAGAGGCCTAAATAAATGCCCAACCATTGGAGTTCATTCACATGCACCATTTTCTGAAACGAACTCCCAAATCTTATCTTCTTGAAGCCAAGTTGGGTAATAGTACTTCACTCAAATGGCTTGGGGCAATTAATATTATTAGCTCATGTGCAGTTACAGGCCATTTTCATACAAAAATAGCTACGCTCTTCATTGCCTTTCTGTTCTGCGTCCTTATTATTCCTTTTATATTGGTTTGATTCAGAAGCAAGAAAGTGGGTGTGAGTTAAATTCCACCTGCCAATAACTTAGGGTAAGTTTCATTTTGATTCATTTATTGCCCATTTGCATTAGTTGTGAGCTTTCTGCTGCTGATGCCTATGTGGGGAACGTTGGATGTCTATCATACAAAAAAAATCTCCTTTCTATTAGTACTTTTGGGTTTTGTGATGCTCGTCATCGTGTTGAATATACTTTGGATTCTTGAGTGTATCATTTGAGGAC
mRNA sequence
TCTACGAACTCCAAAATGCAATCAATCCATGCATTGGGCTTCTCCCACTCTCTTCAATTTTCCCAATCCCATTTCCATCCCAACCCCAAAAATTCTCTGCTAAGAACACTCTCCAGCCATGGAAGCAACAGGAAGGCTAGAACAAGCCTCAGTCTCAGAACATCTTGGCCCTCCATTTCCATTGCCCTATTTGGTTCAGGCTTTCTCTTAGGTCCTCTTCTCGATGGTCTCCATTCTCGGGTGAATCTCGTCGTTTATCAAATAGGATCCGTCGATATCGGCCCACTCCGCACTAACATTTGGGTTCCTTTCTTGCTGGGAGTATTTTACTGTACTGTTGGATTGATTCAACTCTACATAGATGAGAATTTTTCGCCGAACAGACCGGAGGGGAGTTTGGGCAAGACAGTAGCATCCTTAATAGCATTGGCTTTGTTTATTGAATTGAGTGCTGAAATGTACAAAGCTGGAGTGGCTCCCAACATTGAGGCATATGCATTGTTTGCTGGGGCTGAGTTTATATGGGCATTGCTTGATAGTTCATTGCTTGGCTTCTCACTGGCTTGTGTTGTTGGCCTTCTCTGCCCTCTGGCTGAGATTCCCATTATGAAGTTTTTCCATCTCTGGTATTATCCACAAGCGAATGTCGAGATCTTTGGTGAGGGGCTAATCAGCTGGACAATCACATGCTATTTTGTATACACTCCATTCTTGATTAATTTATCGAGGTGGCTCAAGTCTGTGTTGGATTCTGTTGCTGTAAAAAAGGATGGGTCTGCTTAATCAAATGTTTTTCTGCCTAATAGATTGGAAGGTTGGAGCTGGGAAGAAGCAAGAAAGTGGGTGTGAGTTAAATTCCACCTGCCAATAACTTAGGGTAAGTTTCATTTTGATTCATTTATTGCCCATTTGCATTAGTTGTGAGCTTTCTGCTGCTGATGCCTATGTGGGGAACGTTGGATGTCTATCATACAAAAAAAATCTCCTTTCTATTAGTACTTTTGGGTTTTGTGATGCTCGTCATCGTGTTGAATATACTTTGGATTCTTGAGTGTATCATTTGAGGAC
Coding sequence (CDS)
ATGCAATCAATCCATGCATTGGGCTTCTCCCACTCTCTTCAATTTTCCCAATCCCATTTCCATCCCAACCCCAAAAATTCTCTGCTAAGAACACTCTCCAGCCATGGAAGCAACAGGAAGGCTAGAACAAGCCTCAGTCTCAGAACATCTTGGCCCTCCATTTCCATTGCCCTATTTGGTTCAGGCTTTCTCTTAGGTCCTCTTCTCGATGGTCTCCATTCTCGGGTGAATCTCGTCGTTTATCAAATAGGATCCGTCGATATCGGCCCACTCCGCACTAACATTTGGGTTCCTTTCTTGCTGGGAGTATTTTACTGTACTGTTGGATTGATTCAACTCTACATAGATGAGAATTTTTCGCCGAACAGACCGGAGGGGAGTTTGGGCAAGACAGTAGCATCCTTAATAGCATTGGCTTTGTTTATTGAATTGAGTGCTGAAATGTACAAAGCTGGAGTGGCTCCCAACATTGAGGCATATGCATTGTTTGCTGGGGCTGAGTTTATATGGGCATTGCTTGATAGTTCATTGCTTGGCTTCTCACTGGCTTGTGTTGTTGGCCTTCTCTGCCCTCTGGCTGAGATTCCCATTATGAAGTTTTTCCATCTCTGGTATTATCCACAAGCGAATGTCGAGATCTTTGGTGAGGGGCTAATCAGCTGGACAATCACATGCTATTTTGTATACACTCCATTCTTGATTAATTTATCGAGGTGGCTCAAGTCTGTGTTGGATTCTGTTGCTGTAAAAAAGGATGGGTCTGCTTAA
Protein sequence
MQSIHALGFSHSLQFSQSHFHPNPKNSLLRTLSSHGSNRKARTSLSLRTSWPSISIALFGSGFLLGPLLDGLHSRVNLVVYQIGSVDIGPLRTNIWVPFLLGVFYCTVGLIQLYIDENFSPNRPEGSLGKTVASLIALALFIELSAEMYKAGVAPNIEAYALFAGAEFIWALLDSSLLGFSLACVVGLLCPLAEIPIMKFFHLWYYPQANVEIFGEGLISWTITCYFVYTPFLINLSRWLKSVLDSVAVKKDGSA
Homology
BLAST of CmoCh01G002820.1 vs. ExPASy TrEMBL
Match:
A0A6J1GD48 (uncharacterized protein LOC111452860 OS=Cucurbita moschata OX=3662 GN=LOC111452860 PE=4 SV=1)
HSP 1 Score: 509.2 bits (1310), Expect = 9.8e-141
Identity = 255/255 (100.00%), Postives = 255/255 (100.00%), Query Frame = 0
Query: 1 MQSIHALGFSHSLQFSQSHFHPNPKNSLLRTLSSHGSNRKARTSLSLRTSWPSISIALFG 60
MQSIHALGFSHSLQFSQSHFHPNPKNSLLRTLSSHGSNRKARTSLSLRTSWPSISIALFG
Sbjct: 1 MQSIHALGFSHSLQFSQSHFHPNPKNSLLRTLSSHGSNRKARTSLSLRTSWPSISIALFG 60
Query: 61 SGFLLGPLLDGLHSRVNLVVYQIGSVDIGPLRTNIWVPFLLGVFYCTVGLIQLYIDENFS 120
SGFLLGPLLDGLHSRVNLVVYQIGSVDIGPLRTNIWVPFLLGVFYCTVGLIQLYIDENFS
Sbjct: 61 SGFLLGPLLDGLHSRVNLVVYQIGSVDIGPLRTNIWVPFLLGVFYCTVGLIQLYIDENFS 120
Query: 121 PNRPEGSLGKTVASLIALALFIELSAEMYKAGVAPNIEAYALFAGAEFIWALLDSSLLGF 180
PNRPEGSLGKTVASLIALALFIELSAEMYKAGVAPNIEAYALFAGAEFIWALLDSSLLGF
Sbjct: 121 PNRPEGSLGKTVASLIALALFIELSAEMYKAGVAPNIEAYALFAGAEFIWALLDSSLLGF 180
Query: 181 SLACVVGLLCPLAEIPIMKFFHLWYYPQANVEIFGEGLISWTITCYFVYTPFLINLSRWL 240
SLACVVGLLCPLAEIPIMKFFHLWYYPQANVEIFGEGLISWTITCYFVYTPFLINLSRWL
Sbjct: 181 SLACVVGLLCPLAEIPIMKFFHLWYYPQANVEIFGEGLISWTITCYFVYTPFLINLSRWL 240
Query: 241 KSVLDSVAVKKDGSA 256
KSVLDSVAVKKDGSA
Sbjct: 241 KSVLDSVAVKKDGSA 255
BLAST of CmoCh01G002820.1 vs. ExPASy TrEMBL
Match:
A0A6J1KGW6 (uncharacterized protein LOC111493103 OS=Cucurbita maxima OX=3661 GN=LOC111493103 PE=4 SV=1)
HSP 1 Score: 485.7 bits (1249), Expect = 1.2e-133
Identity = 245/256 (95.70%), Postives = 250/256 (97.66%), Query Frame = 0
Query: 1 MQSIHALGFSHSLQFSQSHFHPNPKNSLLRTLSSHGSNRKARTSLSLRTSWPSISIALFG 60
MQSIHALGFSHSLQFSQSHFHPNPKNSLLRTL +HGSNR+ARTSLSLRTSWPSISIALFG
Sbjct: 1 MQSIHALGFSHSLQFSQSHFHPNPKNSLLRTLGTHGSNRQARTSLSLRTSWPSISIALFG 60
Query: 61 SGFLLGPLLDGLHSRVNLVVYQIGSVDIGPLRTNIWVPFLLGVFYCTVGLIQLYIDENFS 120
SGFLLGPLLDGLHSRVNLVVYQIGS+DIGPLRTNI VPFLLG+FYCTVGLIQLYIDENF
Sbjct: 61 SGFLLGPLLDGLHSRVNLVVYQIGSLDIGPLRTNICVPFLLGLFYCTVGLIQLYIDENFL 120
Query: 121 PNRPEGSLGKTVASLIALALFIELSAEMYKAGVAPNIEAYALFAGAEFIWALLDSSLLGF 180
PNRPEGSLGKTVASLIALALFIELSAEMYKAGVAPNIEAYALFAGAEFIWALLDSSLLGF
Sbjct: 121 PNRPEGSLGKTVASLIALALFIELSAEMYKAGVAPNIEAYALFAGAEFIWALLDSSLLGF 180
Query: 181 SLACVVGLLCPLAEIPIMKFFHLWYYPQANVEIFGEGLISWTITCYFVYTPFLINLSRWL 240
SLACVVGLLCPLAEIPIMKFFHLWYYPQANVEIFGEGLISWTITCYFVYTPFLINLSRWL
Sbjct: 181 SLACVVGLLCPLAEIPIMKFFHLWYYPQANVEIFGEGLISWTITCYFVYTPFLINLSRWL 240
Query: 241 -KSVLDSVAVKKDGSA 256
SV+DS AVKKDGSA
Sbjct: 241 MMSVVDSAAVKKDGSA 256
BLAST of CmoCh01G002820.1 vs. ExPASy TrEMBL
Match:
A0A6J1DRY1 (uncharacterized protein LOC111023831 OS=Momordica charantia OX=3673 GN=LOC111023831 PE=4 SV=1)
HSP 1 Score: 406.4 bits (1043), Expect = 9.0e-110
Identity = 210/255 (82.35%), Postives = 223/255 (87.45%), Query Frame = 0
Query: 1 MQSIHALGFSHSLQFSQSHFHPNPKNSLLRTLSSHGSNRKARTSLSLRTSWPSISIALFG 60
MQSI+ LG S LQF Q F K+S L+ SHGS + RTSLSLRT+WPSISIALFG
Sbjct: 1 MQSIYGLGSSRYLQFPQFPFRSISKHSPLKPRCSHGSESR-RTSLSLRTTWPSISIALFG 60
Query: 61 SGFLLGPLLDGLHSRVNLVVYQIGSVDIGPLRTNIWVPFLLGVFYCTVGLIQLYIDENFS 120
SGFLLGPLLDGLHSRVNLVVYQ GS+D+GPL TNIWVPFLLG+FY TVGL+QLYIDENFS
Sbjct: 61 SGFLLGPLLDGLHSRVNLVVYQTGSIDVGPLHTNIWVPFLLGLFYSTVGLMQLYIDENFS 120
Query: 121 PNRPEGSLGKTVASLIALALFIELSAEMYKAGVAPNIEAYALFAGAEFIWALLDSSLLGF 180
N EGSLG+TVASLIALALFIELSAEMYKAGVA NIEAYALFAGAE IWA LDSSLLGF
Sbjct: 121 RNSSEGSLGRTVASLIALALFIELSAEMYKAGVADNIEAYALFAGAELIWAFLDSSLLGF 180
Query: 181 SLACVVGLLCPLAEIPIMKFFHLWYYPQANVEIFGEGLISWTITCYFVYTPFLINLSRWL 240
SLACVVGL CPLAEIPIMKFFHLW YPQANVEIFGEG+ISW ITCYFVYTPFLINLSRWL
Sbjct: 181 SLACVVGLGCPLAEIPIMKFFHLWSYPQANVEIFGEGIISWIITCYFVYTPFLINLSRWL 240
Query: 241 KSVLDSVAVKKDGSA 256
KSV+D+ AVKKD SA
Sbjct: 241 KSVVDAAAVKKDESA 254
BLAST of CmoCh01G002820.1 vs. ExPASy TrEMBL
Match:
A0A1S3BI47 (uncharacterized protein LOC103489906 OS=Cucumis melo OX=3656 GN=LOC103489906 PE=4 SV=1)
HSP 1 Score: 402.1 bits (1032), Expect = 1.7e-108
Identity = 208/263 (79.09%), Postives = 229/263 (87.07%), Query Frame = 0
Query: 1 MQSIHALGFSHSLQFSQSH----FHPNPKNSLLRT-LSSHGSNRKARTSLSLRTSWPSIS 60
MQSI ALGFSHSLQF SH FH N + SL + +SHGSN K+R +LSLRT+WPSIS
Sbjct: 1 MQSIFALGFSHSLQFPHSHSHSYFHSNSRISLQKPHCTSHGSN-KSRPTLSLRTTWPSIS 60
Query: 61 IALFGSGFLLGPLLDGLHSRVNLVVYQIGSVDIGPLRTNIWVPFLLGVFYCTVGLIQLYI 120
I+LF SGFLLGPLLDGLHSRVNLVVY+ GS+ IGPL TNIWVPFLLG+FYCTVGLIQLY+
Sbjct: 61 ISLFASGFLLGPLLDGLHSRVNLVVYRTGSIHIGPLHTNIWVPFLLGLFYCTVGLIQLYL 120
Query: 121 DENFSPNRPEGSLGKTVASLIALALFIELSAEMYKAGVAPNIEAYALFAGAEFIWALLDS 180
DE FSP + +GSL KTVASLIAL LFIELSAEMYKAGVA NIEAYALFAGAEFIWALLDS
Sbjct: 121 DEKFSPKQSQGSLRKTVASLIALGLFIELSAEMYKAGVADNIEAYALFAGAEFIWALLDS 180
Query: 181 SLLGFSLACVVGLLCPLAEIPIMKFFHLWYYPQANVEIFGEGLISWTITCYFVYTPFLIN 240
SLLGFSLACV+GL CPLAEIPIMKFFHLW YP+AN+EIFGEG+ISWT+TCYFVYTPFLIN
Sbjct: 181 SLLGFSLACVLGLGCPLAEIPIMKFFHLWEYPKANIEIFGEGIISWTVTCYFVYTPFLIN 240
Query: 241 LSRWLKSVLD----SVAVKKDGS 255
LSRWLKSV+D + AV +DGS
Sbjct: 241 LSRWLKSVVDADAAAAAVNEDGS 262
BLAST of CmoCh01G002820.1 vs. ExPASy TrEMBL
Match:
A0A0A0LBT1 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G535110 PE=4 SV=1)
HSP 1 Score: 401.4 bits (1030), Expect = 2.9e-108
Identity = 206/257 (80.16%), Postives = 227/257 (88.33%), Query Frame = 0
Query: 1 MQSIHALGFSHSLQF--SQSHFHPNPKNSLLRT-LSSHGSNRKARTSLSLRTSWPSISIA 60
MQSI ALGFS SLQF S SHFH N + S+ + SSHGS +K R SLSLRT+WPSISI+
Sbjct: 1 MQSIFALGFSQSLQFPHSHSHFHSNSRISVQKPHCSSHGS-KKPRISLSLRTTWPSISIS 60
Query: 61 LFGSGFLLGPLLDGLHSRVNLVVYQIGSVDIGPLRTNIWVPFLLGVFYCTVGLIQLYIDE 120
LF SGFLLGPLLDGLHSRVNLVVY+ GS+ IGPL TNIWVPFLLG+FYCTVGLIQLY+DE
Sbjct: 61 LFASGFLLGPLLDGLHSRVNLVVYRTGSIHIGPLHTNIWVPFLLGLFYCTVGLIQLYLDE 120
Query: 121 NFSPNRPEGSLGKTVASLIALALFIELSAEMYKAGVAPNIEAYALFAGAEFIWALLDSSL 180
FS + +GSLGKTVASLIAL LFIELSAEMYKAGVA NIEAYALFAGAEFIWALLDSSL
Sbjct: 121 KFSLKQSQGSLGKTVASLIALGLFIELSAEMYKAGVADNIEAYALFAGAEFIWALLDSSL 180
Query: 181 LGFSLACVVGLLCPLAEIPIMKFFHLWYYPQANVEIFGEGLISWTITCYFVYTPFLINLS 240
LGFSLACV+GL CPLAEIPIMKFFHLW YP+AN++IFGEG+ISWT+TCYFVYTPFLINLS
Sbjct: 181 LGFSLACVLGLGCPLAEIPIMKFFHLWEYPKANIDIFGEGIISWTVTCYFVYTPFLINLS 240
Query: 241 RWLKSVLDSVAVKKDGS 255
RWLKSV+D+ AV +D S
Sbjct: 241 RWLKSVVDAAAVNEDES 256
BLAST of CmoCh01G002820.1 vs. NCBI nr
Match:
XP_022949544.1 (uncharacterized protein LOC111452860 [Cucurbita moschata] >XP_022949545.1 uncharacterized protein LOC111452860 [Cucurbita moschata] >XP_022949546.1 uncharacterized protein LOC111452860 [Cucurbita moschata])
HSP 1 Score: 509.2 bits (1310), Expect = 2.0e-140
Identity = 255/255 (100.00%), Postives = 255/255 (100.00%), Query Frame = 0
Query: 1 MQSIHALGFSHSLQFSQSHFHPNPKNSLLRTLSSHGSNRKARTSLSLRTSWPSISIALFG 60
MQSIHALGFSHSLQFSQSHFHPNPKNSLLRTLSSHGSNRKARTSLSLRTSWPSISIALFG
Sbjct: 1 MQSIHALGFSHSLQFSQSHFHPNPKNSLLRTLSSHGSNRKARTSLSLRTSWPSISIALFG 60
Query: 61 SGFLLGPLLDGLHSRVNLVVYQIGSVDIGPLRTNIWVPFLLGVFYCTVGLIQLYIDENFS 120
SGFLLGPLLDGLHSRVNLVVYQIGSVDIGPLRTNIWVPFLLGVFYCTVGLIQLYIDENFS
Sbjct: 61 SGFLLGPLLDGLHSRVNLVVYQIGSVDIGPLRTNIWVPFLLGVFYCTVGLIQLYIDENFS 120
Query: 121 PNRPEGSLGKTVASLIALALFIELSAEMYKAGVAPNIEAYALFAGAEFIWALLDSSLLGF 180
PNRPEGSLGKTVASLIALALFIELSAEMYKAGVAPNIEAYALFAGAEFIWALLDSSLLGF
Sbjct: 121 PNRPEGSLGKTVASLIALALFIELSAEMYKAGVAPNIEAYALFAGAEFIWALLDSSLLGF 180
Query: 181 SLACVVGLLCPLAEIPIMKFFHLWYYPQANVEIFGEGLISWTITCYFVYTPFLINLSRWL 240
SLACVVGLLCPLAEIPIMKFFHLWYYPQANVEIFGEGLISWTITCYFVYTPFLINLSRWL
Sbjct: 181 SLACVVGLLCPLAEIPIMKFFHLWYYPQANVEIFGEGLISWTITCYFVYTPFLINLSRWL 240
Query: 241 KSVLDSVAVKKDGSA 256
KSVLDSVAVKKDGSA
Sbjct: 241 KSVLDSVAVKKDGSA 255
BLAST of CmoCh01G002820.1 vs. NCBI nr
Match:
KAG7036627.1 (hypothetical protein SDJN02_00246 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 506.5 bits (1303), Expect = 1.3e-139
Identity = 253/255 (99.22%), Postives = 254/255 (99.61%), Query Frame = 0
Query: 1 MQSIHALGFSHSLQFSQSHFHPNPKNSLLRTLSSHGSNRKARTSLSLRTSWPSISIALFG 60
MQSIHALGFSHSLQFSQSHFHPNPKNSLLRTL+SHGSNRKARTSLSLRTSWPSISIALFG
Sbjct: 1 MQSIHALGFSHSLQFSQSHFHPNPKNSLLRTLASHGSNRKARTSLSLRTSWPSISIALFG 60
Query: 61 SGFLLGPLLDGLHSRVNLVVYQIGSVDIGPLRTNIWVPFLLGVFYCTVGLIQLYIDENFS 120
SGFLLGPLLDGLHSRVNLVVYQIGSVDIGPLRTNIWVPFLLGVFYCTVGLIQLYIDENFS
Sbjct: 61 SGFLLGPLLDGLHSRVNLVVYQIGSVDIGPLRTNIWVPFLLGVFYCTVGLIQLYIDENFS 120
Query: 121 PNRPEGSLGKTVASLIALALFIELSAEMYKAGVAPNIEAYALFAGAEFIWALLDSSLLGF 180
PNRPEGSLGKTVASLIALALFIELSAEMYKAGVAPNIEAYALFAGAEFIWALLDSSLLGF
Sbjct: 121 PNRPEGSLGKTVASLIALALFIELSAEMYKAGVAPNIEAYALFAGAEFIWALLDSSLLGF 180
Query: 181 SLACVVGLLCPLAEIPIMKFFHLWYYPQANVEIFGEGLISWTITCYFVYTPFLINLSRWL 240
SLACVVGLLCPLAEIPIMKFFHLWYYPQANVEIFGEGLISWTITCYFVYTPFLINLSRWL
Sbjct: 181 SLACVVGLLCPLAEIPIMKFFHLWYYPQANVEIFGEGLISWTITCYFVYTPFLINLSRWL 240
Query: 241 KSVLDSVAVKKDGSA 256
KSVLDS AVKKDGSA
Sbjct: 241 KSVLDSAAVKKDGSA 255
BLAST of CmoCh01G002820.1 vs. NCBI nr
Match:
XP_022998487.1 (uncharacterized protein LOC111493103 [Cucurbita maxima] >XP_022998488.1 uncharacterized protein LOC111493103 [Cucurbita maxima])
HSP 1 Score: 485.7 bits (1249), Expect = 2.4e-133
Identity = 245/256 (95.70%), Postives = 250/256 (97.66%), Query Frame = 0
Query: 1 MQSIHALGFSHSLQFSQSHFHPNPKNSLLRTLSSHGSNRKARTSLSLRTSWPSISIALFG 60
MQSIHALGFSHSLQFSQSHFHPNPKNSLLRTL +HGSNR+ARTSLSLRTSWPSISIALFG
Sbjct: 1 MQSIHALGFSHSLQFSQSHFHPNPKNSLLRTLGTHGSNRQARTSLSLRTSWPSISIALFG 60
Query: 61 SGFLLGPLLDGLHSRVNLVVYQIGSVDIGPLRTNIWVPFLLGVFYCTVGLIQLYIDENFS 120
SGFLLGPLLDGLHSRVNLVVYQIGS+DIGPLRTNI VPFLLG+FYCTVGLIQLYIDENF
Sbjct: 61 SGFLLGPLLDGLHSRVNLVVYQIGSLDIGPLRTNICVPFLLGLFYCTVGLIQLYIDENFL 120
Query: 121 PNRPEGSLGKTVASLIALALFIELSAEMYKAGVAPNIEAYALFAGAEFIWALLDSSLLGF 180
PNRPEGSLGKTVASLIALALFIELSAEMYKAGVAPNIEAYALFAGAEFIWALLDSSLLGF
Sbjct: 121 PNRPEGSLGKTVASLIALALFIELSAEMYKAGVAPNIEAYALFAGAEFIWALLDSSLLGF 180
Query: 181 SLACVVGLLCPLAEIPIMKFFHLWYYPQANVEIFGEGLISWTITCYFVYTPFLINLSRWL 240
SLACVVGLLCPLAEIPIMKFFHLWYYPQANVEIFGEGLISWTITCYFVYTPFLINLSRWL
Sbjct: 181 SLACVVGLLCPLAEIPIMKFFHLWYYPQANVEIFGEGLISWTITCYFVYTPFLINLSRWL 240
Query: 241 -KSVLDSVAVKKDGSA 256
SV+DS AVKKDGSA
Sbjct: 241 MMSVVDSAAVKKDGSA 256
BLAST of CmoCh01G002820.1 vs. NCBI nr
Match:
XP_023524606.1 (uncharacterized protein LOC111788502 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 476.1 bits (1224), Expect = 1.9e-130
Identity = 239/246 (97.15%), Postives = 242/246 (98.37%), Query Frame = 0
Query: 10 SHSLQFSQSHFHPNPKNSLLRTLSSHGSNRKARTSLSLRTSWPSISIALFGSGFLLGPLL 69
+HSLQFSQSHFHPNPK SLLRTL+SHGSNRKARTSLSLR SWPSISIALFGSGFLLGPLL
Sbjct: 159 THSLQFSQSHFHPNPKISLLRTLASHGSNRKARTSLSLRASWPSISIALFGSGFLLGPLL 218
Query: 70 DGLHSRVNLVVYQIGSVDIGPLRTNIWVPFLLGVFYCTVGLIQLYIDENFSPNRPEGSLG 129
DGLHSRVNLVVYQ+GSVDIGPLRTNI VPFLLGVFYCTVGLIQLYIDENFSPNRPEGSLG
Sbjct: 219 DGLHSRVNLVVYQMGSVDIGPLRTNICVPFLLGVFYCTVGLIQLYIDENFSPNRPEGSLG 278
Query: 130 KTVASLIALALFIELSAEMYKAGVAPNIEAYALFAGAEFIWALLDSSLLGFSLACVVGLL 189
KTVASLIALALFIELSAEMYKAGVAPNIEAYALFAGAEFIWALLDSSLLGFSLACVVGLL
Sbjct: 279 KTVASLIALALFIELSAEMYKAGVAPNIEAYALFAGAEFIWALLDSSLLGFSLACVVGLL 338
Query: 190 CPLAEIPIMKFFHLWYYPQANVEIFGEGLISWTITCYFVYTPFLINLSRWLKSVLDSVAV 249
CPLAEIPIMKFFHLWYYPQANVEIFGEGLISWTITCYFVYTPFLINLSRWLKSVLDS AV
Sbjct: 339 CPLAEIPIMKFFHLWYYPQANVEIFGEGLISWTITCYFVYTPFLINLSRWLKSVLDSAAV 398
Query: 250 KKDGSA 256
KKDGSA
Sbjct: 399 KKDGSA 404
BLAST of CmoCh01G002820.1 vs. NCBI nr
Match:
KAG6606923.1 (hypothetical protein SDJN03_00265, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 475.3 bits (1222), Expect = 3.3e-130
Identity = 237/238 (99.58%), Postives = 238/238 (100.00%), Query Frame = 0
Query: 1 MQSIHALGFSHSLQFSQSHFHPNPKNSLLRTLSSHGSNRKARTSLSLRTSWPSISIALFG 60
MQSIHALGFSHSLQFSQSHFHPNPKNSLLRTL+SHGSNRKARTSLSLRTSWPSISIALFG
Sbjct: 1 MQSIHALGFSHSLQFSQSHFHPNPKNSLLRTLASHGSNRKARTSLSLRTSWPSISIALFG 60
Query: 61 SGFLLGPLLDGLHSRVNLVVYQIGSVDIGPLRTNIWVPFLLGVFYCTVGLIQLYIDENFS 120
SGFLLGPLLDGLHSRVNLVVYQIGSVDIGPLRTNIWVPFLLGVFYCTVGLIQLYIDENFS
Sbjct: 61 SGFLLGPLLDGLHSRVNLVVYQIGSVDIGPLRTNIWVPFLLGVFYCTVGLIQLYIDENFS 120
Query: 121 PNRPEGSLGKTVASLIALALFIELSAEMYKAGVAPNIEAYALFAGAEFIWALLDSSLLGF 180
PNRPEGSLGKTVASLIALALFIELSAEMYKAGVAPNIEAYALFAGAEFIWALLDSSLLGF
Sbjct: 121 PNRPEGSLGKTVASLIALALFIELSAEMYKAGVAPNIEAYALFAGAEFIWALLDSSLLGF 180
Query: 181 SLACVVGLLCPLAEIPIMKFFHLWYYPQANVEIFGEGLISWTITCYFVYTPFLINLSR 239
SLACVVGLLCPLAEIPIMKFFHLWYYPQANVEIFGEGLISWTITCYFVYTPFLINLSR
Sbjct: 181 SLACVVGLLCPLAEIPIMKFFHLWYYPQANVEIFGEGLISWTITCYFVYTPFLINLSR 238
BLAST of CmoCh01G002820.1 vs. TAIR 10
Match:
AT4G01935.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; Has 37 Blast hits to 37 proteins in 17 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 30; Viruses - 0; Other Eukaryotes - 7 (source: NCBI BLink). )
HSP 1 Score: 300.1 bits (767), Expect = 1.7e-81
Identity = 146/245 (59.59%), Postives = 189/245 (77.14%), Query Frame = 0
Query: 23 NPKNSLLRTLSSHGSNR-----------KARTSLSLRTSW-PSISIALFGSGFLLGPLLD 82
+P +L++ L +G NR + ++ S SW +S++LFGSGF+LGPLLD
Sbjct: 8 SPSTTLIKPLKRNGPNRSPVRKILCLSQRKQSKTSTGKSWIVPVSLSLFGSGFVLGPLLD 67
Query: 83 GLHSRVNLVVYQIGSVDIGPLRTNIWVPFLLGVFYCTVGLIQLYIDENFSPNRPEGSLGK 142
G+HSRV+LVVYQ G+ IGPL TNIWVPFLLG+FYCTVGL+QL +DE S + P GSL K
Sbjct: 68 GIHSRVDLVVYQNGAFQIGPLHTNIWVPFLLGLFYCTVGLLQLLLDETTSASPPRGSLDK 127
Query: 143 TVASLIALALFIELSAEMYKAGVAPNIEAYALFAGAEFIWALLDSSLLGFSLACVVGLLC 202
TV SL+AL F+ELSAEMYKAGV+ NIEAY LFA AEFIW LD + + F++A ++G+ C
Sbjct: 128 TVISLLALMFFLELSAEMYKAGVSDNIEAYILFALAEFIWFSLDRTWICFTIATLLGVAC 187
Query: 203 PLAEIPIMKFFHLWYYPQANVEIFGEGLISWTITCYFVYTPFLINLSRWLKSVLDSVAVK 256
PLAEIPIM+FFHLWYYP+AN+EIFG+GL++WT TCYFVYTPFLINL+RWL++V++ ++
Sbjct: 188 PLAEIPIMQFFHLWYYPEANIEIFGQGLVTWTTTCYFVYTPFLINLARWLRTVMERTTIE 247
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1GD48 | 9.8e-141 | 100.00 | uncharacterized protein LOC111452860 OS=Cucurbita moschata OX=3662 GN=LOC1114528... | [more] |
A0A6J1KGW6 | 1.2e-133 | 95.70 | uncharacterized protein LOC111493103 OS=Cucurbita maxima OX=3661 GN=LOC111493103... | [more] |
A0A6J1DRY1 | 9.0e-110 | 82.35 | uncharacterized protein LOC111023831 OS=Momordica charantia OX=3673 GN=LOC111023... | [more] |
A0A1S3BI47 | 1.7e-108 | 79.09 | uncharacterized protein LOC103489906 OS=Cucumis melo OX=3656 GN=LOC103489906 PE=... | [more] |
A0A0A0LBT1 | 2.9e-108 | 80.16 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G535110 PE=4 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
XP_022949544.1 | 2.0e-140 | 100.00 | uncharacterized protein LOC111452860 [Cucurbita moschata] >XP_022949545.1 unchar... | [more] |
KAG7036627.1 | 1.3e-139 | 99.22 | hypothetical protein SDJN02_00246 [Cucurbita argyrosperma subsp. argyrosperma] | [more] |
XP_022998487.1 | 2.4e-133 | 95.70 | uncharacterized protein LOC111493103 [Cucurbita maxima] >XP_022998488.1 uncharac... | [more] |
XP_023524606.1 | 1.9e-130 | 97.15 | uncharacterized protein LOC111788502 [Cucurbita pepo subsp. pepo] | [more] |
KAG6606923.1 | 3.3e-130 | 99.58 | hypothetical protein SDJN03_00265, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
Match Name | E-value | Identity | Description | |
AT4G01935.1 | 1.7e-81 | 59.59 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |
Relationships
This mRNA is a part of the following gene feature(s):
The following exon feature(s) are a part of this mRNA:
Feature Name | Unique Name | Type |
CmoCh01G002820.1:exon:1500 | CmoCh01G002820.1:exon:1500 | exon |
CmoCh01G002820.1:exon:1501 | CmoCh01G002820.1:exon:1501 | exon |
CmoCh01G002820.1:exon:1502 | CmoCh01G002820.1:exon:1502 | exon |
CmoCh01G002820.1:exon:1503 | CmoCh01G002820.1:exon:1503 | exon |
CmoCh01G002820.1:exon:1504 | CmoCh01G002820.1:exon:1504 | exon |
CmoCh01G002820.1:exon:1505 | CmoCh01G002820.1:exon:1505 | exon |
The following five_prime_UTR feature(s) are a part of this mRNA:
Feature Name | Unique Name | Type |
CmoCh01G002820.1:five_prime_utr | CmoCh01G002820.1:five_prime_utr | five_prime_UTR |
The following CDS feature(s) are a part of this mRNA:
Feature Name | Unique Name | Type |
CmoCh01G002820.1:cds | CmoCh01G002820.1:cds | CDS |
CmoCh01G002820.1:cds | CmoCh01G002820.1:cds_2 | CDS |
CmoCh01G002820.1:cds | CmoCh01G002820.1:cds_3 | CDS |
CmoCh01G002820.1:cds | CmoCh01G002820.1:cds_4 | CDS |
CmoCh01G002820.1:cds | CmoCh01G002820.1:cds_5 | CDS |
The following three_prime_UTR feature(s) are a part of this mRNA:
Feature Name | Unique Name | Type |
CmoCh01G002820.1:three_prime_utr | CmoCh01G002820.1:three_prime_utr | three_prime_UTR |
CmoCh01G002820.1:three_prime_utr | CmoCh01G002820.1:three_prime_utr_2 | three_prime_UTR |
The following polypeptide feature(s) derives from this mRNA:
Feature Name | Unique Name | Type |
CmoCh01G002820.1 | CmoCh01G002820.1-protein | polypeptide |