Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAAAATCTAAGGTCCACACGCCCACTCCCTTTCTCTCCTCTCCGCCCCTTCCGCCTCCCTGCCCCCCTCCAGAAAGCGACTTATTCCTCCCCTCTTATTTAATCCCCCCCGCTCTTCCCTTCCCCATTTCATCCCACACATCCTCCACATGGACTACTCCGACCACCACTCCGTCTGCTGCCTCTGTGGCGACGTCGGTTTTCCCGCCAACCTCTTCCGCTGCTCCAACTGCTCCAACCGCTTCCAACACTCGTAAGAATTCCACTCACCAGGACTCTGTTTGATAATGTTCTTGTTTCTTGTTTATAGTCTCATCGTCTGCATTTGATAACAGTTATCAATTTTTTTTTTTCCTTTTTTTGGGCCGATTTGATTGATGGATGTTGAGTGATTGGTGCAGTTATTGCAGCAACTATTACGGGGAATCGGCGGAGGCAATTGAAGTGTGCGATTGGTGCAGGAGTGAACGGAGAAGTACGGGCCGCCGTGACTCTGCAAGGAAGTCCGTTGTCAACCATATGGATGGCGCCAAGTCCCAAAAGGGCTCCGGCGATCAACACAAAAGAGAGAGAAACTCCGGCGGAGTGCCGTCGCCGCGGCCTGCTCCACGCCGGTACAAGCTTCTCAAGGATGTTATGTGTTGAAACCCCGATGGGTTGGTTGTTCTGAATGGATTGTTAAAGATAAACAAGGCAACCCAAGAAGCAATAATGCTTTCAATTTTGATTTTCTTCACACACATATATATGTATACGTATGCGTACGTCGCTTGAAAATTAAATAAGCTACTTAATTTATTATTGGATGGCTCTCTGATTTATATTGAAGGTGTTTTCAATTATTTGAGCTTTCAATGCACTAATTTGCTCTTAGAAAAGATAAAAAGTTCATACCACATTGATGGGAAAATTTTTAGTAGGAGAAATTTTTCATACTACCTCTCCATTCATTGTTTGCTATGAATGAAGAATTGTTTGGGA
mRNA sequence
AAAAAATCTAAGGTCCACACGCCCACTCCCTTTCTCTCCTCTCCGCCCCTTCCGCCTCCCTGCCCCCCTCCAGAAAGCGACTTATTCCTCCCCTCTTATTTAATCCCCCCCGCTCTTCCCTTCCCCATTTCATCCCACACATCCTCCACATGGACTACTCCGACCACCACTCCGTCTGCTGCCTCTGTGGCGACGTCGGTTTTCCCGCCAACCTCTTCCGCTGCTCCAACTGCTCCAACCGCTTCCAACACTCTTATTGCAGCAACTATTACGGGGAATCGGCGGAGGCAATTGAAGTGTGCGATTGGTGCAGGAGTGAACGGAGAAGTACGGGCCGCCGTGACTCTGCAAGGAAGTCCGTTGTCAACCATATGGATGGCGCCAAGTCCCAAAAGGGCTCCGGCGATCAACACAAAAGAGAGAGAAACTCCGGCGGAGTGCCGTCGCCGCGGCCTGCTCCACGCCGGTACAAGCTTCTCAAGGATGTTATGTGTTGAAACCCCGATGGGTTGGTTGTTCTGAATGGATTGTTAAAGATAAACAAGGCAACCCAAGAAGCAATAATGCTTTCAATTTTGATTTTCTTCACACACATATATATGTATACGTATGCGTACGTCGCTTGAAAATTAAATAAGCTACTTAATTTATTATTGGATGGCTCTCTGATTTATATTGAAGGTGTTTTCAATTATTTGAGCTTTCAATGCACTAATTTGCTCTTAGAAAAGATAAAAAGTTCATACCACATTGATGGGAAAATTTTTAGTAGGAGAAATTTTTCATACTACCTCTCCATTCATTGTTTGCTATGAATGAAGAATTGTTTGGGA
Coding sequence (CDS)
ATGGACTACTCCGACCACCACTCCGTCTGCTGCCTCTGTGGCGACGTCGGTTTTCCCGCCAACCTCTTCCGCTGCTCCAACTGCTCCAACCGCTTCCAACACTCTTATTGCAGCAACTATTACGGGGAATCGGCGGAGGCAATTGAAGTGTGCGATTGGTGCAGGAGTGAACGGAGAAGTACGGGCCGCCGTGACTCTGCAAGGAAGTCCGTTGTCAACCATATGGATGGCGCCAAGTCCCAAAAGGGCTCCGGCGATCAACACAAAAGAGAGAGAAACTCCGGCGGAGTGCCGTCGCCGCGGCCTGCTCCACGCCGGTACAAGCTTCTCAAGGATGTTATGTGTTGA
Protein sequence
MDYSDHHSVCCLCGDVGFPANLFRCSNCSNRFQHSYCSNYYGESAEAIEVCDWCRSERRSTGRRDSARKSVVNHMDGAKSQKGSGDQHKRERNSGGVPSPRPAPRRYKLLKDVMC
Homology
BLAST of Lcy10g018440 vs. ExPASy TrEMBL
Match:
A0A6J1IH20 (uncharacterized protein LOC111472884 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111472884 PE=4 SV=1)
HSP 1 Score: 164.5 bits (415), Expect = 2.7e-37
Identity = 83/107 (77.57%), Postives = 87/107 (81.31%), Query Frame = 0
Query: 9 VCCLCGDVGFPANLFRCSNCSNRFQHSYCSNYYGESAEAIEVCDWCRSERRSTGRRDSAR 68
VCCLCGDVGFPANLFRC+ CS+RFQHSYCSNYYGESAEAIEVCDWCR ERR GRR SA
Sbjct: 18 VCCLCGDVGFPANLFRCTLCSHRFQHSYCSNYYGESAEAIEVCDWCRCERR-CGRRGSAA 77
Query: 69 KSVVNHMDGAKSQKGSGDQHKRERNSGGVPSPRPAPRRYKLLKDVMC 116
+ G SQK S Q KRERNSGG+PSPR APRRYKLLKDVMC
Sbjct: 78 RKF-----GVASQKKSSGQDKRERNSGGMPSPRVAPRRYKLLKDVMC 118
BLAST of Lcy10g018440 vs. ExPASy TrEMBL
Match:
A0A6J1IAU2 (uncharacterized protein LOC111472884 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111472884 PE=4 SV=1)
HSP 1 Score: 163.3 bits (412), Expect = 6.0e-37
Identity = 84/107 (78.50%), Postives = 88/107 (82.24%), Query Frame = 0
Query: 9 VCCLCGDVGFPANLFRCSNCSNRFQHSYCSNYYGESAEAIEVCDWCRSERRSTGRRDSAR 68
VCCLCGDVGFPANLFRC+ CS+RFQHSYCSNYYGESAEAIEVCDWCR ERR GRR SA
Sbjct: 18 VCCLCGDVGFPANLFRCTLCSHRFQHSYCSNYYGESAEAIEVCDWCRCERR-CGRRGSAA 77
Query: 69 KSVVNHMDGAKSQKGSGDQHKRERNSGGVPSPRPAPRRYKLLKDVMC 116
+ G SQK SG Q KRERNSGG+PSPR APRRYKLLKDVMC
Sbjct: 78 RKF-----GVASQKSSG-QDKRERNSGGMPSPRVAPRRYKLLKDVMC 117
BLAST of Lcy10g018440 vs. ExPASy TrEMBL
Match:
A0A5B6YSM7 (Uncharacterized protein (Fragment) OS=Davidia involucrata OX=16924 GN=Din_004305 PE=4 SV=1)
HSP 1 Score: 143.3 bits (360), Expect = 6.4e-31
Identity = 72/122 (59.02%), Postives = 83/122 (68.03%), Query Frame = 0
Query: 5 DHHSVCCLCGDVGFPANLFRCSNCSNRFQHSYCSNYYGESAEAIEVCDWCRSERRSTGRR 64
D +VCC+CGDVGFP LFRCS C NRFQHSYCSNYY ES+E IE+CDWC+SE RS
Sbjct: 28 DLQTVCCMCGDVGFPDKLFRCSKCHNRFQHSYCSNYYSESSEPIELCDWCQSEERSARHG 87
Query: 65 DSARKSVVNHMDG--AKSQKGSGD---QHKRE------RNSGGVPSPRPAPRRYKLLKDV 116
S++KS H G ++S+ GD QH RE +N G PSPRP RRYKLLKDV
Sbjct: 88 SSSKKSSTGHEAGVTSRSEYSGGDKVKQHDREESAEKGKNPSGTPSPRPTTRRYKLLKDV 147
BLAST of Lcy10g018440 vs. ExPASy TrEMBL
Match:
A0A1Q3D081 (Uncharacterized protein OS=Cephalotus follicularis OX=3775 GN=CFOL_v3_29323 PE=4 SV=1)
HSP 1 Score: 138.3 bits (347), Expect = 2.1e-29
Identity = 73/123 (59.35%), Postives = 86/123 (69.92%), Query Frame = 0
Query: 5 DHHSVCCLCGDVGFPANLFRCSNCSNRFQHSYCSNYYGESAEAIEVCDWCRSERRSTGRR 64
D +VCC+CGDVGFP LFRC C NRFQHSYCSNYY E AE IE+CDWC+SE R++ R
Sbjct: 3 DLQTVCCMCGDVGFPDKLFRCYKCRNRFQHSYCSNYYSEFAEPIELCDWCQSEERTSSRH 62
Query: 65 -DSARKSVVNHMDGAKSQ-KGSGD---QHKRE-------RNSGGVPSPRPAPRRYKLLKD 116
S++KS V + GA ++ + SGD QH RE +N GVPSPRP RRYKLLKD
Sbjct: 63 GSSSKKSAVGNDVGATNRSEYSGDKIKQHDREEGTDQKGKNPSGVPSPRPTTRRYKLLKD 122
BLAST of Lcy10g018440 vs. ExPASy TrEMBL
Match:
A0A2C9VTF1 (Uncharacterized protein OS=Manihot esculenta OX=3983 GN=MANES_05G050600 PE=4 SV=1)
HSP 1 Score: 138.3 bits (347), Expect = 2.1e-29
Identity = 70/123 (56.91%), Postives = 85/123 (69.11%), Query Frame = 0
Query: 5 DHHSVCCLCGDVGFPANLFRCSNCSNRFQHSYCSNYYGESAEAIEVCDWCRSERRSTGRR 64
D +VCC+CGDVGFP LFRCS C +RFQHSYCSNYY E +E+IE+CDWC+SE R+
Sbjct: 3 DLQTVCCMCGDVGFPDKLFRCSKCRHRFQHSYCSNYYSELSESIELCDWCQSEERNARHG 62
Query: 65 DSARKSVVNHMDGAKSQKG--SGD---QHKRE-------RNSGGVPSPRPAPRRYKLLKD 116
+S++KS V H G + + SGD QH RE ++ GVPSPR A RRYKLLKD
Sbjct: 63 NSSKKSAVGHDSGGITNRSEYSGDKIKQHDREESTTEKGKSPSGVPSPRTATRRYKLLKD 122
BLAST of Lcy10g018440 vs. NCBI nr
Match:
XP_022974249.1 (uncharacterized protein LOC111472884 isoform X1 [Cucurbita maxima])
HSP 1 Score: 164.5 bits (415), Expect = 5.5e-37
Identity = 83/107 (77.57%), Postives = 87/107 (81.31%), Query Frame = 0
Query: 9 VCCLCGDVGFPANLFRCSNCSNRFQHSYCSNYYGESAEAIEVCDWCRSERRSTGRRDSAR 68
VCCLCGDVGFPANLFRC+ CS+RFQHSYCSNYYGESAEAIEVCDWCR ERR GRR SA
Sbjct: 18 VCCLCGDVGFPANLFRCTLCSHRFQHSYCSNYYGESAEAIEVCDWCRCERR-CGRRGSAA 77
Query: 69 KSVVNHMDGAKSQKGSGDQHKRERNSGGVPSPRPAPRRYKLLKDVMC 116
+ G SQK S Q KRERNSGG+PSPR APRRYKLLKDVMC
Sbjct: 78 RKF-----GVASQKKSSGQDKRERNSGGMPSPRVAPRRYKLLKDVMC 118
BLAST of Lcy10g018440 vs. NCBI nr
Match:
XP_023540879.1 (uncharacterized protein LOC111801126 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 163.7 bits (413), Expect = 9.4e-37
Identity = 85/115 (73.91%), Postives = 90/115 (78.26%), Query Frame = 0
Query: 4 SDHH---SVCCLCGDVGFPANLFRCSNCSNRFQHSYCSNYYGESAEAIEVCDWCRSERRS 63
+ HH VCCLCGDVGFPANLFRC+ CS+RFQHSYCSNYYGE AEAIEVCDWCR ERR
Sbjct: 2 ASHHPLPPVCCLCGDVGFPANLFRCTLCSHRFQHSYCSNYYGELAEAIEVCDWCRCERRC 61
Query: 64 TGRRDSARKSVVNHMDGAKSQKGSGDQHKRERNSGGVPSPRPAPRRYKLLKDVMC 116
R +ARKS G SQK SG Q KRERNSGG+PSPR APRRYKLLKDVMC
Sbjct: 62 GRRGSAARKS------GVASQKSSG-QDKRERNSGGMPSPRVAPRRYKLLKDVMC 109
BLAST of Lcy10g018440 vs. NCBI nr
Match:
XP_022974251.1 (uncharacterized protein LOC111472884 isoform X2 [Cucurbita maxima])
HSP 1 Score: 163.3 bits (412), Expect = 1.2e-36
Identity = 84/107 (78.50%), Postives = 88/107 (82.24%), Query Frame = 0
Query: 9 VCCLCGDVGFPANLFRCSNCSNRFQHSYCSNYYGESAEAIEVCDWCRSERRSTGRRDSAR 68
VCCLCGDVGFPANLFRC+ CS+RFQHSYCSNYYGESAEAIEVCDWCR ERR GRR SA
Sbjct: 18 VCCLCGDVGFPANLFRCTLCSHRFQHSYCSNYYGESAEAIEVCDWCRCERR-CGRRGSAA 77
Query: 69 KSVVNHMDGAKSQKGSGDQHKRERNSGGVPSPRPAPRRYKLLKDVMC 116
+ G SQK SG Q KRERNSGG+PSPR APRRYKLLKDVMC
Sbjct: 78 RKF-----GVASQKSSG-QDKRERNSGGMPSPRVAPRRYKLLKDVMC 117
BLAST of Lcy10g018440 vs. NCBI nr
Match:
KAG6597027.1 (hypothetical protein SDJN03_10207, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 162.2 bits (409), Expect = 2.7e-36
Identity = 82/107 (76.64%), Postives = 87/107 (81.31%), Query Frame = 0
Query: 9 VCCLCGDVGFPANLFRCSNCSNRFQHSYCSNYYGESAEAIEVCDWCRSERRSTGRRDSAR 68
VCCLCGDVGFPANLFRC+ CS+RFQHSYCSNYYGE+AEAIE CDWCR ERR R +AR
Sbjct: 14 VCCLCGDVGFPANLFRCTLCSHRFQHSYCSNYYGETAEAIEACDWCRCERRCGRRGSAAR 73
Query: 69 KSVVNHMDGAKSQKGSGDQHKRERNSGGVPSPRPAPRRYKLLKDVMC 116
KS G SQK SG Q KRERNSGG+PSPR APRRYKLLKDVMC
Sbjct: 74 KS------GVASQKSSG-QDKRERNSGGMPSPRVAPRRYKLLKDVMC 113
BLAST of Lcy10g018440 vs. NCBI nr
Match:
KAG7028505.1 (hypothetical protein SDJN02_09686, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 161.4 bits (407), Expect = 4.7e-36
Identity = 82/107 (76.64%), Postives = 86/107 (80.37%), Query Frame = 0
Query: 9 VCCLCGDVGFPANLFRCSNCSNRFQHSYCSNYYGESAEAIEVCDWCRSERRSTGRRDSAR 68
VCCLCGDVGFPANLFRC+ CS+RFQHSYCSNYYGE AEAIE CDWCR ERR R +AR
Sbjct: 12 VCCLCGDVGFPANLFRCTLCSHRFQHSYCSNYYGEMAEAIEACDWCRCERRCGRRGSAAR 71
Query: 69 KSVVNHMDGAKSQKGSGDQHKRERNSGGVPSPRPAPRRYKLLKDVMC 116
KS G SQK SG Q KRERNSGG+PSPR APRRYKLLKDVMC
Sbjct: 72 KS------GVASQKSSG-QDKRERNSGGMPSPRVAPRRYKLLKDVMC 111
BLAST of Lcy10g018440 vs. TAIR 10
Match:
AT3G60520.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G02070.1); Has 107 Blast hits to 107 proteins in 14 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 107; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 109.0 bits (271), Expect = 2.6e-24
Identity = 59/122 (48.36%), Postives = 73/122 (59.84%), Query Frame = 0
Query: 9 VCCLCGDVGFPANLFRCSNCSNRFQHSYCSNYYGESAEAIEVCDWCRSERRS-TGRR--- 68
VCC+CGDVGF LF CS C NRFQHSYCS+YY E A+ I++CDWC+ E +S TG +
Sbjct: 8 VCCMCGDVGFFDKLFHCSKCLNRFQHSYCSSYYKEQADPIKICDWCQCEAKSRTGAKHGV 67
Query: 69 --DSARKSVVNHMDGAKSQ---------KGSGDQHKRERNSGGVPSPRPAPRRYKLLKDV 116
S+++S + Q S ++ GVPSPRPA RRYKLLKDV
Sbjct: 68 NGGSSKRSYRSEYSSPHHQIKQQEINQTTSSSIPPAADKGKTGVPSPRPATRRYKLLKDV 127
BLAST of Lcy10g018440 vs. TAIR 10
Match:
AT1G02070.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G60520.1); Has 98 Blast hits to 98 proteins in 13 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 98; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 107.5 bits (267), Expect = 7.5e-24
Identity = 59/129 (45.74%), Postives = 72/129 (55.81%), Query Frame = 0
Query: 9 VCCLCGDVGFPANLFRCSNCSNRFQHSYCSNYYGESAEAIEVCDWCRSERR--------- 68
VCC+CGDVGF LF C +C RFQHSYCSNYYG+ AE E+CDWCRS+ R
Sbjct: 4 VCCMCGDVGFSDKLFSCGHCRCRFQHSYCSNYYGQFAEPTEICDWCRSDDRKLSNVARHG 63
Query: 69 -STGRRDSARKSVVNHMDGAKSQKGSGDQHKRERN------------SGGVPSPRPAPRR 116
S+ ++ S+ N +S+ G + K N GGV SP+ A RR
Sbjct: 64 GSSSKKPSSSVKYENDFSN-RSEYSPGHRIKHNNNRHDQVAKGVAGDGGGVTSPKTATRR 123
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1IH20 | 2.7e-37 | 77.57 | uncharacterized protein LOC111472884 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1IAU2 | 6.0e-37 | 78.50 | uncharacterized protein LOC111472884 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A5B6YSM7 | 6.4e-31 | 59.02 | Uncharacterized protein (Fragment) OS=Davidia involucrata OX=16924 GN=Din_004305... | [more] |
A0A1Q3D081 | 2.1e-29 | 59.35 | Uncharacterized protein OS=Cephalotus follicularis OX=3775 GN=CFOL_v3_29323 PE=4... | [more] |
A0A2C9VTF1 | 2.1e-29 | 56.91 | Uncharacterized protein OS=Manihot esculenta OX=3983 GN=MANES_05G050600 PE=4 SV=... | [more] |
Match Name | E-value | Identity | Description | |
XP_022974249.1 | 5.5e-37 | 77.57 | uncharacterized protein LOC111472884 isoform X1 [Cucurbita maxima] | [more] |
XP_023540879.1 | 9.4e-37 | 73.91 | uncharacterized protein LOC111801126 [Cucurbita pepo subsp. pepo] | [more] |
XP_022974251.1 | 1.2e-36 | 78.50 | uncharacterized protein LOC111472884 isoform X2 [Cucurbita maxima] | [more] |
KAG6597027.1 | 2.7e-36 | 76.64 | hypothetical protein SDJN03_10207, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
KAG7028505.1 | 4.7e-36 | 76.64 | hypothetical protein SDJN02_09686, partial [Cucurbita argyrosperma subsp. argyro... | [more] |
Match Name | E-value | Identity | Description | |
AT3G60520.1 | 2.6e-24 | 48.36 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |
AT1G02070.1 | 7.5e-24 | 45.74 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |