Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
TCAAACTTGAGAAATCATTGAAACTTCTTTTTCACATTACCATCACAATGTTCTTGATCAGAGTTCAAGATACTAAGCCCATCACGGACGCAACCTTCCTATTCAAGCAATTCATTAACGAAAAAGCCGACTTAGATTTCAACCGCAACAGCTTCAGCATAATTGCCTCAAACCCTTCCCTTCGCTTCATAGCAAGGTTTTACATTTCCAAGAAATATTGCCAAGACTTTTTCATCAATCAAACTCACATTGCCAAAATTTCCCTTCCATCCTTTAGTGATACCATCATGACTGCCGCTGCTACCCGCTTTGATACAATGAGTATCACTCTTCCAAGCCCTTATAAAATGACCCTTACATTCGAGACATCAAGTGAGTATCTATTAATACCCCACAACAACATTGCCAAGAACTATGATTCTTGTTTTCTTTACCTTTCAAGAACTCACTTTTCTTAATAGCTCCTCCAGGGCGTGTGCCTCGGTCGAATTCACGTGCGCTGCCACTGTCACCTACGCAAGCGATGGATTTTGGAAGGCCTTTACTCGCAAAACATTTCACCATTAAACCCGAATGTTTTAGACACATTCTTACCGAACTACCTAACTCGCAAGAT
mRNA sequence
TCAAACTTGAGAAATCATTGAAACTTCTTTTTCACATTACCATCACAATGTTCTTGATCAGAGTTCAAGATACTAAGCCCATCACGGACGCAACCTTCCTATTCAAGCAATTCATTAACGAAAAAGCCGACTTAGATTTCAACCGCAACAGCTTCAGCATAATTGCCTCAAACCCTTCCCTTCGCTTCATAGCAAGGTTTTACATTTCCAAGAAATATTGCCAAGACTTTTTCATCAATCAAACTCACATTGCCAAAATTTCCCTTCCATCCTTTAGTGATACCATCATGACTGCCGCTGCTACCCGCTTTGATACAATGAGTATCACTCTTCCAAGCCCTTATAAAATGACCCTTACATTCGAGACATCAAGGCGTGTGCCTCGGTCGAATTCACGTGCGCTGCCACTGTCACCTACGCAAGCGATGGATTTTGGAAGGCCTTTACTCGCAAAACATTTCACCATTAAACCCGAATGTTTTAGACACATTCTTACCGAACTACCTAACTCGCAAGAT
Coding sequence (CDS)
ATGTTCTTGATCAGAGTTCAAGATACTAAGCCCATCACGGACGCAACCTTCCTATTCAAGCAATTCATTAACGAAAAAGCCGACTTAGATTTCAACCGCAACAGCTTCAGCATAATTGCCTCAAACCCTTCCCTTCGCTTCATAGCAAGGTTTTACATTTCCAAGAAATATTGCCAAGACTTTTTCATCAATCAAACTCACATTGCCAAAATTTCCCTTCCATCCTTTAGTGATACCATCATGACTGCCGCTGCTACCCGCTTTGATACAATGAGTATCACTCTTCCAAGCCCTTATAAAATGACCCTTACATTCGAGACATCAAGGCGTGTGCCTCGGTCGAATTCACGTGCGCTGCCACTGTCACCTACGCAAGCGATGGATTTTGGAAGGCCTTTACTCGCAAAACATTTCACCATTAAACCCGAATGTTTTAGACACATTCTTACCGAACTACCTAACTCGCAAGAT
Protein sequence
MFLIRVQDTKPITDATFLFKQFINEKADLDFNRNSFSIIASNPSLRFIARFYISKKYCQDFFINQTHIAKISLPSFSDTIMTAAATRFDTMSITLPSPYKMTLTFETSRRVPRSNSRALPLSPTQAMDFGRPLLAKHFTIKPECFRHILTELPNSQD
Homology
BLAST of Cp4.1LG04g06570 vs. NCBI nr
Match:
KAG6588842.1 (hypothetical protein SDJN03_17407, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 294 bits (752), Expect = 1.62e-98
Identity = 152/160 (95.00%), Postives = 153/160 (95.62%), Query Frame = 0
Query: 1 MFLIRVQDTKPITDATFLFKQFINEKADLDFNRNSFSIIASNPSLRFIARFYISKKYCQD 60
MFLIRVQDTKPITDATFLFKQFINEKADL FNRNSFSIIASNPSLRFIARFYISKKYCQD
Sbjct: 1 MFLIRVQDTKPITDATFLFKQFINEKADLGFNRNSFSIIASNPSLRFIARFYISKKYCQD 60
Query: 61 FFINQTHIAKISLPSFSDTIMTAAATRFDTMSITLPSPYKMTLTFETSR---RVPRSNSR 120
FFINQTHIAKISLPSFSD IMTAAATRFDTMSITLPSPYKMTLTFE S RVPRSNSR
Sbjct: 61 FFINQTHIAKISLPSFSDAIMTAAATRFDTMSITLPSPYKMTLTFEISTPPGRVPRSNSR 120
Query: 121 ALPLSPTQAMDFGRPLLAKHFTIKPECFRHILTELPNSQD 157
ALPLSPTQAMDFGRPLL+KHFTIKPECFRHILTELPNSQD
Sbjct: 121 ALPLSPTQAMDFGRPLLSKHFTIKPECFRHILTELPNSQD 160
BLAST of Cp4.1LG04g06570 vs. NCBI nr
Match:
XP_022928486.1 (uncharacterized protein LOC111435280 isoform X2 [Cucurbita moschata])
HSP 1 Score: 213 bits (542), Expect = 1.21e-66
Identity = 110/157 (70.06%), Postives = 127/157 (80.89%), Query Frame = 0
Query: 1 MFLIRVQDTKPITDATFLFKQFINEKADLDFNRNSFSIIASNPSLRFIARFYISKKYCQD 60
M LIRVQD PITDA FLF +FIN +ADL+F NS +IIA+NP+LRFIA YISK +CQD
Sbjct: 1 MLLIRVQDANPITDANFLFAEFINHEADLEFKPNSLTIIATNPTLRFIATLYISKTFCQD 60
Query: 61 FFINQTHIAKISLPSFSDTIMTAAATRFDTMSITLPSPYKMTLTFETSRRVPRSNSRALP 120
F INQTHIA++SL SF D IMTAAAT FDT++ITLPS Y M LTFETSRRV +S+ RALP
Sbjct: 61 FTINQTHIARVSLTSFIDAIMTAAATSFDTLAITLPSAYIMILTFETSRRVLQSHPRALP 120
Query: 121 LSPTQAMDFGRPLLAKHFTIKPECFRHILTELPNSQD 157
+SP+ MDF +P+LAKHFTIK ECFR IL ELP QD
Sbjct: 121 MSPSLMMDFPKPILAKHFTIKAECFRRILEELPLWQD 157
BLAST of Cp4.1LG04g06570 vs. NCBI nr
Match:
XP_022989711.1 (uncharacterized protein LOC111486711 [Cucurbita maxima])
HSP 1 Score: 211 bits (536), Expect = 1.08e-65
Identity = 109/160 (68.12%), Postives = 128/160 (80.00%), Query Frame = 0
Query: 1 MFLIRVQDTKPITDATFLFKQFINEKADLDFNRNSFSIIASNPSLRFIARFYISKKYCQD 60
M LIRVQD PITDA FLF +FIN +ADL+F NS +IIA+NP+LRFIA YISK +CQD
Sbjct: 1 MLLIRVQDANPITDANFLFAEFINHEADLEFKPNSLTIIATNPTLRFIATLYISKSFCQD 60
Query: 61 FFINQTHIAKISLPSFSDTIMTAAATRFDTMSITLPSPYKMTLTFETSR---RVPRSNSR 120
F INQTHIA++SL SF D IMTAAATRFDT++ITLPS Y M LTFETS RVP+S+ R
Sbjct: 61 FTINQTHIARVSLTSFIDAIMTAAATRFDTLAITLPSAYIMILTFETSTPSGRVPQSHPR 120
Query: 121 ALPLSPTQAMDFGRPLLAKHFTIKPECFRHILTELPNSQD 157
ALP+SP+ +DF +P+LAKHFTIK ECFR +L ELP QD
Sbjct: 121 ALPMSPSLMVDFPKPILAKHFTIKAECFRRVLAELPLLQD 160
BLAST of Cp4.1LG04g06570 vs. NCBI nr
Match:
XP_022928485.1 (uncharacterized protein LOC111435280 isoform X1 [Cucurbita moschata])
HSP 1 Score: 205 bits (522), Expect = 1.43e-63
Identity = 109/160 (68.12%), Postives = 126/160 (78.75%), Query Frame = 0
Query: 1 MFLIRVQDTKPITDATFLFKQFINEKADLDFNRNSFSIIASNPSLRFIARFYISKKYCQD 60
M LIRVQD PITDA FLF +FIN +ADL+F NS +IIA+NP+LRFIA YISK +CQD
Sbjct: 1 MLLIRVQDANPITDANFLFAEFINHEADLEFKPNSLTIIATNPTLRFIATLYISKTFCQD 60
Query: 61 FFINQTHIAKISLPSFSDTIMTAAATRFDTMSITLPSPYKMTLTFETSR---RVPRSNSR 120
F INQTHIA++SL SF D IMTAAAT FDT++ITLPS Y M LTFETS RV +S+ R
Sbjct: 61 FTINQTHIARVSLTSFIDAIMTAAATSFDTLAITLPSAYIMILTFETSTPSGRVLQSHPR 120
Query: 121 ALPLSPTQAMDFGRPLLAKHFTIKPECFRHILTELPNSQD 157
ALP+SP+ MDF +P+LAKHFTIK ECFR IL ELP QD
Sbjct: 121 ALPMSPSLMMDFPKPILAKHFTIKAECFRRILEELPLWQD 160
BLAST of Cp4.1LG04g06570 vs. NCBI nr
Match:
KAG6588843.1 (hypothetical protein SDJN03_17408, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 193 bits (491), Expect = 1.73e-58
Identity = 109/190 (57.37%), Postives = 126/190 (66.32%), Query Frame = 0
Query: 1 MFLIRVQDTKPITDATFLFKQFINEKADLDFNRNSFSIIASNPSLRFIARFYISKKYCQD 60
M LIRVQD PITDA FLF +FI+ +ADL+F NS +IIA NP+LRFIA YISK +CQD
Sbjct: 1 MLLIRVQDANPITDANFLFAEFISHEADLEFKPNSLTIIAKNPTLRFIATLYISKTFCQD 60
Query: 61 FFINQTHIAKISLPSFSDTIMTAAATRFDTMSITLPSPYKMTLTFETSR----------- 120
F INQTHIA++SL SF D IMTAAATRFDT++ITLPS Y M LTFETS
Sbjct: 61 FTINQTHIARVSLISFIDAIMTAAATRFDTLAITLPSAYIMILTFETSSEYLLIPHNNIA 120
Query: 121 ----------------------RVPRSNSRALPLSPTQAMDFGRPLLAKHFTIKPECFRH 157
RV +S+ RALP+SP+ MDF +P+LAKHFTIK ECFR
Sbjct: 121 KNYDSCFLYLSRTHFLFIAPSGRVLQSHPRALPMSPSLMMDFPKPILAKHFTIKAECFRR 180
BLAST of Cp4.1LG04g06570 vs. ExPASy TrEMBL
Match:
A0A6J1EKE7 (uncharacterized protein LOC111435280 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111435280 PE=4 SV=1)
HSP 1 Score: 213 bits (542), Expect = 5.85e-67
Identity = 110/157 (70.06%), Postives = 127/157 (80.89%), Query Frame = 0
Query: 1 MFLIRVQDTKPITDATFLFKQFINEKADLDFNRNSFSIIASNPSLRFIARFYISKKYCQD 60
M LIRVQD PITDA FLF +FIN +ADL+F NS +IIA+NP+LRFIA YISK +CQD
Sbjct: 1 MLLIRVQDANPITDANFLFAEFINHEADLEFKPNSLTIIATNPTLRFIATLYISKTFCQD 60
Query: 61 FFINQTHIAKISLPSFSDTIMTAAATRFDTMSITLPSPYKMTLTFETSRRVPRSNSRALP 120
F INQTHIA++SL SF D IMTAAAT FDT++ITLPS Y M LTFETSRRV +S+ RALP
Sbjct: 61 FTINQTHIARVSLTSFIDAIMTAAATSFDTLAITLPSAYIMILTFETSRRVLQSHPRALP 120
Query: 121 LSPTQAMDFGRPLLAKHFTIKPECFRHILTELPNSQD 157
+SP+ MDF +P+LAKHFTIK ECFR IL ELP QD
Sbjct: 121 MSPSLMMDFPKPILAKHFTIKAECFRRILEELPLWQD 157
BLAST of Cp4.1LG04g06570 vs. ExPASy TrEMBL
Match:
A0A6J1JKX5 (uncharacterized protein LOC111486711 OS=Cucurbita maxima OX=3661 GN=LOC111486711 PE=4 SV=1)
HSP 1 Score: 211 bits (536), Expect = 5.22e-66
Identity = 109/160 (68.12%), Postives = 128/160 (80.00%), Query Frame = 0
Query: 1 MFLIRVQDTKPITDATFLFKQFINEKADLDFNRNSFSIIASNPSLRFIARFYISKKYCQD 60
M LIRVQD PITDA FLF +FIN +ADL+F NS +IIA+NP+LRFIA YISK +CQD
Sbjct: 1 MLLIRVQDANPITDANFLFAEFINHEADLEFKPNSLTIIATNPTLRFIATLYISKSFCQD 60
Query: 61 FFINQTHIAKISLPSFSDTIMTAAATRFDTMSITLPSPYKMTLTFETSR---RVPRSNSR 120
F INQTHIA++SL SF D IMTAAATRFDT++ITLPS Y M LTFETS RVP+S+ R
Sbjct: 61 FTINQTHIARVSLTSFIDAIMTAAATRFDTLAITLPSAYIMILTFETSTPSGRVPQSHPR 120
Query: 121 ALPLSPTQAMDFGRPLLAKHFTIKPECFRHILTELPNSQD 157
ALP+SP+ +DF +P+LAKHFTIK ECFR +L ELP QD
Sbjct: 121 ALPMSPSLMVDFPKPILAKHFTIKAECFRRVLAELPLLQD 160
BLAST of Cp4.1LG04g06570 vs. ExPASy TrEMBL
Match:
A0A6J1ERT1 (uncharacterized protein LOC111435280 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111435280 PE=4 SV=1)
HSP 1 Score: 205 bits (522), Expect = 6.93e-64
Identity = 109/160 (68.12%), Postives = 126/160 (78.75%), Query Frame = 0
Query: 1 MFLIRVQDTKPITDATFLFKQFINEKADLDFNRNSFSIIASNPSLRFIARFYISKKYCQD 60
M LIRVQD PITDA FLF +FIN +ADL+F NS +IIA+NP+LRFIA YISK +CQD
Sbjct: 1 MLLIRVQDANPITDANFLFAEFINHEADLEFKPNSLTIIATNPTLRFIATLYISKTFCQD 60
Query: 61 FFINQTHIAKISLPSFSDTIMTAAATRFDTMSITLPSPYKMTLTFETSR---RVPRSNSR 120
F INQTHIA++SL SF D IMTAAAT FDT++ITLPS Y M LTFETS RV +S+ R
Sbjct: 61 FTINQTHIARVSLTSFIDAIMTAAATSFDTLAITLPSAYIMILTFETSTPSGRVLQSHPR 120
Query: 121 ALPLSPTQAMDFGRPLLAKHFTIKPECFRHILTELPNSQD 157
ALP+SP+ MDF +P+LAKHFTIK ECFR IL ELP QD
Sbjct: 121 ALPMSPSLMMDFPKPILAKHFTIKAECFRRILEELPLWQD 160
BLAST of Cp4.1LG04g06570 vs. ExPASy TrEMBL
Match:
A0A6J1EK29 (uncharacterized protein LOC111435280 isoform X3 OS=Cucurbita moschata OX=3662 GN=LOC111435280 PE=4 SV=1)
HSP 1 Score: 147 bits (370), Expect = 6.55e-42
Identity = 75/80 (93.75%), Postives = 76/80 (95.00%), Query Frame = 0
Query: 81 MTAAATRFDTMSITLPSPYKMTLTFETSR---RVPRSNSRALPLSPTQAMDFGRPLLAKH 140
MTAAATRFDTMSITLPSPYKMTLTFETS RVPRSNSRALPLSPTQAMDFGRPLL+KH
Sbjct: 1 MTAAATRFDTMSITLPSPYKMTLTFETSTPPGRVPRSNSRALPLSPTQAMDFGRPLLSKH 60
Query: 141 FTIKPECFRHILTELPNSQD 157
FTIKPECFRHILTELPNSQD
Sbjct: 61 FTIKPECFRHILTELPNSQD 80
BLAST of Cp4.1LG04g06570 vs. ExPASy TrEMBL
Match:
A0A6J1JKZ5 (uncharacterized protein LOC111486730 OS=Cucurbita maxima OX=3661 GN=LOC111486730 PE=4 SV=1)
HSP 1 Score: 138 bits (348), Expect = 1.40e-38
Identity = 72/80 (90.00%), Postives = 74/80 (92.50%), Query Frame = 0
Query: 81 MTAAATRFDTMSITLPSPYKMTLTFETSR---RVPRSNSRALPLSPTQAMDFGRPLLAKH 140
MTAAATRFDTMSITLPSPYK+TLTFETS RVPRSNSRALPLSPT AMDFG+PLLAKH
Sbjct: 1 MTAAATRFDTMSITLPSPYKITLTFETSTPPGRVPRSNSRALPLSPTLAMDFGKPLLAKH 60
Query: 141 FTIKPECFRHILTELPNSQD 157
FTIKPE FRHILTELPNSQD
Sbjct: 61 FTIKPEFFRHILTELPNSQD 80
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
KAG6588842.1 | 1.62e-98 | 95.00 | hypothetical protein SDJN03_17407, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_022928486.1 | 1.21e-66 | 70.06 | uncharacterized protein LOC111435280 isoform X2 [Cucurbita moschata] | [more] |
XP_022989711.1 | 1.08e-65 | 68.13 | uncharacterized protein LOC111486711 [Cucurbita maxima] | [more] |
XP_022928485.1 | 1.43e-63 | 68.13 | uncharacterized protein LOC111435280 isoform X1 [Cucurbita moschata] | [more] |
KAG6588843.1 | 1.73e-58 | 57.37 | hypothetical protein SDJN03_17408, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1EKE7 | 5.85e-67 | 70.06 | uncharacterized protein LOC111435280 isoform X2 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1JKX5 | 5.22e-66 | 68.13 | uncharacterized protein LOC111486711 OS=Cucurbita maxima OX=3661 GN=LOC111486711... | [more] |
A0A6J1ERT1 | 6.93e-64 | 68.13 | uncharacterized protein LOC111435280 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1EK29 | 6.55e-42 | 93.75 | uncharacterized protein LOC111435280 isoform X3 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1JKZ5 | 1.40e-38 | 90.00 | uncharacterized protein LOC111486730 OS=Cucurbita maxima OX=3661 GN=LOC111486730... | [more] |
Match Name | E-value | Identity | Description | |