Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTCCCCGGTTCCTCGTCTTAGATCTCTCTGAGCCTTGTTTCCCTAAGCCGAAAGACATGGGTGCCGCGAACCGCTGCAGATTCCGGGAACTTTTCCTTCCACCGCACGCGCGAACAAATTGAAGTAAAATCAAAACAACAATGTTAGGGTTTCTTAGTCGGCCATGCCAATGGAGTTCTCCATCTCTCCCATTTCTCTCTTCAACTTCATCATCTTCTTCCCTATCCCCAAGTTCTCTACGCTACAAATTCACTCTCCATTGCGCGTTTCTCAGACCGCAATCTCAAATTCCTAGAAACCGCGCAAGATTCACGGCGTTTTCGAGCAATAACGGCAATGGATTGGGCGGAAATATAAAGGAAAGAGAAGGAGGAAGAAATGGGGCGAAGGGCTCCAATGGCGGCGATGATTTGAGAAAAGAACGAGGGCCGATTTTCAATATCAAATGGACTGAGCTTCTGATCGATCCGGATCCTGATAACTTATTGGCGGTTGCGTTGACTGGTTTGCTTGCTTGGGCAAGTGTTCAGGTTTTGTGGCAGCTATTCTTCATCTCTTTGGCTATTTTAGTGGCGGCTCTTAAGTACTCTTTTATTGCTGCGCTTCTTATTTTCATTTTAATTACATTACTCTAGAAAAACATCAACAACAACAACAACAACAATCTTGTATAGTCCCTTTTATGTTTGATTTATTGGATTTCAATTGTAACTCTATGTCCTCTGTGTATAAAAATGTGTTTGTCTAGTATTAAGAGAAGGTGAAATTAAAATTTTAGTAGGTTTTTGTCACGTTCTGAAGTTTGAAA
mRNA sequence
TTCCCCGGTTCCTCGTCTTAGATCTCTCTGAGCCTTGTTTCCCTAAGCCGAAAGACATGGGTGCCGCGAACCGCTGCAGATTCCGGGAACTTTTCCTTCCACCGCACGCGCGAACAAATTGAAGTAAAATCAAAACAACAATGTTAGGGTTTCTTAGTCGGCCATGCCAATGGAGTTCTCCATCTCTCCCATTTCTCTCTTCAACTTCATCATCTTCTTCCCTATCCCCAAGTTCTCTACGCTACAAATTCACTCTCCATTGCGCGTTTCTCAGACCGCAATCTCAAATTCCTAGAAACCGCGCAAGATTCACGGCGTTTTCGAGCAATAACGGCAATGGATTGGGCGGAAATATAAAGGAAAGAGAAGGAGGAAGAAATGGGGCGAAGGGCTCCAATGGCGGCGATGATTTGAGAAAAGAACGAGGGCCGATTTTCAATATCAAATGGACTGAGCTTCTGATCGATCCGGATCCTGATAACTTATTGGCGGTTGCGTTGACTGGTTTGCTTGCTTGGGCAAGTGTTCAGGTTTTGTGGCAGCTATTCTTCATCTCTTTGGCTATTTTAGTGGCGGCTCTTAAGTACTCTTTTATTGCTGCGCTTCTTATTTTCATTTTAATTACATTACTCTAGAAAAACATCAACAACAACAACAACAACAATCTTGTATAGTCCCTTTTATGTTTGATTTATTGGATTTCAATTGTAACTCTATGTCCTCTGTGTATAAAAATGTGTTTGTCTAGTATTAAGAGAAGGTGAAATTAAAATTTTAGTAGGTTTTTGTCACGTTCTGAAGTTTGAAA
Coding sequence (CDS)
ATGTTAGGGTTTCTTAGTCGGCCATGCCAATGGAGTTCTCCATCTCTCCCATTTCTCTCTTCAACTTCATCATCTTCTTCCCTATCCCCAAGTTCTCTACGCTACAAATTCACTCTCCATTGCGCGTTTCTCAGACCGCAATCTCAAATTCCTAGAAACCGCGCAAGATTCACGGCGTTTTCGAGCAATAACGGCAATGGATTGGGCGGAAATATAAAGGAAAGAGAAGGAGGAAGAAATGGGGCGAAGGGCTCCAATGGCGGCGATGATTTGAGAAAAGAACGAGGGCCGATTTTCAATATCAAATGGACTGAGCTTCTGATCGATCCGGATCCTGATAACTTATTGGCGGTTGCGTTGACTGGTTTGCTTGCTTGGGCAAGTGTTCAGGTTTTGTGGCAGCTATTCTTCATCTCTTTGGCTATTTTAGTGGCGGCTCTTAAGTACTCTTTTATTGCTGCGCTTCTTATTTTCATTTTAATTACATTACTCTAG
Protein sequence
MLGFLSRPCQWSSPSLPFLSSTSSSSSLSPSSLRYKFTLHCAFLRPQSQIPRNRARFTAFSSNNGNGLGGNIKEREGGRNGAKGSNGGDDLRKERGPIFNIKWTELLIDPDPDNLLAVALTGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL
Homology
BLAST of Tan0003444 vs. NCBI nr
Match:
XP_022139143.1 (uncharacterized protein LOC111010120 [Momordica charantia])
HSP 1 Score: 236.1 bits (601), Expect = 2.1e-58
Identity = 136/168 (80.95%), Postives = 144/168 (85.71%), Query Frame = 0
Query: 1 MLGFLSRPCQWSSPSLPFLSSTSSSSSLSPSSL----RYKFTLHCAFLRPQSQIPRNRAR 60
MLGF + PCQWSS S+ LSST + SS S SL R+KFTLH A L +SQIPRNRAR
Sbjct: 1 MLGFRTLPCQWSSASVRLLSSTPTPSSSSKISLRTVPRFKFTLHYALLMTRSQIPRNRAR 60
Query: 61 FTAFSSNNGNGLGGNIKEREGGRNGAKGSNGGDDLRKERGPIFNIKWTELLIDPDPDNLL 120
FTAFS N NGLGGNIKEREG R GAKGSNGGDDL+KERGP+FNIKW ELLIDPDPDN+L
Sbjct: 61 FTAFSGNGDNGLGGNIKEREGERTGAKGSNGGDDLKKERGPVFNIKWAELLIDPDPDNIL 120
Query: 121 AVALTGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL 165
AVALTGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL
Sbjct: 121 AVALTGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL 168
BLAST of Tan0003444 vs. NCBI nr
Match:
KAG7032002.1 (hypothetical protein SDJN02_06044, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 210.3 bits (534), Expect = 1.3e-50
Identity = 129/166 (77.71%), Postives = 136/166 (81.93%), Query Frame = 0
Query: 1 MLGFLSRPCQWS-SPSLPFLSSTSSSSSLSPSSLR-YKFTLHCAFLRPQSQIPRNRARFT 60
MLGFL+ P QW SPSLP SS S S+ S S +KF LH +SQIP R RFT
Sbjct: 3 MLGFLTIPYQWKISPSLPSSSSPFSLSTRSSLSFSLFKFPLHYL----ESQIPGKRKRFT 62
Query: 61 AFSSNNGNGLGGNIKEREGGRNGAKGSNGGDDLRKERGPIFNIKWTELLIDPDPDNLLAV 120
A +SNN NGLGGNIKEREG RNGAKGSNG DDLRKERGP+FNIKW ELLIDPDPDN+LAV
Sbjct: 63 ALASNNDNGLGGNIKEREGERNGAKGSNGDDDLRKERGPVFNIKWAELLIDPDPDNILAV 122
Query: 121 ALTGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL 165
ALTGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL
Sbjct: 123 ALTGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL 164
BLAST of Tan0003444 vs. NCBI nr
Match:
XP_023549272.1 (uncharacterized protein LOC111807677 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 210.3 bits (534), Expect = 1.3e-50
Identity = 129/166 (77.71%), Postives = 137/166 (82.53%), Query Frame = 0
Query: 1 MLGFLSRPCQWS-SPSLPFLSSTSSSSSLSPSSL-RYKFTLHCAFLRPQSQIPRNRARFT 60
MLGFL+ P QW SPSL SS S S+ S S +KF LH + +SQIP R RFT
Sbjct: 3 MLGFLTIPYQWKISPSLSSSSSPFSLSTRSSLSFPLFKFPLHYS----ESQIPGKRKRFT 62
Query: 61 AFSSNNGNGLGGNIKEREGGRNGAKGSNGGDDLRKERGPIFNIKWTELLIDPDPDNLLAV 120
A +SNN NGLGGNIKEREG RNGAKGSNGGDDLRKERGP+FNIKW ELLIDPDPDN+LAV
Sbjct: 63 ALASNNDNGLGGNIKEREGERNGAKGSNGGDDLRKERGPVFNIKWAELLIDPDPDNILAV 122
Query: 121 ALTGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL 165
ALTGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL
Sbjct: 123 ALTGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL 164
BLAST of Tan0003444 vs. NCBI nr
Match:
XP_022957096.1 (uncharacterized protein LOC111458579 [Cucurbita moschata])
HSP 1 Score: 208.8 bits (530), Expect = 3.6e-50
Identity = 128/166 (77.11%), Postives = 136/166 (81.93%), Query Frame = 0
Query: 1 MLGFLSRPCQWS-SPSLPFLSSTSSSSSLSPSSLR-YKFTLHCAFLRPQSQIPRNRARFT 60
MLGFL+ P QW SPSLP SS S S+ S S +KF LH + +SQIP R RFT
Sbjct: 3 MLGFLTIPYQWKISPSLPSSSSPFSLSTRSSLSFSLFKFPLHYS----ESQIPGKRKRFT 62
Query: 61 AFSSNNGNGLGGNIKEREGGRNGAKGSNGGDDLRKERGPIFNIKWTELLIDPDPDNLLAV 120
A +SNN NGLGGNIKEREG RNGAKGS G DDLRKERGP+FNIKW ELLIDPDPDN+LAV
Sbjct: 63 ALASNNDNGLGGNIKEREGERNGAKGSKGDDDLRKERGPVFNIKWAELLIDPDPDNILAV 122
Query: 121 ALTGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL 165
ALTGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL
Sbjct: 123 ALTGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL 164
BLAST of Tan0003444 vs. NCBI nr
Match:
XP_022993576.1 (uncharacterized protein LOC111489528 [Cucurbita maxima])
HSP 1 Score: 200.3 bits (508), Expect = 1.3e-47
Identity = 127/172 (73.84%), Postives = 134/172 (77.91%), Query Frame = 0
Query: 1 MLGFLSRPCQWS-SPSLPFLSSTSSSSSLSPSSLR-------YKFTLHCAFLRPQSQIPR 60
MLGFL+ P QW SPSL SSSSS P S R +KF LH + +SQI
Sbjct: 3 MLGFLTIPYQWKISPSL------SSSSSPFPLSTRSFLSFPLFKFPLHYS----ESQISG 62
Query: 61 NRARFTAFSSNNGNGLGGNIKEREGGRNGAKGSNGGDDLRKERGPIFNIKWTELLIDPDP 120
R RF A +SNN NGLGGNIKEREG RNGAKGS G DDLRKERGP+FNIKW ELLIDPDP
Sbjct: 63 KRKRFAALASNNDNGLGGNIKEREGERNGAKGSKGDDDLRKERGPVFNIKWAELLIDPDP 122
Query: 121 DNLLAVALTGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL 165
DN+LAVALTGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL
Sbjct: 123 DNILAVALTGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL 164
BLAST of Tan0003444 vs. ExPASy TrEMBL
Match:
A0A6J1CF05 (uncharacterized protein LOC111010120 OS=Momordica charantia OX=3673 GN=LOC111010120 PE=4 SV=1)
HSP 1 Score: 236.1 bits (601), Expect = 1.0e-58
Identity = 136/168 (80.95%), Postives = 144/168 (85.71%), Query Frame = 0
Query: 1 MLGFLSRPCQWSSPSLPFLSSTSSSSSLSPSSL----RYKFTLHCAFLRPQSQIPRNRAR 60
MLGF + PCQWSS S+ LSST + SS S SL R+KFTLH A L +SQIPRNRAR
Sbjct: 1 MLGFRTLPCQWSSASVRLLSSTPTPSSSSKISLRTVPRFKFTLHYALLMTRSQIPRNRAR 60
Query: 61 FTAFSSNNGNGLGGNIKEREGGRNGAKGSNGGDDLRKERGPIFNIKWTELLIDPDPDNLL 120
FTAFS N NGLGGNIKEREG R GAKGSNGGDDL+KERGP+FNIKW ELLIDPDPDN+L
Sbjct: 61 FTAFSGNGDNGLGGNIKEREGERTGAKGSNGGDDLKKERGPVFNIKWAELLIDPDPDNIL 120
Query: 121 AVALTGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL 165
AVALTGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL
Sbjct: 121 AVALTGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL 168
BLAST of Tan0003444 vs. ExPASy TrEMBL
Match:
A0A6J1GYA2 (uncharacterized protein LOC111458579 OS=Cucurbita moschata OX=3662 GN=LOC111458579 PE=4 SV=1)
HSP 1 Score: 208.8 bits (530), Expect = 1.8e-50
Identity = 128/166 (77.11%), Postives = 136/166 (81.93%), Query Frame = 0
Query: 1 MLGFLSRPCQWS-SPSLPFLSSTSSSSSLSPSSLR-YKFTLHCAFLRPQSQIPRNRARFT 60
MLGFL+ P QW SPSLP SS S S+ S S +KF LH + +SQIP R RFT
Sbjct: 3 MLGFLTIPYQWKISPSLPSSSSPFSLSTRSSLSFSLFKFPLHYS----ESQIPGKRKRFT 62
Query: 61 AFSSNNGNGLGGNIKEREGGRNGAKGSNGGDDLRKERGPIFNIKWTELLIDPDPDNLLAV 120
A +SNN NGLGGNIKEREG RNGAKGS G DDLRKERGP+FNIKW ELLIDPDPDN+LAV
Sbjct: 63 ALASNNDNGLGGNIKEREGERNGAKGSKGDDDLRKERGPVFNIKWAELLIDPDPDNILAV 122
Query: 121 ALTGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL 165
ALTGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL
Sbjct: 123 ALTGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL 164
BLAST of Tan0003444 vs. ExPASy TrEMBL
Match:
A0A6J1JWP5 (uncharacterized protein LOC111489528 OS=Cucurbita maxima OX=3661 GN=LOC111489528 PE=4 SV=1)
HSP 1 Score: 200.3 bits (508), Expect = 6.3e-48
Identity = 127/172 (73.84%), Postives = 134/172 (77.91%), Query Frame = 0
Query: 1 MLGFLSRPCQWS-SPSLPFLSSTSSSSSLSPSSLR-------YKFTLHCAFLRPQSQIPR 60
MLGFL+ P QW SPSL SSSSS P S R +KF LH + +SQI
Sbjct: 3 MLGFLTIPYQWKISPSL------SSSSSPFPLSTRSFLSFPLFKFPLHYS----ESQISG 62
Query: 61 NRARFTAFSSNNGNGLGGNIKEREGGRNGAKGSNGGDDLRKERGPIFNIKWTELLIDPDP 120
R RF A +SNN NGLGGNIKEREG RNGAKGS G DDLRKERGP+FNIKW ELLIDPDP
Sbjct: 63 KRKRFAALASNNDNGLGGNIKEREGERNGAKGSKGDDDLRKERGPVFNIKWAELLIDPDP 122
Query: 121 DNLLAVALTGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL 165
DN+LAVALTGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL
Sbjct: 123 DNILAVALTGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL 164
BLAST of Tan0003444 vs. ExPASy TrEMBL
Match:
A0A5D3CD16 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold475G002130 PE=4 SV=1)
HSP 1 Score: 190.3 bits (482), Expect = 6.5e-45
Identity = 122/166 (73.49%), Postives = 129/166 (77.71%), Query Frame = 0
Query: 1 MLGFLSRPCQWS-SPSLPFLSSTSSSSSLSPSSLRYKFTLHCAFLRPQSQIPRNRARFTA 60
MLGFL+ P Q SPSL L S SS SSL YKF LH F + I NR RFTA
Sbjct: 1 MLGFLTIPHQLKISPSLASLPSISSPSSLFLP--LYKFPLHHTFFNSKFLISSNRRRFTA 60
Query: 61 FSSNNGNGLGGNIKEREGGRNGAK-GSNGGDDLRKERGPIFNIKWTELLIDPDPDNLLAV 120
+SN GG+IKEREG RNGAK SNGGDDL+KERGP+FNIKW ELLIDPDPDN+LAV
Sbjct: 61 SASNKNTEFGGSIKEREGERNGAKSSSNGGDDLKKERGPVFNIKWAELLIDPDPDNILAV 120
Query: 121 ALTGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL 165
ALTGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL
Sbjct: 121 ALTGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL 164
BLAST of Tan0003444 vs. ExPASy TrEMBL
Match:
A0A1S3BFX0 (uncharacterized protein LOC103489407 OS=Cucumis melo OX=3656 GN=LOC103489407 PE=4 SV=1)
HSP 1 Score: 190.3 bits (482), Expect = 6.5e-45
Identity = 122/166 (73.49%), Postives = 129/166 (77.71%), Query Frame = 0
Query: 1 MLGFLSRPCQWS-SPSLPFLSSTSSSSSLSPSSLRYKFTLHCAFLRPQSQIPRNRARFTA 60
MLGFL+ P Q SPSL L S SS SSL YKF LH F + I NR RFTA
Sbjct: 23 MLGFLTIPHQLKISPSLASLPSISSPSSLFLP--LYKFPLHHTFFNSKFLISSNRRRFTA 82
Query: 61 FSSNNGNGLGGNIKEREGGRNGAK-GSNGGDDLRKERGPIFNIKWTELLIDPDPDNLLAV 120
+SN GG+IKEREG RNGAK SNGGDDL+KERGP+FNIKW ELLIDPDPDN+LAV
Sbjct: 83 SASNKNTEFGGSIKEREGERNGAKSSSNGGDDLKKERGPVFNIKWAELLIDPDPDNILAV 142
Query: 121 ALTGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL 165
ALTGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL
Sbjct: 143 ALTGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL 186
BLAST of Tan0003444 vs. TAIR 10
Match:
AT4G40045.1 (unknown protein; Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )
HSP 1 Score: 86.3 bits (212), Expect = 2.5e-17
Identity = 60/104 (57.69%), Postives = 75/104 (72.12%), Query Frame = 0
Query: 61 SSNNGNGLGGNIKEREGGRNGAKGSNGGDDLRKERGPIFNIKWTELLIDPDPDNLLAVAL 120
++ NGN + KE GG N ++ G+ +K++ F+ KW ELL +PD DN +AV L
Sbjct: 41 NAQNGN---DSAKESSGGGNRPVTNDDGNGSKKDQFAGFSFKWGELL-NPDQDNFVAVGL 100
Query: 121 TGLLAWASVQVLWQLFFISLAILVAALKYSFIAALLIFILITLL 165
G+L WAS+QVL QLFFIS AILVAALKYSFIAALLIFIL+TLL
Sbjct: 101 AGVLTWASLQVLSQLFFISFAILVAALKYSFIAALLIFILVTLL 140
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_022139143.1 | 2.1e-58 | 80.95 | uncharacterized protein LOC111010120 [Momordica charantia] | [more] |
KAG7032002.1 | 1.3e-50 | 77.71 | hypothetical protein SDJN02_06044, partial [Cucurbita argyrosperma subsp. argyro... | [more] |
XP_023549272.1 | 1.3e-50 | 77.71 | uncharacterized protein LOC111807677 [Cucurbita pepo subsp. pepo] | [more] |
XP_022957096.1 | 3.6e-50 | 77.11 | uncharacterized protein LOC111458579 [Cucurbita moschata] | [more] |
XP_022993576.1 | 1.3e-47 | 73.84 | uncharacterized protein LOC111489528 [Cucurbita maxima] | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1CF05 | 1.0e-58 | 80.95 | uncharacterized protein LOC111010120 OS=Momordica charantia OX=3673 GN=LOC111010... | [more] |
A0A6J1GYA2 | 1.8e-50 | 77.11 | uncharacterized protein LOC111458579 OS=Cucurbita moschata OX=3662 GN=LOC1114585... | [more] |
A0A6J1JWP5 | 6.3e-48 | 73.84 | uncharacterized protein LOC111489528 OS=Cucurbita maxima OX=3661 GN=LOC111489528... | [more] |
A0A5D3CD16 | 6.5e-45 | 73.49 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |
A0A1S3BFX0 | 6.5e-45 | 73.49 | uncharacterized protein LOC103489407 OS=Cucumis melo OX=3656 GN=LOC103489407 PE=... | [more] |
Match Name | E-value | Identity | Description | |
AT4G40045.1 | 2.5e-17 | 57.69 | unknown protein; Has 30201 Blast hits to 17322 proteins in 780 species: Archae -... | [more] |