Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
CTTTCTTAGCTTCTTATTCCCCTTCTTTTCTTCACTCATAAAACTCATCTCTTTTGACTAAATTTCAACAGCCGAAAGCAGCATTCAAGATTATAAGTTACTGCAATCAATCCATCGCAAGCATTCCACAGCAATGGCTTCCCATTCAGATCATCCGCCATTTTCCTTTCGCAATGTCATGAAGCATGCTTTATCCCGCACTCGTTCTATGTTATCCAAATTACTACCTCAATTACATGAAACTAATTCAGCAAATCAATTTCAGATCATTACCAGATTTATTCACGCGATGGCGCCAATCCGAAGGAGATTCGACGAACCGCAACATCGCACACTTCCATTGTACGTCGTTAACGAGTTCCAGAGCCATCCAAGGACGATCCGCTCCACGAGCTTCGGGGAGATGAATTTGAGCCTCACATTTCAGGTGGCCATAGGATTGCTCGTCACTTCGCAGATTCCATCTTCTCGATTCCTGCAAATCGCAGAGGTAATGACGGTGATTAGTTTTGGAGTTTCGTTTTGTGGCGTTTTTCTTCGAAATTCCTTTCCGAGATTCGGAAACAATCTCGAGAAATTCGGTTCGGTTTTAACCTCGATGATCTTCTTCCTAATGACGACCTCCTTTCTTCCGGCGAGAATTCGCTGGATAAGTTGGCCGGTGTTTGCTCTGTCAATGGCGGCCTTCTTGTTCTCGCTCTTCAGATGA
mRNA sequence
CTTTCTTAGCTTCTTATTCCCCTTCTTTTCTTCACTCATAAAACTCATCTCTTTTGACTAAATTTCAACAGCCGAAAGCAGCATTCAAGATTATAAGTTACTGCAATCAATCCATCGCAAGCATTCCACAGCAATGGCTTCCCATTCAGATCATCCGCCATTTTCCTTTCGCAATGTCATGAAGCATGCTTTATCCCGCACTCGTTCTATGTTATCCAAATTACTACCTCAATTACATGAAACTAATTCAGCAAATCAATTTCAGATCATTACCAGATTTATTCACGCGATGGCGCCAATCCGAAGGAGATTCGACGAACCGCAACATCGCACACTTCCATTGTACGTCGTTAACGAGTTCCAGAGCCATCCAAGGACGATCCGCTCCACGAGCTTCGGGGAGATGAATTTGAGCCTCACATTTCAGGTGGCCATAGGATTGCTCGTCACTTCGCAGATTCCATCTTCTCGATTCCTGCAAATCGCAGAGGTAATGACGGTGATTAGTTTTGGAGTTTCGTTTTGTGGCGTTTTTCTTCGAAATTCCTTTCCGAGATTCGGAAACAATCTCGAGAAATTCGGTTCGGTTTTAACCTCGATGATCTTCTTCCTAATGACGACCTCCTTTCTTCCGGCGAGAATTCGCTGGATAAGTTGGCCGGTGTTTGCTCTGTCAATGGCGGCCTTCTTGTTCTCGCTCTTCAGATGA
Coding sequence (CDS)
ATGGCTTCCCATTCAGATCATCCGCCATTTTCCTTTCGCAATGTCATGAAGCATGCTTTATCCCGCACTCGTTCTATGTTATCCAAATTACTACCTCAATTACATGAAACTAATTCAGCAAATCAATTTCAGATCATTACCAGATTTATTCACGCGATGGCGCCAATCCGAAGGAGATTCGACGAACCGCAACATCGCACACTTCCATTGTACGTCGTTAACGAGTTCCAGAGCCATCCAAGGACGATCCGCTCCACGAGCTTCGGGGAGATGAATTTGAGCCTCACATTTCAGGTGGCCATAGGATTGCTCGTCACTTCGCAGATTCCATCTTCTCGATTCCTGCAAATCGCAGAGGTAATGACGGTGATTAGTTTTGGAGTTTCGTTTTGTGGCGTTTTTCTTCGAAATTCCTTTCCGAGATTCGGAAACAATCTCGAGAAATTCGGTTCGGTTTTAACCTCGATGATCTTCTTCCTAATGACGACCTCCTTTCTTCCGGCGAGAATTCGCTGGATAAGTTGGCCGGTGTTTGCTCTGTCAATGGCGGCCTTCTTGTTCTCGCTCTTCAGATGA
Protein sequence
MASHSDHPPFSFRNVMKHALSRTRSMLSKLLPQLHETNSANQFQIITRFIHAMAPIRRRFDEPQHRTLPLYVVNEFQSHPRTIRSTSFGEMNLSLTFQVAIGLLVTSQIPSSRFLQIAEVMTVISFGVSFCGVFLRNSFPRFGNNLEKFGSVLTSMIFFLMTTSFLPARIRWISWPVFALSMAAFLFSLFR
Homology
BLAST of Lcy13g003850 vs. ExPASy TrEMBL
Match:
A0A6J1B1Z4 (uncharacterized protein LOC110423379 isoform X1 OS=Herrania umbratica OX=108875 GN=LOC110423379 PE=4 SV=1)
HSP 1 Score: 84.7 bits (208), Expect = 4.5e-13
Identity = 59/196 (30.10%), Postives = 97/196 (49.49%), Query Frame = 0
Query: 8 PPFSFRNVMKHALSRTRSMLSKLLP---------QLHETNSANQFQIITRFIHAMAPIRR 67
PPFS + +K R +LS+ P + ++S ITR A +
Sbjct: 13 PPFSLLSTIKEVFHRVVRILSQFRPCSVANSLILPITRSSSLPTSSAITRMRFYTASYQW 72
Query: 68 RFDEPQHRTLPLYV-VNEFQSHPRTIR-STSFGEMNLSLTFQVAIGLLVTSQIPSSRFL- 127
R +PLY+ +E + H R S G+ LSL+FQ+ + L +S + + L
Sbjct: 73 RCQALD--CIPLYINYSEIEMHSYQPRPPVSLGKTILSLSFQIVVALAPSSSMGQAHHLL 132
Query: 128 --QIAEVMTVISFGVSFCGVFLRNSFPRFGNNLEKFGSVLTSMIFFLMTTSFLPARIRWI 187
++ +++F SF G+FLR+S+P+ N +E GS++ ++ FF+MT+ FLP W+
Sbjct: 133 PIDFVKISMIMAFAASFSGIFLRSSYPKMANIIENIGSLIAAVGFFIMTSIFLPGNFSWV 192
Query: 188 SWPVFALSMAAFLFSL 190
+W A S+ AF SL
Sbjct: 193 NWLACAFSLLAFFSSL 206
BLAST of Lcy13g003850 vs. ExPASy TrEMBL
Match:
A0A061EGV0 (Ileal sodium/bile acid cotransporter, putative OS=Theobroma cacao OX=3641 GN=TCM_011398 PE=4 SV=1)
HSP 1 Score: 80.5 bits (197), Expect = 8.5e-12
Identity = 59/197 (29.95%), Postives = 101/197 (51.27%), Query Frame = 0
Query: 9 PFSFRNVMKHALSRTRSMLSKLLP---------QLHETNSANQFQIITRFIHAMAPIRRR 68
PFS + +K R +LS+ P + ++S + ITR A + R
Sbjct: 14 PFSLLSTIKEVFHRVVRILSQFRPCSDANSLILPITRSSSLPTSRAITRMRFYTASYQWR 73
Query: 69 FDEPQHRTLPLYVVN---EFQSH-PRTIRSTSFGEMNLSLTFQVAIGLLVTSQIPSSRF- 128
+PL + + E QS+ PR S G+ LSL+FQ+ + L ++S + +
Sbjct: 74 SQALD--CIPLSINSLEIEMQSYQPRP--PVSLGKTILSLSFQIVVALALSSSMGQTHHV 133
Query: 129 --LQIAEVMTVISFGVSFCGVFLRNSFPRFGNNLEKFGSVLTSMIFFLMTTSFLPARIRW 188
+ I ++ +++F SF G+FLR+S+P+ N +E GS++ ++ FF+MT+ FLP + W
Sbjct: 134 LPIDIVKISMIMAFAASFSGIFLRSSYPKMANIIENIGSLIAAVGFFIMTSIFLPGNLYW 193
Query: 189 ISWPVFALSMAAFLFSL 190
++W A S+ AF SL
Sbjct: 194 VTWLACAFSLLAFFSSL 206
BLAST of Lcy13g003850 vs. ExPASy TrEMBL
Match:
A0A1R3HPA4 (Uncharacterized protein OS=Corchorus olitorius OX=93759 GN=COLO4_27768 PE=4 SV=1)
HSP 1 Score: 76.3 bits (186), Expect = 1.6e-10
Identity = 38/103 (36.89%), Postives = 63/103 (61.17%), Query Frame = 0
Query: 75 EFQSHPRTIRSTSFGEMNLSLTFQVAIGLLVTSQIPSSRFL--QIAEVMTVISFGVSFCG 134
E QS+ + S + G+ +SLTFQV + L ++ + L QI +V +++F SF G
Sbjct: 33 EMQSYQQRPNSANLGKTIMSLTFQVVVALALSMGQSHHQLLSIQIVKVSMIMAFAASFSG 92
Query: 135 VFLRNSFPRFGNNLEKFGSVLTSMIFFLMTTSFLPARIRWISW 176
+FLRNS+P+ +E GS+ ++ FF+MT+ FLP + W++W
Sbjct: 93 IFLRNSYPKSARIVENTGSIAAAVGFFIMTSIFLPVKFSWVAW 135
BLAST of Lcy13g003850 vs. ExPASy TrEMBL
Match:
A0A5D2MJL9 (Uncharacterized protein OS=Gossypium tomentosum OX=34277 GN=ES332_A13G114200v1 PE=4 SV=1)
HSP 1 Score: 71.2 bits (173), Expect = 5.1e-09
Identity = 62/206 (30.10%), Postives = 93/206 (45.15%), Query Frame = 0
Query: 1 MASHSDHPPFSFRNVMKHALSRTRSMLSKL-----------LPQLHETNSANQFQIITRF 60
M ++PPFS +++KH+ MLS+ LP T A ITR
Sbjct: 1 MVMPPNNPPFSILSIIKHSFHDVGIMLSQFRHSFEANNPLTLPISRSTPQATA-NSITR- 60
Query: 61 IHAMAPIRRRFDEPQHRTLPLYVVNEFQSHPRTIR--STSFGEMNLSLTFQVAIGLLVTS 120
I + P + + + E QSH + R S S G+ +SLTFQ L +S
Sbjct: 61 IRCCSAAGSWGASPICNYINSFEI-EMQSHHQQPRPNSVSLGKTIMSLTFQAVFALAPSS 120
Query: 121 QIPSS----RFLQIAEVMTVISFGVSFCGVFLRNSFPRFGNNLEKFGSVLTSMIFFLMTT 180
+ L + V++F SFCG+FL S PR + + SV+ ++ FF+M++
Sbjct: 121 STEQADHHHTLLPWSAASMVMAFAASFCGIFLHTSHPRIASIIGNTVSVIAALGFFIMSS 180
Query: 181 SFLPARIRWISWPVFALSMAAFLFSL 190
FLP W++W LS+ AF SL
Sbjct: 181 IFLPGNFAWVTWLACGLSLLAFFLSL 203
BLAST of Lcy13g003850 vs. ExPASy TrEMBL
Match:
A0A5J5T342 (Uncharacterized protein OS=Gossypium barbadense OX=3634 GN=ES319_A13G106300v1 PE=4 SV=1)
HSP 1 Score: 71.2 bits (173), Expect = 5.1e-09
Identity = 61/201 (30.35%), Postives = 92/201 (45.77%), Query Frame = 0
Query: 6 DHPPFSFRNVMKHALSRTRSMLSKL-----------LPQLHETNSANQFQIITRFIHAMA 65
++PPFS +++KH+ MLS+ LP T A ITR I +
Sbjct: 4 NNPPFSILSIIKHSFHDVGIMLSQFRHSFEANNPLTLPISRSTPQATA-NSITR-IRCCS 63
Query: 66 PIRRRFDEPQHRTLPLYVVNEFQSHPRTIR--STSFGEMNLSLTFQVAIGLLVTSQIPSS 125
P + + + E QSH + R S S G+ +SLTFQ L +S +
Sbjct: 64 AAGSWGASPICNYINSFEI-EMQSHHQQPRPNSVSLGKTIMSLTFQAVFALAPSSSTEQA 123
Query: 126 ----RFLQIAEVMTVISFGVSFCGVFLRNSFPRFGNNLEKFGSVLTSMIFFLMTTSFLPA 185
L + V++F SFCG+FL S PR + + SV+ ++ FF+M++ FLP
Sbjct: 124 DHHHTLLPWSAASMVMAFAASFCGIFLHTSHPRIASIIGNTVSVIAALGFFIMSSIFLPG 183
Query: 186 RIRWISWPVFALSMAAFLFSL 190
W++W LS+ AF SL
Sbjct: 184 NFAWVTWLACGLSLLAFFLSL 201
BLAST of Lcy13g003850 vs. NCBI nr
Match:
XP_021293273.1 (uncharacterized protein LOC110423379 isoform X1 [Herrania umbratica])
HSP 1 Score: 84.7 bits (208), Expect = 9.3e-13
Identity = 59/196 (30.10%), Postives = 97/196 (49.49%), Query Frame = 0
Query: 8 PPFSFRNVMKHALSRTRSMLSKLLP---------QLHETNSANQFQIITRFIHAMAPIRR 67
PPFS + +K R +LS+ P + ++S ITR A +
Sbjct: 13 PPFSLLSTIKEVFHRVVRILSQFRPCSVANSLILPITRSSSLPTSSAITRMRFYTASYQW 72
Query: 68 RFDEPQHRTLPLYV-VNEFQSHPRTIR-STSFGEMNLSLTFQVAIGLLVTSQIPSSRFL- 127
R +PLY+ +E + H R S G+ LSL+FQ+ + L +S + + L
Sbjct: 73 RCQALD--CIPLYINYSEIEMHSYQPRPPVSLGKTILSLSFQIVVALAPSSSMGQAHHLL 132
Query: 128 --QIAEVMTVISFGVSFCGVFLRNSFPRFGNNLEKFGSVLTSMIFFLMTTSFLPARIRWI 187
++ +++F SF G+FLR+S+P+ N +E GS++ ++ FF+MT+ FLP W+
Sbjct: 133 PIDFVKISMIMAFAASFSGIFLRSSYPKMANIIENIGSLIAAVGFFIMTSIFLPGNFSWV 192
Query: 188 SWPVFALSMAAFLFSL 190
+W A S+ AF SL
Sbjct: 193 NWLACAFSLLAFFSSL 206
BLAST of Lcy13g003850 vs. NCBI nr
Match:
EOY01534.1 (Ileal sodium/bile acid cotransporter, putative [Theobroma cacao])
HSP 1 Score: 80.5 bits (197), Expect = 1.7e-11
Identity = 59/197 (29.95%), Postives = 101/197 (51.27%), Query Frame = 0
Query: 9 PFSFRNVMKHALSRTRSMLSKLLP---------QLHETNSANQFQIITRFIHAMAPIRRR 68
PFS + +K R +LS+ P + ++S + ITR A + R
Sbjct: 14 PFSLLSTIKEVFHRVVRILSQFRPCSDANSLILPITRSSSLPTSRAITRMRFYTASYQWR 73
Query: 69 FDEPQHRTLPLYVVN---EFQSH-PRTIRSTSFGEMNLSLTFQVAIGLLVTSQIPSSRF- 128
+PL + + E QS+ PR S G+ LSL+FQ+ + L ++S + +
Sbjct: 74 SQALD--CIPLSINSLEIEMQSYQPRP--PVSLGKTILSLSFQIVVALALSSSMGQTHHV 133
Query: 129 --LQIAEVMTVISFGVSFCGVFLRNSFPRFGNNLEKFGSVLTSMIFFLMTTSFLPARIRW 188
+ I ++ +++F SF G+FLR+S+P+ N +E GS++ ++ FF+MT+ FLP + W
Sbjct: 134 LPIDIVKISMIMAFAASFSGIFLRSSYPKMANIIENIGSLIAAVGFFIMTSIFLPGNLYW 193
Query: 189 ISWPVFALSMAAFLFSL 190
++W A S+ AF SL
Sbjct: 194 VTWLACAFSLLAFFSSL 206
BLAST of Lcy13g003850 vs. NCBI nr
Match:
OMO72209.1 (hypothetical protein COLO4_27768 [Corchorus olitorius])
HSP 1 Score: 76.3 bits (186), Expect = 3.3e-10
Identity = 38/103 (36.89%), Postives = 63/103 (61.17%), Query Frame = 0
Query: 75 EFQSHPRTIRSTSFGEMNLSLTFQVAIGLLVTSQIPSSRFL--QIAEVMTVISFGVSFCG 134
E QS+ + S + G+ +SLTFQV + L ++ + L QI +V +++F SF G
Sbjct: 33 EMQSYQQRPNSANLGKTIMSLTFQVVVALALSMGQSHHQLLSIQIVKVSMIMAFAASFSG 92
Query: 135 VFLRNSFPRFGNNLEKFGSVLTSMIFFLMTTSFLPARIRWISW 176
+FLRNS+P+ +E GS+ ++ FF+MT+ FLP + W++W
Sbjct: 93 IFLRNSYPKSARIVENTGSIAAAVGFFIMTSIFLPVKFSWVAW 135
BLAST of Lcy13g003850 vs. NCBI nr
Match:
PPR92765.1 (hypothetical protein GOBAR_AA27906 [Gossypium barbadense] >TYG86151.1 hypothetical protein ES288_A13G111900v1 [Gossypium darwinii] >TYH91418.1 hypothetical protein ES332_A13G114200v1 [Gossypium tomentosum])
HSP 1 Score: 71.2 bits (173), Expect = 1.1e-08
Identity = 62/206 (30.10%), Postives = 93/206 (45.15%), Query Frame = 0
Query: 1 MASHSDHPPFSFRNVMKHALSRTRSMLSKL-----------LPQLHETNSANQFQIITRF 60
M ++PPFS +++KH+ MLS+ LP T A ITR
Sbjct: 1 MVMPPNNPPFSILSIIKHSFHDVGIMLSQFRHSFEANNPLTLPISRSTPQATA-NSITR- 60
Query: 61 IHAMAPIRRRFDEPQHRTLPLYVVNEFQSHPRTIR--STSFGEMNLSLTFQVAIGLLVTS 120
I + P + + + E QSH + R S S G+ +SLTFQ L +S
Sbjct: 61 IRCCSAAGSWGASPICNYINSFEI-EMQSHHQQPRPNSVSLGKTIMSLTFQAVFALAPSS 120
Query: 121 QIPSS----RFLQIAEVMTVISFGVSFCGVFLRNSFPRFGNNLEKFGSVLTSMIFFLMTT 180
+ L + V++F SFCG+FL S PR + + SV+ ++ FF+M++
Sbjct: 121 STEQADHHHTLLPWSAASMVMAFAASFCGIFLHTSHPRIASIIGNTVSVIAALGFFIMSS 180
Query: 181 SFLPARIRWISWPVFALSMAAFLFSL 190
FLP W++W LS+ AF SL
Sbjct: 181 IFLPGNFAWVTWLACGLSLLAFFLSL 203
BLAST of Lcy13g003850 vs. NCBI nr
Match:
KAB2048329.1 (hypothetical protein ES319_A13G106300v1 [Gossypium barbadense])
HSP 1 Score: 71.2 bits (173), Expect = 1.1e-08
Identity = 61/201 (30.35%), Postives = 92/201 (45.77%), Query Frame = 0
Query: 6 DHPPFSFRNVMKHALSRTRSMLSKL-----------LPQLHETNSANQFQIITRFIHAMA 65
++PPFS +++KH+ MLS+ LP T A ITR I +
Sbjct: 4 NNPPFSILSIIKHSFHDVGIMLSQFRHSFEANNPLTLPISRSTPQATA-NSITR-IRCCS 63
Query: 66 PIRRRFDEPQHRTLPLYVVNEFQSHPRTIR--STSFGEMNLSLTFQVAIGLLVTSQIPSS 125
P + + + E QSH + R S S G+ +SLTFQ L +S +
Sbjct: 64 AAGSWGASPICNYINSFEI-EMQSHHQQPRPNSVSLGKTIMSLTFQAVFALAPSSSTEQA 123
Query: 126 ----RFLQIAEVMTVISFGVSFCGVFLRNSFPRFGNNLEKFGSVLTSMIFFLMTTSFLPA 185
L + V++F SFCG+FL S PR + + SV+ ++ FF+M++ FLP
Sbjct: 124 DHHHTLLPWSAASMVMAFAASFCGIFLHTSHPRIASIIGNTVSVIAALGFFIMSSIFLPG 183
Query: 186 RIRWISWPVFALSMAAFLFSL 190
W++W LS+ AF SL
Sbjct: 184 NFAWVTWLACGLSLLAFFLSL 201
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1B1Z4 | 4.5e-13 | 30.10 | uncharacterized protein LOC110423379 isoform X1 OS=Herrania umbratica OX=108875 ... | [more] |
A0A061EGV0 | 8.5e-12 | 29.95 | Ileal sodium/bile acid cotransporter, putative OS=Theobroma cacao OX=3641 GN=TCM... | [more] |
A0A1R3HPA4 | 1.6e-10 | 36.89 | Uncharacterized protein OS=Corchorus olitorius OX=93759 GN=COLO4_27768 PE=4 SV=1 | [more] |
A0A5D2MJL9 | 5.1e-09 | 30.10 | Uncharacterized protein OS=Gossypium tomentosum OX=34277 GN=ES332_A13G114200v1 P... | [more] |
A0A5J5T342 | 5.1e-09 | 30.35 | Uncharacterized protein OS=Gossypium barbadense OX=3634 GN=ES319_A13G106300v1 PE... | [more] |
Match Name | E-value | Identity | Description | |
XP_021293273.1 | 9.3e-13 | 30.10 | uncharacterized protein LOC110423379 isoform X1 [Herrania umbratica] | [more] |
EOY01534.1 | 1.7e-11 | 29.95 | Ileal sodium/bile acid cotransporter, putative [Theobroma cacao] | [more] |
OMO72209.1 | 3.3e-10 | 36.89 | hypothetical protein COLO4_27768 [Corchorus olitorius] | [more] |
PPR92765.1 | 1.1e-08 | 30.10 | hypothetical protein GOBAR_AA27906 [Gossypium barbadense] >TYG86151.1 hypothetic... | [more] |
KAB2048329.1 | 1.1e-08 | 30.35 | hypothetical protein ES319_A13G106300v1 [Gossypium barbadense] | [more] |
Match Name | E-value | Identity | Description | |