Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGATTGTCATGTACTTAAAAATTTTAAACATATTTGATCCAATCAAAGAAAATTTTTTACCATCTAACTAAACTGATCGAGGAATTTCTAGAAATCTTCACTTGTGAATAACACTGAGAGATTATATATAGCATATGTGAGAGAGATAACAAACTCTCTCAACAAACTCAACAATTTAACAAACTCTAGTTACAACAAACTCAACAATTTAACAAACTCAAGTTATTAATATCATAACTAATTACATAATTAATTATTACTCTGCTAATATGCCTCCTCAAACTGAAGCACCCAACGGAAGCAAAAGTTTTGAACATAAATTTGTAAAAGACGGTCCAGACAAGGGCTATGTCAGAATGTCTGCGAGCTGATCAGTAGAGGCAATGTACTTGACATCGAGATCACCATGAACAACTTTCTGCCAAACAAAATGGTAATCAACCTCGATATGCTTTGTTCGAGCATGAAAAATAGGATTCCGAGCCAAAGCAATAGCCGAAACACTGTCACACAAAATTACAGGAGGATGAGAAACAAATATATGCATTTCACACAAGAGTTGCCTTAACCAATAAAGCTCAGCAGCAGTGACAGCAAGACTTCGATATTGAGCCTCAATAGAAGATCGTGAAACAATTGCTTGCTTCTTAGTAGACCAAGACACAGGCATATTACCAATAAAGAGAAGAAAACCAGACGTAGAACACCTATCCTTTGAGTCACCAGCCCAATCAGAATCACAATAGGCACAAATATTTAAAGATGAACCAGTAAAAGAAATACCAAGTGAACGAGTTCCAAACAAATATCTCAGAATACGTTTGGCAGCAGAGTAATGGGTAGTAGAAGGGCAATGCATATATTGGCTAACCTTGTTTACAGCAAATGAAATATCAGGTCGGGTACACGTTAAGTATTGTAAAGAACCATCAACCTGTCTATACAACAAGGGATTTTCAAAAGGAGGAGCACCATCATACATATCACAAGCAGTAGACATAGGTGTAGAGCATTGCTTGGCATTAACCATCCCAATTTTCAGAAGAAGATTAGAAAGATATTTGGACTGATTCACAAAGAAACCAGTTGACTACTTATGTACTTCAAGCCCCAGAAAATATTTCAAATACACCAAATCAGTAATATGAAATTCACTTCGTAATGCTACCTTTAAGCTCAGAAACATAAGTAGGGTCATTACTGGTTATGACAATATCATCAACGTAAAGTAGCAAGCATGTGAGTGAACCATTTACACGACGAACGAACAAAGAAGTATCAACCAGAGAGCCAACAAAACCAAGAGTGAGTAAATGAGAAGTAAATCGCTCAAACCAAGCACGTGGGGCTTGTTTTAACCCAAACAATGACTTATGTAATTTACACACACGATGGGGATAGATAGCATCTTTAAAACTGGGAGGCTGCATCATATATACGTCTTCCTTTAGAGAACCATGTAAAAAGGCATTCTTGACATCCAATTGAAACAAATCCCACTTATAAAAAGCTGCTAAAGCAAGAATGACCCTGACAGTAGTTTTTTTTAACAACGGACTAAAGGTTTCATAAAAATCCAAACCTTCCAACTGATTAAATCCCTCAGCAACTAGTCGACCCTTATAACGAGAAACTGAACCATCAGGGTTATATTTAGTGTGAAACACCCATTTACATCCAACTATATTCATACTATCAGTGGGAGGTACTAGAGTCCAAGTCCCTTGTTCTTCAAAAATTCTTAGCTTGTCTCCATTCAAGAATCTTAGAAGCAGCAGTATAAGAAGTTGGTTCAGTCAAAGAACTATTAACAACAAAAAATTTCTTTTTAAAAGTACCAGACTTGGACCGTGTTTGCATTGGATGTCTATTAGAGGCAACAACATCAGTAACAACAGCCTCATTTGGAGCAGATAAAATGGACTCATTAGGTGGAATAATAGCTTCAACAGACGCAACAGGCAAGGAATTAGAAACACTAGTATGATCACAAGTACTTGGGATATTGCAACCAACATCAGCACAAGTAGCAGACGGATAAAAAGAAGTCTCAGAGAGAAATTTATCAACCAAAGCAGGAAAAGTAGGATCCATTGGAGTCATATTAGACATAGAACCAATATTAGTAGCAGACATTCCAGCGAAAGGAAACACATGCTCATGGAAGATAACATGCCGAGAGATAATCATCTTCTTGGTAGACTGATTATAGCAAAGATATCCCTTGTAATCAACTGTATATCCCATAAAAATATGTTGTTCGGTCTTTGGTAGTAACTTAGCGACGTGTACGAACAAGATTTCTAATGAAGAATGCCTTTAAAACTGAAAAAGTTAGTAACAATAGCATTCACAAATTCACCCCCTCCGTCAGACCTGAAGATTTTGACTTTTTGAGACAATAGATTTTCAATAAGCGAAATAAACTATGGAATAATAGTAAATGCATCAGACTTATAGACCATTGGAAAAATCCATGTATAACGGCTGAAGTCATCCACAAAGCAAATATAATATCTATGGCCAGTAATAGATAAAACAGGGGAAGGGCCCCCACACATCACTGTGTAACAATTCAAGAGGCTTAGTAGTCATTGTTACTGAAGACTGAAAAGGTAATTTGCTCATTTTGCCTTTAAGACAACCAACACATTCAATAGTATTAATAGACTGTCCAATAGAAATAGAAGACTGAGAGACAAAATCTTACGAAGGACATTGGGAGATGGATGTCCAAGTCTAAAGTGCCACAAATCGGTTGTCTTTTTATTCAAAGTAAAAGCAGAAACATGGAAACCAGAGTAAACAACTGACGGTTGTGAAACAACTTGATTAGAGGACGAAACAACCTCACCACCTTTGAAACCACCAAATGGAAGGGGATACAAACCATCTTTACTATGTTCTCTGTACATAACCTTGCCCATTACTTTGTCTTGAATAAGAAACCAATGAGCATCAAACACAAAGATACAATTATTATCATGATAGCACTGAGAGAGAAGATTGGCAGAAATATCCGATACACATAGAATATTGGATAAGGATAAGTCATTATCAGATATCGAAAGAGTACCAAAGCCGGTCTAAGCAACAGAAAATGTTTGGCCACTAGCAACAACAATCAAATCTTCACCATTATAGTTTCCATTTAAATTCAGGTTAGACATATCAAATGTAACGTGAGAGTTACATCCACTATCAGCAAGCCAAAATGAGGAATTCGTAGCAGGATCAACAACAACAACCATAACAACAAGTTGAGACGGAGGATGACGTCCTTGAAAGGAAAAATTCATCCGGTTGTAACAGTCGAGTGCTCCATGGTCAGGTTTATTGCAAGTCTGGCGAACAACTCATCCAAAAGAGTTAATAAAAGAACTCGATGATGAAGATGAGTTAGAATTACCAGTGGAATTCTGGTAATTCGGAGAAGGTTGGGACCTAGAAGAGTTTGATCCGGAGCCTCCATTTCCACGTCCACGACCACGTGGTTAAAAATTTAAGCTGCTAGGAGATTGACAGCCACGTCCACGAAAGGAATTTGAGGAGCCTCGTCCAAATGTTGCCGCAATAGCTGATGGATTAAAAATTGAAGTAGTTTTATTCTGTTGCTCAATGAATTTGGCTTCAGACTTGAGTAGAGCATAAAGCTCCTCAAGTGTTACAGCTTCTCTTCGAGTTCGAATAGAGGTTTGCAATGAATTGTACTCAACGGGAAGTCCATGCAATGTATATAACAACACATCTTCGTCATCAATAGTTACCGCAACATTGGCCAATTGATCGCGAATTTCCTTGATTCGAGGAAGATAATCTTCAATTGACTCCGAAGCAGTTTTCGAGATAGTATGAAGAGCAGATTTAAGCTCGTGGATGTGAGACCTGGTGAGAGAAGAAAATTGCTTCTTAAGAGTTATCCAAACTTCTCTTGAGGTTTTGCAGCCAACGACGAAGGAGAAGGCATTCTTCGACAAGGTTGTGATAATCAAGGTAATTAGGGCATTGTCTTGAGTAGTCCACTGAGTGAAGACTGGATTGACTGTAGTAGATCGTGTTCCATCTTCTTGAAGAGAAATTTTGTCAGGCCGAGGAGTAGATCCATCGATTACTCCAAAAGTGAATGTGCCTTCTGTAACACCCGAGTTAGGAGGGTACATGAATGAACCGAGGCCACACCCGAATGAGAGGGATCCTAAGGACATGAAAGTATAGTCAAAATAAGCTTAAAAGAATTGATAGATACTACTTATACCAACAAGGTGCACCTTCCTTTTCGGTGGCTCAATTATAGGAACTTTGAAGTTAAGCGTGCTTGGCTTAGAGAAGTTCTATGTTGAGTGACCTCCTAAGAATTTTCTTAGGAAGCATGTGAGTGAGGACAAAACATGCTAAAAGGACTCACTCTTAGAAGCAGTTCAGATAAGTATGGTGACGTCGCCAGACCGTAGGGGATGTGGGGGAATGCCGAGGCCATGAGGTGCCGAATCCGGATTCTGAATCCTAGGCCTGGGGCGTTACAGATGGTTTCTTAGGAAGCATGTGAGTGAGGACAAAGCAAGCTAAAAGGACCCTGTGCCCGGATTCTGAATCCTAGGCCTGGGGCGTTACAAATGGTTTCTTAGGAAGCATGTGAGTGAGGACAAAACAAGCTAAAAGGACATTGTGTTGGTTTGTGAGGACAATATTCACTCTCAGAAGCAGTTCAGATAAGTATGGTGACGTCGTCAGGTCGTAGGGGATGTAGGGGAATGCCGGAGTCATGAGGTGCCGAATCTGGATTCCGAATCTTGGGCCTGTGGGGTTACAGATTTAGCCGAAGGTTGTGCGGTACGAGAGGGTGTCCGGTTGGCTAGGGAGCTCGGTTTTCAGCCTTTCCAGATTGAGAAGGACTCCTTACGGGTTCACCGCTTGTTGACGGCACCTTATGAAGATTTGTCGGAGCTGGGTGTCTTGCTGGATGAGGTAAAGCGGTGTGGTTTGTCTGGCTCTGCGAACTTGGTCTTGTTTACTCGTCGATCAGGCAATATGGCGGCTCACTCTTTGGCAAAACTTGCGATGGACTTTGCATTAGATCGAGTATGGTTGGAAGAATGGCCCAGAGATATTTCTCCTATGATTTCTACTGAATGTTTAGTTTCTTTTGATGTTGATTTGTAA
mRNA sequence
ATGGGATTGTCATGTTGTGCGGTACGAGAGGGTGTCCGGTTGGCTAGGGAGCTCGGTTTTCAGCCTTTCCAGATTGAGAAGGACTCCTTACGGGTTCACCGCTTGTTGACGGCACCTTATGAAGATTTGTCGGAGCTGGGTGTCTTGCTGGATGAGGTAAAGCGGTGTGGTTTGTCTGGCTCTGCGAACTTGGTCTTGTTTACTCGTCGATCAGGCAATATGGCGGCTCACTCTTTGGCAAAACTTGCGATGGACTTTGCATTAGATCGAGTATGGTTGGAAGAATGGCCCAGAGATATTTCTCCTATGATTTCTACTGAATGTTTAGTTTCTTTTGATGTTGATTTGTAA
Coding sequence (CDS)
ATGGGATTGTCATGTTGTGCGGTACGAGAGGGTGTCCGGTTGGCTAGGGAGCTCGGTTTTCAGCCTTTCCAGATTGAGAAGGACTCCTTACGGGTTCACCGCTTGTTGACGGCACCTTATGAAGATTTGTCGGAGCTGGGTGTCTTGCTGGATGAGGTAAAGCGGTGTGGTTTGTCTGGCTCTGCGAACTTGGTCTTGTTTACTCGTCGATCAGGCAATATGGCGGCTCACTCTTTGGCAAAACTTGCGATGGACTTTGCATTAGATCGAGTATGGTTGGAAGAATGGCCCAGAGATATTTCTCCTATGATTTCTACTGAATGTTTAGTTTCTTTTGATGTTGATTTGTAA
Protein sequence
MGLSCCAVREGVRLARELGFQPFQIEKDSLRVHRLLTAPYEDLSELGVLLDEVKRCGLSGSANLVLFTRRSGNMAAHSLAKLAMDFALDRVWLEEWPRDISPMISTECLVSFDVDL
Homology
BLAST of Lcy11g010470 vs. ExPASy TrEMBL
Match:
A0A6J1DBJ7 (uncharacterized protein LOC111018973 OS=Momordica charantia OX=3673 GN=LOC111018973 PE=4 SV=1)
HSP 1 Score: 86.3 bits (212), Expect = 9.4e-14
Identity = 50/104 (48.08%), Postives = 65/104 (62.50%), Query Frame = 0
Query: 7 AVREGVRLARELGFQPFQIEKDSLRVHRLLTAPYEDLSELGVLLDEVKRCGLSGSANLV- 66
AV EG+ LA E GF FQIE DSLR+ LLT D SE+GVL +K LS A V
Sbjct: 161 AVYEGILLAVEAGFIRFQIETDSLRIFNLLTTDCVDDSEVGVLCSVIK-LFLSSHAERVS 220
Query: 67 -LFTRRSGNMAAHSLAKLAMDFALDRVWLEEWPRDISPMISTEC 109
FT R+GN AH LA+LA+ ++W+EEWP +IS +++ +C
Sbjct: 221 FSFTHRNGNAXAHLLAQLALTSPHLQIWVEEWPDEISSVLAVDC 263
BLAST of Lcy11g010470 vs. ExPASy TrEMBL
Match:
A0A6J1C467 (uncharacterized protein LOC111007775 OS=Momordica charantia OX=3673 GN=LOC111007775 PE=4 SV=1)
HSP 1 Score: 78.2 bits (191), Expect = 2.5e-11
Identity = 44/106 (41.51%), Postives = 60/106 (56.60%), Query Frame = 0
Query: 5 CCAVREGVRLARELGFQPFQIEKDSLRVHRLLTAPYEDLSELGVLLDEVKRCGLSGSANL 64
C A +EGV LA E G PFQIE DS +V LL ED SE+GVL ++ + S ++
Sbjct: 128 CLAAQEGVCLAIEAGLIPFQIETDSSQVFNLLRTDCEDESEIGVLASSIRH--IVSSLHI 187
Query: 65 ---VLFTRRSGNMAAHSLAKLAMDFALDRVWLEEWPRDISPMISTE 108
F R GN AH+LA++ M VW+EEW D+S +I+ +
Sbjct: 188 GGGFSFVNREGNSGAHTLARMGMVSESFHVWVEEWLSDLSEVIAAD 231
BLAST of Lcy11g010470 vs. ExPASy TrEMBL
Match:
A0A6J1CIF1 (uncharacterized protein LOC111011237 OS=Momordica charantia OX=3673 GN=LOC111011237 PE=4 SV=1)
HSP 1 Score: 75.1 bits (183), Expect = 2.2e-10
Identity = 36/100 (36.00%), Postives = 55/100 (55.00%), Query Frame = 0
Query: 10 EGVRLARELGFQPFQIEKDSLRVHRLLTAPYEDLSELGVLLDEVKRCGLSGSANLVLFTR 69
EG++LA ++G P +E DS R+ L + P EDLSE G ++ + K F +
Sbjct: 94 EGLQLASKIGVNPVILETDSSRIFNLFSQPSEDLSETGEIVLKAKNFWTQSLHASFNFVK 153
Query: 70 RSGNMAAHSLAKLAMDFALDRVWLEEWPRDISPMISTECL 110
R GN AAH LA+ A+ +W+E+WP ++ + ECL
Sbjct: 154 REGNKAAHMLARRALLLREFSIWMEDWPLELKSCLEMECL 193
BLAST of Lcy11g010470 vs. ExPASy TrEMBL
Match:
A0A1R3JXX0 (RNase H domain-containing protein OS=Corchorus olitorius OX=93759 GN=COLO4_13164 PE=4 SV=1)
HSP 1 Score: 72.8 bits (177), Expect = 1.1e-09
Identity = 36/104 (34.62%), Postives = 55/104 (52.88%), Query Frame = 0
Query: 5 CCAVREGVRLARELGFQPFQIEKDSLRVHRLLTAPYEDLSELGVLLDEVKRCGLSGSANL 64
C A + + A+++GF +E D+L + R +TA D S +GV + E+K S + L
Sbjct: 198 CLAALKAITWAKDMGFNNIVLEGDALSIIRKVTASMPDFSPIGVYIAEIKVLSSSFVSCL 257
Query: 65 VLFTRRSGNMAAHSLAKLAMDFALDRVWLEEWPRDISPMISTEC 109
R GN+ AH LA L + R+W+EE P I ++ TEC
Sbjct: 258 FSHVHRDGNVIAHDLASLGSSLSETRIWIEEVPDSIVAVLQTEC 301
BLAST of Lcy11g010470 vs. ExPASy TrEMBL
Match:
A0A2P5ENY9 (Ribonuclease H-like domain containing protein OS=Trema orientale OX=63057 GN=TorRG33x02_169030 PE=4 SV=1)
HSP 1 Score: 71.6 bits (174), Expect = 2.4e-09
Identity = 40/107 (37.38%), Postives = 61/107 (57.01%), Query Frame = 0
Query: 5 CCAVREGVRLARELGFQPFQIEKDSLRVHRLLTAPYEDLSELGVLLDEVKRCGLSGSANL 64
C A+REG+ A+E +E DSLRV L Y+ +E ++LD+VK L
Sbjct: 93 CFAIREGLAFAKESFLHVRMVETDSLRVVNAL-GRYDKYAEESLILDDVKCLLLEADDGS 152
Query: 65 VLFTRRSGNMAAHSLAKLAMDFALDRVWLEEWPRDISPMISTECLVS 112
+F R+GN A+H+LA+ A+ + WLEE P IS ++++E LV+
Sbjct: 153 YMFILRNGNRASHTLARFALSLSSPLYWLEESPGCISHIVASELLVT 198
BLAST of Lcy11g010470 vs. NCBI nr
Match:
XP_022150944.1 (uncharacterized protein LOC111018973 [Momordica charantia])
HSP 1 Score: 86.3 bits (212), Expect = 1.9e-13
Identity = 50/104 (48.08%), Postives = 65/104 (62.50%), Query Frame = 0
Query: 7 AVREGVRLARELGFQPFQIEKDSLRVHRLLTAPYEDLSELGVLLDEVKRCGLSGSANLV- 66
AV EG+ LA E GF FQIE DSLR+ LLT D SE+GVL +K LS A V
Sbjct: 161 AVYEGILLAVEAGFIRFQIETDSLRIFNLLTTDCVDDSEVGVLCSVIK-LFLSSHAERVS 220
Query: 67 -LFTRRSGNMAAHSLAKLAMDFALDRVWLEEWPRDISPMISTEC 109
FT R+GN AH LA+LA+ ++W+EEWP +IS +++ +C
Sbjct: 221 FSFTHRNGNAXAHLLAQLALTSPHLQIWVEEWPDEISSVLAVDC 263
BLAST of Lcy11g010470 vs. NCBI nr
Match:
XP_022135942.1 (uncharacterized protein LOC111007775 [Momordica charantia])
HSP 1 Score: 78.2 bits (191), Expect = 5.3e-11
Identity = 44/106 (41.51%), Postives = 60/106 (56.60%), Query Frame = 0
Query: 5 CCAVREGVRLARELGFQPFQIEKDSLRVHRLLTAPYEDLSELGVLLDEVKRCGLSGSANL 64
C A +EGV LA E G PFQIE DS +V LL ED SE+GVL ++ + S ++
Sbjct: 128 CLAAQEGVCLAIEAGLIPFQIETDSSQVFNLLRTDCEDESEIGVLASSIRH--IVSSLHI 187
Query: 65 ---VLFTRRSGNMAAHSLAKLAMDFALDRVWLEEWPRDISPMISTE 108
F R GN AH+LA++ M VW+EEW D+S +I+ +
Sbjct: 188 GGGFSFVNREGNSGAHTLARMGMVSESFHVWVEEWLSDLSEVIAAD 231
BLAST of Lcy11g010470 vs. NCBI nr
Match:
XP_022140628.1 (uncharacterized protein LOC111011237 [Momordica charantia])
HSP 1 Score: 75.1 bits (183), Expect = 4.5e-10
Identity = 36/100 (36.00%), Postives = 55/100 (55.00%), Query Frame = 0
Query: 10 EGVRLARELGFQPFQIEKDSLRVHRLLTAPYEDLSELGVLLDEVKRCGLSGSANLVLFTR 69
EG++LA ++G P +E DS R+ L + P EDLSE G ++ + K F +
Sbjct: 94 EGLQLASKIGVNPVILETDSSRIFNLFSQPSEDLSETGEIVLKAKNFWTQSLHASFNFVK 153
Query: 70 RSGNMAAHSLAKLAMDFALDRVWLEEWPRDISPMISTECL 110
R GN AAH LA+ A+ +W+E+WP ++ + ECL
Sbjct: 154 REGNKAAHMLARRALLLREFSIWMEDWPLELKSCLEMECL 193
BLAST of Lcy11g010470 vs. NCBI nr
Match:
OMO99660.1 (hypothetical protein COLO4_13164 [Corchorus olitorius])
HSP 1 Score: 72.8 bits (177), Expect = 2.2e-09
Identity = 36/104 (34.62%), Postives = 55/104 (52.88%), Query Frame = 0
Query: 5 CCAVREGVRLARELGFQPFQIEKDSLRVHRLLTAPYEDLSELGVLLDEVKRCGLSGSANL 64
C A + + A+++GF +E D+L + R +TA D S +GV + E+K S + L
Sbjct: 198 CLAALKAITWAKDMGFNNIVLEGDALSIIRKVTASMPDFSPIGVYIAEIKVLSSSFVSCL 257
Query: 65 VLFTRRSGNMAAHSLAKLAMDFALDRVWLEEWPRDISPMISTEC 109
R GN+ AH LA L + R+W+EE P I ++ TEC
Sbjct: 258 FSHVHRDGNVIAHDLASLGSSLSETRIWIEEVPDSIVAVLQTEC 301
BLAST of Lcy11g010470 vs. NCBI nr
Match:
XP_023923041.1 (uncharacterized protein LOC112034451 [Quercus suber])
HSP 1 Score: 72.4 bits (176), Expect = 2.9e-09
Identity = 39/105 (37.14%), Postives = 63/105 (60.00%), Query Frame = 0
Query: 7 AVREGVRLARELGFQPFQIEKDSLRVHRLLTAPYEDLSELGVLLDEVKRCGLSGSANLVL 66
A + LAR+LGFQ +E DSL + + L + ++LS +G+L+D+VK G S +
Sbjct: 110 AALKAASLARDLGFQNVILEGDSLCLIKALKSAEDNLSPIGLLVDDVKWVGRSFEQLVYS 169
Query: 67 FTRRSGNMAAHSLAKLAMDFALDRVWLEEWPRDISPMISTECLVS 112
+R+GN AHSLAK A+ +VW+E+ P I+ ++ + +VS
Sbjct: 170 HVKRNGNSVAHSLAKNALRIPDSQVWMEDVPSHITSILDLDGIVS 214
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1DBJ7 | 9.4e-14 | 48.08 | uncharacterized protein LOC111018973 OS=Momordica charantia OX=3673 GN=LOC111018... | [more] |
A0A6J1C467 | 2.5e-11 | 41.51 | uncharacterized protein LOC111007775 OS=Momordica charantia OX=3673 GN=LOC111007... | [more] |
A0A6J1CIF1 | 2.2e-10 | 36.00 | uncharacterized protein LOC111011237 OS=Momordica charantia OX=3673 GN=LOC111011... | [more] |
A0A1R3JXX0 | 1.1e-09 | 34.62 | RNase H domain-containing protein OS=Corchorus olitorius OX=93759 GN=COLO4_13164... | [more] |
A0A2P5ENY9 | 2.4e-09 | 37.38 | Ribonuclease H-like domain containing protein OS=Trema orientale OX=63057 GN=Tor... | [more] |
Match Name | E-value | Identity | Description | |