Moc10g15060 (gene) Bitter gourd (OHB3-1) v2

Overview
NameMoc10g15060
Typegene
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag protease polyprotein
Locationchr10: 11440259 .. 11440840 (+)
RNA-Seq ExpressionMoc10g15060
SyntenyMoc10g15060
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGACTGGGTGGGCTTGGTGGAGGTGCAAACCTTGAGACAGTGGTTCAATCTCCTAGTCAGACGGAAATTCCACCAATGGTTCAACTTCCTGGTCAGATGGAGAATCCACCAATGGGTCAAACTTCTGGACAGTTAGAGCCTACTATAGCGTCGTCTTTAATGATGGAGACTTTCCAAACACTGTTTCAAACGATAGTCTACAACCAAATGACTTAGTTGGCTCAGGATCAAGGGAGCATGTCAACAAAGGCTAAATATCTACGAGATTTTAAGAAGTACGACACTCACTCTTTTGATGGACTATCTGTAGATTTGACGTTGGCAGAGGCTTGGTTGTCATCGATAGAGACTATCTCTCATTACATGAGGTGTCCGGAGGAACAAAAAGTGCAGTGTGTAGTCTTTATGCTGAAAGATGATGCCTTTTTGTGGTGGGAGTGTGCCAAGAGGTCTATTGATGTGAGTGGAGGCCCGGTCACATGGTTGCAGTTCAAGGAGGCTTTCTTTCAACAATATTACCCAACGATCACCCGGTTTAGGAAACAAGCGGATTCGAACTTGAAGCAAGGCAATAGATGA

mRNA sequence

ATGAGACTGGGTGGGCTTGGTGGAGGTGCAAACCTTGAGACAGTGGTTCAATCTCCTAGTCAGACGGAAATTCCACCAATGGTTCAACTTCCTGGTCAGATGGAGAATCCACCAATGGGTCAAACTTCTGGACACATGTCAACAAAGGCTAAATATCTACGAGATTTTAAGAAGTACGACACTCACTCTTTTGATGGACTATCTGTAGATTTGACGTTGGCAGAGGCTTGGTTGTCATCGATAGAGACTATCTCTCATTACATGAGGTGTCCGGAGGAACAAAAAGTGCAGTGTGTAGTCTTTATGCTGAAAGATGATGCCTTTTTGTGGTGGGAGTGTGCCAAGAGGTCTATTGATGTGAGTGGAGGCCCGGTCACATGGTTGCAGTTCAAGGAGGCTTTCTTTCAACAATATTACCCAACGATCACCCGGTTTAGGAAACAAGCGGATTCGAACTTGAAGCAAGGCAATAGATGA

Coding sequence (CDS)

ATGAGACTGGGTGGGCTTGGTGGAGGTGCAAACCTTGAGACAGTGGTTCAATCTCCTAGTCAGACGGAAATTCCACCAATGGTTCAACTTCCTGGTCAGATGGAGAATCCACCAATGGGTCAAACTTCTGGACACATGTCAACAAAGGCTAAATATCTACGAGATTTTAAGAAGTACGACACTCACTCTTTTGATGGACTATCTGTAGATTTGACGTTGGCAGAGGCTTGGTTGTCATCGATAGAGACTATCTCTCATTACATGAGGTGTCCGGAGGAACAAAAAGTGCAGTGTGTAGTCTTTATGCTGAAAGATGATGCCTTTTTGTGGTGGGAGTGTGCCAAGAGGTCTATTGATGTGAGTGGAGGCCCGGTCACATGGTTGCAGTTCAAGGAGGCTTTCTTTCAACAATATTACCCAACGATCACCCGGTTTAGGAAACAAGCGGATTCGAACTTGAAGCAAGGCAATAGATGA

Protein sequence

MRLGGLGGGANLETVVQSPSQTEIPPMVQLPGQMENPPMGQTSGHMSTKAKYLRDFKKYDTHSFDGLSVDLTLAEAWLSSIETISHYMRCPEEQKVQCVVFMLKDDAFLWWECAKRSIDVSGGPVTWLQFKEAFFQQYYPTITRFRKQADSNLKQGNR
Homology
BLAST of Moc10g15060 vs. NCBI nr
Match: XP_022156662.1 (uncharacterized protein LOC111023512 [Momordica charantia])

HSP 1 Score: 185.7 bits (470), Expect = 3.2e-43
Identity = 93/133 (69.92%), Postives = 102/133 (76.69%), Query Frame = 0

Query: 27  MVQLPGQMENPPMGQTSGHMSTKAKYLRDFKKYDTHSFDGLSVDLTLAEAWLSSIETISH 86
           +VQ     +   + Q  G +S +AKYLRDFKKYD  SFDGLSVD  LAEAWLS +ETI  
Sbjct: 7   LVQTTVSNQMTQLTQNRGSISIEAKYLRDFKKYDPRSFDGLSVDPMLAEAWLSLMETIFR 66

Query: 87  YMRCPEEQKVQCVVFMLKDDAFLWWECAKRSIDVSGGPVTWLQFKEAFFQQYYPTITRFR 146
           YMRC EEQKVQC VFMLKDDAFLWWE  +R IDVSGGPVTWLQFKEAFFQQYYP IT +R
Sbjct: 67  YMRCLEEQKVQCDVFMLKDDAFLWWESTERPIDVSGGPVTWLQFKEAFFQQYYPAITWYR 126

Query: 147 KQAD-SNLKQGNR 159
           KQ +  NLKQ NR
Sbjct: 127 KQVEFLNLKQDNR 139

BLAST of Moc10g15060 vs. NCBI nr
Match: XP_038891712.1 (uncharacterized protein LOC120081110 [Benincasa hispida])

HSP 1 Score: 137.5 bits (345), Expect = 1.0e-28
Identity = 68/136 (50.00%), Postives = 90/136 (66.18%), Query Frame = 0

Query: 25  PPMVQLPG-QMENPPMGQTSGHMSTKAKYLRDFKKYDTHSFDGLSVDLTLAEAWLSSIET 84
           PP  Q+P  Q +N    Q+    S +AK+LRDFKKY+  +F+G   D T AE W+S IET
Sbjct: 19  PPQDQVPQVQTQNMIQNQSMSGFSVEAKHLRDFKKYNPSTFNGSLKDPTNAELWISFIET 78

Query: 85  ISHYMRCPEEQKVQCVVFMLKDDAFLWWECAKRSIDVSGGPVTWLQFKEAFFQQYYPTIT 144
           I  YM+CPE+QKVQC VFML D A +WW+ A+R + V G PVTW QFKE F+ +Y+    
Sbjct: 79  IFRYMKCPEDQKVQCAVFMLSDKAQIWWQLAERMLGVGGDPVTWEQFKERFYAKYFSANL 138

Query: 145 RFRKQAD-SNLKQGNR 159
           R+ KQ +   L+QG+R
Sbjct: 139 RYNKQREFLELRQGHR 154

BLAST of Moc10g15060 vs. NCBI nr
Match: XP_038883046.1 (uncharacterized protein LOC120074107 [Benincasa hispida] >XP_038883047.1 uncharacterized protein LOC120074107 [Benincasa hispida] >XP_038883048.1 uncharacterized protein LOC120074107 [Benincasa hispida] >XP_038883049.1 uncharacterized protein LOC120074107 [Benincasa hispida] >XP_038883050.1 uncharacterized protein LOC120074107 [Benincasa hispida] >XP_038883051.1 uncharacterized protein LOC120074107 [Benincasa hispida] >XP_038883052.1 uncharacterized protein LOC120074107 [Benincasa hispida] >XP_038883053.1 uncharacterized protein LOC120074107 [Benincasa hispida] >XP_038883054.1 uncharacterized protein LOC120074107 [Benincasa hispida])

HSP 1 Score: 136.3 bits (342), Expect = 2.2e-28
Identity = 70/148 (47.30%), Postives = 93/148 (62.84%), Query Frame = 0

Query: 10  ANLETVVQSPSQTEIPPMVQLPGQMENPPMGQTSGHMSTKAKYLRDFKKYDTHSFDGLSV 69
           A +E   Q+ +Q E  P+ QL    +  P  +    +S +AK+LRDF+KYD  SFDG   
Sbjct: 32  AMVEQTAQTQAQEEPMPLEQL---QQTRPQNRDMQGLSLEAKHLRDFRKYDPRSFDGSLG 91

Query: 70  DLTLAEAWLSSIETISHYMRCPEEQKVQCVVFMLKDDAFLWWECAKRSIDVSGGPVTWLQ 129
           D T A+ WLSSIETI  +MRCPEE K+QC VFML  +  +WW   ++ ID  G   TW Q
Sbjct: 92  DPTKAKMWLSSIETIFRFMRCPEEHKLQCTVFMLIGNVEIWWCSVEKMIDTGGKLTTWEQ 151

Query: 130 FKEAFFQQYYPTITRFRKQAD-SNLKQG 157
           FKE F+++Y+   TR+ KQA+  NLKQG
Sbjct: 152 FKERFYEKYFSANTRYNKQAEFLNLKQG 176

BLAST of Moc10g15060 vs. NCBI nr
Match: XP_038896687.1 (uncharacterized protein LOC120084949 [Benincasa hispida])

HSP 1 Score: 135.6 bits (340), Expect = 3.8e-28
Identity = 61/119 (51.26%), Postives = 85/119 (71.43%), Query Frame = 0

Query: 41  QTSGHMSTKAKYLRDFKKYDTHSFDGLSVDLTLAEAWLSSIETISHYMRCPEEQKVQCVV 100
           Q+   +S +AK+LRDF+KY+  +F+G   DLT AE W+SSIETI  YM+CPE+QKVQC +
Sbjct: 5   QSMSGLSVEAKHLRDFRKYNPSTFNGSLKDLTNAELWISSIETIFRYMKCPEDQKVQCAI 64

Query: 101 FMLKDDAFLWWECAKRSIDVSGGPVTWLQFKEAFFQQYYPTITRFRKQAD-SNLKQGNR 159
           FML D A +WW+ A+R + V G PVTW QFKE F+ +Y+    ++ KQ +   L+QG+R
Sbjct: 65  FMLTDKAQIWWQSAERMLGVGGDPVTWEQFKERFYAKYFSANLKYNKQREFLELRQGHR 123

BLAST of Moc10g15060 vs. NCBI nr
Match: XP_038887018.1 (uncharacterized protein LOC120077183 [Benincasa hispida])

HSP 1 Score: 132.5 bits (332), Expect = 3.2e-27
Identity = 62/111 (55.86%), Postives = 80/111 (72.07%), Query Frame = 0

Query: 46  MSTKAKYLRDFKKYDTHSFDGLSVDLTLAEAWLSSIETISHYMRCPEEQKVQCVVFMLKD 105
           +S +AK+LRDF+K+D+ SFDG   D T A+ WLSSIETI H+MRC EE K+QC VFML  
Sbjct: 4   LSLEAKHLRDFRKFDSRSFDGSLRDPTKAKMWLSSIETIFHFMRCLEEHKLQCAVFMLTG 63

Query: 106 DAFLWWECAKRSIDVSGGPVTWLQFKEAFFQQYYPTITRFRKQAD-SNLKQ 156
           +A +WW  A++ ID SGG  TW QFKE F++ Y+   TR+ KQ +  NLKQ
Sbjct: 64  NAEIWWRLAEKIIDTSGGLATWEQFKEHFYEMYFSANTRYNKQTEFLNLKQ 114

BLAST of Moc10g15060 vs. ExPASy TrEMBL
Match: A0A6J1DSJ6 (uncharacterized protein LOC111023512 OS=Momordica charantia OX=3673 GN=LOC111023512 PE=4 SV=1)

HSP 1 Score: 185.7 bits (470), Expect = 1.5e-43
Identity = 93/133 (69.92%), Postives = 102/133 (76.69%), Query Frame = 0

Query: 27  MVQLPGQMENPPMGQTSGHMSTKAKYLRDFKKYDTHSFDGLSVDLTLAEAWLSSIETISH 86
           +VQ     +   + Q  G +S +AKYLRDFKKYD  SFDGLSVD  LAEAWLS +ETI  
Sbjct: 7   LVQTTVSNQMTQLTQNRGSISIEAKYLRDFKKYDPRSFDGLSVDPMLAEAWLSLMETIFR 66

Query: 87  YMRCPEEQKVQCVVFMLKDDAFLWWECAKRSIDVSGGPVTWLQFKEAFFQQYYPTITRFR 146
           YMRC EEQKVQC VFMLKDDAFLWWE  +R IDVSGGPVTWLQFKEAFFQQYYP IT +R
Sbjct: 67  YMRCLEEQKVQCDVFMLKDDAFLWWESTERPIDVSGGPVTWLQFKEAFFQQYYPAITWYR 126

Query: 147 KQAD-SNLKQGNR 159
           KQ +  NLKQ NR
Sbjct: 127 KQVEFLNLKQDNR 139

BLAST of Moc10g15060 vs. ExPASy TrEMBL
Match: A0A5D3E4V0 (Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold455G001880 PE=4 SV=1)

HSP 1 Score: 124.0 bits (310), Expect = 5.5e-25
Identity = 66/155 (42.58%), Postives = 88/155 (56.77%), Query Frame = 0

Query: 2   RLGGLGGGANLETVVQSPSQTEIPPMVQLPGQMENPPMGQTSGHMSTKAKYLRDFKKYDT 61
           R GG GG        +      + P VQ   Q  NP        +S +AK+LRDF+KY+ 
Sbjct: 26  RRGGRGG--------RGRGAGRVQPEVQPVAQATNPTAPVVPDQLSAEAKHLRDFRKYNP 85

Query: 62  HSFDGLSVDLTLAEAWLSSIETISHYMRCPEEQKVQCVVFMLKDDAFLWWECAKRSIDVS 121
            +FDG   D T A+ WLSS+ETI  YM+CPE+QKVQC VFML D    WWE  +R +   
Sbjct: 86  TTFDGSLEDPTRAQLWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTTWWETIERMLGGD 145

Query: 122 GGPVTWLQFKEAFFQQYYPTITR-FRKQADSNLKQ 156
            G +TW QFKE+FF +++    R  ++Q   NL+Q
Sbjct: 146 VGQITWQQFKESFFAKFFSASLRDAKRQEFLNLEQ 172

BLAST of Moc10g15060 vs. ExPASy TrEMBL
Match: A0A5A7SW90 (Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold285G002460 PE=4 SV=1)

HSP 1 Score: 124.0 bits (310), Expect = 5.5e-25
Identity = 66/155 (42.58%), Postives = 88/155 (56.77%), Query Frame = 0

Query: 2   RLGGLGGGANLETVVQSPSQTEIPPMVQLPGQMENPPMGQTSGHMSTKAKYLRDFKKYDT 61
           R GG GG        +      + P VQ   Q  NP        +S +AK+LRDF+KY+ 
Sbjct: 26  RRGGRGG--------RGRGAGRVQPEVQPVAQATNPTAPVVPDQLSAEAKHLRDFRKYNP 85

Query: 62  HSFDGLSVDLTLAEAWLSSIETISHYMRCPEEQKVQCVVFMLKDDAFLWWECAKRSIDVS 121
            +FDG   D T A+ WLSS+ETI  YM+CPE+QKVQC VFML D    WWE  +R +   
Sbjct: 86  TTFDGSLEDPTRAQLWLSSLETIFRYMKCPEDQKVQCAVFMLTDRGTTWWETIERMLGGD 145

Query: 122 GGPVTWLQFKEAFFQQYYPTITR-FRKQADSNLKQ 156
            G +TW QFKE+FF +++    R  ++Q   NL+Q
Sbjct: 146 VGQITWQQFKESFFAKFFSASLRDAKRQEFLNLEQ 172

BLAST of Moc10g15060 vs. ExPASy TrEMBL
Match: A0A5D3BCA2 (Gag protease polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold110G002280 PE=4 SV=1)

HSP 1 Score: 123.2 bits (308), Expect = 9.4e-25
Identity = 61/146 (41.78%), Postives = 90/146 (61.64%), Query Frame = 0

Query: 13  ETVVQSPSQTEIPPMVQLPGQMENPPMGQ-TSGHMSTKAKYLRDFKKYDTHSFDGLSVDL 72
           + ++Q   Q +  P    P     P + Q     +S +AK+LRDF+KY++ +F+G   D 
Sbjct: 124 DLIMQMREQQQPAPPAPAPAPASVPVVPQVVPNQLSAEAKHLRDFRKYNSTTFNGSLEDP 183

Query: 73  TLAEAWLSSIETISHYMRCPEEQKVQCVVFMLKDDAFLWWECAKRSIDVSGGPVTWLQFK 132
           T A+ WLSS+ETI  YM+CPE+QKVQCVVFML D   +WWE  +R +    G +TW QFK
Sbjct: 184 TRAQLWLSSLETIFRYMKCPEDQKVQCVVFMLTDRGTVWWETTERMLGGDVGQITWQQFK 243

Query: 133 EAFFQQYYPTITR-FRKQADSNLKQG 157
           E+F+ +++    R  ++Q   NL+QG
Sbjct: 244 ESFYAKFFSASLRDAKRQEFLNLEQG 269

BLAST of Moc10g15060 vs. ExPASy TrEMBL
Match: A0A5A7T014 (Gag protease polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold20G001610 PE=4 SV=1)

HSP 1 Score: 123.2 bits (308), Expect = 9.4e-25
Identity = 61/146 (41.78%), Postives = 90/146 (61.64%), Query Frame = 0

Query: 13  ETVVQSPSQTEIPPMVQLPGQMENPPMGQ-TSGHMSTKAKYLRDFKKYDTHSFDGLSVDL 72
           + ++Q   Q +  P    P     P + Q     +S +AK+LRDF+KY++ +F+G   D 
Sbjct: 373 DLIMQMREQQQPAPPAPAPAPASVPVVPQVVPNQLSAEAKHLRDFRKYNSTTFNGSLEDP 432

Query: 73  TLAEAWLSSIETISHYMRCPEEQKVQCVVFMLKDDAFLWWECAKRSIDVSGGPVTWLQFK 132
           T A+ WLSS+ETI  YM+CPE+QKVQCVVFML D   +WWE  +R +    G +TW QFK
Sbjct: 433 TRAQLWLSSLETIFRYMKCPEDQKVQCVVFMLTDRGTVWWETTERMLGGDVGQITWQQFK 492

Query: 133 EAFFQQYYPTITR-FRKQADSNLKQG 157
           E+F+ +++    R  ++Q   NL+QG
Sbjct: 493 ESFYAKFFSASLRDAKRQEFLNLEQG 518

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022156662.13.2e-4369.92uncharacterized protein LOC111023512 [Momordica charantia][more]
XP_038891712.11.0e-2850.00uncharacterized protein LOC120081110 [Benincasa hispida][more]
XP_038883046.12.2e-2847.30uncharacterized protein LOC120074107 [Benincasa hispida] >XP_038883047.1 unchara... [more]
XP_038896687.13.8e-2851.26uncharacterized protein LOC120084949 [Benincasa hispida][more]
XP_038887018.13.2e-2755.86uncharacterized protein LOC120077183 [Benincasa hispida][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1DSJ61.5e-4369.92uncharacterized protein LOC111023512 OS=Momordica charantia OX=3673 GN=LOC111023... [more]
A0A5D3E4V05.5e-2542.58Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold45... [more]
A0A5A7SW905.5e-2542.58Reverse transcriptase OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold28... [more]
A0A5D3BCA29.4e-2541.78Gag protease polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffol... [more]
A0A5A7T0149.4e-2541.78Gag protease polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffol... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (OHB3-1) v2
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005162Retrotransposon gag domainPFAMPF03732Retrotrans_gagcoord: 99..157
e-value: 5.8E-5
score: 23.3
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..26

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Moc10g15060.1Moc10g15060.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009987 cellular process
biological_process GO:0043170 macromolecule metabolic process
biological_process GO:0006807 nitrogen compound metabolic process
biological_process GO:0044238 primary metabolic process
cellular_component GO:0110165 cellular anatomical entity
molecular_function GO:0016787 hydrolase activity