Moc05g31440 (gene) Bitter gourd (OHB3-1) v2

Overview
NameMoc05g31440
Typegene
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUlp1-like peptidase
Locationchr5: 23599568 .. 23603249 (+)
RNA-Seq ExpressionMoc05g31440
SyntenyMoc05g31440
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGCTTCAAGAACCTACATCACATTCTCCAAAAACCGACTCCCACAGCACAAAATCCTCGAAAACACTAACTGAAACTCAAGATCGAAATCGAATTTCAGACTCAGGCCTAGGGTTTCGAGCGAAGAAGAAGAAGAAGAAGGGTGGTAGTAGATCTAAAACGAGCGGTTCCATTAATACTGTGAGCTTCCAGTAAAATCGTGTCCACTCTAGGTACTACGTGGCACGAGGAGATATTCTTGGATTTTACACGTGTCGAGTACTGAATGGACAGGATTTGAGATTAAGGAGACTGTGGGCCCAGAATTATTGGCGAGCAAGGGCTTTCTCGTAATTGTATGGGGACGGTAGTAGTACAAGGTCGATCGGGAACATGGGAAAATGACTCATATAACCCTCCAGGTTTCTATTTTGCTCAAATAACCCTGCCCTCTAAAATTGTTTTCAATGACCTACTTTATTATTTTAATTAATTTAAAAAAGAATAAAATTACATTTGTACCCCTGAAGTCAAAAAAACCAACCGTTGGCCTTTCCCCTCTTTCTTTTCCCATTCGTTTTTCCTTCGTCTTCTTCTTCTTATCCTCTTCGATTCCCTCCGTTTCCTTCGGCCATCTCCTTGACTTAGACCTCGTATTTAACGGGCCACTGATACACAACATCCTACTTAGGGAGATCAAGGATAGTACACCTGACACCATTAGCTTCAACCTGTTTGGGAGTAAGGTGTCATTTGGGCGGAGGGAGTTCGACATTATTTCTGGGCTTAAGTATGATAGGGGTCTAGTTAGAAAAGACACATCTCCCCACAGACTTAGGGCTCTTTACTTTAATGATAGCAACGAAGTTCTGTTGAGTGAATTTGAGAAGATTTATTTAGCCGCACGGTTCGAGGATGACTTCGACGCGGTCAAGGTATCTATTGTATACTTAGTAGAGTTAGTTCTGTTGGGGAGGGAGAGGACCCTGAAGTACGACTATACATTGCTGGGAATAGTCGATGATTGGGAAACGTGCTGCAACCACGACTGGGGGATGTTGTCCTTTGATAAGACTATATATAGTCTGAAGCGCGGCCCGACGAAGAGGTCGAAGGATGGCGGGTTCAGGAAATCGTACAGTCTCTACGGTTTCCCTTGGGCGTTCCAGGTATGAGCCTATATACTTGTTTACAGTTTTCGTTCCTTTTAACCAATCAATAACGTCGAGCACTTATATACTTACAGGTATGGCCTACGAGACTATATCTTCCTTATCTGGACGTGTGGCCAATAAGGTAAATTCGGACGTCGTGCCACGGATTCTTCGATGGAGGTGCGGCCATTCAACTGCATGGCATGTGCTTGATAGAGAGATTTTTCGATCTAGCACAGTAAGTTCATTCTTGTTTATACTTATAAGTTTCTTTAATTTCCATATTTATGCGGACACATTCATTACCTTTGGCGTAGGGAAGAACTCGATCAATAGAAGCAACGGATGCTGAGACGACCTTCCTAGATAGGACGTTCGAACCACCGGAGCCCGAAGATGAGGACGAAGTCTCGCGCGAGAACAATGCTACTGAACCATCAAGTGCATGTGCAGGACCAGAGAAGGACGACGGGAGACAAGGAGGAATGTCGACAAAGGTGTTAGAGAAGACGTCCATGAAGAGGCTGAGGAGAAAGAGGAGAATGGACGTGGTAAGAAGGTACGCATGTCTAGTGTGCGTCTGAGGAAAGTTGAGAAACGGATCAAGCGCATGGACAAGCGCATGGACGACGGGTTCGAAGGTATTAAGACTGAATTAAAATACATCCGGAAGTTCTTGCGAAGAATCGCTAAGGTAATAACAGTTTTCACTTATAATAACTACGATGTCATTTTCATATAGCGATAACTAACTTGTTATCCATTGTATAGGGTTTGCCTGTCGACCCGAATGACATGAGAAGAGGCAGCAACCGTGATGGTACTGGACAAGGAGATGGTCCGAGTGATGGTCCTGGGCCAGGAGATGGTCCGAGTGATGCACCCAGTGGGGGACCTAGTGATCGAGGTGATGGTCCCAAGGATGGCAATGGTTCTACACCATCTCCTAGGGACGTGGACGACACAGATGACATGATCATCAATCCCCCCCCATGTGATCGCGCAGGAGACAAAGGAACATCAGTCCGGCCAAACGACCGGGAACCAGGTTGAACCAGAGGGCGAGGTATGTGTTCGTAATCTATAAGTTCAAATTCGATCATAAGCATGGTCTAACGAATGTTACTCTGCCCTGTATCAACAACATTCACCTGCCTTGACGCACAAGGAAGACATGGGTACAGAGGACGTGCACAAGGAAGATATTGGTCTACATGCGCGTTGTGAGGCTGCCCCTCTCGAGCAGACTCCTGTTCAGAGCAGACAGGTGGACCATATTACGATCGATTCGCATCCGTTAGAGTCATCTATGGACGACGAGGTATTTATTTTTGACTTTTCAACTCTAAACACAAGATTTTATTTGCTAACATAATTTATCCAGGACGAATACGCCGAGGACTTCACGGACTCTGATGCGAAAGAGCCGGGAGCGACGTCGCAACCAGACCTGGATGAGGTATGTGCGGTATCGCAGCCTGTTGAACGTCAGAACCCTCGGCGGGGGTCTCGGAAGAGGAAGCTCCCATGAAAGCTTCGCGGGTCGTTCAATGTCATGGTGGACGGGAAGCGGAAGAAGGTAATGCGATATGACCCACTAGTCCACGTCCCCTCTGAACAAGTCCAGAAGTTTCATGCTTGGCTGGCGAACCCTAACACCGACCGCGCCACTCGCAAATCATGCTACGGTGATCGAGGAAAGACATGGTTTCGTGACCTTATCAACTCGGGCAAGTGGATGACGAGTGAGGTACGGGTTCGAAACCCCAAATATGTGTTGTACTGGTATTTAGTTTTTTTCGAACAACAATGACTTATATAGGTGATCGATTCGTTGTTCATGTTCACGCGGAACAAACTCGAGCAACGGCATGACTTGTGTTCTCGAAGATTCACCACCGGTGACATTGTCCTTGCGGTATGTAATATATATATATATATATCTCATACGCTGACCGTTTTTACAAATTCATGTGCTAACCGAAATTTTTTATTCCTTGCAGAACTTTCTTCGACGAACAGACGGACTATATCAACGCATGACAGCCCCGAACGCTGTACCTGCGAGAGTTGCAACGGAATACGATTGGGCTGGGAAGTGTAAGACCATCCTGAGCTATATGGACGGGACGCACACAGACTATCAAACACGATGGCTTGATCTGGATGCTATTTACCTGCCATACAACATCGGTGGCAACCATTGGATTATGCTACACATCGATCTGCAGGAGGGTGAGATCATTGTGTGGGATTCTATAAGGTCGATGACACCCTTTCCAGCTCTGGAGTCCGAGTTGAGGCCAATGACTGTTGTCCTACCAACGTTTATGCACAGGGCCGGTGTTCAGATACTGAGGCCGACACTACCGAATACGCCATGGCGCATTCGTCAAGTAACGTCCGCGCCCCAGCAAAGCGAGTCTGGTGACTGTGGAATGTTTTGCGTTAAATATTTCGAGTATGATGTAACAGGGTCAAATATGACCAGCTTAACTCAGGACAACATTAGTTTTTTTAGGGAGAAATTGGCCATAGAAATGTGA

mRNA sequence

ATGGAGCTTCAAGAACCTACATCACATTCTCCAAAAACCGACTCCCACAGCACAAAATCCTCGAAAACACTAACTGAAACTCAAGATCGAAATCGAATTTCAGACTCAGGCCTAGGGGAGATCAAGGATAGTACACCTGACACCATTAGCTTCAACCTGTTTGGGAGTAAGGTGTCATTTGGGCGGAGGGAGTTCGACATTATTTCTGGGCTTAAGTATGATAGGGGTCTAGTTAGAAAAGACACATCTCCCCACAGACTTAGGGCTCTTTACTTTAATGATAGCAACGAAGTTCTGTTGAGTGAATTTGAGAAGATTTATTTAGCCGCACGGTTCGAGGATGACTTCGACGCGGTCAAGGTATCTATTGTATACTTAGTAGAGTTAGTTCTGTTGGGGAGGGAGAGGACCCTGAAGTACGACTATACATTGCTGGGAATAGTCGATGATTGGGAAACGTGCTGCAACCACGACTGGGGGATGTTGTCCTTTGATAAGACTATATATAGTCTGAAGCGCGGCCCGACGAAGAGGTCGAAGGATGGCGGGTTCAGGAAATCGTACAGTCTCTACGGTTTCCCTTGGGCGTTCCAGGTAAATTCGGACGTCGTGCCACGGATTCTTCGATGGAGGTGCGGCCATTCAACTGCATGGCATGTGCTTGATAGAGAGATTTTTCGATCTAGCACAGGAAGAACTCGATCAATAGAAGCAACGGATGCTGAGACGACCTTCCTAGATAGGACGTTCGAACCACCGGAGCCCGAAGATGAGGACGAAGTCTCGCGCGAGAACAATGCTACTGAACCATCAAGTGCATGTGCAGGACCAGAGAAGGACGACGGGAGACAAGGAGGAATGTCGACAAAGGTGTTAGAGAAGACGTCCATGAAGAGGCTGAGGAGAAAGAGGAGAATGGACGTGCGCATGGACAAGCGCATGGACGACGGGTTCGAAGGTATTAAGACTGAATTAAAATACATCCGGAAGTTCTTGCGAAGAATCGCTAAGGGTTTGCCTGTCGACCCGAATGACATGAGAAGAGGCAGCAACCGTGATGGTACTGGACAAGGAGATGGTCCGAGTGATGGTCCTGGGCCAGGAGATGGTCCGAGTGATGCACCCAGTGGGGGACCTAGTGATCGAGGTGATGGTCCCAAGGATGGCAATGGTTCTACACCATCTCCTAGGGACGTGGACGACACAGATGACATGATCATCAATCCCCCCCCATGTGATCGCGCAGGAGACAAAGGAACATCAGTCCGGCCAAACGACCGGGAACCAGAGGACGTGCACAAGGAAGATATTGGTCTACATGCGCGTTGTGAGGCTGCCCCTCTCGAGCAGACTCCTGTTCAGAGCAGACAGGTGGACCATATTACGATCGATTCGCATCCGTTAGAGTCATCTATGGACGACGAGGACGAATACGCCGAGGACTTCACGGACTCTGATGCGAAAGAGCCGGGAGCGACGTCGCAACCAGACCTGGATGAGCTTCGCGGGTCGTTCAATGTCATGGTGGACGGGAAGCGGAAGAAGGTAATGCGATATGACCCACTAGTCCACGTCCCCTCTGAACAAGTCCAGAAGTTTCATGCTTGGCTGGCGAACCCTAACACCGACCGCGCCACTCGCAAATCATGCTACGGTGATCGAGGAAAGACATGGTTTCGTGACCTTATCAACTCGGGCAAGTGGATGACGAGTGAGGTGATCGATTCGTTGTTCATGTTCACGCGGAACAAACTCGAGCAACGGCATGACTTGTGTTCTCGAAGATTCACCACCGGTGACATTAACTTTCTTCGACGAACAGACGGACTATATCAACGCATGACAGCCCCGAACGCTGTACCTGCGAGAGTTGCAACGGAATACGATTGGGCTGGGAAGTGTAAGACCATCCTGAGCTATATGGACGGGACGCACACAGACTATCAAACACGATGGCTTGATCTGGATGCTATTTACCTGCCATACAACATCGGTGGCAACCATTGGATTATGCTACACATCGATCTGCAGGAGGGTGAGATCATTGTGTGGGATTCTATAAGGTCGATGACACCCTTTCCAGCTCTGGAGTCCGAGTTGAGGCCAATGACTGTTGTCCTACCAACGTTTATGCACAGGGCCGGTGTTCAGATACTGAGGCCGACACTACCGAATACGCCATGGCGCATTCGTCAAGTAACGTCCGCGCCCCAGCAAAGCGAGTCTGGTGACTGTGGAATGTTTTGCGTTAAATATTTCGAGTATGATGTAACAGGGTCAAATATGACCAGCTTAACTCAGGACAACATTAGTTTTTTTAGGGAGAAATTGGCCATAGAAATGTGA

Coding sequence (CDS)

ATGGAGCTTCAAGAACCTACATCACATTCTCCAAAAACCGACTCCCACAGCACAAAATCCTCGAAAACACTAACTGAAACTCAAGATCGAAATCGAATTTCAGACTCAGGCCTAGGGGAGATCAAGGATAGTACACCTGACACCATTAGCTTCAACCTGTTTGGGAGTAAGGTGTCATTTGGGCGGAGGGAGTTCGACATTATTTCTGGGCTTAAGTATGATAGGGGTCTAGTTAGAAAAGACACATCTCCCCACAGACTTAGGGCTCTTTACTTTAATGATAGCAACGAAGTTCTGTTGAGTGAATTTGAGAAGATTTATTTAGCCGCACGGTTCGAGGATGACTTCGACGCGGTCAAGGTATCTATTGTATACTTAGTAGAGTTAGTTCTGTTGGGGAGGGAGAGGACCCTGAAGTACGACTATACATTGCTGGGAATAGTCGATGATTGGGAAACGTGCTGCAACCACGACTGGGGGATGTTGTCCTTTGATAAGACTATATATAGTCTGAAGCGCGGCCCGACGAAGAGGTCGAAGGATGGCGGGTTCAGGAAATCGTACAGTCTCTACGGTTTCCCTTGGGCGTTCCAGGTAAATTCGGACGTCGTGCCACGGATTCTTCGATGGAGGTGCGGCCATTCAACTGCATGGCATGTGCTTGATAGAGAGATTTTTCGATCTAGCACAGGAAGAACTCGATCAATAGAAGCAACGGATGCTGAGACGACCTTCCTAGATAGGACGTTCGAACCACCGGAGCCCGAAGATGAGGACGAAGTCTCGCGCGAGAACAATGCTACTGAACCATCAAGTGCATGTGCAGGACCAGAGAAGGACGACGGGAGACAAGGAGGAATGTCGACAAAGGTGTTAGAGAAGACGTCCATGAAGAGGCTGAGGAGAAAGAGGAGAATGGACGTGCGCATGGACAAGCGCATGGACGACGGGTTCGAAGGTATTAAGACTGAATTAAAATACATCCGGAAGTTCTTGCGAAGAATCGCTAAGGGTTTGCCTGTCGACCCGAATGACATGAGAAGAGGCAGCAACCGTGATGGTACTGGACAAGGAGATGGTCCGAGTGATGGTCCTGGGCCAGGAGATGGTCCGAGTGATGCACCCAGTGGGGGACCTAGTGATCGAGGTGATGGTCCCAAGGATGGCAATGGTTCTACACCATCTCCTAGGGACGTGGACGACACAGATGACATGATCATCAATCCCCCCCCATGTGATCGCGCAGGAGACAAAGGAACATCAGTCCGGCCAAACGACCGGGAACCAGAGGACGTGCACAAGGAAGATATTGGTCTACATGCGCGTTGTGAGGCTGCCCCTCTCGAGCAGACTCCTGTTCAGAGCAGACAGGTGGACCATATTACGATCGATTCGCATCCGTTAGAGTCATCTATGGACGACGAGGACGAATACGCCGAGGACTTCACGGACTCTGATGCGAAAGAGCCGGGAGCGACGTCGCAACCAGACCTGGATGAGCTTCGCGGGTCGTTCAATGTCATGGTGGACGGGAAGCGGAAGAAGGTAATGCGATATGACCCACTAGTCCACGTCCCCTCTGAACAAGTCCAGAAGTTTCATGCTTGGCTGGCGAACCCTAACACCGACCGCGCCACTCGCAAATCATGCTACGGTGATCGAGGAAAGACATGGTTTCGTGACCTTATCAACTCGGGCAAGTGGATGACGAGTGAGGTGATCGATTCGTTGTTCATGTTCACGCGGAACAAACTCGAGCAACGGCATGACTTGTGTTCTCGAAGATTCACCACCGGTGACATTAACTTTCTTCGACGAACAGACGGACTATATCAACGCATGACAGCCCCGAACGCTGTACCTGCGAGAGTTGCAACGGAATACGATTGGGCTGGGAAGTGTAAGACCATCCTGAGCTATATGGACGGGACGCACACAGACTATCAAACACGATGGCTTGATCTGGATGCTATTTACCTGCCATACAACATCGGTGGCAACCATTGGATTATGCTACACATCGATCTGCAGGAGGGTGAGATCATTGTGTGGGATTCTATAAGGTCGATGACACCCTTTCCAGCTCTGGAGTCCGAGTTGAGGCCAATGACTGTTGTCCTACCAACGTTTATGCACAGGGCCGGTGTTCAGATACTGAGGCCGACACTACCGAATACGCCATGGCGCATTCGTCAAGTAACGTCCGCGCCCCAGCAAAGCGAGTCTGGTGACTGTGGAATGTTTTGCGTTAAATATTTCGAGTATGATGTAACAGGGTCAAATATGACCAGCTTAACTCAGGACAACATTAGTTTTTTTAGGGAGAAATTGGCCATAGAAATGTGA

Protein sequence

MELQEPTSHSPKTDSHSTKSSKTLTETQDRNRISDSGLGEIKDSTPDTISFNLFGSKVSFGRREFDIISGLKYDRGLVRKDTSPHRLRALYFNDSNEVLLSEFEKIYLAARFEDDFDAVKVSIVYLVELVLLGRERTLKYDYTLLGIVDDWETCCNHDWGMLSFDKTIYSLKRGPTKRSKDGGFRKSYSLYGFPWAFQVNSDVVPRILRWRCGHSTAWHVLDREIFRSSTGRTRSIEATDAETTFLDRTFEPPEPEDEDEVSRENNATEPSSACAGPEKDDGRQGGMSTKVLEKTSMKRLRRKRRMDVRMDKRMDDGFEGIKTELKYIRKFLRRIAKGLPVDPNDMRRGSNRDGTGQGDGPSDGPGPGDGPSDAPSGGPSDRGDGPKDGNGSTPSPRDVDDTDDMIINPPPCDRAGDKGTSVRPNDREPEDVHKEDIGLHARCEAAPLEQTPVQSRQVDHITIDSHPLESSMDDEDEYAEDFTDSDAKEPGATSQPDLDELRGSFNVMVDGKRKKVMRYDPLVHVPSEQVQKFHAWLANPNTDRATRKSCYGDRGKTWFRDLINSGKWMTSEVIDSLFMFTRNKLEQRHDLCSRRFTTGDINFLRRTDGLYQRMTAPNAVPARVATEYDWAGKCKTILSYMDGTHTDYQTRWLDLDAIYLPYNIGGNHWIMLHIDLQEGEIIVWDSIRSMTPFPALESELRPMTVVLPTFMHRAGVQILRPTLPNTPWRIRQVTSAPQQSESGDCGMFCVKYFEYDVTGSNMTSLTQDNISFFREKLAIEM
Homology
BLAST of Moc05g31440 vs. NCBI nr
Match: XP_022154561.1 (uncharacterized protein LOC111021802 [Momordica charantia])

HSP 1 Score: 319.7 bits (818), Expect = 7.0e-83
Identity = 169/328 (51.52%), Postives = 220/328 (67.07%), Query Frame = 0

Query: 38  LGEIKDSTPDTISFNLFGSKVSFGRREFDIISGLKYDRGLVRKDTSPHRLRALYFNDSNE 97
           L E+++STP+TISFNLF  ++SF R +F +ISGLKY R  VR++T PHRL  LYFND  +
Sbjct: 27  LREVEESTPNTISFNLFRRRMSFERTKFHLISGLKYVRTPVRENTLPHRLMTLYFNDKTD 86

Query: 98  VLLSEFEKIYLAARFEDDFDAVKVSIVYLVELVLLGRERTLKYDYTLLGIVDDWETCCNH 157
           ++LS+FEK+Y AARFEDD+D VKV IVY+V + LLGRER +K+D+TLLGIVDDWE CCN+
Sbjct: 87  LVLSDFEKMYTAARFEDDYDVVKVLIVYMVGIGLLGRERMVKFDHTLLGIVDDWEVCCNY 146

Query: 158 DWGMLSFDKTIYSLKRGPTKRSKDGGFRKSYSLYGFPWAFQVNS---------------- 217
           +W  LSF+KTI SL+RGP K SKDG  RKSYSLYGFPW FQV +                
Sbjct: 147 NWASLSFEKTINSLQRGPLKMSKDGKLRKSYSLYGFPWVFQVWAYDTISSLSMRVANKVL 206

Query: 218 -DVVPRILRWRCGHSTAWHVLDREIFRSSTGRTRSIEATDAETTFLDRTFEPPEPEDEDE 277
            D VP I +WR  HSTAWHVLDR+IF S+ GRTR+++ TD ET+FL+R+F+PP  +D+D 
Sbjct: 207 YDTVPHIFKWRYDHSTAWHVLDRDIFCSTKGRTRTLDETDVETSFLNRSFDPPVSDDDDV 266

Query: 278 VSRENNATEPSSACAGPEKDDGRQGGMSTKVLEK----------------TSMKRLRRKR 333
           +    +   PS+   G + DD  +G    +++EK                 S  RL+R  
Sbjct: 267 MEERGDNAGPSAVREGSQYDDESRGA-DVEMVEKDAELENEETKGKNKVCISTGRLKRVE 326

BLAST of Moc05g31440 vs. NCBI nr
Match: XP_022159253.1 (uncharacterized protein LOC111025666 [Momordica charantia] >XP_022159254.1 uncharacterized protein LOC111025667 [Momordica charantia])

HSP 1 Score: 285.4 bits (729), Expect = 1.5e-72
Identity = 138/158 (87.34%), Postives = 147/158 (93.04%), Query Frame = 0

Query: 38  LGEIKD-STPDTISFNLFGSKVSFGRREFDIISGLKYDRGLVRKDTSPHRLRALYFNDSN 97
           L EI+D STP+TISFNLFGSKV F RREFDIISGLKYDR  VRKDTSPHRLRALYFNDSN
Sbjct: 65  LREIEDSSTPNTISFNLFGSKVLFRRREFDIISGLKYDRSPVRKDTSPHRLRALYFNDSN 124

Query: 98  EVLLSEFEKIYLAARFEDDFDAVKVSIVYLVELVLLGRERTLKYDYTLLGIVDDWETCCN 157
           ++LLS+FEKIY+  RFEDDFDA K+SIVYL+ELVLLGRERTLKYDYTLLGIVDD ETCCN
Sbjct: 125 DILLSDFEKIYIVTRFEDDFDAAKISIVYLMELVLLGRERTLKYDYTLLGIVDDCETCCN 184

Query: 158 HDWGMLSFDKTIYSLKRGPTKRSKDGGFRKSYSLYGFP 195
           HDWGM+SFDKTIYSLKRGPTKRSKDGGFRK YSLYGFP
Sbjct: 185 HDWGMMSFDKTIYSLKRGPTKRSKDGGFRKLYSLYGFP 222

BLAST of Moc05g31440 vs. NCBI nr
Match: XP_022154364.1 (uncharacterized protein LOC111021646 [Momordica charantia])

HSP 1 Score: 269.6 bits (688), Expect = 8.3e-68
Identity = 116/206 (56.31%), Postives = 157/206 (76.21%), Query Frame = 0

Query: 579 MFTRNKLEQRHDLCSRRFTTGDI---NFLRRTDGLYQRMTAPNAVPARVATEYDWAGKCK 638
           MF  NKL+ R +LC R+FTTGD+   NFLR TDG+Y  M +PN + +RVA++YDW G+  
Sbjct: 1   MFVCNKLKLRPNLCRRKFTTGDVLISNFLRSTDGVYVMMQSPNVIASRVASDYDWEGRAW 60

Query: 639 TILSYMDGTHTDYQTRWLDLDAIYLPYNIGGNHWIMLHIDLQEGEIIVWDSIRSMTPFPA 698
           ++LSY+DGTH+D  TRW+D+DA+YLPYNIGG HWI++ ID  EGE+IVWDS  +MTP P 
Sbjct: 61  SMLSYIDGTHSDNDTRWMDVDAVYLPYNIGGVHWIVICIDFDEGELIVWDSFMNMTPLPQ 120

Query: 699 LESELRPMTVVLPTFMHRAGVQILRPTLPNTPWRIRQVTSAPQQSESGDCGMFCVKYFEY 758
           LE EL+PM  ++PT + R GV + +P +P TPWRIR+V+SAPQQ   GDCG+FC+ +FEY
Sbjct: 121 LEQELKPMITIIPTLICRVGVHLYKPNIPLTPWRIRRVSSAPQQGMDGDCGIFCINFFEY 180

Query: 759 DVTGSNMTSLTQDNISFFREKLAIEM 782
           DVT  +  +LTQ  +SFFR + A+++
Sbjct: 181 DVTSCSFDTLTQSRMSFFRRQFAVQL 206

BLAST of Moc05g31440 vs. NCBI nr
Match: XP_022156465.1 (uncharacterized protein LOC111023353 [Momordica charantia])

HSP 1 Score: 267.7 bits (683), Expect = 3.2e-67
Identity = 144/248 (58.06%), Postives = 171/248 (68.95%), Query Frame = 0

Query: 38  LGEIKDSTPDTISFNLFGSKVSFGRREFDIISGLKYDRGLVRKDTSPHRLRALYFNDSNE 97
           L E++DSTP+TISFNLFG +VSFGRREFD+ISGL YDR  VRK T  H+LR LYFND   
Sbjct: 27  LREVEDSTPNTISFNLFGRRVSFGRREFDLISGLHYDRSPVRKVTHSHKLRTLYFNDRTN 86

Query: 98  VLLSEFEKIYLAARFEDDFDAVKVSIVYLVELVLLGRERTLKYDYTLLGIVDDWETCCNH 157
            +LS+F K+Y+AA F+DDFD +KVSI+Y+VELVLLGRE T+K+D  LLG+VDDWE CCNH
Sbjct: 87  DVLSDFVKLYIAALFKDDFDVIKVSIIYMVELVLLGRETTMKFDQILLGVVDDWELCCNH 146

Query: 158 DWGMLSFDKTIYSLKRGPTKRSKDGGFRKSYSLYGFPWAFQVNSDVVPRILRWRCGHSTA 217
           D   LSFDKTI SL RGPT  +KD G RKSYSLYGFPW FQV                  
Sbjct: 147 DLASLSFDKTIRSLHRGPTNMAKDFGLRKSYSLYGFPWVFQV------------------ 206

Query: 218 WHVLDREIFRSSTGRTRSIEATDAETTFLDRTFEPPEPEDEDEVSRENNATEPSSACAGP 277
           W         +   RTR +EATDAET F+ RTFEPPEPED+D   R+ +A  PS+   G 
Sbjct: 207 W---------TYERRTRRLEATDAETNFMRRTFEPPEPEDDD--VRDFDA-GPSTVREGT 244

Query: 278 EKDDGRQG 286
           +  D  +G
Sbjct: 267 QNPDVGRG 244

BLAST of Moc05g31440 vs. NCBI nr
Match: XP_022158744.1 (uncharacterized protein LOC111025209 [Momordica charantia])

HSP 1 Score: 263.8 bits (673), Expect = 4.6e-66
Identity = 131/195 (67.18%), Postives = 150/195 (76.92%), Query Frame = 0

Query: 53  LFGSKVSFGRREFDIISGLKYDRGLVRKDTSPHRLRALYFNDSNEVLLSEFEKIYLAARF 112
           L G+KVSFGRREFDIISGLKY R  VRK T P R   LYFN+S ++LLSE EK+Y + RF
Sbjct: 59  LLGTKVSFGRREFDIISGLKYSRSPVRKITYPQRHGTLYFNNSTDLLLSELEKMYTSIRF 118

Query: 113 EDDFDAVKVSIVYLVELVLLGRERTLKYDYTLLGIVDDWETCCNHDWGMLSFDKTIYSLK 172
           EDD DAVKV +VY VELVLLGRER+ K+D+ LLGIVDDWE CCNHDW +LSFDKTIYSL+
Sbjct: 119 EDDIDAVKV-LVYFVELVLLGRERSTKFDHILLGIVDDWEACCNHDWALLSFDKTIYSLQ 178

Query: 173 RGPTKRSKDGGFRKSYSLYGFPWAFQ-----------------VNSDVVPRILRWRCGHS 231
           RG + +SK+GG RKSYSLYGFPWAFQ                 V+ DVVPRIL+WR  HS
Sbjct: 179 RGASNKSKEGGLRKSYSLYGFPWAFQVWAYEIISSLSGHIVTIVSQDVVPRILQWRYEHS 238

BLAST of Moc05g31440 vs. ExPASy Swiss-Prot
Match: Q94F30 (Ubiquitin-like-specific protease ESD4 OS=Arabidopsis thaliana OX=3702 GN=ESD4 PE=1 SV=1)

HSP 1 Score: 51.2 bits (121), Expect = 6.1e-05
Identity = 32/129 (24.81%), Postives = 60/129 (46.51%), Query Frame = 0

Query: 653 LDLDAIYLPYNIGGNHWIMLHIDLQEGEIIVWDSIRSMTPFPALESELRPMTVVLPTFMH 712
           +D D I++P +  G HW +  I+ +E +++  DS+  + P          +   L  +M 
Sbjct: 366 IDCDMIFVPIH-RGVHWTLAVINNRESKLLYLDSLNGVDPM---------ILNALAKYMG 425

Query: 713 RAGVQILRPTLPNTPWRIRQVTSAPQQSESGDCGMFCVKYFEYDVTGSNMTSLTQDNISF 772
               +     +    W +  V   PQQ    DCGMF +KY ++   G  +   +Q+++ +
Sbjct: 426 DEANEKSGKKIDANSWDMEFVEDLPQQKNGYDCGMFMLKYIDFFSRGLGL-CFSQEHMPY 483

Query: 773 FREKLAIEM 782
           FR + A E+
Sbjct: 486 FRLRTAKEI 483

BLAST of Moc05g31440 vs. ExPASy TrEMBL
Match: A0A6J1DP34 (uncharacterized protein LOC111021802 OS=Momordica charantia OX=3673 GN=LOC111021802 PE=4 SV=1)

HSP 1 Score: 319.7 bits (818), Expect = 3.4e-83
Identity = 169/328 (51.52%), Postives = 220/328 (67.07%), Query Frame = 0

Query: 38  LGEIKDSTPDTISFNLFGSKVSFGRREFDIISGLKYDRGLVRKDTSPHRLRALYFNDSNE 97
           L E+++STP+TISFNLF  ++SF R +F +ISGLKY R  VR++T PHRL  LYFND  +
Sbjct: 27  LREVEESTPNTISFNLFRRRMSFERTKFHLISGLKYVRTPVRENTLPHRLMTLYFNDKTD 86

Query: 98  VLLSEFEKIYLAARFEDDFDAVKVSIVYLVELVLLGRERTLKYDYTLLGIVDDWETCCNH 157
           ++LS+FEK+Y AARFEDD+D VKV IVY+V + LLGRER +K+D+TLLGIVDDWE CCN+
Sbjct: 87  LVLSDFEKMYTAARFEDDYDVVKVLIVYMVGIGLLGRERMVKFDHTLLGIVDDWEVCCNY 146

Query: 158 DWGMLSFDKTIYSLKRGPTKRSKDGGFRKSYSLYGFPWAFQVNS---------------- 217
           +W  LSF+KTI SL+RGP K SKDG  RKSYSLYGFPW FQV +                
Sbjct: 147 NWASLSFEKTINSLQRGPLKMSKDGKLRKSYSLYGFPWVFQVWAYDTISSLSMRVANKVL 206

Query: 218 -DVVPRILRWRCGHSTAWHVLDREIFRSSTGRTRSIEATDAETTFLDRTFEPPEPEDEDE 277
            D VP I +WR  HSTAWHVLDR+IF S+ GRTR+++ TD ET+FL+R+F+PP  +D+D 
Sbjct: 207 YDTVPHIFKWRYDHSTAWHVLDRDIFCSTKGRTRTLDETDVETSFLNRSFDPPVSDDDDV 266

Query: 278 VSRENNATEPSSACAGPEKDDGRQGGMSTKVLEK----------------TSMKRLRRKR 333
           +    +   PS+   G + DD  +G    +++EK                 S  RL+R  
Sbjct: 267 MEERGDNAGPSAVREGSQYDDESRGA-DVEMVEKDAELENEETKGKNKVCISTGRLKRVE 326

BLAST of Moc05g31440 vs. ExPASy TrEMBL
Match: A0A6J1DYB1 (uncharacterized protein LOC111025666 OS=Momordica charantia OX=3673 GN=LOC111025666 PE=4 SV=1)

HSP 1 Score: 285.4 bits (729), Expect = 7.1e-73
Identity = 138/158 (87.34%), Postives = 147/158 (93.04%), Query Frame = 0

Query: 38  LGEIKD-STPDTISFNLFGSKVSFGRREFDIISGLKYDRGLVRKDTSPHRLRALYFNDSN 97
           L EI+D STP+TISFNLFGSKV F RREFDIISGLKYDR  VRKDTSPHRLRALYFNDSN
Sbjct: 65  LREIEDSSTPNTISFNLFGSKVLFRRREFDIISGLKYDRSPVRKDTSPHRLRALYFNDSN 124

Query: 98  EVLLSEFEKIYLAARFEDDFDAVKVSIVYLVELVLLGRERTLKYDYTLLGIVDDWETCCN 157
           ++LLS+FEKIY+  RFEDDFDA K+SIVYL+ELVLLGRERTLKYDYTLLGIVDD ETCCN
Sbjct: 125 DILLSDFEKIYIVTRFEDDFDAAKISIVYLMELVLLGRERTLKYDYTLLGIVDDCETCCN 184

Query: 158 HDWGMLSFDKTIYSLKRGPTKRSKDGGFRKSYSLYGFP 195
           HDWGM+SFDKTIYSLKRGPTKRSKDGGFRK YSLYGFP
Sbjct: 185 HDWGMMSFDKTIYSLKRGPTKRSKDGGFRKLYSLYGFP 222

BLAST of Moc05g31440 vs. ExPASy TrEMBL
Match: A0A6J1DLV0 (uncharacterized protein LOC111021646 OS=Momordica charantia OX=3673 GN=LOC111021646 PE=3 SV=1)

HSP 1 Score: 269.6 bits (688), Expect = 4.0e-68
Identity = 116/206 (56.31%), Postives = 157/206 (76.21%), Query Frame = 0

Query: 579 MFTRNKLEQRHDLCSRRFTTGDI---NFLRRTDGLYQRMTAPNAVPARVATEYDWAGKCK 638
           MF  NKL+ R +LC R+FTTGD+   NFLR TDG+Y  M +PN + +RVA++YDW G+  
Sbjct: 1   MFVCNKLKLRPNLCRRKFTTGDVLISNFLRSTDGVYVMMQSPNVIASRVASDYDWEGRAW 60

Query: 639 TILSYMDGTHTDYQTRWLDLDAIYLPYNIGGNHWIMLHIDLQEGEIIVWDSIRSMTPFPA 698
           ++LSY+DGTH+D  TRW+D+DA+YLPYNIGG HWI++ ID  EGE+IVWDS  +MTP P 
Sbjct: 61  SMLSYIDGTHSDNDTRWMDVDAVYLPYNIGGVHWIVICIDFDEGELIVWDSFMNMTPLPQ 120

Query: 699 LESELRPMTVVLPTFMHRAGVQILRPTLPNTPWRIRQVTSAPQQSESGDCGMFCVKYFEY 758
           LE EL+PM  ++PT + R GV + +P +P TPWRIR+V+SAPQQ   GDCG+FC+ +FEY
Sbjct: 121 LEQELKPMITIIPTLICRVGVHLYKPNIPLTPWRIRRVSSAPQQGMDGDCGIFCINFFEY 180

Query: 759 DVTGSNMTSLTQDNISFFREKLAIEM 782
           DVT  +  +LTQ  +SFFR + A+++
Sbjct: 181 DVTSCSFDTLTQSRMSFFRRQFAVQL 206

BLAST of Moc05g31440 vs. ExPASy TrEMBL
Match: A0A6J1DQC8 (uncharacterized protein LOC111023353 OS=Momordica charantia OX=3673 GN=LOC111023353 PE=4 SV=1)

HSP 1 Score: 267.7 bits (683), Expect = 1.5e-67
Identity = 144/248 (58.06%), Postives = 171/248 (68.95%), Query Frame = 0

Query: 38  LGEIKDSTPDTISFNLFGSKVSFGRREFDIISGLKYDRGLVRKDTSPHRLRALYFNDSNE 97
           L E++DSTP+TISFNLFG +VSFGRREFD+ISGL YDR  VRK T  H+LR LYFND   
Sbjct: 27  LREVEDSTPNTISFNLFGRRVSFGRREFDLISGLHYDRSPVRKVTHSHKLRTLYFNDRTN 86

Query: 98  VLLSEFEKIYLAARFEDDFDAVKVSIVYLVELVLLGRERTLKYDYTLLGIVDDWETCCNH 157
            +LS+F K+Y+AA F+DDFD +KVSI+Y+VELVLLGRE T+K+D  LLG+VDDWE CCNH
Sbjct: 87  DVLSDFVKLYIAALFKDDFDVIKVSIIYMVELVLLGRETTMKFDQILLGVVDDWELCCNH 146

Query: 158 DWGMLSFDKTIYSLKRGPTKRSKDGGFRKSYSLYGFPWAFQVNSDVVPRILRWRCGHSTA 217
           D   LSFDKTI SL RGPT  +KD G RKSYSLYGFPW FQV                  
Sbjct: 147 DLASLSFDKTIRSLHRGPTNMAKDFGLRKSYSLYGFPWVFQV------------------ 206

Query: 218 WHVLDREIFRSSTGRTRSIEATDAETTFLDRTFEPPEPEDEDEVSRENNATEPSSACAGP 277
           W         +   RTR +EATDAET F+ RTFEPPEPED+D   R+ +A  PS+   G 
Sbjct: 207 W---------TYERRTRRLEATDAETNFMRRTFEPPEPEDDD--VRDFDA-GPSTVREGT 244

Query: 278 EKDDGRQG 286
           +  D  +G
Sbjct: 267 QNPDVGRG 244

BLAST of Moc05g31440 vs. ExPASy TrEMBL
Match: A0A6J1E0A9 (uncharacterized protein LOC111025209 OS=Momordica charantia OX=3673 GN=LOC111025209 PE=4 SV=1)

HSP 1 Score: 263.8 bits (673), Expect = 2.2e-66
Identity = 131/195 (67.18%), Postives = 150/195 (76.92%), Query Frame = 0

Query: 53  LFGSKVSFGRREFDIISGLKYDRGLVRKDTSPHRLRALYFNDSNEVLLSEFEKIYLAARF 112
           L G+KVSFGRREFDIISGLKY R  VRK T P R   LYFN+S ++LLSE EK+Y + RF
Sbjct: 59  LLGTKVSFGRREFDIISGLKYSRSPVRKITYPQRHGTLYFNNSTDLLLSELEKMYTSIRF 118

Query: 113 EDDFDAVKVSIVYLVELVLLGRERTLKYDYTLLGIVDDWETCCNHDWGMLSFDKTIYSLK 172
           EDD DAVKV +VY VELVLLGRER+ K+D+ LLGIVDDWE CCNHDW +LSFDKTIYSL+
Sbjct: 119 EDDIDAVKV-LVYFVELVLLGRERSTKFDHILLGIVDDWEACCNHDWALLSFDKTIYSLQ 178

Query: 173 RGPTKRSKDGGFRKSYSLYGFPWAFQ-----------------VNSDVVPRILRWRCGHS 231
           RG + +SK+GG RKSYSLYGFPWAFQ                 V+ DVVPRIL+WR  HS
Sbjct: 179 RGASNKSKEGGLRKSYSLYGFPWAFQVWAYEIISSLSGHIVTIVSQDVVPRILQWRYEHS 238

BLAST of Moc05g31440 vs. TAIR 10
Match: AT5G45570.1 (Ulp1 protease family protein )

HSP 1 Score: 71.2 bits (173), Expect = 4.0e-12
Identity = 37/131 (28.24%), Postives = 66/131 (50.38%), Query Frame = 0

Query: 652 WLDLDAIYLPYNIGGNHWIMLHIDLQEGEIIVWDSIRSMTPFPALESELRPMTVVLPTFM 711
           ++D+D +Y    + GNHW+ L IDL    + V+DSI S+T    +  +   +  ++P  +
Sbjct: 765 FVDVDHLYAYLFVNGNHWVALDIDLTNKRVNVYDSIPSLTTDTEMAIQCMFVMTMIPAML 824

Query: 712 HR-AGVQILRPTLPNTPWRIRQVTSAPQQSESGDCGMFCVKYFEYDVTGSNMTSLTQDNI 771
                 +  R +     W  +++T  P+  + GDC ++ +KY E    G +   L  +N+
Sbjct: 825 SSFIPSKQRRRSYSKLEW--KRITKIPENLDPGDCAIYSIKYIECLALGKSFDGLCDENM 884

Query: 772 SFFREKLAIEM 782
              R KLA+EM
Sbjct: 885 QSLRTKLAVEM 893

BLAST of Moc05g31440 vs. TAIR 10
Match: AT4G15880.1 (Cysteine proteinases superfamily protein )

HSP 1 Score: 51.2 bits (121), Expect = 4.3e-06
Identity = 32/129 (24.81%), Postives = 60/129 (46.51%), Query Frame = 0

Query: 653 LDLDAIYLPYNIGGNHWIMLHIDLQEGEIIVWDSIRSMTPFPALESELRPMTVVLPTFMH 712
           +D D I++P +  G HW +  I+ +E +++  DS+  + P          +   L  +M 
Sbjct: 366 IDCDMIFVPIH-RGVHWTLAVINNRESKLLYLDSLNGVDPM---------ILNALAKYMG 425

Query: 713 RAGVQILRPTLPNTPWRIRQVTSAPQQSESGDCGMFCVKYFEYDVTGSNMTSLTQDNISF 772
               +     +    W +  V   PQQ    DCGMF +KY ++   G  +   +Q+++ +
Sbjct: 426 DEANEKSGKKIDANSWDMEFVEDLPQQKNGYDCGMFMLKYIDFFSRGLGL-CFSQEHMPY 483

Query: 773 FREKLAIEM 782
           FR + A E+
Sbjct: 486 FRLRTAKEI 483

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022154561.17.0e-8351.52uncharacterized protein LOC111021802 [Momordica charantia][more]
XP_022159253.11.5e-7287.34uncharacterized protein LOC111025666 [Momordica charantia] >XP_022159254.1 uncha... [more]
XP_022154364.18.3e-6856.31uncharacterized protein LOC111021646 [Momordica charantia][more]
XP_022156465.13.2e-6758.06uncharacterized protein LOC111023353 [Momordica charantia][more]
XP_022158744.14.6e-6667.18uncharacterized protein LOC111025209 [Momordica charantia][more]
Match NameE-valueIdentityDescription
Q94F306.1e-0524.81Ubiquitin-like-specific protease ESD4 OS=Arabidopsis thaliana OX=3702 GN=ESD4 PE... [more]
Match NameE-valueIdentityDescription
A0A6J1DP343.4e-8351.52uncharacterized protein LOC111021802 OS=Momordica charantia OX=3673 GN=LOC111021... [more]
A0A6J1DYB17.1e-7387.34uncharacterized protein LOC111025666 OS=Momordica charantia OX=3673 GN=LOC111025... [more]
A0A6J1DLV04.0e-6856.31uncharacterized protein LOC111021646 OS=Momordica charantia OX=3673 GN=LOC111021... [more]
A0A6J1DQC81.5e-6758.06uncharacterized protein LOC111023353 OS=Momordica charantia OX=3673 GN=LOC111023... [more]
A0A6J1E0A92.2e-6667.18uncharacterized protein LOC111025209 OS=Momordica charantia OX=3673 GN=LOC111025... [more]
Match NameE-valueIdentityDescription
AT5G45570.14.0e-1228.24Ulp1 protease family protein [more]
AT4G15880.14.3e-0624.81Cysteine proteinases superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (OHB3-1) v2
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3D3.40.395.10Adenoviral Proteinase; Chain Acoord: 514..781
e-value: 1.8E-26
score: 94.9
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..41
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 340..432
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 23..37
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 242..308
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 464..499
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..22
NoneNo IPR availablePANTHERPTHR12606:SF109BNACNNG52470D PROTEINcoord: 606..778
NoneNo IPR availablePANTHERPTHR12606SENTRIN/SUMO-SPECIFIC PROTEASEcoord: 606..778
IPR003653Ulp1 protease family, C-terminal catalytic domainPFAMPF02902Peptidase_C48coord: 642..779
e-value: 1.0E-22
score: 81.1
IPR003653Ulp1 protease family, C-terminal catalytic domainPROSITEPS50600ULP_PROTEASEcoord: 535..756
score: 14.594009
IPR015410Domain of unknown function DUF1985PFAMPF09331DUF1985coord: 41..168
e-value: 3.8E-13
score: 49.6
IPR038765Papain-like cysteine peptidase superfamilySUPERFAMILY54001Cysteine proteinasescoord: 522..780

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Moc05g31440.1Moc05g31440.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
molecular_function GO:0008234 cysteine-type peptidase activity