MS001081 (gene) Bitter gourd (TR) v1

Overview
NameMS001081
Typegene
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionARID domain-containing protein
Locationscaffold36: 1333870 .. 1334826 (+)
RNA-Seq ExpressionMS001081
SyntenyMS001081
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGATCCACTAAAAGAAGAGATCCCAAAGCCATTGCCAAGCCCACTGCTCCGCCGGCCGATCTTCTCGTCTGTTTTCCGGCACGTGCCCGTCTCACCCTACTCCCCAAACCCGCCTGCAGTCCGGCCAGAGCTCCGGCCGACTCCGCCCTCACCCGCCGCCACCACCACCCCAAGAAACCGCCCGCCGTTGCGGCGGCGCCGGCCCATGCGAGTCCTCTGCTGTGGGGTGCCAAGGAGATGGGGTCGGAGCCCACGTCCCCGAAAGTCACGTGCGCCGGCCAGATCAAAATCCGGCCACACGCCACCAAAAACTGGCAATCCGTGATGGAAGAAATCGAGAGAATTCACAAGAAGAAGAAAATACCCAATTCGAGAAATCGAAATCGAAATCCCCTCGGCTTCAAGAGAGAAATCGTCAATTTTCTCTCGTGTTTACGCGGCTTCCGCTTCGATTTCCGCTGCTTCAGAGGCATCCCCGAATCCGACGTCACAACAGAGGACGAATCCGAATCCGAACCCGAATCCGAATCCGAAGAAGAACCCGCCGAAGCTTCGAGAACCATGTTCTCGAAATGGTTCATGGTTTTACAGGAAGAAGAAGAAGAAACCAGACCCAGAAACGGCGGCGTTTCAGCGGCGGAGGCAGAGCCGTCCTCTGGCCCTCCCCCAAATGCTCTGTTACTGATGCGTTGCAGGTCGGCGCCGTCAATGGGGTGCCTCCAACGGAACCCAAAAGCAAAAGCAGAGGAAGAGCAGCAATCGAAAATTAGCTTGAAGTTGTTAATGGAGAAAGAGGAGGTGAATCCGAAAAATGAAAGCTTGGTGGTGATGGATTATGATGCTGATTTCTACAGACTTTCATCGGATATTGCTAAGGAGACGTGGGTTGTGAGTGGATCCAATAAATCCAATGAATCTGATGACCCCTTGATTAGAAGTCGAAGTTGGAAGAGA

mRNA sequence

ATGAGATCCACTAAAAGAAGAGATCCCAAAGCCATTGCCAAGCCCACTGCTCCGCCGGCCGATCTTCTCGTCTGTTTTCCGGCACGTGCCCGTCTCACCCTACTCCCCAAACCCGCCTGCAGTCCGGCCAGAGCTCCGGCCGACTCCGCCCTCACCCGCCGCCACCACCACCCCAAGAAACCGCCCGCCGTTGCGGCGGCGCCGGCCCATGCGAGTCCTCTGCTGTGGGGTGCCAAGGAGATGGGGTCGGAGCCCACGTCCCCGAAAGTCACGTGCGCCGGCCAGATCAAAATCCGGCCACACGCCACCAAAAACTGGCAATCCGTGATGGAAGAAATCGAGAGAATTCACAAGAAGAAGAAAATACCCAATTCGAGAAATCGAAATCGAAATCCCCTCGGCTTCAAGAGAGAAATCGTCAATTTTCTCTCGTGTTTACGCGGCTTCCGCTTCGATTTCCGCTGCTTCAGAGGCATCCCCGAATCCGACGTCACAACAGAGGACGAATCCGAATCCGAACCCGAATCCGAATCCGAAGAAGAACCCGCCGAAGCTTCGAGAACCATGTTCTCGAAATGGTTCATGGTTTTACAGGAAGAAGAAGAAGAAACCAGACCCAGAAACGGCGGCGTTTCAGCGGCGGAGGCAGAGCCGTCCTCTGGCCCTCCCCCAAATGCTCTGTTACTGATGCGTTGCAGGTCGGCGCCGTCAATGGGGTGCCTCCAACGGAACCCAAAAGCAAAAGCAGAGGAAGAGCAGCAATCGAAAATTAGCTTGAAGTTGTTAATGGAGAAAGAGGAGGTGAATCCGAAAAATGAAAGCTTGGTGGTGATGGATTATGATGCTGATTTCTACAGACTTTCATCGGATATTGCTAAGGAGACGTGGGTTGTGAGTGGATCCAATAAATCCAATGAATCTGATGACCCCTTGATTAGAAGTCGAAGTTGGAAGAGA

Coding sequence (CDS)

ATGAGATCCACTAAAAGAAGAGATCCCAAAGCCATTGCCAAGCCCACTGCTCCGCCGGCCGATCTTCTCGTCTGTTTTCCGGCACGTGCCCGTCTCACCCTACTCCCCAAACCCGCCTGCAGTCCGGCCAGAGCTCCGGCCGACTCCGCCCTCACCCGCCGCCACCACCACCCCAAGAAACCGCCCGCCGTTGCGGCGGCGCCGGCCCATGCGAGTCCTCTGCTGTGGGGTGCCAAGGAGATGGGGTCGGAGCCCACGTCCCCGAAAGTCACGTGCGCCGGCCAGATCAAAATCCGGCCACACGCCACCAAAAACTGGCAATCCGTGATGGAAGAAATCGAGAGAATTCACAAGAAGAAGAAAATACCCAATTCGAGAAATCGAAATCGAAATCCCCTCGGCTTCAAGAGAGAAATCGTCAATTTTCTCTCGTGTTTACGCGGCTTCCGCTTCGATTTCCGCTGCTTCAGAGGCATCCCCGAATCCGACGTCACAACAGAGGACGAATCCGAATCCGAACCCGAATCCGAATCCGAAGAAGAACCCGCCGAAGCTTCGAGAACCATGTTCTCGAAATGGTTCATGGTTTTACAGGAAGAAGAAGAAGAAACCAGACCCAGAAACGGCGGCGTTTCAGCGGCGGAGGCAGAGCCGTCCTCTGGCCCTCCCCCAAATGCTCTGTTACTGATGCGTTGCAGGTCGGCGCCGTCAATGGGGTGCCTCCAACGGAACCCAAAAGCAAAAGCAGAGGAAGAGCAGCAATCGAAAATTAGCTTGAAGTTGTTAATGGAGAAAGAGGAGGTGAATCCGAAAAATGAAAGCTTGGTGGTGATGGATTATGATGCTGATTTCTACAGACTTTCATCGGATATTGCTAAGGAGACGTGGGTTGTGAGTGGATCCAATAAATCCAATGAATCTGATGACCCCTTGATTAGAAGTCGAAGTTGGAAGAGA

Protein sequence

MRSTKRRDPKAIAKPTAPPADLLVCFPARARLTLLPKPACSPARAPADSALTRRHHHPKKPPAVAAAPAHASPLLWGAKEMGSEPTSPKVTCAGQIKIRPHATKNWQSVMEEIERIHKKKKIPNSRNRNRNPLGFKREIVNFLSCLRGFRFDFRCFRGIPESDVTTEDESESEPESESEEEPAEASRTMFSKWFMVLQEEEEETRPRNGGVSAAEAEPSSGPPPNALLLMRCRSAPSMGCLQRNPKAKAEEEQQSKISLKLLMEKEEVNPKNESLVVMDYDADFYRLSSDIAKETWVVSGSNKSNESDDPLIRSRSWKR
Homology
BLAST of MS001081 vs. NCBI nr
Match: XP_022131621.1 (uncharacterized protein LOC111004752 [Momordica charantia])

HSP 1 Score: 614.0 bits (1582), Expect = 7.3e-172
Identity = 313/315 (99.37%), Postives = 315/315 (100.00%), Query Frame = 0

Query: 5   KRRDPKAIAKPTAPPADLLVCFPARARLTLLPKPACSPARAPADSALTRRHHHPKKPPAV 64
           KRRDPKAIAKPTAPPADLLVCFPARARLTLLPKPACSPARAPADSALTRRHHHPKKPPAV
Sbjct: 2   KRRDPKAIAKPTAPPADLLVCFPARARLTLLPKPACSPARAPADSALTRRHHHPKKPPAV 61

Query: 65  AAAPAHASPLLWGAKEMGSEPTSPKVTCAGQIKIRPHATKNWQSVMEEIERIHKKKKIPN 124
           AAAPAHASPLLWGAKEMGSEPTSPKVTCAGQIKIRPHATKNWQSVMEEIERIHKKKKIPN
Sbjct: 62  AAAPAHASPLLWGAKEMGSEPTSPKVTCAGQIKIRPHATKNWQSVMEEIERIHKKKKIPN 121

Query: 125 SRNRNRNPLGFKREIVNFLSCLRGFRFDFRCFRGIPESDVTTEDESESEPESESEEEPAE 184
           SRNRN+NPLGFKREIVNFLSCLRGFRFDFRCFRGIPESDVTTEDESESEPESESEEEPAE
Sbjct: 122 SRNRNQNPLGFKREIVNFLSCLRGFRFDFRCFRGIPESDVTTEDESESEPESESEEEPAE 181

Query: 185 ASRTMFSKWFMVLQEEEEETRPRNGGVSAAEAEPSSGPPPNALLLMRCRSAPSMGCLQRN 244
           ASRTMFSKWFMVLQEEEEETRPRNGGVSAAEAEPSSGPPPNALLLMRCRSAPSMGCLQRN
Sbjct: 182 ASRTMFSKWFMVLQEEEEETRPRNGGVSAAEAEPSSGPPPNALLLMRCRSAPSMGCLQRN 241

Query: 245 PKAKAEEEQQSKISLKLLMEKEEVNPKNESLVVMDYDADFYRLSSDIAKETWVVSGSNKS 304
           PKAKAEEEQQSKISLKLLMEKEEVNPKNESLVVMDYDADFYRLS+DIAKETWVVSGSNKS
Sbjct: 242 PKAKAEEEQQSKISLKLLMEKEEVNPKNESLVVMDYDADFYRLSADIAKETWVVSGSNKS 301

Query: 305 NESDDPLIRSRSWKR 320
           NESDDPLIRSRSWKR
Sbjct: 302 NESDDPLIRSRSWKR 316

BLAST of MS001081 vs. NCBI nr
Match: XP_022962582.1 (uncharacterized protein LOC111462985 [Cucurbita moschata])

HSP 1 Score: 394.0 bits (1011), Expect = 1.2e-105
Identity = 232/323 (71.83%), Postives = 255/323 (78.95%), Query Frame = 0

Query: 17  APPADLLVCFPARARLTLLPKPACSPARAPADSALTRRHHHPKKPPAVAAAPAHASPLLW 76
           APPADLLVCFPARARLTLLPKP CSPARA   SA   R H  K PP    + + ASPLLW
Sbjct: 2   APPADLLVCFPARARLTLLPKPTCSPARA---SAEPHRRHQKKAPP---PSQSQASPLLW 61

Query: 77  GAKEMGSEPTSPKVTCAGQIKIRPHA---TKNWQSVMEEIERIHKKKKIPNSRN--RNRN 136
            AKEM SEPTSPKVTCAGQIKI+PHA   TKNWQSVMEEIERIHKKKKIPN  N  R++N
Sbjct: 62  -AKEMASEPTSPKVTCAGQIKIKPHARRSTKNWQSVMEEIERIHKKKKIPNWGNQIRDQN 121

Query: 137 PLGFKREIVNFLSCLRGFRFDFRCFRGIPES-DVTTEDESESEPESES--EEEPAEAS-- 196
           P GFKREIVNFLSCLRGFRFDFRCFRG PES D+TTE+E E E ESE+  EEEP EA+  
Sbjct: 122 PFGFKREIVNFLSCLRGFRFDFRCFRGFPESDDITTEEEDEEEYESEAEYEEEPTEATSK 181

Query: 197 -RTMFSKWFMVLQ----EEEEETRPRNGGVSAAEAEPSSGPPPNALLLMRCRSAPSMGCL 256
            RTMFSKWFMVLQ    +EEEET+PRN      E +P S PPPNALLLMRCRSAPS G +
Sbjct: 182 RRTMFSKWFMVLQDEEEDEEEETKPRN---DTGELQPCSVPPPNALLLMRCRSAPSTGWI 241

Query: 257 QRNPK----AKAEEEQQSKISLKLLMEKEEVN-PKNESLVVMDYDADFYRLSSDIAKETW 316
           +R PK     + +EE+QSKISLKLLME+E+V   K ESLVVMDYDADFY+LSSDIAKETW
Sbjct: 242 ERKPKQEQQEQEQEEKQSKISLKLLMEEEKVAVAKKESLVVMDYDADFYKLSSDIAKETW 301

Query: 317 VVSGSNKSNESDDPLIRSRSWKR 320
           VVSGS+ S  +DDP +RSRSWKR
Sbjct: 302 VVSGSSSSRCNDDPFLRSRSWKR 314

BLAST of MS001081 vs. NCBI nr
Match: KAG6598680.1 (hypothetical protein SDJN03_08458, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 393.7 bits (1010), Expect = 1.6e-105
Identity = 235/334 (70.36%), Postives = 259/334 (77.54%), Query Frame = 0

Query: 6   RRDPKAIAKPTAPPADLLVCFPARARLTLLPKPACSPARAPADSALTRRHHHPKKPPAVA 65
           R +PK   K  APPADLLVCFPARARLTLLPKP CSPARA   SA   R H  K PP   
Sbjct: 4   RNNPK--PKSMAPPADLLVCFPARARLTLLPKPTCSPARA---SAEPHRRHQKKAPP--- 63

Query: 66  AAPAHASPLLWGAKEMGSEPTSPKVTCAGQIKIRPHA---TKNWQSVMEEIERIHKKKKI 125
            + + ASPLLW AKEM SEPTSPKVTCAGQIKI+PHA   TKNWQSVMEEIERIHKKKKI
Sbjct: 64  PSQSQASPLLW-AKEMASEPTSPKVTCAGQIKIKPHARRSTKNWQSVMEEIERIHKKKKI 123

Query: 126 PNSRN--RNRNPLGFKREIVNFLSCLRGFRFDFRCFRGIPES-DVTTEDESESEPESES- 185
           PN  N  +++NP GFKREIVNFLSCLRGFRFDFRCFRG PES D+TTE+E E E ESE+ 
Sbjct: 124 PNWGNQIQDQNPFGFKREIVNFLSCLRGFRFDFRCFRGFPESDDITTEEEDEEEYESEAE 183

Query: 186 -EEEPAEAS---RTMFSKWFMVLQ----EEEEETRPRNGGVSAAEAEPSSGPPPNALLLM 245
            EEEP EA+   RTMFSKWFMVLQ    +EEEET+PRN      E +P S PPPNALLLM
Sbjct: 184 YEEEPTEATSKRRTMFSKWFMVLQDEEEDEEEETKPRN---DTGELQPCSVPPPNALLLM 243

Query: 246 RCRSAPSMGCLQRNPK----AKAEEEQQSKISLKLLMEKE-EVNPKNESLVVMDYDADFY 305
           RCRSAPS G ++R PK     + +EE+QSKISLKLLME+E E   K ESLVVMDYDADFY
Sbjct: 244 RCRSAPSTGWIERKPKQEQQEQEQEEKQSKISLKLLMEEEKEAVAKKESLVVMDYDADFY 303

Query: 306 RLSSDIAKETWVVSGSNKSNESDDPLIRSRSWKR 320
           +LSSDIAKETWVVSGS+ S  +DDP +RSRSWKR
Sbjct: 304 KLSSDIAKETWVVSGSSSSRCNDDPFLRSRSWKR 325

BLAST of MS001081 vs. NCBI nr
Match: XP_023546364.1 (uncharacterized protein LOC111805493 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 392.1 bits (1006), Expect = 4.5e-105
Identity = 236/342 (69.01%), Postives = 259/342 (75.73%), Query Frame = 0

Query: 1   MRSTKRRDPKAIAKPTAPPADLLVCFPARARLTLLPKPACSPARAPADSALTRRHHHPKK 60
           M+S     PK+I    APPADLLVCFPARARLTLLPKP CSPARA   SA   R H  K 
Sbjct: 1   MKSRNNTKPKSI----APPADLLVCFPARARLTLLPKPTCSPARA---SAEPHRRHQKKA 60

Query: 61  PPAVAAAPAHASPLLWGAKEMGSEPTSPKVTCAGQIKIRPHA---TKNWQSVMEEIERIH 120
           PP    + + ASPLLW AKEM SEPTSPKVTCAGQIKI+PHA   TKNWQSVMEEIERIH
Sbjct: 61  PP---PSQSQASPLLW-AKEMASEPTSPKVTCAGQIKIKPHARRSTKNWQSVMEEIERIH 120

Query: 121 KKKKIPNSRN--RNRNPLGFKREIVNFLSCLRGFRFDFRCFRGIPES-DVTTEDES---- 180
           KKKKIPN  N  +++NP GFKREIVNFLSCLRGFRFDFRCFRG PES D+TTE+E     
Sbjct: 121 KKKKIPNWGNQIQDQNPFGFKREIVNFLSCLRGFRFDFRCFRGFPESDDITTEEEDEEEY 180

Query: 181 ESEPESESEEEPAEAS-----RTMFSKWFMVLQ---EEEEETRPRNGGVSAAEAEPSSGP 240
           ESEPESE+E E A        RTMFSKWFMVLQ   EEEEET+PRN      E +P S P
Sbjct: 181 ESEPESEAEYEEAHTEATSKRRTMFSKWFMVLQDEEEEEEETKPRN---DTGELQPCSVP 240

Query: 241 PPNALLLMRCRSAPSMGCLQRNPK----AKAEEEQQSKISLKLLMEKE-EVNPKNESLVV 300
           PPNALLLMRCRSAPS G ++R PK     + +EE+QSKISLKLLME+E E   K ESLVV
Sbjct: 241 PPNALLLMRCRSAPSTGWIERKPKQEQQEQEQEEKQSKISLKLLMEEEKEAVAKKESLVV 300

Query: 301 MDYDADFYRLSSDIAKETWVVSGSNKSNESDDPLIRSRSWKR 320
           MDYDADFY+LSSDIAKETWVVSGS+ S  +DDP +RSRSWKR
Sbjct: 301 MDYDADFYKLSSDIAKETWVVSGSSTSRCNDDPFLRSRSWKR 328

BLAST of MS001081 vs. NCBI nr
Match: KAA0064832.1 (myotubularin-related protein [Cucumis melo var. makuwa])

HSP 1 Score: 374.8 bits (961), Expect = 7.5e-100
Identity = 234/346 (67.63%), Postives = 258/346 (74.57%), Query Frame = 0

Query: 1   MRSTKRRDPKAIAKPTAPPADLLVCFPARARLTLLPKPACSPARAPADSALTRRHHHPKK 60
           MRS KRR+  + +KPTAPPADLLVCFPARA LTLLP    SPARAPA+    RRH+   +
Sbjct: 1   MRSMKRRN-HSKSKPTAPPADLLVCFPARAHLTLLP----SPARAPAEP--HRRHYRKAQ 60

Query: 61  PPAVAAAPAHASPLLWGAKEMGSEPTSPKVTCAGQIKIRPHA---TKNWQSVMEEIERIH 120
           P          SPL W AKEM SEPTSPKVTCAGQIKIRPHA   TKNWQSVMEEIERIH
Sbjct: 61  P----------SPLPW-AKEMSSEPTSPKVTCAGQIKIRPHAHRSTKNWQSVMEEIERIH 120

Query: 121 KKKKIPNSRNRNRNPLGFKREIVNFLSCLRGFRFDFRCFRGIPESDVTTEDES------- 180
            KKK P    RN+NPLGFKREIVNFLSCLRGFRFDFRCFRG P+SD+TT+DE        
Sbjct: 121 NKKKNP---IRNQNPLGFKREIVNFLSCLRGFRFDFRCFRGFPQSDITTDDEDDEEEEFE 180

Query: 181 ----ESEPESESEEEPAEASRTMFSKWFMVLQEEEEETRP-RNGGVSAAEAEPS--SGPP 240
               ESE ESESEEEP    RTMFS+WFMVLQEEEEET+P  N  VS+ E +PS  S PP
Sbjct: 181 QHEPESESESESEEEPTR-GRTMFSEWFMVLQEEEEETKPINNDAVSSLEFQPSSVSVPP 240

Query: 241 PNALLLMRCRSAPSMGCLQRNPKAKAEEEQQ----SKISLKLLM--EKEEVNPKNESLVV 300
           PNALLLMRCRSAP+   L R PK + E+EQ+    SKISLKLLM  EKE V  K +SL+V
Sbjct: 241 PNALLLMRCRSAPN--SLLRKPKQEQEQEQEEEEKSKISLKLLMEEEKEMVTAKKKSLMV 300

Query: 301 MDYDADFYRLSSDIAKETWVVSGSNKSNES----DDPLIRSRSWKR 320
           MDYDADFY+LSSDIAKETWVVSGS+ S+ S    DDPL+RSRSWKR
Sbjct: 301 MDYDADFYKLSSDIAKETWVVSGSSTSSSSSRCNDDPLLRSRSWKR 322

BLAST of MS001081 vs. ExPASy TrEMBL
Match: A0A6J1BQ03 (uncharacterized protein LOC111004752 OS=Momordica charantia OX=3673 GN=LOC111004752 PE=4 SV=1)

HSP 1 Score: 614.0 bits (1582), Expect = 3.5e-172
Identity = 313/315 (99.37%), Postives = 315/315 (100.00%), Query Frame = 0

Query: 5   KRRDPKAIAKPTAPPADLLVCFPARARLTLLPKPACSPARAPADSALTRRHHHPKKPPAV 64
           KRRDPKAIAKPTAPPADLLVCFPARARLTLLPKPACSPARAPADSALTRRHHHPKKPPAV
Sbjct: 2   KRRDPKAIAKPTAPPADLLVCFPARARLTLLPKPACSPARAPADSALTRRHHHPKKPPAV 61

Query: 65  AAAPAHASPLLWGAKEMGSEPTSPKVTCAGQIKIRPHATKNWQSVMEEIERIHKKKKIPN 124
           AAAPAHASPLLWGAKEMGSEPTSPKVTCAGQIKIRPHATKNWQSVMEEIERIHKKKKIPN
Sbjct: 62  AAAPAHASPLLWGAKEMGSEPTSPKVTCAGQIKIRPHATKNWQSVMEEIERIHKKKKIPN 121

Query: 125 SRNRNRNPLGFKREIVNFLSCLRGFRFDFRCFRGIPESDVTTEDESESEPESESEEEPAE 184
           SRNRN+NPLGFKREIVNFLSCLRGFRFDFRCFRGIPESDVTTEDESESEPESESEEEPAE
Sbjct: 122 SRNRNQNPLGFKREIVNFLSCLRGFRFDFRCFRGIPESDVTTEDESESEPESESEEEPAE 181

Query: 185 ASRTMFSKWFMVLQEEEEETRPRNGGVSAAEAEPSSGPPPNALLLMRCRSAPSMGCLQRN 244
           ASRTMFSKWFMVLQEEEEETRPRNGGVSAAEAEPSSGPPPNALLLMRCRSAPSMGCLQRN
Sbjct: 182 ASRTMFSKWFMVLQEEEEETRPRNGGVSAAEAEPSSGPPPNALLLMRCRSAPSMGCLQRN 241

Query: 245 PKAKAEEEQQSKISLKLLMEKEEVNPKNESLVVMDYDADFYRLSSDIAKETWVVSGSNKS 304
           PKAKAEEEQQSKISLKLLMEKEEVNPKNESLVVMDYDADFYRLS+DIAKETWVVSGSNKS
Sbjct: 242 PKAKAEEEQQSKISLKLLMEKEEVNPKNESLVVMDYDADFYRLSADIAKETWVVSGSNKS 301

Query: 305 NESDDPLIRSRSWKR 320
           NESDDPLIRSRSWKR
Sbjct: 302 NESDDPLIRSRSWKR 316

BLAST of MS001081 vs. ExPASy TrEMBL
Match: A0A6J1HD24 (uncharacterized protein LOC111462985 OS=Cucurbita moschata OX=3662 GN=LOC111462985 PE=4 SV=1)

HSP 1 Score: 394.0 bits (1011), Expect = 5.8e-106
Identity = 232/323 (71.83%), Postives = 255/323 (78.95%), Query Frame = 0

Query: 17  APPADLLVCFPARARLTLLPKPACSPARAPADSALTRRHHHPKKPPAVAAAPAHASPLLW 76
           APPADLLVCFPARARLTLLPKP CSPARA   SA   R H  K PP    + + ASPLLW
Sbjct: 2   APPADLLVCFPARARLTLLPKPTCSPARA---SAEPHRRHQKKAPP---PSQSQASPLLW 61

Query: 77  GAKEMGSEPTSPKVTCAGQIKIRPHA---TKNWQSVMEEIERIHKKKKIPNSRN--RNRN 136
            AKEM SEPTSPKVTCAGQIKI+PHA   TKNWQSVMEEIERIHKKKKIPN  N  R++N
Sbjct: 62  -AKEMASEPTSPKVTCAGQIKIKPHARRSTKNWQSVMEEIERIHKKKKIPNWGNQIRDQN 121

Query: 137 PLGFKREIVNFLSCLRGFRFDFRCFRGIPES-DVTTEDESESEPESES--EEEPAEAS-- 196
           P GFKREIVNFLSCLRGFRFDFRCFRG PES D+TTE+E E E ESE+  EEEP EA+  
Sbjct: 122 PFGFKREIVNFLSCLRGFRFDFRCFRGFPESDDITTEEEDEEEYESEAEYEEEPTEATSK 181

Query: 197 -RTMFSKWFMVLQ----EEEEETRPRNGGVSAAEAEPSSGPPPNALLLMRCRSAPSMGCL 256
            RTMFSKWFMVLQ    +EEEET+PRN      E +P S PPPNALLLMRCRSAPS G +
Sbjct: 182 RRTMFSKWFMVLQDEEEDEEEETKPRN---DTGELQPCSVPPPNALLLMRCRSAPSTGWI 241

Query: 257 QRNPK----AKAEEEQQSKISLKLLMEKEEVN-PKNESLVVMDYDADFYRLSSDIAKETW 316
           +R PK     + +EE+QSKISLKLLME+E+V   K ESLVVMDYDADFY+LSSDIAKETW
Sbjct: 242 ERKPKQEQQEQEQEEKQSKISLKLLMEEEKVAVAKKESLVVMDYDADFYKLSSDIAKETW 301

Query: 317 VVSGSNKSNESDDPLIRSRSWKR 320
           VVSGS+ S  +DDP +RSRSWKR
Sbjct: 302 VVSGSSSSRCNDDPFLRSRSWKR 314

BLAST of MS001081 vs. ExPASy TrEMBL
Match: A0A5A7V8Z9 (Myotubularin-related protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold82G001440 PE=4 SV=1)

HSP 1 Score: 374.8 bits (961), Expect = 3.6e-100
Identity = 234/346 (67.63%), Postives = 258/346 (74.57%), Query Frame = 0

Query: 1   MRSTKRRDPKAIAKPTAPPADLLVCFPARARLTLLPKPACSPARAPADSALTRRHHHPKK 60
           MRS KRR+  + +KPTAPPADLLVCFPARA LTLLP    SPARAPA+    RRH+   +
Sbjct: 1   MRSMKRRN-HSKSKPTAPPADLLVCFPARAHLTLLP----SPARAPAEP--HRRHYRKAQ 60

Query: 61  PPAVAAAPAHASPLLWGAKEMGSEPTSPKVTCAGQIKIRPHA---TKNWQSVMEEIERIH 120
           P          SPL W AKEM SEPTSPKVTCAGQIKIRPHA   TKNWQSVMEEIERIH
Sbjct: 61  P----------SPLPW-AKEMSSEPTSPKVTCAGQIKIRPHAHRSTKNWQSVMEEIERIH 120

Query: 121 KKKKIPNSRNRNRNPLGFKREIVNFLSCLRGFRFDFRCFRGIPESDVTTEDES------- 180
            KKK P    RN+NPLGFKREIVNFLSCLRGFRFDFRCFRG P+SD+TT+DE        
Sbjct: 121 NKKKNP---IRNQNPLGFKREIVNFLSCLRGFRFDFRCFRGFPQSDITTDDEDDEEEEFE 180

Query: 181 ----ESEPESESEEEPAEASRTMFSKWFMVLQEEEEETRP-RNGGVSAAEAEPS--SGPP 240
               ESE ESESEEEP    RTMFS+WFMVLQEEEEET+P  N  VS+ E +PS  S PP
Sbjct: 181 QHEPESESESESEEEPTR-GRTMFSEWFMVLQEEEEETKPINNDAVSSLEFQPSSVSVPP 240

Query: 241 PNALLLMRCRSAPSMGCLQRNPKAKAEEEQQ----SKISLKLLM--EKEEVNPKNESLVV 300
           PNALLLMRCRSAP+   L R PK + E+EQ+    SKISLKLLM  EKE V  K +SL+V
Sbjct: 241 PNALLLMRCRSAPN--SLLRKPKQEQEQEQEEEEKSKISLKLLMEEEKEMVTAKKKSLMV 300

Query: 301 MDYDADFYRLSSDIAKETWVVSGSNKSNES----DDPLIRSRSWKR 320
           MDYDADFY+LSSDIAKETWVVSGS+ S+ S    DDPL+RSRSWKR
Sbjct: 301 MDYDADFYKLSSDIAKETWVVSGSSTSSSSSRCNDDPLLRSRSWKR 322

BLAST of MS001081 vs. ExPASy TrEMBL
Match: A0A1S3BDB9 (uncharacterized protein LOC103488403 OS=Cucumis melo OX=3656 GN=LOC103488403 PE=4 SV=1)

HSP 1 Score: 369.4 bits (947), Expect = 1.5e-98
Identity = 231/343 (67.35%), Postives = 255/343 (74.34%), Query Frame = 0

Query: 5   KRRDPKAIAKPTAPPADLLVCFPARARLTLLPKPACSPARAPADSALTRRHHHPKKPPAV 64
           KRR+  + +KPTAPPADLLVCFPARA LTLLP    SPARAPA+    RRH+   +P   
Sbjct: 2   KRRN-HSKSKPTAPPADLLVCFPARAHLTLLP----SPARAPAEP--HRRHYRKAQP--- 61

Query: 65  AAAPAHASPLLWGAKEMGSEPTSPKVTCAGQIKIRPHA---TKNWQSVMEEIERIHKKKK 124
                  SPL W AKEM SEPTSPKVTCAGQIKIRPHA   TKNWQSVMEEIERIH KKK
Sbjct: 62  -------SPLPW-AKEMSSEPTSPKVTCAGQIKIRPHAHRSTKNWQSVMEEIERIHNKKK 121

Query: 125 IPNSRNRNRNPLGFKREIVNFLSCLRGFRFDFRCFRGIPESDVTTEDES----------- 184
            P    RN+NPLGFKREIVNFLSCLRGFRFDFRCFRG P+SD+TT+DE            
Sbjct: 122 NP---IRNQNPLGFKREIVNFLSCLRGFRFDFRCFRGFPQSDITTDDEDDEEEEFEQHEP 181

Query: 185 ESEPESESEEEPAEASRTMFSKWFMVLQEEEEETRP-RNGGVSAAEAEPS--SGPPPNAL 244
           ESE ESESEEEP    RTMFS+WFMVLQEEEEET+P  N  VS+ E +PS  S PPPNAL
Sbjct: 182 ESESESESEEEPTR-GRTMFSEWFMVLQEEEEETKPINNDAVSSLEFQPSSVSVPPPNAL 241

Query: 245 LLMRCRSAPSMGCLQRNPKAKAEEEQQ-----SKISLKLLM--EKEEVNPKNESLVVMDY 304
           LLMRCRSAP+   L R PK + E+EQ+     SKISLKLLM  EKE V  K +SL+VMDY
Sbjct: 242 LLMRCRSAPN--SLLRKPKQEQEQEQEQEEEKSKISLKLLMEEEKEMVTAKKKSLMVMDY 301

Query: 305 DADFYRLSSDIAKETWVVSGSNKSNES----DDPLIRSRSWKR 320
           DADFY+LSSDIAKETWVVSGS+ S+ S    DDPL+RSRSWKR
Sbjct: 302 DADFYKLSSDIAKETWVVSGSSTSSSSSRCNDDPLLRSRSWKR 320

BLAST of MS001081 vs. ExPASy TrEMBL
Match: A0A0A0LMI6 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G345900 PE=4 SV=1)

HSP 1 Score: 362.8 bits (930), Expect = 1.4e-96
Identity = 229/354 (64.69%), Postives = 255/354 (72.03%), Query Frame = 0

Query: 1   MRSTKRRDPKAIAKPTAPPADLLVCFPARARLTLLPKPACSPARAPADSALTRRHHHPKK 60
           MRS KRR+  + +KPTAPPADLLVCFPARA LTLLP    SPAR PA+    RRH+   +
Sbjct: 1   MRSMKRRN-HSKSKPTAPPADLLVCFPARAHLTLLP----SPARPPAEP--HRRHYRKAQ 60

Query: 61  PPAVAAAPAHASPLLWGAKEMGSEPTSPKVTCAGQIKIRPH---ATKNWQSVMEEIERIH 120
           P          SPL W AKEM SEPTSPKVTCAGQIKIRPH   +TKNWQSVMEEIERIH
Sbjct: 61  P----------SPLPW-AKEMSSEPTSPKVTCAGQIKIRPHSHRSTKNWQSVMEEIERIH 120

Query: 121 KKKKIPNSRNRNRNPLGFKREIVNFLSCLRGFRFDFRCFRGIPESDVTTEDES------- 180
            KKK P    RN+NP GFKREIVNFLSCLRGFRFDFRCFRG P+SD+TT+DE        
Sbjct: 121 NKKKNP---IRNQNPFGFKREIVNFLSCLRGFRFDFRCFRGFPQSDITTDDEDDEEEVFE 180

Query: 181 ----ESEPESESEEEPAEASRTMFSKWFMVLQEEEEETRP-RNGGVSAAEAEPS--SGPP 240
               ESE +SESEEEP    RTMFS+WFMVLQE EEET+P  N  VS+ E +PS  S PP
Sbjct: 181 QHEPESESQSESEEEPTR-GRTMFSEWFMVLQEGEEETKPINNDAVSSLEFQPSSVSVPP 240

Query: 241 PNALLLMRCRSAPSMGCLQRNPK------------AKAEEEQQSKISLKLLM--EKEEVN 300
           PNALLLMRCRSAP+   L R PK             + EEE++SKISLKLLM  EKE V 
Sbjct: 241 PNALLLMRCRSAPN--SLLRKPKQQEEKEEEEEEEEEEEEEEKSKISLKLLMEEEKEMVT 300

Query: 301 PKNESLVVMDYDADFYRLSSDIAKETWVVSGSNKSNES----DDPLIRSRSWKR 320
            K +SL+VMDYDADFY+LSSDIAKETWVVSGS+ S+ S    DDPL+RSRSWKR
Sbjct: 301 AKKKSLMVMDYDADFYKLSSDIAKETWVVSGSSTSSSSSRCNDDPLLRSRSWKR 330

BLAST of MS001081 vs. TAIR 10
Match: AT1G78110.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma membrane; EXPRESSED IN: 17 plant structures; EXPRESSED DURING: 7 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G22230.1); Has 5452 Blast hits to 3541 proteins in 289 species: Archae - 4; Bacteria - 165; Metazoa - 1756; Fungi - 532; Plants - 205; Viruses - 141; Other Eukaryotes - 2649 (source: NCBI BLink). )

HSP 1 Score: 243.0 bits (619), Expect = 3.2e-64
Identity = 160/345 (46.38%), Postives = 207/345 (60.00%), Query Frame = 0

Query: 20  ADLLVCFPARARLTLLPKPACSPARAPADSALTRRHHHPKKPPAVA--AAPAHASPLLW- 79
           ADLLVCFP+R  L L PKP CSP+R P+DS+  RR HH ++   ++      H SP+LW 
Sbjct: 18  ADLLVCFPSRTHLALTPKPICSPSR-PSDSSTNRRPHHRRQLSKLSGGGGGGHGSPVLWA 77

Query: 80  ---GAKEMG----SEPTSPKVTCAGQIKIRPHAT----KNWQSVMEEIERIHKKKKIPNS 139
               +K MG    +EPTSPKVTCAGQIK+RP       KNWQSVMEEIERIH        
Sbjct: 78  KQASSKNMGGDEIAEPTSPKVTCAGQIKVRPSKCGGRGKNWQSVMEEIERIHD------- 137

Query: 140 RNRNRNP-LGFKREIVNFLSCLRGFRFDFRCFRGIPESDVTTEDESESEPESESEEEPAE 199
            NR+++   G K++++ FL+CLR  +FDFRCF     +DVT++D+ E + + + EEE  E
Sbjct: 138 -NRSQSKFFGLKKDVMGFLTCLRNIKFDFRCFGDFRHADVTSDDDEEEDDDDDEEEEVVE 197

Query: 200 A-----SRTMFSKWFMVLQEEE---EETRPRN-----GGVSAAEAEPSSGPPPNALLLMR 259
                 S+T+FSKWFMVLQEE+   ++ +  N       +   E EP+  PPPNALLLMR
Sbjct: 198 GEEEENSKTVFSKWFMVLQEEQNNKDDDKNNNKCDEKRDLEDTETEPAV-PPPNALLLMR 257

Query: 260 CRSAPSMGCLQRNPKAKAEEEQQ------------------SKISLKLLMEKEEVNPKNE 319
           CRSAP+   L+   K K E+E++                   K  L+ LME+E++     
Sbjct: 258 CRSAPAKSWLEERMKVKTEQEKREEQKEEKETEDQETSMKTKKKDLRSLMEEEKM----- 317

BLAST of MS001081 vs. TAIR 10
Match: AT1G22230.1 (unknown protein; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G78110.1); Has 2358 Blast hits to 1759 proteins in 159 species: Archae - 2; Bacteria - 36; Metazoa - 1046; Fungi - 203; Plants - 157; Viruses - 72; Other Eukaryotes - 842 (source: NCBI BLink). )

HSP 1 Score: 199.5 bits (506), Expect = 4.0e-51
Identity = 139/331 (41.99%), Postives = 185/331 (55.89%), Query Frame = 0

Query: 14  KPTAPPADLLVCFPARARLTLLPKPACSPARAPADSALTRRH---HHPKKPPAVAAAPAH 73
           K +   ADL+VCFP+RA L+L  K   SP+     S+  RR    HH +    ++++   
Sbjct: 8   KSSGYSADLMVCFPSRAHLSLPSKSISSPS-----SSFNRRQNAPHHRRSISKLSSSGGG 67

Query: 74  ASPLLWGAKEMGSEPTSPKVTCAGQIKIRPH----ATKNWQSVMEEIERIHKKKKIPNSR 133
                 G +E+  EPTSPKVTCAGQIK+R        KNWQS+M EIE+IH+ K      
Sbjct: 68  VRQNRGGGREVVEEPTSPKVTCAGQIKVRSSKRDGGGKNWQSLMAEIEKIHRSKS----- 127

Query: 134 NRNRNPLGFKREIVNFLSCLRGFRFDFRCFRGIPESDVTTEDESESEPESE--SEEEPAE 193
                  G KR+++ FL+CLR   FDFRCF   P  D+ ++DE E E E E   EE+  E
Sbjct: 128 --ESKFFGIKRDVMGFLTCLRD--FDFRCFGAFPPVDIISDDEEEDEEEEEEDEEEDEDE 187

Query: 194 ASRTMFSKWFMVLQEEEEETRPRNGGVSAAEAEPSSGPPPNALLLMRCRSAPSMGCLQRN 253
           +S T+FSKW MVL E++      +G  +      ++ PPPNALLLMRCRSAP     +  
Sbjct: 188 SSGTVFSKWLMVLHEKQNNEECVDGKENVFSDVETAVPPPNALLLMRCRSAPVKNWSEEK 247

Query: 254 P----------KAKAEEEQQ------SKISLKLLMEKEEVNPKNESLVVMDYDADFYRLS 313
                      K   EEE++      +K  L+ LME+E    K  +LVVM+YD ++Y+LS
Sbjct: 248 KEETEEGDNRVKQSGEEEEEEKDRVGNKKDLRSLMEEE----KKMNLVVMNYDTNYYKLS 307

Query: 314 SDIAKETWVVSGSNKSNESDDPLIRSRSWKR 320
           +DIAKETWVV G        DPL RSRSWK+
Sbjct: 308 NDIAKETWVVGG------IQDPLFRSRSWKK 314

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022131621.17.3e-17299.37uncharacterized protein LOC111004752 [Momordica charantia][more]
XP_022962582.11.2e-10571.83uncharacterized protein LOC111462985 [Cucurbita moschata][more]
KAG6598680.11.6e-10570.36hypothetical protein SDJN03_08458, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_023546364.14.5e-10569.01uncharacterized protein LOC111805493 [Cucurbita pepo subsp. pepo][more]
KAA0064832.17.5e-10067.63myotubularin-related protein [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1BQ033.5e-17299.37uncharacterized protein LOC111004752 OS=Momordica charantia OX=3673 GN=LOC111004... [more]
A0A6J1HD245.8e-10671.83uncharacterized protein LOC111462985 OS=Cucurbita moschata OX=3662 GN=LOC1114629... [more]
A0A5A7V8Z93.6e-10067.63Myotubularin-related protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_sca... [more]
A0A1S3BDB91.5e-9867.35uncharacterized protein LOC103488403 OS=Cucumis melo OX=3656 GN=LOC103488403 PE=... [more]
A0A0A0LMI61.4e-9664.69Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G345900 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G78110.13.2e-6446.38unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT1G22230.14.0e-5141.99unknown protein; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein matc... [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (TR) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 38..68
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 165..180
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 298..319
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 201..228
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 163..186
NoneNo IPR availablePANTHERPTHR33448CHLOROPLAST PROTEIN HCF243-RELATEDcoord: 15..319
NoneNo IPR availablePANTHERPTHR33448:SF3OS09G0370000 PROTEINcoord: 15..319

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MS001081.1MS001081.1mRNA