MS022205 (gene) Bitter gourd (TR) v1

Overview
NameMS022205
Typegene
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionLEA_2 domain-containing protein
Locationscaffold47: 1667851 .. 1670865 (+)
RNA-Seq ExpressionMS022205
SyntenyMS022205
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCACGCGAAATCGTACTCGGAGGTGACGAGCGTGGAGCAGTCGTCGCCGGCGCGATCGCCGCGGCGGCCGCTCTACTACGTGCAGAGCCCGTCGAACCACGACGTGGAGAAAATGTCGTACGGGTCGAGCCCGATGGGGTCGCCGCCGCACCACTTCTACCACGCCTCCCCCATCCACCACTCCCGCGAGTCCTCCACCTCCCGCTTCTCCGCCTCCCTCAAGCCTAACCCCCGCAACCTCGCCGCCTGGAGGAAGCTCCACCGCCCCCTCGAATCCGACGCCGACGACGACGATGACGACGCCACCGCCGCCGCCGACGACCGCGATTCCGAATGGACCCGGAAGTTCCGGCTTTACTTGTTCTTGTTCGTTTTCTTCGTACTTCTCTTCACTGTGTTCTCCCTCATCCTCTGGGGCGCCAGCAGATCCTTCCACCCCCAAATCATCCTTCAGGTACACCCCATTGCAAATTTATTTTCATTGATTTTGGTTTTCATCTCTAATTTTGACTTTTTCATCTAGTTTTACACGCATTAATCGCTTATTATTATTATTATTATTATTATTATTTTTTGTGTGAAAAGCACATTAATGGCTTTTAGTGTTTGTGTTAACAAGAAATTGTTTTGTAGCTGGGACATTAAGTGGGAAGAAATGTATACATATTATTCTAAATATATATATATACATACACATTTTCTTTTGAATTCTAAGTTTCTAACATGTTGGGGGGATTGGAGCTCCAAACTAATTTAACTATAGAAAATATGAATAATATTTATGTATTGAATGGTAATGAATTTTTTTTAGTACAATAAATAGGAGGTATTAAGTTTGACTTTTGATCTCAAGAAAAGGAGCATGAATCTCCTTACTTGTGGTTCTAACTTAAGATAATTATTTGAAATCTAAAAATTATTTTTTTAATAATAACTTAAATGTACTTATAAGGGGTAAAAATATACGCAAAAAATTACAAGTATAATATTTAAAAATAAAAAATAAAAAATTTAAAAGAATACATTGCATAGTTTGAAGAGAAATTACAGCCACTTATCCCCAACCGGAATTCGAATCACCAAATCCCTATATTGTACTAGAAAAAGAAATTAAGAAAAAAGGGAAATATACTCAAATGGGATTTAAATAGGTTAGTTTATTTATTCTTTGGCAATTTCAGTACCTGAAAAGGTGGAAATTGTTATTATTAAATATATATGAAAAGCCGAAAAGTAGCATGTTTTGAGTTGGGAAAAATAAAAAAGGTGATAATATTTATATTATATTTATTGTGGAACAGAGTATGGTATTCGAGAGGTTTAACGTACAAGCGGGGAGTGATGCGGGGGGAGTGGCAACTGATCTGATGTCATTAAATTCGACGGTCAGGATCAAGTACAGAAATCCGGCCACGTTTTTCGGTGTCCACGTCAGCTCTTCCCCCATCCAGCTCAACTATTTCCAGCTCCAAATCGCCTCCGGCCAGGTATTTTTATTGTTTTAATTACATTTAGGCATGGCTGGCACGTCGTATCGTATCGTAGCCCGTAGGTTTGTGTGTTTTTGTTGCGGGTTTTGGCCCTTAGGTTTTGAATATTACAAAAGGGTGGTGGGGATCGGGGAATAGTGTAAATAGTGGTTTTTGTGGGGGTGAGGAATGGTAATTTGAGGAGACTTTTCAGACATTGGGAGCTGGGACGATTATAATTCGCAATCAGAAGTCTGCAAAGTCTGAGCAGTAGTAGTCTTTTTCATACTGGCAGTCTCTTTTGTAATTTTCCCGTTGGAGATAAAAAGGAGGATTTTTGGAAACTGTCCTATTTTATTCTTTTTTCACATTTTAAATTATCATAGGTTCAATCTTTTTTTTTAATCATATAGATTATTGTACCATGACCTTAGAATTTTTATGAGGTTTGTTTCAAGCCTCTCAAAACTTTTTTATTAATTGTTTAATAGGCGGATAATGTAGGTTTTTATTTATTTATTATTTTTTTTTGTAGTTTTATGTATATTGATGTGCTGACGTGGGTGTCTATTAGATCGATGACATAATAAAAATGATCTAAAAAGTCACGATTTTATAGTTTTTTTCCTTCTAATTTCTCCATCAATATTCAAAATTGAAAAATTTTGTATATAAAAAATTCAGATGTAAAATTTGGTCCATCCGTCCTTCATAGTTCATAGATTTAAACACCCACGTCAGCACATCAATATACATAAATTTTTTTTAGTACAAGTTATCGGGATGAGGATCGAACTCTCAACTTCATAAGTTGGATATATATACTTTAATTCGTTAAACTATGCGTGGATGACCAAAAATAGTTAATTTTTAAGGTTAGGAACTAAAGTAAATATAAATTTAGGAATGAAAATGAGATTTAAACTAGAGCTTTATCTCAAAATTTTCTTCTCTCCCTTTTTCCTCTCTAAGCTATTGATCCTATCCTTTTCAATTTCTATTTAAATTAGGTATAATATGTTCGTATCCTCATTCCAACTGACTAATAATTATATAGATATATAGATGTTAATTACCAATCAAACATTTTCCTTTAAAATTTTGCAATGGAGAAAATGAGAGGGAGTAAAGGAAATGAAACAAAGAATCAAATAATGCTTTAGCAATGGTACATATAGGTGGAGATCAAACATTTAACCTTAAAGGAAAGAAGTAGAGATGTTTTAACAACTGAACTTATATATATATATTTAAATTAGATGATGGAGTTCTACGAAAAAAGGCAGAGCTCTCGGAGGGTGGCGACAGCGGTGGCGGGGCACCAGGTGCCGCTGTACGGTGGGATCGCGGTGATCGGAAACTGGAGGGAGCAGCGACAGGAGGGGGTGGAGGTGCCGCTGAACCTGACGGTGGCGGTGAGGTCAAGAGCTTACATTCTGGGGAAGCTGGTGAAGTCCACATTCCACACCACAATTACTTGCTCACTCACTCTCAGAACTAAGAATCTTGGCAAATCCCACTCTCTCAACAATTCTTGCATTTAC

mRNA sequence

ATGCACGCGAAATCGTACTCGGAGGTGACGAGCGTGGAGCAGTCGTCGCCGGCGCGATCGCCGCGGCGGCCGCTCTACTACGTGCAGAGCCCGTCGAACCACGACGTGGAGAAAATGTCGTACGGGTCGAGCCCGATGGGGTCGCCGCCGCACCACTTCTACCACGCCTCCCCCATCCACCACTCCCGCGAGTCCTCCACCTCCCGCTTCTCCGCCTCCCTCAAGCCTAACCCCCGCAACCTCGCCGCCTGGAGGAAGCTCCACCGCCCCCTCGAATCCGACGCCGACGACGACGATGACGACGCCACCGCCGCCGCCGACGACCGCGATTCCGAATGGACCCGGAAGTTCCGGCTTTACTTGTTCTTGTTCGTTTTCTTCGTACTTCTCTTCACTGTGTTCTCCCTCATCCTCTGGGGCGCCAGCAGATCCTTCCACCCCCAAATCATCCTTCAGAGTATGGTATTCGAGAGGTTTAACGTACAAGCGGGGAGTGATGCGGGGGGAGTGGCAACTGATCTGATGTCATTAAATTCGACGGTCAGGATCAAGTACAGAAATCCGGCCACGTTTTTCGGTGTCCACGTCAGCTCTTCCCCCATCCAGCTCAACTATTTCCAGCTCCAAATCGCCTCCGGCCAGATGATGGAGTTCTACGAAAAAAGGCAGAGCTCTCGGAGGGTGGCGACAGCGGTGGCGGGGCACCAGGTGCCGCTGTACGGTGGGATCGCGGTGATCGGAAACTGGAGGGAGCAGCGACAGGAGGGGGTGGAGGTGCCGCTGAACCTGACGGTGGCGGTGAGGTCAAGAGCTTACATTCTGGGGAAGCTGGTGAAGTCCACATTCCACACCACAATTACTTGCTCACTCACTCTCAGAACTAAGAATCTTGGCAAATCCCACTCTCTCAACAATTCTTGCATTTAC

Coding sequence (CDS)

ATGCACGCGAAATCGTACTCGGAGGTGACGAGCGTGGAGCAGTCGTCGCCGGCGCGATCGCCGCGGCGGCCGCTCTACTACGTGCAGAGCCCGTCGAACCACGACGTGGAGAAAATGTCGTACGGGTCGAGCCCGATGGGGTCGCCGCCGCACCACTTCTACCACGCCTCCCCCATCCACCACTCCCGCGAGTCCTCCACCTCCCGCTTCTCCGCCTCCCTCAAGCCTAACCCCCGCAACCTCGCCGCCTGGAGGAAGCTCCACCGCCCCCTCGAATCCGACGCCGACGACGACGATGACGACGCCACCGCCGCCGCCGACGACCGCGATTCCGAATGGACCCGGAAGTTCCGGCTTTACTTGTTCTTGTTCGTTTTCTTCGTACTTCTCTTCACTGTGTTCTCCCTCATCCTCTGGGGCGCCAGCAGATCCTTCCACCCCCAAATCATCCTTCAGAGTATGGTATTCGAGAGGTTTAACGTACAAGCGGGGAGTGATGCGGGGGGAGTGGCAACTGATCTGATGTCATTAAATTCGACGGTCAGGATCAAGTACAGAAATCCGGCCACGTTTTTCGGTGTCCACGTCAGCTCTTCCCCCATCCAGCTCAACTATTTCCAGCTCCAAATCGCCTCCGGCCAGATGATGGAGTTCTACGAAAAAAGGCAGAGCTCTCGGAGGGTGGCGACAGCGGTGGCGGGGCACCAGGTGCCGCTGTACGGTGGGATCGCGGTGATCGGAAACTGGAGGGAGCAGCGACAGGAGGGGGTGGAGGTGCCGCTGAACCTGACGGTGGCGGTGAGGTCAAGAGCTTACATTCTGGGGAAGCTGGTGAAGTCCACATTCCACACCACAATTACTTGCTCACTCACTCTCAGAACTAAGAATCTTGGCAAATCCCACTCTCTCAACAATTCTTGCATTTAC

Protein sequence

MHAKSYSEVTSVEQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHHFYHASPIHHSRESSTSRFSASLKPNPRNLAAWRKLHRPLESDADDDDDDATAAADDRDSEWTRKFRLYLFLFVFFVLLFTVFSLILWGASRSFHPQIILQSMVFERFNVQAGSDAGGVATDLMSLNSTVRIKYRNPATFFGVHVSSSPIQLNYFQLQIASGQMMEFYEKRQSSRRVATAVAGHQVPLYGGIAVIGNWREQRQEGVEVPLNLTVAVRSRAYILGKLVKSTFHTTITCSLTLRTKNLGKSHSLNNSCIY
Homology
BLAST of MS022205 vs. NCBI nr
Match: XP_022135688.1 (uncharacterized protein LOC111007587 isoform X1 [Momordica charantia] >XP_022135689.1 uncharacterized protein LOC111007587 isoform X2 [Momordica charantia])

HSP 1 Score: 599.4 bits (1544), Expect = 1.8e-167
Identity = 306/309 (99.03%), Postives = 306/309 (99.03%), Query Frame = 0

Query: 1   MHAKSYSEVTSVEQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHHFYHASPIH 60
           MHAKSYSEVTSVEQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHHFYHASPIH
Sbjct: 1   MHAKSYSEVTSVEQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHHFYHASPIH 60

Query: 61  HSRESSTSRFSASLKPNPRNLAAWRKLHRPLESDADDDDDDATAAADDRDSEWTRKFRLY 120
           HSRESSTSRFSASLKPN RNLAAWRKLHRPLESDADDDDDDATAAADDRDSEWTRKFRLY
Sbjct: 61  HSRESSTSRFSASLKPNXRNLAAWRKLHRPLESDADDDDDDATAAADDRDSEWTRKFRLY 120

Query: 121 LFLFVFFVLLFTVFSLILWGASRSFHPQIILQSMVFERFNVQAGSDAGGVATDLMSLNST 180
           LFLFVFFVLLFTVFSLILWGASRSFHPQIILQSMVFERFNVQAGSDAGGVATDLMSLNST
Sbjct: 121 LFLFVFFVLLFTVFSLILWGASRSFHPQIILQSMVFERFNVQAGSDAGGVATDLMSLNST 180

Query: 181 VRIKYRNPATFFGVHVSSSPIQLNYFQLQIASGQMMEFYEKRQSSRRVATAVAGHQVPLY 240
           VRIKYRNPATFFGVHVSSSPIQLNYFQLQIASGQMMEFYEKRQSSRRVATAVAGHQVPLY
Sbjct: 181 VRIKYRNPATFFGVHVSSSPIQLNYFQLQIASGQMMEFYEKRQSSRRVATAVAGHQVPLY 240

Query: 241 GGIAVIGNWREQRQEGVEVPLNLTVAVRSRAYILGKLVKSTFHTTITCSLTLRTKNLGKS 300
           GGIAVIGNWREQRQEGVEVPLNLTVAVRSRAYILGKLVKSTFH TITCSLTLRTKNLGK 
Sbjct: 241 GGIAVIGNWREQRQEGVEVPLNLTVAVRSRAYILGKLVKSTFHXTITCSLTLRTKNLGKF 300

Query: 301 HSLNNSCIY 310
           HSLNNSCIY
Sbjct: 301 HSLNNSCIY 309

BLAST of MS022205 vs. NCBI nr
Match: XP_038888376.1 (uncharacterized protein LOC120078225 [Benincasa hispida])

HSP 1 Score: 513.1 bits (1320), Expect = 1.7e-141
Identity = 261/312 (83.65%), Postives = 283/312 (90.71%), Query Frame = 0

Query: 1   MHAKSYSEVTSVEQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHHFYHASPIH 60
           MHAKSYSEVTS++QSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHHFYHASPIH
Sbjct: 1   MHAKSYSEVTSMDQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHHFYHASPIH 60

Query: 61  HSRESSTSRFSASLKPNPR---NLAAWRKLHRPLESDADDDDDDATAAADDRDSEWTRKF 120
           HSRESSTSRFSASLK NP    NL+AWRKLHRP +SD DD++DD     DDRDS+W RKF
Sbjct: 61  HSRESSTSRFSASLKNNPNRNGNLSAWRKLHRPQDSD-DDEEDDEDEENDDRDSKWNRKF 120

Query: 121 RLYLFLFVFFVLLFTVFSLILWGASRSFHPQIILQSMVFERFNVQAGSDAGGVATDLMSL 180
           RLYLFLF+ FVLLFTVFSLILWGASRSFHPQI++QSMVFE+FNVQAGSD GGVATDLMSL
Sbjct: 121 RLYLFLFLLFVLLFTVFSLILWGASRSFHPQILIQSMVFEKFNVQAGSDPGGVATDLMSL 180

Query: 181 NSTVRIKYRNPATFFGVHVSSSPIQLNYFQLQIASGQMMEFYEKRQSSRRVATAVAGHQV 240
           NSTVRI YRNPATFFGVHVSS+P  L Y+QLQIASGQM EFY+KRQSSRRV T+VAGHQ+
Sbjct: 181 NSTVRITYRNPATFFGVHVSSTPFHLQYYQLQIASGQMEEFYQKRQSSRRVKTSVAGHQI 240

Query: 241 PLYGGIAVIGNWREQRQE--GVEVPLNLTVAVRSRAYILGKLVKSTFHTTITCSLTLRTK 300
           PLYGGI+ IGNWR+QRQ+  GVE+PLNLTVAVRSRAYILG+LVKSTFHTTITC +TL TK
Sbjct: 241 PLYGGISAIGNWRDQRQDGVGVEIPLNLTVAVRSRAYILGRLVKSTFHTTITCPITLSTK 300

Query: 301 NLGKSHSLNNSC 308
            LGK HS NNSC
Sbjct: 301 KLGKFHSFNNSC 311

BLAST of MS022205 vs. NCBI nr
Match: XP_008447896.1 (PREDICTED: uncharacterized protein LOC103490245 [Cucumis melo])

HSP 1 Score: 508.4 bits (1308), Expect = 4.2e-140
Identity = 258/314 (82.17%), Postives = 282/314 (89.81%), Query Frame = 0

Query: 1   MHAKSYSEVTSVEQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHHFYHASPIH 60
           MHAKSYSEVTSV+QSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHHFYHASPIH
Sbjct: 1   MHAKSYSEVTSVDQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHHFYHASPIH 60

Query: 61  HSRESSTSRFSASLKPNPR---NLAAWRKLHRPLESDADDDDDDATAAADDRDSEWTRKF 120
           HSRESSTSRFSASLK N     N++AWRKLH   +SD DD++DD     +DRDS+W RKF
Sbjct: 61  HSRESSTSRFSASLKSNQNRNGNVSAWRKLHLAEDSDDDDEEDDGDEENEDRDSKWNRKF 120

Query: 121 RLYLFLFVFFVLLFTVFSLILWGASRSFHPQIILQSMVFERFNVQAGSDAGGVATDLMSL 180
           RLYL LF+FFVLLFTVFSLILWGAS+SFHPQI++QSMVF +FNVQAGSD GGVATDLMSL
Sbjct: 121 RLYLILFLFFVLLFTVFSLILWGASKSFHPQILIQSMVFSKFNVQAGSDPGGVATDLMSL 180

Query: 181 NSTVRIKYRNPATFFGVHVSSSPIQLNYFQLQIASGQMMEFYEKRQSSRRVATAVAGHQV 240
           NSTVRI YRNPATFFGVHVSS+P QL+YFQLQIASGQM EFY+KRQSSRR+ T+VAGHQV
Sbjct: 181 NSTVRISYRNPATFFGVHVSSTPFQLHYFQLQIASGQMEEFYQKRQSSRRMVTSVAGHQV 240

Query: 241 PLYGGIAVIGNWREQRQE--GVEVPLNLTVAVRSRAYILGKLVKSTFHTTITCSLTLRTK 300
           PLYGGI+ IGNWR+QRQ+  GVEV LNLTVAVRSRAYILG+LVKSTFHTTITC +TL TK
Sbjct: 241 PLYGGISAIGNWRDQRQDGVGVEVSLNLTVAVRSRAYILGRLVKSTFHTTITCPITLSTK 300

Query: 301 NLGKSHSLNNSCIY 310
            LGKSHS NN+C Y
Sbjct: 301 KLGKSHSFNNTCTY 314

BLAST of MS022205 vs. NCBI nr
Match: XP_004144875.1 (uncharacterized protein LOC101215215 [Cucumis sativus] >KGN43297.1 hypothetical protein Csa_020295 [Cucumis sativus])

HSP 1 Score: 507.7 bits (1306), Expect = 7.2e-140
Identity = 256/314 (81.53%), Postives = 282/314 (89.81%), Query Frame = 0

Query: 1   MHAKSYSEVTSVEQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHHFYHASPIH 60
           MHAKSYSEVTSV+QSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHHFYHASPIH
Sbjct: 1   MHAKSYSEVTSVDQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHHFYHASPIH 60

Query: 61  HSRESSTSRFSASLKPNPR---NLAAWRKLHRPLESDADDDDDDATAAADDRDSEWTRKF 120
           HSRESSTSRFSASLK N     N++AWRKLH   +SD DD++DD     +DRDS+W RKF
Sbjct: 61  HSRESSTSRFSASLKINQNRNGNVSAWRKLHHAQDSDGDDEEDDEEEENEDRDSKWNRKF 120

Query: 121 RLYLFLFVFFVLLFTVFSLILWGASRSFHPQIILQSMVFERFNVQAGSDAGGVATDLMSL 180
           RLYL LF+FF+LLFTVFSLILWGAS+SFHPQI++QSMVF +FNVQAGSD GGVATDLMSL
Sbjct: 121 RLYLILFLFFILLFTVFSLILWGASKSFHPQILIQSMVFSKFNVQAGSDPGGVATDLMSL 180

Query: 181 NSTVRIKYRNPATFFGVHVSSSPIQLNYFQLQIASGQMMEFYEKRQSSRRVATAVAGHQV 240
           NSTVRI Y+NPATFFGVHVSS+PIQL+Y QLQ+ASGQM EFY+KRQSSRRV T+VAGHQV
Sbjct: 181 NSTVRISYKNPATFFGVHVSSTPIQLHYLQLQVASGQMEEFYQKRQSSRRVVTSVAGHQV 240

Query: 241 PLYGGIAVIGNWREQRQE--GVEVPLNLTVAVRSRAYILGKLVKSTFHTTITCSLTLRTK 300
           PLYGGI+ IGNWR+QRQ+  GVEV LNLTVAVRSRAYILG+LVKSTFHTTITC +TL T 
Sbjct: 241 PLYGGISAIGNWRDQRQDGAGVEVSLNLTVAVRSRAYILGRLVKSTFHTTITCPITLSTN 300

Query: 301 NLGKSHSLNNSCIY 310
            LGKSHS NN+CIY
Sbjct: 301 KLGKSHSFNNTCIY 314

BLAST of MS022205 vs. NCBI nr
Match: XP_022928427.1 (uncharacterized protein LOC111435243 [Cucurbita moschata])

HSP 1 Score: 501.5 bits (1290), Expect = 5.1e-138
Identity = 256/312 (82.05%), Postives = 278/312 (89.10%), Query Frame = 0

Query: 1   MHAKSYSEVTSVEQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHHFYHASPIH 60
           MHAKSYSEVTSV+QSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPH FYHASPIH
Sbjct: 1   MHAKSYSEVTSVDQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHPFYHASPIH 60

Query: 61  HSRESSTSRFSASLKPNPR---NLAAWRKLHRPLESDADDDDDDATAAADDRDSEWTRKF 120
           HSRESSTSRFSASLK N     NL+AWRKLHRP   D ++DDDD      DRDS+W RKF
Sbjct: 61  HSRESSTSRFSASLKNNMNRNGNLSAWRKLHRPPGYDEEEDDDDDGDNDGDRDSKWNRKF 120

Query: 121 RLYLFLFVFFVLLFTVFSLILWGASRSFHPQIILQSMVFERFNVQAGSDAGGVATDLMSL 180
           RLYLFLFV FVLLFTVFSLILWGAS+SFHPQI++QSMVFE+FNVQAGSD GGVATDLMSL
Sbjct: 121 RLYLFLFVLFVLLFTVFSLILWGASKSFHPQILVQSMVFEKFNVQAGSDPGGVATDLMSL 180

Query: 181 NSTVRIKYRNPATFFGVHVSSSPIQLNYFQLQIASGQMMEFYEKRQSSRRVATAVAGHQV 240
           NSTVRI Y+NPATFFGVHVSS+P QL+YFQLQIASGQM EFY+KRQSSR+V T+V+GHQV
Sbjct: 181 NSTVRITYKNPATFFGVHVSSTPFQLHYFQLQIASGQMEEFYQKRQSSRKVTTSVSGHQV 240

Query: 241 PLYGGIAVIGNWREQRQEGVEVPLNLTVAVRSRAYILGKLVKSTFHTTITCSLTLRTKNL 300
           PLYGGI+ IGNWR+QRQ+GVEV LNLTVAVRSRAYILG+LVKSTFHT ITC +TL  K L
Sbjct: 241 PLYGGISAIGNWRDQRQDGVEVLLNLTVAVRSRAYILGRLVKSTFHTKITCPVTLSNKKL 300

Query: 301 GKSHSLNNSCIY 310
           GKSHS N +C Y
Sbjct: 301 GKSHSFNKTCTY 312

BLAST of MS022205 vs. ExPASy TrEMBL
Match: A0A6J1C5K4 (uncharacterized protein LOC111007587 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111007587 PE=4 SV=1)

HSP 1 Score: 599.4 bits (1544), Expect = 8.8e-168
Identity = 306/309 (99.03%), Postives = 306/309 (99.03%), Query Frame = 0

Query: 1   MHAKSYSEVTSVEQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHHFYHASPIH 60
           MHAKSYSEVTSVEQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHHFYHASPIH
Sbjct: 1   MHAKSYSEVTSVEQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHHFYHASPIH 60

Query: 61  HSRESSTSRFSASLKPNPRNLAAWRKLHRPLESDADDDDDDATAAADDRDSEWTRKFRLY 120
           HSRESSTSRFSASLKPN RNLAAWRKLHRPLESDADDDDDDATAAADDRDSEWTRKFRLY
Sbjct: 61  HSRESSTSRFSASLKPNXRNLAAWRKLHRPLESDADDDDDDATAAADDRDSEWTRKFRLY 120

Query: 121 LFLFVFFVLLFTVFSLILWGASRSFHPQIILQSMVFERFNVQAGSDAGGVATDLMSLNST 180
           LFLFVFFVLLFTVFSLILWGASRSFHPQIILQSMVFERFNVQAGSDAGGVATDLMSLNST
Sbjct: 121 LFLFVFFVLLFTVFSLILWGASRSFHPQIILQSMVFERFNVQAGSDAGGVATDLMSLNST 180

Query: 181 VRIKYRNPATFFGVHVSSSPIQLNYFQLQIASGQMMEFYEKRQSSRRVATAVAGHQVPLY 240
           VRIKYRNPATFFGVHVSSSPIQLNYFQLQIASGQMMEFYEKRQSSRRVATAVAGHQVPLY
Sbjct: 181 VRIKYRNPATFFGVHVSSSPIQLNYFQLQIASGQMMEFYEKRQSSRRVATAVAGHQVPLY 240

Query: 241 GGIAVIGNWREQRQEGVEVPLNLTVAVRSRAYILGKLVKSTFHTTITCSLTLRTKNLGKS 300
           GGIAVIGNWREQRQEGVEVPLNLTVAVRSRAYILGKLVKSTFH TITCSLTLRTKNLGK 
Sbjct: 241 GGIAVIGNWREQRQEGVEVPLNLTVAVRSRAYILGKLVKSTFHXTITCSLTLRTKNLGKF 300

Query: 301 HSLNNSCIY 310
           HSLNNSCIY
Sbjct: 301 HSLNNSCIY 309

BLAST of MS022205 vs. ExPASy TrEMBL
Match: A0A1S3BJ42 (uncharacterized protein LOC103490245 OS=Cucumis melo OX=3656 GN=LOC103490245 PE=4 SV=1)

HSP 1 Score: 508.4 bits (1308), Expect = 2.0e-140
Identity = 258/314 (82.17%), Postives = 282/314 (89.81%), Query Frame = 0

Query: 1   MHAKSYSEVTSVEQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHHFYHASPIH 60
           MHAKSYSEVTSV+QSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHHFYHASPIH
Sbjct: 1   MHAKSYSEVTSVDQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHHFYHASPIH 60

Query: 61  HSRESSTSRFSASLKPNPR---NLAAWRKLHRPLESDADDDDDDATAAADDRDSEWTRKF 120
           HSRESSTSRFSASLK N     N++AWRKLH   +SD DD++DD     +DRDS+W RKF
Sbjct: 61  HSRESSTSRFSASLKSNQNRNGNVSAWRKLHLAEDSDDDDEEDDGDEENEDRDSKWNRKF 120

Query: 121 RLYLFLFVFFVLLFTVFSLILWGASRSFHPQIILQSMVFERFNVQAGSDAGGVATDLMSL 180
           RLYL LF+FFVLLFTVFSLILWGAS+SFHPQI++QSMVF +FNVQAGSD GGVATDLMSL
Sbjct: 121 RLYLILFLFFVLLFTVFSLILWGASKSFHPQILIQSMVFSKFNVQAGSDPGGVATDLMSL 180

Query: 181 NSTVRIKYRNPATFFGVHVSSSPIQLNYFQLQIASGQMMEFYEKRQSSRRVATAVAGHQV 240
           NSTVRI YRNPATFFGVHVSS+P QL+YFQLQIASGQM EFY+KRQSSRR+ T+VAGHQV
Sbjct: 181 NSTVRISYRNPATFFGVHVSSTPFQLHYFQLQIASGQMEEFYQKRQSSRRMVTSVAGHQV 240

Query: 241 PLYGGIAVIGNWREQRQE--GVEVPLNLTVAVRSRAYILGKLVKSTFHTTITCSLTLRTK 300
           PLYGGI+ IGNWR+QRQ+  GVEV LNLTVAVRSRAYILG+LVKSTFHTTITC +TL TK
Sbjct: 241 PLYGGISAIGNWRDQRQDGVGVEVSLNLTVAVRSRAYILGRLVKSTFHTTITCPITLSTK 300

Query: 301 NLGKSHSLNNSCIY 310
            LGKSHS NN+C Y
Sbjct: 301 KLGKSHSFNNTCTY 314

BLAST of MS022205 vs. ExPASy TrEMBL
Match: A0A0A0K4T2 (LEA_2 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_7G018790 PE=4 SV=1)

HSP 1 Score: 507.7 bits (1306), Expect = 3.5e-140
Identity = 256/314 (81.53%), Postives = 282/314 (89.81%), Query Frame = 0

Query: 1   MHAKSYSEVTSVEQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHHFYHASPIH 60
           MHAKSYSEVTSV+QSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHHFYHASPIH
Sbjct: 1   MHAKSYSEVTSVDQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHHFYHASPIH 60

Query: 61  HSRESSTSRFSASLKPNPR---NLAAWRKLHRPLESDADDDDDDATAAADDRDSEWTRKF 120
           HSRESSTSRFSASLK N     N++AWRKLH   +SD DD++DD     +DRDS+W RKF
Sbjct: 61  HSRESSTSRFSASLKINQNRNGNVSAWRKLHHAQDSDGDDEEDDEEEENEDRDSKWNRKF 120

Query: 121 RLYLFLFVFFVLLFTVFSLILWGASRSFHPQIILQSMVFERFNVQAGSDAGGVATDLMSL 180
           RLYL LF+FF+LLFTVFSLILWGAS+SFHPQI++QSMVF +FNVQAGSD GGVATDLMSL
Sbjct: 121 RLYLILFLFFILLFTVFSLILWGASKSFHPQILIQSMVFSKFNVQAGSDPGGVATDLMSL 180

Query: 181 NSTVRIKYRNPATFFGVHVSSSPIQLNYFQLQIASGQMMEFYEKRQSSRRVATAVAGHQV 240
           NSTVRI Y+NPATFFGVHVSS+PIQL+Y QLQ+ASGQM EFY+KRQSSRRV T+VAGHQV
Sbjct: 181 NSTVRISYKNPATFFGVHVSSTPIQLHYLQLQVASGQMEEFYQKRQSSRRVVTSVAGHQV 240

Query: 241 PLYGGIAVIGNWREQRQE--GVEVPLNLTVAVRSRAYILGKLVKSTFHTTITCSLTLRTK 300
           PLYGGI+ IGNWR+QRQ+  GVEV LNLTVAVRSRAYILG+LVKSTFHTTITC +TL T 
Sbjct: 241 PLYGGISAIGNWRDQRQDGAGVEVSLNLTVAVRSRAYILGRLVKSTFHTTITCPITLSTN 300

Query: 301 NLGKSHSLNNSCIY 310
            LGKSHS NN+CIY
Sbjct: 301 KLGKSHSFNNTCIY 314

BLAST of MS022205 vs. ExPASy TrEMBL
Match: A0A6J1EJW4 (uncharacterized protein LOC111435243 OS=Cucurbita moschata OX=3662 GN=LOC111435243 PE=4 SV=1)

HSP 1 Score: 501.5 bits (1290), Expect = 2.5e-138
Identity = 256/312 (82.05%), Postives = 278/312 (89.10%), Query Frame = 0

Query: 1   MHAKSYSEVTSVEQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHHFYHASPIH 60
           MHAKSYSEVTSV+QSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPH FYHASPIH
Sbjct: 1   MHAKSYSEVTSVDQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHPFYHASPIH 60

Query: 61  HSRESSTSRFSASLKPNPR---NLAAWRKLHRPLESDADDDDDDATAAADDRDSEWTRKF 120
           HSRESSTSRFSASLK N     NL+AWRKLHRP   D ++DDDD      DRDS+W RKF
Sbjct: 61  HSRESSTSRFSASLKNNMNRNGNLSAWRKLHRPPGYDEEEDDDDDGDNDGDRDSKWNRKF 120

Query: 121 RLYLFLFVFFVLLFTVFSLILWGASRSFHPQIILQSMVFERFNVQAGSDAGGVATDLMSL 180
           RLYLFLFV FVLLFTVFSLILWGAS+SFHPQI++QSMVFE+FNVQAGSD GGVATDLMSL
Sbjct: 121 RLYLFLFVLFVLLFTVFSLILWGASKSFHPQILVQSMVFEKFNVQAGSDPGGVATDLMSL 180

Query: 181 NSTVRIKYRNPATFFGVHVSSSPIQLNYFQLQIASGQMMEFYEKRQSSRRVATAVAGHQV 240
           NSTVRI Y+NPATFFGVHVSS+P QL+YFQLQIASGQM EFY+KRQSSR+V T+V+GHQV
Sbjct: 181 NSTVRITYKNPATFFGVHVSSTPFQLHYFQLQIASGQMEEFYQKRQSSRKVTTSVSGHQV 240

Query: 241 PLYGGIAVIGNWREQRQEGVEVPLNLTVAVRSRAYILGKLVKSTFHTTITCSLTLRTKNL 300
           PLYGGI+ IGNWR+QRQ+GVEV LNLTVAVRSRAYILG+LVKSTFHT ITC +TL  K L
Sbjct: 241 PLYGGISAIGNWRDQRQDGVEVLLNLTVAVRSRAYILGRLVKSTFHTKITCPVTLSNKKL 300

Query: 301 GKSHSLNNSCIY 310
           GKSHS N +C Y
Sbjct: 301 GKSHSFNKTCTY 312

BLAST of MS022205 vs. ExPASy TrEMBL
Match: A0A6J1JK28 (uncharacterized protein LOC111486495 OS=Cucurbita maxima OX=3661 GN=LOC111486495 PE=4 SV=1)

HSP 1 Score: 498.4 bits (1282), Expect = 2.1e-137
Identity = 254/312 (81.41%), Postives = 278/312 (89.10%), Query Frame = 0

Query: 1   MHAKSYSEVTSVEQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHHFYHASPIH 60
           MHAKSYSEVTSV+QSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPH FYHASPIH
Sbjct: 1   MHAKSYSEVTSVDQSSPARSPRRPLYYVQSPSNHDVEKMSYGSSPMGSPPHPFYHASPIH 60

Query: 61  HSRESSTSRFSASLKPNPR---NLAAWRKLHRPLESDADDDDDDATAAADDRDSEWTRKF 120
           HSRESSTSRFSASLK N     NL+AWRKLHRP   D ++++DD      DRDS+W RKF
Sbjct: 61  HSRESSTSRFSASLKNNMNRNGNLSAWRKLHRPPGYDEEEEEDDDGDNDGDRDSKWNRKF 120

Query: 121 RLYLFLFVFFVLLFTVFSLILWGASRSFHPQIILQSMVFERFNVQAGSDAGGVATDLMSL 180
           RLYLFLFV FVLLFTVFSLILWGAS+SFHPQI++QSMVFE+FNVQAGSD GGVATDLMSL
Sbjct: 121 RLYLFLFVLFVLLFTVFSLILWGASKSFHPQILVQSMVFEKFNVQAGSDPGGVATDLMSL 180

Query: 181 NSTVRIKYRNPATFFGVHVSSSPIQLNYFQLQIASGQMMEFYEKRQSSRRVATAVAGHQV 240
           NSTVRI Y+NPATFFGVHVSS+P QL+YFQLQIASGQM EFY+KRQSSR+V T+V+GHQV
Sbjct: 181 NSTVRITYKNPATFFGVHVSSTPFQLHYFQLQIASGQMEEFYQKRQSSRKVTTSVSGHQV 240

Query: 241 PLYGGIAVIGNWREQRQEGVEVPLNLTVAVRSRAYILGKLVKSTFHTTITCSLTLRTKNL 300
           PLYGGI+ IGNWR+QRQ+GVEV LNLTVAVRSRAYILG+LVKSTFHT ITC +TL  K L
Sbjct: 241 PLYGGISAIGNWRDQRQDGVEVLLNLTVAVRSRAYILGRLVKSTFHTKITCPVTLSNKKL 300

Query: 301 GKSHSLNNSCIY 310
           GKSHS N +C Y
Sbjct: 301 GKSHSFNKTCTY 312

BLAST of MS022205 vs. TAIR 10
Match: AT2G41990.1 (CONTAINS InterPro DOMAIN/s: Late embryogenesis abundant protein, group 2 (InterPro:IPR004864); BEST Arabidopsis thaliana protein match is: Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family (TAIR:AT4G35170.1); Has 172 Blast hits to 168 proteins in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 172; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 278.1 bits (710), Expect = 8.6e-75
Identity = 163/313 (52.08%), Postives = 210/313 (67.09%), Query Frame = 0

Query: 1   MHAKSYSEVTSVEQS--SPARSPRRPLYYVQSPSNHDVEKMSYGS--SPMGSPPH-HFYH 60
           MHAK+ SE TS++ +  SP RS  RPLYYVQSPSNHDVEKMS+GS  S MGSP H H+YH
Sbjct: 1   MHAKTDSEATSIDAAALSPPRSAIRPLYYVQSPSNHDVEKMSFGSGCSLMGSPTHPHYYH 60

Query: 61  ASPIHHSRESSTSRFSASLKPNPRNLAAWRKLHRPLESDADDDDDDATAAADDRDSEWTR 120
            SPIHHSRESSTSRFS       R L +++ +         +D DD T   DD D    R
Sbjct: 61  CSPIHHSRESSTSRFS------DRALLSYKSIRE--RRRYINDGDDKTDGGDDDDP--FR 120

Query: 121 KFRLYLFLFVFFVLLFTVFSLILWGASRSFHPQIILQSMVFERFNVQAGSDAGGVATDLM 180
             RLY++L +  + LFTVFSLILWGAS+S+ P++ ++ M+    N+QAG+D  GV TD++
Sbjct: 121 NVRLYVWLLLSVIFLFTVFSLILWGASKSYPPKVTVKGMLVRDLNLQAGNDLSGVPTDML 180

Query: 181 SLNSTVRIKYRNPATFFGVHVSSSPIQLNYFQLQIASGQMMEFYEKRQSSRRVATAVAGH 240
           SLNSTVRI YRNP+TFF VHV++SP+ L+Y  L ++SG+M +F   R     V T V GH
Sbjct: 181 SLNSTVRIYYRNPSTFFAVHVTASPLLLHYSNLLLSSGEMNKFTVGRNGETNVVTVVQGH 240

Query: 241 QVPLYGGIAVIGNWREQRQEGVEVPLNLTVAVRSRAYILGKLVKSTFHTTITCSLTLRTK 300
           Q+PLYGG++          + + +PLNLT+ + S+AYILG+LV S F+T I CS TL   
Sbjct: 241 QIPLYGGVSF-------HLDTLSLPLNLTIVLHSKAYILGRLVTSKFYTRIICSFTLDAN 296

Query: 301 NLGKSHSLNNSCI 309
           +L KS SL  SCI
Sbjct: 301 HLPKSISLLRSCI 296

BLAST of MS022205 vs. TAIR 10
Match: AT1G45688.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma membrane; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G42860.1); Has 258 Blast hits to 242 proteins in 39 species: Archae - 0; Bacteria - 11; Metazoa - 10; Fungi - 14; Plants - 198; Viruses - 17; Other Eukaryotes - 8 (source: NCBI BLink). )

HSP 1 Score: 228.4 bits (581), Expect = 7.8e-60
Identity = 148/339 (43.66%), Postives = 205/339 (60.47%), Query Frame = 0

Query: 1   MHAKSYSEVTSVEQSSPARSPRRPLYYVQSPS--NHDVEK--MSYGS----SPMGSPPHH 60
           MHAK+ SEVTS+  SSPARSPRRP+YYVQSPS  +HD EK   S+ S    SPMGSPPH 
Sbjct: 1   MHAKTDSEVTSLAASSPARSPRRPVYYVQSPSRDSHDGEKTATSFHSTPVLSPMGSPPHS 60

Query: 61  FYHASPIHHSRESSTSRFSASLKPNPR----NLAAWRKLHRPLESDADDDDDDATAAADD 120
             H+S   HSRESS+SRFS SLKP  R    N  + RK H   +   +    +     DD
Sbjct: 61  --HSSMGRHSRESSSSRFSGSLKPGSRKVNPNDGSKRKGHGGEKQWKECAVIEEEGLLDD 120

Query: 121 RDSEWTRKFRLYLFLFVF-FVLLFTVFSLILWGASRSFHPQIILQSMVFERFNVQAGSDA 180
            D +     R Y+  F+  F +LF  FSLIL+GA++   P+I ++S+ FE   +QAG DA
Sbjct: 121 GDRDGGVPRRCYVLAFIVGFFILFGFFSLILYGAAKPMKPKITVKSITFETLKIQAGQDA 180

Query: 181 GGVATDLMSLNSTVRIKYRNPATFFGVHVSSSPIQLNYFQLQIASGQMMEFYEKRQSSRR 240
           GGV TD++++N+T+R+ YRN  TFFGVHV+S+PI L++ Q++I SG + +FY+ R+S R 
Sbjct: 181 GGVGTDMITMNATLRMLYRNTGTFFGVHVTSTPIDLSFSQIKIGSGSVKKFYQGRKSERT 240

Query: 241 VATAVAGHQVPLYGGIAVI-------GNWREQRQEG------------VEVPLNLTVAVR 300
           V   V G ++PLYG  + +          + ++++G              VP+ L+  VR
Sbjct: 241 VLVHVIGEKIPLYGSGSTLLPPAPPAPLPKPKKKKGAPVPIPDPPAPPAPVPMTLSFVVR 300

Query: 301 SRAYILGKLVKSTFHTTITCSLTLRTKNLGKSHSLNNSC 308
           SRAY+LGKLV+  F+  I C +    KNL K   +  +C
Sbjct: 301 SRAYVLGKLVQPKFYKKIECDINFEHKNLNKHIVITKNC 337

BLAST of MS022205 vs. TAIR 10
Match: AT4G35170.1 (Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family )

HSP 1 Score: 224.9 bits (572), Expect = 8.6e-59
Identity = 137/296 (46.28%), Postives = 188/296 (63.51%), Query Frame = 0

Query: 14  QSSPARSPRRPLYYVQSPSNHDVEKMSYGS--SPMGSPPHHFYHASPIHHSRESSTSRFS 73
           +SSP ++ R+P+Y V SP N DV+K+S GS  SP GSP +     S   H   + +S + 
Sbjct: 7   RSSP-QNTRKPVYVVHSPPNTDVDKISTGSGFSPFGSPLNDQGQVSNFQHHSVAESSSYP 66

Query: 74  ASLKPNPRNLAAWRKLHRPLESDADDDDDDATAAADDRDSEWTRKFRLYLFLFVFFVLLF 133
            S  P  RN  +  ++H       +D+D D     D++    TR +   LF     VL F
Sbjct: 67  RSSGP-LRNEYSSVQVHDLDRRTHEDEDYDEMDGPDEKRRRITRFYSCLLFT---LVLAF 126

Query: 134 TVFSLILWGASRSFHPQIILQSMVFERFNVQAGSDAGGVATDLMSLNSTVRIKYRNPATF 193
           T+F LILWG S+SF P   L+ MV E  NVQ+G+D  GV TD+++LNSTVRI YRNPATF
Sbjct: 127 TLFCLILWGVSKSFAPIATLKEMVLENLNVQSGNDQSGVLTDMLTLNSTVRILYRNPATF 186

Query: 194 FGVHVSSSPIQLNYFQLQIASGQMMEFYEKRQSSRRVATAVAGHQVPLYGGIAVIGNWRE 253
           F VHV+S+P+QL+Y QL +ASGQM EF ++R+S R + T V G Q+PLYGG+  +   R 
Sbjct: 187 FTVHVTSAPLQLSYSQLILASGQMGEFSQRRKSERIIETKVFGDQIPLYGGVPALFGQRA 246

Query: 254 QRQEGVEVPLNLTVAVRSRAYILGKLVKSTFHTTITCSLTLRTKNLGKSHSLNNSC 308
           +  + V +PLNLT  +R+RAY+LG+LVK+TFH+ I CS+T     LGK+  L+ SC
Sbjct: 247 EPDQ-VVLPLNLTFTLRARAYVLGRLVKTTFHSNIKCSITFYGDKLGKTLDLSKSC 296

BLAST of MS022205 vs. TAIR 10
Match: AT5G42860.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma membrane; EXPRESSED IN: 19 plant structures; EXPRESSED DURING: 11 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G45688.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 204.1 bits (518), Expect = 1.6e-52
Identity = 136/335 (40.60%), Postives = 189/335 (56.42%), Query Frame = 0

Query: 1   MHAKSYSEVTSVEQSSPARSPRRPLYYVQSPS--NHDVEKMSYG-------SSPMGSPPH 60
           MHAK+ SEVTS+  SSP RSPRRP Y+VQSPS  +HD EK +         +SPMGSPPH
Sbjct: 1   MHAKTDSEVTSLSASSPTRSPRRPAYFVQSPSRDSHDGEKTATSFHSTPVLTSPMGSPPH 60

Query: 61  HFYHASPIHHSRESSTSRFSASLKPNPRNLAAWRKLHRPLESDADDDDDDATAAADDRDS 120
                        SS+SRFS  +  + R   A  K    +E +   DD D    A  R  
Sbjct: 61  -----------SHSSSSRFS-KINGSKRKGHAGEKQFAMIEEEGLLDDGDREQEALPR-- 120

Query: 121 EWTRKFRLYLFLFVF-FVLLFTVFSLILWGASRSFHPQIILQSMVFERFNVQAGSDAGGV 180
                 R Y+  F+  F LLF  FSLIL+ A++   P+I ++S+ FE+  VQAG DAGG+
Sbjct: 121 ------RCYVLAFIVGFSLLFAFFSLILYAAAKPQKPKISVKSITFEQLKVQAGQDAGGI 180

Query: 181 ATDLMSLNSTVRIKYRNPATFFGVHVSSSPIQLNYFQLQIASGQMMEFYEKRQSSRRVAT 240
            TD++++N+T+R+ YRN  TFFGVHV+SSPI L++ Q+ I SG + +FY+ R+S R V  
Sbjct: 181 GTDMITMNATLRMLYRNTGTFFGVHVTSSPIDLSFSQITIGSGSIKKFYQSRKSQRTVVV 240

Query: 241 AVAGHQVPLYGGIAVI-------GNWREQRQEG-----------VEVPLNLTVAVRSRAY 300
            V G ++PLYG  + +          + ++++G             VP+ L   VRSRAY
Sbjct: 241 NVLGDKIPLYGSGSTLVPPPPPAPIPKPKKKKGPIVIVEPPAPPAPVPMRLNFTVRSRAY 300

Query: 301 ILGKLVKSTFHTTITCSLTLRTKNLGKSHSLNNSC 308
           +LGKLV+  F+  I C +    K L K   + N+C
Sbjct: 301 VLGKLVQPKFYKRIVCLINFEHKKLSKHIPITNNC 315

BLAST of MS022205 vs. TAIR 10
Match: AT1G45688.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma membrane; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G42860.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 176.4 bits (446), Expect = 3.5e-44
Identity = 113/228 (49.56%), Postives = 150/228 (65.79%), Query Frame = 0

Query: 1   MHAKSYSEVTSVEQSSPARSPRRPLYYVQSPS--NHDVEK--MSYGS----SPMGSPPHH 60
           MHAK+ SEVTS+  SSPARSPRRP+YYVQSPS  +HD EK   S+ S    SPMGSPPH 
Sbjct: 1   MHAKTDSEVTSLAASSPARSPRRPVYYVQSPSRDSHDGEKTATSFHSTPVLSPMGSPPHS 60

Query: 61  FYHASPIHHSRESSTSRFSASLKPNPR----NLAAWRKLHRPLESDADDDDDDATAAADD 120
             H+S   HSRESS+SRFS SLKP  R    N  + RK H   +   +    +     DD
Sbjct: 61  --HSSMGRHSRESSSSRFSGSLKPGSRKVNPNDGSKRKGHGGEKQWKECAVIEEEGLLDD 120

Query: 121 RDSEWTRKFRLYLFLFVF-FVLLFTVFSLILWGASRSFHPQIILQSMVFERFNVQAGSDA 180
            D +     R Y+  F+  F +LF  FSLIL+GA++   P+I ++S+ FE   +QAG DA
Sbjct: 121 GDRDGGVPRRCYVLAFIVGFFILFGFFSLILYGAAKPMKPKITVKSITFETLKIQAGQDA 180

Query: 181 GGVATDLMSLNSTVRIKYRNPATFFGVHVSSSPIQLNYFQLQIASGQM 216
           GGV TD++++N+T+R+ YRN  TFFGVHV+S+PI L++ Q++I SG +
Sbjct: 181 GGVGTDMITMNATLRMLYRNTGTFFGVHVTSTPIDLSFSQIKIGSGSV 226

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022135688.11.8e-16799.03uncharacterized protein LOC111007587 isoform X1 [Momordica charantia] >XP_022135... [more]
XP_038888376.11.7e-14183.65uncharacterized protein LOC120078225 [Benincasa hispida][more]
XP_008447896.14.2e-14082.17PREDICTED: uncharacterized protein LOC103490245 [Cucumis melo][more]
XP_004144875.17.2e-14081.53uncharacterized protein LOC101215215 [Cucumis sativus] >KGN43297.1 hypothetical ... [more]
XP_022928427.15.1e-13882.05uncharacterized protein LOC111435243 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1C5K48.8e-16899.03uncharacterized protein LOC111007587 isoform X1 OS=Momordica charantia OX=3673 G... [more]
A0A1S3BJ422.0e-14082.17uncharacterized protein LOC103490245 OS=Cucumis melo OX=3656 GN=LOC103490245 PE=... [more]
A0A0A0K4T23.5e-14081.53LEA_2 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_7G018790 PE=4 ... [more]
A0A6J1EJW42.5e-13882.05uncharacterized protein LOC111435243 OS=Cucurbita moschata OX=3662 GN=LOC1114352... [more]
A0A6J1JK282.1e-13781.41uncharacterized protein LOC111486495 OS=Cucurbita maxima OX=3661 GN=LOC111486495... [more]
Match NameE-valueIdentityDescription
AT2G41990.18.6e-7552.08CONTAINS InterPro DOMAIN/s: Late embryogenesis abundant protein, group 2 (InterP... [more]
AT1G45688.17.8e-6043.66unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT4G35170.18.6e-5946.28Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family [more]
AT5G42860.11.6e-5240.60unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT1G45688.23.5e-4449.56unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (TR) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004864Late embryogenesis abundant protein, LEA_2 subgroupPFAMPF03168LEA_2coord: 183..289
e-value: 3.5E-9
score: 37.1
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..57
NoneNo IPR availablePANTHERPTHR31852LATE EMBRYOGENESIS ABUNDANT (LEA) HYDROXYPROLINE-RICH GLYCOPROTEIN FAMILYcoord: 52..309
NoneNo IPR availablePANTHERPTHR31852:SF175LATE EMBRYOGENESIS ABUNDANT PROTEINcoord: 52..309

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MS022205.1MS022205.1mRNA