MC10g1033 (gene) Bitter gourd (Dali-11) v1

Overview
NameMC10g1033
Typegene
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionDUF4477 domain-containing protein
LocationMC10: 10044239 .. 10054374 (+)
RNA-Seq ExpressionMC10g1033
SyntenyMC10g1033
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: utr5CDSpolypeptideutr3
Hold the cursor over a type above to highlight its positions in the sequence below.
AACGGACCTTCCTTCAACCTAAAACAAAATCTCTGCAGTTCCCAGCTTCAGGCGACGGCGCACTCCGGCGAGAGATCGGTCCTTCACGGCGAACTCCACCTCAGGCGACGGCGAACTCAGGTGCGGCTCCCCAGTTTTCATATACTGCCTCCTCCGCCTCGCCCTCTCCGGCTTGGGTGGCTGCGGTCGCGGCAGTTGCAGTCTCTGTAGTCGCCGTTTCCCACTTGAAACTTGTGCTTCTCTGATTTGTTCTTTGATTTTGGCATTGAAATTTGCAAGAAAGAATAGAATGTTAAGTAGTTATTGGCTTGTAATTTTTTTTTTCCTTCTACTCTTTTTGATTTATTTAAAAGCATTTGGAACATATCGGTATTATTCAATAACTCGAAACTTGTGGTTTGCTGCTTGATTCTGGCATTGCAATATATTGCAAGGAAAAAAAAGAAAAGAATGTTAAGTAGTATTGGCCTTTATTTTTTGGTGTTCCTTTTGACTCTCTTGATTTATTTCAAGGCATTTAGAACATATCGGTGTTATACAACGGCTTGAAACTTGCTTTCATGATTTGTTGCGTGATTATGGCATTGCAATATATTGTAATATATTGCGAGAAAGAAAACAGAATGTTAAGTAGTTATTGGTCTTTATATATATTTTTTTAAAAACTAAGGATAGAGACACCAGAGAGACATACACGAGAAAAGAGAGAAGAAATAAATTCTCAAATTGATTGATCAAAGATGAGGAATAGAGGCACTATTTATAGGTTATAAGAGGAAACAAAATCTAGGAATAACCTCCTAATTCCCTAACAAAAAAGGAGATTTATTTGCTAAATTTAAACACAAAATTTAAACAACTTCTAACTAAGTCCACATAATCTTAACTTCTAACTAATTCAACACTCCCCCTCAAGCTGGACTATACAGATTGAACAGTTTCAGCTTGATCAACAACCTTTCAAATACGGGTCTCAAGAGTCCTTTTGTTAGGATATCTGCAACTTGATCTGATGATGGAACATACATCAGATTTATAATTTTCTCTTCAATCTTCTCCTTGATAAAATGTCTGTCAACCTGAACATGTTTAGTACGATCGTGATGCACTGGATTGTGTGCAATAGCAATTGCCGCCTTGTTGTCACAATAGACTTCCATTGAACTATCAAACCTGATCTTGAGTTCTTGGAGGAGGAGTTTCAGCCACATTATCTCACAAATACCATGAGATAATGCCCTGAATTCAGCCTCAGCTCTACTTCTTGCAACAACTGATTGTTTCTTACTCTTCCATGTAACAAGATTACCCCATACGAAGGTACAATATCCTGTCGTAGATCTCCTATCTTGAATAGAACCTGCCCAATCAGCATCTGTAAACACCTTAATCTTTCTGTCATCATCCTTTCCGAAAAATAATCCAAGCCCTGGAGTTCCTTTAAGATATCTAAGAATATGATACACGGCTTCCATGTGTTCTTCAGTAGGAGCATGCATGTATTGACTTACAACACTCACTGCAAATGCAATATCCGGCCTAGTGTGGGATAAATAGATTAATCTTCCAACAAGACGCTGATATCTCCCTTTATCAATGGGTACCCCATCTAGGCTCTCTCCCAACTTATGATTGGGACTTAATTGGAGTATCAGATGGTTTACATCCAAGCATTCCGGTTTCCTTCAAGAGGTCTAGTACATATTTTCTTTCACACACAGATATACCAGTTCTGCTCCTAGCAACTTATCCCTAGAAAATACCGAAGTGTTCCCAGGTCCTTTATTTCGAACTCCTTTCCAAGTGCCTGTTTTAGTCTCTCTATTTCTTCATGATGGTCACCTGTGACAATAATATCATCGACATAAACAATGAGGATGGTGATCTTACCTTGAGAGTACCTATAGAACATTGTATGGTCAGTTTGAGCTTGGGTATATCCTTGTTTACAAACAGCCTTTGAAAATCGCCCGAACCATGCATGTGGGGATTGTTATAAACCATAGAGAGATTTCTTCAGCTTACACACCTTTCCATCATACTTATCAATGTCAAATCCAGGAGGAAGGTCCATGTACACCTCTTCTTCCAAATCTCCATTGAGGAATGCATTTTTGACATCCAGTTGTTGTAGTTTCCAATCCAGATTTGCAGCAAGAGACAAGAGGACACGGATCGTGTTTAATTTTACAACGGGTGCAAATGTCTCCTGATAGTCTACCCCAAAAGTCTGAGAGTATCCTTTGGCCACCAATCTTGCTTTATACCTTTCAATGGACCCATCAGCCTTATATTTGACAGTAAAGATCCATCGGCATCCTACCGTTGGTTTATCTTGAGGACGATCAACTACTGTCCACGTATCATTAAGCTTCAAAGCCTTCATTTCATCATCTACCGCATTTTTCCACTGTGGATTGCTCATAGCCTCTTGAATGGATTTTGGTACCTCATTGGCTTCCAGATTGAGAGCCAAGGTCTGAACTGCTGAAGATAAATGACTATAACCAATACATCGAGCAATTGGATGTTGAGTACACGATCTCACACCCTTCCTAAGTGCAATAGGAAGATCAAGATCATTAGTATTGTCTTGAGGCACAAGGGTAGATTGTTCAGTACCTAAGGACGAGATGGATTCATGACTTTGCTGTTGAGGCTGTGGTGGCTCCACTCCTCTTTGAACCGTTTGTCGCCGAGAATAAACAAGAGTCTCAGGTTGCTTGTCACTTCGATTTGGGTTGACTTGATCATCTTCATTATTACAGGAAATAATAGGAACTGACTCTTCTATCTGAGTAATTGGGACGATCGGGATAGGTATTTTTGGCTGAGGTGACTCGGTATTTAGACCATCAATCTCAGGTATTTTTGACTGAGGGGACTCAATTCTGGTATTTAGATCGTCAATCTCCCAAAAATTTCCATCAAGACTACTTGTAATAGGCTCAGTCTCCCCCTGAGGGGAATTTAGGGAGAAAAAAGATTGAGACTCAAAAGATGTGACATCCATGGTAGGGTAAAATTTATTGGTGAATGGACAATAGCATTTATAGCCTTTTTGAGTAGGAGAGTAACCTAGGAAGACACACTTTCTGGCTTTTGGCTCCAATTTGTGACGTTCATGATCATAAACATGAGTGAATGCAGTACATCCAAATATTTTGAGGGGAATTGTGAGGACAGATCGGATTTGCGGATAGATAGTGAGTAAGATTTCTAGTGGAGTGTTAAATTTCAGAACACGAGAGGGCATCCGATTAATGAGATAACAAGCAGTAAGGACAGCATCACCCCAAAAACAGTTAGGAACAAAATTGGTGAACATGAGAGCACGAGCAGTTTCTAAAAGATGCCTATTTTTCCTTTCAGCTATACCATTTTGTTGGGGGGTAATGACACAGGAACTTTGGTGAATGATTCCATTTTGAGACAGGAAAGTTCCCAATATTTGATTAAAATATTCTGTCCCCTTATCTGTGCGAAGAATTTTGATTCTCGTATTAAATTGATTATGAATGAGAGAATGAAACTGTTGAAAGATTGATGCAGTTTCTGATTTTTCTTTAAGTAAATAGACCCAACAAACCCTAGTGTGGTCATCAGTGAAGGTAAGGAACCAACGTTTATCGGACATAGTTTTGATGCGAGACGGCCCCCATAAATCACTATGAATCATTGTGAATGGACTGGATGGTATATACTCAACAGAAGGATAAGATGTACGAGAGTGTTTTGCAAGTTGCACAATTCGCAAGTAAAAAGGTCATCTTTATGAAAAAGTGAAGGGTACAAATGTTTAAGATACTGAAAACTCGGGTGCCCTAAACAGCGATGTAAAAGTAGAATTTGTTGTTCCCTAGAATTCGACTCACTACCAACCTGGAGAACTTGTTTATTTCTTAGACTTGGTCCTCGGAAGTAGTAAAGCCCTTCAAAGCCATCAGCATTGCCAATCGTCGTTCCCGTTATCAAGTCCTGAAACAAACACTTAGAGTCAATGAACAGGGCTTGACACTTTAAATCATGAGTCAACTTCTGAACAGAAATTAGATTGCAACATAATTTAGGCACATGGAGCACTGAGTGCAATGTAATGTTTGGGCTAAGAATAACAGAACCAAAGCCCTTAATAATGGCCGATAACCCATCTGCAAGCTTGACATGTGTCTGAATCGGGTTGGGTGAGTACATGGTAAACATATCATGAAAAGCGGTCATATGATCAGTTGCACCCGAATCTAAGATCCACTGATCGGAATGCTGCTGACTTGTTAAAGCTGCACTAAAAATACCTCGTTGTGCCACAAAACTTGACGAAGGAGTAGACTCAACCGGCGGTGTTAAGAGGCGATAGAGCTGTTCAAGTTGTGCCTTCGAAAATGGAGGGAGAGAGGTGGCCAAATCTTGGGAGTTTGACACACTAGGGCCCACTTGATAGCCGGAGGAGCTGGTTCGAGAAGATGGAGTATTTGTAGGTGGAGGTGGCCGATAATCCCCTGATAAATTCTTCCGTTGAGGACGACCATGGAGTTCCCAACACCGATCTTTTGTATGGTTTGTGCGCTTACAATGATCACACCATAGGTTGTTCCGACGAGTAGATCGGGATGAAGGTGGTGGTGGACCTCGTGCCGCCAGAGCTGATGATTCCACTGAGAGGGACAGAGGTTTTGTGTGTGTATCACCCATCATCACACGTTTACGGCTTGACTCCCAACGAACTTCTGCGAAGATTTCGTCAATGGCTGGGATTGGCTTTGTGGCAAGTAAGCGGCCACGCACATCATCTAATTCCTGACGAAGACCTGCAAGAAAATCATAAATTCGTTCCTTCTCGACGTGTTTGCGAAAGCGTTCAGCATCCTTCAAGTTCTCCCATTCGAGATTGAGACATAAATCAAGTTCAGCCCACAACCTATGTAATGAGCTGTAGTATTGGGTGACATCGGATTCACCTTGTCGTAAGGAACGTGCCTTATTGCGTAATTCAAACAATTGAGCCGAGTTATCAAAATCAGAAAAAGCCATAGTGAGCGCATTCCAAAGATCATTTGCTGTTGAGTAGAAGATGAAGGATTCTTTAATGTCCTCCTCCATCGAGTTAATGAGCCATGCCATGACCATTGAGTTTTGTGCATCCCACACGGAAAAAGAAGGGTCAGCTTCATCTGATTCTGCTATCATGCCATTGATGTACCCGAGTCGACTGCGGCCACGAATAACTAGGAGGGCCGATCGAGACCATTGCAGAAAATTTTTTCCGTTGAGTTTTGGAGAGGTAATTTGGAGCGAAGTGTTTTCAGAGATAGAGATGTGAGGTGTCGTGGATGTGCTTTTAGGGAGAGAAATTGAGGTCGTATTCGATCCGTCGGAACTCTCATCCTTGCGCACATTAGTCATCATGGCTGTTTTAACCATTCTGGTGTCAATCTAAACCTAGCTCTGATACCATAAAAACTAAGGATAGAGACACCAGAGAGACATACACGAGAAAAGAGAGAAGAAATAAATTCTCAAATTGATTGATCAAAGATGAGGAATAGAGACACTATTTATAGGTTACAAGAGGAAACAAAATCTAGGAATAACCTCCTAACTTCCTTAACAAAAAAGGAGATTTATTTGCTAAATTTAAACACAAAATCTAAACAACTTTAAAACAACTTCTAACTAAGTCCACATAATCTTAACTTCTAACTAATTCAACAATTTTTTGTTTCCCTTTGACCATTTTTATTTATTTCAAGGCATTTGGAATGTATTGGTGCTATTCAATAGCTTGAAACTTCTGCTTCCATGATTTTTTGCTTGATTATTGCATTGTAATACATTGGAATCTATTGCAAGAAAAAAAACTAAATGGCTTAAGTAGTTATTTCAAGGATTTGGGACATATTGGTGTTATTCAATGACTTGAAGCTTGTGCTTCCATGATTTTTTGTTTGATTCTAGCATTGCACTACACTGCAATATATTGTAAGAAAAATGGAAAAGAATGTTGAGTAGTCATTGGCCTTGTATATTTTTGGTTTTCTTTTGACTCTTTTGATTTATTTCAAGGTATTTGGCATGTATCGGTGTTGTTCAGTGGCTTGTAACTTGTGCTTCCATGATTTGTTGCTTGATTTTGGCATTACAATACATGACAATATATTGCAAGAAAACATAATGTTGAGTAGTCATTGGCCTTGTCTTTCTTTTTCTTTTTCTTTTTCTTTGACTCTTTTGATTTATTTCAAGGTATTTGGAACATATCGGTATGGCTTGAAACTCAATTCAAATTTGGCACTAATCACGCTTTCATATTCAAAGTTTAATTTGAATCACATTGATTTATAGTCAAATCAGTATCATTTCACAGCATGTGAAAATTGTTTGTGTTTCAATCCAATGCCAACAATGTGACAAAAAATTTGCAAGAGGTGAGATGTGCGTTTTTACCGGGATGTGTAAATCATGTACAACCAACTTTAACATTACAACCCTGCAGCTAGCATTAATTTGAATTCGTGTTTGTATTTCTTAGGTTTTGGTTTTCTAATAACTAATTTTTCTGGAGATTCCATGGCTGCTTCTGATGCTGAGAACCTTGAGGAGAAGTTGGCATCTTTGCTTGGCCAGCTCCACTTAGAAAGTGGCATTCTACACAAAATGATTTATAAGAATAAGAACCAGCACCGCCGGAGTTCCTATTTTCGTTATCTTTTGCAGGTTTTCATTTTTCCCCCTTCTAATGGAGCTTTCTGTTATGGTTGAGAGTAGAAAACTTAACTGAAAGAAGAGAAATACATGGGTTTTTTCTAAAAGAGTAGTCAAATTTTGCTTTACTGTATATGTGTTTTTTTGGGTAGATTATAAATATATATGTTAGTTTATGTAGATACATGTTCATGTTGATTACACCAGTTGGAAGAAAAGTGGACACTAAGATTCAGAAACAAGATGTCATCTGTAGTTTAGTTTGTGTGATAGCTCATTAATTGTTGCATGAAGAATTTGAGGCTTTTGCCATTGACTGTAATTTTGTCTTGCTTCTATACATTTTTAGGTAAGGAGAGATTTAAGACTTCTACAGGCTACCAAGTTGGAGGAGTTGGTAAGTTCCTGCTTTCAAGTTATCGATGGAAAGAAACCTAAGCAAAAGATTCATCTTTTAGAAAGGTATTAGATTTTGAATTCCTTCTCCCTGTGGATTTTCTGTCCAAGGAATATAAAAAGAGAATGTCTAGACTAAGTGAGATATATTGTTCCTTTGATCTTCCGCTTGGACTTTCCATTTTATTACAGTTTGAAACGGAGGAAATGTGAAGTTGGGAAATATAATTTCATGGAACGGCTTCTGGGGGCTGCACGGTTACTGTCAGAGGTGATCTTTCCCTTGAGTTTTTCTTCTTTGTATGACTAGTTAGGGGCATTTATATAAAACTTCTATTCCCCTTCTATTTGTTTATGTTTCAATTGAAAAAAAAAAGGATATAGATTCATTTAACTAAAACTATTTTCCTCAATGTAAAACGTTCTATTTTCACTCATAAAAACTGGTTTTCTAGTAGTTGGGCTTTTTTTCTATAAATAATGTCCATCAACTATCTTCATAAGAGGTCAATGAAGAGTTATTCTTGTTAATAGGACCATAGTGGGGCCTGATCCTGAAGTTACGGGAAAATTACTCCAAAGAGGACAAATCATTTTCACTTTATTTCATGGATTGCACCTCGTCCCAAAGTAGTGATTGGATGTTTTTCCAAATCTCTTGCAGATGGTAGAGCCAATTTTCAAGGCAGCTACGTATCCTTTTTTATCTATTGGTACAATGCACTATGGCTCTGTTGGTTTTGCTTGCTGCTTTCTATTTGCTCAAATTCACTATAGTTTCCATGTGAATCTTTAAAATGGTGTTTCGATGGAGTTCTTCAATAGCTATTCAATTAGCAAGCCCTGCATTACACAGTTTTATGTATGAAAGCTAGCTTATTTTACTTGGTTAACTTATCCATGAAAAGCTCGTTGACAAGAAAATACAATATGTAACTTCATGTTTTACTTGATAGAAAATGTGAAGTTGTTAACTTGTATTTCTGCAGCAAACCATCTCAGCATCTTTCAATTTTTATAGACATTTGCGTGTGAAAAGGTTTTTTATCATACCGATATATTTCATTGTTTTGGATTCTTGCACCAGCCTACTTTTCATTTTTACATCTTTGTTGCATTTTGCACGTGACGATGGATCCCCACTAGATGGTAGTTATGTGAAATGAGTGCCTTGTTTTCAGTTTCTGCCTCTTAATATTTTCATCGTTGCTTTTAATTTGGTGCATTTAAAATCTTCCGCCTGATGCACTAGAGATTATTGTATATAATACCAATGTTTTGTATCATGCACAATTGTTTAGTACTTCTCTGCATAAGTGACAGTTATTTTGACATGATTTCTTCTTAGCTAGAGCTATTTGTTGTTGGATCATGGAAACTAATTTGCAGAAATGCATTATGCAAAATGTCACCCTTGACTATTTAACAGCGAGATATCTATATTGCTTGCTCGAATGTTTTTCACAGGCTTTTGTTTTATGATTTTGGCATTACTGGCGCGGATTCGGGTGTTAGTTCAACAAGTAAGTTCCATTTTTATTATTTAATATTTAAGAGAAGCAGAAGGCATTTTGTATTCGAGTATATCGGTTGACTGATCACTGATTTCATTTTCCTGAAAAGCGATGGCTGAATCTCTCATTCAAGGGATGGGGAAAGTTTTTGAATCCTAGTCTTTAGGAAATGAAATATTTCTTGGTTAAGTAATAGGTGGGGGGACCTGCTGACAATGGTCAAGGTTAGAATTTTCTTAAGAACTGTATGCTTGCAAATATTTTTAGGATTTTATTTGCATGACTAGTTGACCATACATTGTGATTATTCTGCAACGGACATTGTAACAAAAATTACTATTGTCAGACTTTGAGTTCCTCTATGCTTATGAGAATGAGGTTTGCATTTCCATTGAGCATTTGTTTTTCCTTTTTCTTGTTGTATAGATATTACTCAATGTCGTTTCGGTATTCAACATGGTTTCATCTATCTCGCAAAAGAAACATATAGTTAGAATAAACCAGGAAGGAATTGAGGTATACGTGCTCCACTCTTCCCTTCCTTTCTAATTTTCTCTTCAAAAAGTAGAAGTTTTAAAATTATTTTCAAATCTATTTACAGGTCTTTCGAGAATTCTTCCCAACAAATGATGAATTTGTCTTATTAGAATGTGTTTGGAAAGAAGACAAATTTGTGTTGCAAGAGACGAAACAAAAAATTGAGACTAGGAATTGGGAAGAACATCCTGGACCAAGTGTTTCTTCGGCCACCTCAGCTGTACGATATCAGAGCATTGAATCTTTTCTTGAAGGTAAGATTTCAATTCTTTGTATGCAGTCTTGATCTTAGTTTCGTCTCATCTCAAAAAGAAAATAAACCCAAGTTTGCTTCCTTACTGTATTGCTTCATTTTGGGTGTAGATGATGAATCTGATATCAAGCAGGCAGATGCAAATCAAAGCATTGAAGGTGTTGACCTTATGAAAATGAGTAAGAATGACTTGCTAGCAAGCCTCTCCAAGGAAGATAATACCACAACAAGAGACGGTTCGGTCTGTCCTGCGGAGACCTCGAGCAAGACATTGTTGCCCCAAGAAGGTAGTTTGCTCATGAACTCAAGCCCCTCCTCAGTTGGTGCGAAAAAATCGGACTCAAAGAGACCAGCATTTGTTTCAGTGAAAAATCCGAAACCGATTGCGTCTAGTGCAGTTGGAATTCAATTCAATGAAACAAAAGTTGATAGTGAGTTAGAGGAGGATCAATTTTTCACTTTGCTCACAGGTGGGGGAGCCAAAAGTAGTCTGTTCTAGCAGAGTATTTTAGCAAGCAGTTAAATTTTTTTGCAAAAGCATATATTTTATTTTCAATGTCAAATTGTCCAATAGTATTTATTCATAAATTTTAACAAGCTGTGCAGTAAAATTTTTGTAGGGTTATATTGAGCTTGAAAACAAGTTGAAACCTTGAACCATTAAGTTAGCTACAATTTTGTATAGATGATTTAGAGCCTACTGGATAACATTCTCAATTTTTGTTTAATTTATA

mRNA sequence

AACGGACCTTCCTTCAACCTAAAACAAAATCTCTGCAGTTCCCAGCTTCAGGCGACGGCGCACTCCGGCGAGAGATCGGTCCTTCACGGCGAACTCCACCTCAGGCGACGGCGAACTCAGGTTTTGGTTTTCTAATAACTAATTTTTCTGGAGATTCCATGGCTGCTTCTGATGCTGAGAACCTTGAGGAGAAGTTGGCATCTTTGCTTGGCCAGCTCCACTTAGAAAGTGGCATTCTACACAAAATGATTTATAAGAATAAGAACCAGCACCGCCGGAGTTCCTATTTTCGTTATCTTTTGCAGGTAAGGAGAGATTTAAGACTTCTACAGGCTACCAAGTTGGAGGAGTTGGTAAGTTCCTGCTTTCAAGTTATCGATGGAAAGAAACCTAAGCAAAAGATTCATCTTTTAGAAAGTTTGAAACGGAGGAAATGTGAAGTTGGGAAATATAATTTCATGGAACGGCTTCTGGGGGCTGCACGGTTACTGTCAGAGGTGATCTTTCCCTTGAGTTTTTCTTCTTTCGAGATATCTATATTGCTTGCTCGAATGTTTTTCACAGGCTTTTGTTTTATGATTTTGGCATTACTGGCGCGGATTCGGGTGTTAGTTCAACAAATATTACTCAATGTCGTTTCGGTATTCAACATGGTTTCATCTATCTCGCAAAAGAAACATATAGTTAGAATAAACCAGGAAGGAATTGAGGTCTTTCGAGAATTCTTCCCAACAAATGATGAATTTGTCTTATTAGAATGTGTTTGGAAAGAAGACAAATTTGTGTTGCAAGAGACGAAACAAAAAATTGAGACTAGGAATTGGGAAGAACATCCTGGACCAAGTGTTTCTTCGGCCACCTCAGCTGTACGATATCAGAGCATTGAATCTTTTCTTGAAGATGATGAATCTGATATCAAGCAGGCAGATGCAAATCAAAGCATTGAAGGTGTTGACCTTATGAAAATGAGTAAGAATGACTTGCTAGCAAGCCTCTCCAAGGAAGATAATACCACAACAAGAGACGGTTCGGTCTGTCCTGCGGAGACCTCGAGCAAGACATTGTTGCCCCAAGAAGGTAGTTTGCTCATGAACTCAAGCCCCTCCTCAGTTGGTGCGAAAAAATCGGACTCAAAGAGACCAGCATTTGTTTCAGTGAAAAATCCGAAACCGATTGCGTCTAGTGCAGTTGGAATTCAATTCAATGAAACAAAAGTTGATAGTGAGTTAGAGGAGGATCAATTTTTCACTTTGCTCACAGGTGGGGGAGCCAAAAGTAGTCTGTTCTAGCAGAGTATTTTAGCAAGCAGTTAAATTTTTTTGCAAAAGCATATATTTTATTTTCAATGTCAAATTGTCCAATAGTATTTATTCATAAATTTTAACAAGCTGTGCAGTAAAATTTTTGTAGGGTTATATTGAGCTTGAAAACAAGTTGAAACCTTGAACCATTAAGTTAGCTACAATTTTGTATAGATGATTTAGAGCCTACTGGATAACATTCTCAATTTTTGTTTAATTTATA

Coding sequence (CDS)

CGGACCTTCCTTCAACCTAAAACAAAATCTCTGCAGTTCCCAGCTTCAGGCGACGGCGCACTCCGGCGAGAGATCGGTCCTTCACGGCGAACTCCACCTCAGGCGACGGCGAACTCAGGTTTTGGTTTTCTAATAACTAATTTTTCTGGAGATTCCATGGCTGCTTCTGATGCTGAGAACCTTGAGGAGAAGTTGGCATCTTTGCTTGGCCAGCTCCACTTAGAAAGTGGCATTCTACACAAAATGATTTATAAGAATAAGAACCAGCACCGCCGGAGTTCCTATTTTCGTTATCTTTTGCAGGTAAGGAGAGATTTAAGACTTCTACAGGCTACCAAGTTGGAGGAGTTGGTAAGTTCCTGCTTTCAAGTTATCGATGGAAAGAAACCTAAGCAAAAGATTCATCTTTTAGAAAGTTTGAAACGGAGGAAATGTGAAGTTGGGAAATATAATTTCATGGAACGGCTTCTGGGGGCTGCACGGTTACTGTCAGAGGTGATCTTTCCCTTGAGTTTTTCTTCTTTCGAGATATCTATATTGCTTGCTCGAATGTTTTTCACAGGCTTTTGTTTTATGATTTTGGCATTACTGGCGCGGATTCGGGTGTTAGTTCAACAAATATTACTCAATGTCGTTTCGGTATTCAACATGGTTTCATCTATCTCGCAAAAGAAACATATAGTTAGAATAAACCAGGAAGGAATTGAGGTCTTTCGAGAATTCTTCCCAACAAATGATGAATTTGTCTTATTAGAATGTGTTTGGAAAGAAGACAAATTTGTGTTGCAAGAGACGAAACAAAAAATTGAGACTAGGAATTGGGAAGAACATCCTGGACCAAGTGTTTCTTCGGCCACCTCAGCTGTACGATATCAGAGCATTGAATCTTTTCTTGAAGATGATGAATCTGATATCAAGCAGGCAGATGCAAATCAAAGCATTGAAGGTGTTGACCTTATGAAAATGAGTAAGAATGACTTGCTAGCAAGCCTCTCCAAGGAAGATAATACCACAACAAGAGACGGTTCGGTCTGTCCTGCGGAGACCTCGAGCAAGACATTGTTGCCCCAAGAAGGTAGTTTGCTCATGAACTCAAGCCCCTCCTCAGTTGGTGCGAAAAAATCGGACTCAAAGAGACCAGCATTTGTTTCAGTGAAAAATCCGAAACCGATTGCGTCTAGTGCAGTTGGAATTCAATTCAATGAAACAAAAGTTGATAGTGAGTTAGAGGAGGATCAATTTTTCACTTTGCTCACAGGTGGGGGAGCCAAAAGTAGTCTGTTCTAG

Protein sequence

RTFLQPKTKSLQFPASGDGALRREIGPSRRTPPQATANSGFGFLITNFSGDSMAASDAENLEEKLASLLGQLHLESGILHKMIYKNKNQHRRSSYFRYLLQVRRDLRLLQATKLEELVSSCFQVIDGKKPKQKIHLLESLKRRKCEVGKYNFMERLLGAARLLSEVIFPLSFSSFEISILLARMFFTGFCFMILALLARIRVLVQQILLNVVSVFNMVSSISQKKHIVRINQEGIEVFREFFPTNDEFVLLECVWKEDKFVLQETKQKIETRNWEEHPGPSVSSATSAVRYQSIESFLEDDESDIKQADANQSIEGVDLMKMSKNDLLASLSKEDNTTTRDGSVCPAETSSKTLLPQEGSLLMNSSPSSVGAKKSDSKRPAFVSVKNPKPIASSAVGIQFNETKVDSELEEDQFFTLLTGGGAKSSLF
Homology
BLAST of MC10g1033 vs. NCBI nr
Match: XP_022151030.1 (uncharacterized protein LOC111019049 [Momordica charantia] >XP_022151031.1 uncharacterized protein LOC111019049 [Momordica charantia])

HSP 1 Score: 683 bits (1762), Expect = 1.06e-245
Identity = 367/376 (97.61%), Postives = 372/376 (98.94%), Query Frame = 0

Query: 53  MAASDAENLEEKLASLLGQLHLESGILHKMIYKNKNQHRRSSYFRYLLQVRRDLRLLQAT 112
           MAASDAENLEEKLASLLGQLHLESGILHKMIYKNKNQHRRSSYFRYLLQVRRDLRLLQAT
Sbjct: 1   MAASDAENLEEKLASLLGQLHLESGILHKMIYKNKNQHRRSSYFRYLLQVRRDLRLLQAT 60

Query: 113 KLEELVSSCFQVIDGKKPKQKIHLLESLKRRKCEVGKYNFMERLLGAARLLSEVIFPLSF 172
           KLEELVSSCFQVIDGKKPKQKIHLLESLKRRKCEVGKYNFMERLLGAARLLSE++ P+  
Sbjct: 61  KLEELVSSCFQVIDGKKPKQKIHLLESLKRRKCEVGKYNFMERLLGAARLLSEMVEPIFK 120

Query: 173 SSFEISILLARMFFTGFCFMILALLARIRVLVQQILLNVVSVFNMVSSISQKKHIVRINQ 232
           ++ EISILLARMFFTGFCFMILALLARIRVLVQQILLNVVSVFNMVSSISQKKHIVRINQ
Sbjct: 121 AATEISILLARMFFTGFCFMILALLARIRVLVQQILLNVVSVFNMVSSISQKKHIVRINQ 180

Query: 233 EGIEVFREFFPTNDEFVLLECVWKEDKFVLQETKQKIETRNWEEHPGPSVSSATSAVRYQ 292
           EGIEVFREFFPTNDEFVLLECVWKEDKFVLQETKQKIETRNWEEHPGPSVSSATSAVRYQ
Sbjct: 181 EGIEVFREFFPTNDEFVLLECVWKEDKFVLQETKQKIETRNWEEHPGPSVSSATSAVRYQ 240

Query: 293 SIESFLEDDESDIKQADANQSIEGVDLMKMSKNDLLASLSKEDNTTTRDGSVCPAETSSK 352
           SIESFLEDDESDIKQADANQSIEGVDLMKMSKNDLLASLSKEDNTTTRDGSVCPAETSSK
Sbjct: 241 SIESFLEDDESDIKQADANQSIEGVDLMKMSKNDLLASLSKEDNTTTRDGSVCPAETSSK 300

Query: 353 TLLPQEGSLLMNSSPSSVGAKKSDSKRPAFVSVKNPKPIASSAVGIQFNETKVDSELEED 412
           TLLPQEGSLLMNSSPSSVGAKKSDSKRPAFVSVKNPKPIASSAVGIQFNETKVDSELEED
Sbjct: 301 TLLPQEGSLLMNSSPSSVGAKKSDSKRPAFVSVKNPKPIASSAVGIQFNETKVDSELEED 360

Query: 413 QFFTLLTGGGAKSSLF 428
           QFFTLLTGGGAKSSLF
Sbjct: 361 QFFTLLTGGGAKSSLF 376

BLAST of MC10g1033 vs. NCBI nr
Match: XP_022157203.1 (uncharacterized protein LOC111023972 isoform X2 [Momordica charantia])

HSP 1 Score: 590 bits (1521), Expect = 5.38e-209
Identity = 317/377 (84.08%), Postives = 347/377 (92.04%), Query Frame = 0

Query: 53  MAASDAENLEEKLASLLGQLHLESGILHKMIYKNKNQHRRSSYFRYLLQVRRDLRLLQAT 112
           MA+S+AEN EEKL SLLGQLHLESGILHKMIYKNKNQHRR SYFRYLLQV RDLRLLQAT
Sbjct: 1   MASSEAENFEEKLTSLLGQLHLESGILHKMIYKNKNQHRRGSYFRYLLQVGRDLRLLQAT 60

Query: 113 KLEELVSSCFQVIDGKKPKQKIHLLESLKRRKCEVGKYNFMERLLGAARLLSEVIFPLSF 172
           KLE+LVSSCFQVI GKKPKQKIHLLESLKRRKCEVGKYNFMERLLGAARLLSE++ P+  
Sbjct: 61  KLEDLVSSCFQVIGGKKPKQKIHLLESLKRRKCEVGKYNFMERLLGAARLLSEMVEPIFK 120

Query: 173 SSFEISILLARMFFTGFCFMILALLARIRVLVQQILLNVVSVFNMVSSISQKKHIVRINQ 232
           ++ EIS LLAR FFTGFCFMILALLARIRVLVQQIL++VVSVFNMVSSISQKKH V INQ
Sbjct: 121 AATEISTLLARRFFTGFCFMILALLARIRVLVQQILIDVVSVFNMVSSISQKKHTVTINQ 180

Query: 233 EGIEVFREFFPTNDEFVLLECVWKEDKFVLQETKQKIETRNWEEHPGPSVSSATSAVRYQ 292
           EGI+VFREF+PTN+EFV L+CVWKEDKFVLQETKQ  E+RNW+E+ GPSVS +TSA++Y 
Sbjct: 181 EGIQVFREFYPTNEEFVFLQCVWKEDKFVLQETKQNFESRNWQENLGPSVSLSTSAIQYT 240

Query: 293 SIESFLEDDESDIKQADANQSIEGVDLMKMSK-NDLLASLSKEDNTTTRDGSVCPAETSS 352
           SIESFLEDDES IKQA+ NQSIEG+DLMKMSK NDLLASLSK+DNT T+DGSVCP ETSS
Sbjct: 241 SIESFLEDDESAIKQAEVNQSIEGLDLMKMSKKNDLLASLSKKDNTATKDGSVCPTETSS 300

Query: 353 KTLLPQEGSLLMNSSPSSVGAKKSDSKRPAFVSVKNPKPIASSAVGIQFNETKVDSELEE 412
           KTLLPQEGSLL+NSSP+SVGA+KSD+KRPAFVSVKNP PI+ SAVGIQFNETKVDSE +E
Sbjct: 301 KTLLPQEGSLLVNSSPTSVGAEKSDTKRPAFVSVKNPNPISCSAVGIQFNETKVDSEEKE 360

Query: 413 DQFFTLLTGGGAKSSLF 428
           D FFTLLT G AKSSLF
Sbjct: 361 DPFFTLLTDGEAKSSLF 377

BLAST of MC10g1033 vs. NCBI nr
Match: XP_022157200.1 (uncharacterized protein LOC111023972 isoform X1 [Momordica charantia] >XP_022157201.1 uncharacterized protein LOC111023972 isoform X1 [Momordica charantia] >XP_022157202.1 uncharacterized protein LOC111023972 isoform X1 [Momordica charantia])

HSP 1 Score: 567 bits (1462), Expect = 3.86e-199
Identity = 320/433 (73.90%), Postives = 348/433 (80.37%), Query Frame = 0

Query: 53  MAASDAENLEEKLASLLGQLHLESGILHKMIYKNKNQHRRSSYFRYLLQVRRDLRLLQAT 112
           MA+S+AEN EEKL SLLGQLHLESGILHKMIYKNKNQHRR SYFRYLLQV RDLRLLQAT
Sbjct: 1   MASSEAENFEEKLTSLLGQLHLESGILHKMIYKNKNQHRRGSYFRYLLQVGRDLRLLQAT 60

Query: 113 KLEELVSSCFQVIDGKKPKQKIHLLESLKRRKCEVGKYNFMERLLGAARLLSEVIFPL-- 172
           KLE+LVSSCFQVI GKKPKQKIHLLESLKRRKCEVGKYNFMERLLGAARLLSE++ P+  
Sbjct: 61  KLEDLVSSCFQVIGGKKPKQKIHLLESLKRRKCEVGKYNFMERLLGAARLLSEMVEPIFK 120

Query: 173 -------------------------------------SFSSF-----------------E 232
                                                SF  F                 E
Sbjct: 121 AATQYYTTNQQPSLINFWWVTHSRKEKRSYGPSLCGPSFGVFGWKETIWSSTIKRSLLSE 180

Query: 233 ISILLARMFFTGFCFMILALLARIRVLVQQILLNVVSVFNMVSSISQKKHIVRINQEGIE 292
           IS LLAR FFTGFCFMILALLARIRVLVQQIL++VVSVFNMVSSISQKKH V INQEGI+
Sbjct: 181 ISTLLARRFFTGFCFMILALLARIRVLVQQILIDVVSVFNMVSSISQKKHTVTINQEGIQ 240

Query: 293 VFREFFPTNDEFVLLECVWKEDKFVLQETKQKIETRNWEEHPGPSVSSATSAVRYQSIES 352
           VFREF+PTN+EFV L+CVWKEDKFVLQETKQ  E+RNW+E+ GPSVS +TSA++Y SIES
Sbjct: 241 VFREFYPTNEEFVFLQCVWKEDKFVLQETKQNFESRNWQENLGPSVSLSTSAIQYTSIES 300

Query: 353 FLEDDESDIKQADANQSIEGVDLMKMSK-NDLLASLSKEDNTTTRDGSVCPAETSSKTLL 412
           FLEDDES IKQA+ NQSIEG+DLMKMSK NDLLASLSK+DNT T+DGSVCP ETSSKTLL
Sbjct: 301 FLEDDESAIKQAEVNQSIEGLDLMKMSKKNDLLASLSKKDNTATKDGSVCPTETSSKTLL 360

Query: 413 PQEGSLLMNSSPSSVGAKKSDSKRPAFVSVKNPKPIASSAVGIQFNETKVDSELEEDQFF 428
           PQEGSLL+NSSP+SVGA+KSD+KRPAFVSVKNP PI+ SAVGIQFNETKVDSE +ED FF
Sbjct: 361 PQEGSLLVNSSPTSVGAEKSDTKRPAFVSVKNPNPISCSAVGIQFNETKVDSEEKEDPFF 420

BLAST of MC10g1033 vs. NCBI nr
Match: XP_023520444.1 (uncharacterized protein LOC111783829 [Cucurbita pepo subsp. pepo] >XP_023520445.1 uncharacterized protein LOC111783829 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 531 bits (1368), Expect = 1.29e-185
Identity = 293/384 (76.30%), Postives = 330/384 (85.94%), Query Frame = 0

Query: 53  MAASDAENLEEKLASLLGQLHLESGILHKMIYKNKNQHRRSSYFRYLLQVRRDLRLLQAT 112
           MA+S AEN +EKLAS+L QL+LE GILHKMIYKNKNQHRRSSYFRYLLQVRRDLRLLQA 
Sbjct: 1   MASSVAENRQEKLASMLDQLYLECGILHKMIYKNKNQHRRSSYFRYLLQVRRDLRLLQAA 60

Query: 113 KLEELVSSCFQVIDGKKPKQKIHLLESLKRRKCEVGKYNFMERLLGAARLLSEVIFPLSF 172
           KLEELV+SCFQVIDGKKPKQKIH LESLKRRKCEVGKYNFME+LLGAARLLSE++ P+  
Sbjct: 61  KLEELVNSCFQVIDGKKPKQKIHFLESLKRRKCEVGKYNFMEQLLGAARLLSEMVEPIFK 120

Query: 173 SSFEISILLARMFFTGFCFMILALLARIRVLVQQILLNVVSVFNMVSSISQKKHIVRINQ 232
           ++ EISILLAR FFTGFCF+ILALLARIRVLVQQILL+VVS+FNMV+SIS+KKH+V INQ
Sbjct: 121 AATEISILLARTFFTGFCFIILALLARIRVLVQQILLDVVSIFNMVASISKKKHVVTINQ 180

Query: 233 EGIEVFREFFPTNDEFVLLECVWKEDKFVLQETKQKIETRNWEEHPGPSVSSATSAVRYQ 292
           EGI+VFREF+PTNDEFVLLECVWKEDKF+LQE KQ++ T N EEH GP+VSSA S VRYQ
Sbjct: 181 EGIQVFREFYPTNDEFVLLECVWKEDKFILQENKQEVATNNQEEHIGPNVSSAASTVRYQ 240

Query: 293 SIESFLEDDESDIKQADANQSI-EGVDLMKMSKNDLLASLSK-------EDNTTTRDGSV 352
           S++SFL DDE   KQA+ANQS  E +DLMKMSKNDLLAS SK       +D T T+D S+
Sbjct: 241 SLKSFLGDDEPATKQAEANQSNDEALDLMKMSKNDLLASPSKRVNDISVKDITETKDSSI 300

Query: 353 CPAETSSKTLLPQEGSLLMNSSPSSVGAKKSDSKRPAFVSVKNPKPIASSAVGIQFNETK 412
            PA TSS+T +P+EGS L+NSSPS VGAKK  SKRPAFVS+K P PI +SAVGIQFNETK
Sbjct: 301 SPAATSSQTFVPREGSSLVNSSPSLVGAKKLHSKRPAFVSIKPPNPITTSAVGIQFNETK 360

Query: 413 VDSELEEDQFFTLLTGGGAKSSLF 428
            DS  +ED FF LLTGG  KSSLF
Sbjct: 361 ADSVEKEDPFFALLTGGKRKSSLF 384

BLAST of MC10g1033 vs. NCBI nr
Match: KAG7015502.1 (hypothetical protein SDJN02_23138 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 531 bits (1367), Expect = 1.83e-185
Identity = 293/384 (76.30%), Postives = 330/384 (85.94%), Query Frame = 0

Query: 53  MAASDAENLEEKLASLLGQLHLESGILHKMIYKNKNQHRRSSYFRYLLQVRRDLRLLQAT 112
           MA+S AEN +EKLAS+L QL+LE GILHKMIYKNKNQHRRSSYFRYLLQVRRDLRLLQA 
Sbjct: 1   MASSVAENRQEKLASMLDQLYLECGILHKMIYKNKNQHRRSSYFRYLLQVRRDLRLLQAA 60

Query: 113 KLEELVSSCFQVIDGKKPKQKIHLLESLKRRKCEVGKYNFMERLLGAARLLSEVIFPLSF 172
           KLEELV+SCFQVIDGKKPKQKIH LESLKRRKCEVGKYNFME+LLGAARLLSE++ P+  
Sbjct: 61  KLEELVNSCFQVIDGKKPKQKIHFLESLKRRKCEVGKYNFMEQLLGAARLLSEMVEPIFK 120

Query: 173 SSFEISILLARMFFTGFCFMILALLARIRVLVQQILLNVVSVFNMVSSISQKKHIVRINQ 232
           ++ EISILLAR FFTGFCF+ILALLARIRVLVQQILL+VVS+FNMV+SIS+KKH+V INQ
Sbjct: 121 AATEISILLARTFFTGFCFIILALLARIRVLVQQILLDVVSIFNMVASISKKKHVVTINQ 180

Query: 233 EGIEVFREFFPTNDEFVLLECVWKEDKFVLQETKQKIETRNWEEHPGPSVSSATSAVRYQ 292
           EGI+VFREF+PTNDEFVLLECVWKEDKF+LQE KQ++ T N EEH GP+VSS  SAVRYQ
Sbjct: 181 EGIQVFREFYPTNDEFVLLECVWKEDKFILQENKQEVATNNQEEHIGPNVSSTASAVRYQ 240

Query: 293 SIESFLEDDESDIKQADANQSI-EGVDLMKMSKNDLLASLSK-------EDNTTTRDGSV 352
           S+ESFL DDE   KQA+ANQS  E +DLMKM+KNDLLAS SK       +D T T+D S+
Sbjct: 241 SLESFLGDDELATKQAEANQSNDEALDLMKMNKNDLLASPSKRVNDISVKDITETKDSSI 300

Query: 353 CPAETSSKTLLPQEGSLLMNSSPSSVGAKKSDSKRPAFVSVKNPKPIASSAVGIQFNETK 412
            PA TSS+T +P+EGS L+NSSPS VGAKK  SKRPAFVS+K P PI +SAVGIQFNETK
Sbjct: 301 SPAATSSQTFVPREGSSLVNSSPSLVGAKKLHSKRPAFVSIKLPNPITTSAVGIQFNETK 360

Query: 413 VDSELEEDQFFTLLTGGGAKSSLF 428
            DS  +ED FF LLTGG  KSSLF
Sbjct: 361 ADSVEKEDPFFALLTGGKRKSSLF 384

BLAST of MC10g1033 vs. ExPASy TrEMBL
Match: A0A6J1DA39 (uncharacterized protein LOC111019049 OS=Momordica charantia OX=3673 GN=LOC111019049 PE=4 SV=1)

HSP 1 Score: 683 bits (1762), Expect = 5.13e-246
Identity = 367/376 (97.61%), Postives = 372/376 (98.94%), Query Frame = 0

Query: 53  MAASDAENLEEKLASLLGQLHLESGILHKMIYKNKNQHRRSSYFRYLLQVRRDLRLLQAT 112
           MAASDAENLEEKLASLLGQLHLESGILHKMIYKNKNQHRRSSYFRYLLQVRRDLRLLQAT
Sbjct: 1   MAASDAENLEEKLASLLGQLHLESGILHKMIYKNKNQHRRSSYFRYLLQVRRDLRLLQAT 60

Query: 113 KLEELVSSCFQVIDGKKPKQKIHLLESLKRRKCEVGKYNFMERLLGAARLLSEVIFPLSF 172
           KLEELVSSCFQVIDGKKPKQKIHLLESLKRRKCEVGKYNFMERLLGAARLLSE++ P+  
Sbjct: 61  KLEELVSSCFQVIDGKKPKQKIHLLESLKRRKCEVGKYNFMERLLGAARLLSEMVEPIFK 120

Query: 173 SSFEISILLARMFFTGFCFMILALLARIRVLVQQILLNVVSVFNMVSSISQKKHIVRINQ 232
           ++ EISILLARMFFTGFCFMILALLARIRVLVQQILLNVVSVFNMVSSISQKKHIVRINQ
Sbjct: 121 AATEISILLARMFFTGFCFMILALLARIRVLVQQILLNVVSVFNMVSSISQKKHIVRINQ 180

Query: 233 EGIEVFREFFPTNDEFVLLECVWKEDKFVLQETKQKIETRNWEEHPGPSVSSATSAVRYQ 292
           EGIEVFREFFPTNDEFVLLECVWKEDKFVLQETKQKIETRNWEEHPGPSVSSATSAVRYQ
Sbjct: 181 EGIEVFREFFPTNDEFVLLECVWKEDKFVLQETKQKIETRNWEEHPGPSVSSATSAVRYQ 240

Query: 293 SIESFLEDDESDIKQADANQSIEGVDLMKMSKNDLLASLSKEDNTTTRDGSVCPAETSSK 352
           SIESFLEDDESDIKQADANQSIEGVDLMKMSKNDLLASLSKEDNTTTRDGSVCPAETSSK
Sbjct: 241 SIESFLEDDESDIKQADANQSIEGVDLMKMSKNDLLASLSKEDNTTTRDGSVCPAETSSK 300

Query: 353 TLLPQEGSLLMNSSPSSVGAKKSDSKRPAFVSVKNPKPIASSAVGIQFNETKVDSELEED 412
           TLLPQEGSLLMNSSPSSVGAKKSDSKRPAFVSVKNPKPIASSAVGIQFNETKVDSELEED
Sbjct: 301 TLLPQEGSLLMNSSPSSVGAKKSDSKRPAFVSVKNPKPIASSAVGIQFNETKVDSELEED 360

Query: 413 QFFTLLTGGGAKSSLF 428
           QFFTLLTGGGAKSSLF
Sbjct: 361 QFFTLLTGGGAKSSLF 376

BLAST of MC10g1033 vs. ExPASy TrEMBL
Match: A0A6J1DSS0 (uncharacterized protein LOC111023972 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111023972 PE=4 SV=1)

HSP 1 Score: 590 bits (1521), Expect = 2.61e-209
Identity = 317/377 (84.08%), Postives = 347/377 (92.04%), Query Frame = 0

Query: 53  MAASDAENLEEKLASLLGQLHLESGILHKMIYKNKNQHRRSSYFRYLLQVRRDLRLLQAT 112
           MA+S+AEN EEKL SLLGQLHLESGILHKMIYKNKNQHRR SYFRYLLQV RDLRLLQAT
Sbjct: 1   MASSEAENFEEKLTSLLGQLHLESGILHKMIYKNKNQHRRGSYFRYLLQVGRDLRLLQAT 60

Query: 113 KLEELVSSCFQVIDGKKPKQKIHLLESLKRRKCEVGKYNFMERLLGAARLLSEVIFPLSF 172
           KLE+LVSSCFQVI GKKPKQKIHLLESLKRRKCEVGKYNFMERLLGAARLLSE++ P+  
Sbjct: 61  KLEDLVSSCFQVIGGKKPKQKIHLLESLKRRKCEVGKYNFMERLLGAARLLSEMVEPIFK 120

Query: 173 SSFEISILLARMFFTGFCFMILALLARIRVLVQQILLNVVSVFNMVSSISQKKHIVRINQ 232
           ++ EIS LLAR FFTGFCFMILALLARIRVLVQQIL++VVSVFNMVSSISQKKH V INQ
Sbjct: 121 AATEISTLLARRFFTGFCFMILALLARIRVLVQQILIDVVSVFNMVSSISQKKHTVTINQ 180

Query: 233 EGIEVFREFFPTNDEFVLLECVWKEDKFVLQETKQKIETRNWEEHPGPSVSSATSAVRYQ 292
           EGI+VFREF+PTN+EFV L+CVWKEDKFVLQETKQ  E+RNW+E+ GPSVS +TSA++Y 
Sbjct: 181 EGIQVFREFYPTNEEFVFLQCVWKEDKFVLQETKQNFESRNWQENLGPSVSLSTSAIQYT 240

Query: 293 SIESFLEDDESDIKQADANQSIEGVDLMKMSK-NDLLASLSKEDNTTTRDGSVCPAETSS 352
           SIESFLEDDES IKQA+ NQSIEG+DLMKMSK NDLLASLSK+DNT T+DGSVCP ETSS
Sbjct: 241 SIESFLEDDESAIKQAEVNQSIEGLDLMKMSKKNDLLASLSKKDNTATKDGSVCPTETSS 300

Query: 353 KTLLPQEGSLLMNSSPSSVGAKKSDSKRPAFVSVKNPKPIASSAVGIQFNETKVDSELEE 412
           KTLLPQEGSLL+NSSP+SVGA+KSD+KRPAFVSVKNP PI+ SAVGIQFNETKVDSE +E
Sbjct: 301 KTLLPQEGSLLVNSSPTSVGAEKSDTKRPAFVSVKNPNPISCSAVGIQFNETKVDSEEKE 360

Query: 413 DQFFTLLTGGGAKSSLF 428
           D FFTLLT G AKSSLF
Sbjct: 361 DPFFTLLTDGEAKSSLF 377

BLAST of MC10g1033 vs. ExPASy TrEMBL
Match: A0A6J1DTY3 (uncharacterized protein LOC111023972 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111023972 PE=4 SV=1)

HSP 1 Score: 567 bits (1462), Expect = 1.87e-199
Identity = 320/433 (73.90%), Postives = 348/433 (80.37%), Query Frame = 0

Query: 53  MAASDAENLEEKLASLLGQLHLESGILHKMIYKNKNQHRRSSYFRYLLQVRRDLRLLQAT 112
           MA+S+AEN EEKL SLLGQLHLESGILHKMIYKNKNQHRR SYFRYLLQV RDLRLLQAT
Sbjct: 1   MASSEAENFEEKLTSLLGQLHLESGILHKMIYKNKNQHRRGSYFRYLLQVGRDLRLLQAT 60

Query: 113 KLEELVSSCFQVIDGKKPKQKIHLLESLKRRKCEVGKYNFMERLLGAARLLSEVIFPL-- 172
           KLE+LVSSCFQVI GKKPKQKIHLLESLKRRKCEVGKYNFMERLLGAARLLSE++ P+  
Sbjct: 61  KLEDLVSSCFQVIGGKKPKQKIHLLESLKRRKCEVGKYNFMERLLGAARLLSEMVEPIFK 120

Query: 173 -------------------------------------SFSSF-----------------E 232
                                                SF  F                 E
Sbjct: 121 AATQYYTTNQQPSLINFWWVTHSRKEKRSYGPSLCGPSFGVFGWKETIWSSTIKRSLLSE 180

Query: 233 ISILLARMFFTGFCFMILALLARIRVLVQQILLNVVSVFNMVSSISQKKHIVRINQEGIE 292
           IS LLAR FFTGFCFMILALLARIRVLVQQIL++VVSVFNMVSSISQKKH V INQEGI+
Sbjct: 181 ISTLLARRFFTGFCFMILALLARIRVLVQQILIDVVSVFNMVSSISQKKHTVTINQEGIQ 240

Query: 293 VFREFFPTNDEFVLLECVWKEDKFVLQETKQKIETRNWEEHPGPSVSSATSAVRYQSIES 352
           VFREF+PTN+EFV L+CVWKEDKFVLQETKQ  E+RNW+E+ GPSVS +TSA++Y SIES
Sbjct: 241 VFREFYPTNEEFVFLQCVWKEDKFVLQETKQNFESRNWQENLGPSVSLSTSAIQYTSIES 300

Query: 353 FLEDDESDIKQADANQSIEGVDLMKMSK-NDLLASLSKEDNTTTRDGSVCPAETSSKTLL 412
           FLEDDES IKQA+ NQSIEG+DLMKMSK NDLLASLSK+DNT T+DGSVCP ETSSKTLL
Sbjct: 301 FLEDDESAIKQAEVNQSIEGLDLMKMSKKNDLLASLSKKDNTATKDGSVCPTETSSKTLL 360

Query: 413 PQEGSLLMNSSPSSVGAKKSDSKRPAFVSVKNPKPIASSAVGIQFNETKVDSELEEDQFF 428
           PQEGSLL+NSSP+SVGA+KSD+KRPAFVSVKNP PI+ SAVGIQFNETKVDSE +ED FF
Sbjct: 361 PQEGSLLVNSSPTSVGAEKSDTKRPAFVSVKNPNPISCSAVGIQFNETKVDSEEKEDPFF 420

BLAST of MC10g1033 vs. ExPASy TrEMBL
Match: A0A6J1F0Y0 (uncharacterized protein LOC111438493 OS=Cucurbita moschata OX=3662 GN=LOC111438493 PE=4 SV=1)

HSP 1 Score: 526 bits (1354), Expect = 8.37e-184
Identity = 290/384 (75.52%), Postives = 329/384 (85.68%), Query Frame = 0

Query: 53  MAASDAENLEEKLASLLGQLHLESGILHKMIYKNKNQHRRSSYFRYLLQVRRDLRLLQAT 112
           MA+S AEN +EKLAS+L QL+LE GILHKMIYKNKNQHRRSSYF+YLLQVRRDLRLLQA 
Sbjct: 1   MASSVAENRQEKLASMLDQLYLECGILHKMIYKNKNQHRRSSYFQYLLQVRRDLRLLQAA 60

Query: 113 KLEELVSSCFQVIDGKKPKQKIHLLESLKRRKCEVGKYNFMERLLGAARLLSEVIFPLSF 172
           KLEELV+SCFQVIDGKKPKQKIH LESLKRRKCEVGKYNFME+LLGAARLLSE++ P+  
Sbjct: 61  KLEELVNSCFQVIDGKKPKQKIHFLESLKRRKCEVGKYNFMEQLLGAARLLSEMVEPIFK 120

Query: 173 SSFEISILLARMFFTGFCFMILALLARIRVLVQQILLNVVSVFNMVSSISQKKHIVRINQ 232
           ++ EISILLAR FFTGFCF+ILALLARIRVLVQQILL+VVS+FNMV+SIS+KKH+V INQ
Sbjct: 121 AATEISILLARTFFTGFCFIILALLARIRVLVQQILLDVVSIFNMVASISKKKHVVTINQ 180

Query: 233 EGIEVFREFFPTNDEFVLLECVWKEDKFVLQETKQKIETRNWEEHPGPSVSSATSAVRYQ 292
           EGI+VFREF+PTNDEFVLLECVWK+DKF+LQE KQ++ T N EEH GP+VSSA S V YQ
Sbjct: 181 EGIQVFREFYPTNDEFVLLECVWKDDKFILQENKQEVATNNQEEHIGPNVSSAASTVLYQ 240

Query: 293 SIESFLEDDESDIKQADANQSI-EGVDLMKMSKNDLLASLSK-------EDNTTTRDGSV 352
           S+ESFL D+E   KQA+ANQS  E +DLMKMSKNDLLAS SK       +D T T+D S+
Sbjct: 241 SLESFLGDEEPATKQAEANQSNDEALDLMKMSKNDLLASPSKRVNDISVKDITETKDSSI 300

Query: 353 CPAETSSKTLLPQEGSLLMNSSPSSVGAKKSDSKRPAFVSVKNPKPIASSAVGIQFNETK 412
            PA TSS+T +P+EGS L+NSSPS VGAKK  SKRPAFVS+K P PI +SAVGIQFNETK
Sbjct: 301 SPAATSSQTFVPREGSSLVNSSPSLVGAKKLHSKRPAFVSIKPPSPITTSAVGIQFNETK 360

Query: 413 VDSELEEDQFFTLLTGGGAKSSLF 428
            DS  +ED FF LLTGG  KSSLF
Sbjct: 361 ADSVEKEDPFFALLTGGKRKSSLF 384

BLAST of MC10g1033 vs. ExPASy TrEMBL
Match: A0A6J1J4A8 (uncharacterized protein LOC111483286 OS=Cucurbita maxima OX=3661 GN=LOC111483286 PE=4 SV=1)

HSP 1 Score: 523 bits (1346), Expect = 1.33e-182
Identity = 290/384 (75.52%), Postives = 328/384 (85.42%), Query Frame = 0

Query: 53  MAASDAENLEEKLASLLGQLHLESGILHKMIYKNKNQHRRSSYFRYLLQVRRDLRLLQAT 112
           MA+S+AEN +EKLAS+L QL LESGILHKMIYKNKNQHRRS YFRYLLQVRRDLRLLQA 
Sbjct: 1   MASSEAENRQEKLASMLDQLCLESGILHKMIYKNKNQHRRSFYFRYLLQVRRDLRLLQAA 60

Query: 113 KLEELVSSCFQVIDGKKPKQKIHLLESLKRRKCEVGKYNFMERLLGAARLLSEVIFPLSF 172
           KL+EL+SSCFQVIDGKKPKQKIH LESLKRRKCEVGKYNFME+LLGA+RLLSE++ P+  
Sbjct: 61  KLKELISSCFQVIDGKKPKQKIHFLESLKRRKCEVGKYNFMEQLLGASRLLSEMVEPIFK 120

Query: 173 SSFEISILLARMFFTGFCFMILALLARIRVLVQQILLNVVSVFNMVSSISQKKHIVRINQ 232
           ++ EISILLAR FFTGFCF+ILALLARI VLVQQILL+VVS+FNMV+SIS+KKH+V INQ
Sbjct: 121 AATEISILLARTFFTGFCFIILALLARIWVLVQQILLDVVSIFNMVASISKKKHVVTINQ 180

Query: 233 EGIEVFREFFPTNDEFVLLECVWKEDKFVLQETKQKIETRNWEEHPGPSVSSATSAVRYQ 292
           EGI+VFREF+PTNDEFVLLECVWKE KF+LQE KQ++ T N EEH GP+VSSA S VRYQ
Sbjct: 181 EGIQVFREFYPTNDEFVLLECVWKEGKFILQENKQEVATNNQEEHIGPNVSSAASTVRYQ 240

Query: 293 SIESFLEDDESDIKQADANQSIE-GVDLMKMSKNDLLAS-------LSKEDNTTTRDGSV 352
           S+ESFL DDE   KQA+ANQS E GVDLMKMSKNDLLAS       +S +D T T+D S+
Sbjct: 241 SLESFLGDDEPATKQAEANQSNEKGVDLMKMSKNDLLASPSERVNEISVKDITETKDSSI 300

Query: 353 CPAETSSKTLLPQEGSLLMNSSPSSVGAKKSDSKRPAFVSVKNPKPIASSAVGIQFNETK 412
            PA TSS+T +P+ GS L+NSSPS VGAKK  SKRPAFVS+K P PI +SAVGIQFNETK
Sbjct: 301 SPAATSSQTFMPRGGSSLVNSSPSLVGAKKLHSKRPAFVSIKPPNPITTSAVGIQFNETK 360

Query: 413 VDSELEEDQFFTLLTGGGAKSSLF 428
            DS +EED FF  LTGG  KSSLF
Sbjct: 361 ADS-VEEDPFFAFLTGGKPKSSLF 383

BLAST of MC10g1033 vs. TAIR 10
Match: AT1G50910.1 (unknown protein; Has 1105 Blast hits to 802 proteins in 217 species: Archae - 2; Bacteria - 177; Metazoa - 445; Fungi - 210; Plants - 58; Viruses - 6; Other Eukaryotes - 207 (source: NCBI BLink). )

HSP 1 Score: 237.7 bits (605), Expect = 1.8e-62
Identity = 165/371 (44.47%), Postives = 225/371 (60.65%), Query Frame = 0

Query: 53  MAASDAENLEEKLASLLGQLHLESGILHKMIYKNKNQHRRSSYFRYLLQVRRDLRLLQAT 112
           M  +  + LEEKL S L QL LE  +  +M+YKNKNQHRR SYF+YLL+VRR+LRLL+  
Sbjct: 1   MDDTQVKALEEKLKSQLSQLELEQAVFERMVYKNKNQHRRCSYFQYLLKVRRELRLLRTA 60

Query: 113 KLEELVSSCFQVIDGKKPKQKIHLLESLKRRKCEVGKYNFMERLLGAARLLSEVIFPLSF 172
            +E ++  CF VI G+  KQKIH+LESLK +K + GK N +ERLLGA RLLS++  P+  
Sbjct: 61  NMEGMLRPCFHVISGRISKQKIHVLESLKLKKSDTGKPNILERLLGALRLLSQMTEPILK 120

Query: 173 SSFEISILLARMFFTGFCFMILALLARIRVLVQQILLNVVSVFNMVSSISQKKHIVRINQ 232
           ++  IS LLAR FF GF    LALLAR+RVLVQQILL+ VSVFN V+S S KK  V+I Q
Sbjct: 121 AASGISTLLARSFFIGFSVTFLALLARLRVLVQQILLDAVSVFNSVTSTSLKKQSVKIAQ 180

Query: 233 EGIEVFREFFPTNDEFV-LLECVWKEDKFVLQETKQKIE-TRNWEEHPGPSVSSATSAVR 292
           +G+EVFREF+P  +E V LL+CVWK DK+VL ET Q  E ++  E++    V++  S V+
Sbjct: 181 DGVEVFREFYPKEEECVTLLDCVWKTDKYVLLETLQNSENSKPMEKNVSEDVTTRDSLVQ 240

Query: 293 YQSIESFLEDDESDIKQAD-----ANQSIEGVDLMKMSKNDLLASLSKEDNTTTRDGSVC 352
           YQ+  S L +D S + +AD       +S   +D    SK ++   L  ED+    D +  
Sbjct: 241 YQTSVSSLAEDLSPLLRADNSGATVRESSTPIDEAASSKTNI--GLQPEDSENPEDATTR 300

Query: 353 PAETSSKTL---LPQEGSLLMNSSPSSVGAKKSDSKRPAFVSVKNPKPIASSAVGIQFNE 412
                 +T    L ++ S L+ +  S V  ++S +            PIA +A     N 
Sbjct: 301 DCSVQYETFVSPLGEDLSPLLEADNSGVTLRESST------------PIAEAASSKTNNA 357

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_022151030.11.06e-24597.61uncharacterized protein LOC111019049 [Momordica charantia] >XP_022151031.1 uncha... [more]
XP_022157203.15.38e-20984.08uncharacterized protein LOC111023972 isoform X2 [Momordica charantia][more]
XP_022157200.13.86e-19973.90uncharacterized protein LOC111023972 isoform X1 [Momordica charantia] >XP_022157... [more]
XP_023520444.11.29e-18576.30uncharacterized protein LOC111783829 [Cucurbita pepo subsp. pepo] >XP_023520445.... [more]
KAG7015502.11.83e-18576.30hypothetical protein SDJN02_23138 [Cucurbita argyrosperma subsp. argyrosperma][more]
Match NameE-valueIdentityDescription
A0A6J1DA395.13e-24697.61uncharacterized protein LOC111019049 OS=Momordica charantia OX=3673 GN=LOC111019... [more]
A0A6J1DSS02.61e-20984.08uncharacterized protein LOC111023972 isoform X2 OS=Momordica charantia OX=3673 G... [more]
A0A6J1DTY31.87e-19973.90uncharacterized protein LOC111023972 isoform X1 OS=Momordica charantia OX=3673 G... [more]
A0A6J1F0Y08.37e-18475.52uncharacterized protein LOC111438493 OS=Cucurbita moschata OX=3662 GN=LOC1114384... [more]
A0A6J1J4A81.33e-18275.52uncharacterized protein LOC111483286 OS=Cucurbita maxima OX=3661 GN=LOC111483286... [more]
Match NameE-valueIdentityDescription
AT1G50910.11.8e-6244.47unknown protein; Has 1105 Blast hits to 802 proteins in 217 species: Archae - 2;... [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (Dali-11) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR027951Domain of unknown function DUF4477PFAMPF14780DUF4477coord: 61..216
e-value: 2.9E-11
score: 43.4
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 331..383
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..37
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 331..373
NoneNo IPR availablePANTHERPTHR34786OS09G0504900 PROTEINcoord: 55..428
NoneNo IPR availablePANTHERPTHR34786:SF1OS09G0504900 PROTEINcoord: 55..428

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MC10g1033.1MC10g1033.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane