Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTCAACCAGCGAATTCGACCAATACGACAGATCGAAGGACTCTAGCTGCCAGCAATGCCCACCAGAGGGAGGTCGGAGCAGCGGCGGTAGAAGGGCAAGGTCACGACGGCCTACCAACGGAACCCCTCCGCAGGTCGGCACGGAACACCGCGCCCGCCCTACCGCCCGCGCAGCCGAGGACGTCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTAAGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGCGAGAACTTTGATGCGCTCCAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCCATGGAGGCAATGTATAACGAAATGGTGCTAGCTGCATGCGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCATGGACGTACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGCCGAGGAAGAACGTCCCGAAGACAACGAGAGTGAGGGGTACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTCCGAAAAGGGCAGTCACCATCCCGCTCCCACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAATCCCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTGAGGGGGGAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAATACGATTCACTGAACGATGGCGACTTGGGAGAATCGCCTTTCACCTCGGACGTTTTGGAAGCACCAATCCCTCTGAAGTTCAAAGCTCCTGCCGTGAAGCCTTATGACGGGACGAAGGACCCCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTACCGGAGACTGCCAGCCGGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCACCCAGTTCTCTTCTCGGCACTATGACAAAAAGACAGCGACCCATCTCGCCATCATCAGGCATAAGGAGGGCGAGACGCTGCAGGAGTATGTCACTAGATTCCAGGAGGAGCAGTTGAAGGTTGCACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTCCAGAAGGCGAAGAAAGTCATCGATGGACATGAGCTCCTCCGAACCAAAACCGGCCGACCTGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATGAAAGGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTCTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAAAACGGACCTACCAGGAGCCGACCTTACGAGCGCTTCACCCCAACCACGATTCCAATTTCTGAGATCCTAACGAACATCGAGGATTCTGGAATGGAAAAACTACTCAAGCGTCCGGAGAAACTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCACAACACGTCGGACTGCTGGGAATTGAAGCGCCAAATTAAGGATCTAATTCCAGACGGCTACTTCAAGAAGTTTGTGGGAAAGCCCAGGACCAGCTCAGCAGAGAAAAAGGAAGAGCGAAAGCGTTCAAGGACGCCACCCCGGCGCACCGACCGACCTGCGGTCATCAATACCATTTTTGGAGGGCCAAGCGGGGGTCAATCCGGACATAAAAGAAAGGAGTTAGCCCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGAGAGCAGGGGCCGACCTGCCCAATCACCTTCGACGGTGCAGACTTGGAGGAGGTCCACCTGCCCCACAATGATGCCCTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTCAGGAGAGTGCTGGTAGACGGGGGCGCATCCGCTAACATCCTGTCCTTACCGACCTACCTCGCCTTGGGCTGGACGAGGTCGCAATTGAAGAGAAGCCCGACGCCGCTGGTTGGGTTCTCTGGCGAATCAGTCATCCCAGAGGGTTGCATCAACTTGCCGGTCACGCTGGGGCAGGACCAAACTCGGGTCACTCAAATGGCCGAGTTCGTGGTAGTTGACGGTAGATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGTCATTCCCTCAACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGTGGGCACGGTCAGAGGAGAACAGACCGCTTCGAGGGAATGCTATGCCGCCGCACTCAAAGGCTCATCGGTTTGCGCCCTCGAAACTCTCAGGGATGGGACGCTCGAGCTCGAGGCCGACCTGCCAAGGAAGGAGTTTGCCGCACCCACCGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGGTAAGCATAGGAACCAAGCTGGGGGCCACCGACAGAGAGGAGCTAATCCACTTCCTCAGATCCAACTCGGACGTCTTTGCGTGGTCCCATGAGGACATGCCTGGCATTGACCCGCGAATTATGACGCATCGCCTCAGCATAGATCCATCATTCCGACCTGTGAAACAAAAGAGAAGACCTATAAACAAGGAGAGGAGTGATGTAATTGTTGAGGAAGTTAACAAACTTTTGAAAGCTGAATACATAAGAGAAATTTCGTATCCCGAATGGCTCTCCAATGTTGTATTAGTTAAAAAATCTAACGGCAAGTGGAGAATGTGCGTAGATTTTACGAACTTAAATAAGGCATGTCCGAAAGATTGCTTCCCACTGCCGAGGATTGATCAGCTCGTGGACGTCACAGCCGGGCACGAAATGCTCACCTTCATGGACGCCTACTCTGGGTACAACCAAATCAAGATGCATGTCCCAGATGAAGGTCATACCGCTTTCATAACAGACCAAGGTCTGTACTGCTACAAGGTCATGCCCTTCGGGTTAAAGAACGCAGGAGCAACCTACCAGAGAATGGTGAACAAAATGTTCGCCAAGCAGATCGCCCGGAATATGGAAGTGTATGTGGACGACATACTTGTCAAGAGCAAGCAGTCTAAGTCACATCTCTCCGACCTGGCCGAAGCCTTCGAGGTTCTGAGGGCATATCAAATGAAGCTCAACCCTGCTAAGTGTGCCTTTGGAGTCTCCTAGGGAAAATTCCTTGGCTTCATGGTAAACAACCGGGGAATCGAGGCCAACCCCGAAAAGATTAAAGCCGTGATCGAGATGGAGGCACCTAAAACGCTGAAACAGCTTCAGTGCCTCAATGGCAGGATTGCGGCCCTGAACCGGTTTGTTTCAAGGTCGACAGATAAGTGCCTCCCTTTCTTCAAGGTCTTACGAAAGAAAGGACCGTTTGAATGGACAGCGGAGTGCGAGCAAGCGTTTCAGCAATTGAAGAGCTACCTCTGTTCGGCACCCTTGCTTGTCAAACCCGTGCCGGGGGACAAGCTCCAATTGTACCTAGCAGTGTCTGACAGTGCCGTCAGCTCGGCCCTAATCAGGCAAGAGGAAGCGCGGCAAAACCCGGTCTACTACACAAGCAACGCTATGACCGAAGCCGAGACTAGATACCCTCAGATGGAGAAGTTGGCTCTCGCTTTGGTCACCTCGGCCCGACGACTTAGACCATACTTCCAAGCCCATACGGTGGTGGTGCTCACTATCTTGCCCCTTAAAAGTATCTTCCACAAGCCGGAAGCTTCCGCAAGCCTAATGAAGTGGGCAATAGAGCTAAGTGAGTACGACATCCAGTTCGAACCCAGAACTGCGTTGAAAGGACAAGCAGCGGCAGATTTCATAGCCGAGCTCACACCACCTTCCGAGCTGAGCGAGTCCGACCTACCTTGGACAGTCTATGTCGACGGATCCTCCAATGAGAAGGGGTGCGGAGCCGGGGTCCTCCTGCTCGGACCAGGAGGTGAGCGATTTGAGTACGCCTTGCGGTTCAGCTTCCGGACTTCTAACAACGAGGCTGAGTATGAAGCATTTATTGCTGGCCTGCGAATCGCTCGAGCATTGGGGGCCTCTTGTGTTAAGGTCTTCAGTGACTCCCAGCTGGTTGTGAGCCAGATCAAGGACGAATACCAAGCCAAAGACACCCGAATGGAGAAGTATTTGGGCAAGGTCAGATCACACCTCGCCCAGTTTCGAACTTACGAGGTAAGCAGGATTCCACGAGCAGAAAATTCTAATGCTGACGCCTTGGCCAAGTTAGCATCGGCGTACGAGACCGATCTCGCCAGGTCGGTCCCCGTCGAGATCCTAGATAATCCCTCGATCTTAGAGCCAGATCTGATGGAGATCGGCGCTCCAGAATCCTCATGGATGGACCCGATCACGGACTTCATTAGGGGCAATTCACCACAAGATCCCAAGGAGCGCAGAAAGTTGGCAAGGCGGGCAGCTCGGTTCGTGGTCCGAGATGGGGCATTGTACCGACGTGGCTTTTCCCTGCCTCTATTGAAATGCCTAACCCCTGAAGAGGGCTTGTACGTCCTCAGAGAGATCCACGAAGGAGTGTGCGGCAATCACTCAGGCGCCCGGTCGCTGTCAGCCAAAGTGATCCGGCAAGGATACTATTGGCCGACCCTCAGCCAGGATGCCAAGAAGTTCGTTAGAACTTGCGACAATTGCCAACGCTACGGAACCATAATCCACCAACCTCCCGAGCTGCTCACCCCCATCTCGGCCCCATGGCCATTCGCGCGGTGGGGGGTAGATATCATTGGTCCTTTCCCTTTGGGCAAGGGCCAGACCAAGTTCGCTATGGTTGCTGTGGATTACTTCACCAAGTGGGCCGAGGCCGAGGCGCTCTCCCACATAACGGAATCCAGGGCCACATCCTTCGTATGGACGAATATCATATGTCGCTTTGGTATACCGCAGGCCATAGTGACAGACAATGGGAAGCAGTTTGACAACGTCAAGTTCAAAGACTTTTGCAGCAAACTTGGCATAAGTCATCTCAGCTCGTCCCCCGCACATCCGCAAGCAAATGGGCAGGTGGAGGCAGTCAACAAGATCATCAAGCGCGACATCAAACTTAGACTGGACTCCAAGAAAGGCAGGTGGGCCGAGGAGCTACCAGAGGTTCTATGGTCGTACCGGACCACCCAACGAGAGTCGACAGGTGAGACCCCGTTCTCCCTGGCCTTCGGCTCCGAAGCTGTAGTCCCGGTTGAGATCGGCATACCATCTGACAGAGTAGAGCATTACGAGCCTACGACGAATGAGGATGGGCTACTCCTCAACCTCGACTTGTTGGAAGAAAGAAGAGCAATGGCCCAGCTACGCCTGGCGGAATATCAGGGCAGAATGGCCAGACATTACAATGCCCGCGTTCGACCTCGGGCCTTTCAGGTCGGACATCTGGTCTTAAGGAGGGTCTAAACGCATGTGGGTGCCCTTGATCCGACCTGGGAGGGCCCGTTTAAGATCAAGGGCATAGTCCGACCTGGGACGTACATATTGGTCGATCTGAAAGGAGACGTCCTCGCGCACTCGTGGAACGCGGAGCACCTGAAGCGTTATTATCCTTGAAATGACAAAAGGATTTTCAATGCATCTGTAAAGACTGTTCCAAAAGAATTATGATCGGAATAAATGTGATGATTTAATTTCATGCTTCCGAGTTCGACCAGAAATTAAATGGGGGCCGCGGACTCCCACGCGATCGCATTCCAGCAGTTGGTTCAAATTCAACCCTCCGAAGCCTAAGGGTACGAGGTGCGATGCCAAAGCCACTGACGAACTTAAAGTTCAAAACCTTCAAGGCAAAGGGGCGATGTGAAAAGTTCAAAATGATCAAGCCTCCGAACTTGGGGGTACGAGGAATGATATGA
mRNA sequence
ATGGCTCAACCAGCGAATTCGACCAATACGACAGATCGAAGGACTCTAGCTGCCAGCAATGCCCACCAGAGGGAGGTCGGAGCAGCGGCGGTAGAAGGGCAAGGTCACGACGGCCTACCAACGGAACCCCTCCGCAGGTCGGCACGGAACACCGCGCCCGCCCTACCGCCCGCGCAGCCGAGGACGTCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTAAGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGCGAGAACTTTGATGCGCTCCAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCCATGGAGGCAATGTATAACGAAATGGTGCTAGCTGCATGCGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCATGGACGTACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGCCGAGGAAGAACGTCCCGAAGACAACGAGAGTGAGGGGTACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTCCGAAAAGGGCAGTCACCATCCCGCTCCCACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAATCCCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTGAGGGGGGAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAATACGATTCACTGAACGATGGCGACTTGGGAGAATCGCCTTTCACCTCGGACGTTTTGGAAGCACCAATCCCTCTGAAGTTCAAAGCTCCTGCCGTGAAGCCTTATGACGGGACGAAGGACCCCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTACCGGAGACTGCCAGCCGGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCACCCAGTTCTCTTCTCGGCACTATGACAAAAAGACAGCGACCCATCTCGCCATCATCAGGCATAAGGAGGGCGAGACGCTGCAGGAGTATGTCACTAGATTCCAGGAGGAGCAGTTGAAGGTTGCACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTCCAGAAGGCGAAGAAAGTCATCGATGGACATGAGCTCCTCCGAACCAAAACCGGCCGACCTGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATGAAAGGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTCTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAAAACGGACCTACCAGGAGCCGACCTTACGAGCGCTTCACCCCAACCACGATTCCAATTTCTGAGATCCTAACGAACATCGAGGATTCTGGAATGGAAAAACTACTCAAGCGTCCGGAGAAACTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCACAACACGTCGGACTGCTGGGAATTGAAGCGCCAAATTAAGGATCTAATTCCAGACGGCTACTTCAAGAAGTTTGTGGGAAAGCCCAGGACCAGCTCAGCAGAGAAAAAGGAAGAGCGAAAGCGTTCAAGGACGCCACCCCGGCGCACCGACCGACCTGCGGTCATCAATACCATTTTTGGAGGGCCAAGCGGGGGTCAATCCGGACATAAAAGAAAGGAGTTAGCCCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGAGAGCAGGGGCCGACCTGCCCAATCACCTTCGACGGTGCAGACTTGGAGGAGGTCCACCTGCCCCACAATGATGCCCTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTCAGGAGAGTGCTGGTAGACGGGGGCGCATCCGCTAACATCCTGTCCTTACCGACCTACCTCGCCTTGGGCTGGACGAGGTCGCAATTGAAGAGAAGCCCGACGCCGCTGGTTGGGTTCTCTGGCGAATCAGTCATCCCAGAGGGTTGCATCAACTTGCCGGTCACGCTGGGGCAGGACCAAACTCGGGTCACTCAAATGGCCGAGTTCGTGGTAGTTGACGGTAGATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGTCATTCCCTCAACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGTGGGCACGGTCAGAGGAGAACAGACCGCTTCGAGGGAATGCTATGCCGCCGCACTCAAAGGCTCATCGGTTTGCGCCCTCGAAACTCTCAGGGATGGGACGCTCGAGCTCGAGGCCGACCTGCCAAGGAAGGAGTTTGCCGCACCCACCGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGTTAGCATCGGCGTACGAGACCGATCTCGCCAGGTCGGTCCCCGTCGAGATCCTAGATAATCCCTCGATCTTAGAGCCAGATCTGATGGAGATCGGCGCTCCAGAATCCTCATGGATGGACCCGATCACGGACTTCATTAGGGGCAATTCACCACAAGATCCCAAGGAGCGCAGAAAGTTGGCAAGGCGGGCAGCTCGGTTCGTGAAATTAAATGGGGGCCGCGGACTCCCACGCGATCGCATTCCAGCAGTTGGTTCAAATTCAACCCTCCGAAGCCTAAGGGTACGAGGTGCGATGCCAAAGCCACTGACGAACTTAAAGTTCAAAACCTTCAAGGCAAAGGGGCGATGTGAAAAGTTCAAAATGATCAAGCCTCCGAACTTGGGGGTACGAGGAATGATATGA
Coding sequence (CDS)
ATGGCTCAACCAGCGAATTCGACCAATACGACAGATCGAAGGACTCTAGCTGCCAGCAATGCCCACCAGAGGGAGGTCGGAGCAGCGGCGGTAGAAGGGCAAGGTCACGACGGCCTACCAACGGAACCCCTCCGCAGGTCGGCACGGAACACCGCGCCCGCCCTACCGCCCGCGCAGCCGAGGACGTCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTAAGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGCGAGAACTTTGATGCGCTCCAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCCATGGAGGCAATGTATAACGAAATGGTGCTAGCTGCATGCGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCATGGACGTACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGCCGAGGAAGAACGTCCCGAAGACAACGAGAGTGAGGGGTACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTCCGAAAAGGGCAGTCACCATCCCGCTCCCACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAATCCCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTGAGGGGGGAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAATACGATTCACTGAACGATGGCGACTTGGGAGAATCGCCTTTCACCTCGGACGTTTTGGAAGCACCAATCCCTCTGAAGTTCAAAGCTCCTGCCGTGAAGCCTTATGACGGGACGAAGGACCCCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTACCGGAGACTGCCAGCCGGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCACCCAGTTCTCTTCTCGGCACTATGACAAAAAGACAGCGACCCATCTCGCCATCATCAGGCATAAGGAGGGCGAGACGCTGCAGGAGTATGTCACTAGATTCCAGGAGGAGCAGTTGAAGGTTGCACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTCCAGAAGGCGAAGAAAGTCATCGATGGACATGAGCTCCTCCGAACCAAAACCGGCCGACCTGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATGAAAGGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTCTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAAAACGGACCTACCAGGAGCCGACCTTACGAGCGCTTCACCCCAACCACGATTCCAATTTCTGAGATCCTAACGAACATCGAGGATTCTGGAATGGAAAAACTACTCAAGCGTCCGGAGAAACTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCACAACACGTCGGACTGCTGGGAATTGAAGCGCCAAATTAAGGATCTAATTCCAGACGGCTACTTCAAGAAGTTTGTGGGAAAGCCCAGGACCAGCTCAGCAGAGAAAAAGGAAGAGCGAAAGCGTTCAAGGACGCCACCCCGGCGCACCGACCGACCTGCGGTCATCAATACCATTTTTGGAGGGCCAAGCGGGGGTCAATCCGGACATAAAAGAAAGGAGTTAGCCCGTGCAGCCAGGCGCGAGGTGTGCATCATCAGAGAGCAGGGGCCGACCTGCCCAATCACCTTCGACGGTGCAGACTTGGAGGAGGTCCACCTGCCCCACAATGATGCCCTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTCAGGAGAGTGCTGGTAGACGGGGGCGCATCCGCTAACATCCTGTCCTTACCGACCTACCTCGCCTTGGGCTGGACGAGGTCGCAATTGAAGAGAAGCCCGACGCCGCTGGTTGGGTTCTCTGGCGAATCAGTCATCCCAGAGGGTTGCATCAACTTGCCGGTCACGCTGGGGCAGGACCAAACTCGGGTCACTCAAATGGCCGAGTTCGTGGTAGTTGACGGTAGATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGTCATTCCCTCAACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCGTGGGCACGGTCAGAGGAGAACAGACCGCTTCGAGGGAATGCTATGCCGCCGCACTCAAAGGCTCATCGGTTTGCGCCCTCGAAACTCTCAGGGATGGGACGCTCGAGCTCGAGGCCGACCTGCCAAGGAAGGAGTTTGCCGCACCCACCGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGTTAGCATCGGCGTACGAGACCGATCTCGCCAGGTCGGTCCCCGTCGAGATCCTAGATAATCCCTCGATCTTAGAGCCAGATCTGATGGAGATCGGCGCTCCAGAATCCTCATGGATGGACCCGATCACGGACTTCATTAGGGGCAATTCACCACAAGATCCCAAGGAGCGCAGAAAGTTGGCAAGGCGGGCAGCTCGGTTCGTGAAATTAAATGGGGGCCGCGGACTCCCACGCGATCGCATTCCAGCAGTTGGTTCAAATTCAACCCTCCGAAGCCTAAGGGTACGAGGTGCGATGCCAAAGCCACTGACGAACTTAAAGTTCAAAACCTTCAAGGCAAAGGGGCGATGTGAAAAGTTCAAAATGATCAAGCCTCCGAACTTGGGGGTACGAGGAATGATATGA
Protein sequence
MAQPANSTNTTDRRTLAASNAHQREVGAAAVEGQGHDGLPTEPLRRSARNTAPALPPAQPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMRTQMRSMEAMYNEMVLAACAGSRSENRVTRMDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPAGIITREEFDQLRGELDAQVEALKAKCEQKYDSLNDGDLGESPFTSDVLEAPIPLKFKAPAVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPAGSISTYSQLRREFLTQFSSRHYDKKTATHLAIIRHKEGETLQEYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGHELLRTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIKDLIPDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCINLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHSFRVIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVCALETLRDGTLELEADLPRKEFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSILEPDLMEIGAPESSWMDPITDFIRGNSPQDPKERRKLARRAARFVKLNGGRGLPRDRIPAVGSNSTLRSLRVRGAMPKPLTNLKFKTFKAKGRCEKFKMIKPPNLGVRGMI
Homology
BLAST of Moc06g14420 vs. NCBI nr
Match:
XP_022150760.1 (uncharacterized protein LOC111018823 [Momordica charantia])
HSP 1 Score: 962.2 bits (2486), Expect = 3.3e-276
Identity = 519/669 (77.58%), Postives = 549/669 (82.06%), Query Frame = 0
Query: 187 SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKYDSLNDGDLGESPFT 246
SSNQQAESSHNPA G+ITREEFDQLRG+L+AQVEALKAKCEQK LNDGDLGESPFT
Sbjct: 2 SSNQQAESSHNPATPDGVITREEFDQLRGKLNAQVEALKAKCEQKEGPLNDGDLGESPFT 61
Query: 247 SDVLEAPIPLKFKAPAVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR 306
SDVLE AP VK YDG+KDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR
Sbjct: 62 SDVLE--------APTVKSYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR 121
Query: 307 LWYRRLPAGSISTYSQLRREFLTQFSSRHYDKKTATHLAIIRHKEGETLQEYVTRFQEEQ 366
LW FQE+Q
Sbjct: 122 LW-----------------------------------------------------FQEDQ 181
Query: 367 LKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGHELLRTKTGRP 426
LKVA SDDSAMCYFLTGLADEALTVKLG+EAPATFAEVLQKAKKVIDG ELLRTKTGRP
Sbjct: 182 LKVAQSSDDSAMCYFLTGLADEALTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRP 241
Query: 427 ERKIGRGRSGKDERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILT 486
ER I RGRSGKDE+AD KSKDKGSFSSGRAE+RRA NGPTRSRPYERFTPTTIPISEILT
Sbjct: 242 ERGIDRGRSGKDEKADLKSKDKGSFSSGRAEFRRAVNGPTRSRPYERFTPTTIPISEILT 301
Query: 487 NIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIKDLIPDGYFK 546
NIE+SGMEKLLKRPEKLRGAPERR+KDKYCRFHREH HNTSD WELKRQI+DLI D YFK
Sbjct: 302 NIEESGMEKLLKRPEKLRGAPERRNKDKYCRFHREHDHNTSDRWELKRQIEDLIQDDYFK 361
Query: 547 KFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREV 606
KFVGKPRTSSAEKKEERK SRTP RR DRPAVINTIFGGPSGGQSGHKRKELARAARREV
Sbjct: 362 KFVGKPRTSSAEKKEERKLSRTPLRRIDRPAVINTIFGGPSGGQSGHKRKELARAARREV 421
Query: 607 CIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTY 666
CIIREQ PTCPITFD ADLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TY
Sbjct: 422 CIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDEGVSANIVSLLTY 481
Query: 667 LALGWTRSQLKRSPTPLVGFSGESVIPEGCINLPVTLGQDQTRVTQMAEFVVVDGRSAYN 726
LALGWTRSQLK+S TPLVGFS ESVIPEGCI+LPVTLG DQT+VTQMAEFVV+DGRSAYN
Sbjct: 482 LALGWTRSQLKKSTTPLVGFSRESVIPEGCIDLPVTLGHDQTQVTQMAEFVVIDGRSAYN 541
Query: 727 AIFGRPIIHSFRVIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVCALETL- 786
AIFGRPIIHSFR IPSTLHQVLKYSTPNGVG VRGEQ ASRECYA+ALKGSSVCALETL
Sbjct: 542 AIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGMVRGEQIASRECYASALKGSSVCALETLV 601
Query: 787 -RDGTLELEADLPRKEFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSILE 846
RDGTLE +A+LPR+EFAAPTEELELVPLL + +E +L + +D+ +E
Sbjct: 602 SRDGTLEFKANLPRREFAAPTEELELVPLLRYKYNENIDHEQELDEKSSLNKIDDDIGVE 609
Query: 847 --PDLMEIG 849
P+ + +G
Sbjct: 662 GMPEPLNVG 609
BLAST of Moc06g14420 vs. NCBI nr
Match:
XP_022137317.1 (uncharacterized protein LOC111008813 [Momordica charantia])
HSP 1 Score: 958.7 bits (2477), Expect = 3.6e-275
Identity = 491/528 (92.99%), Postives = 503/528 (95.27%), Query Frame = 0
Query: 191 QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKYDSLNDGDLGESPFTSDVL 250
+AESS N PAG+ITREEFDQLRG+LDAQVEALKAKCEQK LNDGDLGESPFTSDVL
Sbjct: 3 KAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVL 62
Query: 251 EAPIPLKFKAPAVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYR 310
EAPIP KFKAP VKPYDG+KDPKDYVEVFE LMDFQAASDAIKCRAF+IALTGSARLWYR
Sbjct: 63 EAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKCRAFRIALTGSARLWYR 122
Query: 311 RLPAGSISTYSQLRREFLTQFSSRHYDKKTATHLAIIRHKEGETLQEYVTRFQEEQLKVA 370
RLPA SISTYSQLRREFL FSSRHYDKKTATHLA IR KEGETL+EYVTRFQEEQLKVA
Sbjct: 123 RLPAXSISTYSQLRREFLAXFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA 182
Query: 371 HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGHELLRTKTGRPERKI 430
HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDG ELLRTKTGRPERKI
Sbjct: 183 HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI 242
Query: 431 GRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE 490
GRGRSGKD E ADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
Sbjct: 243 GRGRSGKDIENADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE 302
Query: 491 DSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIKDLIPDGYFKKFV 550
+SGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELKRQI++LI DGYFKKFV
Sbjct: 303 ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIENLIQDGYFKKFV 362
Query: 551 GKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCII 610
GKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSG KRKELARAARREVCII
Sbjct: 363 GKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARAARREVCII 422
Query: 611 REQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLAL 670
REQ PTCPITFDGADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLAL
Sbjct: 423 REQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVXRVLVDGGTSANILSLPTYLAL 482
Query: 671 GWTRSQLKRSPTPLVGFSGESVIPEGCINLPVTLGQDQTRVTQMAEFV 715
GWTRSQLK+SPTPLVGFSGESVIPEG I+LPVTLGQDQT+VTQMAEFV
Sbjct: 483 GWTRSQLKKSPTPLVGFSGESVIPEGFIDLPVTLGQDQTQVTQMAEFV 530
BLAST of Moc06g14420 vs. NCBI nr
Match:
XP_022152854.1 (uncharacterized protein LOC111020479 [Momordica charantia])
HSP 1 Score: 927.2 bits (2395), Expect = 1.2e-265
Identity = 511/790 (64.68%), Postives = 566/790 (71.65%), Query Frame = 0
Query: 1 MAQPANSTNTTDRRTLAASNAHQREVGAAAVEGQGHDGLPTEPLRRSARNTAPALPPAQP 60
M QPANSTNT DRR LAA++ HQREVGA VEGQGH+ L TEPL RSAR T P LPPA P
Sbjct: 1 MVQPANSTNTADRRALAANHGHQREVGAEVVEGQGHEDLGTEPLCRSARITTPVLPPAHP 60
Query: 61 RTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMRTQMRSMEAMYNEMVLAACAG 120
+ SK
Sbjct: 61 KPSK-------------------------------------------------------- 120
Query: 121 SRSENRVTRMDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQS 180
Sbjct: 121 ------------------------------------------------------------ 180
Query: 181 PSRSHRSSNQQAESSHNP--AGIITREEFDQLRGELDAQVEALKAKCEQKYDSLNDGDLG 240
AESS+NP G+ITREEFDQL+ + DAQVEALKA+CE+K S +DGDLG
Sbjct: 181 -----------AESSYNPITPGVITREEFDQLKSKFDAQVEALKARCEKKESSFDDGDLG 240
Query: 241 ESPFTSDVLEAPIPLKFKAPAVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL 300
E F+SD+LEA IP KFK P +KPYDG+KDPKDYVEVFE LMDFQAA+DAIKC AFQIAL
Sbjct: 241 ELSFSSDILEALIPPKFKTPTMKPYDGSKDPKDYVEVFESLMDFQAATDAIKCCAFQIAL 300
Query: 301 TGSARLWYRRLPAGSISTYSQLRREFLTQFSSRHYDKKTATHLAIIRHKEGETLQEYVTR 360
TGSARLWYRRLPA ISTYSQLR+EF++QFSSRHYD+KT THLA IR KEGETL+EYVTR
Sbjct: 301 TGSARLWYRRLPARLISTYSQLRKEFISQFSSRHYDRKTPTHLATIRQKEGETLREYVTR 360
Query: 361 FQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGHELLRT 420
F EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPATFAEVLQK KKVIDG ELLRT
Sbjct: 361 FPEEQLKVAHCSDDSAMCYFLTGLADETLTVKLREEAPATFAEVLQKTKKVIDGQELLRT 420
Query: 421 KTGRPERKIGRGRSGKDE-RADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTI 480
KTGRPE+ I +GR+GKD+ +AD KS+DKG S SS R +YRR+ + +SRPYE +TPTTI
Sbjct: 421 KTGRPEKNIDQGRAGKDKGKADSKSRDKGPSSSSSRVDYRRSNSSHNQSRPYEHYTPTTI 480
Query: 481 PISEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIKDL 540
PI EILTNIE++GMEKLLKRPEKLRG PE+R+ DKYCRFHR+HGHNTS+ WELKRQI+DL
Sbjct: 481 PIFEILTNIEETGMEKLLKRPEKLRGDPEKRNTDKYCRFHRDHGHNTSNYWELKRQIEDL 540
Query: 541 IPDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELA 600
I DGYFKKFVGKPR++S EKKEERKR RTPPRR DRPAVIN K+KELA
Sbjct: 541 IQDGYFKKFVGKPRSNSVEKKEERKRLRTPPRRDDRPAVIN-------------KKKELA 600
Query: 601 RAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASAN 660
R ARREVCIIREQ PT I F+ ADLE VHLPHNDALVIAPLID V+VRR+LVDGGASAN
Sbjct: 601 REARREVCIIREQRPTSSIAFNHADLEGVHLPHNDALVIAPLIDLVLVRRILVDGGASAN 650
Query: 661 ILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCINLPVTLGQDQTRVTQMAEFVVV 720
ILSL TYLALGWTRSQLK+SPTPLVGFSGES+ EGCI+LPV++ QD T+VTQMAEFVV+
Sbjct: 661 ILSLSTYLALGWTRSQLKKSPTPLVGFSGESISLEGCIDLPVSIRQDDTQVTQMAEFVVI 650
Query: 721 DGRSAYNAIFGRPIIHSFRVIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSV 780
DGRSAYNAIFGRPIIHSFR +PSTLHQVLKYST NGVGTVRGE SRECYA+ K SSV
Sbjct: 721 DGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGELKTSRECYASVPKRSSV 650
Query: 781 CALE--TLRD 785
CALE T+RD
Sbjct: 781 CALEEQTIRD 650
BLAST of Moc06g14420 vs. NCBI nr
Match:
XP_022152110.1 (uncharacterized protein LOC111019899 [Momordica charantia])
HSP 1 Score: 792.0 bits (2044), Expect = 5.8e-225
Identity = 408/446 (91.48%), Postives = 420/446 (94.17%), Query Frame = 0
Query: 375 MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGHELLRTKTGRPERKIGRGRSGK 434
MCYFLTGLADEALTVKL EEAPATFAEVLQKAKKVIDG ELLRT KIG+GRSGK
Sbjct: 1 MCYFLTGLADEALTVKLVEEAPATFAEVLQKAKKVIDGQELLRT-------KIGQGRSGK 60
Query: 435 D-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGMEKL 494
D E DPKSKDKGSFS+GRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE+SGMEKL
Sbjct: 61 DMENTDPKSKDKGSFSNGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKL 120
Query: 495 LKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIKDLIPDGYFKKFVGKPRTSS 554
LKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK QI+DLI DGYFKKFVGKPRTSS
Sbjct: 121 LKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKSQIEDLIQDGYFKKFVGKPRTSS 180
Query: 555 AEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTC 614
AEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRK+LARAARREVCIIREQ PTC
Sbjct: 181 AEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKKLARAARREVCIIREQRPTC 240
Query: 615 PITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL 674
PITFD ADL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
Sbjct: 241 PITFDXADLXEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL 300
Query: 675 KRSPTPLVGFSGESVIPEGCINLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHS 734
K+SPTPLVGFSGESV+PEGCI+LPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHS
Sbjct: 301 KKSPTPLVGFSGESVVPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHS 360
Query: 735 FRVIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVCALETL--RDGTLELEA 794
FR IPSTLHQVLKYSTPNGVGTVRGEQTASRECYA+ LKG+SVCALETL RDGTLE EA
Sbjct: 361 FRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASXLKGTSVCALETLTSRDGTLEFEA 420
Query: 795 DLPRKEFAAPTEELELVPLLSPEKQL 818
DLP +EFAAP EELELVPLLS EKQ+
Sbjct: 421 DLPXREFAAPXEELELVPLLSXEKQV 439
BLAST of Moc06g14420 vs. NCBI nr
Match:
XP_022150613.1 (uncharacterized protein LOC111018708, partial [Momordica charantia])
HSP 1 Score: 778.1 bits (2008), Expect = 8.7e-221
Identity = 395/422 (93.60%), Postives = 403/422 (95.50%), Query Frame = 0
Query: 228 KYDSLNDGDLGESPFTSDVLEAPIPLKFKAPAVKPYDGTKDPKDYVEVFEGLMDFQAASD 287
K DSLNDGDLGES FTSDVLEAPIP KFKAP VKPYDG+KDPKDYVEVFEGLMDF AASD
Sbjct: 44 KDDSLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFHAASD 103
Query: 288 AIKCRAFQIALTGSARLWYRRLPAGSISTYSQLRREFLTQFSSRHYDKKTATHLAIIRHK 347
AIKCRAFQIALTGSARLWYRRLPA SISTYSQLRREFL QFSSR Y KKT THLA IR K
Sbjct: 104 AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRQYGKKTETHLATIRQK 163
Query: 348 EGETLQEYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAK 407
EG TL+EYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGE+AP TFAEVLQKAK
Sbjct: 164 EGGTLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEKAPTTFAEVLQKAK 223
Query: 408 KVIDGHELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSR 467
KVIDG ELLRTKTGRP+RKIGRGRSGKD ERADPKSKDKGSFSSGRAEYRRAE+GPT+SR
Sbjct: 224 KVIDGQELLRTKTGRPDRKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAESGPTKSR 283
Query: 468 PYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC 527
PYERFTPTTIPISEILTNIE+SGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC
Sbjct: 284 PYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC 343
Query: 528 WELKRQIKDLIPDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG 587
WELKRQI+DLI DGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
Sbjct: 344 WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG 403
Query: 588 QSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRR 647
QSGHKRKELARAARREVCIIREQGPTCPITFDGAD EEVHLPHNDA VIAPLIDHVVVRR
Sbjct: 404 QSGHKRKELARAARREVCIIREQGPTCPITFDGADSEEVHLPHNDARVIAPLIDHVVVRR 463
Query: 648 VL 649
VL
Sbjct: 464 VL 465
BLAST of Moc06g14420 vs. ExPASy TrEMBL
Match:
A0A6J1D9E1 (uncharacterized protein LOC111018823 OS=Momordica charantia OX=3673 GN=LOC111018823 PE=4 SV=1)
HSP 1 Score: 962.2 bits (2486), Expect = 1.6e-276
Identity = 519/669 (77.58%), Postives = 549/669 (82.06%), Query Frame = 0
Query: 187 SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKYDSLNDGDLGESPFT 246
SSNQQAESSHNPA G+ITREEFDQLRG+L+AQVEALKAKCEQK LNDGDLGESPFT
Sbjct: 2 SSNQQAESSHNPATPDGVITREEFDQLRGKLNAQVEALKAKCEQKEGPLNDGDLGESPFT 61
Query: 247 SDVLEAPIPLKFKAPAVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR 306
SDVLE AP VK YDG+KDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR
Sbjct: 62 SDVLE--------APTVKSYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR 121
Query: 307 LWYRRLPAGSISTYSQLRREFLTQFSSRHYDKKTATHLAIIRHKEGETLQEYVTRFQEEQ 366
LW FQE+Q
Sbjct: 122 LW-----------------------------------------------------FQEDQ 181
Query: 367 LKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGHELLRTKTGRP 426
LKVA SDDSAMCYFLTGLADEALTVKLG+EAPATFAEVLQKAKKVIDG ELLRTKTGRP
Sbjct: 182 LKVAQSSDDSAMCYFLTGLADEALTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRP 241
Query: 427 ERKIGRGRSGKDERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILT 486
ER I RGRSGKDE+AD KSKDKGSFSSGRAE+RRA NGPTRSRPYERFTPTTIPISEILT
Sbjct: 242 ERGIDRGRSGKDEKADLKSKDKGSFSSGRAEFRRAVNGPTRSRPYERFTPTTIPISEILT 301
Query: 487 NIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIKDLIPDGYFK 546
NIE+SGMEKLLKRPEKLRGAPERR+KDKYCRFHREH HNTSD WELKRQI+DLI D YFK
Sbjct: 302 NIEESGMEKLLKRPEKLRGAPERRNKDKYCRFHREHDHNTSDRWELKRQIEDLIQDDYFK 361
Query: 547 KFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREV 606
KFVGKPRTSSAEKKEERK SRTP RR DRPAVINTIFGGPSGGQSGHKRKELARAARREV
Sbjct: 362 KFVGKPRTSSAEKKEERKLSRTPLRRIDRPAVINTIFGGPSGGQSGHKRKELARAARREV 421
Query: 607 CIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTY 666
CIIREQ PTCPITFD ADLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TY
Sbjct: 422 CIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDEGVSANIVSLLTY 481
Query: 667 LALGWTRSQLKRSPTPLVGFSGESVIPEGCINLPVTLGQDQTRVTQMAEFVVVDGRSAYN 726
LALGWTRSQLK+S TPLVGFS ESVIPEGCI+LPVTLG DQT+VTQMAEFVV+DGRSAYN
Sbjct: 482 LALGWTRSQLKKSTTPLVGFSRESVIPEGCIDLPVTLGHDQTQVTQMAEFVVIDGRSAYN 541
Query: 727 AIFGRPIIHSFRVIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVCALETL- 786
AIFGRPIIHSFR IPSTLHQVLKYSTPNGVG VRGEQ ASRECYA+ALKGSSVCALETL
Sbjct: 542 AIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGMVRGEQIASRECYASALKGSSVCALETLV 601
Query: 787 -RDGTLELEADLPRKEFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSILE 846
RDGTLE +A+LPR+EFAAPTEELELVPLL + +E +L + +D+ +E
Sbjct: 602 SRDGTLEFKANLPRREFAAPTEELELVPLLRYKYNENIDHEQELDEKSSLNKIDDDIGVE 609
Query: 847 --PDLMEIG 849
P+ + +G
Sbjct: 662 GMPEPLNVG 609
BLAST of Moc06g14420 vs. ExPASy TrEMBL
Match:
A0A6J1C7X5 (uncharacterized protein LOC111008813 OS=Momordica charantia OX=3673 GN=LOC111008813 PE=4 SV=1)
HSP 1 Score: 958.7 bits (2477), Expect = 1.7e-275
Identity = 491/528 (92.99%), Postives = 503/528 (95.27%), Query Frame = 0
Query: 191 QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKYDSLNDGDLGESPFTSDVL 250
+AESS N PAG+ITREEFDQLRG+LDAQVEALKAKCEQK LNDGDLGESPFTSDVL
Sbjct: 3 KAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVL 62
Query: 251 EAPIPLKFKAPAVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYR 310
EAPIP KFKAP VKPYDG+KDPKDYVEVFE LMDFQAASDAIKCRAF+IALTGSARLWYR
Sbjct: 63 EAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKCRAFRIALTGSARLWYR 122
Query: 311 RLPAGSISTYSQLRREFLTQFSSRHYDKKTATHLAIIRHKEGETLQEYVTRFQEEQLKVA 370
RLPA SISTYSQLRREFL FSSRHYDKKTATHLA IR KEGETL+EYVTRFQEEQLKVA
Sbjct: 123 RLPAXSISTYSQLRREFLAXFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA 182
Query: 371 HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGHELLRTKTGRPERKI 430
HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDG ELLRTKTGRPERKI
Sbjct: 183 HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI 242
Query: 431 GRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE 490
GRGRSGKD E ADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
Sbjct: 243 GRGRSGKDIENADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE 302
Query: 491 DSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIKDLIPDGYFKKFV 550
+SGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELKRQI++LI DGYFKKFV
Sbjct: 303 ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIENLIQDGYFKKFV 362
Query: 551 GKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCII 610
GKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSG KRKELARAARREVCII
Sbjct: 363 GKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARAARREVCII 422
Query: 611 REQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLAL 670
REQ PTCPITFDGADLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLAL
Sbjct: 423 REQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVXRVLVDGGTSANILSLPTYLAL 482
Query: 671 GWTRSQLKRSPTPLVGFSGESVIPEGCINLPVTLGQDQTRVTQMAEFV 715
GWTRSQLK+SPTPLVGFSGESVIPEG I+LPVTLGQDQT+VTQMAEFV
Sbjct: 483 GWTRSQLKKSPTPLVGFSGESVIPEGFIDLPVTLGQDQTQVTQMAEFV 530
BLAST of Moc06g14420 vs. ExPASy TrEMBL
Match:
A0A6J1DHB3 (uncharacterized protein LOC111020479 OS=Momordica charantia OX=3673 GN=LOC111020479 PE=4 SV=1)
HSP 1 Score: 927.2 bits (2395), Expect = 5.6e-266
Identity = 511/790 (64.68%), Postives = 566/790 (71.65%), Query Frame = 0
Query: 1 MAQPANSTNTTDRRTLAASNAHQREVGAAAVEGQGHDGLPTEPLRRSARNTAPALPPAQP 60
M QPANSTNT DRR LAA++ HQREVGA VEGQGH+ L TEPL RSAR T P LPPA P
Sbjct: 1 MVQPANSTNTADRRALAANHGHQREVGAEVVEGQGHEDLGTEPLCRSARITTPVLPPAHP 60
Query: 61 RTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMRTQMRSMEAMYNEMVLAACAG 120
+ SK
Sbjct: 61 KPSK-------------------------------------------------------- 120
Query: 121 SRSENRVTRMDVREQRGSHLGPAEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQS 180
Sbjct: 121 ------------------------------------------------------------ 180
Query: 181 PSRSHRSSNQQAESSHNP--AGIITREEFDQLRGELDAQVEALKAKCEQKYDSLNDGDLG 240
AESS+NP G+ITREEFDQL+ + DAQVEALKA+CE+K S +DGDLG
Sbjct: 181 -----------AESSYNPITPGVITREEFDQLKSKFDAQVEALKARCEKKESSFDDGDLG 240
Query: 241 ESPFTSDVLEAPIPLKFKAPAVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL 300
E F+SD+LEA IP KFK P +KPYDG+KDPKDYVEVFE LMDFQAA+DAIKC AFQIAL
Sbjct: 241 ELSFSSDILEALIPPKFKTPTMKPYDGSKDPKDYVEVFESLMDFQAATDAIKCCAFQIAL 300
Query: 301 TGSARLWYRRLPAGSISTYSQLRREFLTQFSSRHYDKKTATHLAIIRHKEGETLQEYVTR 360
TGSARLWYRRLPA ISTYSQLR+EF++QFSSRHYD+KT THLA IR KEGETL+EYVTR
Sbjct: 301 TGSARLWYRRLPARLISTYSQLRKEFISQFSSRHYDRKTPTHLATIRQKEGETLREYVTR 360
Query: 361 FQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGHELLRT 420
F EEQLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPATFAEVLQK KKVIDG ELLRT
Sbjct: 361 FPEEQLKVAHCSDDSAMCYFLTGLADETLTVKLREEAPATFAEVLQKTKKVIDGQELLRT 420
Query: 421 KTGRPERKIGRGRSGKDE-RADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTI 480
KTGRPE+ I +GR+GKD+ +AD KS+DKG S SS R +YRR+ + +SRPYE +TPTTI
Sbjct: 421 KTGRPEKNIDQGRAGKDKGKADSKSRDKGPSSSSSRVDYRRSNSSHNQSRPYEHYTPTTI 480
Query: 481 PISEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIKDL 540
PI EILTNIE++GMEKLLKRPEKLRG PE+R+ DKYCRFHR+HGHNTS+ WELKRQI+DL
Sbjct: 481 PIFEILTNIEETGMEKLLKRPEKLRGDPEKRNTDKYCRFHRDHGHNTSNYWELKRQIEDL 540
Query: 541 IPDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELA 600
I DGYFKKFVGKPR++S EKKEERKR RTPPRR DRPAVIN K+KELA
Sbjct: 541 IQDGYFKKFVGKPRSNSVEKKEERKRLRTPPRRDDRPAVIN-------------KKKELA 600
Query: 601 RAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASAN 660
R ARREVCIIREQ PT I F+ ADLE VHLPHNDALVIAPLID V+VRR+LVDGGASAN
Sbjct: 601 REARREVCIIREQRPTSSIAFNHADLEGVHLPHNDALVIAPLIDLVLVRRILVDGGASAN 650
Query: 661 ILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCINLPVTLGQDQTRVTQMAEFVVV 720
ILSL TYLALGWTRSQLK+SPTPLVGFSGES+ EGCI+LPV++ QD T+VTQMAEFVV+
Sbjct: 661 ILSLSTYLALGWTRSQLKKSPTPLVGFSGESISLEGCIDLPVSIRQDDTQVTQMAEFVVI 650
Query: 721 DGRSAYNAIFGRPIIHSFRVIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSV 780
DGRSAYNAIFGRPIIHSFR +PSTLHQVLKYST NGVGTVRGE SRECYA+ K SSV
Sbjct: 721 DGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGELKTSRECYASVPKRSSV 650
Query: 781 CALE--TLRD 785
CALE T+RD
Sbjct: 781 CALEEQTIRD 650
BLAST of Moc06g14420 vs. ExPASy TrEMBL
Match:
A0A6J1DD03 (uncharacterized protein LOC111019899 OS=Momordica charantia OX=3673 GN=LOC111019899 PE=4 SV=1)
HSP 1 Score: 792.0 bits (2044), Expect = 2.8e-225
Identity = 408/446 (91.48%), Postives = 420/446 (94.17%), Query Frame = 0
Query: 375 MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGHELLRTKTGRPERKIGRGRSGK 434
MCYFLTGLADEALTVKL EEAPATFAEVLQKAKKVIDG ELLRT KIG+GRSGK
Sbjct: 1 MCYFLTGLADEALTVKLVEEAPATFAEVLQKAKKVIDGQELLRT-------KIGQGRSGK 60
Query: 435 D-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGMEKL 494
D E DPKSKDKGSFS+GRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE+SGMEKL
Sbjct: 61 DMENTDPKSKDKGSFSNGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKL 120
Query: 495 LKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIKDLIPDGYFKKFVGKPRTSS 554
LKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK QI+DLI DGYFKKFVGKPRTSS
Sbjct: 121 LKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKSQIEDLIQDGYFKKFVGKPRTSS 180
Query: 555 AEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCIIREQGPTC 614
AEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRK+LARAARREVCIIREQ PTC
Sbjct: 181 AEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKKLARAARREVCIIREQRPTC 240
Query: 615 PITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL 674
PITFD ADL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL
Sbjct: 241 PITFDXADLXEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL 300
Query: 675 KRSPTPLVGFSGESVIPEGCINLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHS 734
K+SPTPLVGFSGESV+PEGCI+LPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHS
Sbjct: 301 KKSPTPLVGFSGESVVPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHS 360
Query: 735 FRVIPSTLHQVLKYSTPNGVGTVRGEQTASRECYAAALKGSSVCALETL--RDGTLELEA 794
FR IPSTLHQVLKYSTPNGVGTVRGEQTASRECYA+ LKG+SVCALETL RDGTLE EA
Sbjct: 361 FRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASXLKGTSVCALETLTSRDGTLEFEA 420
Query: 795 DLPRKEFAAPTEELELVPLLSPEKQL 818
DLP +EFAAP EELELVPLLS EKQ+
Sbjct: 421 DLPXREFAAPXEELELVPLLSXEKQV 439
BLAST of Moc06g14420 vs. ExPASy TrEMBL
Match:
A0A6J1D9W7 (uncharacterized protein LOC111018708 OS=Momordica charantia OX=3673 GN=LOC111018708 PE=4 SV=1)
HSP 1 Score: 778.1 bits (2008), Expect = 4.2e-221
Identity = 395/422 (93.60%), Postives = 403/422 (95.50%), Query Frame = 0
Query: 228 KYDSLNDGDLGESPFTSDVLEAPIPLKFKAPAVKPYDGTKDPKDYVEVFEGLMDFQAASD 287
K DSLNDGDLGES FTSDVLEAPIP KFKAP VKPYDG+KDPKDYVEVFEGLMDF AASD
Sbjct: 44 KDDSLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFHAASD 103
Query: 288 AIKCRAFQIALTGSARLWYRRLPAGSISTYSQLRREFLTQFSSRHYDKKTATHLAIIRHK 347
AIKCRAFQIALTGSARLWYRRLPA SISTYSQLRREFL QFSSR Y KKT THLA IR K
Sbjct: 104 AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRQYGKKTETHLATIRQK 163
Query: 348 EGETLQEYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAK 407
EG TL+EYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGE+AP TFAEVLQKAK
Sbjct: 164 EGGTLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEKAPTTFAEVLQKAK 223
Query: 408 KVIDGHELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSR 467
KVIDG ELLRTKTGRP+RKIGRGRSGKD ERADPKSKDKGSFSSGRAEYRRAE+GPT+SR
Sbjct: 224 KVIDGQELLRTKTGRPDRKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAESGPTKSR 283
Query: 468 PYERFTPTTIPISEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC 527
PYERFTPTTIPISEILTNIE+SGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC
Sbjct: 284 PYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC 343
Query: 528 WELKRQIKDLIPDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG 587
WELKRQI+DLI DGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
Sbjct: 344 WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG 403
Query: 588 QSGHKRKELARAARREVCIIREQGPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRR 647
QSGHKRKELARAARREVCIIREQGPTCPITFDGAD EEVHLPHNDA VIAPLIDHVVVRR
Sbjct: 404 QSGHKRKELARAARREVCIIREQGPTCPITFDGADSEEVHLPHNDARVIAPLIDHVVVRR 463
Query: 648 VL 649
VL
Sbjct: 464 VL 465
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022150760.1 | 3.3e-276 | 77.58 | uncharacterized protein LOC111018823 [Momordica charantia] | [more] |
XP_022137317.1 | 3.6e-275 | 92.99 | uncharacterized protein LOC111008813 [Momordica charantia] | [more] |
XP_022152854.1 | 1.2e-265 | 64.68 | uncharacterized protein LOC111020479 [Momordica charantia] | [more] |
XP_022152110.1 | 5.8e-225 | 91.48 | uncharacterized protein LOC111019899 [Momordica charantia] | [more] |
XP_022150613.1 | 8.7e-221 | 93.60 | uncharacterized protein LOC111018708, partial [Momordica charantia] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1D9E1 | 1.6e-276 | 77.58 | uncharacterized protein LOC111018823 OS=Momordica charantia OX=3673 GN=LOC111018... | [more] |
A0A6J1C7X5 | 1.7e-275 | 92.99 | uncharacterized protein LOC111008813 OS=Momordica charantia OX=3673 GN=LOC111008... | [more] |
A0A6J1DHB3 | 5.6e-266 | 64.68 | uncharacterized protein LOC111020479 OS=Momordica charantia OX=3673 GN=LOC111020... | [more] |
A0A6J1DD03 | 2.8e-225 | 91.48 | uncharacterized protein LOC111019899 OS=Momordica charantia OX=3673 GN=LOC111019... | [more] |
A0A6J1D9W7 | 4.2e-221 | 93.60 | uncharacterized protein LOC111018708 OS=Momordica charantia OX=3673 GN=LOC111018... | [more] |
Match Name | E-value | Identity | Description | |