Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTTCAACCCGCAAACTCGACCAATACGGCGGATCGAAGGACCCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGCGGTAAAGGGGCAAGGTCACGACGGCCTAGCAACAAAACCCCTCCGCAGGTCGGCACGGATCACCGCGCCTGCCCTACCGTCTGCGCACCCGAGGACATCCAAGGCCACCCGTGGTCGAGGTGGGACCTCTAAGAAGGGTGCCCGGGGTCCAGCCCCAGCTCTAACAAGTGAGGACTTTGATGCACTCCAGAGAGAGATGGAGGCAATGTGCATACGAATGCGGTCCATGGAGGAAATGTATAACGAAATGATACTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAATGACGCGCGTGGACGTACGCGAGCAAAGGTGTTCCCACCTAGACCCAGCCGAGGAGGAACGTCCCGAAGACAACGAGAGCGAGGGGTACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAAAGGCTCGTCTCTCCGAAAAGGGCAGTCACCATCCCGCTCACACAAGAGCTCCAACCAGCAGGCTAAATCCTCTCACAACCCAGCAACTCCCGCAGGGGTGATCACAAGGGAGAAGTTCGACCAACTGAGGGGCAAGATCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAGAGAAGGTTCACTAAACGATGGCGACTTGAGAGAATCGCCCTTCACCTCGGACGTTTTGGAAGCACCGATCCCTCTGAAGTTCAAAGCTCCTACCGTGAAGCCTTATCATGGGTCGAGGGACCCCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAAACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGACACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGTGTGAGACGCTGCGGAAATATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTAGGGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAATCGGCCGACCGGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATGTAGAAAGAGAAAATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAGCGGACCTACCAGGAGCCGACCTTACGAACGCTTTACCCCGACCACGATTCCAATTTCCGAGATCCTAACAAACATCGAGGAATCTGGAATGGAAAAACTACTCAATCGTCCTGAGAAGCTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCATAACACGTCAGACTGCTGGGAGTTGAAGCATCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAATTTGTGAGAAAGCCCAGGACCAGCTCGGCAGAGAAAAAGGAAGAGCGAAAGCGTTCAAGGACGTTGCCCCGGCGCACTGACCGACCTGCGGTCATCAATACCATTTTCGGAGGGCCAAGCGGGGGTCAGTCCGGACATAAAAGAAAGGAGTTAGCTCGTGCAGCCAGGCGCGAAATGTGCATCATCAGGGAGCAGGGGCCGACCTACCCAATCACCTTCGACGGTGCAGACTTGGGGGAAGTCCACCTGCACCACAATGATGCACTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTCAGGAGGGTGCTGGTAGACGGAGTCGCATCTGCTAACATCCTGTCCTTACCGACCTACCTCGCCCTGGGATGGACGAGGTCGCAATTGAAGAAAAGCCCGCCGCTGGTTGGGTTCTCTGGAGAATCAGTCATTCTAGAGGGTTGCATCGACTTGCCGGTCACGCTGGGGCAGGATCAAACTCAGGTCACCCAAATGGCCGAGTTCGTGGTAATTGACGGCAGATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCGACACTTCATCAAGTTTTGAAGTATCCCACCCACAATGGCGTGGGCACGATCCGAGGAGAACAGACCGCTTCGAGGGAGTGTTATGCCTCCGCACTCAAAGGTTCCTCGGTCTGCGCCCTCGAAACTCTCACCGGTAGGGACGGGACGCTCGAGTTCGAGGCCGACCTGCCGAGAAAGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGGTAAGCATAGGAACCAAGCTAGGGGCCACCGACAGAGAGGAGCTAATCCACTTCCTCAGATCCAACTCGGACGTCTTTGCGTGGTCCCATGAGGACATGCCTGGCATTGACCCGCGAATTATGACGCATCGCCTCAGCATAGATCCATCATTCCGACTTGTGAAACAAAAGAGAAGACCTATAAACAAGGAGAGGAGTGATGTAATTGTTGAGGAAGTTAGCAAACTTTTGAAAGCTGAATACATAAGAGAAATTTCGTATCCCGAGTGGCTCTCCAATGTTGTATTAGTTAAAAAATCTAACGGAAAGTGGAGAATGTGCGTAGATTTTACGAACTTAAATAAGGCATGCCCGAAAGATTGCTTCCTACTGCCGAGGATTGATCAGCTCGTGGACGCCACAGCCGGGCACGAACTGCTCACCTTCATGGACGCCTACTCTGGTTACAACCAAATCAAGATGCATGTCCCAGATGAAGGTCATACCGCTTTCATAACAGACCAAGGTCTGTACTGCTACAAGGTCATGCCCTTCGAGTTAAAGAACGCAGGAGCGACCTACCAGAGAATGGTGAACAAAATGTTCGCCAAGCAGATCGGCCGGAATATGGAAGTGTATGTGGACGACATGCTTGTCAAGAGCAAGCAGTCTAAGTCGCATCTCTCCGATCTGACCGAAGCCTTCGAGGTTCTGAGGGCATATCAAATGAAGCTCAACCCAGCTAAATGTGCCTTCGGAGTCTCTTCGGGAAAATTCCTTGGCTTCATGGTGAACAACCGGGGAATCGAGGCCAACCCCGAAAAGATTAAAGCCGTGATCGAGATGGAGGCACCTAAGACGCTGAAGCAGCTTCAGTGCCTCAATGGCAGGATTACGGCCCTGAACCGGTTTGTTTCAAGATCGACAGATAAGTGCCTTCCTTTCTTCAAAGTCCTACGAAAGAAAGGGCCATTTGAATGGACAGCGGAGTGCGAACAAGCATTTCAGCAATTGAAAAGCTACCTCTGTTCGGCACCTTTGCTCGCCAAGCCCATGCCGGGGGACAAGCTCCAATTGTACTTAGCAGTGTCTGATAGTGCCGTCAGCTCGGCCCTAATCAGGCAAGAGGAAGCGCGGCAAAACCCGGTCTACTACACAAGCAAGGCTATGACCGAAGCTGAGACTAGATACCCTCAGATGGAGAAGTTGGCTCTCGCTTTAGTCACCTCGCCCCGACGGCTTAGACCATACTTTCAAGCCCATACGGTGGTGGTGCTCACTAACTTGCCCCGAAAAAGCATCTTCCATAAGCCAGAAGCTTCTGGACGCCTGATGAAGTGGGCAATGGAGCTAAGTGAGTACGACATCCAGTTCGAACCTAGAACTGCGTTGAAAGGACAAGCAGCAGCAGATTTCATAGCCGAGCTCACACCACCTTCCGAGCTGAGCGAGTCCGACCTACCGTGGGCAATCTATGTTGACGGATCCTCTAATGAGAAGGGGTGCGGAGCCGGGGTCCTCTTGCTCGAACCAGGAGGCGAGCGATTTGAGTATGCCTTGCGGTTCGGCTTCCGGACTTCTAACAACGAGGCTGAGTATGAAGCATTTATTGTCGGCCTGTGAATCGCTAGAGCATTGGGGGCCTCTTGTGTTAAGGTCTTCAGTGACTCACAGCTGGTTGTAAGCCAGATCAAGGACGAGTACCAAGTCAAAGACACCTGAATGGAGAAGTATTTGGACAAGGTCAGATCGTACCTCGCCCAGTTTCGAACTTACGAAGTAAGCCGGGTTCCACGAGTAGAAAATTCTAATGCGGACGCCTTGGCCAAGTTAGCATCGGCATACGAGACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCTTAGATAATCCCTCGATCTCAAAGCCAGATCTGATGGAGATCGGCGCTCCAGAATCCTCATGGATGGACCCGATTGCGGACTTCATTAGGGGCAACTCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAAGGCGAGCAGCTCGGTTCGTTGTCCGAGGTGGAGCATTGTACCGACGCGGCTTTTCCCTGCCCCTATTGAGATGCCTAACCCCTGAAGAGGGCCTGTACGTCCTCAGAGAGATCCACGAAGGAGTGTGCAGCAATCACTCAGGCGCCCGGTCGCTGTCAGCCAAAGTGATCCGACAAGGATACTATTGGCCGACCCTCAGCCAGGACGCCAAGAAGTTCGTTTGAACTTGCGACAATTGCCAACGCTACGGAACCATAATCCACCAACCTCCCGAGCTGCTCACCCCCATCTCGGCCCCGTGGCCATTCACGCAGTGGGGGGTAGATATCATTGGTCCTTTCCCTTTGGGCAAGGGCCAGACCAAGTTCGCTGTGGTTGCTGTGGATTACTTCACAAAGTGGGCCGAGGCCGAGGCGCTCTCCCACATAACGGAATCCAGAGTCACGTCCTTCGTATGGACAAATATCATATGTCGTTTTGGTATACCGCAGGCCATTGTGACAGACAATGGGAAGCAGTTTGACAACGCCAAGTTCAAAGATTTTTGCAGCAAGCTCGGCATAAGTCACCTTAGCTCGTCCCCCGCACATCCGCAAGCAAATGGGCAGGTGGAGGCAGTCGACAAGATCATCAAGCGAGGCATCAAACTTAGACTGGACTCCAAGAAAGGCAGGTGGGCCGAGGAGCTACCAGAGATTCTATGGTCGTACCGGACCACCCAAAGAGAATCGACGGGTGAGACCCCGTTCTCCCTGGCCTTCGGCTCCGAAGCTGTAGTCCCGGTTGAGATCGGCATGCCATCTGACAGAGTAGAGCATTACGAGCCTACGGCAAATGAGGAAGCGCTGCTCCTCAACCTCGACTTGTTGGAAGAAAGAAGAGCAATGGCCCAGCTACGCCTGGCGGAATATCAGGGCAGAATGGCCAGACATTACAACGCCCGCGTTCGACCTCGGACCTTTCAGGTCGGACATCTGGTCTTAAGGAGGGTCCAAACCCATGTGGGTGCCCTTGATCCGGCCTGGGAGGGCCCGTTTGACGTCAAGGGCATAGTTCGACCTGGGACGTACGTATTGGCCGATCTGCAAGGAGACGTCCTCGCGCACCCGTGGAATGCGGAGCACCTGAAGCGTTATTATCCTTGA
mRNA sequence
ATGGTTCAACCCGCAAACTCGACCAATACGGCGGATCGAAGGACCCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGCGGTAAAGGGGCAAGGTCACGACGGCCTAGCAACAAAACCCCTCCGCAGGTCGGCACGGATCACCGCGCCTGCCCTACCGTCTGCGCACCCGAGGACATCCAAGGCCACCCGTGGTCGAGGTGGGACCTCTAAGAAGGGTGCCCGGGGTCCAGCCCCAGCTCTAACAAGTGAGGACTTTGATGCACTCCAGAGAGAGATGGAGGCAATGTGCATACGAATGCGGTCCATGGAGGAAATGTATAACGAAATGATACTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAATGACGCGCGTGGACGTACGCGAGCAAAGGTGTTCCCACCTAGACCCAGCCGAGGAGGAACGTCCCGAAGACAACGAGAGCGAGGGGTACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAAAGGCTCGTCTCTCCGAAAAGGGCAGTCACCATCCCGCTCACACAAGAGCTCCAACCAGCAGGCTAAATCCTCTCACAACCCAGCAACTCCCGCAGGGGTGATCACAAGGGAGAAGTTCGACCAACTGAGGGGCAAGATCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAGAGAAGGTTCACTAAACGATGGCGACTTGAGAGAATCGCCCTTCACCTCGGACGTTTTGGAAGCACCGATCCCTCTGAAGTTCAAAGCTCCTACCGTGAAGCCTTATCATGGGTCGAGGGACCCCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAAACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGACACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGTGTGAGACGCTGCGGAAATATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTAGGGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAATCGGCCGACCGGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATGTAGAAAGAGAAAATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAGCGGACCTACCAGGAGCCGACCTTACGAACGCTTTACCCCGACCACGATTCCAATTTCCGAGATCCTAACAAACATCGAGGAATCTGGAATGGAAAAACTACTCAATCGTCCTGAGAAGCTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCATAACACGTCAGACTGCTGGGAGTTGAAGCATCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAATTTGTGAGAAAGCCCAGGACCAGCTCGGCAGAGAAAAAGGAAGAGCGAAAGCGTTCAAGGACGTTGCCCCGGCGCACTGACCGACCTGCGGTCATCAATACCATTTTCGGAGGGCCAAGCGGGGGTCAGTCCGGACATAAAAGAAAGGAGTTAGCTCGTGCAGCCAGGCGCGAAATGTGCATCATCAGGGAGCAGGGGCCGACCTACCCAATCACCTTCGACGGTGCAGACTTGGGGGAAGTCCACCTGCACCACAATGATGCACTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTCAGGAGGGTGCTGGTAGACGGAGTCGCATCTGCTAACATCCTGTCCTTACCGACCTACCTCGCCCTGGGATGGACGAGGTCGCAATTGAAGAAAAGCCCGCCGCTGGTTGGGTTCTCTGGAGAATCAGTCATTCTAGAGGGTTGCATCGACTTGCCGGTCACGCTGGGGCAGGATCAAACTCAGGTCACCCAAATGGCCGAGTTCGTGGTAATTGACGGCAGATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCGACACTTCATCAAGTTTTGAAGTATCCCACCCACAATGGCGTGGGCACGATCCGAGGAGAACAGACCGCTTCGAGGGAGTGTTATGCCTCCGCACTCAAAGGTTCCTCGGTCTGCGCCCTCGAAACTCTCACCGGTAGGGACGGGACGCTCGAGTTCGAGGCCGACCTGCCGAGAAAGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCTTAGATAATCCCTCGATCTCAAAGCCAGATCTGATGGAGATCGGCGCTCCAGAATCCTCATGGATGGACCCGATTGCGGACTTCATTAGGGGCAACTCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAAGGCGAGCAGCTCGGTTCGTTGTCCGAGGTGGAGCATTGTACCGACGCGGCTTTTCCCTGCCCCTATTGAGATGCCTAACCCCTGAAGAGGGCCTGGTCCAAACCCATGTGGGTGCCCTTGATCCGGCCTGGGAGGGCCCGTTTGACGTCAAGGGCATAGTTCGACCTGGGACGTACGTATTGGCCGATCTGCAAGGAGACGTCCTCGCGCACCCGTGGAATGCGGAGCACCTGAAGCGTTATTATCCTTGA
Coding sequence (CDS)
ATGGTTCAACCCGCAAACTCGACCAATACGGCGGATCGAAGGACCCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGCGGTAAAGGGGCAAGGTCACGACGGCCTAGCAACAAAACCCCTCCGCAGGTCGGCACGGATCACCGCGCCTGCCCTACCGTCTGCGCACCCGAGGACATCCAAGGCCACCCGTGGTCGAGGTGGGACCTCTAAGAAGGGTGCCCGGGGTCCAGCCCCAGCTCTAACAAGTGAGGACTTTGATGCACTCCAGAGAGAGATGGAGGCAATGTGCATACGAATGCGGTCCATGGAGGAAATGTATAACGAAATGATACTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAATGACGCGCGTGGACGTACGCGAGCAAAGGTGTTCCCACCTAGACCCAGCCGAGGAGGAACGTCCCGAAGACAACGAGAGCGAGGGGTACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAAAGGCTCGTCTCTCCGAAAAGGGCAGTCACCATCCCGCTCACACAAGAGCTCCAACCAGCAGGCTAAATCCTCTCACAACCCAGCAACTCCCGCAGGGGTGATCACAAGGGAGAAGTTCGACCAACTGAGGGGCAAGATCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAGAGAAGGTTCACTAAACGATGGCGACTTGAGAGAATCGCCCTTCACCTCGGACGTTTTGGAAGCACCGATCCCTCTGAAGTTCAAAGCTCCTACCGTGAAGCCTTATCATGGGTCGAGGGACCCCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAAACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGACACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGTGTGAGACGCTGCGGAAATATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACGAAGCCCTCACGGTGAAACTTAGGGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAATCGGCCGACCGGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATGTAGAAAGAGAAAATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAGCGGACCTACCAGGAGCCGACCTTACGAACGCTTTACCCCGACCACGATTCCAATTTCCGAGATCCTAACAAACATCGAGGAATCTGGAATGGAAAAACTACTCAATCGTCCTGAGAAGCTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCATAACACGTCAGACTGCTGGGAGTTGAAGCATCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAATTTGTGAGAAAGCCCAGGACCAGCTCGGCAGAGAAAAAGGAAGAGCGAAAGCGTTCAAGGACGTTGCCCCGGCGCACTGACCGACCTGCGGTCATCAATACCATTTTCGGAGGGCCAAGCGGGGGTCAGTCCGGACATAAAAGAAAGGAGTTAGCTCGTGCAGCCAGGCGCGAAATGTGCATCATCAGGGAGCAGGGGCCGACCTACCCAATCACCTTCGACGGTGCAGACTTGGGGGAAGTCCACCTGCACCACAATGATGCACTTGTGATCGCTCCCTTGATTGATCATGTGGTGGTCAGGAGGGTGCTGGTAGACGGAGTCGCATCTGCTAACATCCTGTCCTTACCGACCTACCTCGCCCTGGGATGGACGAGGTCGCAATTGAAGAAAAGCCCGCCGCTGGTTGGGTTCTCTGGAGAATCAGTCATTCTAGAGGGTTGCATCGACTTGCCGGTCACGCTGGGGCAGGATCAAACTCAGGTCACCCAAATGGCCGAGTTCGTGGTAATTGACGGCAGATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCGACACTTCATCAAGTTTTGAAGTATCCCACCCACAATGGCGTGGGCACGATCCGAGGAGAACAGACCGCTTCGAGGGAGTGTTATGCCTCCGCACTCAAAGGTTCCTCGGTCTGCGCCCTCGAAACTCTCACCGGTAGGGACGGGACGCTCGAGTTCGAGGCCGACCTGCCGAGAAAGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCTTAGATAATCCCTCGATCTCAAAGCCAGATCTGATGGAGATCGGCGCTCCAGAATCCTCATGGATGGACCCGATTGCGGACTTCATTAGGGGCAACTCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAAGGCGAGCAGCTCGGTTCGTTGTCCGAGGTGGAGCATTGTACCGACGCGGCTTTTCCCTGCCCCTATTGAGATGCCTAACCCCTGAAGAGGGCCTGGTCCAAACCCATGTGGGTGCCCTTGATCCGGCCTGGGAGGGCCCGTTTGACGTCAAGGGCATAGTTCGACCTGGGACGTACGTATTGGCCGATCTGCAAGGAGACGTCCTCGCGCACCCGTGGAATGCGGAGCACCTGAAGCGTTATTATCCTTGA
Protein sequence
MVQPANSTNTADRRTLAASDAHQREVGAAAVKGQGHDGLATKPLRRSARITAPALPSAHPRTSKATRGRGGTSKKGARGPAPALTSEDFDALQREMEAMCIRMRSMEEMYNEMILAAGAGSRSENRMTRVDVREQRCSHLDPAEEERPEDNESEGYTRQRGDLREHLNRKKGSSLRKGQSPSRSHKSSNQQAKSSHNPATPAGVITREKFDQLRGKIDAQVEALKAKCEQREGSLNDGDLRESPFTSDVLEAPIPLKFKAPTVKPYHGSRDPKDYVEVFEGLMDFQAASNAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKECETLRKYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLREEAPATFAEVLQKAKKVIDGQELLRTKIGRPERKIGRGRSGKDVERENPKSKDKGSFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLNRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKHQIEDLIQDGYFKKFVRKPRTSSAEKKEERKRSRTLPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREMCIIREQGPTYPITFDGADLGEVHLHHNDALVIAPLIDHVVVRRVLVDGVASANILSLPTYLALGWTRSQLKKSPPLVGFSGESVILEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTHNGVGTIRGEQTASRECYASALKGSSVCALETLTGRDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQTDLARSVPVEILDNPSISKPDLMEIGAPESSWMDPIADFIRGNSPQDPKERRKLARRAARFVVRGGALYRRGFSLPLLRCLTPEEGLVQTHVGALDPAWEGPFDVKGIVRPGTYVLADLQGDVLAHPWNAEHLKRYYP
Homology
BLAST of Moc02g13720 vs. NCBI nr
Match:
XP_022137317.1 (uncharacterized protein LOC111008813 [Momordica charantia])
HSP 1 Score: 933.7 bits (2412), Expect = 1.3e-267
Identity = 484/528 (91.67%), Postives = 499/528 (94.51%), Query Frame = 0
Query: 191 QAKSSHNPATPAGVITREKFDQLRGKIDAQVEALKAKCEQREGSLNDGDLRESPFTSDVL 250
+A+SS NPATPAGVITRE+FDQLRG++DAQVEALKAKCEQ+EG LNDGDL ESPFTSDVL
Sbjct: 3 KAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVL 62
Query: 251 EAPIPLKFKAPTVKPYHGSRDPKDYVEVFEGLMDFQAASNAIKCRAFQIALTGSARLWYR 310
EAPIP KFKAPTVKPY GS+DPKDYVEVFE LMDFQAAS+AIKCRAF+IALTGSARLWYR
Sbjct: 63 EAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKCRAFRIALTGSARLWYR 122
Query: 311 RLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKECETLRKYVTRFQEEQLKVA 370
RLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKE ETLR+YVTRFQEEQLKVA
Sbjct: 123 RLPAXSISTYSQLRREFLAXFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA 182
Query: 371 HCSDDSAMCYFLTGLADEALTVKLREEAPATFAEVLQKAKKVIDGQELLRTKIGRPERKI 430
HCSDDSAMCYFLTGLADEALTVKL EEAPATFAEVLQKAKKVIDGQELLRTK GRPERKI
Sbjct: 183 HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI 242
Query: 431 GRGRSGKDVERENPKSKDKGSFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEILTNIE 490
GRGRSGKD+E +PKSKDKGSFSSGRAEYRRAE+GPTRSRPYERFTPTTIPISEILTNIE
Sbjct: 243 GRGRSGKDIENADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE 302
Query: 491 ESGMEKLLNRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKHQIEDLIQDGYFKKFV 550
ESGMEKLL RPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK QIE+LIQDGYFKKFV
Sbjct: 303 ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIENLIQDGYFKKFV 362
Query: 551 RKPRTSSAEKKEERKRSRTLPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREMCII 610
KPRTSSAEKKEERKRSRT PRRTDRPAVINTIFGGPSGGQSG KRKELARAARRE+CII
Sbjct: 363 GKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARAARREVCII 422
Query: 611 REQGPTYPITFDGADLGEVHLHHNDALVIAPLIDHVVVRRVLVDGVASANILSLPTYLAL 670
REQ PT PITFDGADL EVHL HNDALVIAPLIDHVVV RVLVDG SANILSLPTYLAL
Sbjct: 423 REQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVXRVLVDGGTSANILSLPTYLAL 482
Query: 671 GWTRSQLKKSP-PLVGFSGESVILEGCIDLPVTLGQDQTQVTQMAEFV 718
GWTRSQLKKSP PLVGFSGESVI EG IDLPVTLGQDQTQVTQMAEFV
Sbjct: 483 GWTRSQLKKSPTPLVGFSGESVIPEGFIDLPVTLGQDQTQVTQMAEFV 530
BLAST of Moc02g13720 vs. NCBI nr
Match:
XP_022150760.1 (uncharacterized protein LOC111018823 [Momordica charantia])
HSP 1 Score: 930.2 bits (2403), Expect = 1.4e-266
Identity = 505/631 (80.03%), Postives = 525/631 (83.20%), Query Frame = 0
Query: 187 SSNQQAKSSHNPATPAGVITREKFDQLRGKIDAQVEALKAKCEQREGSLNDGDLRESPFT 246
SSNQQA+SSHNPATP GVITRE+FDQLRGK++AQVEALKAKCEQ+EG LNDGDL ESPFT
Sbjct: 2 SSNQQAESSHNPATPDGVITREEFDQLRGKLNAQVEALKAKCEQKEGPLNDGDLGESPFT 61
Query: 247 SDVLEAPIPLKFKAPTVKPYHGSRDPKDYVEVFEGLMDFQAASNAIKCRAFQIALTGSAR 306
SDVLE APTVK Y GS+DPKDYVEVFEGLMDFQAAS+AIKCRAFQIALTGSAR
Sbjct: 62 SDVLE--------APTVKSYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR 121
Query: 307 LWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKECETLRKYVTRFQEEQ 366
LW FQE+Q
Sbjct: 122 LW-----------------------------------------------------FQEDQ 181
Query: 367 LKVAHCSDDSAMCYFLTGLADEALTVKLREEAPATFAEVLQKAKKVIDGQELLRTKIGRP 426
LKVA SDDSAMCYFLTGLADEALTVKL +EAPATFAEVLQKAKKVIDGQELLRTK GRP
Sbjct: 182 LKVAQSSDDSAMCYFLTGLADEALTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRP 241
Query: 427 ERKIGRGRSGKDVERENPKSKDKGSFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEIL 486
ER I RGRSGKD E+ + KSKDKGSFSSGRAE+RRA +GPTRSRPYERFTPTTIPISEIL
Sbjct: 242 ERGIDRGRSGKD-EKADLKSKDKGSFSSGRAEFRRAVNGPTRSRPYERFTPTTIPISEIL 301
Query: 487 TNIEESGMEKLLNRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKHQIEDLIQDGYF 546
TNIEESGMEKLL RPEKLRGAPERR+KDKYCRFHREH HNTSD WELK QIEDLIQD YF
Sbjct: 302 TNIEESGMEKLLKRPEKLRGAPERRNKDKYCRFHREHDHNTSDRWELKRQIEDLIQDDYF 361
Query: 547 KKFVRKPRTSSAEKKEERKRSRTLPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARRE 606
KKFV KPRTSSAEKKEERK SRT RR DRPAVINTIFGGPSGGQSGHKRKELARAARRE
Sbjct: 362 KKFVGKPRTSSAEKKEERKLSRTPLRRIDRPAVINTIFGGPSGGQSGHKRKELARAARRE 421
Query: 607 MCIIREQGPTYPITFDGADLGEVHLHHNDALVIAPLIDHVVVRRVLVDGVASANILSLPT 666
+CIIREQ PT PITFD ADL EVHL HNDALVIAPLIDHVVVRRVLVD SANI+SL T
Sbjct: 422 VCIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDEGVSANIVSLLT 481
Query: 667 YLALGWTRSQLKKS-PPLVGFSGESVILEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAY 726
YLALGWTRSQLKKS PLVGFS ESVI EGCIDLPVTLG DQTQVTQMAEFVVIDGRSAY
Sbjct: 482 YLALGWTRSQLKKSTTPLVGFSRESVIPEGCIDLPVTLGHDQTQVTQMAEFVVIDGRSAY 541
Query: 727 NAIFGRPIIHSFRAIPSTLHQVLKYPTHNGVGTIRGEQTASRECYASALKGSSVCALETL 786
NAIFGRPIIHSFRAIPSTLHQVLKY T NGVG +RGEQ ASRECYASALKGSSVCALETL
Sbjct: 542 NAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGMVRGEQIASRECYASALKGSSVCALETL 570
Query: 787 TGRDGTLEFEADLPRKEFAAPTEELELVPLL 817
RDGTLEF+A+LPR+EFAAPTEELELVPLL
Sbjct: 602 VSRDGTLEFKANLPRREFAAPTEELELVPLL 570
BLAST of Moc02g13720 vs. NCBI nr
Match:
XP_022152854.1 (uncharacterized protein LOC111020479 [Momordica charantia])
HSP 1 Score: 920.2 bits (2377), Expect = 1.4e-263
Identity = 513/791 (64.85%), Postives = 564/791 (71.30%), Query Frame = 0
Query: 1 MVQPANSTNTADRRTLAASDAHQREVGAAAVKGQGHDGLATKPLRRSARITAPALPSAHP 60
MVQPANSTNTADRR LAA+ HQREVGA V+GQGH+ L T+PL RSARIT P LP AHP
Sbjct: 1 MVQPANSTNTADRRALAANHGHQREVGAEVVEGQGHEDLGTEPLCRSARITTPVLPPAHP 60
Query: 61 RTSKATRGRGGTSKKGARGPAPALTSEDFDALQREMEAMCIRMRSMEEMYNEMILAAGAG 120
+ SK
Sbjct: 61 KPSK-------------------------------------------------------- 120
Query: 121 SRSENRMTRVDVREQRCSHLDPAEEERPEDNESEGYTRQRGDLREHLNRKKGSSLRKGQS 180
Sbjct: 121 ------------------------------------------------------------ 180
Query: 181 PSRSHKSSNQQAKSSHNPATPAGVITREKFDQLRGKIDAQVEALKAKCEQREGSLNDGDL 240
A+SS+NP TP GVITRE+FDQL+ K DAQVEALKA+CE++E S +DGDL
Sbjct: 181 -----------AESSYNPITP-GVITREEFDQLKSKFDAQVEALKARCEKKESSFDDGDL 240
Query: 241 RESPFTSDVLEAPIPLKFKAPTVKPYHGSRDPKDYVEVFEGLMDFQAASNAIKCRAFQIA 300
E F+SD+LEA IP KFK PT+KPY GS+DPKDYVEVFE LMDFQAA++AIKC AFQIA
Sbjct: 241 GELSFSSDILEALIPPKFKTPTMKPYDGSKDPKDYVEVFESLMDFQAATDAIKCCAFQIA 300
Query: 301 LTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKECETLRKYVT 360
LTGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKE ETLR+YVT
Sbjct: 301 LTGSARLWYRRLPARLISTYSQLRKEFISQFSSRHYDRKTPTHLATIRQKEGETLREYVT 360
Query: 361 RFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLREEAPATFAEVLQKAKKVIDGQELLR 420
RF EEQLKVAHCSDDSAMCYFLTGLADE LTVKLREEAPATFAEVLQK KKVIDGQELLR
Sbjct: 361 RFPEEQLKVAHCSDDSAMCYFLTGLADETLTVKLREEAPATFAEVLQKTKKVIDGQELLR 420
Query: 421 TKIGRPERKIGRGRSGKDVERENPKSKDKG-SFSSGRAEYRRAESGPTRSRPYERFTPTT 480
TK GRPE+ I +GR+GKD + + KS+DKG S SS R +YRR+ S +SRPYE +TPTT
Sbjct: 421 TKTGRPEKNIDQGRAGKDKGKADSKSRDKGPSSSSSRVDYRRSNSSHNQSRPYEHYTPTT 480
Query: 481 IPISEILTNIEESGMEKLLNRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKHQIED 540
IPI EILTNIEE+GMEKLL RPEKLRG PE+R+ DKYCRFHR+HGHNTS+ WELK QIED
Sbjct: 481 IPIFEILTNIEETGMEKLLKRPEKLRGDPEKRNTDKYCRFHRDHGHNTSNYWELKRQIED 540
Query: 541 LIQDGYFKKFVRKPRTSSAEKKEERKRSRTLPRRTDRPAVINTIFGGPSGGQSGHKRKEL 600
LIQDGYFKKFV KPR++S EKKEERKR RT PRR DRPAVIN K+KEL
Sbjct: 541 LIQDGYFKKFVGKPRSNSVEKKEERKRLRTPPRRDDRPAVIN-------------KKKEL 600
Query: 601 ARAARREMCIIREQGPTYPITFDGADLGEVHLHHNDALVIAPLIDHVVVRRVLVDGVASA 660
AR ARRE+CIIREQ PT I F+ ADL VHL HNDALVIAPLID V+VRR+LVDG ASA
Sbjct: 601 AREARREVCIIREQRPTSSIAFNHADLEGVHLPHNDALVIAPLIDLVLVRRILVDGGASA 650
Query: 661 NILSLPTYLALGWTRSQLKKSP-PLVGFSGESVILEGCIDLPVTLGQDQTQVTQMAEFVV 720
NILSL TYLALGWTRSQLKKSP PLVGFSGES+ LEGCIDLPV++ QD TQVTQMAEFVV
Sbjct: 661 NILSLSTYLALGWTRSQLKKSPTPLVGFSGESISLEGCIDLPVSIRQDDTQVTQMAEFVV 650
Query: 721 IDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTHNGVGTIRGEQTASRECYASALKGSS 780
IDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKY T NGVGT+RGE SRECYAS K SS
Sbjct: 721 IDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGELKTSRECYASVPKRSS 650
Query: 781 VCALETLTGRD 790
VCALE T RD
Sbjct: 781 VCALEEQTIRD 650
BLAST of Moc02g13720 vs. NCBI nr
Match:
XP_022152110.1 (uncharacterized protein LOC111019899 [Momordica charantia])
HSP 1 Score: 774.6 bits (1999), Expect = 9.7e-220
Identity = 403/448 (89.96%), Postives = 416/448 (92.86%), Query Frame = 0
Query: 378 MCYFLTGLADEALTVKLREEAPATFAEVLQKAKKVIDGQELLRTKIGRPERKIGRGRSGK 437
MCYFLTGLADEALTVKL EEAPATFAEVLQKAKKVIDGQELLRT KIG+GRSGK
Sbjct: 1 MCYFLTGLADEALTVKLVEEAPATFAEVLQKAKKVIDGQELLRT-------KIGQGRSGK 60
Query: 438 DVERENPKSKDKGSFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEILTNIEESGMEKL 497
D+E +PKSKDKGSFS+GRAEYRRAE+GPTRSRPYERFTPTTIPISEILTNIEESGMEKL
Sbjct: 61 DMENTDPKSKDKGSFSNGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKL 120
Query: 498 LNRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKHQIEDLIQDGYFKKFVRKPRTSS 557
L RPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFV KPRTSS
Sbjct: 121 LKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKSQIEDLIQDGYFKKFVGKPRTSS 180
Query: 558 AEKKEERKRSRTLPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREMCIIREQGPTY 617
AEKKEERKRSRT PRRTDRPAVINTIFGGPSGGQSGHKRK+LARAARRE+CIIREQ PT
Sbjct: 181 AEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKKLARAARREVCIIREQRPTC 240
Query: 618 PITFDGADLGEVHLHHNDALVIAPLIDHVVVRRVLVDGVASANILSLPTYLALGWTRSQL 677
PITFD ADL EVHL HNDALVIAPLIDHVVVRRVLVDG ASANILSLPTYLALGWTRSQL
Sbjct: 241 PITFDXADLXEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL 300
Query: 678 KKSP-PLVGFSGESVILEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHS 737
KKSP PLVGFSGESV+ EGCIDLPVTLGQDQT+VTQMAEFVV+DGRSAYNAIFGRPIIHS
Sbjct: 301 KKSPTPLVGFSGESVVPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHS 360
Query: 738 FRAIPSTLHQVLKYPTHNGVGTIRGEQTASRECYASALKGSSVCALETLTGRDGTLEFEA 797
FRAIPSTLHQVLKY T NGVGT+RGEQTASRECYAS LKG+SVCALETLT RDGTLEFEA
Sbjct: 361 FRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASXLKGTSVCALETLTSRDGTLEFEA 420
Query: 798 DLPRKEFAAPTEELELVPLLSPEKQTDL 825
DLP +EFAAP EELELVPLLS EKQ L
Sbjct: 421 DLPXREFAAPXEELELVPLLSXEKQVQL 441
BLAST of Moc02g13720 vs. NCBI nr
Match:
XP_022158414.1 (uncharacterized protein LOC111024904 [Momordica charantia])
HSP 1 Score: 756.1 bits (1951), Expect = 3.6e-214
Identity = 405/562 (72.06%), Postives = 448/562 (79.72%), Query Frame = 0
Query: 283 MDFQAASNAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTAT 342
MDFQAA++AIKCRAFQIALTGSARLWYRRLPARSISTYSQLR+EF++QFSS HYD+KTAT
Sbjct: 1 MDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSWHYDRKTAT 60
Query: 343 HLATIRQKECETLRKYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLREEAPATF 402
HLATIRQKE ETLR+YVTRFQEEQLKVAHCSDDSAMCYFLT LADE LTVKL EEAP TF
Sbjct: 61 HLATIRQKERETLREYVTRFQEEQLKVAHCSDDSAMCYFLTSLADETLTVKLGEEAPTTF 120
Query: 403 AEVLQKAKKVIDGQELLRTKIGRPERKIGRGRSGKDVERENPKSKDKGSFSS-GRAEYRR 462
EVLQKAKKVIDGQELLRTK GRPE++I + + ++ + + KS+DKGS SS R EYRR
Sbjct: 121 VEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSSSASRTEYRR 180
Query: 463 AESGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLNRPEKLRGAPERRSKDKYCRFHR 522
ESGP+RSRPYER+T +TIPISEILTNIEESGMEKLL RPEKLRG E+R+K+KYCRFHR
Sbjct: 181 LESGPSRSRPYERYTSSTIPISEILTNIEESGMEKLLKRPEKLRGDLEKRNKEKYCRFHR 240
Query: 523 EHGHNTSDCWELKHQIEDLIQDGYFKKFVRKPRTSSAEKKEERKRSRTLPRRTDRPAVIN 582
+HGHNT+ CWELK QIEDLIQDGYFKKFV KPR++S EKKEERKRSRT PRR DRPAVIN
Sbjct: 241 DHGHNTTSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRREDRPAVIN 300
Query: 583 TIFGGPSGGQSGHKRKELARAARREMCIIREQGPTYPITFDGADLGEVHLHHNDALVIAP 642
TIFGGP+GGQSG+KRKELAR ARRE+CIIRE PT ITF ADL VHL HNDALVIA
Sbjct: 301 TIFGGPNGGQSGNKRKELAREARREVCIIREHKPTCSITFGDADLEGVHLPHNDALVIAS 360
Query: 643 LIDHVVVRRVLVDGVASANILSLPTYLALGWTRSQLKKSPPLVGFSGESVILEGCIDLPV 702
LIDH +VRRVL+DG GCIDLPV
Sbjct: 361 LIDHDLVRRVLIDG---------------------------------------GCIDLPV 420
Query: 703 TLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTHNGVGTIRG 762
T+GQD TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKY T N VG +RG
Sbjct: 421 TIGQDATQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNEVGMVRG 480
Query: 763 EQTASRECYASALKGSSVCALETLTGRDGTLEFEADLP---RKEFAAPTEELELVPLLSP 822
EQ SRECYASALKGS+VCALE T R E EADLP +++F PTEELELVPLLSP
Sbjct: 481 EQKTSRECYASALKGSAVCALEEQTNRGKLQESEADLPKEGKRQFPPPTEELELVPLLSP 523
Query: 823 EKQTDLARSVPVEILDNPSISK 841
E+Q + + V ++ P K
Sbjct: 541 ERQANPEKIKTVLEMEAPKTLK 523
BLAST of Moc02g13720 vs. ExPASy TrEMBL
Match:
A0A6J1C7X5 (uncharacterized protein LOC111008813 OS=Momordica charantia OX=3673 GN=LOC111008813 PE=4 SV=1)
HSP 1 Score: 933.7 bits (2412), Expect = 6.1e-268
Identity = 484/528 (91.67%), Postives = 499/528 (94.51%), Query Frame = 0
Query: 191 QAKSSHNPATPAGVITREKFDQLRGKIDAQVEALKAKCEQREGSLNDGDLRESPFTSDVL 250
+A+SS NPATPAGVITRE+FDQLRG++DAQVEALKAKCEQ+EG LNDGDL ESPFTSDVL
Sbjct: 3 KAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVL 62
Query: 251 EAPIPLKFKAPTVKPYHGSRDPKDYVEVFEGLMDFQAASNAIKCRAFQIALTGSARLWYR 310
EAPIP KFKAPTVKPY GS+DPKDYVEVFE LMDFQAAS+AIKCRAF+IALTGSARLWYR
Sbjct: 63 EAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKCRAFRIALTGSARLWYR 122
Query: 311 RLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKECETLRKYVTRFQEEQLKVA 370
RLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKE ETLR+YVTRFQEEQLKVA
Sbjct: 123 RLPAXSISTYSQLRREFLAXFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA 182
Query: 371 HCSDDSAMCYFLTGLADEALTVKLREEAPATFAEVLQKAKKVIDGQELLRTKIGRPERKI 430
HCSDDSAMCYFLTGLADEALTVKL EEAPATFAEVLQKAKKVIDGQELLRTK GRPERKI
Sbjct: 183 HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI 242
Query: 431 GRGRSGKDVERENPKSKDKGSFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEILTNIE 490
GRGRSGKD+E +PKSKDKGSFSSGRAEYRRAE+GPTRSRPYERFTPTTIPISEILTNIE
Sbjct: 243 GRGRSGKDIENADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE 302
Query: 491 ESGMEKLLNRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKHQIEDLIQDGYFKKFV 550
ESGMEKLL RPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK QIE+LIQDGYFKKFV
Sbjct: 303 ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIENLIQDGYFKKFV 362
Query: 551 RKPRTSSAEKKEERKRSRTLPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREMCII 610
KPRTSSAEKKEERKRSRT PRRTDRPAVINTIFGGPSGGQSG KRKELARAARRE+CII
Sbjct: 363 GKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARAARREVCII 422
Query: 611 REQGPTYPITFDGADLGEVHLHHNDALVIAPLIDHVVVRRVLVDGVASANILSLPTYLAL 670
REQ PT PITFDGADL EVHL HNDALVIAPLIDHVVV RVLVDG SANILSLPTYLAL
Sbjct: 423 REQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVXRVLVDGGTSANILSLPTYLAL 482
Query: 671 GWTRSQLKKSP-PLVGFSGESVILEGCIDLPVTLGQDQTQVTQMAEFV 718
GWTRSQLKKSP PLVGFSGESVI EG IDLPVTLGQDQTQVTQMAEFV
Sbjct: 483 GWTRSQLKKSPTPLVGFSGESVIPEGFIDLPVTLGQDQTQVTQMAEFV 530
BLAST of Moc02g13720 vs. ExPASy TrEMBL
Match:
A0A6J1D9E1 (uncharacterized protein LOC111018823 OS=Momordica charantia OX=3673 GN=LOC111018823 PE=4 SV=1)
HSP 1 Score: 930.2 bits (2403), Expect = 6.7e-267
Identity = 505/631 (80.03%), Postives = 525/631 (83.20%), Query Frame = 0
Query: 187 SSNQQAKSSHNPATPAGVITREKFDQLRGKIDAQVEALKAKCEQREGSLNDGDLRESPFT 246
SSNQQA+SSHNPATP GVITRE+FDQLRGK++AQVEALKAKCEQ+EG LNDGDL ESPFT
Sbjct: 2 SSNQQAESSHNPATPDGVITREEFDQLRGKLNAQVEALKAKCEQKEGPLNDGDLGESPFT 61
Query: 247 SDVLEAPIPLKFKAPTVKPYHGSRDPKDYVEVFEGLMDFQAASNAIKCRAFQIALTGSAR 306
SDVLE APTVK Y GS+DPKDYVEVFEGLMDFQAAS+AIKCRAFQIALTGSAR
Sbjct: 62 SDVLE--------APTVKSYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR 121
Query: 307 LWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKECETLRKYVTRFQEEQ 366
LW FQE+Q
Sbjct: 122 LW-----------------------------------------------------FQEDQ 181
Query: 367 LKVAHCSDDSAMCYFLTGLADEALTVKLREEAPATFAEVLQKAKKVIDGQELLRTKIGRP 426
LKVA SDDSAMCYFLTGLADEALTVKL +EAPATFAEVLQKAKKVIDGQELLRTK GRP
Sbjct: 182 LKVAQSSDDSAMCYFLTGLADEALTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRP 241
Query: 427 ERKIGRGRSGKDVERENPKSKDKGSFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEIL 486
ER I RGRSGKD E+ + KSKDKGSFSSGRAE+RRA +GPTRSRPYERFTPTTIPISEIL
Sbjct: 242 ERGIDRGRSGKD-EKADLKSKDKGSFSSGRAEFRRAVNGPTRSRPYERFTPTTIPISEIL 301
Query: 487 TNIEESGMEKLLNRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKHQIEDLIQDGYF 546
TNIEESGMEKLL RPEKLRGAPERR+KDKYCRFHREH HNTSD WELK QIEDLIQD YF
Sbjct: 302 TNIEESGMEKLLKRPEKLRGAPERRNKDKYCRFHREHDHNTSDRWELKRQIEDLIQDDYF 361
Query: 547 KKFVRKPRTSSAEKKEERKRSRTLPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARRE 606
KKFV KPRTSSAEKKEERK SRT RR DRPAVINTIFGGPSGGQSGHKRKELARAARRE
Sbjct: 362 KKFVGKPRTSSAEKKEERKLSRTPLRRIDRPAVINTIFGGPSGGQSGHKRKELARAARRE 421
Query: 607 MCIIREQGPTYPITFDGADLGEVHLHHNDALVIAPLIDHVVVRRVLVDGVASANILSLPT 666
+CIIREQ PT PITFD ADL EVHL HNDALVIAPLIDHVVVRRVLVD SANI+SL T
Sbjct: 422 VCIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDEGVSANIVSLLT 481
Query: 667 YLALGWTRSQLKKS-PPLVGFSGESVILEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAY 726
YLALGWTRSQLKKS PLVGFS ESVI EGCIDLPVTLG DQTQVTQMAEFVVIDGRSAY
Sbjct: 482 YLALGWTRSQLKKSTTPLVGFSRESVIPEGCIDLPVTLGHDQTQVTQMAEFVVIDGRSAY 541
Query: 727 NAIFGRPIIHSFRAIPSTLHQVLKYPTHNGVGTIRGEQTASRECYASALKGSSVCALETL 786
NAIFGRPIIHSFRAIPSTLHQVLKY T NGVG +RGEQ ASRECYASALKGSSVCALETL
Sbjct: 542 NAIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGMVRGEQIASRECYASALKGSSVCALETL 570
Query: 787 TGRDGTLEFEADLPRKEFAAPTEELELVPLL 817
RDGTLEF+A+LPR+EFAAPTEELELVPLL
Sbjct: 602 VSRDGTLEFKANLPRREFAAPTEELELVPLL 570
BLAST of Moc02g13720 vs. ExPASy TrEMBL
Match:
A0A6J1DHB3 (uncharacterized protein LOC111020479 OS=Momordica charantia OX=3673 GN=LOC111020479 PE=4 SV=1)
HSP 1 Score: 920.2 bits (2377), Expect = 7.0e-264
Identity = 513/791 (64.85%), Postives = 564/791 (71.30%), Query Frame = 0
Query: 1 MVQPANSTNTADRRTLAASDAHQREVGAAAVKGQGHDGLATKPLRRSARITAPALPSAHP 60
MVQPANSTNTADRR LAA+ HQREVGA V+GQGH+ L T+PL RSARIT P LP AHP
Sbjct: 1 MVQPANSTNTADRRALAANHGHQREVGAEVVEGQGHEDLGTEPLCRSARITTPVLPPAHP 60
Query: 61 RTSKATRGRGGTSKKGARGPAPALTSEDFDALQREMEAMCIRMRSMEEMYNEMILAAGAG 120
+ SK
Sbjct: 61 KPSK-------------------------------------------------------- 120
Query: 121 SRSENRMTRVDVREQRCSHLDPAEEERPEDNESEGYTRQRGDLREHLNRKKGSSLRKGQS 180
Sbjct: 121 ------------------------------------------------------------ 180
Query: 181 PSRSHKSSNQQAKSSHNPATPAGVITREKFDQLRGKIDAQVEALKAKCEQREGSLNDGDL 240
A+SS+NP TP GVITRE+FDQL+ K DAQVEALKA+CE++E S +DGDL
Sbjct: 181 -----------AESSYNPITP-GVITREEFDQLKSKFDAQVEALKARCEKKESSFDDGDL 240
Query: 241 RESPFTSDVLEAPIPLKFKAPTVKPYHGSRDPKDYVEVFEGLMDFQAASNAIKCRAFQIA 300
E F+SD+LEA IP KFK PT+KPY GS+DPKDYVEVFE LMDFQAA++AIKC AFQIA
Sbjct: 241 GELSFSSDILEALIPPKFKTPTMKPYDGSKDPKDYVEVFESLMDFQAATDAIKCCAFQIA 300
Query: 301 LTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKECETLRKYVT 360
LTGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKE ETLR+YVT
Sbjct: 301 LTGSARLWYRRLPARLISTYSQLRKEFISQFSSRHYDRKTPTHLATIRQKEGETLREYVT 360
Query: 361 RFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLREEAPATFAEVLQKAKKVIDGQELLR 420
RF EEQLKVAHCSDDSAMCYFLTGLADE LTVKLREEAPATFAEVLQK KKVIDGQELLR
Sbjct: 361 RFPEEQLKVAHCSDDSAMCYFLTGLADETLTVKLREEAPATFAEVLQKTKKVIDGQELLR 420
Query: 421 TKIGRPERKIGRGRSGKDVERENPKSKDKG-SFSSGRAEYRRAESGPTRSRPYERFTPTT 480
TK GRPE+ I +GR+GKD + + KS+DKG S SS R +YRR+ S +SRPYE +TPTT
Sbjct: 421 TKTGRPEKNIDQGRAGKDKGKADSKSRDKGPSSSSSRVDYRRSNSSHNQSRPYEHYTPTT 480
Query: 481 IPISEILTNIEESGMEKLLNRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKHQIED 540
IPI EILTNIEE+GMEKLL RPEKLRG PE+R+ DKYCRFHR+HGHNTS+ WELK QIED
Sbjct: 481 IPIFEILTNIEETGMEKLLKRPEKLRGDPEKRNTDKYCRFHRDHGHNTSNYWELKRQIED 540
Query: 541 LIQDGYFKKFVRKPRTSSAEKKEERKRSRTLPRRTDRPAVINTIFGGPSGGQSGHKRKEL 600
LIQDGYFKKFV KPR++S EKKEERKR RT PRR DRPAVIN K+KEL
Sbjct: 541 LIQDGYFKKFVGKPRSNSVEKKEERKRLRTPPRRDDRPAVIN-------------KKKEL 600
Query: 601 ARAARREMCIIREQGPTYPITFDGADLGEVHLHHNDALVIAPLIDHVVVRRVLVDGVASA 660
AR ARRE+CIIREQ PT I F+ ADL VHL HNDALVIAPLID V+VRR+LVDG ASA
Sbjct: 601 AREARREVCIIREQRPTSSIAFNHADLEGVHLPHNDALVIAPLIDLVLVRRILVDGGASA 650
Query: 661 NILSLPTYLALGWTRSQLKKSP-PLVGFSGESVILEGCIDLPVTLGQDQTQVTQMAEFVV 720
NILSL TYLALGWTRSQLKKSP PLVGFSGES+ LEGCIDLPV++ QD TQVTQMAEFVV
Sbjct: 661 NILSLSTYLALGWTRSQLKKSPTPLVGFSGESISLEGCIDLPVSIRQDDTQVTQMAEFVV 650
Query: 721 IDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTHNGVGTIRGEQTASRECYASALKGSS 780
IDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKY T NGVGT+RGE SRECYAS K SS
Sbjct: 721 IDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGELKTSRECYASVPKRSS 650
Query: 781 VCALETLTGRD 790
VCALE T RD
Sbjct: 781 VCALEEQTIRD 650
BLAST of Moc02g13720 vs. ExPASy TrEMBL
Match:
A0A6J1DD03 (uncharacterized protein LOC111019899 OS=Momordica charantia OX=3673 GN=LOC111019899 PE=4 SV=1)
HSP 1 Score: 774.6 bits (1999), Expect = 4.7e-220
Identity = 403/448 (89.96%), Postives = 416/448 (92.86%), Query Frame = 0
Query: 378 MCYFLTGLADEALTVKLREEAPATFAEVLQKAKKVIDGQELLRTKIGRPERKIGRGRSGK 437
MCYFLTGLADEALTVKL EEAPATFAEVLQKAKKVIDGQELLRT KIG+GRSGK
Sbjct: 1 MCYFLTGLADEALTVKLVEEAPATFAEVLQKAKKVIDGQELLRT-------KIGQGRSGK 60
Query: 438 DVERENPKSKDKGSFSSGRAEYRRAESGPTRSRPYERFTPTTIPISEILTNIEESGMEKL 497
D+E +PKSKDKGSFS+GRAEYRRAE+GPTRSRPYERFTPTTIPISEILTNIEESGMEKL
Sbjct: 61 DMENTDPKSKDKGSFSNGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKL 120
Query: 498 LNRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKHQIEDLIQDGYFKKFVRKPRTSS 557
L RPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFV KPRTSS
Sbjct: 121 LKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKSQIEDLIQDGYFKKFVGKPRTSS 180
Query: 558 AEKKEERKRSRTLPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREMCIIREQGPTY 617
AEKKEERKRSRT PRRTDRPAVINTIFGGPSGGQSGHKRK+LARAARRE+CIIREQ PT
Sbjct: 181 AEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKKLARAARREVCIIREQRPTC 240
Query: 618 PITFDGADLGEVHLHHNDALVIAPLIDHVVVRRVLVDGVASANILSLPTYLALGWTRSQL 677
PITFD ADL EVHL HNDALVIAPLIDHVVVRRVLVDG ASANILSLPTYLALGWTRSQL
Sbjct: 241 PITFDXADLXEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL 300
Query: 678 KKSP-PLVGFSGESVILEGCIDLPVTLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHS 737
KKSP PLVGFSGESV+ EGCIDLPVTLGQDQT+VTQMAEFVV+DGRSAYNAIFGRPIIHS
Sbjct: 301 KKSPTPLVGFSGESVVPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHS 360
Query: 738 FRAIPSTLHQVLKYPTHNGVGTIRGEQTASRECYASALKGSSVCALETLTGRDGTLEFEA 797
FRAIPSTLHQVLKY T NGVGT+RGEQTASRECYAS LKG+SVCALETLT RDGTLEFEA
Sbjct: 361 FRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASXLKGTSVCALETLTSRDGTLEFEA 420
Query: 798 DLPRKEFAAPTEELELVPLLSPEKQTDL 825
DLP +EFAAP EELELVPLLS EKQ L
Sbjct: 421 DLPXREFAAPXEELELVPLLSXEKQVQL 441
BLAST of Moc02g13720 vs. ExPASy TrEMBL
Match:
A0A6J1DZB9 (uncharacterized protein LOC111024904 OS=Momordica charantia OX=3673 GN=LOC111024904 PE=4 SV=1)
HSP 1 Score: 756.1 bits (1951), Expect = 1.7e-214
Identity = 405/562 (72.06%), Postives = 448/562 (79.72%), Query Frame = 0
Query: 283 MDFQAASNAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTAT 342
MDFQAA++AIKCRAFQIALTGSARLWYRRLPARSISTYSQLR+EF++QFSS HYD+KTAT
Sbjct: 1 MDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSWHYDRKTAT 60
Query: 343 HLATIRQKECETLRKYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLREEAPATF 402
HLATIRQKE ETLR+YVTRFQEEQLKVAHCSDDSAMCYFLT LADE LTVKL EEAP TF
Sbjct: 61 HLATIRQKERETLREYVTRFQEEQLKVAHCSDDSAMCYFLTSLADETLTVKLGEEAPTTF 120
Query: 403 AEVLQKAKKVIDGQELLRTKIGRPERKIGRGRSGKDVERENPKSKDKGSFSS-GRAEYRR 462
EVLQKAKKVIDGQELLRTK GRPE++I + + ++ + + KS+DKGS SS R EYRR
Sbjct: 121 VEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSSSASRTEYRR 180
Query: 463 AESGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLNRPEKLRGAPERRSKDKYCRFHR 522
ESGP+RSRPYER+T +TIPISEILTNIEESGMEKLL RPEKLRG E+R+K+KYCRFHR
Sbjct: 181 LESGPSRSRPYERYTSSTIPISEILTNIEESGMEKLLKRPEKLRGDLEKRNKEKYCRFHR 240
Query: 523 EHGHNTSDCWELKHQIEDLIQDGYFKKFVRKPRTSSAEKKEERKRSRTLPRRTDRPAVIN 582
+HGHNT+ CWELK QIEDLIQDGYFKKFV KPR++S EKKEERKRSRT PRR DRPAVIN
Sbjct: 241 DHGHNTTSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRREDRPAVIN 300
Query: 583 TIFGGPSGGQSGHKRKELARAARREMCIIREQGPTYPITFDGADLGEVHLHHNDALVIAP 642
TIFGGP+GGQSG+KRKELAR ARRE+CIIRE PT ITF ADL VHL HNDALVIA
Sbjct: 301 TIFGGPNGGQSGNKRKELAREARREVCIIREHKPTCSITFGDADLEGVHLPHNDALVIAS 360
Query: 643 LIDHVVVRRVLVDGVASANILSLPTYLALGWTRSQLKKSPPLVGFSGESVILEGCIDLPV 702
LIDH +VRRVL+DG GCIDLPV
Sbjct: 361 LIDHDLVRRVLIDG---------------------------------------GCIDLPV 420
Query: 703 TLGQDQTQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYPTHNGVGTIRG 762
T+GQD TQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKY T N VG +RG
Sbjct: 421 TIGQDATQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNEVGMVRG 480
Query: 763 EQTASRECYASALKGSSVCALETLTGRDGTLEFEADLP---RKEFAAPTEELELVPLLSP 822
EQ SRECYASALKGS+VCALE T R E EADLP +++F PTEELELVPLLSP
Sbjct: 481 EQKTSRECYASALKGSAVCALEEQTNRGKLQESEADLPKEGKRQFPPPTEELELVPLLSP 523
Query: 823 EKQTDLARSVPVEILDNPSISK 841
E+Q + + V ++ P K
Sbjct: 541 ERQANPEKIKTVLEMEAPKTLK 523
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1C7X5 | 6.1e-268 | 91.67 | uncharacterized protein LOC111008813 OS=Momordica charantia OX=3673 GN=LOC111008... | [more] |
A0A6J1D9E1 | 6.7e-267 | 80.03 | uncharacterized protein LOC111018823 OS=Momordica charantia OX=3673 GN=LOC111018... | [more] |
A0A6J1DHB3 | 7.0e-264 | 64.85 | uncharacterized protein LOC111020479 OS=Momordica charantia OX=3673 GN=LOC111020... | [more] |
A0A6J1DD03 | 4.7e-220 | 89.96 | uncharacterized protein LOC111019899 OS=Momordica charantia OX=3673 GN=LOC111019... | [more] |
A0A6J1DZB9 | 1.7e-214 | 72.06 | uncharacterized protein LOC111024904 OS=Momordica charantia OX=3673 GN=LOC111024... | [more] |
Match Name | E-value | Identity | Description | |