Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTTCAACCAGCGAATTCGACCAATACGACGGATCGAAGGACTCTGGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCGGCGGTAGAGGGGCAAGGTCACGACGGCCTAGCAGCGGAACCCCTCCGCAGGTCGGCACGAATCACAGCGCCTGCCCTGCCGCCTGCACACCCGAGGACGTCCAAGGCCATCCGTGGTCGAGGTGGGACCTCTAAGAAGGGCGCTCGAGGTCCAGCCCCGACTCCAACAAGCGAGGATTTTGATGCGCTCCAGAGAGAGATGGAGGCAATGCGCACGCAAATGCGCTCCATGGAGGCGATGTATAACGAAATGGTGCTAGCTGCAGGCGCGGGGTCCCGATCTGAAAATCGGGCGACGCGCATGCACGTACGCGAGCAAAGGGGCTCCCACCTCGGCCCGGCCGAGGAGGAACGTCCCGAAGACAACGGGAGTGAGGGGTACGCTCGCCAGAGGGGAGACCTCCGCGAGCATCTCAACCGAAAGAGAGGCTCGTCTCTCCAAAAAGGGCAGTCGTCATCCCGCTCCCACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAACCCCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTGAGGGGGGAGCTCGATGCTCAAGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGGCGACTTGGGAGAATCGCCTTTCACCTCGGACGTTTTGGAAGCACCGATCCCCCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGACGAGACGAAGGACCCCAAGGACTATGTTGAAGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTCCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTACCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCCCAGTTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCACTACGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGGGGAGACGCTGCGGGAGTATGTCACCAGATTCCAGGAGAAGCAATTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGTTATTTCCTCACAGGTCTAGCCGATGAGGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTCCAGAAGGCAAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCTGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATGAAAGGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTCTCCAGAGGCCGAGCTGAGTATCGAAGGGCGGAGAACGGACCTACCAGGAGCCGACCCTACGAGCGCTTCACCCCAACAACGATTCCAATCTCCGAGATCCTAACGAACATCGAGGATTCTGGAATGGAAAAACTATTCAAGCGTCCGGAGAAACTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTTCATCGGGAGCACGACCACAACACGTCGGACTGCTGGGAATTGAAGCGCCAAATTGAAGATCTGATTCAAGACGGCTACTTCAAGAAGTTTGTGGGAAAGCCCGGGACCGGCTCAGCAGAGAAAAAGGAGGAGCGAAAGCGTTCAAGGACGCCACCCCGGCGCACCGACCGACCTGCGGTCATCAATACCATCTTTGGAGGGCCAAGCGGGGGTCAATCCGGACATAAAAGAAAGGAGTTAGCCCGTGCAGCCAGACGCGAGGTGTGCGTCATCAGGGAGCAGGGGCCGACCTGCCTAATCACCTTCGACGGTGTAGACTTGGAGGAGGTACACCTGCCCCACAATGATGCTCTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTTAGAAGAGTGCTGGTAGACGGGGGCGCATCCGCTAACATCCTGTCCTTACCGACCTACCTCGCCTTGGGCTGGACGAGGTCGCAATTGAAGAGAAGCCCGACGCCGCTGGTTGGGTTCTCTGGAGAGTCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACGCTGGGGCAGAACCAAACCCGGATCACTTAAATGGCCGAGTTCGTGGTAGTTGACGGTAGGTCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCCTTTCGGGCCATTCCTTCAACACTTCATCAAGTTTTGAAGTATCCCACCCCCAACGGCGTGGGCACGGTCCGAGGAGAACAAGCCGCTTCGAGGGAGTGTTATGCCGCCGCACTCAAAGGCCCATCGGTCTGCGCCCTCGAAACTCTCAGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCCGAGGAAGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGGTAAGCATAGGAACCAAGCTGGGGGCCACCGACAGAGAGGAGCTAATCCACTTCCTCAGATCCAACTTGGACGTCTTTGCATGGTCCCATGAGGACATGCCTGACATTGACCCGCGAATTATGACGCATCGCCTCAGCATAGATCCATCATTCCGACCTGTGAAACAAAAGAGAAGACTTATAAACAAGGAGAGGAGTGATGTAATTGTTGAGGAAGTTAACAAACTTTTGACAGCTGAATACATAAGAGAAATTTCGTATCCCGAGTGGCTCTCCAATGTTGTATTAGTTAAAAAATCTAACGGCAAGTGGAGAATGTGCGTAGATTTTACGAACTTAAATAAGGCATGCCCGAAAGATTGCTTCCCACTGCCGAGGATCGATCAGCTCGTCGACGCTACAGCCGGGCACGAACTGCTCACTTTCATGGACGCCTACTCTGGGTACAACCAAATCAAGATGCATGTCCCAGATGAAGGTCATACCGCTTTCATAACAGACCAAGGTCTGTATTGCTACAAGGTCATGCCCTTCGGGTTAAAGAACGCAGGAGCGACCTACCAGAGAATGGTGAACAAAATGTTCGCCAAGCAGATCGGCCGGAATATGGAAGTGTATGTGGACGACATGCTTGTCAAGAGCAAGCAGTCTAAGTCGCATCTCTCCGACCTGGCCGAAGCCTTCGAGGTTCTGAGGACATATCAAATGAAGCTCAACCCTGCTAAGTGTGCCTTTGGAGTCTCCTCGGGAAAATTCCTTGGCTTCATGGTAAACAACCGGGGAATCGAGGCCAACCCCGAAAAGATTAAAGCCGTCATCGAGATGGAGGCACCTAAAACGCTGAAACAGCTTCAGTGCCTCAATGGCAGGATTGCGGCCCTGAACCGGTTTGTTTCAAGGTCAACGGACAAGTGCCTCCCTTTCTTCAAGGTCTTACGAAAGAAAGGGCCGTTTGAATGGACGGCGGAGTGCGAGCAAGCGTTTCAGCAATTGAAGAACTACCTCTGTTCGGCACCCTTGCTTGTCAAGCCTATGCCGGGGGACAAGCTCCAATTATACCTAGCAGTGTCTGACAGTGCCGTCAGCTCGGCCCTAATCCGGCAAGAGGAAGCGCGGCAAAACCCGGTCTACTACACAAGCAAGGCTATGACCGAAGCCGAGGCTAGATACCCTCAGATGGAGAAGTTGGCTCTCGCTTTGGTCACCTCGGCCCGACGACTTAGACCATACTTCCAAGCCCATACGATGCTGGTGCTCACTAACTCGCCCCTTAAAAGTATCTTCCACAAGCCGGAAGCTTCCGGACGCCTAATGAAGTGGGCGATAGAGCTAAGTGAGTACGACATCCAGTTCGAACCCATAACTGCGTTGAAAGGGCAAACAGCGGCAGATTTCATAGCCGAGCTCACACCACCTTCCGAGCTGAGCGGGTCCGACCTGCCTTGGACAGTCTACGTCGACGGATCCTCCAATGAGAAGGGGTGCGGAGTCGGGATCCTCTTGCTCAGACCAGGGGGTGAACGATTTGAGTATGCCTTGCGGTTCAGCTTCCGGACTTCTAACAACGAGGCTGAGTATGAAGCATTTATTGCTGGCCTGCGAATCGCTCGAGCATTGGGGGCCTCTTGTGTTAAGGTCTTCAGTGACTCCCAGCTGGTTGTGAGCCAGATCAAGGACGAATACCAAGCCAAAGACACCCGAATGGAGAAGTATTTGAGCAAGGTCAGATCATACCTCGCCCAGTTCCGAACTTACGAAGTAAGCCGGATTCCGCGGGCGAAAAATTCTAATGCTGACGCCTTGGCCAAGTTAGCATCAGCGTACGAGACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCCTAGATAATCCCTCGATCTTAGAGCCAGATCTGATGAAGATCGGCGCTCCAGAACCCTCATGGATGGACCCGATCGCGGACTTCATTAGGGGCAACTCACCACAAGACCCCGAGGAGCGCAGAAAGTTGGTAAGAAGGGCAGCTCGGTTCGTGATCCGAGATGGGGCATTGTACCGACGTGGCTTTTCCCTGCCTCTGTTGAAATGCCTAACCTCTGAAGTGGGCCTGTACGTCCTCAGAGAGATCCACGAAGGAGTGTGCGGCAATCACTCAGGCGCCCGGTCGCTGTCAGCCAAAGTGATCCGACAAGGATACTATTGGCCGACCCTCAGCTAGGACGCCAAGAAGTTCGTTAGAACTTGCGACAATTGCCAACGCTACGGAACCATAATCCACCAACCTCCCGAGCTGCTCACCCTCATCTCGGCCCCGTGGCCATTCGCGCAGTGGGGGGTGGATATCATTGGCCCTTTCCCTTTGGGCAAGAGCCAGACCAAGTTCGCGGTGGTTGTTGTGGATTACTTCACCAAGTGGGCCGAGGCCGAAGCGCTCTCCCACATAACGGAATCTAGGGTCACGCCCTTCGTATGGACAAGCATCATATGTCGCTTTGGTATACCACAGGCCATAGTGACAGACAATGGGAAGCAGTTTGACAACGCCAAGTTCAAAGACTTTTGCAGCAAACTTGGCATAAGTCACCTCAGCTCGTCCCCCGCACATCCGCAAGCGAATGGGCAGGTGGAGGCGGTCAACAAGATCATCAAGCGAGGCATCAAACTTAGATTGGACTCCAAGAAAGGCAGGTGGGCCGAGGAGTTACCAGAAGTCCTATGGTCGTACCGGACCACCCAACGGGGGTCGACGGGTGAGACCCCATTCTCCCTGGCCTTCGGCTCCGAAGCTGTGGTCCCGGTTGAGATCGGCATGCCATCTGATAGAGTAGAGCGTTACGAGCCTTCGATGAATGAGGAAGAGCTGCTCCTCAACCTCGACTTGTTGGAAGAAAGAAGAGCACTGGCCCAGCTACGCCTGGCGGAATATCAGGGCAGGATGGCCAGACATTATAATGCCCGCGTTCGACCTCGGGACTTTCAGGTCGGCCATCTGGTCTTAAGGAGGGTCCAAACGCATGTGGGTGCCCTCGATCCGGCCTGGGAAGGCCCATTTGAGATCAAGGGCATAGTCCGACCTGGGACGTACGTATTGGCCGATCTGGAAGGAGATGTCCTCGCGCACCCGTGTAACGCGAAACACCTGAAGCGTTATTATCCTTGA
mRNA sequence
ATGGTTCAACCAGCGAATTCGACCAATACGACGGATCGAAGGACTCTGGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCGGCGGTAGAGGGGCAAGGTCACGACGGCCTAGCAGCGGAACCCCTCCGCAGGTCGGCACGAATCACAGCGCCTGCCCTGCCGCCTGCACACCCGAGGACGTCCAAGGCCATCCGTGGTCGAGGTGGGACCTCTAAGAAGGGCGCTCGAGGTCCAGCCCCGACTCCAACAAGCGAGGATTTTGATGCGCTCCAGAGAGAGATGGAGGCAATGCGCACGCAAATGCGCTCCATGGAGGCGATGTATAACGAAATGGTGCTAGCTGCAGGCGCGGGGTCCCGATCTGAAAATCGGGCGACGCGCATGCACGTACGCGAGCAAAGGGGCTCCCACCTCGGCCCGGCCGAGGAGGAACGTCCCGAAGACAACGGGAGTGAGGGGTACGCTCGCCAGAGGGGAGACCTCCGCGAGCATCTCAACCGAAAGAGAGGCTCGTCTCTCCAAAAAGGGCAGTCGTCATCCCGCTCCCACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAACCCCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTGAGGGGGGAGCTCGATGCTCAAGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGGCGACTTGGGAGAATCGCCTTTCACCTCGGACGTTTTGGAAGCACCGATCCCCCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGACGAGACGAAGGACCCCAAGGACTATGTTGAAGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTCCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTACCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCCCAGTTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCACTACGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGGGGAGACGCTGCGGGAGTATGTCACCAGATTCCAGGAGAAGCAATTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGTTATTTCCTCACAGGTCTAGCCGATGAGGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTCCAGAAGGCAAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCTGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATGAAAGGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTCTCCAGAGGCCGAGCTGAGTATCGAAGGGCGGAGAACGGACCTACCAGGAGCCGACCCTACGAGCGCTTCACCCCAACAACGATTCCAATCTCCGAGATCCTAACGAACATCGAGGATTCTGGAATGGAAAAACTATTCAAGCGTCCGGAGAAACTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTTCATCGGGAGCACGACCACAACACGTCGGACTGCTGGGAATTGAAGCGCCAAATTGAAGATCTGATTCAAGACGGCTACTTCAAGAAGTTTGTGGGAAAGCCCGGGACCGGCTCAGCAGAGAAAAAGGAGGAGCGAAAGCGTTCAAGGACGCCACCCCGGCGCACCGACCGACCTGCGGTCATCAATACCATCTTTGGAGGGCCAAGCGGGGGTCAATCCGGACATAAAAGAAAGGAGTTAGCCCGTGCAGCCAGACGCGAGGTGTGCGTCATCAGGGAGCAGGGGCCGACCTGCCTAATCACCTTCGACGGTGTAGACTTGGAGGAGGTACACCTGCCCCACAATGATGCTCTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTTAGAAGAGTGCTGGTAGACGGGGGCGCATCCGCTAACATCCTGTCCTTACCGACCTACCTCGCCTTGGGCTGGACGAGGTCGCAATTGAAGAGAAGCCCGACGCCGCTGGTTGGGTTCTCTGGAGAGTCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCCTTTCGGGCCATTCCTTCAACACTTCATCAAGTTTTGAAGTATCCCACCCCCAACGGCGTGGGCACGGTCCGAGGAGAACAAGCCGCTTCGAGGGAGTGTTATGCCGCCGCACTCAAAGGCCCATCGGTCTGCGCCCTCGAAACTCTCAGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCCGAGGAAGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGTTAGCATCAGCGTACGAGACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCCTAGATAATCCCTCGATCTTAGAGCCAGATCTGATGAAGATCGGCGCTCCAGAACCCTCATGGATGGACCCGATCGCGGACTTCATTAGGGGCAACTCACCACAAGACCCCGAGGAGCGCAGAAAGTTGGTAAGAAGGGCAGCTCGGTTCGTGATCCGAGATGGGGCATTGTACCGACGTGGCTTTTCCCTGCCTCTGTTGAAATGCCTAACCTCTGAAATCGGCATGCCATCTGATAGAGTAGAGCGTTACGAGCCTTCGATGAATGAGGAAGAGCTGCTCCTCAACCTCGACTTGTTGGAAGAAAGAAGAGCACTGGCCCAGCTACGCCTGGCGGAATATCAGGGCAGGATGGCCAGACATTATAATGCCCGCGTTCGACCTCGGGACTTTCAGGTCGGCCATCTGGTCTTAAGGAGGGTCCAAACGCATGTGGGTGCCCTCGATCCGGCCTGGGAAGGCCCATTTGAGATCAAGGGCATAGTCCGACCTGGGACGTACGTATTGGCCGATCTGGAAGGAGATGTCCTCGCGCACCCGTGTAACGCGAAACACCTGAAGCGTTATTATCCTTGA
Coding sequence (CDS)
ATGGTTCAACCAGCGAATTCGACCAATACGACGGATCGAAGGACTCTGGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCGGCGGTAGAGGGGCAAGGTCACGACGGCCTAGCAGCGGAACCCCTCCGCAGGTCGGCACGAATCACAGCGCCTGCCCTGCCGCCTGCACACCCGAGGACGTCCAAGGCCATCCGTGGTCGAGGTGGGACCTCTAAGAAGGGCGCTCGAGGTCCAGCCCCGACTCCAACAAGCGAGGATTTTGATGCGCTCCAGAGAGAGATGGAGGCAATGCGCACGCAAATGCGCTCCATGGAGGCGATGTATAACGAAATGGTGCTAGCTGCAGGCGCGGGGTCCCGATCTGAAAATCGGGCGACGCGCATGCACGTACGCGAGCAAAGGGGCTCCCACCTCGGCCCGGCCGAGGAGGAACGTCCCGAAGACAACGGGAGTGAGGGGTACGCTCGCCAGAGGGGAGACCTCCGCGAGCATCTCAACCGAAAGAGAGGCTCGTCTCTCCAAAAAGGGCAGTCGTCATCCCGCTCCCACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAACCCCGCAGGGATAATCACAAGGGAGGAGTTCGACCAGCTGAGGGGGGAGCTCGATGCTCAAGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGGCGACTTGGGAGAATCGCCTTTCACCTCGGACGTTTTGGAAGCACCGATCCCCCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGACGAGACGAAGGACCCCAAGGACTATGTTGAAGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTCCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTACCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCCCAGTTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCACTACGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGGGGAGACGCTGCGGGAGTATGTCACCAGATTCCAGGAGAAGCAATTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGTTATTTCCTCACAGGTCTAGCCGATGAGGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTCCAGAAGGCAAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCTGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATGAAAGGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTCTCCAGAGGCCGAGCTGAGTATCGAAGGGCGGAGAACGGACCTACCAGGAGCCGACCCTACGAGCGCTTCACCCCAACAACGATTCCAATCTCCGAGATCCTAACGAACATCGAGGATTCTGGAATGGAAAAACTATTCAAGCGTCCGGAGAAACTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTTCATCGGGAGCACGACCACAACACGTCGGACTGCTGGGAATTGAAGCGCCAAATTGAAGATCTGATTCAAGACGGCTACTTCAAGAAGTTTGTGGGAAAGCCCGGGACCGGCTCAGCAGAGAAAAAGGAGGAGCGAAAGCGTTCAAGGACGCCACCCCGGCGCACCGACCGACCTGCGGTCATCAATACCATCTTTGGAGGGCCAAGCGGGGGTCAATCCGGACATAAAAGAAAGGAGTTAGCCCGTGCAGCCAGACGCGAGGTGTGCGTCATCAGGGAGCAGGGGCCGACCTGCCTAATCACCTTCGACGGTGTAGACTTGGAGGAGGTACACCTGCCCCACAATGATGCTCTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTTAGAAGAGTGCTGGTAGACGGGGGCGCATCCGCTAACATCCTGTCCTTACCGACCTACCTCGCCTTGGGCTGGACGAGGTCGCAATTGAAGAGAAGCCCGACGCCGCTGGTTGGGTTCTCTGGAGAGTCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCCTTTCGGGCCATTCCTTCAACACTTCATCAAGTTTTGAAGTATCCCACCCCCAACGGCGTGGGCACGGTCCGAGGAGAACAAGCCGCTTCGAGGGAGTGTTATGCCGCCGCACTCAAAGGCCCATCGGTCTGCGCCCTCGAAACTCTCAGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCCGAGGAAGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGTTAGCATCAGCGTACGAGACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCCTAGATAATCCCTCGATCTTAGAGCCAGATCTGATGAAGATCGGCGCTCCAGAACCCTCATGGATGGACCCGATCGCGGACTTCATTAGGGGCAACTCACCACAAGACCCCGAGGAGCGCAGAAAGTTGGTAAGAAGGGCAGCTCGGTTCGTGATCCGAGATGGGGCATTGTACCGACGTGGCTTTTCCCTGCCTCTGTTGAAATGCCTAACCTCTGAAATCGGCATGCCATCTGATAGAGTAGAGCGTTACGAGCCTTCGATGAATGAGGAAGAGCTGCTCCTCAACCTCGACTTGTTGGAAGAAAGAAGAGCACTGGCCCAGCTACGCCTGGCGGAATATCAGGGCAGGATGGCCAGACATTATAATGCCCGCGTTCGACCTCGGGACTTTCAGGTCGGCCATCTGGTCTTAAGGAGGGTCCAAACGCATGTGGGTGCCCTCGATCCGGCCTGGGAAGGCCCATTTGAGATCAAGGGCATAGTCCGACCTGGGACGTACGTATTGGCCGATCTGGAAGGAGATGTCCTCGCGCACCCGTGTAACGCGAAACACCTGAAGCGTTATTATCCTTGA
Protein sequence
MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLAAEPLRRSARITAPALPPAHPRTSKAIRGRGGTSKKGARGPAPTPTSEDFDALQREMEAMRTQMRSMEAMYNEMVLAAGAGSRSENRATRMHVREQRGSHLGPAEEERPEDNGSEGYARQRGDLREHLNRKRGSSLQKGQSSSRSHRSSNQQAESSHNPAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDETKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERADPKSKDKGSFSRGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLFKRPEKLRGAPERRSKDKYCRFHREHDHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTGSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCVIREQGPTCLITFDGVDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQAASRECYAAALKGPSVCALETLRDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSILEPDLMKIGAPEPSWMDPIADFIRGNSPQDPEERRKLVRRAARFVIRDGALYRRGFSLPLLKCLTSEIGMPSDRVERYEPSMNEEELLLNLDLLEERRALAQLRLAEYQGRMARHYNARVRPRDFQVGHLVLRRVQTHVGALDPAWEGPFEIKGIVRPGTYVLADLEGDVLAHPCNAKHLKRYYP
Homology
BLAST of Moc11g13700 vs. NCBI nr
Match:
XP_022137317.1 (uncharacterized protein LOC111008813 [Momordica charantia])
HSP 1 Score: 907.5 bits (2344), Expect = 1.0e-259
Identity = 472/524 (90.08%), Postives = 484/524 (92.37%), Query Frame = 0
Query: 191 QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVL 250
+AESS N PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+ LNDGDLGESPFTSDVL
Sbjct: 3 KAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVL 62
Query: 251 EAPIPPKFKAPTVKPYDETKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYR 310
EAPIPPKFKAPTVKPYD +KDPKDYVEVFE LMDFQAASDAIKCRAF+IALTGSARLWYR
Sbjct: 63 EAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKCRAFRIALTGSARLWYR 122
Query: 311 RLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQLKVA 370
RLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQE+QLKVA
Sbjct: 123 RLPAXSISTYSQLRREFLAXFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA 182
Query: 371 HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI 430
HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI
Sbjct: 183 HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI 242
Query: 431 GRGRSGKD-ERADPKSKDKGSFSRGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE 490
GRGRSGKD E ADPKSKDKGSFS GRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
Sbjct: 243 GRGRSGKDIENADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE 302
Query: 491 DSGMEKLFKRPEKLRGAPERRSKDKYCRFHREHDHNTSDCWELKRQIEDLIQDGYFKKFV 550
+SGMEKL KRPEKLRGAPERRSKDKYCRFHREH HNTSD WELKRQIE+LIQDGYFKKFV
Sbjct: 303 ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIENLIQDGYFKKFV 362
Query: 551 GKPGTGSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCVI 610
GKP T SAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSG KRKELARAARREVC+I
Sbjct: 363 GKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARAARREVCII 422
Query: 611 REQGPTCLITFDGVDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLAL 670
REQ PTC ITFDG DLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLAL
Sbjct: 423 REQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVXRVLVDGGTSANILSLPTYLAL 482
Query: 671 GWTRSQLKRSPTPLVGFSGESAYNAIFGRPIIHSFRAIPSTLHQ 711
GWTRSQLK+SPTPLVGFSGES I F +P TL Q
Sbjct: 483 GWTRSQLKKSPTPLVGFSGESV--------IPEGFIDLPVTLGQ 518
BLAST of Moc11g13700 vs. NCBI nr
Match:
XP_022150760.1 (uncharacterized protein LOC111018823 [Momordica charantia])
HSP 1 Score: 880.2 bits (2273), Expect = 1.7e-251
Identity = 489/671 (72.88%), Postives = 513/671 (76.45%), Query Frame = 0
Query: 187 SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFT 246
SSNQQAESSHNPA G+ITREEFDQLRG+L+AQVEALKAKCEQK+ LNDGDLGESPFT
Sbjct: 2 SSNQQAESSHNPATPDGVITREEFDQLRGKLNAQVEALKAKCEQKEGPLNDGDLGESPFT 61
Query: 247 SDVLEAPIPPKFKAPTVKPYDETKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR 306
SDVLE APTVK YD +KDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR
Sbjct: 62 SDVLE--------APTVKSYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR 121
Query: 307 LWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQ 366
LW FQE Q
Sbjct: 122 LW-----------------------------------------------------FQEDQ 181
Query: 367 LKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRP 426
LKVA SDDSAMCYFLTGLADEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRP
Sbjct: 182 LKVAQSSDDSAMCYFLTGLADEALTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRP 241
Query: 427 ERKIGRGRSGKDERADPKSKDKGSFSRGRAEYRRAENGPTRSRPYERFTPTTIPISEILT 486
ER I RGRSGKDE+AD KSKDKGSFS GRAE+RRA NGPTRSRPYERFTPTTIPISEILT
Sbjct: 242 ERGIDRGRSGKDEKADLKSKDKGSFSSGRAEFRRAVNGPTRSRPYERFTPTTIPISEILT 301
Query: 487 NIEDSGMEKLFKRPEKLRGAPERRSKDKYCRFHREHDHNTSDCWELKRQIEDLIQDGYFK 546
NIE+SGMEKL KRPEKLRGAPERR+KDKYCRFHREHDHNTSD WELKRQIEDLIQD YFK
Sbjct: 302 NIEESGMEKLLKRPEKLRGAPERRNKDKYCRFHREHDHNTSDRWELKRQIEDLIQDDYFK 361
Query: 547 KFVGKPGTGSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREV 606
KFVGKP T SAEKKEERK SRTP RR DRPAVINTIFGGPSGGQSGHKRKELARAARREV
Sbjct: 362 KFVGKPRTSSAEKKEERKLSRTPLRRIDRPAVINTIFGGPSGGQSGHKRKELARAARREV 421
Query: 607 CVIREQGPTCLITFDGVDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTY 666
C+IREQ PTC ITFD DLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TY
Sbjct: 422 CIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDEGVSANIVSLLTY 481
Query: 667 LALGWTRSQLKRSPTPLVGFS---------------------------------GESAYN 726
LALGWTRSQLK+S TPLVGFS G SAYN
Sbjct: 482 LALGWTRSQLKKSTTPLVGFSRESVIPEGCIDLPVTLGHDQTQVTQMAEFVVIDGRSAYN 541
Query: 727 AIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQAASRECYAAALKGPSVCALETL- 786
AIFGRPIIHSFRAIPSTLHQVLKY TPNGVG VRGEQ ASRECYA+ALKG SVCALETL
Sbjct: 542 AIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGMVRGEQIASRECYASALKGSSVCALETLV 601
Query: 787 -RDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSILE 820
RDGTLEF+A+LPR+EFAAPTEELELVPLL + +E +L + +D+
Sbjct: 602 SRDGTLEFKANLPRREFAAPTEELELVPLLRYKYNENIDHEQELDEKSSLNKIDD----- 605
BLAST of Moc11g13700 vs. NCBI nr
Match:
XP_022152854.1 (uncharacterized protein LOC111020479 [Momordica charantia])
HSP 1 Score: 867.5 bits (2240), Expect = 1.1e-247
Identity = 490/790 (62.03%), Postives = 538/790 (68.10%), Query Frame = 0
Query: 1 MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLAAEPLRRSARITAPALPPAHP 60
MVQPANSTNT DRR LAA+ HQREVGA VEGQGH+ L EPL RSARIT P LPPAHP
Sbjct: 1 MVQPANSTNTADRRALAANHGHQREVGAEVVEGQGHEDLGTEPLCRSARITTPVLPPAHP 60
Query: 61 RTSKAIRGRGGTSKKGARGPAPTPTSEDFDALQREMEAMRTQMRSMEAMYNEMVLAAGAG 120
+ SK
Sbjct: 61 KPSK-------------------------------------------------------- 120
Query: 121 SRSENRATRMHVREQRGSHLGPAEEERPEDNGSEGYARQRGDLREHLNRKRGSSLQKGQS 180
Sbjct: 121 ------------------------------------------------------------ 180
Query: 181 SSRSHRSSNQQAESSHNP--AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLG 240
AESS+NP G+ITREEFDQL+ + DAQVEALKA+CE+K+ S +DGDLG
Sbjct: 181 -----------AESSYNPITPGVITREEFDQLKSKFDAQVEALKARCEKKESSFDDGDLG 240
Query: 241 ESPFTSDVLEAPIPPKFKAPTVKPYDETKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL 300
E F+SD+LEA IPPKFK PT+KPYD +KDPKDYVEVFE LMDFQAA+DAIKC AFQIAL
Sbjct: 241 ELSFSSDILEALIPPKFKTPTMKPYDGSKDPKDYVEVFESLMDFQAATDAIKCCAFQIAL 300
Query: 301 TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTR 360
TGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTR
Sbjct: 301 TGSARLWYRRLPARLISTYSQLRKEFISQFSSRHYDRKTPTHLATIRQKEGETLREYVTR 360
Query: 361 FQEKQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRT 420
F E+QLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPATFAEVLQK KKVIDGQELLRT
Sbjct: 361 FPEEQLKVAHCSDDSAMCYFLTGLADETLTVKLREEAPATFAEVLQKTKKVIDGQELLRT 420
Query: 421 KTGRPERKIGRGRSGKDE-RADPKSKDKG-SFSRGRAEYRRAENGPTRSRPYERFTPTTI 480
KTGRPE+ I +GR+GKD+ +AD KS+DKG S S R +YRR+ + +SRPYE +TPTTI
Sbjct: 421 KTGRPEKNIDQGRAGKDKGKADSKSRDKGPSSSSSRVDYRRSNSSHNQSRPYEHYTPTTI 480
Query: 481 PISEILTNIEDSGMEKLFKRPEKLRGAPERRSKDKYCRFHREHDHNTSDCWELKRQIEDL 540
PI EILTNIE++GMEKL KRPEKLRG PE+R+ DKYCRFHR+H HNTS+ WELKRQIEDL
Sbjct: 481 PIFEILTNIEETGMEKLLKRPEKLRGDPEKRNTDKYCRFHRDHGHNTSNYWELKRQIEDL 540
Query: 541 IQDGYFKKFVGKPGTGSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELA 600
IQDGYFKKFVGKP + S EKKEERKR RTPPRR DRPAVIN K+KELA
Sbjct: 541 IQDGYFKKFVGKPRSNSVEKKEERKRLRTPPRRDDRPAVIN-------------KKKELA 600
Query: 601 RAARREVCVIREQGPTCLITFDGVDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASAN 660
R ARREVC+IREQ PT I F+ DLE VHLPHNDALVIAPLID V+VRR+LVDGGASAN
Sbjct: 601 REARREVCIIREQRPTSSIAFNHADLEGVHLPHNDALVIAPLIDLVLVRRILVDGGASAN 650
Query: 661 ILSLPTYLALGWTRSQLKRSPTPLVGFSGE------------------------------ 720
ILSL TYLALGWTRSQLK+SPTPLVGFSGE
Sbjct: 661 ILSLSTYLALGWTRSQLKKSPTPLVGFSGESISLEGCIDLPVSIRQDDTQVTQMAEFVVI 650
Query: 721 ---SAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQAASRECYAAALKGPSV 752
SAYNAIFGRPIIHSFRA+PSTLHQVLKY T NGVGTVRGE SRECYA+ K SV
Sbjct: 721 DGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGELKTSRECYASVPKRSSV 650
BLAST of Moc11g13700 vs. NCBI nr
Match:
XP_022150613.1 (uncharacterized protein LOC111018708, partial [Momordica charantia])
HSP 1 Score: 776.9 bits (2005), Expect = 2.0e-220
Identity = 396/422 (93.84%), Postives = 404/422 (95.73%), Query Frame = 0
Query: 228 KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDETKDPKDYVEVFEGLMDFQAASD 287
KDDSLNDGDLGES FTSDVLEAPIPPKFKAPTVKPYD +KDPKDYVEVFEGLMDF AASD
Sbjct: 44 KDDSLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFHAASD 103
Query: 288 AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQK 347
AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSR Y KKT THLATIRQK
Sbjct: 104 AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRQYGKKTETHLATIRQK 163
Query: 348 EGETLREYVTRFQEKQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAK 407
EG TLREYVTRFQE+QLKVAHCSDDSAMCYFLTGLADEALTVKLGE+AP TFAEVLQKAK
Sbjct: 164 EGGTLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEKAPTTFAEVLQKAK 223
Query: 408 KVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSRGRAEYRRAENGPTRSR 467
KVIDGQELLRTKTGRP+RKIGRGRSGKD ERADPKSKDKGSFS GRAEYRRAE+GPT+SR
Sbjct: 224 KVIDGQELLRTKTGRPDRKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAESGPTKSR 283
Query: 468 PYERFTPTTIPISEILTNIEDSGMEKLFKRPEKLRGAPERRSKDKYCRFHREHDHNTSDC 527
PYERFTPTTIPISEILTNIE+SGMEKL KRPEKLRGAPERRSKDKYCRFHREH HNTSDC
Sbjct: 284 PYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC 343
Query: 528 WELKRQIEDLIQDGYFKKFVGKPGTGSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG 587
WELKRQIEDLIQDGYFKKFVGKP T SAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
Sbjct: 344 WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG 403
Query: 588 QSGHKRKELARAARREVCVIREQGPTCLITFDGVDLEEVHLPHNDALVIAPLIDHVVVRR 647
QSGHKRKELARAARREVC+IREQGPTC ITFDG D EEVHLPHNDA VIAPLIDHVVVRR
Sbjct: 404 QSGHKRKELARAARREVCIIREQGPTCPITFDGADSEEVHLPHNDARVIAPLIDHVVVRR 463
Query: 648 VL 649
VL
Sbjct: 464 VL 465
BLAST of Moc11g13700 vs. NCBI nr
Match:
XP_022156542.1 (uncharacterized protein LOC111023421 [Momordica charantia])
HSP 1 Score: 760.8 bits (1963), Expect = 1.5e-215
Identity = 391/405 (96.54%), Postives = 394/405 (97.28%), Query Frame = 0
Query: 200 GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPT 259
GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPT
Sbjct: 26 GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPT 85
Query: 260 VKPYDETKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQ 319
VKPYD TKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLP RSISTYSQ
Sbjct: 86 VKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPXRSISTYSQ 145
Query: 320 LRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQLKVAHCSDDSAMCYFL 379
LRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQE+QLKVAHCSDDSAMCYFL
Sbjct: 146 LRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL 205
Query: 380 TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERA 439
TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD ERA
Sbjct: 206 TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERA 265
Query: 440 DPKSKDKGSFSRGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLFKRPE 499
DPKSKDKGSFS GRAEYRRAENGPTRSRPYERFTPTTIPI EILTNIE+SGMEKL KRPE
Sbjct: 266 DPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIEESGMEKLLKRPE 325
Query: 500 KLRGAPERRSKDKYCRFHREHDHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTGSAEKKE 559
KLRGAPERRSKDKYCRFHREH HNTSD WELKRQIEDLIQDGYFKKFVGKP T SAEKKE
Sbjct: 326 KLRGAPERRSKDKYCRFHREHGHNTSDFWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKE 385
Query: 560 ERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREV 604
ERKRSRTPPRRTDRPAVINTIFGGPSGGQ GHKRKELARAARRE+
Sbjct: 386 ERKRSRTPPRRTDRPAVINTIFGGPSGGQLGHKRKELARAARREL 430
BLAST of Moc11g13700 vs. ExPASy TrEMBL
Match:
A0A6J1C7X5 (uncharacterized protein LOC111008813 OS=Momordica charantia OX=3673 GN=LOC111008813 PE=4 SV=1)
HSP 1 Score: 907.5 bits (2344), Expect = 4.8e-260
Identity = 472/524 (90.08%), Postives = 484/524 (92.37%), Query Frame = 0
Query: 191 QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVL 250
+AESS N PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+ LNDGDLGESPFTSDVL
Sbjct: 3 KAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVL 62
Query: 251 EAPIPPKFKAPTVKPYDETKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYR 310
EAPIPPKFKAPTVKPYD +KDPKDYVEVFE LMDFQAASDAIKCRAF+IALTGSARLWYR
Sbjct: 63 EAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKCRAFRIALTGSARLWYR 122
Query: 311 RLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQLKVA 370
RLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQE+QLKVA
Sbjct: 123 RLPAXSISTYSQLRREFLAXFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA 182
Query: 371 HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI 430
HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI
Sbjct: 183 HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI 242
Query: 431 GRGRSGKD-ERADPKSKDKGSFSRGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE 490
GRGRSGKD E ADPKSKDKGSFS GRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE
Sbjct: 243 GRGRSGKDIENADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE 302
Query: 491 DSGMEKLFKRPEKLRGAPERRSKDKYCRFHREHDHNTSDCWELKRQIEDLIQDGYFKKFV 550
+SGMEKL KRPEKLRGAPERRSKDKYCRFHREH HNTSD WELKRQIE+LIQDGYFKKFV
Sbjct: 303 ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIENLIQDGYFKKFV 362
Query: 551 GKPGTGSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCVI 610
GKP T SAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSG KRKELARAARREVC+I
Sbjct: 363 GKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARAARREVCII 422
Query: 611 REQGPTCLITFDGVDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLAL 670
REQ PTC ITFDG DLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SANILSLPTYLAL
Sbjct: 423 REQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVXRVLVDGGTSANILSLPTYLAL 482
Query: 671 GWTRSQLKRSPTPLVGFSGESAYNAIFGRPIIHSFRAIPSTLHQ 711
GWTRSQLK+SPTPLVGFSGES I F +P TL Q
Sbjct: 483 GWTRSQLKKSPTPLVGFSGESV--------IPEGFIDLPVTLGQ 518
BLAST of Moc11g13700 vs. ExPASy TrEMBL
Match:
A0A6J1D9E1 (uncharacterized protein LOC111018823 OS=Momordica charantia OX=3673 GN=LOC111018823 PE=4 SV=1)
HSP 1 Score: 880.2 bits (2273), Expect = 8.2e-252
Identity = 489/671 (72.88%), Postives = 513/671 (76.45%), Query Frame = 0
Query: 187 SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFT 246
SSNQQAESSHNPA G+ITREEFDQLRG+L+AQVEALKAKCEQK+ LNDGDLGESPFT
Sbjct: 2 SSNQQAESSHNPATPDGVITREEFDQLRGKLNAQVEALKAKCEQKEGPLNDGDLGESPFT 61
Query: 247 SDVLEAPIPPKFKAPTVKPYDETKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR 306
SDVLE APTVK YD +KDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR
Sbjct: 62 SDVLE--------APTVKSYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR 121
Query: 307 LWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQ 366
LW FQE Q
Sbjct: 122 LW-----------------------------------------------------FQEDQ 181
Query: 367 LKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRP 426
LKVA SDDSAMCYFLTGLADEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRP
Sbjct: 182 LKVAQSSDDSAMCYFLTGLADEALTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRP 241
Query: 427 ERKIGRGRSGKDERADPKSKDKGSFSRGRAEYRRAENGPTRSRPYERFTPTTIPISEILT 486
ER I RGRSGKDE+AD KSKDKGSFS GRAE+RRA NGPTRSRPYERFTPTTIPISEILT
Sbjct: 242 ERGIDRGRSGKDEKADLKSKDKGSFSSGRAEFRRAVNGPTRSRPYERFTPTTIPISEILT 301
Query: 487 NIEDSGMEKLFKRPEKLRGAPERRSKDKYCRFHREHDHNTSDCWELKRQIEDLIQDGYFK 546
NIE+SGMEKL KRPEKLRGAPERR+KDKYCRFHREHDHNTSD WELKRQIEDLIQD YFK
Sbjct: 302 NIEESGMEKLLKRPEKLRGAPERRNKDKYCRFHREHDHNTSDRWELKRQIEDLIQDDYFK 361
Query: 547 KFVGKPGTGSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREV 606
KFVGKP T SAEKKEERK SRTP RR DRPAVINTIFGGPSGGQSGHKRKELARAARREV
Sbjct: 362 KFVGKPRTSSAEKKEERKLSRTPLRRIDRPAVINTIFGGPSGGQSGHKRKELARAARREV 421
Query: 607 CVIREQGPTCLITFDGVDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTY 666
C+IREQ PTC ITFD DLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SANI+SL TY
Sbjct: 422 CIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDEGVSANIVSLLTY 481
Query: 667 LALGWTRSQLKRSPTPLVGFS---------------------------------GESAYN 726
LALGWTRSQLK+S TPLVGFS G SAYN
Sbjct: 482 LALGWTRSQLKKSTTPLVGFSRESVIPEGCIDLPVTLGHDQTQVTQMAEFVVIDGRSAYN 541
Query: 727 AIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQAASRECYAAALKGPSVCALETL- 786
AIFGRPIIHSFRAIPSTLHQVLKY TPNGVG VRGEQ ASRECYA+ALKG SVCALETL
Sbjct: 542 AIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGMVRGEQIASRECYASALKGSSVCALETLV 601
Query: 787 -RDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSILE 820
RDGTLEF+A+LPR+EFAAPTEELELVPLL + +E +L + +D+
Sbjct: 602 SRDGTLEFKANLPRREFAAPTEELELVPLLRYKYNENIDHEQELDEKSSLNKIDD----- 605
BLAST of Moc11g13700 vs. ExPASy TrEMBL
Match:
A0A6J1DHB3 (uncharacterized protein LOC111020479 OS=Momordica charantia OX=3673 GN=LOC111020479 PE=4 SV=1)
HSP 1 Score: 867.5 bits (2240), Expect = 5.5e-248
Identity = 490/790 (62.03%), Postives = 538/790 (68.10%), Query Frame = 0
Query: 1 MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLAAEPLRRSARITAPALPPAHP 60
MVQPANSTNT DRR LAA+ HQREVGA VEGQGH+ L EPL RSARIT P LPPAHP
Sbjct: 1 MVQPANSTNTADRRALAANHGHQREVGAEVVEGQGHEDLGTEPLCRSARITTPVLPPAHP 60
Query: 61 RTSKAIRGRGGTSKKGARGPAPTPTSEDFDALQREMEAMRTQMRSMEAMYNEMVLAAGAG 120
+ SK
Sbjct: 61 KPSK-------------------------------------------------------- 120
Query: 121 SRSENRATRMHVREQRGSHLGPAEEERPEDNGSEGYARQRGDLREHLNRKRGSSLQKGQS 180
Sbjct: 121 ------------------------------------------------------------ 180
Query: 181 SSRSHRSSNQQAESSHNP--AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLG 240
AESS+NP G+ITREEFDQL+ + DAQVEALKA+CE+K+ S +DGDLG
Sbjct: 181 -----------AESSYNPITPGVITREEFDQLKSKFDAQVEALKARCEKKESSFDDGDLG 240
Query: 241 ESPFTSDVLEAPIPPKFKAPTVKPYDETKDPKDYVEVFEGLMDFQAASDAIKCRAFQIAL 300
E F+SD+LEA IPPKFK PT+KPYD +KDPKDYVEVFE LMDFQAA+DAIKC AFQIAL
Sbjct: 241 ELSFSSDILEALIPPKFKTPTMKPYDGSKDPKDYVEVFESLMDFQAATDAIKCCAFQIAL 300
Query: 301 TGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTR 360
TGSARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTR
Sbjct: 301 TGSARLWYRRLPARLISTYSQLRKEFISQFSSRHYDRKTPTHLATIRQKEGETLREYVTR 360
Query: 361 FQEKQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRT 420
F E+QLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPATFAEVLQK KKVIDGQELLRT
Sbjct: 361 FPEEQLKVAHCSDDSAMCYFLTGLADETLTVKLREEAPATFAEVLQKTKKVIDGQELLRT 420
Query: 421 KTGRPERKIGRGRSGKDE-RADPKSKDKG-SFSRGRAEYRRAENGPTRSRPYERFTPTTI 480
KTGRPE+ I +GR+GKD+ +AD KS+DKG S S R +YRR+ + +SRPYE +TPTTI
Sbjct: 421 KTGRPEKNIDQGRAGKDKGKADSKSRDKGPSSSSSRVDYRRSNSSHNQSRPYEHYTPTTI 480
Query: 481 PISEILTNIEDSGMEKLFKRPEKLRGAPERRSKDKYCRFHREHDHNTSDCWELKRQIEDL 540
PI EILTNIE++GMEKL KRPEKLRG PE+R+ DKYCRFHR+H HNTS+ WELKRQIEDL
Sbjct: 481 PIFEILTNIEETGMEKLLKRPEKLRGDPEKRNTDKYCRFHRDHGHNTSNYWELKRQIEDL 540
Query: 541 IQDGYFKKFVGKPGTGSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELA 600
IQDGYFKKFVGKP + S EKKEERKR RTPPRR DRPAVIN K+KELA
Sbjct: 541 IQDGYFKKFVGKPRSNSVEKKEERKRLRTPPRRDDRPAVIN-------------KKKELA 600
Query: 601 RAARREVCVIREQGPTCLITFDGVDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASAN 660
R ARREVC+IREQ PT I F+ DLE VHLPHNDALVIAPLID V+VRR+LVDGGASAN
Sbjct: 601 REARREVCIIREQRPTSSIAFNHADLEGVHLPHNDALVIAPLIDLVLVRRILVDGGASAN 650
Query: 661 ILSLPTYLALGWTRSQLKRSPTPLVGFSGE------------------------------ 720
ILSL TYLALGWTRSQLK+SPTPLVGFSGE
Sbjct: 661 ILSLSTYLALGWTRSQLKKSPTPLVGFSGESISLEGCIDLPVSIRQDDTQVTQMAEFVVI 650
Query: 721 ---SAYNAIFGRPIIHSFRAIPSTLHQVLKYPTPNGVGTVRGEQAASRECYAAALKGPSV 752
SAYNAIFGRPIIHSFRA+PSTLHQVLKY T NGVGTVRGE SRECYA+ K SV
Sbjct: 721 DGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGELKTSRECYASVPKRSSV 650
BLAST of Moc11g13700 vs. ExPASy TrEMBL
Match:
A0A6J1D9W7 (uncharacterized protein LOC111018708 OS=Momordica charantia OX=3673 GN=LOC111018708 PE=4 SV=1)
HSP 1 Score: 776.9 bits (2005), Expect = 9.8e-221
Identity = 396/422 (93.84%), Postives = 404/422 (95.73%), Query Frame = 0
Query: 228 KDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDETKDPKDYVEVFEGLMDFQAASD 287
KDDSLNDGDLGES FTSDVLEAPIPPKFKAPTVKPYD +KDPKDYVEVFEGLMDF AASD
Sbjct: 44 KDDSLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFHAASD 103
Query: 288 AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQK 347
AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSR Y KKT THLATIRQK
Sbjct: 104 AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRQYGKKTETHLATIRQK 163
Query: 348 EGETLREYVTRFQEKQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAK 407
EG TLREYVTRFQE+QLKVAHCSDDSAMCYFLTGLADEALTVKLGE+AP TFAEVLQKAK
Sbjct: 164 EGGTLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEKAPTTFAEVLQKAK 223
Query: 408 KVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSRGRAEYRRAENGPTRSR 467
KVIDGQELLRTKTGRP+RKIGRGRSGKD ERADPKSKDKGSFS GRAEYRRAE+GPT+SR
Sbjct: 224 KVIDGQELLRTKTGRPDRKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAESGPTKSR 283
Query: 468 PYERFTPTTIPISEILTNIEDSGMEKLFKRPEKLRGAPERRSKDKYCRFHREHDHNTSDC 527
PYERFTPTTIPISEILTNIE+SGMEKL KRPEKLRGAPERRSKDKYCRFHREH HNTSDC
Sbjct: 284 PYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC 343
Query: 528 WELKRQIEDLIQDGYFKKFVGKPGTGSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG 587
WELKRQIEDLIQDGYFKKFVGKP T SAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
Sbjct: 344 WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG 403
Query: 588 QSGHKRKELARAARREVCVIREQGPTCLITFDGVDLEEVHLPHNDALVIAPLIDHVVVRR 647
QSGHKRKELARAARREVC+IREQGPTC ITFDG D EEVHLPHNDA VIAPLIDHVVVRR
Sbjct: 404 QSGHKRKELARAARREVCIIREQGPTCPITFDGADSEEVHLPHNDARVIAPLIDHVVVRR 463
Query: 648 VL 649
VL
Sbjct: 464 VL 465
BLAST of Moc11g13700 vs. ExPASy TrEMBL
Match:
A0A6J1DS95 (uncharacterized protein LOC111023421 OS=Momordica charantia OX=3673 GN=LOC111023421 PE=4 SV=1)
HSP 1 Score: 760.8 bits (1963), Expect = 7.3e-216
Identity = 391/405 (96.54%), Postives = 394/405 (97.28%), Query Frame = 0
Query: 200 GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPT 259
GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPT
Sbjct: 26 GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPT 85
Query: 260 VKPYDETKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQ 319
VKPYD TKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLP RSISTYSQ
Sbjct: 86 VKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPXRSISTYSQ 145
Query: 320 LRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQLKVAHCSDDSAMCYFL 379
LRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQE+QLKVAHCSDDSAMCYFL
Sbjct: 146 LRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFL 205
Query: 380 TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERA 439
TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD ERA
Sbjct: 206 TGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERA 265
Query: 440 DPKSKDKGSFSRGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEDSGMEKLFKRPE 499
DPKSKDKGSFS GRAEYRRAENGPTRSRPYERFTPTTIPI EILTNIE+SGMEKL KRPE
Sbjct: 266 DPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIEESGMEKLLKRPE 325
Query: 500 KLRGAPERRSKDKYCRFHREHDHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTGSAEKKE 559
KLRGAPERRSKDKYCRFHREH HNTSD WELKRQIEDLIQDGYFKKFVGKP T SAEKKE
Sbjct: 326 KLRGAPERRSKDKYCRFHREHGHNTSDFWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKE 385
Query: 560 ERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREV 604
ERKRSRTPPRRTDRPAVINTIFGGPSGGQ GHKRKELARAARRE+
Sbjct: 386 ERKRSRTPPRRTDRPAVINTIFGGPSGGQLGHKRKELARAARREL 430
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022137317.1 | 1.0e-259 | 90.08 | uncharacterized protein LOC111008813 [Momordica charantia] | [more] |
XP_022150760.1 | 1.7e-251 | 72.88 | uncharacterized protein LOC111018823 [Momordica charantia] | [more] |
XP_022152854.1 | 1.1e-247 | 62.03 | uncharacterized protein LOC111020479 [Momordica charantia] | [more] |
XP_022150613.1 | 2.0e-220 | 93.84 | uncharacterized protein LOC111018708, partial [Momordica charantia] | [more] |
XP_022156542.1 | 1.5e-215 | 96.54 | uncharacterized protein LOC111023421 [Momordica charantia] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1C7X5 | 4.8e-260 | 90.08 | uncharacterized protein LOC111008813 OS=Momordica charantia OX=3673 GN=LOC111008... | [more] |
A0A6J1D9E1 | 8.2e-252 | 72.88 | uncharacterized protein LOC111018823 OS=Momordica charantia OX=3673 GN=LOC111018... | [more] |
A0A6J1DHB3 | 5.5e-248 | 62.03 | uncharacterized protein LOC111020479 OS=Momordica charantia OX=3673 GN=LOC111020... | [more] |
A0A6J1D9W7 | 9.8e-221 | 93.84 | uncharacterized protein LOC111018708 OS=Momordica charantia OX=3673 GN=LOC111018... | [more] |
A0A6J1DS95 | 7.3e-216 | 96.54 | uncharacterized protein LOC111023421 OS=Momordica charantia OX=3673 GN=LOC111023... | [more] |
Match Name | E-value | Identity | Description | |