Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTTCAACCAGCGAATTCGACCAATACGACGGATCGAAGGACTCTGGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGTGGCGGCAGAAGGGCAAGGTCACGACGGCCTGGCAGCGGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGCCCTGCCGCCTGCACACCCGAGGACGTCCAAGGCCACCCGTGGTCGAGGTGGGACCTCTAAGAAGGGCGCCCGGGGTCCAGCCCCGACTCCAACAAGCGAGGATTTTGATGCGCTCCAGAGAGAGATGGAGGCAATGCGCTTCATGGAGGCAATGTATAACGACATGGTGCTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGGGCGACGCGCATAGACGCATGCGAGCAAAGGGGTTCCCACCTCGGCCCGGCCGAGGAGGAACGTCCCGAAGACAACGGGAGTGAGGGGTACACTCACCAGAAGGGAGACCTCCGCGAGCATCTCAACCGAAAGAGAGGCTCGTCTCTCCGAAAGGGGCAATCGCCATCCCGCTCCCACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAACCCCGCAGGGATAATCACAAGGGAGGAATTCGACCAGCTAAGGGGGGAGCTCGATGCTCAAGTGGAGGCCCTAAAGGCCAAATGTGAGCAGAAGGACGATTCACTGAACGATGGCGACTTGGGAGAATCGTCTTTCACCTCGGACGTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGACGGGACGAAGGACCCCAAGGACTATGTTGAAGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGTTTGTGGTACCGGAGATTGCCAGCCAGATCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCACTACGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGGGGAGACGCTGCGGGAGTATGTTACCAGATTCCAGGAGGAGCAGTTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTACTTCCTCACCGGTCTAGCCGATGAGGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTCCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCTGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATGAAAGGGCAGATCCCAAGTCCAAAGACAAAGGATCCTTCTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAACGGACCAACCAGGAGCCGACCTTACGAGCGCTTCACCCCAACCACGATTCCAATTTTCGAGATCCTAACGAACATCGAGGATTCTGGAATGGAAAAACTACTCAAGCGTCCGGAGAAACTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTACTGCCGCTTCCATCGGGAGCACGGCCACAACACGTCGGACTGCTGGGAATTGAAGCGCCAAATTGAAGATCTAATTCAAGACGGCTACTTCAAGAAGTTTGTGGGAAAGCCCGGGACTAGCTCAGCAGAGAAAAAGGAGGAGCGAAAGCGTTCTAGAACGCCACCTCGGCGCACCGACCGACCTGCGGTCATCAACACCATTTTTGGAGGACCAAGCGGGGGTCAATCCGGGCATAAAAGAAAGGAGTTAGCCCGTGCAGCCAGACGCGAGGTGTGCGTCATCAGGGAGCACGGGCCGACCTGCCCAATCACCTTCGACGGTGTAGACTTGGAGGAGGTACATCTGCCCCACAATGATGCCCTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTTAGGAGAGTACTGGTAGACGGGGGCGCATCCGCTAACGTCCTGTCCTTACCGACCTACCTCGCCTTGGGCTGGACGAGGTCGCAATTGAGGAGAAGCCCGACGCCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGCTGCATCGACTTGCCGGTCACGCTGGGGCAGAACCAAACCCGGATCACTCAAATGGCCGAGTTCGTGGTAGTTGATGGTAGGTCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCCTTTTGGGCCATTCCTTCAACAGTTCATCAAGTTTTGAAGTATCCCACCCCCAGCGGCGTGGGCACGGTCCGAGGAGAGCAAACCGCTTCGAGGGAGTGTTATGCCGCCGCACTCAAAGGCCCATCAGTTTGCGCCCTCGAAACTCTCAGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCCGAGGAAGGAGTTTGCCGCACCCACCGAGGAGCTCGAGCTTGTTCCTCTGCTTAGCCCCGAGAAGCAGGTAAGCATAGGAACCAAGCTGGGGGCCACCGACAGAGAGGAGCTAATCCACTTCCTCAGATCCAACTCGGACGTCTTTGCATGGTCCCATGAGGACATGCTTGGCATCGACCCGCGAATTATGACGCATCGCCTCAGCATAGATCCATCATTCCGACCTGTGAAACAAAAGAGAAGACCTATAAACAAGGAGAGAAGTGATGTAATTGTTGAGAAAGTTAACAAACTTTTGAAAGCTGAATACATAAGAGAAATTTCGTATCCCGAGTGGCTCTCCAATGTTGTATTAGTTAAAAAATCTAACGGCAAGTGGAGAATGTGCGTAGACTTTACGAATTTAAATAAGGCATGTCCGAAGGATTGCTTCCCACTGCCGAGGATTGATCAGCTCGTGGACGCCATAGCTGGGCACGAACTGCTCACTTTCATGGACGCCTACTCTGGGTACAACCAAATCAAGATGCATGTCTCAGATGAAGGTCATACCGCTTTTATAACAGACCAAGGTCTGTACTGCTACAAGGTCATGCCCTTCGGGTTAAAAAACGCAGGAGCGACCTACCAGAGAATGGTGAACAAAATGTTCGCCAAGCAGATCGGCCGGAATATGGAGGTGTATGTGGACGACATGCTTGTCAAGAGCAAGCAGTCTGAGTCGCATCTCTCCGACCTGGCCGAAGCCTTCGAGGTTCTGAGGGCATATCAAATGAAGCTCAACCCTGCCAAGTGTGCCTTTGGAGTCTCCTCGGGAAAATTCCTTGGCTTCATGGTAAACAACCGGGGGATCGAGGCCAACCCCGAAAAGATTAGAGCCGTGATCGAGATGGAGGCACCTAAAACGCTGAAACAGCTTCAGTGCCTCAATGGCAGGATTGCGGCCCTGAACCGGTTTGCTTCAAGGTCAACGGACAAGTGCCTCCCTTTCTTCAAGGTTTTACGAAAGAAAGGGCCGTTTGAATGGACGGCGGAGTGCAAGCAAGCGTTTCAGCAATTGAAGAACTACCTCTGTTCGGCACCCTTGCTTGCCAAGCCTATGTCGGGAGACAAGCTCCAATTATACCTAGCAGTGTCTGACAGTGCCGTCAGCTCGGCCCTAATCCGGCAAGAGGAAGCGCGGCAAAACCCGGTCTACTACACAAGCAAGGCTATGACCGAAGCCGAGGCTAGATACCCTCAGATGGAGAAGTTGGCTCTCGCTTTGGTCACCTCGGCCCGACGACTTAGACCATACTTCCAAGCCCATACGGTGGTGGTACTCACTAACTCGCCCCTTAAAAGTATCTTCCACAAGCCGGAAGCTTCCGGGCGCCTAATGAAGTGAGCGATAGAGTTAAGTGAGTACGACATCCAGTTCGAACCCAGAACTGCGTTAAAAGGGCAAGCAGCGGCAGATTTCATAGCCGAGCTCACACCACTTTCCGAGCTGAGCGGGACCGACCTGCCTTGGACAGTCTACGTCGACGGATCCTCCAATGAGAAGGGGTGCGGAGCCGGGGTCCTCTTGCTCGGACCAGGGGGTGAACGATTTGAGTATGCCTTGCGGTTCAGCTTCCGGACTTCTAACAACGAGGCTGAGTATGAAGCATTTATTGCTGGCCTGCGAATCGCTCGAGCATTGGGGGCCTCTTGTGTTAAGGTCTTCAGTGACTCCCAGCTGGTTGTGAACCAGATCAAGGACGAATACCAAGCCAAAGACACCCGAATGGAGAAGTATTTGGGCAAGGTCAGATCATACCTCAACCAGTTTCGAACTTACGAAGTAAGCCGGATTCCGCGGGCGGAGAATTCTAATGCTGACGCCTTGGCCAAGTTAGCATCGGCGTACGAGACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCCTAGATAATCCCTCGATCTTAGAACCAGATCTGATGGAGATCGGCGCTCCAGAACCCTCATGGATGGACCCGATCGCAGACTTCATTAGGGGCAACTCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAAGACAGGCAGCTCGGTTCGTGATCCGAGATGGGGCATTGTACCGACGTGGTTTTTCCCTGCCTCTGTTGAAATGCCTAACCCCTGAAGAGGGCCTGTACGTCCTTAGAGAGATCCACGAAGGAGTGTGCGGCAATCACTCAGGCGCCCGGTCGCTGTCAGCCAAAGTGATCCGACAAGGATACTATTGGCCGACCCTCAGCCAGGACGCCAAGAAGTTCGTTAGAACTTGCGACAATTGCTAACGCTACGGAACCATAATCCACCAACCTCCCGAGCTGCTCACCCCCATCTCGGCCCCGTGGCCATTCGCGCAGTGGGGGGTGGATATCATTGGCCCTTTCCCTTTGGGCAAGGGCCAGACCAAGTTCGCTGTGGTTGCTGTGGATTACTTCACCAAGTGGGCCGAGGCCGAAGCGCTCTCCCACATAACGGAATTCAGGGTCACGTCCTTCGTATGGACGAACATCATATGTCGCTTTGGTATACCACAGGCCATAGTGACAGACAATGGCAAGCAGTTTGACAACGCCAAGTTCAAAGACTTTTGCAGCAAACTTGGCATAAGTCATCTCAGCTCGTCCCCCGCACATCCGCAAGCAAATGGGCAGGTGGAGGCGGTCAACAAGATCATCAAGCGAGGCATCAAACTTAGACTGGACTCCAAGAAAGCCAGGTGGGCCGAGGAGCTACCCGAAGTTCTATGGTCGTACCGGACCACCCAGCGGGGGTCGACGGGGAGACCCCGTTTTCCCTGGCTTTCGGCTCCGAAGCTGTAGTCCCGATTGAGATCGGCATGCCATCTGACAGAGTAGAGCATTACGAGCCTTCGACAAATGAGGAAGAGCTGCTCCTCAACCTCGACTTGTTGGAAGAAAGACGAGCAATGGCCCAGCTACGCCTGGCGGAATATCAGGGTAGGATGGCCAGACATTACAATGCCCGCGTTCGACCTCGGGCCTTTCAGGTCGGCCATCTGGTCTTAAGGAGGGTCCAAACGCATGTGGGTGCCCTTGATCCGGCCTGGGAAGGCCCATTTGAGATCAAGGGCATAGTCCGACCTGGGACGTACGTATTGGCCGATCCGAAAGGAGATGTCCTCGCGCACCCGTGA
mRNA sequence
ATGGTTCAACCAGCGAATTCGACCAATACGACGGATCGAAGGACTCTGGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGTGGCGGCAGAAGGGCAAGGTCACGACGGCCTGGCAGCGGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGCCCTGCCGCCTGCACACCCGAGGACGTCCAAGGCCACCCGTGGTCGAGGTGGGACCTCTAAGAAGGGCGCCCGGGGTCCAGCCCCGACTCCAACAAGCGAGGATTTTGATGCGCTCCAGAGAGAGATGGAGGCAATGCGCTTCATGGAGGCAATGTATAACGACATGGTGCTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGGGCGACGCGCATAGACGCATGCGAGCAAAGGGGTTCCCACCTCGGCCCGGCCGAGGAGGAACGTCCCGAAGACAACGGGAGTGAGGGGTACACTCACCAGAAGGGAGACCTCCGCGAGCATCTCAACCGAAAGAGAGGCTCGTCTCTCCGAAAGGGGCAATCGCCATCCCGCTCCCACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAACCCCGCAGGGATAATCACAAGGGAGGAATTCGACCAGCTAAGGGGGGAGCTCGATGCTCAAGTGGAGGCCCTAAAGGCCAAATGTGAGCAGAAGGACGATTCACTGAACGATGGCGACTTGGGAGAATCGTCTTTCACCTCGGACGTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGACGGGACGAAGGACCCCAAGGACTATGTTGAAGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGTTTGTGGTACCGGAGATTGCCAGCCAGATCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCACTACGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGGGGAGACGCTGCGGGAGTATGTTACCAGATTCCAGGAGGAGCAGTTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTACTTCCTCACCGGTCTAGCCGATGAGGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTCCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCTGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATGAAAGGGCAGATCCCAAGTCCAAAGACAAAGGATCCTTCTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAACGGACCAACCAGGAGCCGACCTTACGAGCGCTTCACCCCAACCACGATTCCAATTTTCGAGATCCTAACGAACATCGAGGATTCTGGAATGGAAAAACTACTCAAGCGTCCGGAGAAACTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTACTGCCGCTTCCATCGGGAGCACGGCCACAACACGTCGGACTGCTGGGAATTGAAGCGCCAAATTGAAGATCTAATTCAAGACGGCTACTTCAAGAAGTTTGTGGGAAAGCCCGGGACTAGCTCAGCAGAGAAAAAGGAGGAGCGAAAGCGTTCTAGAACGCCACCTCGGCGCACCGACCGACCTGCGGTCATCAACACCATTTTTGGAGGACCAAGCGGGGGTCAATCCGGGCATAAAAGAAAGGAGTTAGCCCGTGCAGCCAGACGCGAGGTGTGCGTCATCAGGGAGCACGGGCCGACCTGCCCAATCACCTTCGACGGTGTAGACTTGGAGGAGGTACATCTGCCCCACAATGATGCCCTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTTAGGAGAGTACTGGTAGACGGGGGCGCATCCGCTAACGTCCTGTCCTTACCGACCTACCTCGCCTTGGGCTGGACGAGGTCGCAATTGAGGAGAAGCCCGACGCCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGCTGCATCGACTTGCCGGTCACGCTGGGGCAGAACCAAACCCGGATCACTCAAATGGCCGAGTTCGTGGTAGTTGATGGTAGGTCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCCTTTTGGGCCATTCCTTCAACAGTTCATCAAGTTTTGAAGTATCCCACCCCCAGCGGCGTGGGCACGGTCCGAGGAGAGCAAACCGCTTCGAGGGAGTGTTATGCCGCCGCACTCAAAGGCCCATCAGTTTGCGCCCTCGAAACTCTCAGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCCGAGGAAGGAGTTTGCCGCACCCACCGAGGAGCTCGAGCTTGTTCCTCTGCTTAGCCCCGAGAAGCAGTTAGCATCGGCGTACGAGACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCCTAGATAATCCCTCGATCTTAGAACCAGATCTGATGGAGATCGGCGCTCCAGAACCCTCATGGATGGACCCGATCGCAGACTTCATTAGGGGCAACTCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAAGACAGGCAGCTCGGTTCGTGATCCGAGATGGGGCATTGTACCGACGTGGTTTTTCCCTGCCTCTGTTGAAATGCCTAACCCCTGAAGAGGGCCTAGTAGAGCATTACGAGCCTTCGACAAATGAGGAAGAGCTGCTCCTCAACCTCGACTTGTTGGAAGAAAGACGAGCAATGGCCCAGCTACGCCTGGCGGAATATCAGGGTAGGATGGCCAGACATTACAATGCCCGCGTTCGACCTCGGGCCTTTCAGGTCGGCCATCTGGTCTTAAGGAGGGTCCAAACGCATGTGGGTGCCCTTGATCCGGCCTGGGAAGGCCCATTTGAGATCAAGGGCATAGTCCGACCTGGGACGTACGTATTGGCCGATCCGAAAGGAGATGTCCTCGCGCACCCGTGA
Coding sequence (CDS)
ATGGTTCAACCAGCGAATTCGACCAATACGACGGATCGAAGGACTCTGGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGTGGCGGCAGAAGGGCAAGGTCACGACGGCCTGGCAGCGGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGCCCTGCCGCCTGCACACCCGAGGACGTCCAAGGCCACCCGTGGTCGAGGTGGGACCTCTAAGAAGGGCGCCCGGGGTCCAGCCCCGACTCCAACAAGCGAGGATTTTGATGCGCTCCAGAGAGAGATGGAGGCAATGCGCTTCATGGAGGCAATGTATAACGACATGGTGCTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGGGCGACGCGCATAGACGCATGCGAGCAAAGGGGTTCCCACCTCGGCCCGGCCGAGGAGGAACGTCCCGAAGACAACGGGAGTGAGGGGTACACTCACCAGAAGGGAGACCTCCGCGAGCATCTCAACCGAAAGAGAGGCTCGTCTCTCCGAAAGGGGCAATCGCCATCCCGCTCCCACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAACCCCGCAGGGATAATCACAAGGGAGGAATTCGACCAGCTAAGGGGGGAGCTCGATGCTCAAGTGGAGGCCCTAAAGGCCAAATGTGAGCAGAAGGACGATTCACTGAACGATGGCGACTTGGGAGAATCGTCTTTCACCTCGGACGTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGACGGGACGAAGGACCCCAAGGACTATGTTGAAGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGTTTGTGGTACCGGAGATTGCCAGCCAGATCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCACTACGACAAAAAGACAGCGACCCATCTCGCCACCATCAGGCAGAAGGAGGGGGAGACGCTGCGGGAGTATGTTACCAGATTCCAGGAGGAGCAGTTGAAGGTCGCACACTGCTCCGATGACTCGGCCATGTGCTACTTCCTCACCGGTCTAGCCGATGAGGCCCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTCCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCTGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATGAAAGGGCAGATCCCAAGTCCAAAGACAAAGGATCCTTCTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAACGGACCAACCAGGAGCCGACCTTACGAGCGCTTCACCCCAACCACGATTCCAATTTTCGAGATCCTAACGAACATCGAGGATTCTGGAATGGAAAAACTACTCAAGCGTCCGGAGAAACTTCGGGGAGCCCCGGAGAGGCGCAGCAAGGACAAGTACTGCCGCTTCCATCGGGAGCACGGCCACAACACGTCGGACTGCTGGGAATTGAAGCGCCAAATTGAAGATCTAATTCAAGACGGCTACTTCAAGAAGTTTGTGGGAAAGCCCGGGACTAGCTCAGCAGAGAAAAAGGAGGAGCGAAAGCGTTCTAGAACGCCACCTCGGCGCACCGACCGACCTGCGGTCATCAACACCATTTTTGGAGGACCAAGCGGGGGTCAATCCGGGCATAAAAGAAAGGAGTTAGCCCGTGCAGCCAGACGCGAGGTGTGCGTCATCAGGGAGCACGGGCCGACCTGCCCAATCACCTTCGACGGTGTAGACTTGGAGGAGGTACATCTGCCCCACAATGATGCCCTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTTAGGAGAGTACTGGTAGACGGGGGCGCATCCGCTAACGTCCTGTCCTTACCGACCTACCTCGCCTTGGGCTGGACGAGGTCGCAATTGAGGAGAAGCCCGACGCCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGCTGCATCGACTTGCCGGTCACGCTGGGGCAGAACCAAACCCGGATCACTCAAATGGCCGAGTTCGTGGTAGTTGATGGTAGGTCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCCTTTTGGGCCATTCCTTCAACAGTTCATCAAGTTTTGAAGTATCCCACCCCCAGCGGCGTGGGCACGGTCCGAGGAGAGCAAACCGCTTCGAGGGAGTGTTATGCCGCCGCACTCAAAGGCCCATCAGTTTGCGCCCTCGAAACTCTCAGGGATGGGACGCTCGAGTTCGAGGCCGACCTGCCGAGGAAGGAGTTTGCCGCACCCACCGAGGAGCTCGAGCTTGTTCCTCTGCTTAGCCCCGAGAAGCAGTTAGCATCGGCGTACGAGACCGACCTGGCCAGGTCGGTCCCCGTCGAGATCCTAGATAATCCCTCGATCTTAGAACCAGATCTGATGGAGATCGGCGCTCCAGAACCCTCATGGATGGACCCGATCGCAGACTTCATTAGGGGCAACTCACCACAAGACCCCAAGGAGCGCAGAAAGTTGGCAAGACAGGCAGCTCGGTTCGTGATCCGAGATGGGGCATTGTACCGACGTGGTTTTTCCCTGCCTCTGTTGAAATGCCTAACCCCTGAAGAGGGCCTAGTAGAGCATTACGAGCCTTCGACAAATGAGGAAGAGCTGCTCCTCAACCTCGACTTGTTGGAAGAAAGACGAGCAATGGCCCAGCTACGCCTGGCGGAATATCAGGGTAGGATGGCCAGACATTACAATGCCCGCGTTCGACCTCGGGCCTTTCAGGTCGGCCATCTGGTCTTAAGGAGGGTCCAAACGCATGTGGGTGCCCTTGATCCGGCCTGGGAAGGCCCATTTGAGATCAAGGGCATAGTCCGACCTGGGACGTACGTATTGGCCGATCCGAAAGGAGATGTCCTCGCGCACCCGTGA
Protein sequence
MVQPANSTNTTDRRTLAASDAHQREVGAVAAEGQGHDGLAAEPLRRSARITAPALPPAHPRTSKATRGRGGTSKKGARGPAPTPTSEDFDALQREMEAMRFMEAMYNDMVLAAGAGSRSENRATRIDACEQRGSHLGPAEEERPEDNGSEGYTHQKGDLREHLNRKRGSSLRKGQSPSRSHRSSNQQAESSHNPAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCVIREHGPTCPITFDGVDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANVLSLPTYLALGWTRSQLRRSPTPLVGFSGESVIPEGCIDLPVTLGQNQTRITQMAEFVVVDGRSAYNAIFGRPIIHSFWAIPSTVHQVLKYPTPSGVGTVRGEQTASRECYAAALKGPSVCALETLRDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSILEPDLMEIGAPEPSWMDPIADFIRGNSPQDPKERRKLARQAARFVIRDGALYRRGFSLPLLKCLTPEEGLVEHYEPSTNEEELLLNLDLLEERRAMAQLRLAEYQGRMARHYNARVRPRAFQVGHLVLRRVQTHVGALDPAWEGPFEIKGIVRPGTYVLADPKGDVLAHP
Homology
BLAST of Moc03g20370 vs. NCBI nr
Match:
XP_022137317.1 (uncharacterized protein LOC111008813 [Momordica charantia])
HSP 1 Score: 961.8 bits (2485), Expect = 4.5e-276
Identity = 491/528 (92.99%), Postives = 506/528 (95.83%), Query Frame = 0
Query: 187 QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESSFTSDVL 246
+AESS N PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+ LNDGDLGES FTSDVL
Sbjct: 3 KAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVL 62
Query: 247 EAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYR 306
EAPIPPKFKAPTVKPYDG+KDPKDYVEVFE LMDFQAASDAIKCRAF+IALTGSARLWYR
Sbjct: 63 EAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKCRAFRIALTGSARLWYR 122
Query: 307 RLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA 366
RLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA
Sbjct: 123 RLPAXSISTYSQLRREFLAXFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA 182
Query: 367 HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI 426
HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI
Sbjct: 183 HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI 242
Query: 427 GRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIE 486
GRGRSGKD E ADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPI EILTNIE
Sbjct: 243 GRGRSGKDIENADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE 302
Query: 487 DSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFV 546
+SGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFV
Sbjct: 303 ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIENLIQDGYFKKFV 362
Query: 547 GKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCVI 606
GKP TSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSG KRKELARAARREVC+I
Sbjct: 363 GKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARAARREVCII 422
Query: 607 REHGPTCPITFDGVDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANVLSLPTYLAL 666
RE PTCPITFDG DLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SAN+LSLPTYLAL
Sbjct: 423 REQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVXRVLVDGGTSANILSLPTYLAL 482
Query: 667 GWTRSQLRRSPTPLVGFSGESVIPEGCIDLPVTLGQNQTRITQMAEFV 711
GWTRSQL++SPTPLVGFSGESVIPEG IDLPVTLGQ+QT++TQMAEFV
Sbjct: 483 GWTRSQLKKSPTPLVGFSGESVIPEGFIDLPVTLGQDQTQVTQMAEFV 530
BLAST of Moc03g20370 vs. NCBI nr
Match:
XP_022150760.1 (uncharacterized protein LOC111018823 [Momordica charantia])
HSP 1 Score: 952.2 bits (2460), Expect = 3.6e-273
Identity = 513/671 (76.45%), Postives = 546/671 (81.37%), Query Frame = 0
Query: 183 SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESSFT 242
SSNQQAESSHNPA G+ITREEFDQLRG+L+AQVEALKAKCEQK+ LNDGDLGES FT
Sbjct: 2 SSNQQAESSHNPATPDGVITREEFDQLRGKLNAQVEALKAKCEQKEGPLNDGDLGESPFT 61
Query: 243 SDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR 302
SDVLE APTVK YDG+KDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR
Sbjct: 62 SDVLE--------APTVKSYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR 121
Query: 303 LWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQ 362
LW FQE+Q
Sbjct: 122 LW-----------------------------------------------------FQEDQ 181
Query: 363 LKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRP 422
LKVA SDDSAMCYFLTGLADEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRP
Sbjct: 182 LKVAQSSDDSAMCYFLTGLADEALTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRP 241
Query: 423 ERKIGRGRSGKDERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILT 482
ER I RGRSGKDE+AD KSKDKGSFSSGRAE+RRA NGPTRSRPYERFTPTTIPI EILT
Sbjct: 242 ERGIDRGRSGKDEKADLKSKDKGSFSSGRAEFRRAVNGPTRSRPYERFTPTTIPISEILT 301
Query: 483 NIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFK 542
NIE+SGMEKLLKRPEKLRGAPERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YFK
Sbjct: 302 NIEESGMEKLLKRPEKLRGAPERRNKDKYCRFHREHDHNTSDRWELKRQIEDLIQDDYFK 361
Query: 543 KFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREV 602
KFVGKP TSSAEKKEERK SRTP RR DRPAVINTIFGGPSGGQSGHKRKELARAARREV
Sbjct: 362 KFVGKPRTSSAEKKEERKLSRTPLRRIDRPAVINTIFGGPSGGQSGHKRKELARAARREV 421
Query: 603 CVIREHGPTCPITFDGVDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANVLSLPTY 662
C+IRE PTCPITFD DLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SAN++SL TY
Sbjct: 422 CIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDEGVSANIVSLLTY 481
Query: 663 LALGWTRSQLRRSPTPLVGFSGESVIPEGCIDLPVTLGQNQTRITQMAEFVVVDGRSAYN 722
LALGWTRSQL++S TPLVGFS ESVIPEGCIDLPVTLG +QT++TQMAEFVV+DGRSAYN
Sbjct: 482 LALGWTRSQLKKSTTPLVGFSRESVIPEGCIDLPVTLGHDQTQVTQMAEFVVIDGRSAYN 541
Query: 723 AIFGRPIIHSFWAIPSTVHQVLKYPTPSGVGTVRGEQTASRECYAAALKGPSVCALETL- 782
AIFGRPIIHSF AIPST+HQVLKY TP+GVG VRGEQ ASRECYA+ALKG SVCALETL
Sbjct: 542 AIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGMVRGEQIASRECYASALKGSSVCALETLV 601
Query: 783 -RDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSILE 842
RDGTLEF+A+LPR+EFAAPTEELELVPLL + +E +L + +D+
Sbjct: 602 SRDGTLEFKANLPRREFAAPTEELELVPLLRYKYNENIDHEQELDEKSSLNKIDD----- 605
Query: 843 PDLMEIGAPEP 849
D+ G PEP
Sbjct: 662 -DIGVEGMPEP 605
BLAST of Moc03g20370 vs. NCBI nr
Match:
XP_022152854.1 (uncharacterized protein LOC111020479 [Momordica charantia])
HSP 1 Score: 940.3 bits (2429), Expect = 1.4e-269
Identity = 512/786 (65.14%), Postives = 571/786 (72.65%), Query Frame = 0
Query: 1 MVQPANSTNTTDRRTLAASDAHQREVGAVAAEGQGHDGLAAEPLRRSARITAPALPPAHP 60
MVQPANSTNT DRR LAA+ HQREVGA EGQGH+ L EPL RSARIT P LPPAHP
Sbjct: 1 MVQPANSTNTADRRALAANHGHQREVGAEVVEGQGHEDLGTEPLCRSARITTPVLPPAHP 60
Query: 61 RTSKATRGRGGTSKKGARGPAPTPTSEDFDALQREMEAMRFMEAMYNDMVLAAGAGSRSE 120
+ SK
Sbjct: 61 KPSK-------------------------------------------------------- 120
Query: 121 NRATRIDACEQRGSHLGPAEEERPEDNGSEGYTHQKGDLREHLNRKRGSSLRKGQSPSRS 180
Sbjct: 121 ------------------------------------------------------------ 180
Query: 181 HRSSNQQAESSHNP--AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESSF 240
AESS+NP G+ITREEFDQL+ + DAQVEALKA+CE+K+ S +DGDLGE SF
Sbjct: 181 -------AESSYNPITPGVITREEFDQLKSKFDAQVEALKARCEKKESSFDDGDLGELSF 240
Query: 241 TSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSA 300
+SD+LEA IPPKFK PT+KPYDG+KDPKDYVEVFE LMDFQAA+DAIKC AFQIALTGSA
Sbjct: 241 SSDILEALIPPKFKTPTMKPYDGSKDPKDYVEVFESLMDFQAATDAIKCCAFQIALTGSA 300
Query: 301 RLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEE 360
RLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EE
Sbjct: 301 RLWYRRLPARLISTYSQLRKEFISQFSSRHYDRKTPTHLATIRQKEGETLREYVTRFPEE 360
Query: 361 QLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGR 420
QLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPATFAEVLQK KKVIDGQELLRTKTGR
Sbjct: 361 QLKVAHCSDDSAMCYFLTGLADETLTVKLREEAPATFAEVLQKTKKVIDGQELLRTKTGR 420
Query: 421 PERKIGRGRSGKDE-RADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFE 480
PE+ I +GR+GKD+ +AD KS+DKG S SS R +YRR+ + +SRPYE +TPTTIPIFE
Sbjct: 421 PEKNIDQGRAGKDKGKADSKSRDKGPSSSSSRVDYRRSNSSHNQSRPYEHYTPTTIPIFE 480
Query: 481 ILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDG 540
ILTNIE++GMEKLLKRPEKLRG PE+R+ DKYCRFHR+HGHNTS+ WELKRQIEDLIQDG
Sbjct: 481 ILTNIEETGMEKLLKRPEKLRGDPEKRNTDKYCRFHRDHGHNTSNYWELKRQIEDLIQDG 540
Query: 541 YFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAAR 600
YFKKFVGKP ++S EKKEERKR RTPPRR DRPAVIN K+KELAR AR
Sbjct: 541 YFKKFVGKPRSNSVEKKEERKRLRTPPRRDDRPAVIN-------------KKKELAREAR 600
Query: 601 REVCVIREHGPTCPITFDGVDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANVLSL 660
REVC+IRE PT I F+ DLE VHLPHNDALVIAPLID V+VRR+LVDGGASAN+LSL
Sbjct: 601 REVCIIREQRPTSSIAFNHADLEGVHLPHNDALVIAPLIDLVLVRRILVDGGASANILSL 650
Query: 661 PTYLALGWTRSQLRRSPTPLVGFSGESVIPEGCIDLPVTLGQNQTRITQMAEFVVVDGRS 720
TYLALGWTRSQL++SPTPLVGFSGES+ EGCIDLPV++ Q+ T++TQMAEFVV+DGRS
Sbjct: 661 STYLALGWTRSQLKKSPTPLVGFSGESISLEGCIDLPVSIRQDDTQVTQMAEFVVIDGRS 650
Query: 721 AYNAIFGRPIIHSFWAIPSTVHQVLKYPTPSGVGTVRGEQTASRECYAAALKGPSVCALE 780
AYNAIFGRPIIHSF A+PST+HQVLKY T +GVGTVRGE SRECYA+ K SVCALE
Sbjct: 721 AYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGELKTSRECYASVPKRSSVCALE 650
BLAST of Moc03g20370 vs. NCBI nr
Match:
XP_022150613.1 (uncharacterized protein LOC111018708, partial [Momordica charantia])
HSP 1 Score: 792.7 bits (2046), Expect = 3.6e-225
Identity = 402/422 (95.26%), Postives = 409/422 (96.92%), Query Frame = 0
Query: 224 KDDSLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD 283
KDDSLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFEGLMDF AASD
Sbjct: 44 KDDSLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFHAASD 103
Query: 284 AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQK 343
AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSR Y KKT THLATIRQK
Sbjct: 104 AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRQYGKKTETHLATIRQK 163
Query: 344 EGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAK 403
EG TLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGE+AP TFAEVLQKAK
Sbjct: 164 EGGTLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEKAPTTFAEVLQKAK 223
Query: 404 KVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSR 463
KVIDGQELLRTKTGRP+RKIGRGRSGKD ERADPKSKDKGSFSSGRAEYRRAE+GPT+SR
Sbjct: 224 KVIDGQELLRTKTGRPDRKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAESGPTKSR 283
Query: 464 PYERFTPTTIPIFEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC 523
PYERFTPTTIPI EILTNIE+SGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC
Sbjct: 284 PYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC 343
Query: 524 WELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG 583
WELKRQIEDLIQDGYFKKFVGKP TSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
Sbjct: 344 WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG 403
Query: 584 QSGHKRKELARAARREVCVIREHGPTCPITFDGVDLEEVHLPHNDALVIAPLIDHVVVRR 643
QSGHKRKELARAARREVC+IRE GPTCPITFDG D EEVHLPHNDA VIAPLIDHVVVRR
Sbjct: 404 QSGHKRKELARAARREVCIIREQGPTCPITFDGADSEEVHLPHNDARVIAPLIDHVVVRR 463
Query: 644 VL 645
VL
Sbjct: 464 VL 465
BLAST of Moc03g20370 vs. NCBI nr
Match:
XP_022152110.1 (uncharacterized protein LOC111019899 [Momordica charantia])
HSP 1 Score: 783.1 bits (2021), Expect = 2.9e-222
Identity = 401/446 (89.91%), Postives = 417/446 (93.50%), Query Frame = 0
Query: 371 MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGK 430
MCYFLTGLADEALTVKL EEAPATFAEVLQKAKKVIDGQELLRT KIG+GRSGK
Sbjct: 1 MCYFLTGLADEALTVKLVEEAPATFAEVLQKAKKVIDGQELLRT-------KIGQGRSGK 60
Query: 431 D-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIEDSGMEKL 490
D E DPKSKDKGSFS+GRAEYRRAENGPTRSRPYERFTPTTIPI EILTNIE+SGMEKL
Sbjct: 61 DMENTDPKSKDKGSFSNGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKL 120
Query: 491 LKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSS 550
LKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKP TSS
Sbjct: 121 LKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKSQIEDLIQDGYFKKFVGKPRTSS 180
Query: 551 AEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCVIREHGPTC 610
AEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRK+LARAARREVC+IRE PTC
Sbjct: 181 AEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKKLARAARREVCIIREQRPTC 240
Query: 611 PITFDGVDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANVLSLPTYLALGWTRSQL 670
PITFD DL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASAN+LSLPTYLALGWTRSQL
Sbjct: 241 PITFDXADLXEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL 300
Query: 671 RRSPTPLVGFSGESVIPEGCIDLPVTLGQNQTRITQMAEFVVVDGRSAYNAIFGRPIIHS 730
++SPTPLVGFSGESV+PEGCIDLPVTLGQ+QTR+TQMAEFVVVDGRSAYNAIFGRPIIHS
Sbjct: 301 KKSPTPLVGFSGESVVPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHS 360
Query: 731 FWAIPSTVHQVLKYPTPSGVGTVRGEQTASRECYAAALKGPSVCALETL--RDGTLEFEA 790
F AIPST+HQVLKY TP+GVGTVRGEQTASRECYA+ LKG SVCALETL RDGTLEFEA
Sbjct: 361 FRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASXLKGTSVCALETLTSRDGTLEFEA 420
Query: 791 DLPRKEFAAPTEELELVPLLSPEKQL 814
DLP +EFAAP EELELVPLLS EKQ+
Sbjct: 421 DLPXREFAAPXEELELVPLLSXEKQV 439
BLAST of Moc03g20370 vs. ExPASy TrEMBL
Match:
A0A6J1C7X5 (uncharacterized protein LOC111008813 OS=Momordica charantia OX=3673 GN=LOC111008813 PE=4 SV=1)
HSP 1 Score: 961.8 bits (2485), Expect = 2.2e-276
Identity = 491/528 (92.99%), Postives = 506/528 (95.83%), Query Frame = 0
Query: 187 QAESSHN---PAGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESSFTSDVL 246
+AESS N PAG+ITREEFDQLRG+LDAQVEALKAKCEQK+ LNDGDLGES FTSDVL
Sbjct: 3 KAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVL 62
Query: 247 EAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYR 306
EAPIPPKFKAPTVKPYDG+KDPKDYVEVFE LMDFQAASDAIKCRAF+IALTGSARLWYR
Sbjct: 63 EAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKCRAFRIALTGSARLWYR 122
Query: 307 RLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA 366
RLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA
Sbjct: 123 RLPAXSISTYSQLRREFLAXFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA 182
Query: 367 HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI 426
HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI
Sbjct: 183 HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI 242
Query: 427 GRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIE 486
GRGRSGKD E ADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPI EILTNIE
Sbjct: 243 GRGRSGKDIENADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE 302
Query: 487 DSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFV 546
+SGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELKRQIE+LIQDGYFKKFV
Sbjct: 303 ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIENLIQDGYFKKFV 362
Query: 547 GKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCVI 606
GKP TSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSG KRKELARAARREVC+I
Sbjct: 363 GKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARAARREVCII 422
Query: 607 REHGPTCPITFDGVDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANVLSLPTYLAL 666
RE PTCPITFDG DLEEVHLPHNDALVIAPLIDHVVV RVLVDGG SAN+LSLPTYLAL
Sbjct: 423 REQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVXRVLVDGGTSANILSLPTYLAL 482
Query: 667 GWTRSQLRRSPTPLVGFSGESVIPEGCIDLPVTLGQNQTRITQMAEFV 711
GWTRSQL++SPTPLVGFSGESVIPEG IDLPVTLGQ+QT++TQMAEFV
Sbjct: 483 GWTRSQLKKSPTPLVGFSGESVIPEGFIDLPVTLGQDQTQVTQMAEFV 530
BLAST of Moc03g20370 vs. ExPASy TrEMBL
Match:
A0A6J1D9E1 (uncharacterized protein LOC111018823 OS=Momordica charantia OX=3673 GN=LOC111018823 PE=4 SV=1)
HSP 1 Score: 952.2 bits (2460), Expect = 1.7e-273
Identity = 513/671 (76.45%), Postives = 546/671 (81.37%), Query Frame = 0
Query: 183 SSNQQAESSHNPA---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESSFT 242
SSNQQAESSHNPA G+ITREEFDQLRG+L+AQVEALKAKCEQK+ LNDGDLGES FT
Sbjct: 2 SSNQQAESSHNPATPDGVITREEFDQLRGKLNAQVEALKAKCEQKEGPLNDGDLGESPFT 61
Query: 243 SDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR 302
SDVLE APTVK YDG+KDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR
Sbjct: 62 SDVLE--------APTVKSYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR 121
Query: 303 LWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQ 362
LW FQE+Q
Sbjct: 122 LW-----------------------------------------------------FQEDQ 181
Query: 363 LKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRP 422
LKVA SDDSAMCYFLTGLADEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRP
Sbjct: 182 LKVAQSSDDSAMCYFLTGLADEALTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRP 241
Query: 423 ERKIGRGRSGKDERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILT 482
ER I RGRSGKDE+AD KSKDKGSFSSGRAE+RRA NGPTRSRPYERFTPTTIPI EILT
Sbjct: 242 ERGIDRGRSGKDEKADLKSKDKGSFSSGRAEFRRAVNGPTRSRPYERFTPTTIPISEILT 301
Query: 483 NIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFK 542
NIE+SGMEKLLKRPEKLRGAPERR+KDKYCRFHREH HNTSD WELKRQIEDLIQD YFK
Sbjct: 302 NIEESGMEKLLKRPEKLRGAPERRNKDKYCRFHREHDHNTSDRWELKRQIEDLIQDDYFK 361
Query: 543 KFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREV 602
KFVGKP TSSAEKKEERK SRTP RR DRPAVINTIFGGPSGGQSGHKRKELARAARREV
Sbjct: 362 KFVGKPRTSSAEKKEERKLSRTPLRRIDRPAVINTIFGGPSGGQSGHKRKELARAARREV 421
Query: 603 CVIREHGPTCPITFDGVDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANVLSLPTY 662
C+IRE PTCPITFD DLEEVHLPHNDALVIAPLIDHVVVRRVLVD G SAN++SL TY
Sbjct: 422 CIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDEGVSANIVSLLTY 481
Query: 663 LALGWTRSQLRRSPTPLVGFSGESVIPEGCIDLPVTLGQNQTRITQMAEFVVVDGRSAYN 722
LALGWTRSQL++S TPLVGFS ESVIPEGCIDLPVTLG +QT++TQMAEFVV+DGRSAYN
Sbjct: 482 LALGWTRSQLKKSTTPLVGFSRESVIPEGCIDLPVTLGHDQTQVTQMAEFVVIDGRSAYN 541
Query: 723 AIFGRPIIHSFWAIPSTVHQVLKYPTPSGVGTVRGEQTASRECYAAALKGPSVCALETL- 782
AIFGRPIIHSF AIPST+HQVLKY TP+GVG VRGEQ ASRECYA+ALKG SVCALETL
Sbjct: 542 AIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGMVRGEQIASRECYASALKGSSVCALETLV 601
Query: 783 -RDGTLEFEADLPRKEFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSILE 842
RDGTLEF+A+LPR+EFAAPTEELELVPLL + +E +L + +D+
Sbjct: 602 SRDGTLEFKANLPRREFAAPTEELELVPLLRYKYNENIDHEQELDEKSSLNKIDD----- 605
Query: 843 PDLMEIGAPEP 849
D+ G PEP
Sbjct: 662 -DIGVEGMPEP 605
BLAST of Moc03g20370 vs. ExPASy TrEMBL
Match:
A0A6J1DHB3 (uncharacterized protein LOC111020479 OS=Momordica charantia OX=3673 GN=LOC111020479 PE=4 SV=1)
HSP 1 Score: 940.3 bits (2429), Expect = 6.8e-270
Identity = 512/786 (65.14%), Postives = 571/786 (72.65%), Query Frame = 0
Query: 1 MVQPANSTNTTDRRTLAASDAHQREVGAVAAEGQGHDGLAAEPLRRSARITAPALPPAHP 60
MVQPANSTNT DRR LAA+ HQREVGA EGQGH+ L EPL RSARIT P LPPAHP
Sbjct: 1 MVQPANSTNTADRRALAANHGHQREVGAEVVEGQGHEDLGTEPLCRSARITTPVLPPAHP 60
Query: 61 RTSKATRGRGGTSKKGARGPAPTPTSEDFDALQREMEAMRFMEAMYNDMVLAAGAGSRSE 120
+ SK
Sbjct: 61 KPSK-------------------------------------------------------- 120
Query: 121 NRATRIDACEQRGSHLGPAEEERPEDNGSEGYTHQKGDLREHLNRKRGSSLRKGQSPSRS 180
Sbjct: 121 ------------------------------------------------------------ 180
Query: 181 HRSSNQQAESSHNP--AGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESSF 240
AESS+NP G+ITREEFDQL+ + DAQVEALKA+CE+K+ S +DGDLGE SF
Sbjct: 181 -------AESSYNPITPGVITREEFDQLKSKFDAQVEALKARCEKKESSFDDGDLGELSF 240
Query: 241 TSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSA 300
+SD+LEA IPPKFK PT+KPYDG+KDPKDYVEVFE LMDFQAA+DAIKC AFQIALTGSA
Sbjct: 241 SSDILEALIPPKFKTPTMKPYDGSKDPKDYVEVFESLMDFQAATDAIKCCAFQIALTGSA 300
Query: 301 RLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEE 360
RLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATIRQKEGETLREYVTRF EE
Sbjct: 301 RLWYRRLPARLISTYSQLRKEFISQFSSRHYDRKTPTHLATIRQKEGETLREYVTRFPEE 360
Query: 361 QLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGR 420
QLKVAHCSDDSAMCYFLTGLADE LTVKL EEAPATFAEVLQK KKVIDGQELLRTKTGR
Sbjct: 361 QLKVAHCSDDSAMCYFLTGLADETLTVKLREEAPATFAEVLQKTKKVIDGQELLRTKTGR 420
Query: 421 PERKIGRGRSGKDE-RADPKSKDKG-SFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFE 480
PE+ I +GR+GKD+ +AD KS+DKG S SS R +YRR+ + +SRPYE +TPTTIPIFE
Sbjct: 421 PEKNIDQGRAGKDKGKADSKSRDKGPSSSSSRVDYRRSNSSHNQSRPYEHYTPTTIPIFE 480
Query: 481 ILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDG 540
ILTNIE++GMEKLLKRPEKLRG PE+R+ DKYCRFHR+HGHNTS+ WELKRQIEDLIQDG
Sbjct: 481 ILTNIEETGMEKLLKRPEKLRGDPEKRNTDKYCRFHRDHGHNTSNYWELKRQIEDLIQDG 540
Query: 541 YFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAAR 600
YFKKFVGKP ++S EKKEERKR RTPPRR DRPAVIN K+KELAR AR
Sbjct: 541 YFKKFVGKPRSNSVEKKEERKRLRTPPRRDDRPAVIN-------------KKKELAREAR 600
Query: 601 REVCVIREHGPTCPITFDGVDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANVLSL 660
REVC+IRE PT I F+ DLE VHLPHNDALVIAPLID V+VRR+LVDGGASAN+LSL
Sbjct: 601 REVCIIREQRPTSSIAFNHADLEGVHLPHNDALVIAPLIDLVLVRRILVDGGASANILSL 650
Query: 661 PTYLALGWTRSQLRRSPTPLVGFSGESVIPEGCIDLPVTLGQNQTRITQMAEFVVVDGRS 720
TYLALGWTRSQL++SPTPLVGFSGES+ EGCIDLPV++ Q+ T++TQMAEFVV+DGRS
Sbjct: 661 STYLALGWTRSQLKKSPTPLVGFSGESISLEGCIDLPVSIRQDDTQVTQMAEFVVIDGRS 650
Query: 721 AYNAIFGRPIIHSFWAIPSTVHQVLKYPTPSGVGTVRGEQTASRECYAAALKGPSVCALE 780
AYNAIFGRPIIHSF A+PST+HQVLKY T +GVGTVRGE SRECYA+ K SVCALE
Sbjct: 721 AYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGELKTSRECYASVPKRSSVCALE 650
BLAST of Moc03g20370 vs. ExPASy TrEMBL
Match:
A0A6J1D9W7 (uncharacterized protein LOC111018708 OS=Momordica charantia OX=3673 GN=LOC111018708 PE=4 SV=1)
HSP 1 Score: 792.7 bits (2046), Expect = 1.8e-225
Identity = 402/422 (95.26%), Postives = 409/422 (96.92%), Query Frame = 0
Query: 224 KDDSLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASD 283
KDDSLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFEGLMDF AASD
Sbjct: 44 KDDSLNDGDLGESSFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFHAASD 103
Query: 284 AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQK 343
AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSR Y KKT THLATIRQK
Sbjct: 104 AIKCRAFQIALTGSARLWYRRLPARSISTYSQLRREFLAQFSSRQYGKKTETHLATIRQK 163
Query: 344 EGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAK 403
EG TLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGE+AP TFAEVLQKAK
Sbjct: 164 EGGTLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTVKLGEKAPTTFAEVLQKAK 223
Query: 404 KVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFSSGRAEYRRAENGPTRSR 463
KVIDGQELLRTKTGRP+RKIGRGRSGKD ERADPKSKDKGSFSSGRAEYRRAE+GPT+SR
Sbjct: 224 KVIDGQELLRTKTGRPDRKIGRGRSGKDVERADPKSKDKGSFSSGRAEYRRAESGPTKSR 283
Query: 464 PYERFTPTTIPIFEILTNIEDSGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC 523
PYERFTPTTIPI EILTNIE+SGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC
Sbjct: 284 PYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDC 343
Query: 524 WELKRQIEDLIQDGYFKKFVGKPGTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG 583
WELKRQIEDLIQDGYFKKFVGKP TSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG
Sbjct: 344 WELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGG 403
Query: 584 QSGHKRKELARAARREVCVIREHGPTCPITFDGVDLEEVHLPHNDALVIAPLIDHVVVRR 643
QSGHKRKELARAARREVC+IRE GPTCPITFDG D EEVHLPHNDA VIAPLIDHVVVRR
Sbjct: 404 QSGHKRKELARAARREVCIIREQGPTCPITFDGADSEEVHLPHNDARVIAPLIDHVVVRR 463
Query: 644 VL 645
VL
Sbjct: 464 VL 465
BLAST of Moc03g20370 vs. ExPASy TrEMBL
Match:
A0A6J1DD03 (uncharacterized protein LOC111019899 OS=Momordica charantia OX=3673 GN=LOC111019899 PE=4 SV=1)
HSP 1 Score: 783.1 bits (2021), Expect = 1.4e-222
Identity = 401/446 (89.91%), Postives = 417/446 (93.50%), Query Frame = 0
Query: 371 MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGK 430
MCYFLTGLADEALTVKL EEAPATFAEVLQKAKKVIDGQELLRT KIG+GRSGK
Sbjct: 1 MCYFLTGLADEALTVKLVEEAPATFAEVLQKAKKVIDGQELLRT-------KIGQGRSGK 60
Query: 431 D-ERADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPIFEILTNIEDSGMEKL 490
D E DPKSKDKGSFS+GRAEYRRAENGPTRSRPYERFTPTTIPI EILTNIE+SGMEKL
Sbjct: 61 DMENTDPKSKDKGSFSNGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKL 120
Query: 491 LKRPEKLRGAPERRSKDKYCRFHREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPGTSS 550
LKRPEKLRGAPERRSKDKYCRFHREHGHNTSD WELK QIEDLIQDGYFKKFVGKP TSS
Sbjct: 121 LKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKSQIEDLIQDGYFKKFVGKPRTSS 180
Query: 551 AEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARREVCVIREHGPTC 610
AEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRK+LARAARREVC+IRE PTC
Sbjct: 181 AEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKKLARAARREVCIIREQRPTC 240
Query: 611 PITFDGVDLEEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANVLSLPTYLALGWTRSQL 670
PITFD DL EVHLPHNDALVIAPLIDHVVVRRVLVDGGASAN+LSLPTYLALGWTRSQL
Sbjct: 241 PITFDXADLXEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL 300
Query: 671 RRSPTPLVGFSGESVIPEGCIDLPVTLGQNQTRITQMAEFVVVDGRSAYNAIFGRPIIHS 730
++SPTPLVGFSGESV+PEGCIDLPVTLGQ+QTR+TQMAEFVVVDGRSAYNAIFGRPIIHS
Sbjct: 301 KKSPTPLVGFSGESVVPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHS 360
Query: 731 FWAIPSTVHQVLKYPTPSGVGTVRGEQTASRECYAAALKGPSVCALETL--RDGTLEFEA 790
F AIPST+HQVLKY TP+GVGTVRGEQTASRECYA+ LKG SVCALETL RDGTLEFEA
Sbjct: 361 FRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASXLKGTSVCALETLTSRDGTLEFEA 420
Query: 791 DLPRKEFAAPTEELELVPLLSPEKQL 814
DLP +EFAAP EELELVPLLS EKQ+
Sbjct: 421 DLPXREFAAPXEELELVPLLSXEKQV 439
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022137317.1 | 4.5e-276 | 92.99 | uncharacterized protein LOC111008813 [Momordica charantia] | [more] |
XP_022150760.1 | 3.6e-273 | 76.45 | uncharacterized protein LOC111018823 [Momordica charantia] | [more] |
XP_022152854.1 | 1.4e-269 | 65.14 | uncharacterized protein LOC111020479 [Momordica charantia] | [more] |
XP_022150613.1 | 3.6e-225 | 95.26 | uncharacterized protein LOC111018708, partial [Momordica charantia] | [more] |
XP_022152110.1 | 2.9e-222 | 89.91 | uncharacterized protein LOC111019899 [Momordica charantia] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1C7X5 | 2.2e-276 | 92.99 | uncharacterized protein LOC111008813 OS=Momordica charantia OX=3673 GN=LOC111008... | [more] |
A0A6J1D9E1 | 1.7e-273 | 76.45 | uncharacterized protein LOC111018823 OS=Momordica charantia OX=3673 GN=LOC111018... | [more] |
A0A6J1DHB3 | 6.8e-270 | 65.14 | uncharacterized protein LOC111020479 OS=Momordica charantia OX=3673 GN=LOC111020... | [more] |
A0A6J1D9W7 | 1.8e-225 | 95.26 | uncharacterized protein LOC111018708 OS=Momordica charantia OX=3673 GN=LOC111018... | [more] |
A0A6J1DD03 | 1.4e-222 | 89.91 | uncharacterized protein LOC111019899 OS=Momordica charantia OX=3673 GN=LOC111019... | [more] |
Match Name | E-value | Identity | Description | |