Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTAAGTATGAGGGCCGAGGTGAACCTGGCCCAGGTCCGACCTACCGGGGAGCTCGGTAGGGGCCAATGTGAGCAACTGTCAGCGGCGTGATTATCGGGCGTAACAGTCGGTCCCGAGATTACCGGGCGTAACCGTCGGTCCCGAGATTAGCGGGACTACCACCCATAAGTAGAAGGCCTCACTCCACGATCAGGTACGATGATTTCTAACCTCGAACTAAACTGGGAATCCGACTTATACTGACTTGATCGTCGGAGTGCTCACCTTTTTGTGCAGGTCCGCGCAAGTGTTCAGATCGGCCCGGAAGCCGAGTTCGAGTTGCAATCTGAAATACGTTGTTGTGCATATTCTTGCATAAACATTTGGCGCTGTCTGTGGGGACGACAATCTAAGTCATCCCAATTCTTTTAAACCAACACGCAAGCGACCATGGTTCAACCAGCGAACTCGACCAATACGACAGATCGAAGGACTTTAGCTGCCAGCGATGCCCACCAAAGGGAGGTCGGAGCAGCAGCGGTAGAGGGGCAAGGTCACGACGGCCTAACAACGGAACCCCTCCGCAGGTCGACACGGATCACCGCGCCTGCCCTACCGCCTATTTTAATTGAGGACGTCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTAAGAAGGGCGCCCGGGGTCCAACCCCGACTCCAACAAGCGAGAACTTTGATGCGCTCCAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCACCATGGAGGAAATGTATAACGAAATGATGCTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCGTGCAAAGGGGTTCCCACCTCGGCCCAGTCGAGGAGGAACGTCCCGAAGACAACGAGAGCGAGGGGTACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTCCGAAAAGGGCAGTCACCATCCCGCAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAATCCCGTAGGGATAATCACAAGGGAGGAGTTCGACCAGCTGAGGGGGGAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGGCGACTTGGGAGAATCGCCTTTCACCTCGGACGTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGATGGGACGAAGGACCCCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATGGACTTCCAAGCGACATCAGACGCAATCAAATGTCGCGCCTTTCAGATCGCGATTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGCCAGAAGGAGGGTGAAACGCTGCGGGAATATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGCTGCATACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAAGCTCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCTGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATGAAAGGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCAGCCGAGCTGAGTGTCGAAGGGCGGAGAGCGGACCTACCAGGAGTCGACCTTACGAACGCTTCACCCCGACCACGATTCCAATTTCCGAGATCCTAACAAACATCGAGGAATCTGGAATGGAAAAACTACTCAAGCGTCCTGAGAAGCTTCGGGGAGCCCTGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCTATCGGGAGCACGGCCACAACACGTCGGACTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAGTTTGTGGGAAAGCCCAGGACCAGCTCAGCAGAAAAAAAGGAGGAGCGAAAGCGTTCAAGGACGCCACCCCGGCGCACCGACCGACCTGCGGTCATCAACACCATTTTTGGAGGGCCAAGCGGGGGTCAGTCCGGACATAAAAGAAAGGAGTTAGCCCGTGCAGCCAGGTGCGAGGTGTGCATCATCAGGGAGCAGGAGCCGACCTGCCCAATCACCTTCGACGGTGCAGACTTGGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTCAGGAGAGTGCTGGTAAACGGGGGCGCATCCGCTAACATCCTGTCCTTACCGACCTACCTCGCCTTGGGATGGACGAGGTCACAATTGAAGAGAAGCCCGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACGCTGGGGCAGGACCGAACTCGGGTCACTCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCGACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCATGGGCACGGTCCGAGGAGAACAGACCGCTTCGAGGGAGTGTTATGCCTCCGCACTCAAGGGCTCATCGGTCTGCGCCCTCGAAACGCTCGCCGGTAGGGATGGGGCGCTCGAGTTCGAGGCCGACCTGCCGAGGAAGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGGTAAGCATAGGAACCAAGCTGGGGGCCACCGACAGAGAGGAGCTAATCCACTTCCTCAGATCCAACTCGGACGTCTTTGCGTGGTCCCATGAGGACATGCCTGGCATTGACCCGCGAATTATGACGCATCGCCTCAGCATAGATCCATCATTCTGACCTGTGAAACAAAAGAGAAGACCTATAAATAAGGAAATGAGTGATGTAATTGTTGAGGAAGTTAACAAACTTTTGAAAGCTGAATACATAAGAGAAATTTCGTATCCCGAGTGGCTCTCCAATGTTGTATTAGTTAAAAAATCTAACGGCAAGTGGAGAATGTGCGTAGATTTTACGAACTTAAATAAGGCATGCCCGAAAGATTGCTTCCCACTGCCGAGGATTGATCAGCTCGTGGACGCCACAGCCGGGCACGAACTGCTCACCTTCATGGACGCCTACTCTGGGTACAACCAAATCAAGATGCATGTCCCAAATGAAGGTCATACCGCTTTCATAACAGACCAAGGTCTGTACTGCTACAAGGTCATGCCCTTCGGATTAAAGAACGCAGGAGCGACCTACCAGTGAATGGTGAACAAAATGTTCGCCAAGCAGATCGGTCGGAATATGGAAGTGTATATAGACGACATGCTTGTCAAGAGCAAGCAGTCTAAGTCGCATCTCTCCAATCTGACCGAAGCCTTCGAGGTTCTGAGGGCATATCGAATGAAGCTCAACCCCGCTAAGTGTGCCTTTGGAGTCTCTTCGGGAAAATTCTTCGGCTTCATGGTGAATAACCGGGGAATCAAGGCCAACCCCGAAAAGATTAAAGCCGTGACCGAGATGGAGGCACCGAAGACGCTGAAGCAGCTTCAGTGTCTCAATGGCAGGATTGCGGCCCTGAGCCGGTTTGTTTCAAGATCGACAGATAAGTGCCTTCCTTTCTTCAAAATCCTACGAAAGAAAGGGCCGTTTCAATGGACAGCGGAGTGCGAGCAAGCGTTTCAGCAATTGAAGAGCTACCTCTGTTCGGCACCTTTGCTCGCCAAGCCCATGCCGGGAGACAAGCTCCAATTGTACTTAGCAGTATCTGACAGTGCCGTCAGCTCGGCAAAACCCGATTTACTACACAAGCAAGGCTATGACCGAAGCCGAGACTAGATACCCTCAGATGGAGAAGTTGGCTCTCGCTTTGGTCACCTCGGCCCGACGGCTCAGACCATACTTTCAAGCCCATACGGTGGTGGTGCTCACTAACTTGCCCCTAAAAAGCATCTTCCATAAGCCGAAAGCTTTTGGACGCCTAATGAAGTGGGCAATAGAGCTAAGTGAGTACGACATCCAGTTCGAACCCAGAACTGCGTTGAAAGGACAAGCAGCGGCAGATTTCATACCCGAGCTCACACCACCTTCCGAGCTGAGCGAGTCCGACCTACCTTGGACAGTCTATGTCGACGGATCCTCCAATGAGAAGGGGTGCGGAGCTGGGGTCCTCTTGCTCGGACCAGGGGGCGAGCGATTTGAGTATGCCTTGCGGTTCGGCTTCCGGACTTCTAACAACGAGGCTGAGTATGAAGCATTTATTGCCGGCCTACGAATCGCTAGAGCATTGGGGGCCTCTTGTGTTAAGGTCTTCAGTGACTCACAGCTGGTTGTGAGCCAGATCAAGGACGAGTACCAAGCCAAAGACACCCGAATGGAGAAATATTTGGGCAAGGTCAGATCGTACCTCGCCCAGTTTCGAACTTACGAAGTAAGCCGGATTCCGCGAGCAGAAAATTCTAATACTGACGCCTTGGCCAAGTTAGCATCGGCGTACGAGACTGACCTGGCCAGGTCGGTCCCCGTCGAGATCTTAGATAATCCCTCGATCTCAGAGCCAGATCTGATGGAGATCGGCGCTCCAGAATCTTCATGGATGGACCCGATCGCGGACTTCATTAGGGGCAACTCACCACAAGACCCCAAGGTGCGCAGAAAGTTGGCACGGCGGGCAGCTCGGTTCGTGGTCCGAGATGGGGCATTGTACCGACGTGGCTTTTCTCTGCCTCTATTGAGATGCCTAACCCCTAAAGAGGGCCTGTACGTCCTCAGAGAGATCCACGAAGGAGTGTGCGACAATCACTCAGGCGCCCGGTCGCTGTCAGCCAAAGTGATCCGACAAGGATACTATTGGCCGCCCCTCAGCCAGGACGCCAAGAAGTTCGTTAGAACTTGCGACAATTGCCAACGCTACGGAACCATAATCCACCAACCTCCCGAGCTGCTCACCCCCATCTCGGCCCTGTGGCCATTCGCGCAGTGGGGGGTAGATATCATTGGTCCTCTCCCTTTGGGCAAGGGCCAGACCAAGTTCGCTGTGGTTGCTGTGGATTACTTCACCAAGTGGGCCGAGGCCGAGGCGCTCTTCCACATAACGGAATCCAGGGTCACGTCCTTCGTATGGACGAATGTCATATGTCGCTTTGGTATACCGCAGGCCATAGTGACAGACAATGGGAAGCAGTTTGACAACGCCAAGTTCAAAGACTTTTGCAGCAAACTTGGCATAAGTCATCTTAGCTCGTCCCCCGCACATCCGCAAGCAAATGGGCAGGTGGAGGCAGTCAACAAGATCATTAAGCGAGGCATCAAACTTAGACTGGACTCCAAGAAAGGCAGGTGGGCCGAGGAGCTACCAGAGGTTCTATGGTCGTACCGGACCACCCAACGAGAGTCGACGGGTGAGACCCCGTTCTCCCTGGCCTTCGACTCCGAAGCTGTAGTCCCGGTTGAGATCGGCATGCCATCTGACAGAGTAGAGCATTTCGAACCTACGGCAAATGAGGAAGAGCTGCTCCTCAACCTCGACTTGTTGGAAGAAAGAAGAGCAATGGCCCAGCTACGCCTGGCGGAATATCAGGGCAGAATGGCCAGACATTACAATGCCCGCGTTCGACCTCGGGCCTTTCAGGTCGGACATCTGGTCTTAAGGAGGGTCCAAACGCATGTAGGTGCCCTTGATCCGGCCTGGGAGGGCCCGTTTGAGATCAAGGGCATAGTCCGACCTGGGACGTACATATTGGCCGATCTGAAAGGAGACGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCGCTATTATCCCAGAAATGCAAAAAGGATTTTCAATGAATCTGTAAAGACTGTTCCAAAAGAATTATGA
mRNA sequence
ATGCTAAGTATGAGGGCCGAGGTGAACCTGGCCCAGGTCCGCGCAAGTGTTCAGATCGGCCCGGAAGCCGAGTTCGAGTTGCAATCTGAAATACGTTGTTGTGCATATTCTTGCATAAACATTTGGCGCTGTCTGACGTCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTAAGAAGGGCGCCCGGGGTCCAACCCCGACTCCAACAAGCGAGAACTTTGATGCGCTCCAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCACCATGGAGGAAATGTATAACGAAATGATGCTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCGTGCAAAGGGGTTCCCACCTCGGCCCAGTCGAGGAGGAACGTCCCGAAGACAACGAGAGCGAGGGGTACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTCCGAAAAGGGCAGTCACCATCCCGCAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAATCCCGTAGGGATAATCACAAGGGAGGAGTTCGACCAGCTGAGGGGGGAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGGCGACTTGGGAGAATCGCCTTTCACCTCGGACGTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGATGGGACGAAGGACCCCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATGGACTTCCAAGCGACATCAGACGCAATCAAATGTCGCGCCTTTCAGATCGCGATTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGCCAGAAGGAGGGTGAAACGCTGCGGGAATATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGCTGCATACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAAGCTCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCTGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATGAAAGGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCAGCCGAGCTGAGTGTCGAAGGGCGGAGAGCGGACCTACCAGGAGTCGACCTTACGAACGCTTCACCCCGACCACGATTCCAATTTCCGAGATCCTAACAAACATCGAGGAATCTGGAATGGAAAAACTACTCAAGCGTCCTGAGAAGCTTCGGGGAGCCCTGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCTATCGGGAGCACGGCCACAACACGTCGGACTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAGTTTGTGGGAAAGCCCAGGACCAGCTCAGCAGAAAAAAAGGAGGAGCGAAAGCGTTCAAGGACGCCACCCCGGCGCACCGACCGACCTGCGGTCATCAACACCATTTTTGGAGGGCCAAGCGGGGGTCAGTCCGGACATAAAAGAAAGGAGTTAGCCCGTGCAGCCAGGTGCGAGGTGTGCATCATCAGGGAGCAGGAGCCGACCTGCCCAATCACCTTCGACGGTGCAGACTTGGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTCAGGAGAGTGCTGGTAAACGGGGGCGCATCCGCTAACATCCTGTCCTTACCGACCTACCTCGCCTTGGGATGGACGAGGTCACAATTGAAGAGAAGCCCGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACGCTGGGGCAGGACCGAACTCGGGTCACTCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCGACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCATGGGCACGGTCCGAGGAGAACAGACCGCTTCGAGGGAGTGTTATGCCTCCGCACTCAAGGGCTCATCGGTCTGCGCCCTCGAAACGCTCGCCGGTAGGGATGGGGCGCTCGAGTTCGAGGCCGACCTGCCGAGGAAGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGTTAGCATCGGCGTACGAGACTGACCTGGCCAGGTCGGTCCCCGTCGAGATCTTAGATAATCCCTCGATCTCAGAGCCAGATCTGATGGAGATCGGCGCTCCAGAATCTTCATGGATGGACCCGATCGCGGACTTCATTAGGGGCAACTCACCACAAGACCCCAAGGTGCGCAGAAAGTTGGCACGGCGGGCAGCTCGAGTAGAGCATTTCGAACCTACGGCAAATGAGGAAGAGCTGCTCCTCAACCTCGACTTGTTGGAAGAAAGAAGAGCAATGGCCCAGCTACGCCTGGCGGAATATCAGGGCAGAATGGCCAGACATTACAATGCCCGCGTTCGACCTCGGGCCTTTCAGGTCGGACATCTGGTCTTAAGGAGGGTCCAAACGCATGTAGGTGCCCTTGATCCGGCCTGGGAGGGCCCGTTTGAGATCAAGGGCATAGTCCGACCTGGGACGTACATATTGGCCGATCTGAAAGGAGACGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCGCTATTATCCCAGAAATGCAAAAAGGATTTTCAATGAATCTGTAAAGACTGTTCCAAAAGAATTATGA
Coding sequence (CDS)
ATGCTAAGTATGAGGGCCGAGGTGAACCTGGCCCAGGTCCGCGCAAGTGTTCAGATCGGCCCGGAAGCCGAGTTCGAGTTGCAATCTGAAATACGTTGTTGTGCATATTCTTGCATAAACATTTGGCGCTGTCTGACGTCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTAAGAAGGGCGCCCGGGGTCCAACCCCGACTCCAACAAGCGAGAACTTTGATGCGCTCCAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCACCATGGAGGAAATGTATAACGAAATGATGCTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCGTGCAAAGGGGTTCCCACCTCGGCCCAGTCGAGGAGGAACGTCCCGAAGACAACGAGAGCGAGGGGTACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTCCGAAAAGGGCAGTCACCATCCCGCAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAATCCCGTAGGGATAATCACAAGGGAGGAGTTCGACCAGCTGAGGGGGGAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGATGGCGACTTGGGAGAATCGCCTTTCACCTCGGACGTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGATGGGACGAAGGACCCCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATGGACTTCCAAGCGACATCAGACGCAATCAAATGTCGCGCCTTTCAGATCGCGATTACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCACTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGCCAGAAGGAGGGTGAAACGCTGCGGGAATATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGCTGCATACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCGGTCTAGCCGACGAAGCTCTCACGGTGAAACTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCTGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATGAAAGGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCAGCCGAGCTGAGTGTCGAAGGGCGGAGAGCGGACCTACCAGGAGTCGACCTTACGAACGCTTCACCCCGACCACGATTCCAATTTCCGAGATCCTAACAAACATCGAGGAATCTGGAATGGAAAAACTACTCAAGCGTCCTGAGAAGCTTCGGGGAGCCCTGGAGAGGCGCAGCAAGGACAAGTATTGCCGCTTCTATCGGGAGCACGGCCACAACACGTCGGACTGCTGGGAGTTGAAGCGCCAAATTGAGGATCTAATTCAAGATGGCTACTTCAAGAAGTTTGTGGGAAAGCCCAGGACCAGCTCAGCAGAAAAAAAGGAGGAGCGAAAGCGTTCAAGGACGCCACCCCGGCGCACCGACCGACCTGCGGTCATCAACACCATTTTTGGAGGGCCAAGCGGGGGTCAGTCCGGACATAAAAGAAAGGAGTTAGCCCGTGCAGCCAGGTGCGAGGTGTGCATCATCAGGGAGCAGGAGCCGACCTGCCCAATCACCTTCGACGGTGCAGACTTGGAGGAGGTCCACCTGCCCCACAATGATGCACTTGTGATTGCTCCCTTGATTGATCATGTGGTGGTCAGGAGAGTGCTGGTAAACGGGGGCGCATCCGCTAACATCCTGTCCTTACCGACCTACCTCGCCTTGGGATGGACGAGGTCACAATTGAAGAGAAGCCCGACACCGCTGGTTGGGTTCTCTGGAGAATCGGTCATCCCAGAGGGTTGCATCGACTTGCCGGTCACGCTGGGGCAGGACCGAACTCGGGTCACTCAAATGGCCGAGTTCGTGGTAATTGACGGTAGATCGGCCTATAACGCCATCTTTGGGAGACCCATCATCCACTCATTTCGGGCCATTCCCTCGACACTTCATCAAGTTTTGAAGTATTCCACCCCCAATGGCATGGGCACGGTCCGAGGAGAACAGACCGCTTCGAGGGAGTGTTATGCCTCCGCACTCAAGGGCTCATCGGTCTGCGCCCTCGAAACGCTCGCCGGTAGGGATGGGGCGCTCGAGTTCGAGGCCGACCTGCCGAGGAAGGAGTTTGCCGCACCCACTGAGGAGCTCGAGCTTGTTCCTCTGCTTAGTCCCGAGAAGCAGTTAGCATCGGCGTACGAGACTGACCTGGCCAGGTCGGTCCCCGTCGAGATCTTAGATAATCCCTCGATCTCAGAGCCAGATCTGATGGAGATCGGCGCTCCAGAATCTTCATGGATGGACCCGATCGCGGACTTCATTAGGGGCAACTCACCACAAGACCCCAAGGTGCGCAGAAAGTTGGCACGGCGGGCAGCTCGAGTAGAGCATTTCGAACCTACGGCAAATGAGGAAGAGCTGCTCCTCAACCTCGACTTGTTGGAAGAAAGAAGAGCAATGGCCCAGCTACGCCTGGCGGAATATCAGGGCAGAATGGCCAGACATTACAATGCCCGCGTTCGACCTCGGGCCTTTCAGGTCGGACATCTGGTCTTAAGGAGGGTCCAAACGCATGTAGGTGCCCTTGATCCGGCCTGGGAGGGCCCGTTTGAGATCAAGGGCATAGTCCGACCTGGGACGTACATATTGGCCGATCTGAAAGGAGACGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCGCTATTATCCCAGAAATGCAAAAAGGATTTTCAATGAATCTGTAAAGACTGTTCCAAAAGAATTATGA
Protein sequence
MLSMRAEVNLAQVRASVQIGPEAEFELQSEIRCCAYSCINIWRCLTSKATRGRGGTSKKGARGPTPTPTSENFDALQREMEAMRTQMRTMEEMYNEMMLAAGAGSRSENRVTRVQRGSHLGPVEEERPEDNESEGYTRQRGDLREHLNRKRGSSLRKGQSPSRRSSNQQAESSHNPVGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQATSDAIKCRAFQIAITGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATISQKEGETLREYVTRFQEEQLKAAYCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERADPKSKDKGSFSSSRAECRRAESGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGALERRSKDKYCRFYREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARCEVCIIREQEPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVNGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGMGTVRGEQTASRECYASALKGSSVCALETLAGRDGALEFEADLPRKEFAAPTEELELVPLLSPEKQLASAYETDLARSVPVEILDNPSISEPDLMEIGAPESSWMDPIADFIRGNSPQDPKVRRKLARRAARVEHFEPTANEEELLLNLDLLEERRAMAQLRLAEYQGRMARHYNARVRPRAFQVGHLVLRRVQTHVGALDPAWEGPFEIKGIVRPGTYILADLKGDVLAHPWNAEHLKRYYPRNAKRIFNESVKTVPKEL
Homology
BLAST of Moc11g29330 vs. NCBI nr
Match:
XP_022150760.1 (uncharacterized protein LOC111018823 [Momordica charantia])
HSP 1 Score: 953.7 bits (2464), Expect = 1.2e-273
Identity = 510/630 (80.95%), Postives = 531/630 (84.29%), Query Frame = 0
Query: 165 SSNQQAESSHNPV---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFT 224
SSNQQAESSHNP G+ITREEFDQLRG+L+AQVEALKAKCEQK+ LNDGDLGESPFT
Sbjct: 2 SSNQQAESSHNPATPDGVITREEFDQLRGKLNAQVEALKAKCEQKEGPLNDGDLGESPFT 61
Query: 225 SDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQATSDAIKCRAFQIAITGSAR 284
SDVLE APTVK YDG+KDPKDYVEVFEGLMDFQA SDAIKCRAFQIA+TGSAR
Sbjct: 62 SDVLE--------APTVKSYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR 121
Query: 285 LWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATISQKEGETLREYVTRFQEEQ 344
LW FQE+Q
Sbjct: 122 LW-----------------------------------------------------FQEDQ 181
Query: 345 LKAAYCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRP 404
LK A SDDSAMCYFLTGLADEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRP
Sbjct: 182 LKVAQSSDDSAMCYFLTGLADEALTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRP 241
Query: 405 ERKIGRGRSGKDERADPKSKDKGSFSSSRAECRRAESGPTRSRPYERFTPTTIPISEILT 464
ER I RGRSGKDE+AD KSKDKGSFSS RAE RRA +GPTRSRPYERFTPTTIPISEILT
Sbjct: 242 ERGIDRGRSGKDEKADLKSKDKGSFSSGRAEFRRAVNGPTRSRPYERFTPTTIPISEILT 301
Query: 465 NIEESGMEKLLKRPEKLRGALERRSKDKYCRFYREHGHNTSDCWELKRQIEDLIQDGYFK 524
NIEESGMEKLLKRPEKLRGA ERR+KDKYCRF+REH HNTSD WELKRQIEDLIQD YFK
Sbjct: 302 NIEESGMEKLLKRPEKLRGAPERRNKDKYCRFHREHDHNTSDRWELKRQIEDLIQDDYFK 361
Query: 525 KFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARCEV 584
KFVGKPRTSSAEKKEERK SRTP RR DRPAVINTIFGGPSGGQSGHKRKELARAAR EV
Sbjct: 362 KFVGKPRTSSAEKKEERKLSRTPLRRIDRPAVINTIFGGPSGGQSGHKRKELARAARREV 421
Query: 585 CIIREQEPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVNGGASANILSLPTY 644
CIIREQ PTCPITFD ADLEEVHLPHNDALVIAPLIDHVVVRRVLV+ G SANI+SL TY
Sbjct: 422 CIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDEGVSANIVSLLTY 481
Query: 645 LALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYN 704
LALGWTRSQLK+S TPLVGFS ESVIPEGCIDLPVTLG D+T+VTQMAEFVVIDGRSAYN
Sbjct: 482 LALGWTRSQLKKSTTPLVGFSRESVIPEGCIDLPVTLGHDQTQVTQMAEFVVIDGRSAYN 541
Query: 705 AIFGRPIIHSFRAIPSTLHQVLKYSTPNGMGTVRGEQTASRECYASALKGSSVCALETLA 764
AIFGRPIIHSFRAIPSTLHQVLKYSTPNG+G VRGEQ ASRECYASALKGSSVCALETL
Sbjct: 542 AIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGMVRGEQIASRECYASALKGSSVCALETLV 570
Query: 765 GRDGALEFEADLPRKEFAAPTEELELVPLL 792
RDG LEF+A+LPR+EFAAPTEELELVPLL
Sbjct: 602 SRDGTLEFKANLPRREFAAPTEELELVPLL 570
BLAST of Moc11g29330 vs. NCBI nr
Match:
XP_022137317.1 (uncharacterized protein LOC111008813 [Momordica charantia])
HSP 1 Score: 947.2 bits (2447), Expect = 1.1e-271
Identity = 488/528 (92.42%), Postives = 503/528 (95.27%), Query Frame = 0
Query: 169 QAESSHN---PVGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVL 228
+AESS N P G+ITREEFDQLRG+LDAQVEALKAKCEQK+ LNDGDLGESPFTSDVL
Sbjct: 3 KAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVL 62
Query: 229 EAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQATSDAIKCRAFQIAITGSARLWYR 288
EAPIPPKFKAPTVKPYDG+KDPKDYVEVFE LMDFQA SDAIKCRAF+IA+TGSARLWYR
Sbjct: 63 EAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKCRAFRIALTGSARLWYR 122
Query: 289 RLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATISQKEGETLREYVTRFQEEQLKAA 348
RLPA SISTYSQLRREFLA FSSRHYDKKTATHLATI QKEGETLREYVTRFQEEQLK A
Sbjct: 123 RLPAXSISTYSQLRREFLAXFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA 182
Query: 349 YCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI 408
+CSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI
Sbjct: 183 HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI 242
Query: 409 GRGRSGKD-ERADPKSKDKGSFSSSRAECRRAESGPTRSRPYERFTPTTIPISEILTNIE 468
GRGRSGKD E ADPKSKDKGSFSS RAE RRAE+GPTRSRPYERFTPTTIPISEILTNIE
Sbjct: 243 GRGRSGKDIENADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE 302
Query: 469 ESGMEKLLKRPEKLRGALERRSKDKYCRFYREHGHNTSDCWELKRQIEDLIQDGYFKKFV 528
ESGMEKLLKRPEKLRGA ERRSKDKYCRF+REHGHNTSD WELKRQIE+LIQDGYFKKFV
Sbjct: 303 ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIENLIQDGYFKKFV 362
Query: 529 GKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARCEVCII 588
GKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSG KRKELARAAR EVCII
Sbjct: 363 GKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARAARREVCII 422
Query: 589 REQEPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVNGGASANILSLPTYLAL 648
REQ PTCPITFDGADLEEVHLPHNDALVIAPLIDHVVV RVLV+GG SANILSLPTYLAL
Sbjct: 423 REQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVXRVLVDGGTSANILSLPTYLAL 482
Query: 649 GWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFV 693
GWTRSQLK+SPTPLVGFSGESVIPEG IDLPVTLGQD+T+VTQMAEFV
Sbjct: 483 GWTRSQLKKSPTPLVGFSGESVIPEGFIDLPVTLGQDQTQVTQMAEFV 530
BLAST of Moc11g29330 vs. NCBI nr
Match:
XP_022152854.1 (uncharacterized protein LOC111020479 [Momordica charantia])
HSP 1 Score: 897.9 bits (2319), Expect = 7.9e-257
Identity = 470/608 (77.30%), Postives = 521/608 (85.69%), Query Frame = 0
Query: 161 PSRRSSNQQAESSHNPV--GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGES 220
P +AESS+NP+ G+ITREEFDQL+ + DAQVEALKA+CE+K+ S +DGDLGE
Sbjct: 56 PPAHPKPSKAESSYNPITPGVITREEFDQLKSKFDAQVEALKARCEKKESSFDDGDLGEL 115
Query: 221 PFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQATSDAIKCRAFQIAITG 280
F+SD+LEA IPPKFK PT+KPYDG+KDPKDYVEVFE LMDFQA +DAIKC AFQIA+TG
Sbjct: 116 SFSSDILEALIPPKFKTPTMKPYDGSKDPKDYVEVFESLMDFQAATDAIKCCAFQIALTG 175
Query: 281 SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATISQKEGETLREYVTRFQ 340
SARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATI QKEGETLREYVTRF
Sbjct: 176 SARLWYRRLPARLISTYSQLRKEFISQFSSRHYDRKTPTHLATIRQKEGETLREYVTRFP 235
Query: 341 EEQLKAAYCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKT 400
EEQLK A+CSDDSAMCYFLTGLADE LTVKL EEAPATFAEVLQK KKVIDGQELLRTKT
Sbjct: 236 EEQLKVAHCSDDSAMCYFLTGLADETLTVKLREEAPATFAEVLQKTKKVIDGQELLRTKT 295
Query: 401 GRPERKIGRGRSGKDE-RADPKSKDKG-SFSSSRAECRRAESGPTRSRPYERFTPTTIPI 460
GRPE+ I +GR+GKD+ +AD KS+DKG S SSSR + RR+ S +SRPYE +TPTTIPI
Sbjct: 296 GRPEKNIDQGRAGKDKGKADSKSRDKGPSSSSSRVDYRRSNSSHNQSRPYEHYTPTTIPI 355
Query: 461 SEILTNIEESGMEKLLKRPEKLRGALERRSKDKYCRFYREHGHNTSDCWELKRQIEDLIQ 520
EILTNIEE+GMEKLLKRPEKLRG E+R+ DKYCRF+R+HGHNTS+ WELKRQIEDLIQ
Sbjct: 356 FEILTNIEETGMEKLLKRPEKLRGDPEKRNTDKYCRFHRDHGHNTSNYWELKRQIEDLIQ 415
Query: 521 DGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARA 580
DGYFKKFVGKPR++S EKKEERKR RTPPRR DRPAVIN K+KELAR
Sbjct: 416 DGYFKKFVGKPRSNSVEKKEERKRLRTPPRRDDRPAVIN-------------KKKELARE 475
Query: 581 ARCEVCIIREQEPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVNGGASANIL 640
AR EVCIIREQ PT I F+ ADLE VHLPHNDALVIAPLID V+VRR+LV+GGASANIL
Sbjct: 476 ARREVCIIREQRPTSSIAFNHADLEGVHLPHNDALVIAPLIDLVLVRRILVDGGASANIL 535
Query: 641 SLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDG 700
SL TYLALGWTRSQLK+SPTPLVGFSGES+ EGCIDLPV++ QD T+VTQMAEFVVIDG
Sbjct: 536 SLSTYLALGWTRSQLKKSPTPLVGFSGESISLEGCIDLPVSIRQDDTQVTQMAEFVVIDG 595
Query: 701 RSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGMGTVRGEQTASRECYASALKGSSVCA 760
RSAYNAIFGRPIIHSFRA+PSTLHQVLKYST NG+GTVRGE SRECYAS K SSVCA
Sbjct: 596 RSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGELKTSRECYASVPKRSSVCA 650
Query: 761 LETLAGRD 765
LE RD
Sbjct: 656 LEEQTIRD 650
BLAST of Moc11g29330 vs. NCBI nr
Match:
XP_022152110.1 (uncharacterized protein LOC111019899 [Momordica charantia])
HSP 1 Score: 785.0 bits (2026), Expect = 7.5e-223
Identity = 405/446 (90.81%), Postives = 419/446 (93.95%), Query Frame = 0
Query: 353 MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGK 412
MCYFLTGLADEALTVKL EEAPATFAEVLQKAKKVIDGQELLRT KIG+GRSGK
Sbjct: 1 MCYFLTGLADEALTVKLVEEAPATFAEVLQKAKKVIDGQELLRT-------KIGQGRSGK 60
Query: 413 D-ERADPKSKDKGSFSSSRAECRRAESGPTRSRPYERFTPTTIPISEILTNIEESGMEKL 472
D E DPKSKDKGSFS+ RAE RRAE+GPTRSRPYERFTPTTIPISEILTNIEESGMEKL
Sbjct: 61 DMENTDPKSKDKGSFSNGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKL 120
Query: 473 LKRPEKLRGALERRSKDKYCRFYREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSS 532
LKRPEKLRGA ERRSKDKYCRF+REHGHNTSD WELK QIEDLIQDGYFKKFVGKPRTSS
Sbjct: 121 LKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKSQIEDLIQDGYFKKFVGKPRTSS 180
Query: 533 AEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARCEVCIIREQEPTC 592
AEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRK+LARAAR EVCIIREQ PTC
Sbjct: 181 AEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKKLARAARREVCIIREQRPTC 240
Query: 593 PITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVNGGASANILSLPTYLALGWTRSQL 652
PITFD ADL EVHLPHNDALVIAPLIDHVVVRRVLV+GGASANILSLPTYLALGWTRSQL
Sbjct: 241 PITFDXADLXEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL 300
Query: 653 KRSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHS 712
K+SPTPLVGFSGESV+PEGCIDLPVTLGQD+TRVTQMAEFVV+DGRSAYNAIFGRPIIHS
Sbjct: 301 KKSPTPLVGFSGESVVPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHS 360
Query: 713 FRAIPSTLHQVLKYSTPNGMGTVRGEQTASRECYASALKGSSVCALETLAGRDGALEFEA 772
FRAIPSTLHQVLKYSTPNG+GTVRGEQTASRECYAS LKG+SVCALETL RDG LEFEA
Sbjct: 361 FRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASXLKGTSVCALETLTSRDGTLEFEA 420
Query: 773 DLPRKEFAAPTEELELVPLLSPEKQL 798
DLP +EFAAP EELELVPLLS EKQ+
Sbjct: 421 DLPXREFAAPXEELELVPLLSXEKQV 439
BLAST of Moc11g29330 vs. NCBI nr
Match:
XP_022158414.1 (uncharacterized protein LOC111024904 [Momordica charantia])
HSP 1 Score: 775.0 bits (2000), Expect = 7.7e-220
Identity = 410/544 (75.37%), Postives = 451/544 (82.90%), Query Frame = 0
Query: 258 MDFQATSDAIKCRAFQIAITGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTAT 317
MDFQA +DAIKCRAFQIA+TGSARLWYRRLPARSISTYSQLR+EF++QFSS HYD+KTAT
Sbjct: 1 MDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSWHYDRKTAT 60
Query: 318 HLATISQKEGETLREYVTRFQEEQLKAAYCSDDSAMCYFLTGLADEALTVKLGEEAPATF 377
HLATI QKE ETLREYVTRFQEEQLK A+CSDDSAMCYFLT LADE LTVKLGEEAP TF
Sbjct: 61 HLATIRQKERETLREYVTRFQEEQLKVAHCSDDSAMCYFLTSLADETLTVKLGEEAPTTF 120
Query: 378 AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDER-ADPKSKDKGSFSS-SRAECRR 437
EVLQKAKKVIDGQELLRTKTGRPE++I + + +++R AD KS+DKGS SS SR E RR
Sbjct: 121 VEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSSSASRTEYRR 180
Query: 438 AESGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGALERRSKDKYCRFYR 497
ESGP+RSRPYER+T +TIPISEILTNIEESGMEKLLKRPEKLRG LE+R+K+KYCRF+R
Sbjct: 181 LESGPSRSRPYERYTSSTIPISEILTNIEESGMEKLLKRPEKLRGDLEKRNKEKYCRFHR 240
Query: 498 EHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVIN 557
+HGHNT+ CWELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKRSRTPPRR DRPAVIN
Sbjct: 241 DHGHNTTSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRREDRPAVIN 300
Query: 558 TIFGGPSGGQSGHKRKELARAARCEVCIIREQEPTCPITFDGADLEEVHLPHNDALVIAP 617
TIFGGP+GGQSG+KRKELAR AR EVCIIRE +PTC ITF ADLE VHLPHNDALVIA
Sbjct: 301 TIFGGPNGGQSGNKRKELAREARREVCIIREHKPTCSITFGDADLEGVHLPHNDALVIAS 360
Query: 618 LIDHVVVRRVLVNGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLP 677
LIDH +VRRVL++G GCIDLP
Sbjct: 361 LIDHDLVRRVLIDG----------------------------------------GCIDLP 420
Query: 678 VTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGMGTVR 737
VT+GQD T+VTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYSTPN +G VR
Sbjct: 421 VTIGQDATQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNEVGMVR 480
Query: 738 GEQTASRECYASALKGSSVCALETLAGRDGALEFEADLP---RKEFAAPTEELELVPLLS 797
GEQ SRECYASALKGS+VCALE R E EADLP +++F PTEELELVPLLS
Sbjct: 481 GEQKTSRECYASALKGSAVCALEEQTNRGKLQESEADLPKEGKRQFPPPTEELELVPLLS 504
BLAST of Moc11g29330 vs. ExPASy TrEMBL
Match:
A0A6J1D9E1 (uncharacterized protein LOC111018823 OS=Momordica charantia OX=3673 GN=LOC111018823 PE=4 SV=1)
HSP 1 Score: 953.7 bits (2464), Expect = 5.9e-274
Identity = 510/630 (80.95%), Postives = 531/630 (84.29%), Query Frame = 0
Query: 165 SSNQQAESSHNPV---GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFT 224
SSNQQAESSHNP G+ITREEFDQLRG+L+AQVEALKAKCEQK+ LNDGDLGESPFT
Sbjct: 2 SSNQQAESSHNPATPDGVITREEFDQLRGKLNAQVEALKAKCEQKEGPLNDGDLGESPFT 61
Query: 225 SDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQATSDAIKCRAFQIAITGSAR 284
SDVLE APTVK YDG+KDPKDYVEVFEGLMDFQA SDAIKCRAFQIA+TGSAR
Sbjct: 62 SDVLE--------APTVKSYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAR 121
Query: 285 LWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATISQKEGETLREYVTRFQEEQ 344
LW FQE+Q
Sbjct: 122 LW-----------------------------------------------------FQEDQ 181
Query: 345 LKAAYCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRP 404
LK A SDDSAMCYFLTGLADEALTVKLG+EAPATFAEVLQKAKKVIDGQELLRTKTGRP
Sbjct: 182 LKVAQSSDDSAMCYFLTGLADEALTVKLGKEAPATFAEVLQKAKKVIDGQELLRTKTGRP 241
Query: 405 ERKIGRGRSGKDERADPKSKDKGSFSSSRAECRRAESGPTRSRPYERFTPTTIPISEILT 464
ER I RGRSGKDE+AD KSKDKGSFSS RAE RRA +GPTRSRPYERFTPTTIPISEILT
Sbjct: 242 ERGIDRGRSGKDEKADLKSKDKGSFSSGRAEFRRAVNGPTRSRPYERFTPTTIPISEILT 301
Query: 465 NIEESGMEKLLKRPEKLRGALERRSKDKYCRFYREHGHNTSDCWELKRQIEDLIQDGYFK 524
NIEESGMEKLLKRPEKLRGA ERR+KDKYCRF+REH HNTSD WELKRQIEDLIQD YFK
Sbjct: 302 NIEESGMEKLLKRPEKLRGAPERRNKDKYCRFHREHDHNTSDRWELKRQIEDLIQDDYFK 361
Query: 525 KFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARCEV 584
KFVGKPRTSSAEKKEERK SRTP RR DRPAVINTIFGGPSGGQSGHKRKELARAAR EV
Sbjct: 362 KFVGKPRTSSAEKKEERKLSRTPLRRIDRPAVINTIFGGPSGGQSGHKRKELARAARREV 421
Query: 585 CIIREQEPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVNGGASANILSLPTY 644
CIIREQ PTCPITFD ADLEEVHLPHNDALVIAPLIDHVVVRRVLV+ G SANI+SL TY
Sbjct: 422 CIIREQRPTCPITFDSADLEEVHLPHNDALVIAPLIDHVVVRRVLVDEGVSANIVSLLTY 481
Query: 645 LALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYN 704
LALGWTRSQLK+S TPLVGFS ESVIPEGCIDLPVTLG D+T+VTQMAEFVVIDGRSAYN
Sbjct: 482 LALGWTRSQLKKSTTPLVGFSRESVIPEGCIDLPVTLGHDQTQVTQMAEFVVIDGRSAYN 541
Query: 705 AIFGRPIIHSFRAIPSTLHQVLKYSTPNGMGTVRGEQTASRECYASALKGSSVCALETLA 764
AIFGRPIIHSFRAIPSTLHQVLKYSTPNG+G VRGEQ ASRECYASALKGSSVCALETL
Sbjct: 542 AIFGRPIIHSFRAIPSTLHQVLKYSTPNGVGMVRGEQIASRECYASALKGSSVCALETLV 570
Query: 765 GRDGALEFEADLPRKEFAAPTEELELVPLL 792
RDG LEF+A+LPR+EFAAPTEELELVPLL
Sbjct: 602 SRDGTLEFKANLPRREFAAPTEELELVPLL 570
BLAST of Moc11g29330 vs. ExPASy TrEMBL
Match:
A0A6J1C7X5 (uncharacterized protein LOC111008813 OS=Momordica charantia OX=3673 GN=LOC111008813 PE=4 SV=1)
HSP 1 Score: 947.2 bits (2447), Expect = 5.5e-272
Identity = 488/528 (92.42%), Postives = 503/528 (95.27%), Query Frame = 0
Query: 169 QAESSHN---PVGIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVL 228
+AESS N P G+ITREEFDQLRG+LDAQVEALKAKCEQK+ LNDGDLGESPFTSDVL
Sbjct: 3 KAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVL 62
Query: 229 EAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQATSDAIKCRAFQIAITGSARLWYR 288
EAPIPPKFKAPTVKPYDG+KDPKDYVEVFE LMDFQA SDAIKCRAF+IA+TGSARLWYR
Sbjct: 63 EAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKCRAFRIALTGSARLWYR 122
Query: 289 RLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATISQKEGETLREYVTRFQEEQLKAA 348
RLPA SISTYSQLRREFLA FSSRHYDKKTATHLATI QKEGETLREYVTRFQEEQLK A
Sbjct: 123 RLPAXSISTYSQLRREFLAXFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA 182
Query: 349 YCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI 408
+CSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI
Sbjct: 183 HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI 242
Query: 409 GRGRSGKD-ERADPKSKDKGSFSSSRAECRRAESGPTRSRPYERFTPTTIPISEILTNIE 468
GRGRSGKD E ADPKSKDKGSFSS RAE RRAE+GPTRSRPYERFTPTTIPISEILTNIE
Sbjct: 243 GRGRSGKDIENADPKSKDKGSFSSGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIE 302
Query: 469 ESGMEKLLKRPEKLRGALERRSKDKYCRFYREHGHNTSDCWELKRQIEDLIQDGYFKKFV 528
ESGMEKLLKRPEKLRGA ERRSKDKYCRF+REHGHNTSD WELKRQIE+LIQDGYFKKFV
Sbjct: 303 ESGMEKLLKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKRQIENLIQDGYFKKFV 362
Query: 529 GKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARCEVCII 588
GKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSG KRKELARAAR EVCII
Sbjct: 363 GKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGRKRKELARAARREVCII 422
Query: 589 REQEPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVNGGASANILSLPTYLAL 648
REQ PTCPITFDGADLEEVHLPHNDALVIAPLIDHVVV RVLV+GG SANILSLPTYLAL
Sbjct: 423 REQRPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVXRVLVDGGTSANILSLPTYLAL 482
Query: 649 GWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFV 693
GWTRSQLK+SPTPLVGFSGESVIPEG IDLPVTLGQD+T+VTQMAEFV
Sbjct: 483 GWTRSQLKKSPTPLVGFSGESVIPEGFIDLPVTLGQDQTQVTQMAEFV 530
BLAST of Moc11g29330 vs. ExPASy TrEMBL
Match:
A0A6J1DHB3 (uncharacterized protein LOC111020479 OS=Momordica charantia OX=3673 GN=LOC111020479 PE=4 SV=1)
HSP 1 Score: 897.9 bits (2319), Expect = 3.8e-257
Identity = 470/608 (77.30%), Postives = 521/608 (85.69%), Query Frame = 0
Query: 161 PSRRSSNQQAESSHNPV--GIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGES 220
P +AESS+NP+ G+ITREEFDQL+ + DAQVEALKA+CE+K+ S +DGDLGE
Sbjct: 56 PPAHPKPSKAESSYNPITPGVITREEFDQLKSKFDAQVEALKARCEKKESSFDDGDLGEL 115
Query: 221 PFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQATSDAIKCRAFQIAITG 280
F+SD+LEA IPPKFK PT+KPYDG+KDPKDYVEVFE LMDFQA +DAIKC AFQIA+TG
Sbjct: 116 SFSSDILEALIPPKFKTPTMKPYDGSKDPKDYVEVFESLMDFQAATDAIKCCAFQIALTG 175
Query: 281 SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATISQKEGETLREYVTRFQ 340
SARLWYRRLPAR ISTYSQLR+EF++QFSSRHYD+KT THLATI QKEGETLREYVTRF
Sbjct: 176 SARLWYRRLPARLISTYSQLRKEFISQFSSRHYDRKTPTHLATIRQKEGETLREYVTRFP 235
Query: 341 EEQLKAAYCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKT 400
EEQLK A+CSDDSAMCYFLTGLADE LTVKL EEAPATFAEVLQK KKVIDGQELLRTKT
Sbjct: 236 EEQLKVAHCSDDSAMCYFLTGLADETLTVKLREEAPATFAEVLQKTKKVIDGQELLRTKT 295
Query: 401 GRPERKIGRGRSGKDE-RADPKSKDKG-SFSSSRAECRRAESGPTRSRPYERFTPTTIPI 460
GRPE+ I +GR+GKD+ +AD KS+DKG S SSSR + RR+ S +SRPYE +TPTTIPI
Sbjct: 296 GRPEKNIDQGRAGKDKGKADSKSRDKGPSSSSSRVDYRRSNSSHNQSRPYEHYTPTTIPI 355
Query: 461 SEILTNIEESGMEKLLKRPEKLRGALERRSKDKYCRFYREHGHNTSDCWELKRQIEDLIQ 520
EILTNIEE+GMEKLLKRPEKLRG E+R+ DKYCRF+R+HGHNTS+ WELKRQIEDLIQ
Sbjct: 356 FEILTNIEETGMEKLLKRPEKLRGDPEKRNTDKYCRFHRDHGHNTSNYWELKRQIEDLIQ 415
Query: 521 DGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARA 580
DGYFKKFVGKPR++S EKKEERKR RTPPRR DRPAVIN K+KELAR
Sbjct: 416 DGYFKKFVGKPRSNSVEKKEERKRLRTPPRRDDRPAVIN-------------KKKELARE 475
Query: 581 ARCEVCIIREQEPTCPITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVNGGASANIL 640
AR EVCIIREQ PT I F+ ADLE VHLPHNDALVIAPLID V+VRR+LV+GGASANIL
Sbjct: 476 ARREVCIIREQRPTSSIAFNHADLEGVHLPHNDALVIAPLIDLVLVRRILVDGGASANIL 535
Query: 641 SLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDG 700
SL TYLALGWTRSQLK+SPTPLVGFSGES+ EGCIDLPV++ QD T+VTQMAEFVVIDG
Sbjct: 536 SLSTYLALGWTRSQLKKSPTPLVGFSGESISLEGCIDLPVSIRQDDTQVTQMAEFVVIDG 595
Query: 701 RSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGMGTVRGEQTASRECYASALKGSSVCA 760
RSAYNAIFGRPIIHSFRA+PSTLHQVLKYST NG+GTVRGE SRECYAS K SSVCA
Sbjct: 596 RSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTLNGVGTVRGELKTSRECYASVPKRSSVCA 650
Query: 761 LETLAGRD 765
LE RD
Sbjct: 656 LEEQTIRD 650
BLAST of Moc11g29330 vs. ExPASy TrEMBL
Match:
A0A6J1DD03 (uncharacterized protein LOC111019899 OS=Momordica charantia OX=3673 GN=LOC111019899 PE=4 SV=1)
HSP 1 Score: 785.0 bits (2026), Expect = 3.6e-223
Identity = 405/446 (90.81%), Postives = 419/446 (93.95%), Query Frame = 0
Query: 353 MCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGK 412
MCYFLTGLADEALTVKL EEAPATFAEVLQKAKKVIDGQELLRT KIG+GRSGK
Sbjct: 1 MCYFLTGLADEALTVKLVEEAPATFAEVLQKAKKVIDGQELLRT-------KIGQGRSGK 60
Query: 413 D-ERADPKSKDKGSFSSSRAECRRAESGPTRSRPYERFTPTTIPISEILTNIEESGMEKL 472
D E DPKSKDKGSFS+ RAE RRAE+GPTRSRPYERFTPTTIPISEILTNIEESGMEKL
Sbjct: 61 DMENTDPKSKDKGSFSNGRAEYRRAENGPTRSRPYERFTPTTIPISEILTNIEESGMEKL 120
Query: 473 LKRPEKLRGALERRSKDKYCRFYREHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSS 532
LKRPEKLRGA ERRSKDKYCRF+REHGHNTSD WELK QIEDLIQDGYFKKFVGKPRTSS
Sbjct: 121 LKRPEKLRGAPERRSKDKYCRFHREHGHNTSDYWELKSQIEDLIQDGYFKKFVGKPRTSS 180
Query: 533 AEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKELARAARCEVCIIREQEPTC 592
AEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRK+LARAAR EVCIIREQ PTC
Sbjct: 181 AEKKEERKRSRTPPRRTDRPAVINTIFGGPSGGQSGHKRKKLARAARREVCIIREQRPTC 240
Query: 593 PITFDGADLEEVHLPHNDALVIAPLIDHVVVRRVLVNGGASANILSLPTYLALGWTRSQL 652
PITFD ADL EVHLPHNDALVIAPLIDHVVVRRVLV+GGASANILSLPTYLALGWTRSQL
Sbjct: 241 PITFDXADLXEVHLPHNDALVIAPLIDHVVVRRVLVDGGASANILSLPTYLALGWTRSQL 300
Query: 653 KRSPTPLVGFSGESVIPEGCIDLPVTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHS 712
K+SPTPLVGFSGESV+PEGCIDLPVTLGQD+TRVTQMAEFVV+DGRSAYNAIFGRPIIHS
Sbjct: 301 KKSPTPLVGFSGESVVPEGCIDLPVTLGQDQTRVTQMAEFVVVDGRSAYNAIFGRPIIHS 360
Query: 713 FRAIPSTLHQVLKYSTPNGMGTVRGEQTASRECYASALKGSSVCALETLAGRDGALEFEA 772
FRAIPSTLHQVLKYSTPNG+GTVRGEQTASRECYAS LKG+SVCALETL RDG LEFEA
Sbjct: 361 FRAIPSTLHQVLKYSTPNGVGTVRGEQTASRECYASXLKGTSVCALETLTSRDGTLEFEA 420
Query: 773 DLPRKEFAAPTEELELVPLLSPEKQL 798
DLP +EFAAP EELELVPLLS EKQ+
Sbjct: 421 DLPXREFAAPXEELELVPLLSXEKQV 439
BLAST of Moc11g29330 vs. ExPASy TrEMBL
Match:
A0A6J1DZB9 (uncharacterized protein LOC111024904 OS=Momordica charantia OX=3673 GN=LOC111024904 PE=4 SV=1)
HSP 1 Score: 775.0 bits (2000), Expect = 3.7e-220
Identity = 410/544 (75.37%), Postives = 451/544 (82.90%), Query Frame = 0
Query: 258 MDFQATSDAIKCRAFQIAITGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTAT 317
MDFQA +DAIKCRAFQIA+TGSARLWYRRLPARSISTYSQLR+EF++QFSS HYD+KTAT
Sbjct: 1 MDFQAATDAIKCRAFQIALTGSARLWYRRLPARSISTYSQLRKEFISQFSSWHYDRKTAT 60
Query: 318 HLATISQKEGETLREYVTRFQEEQLKAAYCSDDSAMCYFLTGLADEALTVKLGEEAPATF 377
HLATI QKE ETLREYVTRFQEEQLK A+CSDDSAMCYFLT LADE LTVKLGEEAP TF
Sbjct: 61 HLATIRQKERETLREYVTRFQEEQLKVAHCSDDSAMCYFLTSLADETLTVKLGEEAPTTF 120
Query: 378 AEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDER-ADPKSKDKGSFSS-SRAECRR 437
EVLQKAKKVIDGQELLRTKTGRPE++I + + +++R AD KS+DKGS SS SR E RR
Sbjct: 121 VEVLQKAKKVIDGQELLRTKTGRPEKQIDQKKLSQEKRKADSKSRDKGSSSSASRTEYRR 180
Query: 438 AESGPTRSRPYERFTPTTIPISEILTNIEESGMEKLLKRPEKLRGALERRSKDKYCRFYR 497
ESGP+RSRPYER+T +TIPISEILTNIEESGMEKLLKRPEKLRG LE+R+K+KYCRF+R
Sbjct: 181 LESGPSRSRPYERYTSSTIPISEILTNIEESGMEKLLKRPEKLRGDLEKRNKEKYCRFHR 240
Query: 498 EHGHNTSDCWELKRQIEDLIQDGYFKKFVGKPRTSSAEKKEERKRSRTPPRRTDRPAVIN 557
+HGHNT+ CWELKRQIEDLIQDGYFKKFVGKPR++S EKKEERKRSRTPPRR DRPAVIN
Sbjct: 241 DHGHNTTSCWELKRQIEDLIQDGYFKKFVGKPRSNSVEKKEERKRSRTPPRREDRPAVIN 300
Query: 558 TIFGGPSGGQSGHKRKELARAARCEVCIIREQEPTCPITFDGADLEEVHLPHNDALVIAP 617
TIFGGP+GGQSG+KRKELAR AR EVCIIRE +PTC ITF ADLE VHLPHNDALVIA
Sbjct: 301 TIFGGPNGGQSGNKRKELAREARREVCIIREHKPTCSITFGDADLEGVHLPHNDALVIAS 360
Query: 618 LIDHVVVRRVLVNGGASANILSLPTYLALGWTRSQLKRSPTPLVGFSGESVIPEGCIDLP 677
LIDH +VRRVL++G GCIDLP
Sbjct: 361 LIDHDLVRRVLIDG----------------------------------------GCIDLP 420
Query: 678 VTLGQDRTRVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAIPSTLHQVLKYSTPNGMGTVR 737
VT+GQD T+VTQMAEFVVIDGRSAYNAIFGRPIIHSFRA+PSTLHQVLKYSTPN +G VR
Sbjct: 421 VTIGQDATQVTQMAEFVVIDGRSAYNAIFGRPIIHSFRAVPSTLHQVLKYSTPNEVGMVR 480
Query: 738 GEQTASRECYASALKGSSVCALETLAGRDGALEFEADLP---RKEFAAPTEELELVPLLS 797
GEQ SRECYASALKGS+VCALE R E EADLP +++F PTEELELVPLLS
Sbjct: 481 GEQKTSRECYASALKGSAVCALEEQTNRGKLQESEADLPKEGKRQFPPPTEELELVPLLS 504
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1D9E1 | 5.9e-274 | 80.95 | uncharacterized protein LOC111018823 OS=Momordica charantia OX=3673 GN=LOC111018... | [more] |
A0A6J1C7X5 | 5.5e-272 | 92.42 | uncharacterized protein LOC111008813 OS=Momordica charantia OX=3673 GN=LOC111008... | [more] |
A0A6J1DHB3 | 3.8e-257 | 77.30 | uncharacterized protein LOC111020479 OS=Momordica charantia OX=3673 GN=LOC111020... | [more] |
A0A6J1DD03 | 3.6e-223 | 90.81 | uncharacterized protein LOC111019899 OS=Momordica charantia OX=3673 GN=LOC111019... | [more] |
A0A6J1DZB9 | 3.7e-220 | 75.37 | uncharacterized protein LOC111024904 OS=Momordica charantia OX=3673 GN=LOC111024... | [more] |
Match Name | E-value | Identity | Description | |