Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GGTAAGGAGTAAGGCAGTGAATGGCAGGTGGGTGTTGTATGATTTTTTTTTTGGGCAAGGAGTGAGGCAGTGAAATTAACTGGTGGAGATACGACCATTAGAAATTTATAATGCTATGGTATGAAAAATTGTCATGGATATTTTTTGAACACGCGTGCATCAGGATGTGTTATGGTCAGTCAAATGTTGAAATTCTCGTTTGTTTATGTCTGAATGAATACTTTTATTTGTTGTATCTAATCTATAAGCCGTTTACTGTTTGCACAACTTAAATACAATATTTTTAGATTTATCTTATCTAGCATCAAATTGTTGCTTATTGAGACATTTGTATATATGGAGTTAACTGCCTTTTACCTATGTACCTTTACAATGTTATTTATGTGAATGCCCTGCAGAAGTCCTTGGCCCAGTTTCCCTGAGTCTGGTACCTCGGTTGGTTTTTCTCTTCCTTCTCCCTCTGTTGTTCTTTGTGCTGTTAAATGTTAATGTTCCAAAAAAAATTCCATGCATATCTTTCTTCCTCTTTTTGGTTGCAGAATGGCCCAGGGTCCAGTTCTACAAATCCTTTTGGTTCTGATGCTAAAAATGGTACACTTAATAGGGTGTTTTGCTTTAGTAAACGATGCTTTATTTTGCACGAATGTTATTTATGAGAAACTGGTTGTTGCTTTATTACAGATGAAGATTCTCCTTGGATTAGCAAGTTGACTCCAGAGGCAAGCACATCCTGGGGTGCTGCAAAATCAAGTGTAGATACTGCAAACGACGGTCAAGCCTCTGGATGGGGAAAGAGCGATTCTAAAATTTGTTCAGATGGCAATGCCTCTGGTGCTTTGGGCAAAACAGTAGTACCTAGTGGAGATTCTGCAGGCTTTACAGATTCAGAATCTGGAGGTTGGAAAAAGAATCAAAGTGCTAATTTTGGTGATGATAACGCTCCAGTCGAAACTTCTGCCGATAGGTGGGGCAGCAAAAGTAGATCAAGTGGAAGCTGGGGCGATCAAAATGCTTCAACCACTGTCTCTGAGATTCAGCCAGCTGGCAAAGGAAATGCAGGTGCCTGGAATGTAGGCACTGCCAAGGATGAGTCAGGTGGGTGGGGTAAACCAAAGAACGTTGGGGATGTTGGCAGTTCTGCTTGGAACAAATCTACTGCTGGTGATGGAGACGGTCAAAACGACAGTTGGAATAAACCAAAACCTTCTAGTCACGATGGAAATGTTGGAAAAAAAGAATGGGGACAGGGTAATGAAGCCAGTGATAATGGTAACAAATGGCAGAGTTCAAGGTCTGATGGTGGAAAAAAATGGGGCACTAATGAAGCAGAACGTGAGGGTGGAAGCAGTTGGAATACATCCAAGTCGTCTGATGTTGGTCCTGCTAGTTGGAAAGATAAACCCGACTCCTCAAGTTTGACAGCTCCTAAAGGAGATCAATGGGCAGAAGGTTGGGATAAGCAGCACAGTTCAAATGATACAAAAGCCTCTGATGACAATTCTTCTTGGAATAAAAAACCTGTTGAAAGTGGGAAAGATGGCGAGCTTAAGAATCAAGGCAGTGGTTGGAATGTTGGAAAAACTTCTGGTGGAGATTCTGCATCCGGATGGGGTCAGACCAGCAAGGAAGCTGATTTGAGTGATCAAGCAGGTAGCTGGGGTTCTAATTGGAAAAAGAATTCTGATACCAGGAATGAAGATTCTAGTTCGGCCAAGAAAAGCAGCTGGGGTTCTGGAAGTGGAAATTCTAACTGGGGAGAGAAAAGCAATTGGAACTCAGGAAATGAATTTAATGCTATCACCGGTGGTGCTGAAGCTCAAACTGACGTTTCTAATGATACCTCAGGTTATGGGAGTTGGAAGCCAGAAAGTAGTGATAGAGGAGGATATCGTGGAAGAGGTGGTTTTAGAGGAAGGGGTGAAAGGGGCCGGTTTGGTGGCAGAGGCAGATCTGATAGAGGTGGTTTTGGCAGAGGTGGTTCAGACAGAGGTGGTTTTGGAGGCAGAGGCCGTGGGAGGTGGAACAGTGAAGGTGGTTCAAATGATGGTGAAAATAAAGGATGGAGTGGTGGTGGCGGCGGCGGCAGCGACAACAAAGGATGGGGCAGTGGTGGTGGAGGCAGCGACAATAAAGGATGGAGCAGTGGTGGTGACGGCAGTAACAATAAAGGATGGAGTACTGGTGGTGAAGGCAGTGGCAATAAAGGTGGTGGTGACAACAAAGGCTGGGGTAGTGGCAGTGGCGGCAGCAGCGACGATAAAGGATGGAGTGGTGGTGGAAATGGTGTTGGTGGTGGTGACAATAAAGGATGGGGCAGTGGTGGTGGCGGTAGTAACGACAATAAAGGATGGAGTGGTGGTGACAACAAAGGATGGGGCAGTGCTGGCGGCAGCAGCGACAATAAAGGGTGGAGTAGTGGTGGCAGTGGTGGCGACAATAAAGGATGGAGTAGTGGAAGTGGTGGTGGCGGTGACGGTTGCGGAGATAAAGGGTGGAGTAGCGGTGGTGGTGGCGACAACAAAGGATGGGGTGGTGGTGGTGAGTCTGGTGACAAGGGGTGGAGTAGTGGTGGTAGTCGCGAATGGGAGAAATCTGGATCAGATCGAGGTGGATTTGGTGGCAGAGGTCGTGGAAGATGGAGCAGTGGAAGTGGTTCAAATGACAGTGACAGTGGGAGTGGGGGGTGGAGTGGTGGTGGTGATAGATCAGACCGAGGTGGAGGTGGATTTAGAGGTAGAGGCCGTGGAAGATGGAACCAAGAAGGTTCCTACGATGGTGACAATGGAGGGCAAAGAGGTGGATATGGAGGCCGGGGACGTGGGAGATGGAATCAAGAAAATGGTTCAAATGAAGGTGGTGACAATGGTGGGTGGAGTGGTGGTAGAGGTGGATTTGGAAGTCGAGGCCGTGGAAGTTGGAATCAAGACGATGGTGGTTCAGGAGGATGGAGTGGAGGAAATGGTGGTAGAGGCGGATTTGGCGGCCGAGGTCGTGGAAGGAGGAATCAAGACAGTTCAAATGATGGCAATAATGATGATAAACCTGCAAGCTGGAGCACAGGTTCAGGTAATAGTGGAGGGTGGAACAGTGGCGGTGGTGGTGCAGGATCTTGGAACCAAGGAGGTGATGAAAAGAACCAGCAGCAACATAGTTGGAAGTCTAGCAATGATGGCGGTCAGGGATCTGGTTGGAAGGAACCAAGTGGGAGTGATCACAATAATTGGGAATCATCTGGTTCTTCTGGTGCAGGTAATAGTTCTGGGTGGAACAATTCGACTACAGGCAAGGAGACGGAAGAGAGTGGTGGTCACAACAGCTGGAACCAAACGACAAAGACGGATTCTCAAGGTGGAGGCTGGCAGAAGTCAGCATCTAGTTGGAATGCCGGAACTGAGAATCAAACAGTGACCAAGGATGTGAGTTCAGTTTCTAAAGATGGAGGTTGGGGAAAGTCTGCTGAGCCATCAACCCTGGACAAAGAAAAAGCCAACGTAGGTGCCCAAGGAGGAGGAGCAGCAGGTTGGGAGAAGCCTACATCTAGTTGGAATACAGAGCAGTCTCGAGGCGAAAACAACAGCGGAGGAGGACGAGGAGGAGCTCGGGGTAAATAAGGAAATCATTAAACATAATCTTCATAACATTATGCCCCTTTTGGGTCATAATGGATGTTCTAATAATCACAAACTATTGAATTGAGCATGCTTTGATTAACTAAGGAAATCATTAGGATTCAAATATCTTTCTCATTGTTTGAGAGTCTGTCTTGACCAATTGCTTCAATGATATTGTTTCTTGTTATCCTTTTTCCTCTTTGCGTTTTCTCCTAATGCACGTTGTTTGTCTATGAATTCTCGATTTTCAAGCGTATGTGTTAACAATTGATCTTGACAGTTGATAATAACTTTTGAATTTCCGAATGACAATGGTGTGGCACAGTGCAGGGCGGTGTTTTGCTTGTTTTGGGGGAGACAGACAATCATCCTCACATCCATTTTCCACATGCTGTGTATGTAATAATAGACAAGAGATGGGGTCGTTTCAATAGATTCCATTTTTTCAATGTTTCAC
mRNA sequence
GGTAAGGAGTAAGGCAGTGAATGGCAGGTGGGTGTTGTATGATTTTTTTTTTGGGCAAGGAGTGAGGCAGTGAAATTAACTGGTGGAGATACGACCATTAGAAATTTATAATGCTATGGTATGAAAAATTGTCATGGATATTTTTTGAACACGCGTGCATCAGGATGTGTTATGGTCAGTCAAATGTTGAAATTCTCGTTTGTTTATGTCTGAATGAATACTTTTATTTGTTGTATCTAATCTATAAGCCGTTTACTGTTTGCACAACTTAAATACAATATTTTTAGATTTATCTTATCTAGCATCAAATTGTTGCTTATTGAGACATTTGTATATATGGAGTTAACTGCCTTTTACCTATGTACCTTTACAATGTTATTTATGTGAATGCCCTGCAGAAGTCCTTGGCCCAGTTTCCCTGAGTCTGGTACCTCGAATGGCCCAGGGTCCAGTTCTACAAATCCTTTTGGTTCTGATGCTAAAAATGATGAAGATTCTCCTTGGATTAGCAAGTTGACTCCAGAGGCAAGCACATCCTGGGGTGCTGCAAAATCAAGTGTAGATACTGCAAACGACGGTCAAGCCTCTGGATGGGGAAAGAGCGATTCTAAAATTTGTTCAGATGGCAATGCCTCTGGTGCTTTGGGCAAAACAGTAGTACCTAGTGGAGATTCTGCAGGCTTTACAGATTCAGAATCTGGAGGTTGGAAAAAGAATCAAAGTGCTAATTTTGGTGATGATAACGCTCCAGTCGAAACTTCTGCCGATAGGTGGGGCAGCAAAAGTAGATCAAGTGGAAGCTGGGGCGATCAAAATGCTTCAACCACTGTCTCTGAGATTCAGCCAGCTGGCAAAGGAAATGCAGGTGCCTGGAATGTAGGCACTGCCAAGGATGAGTCAGGTGGGTGGGGTAAACCAAAGAACGTTGGGGATGTTGGCAGTTCTGCTTGGAACAAATCTACTGCTGGTGATGGAGACGGTCAAAACGACAGTTGGAATAAACCAAAACCTTCTAGTCACGATGGAAATGTTGGAAAAAAAGAATGGGGACAGGGTAATGAAGCCAGTGATAATGGTAACAAATGGCAGAGTTCAAGGTCTGATGGTGGAAAAAAATGGGGCACTAATGAAGCAGAACGTGAGGGTGGAAGCAGTTGGAATACATCCAAGTCGTCTGATGTTGGTCCTGCTAGTTGGAAAGATAAACCCGACTCCTCAAGTTTGACAGCTCCTAAAGGAGATCAATGGGCAGAAGGTTGGGATAAGCAGCACAGTTCAAATGATACAAAAGCCTCTGATGACAATTCTTCTTGGAATAAAAAACCTGTTGAAAGTGGGAAAGATGGCGAGCTTAAGAATCAAGGCAGTGGTTGGAATGTTGGAAAAACTTCTGGTGGAGATTCTGCATCCGGATGGGGTCAGACCAGCAAGGAAGCTGATTTGAGTGATCAAGCAGGTAGCTGGGGTTCTAATTGGAAAAAGAATTCTGATACCAGGAATGAAGATTCTAGTTCGGCCAAGAAAAGCAGCTGGGGTTCTGGAAGTGGAAATTCTAACTGGGGAGAGAAAAGCAATTGGAACTCAGGAAATGAATTTAATGCTATCACCGGTGGTGCTGAAGCTCAAACTGACGTTTCTAATGATACCTCAGGTTATGGGAGTTGGAAGCCAGAAAGTAGTGATAGAGGAGGATATCGTGGAAGAGGTGGTTTTAGAGGAAGGGGTGAAAGGGGCCGGTTTGGTGGCAGAGGCAGATCTGATAGAGGTGGTTTTGGCAGAGGTGGTTCAGACAGAGGTGGTTTTGGAGGCAGAGGCCGTGGGAGGTGGAACAGTGAAGGTGGTTCAAATGATGGTGAAAATAAAGGATGGAGTGGTGGTGGCGGCGGCGGCAGCGACAACAAAGGATGGGGCAGTGGTGGTGGAGGCAGCGACAATAAAGGATGGAGCAGTGGTGGTGACGGCAGTAACAATAAAGGATGGAGTACTGGTGGTGAAGGCAGTGGCAATAAAGGTGGTGGTGACAACAAAGGCTGGGGTAGTGGCAGTGGCGGCAGCAGCGACGATAAAGGATGGAGTGGTGGTGGAAATGGTGTTGGTGGTGGTGACAATAAAGGATGGGGCAGTGGTGGTGGCGGTAGTAACGACAATAAAGGATGGAGTGGTGGTGACAACAAAGGATGGGGCAGTGCTGGCGGCAGCAGCGACAATAAAGGGTGGAGTAGTGGTGGCAGTGGTGGCGACAATAAAGGATGGAGTAGTGGAAGTGGTGGTGGCGGTGACGGTTGCGGAGATAAAGGGTGGAGTAGCGGTGGTGGTGGCGACAACAAAGGATGGGGTGGTGGTGGTGAGTCTGGTGACAAGGGGTGGAGTAGTGGTGGTAGTCGCGAATGGGAGAAATCTGGATCAGATCGAGGTGGATTTGGTGGCAGAGGTCGTGGAAGATGGAGCAGTGGAAGTGGTTCAAATGACAGTGACAGTGGGAGTGGGGGGTGGAGTGGTGGTGGTGATAGATCAGACCGAGGTGGAGGTGGATTTAGAGGTAGAGGCCGTGGAAGATGGAACCAAGAAGGTTCCTACGATGGTGACAATGGAGGGCAAAGAGGTGGATATGGAGGCCGGGGACGTGGGAGATGGAATCAAGAAAATGGTTCAAATGAAGGTGGTGACAATGGTGGGTGGAGTGGTGGTAGAGGTGGATTTGGAAGTCGAGGCCGTGGAAGTTGGAATCAAGACGATGGTGGTTCAGGAGGATGGAGTGGAGGAAATGGTGGTAGAGGCGGATTTGGCGGCCGAGGTCGTGGAAGGAGGAATCAAGACAGTTCAAATGATGGCAATAATGATGATAAACCTGCAAGCTGGAGCACAGGTTCAGGTAATAGTGGAGGGTGGAACAGTGGCGGTGGTGGTGCAGGATCTTGGAACCAAGGAGGTGATGAAAAGAACCAGCAGCAACATAGTTGGAAGTCTAGCAATGATGGCGGTCAGGGATCTGGTTGGAAGGAACCAAGTGGGAGTGATCACAATAATTGGGAATCATCTGGTTCTTCTGGTGCAGGTAATAGTTCTGGGTGGAACAATTCGACTACAGGCAAGGAGACGGAAGAGAGTGGTGGTCACAACAGCTGGAACCAAACGACAAAGACGGATTCTCAAGGTGGAGGCTGGCAGAAGTCAGCATCTAGTTGGAATGCCGGAACTGAGAATCAAACAGTGACCAAGGATGTGAGTTCAGTTTCTAAAGATGGAGGTTGGGGAAAGTCTGCTGAGCCATCAACCCTGGACAAAGAAAAAGCCAACGTAGGTGCCCAAGGAGGAGGAGCAGCAGGTTGGGAGAAGCCTACATCTAGTTGGAATACAGAGCAGTCTCGAGGCGAAAACAACAGCGGAGGAGGACGAGGAGGAGCTCGGGGTAAATAAGGAAATCATTAAACATAATCTTCATAACATTATGCCCCTTTTGGGTCATAATGGATGTTCTAATAATCACAAACTATTGAATTGAGCATGCTTTGATTAACTAAGGAAATCATTAGGATTCAAATATCTTTCTCATTGTTTGAGAGTCTGTCTTGACCAATTGCTTCAATGATATTGTTTCTTGTTATCCTTTTTCCTCTTTGCGTTTTCTCCTAATGCACGTTGTTTGTCTATGAATTCTCGATTTTCAAGCGTATGTGTTAACAATTGATCTTGACAGTTGATAATAACTTTTGAATTTCCGAATGACAATGGTGTGGCACAGTGCAGGGCGGTGTTTTGCTTGTTTTGGGGGAGACAGACAATCATCCTCACATCCATTTTCCACATGCTGTGTATGTAATAATAGACAAGAGATGGGGTCGTTTCAATAGATTCCATTTTTTCAATGTTTCAC
Coding sequence (CDS)
ATGCCCTGCAGAAGTCCTTGGCCCAGTTTCCCTGAGTCTGGTACCTCGAATGGCCCAGGGTCCAGTTCTACAAATCCTTTTGGTTCTGATGCTAAAAATGATGAAGATTCTCCTTGGATTAGCAAGTTGACTCCAGAGGCAAGCACATCCTGGGGTGCTGCAAAATCAAGTGTAGATACTGCAAACGACGGTCAAGCCTCTGGATGGGGAAAGAGCGATTCTAAAATTTGTTCAGATGGCAATGCCTCTGGTGCTTTGGGCAAAACAGTAGTACCTAGTGGAGATTCTGCAGGCTTTACAGATTCAGAATCTGGAGGTTGGAAAAAGAATCAAAGTGCTAATTTTGGTGATGATAACGCTCCAGTCGAAACTTCTGCCGATAGGTGGGGCAGCAAAAGTAGATCAAGTGGAAGCTGGGGCGATCAAAATGCTTCAACCACTGTCTCTGAGATTCAGCCAGCTGGCAAAGGAAATGCAGGTGCCTGGAATGTAGGCACTGCCAAGGATGAGTCAGGTGGGTGGGGTAAACCAAAGAACGTTGGGGATGTTGGCAGTTCTGCTTGGAACAAATCTACTGCTGGTGATGGAGACGGTCAAAACGACAGTTGGAATAAACCAAAACCTTCTAGTCACGATGGAAATGTTGGAAAAAAAGAATGGGGACAGGGTAATGAAGCCAGTGATAATGGTAACAAATGGCAGAGTTCAAGGTCTGATGGTGGAAAAAAATGGGGCACTAATGAAGCAGAACGTGAGGGTGGAAGCAGTTGGAATACATCCAAGTCGTCTGATGTTGGTCCTGCTAGTTGGAAAGATAAACCCGACTCCTCAAGTTTGACAGCTCCTAAAGGAGATCAATGGGCAGAAGGTTGGGATAAGCAGCACAGTTCAAATGATACAAAAGCCTCTGATGACAATTCTTCTTGGAATAAAAAACCTGTTGAAAGTGGGAAAGATGGCGAGCTTAAGAATCAAGGCAGTGGTTGGAATGTTGGAAAAACTTCTGGTGGAGATTCTGCATCCGGATGGGGTCAGACCAGCAAGGAAGCTGATTTGAGTGATCAAGCAGGTAGCTGGGGTTCTAATTGGAAAAAGAATTCTGATACCAGGAATGAAGATTCTAGTTCGGCCAAGAAAAGCAGCTGGGGTTCTGGAAGTGGAAATTCTAACTGGGGAGAGAAAAGCAATTGGAACTCAGGAAATGAATTTAATGCTATCACCGGTGGTGCTGAAGCTCAAACTGACGTTTCTAATGATACCTCAGGTTATGGGAGTTGGAAGCCAGAAAGTAGTGATAGAGGAGGATATCGTGGAAGAGGTGGTTTTAGAGGAAGGGGTGAAAGGGGCCGGTTTGGTGGCAGAGGCAGATCTGATAGAGGTGGTTTTGGCAGAGGTGGTTCAGACAGAGGTGGTTTTGGAGGCAGAGGCCGTGGGAGGTGGAACAGTGAAGGTGGTTCAAATGATGGTGAAAATAAAGGATGGAGTGGTGGTGGCGGCGGCGGCAGCGACAACAAAGGATGGGGCAGTGGTGGTGGAGGCAGCGACAATAAAGGATGGAGCAGTGGTGGTGACGGCAGTAACAATAAAGGATGGAGTACTGGTGGTGAAGGCAGTGGCAATAAAGGTGGTGGTGACAACAAAGGCTGGGGTAGTGGCAGTGGCGGCAGCAGCGACGATAAAGGATGGAGTGGTGGTGGAAATGGTGTTGGTGGTGGTGACAATAAAGGATGGGGCAGTGGTGGTGGCGGTAGTAACGACAATAAAGGATGGAGTGGTGGTGACAACAAAGGATGGGGCAGTGCTGGCGGCAGCAGCGACAATAAAGGGTGGAGTAGTGGTGGCAGTGGTGGCGACAATAAAGGATGGAGTAGTGGAAGTGGTGGTGGCGGTGACGGTTGCGGAGATAAAGGGTGGAGTAGCGGTGGTGGTGGCGACAACAAAGGATGGGGTGGTGGTGGTGAGTCTGGTGACAAGGGGTGGAGTAGTGGTGGTAGTCGCGAATGGGAGAAATCTGGATCAGATCGAGGTGGATTTGGTGGCAGAGGTCGTGGAAGATGGAGCAGTGGAAGTGGTTCAAATGACAGTGACAGTGGGAGTGGGGGGTGGAGTGGTGGTGGTGATAGATCAGACCGAGGTGGAGGTGGATTTAGAGGTAGAGGCCGTGGAAGATGGAACCAAGAAGGTTCCTACGATGGTGACAATGGAGGGCAAAGAGGTGGATATGGAGGCCGGGGACGTGGGAGATGGAATCAAGAAAATGGTTCAAATGAAGGTGGTGACAATGGTGGGTGGAGTGGTGGTAGAGGTGGATTTGGAAGTCGAGGCCGTGGAAGTTGGAATCAAGACGATGGTGGTTCAGGAGGATGGAGTGGAGGAAATGGTGGTAGAGGCGGATTTGGCGGCCGAGGTCGTGGAAGGAGGAATCAAGACAGTTCAAATGATGGCAATAATGATGATAAACCTGCAAGCTGGAGCACAGGTTCAGGTAATAGTGGAGGGTGGAACAGTGGCGGTGGTGGTGCAGGATCTTGGAACCAAGGAGGTGATGAAAAGAACCAGCAGCAACATAGTTGGAAGTCTAGCAATGATGGCGGTCAGGGATCTGGTTGGAAGGAACCAAGTGGGAGTGATCACAATAATTGGGAATCATCTGGTTCTTCTGGTGCAGGTAATAGTTCTGGGTGGAACAATTCGACTACAGGCAAGGAGACGGAAGAGAGTGGTGGTCACAACAGCTGGAACCAAACGACAAAGACGGATTCTCAAGGTGGAGGCTGGCAGAAGTCAGCATCTAGTTGGAATGCCGGAACTGAGAATCAAACAGTGACCAAGGATGTGAGTTCAGTTTCTAAAGATGGAGGTTGGGGAAAGTCTGCTGAGCCATCAACCCTGGACAAAGAAAAAGCCAACGTAGGTGCCCAAGGAGGAGGAGCAGCAGGTTGGGAGAAGCCTACATCTAGTTGGAATACAGAGCAGTCTCGAGGCGAAAACAACAGCGGAGGAGGACGAGGAGGAGCTCGGGGTAAATAA
Protein sequence
MPCRSPWPSFPESGTSNGPGSSSTNPFGSDAKNDEDSPWISKLTPEASTSWGAAKSSVDTANDGQASGWGKSDSKICSDGNASGALGKTVVPSGDSAGFTDSESGGWKKNQSANFGDDNAPVETSADRWGSKSRSSGSWGDQNASTTVSEIQPAGKGNAGAWNVGTAKDESGGWGKPKNVGDVGSSAWNKSTAGDGDGQNDSWNKPKPSSHDGNVGKKEWGQGNEASDNGNKWQSSRSDGGKKWGTNEAEREGGSSWNTSKSSDVGPASWKDKPDSSSLTAPKGDQWAEGWDKQHSSNDTKASDDNSSWNKKPVESGKDGELKNQGSGWNVGKTSGGDSASGWGQTSKEADLSDQAGSWGSNWKKNSDTRNEDSSSAKKSSWGSGSGNSNWGEKSNWNSGNEFNAITGGAEAQTDVSNDTSGYGSWKPESSDRGGYRGRGGFRGRGERGRFGGRGRSDRGGFGRGGSDRGGFGGRGRGRWNSEGGSNDGENKGWSGGGGGGSDNKGWGSGGGGSDNKGWSSGGDGSNNKGWSTGGEGSGNKGGGDNKGWGSGSGGSSDDKGWSGGGNGVGGGDNKGWGSGGGGSNDNKGWSGGDNKGWGSAGGSSDNKGWSSGGSGGDNKGWSSGSGGGGDGCGDKGWSSGGGGDNKGWGGGGESGDKGWSSGGSREWEKSGSDRGGFGGRGRGRWSSGSGSNDSDSGSGGWSGGGDRSDRGGGGFRGRGRGRWNQEGSYDGDNGGQRGGYGGRGRGRWNQENGSNEGGDNGGWSGGRGGFGSRGRGSWNQDDGGSGGWSGGNGGRGGFGGRGRGRRNQDSSNDGNNDDKPASWSTGSGNSGGWNSGGGGAGSWNQGGDEKNQQQHSWKSSNDGGQGSGWKEPSGSDHNNWESSGSSGAGNSSGWNNSTTGKETEESGGHNSWNQTTKTDSQGGGWQKSASSWNAGTENQTVTKDVSSVSKDGGWGKSAEPSTLDKEKANVGAQGGGAAGWEKPTSSWNTEQSRGENNSGGGRGGARGK*
Homology
BLAST of CSPI02G16050 vs. ExPASy TrEMBL
Match:
A0A5D3BLU8 (Protein RNA-directed DNA methylation 3 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold360G00100 PE=4 SV=1)
HSP 1 Score: 1544.3 bits (3997), Expect = 0.0e+00
Identity = 915/1065 (85.92%), Postives = 939/1065 (88.17%), Query Frame = 0
Query: 4 RSPWPSFPESGTSNGPGSSSTNPFGSDAKNDEDSPWISKLTPEASTSWGAAKSSVDTAND 63
RSPWPSFPESGTSNGPGSSSTNPFGSDA NDEDSPWISK TPEASTSWGAAKSSVDTAND
Sbjct: 679 RSPWPSFPESGTSNGPGSSSTNPFGSDAINDEDSPWISKSTPEASTSWGAAKSSVDTAND 738
Query: 64 GQASGWGKSDSKICSDGNASGALGKTVVPSGDSAGFTDSESGGWKKNQSANFGDDNAPVE 123
GQASGWGK DSK CSDGNASGA GKTV PSG SAGFTDSESGGWKKNQSANFGDD P E
Sbjct: 739 GQASGWGKGDSKTCSDGNASGAWGKTVAPSGHSAGFTDSESGGWKKNQSANFGDDKTPAE 798
Query: 124 TSADRWGSKSRSSGSWGDQNASTTVSEIQPAGKGNAGAWNVGTAKDESGGWGKPKNVGDV 183
T+ADRWGSKSRSSGSWGDQNASTTVSE+QPAGKGNAGAWN GTA+DESGGWGKPKN GDV
Sbjct: 799 TAADRWGSKSRSSGSWGDQNASTTVSEVQPAGKGNAGAWNEGTAQDESGGWGKPKNFGDV 858
Query: 184 GSSAWNKSTAGDGDGQNDSWNKPKPSSHDGNVGKKEWGQGNEASDNGNKWQSSRSDGGKK 243
GSSAWNKSTAGD DG+NDSWNKPKPSSHDG+VGKKEWGQGNEASDNGNKWQSSRSDGGKK
Sbjct: 859 GSSAWNKSTAGDRDGENDSWNKPKPSSHDGSVGKKEWGQGNEASDNGNKWQSSRSDGGKK 918
Query: 244 WGTNEAEREGGSSWNTSKSSDVGPASWKDKPDSSSLTAPKGDQWAEGWDKQHSSNDTKAS 303
WG+NEAE EGGSSWNTSKSSDV AS KDKPDSSSLTAPKGDQWA GWDKQHSSNDTKAS
Sbjct: 919 WGSNEAEPEGGSSWNTSKSSDVVSASRKDKPDSSSLTAPKGDQWAGGWDKQHSSNDTKAS 978
Query: 304 DDNSSWNKKPVESGKDGELKNQGSGWNVGKTSGGDSASGWGQTSKEADLSDQAGSWGSNW 363
DDNS WNKK VESGKDGE+KNQGSGWNVGKTSGGDSASGWGQTSKEA LSDQAGSWGSNW
Sbjct: 979 DDNSPWNKKSVESGKDGEIKNQGSGWNVGKTSGGDSASGWGQTSKEAGLSDQAGSWGSNW 1038
Query: 364 KKNSDTRNEDSSSAKKSSWGSGSGNSNWGEKSNWNSGNEFNAITGGAEAQTDVSNDTSGY 423
KKNS NEDSSSAKKSSWGSG GNSNWGEKSNWNSGNEFNA TGGAEAQTDVSNDTS Y
Sbjct: 1039 KKNSGAGNEDSSSAKKSSWGSGGGNSNWGEKSNWNSGNEFNATTGGAEAQTDVSNDTSSY 1098
Query: 424 GSWKPESSDRGGYRGRGGFRGRGERGRFGGRGRSDRGGFGRGGSDRGGFGGRGRGRWNSE 483
GSWKPESSDRGGYRGRGGFRGRGERGRFGGRGRSDRGGFGRGGSDRGGFGGRGRGRWNSE
Sbjct: 1099 GSWKPESSDRGGYRGRGGFRGRGERGRFGGRGRSDRGGFGRGGSDRGGFGGRGRGRWNSE 1158
Query: 484 GGSNDGENKGWSGGGG---------------------GGSDNKGWGSGGGGSDNKGWSSG 543
GGSNDGENKGWS GGG GGSD KGWGSGGGGSDNKGWSSG
Sbjct: 1159 GGSNDGENKGWSAGGGNDNKGWSSGGVSDSKGWSSGDGGSDIKGWGSGGGGSDNKGWSSG 1218
Query: 544 GDGSNNKGWSTGGEGSGNK-------GGGDNKGWGSGSGGSSDDKGWSGGGNGVGGGDNK 603
GDGS+NKGWS+GG G NK GGGD+KGWGS GGSSD+KGWS G+GVGG DNK
Sbjct: 1219 GDGSDNKGWSSGGSGGDNKGWSTGGEGGGDSKGWGSSGGGSSDNKGWS-SGSGVGGHDNK 1278
Query: 604 GWGSGGGGSNDNKGWS-------GGDNKGWGSAGGSSDNKGWSSGG----SGGDNKGWSS 663
GWGSGGGGS+DNKGWS GGDNKGWGS GG SDNKGW+SGG GGDNKGW S
Sbjct: 1279 GWGSGGGGSSDNKGWSSGGSGVAGGDNKGWGSGGGGSDNKGWNSGGESGVDGGDNKGWGS 1338
Query: 664 -------------GSGGGGDGCGDKGWSSGGGGDNKGWGGGGESGD-KGWSSGGSREWEK 723
GSGGGGDG GDKGWSSGGGG+NKGWGGGGESGD KGWSSGGSREWEK
Sbjct: 1339 GGGGNNKGWSSGGGSGGGGDGTGDKGWSSGGGGNNKGWGGGGESGDNKGWSSGGSREWEK 1398
Query: 724 SGSDRGGFGGRGRGRWSSGSGSNDSDSGSGGWS-GGGDR----SDRGGGGFRGRGRGRWN 783
SGSD GGFGGRGRGRWSSGSGSND DSGSGGWS GGGDR S+RGGGGFRGRGRGRWN
Sbjct: 1399 SGSDGGGFGGRGRGRWSSGSGSNDGDSGSGGWSGGGGDREKFGSERGGGGFRGRGRGRWN 1458
Query: 784 QE-GSYDGDNGGQRGGYGGRGRGRWNQENGSNEGGDNGGWSGGRGGFGSRGRGSWNQDDG 843
QE GSYDGDNGG+RGGYGGRGRGRWNQENGSNEGGDN GGRGGFG RGRG WNQDDG
Sbjct: 1459 QEGGSYDGDNGGRRGGYGGRGRGRWNQENGSNEGGDN----GGRGGFGGRGRGRWNQDDG 1518
Query: 844 GSGGWSGGNGGRGGFGGRGRGRRNQDSSNDGNNDDKPASWSTGSGNSGGWNSGGGGAGSW 903
GSGGWSGG GGRGGFGGRGRGRRNQD SN+ NNDDKPASWS GSGNSGGW+S GGAGSW
Sbjct: 1519 GSGGWSGG-GGRGGFGGRGRGRRNQDGSNESNNDDKPASWSAGSGNSGGWSS-SGGAGSW 1578
Query: 904 NQGGDEKNQQQHSWKSSNDGGQGSGWKEPSGSDHNNWESSGSSGAGNSSGWNNSTTGKET 963
NQGGDEKN+QQHSWKSSNDGGQGSGWKEPSG+DHNNW+SSGSSGAGNSSGWNNST KE
Sbjct: 1579 NQGGDEKNEQQHSWKSSNDGGQGSGWKEPSGNDHNNWKSSGSSGAGNSSGWNNSTAAKEM 1638
Query: 964 EESGGHNSWNQTTKTDSQGGGWQKSASSWNAGTENQTVTKDVSSVSKDGGWGKSAEPSTL 1010
EESGG NSW+Q+TKTDSQGGGWQKSASSWNAGTENQT TKDVSS SKDGGWGKS EPSTL
Sbjct: 1639 EESGGQNSWSQSTKTDSQGGGWQKSASSWNAGTENQTATKDVSSGSKDGGWGKSVEPSTL 1698
BLAST of CSPI02G16050 vs. ExPASy TrEMBL
Match:
A0A0A0LN22 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G325960 PE=4 SV=1)
HSP 1 Score: 1357.8 bits (3513), Expect = 0.0e+00
Identity = 789/1006 (78.43%), Postives = 790/1006 (78.53%), Query Frame = 0
Query: 4 RSPWPSFPESGTSNGPGSSSTNPFGSDAKNDEDSPWISKLTPEASTSWGAAKSSVDTAND 63
RSPWPSFPESGTSNGPGSSSTNPFGSDAKNDEDSPWISKLTPEASTSWGAAKSSVDTAND
Sbjct: 680 RSPWPSFPESGTSNGPGSSSTNPFGSDAKNDEDSPWISKLTPEASTSWGAAKSSVDTAND 739
Query: 64 GQASGWGKSDSKICSDGNASGALGKTVVPSGDSAGFTDSESGGWKKNQSANFGDDNAPVE 123
GQASGWGKSDSKICSDGNASGALGKTVVPSGDSAGFTDSESGGWKKNQSANFGDDNAPVE
Sbjct: 740 GQASGWGKSDSKICSDGNASGALGKTVVPSGDSAGFTDSESGGWKKNQSANFGDDNAPVE 799
Query: 124 TSADRWGSKSRSSGSWGDQNASTTVSEIQPAGKGNAGAWNVGTAKDESGGWGKPKNVGDV 183
TSADRWGSKSRSSGSWGDQNASTTVSEIQPAGKGNAGAWNVGTAKDESGGWGKPKNVGDV
Sbjct: 800 TSADRWGSKSRSSGSWGDQNASTTVSEIQPAGKGNAGAWNVGTAKDESGGWGKPKNVGDV 859
Query: 184 GSSAWNKSTAGDGDGQNDSWNKPKPSSHDGNVGKKEWGQGNEASDNGNKWQSSRSDGGKK 243
GSSAWNKSTAGDGDGQN SWNKPKPS+HDGNVGKKEWGQGNEASDNGNKWQSSRSDGGKK
Sbjct: 860 GSSAWNKSTAGDGDGQNGSWNKPKPSNHDGNVGKKEWGQGNEASDNGNKWQSSRSDGGKK 919
Query: 244 WGTNEAEREGGSSWNTSKSSDVGPASWKDKPDSSSLTAPKGDQWAEGWDKQHSSNDTKAS 303
WGTNEAEREGGSSWNTSKSSDVGPASWKDKPDSSSLTAPKGDQWAEGWDKQHSSNDTKAS
Sbjct: 920 WGTNEAEREGGSSWNTSKSSDVGPASWKDKPDSSSLTAPKGDQWAEGWDKQHSSNDTKAS 979
Query: 304 DDNSSWNKKPVESGKDGELKNQGSGWNVGKTSGGDSASGWGQTSKEADLSDQAGSWGSNW 363
DDNSSWNKKPVESGKDGELKNQGSGWNVGKTSGGDSASGWGQTSKEADLSDQAGSWGSNW
Sbjct: 980 DDNSSWNKKPVESGKDGELKNQGSGWNVGKTSGGDSASGWGQTSKEADLSDQAGSWGSNW 1039
Query: 364 KKNSDTRNEDSSSAKKSSWGSGSGNSNWGEKSNWNSGNEFNAITGGAEAQTDVSNDTSGY 423
KKNSDTRNEDSSSAKKSSWGSGSGNSNWGEKSNWNSGNEFNAITGGAEAQTDVSNDTSGY
Sbjct: 1040 KKNSDTRNEDSSSAKKSSWGSGSGNSNWGEKSNWNSGNEFNAITGGAEAQTDVSNDTSGY 1099
Query: 424 GSWKPESSDRGGYRGRGGFRGRGERGRFGGRGRSDRGGFGRGGSDRGGFGGRGRGRWNSE 483
GSWKPESSDRGGYRGRGGFRGRGERGRFGGRGRSDRGGFGRGGSDRGGFGGRGRGRWNSE
Sbjct: 1100 GSWKPESSDRGGYRGRGGFRGRGERGRFGGRGRSDRGGFGRGGSDRGGFGGRGRGRWNSE 1159
Query: 484 GGSNDGENKGWSGGGGGGSDNKGWGSGGGGSDNKGWSSGGDGSNNKGWSTGGEGSGNKGG 543
GGSNDGENKGWSGGGGGGSDNKGWGSGGGGSDNKGWSSGGDGSNNKGWSTGGEGSGNKGG
Sbjct: 1160 GGSNDGENKGWSGGGGGGSDNKGWGSGGGGSDNKGWSSGGDGSNNKGWSTGGEGSGNKGG 1219
Query: 544 GDNKGWGSGSGGSSDDKGWSGGGNGVGGGDNKGWGSGGGGSNDNKGWSGGDNKGWGSAGG 603
GDNKGWGSGSGGSSDDKGWSGGGNGVGGGDNKGWGSGGGGSNDNKGWSGGDNKGWGSAGG
Sbjct: 1220 GDNKGWGSGSGGSSDDKGWSGGGNGVGGGDNKGWGSGGGGSNDNKGWSGGDNKGWGSAGG 1279
Query: 604 SSDNKGWSSGGSGGDNKGWSSGSGGGGDGCGDKGWSSGGGGDNKGWGGGGESGDKGWSSG 663
SSDNKGWSSGGSGGDNKGWSS GGDG
Sbjct: 1280 SSDNKGWSSGGSGGDNKGWSS----GGDG------------------------------- 1339
Query: 664 GSREWEKSGSDRGGFGGRGRGRWSSGSGSNDSDSGSGGWSGGGDRSDRGGGGFRGRGRGR 723
Sbjct: 1340 ------------------------------------------------------------ 1399
Query: 724 WNQEGSYDGDNGGQRGGYGGRGRGRWNQENGSNEGGDNGGWSGGRGGFGSRGRGSWNQDD 783
Sbjct: 1400 ------------------------------------------------------------ 1459
Query: 784 GGSGGWSGGNGGRGGFGGRGRGRRNQDSSNDGNNDDKPASWSTGSGNSGGWNSGGGGAGS 843
S
Sbjct: 1460 ----------------------------------------------------------RS 1472
Query: 844 WNQGGDEKNQQQHSWKSSNDGGQGSGWKEPSGSDHNNWESSGSSGAGNSSGWNNSTTGKE 903
WNQGGDEKNQQQHSWKSSNDGGQGSGWKEPSGSDHNNWESSGSSGAGNSSGWNNSTTGKE
Sbjct: 1520 WNQGGDEKNQQQHSWKSSNDGGQGSGWKEPSGSDHNNWESSGSSGAGNSSGWNNSTTGKE 1472
Query: 904 TEESGGHNSWNQTTKTDSQGGGWQKSASSWNAGTENQTVTKDVSSVSKDGGWGKSAEPST 963
TEESGGHNSWNQTTKTDSQGGGWQKSASSWNAGTENQTVTKDVSSVSKDGGWGKSAEPST
Sbjct: 1580 TEESGGHNSWNQTTKTDSQGGGWQKSASSWNAGTENQTVTKDVSSVSKDGGWGKSAEPST 1472
Query: 964 LDKEKANVGAQGGGAAGWEKPTSSWNTEQSRGENNSGGGRGGARGK 1010
LDKE ANVGAQGGGAAGWEKPTSSWNTEQSRGENNSGGGRGGARGK
Sbjct: 1640 LDKEIANVGAQGGGAAGWEKPTSSWNTEQSRGENNSGGGRGGARGK 1472
BLAST of CSPI02G16050 vs. ExPASy TrEMBL
Match:
A0A1S4E184 (protein RNA-directed DNA methylation 3 OS=Cucumis melo OX=3656 GN=LOC103496462 PE=4 SV=1)
HSP 1 Score: 931.8 bits (2407), Expect = 2.4e-267
Identity = 545/631 (86.37%), Postives = 561/631 (88.91%), Query Frame = 0
Query: 4 RSPWPSFPESGTSNGPGSSSTNPFGSDAKNDEDSPWISKLTPEASTSWGAAKSSVDTAND 63
RSPWPSFPESGTSNGPGSSSTNPFGSDA NDEDSPWISK TPEASTSWGAAKSSVDTAND
Sbjct: 619 RSPWPSFPESGTSNGPGSSSTNPFGSDAINDEDSPWISKSTPEASTSWGAAKSSVDTAND 678
Query: 64 GQASGWGKSDSKICSDGNASGALGKTVVPSGDSAGFTDSESGGWKKNQSANFGDDNAPVE 123
GQASGWGK DSK CSDGNASGA GKTV PSG SAGFTDSESGGWKKNQSANFGDD P E
Sbjct: 679 GQASGWGKGDSKTCSDGNASGAWGKTVAPSGHSAGFTDSESGGWKKNQSANFGDDKTPAE 738
Query: 124 TSADRWGSKSRSSGSWGDQNASTTVSEIQPAGKGNAGAWNVGTAKDESGGWGKPKNVGDV 183
T+ADRWGSKSRSSGSWGDQNASTTVSE+QPAGKGNAGAWN GTA+DESGGWGKPKN GDV
Sbjct: 739 TAADRWGSKSRSSGSWGDQNASTTVSEVQPAGKGNAGAWNEGTAQDESGGWGKPKNFGDV 798
Query: 184 GSSAWNKSTAGDGDGQNDSWNKPKPSSHDGNVGKKEWGQGNEASDNGNKWQSSRSDGGKK 243
GSSAWNKSTAGD DG+NDSWNKPKPSSHDG+VGKKEWGQGNEASDNGNKWQSSRSDGGKK
Sbjct: 799 GSSAWNKSTAGDRDGENDSWNKPKPSSHDGSVGKKEWGQGNEASDNGNKWQSSRSDGGKK 858
Query: 244 WGTNEAEREGGSSWNTSKSSDVGPASWKDKPDSSSLTAPKGDQWAEGWDKQHSSNDTKAS 303
WG+NEAE EGGSSWNTSKSSDV AS KDKPDSSSLTAPKGDQWA GWDKQHSSNDTKAS
Sbjct: 859 WGSNEAEPEGGSSWNTSKSSDVVSASRKDKPDSSSLTAPKGDQWAGGWDKQHSSNDTKAS 918
Query: 304 DDNSSWNKKPVESGKDGELKNQGSGWNVGKTSGGDSASGWGQTSKEADLSDQAGSWGSNW 363
DDNS WNKK VESGKDGE+KNQGSGWNVGKTSGGDSASGWGQTSKEA LSDQAGSWGSNW
Sbjct: 919 DDNSPWNKKSVESGKDGEIKNQGSGWNVGKTSGGDSASGWGQTSKEAGLSDQAGSWGSNW 978
Query: 364 KKNSDTRNEDSSSAKKSSWGSGSGNSNWGEKSNWNSGNEFNAITGGAEAQTDVSNDTSGY 423
KKNS NEDSSSAKKSSWGSG GNSNWGEKSNWNSGNEFNA TGGAEAQTDVSNDTS Y
Sbjct: 979 KKNSGAGNEDSSSAKKSSWGSGGGNSNWGEKSNWNSGNEFNATTGGAEAQTDVSNDTSSY 1038
Query: 424 GSWKPESSDRGGYRGRGGFRGRGERGRFGGRGRSDRGGFGRGGSDRGGFGGRGRGRWNSE 483
GSWKPESSDRGGYRGRGGFRGRGERGRFGGRGRSDRGGFGRGGSDRGGFGGRGRGRWNSE
Sbjct: 1039 GSWKPESSDRGGYRGRGGFRGRGERGRFGGRGRSDRGGFGRGGSDRGGFGGRGRGRWNSE 1098
Query: 484 GGSNDGENKGWSGGGGGGSDNKGWGSGGGGSDNKGWSSGGDGSNNKGWSTGGEGSGNKGG 543
GGSNDGENKGWS GGG+DNKGW S GG SD+KGWSSG GS+ KGW +GG GG
Sbjct: 1099 GGSNDGENKGWS--AGGGNDNKGW-SSGGVSDSKGWSSGDGGSDIKGWGSGG------GG 1158
Query: 544 GDNKGWGSGSGGSSDDKGWSGGGNGVGGGDNKGWGSGGGGSNDNKGWSGGDNKGWGSA-G 603
DNKGW SG G SD+KGWS GG+ GGDNKGW +GG G GGD+KGWGS+ G
Sbjct: 1159 SDNKGWSSG-GDGSDNKGWSSGGS---GGDNKGWSTGGEG--------GGDSKGWGSSGG 1218
Query: 604 GSSDNKGWSSG-GSGG-DNKGWSSGSGGGGD 632
GSSDNKGWSSG G GG DNKGW SG GG D
Sbjct: 1219 GSSDNKGWSSGSGVGGHDNKGWGSGGGGSSD 1228
BLAST of CSPI02G16050 vs. ExPASy TrEMBL
Match:
A0A6J1FGX3 (protein RNA-directed DNA methylation 3-like OS=Cucurbita moschata OX=3662 GN=LOC111445374 PE=4 SV=1)
HSP 1 Score: 928.7 bits (2399), Expect = 2.1e-266
Identity = 665/1096 (60.68%), Postives = 720/1096 (65.69%), Query Frame = 0
Query: 5 SPWPSFPESGTSNGPGSSSTNPFGS---DAKNDEDSPWISKLTPEASTSWGAAKSSVDTA 64
+PWPSFPES T NGPGSSSTNP GS DAK DEDSPW+SK TP+ASTSWGAAKSSVDTA
Sbjct: 675 NPWPSFPESSTLNGPGSSSTNPIGSESFDAKKDEDSPWVSKSTPDASTSWGAAKSSVDTA 734
Query: 65 NDGQASGWGKSDS------KICSDGNASGALGKTVVPSGDSAGFT--------------- 124
N+GQASGWGKSDS K CSDGNASGA GKT VPSGDSAG T
Sbjct: 735 NNGQASGWGKSDSWGKTIAKTCSDGNASGAWGKTAVPSGDSAGLTEHTWDKWDKGKQVSS 794
Query: 125 ------------------------DSESGGWKKNQSANFGDDNAPVETSADRWGSKSRSS 184
D+ESGGWKK QSANF DD P E++ D ++ +
Sbjct: 795 DNQTGNWGDGTSGKNEHSAWSRDKDAESGGWKKTQSANFDDDKTPAESAGDCTNPEAENK 854
Query: 185 GSWGDQNASTTVSEIQPAGKGNAGAWNVGTAKDESGGWGKPKNVGDVGSSAWNKSTAGDG 244
+ N T++ Q + GN +DE+GGWGKPKNVG+ GSSAWNKST+GDG
Sbjct: 855 VNPSGWNEGTSMKGSQTSNWGN---------QDETGGWGKPKNVGNGGSSAWNKSTSGDG 914
Query: 245 DGQNDSWNKPKPSSHDGNVGKKEWGQGNEASDNGNKWQSSRSDGGKKWGTNEAEREGGSS 304
+NDSWNKPK SHD N+GKK WGQ NEASDNGNKWQSSRSDGG KWGTNE+E EGG
Sbjct: 915 AVENDSWNKPKLCSHDENIGKKGWGQSNEASDNGNKWQSSRSDGGTKWGTNESEHEGG-- 974
Query: 305 WNTSKSSDVGPASWKDKPDSSSLTAPKGDQWAEGWDKQHSSNDTKASDDNSSWNKKPVES 364
KDK DSSSLT P+GDQ GWDKQ SSNDTKAS++NS WNKK VES
Sbjct: 975 --------------KDKSDSSSLTTPRGDQSVGGWDKQRSSNDTKASEENSPWNKKSVES 1034
Query: 365 GKDGELKNQGSGWNVGKTSGGDSASGWGQTSKEADLSDQAGSWGSNWKKNSDTRNEDSSS 424
GKDGELKNQGSGWNVGKTSGGDSASGWGQ SKEA SD AG+WGSNWKKNSD NEDSS
Sbjct: 1035 GKDGELKNQGSGWNVGKTSGGDSASGWGQASKEAGSSDLAGNWGSNWKKNSDVGNEDSSL 1094
Query: 425 AKKSSWGSGSGNSNWGEKSNWNSGNEFNA--ITGGAEAQTDVSNDTSGYGSWKPESSDRG 484
AKKS+W SGSGNSNWGEKSNWNSGNE+NA TGGAEAQTDVSNDTSGYGSW+ E+SDRG
Sbjct: 1095 AKKSNWSSGSGNSNWGEKSNWNSGNEYNANHSTGGAEAQTDVSNDTSGYGSWREENSDRG 1154
Query: 485 GYRGRGGFRGRGERGRFGGRGRSDRGGFGRGGSDRGGFGGRGRGRWNSEGGSNDGENKGW 544
GYRGRGGFRGRGERGRFGGRGRSDRGGFG RGGFGGRGRGRWNSEGGSN G+NKGW
Sbjct: 1155 GYRGRGGFRGRGERGRFGGRGRSDRGGFG-----RGGFGGRGRGRWNSEGGSNGGDNKGW 1214
Query: 545 SGGGGGGSDNKGWGS-GGGGSDNKGWSSGGDGSNNKGWSTGGEGSGNKGGGDNKGWGSGS 604
S GGGG DN+GW S GGGG DNKGWSSGG G +NKGWS+G
Sbjct: 1215 SSGGGG--DNRGWSSGGGGGDDNKGWSSGGGGEDNKGWSSG------------------- 1274
Query: 605 GGSSDDKGWSGGGNGVGGGDNKGWGSGGGGSNDNKGWSGGDNKGWGSAGGSSDNKGWSSG 664
G GGGDNKGW SGGGG DNKGWSSG
Sbjct: 1275 --------------GAGGGDNKGWSSGGGG----------------------DNKGWSSG 1334
Query: 665 GSGGDNKGWSSGSGGGGDGCGDKGWSSGGGGDNKGWGGGGESGDKGWSSGGSREWEKSGS 724
G GGDNKGWSS GGGDNKGW GGG GGS +WE+SGS
Sbjct: 1335 G-GGDNKGWSS-----------------GGGDNKGWSGGG--------GGGSSDWERSGS 1394
Query: 725 DRGGFGGRGRGRWSSGSGSNDSDSGSGGWSGGGDRSDRGGGGFRGRGRGRWNQE-GSYDG 784
DRGGFGGRGRGRWSSGSGSND D G SDR GGFRGRGRGRWNQE GS+DG
Sbjct: 1395 DRGGFGGRGRGRWSSGSGSNDGDREKFG-------SDR--GGFRGRGRGRWNQEGGSHDG 1454
Query: 785 DNG---GQRGGYGGRGRGRWNQENGSNE---GGDNGGWSGGRGGFGSRGRGSWNQDDGGS 844
DNG G RGGYGGRGRGRWNQE SN+ GG GGWS G GG GSW
Sbjct: 1455 DNGGWRGGRGGYGGRGRGRWNQEGDSNDGDNGGGGGGWSSGGGG------GSW------- 1514
Query: 845 GGWSGGNGGRGGFGGRGRGRRNQDSSNDGNNDDKPASWSTGSGNSGGWNSGGGGAGSWNQ 904
G SG + G GGFGGRG G+ NQ+ GS N+G W+S GGGAGSWNQ
Sbjct: 1515 -GKSGSDRG-GGFGGRGSGKWNQEG---------------GSNNNGRWSS-GGGAGSWNQ 1574
Query: 905 GGDEKNQQQHSWKSSNDGG------QGSGWKEP-SGSDHNNWESSGSSGAGNSSGWNNST 964
GGDEKNQ Q SWK SNDGG QGSGWKEP SG++ NNW+SSGSSGA ++SGWN ST
Sbjct: 1575 GGDEKNQPQ-SWKPSNDGGDKFGGQQGSGWKEPTSGNEPNNWKSSGSSGADHNSGWNKST 1611
Query: 965 TGKETEESGGH-NSWNQTTKTDSQ--GGGWQKSASSWNAGTENQTVTKDVSSVSKDGGWG 1008
TGKE EES NSW+Q TK D GGGWQK AS WN GTENQTV DV++ +KDGGWG
Sbjct: 1635 TGKEIEESDSQKNSWSQATKVDDAHGGGGWQKPASGWNGGTENQTVKNDVNASAKDGGWG 1611
BLAST of CSPI02G16050 vs. ExPASy TrEMBL
Match:
A0A6J1JV02 (protein RNA-directed DNA methylation 3-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111489134 PE=4 SV=1)
HSP 1 Score: 913.7 bits (2360), Expect = 6.9e-262
Identity = 649/1089 (59.60%), Postives = 708/1089 (65.01%), Query Frame = 0
Query: 5 SPWPSFPESGTSNGPGSSSTNPFGS---DAKNDEDSPWISKLTPEASTSWGAAKSSVDTA 64
+PWPSFPES T NGPGSSSTNP GS DA DEDSPW+SK TP+ASTSWGAAKSSVDTA
Sbjct: 675 NPWPSFPESSTLNGPGSSSTNPIGSESFDANKDEDSPWVSKSTPDASTSWGAAKSSVDTA 734
Query: 65 NDGQASGWGKSDS------KICSDGNASGALGKTVVPSGDSAGFT--------------- 124
N+GQASGWGKSDS K CSDGNASGA GKT VPSGDSAG T
Sbjct: 735 NNGQASGWGKSDSWGKTIAKTCSDGNASGAWGKTAVPSGDSAGLTENTWDKWDKGKQVSS 794
Query: 125 ------------------------DSESGGWKKNQSANFGDDNAPVETSADRWGSKSRSS 184
D+ESGGWKK QSANF DD P E++ D ++ +
Sbjct: 795 DNQTGNWDNGTSGKNEHSAWSRDKDAESGGWKKTQSANFDDDKTPAESAGDWTNPEAENK 854
Query: 185 GSWGDQNASTTVSEIQPAGKGNAGAWNVGTAKDESGGWGKPKNVGDVGSSAWNKSTAGDG 244
+ N T++ Q + GN +DE+GGWGKPKNVG+ GSSAWNKST+GDG
Sbjct: 855 VNPSGWNEGTSMKGSQTSNWGN---------QDETGGWGKPKNVGNGGSSAWNKSTSGDG 914
Query: 245 DGQNDSWNKPKPSSHDGNVGKKEWGQGNEASDNGNKWQSSRSDGGKKWGTNEAEREGGSS 304
+NDSWNKPK SHD ++GKK WGQ NEASDNGNKWQSSRSDGG KWGTNE+E EGG
Sbjct: 915 AVENDSWNKPKLFSHDESIGKKGWGQSNEASDNGNKWQSSRSDGGTKWGTNESEHEGG-- 974
Query: 305 WNTSKSSDVGPASWKDKPDSSSLTAPKGDQWAEGWDKQHSSNDTKASDDNSSWNKKPVES 364
KDK DSSSLT P+GDQ GWDKQ SSNDTKAS++NS WNKK VES
Sbjct: 975 --------------KDKSDSSSLTTPRGDQSVGGWDKQRSSNDTKASEENSPWNKKSVES 1034
Query: 365 GKDGELKNQGSGWNVGKTSGGDSASGWGQTSKEADLSDQAGSWGSNWKKNSDTRNEDSSS 424
GKDGELKNQGSGWNVGKTSGGDSASGWGQ SKEA SD G+WGSNWKKNSD NEDSS
Sbjct: 1035 GKDGELKNQGSGWNVGKTSGGDSASGWGQASKEAGSSDLVGNWGSNWKKNSDVGNEDSSL 1094
Query: 425 AKKSSWGSGSGNSNWGEKSNWNSGNEFNA--ITGGAEAQTDVSNDTSGYGSWKPESSDRG 484
AKKS+W SGSGNSNWGEKSNWNSGNE+NA TGGAEAQTDVSNDTSGYGSW+ E+SDRG
Sbjct: 1095 AKKSNWSSGSGNSNWGEKSNWNSGNEYNANHSTGGAEAQTDVSNDTSGYGSWRGENSDRG 1154
Query: 485 GYRGRGGFRGRGERGRFGGRGRSDRGGFGRGGSDRGGFGGRGRGRWNSEGGSNDGENKGW 544
GYRGRGGFRGRGERGRFGGRGRSDRGGFG RGGFGGRGRGRWNSEGGSN G+NKGW
Sbjct: 1155 GYRGRGGFRGRGERGRFGGRGRSDRGGFG-----RGGFGGRGRGRWNSEGGSNGGDNKGW 1214
Query: 545 SGGGGGGSDNKGWGSGGGGSDNKGWSSGGDGSNNKGWSTGGEGSGNKGGGDNKGWGSGSG 604
S GGGGG DNKGW SGGGG DN+GWSSGG G +NKGWS+GG
Sbjct: 1215 SSGGGGG-DNKGWSSGGGGGDNRGWSSGGGGDDNKGWSSGG------------------- 1274
Query: 605 GSSDDKGWSGGGNGVGGGDNKGWGSGGGGSNDNKGWSGGDNKGWGSAGGSSDNKGWSSGG 664
GDNKGWG GG
Sbjct: 1275 -------------------------------------AGDNKGWG-------------GG 1334
Query: 665 SGGDNKGWSSGSGGGGDGCGDKGWSSGGGGDNKGWGGGGESGDKGWSSGGSREWEKSGSD 724
GGDNKGWSS GGGDNKGW GGG GGS +WEKSGSD
Sbjct: 1335 GGGDNKGWSS-----------------GGGDNKGWSGGG--------GGGSSDWEKSGSD 1394
Query: 725 RGGFGGRGRGRWSSGSGSNDSDSGSGGWSGGGDRSDRGGGGFRGRGRGRWNQE-GSYDGD 784
RGGFGGRGRGRWSSGSGS+D D G SDR GGFRGRGRGRWNQ+ GS+DGD
Sbjct: 1395 RGGFGGRGRGRWSSGSGSHDGDREKFG-------SDR--GGFRGRGRGRWNQDGGSHDGD 1454
Query: 785 NG---GQRGGYGGRGRGRWNQENGSNEGGDNGGWSGGRGGFGSRGRGSWNQDDGGSGGWS 844
NG G RGGYGGRGRGRWNQE SN+ GDNGG G W+ +GGS G S
Sbjct: 1455 NGGWRGGRGGYGGRGRGRWNQEGDSND-GDNGG-------------GGWSSGNGGSWGKS 1514
Query: 845 GGNGGRGGFGGRGRGRRNQDSSNDGNNDDKPASWSTGSGNSGGWNSGGGGAGSWNQGGDE 904
G + G GGFGGRG G+ NQ+ GS N+GGW+S GGGAGSWNQGGDE
Sbjct: 1515 GSDIG-GGFGGRGSGKWNQEG---------------GSNNNGGWSS-GGGAGSWNQGGDE 1574
Query: 905 KNQQQHSWKSSNDGG------QGSGWKEP-SGSDHNNWESSGSSGAGNSSGWNNSTTGKE 964
KNQ Q WK SNDGG QGSGWKEP SG++ NNW+SSGSSGA ++SGWN STTGKE
Sbjct: 1575 KNQAQ-GWKPSNDGGDKFGGQQGSGWKEPTSGNEPNNWKSSGSSGADHNSGWNKSTTGKE 1592
Query: 965 TEESGGH-NSWNQTTKTDSQ-GGGWQKSASSWNAGTENQTVTKDVSSVSKDGGWGKSA-- 1006
EES NSW+Q TK D+Q GGGWQK+AS WN TENQTV DV+ +KDGGWGK A
Sbjct: 1635 MEESDSQKNSWSQATKVDAQGGGGWQKAASGWNGETENQTVKSDVNVSAKDGGWGKPASG 1592
BLAST of CSPI02G16050 vs. NCBI nr
Match:
XP_031736955.1 (protein RNA-directed DNA methylation 3 isoform X1 [Cucumis sativus])
HSP 1 Score: 1804.3 bits (4672), Expect = 0.0e+00
Identity = 1002/1010 (99.21%), Postives = 1003/1010 (99.31%), Query Frame = 0
Query: 4 RSPWPSFPESGTSNGPGSSSTNPFGSDAKNDEDSPWISKLTPEASTSWGAAKSSVDTAND 63
RSPWPSFPESGTSNGPGSSSTNPFGSDAKNDEDSPWISKLTPEASTSWGAAKSSVDTAND
Sbjct: 680 RSPWPSFPESGTSNGPGSSSTNPFGSDAKNDEDSPWISKLTPEASTSWGAAKSSVDTAND 739
Query: 64 GQASGWGKSDSKICSDGNASGALGKTVVPSGDSAGFTDSESGGWKKNQSANFGDDNAPVE 123
GQASGWGKSDSKICSDGNASGALGKTVVPSGDSAGFTDSESGGWKKNQSANFGDDNAPVE
Sbjct: 740 GQASGWGKSDSKICSDGNASGALGKTVVPSGDSAGFTDSESGGWKKNQSANFGDDNAPVE 799
Query: 124 TSADRWGSKSRSSGSWGDQNASTTVSEIQPAGKGNAGAWNVGTAKDESGGWGKPKNVGDV 183
TSADRWGSKSRSSGSWGDQNASTTVSEIQPAGKGNAGAWNVGTAKDESGGWGKPKNVGDV
Sbjct: 800 TSADRWGSKSRSSGSWGDQNASTTVSEIQPAGKGNAGAWNVGTAKDESGGWGKPKNVGDV 859
Query: 184 GSSAWNKSTAGDGDGQNDSWNKPKPSSHDGNVGKKEWGQGNEASDNGNKWQSSRSDGGKK 243
GSSAWNKSTAGDGDGQN SWNKPKPS+HDGNVGKKEWGQGNEASDNGNKWQSSRSDGGKK
Sbjct: 860 GSSAWNKSTAGDGDGQNGSWNKPKPSNHDGNVGKKEWGQGNEASDNGNKWQSSRSDGGKK 919
Query: 244 WGTNEAEREGGSSWNTSKSSDVGPASWKDKPDSSSLTAPKGDQWAEGWDKQHSSNDTKAS 303
WGTNEAEREGGSSWNTSKSSDVGPASWKDKPDSSSLTAPKGDQWAEGWDKQHSSNDTKAS
Sbjct: 920 WGTNEAEREGGSSWNTSKSSDVGPASWKDKPDSSSLTAPKGDQWAEGWDKQHSSNDTKAS 979
Query: 304 DDNSSWNKKPVESGKDGELKNQGSGWNVGKTSGGDSASGWGQTSKEADLSDQAGSWGSNW 363
DDNSSWNKKPVESGKDGELKNQGSGWNVGKTSGGDSASGWGQTSKEADLSDQAGSWGSNW
Sbjct: 980 DDNSSWNKKPVESGKDGELKNQGSGWNVGKTSGGDSASGWGQTSKEADLSDQAGSWGSNW 1039
Query: 364 KKNSDTRNEDSSSAKKSSWGSGSGNSNWGEKSNWNSGNEFNAITGGAEAQTDVSNDTSGY 423
KKNSDTRNEDSSSAKKSSWGSGSGNSNWGEKSNWNSGNEFNAITGGAEAQTDVSNDTSGY
Sbjct: 1040 KKNSDTRNEDSSSAKKSSWGSGSGNSNWGEKSNWNSGNEFNAITGGAEAQTDVSNDTSGY 1099
Query: 424 GSWKPESSDRGGYRGRGGFRGRGERGRFGGRGRSDRGGFGRGGSDRGGFGGRGRGRWNSE 483
GSWKPESSDRGGYRGRGGFRGRGERGRFGGRGRSDRGGFGRGGSDRGGFGGRGRGRWNSE
Sbjct: 1100 GSWKPESSDRGGYRGRGGFRGRGERGRFGGRGRSDRGGFGRGGSDRGGFGGRGRGRWNSE 1159
Query: 484 GGSNDGENKGWSGGGGGGSDNKGWGSGGGGSDNKGWSSGGDGSNNKGWSTGGEGSGNKGG 543
GGSNDGENKGWSGGGGGGSDNKGWGSGGGGSDNKGWSSGGDGSNNKGWSTGGEGSGNKGG
Sbjct: 1160 GGSNDGENKGWSGGGGGGSDNKGWGSGGGGSDNKGWSSGGDGSNNKGWSTGGEGSGNKGG 1219
Query: 544 GDNKGWGSGSGGSSDDKGWSGGGNGVGGGDNKGWGSGGGGSNDNKGWSGGDNKGWGSAGG 603
GDNKGWGSGSGGSSDDKGWSGGGNGVGGGDNKGWGSGGGGSNDNKGWSGGDNKGWGSAGG
Sbjct: 1220 GDNKGWGSGSGGSSDDKGWSGGGNGVGGGDNKGWGSGGGGSNDNKGWSGGDNKGWGSAGG 1279
Query: 604 SSDNKGWSSGGSGGDNKGWSSGSGGGGDGCGDKGWSSGGGGDNKGWGGGGESGDKGWSSG 663
SSDNKGWSSGGSGGDNKGWSSGSGGGGDGCGDKGWSSGGGGDNKGWGGGGESGDKGWSSG
Sbjct: 1280 SSDNKGWSSGGSGGDNKGWSSGSGGGGDGCGDKGWSSGGGGDNKGWGGGGESGDKGWSSG 1339
Query: 664 GSREWEKSGSDRGGFGGRGRGRWSSGSGSNDSDSGSGGWSGGGDRSDRGGGGFRGRGRGR 723
GSREWEKSGSDRGGFGGRGRGRWSSGSGSNDSDSGSGGWSGGGDRSDR GGGFRGRGRGR
Sbjct: 1340 GSREWEKSGSDRGGFGGRGRGRWSSGSGSNDSDSGSGGWSGGGDRSDRSGGGFRGRGRGR 1399
Query: 724 WNQEGSYDGDNGGQRGGYGGRGRGRWNQENGSNEGGDNGGWSGGRGGFGSRGRGSWNQDD 783
WNQEGSYDGDNGGQRGGYGGRGRGRWNQENGSNEGGDNGGWSGGRGGFGSRGRGSWNQDD
Sbjct: 1400 WNQEGSYDGDNGGQRGGYGGRGRGRWNQENGSNEGGDNGGWSGGRGGFGSRGRGSWNQDD 1459
Query: 784 GGSGGWSGGNGGRGGFGGRGRGRRNQDSSNDGNNDDKPASWSTGSGNSGGWNS----GGG 843
GGSGGWSGGNGGRGGFGGRGRGRRNQDSSNDGNNDDKPASWSTGSGNSGGWNS GGG
Sbjct: 1460 GGSGGWSGGNGGRGGFGGRGRGRRNQDSSNDGNNDDKPASWSTGSGNSGGWNSGGGGGGG 1519
Query: 844 GAGSWNQGGDEKNQQQHSWKSSNDGGQGSGWKEPSGSDHNNWESSGSSGAGNSSGWNNST 903
GAGSWNQGGDEKNQQQHSWKSSNDGGQGSGWKEPSGSDHNNWESSGSSGAGNSSGWNNST
Sbjct: 1520 GAGSWNQGGDEKNQQQHSWKSSNDGGQGSGWKEPSGSDHNNWESSGSSGAGNSSGWNNST 1579
Query: 904 TGKETEESGGHNSWNQTTKTDSQGGGWQKSASSWNAGTENQTVTKDVSSVSKDGGWGKSA 963
TGKETEESGGHNSWNQTTKTDSQGGGWQKSASSWNAGTENQTVTKDVSSVSKDGGWGKSA
Sbjct: 1580 TGKETEESGGHNSWNQTTKTDSQGGGWQKSASSWNAGTENQTVTKDVSSVSKDGGWGKSA 1639
Query: 964 EPSTLDKEKANVGAQGGGAAGWEKPTSSWNTEQSRGENNSGGGRGGARGK 1010
EPSTLDKE ANVGAQGGGAAGWEKPTSSWNTEQSRGENNSGGGRGGARGK
Sbjct: 1640 EPSTLDKEIANVGAQGGGAAGWEKPTSSWNTEQSRGENNSGGGRGGARGK 1689
BLAST of CSPI02G16050 vs. NCBI nr
Match:
XP_031736956.1 (protein RNA-directed DNA methylation 3 isoform X2 [Cucumis sativus])
HSP 1 Score: 1804.3 bits (4672), Expect = 0.0e+00
Identity = 1002/1010 (99.21%), Postives = 1003/1010 (99.31%), Query Frame = 0
Query: 4 RSPWPSFPESGTSNGPGSSSTNPFGSDAKNDEDSPWISKLTPEASTSWGAAKSSVDTAND 63
RSPWPSFPESGTSNGPGSSSTNPFGSDAKNDEDSPWISKLTPEASTSWGAAKSSVDTAND
Sbjct: 620 RSPWPSFPESGTSNGPGSSSTNPFGSDAKNDEDSPWISKLTPEASTSWGAAKSSVDTAND 679
Query: 64 GQASGWGKSDSKICSDGNASGALGKTVVPSGDSAGFTDSESGGWKKNQSANFGDDNAPVE 123
GQASGWGKSDSKICSDGNASGALGKTVVPSGDSAGFTDSESGGWKKNQSANFGDDNAPVE
Sbjct: 680 GQASGWGKSDSKICSDGNASGALGKTVVPSGDSAGFTDSESGGWKKNQSANFGDDNAPVE 739
Query: 124 TSADRWGSKSRSSGSWGDQNASTTVSEIQPAGKGNAGAWNVGTAKDESGGWGKPKNVGDV 183
TSADRWGSKSRSSGSWGDQNASTTVSEIQPAGKGNAGAWNVGTAKDESGGWGKPKNVGDV
Sbjct: 740 TSADRWGSKSRSSGSWGDQNASTTVSEIQPAGKGNAGAWNVGTAKDESGGWGKPKNVGDV 799
Query: 184 GSSAWNKSTAGDGDGQNDSWNKPKPSSHDGNVGKKEWGQGNEASDNGNKWQSSRSDGGKK 243
GSSAWNKSTAGDGDGQN SWNKPKPS+HDGNVGKKEWGQGNEASDNGNKWQSSRSDGGKK
Sbjct: 800 GSSAWNKSTAGDGDGQNGSWNKPKPSNHDGNVGKKEWGQGNEASDNGNKWQSSRSDGGKK 859
Query: 244 WGTNEAEREGGSSWNTSKSSDVGPASWKDKPDSSSLTAPKGDQWAEGWDKQHSSNDTKAS 303
WGTNEAEREGGSSWNTSKSSDVGPASWKDKPDSSSLTAPKGDQWAEGWDKQHSSNDTKAS
Sbjct: 860 WGTNEAEREGGSSWNTSKSSDVGPASWKDKPDSSSLTAPKGDQWAEGWDKQHSSNDTKAS 919
Query: 304 DDNSSWNKKPVESGKDGELKNQGSGWNVGKTSGGDSASGWGQTSKEADLSDQAGSWGSNW 363
DDNSSWNKKPVESGKDGELKNQGSGWNVGKTSGGDSASGWGQTSKEADLSDQAGSWGSNW
Sbjct: 920 DDNSSWNKKPVESGKDGELKNQGSGWNVGKTSGGDSASGWGQTSKEADLSDQAGSWGSNW 979
Query: 364 KKNSDTRNEDSSSAKKSSWGSGSGNSNWGEKSNWNSGNEFNAITGGAEAQTDVSNDTSGY 423
KKNSDTRNEDSSSAKKSSWGSGSGNSNWGEKSNWNSGNEFNAITGGAEAQTDVSNDTSGY
Sbjct: 980 KKNSDTRNEDSSSAKKSSWGSGSGNSNWGEKSNWNSGNEFNAITGGAEAQTDVSNDTSGY 1039
Query: 424 GSWKPESSDRGGYRGRGGFRGRGERGRFGGRGRSDRGGFGRGGSDRGGFGGRGRGRWNSE 483
GSWKPESSDRGGYRGRGGFRGRGERGRFGGRGRSDRGGFGRGGSDRGGFGGRGRGRWNSE
Sbjct: 1040 GSWKPESSDRGGYRGRGGFRGRGERGRFGGRGRSDRGGFGRGGSDRGGFGGRGRGRWNSE 1099
Query: 484 GGSNDGENKGWSGGGGGGSDNKGWGSGGGGSDNKGWSSGGDGSNNKGWSTGGEGSGNKGG 543
GGSNDGENKGWSGGGGGGSDNKGWGSGGGGSDNKGWSSGGDGSNNKGWSTGGEGSGNKGG
Sbjct: 1100 GGSNDGENKGWSGGGGGGSDNKGWGSGGGGSDNKGWSSGGDGSNNKGWSTGGEGSGNKGG 1159
Query: 544 GDNKGWGSGSGGSSDDKGWSGGGNGVGGGDNKGWGSGGGGSNDNKGWSGGDNKGWGSAGG 603
GDNKGWGSGSGGSSDDKGWSGGGNGVGGGDNKGWGSGGGGSNDNKGWSGGDNKGWGSAGG
Sbjct: 1160 GDNKGWGSGSGGSSDDKGWSGGGNGVGGGDNKGWGSGGGGSNDNKGWSGGDNKGWGSAGG 1219
Query: 604 SSDNKGWSSGGSGGDNKGWSSGSGGGGDGCGDKGWSSGGGGDNKGWGGGGESGDKGWSSG 663
SSDNKGWSSGGSGGDNKGWSSGSGGGGDGCGDKGWSSGGGGDNKGWGGGGESGDKGWSSG
Sbjct: 1220 SSDNKGWSSGGSGGDNKGWSSGSGGGGDGCGDKGWSSGGGGDNKGWGGGGESGDKGWSSG 1279
Query: 664 GSREWEKSGSDRGGFGGRGRGRWSSGSGSNDSDSGSGGWSGGGDRSDRGGGGFRGRGRGR 723
GSREWEKSGSDRGGFGGRGRGRWSSGSGSNDSDSGSGGWSGGGDRSDR GGGFRGRGRGR
Sbjct: 1280 GSREWEKSGSDRGGFGGRGRGRWSSGSGSNDSDSGSGGWSGGGDRSDRSGGGFRGRGRGR 1339
Query: 724 WNQEGSYDGDNGGQRGGYGGRGRGRWNQENGSNEGGDNGGWSGGRGGFGSRGRGSWNQDD 783
WNQEGSYDGDNGGQRGGYGGRGRGRWNQENGSNEGGDNGGWSGGRGGFGSRGRGSWNQDD
Sbjct: 1340 WNQEGSYDGDNGGQRGGYGGRGRGRWNQENGSNEGGDNGGWSGGRGGFGSRGRGSWNQDD 1399
Query: 784 GGSGGWSGGNGGRGGFGGRGRGRRNQDSSNDGNNDDKPASWSTGSGNSGGWNS----GGG 843
GGSGGWSGGNGGRGGFGGRGRGRRNQDSSNDGNNDDKPASWSTGSGNSGGWNS GGG
Sbjct: 1400 GGSGGWSGGNGGRGGFGGRGRGRRNQDSSNDGNNDDKPASWSTGSGNSGGWNSGGGGGGG 1459
Query: 844 GAGSWNQGGDEKNQQQHSWKSSNDGGQGSGWKEPSGSDHNNWESSGSSGAGNSSGWNNST 903
GAGSWNQGGDEKNQQQHSWKSSNDGGQGSGWKEPSGSDHNNWESSGSSGAGNSSGWNNST
Sbjct: 1460 GAGSWNQGGDEKNQQQHSWKSSNDGGQGSGWKEPSGSDHNNWESSGSSGAGNSSGWNNST 1519
Query: 904 TGKETEESGGHNSWNQTTKTDSQGGGWQKSASSWNAGTENQTVTKDVSSVSKDGGWGKSA 963
TGKETEESGGHNSWNQTTKTDSQGGGWQKSASSWNAGTENQTVTKDVSSVSKDGGWGKSA
Sbjct: 1520 TGKETEESGGHNSWNQTTKTDSQGGGWQKSASSWNAGTENQTVTKDVSSVSKDGGWGKSA 1579
Query: 964 EPSTLDKEKANVGAQGGGAAGWEKPTSSWNTEQSRGENNSGGGRGGARGK 1010
EPSTLDKE ANVGAQGGGAAGWEKPTSSWNTEQSRGENNSGGGRGGARGK
Sbjct: 1580 EPSTLDKEIANVGAQGGGAAGWEKPTSSWNTEQSRGENNSGGGRGGARGK 1629
BLAST of CSPI02G16050 vs. NCBI nr
Match:
KAE8652013.1 (hypothetical protein Csa_006830 [Cucumis sativus])
HSP 1 Score: 1802.3 bits (4667), Expect = 0.0e+00
Identity = 1001/1009 (99.21%), Postives = 1002/1009 (99.31%), Query Frame = 0
Query: 5 SPWPSFPESGTSNGPGSSSTNPFGSDAKNDEDSPWISKLTPEASTSWGAAKSSVDTANDG 64
SPWPSFPESGTSNGPGSSSTNPFGSDAKNDEDSPWISKLTPEASTSWGAAKSSVDTANDG
Sbjct: 620 SPWPSFPESGTSNGPGSSSTNPFGSDAKNDEDSPWISKLTPEASTSWGAAKSSVDTANDG 679
Query: 65 QASGWGKSDSKICSDGNASGALGKTVVPSGDSAGFTDSESGGWKKNQSANFGDDNAPVET 124
QASGWGKSDSKICSDGNASGALGKTVVPSGDSAGFTDSESGGWKKNQSANFGDDNAPVET
Sbjct: 680 QASGWGKSDSKICSDGNASGALGKTVVPSGDSAGFTDSESGGWKKNQSANFGDDNAPVET 739
Query: 125 SADRWGSKSRSSGSWGDQNASTTVSEIQPAGKGNAGAWNVGTAKDESGGWGKPKNVGDVG 184
SADRWGSKSRSSGSWGDQNASTTVSEIQPAGKGNAGAWNVGTAKDESGGWGKPKNVGDVG
Sbjct: 740 SADRWGSKSRSSGSWGDQNASTTVSEIQPAGKGNAGAWNVGTAKDESGGWGKPKNVGDVG 799
Query: 185 SSAWNKSTAGDGDGQNDSWNKPKPSSHDGNVGKKEWGQGNEASDNGNKWQSSRSDGGKKW 244
SSAWNKSTAGDGDGQN SWNKPKPS+HDGNVGKKEWGQGNEASDNGNKWQSSRSDGGKKW
Sbjct: 800 SSAWNKSTAGDGDGQNGSWNKPKPSNHDGNVGKKEWGQGNEASDNGNKWQSSRSDGGKKW 859
Query: 245 GTNEAEREGGSSWNTSKSSDVGPASWKDKPDSSSLTAPKGDQWAEGWDKQHSSNDTKASD 304
GTNEAEREGGSSWNTSKSSDVGPASWKDKPDSSSLTAPKGDQWAEGWDKQHSSNDTKASD
Sbjct: 860 GTNEAEREGGSSWNTSKSSDVGPASWKDKPDSSSLTAPKGDQWAEGWDKQHSSNDTKASD 919
Query: 305 DNSSWNKKPVESGKDGELKNQGSGWNVGKTSGGDSASGWGQTSKEADLSDQAGSWGSNWK 364
DNSSWNKKPVESGKDGELKNQGSGWNVGKTSGGDSASGWGQTSKEADLSDQAGSWGSNWK
Sbjct: 920 DNSSWNKKPVESGKDGELKNQGSGWNVGKTSGGDSASGWGQTSKEADLSDQAGSWGSNWK 979
Query: 365 KNSDTRNEDSSSAKKSSWGSGSGNSNWGEKSNWNSGNEFNAITGGAEAQTDVSNDTSGYG 424
KNSDTRNEDSSSAKKSSWGSGSGNSNWGEKSNWNSGNEFNAITGGAEAQTDVSNDTSGYG
Sbjct: 980 KNSDTRNEDSSSAKKSSWGSGSGNSNWGEKSNWNSGNEFNAITGGAEAQTDVSNDTSGYG 1039
Query: 425 SWKPESSDRGGYRGRGGFRGRGERGRFGGRGRSDRGGFGRGGSDRGGFGGRGRGRWNSEG 484
SWKPESSDRGGYRGRGGFRGRGERGRFGGRGRSDRGGFGRGGSDRGGFGGRGRGRWNSEG
Sbjct: 1040 SWKPESSDRGGYRGRGGFRGRGERGRFGGRGRSDRGGFGRGGSDRGGFGGRGRGRWNSEG 1099
Query: 485 GSNDGENKGWSGGGGGGSDNKGWGSGGGGSDNKGWSSGGDGSNNKGWSTGGEGSGNKGGG 544
GSNDGENKGWSGGGGGGSDNKGWGSGGGGSDNKGWSSGGDGSNNKGWSTGGEGSGNKGGG
Sbjct: 1100 GSNDGENKGWSGGGGGGSDNKGWGSGGGGSDNKGWSSGGDGSNNKGWSTGGEGSGNKGGG 1159
Query: 545 DNKGWGSGSGGSSDDKGWSGGGNGVGGGDNKGWGSGGGGSNDNKGWSGGDNKGWGSAGGS 604
DNKGWGSGSGGSSDDKGWSGGGNGVGGGDNKGWGSGGGGSNDNKGWSGGDNKGWGSAGGS
Sbjct: 1160 DNKGWGSGSGGSSDDKGWSGGGNGVGGGDNKGWGSGGGGSNDNKGWSGGDNKGWGSAGGS 1219
Query: 605 SDNKGWSSGGSGGDNKGWSSGSGGGGDGCGDKGWSSGGGGDNKGWGGGGESGDKGWSSGG 664
SDNKGWSSGGSGGDNKGWSSGSGGGGDGCGDKGWSSGGGGDNKGWGGGGESGDKGWSSGG
Sbjct: 1220 SDNKGWSSGGSGGDNKGWSSGSGGGGDGCGDKGWSSGGGGDNKGWGGGGESGDKGWSSGG 1279
Query: 665 SREWEKSGSDRGGFGGRGRGRWSSGSGSNDSDSGSGGWSGGGDRSDRGGGGFRGRGRGRW 724
SREWEKSGSDRGGFGGRGRGRWSSGSGSNDSDSGSGGWSGGGDRSDR GGGFRGRGRGRW
Sbjct: 1280 SREWEKSGSDRGGFGGRGRGRWSSGSGSNDSDSGSGGWSGGGDRSDRSGGGFRGRGRGRW 1339
Query: 725 NQEGSYDGDNGGQRGGYGGRGRGRWNQENGSNEGGDNGGWSGGRGGFGSRGRGSWNQDDG 784
NQEGSYDGDNGGQRGGYGGRGRGRWNQENGSNEGGDNGGWSGGRGGFGSRGRGSWNQDDG
Sbjct: 1340 NQEGSYDGDNGGQRGGYGGRGRGRWNQENGSNEGGDNGGWSGGRGGFGSRGRGSWNQDDG 1399
Query: 785 GSGGWSGGNGGRGGFGGRGRGRRNQDSSNDGNNDDKPASWSTGSGNSGGWNS----GGGG 844
GSGGWSGGNGGRGGFGGRGRGRRNQDSSNDGNNDDKPASWSTGSGNSGGWNS GGGG
Sbjct: 1400 GSGGWSGGNGGRGGFGGRGRGRRNQDSSNDGNNDDKPASWSTGSGNSGGWNSGGGGGGGG 1459
Query: 845 AGSWNQGGDEKNQQQHSWKSSNDGGQGSGWKEPSGSDHNNWESSGSSGAGNSSGWNNSTT 904
AGSWNQGGDEKNQQQHSWKSSNDGGQGSGWKEPSGSDHNNWESSGSSGAGNSSGWNNSTT
Sbjct: 1460 AGSWNQGGDEKNQQQHSWKSSNDGGQGSGWKEPSGSDHNNWESSGSSGAGNSSGWNNSTT 1519
Query: 905 GKETEESGGHNSWNQTTKTDSQGGGWQKSASSWNAGTENQTVTKDVSSVSKDGGWGKSAE 964
GKETEESGGHNSWNQTTKTDSQGGGWQKSASSWNAGTENQTVTKDVSSVSKDGGWGKSAE
Sbjct: 1520 GKETEESGGHNSWNQTTKTDSQGGGWQKSASSWNAGTENQTVTKDVSSVSKDGGWGKSAE 1579
Query: 965 PSTLDKEKANVGAQGGGAAGWEKPTSSWNTEQSRGENNSGGGRGGARGK 1010
PSTLDKE ANVGAQGGGAAGWEKPTSSWNTEQSRGENNSGGGRGGARGK
Sbjct: 1580 PSTLDKEIANVGAQGGGAAGWEKPTSSWNTEQSRGENNSGGGRGGARGK 1628
BLAST of CSPI02G16050 vs. NCBI nr
Match:
TYJ99969.1 (protein RNA-directed DNA methylation 3 [Cucumis melo var. makuwa])
HSP 1 Score: 1544.3 bits (3997), Expect = 0.0e+00
Identity = 915/1065 (85.92%), Postives = 939/1065 (88.17%), Query Frame = 0
Query: 4 RSPWPSFPESGTSNGPGSSSTNPFGSDAKNDEDSPWISKLTPEASTSWGAAKSSVDTAND 63
RSPWPSFPESGTSNGPGSSSTNPFGSDA NDEDSPWISK TPEASTSWGAAKSSVDTAND
Sbjct: 679 RSPWPSFPESGTSNGPGSSSTNPFGSDAINDEDSPWISKSTPEASTSWGAAKSSVDTAND 738
Query: 64 GQASGWGKSDSKICSDGNASGALGKTVVPSGDSAGFTDSESGGWKKNQSANFGDDNAPVE 123
GQASGWGK DSK CSDGNASGA GKTV PSG SAGFTDSESGGWKKNQSANFGDD P E
Sbjct: 739 GQASGWGKGDSKTCSDGNASGAWGKTVAPSGHSAGFTDSESGGWKKNQSANFGDDKTPAE 798
Query: 124 TSADRWGSKSRSSGSWGDQNASTTVSEIQPAGKGNAGAWNVGTAKDESGGWGKPKNVGDV 183
T+ADRWGSKSRSSGSWGDQNASTTVSE+QPAGKGNAGAWN GTA+DESGGWGKPKN GDV
Sbjct: 799 TAADRWGSKSRSSGSWGDQNASTTVSEVQPAGKGNAGAWNEGTAQDESGGWGKPKNFGDV 858
Query: 184 GSSAWNKSTAGDGDGQNDSWNKPKPSSHDGNVGKKEWGQGNEASDNGNKWQSSRSDGGKK 243
GSSAWNKSTAGD DG+NDSWNKPKPSSHDG+VGKKEWGQGNEASDNGNKWQSSRSDGGKK
Sbjct: 859 GSSAWNKSTAGDRDGENDSWNKPKPSSHDGSVGKKEWGQGNEASDNGNKWQSSRSDGGKK 918
Query: 244 WGTNEAEREGGSSWNTSKSSDVGPASWKDKPDSSSLTAPKGDQWAEGWDKQHSSNDTKAS 303
WG+NEAE EGGSSWNTSKSSDV AS KDKPDSSSLTAPKGDQWA GWDKQHSSNDTKAS
Sbjct: 919 WGSNEAEPEGGSSWNTSKSSDVVSASRKDKPDSSSLTAPKGDQWAGGWDKQHSSNDTKAS 978
Query: 304 DDNSSWNKKPVESGKDGELKNQGSGWNVGKTSGGDSASGWGQTSKEADLSDQAGSWGSNW 363
DDNS WNKK VESGKDGE+KNQGSGWNVGKTSGGDSASGWGQTSKEA LSDQAGSWGSNW
Sbjct: 979 DDNSPWNKKSVESGKDGEIKNQGSGWNVGKTSGGDSASGWGQTSKEAGLSDQAGSWGSNW 1038
Query: 364 KKNSDTRNEDSSSAKKSSWGSGSGNSNWGEKSNWNSGNEFNAITGGAEAQTDVSNDTSGY 423
KKNS NEDSSSAKKSSWGSG GNSNWGEKSNWNSGNEFNA TGGAEAQTDVSNDTS Y
Sbjct: 1039 KKNSGAGNEDSSSAKKSSWGSGGGNSNWGEKSNWNSGNEFNATTGGAEAQTDVSNDTSSY 1098
Query: 424 GSWKPESSDRGGYRGRGGFRGRGERGRFGGRGRSDRGGFGRGGSDRGGFGGRGRGRWNSE 483
GSWKPESSDRGGYRGRGGFRGRGERGRFGGRGRSDRGGFGRGGSDRGGFGGRGRGRWNSE
Sbjct: 1099 GSWKPESSDRGGYRGRGGFRGRGERGRFGGRGRSDRGGFGRGGSDRGGFGGRGRGRWNSE 1158
Query: 484 GGSNDGENKGWSGGGG---------------------GGSDNKGWGSGGGGSDNKGWSSG 543
GGSNDGENKGWS GGG GGSD KGWGSGGGGSDNKGWSSG
Sbjct: 1159 GGSNDGENKGWSAGGGNDNKGWSSGGVSDSKGWSSGDGGSDIKGWGSGGGGSDNKGWSSG 1218
Query: 544 GDGSNNKGWSTGGEGSGNK-------GGGDNKGWGSGSGGSSDDKGWSGGGNGVGGGDNK 603
GDGS+NKGWS+GG G NK GGGD+KGWGS GGSSD+KGWS G+GVGG DNK
Sbjct: 1219 GDGSDNKGWSSGGSGGDNKGWSTGGEGGGDSKGWGSSGGGSSDNKGWS-SGSGVGGHDNK 1278
Query: 604 GWGSGGGGSNDNKGWS-------GGDNKGWGSAGGSSDNKGWSSGG----SGGDNKGWSS 663
GWGSGGGGS+DNKGWS GGDNKGWGS GG SDNKGW+SGG GGDNKGW S
Sbjct: 1279 GWGSGGGGSSDNKGWSSGGSGVAGGDNKGWGSGGGGSDNKGWNSGGESGVDGGDNKGWGS 1338
Query: 664 -------------GSGGGGDGCGDKGWSSGGGGDNKGWGGGGESGD-KGWSSGGSREWEK 723
GSGGGGDG GDKGWSSGGGG+NKGWGGGGESGD KGWSSGGSREWEK
Sbjct: 1339 GGGGNNKGWSSGGGSGGGGDGTGDKGWSSGGGGNNKGWGGGGESGDNKGWSSGGSREWEK 1398
Query: 724 SGSDRGGFGGRGRGRWSSGSGSNDSDSGSGGWS-GGGDR----SDRGGGGFRGRGRGRWN 783
SGSD GGFGGRGRGRWSSGSGSND DSGSGGWS GGGDR S+RGGGGFRGRGRGRWN
Sbjct: 1399 SGSDGGGFGGRGRGRWSSGSGSNDGDSGSGGWSGGGGDREKFGSERGGGGFRGRGRGRWN 1458
Query: 784 QE-GSYDGDNGGQRGGYGGRGRGRWNQENGSNEGGDNGGWSGGRGGFGSRGRGSWNQDDG 843
QE GSYDGDNGG+RGGYGGRGRGRWNQENGSNEGGDN GGRGGFG RGRG WNQDDG
Sbjct: 1459 QEGGSYDGDNGGRRGGYGGRGRGRWNQENGSNEGGDN----GGRGGFGGRGRGRWNQDDG 1518
Query: 844 GSGGWSGGNGGRGGFGGRGRGRRNQDSSNDGNNDDKPASWSTGSGNSGGWNSGGGGAGSW 903
GSGGWSGG GGRGGFGGRGRGRRNQD SN+ NNDDKPASWS GSGNSGGW+S GGAGSW
Sbjct: 1519 GSGGWSGG-GGRGGFGGRGRGRRNQDGSNESNNDDKPASWSAGSGNSGGWSS-SGGAGSW 1578
Query: 904 NQGGDEKNQQQHSWKSSNDGGQGSGWKEPSGSDHNNWESSGSSGAGNSSGWNNSTTGKET 963
NQGGDEKN+QQHSWKSSNDGGQGSGWKEPSG+DHNNW+SSGSSGAGNSSGWNNST KE
Sbjct: 1579 NQGGDEKNEQQHSWKSSNDGGQGSGWKEPSGNDHNNWKSSGSSGAGNSSGWNNSTAAKEM 1638
Query: 964 EESGGHNSWNQTTKTDSQGGGWQKSASSWNAGTENQTVTKDVSSVSKDGGWGKSAEPSTL 1010
EESGG NSW+Q+TKTDSQGGGWQKSASSWNAGTENQT TKDVSS SKDGGWGKS EPSTL
Sbjct: 1639 EESGGQNSWSQSTKTDSQGGGWQKSASSWNAGTENQTATKDVSSGSKDGGWGKSVEPSTL 1698
BLAST of CSPI02G16050 vs. NCBI nr
Match:
KAG6578795.1 (Protein RNA-directed DNA methylation 3, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 954.5 bits (2466), Expect = 7.2e-274
Identity = 686/1106 (62.03%), Postives = 745/1106 (67.36%), Query Frame = 0
Query: 5 SPWPSFPESGTSNGPGSSSTNPFGS---DAKNDEDSPWISKLTPEASTSWGAAKSSVDTA 64
+PWPSFPES T NGPGSSSTNP GS DAK DEDSPW+SK TP+ASTSWGAAKSSVDTA
Sbjct: 675 NPWPSFPESSTLNGPGSSSTNPIGSESFDAKKDEDSPWVSKSTPDASTSWGAAKSSVDTA 734
Query: 65 NDGQASGWGKSDS------KICSDGNASGALGKTVVPSGDSAGFT--------------- 124
N+GQASGWGKSDS K CSDGNASGA GKT VPSGDSAG T
Sbjct: 735 NNGQASGWGKSDSWGKTIAKTCSDGNASGAWGKTAVPSGDSAGLTEHTWDKWDKGKQVSS 794
Query: 125 ------------------------DSESGGWKKNQSANFGDDNAPVETSADRWGSKSRSS 184
D+ESGGWKK QSANF DD P E++ D ++ +
Sbjct: 795 DNQTGNWGDGTSGKNEHSAWSRDKDAESGGWKKTQSANFDDDKTPAESAGDWTNPEAENK 854
Query: 185 GSWGDQNASTTVSEIQPAGKGNAGAWNVGTAKDESGGWGKPKNVGDVGSSAWNKSTAGDG 244
+ N T++ Q + GN +DE+GGWGKPKNVG+ GSSAWNKST+GDG
Sbjct: 855 VNPSGWNEGTSMKGSQTSNWGN---------QDETGGWGKPKNVGNGGSSAWNKSTSGDG 914
Query: 245 DGQNDSWNKPKPSSHDGNVGKKEWGQGNEASDNGNKWQSSRSDGGKKWGTNEAEREGGSS 304
+NDSWNKPK +HD ++GKK WGQ NEASDNGNKWQSSRSDGG KWGTNE+E EGG
Sbjct: 915 AVENDSWNKPKLCNHDESIGKKGWGQSNEASDNGNKWQSSRSDGGTKWGTNESEHEGG-- 974
Query: 305 WNTSKSSDVGPASWKDKPDSSSLTAPKGDQWAEGWDKQHSSNDTKASDDNSSWNKKPVES 364
KDK DSSSLT P+GDQ GWDKQ SSNDTKAS++NS WNKK VES
Sbjct: 975 --------------KDKSDSSSLTTPRGDQSVGGWDKQRSSNDTKASEENSPWNKKSVES 1034
Query: 365 GKDGELKNQGSGWNVGKTSGGDSASGWGQTSKEADLSDQAGSWGSNWKKNSDTRNEDSSS 424
GKDGELKNQGSGWNVGKTSGGDSASGWGQ SKEA SD AG+WGSNWKKNSD NEDSS
Sbjct: 1035 GKDGELKNQGSGWNVGKTSGGDSASGWGQASKEAGSSDLAGNWGSNWKKNSDVGNEDSSL 1094
Query: 425 AKKSSWGSGSGNSNWGEKSNWNSGNEFNA--ITGGAEAQTDVSNDTSGYGSWKPESSDRG 484
AKKS+W SGSGNSNWGEKSNWNSGNE+NA TGGAEAQTDVSNDTSGYGSW+ E+SDRG
Sbjct: 1095 AKKSNWSSGSGNSNWGEKSNWNSGNEYNANHSTGGAEAQTDVSNDTSGYGSWRGENSDRG 1154
Query: 485 GYRGRGGFRGRGERGRFGGRGRSDRGGFGRGGSDRGGFGGRGRGRWNSEGGSNDGENKGW 544
GYRGRGGFRGRGERGRFGGRGRSDRGGFG RGGFGGRGRGRWNSEGGSN G+NKGW
Sbjct: 1155 GYRGRGGFRGRGERGRFGGRGRSDRGGFG-----RGGFGGRGRGRWNSEGGSNGGDNKGW 1214
Query: 545 SGGGGGGSDNKGWGS-GGGGSDNKGWSSGGDGSNNKGWSTGGEGSGNKGGGDNKGWGSGS 604
S GGGG DN+GW S GGGG DNKGWSSGG G +NKGWS+GG GGGDNKGW SG
Sbjct: 1215 SSGGGG--DNRGWSSGGGGGDDNKGWSSGGGGDDNKGWSSGG-----AGGGDNKGWSSG- 1274
Query: 605 GGSSDDKGWSGGGNGVGGGDNKGWGSGGGGSNDNKGWSGGDNKGWGSAGGSSDNKGWSSG 664
GGGDNKGW SGGGG DNKGWSGG GG DNKGWSSG
Sbjct: 1275 ----------------GGGDNKGWSSGGGG--DNKGWSGG--------GG--DNKGWSSG 1334
Query: 665 GSGGDNKGWSSGSGGGGDGCGDKGWSSGGGGDNKGWGGGGESGDKGWSSGGSREWEKSGS 724
GGDNKGWSS GGGDNKGW GGG G GGS +WEKSGS
Sbjct: 1335 --GGDNKGWSS-----------------GGGDNKGWSGGGGGG------GGSSDWEKSGS 1394
Query: 725 DRGGFGGRGRGRWSSGSGSNDSDSGSGGWSGGGDRSDRGGGGFRGRGRGRWNQE-GSYDG 784
DRGGFGGRGRGRWSSGS SND D G SDR GGF+GRGRGRWNQE GS+DG
Sbjct: 1395 DRGGFGGRGRGRWSSGSSSNDGDREKFG-------SDR--GGFKGRGRGRWNQEGGSHDG 1454
Query: 785 DNG---GQRGGYGGRGRGRWNQENGSNEGGDNGGWSGGRGGFGSRGRGSWNQDDGGSGGW 844
DNG G RGGYGGRGRGRWNQE SN+ GDNGG GG+ S G GGS G
Sbjct: 1455 DNGGWRGGRGGYGGRGRGRWNQEGDSND-GDNGG-----GGWSSCG-------GGGSWGK 1514
Query: 845 SGGNGGRGGFGGRGRGRRNQDSSNDGNNDDKPASWSTGSGNSGGWNSGGGGAGSWNQGGD 904
SG + G GGFGGRG G+ NQ+ GS N+GGW+S GGGAGSWNQGGD
Sbjct: 1515 SGSDRG-GGFGGRGSGKWNQEG---------------GSNNNGGWSS-GGGAGSWNQGGD 1574
Query: 905 EKNQQQHSWKSSNDGG------QGSGWKEP-SGSDHNNWESSGSSGAGNSSGWNNSTTGK 964
+KNQ Q SWK SNDGG QGSGWKEP SG++ NNW+SSGSSGA ++SGWN STTGK
Sbjct: 1575 DKNQPQ-SWKPSNDGGDKFGGQQGSGWKEPTSGNEPNNWKSSGSSGADHNSGWNKSTTGK 1634
Query: 965 ETEESGGH-NSWNQTTKTDSQ-GGGWQKSASSWNAGTENQTVTKDVSSVSKDGGWGKSA- 1003
E EES G NSW+Q TK D+Q GGGWQK AS WN GTENQTV DV++ +KDGGWGK A
Sbjct: 1635 EIEESDGQKNSWSQATKVDAQGGGGWQKPASGWNGGTENQTVKNDVNASAKDGGWGKPAS 1645
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A5D3BLU8 | 0.0e+00 | 85.92 | Protein RNA-directed DNA methylation 3 OS=Cucumis melo var. makuwa OX=1194695 GN... | [more] |
A0A0A0LN22 | 0.0e+00 | 78.43 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G325960 PE=4 SV=1 | [more] |
A0A1S4E184 | 2.4e-267 | 86.37 | protein RNA-directed DNA methylation 3 OS=Cucumis melo OX=3656 GN=LOC103496462 P... | [more] |
A0A6J1FGX3 | 2.1e-266 | 60.68 | protein RNA-directed DNA methylation 3-like OS=Cucurbita moschata OX=3662 GN=LOC... | [more] |
A0A6J1JV02 | 6.9e-262 | 59.60 | protein RNA-directed DNA methylation 3-like isoform X2 OS=Cucurbita maxima OX=36... | [more] |
Match Name | E-value | Identity | Description | |
XP_031736955.1 | 0.0e+00 | 99.21 | protein RNA-directed DNA methylation 3 isoform X1 [Cucumis sativus] | [more] |
XP_031736956.1 | 0.0e+00 | 99.21 | protein RNA-directed DNA methylation 3 isoform X2 [Cucumis sativus] | [more] |
KAE8652013.1 | 0.0e+00 | 99.21 | hypothetical protein Csa_006830 [Cucumis sativus] | [more] |
TYJ99969.1 | 0.0e+00 | 85.92 | protein RNA-directed DNA methylation 3 [Cucumis melo var. makuwa] | [more] |
KAG6578795.1 | 7.2e-274 | 62.03 | Protein RNA-directed DNA methylation 3, partial [Cucurbita argyrosperma subsp. s... | [more] |
Match Name | E-value | Identity | Description | |