Homology
BLAST of HG10003230 vs. NCBI nr
Match:
XP_038905038.1 (DNA-directed RNA polymerase IV subunit 1 isoform X1 [Benincasa hispida] >XP_038905039.1 DNA-directed RNA polymerase IV subunit 1 isoform X1 [Benincasa hispida] >XP_038905040.1 DNA-directed RNA polymerase IV subunit 1 isoform X1 [Benincasa hispida] >XP_038905041.1 DNA-directed RNA polymerase IV subunit 1 isoform X1 [Benincasa hispida])
HSP 1 Score: 2546.2 bits (6598), Expect = 0.0e+00
Identity = 1292/1496 (86.36%), Postives = 1324/1496 (88.50%), Query Frame = 0
Query: 1 MIHMEDEQDGELPIPSGLVTGINFSVSTQQDTENIAVMTVDASNEVSDPKLGLPNPSYQC 60
MIHMEDEQDGELPIPSGL+TGINFSV+TQQDTENIAV+TVDA++EVSDPKLGLPNPSYQC
Sbjct: 14 MIHMEDEQDGELPIPSGLLTGINFSVATQQDTENIAVLTVDAASEVSDPKLGLPNPSYQC 73
Query: 61 TTCGASSLKSCEG------LTRGVIHS----------TKDSPPHGDV-FEVDSEVEDPTS 120
TTCGASSLKSCEG +IH K P + E+ +VEDPTS
Sbjct: 74 TTCGASSLKSCEGHFGVIKFPYTIIHPYFLSEVAQVLNKVCPGCKSIRQELWGKVEDPTS 133
Query: 121 DYHRPKGCRYCFGSLKDWYPPMRFKLSTTDMFRKSMIMVEVKENMSKKYQKRVAKGGLPS 180
DYHRPKGCRYCFGSLKDWYPPMRFKLSTTDMFRKSMIMVEVKENMSKKYQK+VAKGGLPS
Sbjct: 134 DYHRPKGCRYCFGSLKDWYPPMRFKLSTTDMFRKSMIMVEVKENMSKKYQKKVAKGGLPS 193
Query: 181 DYWNFIPKDEQQEDSYCRPNRKILTHAQVHYLLKDIDPKFLKKFVPATDSLFLNSFPVTP 240
DYWNFIPKDEQQE+SYCRPNRKILTHAQVHYLLKDIDPKFLKKFVPATDSLFLNSFPVTP
Sbjct: 194 DYWNFIPKDEQQEESYCRPNRKILTHAQVHYLLKDIDPKFLKKFVPATDSLFLNSFPVTP 253
Query: 241 NSHRVTEMAHSFSNGQRLIF--------------------------------LSPEKLQS 300
NSHRVTE+ HSFSNGQRLIF LSPEKLQS
Sbjct: 254 NSHRVTELTHSFSNGQRLIFDERTRAYKKVVDFRGTANELGSRILDCLKISKLSPEKLQS 313
Query: 301 KDLVYQQKKIKDTATSSYGLRWIKDVVLGKRSDHCFRMVVVGDPNIELSEIGIPCHVAER 360
KDLVYQQKKIKDTATSSYGLRWIKDVVLGKRSDHCFRMVVVGDPNIELSEIGIPCHVAER
Sbjct: 314 KDLVYQQKKIKDTATSSYGLRWIKDVVLGKRSDHCFRMVVVGDPNIELSEIGIPCHVAER 373
Query: 361 LQISEHLSSWNMKKLSTSCYLHLVEKGEIYVRREGRLVRVRNVLELNMGDTIYRPLADGD 420
LQISEHLSSWNMKKLSTSCYL+LV KGEI+VRREGRLVRVRNVLELNMGDTIYRPLADGD
Sbjct: 374 LQISEHLSSWNMKKLSTSCYLNLVVKGEIFVRREGRLVRVRNVLELNMGDTIYRPLADGD 433
Query: 421 VVLVNRPPSIHQHSLIALSVKLLPVSSVLSLNPLCCSPFRGDFDGDCLHGYVPQSLEARV 480
VVLVNRPPSIHQHSLIAL VKLLPVSSVLSLNPLCCSPFRGDFDGDCLHGYVPQSLEARV
Sbjct: 434 VVLVNRPPSIHQHSLIALYVKLLPVSSVLSLNPLCCSPFRGDFDGDCLHGYVPQSLEARV 493
Query: 481 EVRELVSLDRQLINGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQMQQLQMLTLHQLLS 540
EVRELVSLDRQLINGQSGRNLLSLSHDSLTAAHLIMEDGVSLN FQMQQLQML LHQLL
Sbjct: 494 EVRELVSLDRQLINGQSGRNLLSLSHDSLTAAHLIMEDGVSLNAFQMQQLQMLALHQLLP 553
Query: 541 PAILKAPLLRNCAWTGKQLFSTLLPPDFDYSSPSHCVFIKNGELISSEGSYWLRDSGRNL 600
PAI+KAPL RNCAWTGKQLFS LLPPDFDYSSPSH VFIKNGELISSEGSYWLRDSGRNL
Sbjct: 554 PAIVKAPLFRNCAWTGKQLFSILLPPDFDYSSPSHNVFIKNGELISSEGSYWLRDSGRNL 613
Query: 601 FQALIEHCEGKTLDYLHDAQGVLCEWLSMRGLSVSLSDLYLSVDSYSHKNMMDDIFCGLQ 660
FQALIEHCEGKTLDYLHDAQGVLCEWLSMRGLSVSLSDLYLSVDS+SHKNMMDDIFCGLQ
Sbjct: 614 FQALIEHCEGKTLDYLHDAQGVLCEWLSMRGLSVSLSDLYLSVDSHSHKNMMDDIFCGLQ 673
Query: 661 EAEETCNLKQLMVDSHKDILTGDDEDNQHVLSIAVERLSYEKQKSAALNQASVDAFKKVF 720
EAEETCNLKQLMVDSHKDILTG+DEDNQH+LSI +ERL YEKQKS ALNQASVDAFKKVF
Sbjct: 674 EAEETCNLKQLMVDSHKDILTGNDEDNQHMLSIEMERLIYEKQKSVALNQASVDAFKKVF 733
Query: 721 RDIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVTLSFSLPHKLTCSA 780
RDIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLV LSFSLPHKL+CSA
Sbjct: 734 RDIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVFLSFSLPHKLSCSA 793
Query: 781 WNSQKMPRYIQKDGLPDRTPSFIPYAVVESSFLSGLNPFECFAHSVTNRDSSFSDNAEVP 840
WNSQKMPRYIQKDGLPDRT SFIPYAVVE+SFLSGLNPFECFAHSVTNRDSSFSDNAEVP
Sbjct: 794 WNSQKMPRYIQKDGLPDRTQSFIPYAVVENSFLSGLNPFECFAHSVTNRDSSFSDNAEVP 853
Query: 841 GTLTRKLTFLMRDIYTAYDGTVRNAYGNQLVQFSYDIDRPTSVSNELDGENNNRDRDIGG 900
GTLTRKLTFLMRDIYTAYDGTVRNAYGNQLVQFSYDIDRPTSVSNELD ENNN+DRDIGG
Sbjct: 854 GTLTRKLTFLMRDIYTAYDGTVRNAYGNQLVQFSYDIDRPTSVSNELDSENNNKDRDIGG 913
Query: 901 HPVGSLAACAMSEAAYSALDQPISLLEASPLLNLKRVLECGSKRNSTKQTFSLFLSEKLS 960
HPVGSLAACAMSEAAYSALDQPISLLEASPLLNLKRVLECGSKRNSTKQTFSLFLSEKLS
Sbjct: 914 HPVGSLAACAMSEAAYSALDQPISLLEASPLLNLKRVLECGSKRNSTKQTFSLFLSEKLS 973
Query: 961 KRSYGFEYGALGVKNHLERVIFKDIVSNVMI----------------------------- 1020
KRSYGFEYGALGVKNHLERV+FKDIVS VMI
Sbjct: 974 KRSYGFEYGALGVKNHLERVMFKDIVSTVMIIFSPQPSRKKHFSPWVCHFHVCKEILKKR 1033
Query: 1021 ---------------------------------TDCPLADSLREDGDTVCLTVTIAENTK 1080
DCPLADS+REDGDTVCLTVTIAENTK
Sbjct: 1034 RLKMNSVIHSLNIRCDSVRQEGRMNLPSLQIISQDCPLADSVREDGDTVCLTVTIAENTK 1093
Query: 1081 NSFLQLDFIQDLLIHFLLGTVIRGFAEIDRVDIAWNDRPKVPKPRCNHGELYLRVTMSGE 1140
NSFLQLDFIQDLLIHFLLGTVIRGFAEIDRVDIAWNDRPKVPKPRCNHGELYLRVTMSGE
Sbjct: 1094 NSFLQLDFIQDLLIHFLLGTVIRGFAEIDRVDIAWNDRPKVPKPRCNHGELYLRVTMSGE 1153
Query: 1141 GNSRFWATLMNNCLPIMDLIDWSRSHPDNTHSLCLAYGIDSGWKYFLNSLESATLDIGKT 1200
GNSRFWATL+NNCLP+MDLIDWSRSHPDNTHSLCLAYGIDSGWKYFLNSL SATLDIGKT
Sbjct: 1154 GNSRFWATLVNNCLPVMDLIDWSRSHPDNTHSLCLAYGIDSGWKYFLNSLVSATLDIGKT 1213
Query: 1201 IRPKHLLLVANSLSATGEFIGLNVKGLSHQREHALVKTPFMQACFSTPGACFVKAAKAGI 1260
IR +HLLL+ANSLSATGEF+GLNVKGLSHQREHALVKTPFMQACFS+PGACFVKAAKAG
Sbjct: 1214 IRLEHLLLIANSLSATGEFVGLNVKGLSHQREHALVKTPFMQACFSSPGACFVKAAKAGS 1273
Query: 1261 KDNLSGSLDALAWGRIPSLGTGGQFDILYSGRGHELNKPVDVYNLLGGQSICEKQNAKIE 1320
KDNLSGSLDALAWGRIPSLGTGGQFDILYSGRGHELNKPVDVYNLLGGQS CEKQNAKI
Sbjct: 1274 KDNLSGSLDALAWGRIPSLGTGGQFDILYSGRGHELNKPVDVYNLLGGQSTCEKQNAKIG 1333
Query: 1321 SLDKNNISEKYSAQLVLKNGGSTIKGLKKLDSVSKSILREFLTLNDIQKLSFTLRTILHK 1380
SLDKNNISEKYSAQLVLKNGGSTIKGLKKLD+VSKSILREFLTLNDIQKLSF LRTILHK
Sbjct: 1334 SLDKNNISEKYSAQLVLKNGGSTIKGLKKLDNVSKSILREFLTLNDIQKLSFALRTILHK 1393
Query: 1381 YSLNERLNEVDKSTLMMALYFHPHRDEKIGVGAQDIKPTMQCLFSLGSISEVRLSGRLKK 1386
YSLNERLNEVDKSTLMMALYFHPHRDEKIGVGAQDI K
Sbjct: 1394 YSLNERLNEVDKSTLMMALYFHPHRDEKIGVGAQDI-----------------------K 1453
BLAST of HG10003230 vs. NCBI nr
Match:
XP_038905042.1 (DNA-directed RNA polymerase IV subunit 1 isoform X2 [Benincasa hispida])
HSP 1 Score: 2546.2 bits (6598), Expect = 0.0e+00
Identity = 1292/1496 (86.36%), Postives = 1324/1496 (88.50%), Query Frame = 0
Query: 1 MIHMEDEQDGELPIPSGLVTGINFSVSTQQDTENIAVMTVDASNEVSDPKLGLPNPSYQC 60
MIHMEDEQDGELPIPSGL+TGINFSV+TQQDTENIAV+TVDA++EVSDPKLGLPNPSYQC
Sbjct: 1 MIHMEDEQDGELPIPSGLLTGINFSVATQQDTENIAVLTVDAASEVSDPKLGLPNPSYQC 60
Query: 61 TTCGASSLKSCEG------LTRGVIHS----------TKDSPPHGDV-FEVDSEVEDPTS 120
TTCGASSLKSCEG +IH K P + E+ +VEDPTS
Sbjct: 61 TTCGASSLKSCEGHFGVIKFPYTIIHPYFLSEVAQVLNKVCPGCKSIRQELWGKVEDPTS 120
Query: 121 DYHRPKGCRYCFGSLKDWYPPMRFKLSTTDMFRKSMIMVEVKENMSKKYQKRVAKGGLPS 180
DYHRPKGCRYCFGSLKDWYPPMRFKLSTTDMFRKSMIMVEVKENMSKKYQK+VAKGGLPS
Sbjct: 121 DYHRPKGCRYCFGSLKDWYPPMRFKLSTTDMFRKSMIMVEVKENMSKKYQKKVAKGGLPS 180
Query: 181 DYWNFIPKDEQQEDSYCRPNRKILTHAQVHYLLKDIDPKFLKKFVPATDSLFLNSFPVTP 240
DYWNFIPKDEQQE+SYCRPNRKILTHAQVHYLLKDIDPKFLKKFVPATDSLFLNSFPVTP
Sbjct: 181 DYWNFIPKDEQQEESYCRPNRKILTHAQVHYLLKDIDPKFLKKFVPATDSLFLNSFPVTP 240
Query: 241 NSHRVTEMAHSFSNGQRLIF--------------------------------LSPEKLQS 300
NSHRVTE+ HSFSNGQRLIF LSPEKLQS
Sbjct: 241 NSHRVTELTHSFSNGQRLIFDERTRAYKKVVDFRGTANELGSRILDCLKISKLSPEKLQS 300
Query: 301 KDLVYQQKKIKDTATSSYGLRWIKDVVLGKRSDHCFRMVVVGDPNIELSEIGIPCHVAER 360
KDLVYQQKKIKDTATSSYGLRWIKDVVLGKRSDHCFRMVVVGDPNIELSEIGIPCHVAER
Sbjct: 301 KDLVYQQKKIKDTATSSYGLRWIKDVVLGKRSDHCFRMVVVGDPNIELSEIGIPCHVAER 360
Query: 361 LQISEHLSSWNMKKLSTSCYLHLVEKGEIYVRREGRLVRVRNVLELNMGDTIYRPLADGD 420
LQISEHLSSWNMKKLSTSCYL+LV KGEI+VRREGRLVRVRNVLELNMGDTIYRPLADGD
Sbjct: 361 LQISEHLSSWNMKKLSTSCYLNLVVKGEIFVRREGRLVRVRNVLELNMGDTIYRPLADGD 420
Query: 421 VVLVNRPPSIHQHSLIALSVKLLPVSSVLSLNPLCCSPFRGDFDGDCLHGYVPQSLEARV 480
VVLVNRPPSIHQHSLIAL VKLLPVSSVLSLNPLCCSPFRGDFDGDCLHGYVPQSLEARV
Sbjct: 421 VVLVNRPPSIHQHSLIALYVKLLPVSSVLSLNPLCCSPFRGDFDGDCLHGYVPQSLEARV 480
Query: 481 EVRELVSLDRQLINGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQMQQLQMLTLHQLLS 540
EVRELVSLDRQLINGQSGRNLLSLSHDSLTAAHLIMEDGVSLN FQMQQLQML LHQLL
Sbjct: 481 EVRELVSLDRQLINGQSGRNLLSLSHDSLTAAHLIMEDGVSLNAFQMQQLQMLALHQLLP 540
Query: 541 PAILKAPLLRNCAWTGKQLFSTLLPPDFDYSSPSHCVFIKNGELISSEGSYWLRDSGRNL 600
PAI+KAPL RNCAWTGKQLFS LLPPDFDYSSPSH VFIKNGELISSEGSYWLRDSGRNL
Sbjct: 541 PAIVKAPLFRNCAWTGKQLFSILLPPDFDYSSPSHNVFIKNGELISSEGSYWLRDSGRNL 600
Query: 601 FQALIEHCEGKTLDYLHDAQGVLCEWLSMRGLSVSLSDLYLSVDSYSHKNMMDDIFCGLQ 660
FQALIEHCEGKTLDYLHDAQGVLCEWLSMRGLSVSLSDLYLSVDS+SHKNMMDDIFCGLQ
Sbjct: 601 FQALIEHCEGKTLDYLHDAQGVLCEWLSMRGLSVSLSDLYLSVDSHSHKNMMDDIFCGLQ 660
Query: 661 EAEETCNLKQLMVDSHKDILTGDDEDNQHVLSIAVERLSYEKQKSAALNQASVDAFKKVF 720
EAEETCNLKQLMVDSHKDILTG+DEDNQH+LSI +ERL YEKQKS ALNQASVDAFKKVF
Sbjct: 661 EAEETCNLKQLMVDSHKDILTGNDEDNQHMLSIEMERLIYEKQKSVALNQASVDAFKKVF 720
Query: 721 RDIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVTLSFSLPHKLTCSA 780
RDIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLV LSFSLPHKL+CSA
Sbjct: 721 RDIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVFLSFSLPHKLSCSA 780
Query: 781 WNSQKMPRYIQKDGLPDRTPSFIPYAVVESSFLSGLNPFECFAHSVTNRDSSFSDNAEVP 840
WNSQKMPRYIQKDGLPDRT SFIPYAVVE+SFLSGLNPFECFAHSVTNRDSSFSDNAEVP
Sbjct: 781 WNSQKMPRYIQKDGLPDRTQSFIPYAVVENSFLSGLNPFECFAHSVTNRDSSFSDNAEVP 840
Query: 841 GTLTRKLTFLMRDIYTAYDGTVRNAYGNQLVQFSYDIDRPTSVSNELDGENNNRDRDIGG 900
GTLTRKLTFLMRDIYTAYDGTVRNAYGNQLVQFSYDIDRPTSVSNELD ENNN+DRDIGG
Sbjct: 841 GTLTRKLTFLMRDIYTAYDGTVRNAYGNQLVQFSYDIDRPTSVSNELDSENNNKDRDIGG 900
Query: 901 HPVGSLAACAMSEAAYSALDQPISLLEASPLLNLKRVLECGSKRNSTKQTFSLFLSEKLS 960
HPVGSLAACAMSEAAYSALDQPISLLEASPLLNLKRVLECGSKRNSTKQTFSLFLSEKLS
Sbjct: 901 HPVGSLAACAMSEAAYSALDQPISLLEASPLLNLKRVLECGSKRNSTKQTFSLFLSEKLS 960
Query: 961 KRSYGFEYGALGVKNHLERVIFKDIVSNVMI----------------------------- 1020
KRSYGFEYGALGVKNHLERV+FKDIVS VMI
Sbjct: 961 KRSYGFEYGALGVKNHLERVMFKDIVSTVMIIFSPQPSRKKHFSPWVCHFHVCKEILKKR 1020
Query: 1021 ---------------------------------TDCPLADSLREDGDTVCLTVTIAENTK 1080
DCPLADS+REDGDTVCLTVTIAENTK
Sbjct: 1021 RLKMNSVIHSLNIRCDSVRQEGRMNLPSLQIISQDCPLADSVREDGDTVCLTVTIAENTK 1080
Query: 1081 NSFLQLDFIQDLLIHFLLGTVIRGFAEIDRVDIAWNDRPKVPKPRCNHGELYLRVTMSGE 1140
NSFLQLDFIQDLLIHFLLGTVIRGFAEIDRVDIAWNDRPKVPKPRCNHGELYLRVTMSGE
Sbjct: 1081 NSFLQLDFIQDLLIHFLLGTVIRGFAEIDRVDIAWNDRPKVPKPRCNHGELYLRVTMSGE 1140
Query: 1141 GNSRFWATLMNNCLPIMDLIDWSRSHPDNTHSLCLAYGIDSGWKYFLNSLESATLDIGKT 1200
GNSRFWATL+NNCLP+MDLIDWSRSHPDNTHSLCLAYGIDSGWKYFLNSL SATLDIGKT
Sbjct: 1141 GNSRFWATLVNNCLPVMDLIDWSRSHPDNTHSLCLAYGIDSGWKYFLNSLVSATLDIGKT 1200
Query: 1201 IRPKHLLLVANSLSATGEFIGLNVKGLSHQREHALVKTPFMQACFSTPGACFVKAAKAGI 1260
IR +HLLL+ANSLSATGEF+GLNVKGLSHQREHALVKTPFMQACFS+PGACFVKAAKAG
Sbjct: 1201 IRLEHLLLIANSLSATGEFVGLNVKGLSHQREHALVKTPFMQACFSSPGACFVKAAKAGS 1260
Query: 1261 KDNLSGSLDALAWGRIPSLGTGGQFDILYSGRGHELNKPVDVYNLLGGQSICEKQNAKIE 1320
KDNLSGSLDALAWGRIPSLGTGGQFDILYSGRGHELNKPVDVYNLLGGQS CEKQNAKI
Sbjct: 1261 KDNLSGSLDALAWGRIPSLGTGGQFDILYSGRGHELNKPVDVYNLLGGQSTCEKQNAKIG 1320
Query: 1321 SLDKNNISEKYSAQLVLKNGGSTIKGLKKLDSVSKSILREFLTLNDIQKLSFTLRTILHK 1380
SLDKNNISEKYSAQLVLKNGGSTIKGLKKLD+VSKSILREFLTLNDIQKLSF LRTILHK
Sbjct: 1321 SLDKNNISEKYSAQLVLKNGGSTIKGLKKLDNVSKSILREFLTLNDIQKLSFALRTILHK 1380
Query: 1381 YSLNERLNEVDKSTLMMALYFHPHRDEKIGVGAQDIKPTMQCLFSLGSISEVRLSGRLKK 1386
YSLNERLNEVDKSTLMMALYFHPHRDEKIGVGAQDI K
Sbjct: 1381 YSLNERLNEVDKSTLMMALYFHPHRDEKIGVGAQDI-----------------------K 1440
BLAST of HG10003230 vs. NCBI nr
Match:
TYK19428.1 (DNA-directed RNA polymerase IV subunit 1 [Cucumis melo var. makuwa])
HSP 1 Score: 2505.7 bits (6493), Expect = 0.0e+00
Identity = 1278/1523 (83.91%), Postives = 1320/1523 (86.67%), Query Frame = 0
Query: 1 MIHMEDEQDGELPIPSGLVTGINFSVSTQQDTENIAVMTVDASNEVSDPKLGLPNPSYQC 60
MIHMEDEQDGELPIPSG +TGINFSVS QQD ENIAV+TVDA++EVSDPKLGLPNPSYQC
Sbjct: 14 MIHMEDEQDGELPIPSGRLTGINFSVSNQQDIENIAVITVDAASEVSDPKLGLPNPSYQC 73
Query: 61 TTCGASSLKSCEG------LTRGVIHS----------TKDSP------------------ 120
TTCGASSLK CEG +IH K P
Sbjct: 74 TTCGASSLKFCEGHFGVIKFPYTIIHPYFLSEVAQVLNKVCPGCKSIRQELWGKVRKKMM 133
Query: 121 ----PHGDVFEVDSEVEDPTSDYHRPKGCRYCFGSLKDWYPPMRFKLSTTDMFRKSMIMV 180
+ FE+ VEDPTSDY+RPKGCRYCFGSLKDWYPPMRFKLSTTDMF+KSMIMV
Sbjct: 134 TLCHRNSSPFEIFIWVEDPTSDYNRPKGCRYCFGSLKDWYPPMRFKLSTTDMFKKSMIMV 193
Query: 181 EVKENMSKKYQKRVAKGGLPSDYWNFIPKDEQQEDSYCRPNRKILTHAQVHYLLKDIDPK 240
EVKENMSKKYQKRVAKGGLPSDYW+FIPKDEQQE+SYCRPNRKILTHAQVHYLLKDIDPK
Sbjct: 194 EVKENMSKKYQKRVAKGGLPSDYWDFIPKDEQQEESYCRPNRKILTHAQVHYLLKDIDPK 253
Query: 241 FLKKFVPATDSLFLNSFPVTPNSHRVTEMAHSFSNGQRL--------------------- 300
FLKKFVPA DSLFLNSFPVTPNSHRVTEMAHSFSNGQRL
Sbjct: 254 FLKKFVPAIDSLFLNSFPVTPNSHRVTEMAHSFSNGQRLIFDERTRAYKKVVDFRGTANE 313
Query: 301 -----------------IFLSPEKLQSKDLVYQQKKIKDTATSSYGLRWIKDVVLGKRSD 360
IFLSPEKLQSKDLVYQQKKIKDTATSS GLRWIKDVVLGKRSD
Sbjct: 314 LGSRVLDCLKISKAIYKIFLSPEKLQSKDLVYQQKKIKDTATSSSGLRWIKDVVLGKRSD 373
Query: 361 HCFRMVVVGDPNIELSEIGIPCHVAERLQISEHLSSWNMKKLSTSCYLHLVEKGEIYVRR 420
HCFRMVVVGDPNIELSEIGIPCHVAERLQISEHLSSWNMKKLSTSCYLHLVEKGEIYVRR
Sbjct: 374 HCFRMVVVGDPNIELSEIGIPCHVAERLQISEHLSSWNMKKLSTSCYLHLVEKGEIYVRR 433
Query: 421 EGRLVRVRNVLELNMGDTIYRPLADGDVVLVNRPPSIHQHSLIALSVKLLPVSSVLSLNP 480
EGRLVRVRNVLELNMGDTIYRPLADGDVVLVNRPPSIHQHSLIALSVKLLPVS+VLSLNP
Sbjct: 434 EGRLVRVRNVLELNMGDTIYRPLADGDVVLVNRPPSIHQHSLIALSVKLLPVSAVLSLNP 493
Query: 481 LCCSPFRGDFDGDCLHGYVPQSLEARVEVRELVSLDRQLINGQSGRNLLSLSHDSLTAAH 540
LCCSPFRGDFDGDCLHGYVPQSLEARVEVRELVSLDRQLINGQSGRNLLSLSHDSLTAAH
Sbjct: 494 LCCSPFRGDFDGDCLHGYVPQSLEARVEVRELVSLDRQLINGQSGRNLLSLSHDSLTAAH 553
Query: 541 LIMEDGVSLNLFQMQQLQMLTLHQLLSPAILKAPLLRNCAWTGKQLFSTLLPPDFDYSSP 600
LI+EDGVSLNLFQMQQLQMLTLHQLL PAI+K+PLLRNCAWTGKQLFS LLPPDF+YSSP
Sbjct: 554 LILEDGVSLNLFQMQQLQMLTLHQLLPPAIVKSPLLRNCAWTGKQLFSILLPPDFEYSSP 613
Query: 601 SHCVFIKNGELISSEGSYWLRDSGRNLFQALIEHCEGKTLDYLHDAQGVLCEWLSMRGLS 660
SH VFI+ GELISSEGSYWLRDSGRNLFQALIEHCEGKTLDYL DAQGVLCEWLSMRGLS
Sbjct: 614 SHNVFIEKGELISSEGSYWLRDSGRNLFQALIEHCEGKTLDYLRDAQGVLCEWLSMRGLS 673
Query: 661 VSLSDLYLSVDSYSHKNMMDDIFCGLQEAEETCNLKQLMVDSHKDILTGDDEDNQHVLSI 720
VSLSDLYLSVDSYSHKNMMDDIFCGLQEAEETCNLKQLMVDSHK+ILTG+DEDNQH+LSI
Sbjct: 674 VSLSDLYLSVDSYSHKNMMDDIFCGLQEAEETCNLKQLMVDSHKEILTGNDEDNQHLLSI 733
Query: 721 AVERLSYEKQKSAALNQASVDAFKKVFRDIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLV 780
AVE L YEKQKSAALNQASVDAFKKVFRDIQNLV+KYSGKDNSLLTMFKAGSKGNL+KLV
Sbjct: 734 AVEHLIYEKQKSAALNQASVDAFKKVFRDIQNLVHKYSGKDNSLLTMFKAGSKGNLMKLV 793
Query: 781 QHSMCLGLQHSLVTLSFSLPHKLTCSAWNSQKMPRYIQKDGLPDRTPSFIPYAVVESSFL 840
QHSMCLGLQHSLVTLSFSLPHKL+CSAWNSQKMPRYIQ+DGLPDRT SFIPYAVVE+SFL
Sbjct: 794 QHSMCLGLQHSLVTLSFSLPHKLSCSAWNSQKMPRYIQEDGLPDRTQSFIPYAVVENSFL 853
Query: 841 SGLNPFECFAHSVTNRDSSFSDNAEVPGTLTRKLTFLMRDIYTAYDGTVRNAYGNQLVQF 900
SGLNPFECFAHSVTNRDSSFSDNAEVPGTLTRKLTFLMRDIYTAYDGTVRNAYGNQLVQF
Sbjct: 854 SGLNPFECFAHSVTNRDSSFSDNAEVPGTLTRKLTFLMRDIYTAYDGTVRNAYGNQLVQF 913
Query: 901 SYDIDRPTSVSNELDGENNNRDRDIGGHPVGSLAACAMSEAAYSALDQPISLLEASPLLN 960
YDIDRPTSVS+E D E NNRDRDIGGHPVGSLAACA SEAAYSALDQPISLLEASPLLN
Sbjct: 914 CYDIDRPTSVSSESDSE-NNRDRDIGGHPVGSLAACAFSEAAYSALDQPISLLEASPLLN 973
Query: 961 LKRVLECGSKRNSTKQTFSLFLSEKLSKRSYGFEYGALGVKNHLERVIFKDIVSNVMI-- 1020
LKRVLECGSKRNSTKQTFSLFLSEKLSKRSYGFEYGALGVKNHLERV+FKDIVS+VMI
Sbjct: 974 LKRVLECGSKRNSTKQTFSLFLSEKLSKRSYGFEYGALGVKNHLERVMFKDIVSSVMIIF 1033
Query: 1021 ------------------------------------------------------------ 1080
Sbjct: 1034 SPQPSRKKHFSPWVCHFHVCKDILKKRRLKMNSVIHSLNMRCDSVRQEGRMNLPSLQIIT 1093
Query: 1081 TDCPLADSLREDGDTVCLTVTIAENTKNSFLQLDFIQDLLIHFLLGTVIRGFAEIDRVDI 1140
DCPLADSL EDGDTVCLTVTIAENTKNSFLQLDFIQDLLIHFLLGTVIRGFAEIDRVDI
Sbjct: 1094 QDCPLADSLTEDGDTVCLTVTIAENTKNSFLQLDFIQDLLIHFLLGTVIRGFAEIDRVDI 1153
Query: 1141 AWNDRPKVPKPRCNHGELYLRVTMSGEGNSRFWATLMNNCLPIMDLIDWSRSHPDNTHSL 1200
WNDRPKVPKPRC+HGELYLRVTMSGEGNSRFWATL+NNCLPIMDLIDW+RSHPDNTHSL
Sbjct: 1154 TWNDRPKVPKPRCSHGELYLRVTMSGEGNSRFWATLINNCLPIMDLIDWTRSHPDNTHSL 1213
Query: 1201 CLAYGIDSGWKYFLNSLESATLDIGKTIRPKHLLLVANSLSATGEFIGLNVKGLSHQREH 1260
CLAYGIDSGWKYFLNSLE ATLDIGKTIR +HLLLVANSLSATGEF+GLNVKGL+HQREH
Sbjct: 1214 CLAYGIDSGWKYFLNSLECATLDIGKTIRLEHLLLVANSLSATGEFVGLNVKGLTHQREH 1273
Query: 1261 ALVKTPFMQACFSTPGACFVKAAKAGIKDNLSGSLDALAWGRIPSLGTGGQFDILYSGRG 1320
ALVKTPFMQACFS+PGAC +KAAKAGIKDNLSGSLDALAWGR+PSLGTGGQFDILYSG+G
Sbjct: 1274 ALVKTPFMQACFSSPGACLIKAAKAGIKDNLSGSLDALAWGRMPSLGTGGQFDILYSGKG 1333
Query: 1321 HELNKPVDVYNLLGGQSICEKQNAKIESLDKNNISEKYSAQLVLKNGGSTIKGLKKLDSV 1380
HELNKPVDVYNLLGGQS CEKQNAKIES+DKNNISEKYSAQLVLKNGGSTIKGLK+LDSV
Sbjct: 1334 HELNKPVDVYNLLGGQSTCEKQNAKIESVDKNNISEKYSAQLVLKNGGSTIKGLKRLDSV 1393
Query: 1381 SKSILREFLTLNDIQKLSFTLRTILHKYSLNERLNEVDKSTLMMALYFHPHRDEKIGVGA 1386
SKSILR+FLTLNDIQKLSF LRTILHKYSLNERLNEVDKSTLMMALYFHPHRDEKIGVGA
Sbjct: 1394 SKSILRKFLTLNDIQKLSFALRTILHKYSLNERLNEVDKSTLMMALYFHPHRDEKIGVGA 1453
BLAST of HG10003230 vs. NCBI nr
Match:
XP_011650447.1 (DNA-directed RNA polymerase IV subunit 1 isoform X1 [Cucumis sativus] >XP_011650449.1 DNA-directed RNA polymerase IV subunit 1 isoform X1 [Cucumis sativus] >KGN55963.1 hypothetical protein Csa_010842 [Cucumis sativus])
HSP 1 Score: 2503.4 bits (6487), Expect = 0.0e+00
Identity = 1270/1496 (84.89%), Postives = 1315/1496 (87.90%), Query Frame = 0
Query: 1 MIHMEDEQDGELPIPSGLVTGINFSVSTQQDTENIAVMTVDASNEVSDPKLGLPNPSYQC 60
MIHMEDEQDGELPIPSGL+TGINFSVS QQD ENIAV+TVDA+NEVSDPKLGLPNPSYQC
Sbjct: 14 MIHMEDEQDGELPIPSGLLTGINFSVSNQQDIENIAVITVDAANEVSDPKLGLPNPSYQC 73
Query: 61 TTCGASSLKSCEG------LTRGVIHS----------TKDSPPHGDV-FEVDSEVEDPTS 120
TTCGASSLK CEG +IH K P V E+ +VEDPTS
Sbjct: 74 TTCGASSLKFCEGHFGVIKFPYTIIHPYFLSEVAQVLNKVCPGCKSVRQELWGKVEDPTS 133
Query: 121 DYHRPKGCRYCFGSLKDWYPPMRFKLSTTDMFRKSMIMVEVKENMSKKYQKRVAKGGLPS 180
DY+RPKGCRYCFGSLKDWYPPMRFKLSTTDMF+KSMIMVEVKENMSKKYQKRVAKGGLPS
Sbjct: 134 DYNRPKGCRYCFGSLKDWYPPMRFKLSTTDMFKKSMIMVEVKENMSKKYQKRVAKGGLPS 193
Query: 181 DYWNFIPKDEQQEDSYCRPNRKILTHAQVHYLLKDIDPKFLKKFVPATDSLFLNSFPVTP 240
DYW+FIPKDEQQE+SYCRPNRKILTHAQVHYLLKDIDPKFLKKFVPA DSLFLNSFPVTP
Sbjct: 194 DYWDFIPKDEQQEESYCRPNRKILTHAQVHYLLKDIDPKFLKKFVPAIDSLFLNSFPVTP 253
Query: 241 NSHRVTEMAHSFSNGQRLIF--------------------------------LSPEKLQS 300
NSHRVTEMAHSFSNGQRLIF LSPEKLQ+
Sbjct: 254 NSHRVTEMAHSFSNGQRLIFDERTRAYKKVVDFRGTANELGSRVLDCLKISKLSPEKLQN 313
Query: 301 KDLVYQQKKIKDTATSSYGLRWIKDVVLGKRSDHCFRMVVVGDPNIELSEIGIPCHVAER 360
KDLVYQQKKIKDTATSS GLRWIKDVVLGKRSDHCFRMVVVGDPNIELSEIGIPCHVAER
Sbjct: 314 KDLVYQQKKIKDTATSSSGLRWIKDVVLGKRSDHCFRMVVVGDPNIELSEIGIPCHVAER 373
Query: 361 LQISEHLSSWNMKKLSTSCYLHLVEKGEIYVRREGRLVRVRNVLELNMGDTIYRPLADGD 420
LQISEHLSSWNMKKLSTSCYLHLVEKGEIYVRREGRLVRVRNVLELNMGDTIYRPLADGD
Sbjct: 374 LQISEHLSSWNMKKLSTSCYLHLVEKGEIYVRREGRLVRVRNVLELNMGDTIYRPLADGD 433
Query: 421 VVLVNRPPSIHQHSLIALSVKLLPVSSVLSLNPLCCSPFRGDFDGDCLHGYVPQSLEARV 480
+VLVNRPPSIHQHSLIALSVKLLPVS+VLSLNPLCCSPFRGDFDGDCLHGYVPQSLEARV
Sbjct: 434 IVLVNRPPSIHQHSLIALSVKLLPVSAVLSLNPLCCSPFRGDFDGDCLHGYVPQSLEARV 493
Query: 481 EVRELVSLDRQLINGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQMQQLQMLTLHQLLS 540
EVRELVSLD+QL NGQSGRNLLSLSHDSLTAAHLI+EDGVSLNLFQMQQLQMLTLHQLL
Sbjct: 494 EVRELVSLDKQLTNGQSGRNLLSLSHDSLTAAHLILEDGVSLNLFQMQQLQMLTLHQLLP 553
Query: 541 PAILKAPLLRNCAWTGKQLFSTLLPPDFDYSSPSHCVFIKNGELISSEGSYWLRDSGRNL 600
PAI+K+PLLRNCAWTGKQLFS LLPPDFDYSSPSH VFI+ GELISSEGSYWLRDSGRNL
Sbjct: 554 PAIVKSPLLRNCAWTGKQLFSILLPPDFDYSSPSHNVFIEKGELISSEGSYWLRDSGRNL 613
Query: 601 FQALIEHCEGKTLDYLHDAQGVLCEWLSMRGLSVSLSDLYLSVDSYSHKNMMDDIFCGLQ 660
FQALIEHCEGKTLDYL DAQGVLCEWLS RGLSVSLSDLYLSVDSYSH+NMMDDIFCGLQ
Sbjct: 614 FQALIEHCEGKTLDYLRDAQGVLCEWLSTRGLSVSLSDLYLSVDSYSHENMMDDIFCGLQ 673
Query: 661 EAEETCNLKQLMVDSHKDILTGDDEDNQHVLSIAVERLSYEKQKSAALNQASVDAFKKVF 720
EAEETCNLKQLMVDSHK+IL G+DEDNQH+LSIAVERL YEKQKSAALNQASVDAFKKVF
Sbjct: 674 EAEETCNLKQLMVDSHKEILIGNDEDNQHLLSIAVERLIYEKQKSAALNQASVDAFKKVF 733
Query: 721 RDIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVTLSFSLPHKLTCSA 780
RDIQNLVYKYSGKDNSLLTMFKAGSKGNL+KLVQHSMCLGLQHSLVTLSFSLPHKL+C+A
Sbjct: 734 RDIQNLVYKYSGKDNSLLTMFKAGSKGNLMKLVQHSMCLGLQHSLVTLSFSLPHKLSCAA 793
Query: 781 WNSQKMPRYIQKDGLPDRTPSFIPYAVVESSFLSGLNPFECFAHSVTNRDSSFSDNAEVP 840
WNSQKMPRYIQKDGLPDRT SFIPYAVVE+SFLSGLNPFECFAHSVTNRDSSFSDNAEVP
Sbjct: 794 WNSQKMPRYIQKDGLPDRTQSFIPYAVVENSFLSGLNPFECFAHSVTNRDSSFSDNAEVP 853
Query: 841 GTLTRKLTFLMRDIYTAYDGTVRNAYGNQLVQFSYDIDRPTSVSNELDGENNNRDRDIGG 900
GTLTRKLTFLMRDIYTAYDGTVRNAYGNQLVQF YDIDRPTS E + ENNNRDR IGG
Sbjct: 854 GTLTRKLTFLMRDIYTAYDGTVRNAYGNQLVQFCYDIDRPTS---ESESENNNRDRGIGG 913
Query: 901 HPVGSLAACAMSEAAYSALDQPISLLEASPLLNLKRVLECGSKRNSTKQTFSLFLSEKLS 960
HPVGSLAACA+SEAAYSALDQPISLLEASPLLNLKRVLECGSKRNSTKQTFSLFLSEKLS
Sbjct: 914 HPVGSLAACAISEAAYSALDQPISLLEASPLLNLKRVLECGSKRNSTKQTFSLFLSEKLS 973
Query: 961 KRSYGFEYGALGVKNHLERVIFKDIVSNVMI----------------------------- 1020
KRSYGFEYGALGVKNHLERV+FKDIVS+VMI
Sbjct: 974 KRSYGFEYGALGVKNHLERVMFKDIVSSVMIIFSPLPSRKKHFSPWVCHFHVCKEILKKR 1033
Query: 1021 ---------------------------------TDCPLADSLREDGDTVCLTVTIAENTK 1080
DCPLADSL EDGDTVCLTVTIAENTK
Sbjct: 1034 RLKMNSVIHSLNMRCDSMRQEGRMNLPSLQIITQDCPLADSLTEDGDTVCLTVTIAENTK 1093
Query: 1081 NSFLQLDFIQDLLIHFLLGTVIRGFAEIDRVDIAWNDRPKVPKPRCNHGELYLRVTMSGE 1140
NSFLQLDFIQDLLIHFLLGTVIRGF EIDRVDI WNDRPKVPKPRC+HGELYLRVTMSGE
Sbjct: 1094 NSFLQLDFIQDLLIHFLLGTVIRGFTEIDRVDITWNDRPKVPKPRCSHGELYLRVTMSGE 1153
Query: 1141 GNSRFWATLMNNCLPIMDLIDWSRSHPDNTHSLCLAYGIDSGWKYFLNSLESATLDIGKT 1200
GNSRFWATLMNNCLPIMDLIDW+RSHPDNTHSLCLAYGIDSGWKYFLNSLESATLD+GKT
Sbjct: 1154 GNSRFWATLMNNCLPIMDLIDWTRSHPDNTHSLCLAYGIDSGWKYFLNSLESATLDVGKT 1213
Query: 1201 IRPKHLLLVANSLSATGEFIGLNVKGLSHQREHALVKTPFMQACFSTPGACFVKAAKAGI 1260
IR +HLLLV+NSLSATGEF+GLNVKGL+HQREHALVKTPFMQACFS+PGAC +KAAKAGI
Sbjct: 1214 IRLEHLLLVSNSLSATGEFVGLNVKGLTHQREHALVKTPFMQACFSSPGACMIKAAKAGI 1273
Query: 1261 KDNLSGSLDALAWGRIPSLGTGGQFDILYSGRGHELNKPVDVYNLLGGQSICEKQNAKIE 1320
KDNLSGSLDALAWGR+PSLGTGGQFDILYSG+GHELNKPVDVYNLLGGQS CEKQN KIE
Sbjct: 1274 KDNLSGSLDALAWGRMPSLGTGGQFDILYSGKGHELNKPVDVYNLLGGQSTCEKQNTKIE 1333
Query: 1321 SLDKNNISEKYSAQLVLKNGGSTIKGLKKLDSVSKSILREFLTLNDIQKLSFTLRTILHK 1380
SLDKN ISEKYSAQL+LKNGGSTIKGLK+LDSVSKSILR+FLTLNDIQKLSF LRTILHK
Sbjct: 1334 SLDKNTISEKYSAQLMLKNGGSTIKGLKRLDSVSKSILRKFLTLNDIQKLSFALRTILHK 1393
Query: 1381 YSLNERLNEVDKSTLMMALYFHPHRDEKIGVGAQDIKPTMQCLFSLGSISEVRLSGRLKK 1386
YSLNERLNEVDKSTLMMALYFHPHRDEKIGVGAQDI K
Sbjct: 1394 YSLNERLNEVDKSTLMMALYFHPHRDEKIGVGAQDI-----------------------K 1453
BLAST of HG10003230 vs. NCBI nr
Match:
XP_038905043.1 (DNA-directed RNA polymerase IV subunit 1 isoform X3 [Benincasa hispida] >XP_038905044.1 DNA-directed RNA polymerase IV subunit 1 isoform X3 [Benincasa hispida])
HSP 1 Score: 2484.9 bits (6439), Expect = 0.0e+00
Identity = 1262/1464 (86.20%), Postives = 1292/1464 (88.25%), Query Frame = 0
Query: 33 ENIAVMTVDASNEVSDPKLGLPNPSYQCTTCGASSLKSCEG------LTRGVIHS----- 92
ENIAV+TVDA++EVSDPKLGLPNPSYQCTTCGASSLKSCEG +IH
Sbjct: 14 ENIAVLTVDAASEVSDPKLGLPNPSYQCTTCGASSLKSCEGHFGVIKFPYTIIHPYFLSE 73
Query: 93 -----TKDSPPHGDV-FEVDSEVEDPTSDYHRPKGCRYCFGSLKDWYPPMRFKLSTTDMF 152
K P + E+ +VEDPTSDYHRPKGCRYCFGSLKDWYPPMRFKLSTTDMF
Sbjct: 74 VAQVLNKVCPGCKSIRQELWGKVEDPTSDYHRPKGCRYCFGSLKDWYPPMRFKLSTTDMF 133
Query: 153 RKSMIMVEVKENMSKKYQKRVAKGGLPSDYWNFIPKDEQQEDSYCRPNRKILTHAQVHYL 212
RKSMIMVEVKENMSKKYQK+VAKGGLPSDYWNFIPKDEQQE+SYCRPNRKILTHAQVHYL
Sbjct: 134 RKSMIMVEVKENMSKKYQKKVAKGGLPSDYWNFIPKDEQQEESYCRPNRKILTHAQVHYL 193
Query: 213 LKDIDPKFLKKFVPATDSLFLNSFPVTPNSHRVTEMAHSFSNGQRLIF------------ 272
LKDIDPKFLKKFVPATDSLFLNSFPVTPNSHRVTE+ HSFSNGQRLIF
Sbjct: 194 LKDIDPKFLKKFVPATDSLFLNSFPVTPNSHRVTELTHSFSNGQRLIFDERTRAYKKVVD 253
Query: 273 --------------------LSPEKLQSKDLVYQQKKIKDTATSSYGLRWIKDVVLGKRS 332
LSPEKLQSKDLVYQQKKIKDTATSSYGLRWIKDVVLGKRS
Sbjct: 254 FRGTANELGSRILDCLKISKLSPEKLQSKDLVYQQKKIKDTATSSYGLRWIKDVVLGKRS 313
Query: 333 DHCFRMVVVGDPNIELSEIGIPCHVAERLQISEHLSSWNMKKLSTSCYLHLVEKGEIYVR 392
DHCFRMVVVGDPNIELSEIGIPCHVAERLQISEHLSSWNMKKLSTSCYL+LV KGEI+VR
Sbjct: 314 DHCFRMVVVGDPNIELSEIGIPCHVAERLQISEHLSSWNMKKLSTSCYLNLVVKGEIFVR 373
Query: 393 REGRLVRVRNVLELNMGDTIYRPLADGDVVLVNRPPSIHQHSLIALSVKLLPVSSVLSLN 452
REGRLVRVRNVLELNMGDTIYRPLADGDVVLVNRPPSIHQHSLIAL VKLLPVSSVLSLN
Sbjct: 374 REGRLVRVRNVLELNMGDTIYRPLADGDVVLVNRPPSIHQHSLIALYVKLLPVSSVLSLN 433
Query: 453 PLCCSPFRGDFDGDCLHGYVPQSLEARVEVRELVSLDRQLINGQSGRNLLSLSHDSLTAA 512
PLCCSPFRGDFDGDCLHGYVPQSLEARVEVRELVSLDRQLINGQSGRNLLSLSHDSLTAA
Sbjct: 434 PLCCSPFRGDFDGDCLHGYVPQSLEARVEVRELVSLDRQLINGQSGRNLLSLSHDSLTAA 493
Query: 513 HLIMEDGVSLNLFQMQQLQMLTLHQLLSPAILKAPLLRNCAWTGKQLFSTLLPPDFDYSS 572
HLIMEDGVSLN FQMQQLQML LHQLL PAI+KAPL RNCAWTGKQLFS LLPPDFDYSS
Sbjct: 494 HLIMEDGVSLNAFQMQQLQMLALHQLLPPAIVKAPLFRNCAWTGKQLFSILLPPDFDYSS 553
Query: 573 PSHCVFIKNGELISSEGSYWLRDSGRNLFQALIEHCEGKTLDYLHDAQGVLCEWLSMRGL 632
PSH VFIKNGELISSEGSYWLRDSGRNLFQALIEHCEGKTLDYLHDAQGVLCEWLSMRGL
Sbjct: 554 PSHNVFIKNGELISSEGSYWLRDSGRNLFQALIEHCEGKTLDYLHDAQGVLCEWLSMRGL 613
Query: 633 SVSLSDLYLSVDSYSHKNMMDDIFCGLQEAEETCNLKQLMVDSHKDILTGDDEDNQHVLS 692
SVSLSDLYLSVDS+SHKNMMDDIFCGLQEAEETCNLKQLMVDSHKDILTG+DEDNQH+LS
Sbjct: 614 SVSLSDLYLSVDSHSHKNMMDDIFCGLQEAEETCNLKQLMVDSHKDILTGNDEDNQHMLS 673
Query: 693 IAVERLSYEKQKSAALNQASVDAFKKVFRDIQNLVYKYSGKDNSLLTMFKAGSKGNLLKL 752
I +ERL YEKQKS ALNQASVDAFKKVFRDIQNLVYKYSGKDNSLLTMFKAGSKGNLLKL
Sbjct: 674 IEMERLIYEKQKSVALNQASVDAFKKVFRDIQNLVYKYSGKDNSLLTMFKAGSKGNLLKL 733
Query: 753 VQHSMCLGLQHSLVTLSFSLPHKLTCSAWNSQKMPRYIQKDGLPDRTPSFIPYAVVESSF 812
VQHSMCLGLQHSLV LSFSLPHKL+CSAWNSQKMPRYIQKDGLPDRT SFIPYAVVE+SF
Sbjct: 734 VQHSMCLGLQHSLVFLSFSLPHKLSCSAWNSQKMPRYIQKDGLPDRTQSFIPYAVVENSF 793
Query: 813 LSGLNPFECFAHSVTNRDSSFSDNAEVPGTLTRKLTFLMRDIYTAYDGTVRNAYGNQLVQ 872
LSGLNPFECFAHSVTNRDSSFSDNAEVPGTLTRKLTFLMRDIYTAYDGTVRNAYGNQLVQ
Sbjct: 794 LSGLNPFECFAHSVTNRDSSFSDNAEVPGTLTRKLTFLMRDIYTAYDGTVRNAYGNQLVQ 853
Query: 873 FSYDIDRPTSVSNELDGENNNRDRDIGGHPVGSLAACAMSEAAYSALDQPISLLEASPLL 932
FSYDIDRPTSVSNELD ENNN+DRDIGGHPVGSLAACAMSEAAYSALDQPISLLEASPLL
Sbjct: 854 FSYDIDRPTSVSNELDSENNNKDRDIGGHPVGSLAACAMSEAAYSALDQPISLLEASPLL 913
Query: 933 NLKRVLECGSKRNSTKQTFSLFLSEKLSKRSYGFEYGALGVKNHLERVIFKDIVSNVMI- 992
NLKRVLECGSKRNSTKQTFSLFLSEKLSKRSYGFEYGALGVKNHLERV+FKDIVS VMI
Sbjct: 914 NLKRVLECGSKRNSTKQTFSLFLSEKLSKRSYGFEYGALGVKNHLERVMFKDIVSTVMII 973
Query: 993 ------------------------------------------------------------ 1052
Sbjct: 974 FSPQPSRKKHFSPWVCHFHVCKEILKKRRLKMNSVIHSLNIRCDSVRQEGRMNLPSLQII 1033
Query: 1053 -TDCPLADSLREDGDTVCLTVTIAENTKNSFLQLDFIQDLLIHFLLGTVIRGFAEIDRVD 1112
DCPLADS+REDGDTVCLTVTIAENTKNSFLQLDFIQDLLIHFLLGTVIRGFAEIDRVD
Sbjct: 1034 SQDCPLADSVREDGDTVCLTVTIAENTKNSFLQLDFIQDLLIHFLLGTVIRGFAEIDRVD 1093
Query: 1113 IAWNDRPKVPKPRCNHGELYLRVTMSGEGNSRFWATLMNNCLPIMDLIDWSRSHPDNTHS 1172
IAWNDRPKVPKPRCNHGELYLRVTMSGEGNSRFWATL+NNCLP+MDLIDWSRSHPDNTHS
Sbjct: 1094 IAWNDRPKVPKPRCNHGELYLRVTMSGEGNSRFWATLVNNCLPVMDLIDWSRSHPDNTHS 1153
Query: 1173 LCLAYGIDSGWKYFLNSLESATLDIGKTIRPKHLLLVANSLSATGEFIGLNVKGLSHQRE 1232
LCLAYGIDSGWKYFLNSL SATLDIGKTIR +HLLL+ANSLSATGEF+GLNVKGLSHQRE
Sbjct: 1154 LCLAYGIDSGWKYFLNSLVSATLDIGKTIRLEHLLLIANSLSATGEFVGLNVKGLSHQRE 1213
Query: 1233 HALVKTPFMQACFSTPGACFVKAAKAGIKDNLSGSLDALAWGRIPSLGTGGQFDILYSGR 1292
HALVKTPFMQACFS+PGACFVKAAKAG KDNLSGSLDALAWGRIPSLGTGGQFDILYSGR
Sbjct: 1214 HALVKTPFMQACFSSPGACFVKAAKAGSKDNLSGSLDALAWGRIPSLGTGGQFDILYSGR 1273
Query: 1293 GHELNKPVDVYNLLGGQSICEKQNAKIESLDKNNISEKYSAQLVLKNGGSTIKGLKKLDS 1352
GHELNKPVDVYNLLGGQS CEKQNAKI SLDKNNISEKYSAQLVLKNGGSTIKGLKKLD+
Sbjct: 1274 GHELNKPVDVYNLLGGQSTCEKQNAKIGSLDKNNISEKYSAQLVLKNGGSTIKGLKKLDN 1333
Query: 1353 VSKSILREFLTLNDIQKLSFTLRTILHKYSLNERLNEVDKSTLMMALYFHPHRDEKIGVG 1386
VSKSILREFLTLNDIQKLSF LRTILHKYSLNERLNEVDKSTLMMALYFHPHRDEKIGVG
Sbjct: 1334 VSKSILREFLTLNDIQKLSFALRTILHKYSLNERLNEVDKSTLMMALYFHPHRDEKIGVG 1393
BLAST of HG10003230 vs. ExPASy Swiss-Prot
Match:
Q9LQ02 (DNA-directed RNA polymerase IV subunit 1 OS=Arabidopsis thaliana OX=3702 GN=NRPD1 PE=1 SV=1)
HSP 1 Score: 1271.9 bits (3290), Expect = 0.0e+00
Identity = 712/1492 (47.72%), Postives = 936/1492 (62.73%), Query Frame = 0
Query: 4 MEDEQDGELPIPSGLVTGINFSVSTQQDTENIAVMTVDASNEVSDPKLGLPNPSYQCTTC 63
MED+ + EL +P G +T I FS+S D + ++V+ V+A N+V+D +LGLPNP C TC
Sbjct: 1 MEDDCE-ELQVPVGTLTSIGFSISNNNDRDKMSVLEVEAPNQVTDSRLGLPNPDSVCRTC 60
Query: 64 GASSLKSCEGLTRGVIHSTKDSPPHGDVFEVDSEVED--PTSDYHR----------PKGC 123
G+ K CEG GVI+ + EV + + P Y R P+ C
Sbjct: 61 GSKDRKVCEG-HFGVINFAYSIINPYFLKEVAALLNKICPGCKYIRKKQFQITEDQPERC 120
Query: 124 RYCFGSLKDWYPPMRFKLSTTDMFRKSMIMVEVKENMSKKYQKRVAKGGLPSDYWNFIPK 183
RYC +L YP M+F+++T ++FR+S I+VEV E K +KR LP DYW+F+P+
Sbjct: 121 RYC--TLNTGYPLMKFRVTTKEVFRRSGIVVEVNEESLMKLKKRGVL-TLPPDYWSFLPQ 180
Query: 184 DEQQEDSYCRPNRKILTHAQVHYLLKDIDPKFLKKFVPATDSLFLNSFPVTPNSHRVTEM 243
D ++S +P R+I+THAQV+ LL ID + +KK +P +SL L SFPVTPN +RVTE+
Sbjct: 181 DSNIDESCLKPTRRIITHAQVYALLLGIDQRLIKKDIPMFNSLGLTSFPVTPNGYRVTEI 240
Query: 244 AHSFSNGQRLIFLSPEKLQSK----------------DLVYQQKKIKDTATSS------- 303
H F NG RLIF ++ K + + + +T +SS
Sbjct: 241 VHQF-NGARLIFDERTRIYKKLVGFEGNTLELSSRVMECMQYSRLFSETVSSSKDSANPY 300
Query: 304 ---------YGLRWIKDVVLGKRSDHCFRMVVVGDPNIELSEIGIPCHVAERLQISEHLS 363
GLR++KDV+LGKRSDH FR VVVGDP+++L+EIGIP +A+RLQ+SEHL+
Sbjct: 301 QKKSDTPKLCGLRFMKDVLLGKRSDHTFRTVVVGDPSLKLNEIGIPESIAKRLQVSEHLN 360
Query: 364 SWNMKKLSTSCYLHLVEKGEIYVRREGRLVRVRNVLELNMGDTIYRPLADGDVVLVNRPP 423
N ++L TS L++ E++VRR RLV ++ V +L GD I+R L DGD VL+NRPP
Sbjct: 361 QCNKERLVTSFVPTLLDNKEMHVRRGDRLVAIQ-VNDLQTGDKIFRSLMDGDTVLMNRPP 420
Query: 424 SIHQHSLIALSVKLLPVSSVLSLNPLCCSPFRGDFDGDCLHGYVPQSLEARVEVRELVSL 483
SIHQHSLIA++V++LP +SV+SLNP+CC PFRGDFDGDCLHGYVPQS++A+VE+ ELV+L
Sbjct: 421 SIHQHSLIAMTVRILPTTSVVSLNPICCLPFRGDFDGDCLHGYVPQSIQAKVELDELVAL 480
Query: 484 DRQLINGQSGRNLLSLSHDSLTAAHLI-MEDGVSLNLFQMQQLQMLTLHQLLSPAILKA- 543
D+QLIN Q+GRNLLSL DSLTAA+L+ +E LN QMQQLQM QL PAI+KA
Sbjct: 481 DKQLINRQNGRNLLSLGQDSLTAAYLVNVEKNCYLNRAQMQQLQMYCPFQLPPPAIIKAS 540
Query: 544 PLLRNCAWTGKQLFSTLLPPDFDYSSPSHCVFIKNGELIS-SEGSYWLRDSGRNLFQALI 603
P WTG QLF L PP FDY+ P + V + NGEL+S SEGS WLRD N + L+
Sbjct: 541 PSSTEPQWTGMQLFGMLFPPGFDYTYPLNNVVVSNGELLSFSEGSAWLRDGEGNFIERLL 600
Query: 604 EHCEGKTLDYLHDAQGVLCEWLSMRGLSVSLSDLYLSVDSYSHKNMMDDIFCGLQEAEET 663
+H +GK LD ++ AQ +L +WL MRGLSVSL+DLYLS D S KN+ ++I GL+EAE+
Sbjct: 601 KHDKGKVLDIIYSAQEMLSQWLLMRGLSVSLADLYLSSDLQSRKNLTEEISYGLREAEQV 660
Query: 664 CNLKQLMVDSHKDILTGDDEDNQHVLSIAVERLSYEKQKSAALNQASVDAFKKVFRDIQN 723
CN +QLMV+S +D L + ED + + R YE+QKSA L++ +V AFK +RD+Q
Sbjct: 661 CNKQQLMVESWRDFLAVNGEDKEEDSVSDLARFCYERQKSATLSELAVSAFKDAYRDVQA 720
Query: 724 LVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVTLSFSLPHKLTCSAWNSQK 783
L Y+Y + NS L M KAGSKGN+ KLVQHSMC+GLQ+S V+LSF P +LTC+AWN
Sbjct: 721 LAYRYGDQSNSFLIMSKAGSKGNIGKLVQHSMCIGLQNSAVSLSFGFPRELTCAAWNDPN 780
Query: 784 MPRYIQKDGLPDRTPSFIPYAVVESSFLSGLNPFECFAHSVTNRDSSFSDNAEVPGTLTR 843
P K T S++PY V+E+SFL+GLNP E F HSVT+RDSSFS NA++PGTL+R
Sbjct: 781 SPLRGAKGKDSTTTESYVPYGVIENSFLTGLNPLESFVHSVTSRDSSFSGNADLPGTLSR 840
Query: 844 KLTFLMRDIYTAYDGTVRNAYGNQLVQFSYDIDRPTSVSNELDGENNNRDRDIGGHPVGS 903
+L F MRDIY AYDGTVRN++GNQLVQF+Y+ D P DI G +GS
Sbjct: 841 RLMFFMRDIYAAYDGTVRNSFGNQLVQFTYETDGPV--------------EDITGEALGS 900
Query: 904 LAACAMSEAAYSALDQPISLLEASPLLNLKRVLECGSKRNSTKQTFSLFLSEKLSKRSYG 963
L+ACA+SEAAYSALDQPISLLE SPLLNLK VLECGSK+ +QT SL+LSE LSK+ +G
Sbjct: 901 LSACALSEAAYSALDQPISLLETSPLLNLKNVLECGSKKGQREQTMSLYLSEYLSKKKHG 960
Query: 964 FEYGALGVKNHLERVIFKDIVSNVMI-------TDCPLA--------------------- 1023
FEYG+L +KNHLE++ F +IVS MI T PL+
Sbjct: 961 FEYGSLEIKNHLEKLSFSEIVSTSMIIFSPSSNTKVPLSPWVCHFHISEKVLKRKQLSAE 1020
Query: 1024 ---DSLRED-------------------------------GDTVCLTVTIAENTKNSFLQ 1083
SL E D VC+TVT+ E +K+S L+
Sbjct: 1021 SVVSSLNEQYKSRNRELKLDIVDLDIQNTNHCSSDDQAMKDDNVCITVTVVEASKHSVLE 1080
Query: 1084 LDFIQDLLIHFLLGTVIRGFAEIDRVDIAWNDRPKVPKPRCNH--GELYLRVTMSGE-GN 1143
LD I+ +LI FLL + ++G I +V+I W DRPK PK NH GELYL+VTM G+ G
Sbjct: 1081 LDAIRLVLIPFLLDSPVKGDQGIKKVNILWTDRPKAPKRNGNHLAGELYLKVTMYGDRGK 1140
Query: 1144 SRFWATLMNNCLPIMDLIDWSRSHPDNTHSLCLAYGIDSGWKYFLNSLESATLDIGKTIR 1203
W L+ CLPIMD+IDW RSHPDN C YGID+G F+ +LESA D GK I
Sbjct: 1141 RNCWTALLETCLPIMDMIDWGRSHPDNIRQCCSVYGIDAGRSIFVANLESAVSDTGKEIL 1200
Query: 1204 PKHLLLVANSLSATGEFIGLNVKGLSHQREHALVKTPFMQACFSTPGACFVKAAKAGIKD 1263
+HLLLVA+SLS TGEF+ LN KG S QR+ PF QACFS+P CF+KAAK G++D
Sbjct: 1201 REHLLLVADSLSVTGEFVALNAKGWSKQRQVESTPAPFTQACFSSPSQCFLKAAKEGVRD 1260
Query: 1264 NLSGSLDALAWGRIPSLGTGGQFDILYSGRGHELNKPVDVYNLLGGQSICEKQNAKIESL 1323
+L GS+DALAWG++P GTG QF+I+ S + H PVDVY+LL + N+ +
Sbjct: 1261 DLQGSIDALAWGKVPGFGTGDQFEIIISPKVHGFTTPVDVYDLLSSTKTMRRTNSAPK-- 1320
Query: 1324 DKNNISEKYSAQLVLKNGGSTIKGLKKLD--SVSKSILREFLTLNDIQKLSFTLRTILHK 1382
S+K + Q + +K +K LD + S+LR T +I+ LS +L+ ILH
Sbjct: 1321 -----SDKATVQPFGLLHSAFLKDIKVLDGKGIPMSLLRTIFTWKNIELLSQSLKRILHS 1380
BLAST of HG10003230 vs. ExPASy Swiss-Prot
Match:
Q5D869 (DNA-directed RNA polymerase V subunit 1 OS=Arabidopsis thaliana OX=3702 GN=NRPE1 PE=1 SV=1)
HSP 1 Score: 315.1 bits (806), Expect = 4.0e-84
Identity = 343/1356 (25.29%), Postives = 551/1356 (40.63%), Query Frame = 0
Query: 4 MEDEQDGELPIPSGLVTGINFSVSTQQD--TENIAVMTVDASNEVSDPKLGLPNPSYQCT 63
ME+E E I G + GI F++++ + ++I+ ++ +++++ LGLP +C
Sbjct: 1 MEEESTSE--ILDGEIVGITFALASHHEICIQSISESAINHPSQLTNAFLGLPLEFGKCE 60
Query: 64 TCGASSLKSCEGLTRGVIHSTKDSPPHGDVFEVDSEVEDPTSDYHRPKGCRYCFGSLKDW 123
+CGA+ CEG G I V E+ + + K + G L D
Sbjct: 61 SCGATEPDKCEG-HFGYIQLPVPIYHPAHVNELKQMLSLLCLKCLKIKKAKGTSGGLAD- 120
Query: 124 YPPMRFKLSTTDMFRKSMIMVEVKENMSKKY--QKRVAKGGLPSDYWNFIPKDEQQEDS- 183
+L S I ++ + + Y K ++ L WNF+ + + S
Sbjct: 121 ------RLLGVCCEEASQISIKDRASDGASYLELKLPSRSRLQPGCWNFLERYGYRYGSD 180
Query: 184 YCRPNRKILTHAQVHYLLKDIDPKFLKKFVP----ATDSLFLNSFPVTPNSHRVTEMAHS 243
Y RP L +V +L+ I + KK + L PV PN V E +
Sbjct: 181 YTRP----LLAREVKEILRRIPEESRKKLTAKGHIPQEGYILEYLPVPPNCLSVPEASDG 240
Query: 244 FSNGQRLIFLSPEKLQSKDLV--------------------------------------- 303
FS + + P +++ KD++
Sbjct: 241 FST----MSVDPSRIELKDVLKKVIAIKSSRSGETNFESHKAEASEMFRVVDTYLQVRGT 300
Query: 304 ----------YQQKKIKDTATSSYGLRWIKDVVLGKRSDHCFRMVVVGDPNIELSEIGIP 363
Y KI D+++S ++ + + K S R V+ GD ++E+GIP
Sbjct: 301 AKAARNIDMRYGVSKISDSSSSKAWTEKMRTLFIRKGSGFSSRSVITGDAYRHVNEVGIP 360
Query: 364 CHVAERLQISEHLSSWN----MKKLSTSCYLHLVEKGEIYVRREGRLVRVRNVLELNMGD 423
+A+R+ E +S N K + L + Y R+G + EL G
Sbjct: 361 IEIAQRITFEERVSVHNRGYLQKLVDDKLCLSYTQGSTTYSLRDGS----KGHTELKPGQ 420
Query: 424 TIYRPLADGDVVLVNRPPSIHQHSLIALSVKLLPVSSVLSLNPLCCSPFRGDFDGDCLHG 483
++R + DGDVV +NRPP+ H+HSL AL V + + + +NPL CSP DFDGDC+H
Sbjct: 421 VVHRRVMDGDVVFINRPPTTHKHSLQALRV-YVHEDNTVKINPLMCSPLSADFDGDCVHL 480
Query: 484 YVPQSLEARVEVRELVSLDRQLINGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQMQQL 543
+ PQSL A+ EV EL S+++QL++ +G+ +L + DSL + +++E V L+ QQL
Sbjct: 481 FYPQSLSAKAEVMELFSVEKQLLSSHTGQLILQMGSDSLLSLRVMLE-RVFLDKATAQQL 540
Query: 544 QMLTLHQLLSPAILKAPLLRNCAWTGKQLFSTLLPPDFDYSSPSHCVFIKNGELISSEGS 603
M L PA+ K+ AWT Q+ P S + +L+ +
Sbjct: 541 AMYGSLSLPPPALRKSS-KSGPAWTVFQILQLAFPERL--SCKGDRFLVDGSDLLKFDFG 600
Query: 604 YWLRDSGRN--LFQALIEHCEGKTLDYLHDAQGVLCEWLSMRGLSVSLSDLYLSVDSYSH 663
S N + +E +TL + Q +L E L G S+SL DL S S
Sbjct: 601 VDAMGSIINEIVTSIFLEKGPKETLGFFDSLQPLLMESLFAEGFSLSLEDL-----SMSR 660
Query: 664 KNMMDDIFCGLQEAEETCNLKQLMVDSHKDILTGDDEDNQHVLSIAVERLSYEKQKSAAL 723
+M D+ L E + + +L + S++D L ++ S K K A
Sbjct: 661 ADM--DVIHNLIIREISPMVSRLRL-SYRDELQLEN--------------SIHKVKEVAA 720
Query: 724 NQASVDAFKKVFRDIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVTL 783
N F I+NL+ K NS +T KLVQ + LGLQ S
Sbjct: 721 N------FMLKSYSIRNLI---DIKSNSAIT-----------KLVQQTGFLGLQLSDKKK 780
Query: 784 SFSLPHKLTCSAWNSQKMPRYIQKDGLPDRTPSFIPYAVVESSFLSGLNPFECFAHSVTN 843
++ + + +K R I G + +V+ F GL+P+E AHS+
Sbjct: 781 FYTKTLVEDMAIFCKRKYGR-ISSSG---------DFGIVKGCFFHGLDPYEEMAHSIAA 840
Query: 844 RD--SSFSDNAEVPGTLTRKLTFLMRDIYTAYDGTVRNAYGNQLVQFSYDIDRPTSVSNE 903
R+ S PGTL + L ++RDI DGTVRN N ++QF Y +
Sbjct: 841 REVIVRSSRGLAEPGTLFKNLMAVLRDIVITNDGTVRNTCSNSVIQFKYGV--------- 900
Query: 904 LDGENNNRDRDIGGHPVGSLAACAMSEAAYSALDQPISLLEASPLLN-----LKRVLEC- 963
D E ++ G PVG LAA AMS AY A +L++SP N +K VL C
Sbjct: 901 -DSERGHQGLFEAGEPVGVLAATAMSNPAYKA------VLDSSPNSNSSWELMKEVLLCK 960
Query: 964 -GSKRNSTKQTFSLFLSEKLSKRSYGFEYGALGVKNHLERVIFKDIVSNVMI-------- 1023
+ + + L+L+E + + E A V+N L +V KD ++
Sbjct: 961 VNFQNTTNDRRVILYLNECHCGKRFCQENAACTVRNKLNKVSLKDTAVEFLVEYRKQPTI 1020
Query: 1024 --------------------------------------------------TD-------- 1083
TD
Sbjct: 1021 SEIFGIDSCLHGHIHLNKTLLQDWNISMQDIHQKCEDVINSLGQKKKKKATDDFKRTSLS 1080
Query: 1084 ----CPLADSLREDG-DTVCLTVTIAENTKNSFLQLDFIQDLLIHFLLGTVIRGFAEIDR 1143
C D G D CLT + + LD + + + LL VI+G + I
Sbjct: 1081 VSECCSFRDPCGSKGSDMPCLTFSYNATDPDLERTLDVLCNTVYPVLLEIVIKGDSRICS 1140
Query: 1144 VDIAWNDRPKVPKPRCNH----GELYLRVTMSGEG---NSRFWATLMNNCLPIMDLIDWS 1203
+I WN R H GE L VT+ + W ++++CL ++ LID
Sbjct: 1141 ANIIWNSSDMTTWIRNRHASRRGEWVLDVTVEKSAVKQSGDAWRVVIDSCLSVLHLIDTK 1200
Query: 1204 RSHPDNTHSLCLAYGIDSGWKYFLNSLESATLDIGKTIRPKHLLLVANSLSATGEFIGLN 1207
RS P + + G+ ++ + L ++ + K + +H++L+AN+++ +G +G N
Sbjct: 1201 RSIPYSVKQVQELLGLSCAFEQAVQRLSASVRMVSKGVLKEHIILLANNMTCSGTMLGFN 1259
BLAST of HG10003230 vs. ExPASy Swiss-Prot
Match:
P17546 (DNA-directed RNA polymerase II subunit RPB1-A OS=Trypanosoma brucei brucei OX=5702 GN=TRP4.8 PE=1 SV=1)
HSP 1 Score: 193.0 bits (489), Expect = 2.3e-47
Identity = 162/594 (27.27%), Postives = 281/594 (47.31%), Query Frame = 0
Query: 259 KKIKDTATSSYGLRWIKDVVLGKRSDHCFRMVVVGDPNIELSEIGIPCHVAERLQISEHL 318
K + + YG ++ ++GKR D R V+ GDPNI++ E+G+P VA L E +
Sbjct: 331 KSLTERLKGKYGR--LRGNLMGKRVDFSARTVITGDPNIDVDEVGVPFSVAMTLTFPERV 390
Query: 319 SSWNMKKLSTSCYLHLVEKGEIYVRREG------RLVRVRNVLELNMGDTIYRPLADGDV 378
++ N K+L T V Y+ L+R R+ + LN+GD + R + +GDV
Sbjct: 391 NTVNKKRL-TEFARRTVYPSANYIHHPNGTITKLALLRDRSKVTLNIGDVVERHVINGDV 450
Query: 379 VLVNRPPSIHQHSLIALSVKLLPVSSVLSLNPLCCSPFRGDFDGDCLHGYVPQSLEARVE 438
VL NR P++H+ S++ V++L S+ LN C +P+ DFDGD ++ +VPQSL + E
Sbjct: 451 VLFNRQPTLHRMSMMGHRVRVLNYST-FRLNLSCTTPYNADFDGDEMNLHVPQSLLTKAE 510
Query: 439 VRELVSLDRQLINGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQMQQLQM-LTLHQLLS 498
+ E++ + + ++ + + DSL ++ + + L+ + +Q + + L L QL
Sbjct: 511 LIEMMMVPKNFVSPNKSAPCMGIVQDSLLGSYRLTDKDTFLDKYFVQSVALWLDLWQLPI 570
Query: 499 PAILKAPLLRNCAWTGKQLFSTLLP-----------PDFDYSSPSHCVFIKNGELISSEG 558
PAILK L WTGKQ+FS +LP P F ++ V I+ G+L+
Sbjct: 571 PAILKPRPL----WTGKQVFSLILPEVNHPATPQDRPPFPHN--DSVVMIRRGQLLCGPI 630
Query: 559 SYWLRDS--GRNLFQALIEHCEGKTLDYLHDAQGVLCEWLSMRGLSVSLSDLYLSVDSYS 618
+ + + G + EH + +++ Q V +L G SV + D D+
Sbjct: 631 TKSIVGAAPGSLIHVIFNEHGSDEVARFINGVQRVTTFFLLNFGFSVGVQDTVADSDTL- 690
Query: 619 HKNMMDDIFCGLQEAEETCNLKQLMVDSHKDILTGDDEDNQHVLSIAVERLSYEKQKSAA 678
M+D+ + N++++ ++ L N+ ++ S+E ++A
Sbjct: 691 --RQMNDVLVKTRR-----NVEKIGAAANNRTL------NRKAGMTLLQ--SFEADVNSA 750
Query: 679 LNQASVDAFKKVFRDIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVT 738
LN+ +A KK +++ + NS M +AGSKG L + Q ++ +G Q+
Sbjct: 751 LNKCREEAAKKALSNVR--------RTNSFKVMIEAGSKGTDLNICQIAVFVGQQN---V 810
Query: 739 LSFSLPHKLTCSAWNSQKMPRYIQKD-GLPDRTPSFIPYAVVESSFLSGLNPFECFAHSV 798
+P + + +P ++ D G R + ++ GL P E F H++
Sbjct: 811 AGSRIPF-----GFRRRTLPHFMLDDYGETSR-------GMANRGYVEGLKPHEFFFHTM 870
Query: 799 TNRDSSFSDNAEV--PGTLTRKLTFLMRDIYTAYDGTVRNAYGNQLVQFSYDID 830
R+ + G L RKL + D++ AYDGTVRNA ++L+QF Y D
Sbjct: 871 AGREGLIDTAVKTSDTGYLQRKLIKALEDVHAAYDGTVRNA-NDELIQFMYGED 874
BLAST of HG10003230 vs. ExPASy Swiss-Prot
Match:
P17545 (DNA-directed RNA polymerase II subunit RPB1-B OS=Trypanosoma brucei brucei OX=5702 GN=TRP5.9 PE=1 SV=1)
HSP 1 Score: 191.8 bits (486), Expect = 5.1e-47
Identity = 161/594 (27.10%), Postives = 281/594 (47.31%), Query Frame = 0
Query: 259 KKIKDTATSSYGLRWIKDVVLGKRSDHCFRMVVVGDPNIELSEIGIPCHVAERLQISEHL 318
K + + YG ++ ++GKR D R V+ GDPNI++ E+G+P VA L E +
Sbjct: 331 KSLTERLKGKYGR--LRGNLMGKRVDFSARTVITGDPNIDVDEVGVPFSVAMTLTFPERV 390
Query: 319 SSWNMKKLSTSCYLHLVEKGEIYVRREG------RLVRVRNVLELNMGDTIYRPLADGDV 378
++ N K+L T V Y+ L+R R+ + LN+GD + R + +GDV
Sbjct: 391 NTINKKRL-TEFARRTVYPSANYIHHPNGTITKLALLRDRSKVTLNIGDVVERHVINGDV 450
Query: 379 VLVNRPPSIHQHSLIALSVKLLPVSSVLSLNPLCCSPFRGDFDGDCLHGYVPQSLEARVE 438
VL NR P++H+ S++ V++L ++ LN C +P+ DFDGD ++ +VPQSL + E
Sbjct: 451 VLFNRQPTLHRMSMMGHRVRVLNYNT-FRLNLSCTTPYNADFDGDEMNLHVPQSLLTKAE 510
Query: 439 VRELVSLDRQLINGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQMQQLQM-LTLHQLLS 498
+ E++ + + ++ + + DSL ++ + + L+ + +Q + + L L QL
Sbjct: 511 LIEMMMVPKNFVSPNKSAPCMGIVQDSLLGSYRLTDKDTFLDKYFVQSVALWLDLWQLPI 570
Query: 499 PAILKAPLLRNCAWTGKQLFSTLLP-----------PDFDYSSPSHCVFIKNGELISSEG 558
PAILK L WTGKQ+FS +LP P F ++ V I+ G+L+
Sbjct: 571 PAILKPRPL----WTGKQVFSLILPEVNHPATPQDRPPFPHN--DSVVMIRRGQLLCGPI 630
Query: 559 SYWLRDS--GRNLFQALIEHCEGKTLDYLHDAQGVLCEWLSMRGLSVSLSDLYLSVDSYS 618
+ + + G + EH + +++ Q V +L G SV + D D+
Sbjct: 631 TKSIVGAAPGSLIHVIFNEHGSDEVARFINGVQRVTTFFLLNFGFSVGVQDTVADSDTL- 690
Query: 619 HKNMMDDIFCGLQEAEETCNLKQLMVDSHKDILTGDDEDNQHVLSIAVERLSYEKQKSAA 678
M+D+ + N++++ ++ L N+ ++ S+E ++A
Sbjct: 691 --RQMNDVLVKTRR-----NVEKIGAAANNRTL------NRKAGMTLLQ--SFEADVNSA 750
Query: 679 LNQASVDAFKKVFRDIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVT 738
LN+ +A KK +++ + NS M +AGSKG L + Q ++ +G Q+
Sbjct: 751 LNKCREEAAKKALSNVR--------RTNSFKVMIEAGSKGTDLNICQIAVFVGQQN---V 810
Query: 739 LSFSLPHKLTCSAWNSQKMPRYIQKD-GLPDRTPSFIPYAVVESSFLSGLNPFECFAHSV 798
+P + + +P ++ D G R + ++ GL P E F H++
Sbjct: 811 AGSRIPF-----GFRRRTLPHFMLDDYGETSR-------GMANRGYVEGLKPHEFFFHTM 870
Query: 799 TNRDSSFSDNAEV--PGTLTRKLTFLMRDIYTAYDGTVRNAYGNQLVQFSYDID 830
R+ + G L RKL + D++ AYDGTVRNA ++L+QF Y D
Sbjct: 871 AGREGLIDTAVKTSDTGYLQRKLIKALEDVHAAYDGTVRNA-NDELIQFMYGED 874
BLAST of HG10003230 vs. ExPASy Swiss-Prot
Match:
P35084 (DNA-directed RNA polymerase II subunit rpb1 OS=Dictyostelium discoideum OX=44689 GN=polr2a PE=2 SV=2)
HSP 1 Score: 190.3 bits (482), Expect = 1.5e-46
Identity = 161/603 (26.70%), Postives = 273/603 (45.27%), Query Frame = 0
Query: 274 IKDVVLGKRSDHCFRMVVVGDPNIELSEIGIPCHVAERLQISEHLSSWNMKKLSTSCYLH 333
I+ ++GKR D R V+ DPN+ + ++G+P +A L E ++ +N+ K+
Sbjct: 341 IRGNLMGKRVDFSARTVITADPNLSIDQVGVPRSIALNLTYPETVTPFNIDKMR-----E 400
Query: 334 LVEKG-------EIYVRREG-----RLVRVRNVLELNMGDTIYRPLADGDVVLVNRPPSI 393
L+ G + +R +G R V+ + L G + R + DGDVV+ NR PS+
Sbjct: 401 LIRNGPSEHPGAKYIIREDGTRFDLRFVKKVSDTHLECGYKVERHINDGDVVIFNRQPSL 460
Query: 394 HQHSLIALSVKLLPVSSVLSLNPLCCSPFRGDFDGDCLHGYVPQSLEARVEVRELVSLDR 453
H+ S++ +K++P S+ LN SP+ DFDGD ++ +VPQ+LE R EV E++ + R
Sbjct: 461 HKMSMMGHRIKVMPYST-FRLNLSVTSPYNADFDGDEMNLHVPQTLETRAEVIEIMMVPR 520
Query: 454 QLINGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQMQQLQMLTLHQLLS-------PAI 513
Q+++ QS R ++ + D+L + L + F + L M L L S PAI
Sbjct: 521 QIVSPQSNRPVMGIVQDTLLGSRLF----TKRDCFMEKDLVMNILMWLPSWDGKVPPPAI 580
Query: 514 LKAPLLRNCAWTGKQLFSTLLP--------PDFDYSSPSHC------VFIKNGELISSEG 573
LK L WTGKQLFS ++P + P+ C V I+ GEL++ G
Sbjct: 581 LKPKQL----WTGKQLFSLIIPDINLIRFTSTHNDKEPNECSAGDTRVIIERGELLA--G 640
Query: 574 SYWLRD----SGRNLFQALIEHCEGKTLDYLHDAQGVLCEWLSMRGLSVSLSDLY----- 633
R +G + + EH ++ Q V+ WL RG ++ + D
Sbjct: 641 ILCKRSLGAANGSIIHVVMNEHGHDTCRLFIDQTQTVVNHWLINRGFTMGIGDTIADSAT 700
Query: 634 ---LSVDSYSHKNMMDDIFCGLQEAEETCNLKQLMVDSHKDILTGDDEDNQHVLSIAVER 693
+++ S KN + ++ Q + C + +++
Sbjct: 701 MAKVTLTISSAKNQVKELIIKAQNKQFECQPGKSVIE----------------------- 760
Query: 694 LSYEKQKSAALNQASVDAFKKVFRDIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSM 753
++E++ + LN+A A + +DN+L M AGSKG+ + + Q
Sbjct: 761 -TFEQKVNQVLNKARDTAGSSAQDSL--------SEDNNLKAMVTAGSKGSFINISQMMA 820
Query: 754 CLGLQHSLVTLSFSLPHKLTCSAWNSQKMPRYIQKDGLPDRTPSFIPYAVVESSFLSGLN 813
C+G Q ++ K + S+ +P + + D P+ VE+S+L GL
Sbjct: 821 CVGQQ--------NVEGKRIPFGFQSRTLPHFTKDDYGPESR------GFVENSYLRGLT 880
Query: 814 PFECFAHSVTNRDSSFSDNAEV--PGTLTRKLTFLMRDIYTAYDGTVRNAYGNQLVQFSY 830
P E F H++ R+ + G + R+L M D+ YD TVRN+ G+ ++QF+Y
Sbjct: 881 PQEFFFHAMGGREGLIDTAVKTSETGYIQRRLVKAMEDVSIKYDATVRNSLGD-VIQFAY 880
BLAST of HG10003230 vs. ExPASy TrEMBL
Match:
A0A5D3D780 (DNA-directed RNA polymerase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold443G00770 PE=4 SV=1)
HSP 1 Score: 2505.7 bits (6493), Expect = 0.0e+00
Identity = 1278/1523 (83.91%), Postives = 1320/1523 (86.67%), Query Frame = 0
Query: 1 MIHMEDEQDGELPIPSGLVTGINFSVSTQQDTENIAVMTVDASNEVSDPKLGLPNPSYQC 60
MIHMEDEQDGELPIPSG +TGINFSVS QQD ENIAV+TVDA++EVSDPKLGLPNPSYQC
Sbjct: 14 MIHMEDEQDGELPIPSGRLTGINFSVSNQQDIENIAVITVDAASEVSDPKLGLPNPSYQC 73
Query: 61 TTCGASSLKSCEG------LTRGVIHS----------TKDSP------------------ 120
TTCGASSLK CEG +IH K P
Sbjct: 74 TTCGASSLKFCEGHFGVIKFPYTIIHPYFLSEVAQVLNKVCPGCKSIRQELWGKVRKKMM 133
Query: 121 ----PHGDVFEVDSEVEDPTSDYHRPKGCRYCFGSLKDWYPPMRFKLSTTDMFRKSMIMV 180
+ FE+ VEDPTSDY+RPKGCRYCFGSLKDWYPPMRFKLSTTDMF+KSMIMV
Sbjct: 134 TLCHRNSSPFEIFIWVEDPTSDYNRPKGCRYCFGSLKDWYPPMRFKLSTTDMFKKSMIMV 193
Query: 181 EVKENMSKKYQKRVAKGGLPSDYWNFIPKDEQQEDSYCRPNRKILTHAQVHYLLKDIDPK 240
EVKENMSKKYQKRVAKGGLPSDYW+FIPKDEQQE+SYCRPNRKILTHAQVHYLLKDIDPK
Sbjct: 194 EVKENMSKKYQKRVAKGGLPSDYWDFIPKDEQQEESYCRPNRKILTHAQVHYLLKDIDPK 253
Query: 241 FLKKFVPATDSLFLNSFPVTPNSHRVTEMAHSFSNGQRL--------------------- 300
FLKKFVPA DSLFLNSFPVTPNSHRVTEMAHSFSNGQRL
Sbjct: 254 FLKKFVPAIDSLFLNSFPVTPNSHRVTEMAHSFSNGQRLIFDERTRAYKKVVDFRGTANE 313
Query: 301 -----------------IFLSPEKLQSKDLVYQQKKIKDTATSSYGLRWIKDVVLGKRSD 360
IFLSPEKLQSKDLVYQQKKIKDTATSS GLRWIKDVVLGKRSD
Sbjct: 314 LGSRVLDCLKISKAIYKIFLSPEKLQSKDLVYQQKKIKDTATSSSGLRWIKDVVLGKRSD 373
Query: 361 HCFRMVVVGDPNIELSEIGIPCHVAERLQISEHLSSWNMKKLSTSCYLHLVEKGEIYVRR 420
HCFRMVVVGDPNIELSEIGIPCHVAERLQISEHLSSWNMKKLSTSCYLHLVEKGEIYVRR
Sbjct: 374 HCFRMVVVGDPNIELSEIGIPCHVAERLQISEHLSSWNMKKLSTSCYLHLVEKGEIYVRR 433
Query: 421 EGRLVRVRNVLELNMGDTIYRPLADGDVVLVNRPPSIHQHSLIALSVKLLPVSSVLSLNP 480
EGRLVRVRNVLELNMGDTIYRPLADGDVVLVNRPPSIHQHSLIALSVKLLPVS+VLSLNP
Sbjct: 434 EGRLVRVRNVLELNMGDTIYRPLADGDVVLVNRPPSIHQHSLIALSVKLLPVSAVLSLNP 493
Query: 481 LCCSPFRGDFDGDCLHGYVPQSLEARVEVRELVSLDRQLINGQSGRNLLSLSHDSLTAAH 540
LCCSPFRGDFDGDCLHGYVPQSLEARVEVRELVSLDRQLINGQSGRNLLSLSHDSLTAAH
Sbjct: 494 LCCSPFRGDFDGDCLHGYVPQSLEARVEVRELVSLDRQLINGQSGRNLLSLSHDSLTAAH 553
Query: 541 LIMEDGVSLNLFQMQQLQMLTLHQLLSPAILKAPLLRNCAWTGKQLFSTLLPPDFDYSSP 600
LI+EDGVSLNLFQMQQLQMLTLHQLL PAI+K+PLLRNCAWTGKQLFS LLPPDF+YSSP
Sbjct: 554 LILEDGVSLNLFQMQQLQMLTLHQLLPPAIVKSPLLRNCAWTGKQLFSILLPPDFEYSSP 613
Query: 601 SHCVFIKNGELISSEGSYWLRDSGRNLFQALIEHCEGKTLDYLHDAQGVLCEWLSMRGLS 660
SH VFI+ GELISSEGSYWLRDSGRNLFQALIEHCEGKTLDYL DAQGVLCEWLSMRGLS
Sbjct: 614 SHNVFIEKGELISSEGSYWLRDSGRNLFQALIEHCEGKTLDYLRDAQGVLCEWLSMRGLS 673
Query: 661 VSLSDLYLSVDSYSHKNMMDDIFCGLQEAEETCNLKQLMVDSHKDILTGDDEDNQHVLSI 720
VSLSDLYLSVDSYSHKNMMDDIFCGLQEAEETCNLKQLMVDSHK+ILTG+DEDNQH+LSI
Sbjct: 674 VSLSDLYLSVDSYSHKNMMDDIFCGLQEAEETCNLKQLMVDSHKEILTGNDEDNQHLLSI 733
Query: 721 AVERLSYEKQKSAALNQASVDAFKKVFRDIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLV 780
AVE L YEKQKSAALNQASVDAFKKVFRDIQNLV+KYSGKDNSLLTMFKAGSKGNL+KLV
Sbjct: 734 AVEHLIYEKQKSAALNQASVDAFKKVFRDIQNLVHKYSGKDNSLLTMFKAGSKGNLMKLV 793
Query: 781 QHSMCLGLQHSLVTLSFSLPHKLTCSAWNSQKMPRYIQKDGLPDRTPSFIPYAVVESSFL 840
QHSMCLGLQHSLVTLSFSLPHKL+CSAWNSQKMPRYIQ+DGLPDRT SFIPYAVVE+SFL
Sbjct: 794 QHSMCLGLQHSLVTLSFSLPHKLSCSAWNSQKMPRYIQEDGLPDRTQSFIPYAVVENSFL 853
Query: 841 SGLNPFECFAHSVTNRDSSFSDNAEVPGTLTRKLTFLMRDIYTAYDGTVRNAYGNQLVQF 900
SGLNPFECFAHSVTNRDSSFSDNAEVPGTLTRKLTFLMRDIYTAYDGTVRNAYGNQLVQF
Sbjct: 854 SGLNPFECFAHSVTNRDSSFSDNAEVPGTLTRKLTFLMRDIYTAYDGTVRNAYGNQLVQF 913
Query: 901 SYDIDRPTSVSNELDGENNNRDRDIGGHPVGSLAACAMSEAAYSALDQPISLLEASPLLN 960
YDIDRPTSVS+E D E NNRDRDIGGHPVGSLAACA SEAAYSALDQPISLLEASPLLN
Sbjct: 914 CYDIDRPTSVSSESDSE-NNRDRDIGGHPVGSLAACAFSEAAYSALDQPISLLEASPLLN 973
Query: 961 LKRVLECGSKRNSTKQTFSLFLSEKLSKRSYGFEYGALGVKNHLERVIFKDIVSNVMI-- 1020
LKRVLECGSKRNSTKQTFSLFLSEKLSKRSYGFEYGALGVKNHLERV+FKDIVS+VMI
Sbjct: 974 LKRVLECGSKRNSTKQTFSLFLSEKLSKRSYGFEYGALGVKNHLERVMFKDIVSSVMIIF 1033
Query: 1021 ------------------------------------------------------------ 1080
Sbjct: 1034 SPQPSRKKHFSPWVCHFHVCKDILKKRRLKMNSVIHSLNMRCDSVRQEGRMNLPSLQIIT 1093
Query: 1081 TDCPLADSLREDGDTVCLTVTIAENTKNSFLQLDFIQDLLIHFLLGTVIRGFAEIDRVDI 1140
DCPLADSL EDGDTVCLTVTIAENTKNSFLQLDFIQDLLIHFLLGTVIRGFAEIDRVDI
Sbjct: 1094 QDCPLADSLTEDGDTVCLTVTIAENTKNSFLQLDFIQDLLIHFLLGTVIRGFAEIDRVDI 1153
Query: 1141 AWNDRPKVPKPRCNHGELYLRVTMSGEGNSRFWATLMNNCLPIMDLIDWSRSHPDNTHSL 1200
WNDRPKVPKPRC+HGELYLRVTMSGEGNSRFWATL+NNCLPIMDLIDW+RSHPDNTHSL
Sbjct: 1154 TWNDRPKVPKPRCSHGELYLRVTMSGEGNSRFWATLINNCLPIMDLIDWTRSHPDNTHSL 1213
Query: 1201 CLAYGIDSGWKYFLNSLESATLDIGKTIRPKHLLLVANSLSATGEFIGLNVKGLSHQREH 1260
CLAYGIDSGWKYFLNSLE ATLDIGKTIR +HLLLVANSLSATGEF+GLNVKGL+HQREH
Sbjct: 1214 CLAYGIDSGWKYFLNSLECATLDIGKTIRLEHLLLVANSLSATGEFVGLNVKGLTHQREH 1273
Query: 1261 ALVKTPFMQACFSTPGACFVKAAKAGIKDNLSGSLDALAWGRIPSLGTGGQFDILYSGRG 1320
ALVKTPFMQACFS+PGAC +KAAKAGIKDNLSGSLDALAWGR+PSLGTGGQFDILYSG+G
Sbjct: 1274 ALVKTPFMQACFSSPGACLIKAAKAGIKDNLSGSLDALAWGRMPSLGTGGQFDILYSGKG 1333
Query: 1321 HELNKPVDVYNLLGGQSICEKQNAKIESLDKNNISEKYSAQLVLKNGGSTIKGLKKLDSV 1380
HELNKPVDVYNLLGGQS CEKQNAKIES+DKNNISEKYSAQLVLKNGGSTIKGLK+LDSV
Sbjct: 1334 HELNKPVDVYNLLGGQSTCEKQNAKIESVDKNNISEKYSAQLVLKNGGSTIKGLKRLDSV 1393
Query: 1381 SKSILREFLTLNDIQKLSFTLRTILHKYSLNERLNEVDKSTLMMALYFHPHRDEKIGVGA 1386
SKSILR+FLTLNDIQKLSF LRTILHKYSLNERLNEVDKSTLMMALYFHPHRDEKIGVGA
Sbjct: 1394 SKSILRKFLTLNDIQKLSFALRTILHKYSLNERLNEVDKSTLMMALYFHPHRDEKIGVGA 1453
BLAST of HG10003230 vs. ExPASy TrEMBL
Match:
A0A0A0L2L4 (DNA-directed RNA polymerase OS=Cucumis sativus OX=3659 GN=Csa_3G039340 PE=4 SV=1)
HSP 1 Score: 2503.4 bits (6487), Expect = 0.0e+00
Identity = 1270/1496 (84.89%), Postives = 1315/1496 (87.90%), Query Frame = 0
Query: 1 MIHMEDEQDGELPIPSGLVTGINFSVSTQQDTENIAVMTVDASNEVSDPKLGLPNPSYQC 60
MIHMEDEQDGELPIPSGL+TGINFSVS QQD ENIAV+TVDA+NEVSDPKLGLPNPSYQC
Sbjct: 14 MIHMEDEQDGELPIPSGLLTGINFSVSNQQDIENIAVITVDAANEVSDPKLGLPNPSYQC 73
Query: 61 TTCGASSLKSCEG------LTRGVIHS----------TKDSPPHGDV-FEVDSEVEDPTS 120
TTCGASSLK CEG +IH K P V E+ +VEDPTS
Sbjct: 74 TTCGASSLKFCEGHFGVIKFPYTIIHPYFLSEVAQVLNKVCPGCKSVRQELWGKVEDPTS 133
Query: 121 DYHRPKGCRYCFGSLKDWYPPMRFKLSTTDMFRKSMIMVEVKENMSKKYQKRVAKGGLPS 180
DY+RPKGCRYCFGSLKDWYPPMRFKLSTTDMF+KSMIMVEVKENMSKKYQKRVAKGGLPS
Sbjct: 134 DYNRPKGCRYCFGSLKDWYPPMRFKLSTTDMFKKSMIMVEVKENMSKKYQKRVAKGGLPS 193
Query: 181 DYWNFIPKDEQQEDSYCRPNRKILTHAQVHYLLKDIDPKFLKKFVPATDSLFLNSFPVTP 240
DYW+FIPKDEQQE+SYCRPNRKILTHAQVHYLLKDIDPKFLKKFVPA DSLFLNSFPVTP
Sbjct: 194 DYWDFIPKDEQQEESYCRPNRKILTHAQVHYLLKDIDPKFLKKFVPAIDSLFLNSFPVTP 253
Query: 241 NSHRVTEMAHSFSNGQRLIF--------------------------------LSPEKLQS 300
NSHRVTEMAHSFSNGQRLIF LSPEKLQ+
Sbjct: 254 NSHRVTEMAHSFSNGQRLIFDERTRAYKKVVDFRGTANELGSRVLDCLKISKLSPEKLQN 313
Query: 301 KDLVYQQKKIKDTATSSYGLRWIKDVVLGKRSDHCFRMVVVGDPNIELSEIGIPCHVAER 360
KDLVYQQKKIKDTATSS GLRWIKDVVLGKRSDHCFRMVVVGDPNIELSEIGIPCHVAER
Sbjct: 314 KDLVYQQKKIKDTATSSSGLRWIKDVVLGKRSDHCFRMVVVGDPNIELSEIGIPCHVAER 373
Query: 361 LQISEHLSSWNMKKLSTSCYLHLVEKGEIYVRREGRLVRVRNVLELNMGDTIYRPLADGD 420
LQISEHLSSWNMKKLSTSCYLHLVEKGEIYVRREGRLVRVRNVLELNMGDTIYRPLADGD
Sbjct: 374 LQISEHLSSWNMKKLSTSCYLHLVEKGEIYVRREGRLVRVRNVLELNMGDTIYRPLADGD 433
Query: 421 VVLVNRPPSIHQHSLIALSVKLLPVSSVLSLNPLCCSPFRGDFDGDCLHGYVPQSLEARV 480
+VLVNRPPSIHQHSLIALSVKLLPVS+VLSLNPLCCSPFRGDFDGDCLHGYVPQSLEARV
Sbjct: 434 IVLVNRPPSIHQHSLIALSVKLLPVSAVLSLNPLCCSPFRGDFDGDCLHGYVPQSLEARV 493
Query: 481 EVRELVSLDRQLINGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQMQQLQMLTLHQLLS 540
EVRELVSLD+QL NGQSGRNLLSLSHDSLTAAHLI+EDGVSLNLFQMQQLQMLTLHQLL
Sbjct: 494 EVRELVSLDKQLTNGQSGRNLLSLSHDSLTAAHLILEDGVSLNLFQMQQLQMLTLHQLLP 553
Query: 541 PAILKAPLLRNCAWTGKQLFSTLLPPDFDYSSPSHCVFIKNGELISSEGSYWLRDSGRNL 600
PAI+K+PLLRNCAWTGKQLFS LLPPDFDYSSPSH VFI+ GELISSEGSYWLRDSGRNL
Sbjct: 554 PAIVKSPLLRNCAWTGKQLFSILLPPDFDYSSPSHNVFIEKGELISSEGSYWLRDSGRNL 613
Query: 601 FQALIEHCEGKTLDYLHDAQGVLCEWLSMRGLSVSLSDLYLSVDSYSHKNMMDDIFCGLQ 660
FQALIEHCEGKTLDYL DAQGVLCEWLS RGLSVSLSDLYLSVDSYSH+NMMDDIFCGLQ
Sbjct: 614 FQALIEHCEGKTLDYLRDAQGVLCEWLSTRGLSVSLSDLYLSVDSYSHENMMDDIFCGLQ 673
Query: 661 EAEETCNLKQLMVDSHKDILTGDDEDNQHVLSIAVERLSYEKQKSAALNQASVDAFKKVF 720
EAEETCNLKQLMVDSHK+IL G+DEDNQH+LSIAVERL YEKQKSAALNQASVDAFKKVF
Sbjct: 674 EAEETCNLKQLMVDSHKEILIGNDEDNQHLLSIAVERLIYEKQKSAALNQASVDAFKKVF 733
Query: 721 RDIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVTLSFSLPHKLTCSA 780
RDIQNLVYKYSGKDNSLLTMFKAGSKGNL+KLVQHSMCLGLQHSLVTLSFSLPHKL+C+A
Sbjct: 734 RDIQNLVYKYSGKDNSLLTMFKAGSKGNLMKLVQHSMCLGLQHSLVTLSFSLPHKLSCAA 793
Query: 781 WNSQKMPRYIQKDGLPDRTPSFIPYAVVESSFLSGLNPFECFAHSVTNRDSSFSDNAEVP 840
WNSQKMPRYIQKDGLPDRT SFIPYAVVE+SFLSGLNPFECFAHSVTNRDSSFSDNAEVP
Sbjct: 794 WNSQKMPRYIQKDGLPDRTQSFIPYAVVENSFLSGLNPFECFAHSVTNRDSSFSDNAEVP 853
Query: 841 GTLTRKLTFLMRDIYTAYDGTVRNAYGNQLVQFSYDIDRPTSVSNELDGENNNRDRDIGG 900
GTLTRKLTFLMRDIYTAYDGTVRNAYGNQLVQF YDIDRPTS E + ENNNRDR IGG
Sbjct: 854 GTLTRKLTFLMRDIYTAYDGTVRNAYGNQLVQFCYDIDRPTS---ESESENNNRDRGIGG 913
Query: 901 HPVGSLAACAMSEAAYSALDQPISLLEASPLLNLKRVLECGSKRNSTKQTFSLFLSEKLS 960
HPVGSLAACA+SEAAYSALDQPISLLEASPLLNLKRVLECGSKRNSTKQTFSLFLSEKLS
Sbjct: 914 HPVGSLAACAISEAAYSALDQPISLLEASPLLNLKRVLECGSKRNSTKQTFSLFLSEKLS 973
Query: 961 KRSYGFEYGALGVKNHLERVIFKDIVSNVMI----------------------------- 1020
KRSYGFEYGALGVKNHLERV+FKDIVS+VMI
Sbjct: 974 KRSYGFEYGALGVKNHLERVMFKDIVSSVMIIFSPLPSRKKHFSPWVCHFHVCKEILKKR 1033
Query: 1021 ---------------------------------TDCPLADSLREDGDTVCLTVTIAENTK 1080
DCPLADSL EDGDTVCLTVTIAENTK
Sbjct: 1034 RLKMNSVIHSLNMRCDSMRQEGRMNLPSLQIITQDCPLADSLTEDGDTVCLTVTIAENTK 1093
Query: 1081 NSFLQLDFIQDLLIHFLLGTVIRGFAEIDRVDIAWNDRPKVPKPRCNHGELYLRVTMSGE 1140
NSFLQLDFIQDLLIHFLLGTVIRGF EIDRVDI WNDRPKVPKPRC+HGELYLRVTMSGE
Sbjct: 1094 NSFLQLDFIQDLLIHFLLGTVIRGFTEIDRVDITWNDRPKVPKPRCSHGELYLRVTMSGE 1153
Query: 1141 GNSRFWATLMNNCLPIMDLIDWSRSHPDNTHSLCLAYGIDSGWKYFLNSLESATLDIGKT 1200
GNSRFWATLMNNCLPIMDLIDW+RSHPDNTHSLCLAYGIDSGWKYFLNSLESATLD+GKT
Sbjct: 1154 GNSRFWATLMNNCLPIMDLIDWTRSHPDNTHSLCLAYGIDSGWKYFLNSLESATLDVGKT 1213
Query: 1201 IRPKHLLLVANSLSATGEFIGLNVKGLSHQREHALVKTPFMQACFSTPGACFVKAAKAGI 1260
IR +HLLLV+NSLSATGEF+GLNVKGL+HQREHALVKTPFMQACFS+PGAC +KAAKAGI
Sbjct: 1214 IRLEHLLLVSNSLSATGEFVGLNVKGLTHQREHALVKTPFMQACFSSPGACMIKAAKAGI 1273
Query: 1261 KDNLSGSLDALAWGRIPSLGTGGQFDILYSGRGHELNKPVDVYNLLGGQSICEKQNAKIE 1320
KDNLSGSLDALAWGR+PSLGTGGQFDILYSG+GHELNKPVDVYNLLGGQS CEKQN KIE
Sbjct: 1274 KDNLSGSLDALAWGRMPSLGTGGQFDILYSGKGHELNKPVDVYNLLGGQSTCEKQNTKIE 1333
Query: 1321 SLDKNNISEKYSAQLVLKNGGSTIKGLKKLDSVSKSILREFLTLNDIQKLSFTLRTILHK 1380
SLDKN ISEKYSAQL+LKNGGSTIKGLK+LDSVSKSILR+FLTLNDIQKLSF LRTILHK
Sbjct: 1334 SLDKNTISEKYSAQLMLKNGGSTIKGLKRLDSVSKSILRKFLTLNDIQKLSFALRTILHK 1393
Query: 1381 YSLNERLNEVDKSTLMMALYFHPHRDEKIGVGAQDIKPTMQCLFSLGSISEVRLSGRLKK 1386
YSLNERLNEVDKSTLMMALYFHPHRDEKIGVGAQDI K
Sbjct: 1394 YSLNERLNEVDKSTLMMALYFHPHRDEKIGVGAQDI-----------------------K 1453
BLAST of HG10003230 vs. ExPASy TrEMBL
Match:
A0A1S4DY39 (DNA-directed RNA polymerase OS=Cucumis melo OX=3656 GN=LOC103490982 PE=4 SV=1)
HSP 1 Score: 2479.5 bits (6425), Expect = 0.0e+00
Identity = 1266/1506 (84.06%), Postives = 1310/1506 (86.99%), Query Frame = 0
Query: 1 MIHMEDEQDGELPIPSGLVTGINFSVSTQQDTENIAVMTVDASNEVSDPKLGLPNPSYQC 60
MIHMEDEQDGELPIPSG +TGINFSVS QQD ENIAV+TVDA++EVSDPKLGLPNPSYQC
Sbjct: 14 MIHMEDEQDGELPIPSGRLTGINFSVSNQQDIENIAVITVDAASEVSDPKLGLPNPSYQC 73
Query: 61 TTCGASSLKSCEG------LTRGVIHS----------TKDSPPHGDV-FEVDSEVEDPTS 120
TTCGASSLK CEG +IH K P + E+ +VEDPTS
Sbjct: 74 TTCGASSLKFCEGHFGVIKFPYTIIHPYFLSEVAQVLNKVCPGCKSIRQELWGKVEDPTS 133
Query: 121 DYHRPKGCRYCFGSLKDWYPPMRFKLSTTDMFRKSMIMVEVKENMSKKYQKRVAKGGLPS 180
DY+RPKGCRYCFGSLKDWYPPMRFKLSTTDMF+KSMIMVEVKENMSKKYQKRVAKGGLPS
Sbjct: 134 DYNRPKGCRYCFGSLKDWYPPMRFKLSTTDMFKKSMIMVEVKENMSKKYQKRVAKGGLPS 193
Query: 181 DYWNFIPKDEQQEDSYCRPNRKILTHAQVHYLLKDIDPKFLKKFVPATDSLFLNSFPVTP 240
DYW+FIPKDEQQE+SYCRPNRKILTHAQVHYLLKDIDPKFLKKFVPA DSLFLNSFPVTP
Sbjct: 194 DYWDFIPKDEQQEESYCRPNRKILTHAQVHYLLKDIDPKFLKKFVPAIDSLFLNSFPVTP 253
Query: 241 NSHRVTEMAHSFSNGQRLIF---------------------------LSPEKLQSKDLVY 300
NSHRVTEMAHSFSNGQRLIF L K K V+
Sbjct: 254 NSHRVTEMAHSFSNGQRLIFDERTRAYKKVVDFRGTANELGSRVLDCLKISKAIYKIFVF 313
Query: 301 ---------------QQKKIKDTATSSYGLRWIKDVVLGKRSDHCFRMVVVGDPNIELSE 360
KKIKDTATSS GLRWIKDVVLGKRSDHCFRMVVVGDPNIELSE
Sbjct: 314 CLNHPREVTKXRFGLPAKKIKDTATSSSGLRWIKDVVLGKRSDHCFRMVVVGDPNIELSE 373
Query: 361 IGIPCHVAERLQISEHLSSWNMKKLSTSCYLHLVEKGEIYVRREGRLVRVRNVLELNMGD 420
IGIPCHVAERLQISEHLSSWNMKKLSTSCYLHLVEKGEIYVRREGRLVRVRNVLELNMGD
Sbjct: 374 IGIPCHVAERLQISEHLSSWNMKKLSTSCYLHLVEKGEIYVRREGRLVRVRNVLELNMGD 433
Query: 421 TIYRPLADGDVVLVNRPPSIHQHSLIALSVKLLPVSSVLSLNPLCCSPFRGDFDGDCLHG 480
TIYRPLADGDVVLVNRPPSIHQHSLIALSVKLLPVS+VLSLNPLCCSPFRGDFDGDCLHG
Sbjct: 434 TIYRPLADGDVVLVNRPPSIHQHSLIALSVKLLPVSAVLSLNPLCCSPFRGDFDGDCLHG 493
Query: 481 YVPQSLEARVEVRELVSLDRQLINGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQMQQL 540
YVPQSLEARVEVRELVSLDRQLINGQSGRNLLSLSHDSLTAAHLI+EDGVSLNLFQMQQL
Sbjct: 494 YVPQSLEARVEVRELVSLDRQLINGQSGRNLLSLSHDSLTAAHLILEDGVSLNLFQMQQL 553
Query: 541 QMLTLHQLLSPAILKAPLLRNCAWTGKQLFSTLLPPDFDYSSPSHCVFIKNGELISSEGS 600
QMLTLHQLL PAI+K+PLLRNCAWTGKQLFS LLPPDF+YSSPSH VFI+ GELISSEGS
Sbjct: 554 QMLTLHQLLPPAIVKSPLLRNCAWTGKQLFSILLPPDFEYSSPSHNVFIEKGELISSEGS 613
Query: 601 YWLRDSGRNLFQALIEHCEGKTLDYLHDAQGVLCEWLSMRGLSVSLSDLYLSVDSYSHKN 660
YWLRDSGRNLFQALIEHCEGKTLDYL DAQGVLCEWLSMRGLSVSLSDLYLSVDSYSHKN
Sbjct: 614 YWLRDSGRNLFQALIEHCEGKTLDYLRDAQGVLCEWLSMRGLSVSLSDLYLSVDSYSHKN 673
Query: 661 MMDDIFCGLQEAEETCNLKQLMVDSHKDILTGDDEDNQHVLSIAVERLSYEKQKSAALNQ 720
MMDDIFCGLQEAEETCNLKQLMVDSHK+ILTG+DEDNQH+LSIAVE L YEKQKSAALNQ
Sbjct: 674 MMDDIFCGLQEAEETCNLKQLMVDSHKEILTGNDEDNQHLLSIAVEHLIYEKQKSAALNQ 733
Query: 721 ASVDAFKKVFRDIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVTLSF 780
ASVDAFKKVFRDIQNLV+KYSGKDNSLLTMFKAGSKGNL+KLVQHSMCLGLQHSLVTLSF
Sbjct: 734 ASVDAFKKVFRDIQNLVHKYSGKDNSLLTMFKAGSKGNLMKLVQHSMCLGLQHSLVTLSF 793
Query: 781 SLPHKLTCSAWNSQKMPRYIQKDGLPDRTPSFIPYAVVESSFLSGLNPFECFAHSVTNRD 840
SLPHKL+CSAWNSQKMPRYIQ+DGLPDRT SFIPYAVVE+SFLSGLNPFECFAHSVTNRD
Sbjct: 794 SLPHKLSCSAWNSQKMPRYIQEDGLPDRTQSFIPYAVVENSFLSGLNPFECFAHSVTNRD 853
Query: 841 SSFSDNAEVPGTLTRKLTFLMRDIYTAYDGTVRNAYGNQLVQFSYDIDRPTSVSNELDGE 900
SSFSDNAEVPGTLTRKLTFLMRDIYTAYDGTVRNAYGNQLVQF YDIDRPTSVS+E D E
Sbjct: 854 SSFSDNAEVPGTLTRKLTFLMRDIYTAYDGTVRNAYGNQLVQFCYDIDRPTSVSSESDSE 913
Query: 901 NNNRDRDIGGHPVGSLAACAMSEAAYSALDQPISLLEASPLLNLKRVLECGSKRNSTKQT 960
NNRDRDIGGHPVGSLAACA SEAAYSALDQPISLLEASPLLNLKRVLECGSKRNSTKQT
Sbjct: 914 -NNRDRDIGGHPVGSLAACAFSEAAYSALDQPISLLEASPLLNLKRVLECGSKRNSTKQT 973
Query: 961 FSLFLSEKLSKRSYGFEYGALGVKNHLERVIFKDIVSNVMI------------------- 1020
FSLFLSEKLSKRSYGFEYGALGVKNHLERV+FKDIVS+VMI
Sbjct: 974 FSLFLSEKLSKRSYGFEYGALGVKNHLERVMFKDIVSSVMIIFSPQPSRKKHFSPWVCHF 1033
Query: 1021 -------------------------------------------TDCPLADSLREDGDTVC 1080
DCPLADSL EDGDTVC
Sbjct: 1034 HVCKDILKKRRLKMNSVIHSLNMRCDSVRQEGRMNLPSLQIITQDCPLADSLTEDGDTVC 1093
Query: 1081 LTVTIAENTKNSFLQLDFIQDLLIHFLLGTVIRGFAEIDRVDIAWNDRPKVPKPRCNHGE 1140
LTVTIAENTKNSFLQLDFIQDLLIHFLLGTVIRGFAEIDRVDI WNDRPKVPKPRC+HGE
Sbjct: 1094 LTVTIAENTKNSFLQLDFIQDLLIHFLLGTVIRGFAEIDRVDITWNDRPKVPKPRCSHGE 1153
Query: 1141 LYLRVTMSGEGNSRFWATLMNNCLPIMDLIDWSRSHPDNTHSLCLAYGIDSGWKYFLNSL 1200
LYLRVTMSGEGNSRFWATL+NNCLPIMDLIDW+RSHPDNTHSLCLAYGIDSGWKYFLNSL
Sbjct: 1154 LYLRVTMSGEGNSRFWATLINNCLPIMDLIDWTRSHPDNTHSLCLAYGIDSGWKYFLNSL 1213
Query: 1201 ESATLDIGKTIRPKHLLLVANSLSATGEFIGLNVKGLSHQREHALVKTPFMQACFSTPGA 1260
E ATLDIGKTIR +HLLLVANSLSATGEF+GLNVKGL+HQREHALVKTPFMQACFS+PGA
Sbjct: 1214 ECATLDIGKTIRLEHLLLVANSLSATGEFVGLNVKGLTHQREHALVKTPFMQACFSSPGA 1273
Query: 1261 CFVKAAKAGIKDNLSGSLDALAWGRIPSLGTGGQFDILYSGRGHELNKPVDVYNLLGGQS 1320
C +KAAKAGIKDNLSGSLDALAWGR+PSLGTGGQFDILYSG+GHELNKPVDVYNLLGGQS
Sbjct: 1274 CLIKAAKAGIKDNLSGSLDALAWGRMPSLGTGGQFDILYSGKGHELNKPVDVYNLLGGQS 1333
Query: 1321 ICEKQNAKIESLDKNNISEKYSAQLVLKNGGSTIKGLKKLDSVSKSILREFLTLNDIQKL 1380
CEKQNAKIES+DKNNISEKYSAQLVLKNGGSTIKGLK+LDSVSKSILR+FLTLNDIQKL
Sbjct: 1334 TCEKQNAKIESVDKNNISEKYSAQLVLKNGGSTIKGLKRLDSVSKSILRKFLTLNDIQKL 1393
Query: 1381 SFTLRTILHKYSLNERLNEVDKSTLMMALYFHPHRDEKIGVGAQDIKPTMQCLFSLGSIS 1386
SF LRTILHKYSLNERLNEVDKSTLMMALYFHPHRDEKIGVGAQDI
Sbjct: 1394 SFALRTILHKYSLNERLNEVDKSTLMMALYFHPHRDEKIGVGAQDI-------------- 1453
BLAST of HG10003230 vs. ExPASy TrEMBL
Match:
A0A6J1FKU9 (DNA-directed RNA polymerase OS=Cucurbita moschata OX=3662 GN=LOC111444908 PE=4 SV=1)
HSP 1 Score: 2384.0 bits (6177), Expect = 0.0e+00
Identity = 1213/1497 (81.03%), Postives = 1278/1497 (85.37%), Query Frame = 0
Query: 1 MIHMEDEQDGELPIPSGLVTGINFSVSTQQDTENIAVMTVDASNEVSDPKLGLPNPSYQC 60
M HMEDEQD EL IPSG++ G+NFSVSTQQD ENIAV+ ++A+ EVSDPKLGLPNPSYQC
Sbjct: 14 MNHMEDEQDSELQIPSGVLVGVNFSVSTQQDMENIAVINIEAACEVSDPKLGLPNPSYQC 73
Query: 61 TTCGASSLKSCEG------LTRGVIHS----------TKDSPPHGDV-FEVDSEVEDPTS 120
TTCGAS LK CEG +IH K P + E+ +VEDPTS
Sbjct: 74 TTCGASVLKCCEGHFGAIKFPYTIIHPYFLSEVAQVLNKVCPGCKSIRRELWGKVEDPTS 133
Query: 121 DYHRPKGCRYCFGSLKDWYPPMRFKLSTTDMFRKSMIMVEVKENMSKKYQKRVAKGGLPS 180
D+HRPKGCRYCFGSLKDWYPPMRFKLSTTDMFRKSMIMVEVKENMSKKYQKRVA+GGLP
Sbjct: 134 DFHRPKGCRYCFGSLKDWYPPMRFKLSTTDMFRKSMIMVEVKENMSKKYQKRVARGGLPP 193
Query: 181 DYWNFIPKDEQQEDSYCRPNRKILTHAQVHYLLKDIDPKFLKKFVPATDSLFLNSFPVTP 240
DYWNFIPKDEQQE+SYCRPNRK+LTHAQVHYLLKDIDPKFLKKFV ATDSLFLNSFPVTP
Sbjct: 194 DYWNFIPKDEQQEESYCRPNRKVLTHAQVHYLLKDIDPKFLKKFVSATDSLFLNSFPVTP 253
Query: 241 NSHRVTEMAHSFSNGQRLIF--------------------------------LSPEKLQS 300
N HRVTEM HSFS+GQRL+F LSPEKL+S
Sbjct: 254 NCHRVTEMTHSFSSGQRLVFDERTRAYKKLVDFRGTANELGSRVLDCLKISKLSPEKLES 313
Query: 301 KDLVYQQKKIKDTATSSYGLRWIKDVVLGKRSDHCFRMVVVGDPNIELSEIGIPCHVAER 360
KDL+YQQKKIKDTATSS GLRWIKDVVLGKRSDHCFRMVVVGDPNIELSEIGIPCHVAER
Sbjct: 314 KDLIYQQKKIKDTATSSNGLRWIKDVVLGKRSDHCFRMVVVGDPNIELSEIGIPCHVAER 373
Query: 361 LQISEHLSSWNMKKLSTSCYLHLVEKGEIYVRREGRLVRVRNVLELNMGDTIYRPLADGD 420
LQISEHLSSWNMKKLSTSCYL LVEKGEI+VRREGRLVRVR+VLEL+MGDTIYRPLADGD
Sbjct: 374 LQISEHLSSWNMKKLSTSCYLRLVEKGEIFVRREGRLVRVRHVLELSMGDTIYRPLADGD 433
Query: 421 VVLVNRPPSIHQHSLIALSVKLLPVSSVLSLNPLCCSPFRGDFDGDCLHGYVPQSLEARV 480
VVLVNRPPSIHQHSLIALSV++LPVSSVLSLNPLCCSPFRGDFDGDCLHGYVPQSLEARV
Sbjct: 434 VVLVNRPPSIHQHSLIALSVRVLPVSSVLSLNPLCCSPFRGDFDGDCLHGYVPQSLEARV 493
Query: 481 EVRELVSLDRQLINGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQMQQLQMLTLHQLLS 540
E+RELV+LDRQL+NGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQ+QQLQM LHQLL
Sbjct: 494 ELRELVALDRQLVNGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQIQQLQMFALHQLLP 553
Query: 541 PAILKAPLLRNCAWTGKQLFSTLLPPDFDYSSPSHCVFIKNGELISSEGSYWLRDSGRNL 600
PAI+KAP R+CAWTGKQLFS LPPDFDYSSPSH V IKNGEL+SSEGSYWLRD+GRN
Sbjct: 554 PAIVKAPSFRSCAWTGKQLFSIFLPPDFDYSSPSHRVHIKNGELLSSEGSYWLRDTGRNP 613
Query: 601 FQALIEHCEGKTLDYLHDAQGVLCEWLSMRGLSVSLSDLYLSVDSYSHKNMMDDIFCGLQ 660
FQALIEHCEG+TL+YLH AQ VLCEWLSMRGLSVSLSDLYLSVDS+SHKNMMDDIFCGLQ
Sbjct: 614 FQALIEHCEGRTLNYLHIAQRVLCEWLSMRGLSVSLSDLYLSVDSHSHKNMMDDIFCGLQ 673
Query: 661 EAEETCNLKQLMVDSHKDILTGDDEDNQHVLSIAVERLSYEKQKSAALNQASVDAFKKVF 720
EAEETCNL QLMVDSHKD+LTGDDE NQHVLSI VE LSYEKQKSAALNQASVDAFK+VF
Sbjct: 674 EAEETCNLIQLMVDSHKDVLTGDDEGNQHVLSIEVEHLSYEKQKSAALNQASVDAFKRVF 733
Query: 721 RDIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVTLSFSLPHKLTCSA 780
R+IQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVTLSF LPHKL+CS+
Sbjct: 734 REIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVTLSFGLPHKLSCSS 793
Query: 781 WNSQKMPRYIQKDGLPDRTPSFIPYAVVESSFLSGLNPFECFAHSVTNRDSSFSDNAEVP 840
WNSQKMPRYIQKDGL DRT SFIPYAVVE+SFLSGLNPFECFAHSVTNRDSSFSDNAEVP
Sbjct: 794 WNSQKMPRYIQKDGLADRTQSFIPYAVVENSFLSGLNPFECFAHSVTNRDSSFSDNAEVP 853
Query: 841 GTLTRKLTFLMRDIYTAYDGTVRNAYGNQLVQFSYDIDRPTSVSNELDGENNNRDRDIGG 900
GTLTRKLTFLMRDIY AYD TVRNAYGNQLVQFSYD D P S SNELDGENNN +RDIGG
Sbjct: 854 GTLTRKLTFLMRDIYNAYDRTVRNAYGNQLVQFSYDTDSPMSTSNELDGENNNTNRDIGG 913
Query: 901 HPVGSLAACAMSEAAYSALDQPISLLEASPLLNLKRVLECGSKRNSTKQTFSLFLSEKLS 960
PVGSLAACA+SEAAYSALDQPISLLE SPLLNLK+VLECGSKRNS KQTFSLFL EKLS
Sbjct: 914 QPVGSLAACALSEAAYSALDQPISLLETSPLLNLKKVLECGSKRNSPKQTFSLFLLEKLS 973
Query: 961 KRSYGFEYGALGVKNHLERVIFKDIVSNVMI----------------------------- 1020
KRSYGFEYGALGVKNHLERVIFKDIVS+VMI
Sbjct: 974 KRSYGFEYGALGVKNHLERVIFKDIVSSVMIIFAPEPSRKRHFSPWVCHFHVCKEILKKR 1033
Query: 1021 ---------------------------------TDCPLADSLREDGDTVCLTVTIAENTK 1080
DC LADS REDGDTVCLTVTIAENTK
Sbjct: 1034 RLKISSVIHSLNMRCDSVRQEAKINLPFLHISTQDCSLADSSREDGDTVCLTVTIAENTK 1093
Query: 1081 NSFLQLDFIQDLLIHFLLGTVIRGFAEIDRVDIAWNDRPKVPKPRC-NHGELYLRVTMSG 1140
NSFLQLDFIQDLLIHFLLGTVIRGFAEID+VDI+WNDRPKVPKP C +HGELYLRVTMSG
Sbjct: 1094 NSFLQLDFIQDLLIHFLLGTVIRGFAEIDKVDISWNDRPKVPKPHCKSHGELYLRVTMSG 1153
Query: 1141 EGNSRFWATLMNNCLPIMDLIDWSRSHPDNTHSLCLAYGIDSGWKYFLNSLESATLDIGK 1200
EGNSRFWATLMN+CLPIMDLIDWSRSHPDN HS C+AYGIDSG YFLNSLESATLDIGK
Sbjct: 1154 EGNSRFWATLMNHCLPIMDLIDWSRSHPDNIHSFCMAYGIDSGRNYFLNSLESATLDIGK 1213
Query: 1201 TIRPKHLLLVANSLSATGEFIGLNVKGLSHQREHALVKTPFMQACFSTPGACFVKAAKAG 1260
TIR +HLLLVAN+LSATGEF+GLNVKG+S QREHALVKTPFMQACFS+PGA FVKAAKAG
Sbjct: 1214 TIRHEHLLLVANTLSATGEFVGLNVKGVSRQREHALVKTPFMQACFSSPGASFVKAAKAG 1273
Query: 1261 IKDNLSGSLDALAWGRIPSLGTGGQFDILYSGRGHELNKPVDVYNLLGGQSICEKQNAKI 1320
IKD+LSGSLDALAWG+IPS+GTGGQFDILYSG+GHELNKPVDVYNLLG Q ICEK N KI
Sbjct: 1274 IKDSLSGSLDALAWGKIPSMGTGGQFDILYSGKGHELNKPVDVYNLLGSQGICEKPNVKI 1333
Query: 1321 ESLDKNNISEKYSAQLVLKNGGSTIKGLKKLDSVSKSILREFLTLNDIQKLSFTLRTILH 1380
ESLDKN I EKYSA +V KNGGSTIKGLKKLDSVSKSILREFLTLNDIQKLS TLR+IL
Sbjct: 1334 ESLDKNTIYEKYSA-VVHKNGGSTIKGLKKLDSVSKSILREFLTLNDIQKLSHTLRSILR 1393
Query: 1381 KYSLNERLNEVDKSTLMMALYFHPHRDEKIGVGAQDIKPTMQCLFSLGSISEVRLSGRLK 1386
KYSLNERLNEVDKSTLMMALYFHP RDEKIGVGAQDI
Sbjct: 1394 KYSLNERLNEVDKSTLMMALYFHPQRDEKIGVGAQDI----------------------- 1453
BLAST of HG10003230 vs. ExPASy TrEMBL
Match:
A0A6J1KSL8 (DNA-directed RNA polymerase OS=Cucurbita maxima OX=3661 GN=LOC111498320 PE=4 SV=1)
HSP 1 Score: 2381.3 bits (6170), Expect = 0.0e+00
Identity = 1211/1497 (80.90%), Postives = 1278/1497 (85.37%), Query Frame = 0
Query: 1 MIHMEDEQDGELPIPSGLVTGINFSVSTQQDTENIAVMTVDASNEVSDPKLGLPNPSYQC 60
M HMEDEQD EL IPSG++ G+NFSVSTQQD ENIAV+ ++A+ EVSDPKLGLPNPSYQC
Sbjct: 14 MNHMEDEQDSELQIPSGVLVGVNFSVSTQQDMENIAVINIEAACEVSDPKLGLPNPSYQC 73
Query: 61 TTCGASSLKSCEG------LTRGVIHS----------TKDSPPHGDV-FEVDSEVEDPTS 120
TTCGAS LK CEG +IH K P + E+ +VEDPTS
Sbjct: 74 TTCGASVLKCCEGHFGAIKFPYTIIHPYFLSEVAQVLNKVCPGCKSIRRELWGKVEDPTS 133
Query: 121 DYHRPKGCRYCFGSLKDWYPPMRFKLSTTDMFRKSMIMVEVKENMSKKYQKRVAKGGLPS 180
D+HRPKGCRYCFGSLKDWYPPMRFKLSTTDMFRKSMIMVEVKENMSKKYQKRVA+GGLP
Sbjct: 134 DFHRPKGCRYCFGSLKDWYPPMRFKLSTTDMFRKSMIMVEVKENMSKKYQKRVARGGLPP 193
Query: 181 DYWNFIPKDEQQEDSYCRPNRKILTHAQVHYLLKDIDPKFLKKFVPATDSLFLNSFPVTP 240
DYWNFIPKDEQQE+SYCRPNRK+LTHAQVHYLLKDIDPKFLKKFV ATDSLFLNSFPVTP
Sbjct: 194 DYWNFIPKDEQQEESYCRPNRKVLTHAQVHYLLKDIDPKFLKKFVSATDSLFLNSFPVTP 253
Query: 241 NSHRVTEMAHSFSNGQRLIF--------------------------------LSPEKLQS 300
N HRVTEM HSFS+GQRL+F LSPEKL+S
Sbjct: 254 NCHRVTEMTHSFSSGQRLVFDERTRAYKKLVDFRGTANELGSRVLDCLKISKLSPEKLES 313
Query: 301 KDLVYQQKKIKDTATSSYGLRWIKDVVLGKRSDHCFRMVVVGDPNIELSEIGIPCHVAER 360
KDL+YQQKKIKDTATSS GLRWIKDVVLGKRSDHCFRMVVVGDPNIELSEIGIPCHVAER
Sbjct: 314 KDLIYQQKKIKDTATSSNGLRWIKDVVLGKRSDHCFRMVVVGDPNIELSEIGIPCHVAER 373
Query: 361 LQISEHLSSWNMKKLSTSCYLHLVEKGEIYVRREGRLVRVRNVLELNMGDTIYRPLADGD 420
LQISEHLSSWNMKKLSTSCYL LVEKGEI+VRREGRLVRVR+VLEL+MGDTIYRPLADGD
Sbjct: 374 LQISEHLSSWNMKKLSTSCYLRLVEKGEIFVRREGRLVRVRHVLELSMGDTIYRPLADGD 433
Query: 421 VVLVNRPPSIHQHSLIALSVKLLPVSSVLSLNPLCCSPFRGDFDGDCLHGYVPQSLEARV 480
VVLVNRPPSIHQHSLIALSV++LPVSSVLSLNPLCCSPFRGDFDGDCLHGYVPQSLEARV
Sbjct: 434 VVLVNRPPSIHQHSLIALSVRVLPVSSVLSLNPLCCSPFRGDFDGDCLHGYVPQSLEARV 493
Query: 481 EVRELVSLDRQLINGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQMQQLQMLTLHQLLS 540
E+RELV+LDRQL+NGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQ+QQLQM LHQLL
Sbjct: 494 ELRELVALDRQLVNGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQIQQLQMFALHQLLP 553
Query: 541 PAILKAPLLRNCAWTGKQLFSTLLPPDFDYSSPSHCVFIKNGELISSEGSYWLRDSGRNL 600
PAI+KAP R+CAWTGKQLFS LPPDFDYSSPSH V IKNGEL+SSEGSYWLRD+GRN
Sbjct: 554 PAIVKAPSFRSCAWTGKQLFSIFLPPDFDYSSPSHRVHIKNGELLSSEGSYWLRDTGRNP 613
Query: 601 FQALIEHCEGKTLDYLHDAQGVLCEWLSMRGLSVSLSDLYLSVDSYSHKNMMDDIFCGLQ 660
FQALIEHCEG TL+YLH AQ VLCEWLSMRGLSVSLSDLYLSVDS+SHKNMMDDIFCGLQ
Sbjct: 614 FQALIEHCEGMTLNYLHIAQRVLCEWLSMRGLSVSLSDLYLSVDSHSHKNMMDDIFCGLQ 673
Query: 661 EAEETCNLKQLMVDSHKDILTGDDEDNQHVLSIAVERLSYEKQKSAALNQASVDAFKKVF 720
EAEETCNL QLMVDSHKD LTGDDE NQHVLSI VE LSYEKQKSAALNQASVDAFK+VF
Sbjct: 674 EAEETCNLIQLMVDSHKDALTGDDEGNQHVLSIEVEHLSYEKQKSAALNQASVDAFKRVF 733
Query: 721 RDIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVTLSFSLPHKLTCSA 780
R+IQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVTLSF LPHKL+CS+
Sbjct: 734 REIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVTLSFGLPHKLSCSS 793
Query: 781 WNSQKMPRYIQKDGLPDRTPSFIPYAVVESSFLSGLNPFECFAHSVTNRDSSFSDNAEVP 840
WNSQKMPRYI+KDGL DRT SFIPYAVVE+SFLSGLNPFECFAHSVTNRDSSFSDNAEVP
Sbjct: 794 WNSQKMPRYIRKDGLADRTQSFIPYAVVENSFLSGLNPFECFAHSVTNRDSSFSDNAEVP 853
Query: 841 GTLTRKLTFLMRDIYTAYDGTVRNAYGNQLVQFSYDIDRPTSVSNELDGENNNRDRDIGG 900
GTLTRKLTFLMRDIY AYDGTVRNAYGNQLVQFSYD D PTS+SNELDGENNN +RDIGG
Sbjct: 854 GTLTRKLTFLMRDIYNAYDGTVRNAYGNQLVQFSYDTDSPTSISNELDGENNNTNRDIGG 913
Query: 901 HPVGSLAACAMSEAAYSALDQPISLLEASPLLNLKRVLECGSKRNSTKQTFSLFLSEKLS 960
PVGSLAACA+SEAAYSALDQPISLLE SPLLNLK+VLECGSKRNS KQ FSLFL EKLS
Sbjct: 914 QPVGSLAACAISEAAYSALDQPISLLETSPLLNLKKVLECGSKRNSPKQIFSLFLLEKLS 973
Query: 961 KRSYGFEYGALGVKNHLERVIFKDIVSNVMI----------------------------- 1020
KRSYG+EYGALGVKNHLERVIFKDIVS+VMI
Sbjct: 974 KRSYGYEYGALGVKNHLERVIFKDIVSSVMIIFAPEPSRKRHFSPWVCHFHVCKEILKKR 1033
Query: 1021 ---------------------------------TDCPLADSLREDGDTVCLTVTIAENTK 1080
DC LADS REDGDTVCLTVTIAENTK
Sbjct: 1034 RLKISSVIHSLNMRCDSMRQEAKINLPFLHISTQDCSLADSSREDGDTVCLTVTIAENTK 1093
Query: 1081 NSFLQLDFIQDLLIHFLLGTVIRGFAEIDRVDIAWNDRPKVPKPRC-NHGELYLRVTMSG 1140
NSFLQLDFIQDLLIHFLLGTVIRGFAEID+VDI+WNDRPKVPKP C +HGELYLRVTMSG
Sbjct: 1094 NSFLQLDFIQDLLIHFLLGTVIRGFAEIDKVDISWNDRPKVPKPHCKSHGELYLRVTMSG 1153
Query: 1141 EGNSRFWATLMNNCLPIMDLIDWSRSHPDNTHSLCLAYGIDSGWKYFLNSLESATLDIGK 1200
EGNSRFWATLMNNCLPIMDLIDWSRSHPDN HS C+AYGIDSG YFLNSLESATLDIGK
Sbjct: 1154 EGNSRFWATLMNNCLPIMDLIDWSRSHPDNIHSFCMAYGIDSGRNYFLNSLESATLDIGK 1213
Query: 1201 TIRPKHLLLVANSLSATGEFIGLNVKGLSHQREHALVKTPFMQACFSTPGACFVKAAKAG 1260
TIR +HLLLVAN+LSATGEF+GLNVKG+S QREHALVKTPFMQACFS+PGA FVKAAKAG
Sbjct: 1214 TIRHEHLLLVANTLSATGEFVGLNVKGVSRQREHALVKTPFMQACFSSPGASFVKAAKAG 1273
Query: 1261 IKDNLSGSLDALAWGRIPSLGTGGQFDILYSGRGHELNKPVDVYNLLGGQSICEKQNAKI 1320
IKD+LSGSLDALAWG+IPS+GTGGQFDILYSG+GHEL+KPVDVYNLLG Q ICEK N K+
Sbjct: 1274 IKDSLSGSLDALAWGKIPSMGTGGQFDILYSGKGHELSKPVDVYNLLGSQGICEKPNVKM 1333
Query: 1321 ESLDKNNISEKYSAQLVLKNGGSTIKGLKKLDSVSKSILREFLTLNDIQKLSFTLRTILH 1380
ESLDKN I EKYSA +V KNGGSTIKGLKKLDSVSKSILREFLTLNDIQKLS TLR+IL
Sbjct: 1334 ESLDKNTIYEKYSA-VVHKNGGSTIKGLKKLDSVSKSILREFLTLNDIQKLSHTLRSILR 1393
Query: 1381 KYSLNERLNEVDKSTLMMALYFHPHRDEKIGVGAQDIKPTMQCLFSLGSISEVRLSGRLK 1386
KYSLNERLNEVDKSTLMMALYFHP RDEKIGVGAQDI
Sbjct: 1394 KYSLNERLNEVDKSTLMMALYFHPQRDEKIGVGAQDI----------------------- 1453
BLAST of HG10003230 vs. TAIR 10
Match:
AT1G63020.1 (nuclear RNA polymerase D1A )
HSP 1 Score: 1271.9 bits (3290), Expect = 0.0e+00
Identity = 712/1492 (47.72%), Postives = 936/1492 (62.73%), Query Frame = 0
Query: 4 MEDEQDGELPIPSGLVTGINFSVSTQQDTENIAVMTVDASNEVSDPKLGLPNPSYQCTTC 63
MED+ + EL +P G +T I FS+S D + ++V+ V+A N+V+D +LGLPNP C TC
Sbjct: 1 MEDDCE-ELQVPVGTLTSIGFSISNNNDRDKMSVLEVEAPNQVTDSRLGLPNPDSVCRTC 60
Query: 64 GASSLKSCEGLTRGVIHSTKDSPPHGDVFEVDSEVED--PTSDYHR----------PKGC 123
G+ K CEG GVI+ + EV + + P Y R P+ C
Sbjct: 61 GSKDRKVCEG-HFGVINFAYSIINPYFLKEVAALLNKICPGCKYIRKKQFQITEDQPERC 120
Query: 124 RYCFGSLKDWYPPMRFKLSTTDMFRKSMIMVEVKENMSKKYQKRVAKGGLPSDYWNFIPK 183
RYC +L YP M+F+++T ++FR+S I+VEV E K +KR LP DYW+F+P+
Sbjct: 121 RYC--TLNTGYPLMKFRVTTKEVFRRSGIVVEVNEESLMKLKKRGVL-TLPPDYWSFLPQ 180
Query: 184 DEQQEDSYCRPNRKILTHAQVHYLLKDIDPKFLKKFVPATDSLFLNSFPVTPNSHRVTEM 243
D ++S +P R+I+THAQV+ LL ID + +KK +P +SL L SFPVTPN +RVTE+
Sbjct: 181 DSNIDESCLKPTRRIITHAQVYALLLGIDQRLIKKDIPMFNSLGLTSFPVTPNGYRVTEI 240
Query: 244 AHSFSNGQRLIFLSPEKLQSK----------------DLVYQQKKIKDTATSS------- 303
H F NG RLIF ++ K + + + +T +SS
Sbjct: 241 VHQF-NGARLIFDERTRIYKKLVGFEGNTLELSSRVMECMQYSRLFSETVSSSKDSANPY 300
Query: 304 ---------YGLRWIKDVVLGKRSDHCFRMVVVGDPNIELSEIGIPCHVAERLQISEHLS 363
GLR++KDV+LGKRSDH FR VVVGDP+++L+EIGIP +A+RLQ+SEHL+
Sbjct: 301 QKKSDTPKLCGLRFMKDVLLGKRSDHTFRTVVVGDPSLKLNEIGIPESIAKRLQVSEHLN 360
Query: 364 SWNMKKLSTSCYLHLVEKGEIYVRREGRLVRVRNVLELNMGDTIYRPLADGDVVLVNRPP 423
N ++L TS L++ E++VRR RLV ++ V +L GD I+R L DGD VL+NRPP
Sbjct: 361 QCNKERLVTSFVPTLLDNKEMHVRRGDRLVAIQ-VNDLQTGDKIFRSLMDGDTVLMNRPP 420
Query: 424 SIHQHSLIALSVKLLPVSSVLSLNPLCCSPFRGDFDGDCLHGYVPQSLEARVEVRELVSL 483
SIHQHSLIA++V++LP +SV+SLNP+CC PFRGDFDGDCLHGYVPQS++A+VE+ ELV+L
Sbjct: 421 SIHQHSLIAMTVRILPTTSVVSLNPICCLPFRGDFDGDCLHGYVPQSIQAKVELDELVAL 480
Query: 484 DRQLINGQSGRNLLSLSHDSLTAAHLI-MEDGVSLNLFQMQQLQMLTLHQLLSPAILKA- 543
D+QLIN Q+GRNLLSL DSLTAA+L+ +E LN QMQQLQM QL PAI+KA
Sbjct: 481 DKQLINRQNGRNLLSLGQDSLTAAYLVNVEKNCYLNRAQMQQLQMYCPFQLPPPAIIKAS 540
Query: 544 PLLRNCAWTGKQLFSTLLPPDFDYSSPSHCVFIKNGELIS-SEGSYWLRDSGRNLFQALI 603
P WTG QLF L PP FDY+ P + V + NGEL+S SEGS WLRD N + L+
Sbjct: 541 PSSTEPQWTGMQLFGMLFPPGFDYTYPLNNVVVSNGELLSFSEGSAWLRDGEGNFIERLL 600
Query: 604 EHCEGKTLDYLHDAQGVLCEWLSMRGLSVSLSDLYLSVDSYSHKNMMDDIFCGLQEAEET 663
+H +GK LD ++ AQ +L +WL MRGLSVSL+DLYLS D S KN+ ++I GL+EAE+
Sbjct: 601 KHDKGKVLDIIYSAQEMLSQWLLMRGLSVSLADLYLSSDLQSRKNLTEEISYGLREAEQV 660
Query: 664 CNLKQLMVDSHKDILTGDDEDNQHVLSIAVERLSYEKQKSAALNQASVDAFKKVFRDIQN 723
CN +QLMV+S +D L + ED + + R YE+QKSA L++ +V AFK +RD+Q
Sbjct: 661 CNKQQLMVESWRDFLAVNGEDKEEDSVSDLARFCYERQKSATLSELAVSAFKDAYRDVQA 720
Query: 724 LVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVTLSFSLPHKLTCSAWNSQK 783
L Y+Y + NS L M KAGSKGN+ KLVQHSMC+GLQ+S V+LSF P +LTC+AWN
Sbjct: 721 LAYRYGDQSNSFLIMSKAGSKGNIGKLVQHSMCIGLQNSAVSLSFGFPRELTCAAWNDPN 780
Query: 784 MPRYIQKDGLPDRTPSFIPYAVVESSFLSGLNPFECFAHSVTNRDSSFSDNAEVPGTLTR 843
P K T S++PY V+E+SFL+GLNP E F HSVT+RDSSFS NA++PGTL+R
Sbjct: 781 SPLRGAKGKDSTTTESYVPYGVIENSFLTGLNPLESFVHSVTSRDSSFSGNADLPGTLSR 840
Query: 844 KLTFLMRDIYTAYDGTVRNAYGNQLVQFSYDIDRPTSVSNELDGENNNRDRDIGGHPVGS 903
+L F MRDIY AYDGTVRN++GNQLVQF+Y+ D P DI G +GS
Sbjct: 841 RLMFFMRDIYAAYDGTVRNSFGNQLVQFTYETDGPV--------------EDITGEALGS 900
Query: 904 LAACAMSEAAYSALDQPISLLEASPLLNLKRVLECGSKRNSTKQTFSLFLSEKLSKRSYG 963
L+ACA+SEAAYSALDQPISLLE SPLLNLK VLECGSK+ +QT SL+LSE LSK+ +G
Sbjct: 901 LSACALSEAAYSALDQPISLLETSPLLNLKNVLECGSKKGQREQTMSLYLSEYLSKKKHG 960
Query: 964 FEYGALGVKNHLERVIFKDIVSNVMI-------TDCPLA--------------------- 1023
FEYG+L +KNHLE++ F +IVS MI T PL+
Sbjct: 961 FEYGSLEIKNHLEKLSFSEIVSTSMIIFSPSSNTKVPLSPWVCHFHISEKVLKRKQLSAE 1020
Query: 1024 ---DSLRED-------------------------------GDTVCLTVTIAENTKNSFLQ 1083
SL E D VC+TVT+ E +K+S L+
Sbjct: 1021 SVVSSLNEQYKSRNRELKLDIVDLDIQNTNHCSSDDQAMKDDNVCITVTVVEASKHSVLE 1080
Query: 1084 LDFIQDLLIHFLLGTVIRGFAEIDRVDIAWNDRPKVPKPRCNH--GELYLRVTMSGE-GN 1143
LD I+ +LI FLL + ++G I +V+I W DRPK PK NH GELYL+VTM G+ G
Sbjct: 1081 LDAIRLVLIPFLLDSPVKGDQGIKKVNILWTDRPKAPKRNGNHLAGELYLKVTMYGDRGK 1140
Query: 1144 SRFWATLMNNCLPIMDLIDWSRSHPDNTHSLCLAYGIDSGWKYFLNSLESATLDIGKTIR 1203
W L+ CLPIMD+IDW RSHPDN C YGID+G F+ +LESA D GK I
Sbjct: 1141 RNCWTALLETCLPIMDMIDWGRSHPDNIRQCCSVYGIDAGRSIFVANLESAVSDTGKEIL 1200
Query: 1204 PKHLLLVANSLSATGEFIGLNVKGLSHQREHALVKTPFMQACFSTPGACFVKAAKAGIKD 1263
+HLLLVA+SLS TGEF+ LN KG S QR+ PF QACFS+P CF+KAAK G++D
Sbjct: 1201 REHLLLVADSLSVTGEFVALNAKGWSKQRQVESTPAPFTQACFSSPSQCFLKAAKEGVRD 1260
Query: 1264 NLSGSLDALAWGRIPSLGTGGQFDILYSGRGHELNKPVDVYNLLGGQSICEKQNAKIESL 1323
+L GS+DALAWG++P GTG QF+I+ S + H PVDVY+LL + N+ +
Sbjct: 1261 DLQGSIDALAWGKVPGFGTGDQFEIIISPKVHGFTTPVDVYDLLSSTKTMRRTNSAPK-- 1320
Query: 1324 DKNNISEKYSAQLVLKNGGSTIKGLKKLD--SVSKSILREFLTLNDIQKLSFTLRTILHK 1382
S+K + Q + +K +K LD + S+LR T +I+ LS +L+ ILH
Sbjct: 1321 -----SDKATVQPFGLLHSAFLKDIKVLDGKGIPMSLLRTIFTWKNIELLSQSLKRILHS 1380
BLAST of HG10003230 vs. TAIR 10
Match:
AT1G63020.2 (nuclear RNA polymerase D1A )
HSP 1 Score: 1271.9 bits (3290), Expect = 0.0e+00
Identity = 712/1492 (47.72%), Postives = 936/1492 (62.73%), Query Frame = 0
Query: 4 MEDEQDGELPIPSGLVTGINFSVSTQQDTENIAVMTVDASNEVSDPKLGLPNPSYQCTTC 63
MED+ + EL +P G +T I FS+S D + ++V+ V+A N+V+D +LGLPNP C TC
Sbjct: 1 MEDDCE-ELQVPVGTLTSIGFSISNNNDRDKMSVLEVEAPNQVTDSRLGLPNPDSVCRTC 60
Query: 64 GASSLKSCEGLTRGVIHSTKDSPPHGDVFEVDSEVED--PTSDYHR----------PKGC 123
G+ K CEG GVI+ + EV + + P Y R P+ C
Sbjct: 61 GSKDRKVCEG-HFGVINFAYSIINPYFLKEVAALLNKICPGCKYIRKKQFQITEDQPERC 120
Query: 124 RYCFGSLKDWYPPMRFKLSTTDMFRKSMIMVEVKENMSKKYQKRVAKGGLPSDYWNFIPK 183
RYC +L YP M+F+++T ++FR+S I+VEV E K +KR LP DYW+F+P+
Sbjct: 121 RYC--TLNTGYPLMKFRVTTKEVFRRSGIVVEVNEESLMKLKKRGVL-TLPPDYWSFLPQ 180
Query: 184 DEQQEDSYCRPNRKILTHAQVHYLLKDIDPKFLKKFVPATDSLFLNSFPVTPNSHRVTEM 243
D ++S +P R+I+THAQV+ LL ID + +KK +P +SL L SFPVTPN +RVTE+
Sbjct: 181 DSNIDESCLKPTRRIITHAQVYALLLGIDQRLIKKDIPMFNSLGLTSFPVTPNGYRVTEI 240
Query: 244 AHSFSNGQRLIFLSPEKLQSK----------------DLVYQQKKIKDTATSS------- 303
H F NG RLIF ++ K + + + +T +SS
Sbjct: 241 VHQF-NGARLIFDERTRIYKKLVGFEGNTLELSSRVMECMQYSRLFSETVSSSKDSANPY 300
Query: 304 ---------YGLRWIKDVVLGKRSDHCFRMVVVGDPNIELSEIGIPCHVAERLQISEHLS 363
GLR++KDV+LGKRSDH FR VVVGDP+++L+EIGIP +A+RLQ+SEHL+
Sbjct: 301 QKKSDTPKLCGLRFMKDVLLGKRSDHTFRTVVVGDPSLKLNEIGIPESIAKRLQVSEHLN 360
Query: 364 SWNMKKLSTSCYLHLVEKGEIYVRREGRLVRVRNVLELNMGDTIYRPLADGDVVLVNRPP 423
N ++L TS L++ E++VRR RLV ++ V +L GD I+R L DGD VL+NRPP
Sbjct: 361 QCNKERLVTSFVPTLLDNKEMHVRRGDRLVAIQ-VNDLQTGDKIFRSLMDGDTVLMNRPP 420
Query: 424 SIHQHSLIALSVKLLPVSSVLSLNPLCCSPFRGDFDGDCLHGYVPQSLEARVEVRELVSL 483
SIHQHSLIA++V++LP +SV+SLNP+CC PFRGDFDGDCLHGYVPQS++A+VE+ ELV+L
Sbjct: 421 SIHQHSLIAMTVRILPTTSVVSLNPICCLPFRGDFDGDCLHGYVPQSIQAKVELDELVAL 480
Query: 484 DRQLINGQSGRNLLSLSHDSLTAAHLI-MEDGVSLNLFQMQQLQMLTLHQLLSPAILKA- 543
D+QLIN Q+GRNLLSL DSLTAA+L+ +E LN QMQQLQM QL PAI+KA
Sbjct: 481 DKQLINRQNGRNLLSLGQDSLTAAYLVNVEKNCYLNRAQMQQLQMYCPFQLPPPAIIKAS 540
Query: 544 PLLRNCAWTGKQLFSTLLPPDFDYSSPSHCVFIKNGELIS-SEGSYWLRDSGRNLFQALI 603
P WTG QLF L PP FDY+ P + V + NGEL+S SEGS WLRD N + L+
Sbjct: 541 PSSTEPQWTGMQLFGMLFPPGFDYTYPLNNVVVSNGELLSFSEGSAWLRDGEGNFIERLL 600
Query: 604 EHCEGKTLDYLHDAQGVLCEWLSMRGLSVSLSDLYLSVDSYSHKNMMDDIFCGLQEAEET 663
+H +GK LD ++ AQ +L +WL MRGLSVSL+DLYLS D S KN+ ++I GL+EAE+
Sbjct: 601 KHDKGKVLDIIYSAQEMLSQWLLMRGLSVSLADLYLSSDLQSRKNLTEEISYGLREAEQV 660
Query: 664 CNLKQLMVDSHKDILTGDDEDNQHVLSIAVERLSYEKQKSAALNQASVDAFKKVFRDIQN 723
CN +QLMV+S +D L + ED + + R YE+QKSA L++ +V AFK +RD+Q
Sbjct: 661 CNKQQLMVESWRDFLAVNGEDKEEDSVSDLARFCYERQKSATLSELAVSAFKDAYRDVQA 720
Query: 724 LVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVTLSFSLPHKLTCSAWNSQK 783
L Y+Y + NS L M KAGSKGN+ KLVQHSMC+GLQ+S V+LSF P +LTC+AWN
Sbjct: 721 LAYRYGDQSNSFLIMSKAGSKGNIGKLVQHSMCIGLQNSAVSLSFGFPRELTCAAWNDPN 780
Query: 784 MPRYIQKDGLPDRTPSFIPYAVVESSFLSGLNPFECFAHSVTNRDSSFSDNAEVPGTLTR 843
P K T S++PY V+E+SFL+GLNP E F HSVT+RDSSFS NA++PGTL+R
Sbjct: 781 SPLRGAKGKDSTTTESYVPYGVIENSFLTGLNPLESFVHSVTSRDSSFSGNADLPGTLSR 840
Query: 844 KLTFLMRDIYTAYDGTVRNAYGNQLVQFSYDIDRPTSVSNELDGENNNRDRDIGGHPVGS 903
+L F MRDIY AYDGTVRN++GNQLVQF+Y+ D P DI G +GS
Sbjct: 841 RLMFFMRDIYAAYDGTVRNSFGNQLVQFTYETDGPV--------------EDITGEALGS 900
Query: 904 LAACAMSEAAYSALDQPISLLEASPLLNLKRVLECGSKRNSTKQTFSLFLSEKLSKRSYG 963
L+ACA+SEAAYSALDQPISLLE SPLLNLK VLECGSK+ +QT SL+LSE LSK+ +G
Sbjct: 901 LSACALSEAAYSALDQPISLLETSPLLNLKNVLECGSKKGQREQTMSLYLSEYLSKKKHG 960
Query: 964 FEYGALGVKNHLERVIFKDIVSNVMI-------TDCPLA--------------------- 1023
FEYG+L +KNHLE++ F +IVS MI T PL+
Sbjct: 961 FEYGSLEIKNHLEKLSFSEIVSTSMIIFSPSSNTKVPLSPWVCHFHISEKVLKRKQLSAE 1020
Query: 1024 ---DSLRED-------------------------------GDTVCLTVTIAENTKNSFLQ 1083
SL E D VC+TVT+ E +K+S L+
Sbjct: 1021 SVVSSLNEQYKSRNRELKLDIVDLDIQNTNHCSSDDQAMKDDNVCITVTVVEASKHSVLE 1080
Query: 1084 LDFIQDLLIHFLLGTVIRGFAEIDRVDIAWNDRPKVPKPRCNH--GELYLRVTMSGE-GN 1143
LD I+ +LI FLL + ++G I +V+I W DRPK PK NH GELYL+VTM G+ G
Sbjct: 1081 LDAIRLVLIPFLLDSPVKGDQGIKKVNILWTDRPKAPKRNGNHLAGELYLKVTMYGDRGK 1140
Query: 1144 SRFWATLMNNCLPIMDLIDWSRSHPDNTHSLCLAYGIDSGWKYFLNSLESATLDIGKTIR 1203
W L+ CLPIMD+IDW RSHPDN C YGID+G F+ +LESA D GK I
Sbjct: 1141 RNCWTALLETCLPIMDMIDWGRSHPDNIRQCCSVYGIDAGRSIFVANLESAVSDTGKEIL 1200
Query: 1204 PKHLLLVANSLSATGEFIGLNVKGLSHQREHALVKTPFMQACFSTPGACFVKAAKAGIKD 1263
+HLLLVA+SLS TGEF+ LN KG S QR+ PF QACFS+P CF+KAAK G++D
Sbjct: 1201 REHLLLVADSLSVTGEFVALNAKGWSKQRQVESTPAPFTQACFSSPSQCFLKAAKEGVRD 1260
Query: 1264 NLSGSLDALAWGRIPSLGTGGQFDILYSGRGHELNKPVDVYNLLGGQSICEKQNAKIESL 1323
+L GS+DALAWG++P GTG QF+I+ S + H PVDVY+LL + N+ +
Sbjct: 1261 DLQGSIDALAWGKVPGFGTGDQFEIIISPKVHGFTTPVDVYDLLSSTKTMRRTNSAPK-- 1320
Query: 1324 DKNNISEKYSAQLVLKNGGSTIKGLKKLD--SVSKSILREFLTLNDIQKLSFTLRTILHK 1382
S+K + Q + +K +K LD + S+LR T +I+ LS +L+ ILH
Sbjct: 1321 -----SDKATVQPFGLLHSAFLKDIKVLDGKGIPMSLLRTIFTWKNIELLSQSLKRILHS 1380
BLAST of HG10003230 vs. TAIR 10
Match:
AT2G40030.1 (nuclear RNA polymerase D1B )
HSP 1 Score: 315.1 bits (806), Expect = 2.8e-85
Identity = 343/1356 (25.29%), Postives = 551/1356 (40.63%), Query Frame = 0
Query: 4 MEDEQDGELPIPSGLVTGINFSVSTQQD--TENIAVMTVDASNEVSDPKLGLPNPSYQCT 63
ME+E E I G + GI F++++ + ++I+ ++ +++++ LGLP +C
Sbjct: 1 MEEESTSE--ILDGEIVGITFALASHHEICIQSISESAINHPSQLTNAFLGLPLEFGKCE 60
Query: 64 TCGASSLKSCEGLTRGVIHSTKDSPPHGDVFEVDSEVEDPTSDYHRPKGCRYCFGSLKDW 123
+CGA+ CEG G I V E+ + + K + G L D
Sbjct: 61 SCGATEPDKCEG-HFGYIQLPVPIYHPAHVNELKQMLSLLCLKCLKIKKAKGTSGGLAD- 120
Query: 124 YPPMRFKLSTTDMFRKSMIMVEVKENMSKKY--QKRVAKGGLPSDYWNFIPKDEQQEDS- 183
+L S I ++ + + Y K ++ L WNF+ + + S
Sbjct: 121 ------RLLGVCCEEASQISIKDRASDGASYLELKLPSRSRLQPGCWNFLERYGYRYGSD 180
Query: 184 YCRPNRKILTHAQVHYLLKDIDPKFLKKFVP----ATDSLFLNSFPVTPNSHRVTEMAHS 243
Y RP L +V +L+ I + KK + L PV PN V E +
Sbjct: 181 YTRP----LLAREVKEILRRIPEESRKKLTAKGHIPQEGYILEYLPVPPNCLSVPEASDG 240
Query: 244 FSNGQRLIFLSPEKLQSKDLV--------------------------------------- 303
FS + + P +++ KD++
Sbjct: 241 FST----MSVDPSRIELKDVLKKVIAIKSSRSGETNFESHKAEASEMFRVVDTYLQVRGT 300
Query: 304 ----------YQQKKIKDTATSSYGLRWIKDVVLGKRSDHCFRMVVVGDPNIELSEIGIP 363
Y KI D+++S ++ + + K S R V+ GD ++E+GIP
Sbjct: 301 AKAARNIDMRYGVSKISDSSSSKAWTEKMRTLFIRKGSGFSSRSVITGDAYRHVNEVGIP 360
Query: 364 CHVAERLQISEHLSSWN----MKKLSTSCYLHLVEKGEIYVRREGRLVRVRNVLELNMGD 423
+A+R+ E +S N K + L + Y R+G + EL G
Sbjct: 361 IEIAQRITFEERVSVHNRGYLQKLVDDKLCLSYTQGSTTYSLRDGS----KGHTELKPGQ 420
Query: 424 TIYRPLADGDVVLVNRPPSIHQHSLIALSVKLLPVSSVLSLNPLCCSPFRGDFDGDCLHG 483
++R + DGDVV +NRPP+ H+HSL AL V + + + +NPL CSP DFDGDC+H
Sbjct: 421 VVHRRVMDGDVVFINRPPTTHKHSLQALRV-YVHEDNTVKINPLMCSPLSADFDGDCVHL 480
Query: 484 YVPQSLEARVEVRELVSLDRQLINGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQMQQL 543
+ PQSL A+ EV EL S+++QL++ +G+ +L + DSL + +++E V L+ QQL
Sbjct: 481 FYPQSLSAKAEVMELFSVEKQLLSSHTGQLILQMGSDSLLSLRVMLE-RVFLDKATAQQL 540
Query: 544 QMLTLHQLLSPAILKAPLLRNCAWTGKQLFSTLLPPDFDYSSPSHCVFIKNGELISSEGS 603
M L PA+ K+ AWT Q+ P S + +L+ +
Sbjct: 541 AMYGSLSLPPPALRKSS-KSGPAWTVFQILQLAFPERL--SCKGDRFLVDGSDLLKFDFG 600
Query: 604 YWLRDSGRN--LFQALIEHCEGKTLDYLHDAQGVLCEWLSMRGLSVSLSDLYLSVDSYSH 663
S N + +E +TL + Q +L E L G S+SL DL S S
Sbjct: 601 VDAMGSIINEIVTSIFLEKGPKETLGFFDSLQPLLMESLFAEGFSLSLEDL-----SMSR 660
Query: 664 KNMMDDIFCGLQEAEETCNLKQLMVDSHKDILTGDDEDNQHVLSIAVERLSYEKQKSAAL 723
+M D+ L E + + +L + S++D L ++ S K K A
Sbjct: 661 ADM--DVIHNLIIREISPMVSRLRL-SYRDELQLEN--------------SIHKVKEVAA 720
Query: 724 NQASVDAFKKVFRDIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQHSLVTL 783
N F I+NL+ K NS +T KLVQ + LGLQ S
Sbjct: 721 N------FMLKSYSIRNLI---DIKSNSAIT-----------KLVQQTGFLGLQLSDKKK 780
Query: 784 SFSLPHKLTCSAWNSQKMPRYIQKDGLPDRTPSFIPYAVVESSFLSGLNPFECFAHSVTN 843
++ + + +K R I G + +V+ F GL+P+E AHS+
Sbjct: 781 FYTKTLVEDMAIFCKRKYGR-ISSSG---------DFGIVKGCFFHGLDPYEEMAHSIAA 840
Query: 844 RD--SSFSDNAEVPGTLTRKLTFLMRDIYTAYDGTVRNAYGNQLVQFSYDIDRPTSVSNE 903
R+ S PGTL + L ++RDI DGTVRN N ++QF Y +
Sbjct: 841 REVIVRSSRGLAEPGTLFKNLMAVLRDIVITNDGTVRNTCSNSVIQFKYGV--------- 900
Query: 904 LDGENNNRDRDIGGHPVGSLAACAMSEAAYSALDQPISLLEASPLLN-----LKRVLEC- 963
D E ++ G PVG LAA AMS AY A +L++SP N +K VL C
Sbjct: 901 -DSERGHQGLFEAGEPVGVLAATAMSNPAYKA------VLDSSPNSNSSWELMKEVLLCK 960
Query: 964 -GSKRNSTKQTFSLFLSEKLSKRSYGFEYGALGVKNHLERVIFKDIVSNVMI-------- 1023
+ + + L+L+E + + E A V+N L +V KD ++
Sbjct: 961 VNFQNTTNDRRVILYLNECHCGKRFCQENAACTVRNKLNKVSLKDTAVEFLVEYRKQPTI 1020
Query: 1024 --------------------------------------------------TD-------- 1083
TD
Sbjct: 1021 SEIFGIDSCLHGHIHLNKTLLQDWNISMQDIHQKCEDVINSLGQKKKKKATDDFKRTSLS 1080
Query: 1084 ----CPLADSLREDG-DTVCLTVTIAENTKNSFLQLDFIQDLLIHFLLGTVIRGFAEIDR 1143
C D G D CLT + + LD + + + LL VI+G + I
Sbjct: 1081 VSECCSFRDPCGSKGSDMPCLTFSYNATDPDLERTLDVLCNTVYPVLLEIVIKGDSRICS 1140
Query: 1144 VDIAWNDRPKVPKPRCNH----GELYLRVTMSGEG---NSRFWATLMNNCLPIMDLIDWS 1203
+I WN R H GE L VT+ + W ++++CL ++ LID
Sbjct: 1141 ANIIWNSSDMTTWIRNRHASRRGEWVLDVTVEKSAVKQSGDAWRVVIDSCLSVLHLIDTK 1200
Query: 1204 RSHPDNTHSLCLAYGIDSGWKYFLNSLESATLDIGKTIRPKHLLLVANSLSATGEFIGLN 1207
RS P + + G+ ++ + L ++ + K + +H++L+AN+++ +G +G N
Sbjct: 1201 RSIPYSVKQVQELLGLSCAFEQAVQRLSASVRMVSKGVLKEHIILLANNMTCSGTMLGFN 1259
BLAST of HG10003230 vs. TAIR 10
Match:
AT4G35800.1 (RNA polymerase II large subunit )
HSP 1 Score: 173.3 bits (438), Expect = 1.3e-42
Identity = 162/610 (26.56%), Postives = 282/610 (46.23%), Query Frame = 0
Query: 274 IKDVVLGKRSDHCFRMVVVGDPNIELSEIGIPCHVAERLQISEHLSSWNMKKLSTSCYLH 333
I+ ++GKR D R V+ DP I + E+G+P +A L E ++ +N+++L
Sbjct: 347 IRGNLMGKRVDFSARTVITPDPTINIDELGVPWSIALNLTYPETVTPYNIERLK-----E 406
Query: 334 LVEKG----------EIYVRREGRLVRVRNVLE-----LNMGDTIYRPLADGDVVLVNRP 393
LV+ G + +R +G+ + +R + + L +G + R L DGD VL NR
Sbjct: 407 LVDYGPHPPPGKTGAKYIIRDDGQRLDLRYLKKSSDQHLELGYKVERHLQDGDFVLFNRQ 466
Query: 394 PSIHQHSLIALSVKLLPVSSVLSLNPLCCSPFRGDFDGDCLHGYVPQSLEARVEVRELVS 453
PS+H+ S++ ++++P S+ LN SP+ DFDGD ++ +VPQS E R EV EL+
Sbjct: 467 PSLHKMSIMGHRIRIMPYST-FRLNLSVTSPYNADFDGDEMNMHVPQSFETRAEVLELMM 526
Query: 454 LDRQLINGQSGRNLLSLSHDSLTAAHLIMEDGVSLNLFQMQQLQMLTL-------HQLLS 513
+ + +++ Q+ R ++ + D+L I + F + + M TL ++ +
Sbjct: 527 VPKCIVSPQANRPVMGIVQDTLLGCRKI----TKRDTFIEKDVFMNTLMWWEDFDGKVPA 586
Query: 514 PAILKAPLLRNCAWTGKQLFSTLLPPDFD--------------YSSPSHC-VFIKNGELI 573
PAILK L WTGKQ+F+ ++P + + +P V I+ GEL+
Sbjct: 587 PAILKPRPL----WTGKQVFNLIIPKQINLLRYSAWHADTETGFITPGDTQVRIERGELL 646
Query: 574 SSE-GSYWLRDSGRNLFQALIEHC-EGKTLDYLHDAQGVLCEWLSMRGLSVSLSDLYLSV 633
+ L S +L + E +L Q ++ WL G ++ +
Sbjct: 647 AGTLCKKTLGTSNGSLVHVIWEEVGPDAARKFLGHTQWLVNYWLLQNGFTIGIG------ 706
Query: 634 DSYSHKNMMDDIFCGLQEAEETCNLKQLMVDSHKDILTGDDEDNQHVLSIAVERLSYEKQ 693
D+ + + M+ I E N K + D + G + D + ++ R ++E +
Sbjct: 707 DTIADSSTMEKI------NETISNAKTAVKDLIRQ-FQGKELDPEPGRTM---RDTFENR 766
Query: 694 KSAALNQASVDAFKKVFRDIQNLVYKYSGKDNSLLTMFKAGSKGNLLKLVQHSMCLGLQH 753
+ LN+A DA + + + N+L M AGSKG+ + + Q + C+G Q
Sbjct: 767 VNQVLNKARDDAGSSAQKSL--------AETNNLKAMVTAGSKGSFINISQMTACVGQQ- 826
Query: 754 SLVTLSFSLPHKLTCSAWNSQKMPRYIQKDGLPDRTPSFIPYAVVESSFLSGLNPFECFA 813
++ K ++ + +P + + D P+ VE+S+L GL P E F
Sbjct: 827 -------NVEGKRIPFGFDGRTLPHFTKDDYGPESR------GFVENSYLRGLTPQEFFF 886
Query: 814 HSVTNRDSSFSDNAEV--PGTLTRKLTFLMRDIYTAYDGTVRNAYGNQLVQFSYDIDRPT 840
H++ R+ + G + R+L M DI YDGTVRN+ G+ ++QF Y D
Sbjct: 887 HAMGGREGLIDTAVKTSETGYIQRRLVKAMEDIMVKYDGTVRNSLGD-VIQFLYGEDGMD 903
BLAST of HG10003230 vs. TAIR 10
Match:
AT5G60040.1 (nuclear RNA polymerase C1 )
HSP 1 Score: 147.1 bits (370), Expect = 1.0e-34
Identity = 171/609 (28.08%), Postives = 266/609 (43.68%), Query Frame = 0
Query: 280 GKRSDHCFRMVVVGDPNIELSEIGIPCHVAERLQISEHLSSWNMKKLSTSCYLHLVEK-- 339
GKR + R V+ DPN++++E+GIP +A+ L E +S N++KL C + K
Sbjct: 359 GKRVEFTGRTVISPDPNLKITEVGIPILMAQILTFPECVSRHNIEKL-RQCVRNGPNKYP 418
Query: 340 GEIYVR----REGRLV---RVRNVLELNMGDTIYRPLADGDVVLVNRPPSIHQHSLIALS 399
G VR LV R R EL +G + R L +GDVVL NR PS+H+ S++
Sbjct: 419 GARNVRYPDGSSRTLVGDYRKRIADELAIGCIVDRHLQEGDVVLFNRQPSLHRMSIMCHR 478
Query: 400 VKLLPVSSVLSLNPLCCSPFRGDFDGDCLHGYVPQSLEARVEVRELVSLDRQLINGQSGR 459
+++P + L N C+P+ DFDGD ++ +VPQ+ EAR E L+ + L ++G
Sbjct: 479 ARIMPWRT-LRFNESVCNPYNADFDGDEMNMHVPQTEEARTEAITLMGVQNNLCTPKNGE 538
Query: 460 NLLSLSHDSLTAAHLIME-----DGVSLNLFQMQQLQMLTLHQLLSPAILKAPLLRNCAW 519
L++ + D LT++ LI D + +L + L +P ILK L W
Sbjct: 539 ILVASTQDFLTSSFLITRKDTFYDRAAFSLICSYMGDGMDSIDLPTPTILKPIEL----W 598
Query: 520 TGKQLFSTLLPP-------------DFDYSSPSH-----------CVFIKNGELISSE-G 579
TGKQ+FS LL P + ++ H V+ +N ELIS + G
Sbjct: 599 TGKQIFSVLLRPNASIRVYVTLNVKEKNFKKGEHGFDETMCINDGWVYFRNSELISGQLG 658
Query: 580 SYWLRDSGRN-LFQALIEHCEGKTLDYLHDAQGVLCEWLSMRGLSVSLSDLYLSVDSYSH 639
L + ++ L+ L+ DY A V L+ LS ++ + +S
Sbjct: 659 KATLGNGNKDGLYSILLR-------DYNSHAAAVCMNRLA------KLSARWIGIHGFSI 718
Query: 640 KNMMDDIFCGLQEAEETCNLKQLMVDS-HKDILTGDDEDNQHVLSIAVERLSYEKQKSAA 699
+DD+ G + ++E + Q D H+ I +E N+ L Q A
Sbjct: 719 G--IDDVQPGEELSKERKDSIQFGYDQCHRKI----EEFNRGNL-----------QLKAG 778
Query: 700 LNQASVDAFKKVFRDIQNLVYKYSGK--------DNSLLTMFKAGSKGNLLKLVQHSMCL 759
L+ A + + I N + + +GK NS L M + GSKG+ + + Q C+
Sbjct: 779 LDGAK--SLEAEITGILNTIREATGKACMSGLHWRNSPLIMSQCGSKGSPINISQMVACV 838
Query: 760 GLQHSLVTLSFSLPHKLTCSAWNSQKMPRYIQKDGLPDRTPSFIP--------YAVVESS 819
G Q N + P DG DR+ P V +S
Sbjct: 839 GQQ-----------------TVNGHRAP-----DGFIDRSLPHFPRMSKSPAAKGFVANS 898
Query: 820 FLSGLNPFECFAHSVTNRDSSFSDNAEVPGT--LTRKLTFLMRDIYTAYDGTVRNAYGNQ 830
F SGL E F H++ R+ + T ++R+L + D+ YD TVRNA G
Sbjct: 899 FYSGLTATEFFFHTMGGREGLVDTAVKTASTGYMSRRLMKALEDLLVHYDNTVRNASG-C 906
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038905038.1 | 0.0e+00 | 86.36 | DNA-directed RNA polymerase IV subunit 1 isoform X1 [Benincasa hispida] >XP_0389... | [more] |
XP_038905042.1 | 0.0e+00 | 86.36 | DNA-directed RNA polymerase IV subunit 1 isoform X2 [Benincasa hispida] | [more] |
TYK19428.1 | 0.0e+00 | 83.91 | DNA-directed RNA polymerase IV subunit 1 [Cucumis melo var. makuwa] | [more] |
XP_011650447.1 | 0.0e+00 | 84.89 | DNA-directed RNA polymerase IV subunit 1 isoform X1 [Cucumis sativus] >XP_011650... | [more] |
XP_038905043.1 | 0.0e+00 | 86.20 | DNA-directed RNA polymerase IV subunit 1 isoform X3 [Benincasa hispida] >XP_0389... | [more] |
Match Name | E-value | Identity | Description | |
Q9LQ02 | 0.0e+00 | 47.72 | DNA-directed RNA polymerase IV subunit 1 OS=Arabidopsis thaliana OX=3702 GN=NRPD... | [more] |
Q5D869 | 4.0e-84 | 25.29 | DNA-directed RNA polymerase V subunit 1 OS=Arabidopsis thaliana OX=3702 GN=NRPE1... | [more] |
P17546 | 2.3e-47 | 27.27 | DNA-directed RNA polymerase II subunit RPB1-A OS=Trypanosoma brucei brucei OX=57... | [more] |
P17545 | 5.1e-47 | 27.10 | DNA-directed RNA polymerase II subunit RPB1-B OS=Trypanosoma brucei brucei OX=57... | [more] |
P35084 | 1.5e-46 | 26.70 | DNA-directed RNA polymerase II subunit rpb1 OS=Dictyostelium discoideum OX=44689... | [more] |
Match Name | E-value | Identity | Description | |
A0A5D3D780 | 0.0e+00 | 83.91 | DNA-directed RNA polymerase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaf... | [more] |
A0A0A0L2L4 | 0.0e+00 | 84.89 | DNA-directed RNA polymerase OS=Cucumis sativus OX=3659 GN=Csa_3G039340 PE=4 SV=1 | [more] |
A0A1S4DY39 | 0.0e+00 | 84.06 | DNA-directed RNA polymerase OS=Cucumis melo OX=3656 GN=LOC103490982 PE=4 SV=1 | [more] |
A0A6J1FKU9 | 0.0e+00 | 81.03 | DNA-directed RNA polymerase OS=Cucurbita moschata OX=3662 GN=LOC111444908 PE=4 S... | [more] |
A0A6J1KSL8 | 0.0e+00 | 80.90 | DNA-directed RNA polymerase OS=Cucurbita maxima OX=3661 GN=LOC111498320 PE=4 SV=... | [more] |