Homology
BLAST of HG10002132 vs. NCBI nr
Match:
XP_038893419.1 (putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Benincasa hispida])
HSP 1 Score: 1107.8 bits (2864), Expect = 0.0e+00
Identity = 582/692 (84.10%), Postives = 602/692 (86.99%), Query Frame = 0
Query: 1 MAKNQSVLIKETVFKLQLALLEGIQNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC 60
MAKNQSVLIK+TV+KLQLALLEGIQNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC
Sbjct: 1 MAKNQSVLIKDTVYKLQLALLEGIQNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC 60
Query: 61 HSNLSSDNTRKGRYRISLKEHKVYDLQETYKYCSSTCLINSRAFSGRLQDERCSVMNPEK 120
HSNLSSDNTR+GRYRISLKEHKVYDLQETYKYCSSTCLINSRAFSGRLQ+ERCSVMNPEK
Sbjct: 61 HSNLSSDNTRRGRYRISLKEHKVYDLQETYKYCSSTCLINSRAFSGRLQNERCSVMNPEK 120
Query: 121 LKETLRLFENLSLDSKEKMGNNSDLGLEIQEKIDSNIGEVPIEDWMGPSNAIEGYVPHSD 180
LKE LRLFENLSLDSKE +GNN DLGLEIQE I+SN GEVPIE+WMGPSNAIEGYVPH D
Sbjct: 121 LKEILRLFENLSLDSKENVGNNCDLGLEIQENIESNTGEVPIEEWMGPSNAIEGYVPHRD 180
Query: 181 HKIMTLPSKDGKESKDGSKAKSKPLGGGKDFFSDFSFTSTIITDEEYSVSKISSGLKEMS 240
HKIMTLPSKDGKESKDGSKAK KPLGGGKDFFSD SFTSTI+TDEEYSVSKISSGLKEM+
Sbjct: 181 HKIMTLPSKDGKESKDGSKAKIKPLGGGKDFFSDLSFTSTILTDEEYSVSKISSGLKEMA 240
Query: 241 LDTNSKIQTGEFCVKESNEQFTILETPHAPAPTKNNVGRKAKGSKERTKVSATEESTKNL 300
DT+SKIQTGE C KES +QFTILETPHAPAPTKN+VGRKA+GSKERTKVSAT+EST NL
Sbjct: 241 FDTDSKIQTGELCGKESKDQFTILETPHAPAPTKNSVGRKARGSKERTKVSATKESTNNL 300
Query: 301 SDAPSTSNQCNTNCNLMTEEPRGGSNDLSGTGIKSSLKQPGKKNLRRSVTWADEKTDDTS 360
SDAPSTSNQCNTNCNLMTEEPRGGSNDLSGT IKSSLKQPGKKNL RSVTWADEKT DTS
Sbjct: 301 SDAPSTSNQCNTNCNLMTEEPRGGSNDLSGTEIKSSLKQPGKKNLHRSVTWADEKTVDTS 360
Query: 361 IINPPEVREMGKTKECSRITNNLVNSDNDNEDLLRFESAEACAMALSQAAEAITSGKNEV 420
IIN PEVREMGK KECSRIT NLVNSDNDN DLLR ESAEACAMAL+QAAEAI+SG+NEV
Sbjct: 361 IINLPEVREMGKKKECSRITRNLVNSDNDNGDLLRVESAEACAMALTQAAEAISSGQNEV 420
Query: 421 SDAVSEAGIIILPRPSDANEEASTNPVNAFEPHSLSEKSNKLGELRSDLFDPNDSWYDAP 480
SDAVSEAGIIILPRP+D NEEASTNPVNA EPHS SEKSNKLG LRSDLFDPNDSWYDAP
Sbjct: 421 SDAVSEAGIIILPRPNDGNEEASTNPVNASEPHSSSEKSNKLGVLRSDLFDPNDSWYDAP 480
Query: 481 PEGFSLTLSSFATMWMAIFAWITSSSLAYIYGKDDKFHEEFLYIDGKEYPRKIVSTDGRS 540
PEGFSLTLSSFATMWMAIFAWITSSSLAYIYGKDDKFHEEFLYIDGKEYPRKIVS DGRS
Sbjct: 481 PEGFSLTLSSFATMWMAIFAWITSSSLAYIYGKDDKFHEEFLYIDGKEYPRKIVSADGRS 540
Query: 541 SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGMGRCWYLSKQGIVGLKSERGEGVGVE 600
SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGM
Sbjct: 541 SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGM------------------------- 600
Query: 601 GGFLDGHLLDTMTFLDALPAFRMKQWQ-------------------------------VL 660
GHLLDTMTFLDALPAFRMKQWQ VL
Sbjct: 601 -----GHLLDTMTFLDALPAFRMKQWQVIVLLFIEALSVCQIPSLASHMSNSRSLYHKVL 660
Query: 661 DRAQIRPDEFELMKDHILPLGRTAQFSVENDA 662
DRAQIR DE+E+MKDHILPLGR AQFS ENDA
Sbjct: 661 DRAQIRSDEYEVMKDHILPLGRIAQFSGENDA 662
BLAST of HG10002132 vs. NCBI nr
Match:
XP_031739958.1 (putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Cucumis sativus] >KGN52984.1 hypothetical protein Csa_015280 [Cucumis sativus])
HSP 1 Score: 1059.3 bits (2738), Expect = 1.4e-305
Identity = 557/692 (80.49%), Postives = 590/692 (85.26%), Query Frame = 0
Query: 1 MAKNQSVLIKETVFKLQLALLEGIQNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC 60
MAKNQSVLIK+TV+KLQLAL EGI+NENQLFAAGSLMSRSDYEDVVTERSIA+LCGYPLC
Sbjct: 1 MAKNQSVLIKDTVYKLQLALYEGIKNENQLFAAGSLMSRSDYEDVVTERSIADLCGYPLC 60
Query: 61 HSNLSSDNTRKGRYRISLKEHKVYDLQETYKYCSSTCLINSRAFSGRLQDERCSVMNPEK 120
HSNL SDNTR+GRYRISLKEHKVYDL+ETYKYCSS CLINSRAFSGRLQDERCSVMNP+K
Sbjct: 61 HSNLPSDNTRRGRYRISLKEHKVYDLEETYKYCSSACLINSRAFSGRLQDERCSVMNPDK 120
Query: 121 LKETLRLFENLSLDSKEKMGNNSDLGLEIQEKIDSNIGEVPIEDWMGPSNAIEGYVPHSD 180
LKE L+LFEN+SLDSKE MGNN D GLEIQEKI+SNIGEVPIE+WMGPSNAIEGYVPH D
Sbjct: 121 LKEILKLFENMSLDSKENMGNNCDSGLEIQEKIESNIGEVPIEEWMGPSNAIEGYVPHRD 180
Query: 181 HKIMTLPSKDGKESKDGSKAKSKPLGGGKDFFSDFSFTSTIITDEEYSVSKISSGLKEMS 240
HK+MTL SKDGKESKDGSKAK KPLGGGKDFFSDFS TSTIITDEEYSVSKISSGLKEM+
Sbjct: 181 HKVMTLHSKDGKESKDGSKAKIKPLGGGKDFFSDFSITSTIITDEEYSVSKISSGLKEMA 240
Query: 241 LDTNSKIQTGEFCVKESNEQFTILETPHAPAPTKNNVGRKAKGSKERTKVSATEESTKNL 300
LDTNSK QTGEFC KESN+QF ILETPHAPAP KN+VGRKA+GSKERTKVSAT+EST NL
Sbjct: 241 LDTNSKNQTGEFCGKESNDQFAILETPHAPAPPKNSVGRKARGSKERTKVSATKESTDNL 300
Query: 301 SDAPSTSNQCNTNCNLMTEEPRGGSNDLSGTGIKSSLKQPGKKNLRRSVTWADEKTDDTS 360
SDAPSTS +TN NLMTEEPRGG NDLSGT +KSSLK+PGKKNL RSVTWADEKTDD S
Sbjct: 301 SDAPSTSKNRSTNFNLMTEEPRGGFNDLSGTELKSSLKKPGKKNLCRSVTWADEKTDDAS 360
Query: 361 IINPPEVREMGKTKECSRITNNLVNSDNDNEDLLRFESAEACAMALSQAAEAITSGKNEV 420
I+N PEV EMGKTKECSR T+NLVN DNDNED+LR ESAEACAMALSQAAEAITSG++EV
Sbjct: 361 IMNLPEVGEMGKTKECSRTTSNLVNFDNDNEDILRVESAEACAMALSQAAEAITSGQSEV 420
Query: 421 SDAVSEAGIIILPRPSDANEEASTNPVNAFEPHSLSEKSNKLGELRSDLFDPNDSWYDAP 480
SDAVSEAGIIILP PSDANEEAST+PVNA EPHS SEKSNKLG LRSDLFDP+DSWYDAP
Sbjct: 421 SDAVSEAGIIILPHPSDANEEASTDPVNASEPHSFSEKSNKLGVLRSDLFDPSDSWYDAP 480
Query: 481 PEGFSLTLSSFATMWMAIFAWITSSSLAYIYGKDDKFHEEFLYIDGKEYPRKIVSTDGRS 540
PEGFSLTLSSFATMWMAIFAW+TSSSLAYIYGKDDKFHEEFLYIDGKEYP KIVS DGRS
Sbjct: 481 PEGFSLTLSSFATMWMAIFAWVTSSSLAYIYGKDDKFHEEFLYIDGKEYPSKIVSADGRS 540
Query: 541 SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGMGRCWYLSKQGIVGLKSERGEGVGVE 600
SEIKQTLAGCLTR+IPGLASEL LSTPIS LE+GM
Sbjct: 541 SEIKQTLAGCLTRAIPGLASELNLSTPISRLENGM------------------------- 600
Query: 601 GGFLDGHLLDTMTFLDALPAFRMKQWQ-------------------------------VL 660
HLLDTMTFLDALPAFRMKQWQ VL
Sbjct: 601 -----AHLLDTMTFLDALPAFRMKQWQVIVLLFIEALSVSRIPSLASHMSSSRNLYHKVL 660
Query: 661 DRAQIRPDEFELMKDHILPLGRTAQFSVENDA 662
DRAQIR DE+E+M+DHILPLGRTAQ S ENDA
Sbjct: 661 DRAQIRSDEYEIMRDHILPLGRTAQLSDENDA 662
BLAST of HG10002132 vs. NCBI nr
Match:
XP_008454119.1 (PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Cucumis melo] >XP_008454120.1 PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Cucumis melo])
HSP 1 Score: 1047.0 bits (2706), Expect = 7.0e-302
Identity = 555/692 (80.20%), Postives = 584/692 (84.39%), Query Frame = 0
Query: 1 MAKNQSVLIKETVFKLQLALLEGIQNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC 60
MAKNQS LIK+TV+KLQLAL EGIQNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC
Sbjct: 1 MAKNQSALIKDTVYKLQLALYEGIQNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC 60
Query: 61 HSNLSSDNTRKGRYRISLKEHKVYDLQETYKYCSSTCLINSRAFSGRLQDERCSVMNPEK 120
HSNL SDNTR+GRYRISLKEHKVYDL+ETYKYCSS CLINSRAFSGRLQDERCSVMNP K
Sbjct: 61 HSNLPSDNTRRGRYRISLKEHKVYDLEETYKYCSSACLINSRAFSGRLQDERCSVMNPAK 120
Query: 121 LKETLRLFENLSLDSKEKMGNNSDLGLEIQEKIDSNIGEVPIEDWMGPSNAIEGYVPHSD 180
LKE L+LFEN+SLDSKE MGNN D GLEIQEKI+S+IGEVPIE+WMGPSNAIEGYVPH D
Sbjct: 121 LKEILKLFENMSLDSKENMGNNCDSGLEIQEKIESSIGEVPIEEWMGPSNAIEGYVPHRD 180
Query: 181 HKIMTLPSKDGKESKDGSKAKSKPLGGGKDFFSDFSFTSTIITDEEYSVSKISSGLKEMS 240
HKIMTLPSKDGKESKDGS AK KPLGGGKDFFSDFSFTSTIITDEEYSVSKISS LKEM+
Sbjct: 181 HKIMTLPSKDGKESKDGSTAKIKPLGGGKDFFSDFSFTSTIITDEEYSVSKISSSLKEMA 240
Query: 241 LDTNSKIQTGEFCVKESNEQFTILETPHAPAPTKNNVGRKAKGSKERTKVSATEESTKNL 300
LDTNSKIQTGEFC KESN+QFTILET HA AP KN+VG KA+GSKERTKVSATEEST NL
Sbjct: 241 LDTNSKIQTGEFCGKESNDQFTILETSHARAPPKNSVGHKARGSKERTKVSATEESTNNL 300
Query: 301 SDAPSTSNQCNTNCNLMTEEPRGGSNDLSGTGIKSSLKQPGKKNLRRSVTWADEKTDDTS 360
SDAPSTSN +TN NL+TEEP+GG NDL GT IKSSLKQPGKKNLRRSVTWADEK DDTS
Sbjct: 301 SDAPSTSNNRSTNFNLVTEEPKGGFNDLRGTEIKSSLKQPGKKNLRRSVTWADEKIDDTS 360
Query: 361 IINPPEVREMGKTKECSRITNNLVNSDNDNEDLLRFESAEACAMALSQAAEAITSGKNEV 420
+N PEV E GKTKECSRIT+NLVN DNDNEDL+R ESAEACAMALSQAAEAITSG++EV
Sbjct: 361 -MNLPEVGEKGKTKECSRITSNLVNFDNDNEDLIRVESAEACAMALSQAAEAITSGQSEV 420
Query: 421 SDAVSEAGIIILPRPSDANEEASTNPVNAFEPHSLSEKSNKLGELRSDLFDPNDSWYDAP 480
S+AVSEAGIIILP PSDANEEAST PV A EPHS SEKSNKLG L SDLFDP+DSWYDAP
Sbjct: 421 SEAVSEAGIIILPHPSDANEEASTEPVKASEPHSFSEKSNKLGVLHSDLFDPSDSWYDAP 480
Query: 481 PEGFSLTLSSFATMWMAIFAWITSSSLAYIYGKDDKFHEEFLYIDGKEYPRKIVSTDGRS 540
PEGFSLTLSSFATMWMAIFAW+TSSSLAYIYGKDDKFHEEF YIDGKEYP KIVS DGRS
Sbjct: 481 PEGFSLTLSSFATMWMAIFAWVTSSSLAYIYGKDDKFHEEFQYIDGKEYPSKIVSADGRS 540
Query: 541 SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGMGRCWYLSKQGIVGLKSERGEGVGVE 600
SEIKQTLAGCLTR+IPGLASELKLSTPIS LE+GM
Sbjct: 541 SEIKQTLAGCLTRAIPGLASELKLSTPISRLEYGM------------------------- 600
Query: 601 GGFLDGHLLDTMTFLDALPAFRMKQWQ-------------------------------VL 660
HLLDTMTFLDALPAFRMKQWQ VL
Sbjct: 601 -----AHLLDTMTFLDALPAFRMKQWQVIVLLFMEALSVCRIPSLASHMSSSRNLYHKVL 660
Query: 661 DRAQIRPDEFELMKDHILPLGRTAQFSVENDA 662
DRAQI+ DE+E+MKDHILPLG TAQ SVENDA
Sbjct: 661 DRAQIQSDEYEIMKDHILPLGLTAQLSVENDA 661
BLAST of HG10002132 vs. NCBI nr
Match:
KAA0044516.1 (putative RNA polymerase II subunit B1 CTD phosphatase RPAP2-like protein [Cucumis melo var. makuwa])
HSP 1 Score: 1043.5 bits (2697), Expect = 7.8e-301
Identity = 552/692 (79.77%), Postives = 585/692 (84.54%), Query Frame = 0
Query: 1 MAKNQSVLIKETVFKLQLALLEGIQNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC 60
MAKNQS LIK+TV+KLQLAL EGIQNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC
Sbjct: 1 MAKNQSALIKDTVYKLQLALYEGIQNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC 60
Query: 61 HSNLSSDNTRKGRYRISLKEHKVYDLQETYKYCSSTCLINSRAFSGRLQDERCSVMNPEK 120
HSNL SDNTR+GRYRISLKEHKVYDL+ETYKYCSS CLINSRAFSGRLQDERCSVMNP K
Sbjct: 61 HSNLPSDNTRRGRYRISLKEHKVYDLEETYKYCSSACLINSRAFSGRLQDERCSVMNPAK 120
Query: 121 LKETLRLFENLSLDSKEKMGNNSDLGLEIQEKIDSNIGEVPIEDWMGPSNAIEGYVPHSD 180
LK+ L+LFEN+SLDSKE +GNN D GLEIQEKI+S+IGEVPIE+WMGPSNAIEGYVPH D
Sbjct: 121 LKDILKLFENMSLDSKENVGNNCDSGLEIQEKIESSIGEVPIEEWMGPSNAIEGYVPHRD 180
Query: 181 HKIMTLPSKDGKESKDGSKAKSKPLGGGKDFFSDFSFTSTIITDEEYSVSKISSGLKEMS 240
HKIMTLPSKDGKESKDGS AK KPLGGGKDFFSDFSFTSTIITDEEYSVSKISS LKEM+
Sbjct: 181 HKIMTLPSKDGKESKDGSTAKIKPLGGGKDFFSDFSFTSTIITDEEYSVSKISSSLKEMA 240
Query: 241 LDTNSKIQTGEFCVKESNEQFTILETPHAPAPTKNNVGRKAKGSKERTKVSATEESTKNL 300
LDTNSKIQTGEFC KESN+QFTILET HA AP KN+VG KA+GSKERTKVSATEEST NL
Sbjct: 241 LDTNSKIQTGEFCGKESNDQFTILETSHARAPPKNSVGHKARGSKERTKVSATEESTNNL 300
Query: 301 SDAPSTSNQCNTNCNLMTEEPRGGSNDLSGTGIKSSLKQPGKKNLRRSVTWADEKTDDTS 360
SDAPSTSN +TN NL+TEEP+GG NDL GT IKSSLKQPGKKNLRRSVTWADEK DDTS
Sbjct: 301 SDAPSTSNNRSTNFNLVTEEPKGGFNDLRGTEIKSSLKQPGKKNLRRSVTWADEKIDDTS 360
Query: 361 IINPPEVREMGKTKECSRITNNLVNSDNDNEDLLRFESAEACAMALSQAAEAITSGKNEV 420
+N PEV E GKTKECSRIT+NLVN DNDNEDLLR ESAEACAMALSQAAEAITSG++EV
Sbjct: 361 -MNLPEVGEKGKTKECSRITSNLVNFDNDNEDLLRVESAEACAMALSQAAEAITSGQSEV 420
Query: 421 SDAVSEAGIIILPRPSDANEEASTNPVNAFEPHSLSEKSNKLGELRSDLFDPNDSWYDAP 480
S+AVSEAGIIILP PSDANEEAST+PV A EPHS SEKSNKLG L SDLFDP++SWYDAP
Sbjct: 421 SEAVSEAGIIILPHPSDANEEASTDPVKASEPHSFSEKSNKLGVLHSDLFDPSESWYDAP 480
Query: 481 PEGFSLTLSSFATMWMAIFAWITSSSLAYIYGKDDKFHEEFLYIDGKEYPRKIVSTDGRS 540
PEGFSLTLSSFATMWMAIFAW+TSSSLAYIYGKDDKFHEEF YIDGKEYP KIVS DGRS
Sbjct: 481 PEGFSLTLSSFATMWMAIFAWVTSSSLAYIYGKDDKFHEEFQYIDGKEYPSKIVSADGRS 540
Query: 541 SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGMGRCWYLSKQGIVGLKSERGEGVGVE 600
SEIKQTLAGCLTR+IPGLASELKLSTP+S LE+GM
Sbjct: 541 SEIKQTLAGCLTRAIPGLASELKLSTPVSRLEYGM------------------------- 600
Query: 601 GGFLDGHLLDTMTFLDALPAFRMKQWQ-------------------------------VL 660
HLLDTMTFLDALPAFRMKQWQ VL
Sbjct: 601 -----AHLLDTMTFLDALPAFRMKQWQVIVLLFIEALSVCRIPSLASHMSSSRNLYHKVL 660
Query: 661 DRAQIRPDEFELMKDHILPLGRTAQFSVENDA 662
DRAQI+ DE+E+MKDHILPLG TAQ SVENDA
Sbjct: 661 DRAQIQSDEYEIMKDHILPLGLTAQLSVENDA 661
BLAST of HG10002132 vs. NCBI nr
Match:
XP_022955995.1 (putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog isoform X1 [Cucurbita moschata])
HSP 1 Score: 1038.5 bits (2684), Expect = 2.5e-299
Identity = 549/692 (79.34%), Postives = 580/692 (83.82%), Query Frame = 0
Query: 1 MAKNQSVLIKETVFKLQLALLEGIQNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC 60
MAKNQ++LIK+TV+KLQLALL+GI NENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC
Sbjct: 1 MAKNQTILIKDTVYKLQLALLDGIHNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC 60
Query: 61 HSNLSSDNTRKGRYRISLKEHKVYDLQETYKYCSSTCLINSRAFSGRLQDERCSVMNPEK 120
SNL SDNTRKGRYRISLKEHKVYDL+ETYKYCSSTCLINSRAFSGRLQDERCSVMNP K
Sbjct: 61 QSNLPSDNTRKGRYRISLKEHKVYDLEETYKYCSSTCLINSRAFSGRLQDERCSVMNPGK 120
Query: 121 LKETLRLFENLSLDSKEKMGNNSDLGLEIQEKIDSNIGEVPIEDWMGPSNAIEGYVPHSD 180
LKE LRLFENLSLDSKE N+ DLGLEIQEKI+S+IGEVPIE+WMGPSNAIEGYVPH +
Sbjct: 121 LKEILRLFENLSLDSKENTRNSCDLGLEIQEKIESSIGEVPIEEWMGPSNAIEGYVPHRN 180
Query: 181 HKIMTLPSKDGKESKDGSKAKSKPLGGGKDFFSDFSFTSTIITDEEYSVSKISSGLKEMS 240
H IMT P KDGKE KDGSKAK K LG GKDFFSDFSF +T+ITDEEYSVSKISSGLKEM+
Sbjct: 181 HNIMTSPRKDGKELKDGSKAKIKQLGVGKDFFSDFSFATTVITDEEYSVSKISSGLKEMT 240
Query: 241 LDTNSKIQTGEFCVKESNEQFTILETPHAPAPTKNNVGRKAKGSKERTKVSATEESTKNL 300
DT SK QTGEFC K+SNEQFTILETPH PAPTKN+VGRKA+GSKERT VSAT ES NL
Sbjct: 241 FDTKSKAQTGEFCGKQSNEQFTILETPHGPAPTKNSVGRKARGSKERTNVSATAESNNNL 300
Query: 301 SDAPSTSNQCNTNCNLMTEEPRGGSNDLSGTGIKSSLKQPGKKNLRRSVTWADEKTDDTS 360
SDAPSTSN C+TNCN+ TEEP GGSNDL+ T IKSSLKQPGKKNLRRSVTWAD KTD+TS
Sbjct: 301 SDAPSTSNHCSTNCNITTEEPNGGSNDLNETQIKSSLKQPGKKNLRRSVTWADAKTDETS 360
Query: 361 IINPPEVREMGKTKECSRITNNLVNSDNDNEDLLRFESAEACAMALSQAAEAITSGKNEV 420
IIN PE REMGKTKECSR+T+NLVN+DN NED+LR ESAEACAMALSQAAEAITSGKNEV
Sbjct: 361 IINLPEDREMGKTKECSRMTSNLVNADNGNEDILRVESAEACAMALSQAAEAITSGKNEV 420
Query: 421 SDAVSEAGIIILPRPSDANEEASTNPVNAFEPHSLSEKSNKLGELRSDLFDPNDSWYDAP 480
SDAVSEAGIIILPRPSDANEEASTN N EPHS SEKSNK G LRSDLFDPNDSWYD+P
Sbjct: 421 SDAVSEAGIIILPRPSDANEEASTNGENISEPHSSSEKSNKPGILRSDLFDPNDSWYDSP 480
Query: 481 PEGFSLTLSSFATMWMAIFAWITSSSLAYIYGKDDKFHEEFLYIDGKEYPRKIVSTDGRS 540
PEGFSLTLSSFATMWMAIFAW+TSSSLAYIYGKD+KFHEEF YIDG+EYPRKIVS DGRS
Sbjct: 481 PEGFSLTLSSFATMWMAIFAWMTSSSLAYIYGKDEKFHEEFQYIDGREYPRKIVSADGRS 540
Query: 541 SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGMGRCWYLSKQGIVGLKSERGEGVGVE 600
SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGMG C
Sbjct: 541 SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGMG-C---------------------- 600
Query: 601 GGFLDGHLLDTMTFLDALPAFRMKQWQ-------------------------------VL 660
LLDTMTFLDALPAFR KQWQ VL
Sbjct: 601 -------LLDTMTFLDALPAFRTKQWQVIVLLFIEALSVCRIPSLDSQVSHSRSLFHKVL 660
Query: 661 DRAQIRPDEFELMKDHILPLGRTAQFSVENDA 662
DRAQIR DE+E +KDHILPLGRTAQF ENDA
Sbjct: 661 DRAQIRSDEYETLKDHILPLGRTAQFPGENDA 662
BLAST of HG10002132 vs. ExPASy Swiss-Prot
Match:
F4K1B1 (Putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog OS=Arabidopsis thaliana OX=3702 GN=At5g26760 PE=2 SV=1)
HSP 1 Score: 383.6 bits (984), Expect = 4.4e-105
Identity = 288/789 (36.50%), Postives = 388/789 (49.18%), Query Frame = 0
Query: 1 MAK-NQSVLIKETVFKLQLALLEGIQNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPL 60
MAK N+++ I + V KLQL +LE ++NQLFAA LMSRSDYEDVVTER+IA LCGY L
Sbjct: 1 MAKDNEAIAINDAVHKLQLYMLENTTDQNQLFAARKLMSRSDYEDVVTERAIAKLCGYTL 60
Query: 61 CHSNLSSDNTRKGRYRISLKEHKVYDLQETYKYCSSTCLINSRAFSGRLQDERCSVMNPE 120
C L SD +R+G+YRISLK+HKVYDLQET K+CS+ CLI+S+ FSG LQ+ R +
Sbjct: 61 CQRFLPSDVSRRGKYRISLKDHKVYDLQETSKFCSAGCLIDSKTFSGSLQEARTLEFDSV 120
Query: 121 KLKETLRLFENLSLDSKEKMGNNSDLGLE---IQEKIDSNIGEVPIEDWMGPSNAIEGYV 180
KL E L LF + SL+ K + N DL L I+E E+ +E WMGPSNA+EGYV
Sbjct: 121 KLNEILDLFGD-SLEVKGSLDVNKDLDLSKLMIKENFGVRGEELSLEKWMGPSNAVEGYV 180
Query: 181 PHSDHKIMTLPSKDGKESKDGSKAKSKPLGGGKDFFSDFSFTSTIITDEEYSVSKISSGL 240
P K S D K + ++ K + FTST+I + SVSK+
Sbjct: 181 PFDRSK----SSNDSKATTQSNQEK-----------HEMDFTSTVIMPDVNSVSKLPPQT 240
Query: 241 KEMSLDTNSKIQTGEFCVKESN---EQFTILETPHAPAPTKNNVGRKAKG-SKERTKV-- 300
K+ S S G+ +KE + K G G ++E+T V
Sbjct: 241 KQASTVVESVDGKGKTVLKEQTVVPPTKKVSRFRREKEKEKKTFGVDGMGCAQEKTTVLP 300
Query: 301 -------SATEESTKN-----------------------LSDAPSTSNQCNTNCNLMTE- 360
+ E+ KN +S P S + + +C L +
Sbjct: 301 RKILSFCNEIEKDFKNFGFDEMGLASSAMMSDGYGVEYSVSKQPQCSMEDSLSCKLKGDL 360
Query: 361 EPRGGSNDLSGTG----------------------------------------------- 420
+ G N LSG+
Sbjct: 361 QTLDGKNTLSGSSSGSNTKGSKTKPEKSRKKIISVEYHANSYEDGEEILAAESYERHKAQ 420
Query: 421 ---------IKSSLKQPGKKNLRRSVTWADEKTDDTSIINPPEVREMGKTKECSRITNNL 480
KS LK G K L RSVTWAD+ + EVR S +N++
Sbjct: 421 DVCSSSEIVTKSCLKISGSKKLSRSVTWADQNDGRGDLC---EVRNNDNAAGPSLSSNDI 480
Query: 481 VNSDNDNEDLLRFESAEACAMALSQAAEAITSGKNEVSDAVSEAGIIILPR----PSDAN 540
D L R AEA A ALSQAAEA++SG ++ SDA ++AGII+LP +
Sbjct: 481 ----EDVNSLSRLALAEALATALSQAAEAVSSGNSDASDATAKAGIILLPSTHQLDEEVT 540
Query: 541 EEASTNPVNAFEPHSLSEKSNKLGELRSDLFDPNDSWYDAPPEGFSLTLSSFATMWMAIF 600
EE S + EP +L + NK G SDLFD + SW+D PPEGF+LTLS+FA MW ++F
Sbjct: 541 EEHSEEEMTEEEP-TLLKWPNKPGIPDSDLFDRDQSWFDGPPEGFNLTLSNFAVMWDSLF 600
Query: 601 AWITSSSLAYIYGKDDKFHEEFLYIDGKEYPRKIVSTDGRSSEIKQTLAGCLTRSIPGLA 660
W++SSSLAYIYGK++ HEEFL ++GKEYPR+I+ DG SSEIKQT+AGCL R++P +
Sbjct: 601 GWVSSSSLAYIYGKEESAHEEFLLVNGKEYPRRIIMVDGLSSEIKQTIAGCLARALPRVV 660
Query: 661 SELKLSTPISSLEHGMGRCWYLSKQGIVGLKSERGEGVGVEGGFLDGHLLDTMTFLDALP 662
+ L+L IS LE G+ G LL+TM+ A+P
Sbjct: 661 THLRLPIAISELEKGL------------------------------GSLLETMSLTGAVP 720
BLAST of HG10002132 vs. ExPASy Swiss-Prot
Match:
A2Y040 (Putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog OS=Oryza sativa subsp. indica OX=39946 GN=OsI_18345 PE=3 SV=1)
HSP 1 Score: 306.2 bits (783), Expect = 8.9e-82
Identity = 250/763 (32.77%), Postives = 361/763 (47.31%), Query Frame = 0
Query: 2 AKNQSVLIKETVFKLQLALLEG--IQNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPL 61
A+ + + V ++Q+AL +G E L AA SL+S DY DVVTERSIA+ CGYP
Sbjct: 11 ARMKPTTVASAVHRVQMALYDGAAASREPLLRAAASLLSGPDYADVVTERSIADACGYPA 70
Query: 62 CHSNLSSDNTR---KGRYRISLKEHKVYDLQETYKYCSSTCLINSRAFSGRLQDERCSVM 121
C + L S++ R R+RISL+EH+VYDL+E K+CS CL+ S AF L +R +
Sbjct: 71 CPNPLPSEDARGKAAPRFRISLREHRVYDLEEARKFCSERCLVASAAFGASLPPDRPFGV 130
Query: 122 NPEKLKETLRLFE---------------NLSLDSKEKMGNNSDLGLEIQEKIDSNIGEVP 181
+P++L + LFE S D KE +EI EK + GEV
Sbjct: 131 SPDRLDALVALFEGGGGGGGDGGLALGFGASGDGKEVEEGRK---VEIMEKEAAGTGEVT 190
Query: 182 IEDWMGPSNAIEGYVPHSDHKIMTLPSKDGKESKDGSKAKSKPLGGGKDFFSDFSFTSTI 241
+++W+GPS+AIEGYVP D +++ P K+ K++ S +S + D + S S +
Sbjct: 191 LQEWIGPSDAIEGYVPRRD-RVVGGPKKEAKQNDACSAEQSSNI--NVDSRNASSGESGM 250
Query: 242 ITDEEYSVSK---ISSGLKEMSLDTNSKIQTGEFCVKES---NEQFTILETPHAPAPTKN 301
+ E K + LK D ++ + + C+ +S + +LE K
Sbjct: 251 VLTENTKAKKKEATKTPLKMFKQDEDNDMLSS--CISDSIVKQLEDVVLEEKKDKKKNKA 310
Query: 302 NVGRKAKGSKERTKVSATEE-------STKNLSDAPS------TSNQCNTNCNLM-TEEP 361
G G + K + ST + D S Q N + +++ E+P
Sbjct: 311 AKGTSRVGKSKPAKRPVGRDGHEVDFTSTIIMGDHGSEMMDHGALGQYNFSSSILANEQP 370
Query: 362 RGG------------------------------SNDLSGTGIKSSLKQPGKKNLRRSVTW 421
S+D ++SSLK G KN RSV W
Sbjct: 371 SSSQYAAIDSVQAYTEELDELFSNAVNIAKDETSDDSGRCTLRSSLKAVGSKNAGRSVKW 430
Query: 422 ADEKTDDTSIINPPEVREMGKTKECSR-ITNNLVNSDNDNEDLLRFESAEACAMALSQAA 481
ADE G E SR ++ S + +R ESAEACA AL +AA
Sbjct: 431 ADEN---------------GSVLETSRAFVSHSSKSQESMDSSVRRESAEACAAALIEAA 490
Query: 482 EAITSGKNEVSDAVSEAGIIILP---------RPSDANEEASTNPVNAFEPHSLSEKSNK 541
EAI+SG +EV DAVS+AGIIILP D +++A N + + + + K
Sbjct: 491 EAISSGTSEVEDAVSKAGIIILPDMVNQQQYNNDYDNDKDAGENEIFEID-RGVVKWPKK 550
Query: 542 LGELRSDLFDPNDSWYDAPPEGFSLTLSSFATMWMAIFAWITSSSLAYIYGKDDKFHEEF 601
L +D+FD +DSW+D PPEGFSLTLSSFATMW A+F W++ SSLAY+YG D+ E+
Sbjct: 551 TVLLDTDMFDVDDSWHDTPPEGFSLTLSSFATMWAALFGWVSRSSLAYVYGLDESSMEDL 610
Query: 602 LYIDGKEYPRKIVSTDGRSSEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGMGRCWYL 655
L G+E P+K V DG SSEI++ L C+ ++P L S L++ P+S LE +
Sbjct: 611 LIAGGRECPQKRVLNDGHSSEIRRALDTCVCNALPVLVSNLRMQIPVSKLEITL------ 670
BLAST of HG10002132 vs. ExPASy Swiss-Prot
Match:
Q6AVZ9 (Putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog OS=Oryza sativa subsp. japonica OX=39947 GN=Os05g0134300 PE=3 SV=1)
HSP 1 Score: 304.7 bits (779), Expect = 2.6e-81
Identity = 249/763 (32.63%), Postives = 360/763 (47.18%), Query Frame = 0
Query: 2 AKNQSVLIKETVFKLQLALLEG--IQNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPL 61
A+ + + V ++Q+AL +G E L AA SL+S DY DVVTERSIA+ CGYP
Sbjct: 11 ARMKPTTVASAVHRVQMALYDGAAASREPLLRAAASLLSGPDYADVVTERSIADACGYPA 70
Query: 62 CHSNLSSDNTR---KGRYRISLKEHKVYDLQETYKYCSSTCLINSRAFSGRLQDERCSVM 121
C + L S++ R R+RISL+EH+VYDL+E K+CS CL+ S AF L +R +
Sbjct: 71 CPNPLPSEDARGKAAPRFRISLREHRVYDLEEARKFCSERCLVASAAFGASLPPDRPFGV 130
Query: 122 NPEKLKETLRLFE---------------NLSLDSKEKMGNNSDLGLEIQEKIDSNIGEVP 181
+P++L + LFE S D KE +EI EK + GEV
Sbjct: 131 SPDRLDALVALFEGGGGGGDDGGLALGFGASGDGKEVEEGRK---VEIMEKEAAGTGEVT 190
Query: 182 IEDWMGPSNAIEGYVPHSDHKIMTLPSKDGKESKDGSKAKSKPLGGGKDFFSDFSFTSTI 241
+++W+GPS+AIEGYVP D +++ P K+ K++ S +S + D + S S +
Sbjct: 191 LQEWIGPSDAIEGYVPRRD-RVVGGPKKEAKQNDACSAEQSSNI--NVDSRNASSGESGM 250
Query: 242 ITDEEYSVSK---ISSGLKEMSLDTNSKIQTGEFCVKES---NEQFTILETPHAPAPTKN 301
+ E K + LK D ++ + + C+ +S + +LE K
Sbjct: 251 VLTENTKAKKKEATKTPLKMFKQDEDNDMLSS--CISDSIVKQLEDVVLEEKKDKKKNKA 310
Query: 302 NVGRKAKGSKERTKVSATEE-------STKNLSDAPS------TSNQCNTNCNLM-TEEP 361
G G + K + ST + D S Q N + +++ E+P
Sbjct: 311 AKGTSRVGKSKPAKRPVGRDGHEVDFTSTIIMGDRGSEMMDHGALGQYNFSSSILANEQP 370
Query: 362 RGG------------------------------SNDLSGTGIKSSLKQPGKKNLRRSVTW 421
S+D ++SSLK G KN SV W
Sbjct: 371 SSSQYAAIDSVQAYTEELDELFSNAVNIAKDETSDDSGRCTLRSSLKAVGSKNAGHSVKW 430
Query: 422 ADEKTDDTSIINPPEVREMGKTKECSR-ITNNLVNSDNDNEDLLRFESAEACAMALSQAA 481
ADE G E SR ++ S + +R ESAEACA AL +AA
Sbjct: 431 ADEN---------------GSVLETSRAFVSHSSKSQESMDSSVRRESAEACAAALIEAA 490
Query: 482 EAITSGKNEVSDAVSEAGIIILP---------RPSDANEEASTNPVNAFEPHSLSEKSNK 541
EAI+SG +EV DAVS+AGIIILP D +++A N + + + + K
Sbjct: 491 EAISSGTSEVEDAVSKAGIIILPDMVNQQQYNNDYDNDKDAGENEIFEID-RGVVKWPKK 550
Query: 542 LGELRSDLFDPNDSWYDAPPEGFSLTLSSFATMWMAIFAWITSSSLAYIYGKDDKFHEEF 601
L +D+FD +DSW+D PPEGFSLTLSSFATMW A+F W++ SSLAY+YG D+ E+
Sbjct: 551 TVLLDTDMFDVDDSWHDTPPEGFSLTLSSFATMWAALFGWVSRSSLAYVYGLDESSMEDL 610
Query: 602 LYIDGKEYPRKIVSTDGRSSEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGMGRCWYL 655
L G+E P+K V DG SSEI++ L C+ ++P L S L++ P+S LE +
Sbjct: 611 LIAGGRECPQKRVLNDGHSSEIRRALDTCVCNALPVLVSNLRMQIPVSKLEITL------ 670
BLAST of HG10002132 vs. ExPASy Swiss-Prot
Match:
Q8IXW5 (Putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 OS=Homo sapiens OX=9606 GN=RPAP2 PE=1 SV=1)
HSP 1 Score: 65.5 bits (158), Expect = 2.6e-09
Identity = 106/450 (23.56%), Postives = 185/450 (41.11%), Query Frame = 0
Query: 20 LLEGIQNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLCHSNLSSDNTRKGRYRISLK 79
LLE E L G ++ + Y DVV ERSI LCGYPLC L K +Y+IS K
Sbjct: 65 LLEENITEEFLMECGRFITPAHYSDVVDERSIVKLCGYPLCQKKLGI--VPKQKYKISTK 124
Query: 80 EHKVYDLQETYKYCSSTCLINSRAFSGRL--------QDERCSVMNPEKLKETLRLFENL 139
+KVYD+ E +CS+ C S+ F ++ ++ER K +++ E +
Sbjct: 125 TNKVYDITERKSFCSNFCYQASKFFEAQIPKTPVWVREEERHPDFQLLKEEQSGHSGEEV 184
Query: 140 SLDSKEKMGNNSDLGLEIQEKIDSNIGEVPIEDWMG-----PSNAIEGYVPHSDHKIMTL 199
L SK ++ D +++ +S+ + S+ + G P+S + L
Sbjct: 185 QLCSKAIKTSDIDNPSHFEKQYESSSSSTHSDSSSDNEQDFVSSILPGNRPNSTNIRPQL 244
Query: 200 PSKDGKESKDGSKAKSKPLGGGKDFFSDFSFTSTIITDEEYSVSKISSGLKEMSLDTNSK 259
K + K G KA SK D+E +V ++ L + LD+ K
Sbjct: 245 HQKSIMKKKAGHKANSKH------------------KDKEQTVVDVTEQLGDCKLDSQEK 304
Query: 260 IQTGEFCVKESNEQFTILET-PHAPAPTKNNVGRKAKGSKERTKVSATEESTKNLSDAPS 319
T E +++ N Q + T P ++N+ ++ E T V +++S ++ +
Sbjct: 305 DATCELPLQKVNTQSSSNSTLPERLKASENSESEYSR--SEITLVGISKKSAEHFKRKFA 364
Query: 320 TSNQCNTNCNLMTEEPRGGSNDLSGTGIKSSLK---QPGKKNLRRSV--TWADEKTDDTS 379
SNQ + + + SS++ + GK+NL + + T + KT++T
Sbjct: 365 KSNQVSRS-------------------VSSSVQVCPEVGKRNLLKVLKETLIEWKTEETL 424
Query: 380 IINPPEVREMGKTKECSRITNNLVNSDNDNEDLLRFESAEACAMALSQAAEAITSGKNEV 439
+ C + +LV + D +D++ + A SQ N +
Sbjct: 425 RF----LYGQNYASVCLKPEASLVKEELDEDDIISDPDSHFPAWRESQ---------NSL 460
Query: 440 SDAV--SEAGIIILPRPSDANEEASTNPVN 449
+++ +G I P PS N + T +N
Sbjct: 485 DESLPFRGSGTAIKPLPSYENLKKETEKLN 460
BLAST of HG10002132 vs. ExPASy Swiss-Prot
Match:
Q5RA37 (Putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 OS=Pongo abelii OX=9601 GN=RPAP2 PE=2 SV=1)
HSP 1 Score: 65.1 bits (157), Expect = 3.4e-09
Identity = 106/450 (23.56%), Postives = 184/450 (40.89%), Query Frame = 0
Query: 20 LLEGIQNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLCHSNLSSDNTRKGRYRISLK 79
LLE E L G ++ + Y DVV ERSI LCGYPLC L K +Y+IS K
Sbjct: 65 LLEENITEEFLMECGKFITPAHYSDVVDERSIVKLCGYPLCQKKLGI--VPKQKYKISTK 124
Query: 80 EHKVYDLQETYKYCSSTCLINSRAFSGRL--------QDERCSVMNPEKLKETLRLFENL 139
+KVYD+ E +CS+ C S+ F ++ ++ER K +++ E +
Sbjct: 125 TNKVYDITERKSFCSNFCYQASKFFEAQIPKTPVWVREEERHPDFQLLKEQQSGHSGEEV 184
Query: 140 SLDSKEKMGNNSDLGLEIQEKIDSNIGEVPIEDWMG-----PSNAIEGYVPHSDHKIMTL 199
L SK ++ D +++ +S+ + S+ + G P+S L
Sbjct: 185 QLCSKAIKTSDIDNPSHFEKQYESSSSSTHSDSSSDNEQDFVSSILPGNRPNSTSIRPQL 244
Query: 200 PSKDGKESKDGSKAKSKPLGGGKDFFSDFSFTSTIITDEEYSVSKISSGLKEMSLDTNSK 259
K + K G KA SK D+E +V ++ L + LD+ K
Sbjct: 245 HQKSIMKKKAGHKANSKH------------------KDKEQTVIDVTEQLGDCKLDSQEK 304
Query: 260 IQTGEFCVKESNEQFTILET-PHAPAPTKNNVGRKAKGSKERTKVSATEESTKNLSDAPS 319
T E +++ N Q + T P ++N+ ++ E T V +++S ++ +
Sbjct: 305 DATCELPLQKVNTQSSSNSTLPERLKASENSESEYSR--SEITLVGISKKSAEHFKRKFA 364
Query: 320 TSNQCNTNCNLMTEEPRGGSNDLSGTGIKSSLK---QPGKKNLRRSV--TWADEKTDDTS 379
SNQ + + + SS++ + GK+NL + + T + KT++T
Sbjct: 365 KSNQVSRS-------------------VSSSVQVCPEVGKRNLLKILKETLIEWKTEETL 424
Query: 380 IINPPEVREMGKTKECSRITNNLVNSDNDNEDLLRFESAEACAMALSQAAEAITSGKNEV 439
+ C + +LV + D +D++ + A SQ N +
Sbjct: 425 RF----LYGQNYASVCLKPEASLVKEELDEDDIISDPDSHFPAWRESQ---------NSL 460
Query: 440 SDAV--SEAGIIILPRPSDANEEASTNPVN 449
+++ +G I P PS N + T +N
Sbjct: 485 DESLPFRGSGTAIKPLPSYENLKKETEKLN 460
BLAST of HG10002132 vs. ExPASy TrEMBL
Match:
A0A0A0KVU3 (RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog OS=Cucumis sativus OX=3659 GN=Csa_4G009360 PE=3 SV=1)
HSP 1 Score: 1059.3 bits (2738), Expect = 6.6e-306
Identity = 557/692 (80.49%), Postives = 590/692 (85.26%), Query Frame = 0
Query: 1 MAKNQSVLIKETVFKLQLALLEGIQNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC 60
MAKNQSVLIK+TV+KLQLAL EGI+NENQLFAAGSLMSRSDYEDVVTERSIA+LCGYPLC
Sbjct: 1 MAKNQSVLIKDTVYKLQLALYEGIKNENQLFAAGSLMSRSDYEDVVTERSIADLCGYPLC 60
Query: 61 HSNLSSDNTRKGRYRISLKEHKVYDLQETYKYCSSTCLINSRAFSGRLQDERCSVMNPEK 120
HSNL SDNTR+GRYRISLKEHKVYDL+ETYKYCSS CLINSRAFSGRLQDERCSVMNP+K
Sbjct: 61 HSNLPSDNTRRGRYRISLKEHKVYDLEETYKYCSSACLINSRAFSGRLQDERCSVMNPDK 120
Query: 121 LKETLRLFENLSLDSKEKMGNNSDLGLEIQEKIDSNIGEVPIEDWMGPSNAIEGYVPHSD 180
LKE L+LFEN+SLDSKE MGNN D GLEIQEKI+SNIGEVPIE+WMGPSNAIEGYVPH D
Sbjct: 121 LKEILKLFENMSLDSKENMGNNCDSGLEIQEKIESNIGEVPIEEWMGPSNAIEGYVPHRD 180
Query: 181 HKIMTLPSKDGKESKDGSKAKSKPLGGGKDFFSDFSFTSTIITDEEYSVSKISSGLKEMS 240
HK+MTL SKDGKESKDGSKAK KPLGGGKDFFSDFS TSTIITDEEYSVSKISSGLKEM+
Sbjct: 181 HKVMTLHSKDGKESKDGSKAKIKPLGGGKDFFSDFSITSTIITDEEYSVSKISSGLKEMA 240
Query: 241 LDTNSKIQTGEFCVKESNEQFTILETPHAPAPTKNNVGRKAKGSKERTKVSATEESTKNL 300
LDTNSK QTGEFC KESN+QF ILETPHAPAP KN+VGRKA+GSKERTKVSAT+EST NL
Sbjct: 241 LDTNSKNQTGEFCGKESNDQFAILETPHAPAPPKNSVGRKARGSKERTKVSATKESTDNL 300
Query: 301 SDAPSTSNQCNTNCNLMTEEPRGGSNDLSGTGIKSSLKQPGKKNLRRSVTWADEKTDDTS 360
SDAPSTS +TN NLMTEEPRGG NDLSGT +KSSLK+PGKKNL RSVTWADEKTDD S
Sbjct: 301 SDAPSTSKNRSTNFNLMTEEPRGGFNDLSGTELKSSLKKPGKKNLCRSVTWADEKTDDAS 360
Query: 361 IINPPEVREMGKTKECSRITNNLVNSDNDNEDLLRFESAEACAMALSQAAEAITSGKNEV 420
I+N PEV EMGKTKECSR T+NLVN DNDNED+LR ESAEACAMALSQAAEAITSG++EV
Sbjct: 361 IMNLPEVGEMGKTKECSRTTSNLVNFDNDNEDILRVESAEACAMALSQAAEAITSGQSEV 420
Query: 421 SDAVSEAGIIILPRPSDANEEASTNPVNAFEPHSLSEKSNKLGELRSDLFDPNDSWYDAP 480
SDAVSEAGIIILP PSDANEEAST+PVNA EPHS SEKSNKLG LRSDLFDP+DSWYDAP
Sbjct: 421 SDAVSEAGIIILPHPSDANEEASTDPVNASEPHSFSEKSNKLGVLRSDLFDPSDSWYDAP 480
Query: 481 PEGFSLTLSSFATMWMAIFAWITSSSLAYIYGKDDKFHEEFLYIDGKEYPRKIVSTDGRS 540
PEGFSLTLSSFATMWMAIFAW+TSSSLAYIYGKDDKFHEEFLYIDGKEYP KIVS DGRS
Sbjct: 481 PEGFSLTLSSFATMWMAIFAWVTSSSLAYIYGKDDKFHEEFLYIDGKEYPSKIVSADGRS 540
Query: 541 SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGMGRCWYLSKQGIVGLKSERGEGVGVE 600
SEIKQTLAGCLTR+IPGLASEL LSTPIS LE+GM
Sbjct: 541 SEIKQTLAGCLTRAIPGLASELNLSTPISRLENGM------------------------- 600
Query: 601 GGFLDGHLLDTMTFLDALPAFRMKQWQ-------------------------------VL 660
HLLDTMTFLDALPAFRMKQWQ VL
Sbjct: 601 -----AHLLDTMTFLDALPAFRMKQWQVIVLLFIEALSVSRIPSLASHMSSSRNLYHKVL 660
Query: 661 DRAQIRPDEFELMKDHILPLGRTAQFSVENDA 662
DRAQIR DE+E+M+DHILPLGRTAQ S ENDA
Sbjct: 661 DRAQIRSDEYEIMRDHILPLGRTAQLSDENDA 662
BLAST of HG10002132 vs. ExPASy TrEMBL
Match:
A0A1S3BXZ9 (RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog OS=Cucumis melo OX=3656 GN=LOC103494620 PE=3 SV=1)
HSP 1 Score: 1047.0 bits (2706), Expect = 3.4e-302
Identity = 555/692 (80.20%), Postives = 584/692 (84.39%), Query Frame = 0
Query: 1 MAKNQSVLIKETVFKLQLALLEGIQNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC 60
MAKNQS LIK+TV+KLQLAL EGIQNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC
Sbjct: 1 MAKNQSALIKDTVYKLQLALYEGIQNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC 60
Query: 61 HSNLSSDNTRKGRYRISLKEHKVYDLQETYKYCSSTCLINSRAFSGRLQDERCSVMNPEK 120
HSNL SDNTR+GRYRISLKEHKVYDL+ETYKYCSS CLINSRAFSGRLQDERCSVMNP K
Sbjct: 61 HSNLPSDNTRRGRYRISLKEHKVYDLEETYKYCSSACLINSRAFSGRLQDERCSVMNPAK 120
Query: 121 LKETLRLFENLSLDSKEKMGNNSDLGLEIQEKIDSNIGEVPIEDWMGPSNAIEGYVPHSD 180
LKE L+LFEN+SLDSKE MGNN D GLEIQEKI+S+IGEVPIE+WMGPSNAIEGYVPH D
Sbjct: 121 LKEILKLFENMSLDSKENMGNNCDSGLEIQEKIESSIGEVPIEEWMGPSNAIEGYVPHRD 180
Query: 181 HKIMTLPSKDGKESKDGSKAKSKPLGGGKDFFSDFSFTSTIITDEEYSVSKISSGLKEMS 240
HKIMTLPSKDGKESKDGS AK KPLGGGKDFFSDFSFTSTIITDEEYSVSKISS LKEM+
Sbjct: 181 HKIMTLPSKDGKESKDGSTAKIKPLGGGKDFFSDFSFTSTIITDEEYSVSKISSSLKEMA 240
Query: 241 LDTNSKIQTGEFCVKESNEQFTILETPHAPAPTKNNVGRKAKGSKERTKVSATEESTKNL 300
LDTNSKIQTGEFC KESN+QFTILET HA AP KN+VG KA+GSKERTKVSATEEST NL
Sbjct: 241 LDTNSKIQTGEFCGKESNDQFTILETSHARAPPKNSVGHKARGSKERTKVSATEESTNNL 300
Query: 301 SDAPSTSNQCNTNCNLMTEEPRGGSNDLSGTGIKSSLKQPGKKNLRRSVTWADEKTDDTS 360
SDAPSTSN +TN NL+TEEP+GG NDL GT IKSSLKQPGKKNLRRSVTWADEK DDTS
Sbjct: 301 SDAPSTSNNRSTNFNLVTEEPKGGFNDLRGTEIKSSLKQPGKKNLRRSVTWADEKIDDTS 360
Query: 361 IINPPEVREMGKTKECSRITNNLVNSDNDNEDLLRFESAEACAMALSQAAEAITSGKNEV 420
+N PEV E GKTKECSRIT+NLVN DNDNEDL+R ESAEACAMALSQAAEAITSG++EV
Sbjct: 361 -MNLPEVGEKGKTKECSRITSNLVNFDNDNEDLIRVESAEACAMALSQAAEAITSGQSEV 420
Query: 421 SDAVSEAGIIILPRPSDANEEASTNPVNAFEPHSLSEKSNKLGELRSDLFDPNDSWYDAP 480
S+AVSEAGIIILP PSDANEEAST PV A EPHS SEKSNKLG L SDLFDP+DSWYDAP
Sbjct: 421 SEAVSEAGIIILPHPSDANEEASTEPVKASEPHSFSEKSNKLGVLHSDLFDPSDSWYDAP 480
Query: 481 PEGFSLTLSSFATMWMAIFAWITSSSLAYIYGKDDKFHEEFLYIDGKEYPRKIVSTDGRS 540
PEGFSLTLSSFATMWMAIFAW+TSSSLAYIYGKDDKFHEEF YIDGKEYP KIVS DGRS
Sbjct: 481 PEGFSLTLSSFATMWMAIFAWVTSSSLAYIYGKDDKFHEEFQYIDGKEYPSKIVSADGRS 540
Query: 541 SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGMGRCWYLSKQGIVGLKSERGEGVGVE 600
SEIKQTLAGCLTR+IPGLASELKLSTPIS LE+GM
Sbjct: 541 SEIKQTLAGCLTRAIPGLASELKLSTPISRLEYGM------------------------- 600
Query: 601 GGFLDGHLLDTMTFLDALPAFRMKQWQ-------------------------------VL 660
HLLDTMTFLDALPAFRMKQWQ VL
Sbjct: 601 -----AHLLDTMTFLDALPAFRMKQWQVIVLLFMEALSVCRIPSLASHMSSSRNLYHKVL 660
Query: 661 DRAQIRPDEFELMKDHILPLGRTAQFSVENDA 662
DRAQI+ DE+E+MKDHILPLG TAQ SVENDA
Sbjct: 661 DRAQIQSDEYEIMKDHILPLGLTAQLSVENDA 661
BLAST of HG10002132 vs. ExPASy TrEMBL
Match:
A0A5A7TQX7 (RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold46G002560 PE=3 SV=1)
HSP 1 Score: 1043.5 bits (2697), Expect = 3.8e-301
Identity = 552/692 (79.77%), Postives = 585/692 (84.54%), Query Frame = 0
Query: 1 MAKNQSVLIKETVFKLQLALLEGIQNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC 60
MAKNQS LIK+TV+KLQLAL EGIQNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC
Sbjct: 1 MAKNQSALIKDTVYKLQLALYEGIQNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC 60
Query: 61 HSNLSSDNTRKGRYRISLKEHKVYDLQETYKYCSSTCLINSRAFSGRLQDERCSVMNPEK 120
HSNL SDNTR+GRYRISLKEHKVYDL+ETYKYCSS CLINSRAFSGRLQDERCSVMNP K
Sbjct: 61 HSNLPSDNTRRGRYRISLKEHKVYDLEETYKYCSSACLINSRAFSGRLQDERCSVMNPAK 120
Query: 121 LKETLRLFENLSLDSKEKMGNNSDLGLEIQEKIDSNIGEVPIEDWMGPSNAIEGYVPHSD 180
LK+ L+LFEN+SLDSKE +GNN D GLEIQEKI+S+IGEVPIE+WMGPSNAIEGYVPH D
Sbjct: 121 LKDILKLFENMSLDSKENVGNNCDSGLEIQEKIESSIGEVPIEEWMGPSNAIEGYVPHRD 180
Query: 181 HKIMTLPSKDGKESKDGSKAKSKPLGGGKDFFSDFSFTSTIITDEEYSVSKISSGLKEMS 240
HKIMTLPSKDGKESKDGS AK KPLGGGKDFFSDFSFTSTIITDEEYSVSKISS LKEM+
Sbjct: 181 HKIMTLPSKDGKESKDGSTAKIKPLGGGKDFFSDFSFTSTIITDEEYSVSKISSSLKEMA 240
Query: 241 LDTNSKIQTGEFCVKESNEQFTILETPHAPAPTKNNVGRKAKGSKERTKVSATEESTKNL 300
LDTNSKIQTGEFC KESN+QFTILET HA AP KN+VG KA+GSKERTKVSATEEST NL
Sbjct: 241 LDTNSKIQTGEFCGKESNDQFTILETSHARAPPKNSVGHKARGSKERTKVSATEESTNNL 300
Query: 301 SDAPSTSNQCNTNCNLMTEEPRGGSNDLSGTGIKSSLKQPGKKNLRRSVTWADEKTDDTS 360
SDAPSTSN +TN NL+TEEP+GG NDL GT IKSSLKQPGKKNLRRSVTWADEK DDTS
Sbjct: 301 SDAPSTSNNRSTNFNLVTEEPKGGFNDLRGTEIKSSLKQPGKKNLRRSVTWADEKIDDTS 360
Query: 361 IINPPEVREMGKTKECSRITNNLVNSDNDNEDLLRFESAEACAMALSQAAEAITSGKNEV 420
+N PEV E GKTKECSRIT+NLVN DNDNEDLLR ESAEACAMALSQAAEAITSG++EV
Sbjct: 361 -MNLPEVGEKGKTKECSRITSNLVNFDNDNEDLLRVESAEACAMALSQAAEAITSGQSEV 420
Query: 421 SDAVSEAGIIILPRPSDANEEASTNPVNAFEPHSLSEKSNKLGELRSDLFDPNDSWYDAP 480
S+AVSEAGIIILP PSDANEEAST+PV A EPHS SEKSNKLG L SDLFDP++SWYDAP
Sbjct: 421 SEAVSEAGIIILPHPSDANEEASTDPVKASEPHSFSEKSNKLGVLHSDLFDPSESWYDAP 480
Query: 481 PEGFSLTLSSFATMWMAIFAWITSSSLAYIYGKDDKFHEEFLYIDGKEYPRKIVSTDGRS 540
PEGFSLTLSSFATMWMAIFAW+TSSSLAYIYGKDDKFHEEF YIDGKEYP KIVS DGRS
Sbjct: 481 PEGFSLTLSSFATMWMAIFAWVTSSSLAYIYGKDDKFHEEFQYIDGKEYPSKIVSADGRS 540
Query: 541 SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGMGRCWYLSKQGIVGLKSERGEGVGVE 600
SEIKQTLAGCLTR+IPGLASELKLSTP+S LE+GM
Sbjct: 541 SEIKQTLAGCLTRAIPGLASELKLSTPVSRLEYGM------------------------- 600
Query: 601 GGFLDGHLLDTMTFLDALPAFRMKQWQ-------------------------------VL 660
HLLDTMTFLDALPAFRMKQWQ VL
Sbjct: 601 -----AHLLDTMTFLDALPAFRMKQWQVIVLLFIEALSVCRIPSLASHMSSSRNLYHKVL 660
Query: 661 DRAQIRPDEFELMKDHILPLGRTAQFSVENDA 662
DRAQI+ DE+E+MKDHILPLG TAQ SVENDA
Sbjct: 661 DRAQIQSDEYEIMKDHILPLGLTAQLSVENDA 661
BLAST of HG10002132 vs. ExPASy TrEMBL
Match:
A0A6J1GWL9 (RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog OS=Cucurbita moschata OX=3662 GN=LOC111457827 PE=3 SV=1)
HSP 1 Score: 1038.5 bits (2684), Expect = 1.2e-299
Identity = 549/692 (79.34%), Postives = 580/692 (83.82%), Query Frame = 0
Query: 1 MAKNQSVLIKETVFKLQLALLEGIQNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC 60
MAKNQ++LIK+TV+KLQLALL+GI NENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC
Sbjct: 1 MAKNQTILIKDTVYKLQLALLDGIHNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC 60
Query: 61 HSNLSSDNTRKGRYRISLKEHKVYDLQETYKYCSSTCLINSRAFSGRLQDERCSVMNPEK 120
SNL SDNTRKGRYRISLKEHKVYDL+ETYKYCSSTCLINSRAFSGRLQDERCSVMNP K
Sbjct: 61 QSNLPSDNTRKGRYRISLKEHKVYDLEETYKYCSSTCLINSRAFSGRLQDERCSVMNPGK 120
Query: 121 LKETLRLFENLSLDSKEKMGNNSDLGLEIQEKIDSNIGEVPIEDWMGPSNAIEGYVPHSD 180
LKE LRLFENLSLDSKE N+ DLGLEIQEKI+S+IGEVPIE+WMGPSNAIEGYVPH +
Sbjct: 121 LKEILRLFENLSLDSKENTRNSCDLGLEIQEKIESSIGEVPIEEWMGPSNAIEGYVPHRN 180
Query: 181 HKIMTLPSKDGKESKDGSKAKSKPLGGGKDFFSDFSFTSTIITDEEYSVSKISSGLKEMS 240
H IMT P KDGKE KDGSKAK K LG GKDFFSDFSF +T+ITDEEYSVSKISSGLKEM+
Sbjct: 181 HNIMTSPRKDGKELKDGSKAKIKQLGVGKDFFSDFSFATTVITDEEYSVSKISSGLKEMT 240
Query: 241 LDTNSKIQTGEFCVKESNEQFTILETPHAPAPTKNNVGRKAKGSKERTKVSATEESTKNL 300
DT SK QTGEFC K+SNEQFTILETPH PAPTKN+VGRKA+GSKERT VSAT ES NL
Sbjct: 241 FDTKSKAQTGEFCGKQSNEQFTILETPHGPAPTKNSVGRKARGSKERTNVSATAESNNNL 300
Query: 301 SDAPSTSNQCNTNCNLMTEEPRGGSNDLSGTGIKSSLKQPGKKNLRRSVTWADEKTDDTS 360
SDAPSTSN C+TNCN+ TEEP GGSNDL+ T IKSSLKQPGKKNLRRSVTWAD KTD+TS
Sbjct: 301 SDAPSTSNHCSTNCNITTEEPNGGSNDLNETQIKSSLKQPGKKNLRRSVTWADAKTDETS 360
Query: 361 IINPPEVREMGKTKECSRITNNLVNSDNDNEDLLRFESAEACAMALSQAAEAITSGKNEV 420
IIN PE REMGKTKECSR+T+NLVN+DN NED+LR ESAEACAMALSQAAEAITSGKNEV
Sbjct: 361 IINLPEDREMGKTKECSRMTSNLVNADNGNEDILRVESAEACAMALSQAAEAITSGKNEV 420
Query: 421 SDAVSEAGIIILPRPSDANEEASTNPVNAFEPHSLSEKSNKLGELRSDLFDPNDSWYDAP 480
SDAVSEAGIIILPRPSDANEEASTN N EPHS SEKSNK G LRSDLFDPNDSWYD+P
Sbjct: 421 SDAVSEAGIIILPRPSDANEEASTNGENISEPHSSSEKSNKPGILRSDLFDPNDSWYDSP 480
Query: 481 PEGFSLTLSSFATMWMAIFAWITSSSLAYIYGKDDKFHEEFLYIDGKEYPRKIVSTDGRS 540
PEGFSLTLSSFATMWMAIFAW+TSSSLAYIYGKD+KFHEEF YIDG+EYPRKIVS DGRS
Sbjct: 481 PEGFSLTLSSFATMWMAIFAWMTSSSLAYIYGKDEKFHEEFQYIDGREYPRKIVSADGRS 540
Query: 541 SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGMGRCWYLSKQGIVGLKSERGEGVGVE 600
SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGMG C
Sbjct: 541 SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGMG-C---------------------- 600
Query: 601 GGFLDGHLLDTMTFLDALPAFRMKQWQ-------------------------------VL 660
LLDTMTFLDALPAFR KQWQ VL
Sbjct: 601 -------LLDTMTFLDALPAFRTKQWQVIVLLFIEALSVCRIPSLDSQVSHSRSLFHKVL 660
Query: 661 DRAQIRPDEFELMKDHILPLGRTAQFSVENDA 662
DRAQIR DE+E +KDHILPLGRTAQF ENDA
Sbjct: 661 DRAQIRSDEYETLKDHILPLGRTAQFPGENDA 662
BLAST of HG10002132 vs. ExPASy TrEMBL
Match:
A0A6J1IY57 (RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog OS=Cucurbita maxima OX=3661 GN=LOC111479539 PE=3 SV=1)
HSP 1 Score: 1032.3 bits (2668), Expect = 8.7e-298
Identity = 545/692 (78.76%), Postives = 581/692 (83.96%), Query Frame = 0
Query: 1 MAKNQSVLIKETVFKLQLALLEGIQNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC 60
MAKNQ++LIK+TV+KLQLALL+GI NENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC
Sbjct: 1 MAKNQTILIKDTVYKLQLALLDGIHNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC 60
Query: 61 HSNLSSDNTRKGRYRISLKEHKVYDLQETYKYCSSTCLINSRAFSGRLQDERCSVMNPEK 120
SNL SDNTRKGRYRISLKEHKVYDL+ETYKYCSSTCLINSRAFSGRLQDERCSVMNP K
Sbjct: 61 QSNLPSDNTRKGRYRISLKEHKVYDLEETYKYCSSTCLINSRAFSGRLQDERCSVMNPGK 120
Query: 121 LKETLRLFENLSLDSKEKMGNNSDLGLEIQEKIDSNIGEVPIEDWMGPSNAIEGYVPHSD 180
LKE LRLFENLSLDSKE N+ DLGLEIQEKI+S+IGEVPIE+WMGPSNAIEGYVPH +
Sbjct: 121 LKEILRLFENLSLDSKENTRNSCDLGLEIQEKIESSIGEVPIEEWMGPSNAIEGYVPHRN 180
Query: 181 HKIMTLPSKDGKESKDGSKAKSKPLGGGKDFFSDFSFTSTIITDEEYSVSKISSGLKEMS 240
H IMTLPSKDGKE KDGSKAK K LG KDFFSDFSF ST+ITDEEYSVSKISSGLKEM+
Sbjct: 181 HNIMTLPSKDGKELKDGSKAKIKQLGVEKDFFSDFSFASTVITDEEYSVSKISSGLKEMT 240
Query: 241 LDTNSKIQTGEFCVKESNEQFTILETPHAPAPTKNNVGRKAKGSKERTKVSATEESTKNL 300
DT SK QTGEFC K+SNEQFTILETPH PAPTKN+VGRKA+G+KERT VSAT ES NL
Sbjct: 241 FDTKSKAQTGEFCGKQSNEQFTILETPHGPAPTKNSVGRKARGTKERTNVSATAESNNNL 300
Query: 301 SDAPSTSNQCNTNCNLMTEEPRGGSNDLSGTGIKSSLKQPGKKNLRRSVTWADEKTDDTS 360
SD+PSTSN CNTNCN+ TEEP+GGSN+L+ T IKSSLKQPGKKNLRRSVTWAD KTD+TS
Sbjct: 301 SDSPSTSNHCNTNCNITTEEPKGGSNELNETQIKSSLKQPGKKNLRRSVTWADAKTDETS 360
Query: 361 IINPPEVREMGKTKECSRITNNLVNSDNDNEDLLRFESAEACAMALSQAAEAITSGKNEV 420
IIN PE REMGKTKECSR+T+NLVN+DN NED+LR ESAEACAMALSQAAEAITSG+NEV
Sbjct: 361 IINLPEDREMGKTKECSRMTSNLVNADNGNEDMLRVESAEACAMALSQAAEAITSGQNEV 420
Query: 421 SDAVSEAGIIILPRPSDANEEASTNPVNAFEPHSLSEKSNKLGELRSDLFDPNDSWYDAP 480
SDAVSEAGIIILPRPSDANEE STN N EP+S SEKSNK G L SDLFDP DSWYD+P
Sbjct: 421 SDAVSEAGIIILPRPSDANEEVSTNGKNISEPYSSSEKSNKPGILHSDLFDPEDSWYDSP 480
Query: 481 PEGFSLTLSSFATMWMAIFAWITSSSLAYIYGKDDKFHEEFLYIDGKEYPRKIVSTDGRS 540
PEGFSLTLSSFATMWMAIFAW+TSSSLAYIYGKD+KFHEEF YIDG+EYPRKIVS DGRS
Sbjct: 481 PEGFSLTLSSFATMWMAIFAWMTSSSLAYIYGKDEKFHEEFQYIDGREYPRKIVSADGRS 540
Query: 541 SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGMGRCWYLSKQGIVGLKSERGEGVGVE 600
SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGMG C
Sbjct: 541 SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGMG-C---------------------- 600
Query: 601 GGFLDGHLLDTMTFLDALPAFRMKQWQ-------------------------------VL 660
LLDTMTFLDALPAFRMKQWQ VL
Sbjct: 601 -------LLDTMTFLDALPAFRMKQWQVIVLLFIEALSVCRIPSLDSQVSNSRSLFHKVL 660
Query: 661 DRAQIRPDEFELMKDHILPLGRTAQFSVENDA 662
DRAQIR +E+E +KDHILPLGRTAQFS ENDA
Sbjct: 661 DRAQIRSNEYETLKDHILPLGRTAQFSGENDA 662
BLAST of HG10002132 vs. TAIR 10
Match:
AT5G26760.2 (unknown protein; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF408 (InterPro:IPR007308); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )
HSP 1 Score: 383.6 bits (984), Expect = 3.1e-106
Identity = 288/789 (36.50%), Postives = 388/789 (49.18%), Query Frame = 0
Query: 1 MAK-NQSVLIKETVFKLQLALLEGIQNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPL 60
MAK N+++ I + V KLQL +LE ++NQLFAA LMSRSDYEDVVTER+IA LCGY L
Sbjct: 1 MAKDNEAIAINDAVHKLQLYMLENTTDQNQLFAARKLMSRSDYEDVVTERAIAKLCGYTL 60
Query: 61 CHSNLSSDNTRKGRYRISLKEHKVYDLQETYKYCSSTCLINSRAFSGRLQDERCSVMNPE 120
C L SD +R+G+YRISLK+HKVYDLQET K+CS+ CLI+S+ FSG LQ+ R +
Sbjct: 61 CQRFLPSDVSRRGKYRISLKDHKVYDLQETSKFCSAGCLIDSKTFSGSLQEARTLEFDSV 120
Query: 121 KLKETLRLFENLSLDSKEKMGNNSDLGLE---IQEKIDSNIGEVPIEDWMGPSNAIEGYV 180
KL E L LF + SL+ K + N DL L I+E E+ +E WMGPSNA+EGYV
Sbjct: 121 KLNEILDLFGD-SLEVKGSLDVNKDLDLSKLMIKENFGVRGEELSLEKWMGPSNAVEGYV 180
Query: 181 PHSDHKIMTLPSKDGKESKDGSKAKSKPLGGGKDFFSDFSFTSTIITDEEYSVSKISSGL 240
P K S D K + ++ K + FTST+I + SVSK+
Sbjct: 181 PFDRSK----SSNDSKATTQSNQEK-----------HEMDFTSTVIMPDVNSVSKLPPQT 240
Query: 241 KEMSLDTNSKIQTGEFCVKESN---EQFTILETPHAPAPTKNNVGRKAKG-SKERTKV-- 300
K+ S S G+ +KE + K G G ++E+T V
Sbjct: 241 KQASTVVESVDGKGKTVLKEQTVVPPTKKVSRFRREKEKEKKTFGVDGMGCAQEKTTVLP 300
Query: 301 -------SATEESTKN-----------------------LSDAPSTSNQCNTNCNLMTE- 360
+ E+ KN +S P S + + +C L +
Sbjct: 301 RKILSFCNEIEKDFKNFGFDEMGLASSAMMSDGYGVEYSVSKQPQCSMEDSLSCKLKGDL 360
Query: 361 EPRGGSNDLSGTG----------------------------------------------- 420
+ G N LSG+
Sbjct: 361 QTLDGKNTLSGSSSGSNTKGSKTKPEKSRKKIISVEYHANSYEDGEEILAAESYERHKAQ 420
Query: 421 ---------IKSSLKQPGKKNLRRSVTWADEKTDDTSIINPPEVREMGKTKECSRITNNL 480
KS LK G K L RSVTWAD+ + EVR S +N++
Sbjct: 421 DVCSSSEIVTKSCLKISGSKKLSRSVTWADQNDGRGDLC---EVRNNDNAAGPSLSSNDI 480
Query: 481 VNSDNDNEDLLRFESAEACAMALSQAAEAITSGKNEVSDAVSEAGIIILPR----PSDAN 540
D L R AEA A ALSQAAEA++SG ++ SDA ++AGII+LP +
Sbjct: 481 ----EDVNSLSRLALAEALATALSQAAEAVSSGNSDASDATAKAGIILLPSTHQLDEEVT 540
Query: 541 EEASTNPVNAFEPHSLSEKSNKLGELRSDLFDPNDSWYDAPPEGFSLTLSSFATMWMAIF 600
EE S + EP +L + NK G SDLFD + SW+D PPEGF+LTLS+FA MW ++F
Sbjct: 541 EEHSEEEMTEEEP-TLLKWPNKPGIPDSDLFDRDQSWFDGPPEGFNLTLSNFAVMWDSLF 600
Query: 601 AWITSSSLAYIYGKDDKFHEEFLYIDGKEYPRKIVSTDGRSSEIKQTLAGCLTRSIPGLA 660
W++SSSLAYIYGK++ HEEFL ++GKEYPR+I+ DG SSEIKQT+AGCL R++P +
Sbjct: 601 GWVSSSSLAYIYGKEESAHEEFLLVNGKEYPRRIIMVDGLSSEIKQTIAGCLARALPRVV 660
Query: 661 SELKLSTPISSLEHGMGRCWYLSKQGIVGLKSERGEGVGVEGGFLDGHLLDTMTFLDALP 662
+ L+L IS LE G+ G LL+TM+ A+P
Sbjct: 661 THLRLPIAISELEKGL------------------------------GSLLETMSLTGAVP 720
BLAST of HG10002132 vs. TAIR 10
Match:
AT5G26760.1 (unknown protein; Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )
HSP 1 Score: 232.6 bits (592), Expect = 8.9e-61
Identity = 158/424 (37.26%), Postives = 222/424 (52.36%), Query Frame = 0
Query: 274 KNNVGRKAKGSKERTKVSATEESTKNLSDAPSTSNQCNTNCNLMTEE--PRGGSNDLSGT 333
KN + + GS + + E+S K + +N ++ E R + D+ +
Sbjct: 45 KNTLSGSSSGSNTKGSKTKPEKSRKKIISVEYHANSYEDGEEILAAESYERHKAQDVCSS 104
Query: 334 G---IKSSLKQPGKKNLRRSVTWADEKTDDTSIINPPEVREMGKTKECSRITNNLVNSDN 393
KS LK G K L RSVTWAD+ + EVR S +N++
Sbjct: 105 SEIVTKSCLKISGSKKLSRSVTWADQNDGRGDLC---EVRNNDNAAGPSLSSNDI----E 164
Query: 394 DNEDLLRFESAEACAMALSQAAEAITSGKNEVSDAVSEAGIIILPR----PSDANEEAST 453
D L R AEA A ALSQAAEA++SG ++ SDA ++AGII+LP + EE S
Sbjct: 165 DVNSLSRLALAEALATALSQAAEAVSSGNSDASDATAKAGIILLPSTHQLDEEVTEEHSE 224
Query: 454 NPVNAFEPHSLSEKSNKLGELRSDLFDPNDSWYDAPPEGFSLTLSSFATMWMAIFAWITS 513
+ EP +L + NK G SDLFD + SW+D PPEGF+LTLS+FA MW ++F W++S
Sbjct: 225 EEMTEEEP-TLLKWPNKPGIPDSDLFDRDQSWFDGPPEGFNLTLSNFAVMWDSLFGWVSS 284
Query: 514 SSLAYIYGKDDKFHEEFLYIDGKEYPRKIVSTDGRSSEIKQTLAGCLTRSIPGLASELKL 573
SSLAYIYGK++ HEEFL ++GKEYPR+I+ DG SSEIKQT+AGCL R++P + + L+L
Sbjct: 285 SSLAYIYGKEESAHEEFLLVNGKEYPRRIIMVDGLSSEIKQTIAGCLARALPRVVTHLRL 344
Query: 574 STPISSLEHGMGRCWYLSKQGIVGLKSERGEGVGVEGGFLDGHLLDTMTFLDALPAFRMK 633
IS LE G+ G LL+TM+ A+P+FR+K
Sbjct: 345 PIAISELEKGL------------------------------GSLLETMSLTGAVPSFRVK 404
Query: 634 QW---------------------------QVLDRAQIRPDEFELMKDHILPLGRTAQFSV 662
+W ++L+ + I +E+E MKD +LPLGR QF+
Sbjct: 405 EWLVIVLLFLDALSVSRIPRIAPYISNRDKILEGSGIGNEEYETMKDILLPLGRVPQFAT 430
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038893419.1 | 0.0e+00 | 84.10 | putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Benincasa h... | [more] |
XP_031739958.1 | 1.4e-305 | 80.49 | putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Cucumis sat... | [more] |
XP_008454119.1 | 7.0e-302 | 80.20 | PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [... | [more] |
KAA0044516.1 | 7.8e-301 | 79.77 | putative RNA polymerase II subunit B1 CTD phosphatase RPAP2-like protein [Cucumi... | [more] |
XP_022955995.1 | 2.5e-299 | 79.34 | putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog isoform X1 [... | [more] |
Match Name | E-value | Identity | Description | |
F4K1B1 | 4.4e-105 | 36.50 | Putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog OS=Arabidops... | [more] |
A2Y040 | 8.9e-82 | 32.77 | Putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog OS=Oryza sat... | [more] |
Q6AVZ9 | 2.6e-81 | 32.63 | Putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog OS=Oryza sat... | [more] |
Q8IXW5 | 2.6e-09 | 23.56 | Putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 OS=Homo sapiens OX=9... | [more] |
Q5RA37 | 3.4e-09 | 23.56 | Putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 OS=Pongo abelii OX=9... | [more] |
Match Name | E-value | Identity | Description | |
A0A0A0KVU3 | 6.6e-306 | 80.49 | RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog OS=Cucumis sativus OX... | [more] |
A0A1S3BXZ9 | 3.4e-302 | 80.20 | RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog OS=Cucumis melo OX=36... | [more] |
A0A5A7TQX7 | 3.8e-301 | 79.77 | RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog OS=Cucumis melo var. ... | [more] |
A0A6J1GWL9 | 1.2e-299 | 79.34 | RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog OS=Cucurbita moschata... | [more] |
A0A6J1IY57 | 8.7e-298 | 78.76 | RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog OS=Cucurbita maxima O... | [more] |
Match Name | E-value | Identity | Description | |
AT5G26760.2 | 3.1e-106 | 36.50 | unknown protein; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF408 ... | [more] |
AT5G26760.1 | 8.9e-61 | 37.26 | unknown protein; Has 30201 Blast hits to 17322 proteins in 780 species: Archae -... | [more] |