CaUC07G137190 (gene) Watermelon (USVL246-FR2) v1

Overview
NameCaUC07G137190
Typegene
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionRNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog
LocationCiama_Chr07: 27557716 .. 27562914 (-)
RNA-Seq ExpressionCaUC07G137190
SyntenyCaUC07G137190
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCAAAGAATCAGTCTGTTTTGATTAAAGACACAGTATATAAATTACAGCTTGCACTCCTTGAGGGCATTCAAAATGAGAACCAGCTATTTGCGGCTGGGTCTCTGATGTCTCGGAGTGACTATGAAGATGTGGTGACTGAGCGGTCTATTGCAAACCTCTGTGGATTTCCATTATGCCATTCTAATTTGTCATCTGATAACACTAGGAAAGGCCGGTACAGAGTTTCATTGAAAGAACATAAGGTGTATGATTTACAAGAGACATATAAGTACTGCTCTTCCACTTGCCTCATTAACAGCCGTGCCTTTTCTGCAAGATTGCAAGATGAGAGATGTTCGGTTATGAATCCAGAGAAACTTGAAGAAACTCTTAGACTGTTTGAGAATCTGAGTTTGGATTCTAAGGAAAATGCAGGGAATATTCGTGATTTGGGGCTTGAAATTCAGGAGAAGATAGATAGCAATATTGGAGAAGTTCCCATTGAAGATTGGATGGGTCCATCAAATGCAATTGAAGGCTATGTGCCTCACAGAGATCATAAGATCATGACTTTGCCCAGCAAGGATGGCAAAGAATCCAAGGATGGTAACTTTTTTTTCTTTTTTTTTTCTTTGTATGCCGAAGTTTTATTGTGTTTGGTTGTTGAATTCCATATTGGAGAAGCTTATGCATTAAAAAATGTTCCTTGCAAAAGGAAAAATGTTCTTAGAATGCTAAAATGGAGAAGTGGAACTGTAGATTTGTTTGCATTGGTGGTTAAAGTGAATTTAAGTACGCTGTCATAGGGTCTAATGCTAAAAGTTCGCCATTGAACTTTTTCTAGTATGCCGGCAACTTGTTGTGTTTGATTGTTAAATTCGATTCATATCGAATTTAATGCATTGTTGGGATGTCCCTTTGAGAAAAGCTGAAGAGAAAAAAAGGATTCTTAGAATACCCAAGAAAAAGGGAAAAAAAGAAAGTGAAACAGTGTATTTATTTGCTCCTGTTAGTTTAATTGAGTTTTGGTGCACTATTTTAGGTTCTAAAGCTAAAAATAAGCCATTGGGCGGTGGAAAGGATTTCTTCAGCGACCTCTCCTTCACGAGTACTATAATCACAGATGAAGAGTATAGTGTTTCAAAGATATCATCTGGTTTGAAAGAGATGGCTTTTGATACTAATTCGAAGACACAAACTGGAGAATTCTGTGTTAAAGACTCAAATGAACAATTTACCATTTTGGAAACCCCACATGCTCCAGCTCCCACAAAAAACAGTATTGAACGGAAGGCAAGAGGATCCAAAGAAAGGACTAAAGTATCAGCCACCAAAGAGAGTACTAATAATTTGTCTGATGCTCCTTCGACTTCAAATCAGAGCAATACTAATTTCAATTTAGTGACAGAAGAACCAAGAGGTGGATCCAATGATCTTAGCGGAACTGAGATCAAATCCTCCCTTAAACAACCAGGCAAGAAAAACCTGCATCGCTCTGTAACTTGGGCAGATGAAAAGACTGATGAAACCAGTATTATAAACCTTCCAGAGGTCAGAGAAATGGGGAAGACAAAGGAATGTTCAAGAATTACAAGCAATTTGGTGAATTCCGACAATGATGATGAGGACCTATTACGGCTTGAATCTGCTGAAGCTTGTGCAATGGCACTGAGCCAGGCAGCTGAAGCAATTACTTCTGGGCAAAATGAGGTCTCTGATGCAGGTAATGTACTCTATGCATGATGATTGAGGTTAGTTATTGTACTACATGCTATAAAGATTATATTACTTGTCTGGCATTGGCCATTAGATGAGCTCTGCTGAAACTATTTCTATTTTTCCCTATTTTGATGTATAAACCAATCAGAGATTTAATTTTTGTTGTGTTTCTATTGGTTATTAGCAAATATATTATCATCCCCTTCTCTGCGTTCTTTTTCCTTGAAACATTGAAACTTCATCCTACAAAAAAAAAAAGGTGGCTTGAGTGGTTATAAAGTTGCTGTATGAATATGAACTAGATGGTTACTCATATAGTTTTTCTTTCTGTGTTCTGTTCTTTGTGATATCTCACAAATAGAAGAAACTTCATATAGTTTTTTTGAGTTCGGTTCTTTGTAATATTCAGTGTCTGAAGCTGGAATTATTATATTGCCACGCCCAATTGGTGCTAATGAAGAGGCATCTACTAATCCTGTCAACGCATCTGAACCACATTCATTCTCAGAGAAGTCAAACAGACTTGGGGAATTACGTTCGGATCTATTTGATCCCAGTGACTCTTGGTATGATGCGCCACCAGAGGGTTTCAGCCTTACTGTAAGCTCCTTTTCTTTCTTTGGATGTACCTTCTCTTCCTAACCTAAAAATATTCTTTTTCTCACCACATTTACCCTGTTGTTTCCAGTTATCTTCTTTTGCAACCATGTGGATGGCAATCTTTGCATGGATAACATCATCTTCCCTAGCCTACATTTATGGAAAAGATGATAAGTTTCATGAGGAATTTCTATATATTGATGGGAAGGAGTATCCAAGGAAAATTGTCTCTGCTGATGGCCGATCTTCTGAAATCAAGCAAACACTTGCTGGATGTCTAACACGGTCAATACCTGGACTTGCTTCTGAACTTAAGCTATCAACCCCAATATCAAGTTTGGAGCACGGGATGGTAATGATAAATATTCATGACATTGTCTTTTTCTCTCAATGACTTTTGTTGGTGCTAATTGAGAACTTACATTAGTTTGCACATATGCAGTTCTTATTATTGTTATTATTTATTTTGTGTTGATATATATGTATAGAGGTAGTATCTCTGGTACACTGGTACCATTCCCCAGAGGAGTGGCTCTCATTTATATTTACTAGCTATGGTAAATTCATGGTTTTGATACTTCAAGACACCGCATCTTGATCTAGTTTTTGGAAAACGCTGATGGGAGGAAGGATGGGAGGTTGAATTGGATTCTCTTGACGTATTTAGGCTTTGTTGAATATGATTTTCAATAGAAGTTTTGAAGCTTTTTTTACTTGGCAAGTCCTTCATGGTTGTGCTAACATGTTGGATCGGTTTGTGAGGAAGTTGCCTTTGCTAGTTGGGCCTTTTTGTTGTATTCTTTGTTGGAAGGCGGAGGAAGACTTGGAATATATTCTTTGGCACTATGGTTACGTGAATAGTGTTTGGGATTCCTTCCTTCAGGAGTTTGGCTTGATGTATGTTCATCACAAAAATGGTAGCGATATGATTGAGGGATTCCTCCTCGATCCGCCTTTTGGAGAGGGGCTGATTTTTATGGCTTGCGGGGTTGTGTGCAATTATGTGGGTACTGTGGGGTGAGCAAAATAATAGGGTTATTAAGGGTTTGGATGGGATCCTTCAGAGACTTGGTCCCTTGTTCATTTTCATGTCTCTTCATGGGCTTTGATTTCGAAGACCTTTTGTAATTATTCTATAGATACTATCATGCGTAGTTGGAGGCCCTTCTTGTAGAGGGAGCTCCCTTTTTTGTGGGTTTGGTTTTTTGTATGTCCGTATGTTCTTCCATTTTTTCTCAATGAAAAATGTATTTTCCATTTAAAAAATTGGGAGAGACTTCCATTCAAAATCGCCTTTGGGAGATTGCTGAAGCAAACAGAGGAGTGTATATTCTTATCTCACTGTCAGTAATTTAACTAAGCATTTTGGTCTTGGAGATTTAAGTTATTCGGAACACCAAAGTTTTAAACCTTGCTATACTCTCGAAATGGTGGGGGTTGTTGGTATTTTTTAGGGCAGGTGCTTGTATCTTTTAAAAACAGAGCATTGTTGGTTTGAAGTTTGTGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAAGGGGGAGGACTCTTGGATGTATGTATCCCTATCTCCTCTTTGAACTCGGTTTTTTTGGCTTGTGGTTTCTATTCTATTGTTACTATTAAAGACTGGATGGAATTCTGCATGGAATTTTAAATTAAATTTAGAAGGCATTCCCTTGGTTTTGGAAGGTCTTATACCACATGAGAGAAGGATACTGTTAATCCAACTTTGGCATATGAGTGAAGTTGATTCGTGTTCTTGTGCAGGGGCACTTGTTAGACACCATGACTTTCCTTGATGCACTTCCAGCATTCAGAATGAAGCAGTGGCAAGTAATTGTTCTATTATTTATAGAAGCTTTATCTGTTTGCCGGATTCCTTCACTTGCCTCCCACATGTCAAATAGTAGAAGTCTGTATCACAAGGTTTGGCTCCCTTTCCGCCCTCTTGCTTAGAAAAGAATGCTTTGTGTTGAACGAAATGTTAAATAAGCAGTTTAAGAGTTAAAAATGATAATGTAGATGATCTAAAATATGCTCTACTTATCAATTAGCTAATGACTAGTTCTAAAGCTGTATACTAGGAGTTTCTGAATTGTATGCGTAGAATTGATATCAAATCAACTTCTTTGTGCTCTTTTTGTAGTGCCCAATGAAAATTGGTTGTTGACATGCTCTTATTCCTTTCCTTGACTGGAATTTTTTCGTTTTGAGGAAATCGATAAGTTGTTGAAATTGCCGCACATACTTTATCTGATCAATATTCTATAAAAATGTTACCTTATTTATTGTCCTTTTCAATTCAAGCACTTCTGTAGTTTTTATTTTCTTCATATTTTTGCATCTGCAATCTTTTTTCTATTTCTTTTGTGCCTGCTTGTATATTCATATTTATCAGTCGAACAAATCTCAGCATTGTTACCCGAGAAATAAATATTTGTGTTGTCATTTTTATTTTACTTTATTTTTGAGGTCGACAGATATTTCGAGAAATAAATATTTGTGTTGTCATTTTTATTTTACTTTATTTTTGAGGACGACAGATATTTGTTTCTTGATTCTAATGGTGGATATTCCTCTTTCAATCCAAAATTTGAAAATTGTGCCGTAATATTGTTTAGTTATCGGAAAAAAGGTATTACATAGTTCAATGATGCCCTCTATAGTTCATGTCATGCTCTTGTGTAGCGTAAGTGATTGATCATGAAGAGTACATGTAAATGCCATATTCCTTGATTTTAAAATTGGGGTAAAGTTTAACATCCTGCTCTCACTCTCGTGTACATAGGTGCTTGATCGTGCTCAGATACGATCCGACGAATACGAGGTTATGAAAGATCATATATTACCGCTTGGTCGAACAGCTCAATTTTCAGGCAACAATGATGCCTAA

mRNA sequence

ATGGCAAAGAATCAGTCTGTTTTGATTAAAGACACAGTATATAAATTACAGCTTGCACTCCTTGAGGGCATTCAAAATGAGAACCAGCTATTTGCGGCTGGGTCTCTGATGTCTCGGAGTGACTATGAAGATGTGGTGACTGAGCGGTCTATTGCAAACCTCTGTGGATTTCCATTATGCCATTCTAATTTGTCATCTGATAACACTAGGAAAGGCCGGTACAGAGTTTCATTGAAAGAACATAAGGTGTATGATTTACAAGAGACATATAAGTACTGCTCTTCCACTTGCCTCATTAACAGCCGTGCCTTTTCTGCAAGATTGCAAGATGAGAGATGTTCGGTTATGAATCCAGAGAAACTTGAAGAAACTCTTAGACTGTTTGAGAATCTGAGTTTGGATTCTAAGGAAAATGCAGGGAATATTCGTGATTTGGGGCTTGAAATTCAGGAGAAGATAGATAGCAATATTGGAGAAGTTCCCATTGAAGATTGGATGGGTCCATCAAATGCAATTGAAGGCTATGTGCCTCACAGAGATCATAAGATCATGACTTTGCCCAGCAAGGATGGCAAAGAATCCAAGGATGGTTCTAAAGCTAAAAATAAGCCATTGGGCGGTGGAAAGGATTTCTTCAGCGACCTCTCCTTCACGAGTACTATAATCACAGATGAAGAGTATAGTGTTTCAAAGATATCATCTGGTTTGAAAGAGATGGCTTTTGATACTAATTCGAAGACACAAACTGGAGAATTCTGTGTTAAAGACTCAAATGAACAATTTACCATTTTGGAAACCCCACATGCTCCAGCTCCCACAAAAAACAGTATTGAACGGAAGGCAAGAGGATCCAAAGAAAGGACTAAAGTATCAGCCACCAAAGAGAGTACTAATAATTTGTCTGATGCTCCTTCGACTTCAAATCAGAGCAATACTAATTTCAATTTAGTGACAGAAGAACCAAGAGGTGGATCCAATGATCTTAGCGGAACTGAGATCAAATCCTCCCTTAAACAACCAGGCAAGAAAAACCTGCATCGCTCTGTAACTTGGGCAGATGAAAAGACTGATGAAACCAGTATTATAAACCTTCCAGAGGTCAGAGAAATGGGGAAGACAAAGGAATGTTCAAGAATTACAAGCAATTTGGTGAATTCCGACAATGATGATGAGGACCTATTACGGCTTGAATCTGCTGAAGCTTGTGCAATGGCACTGAGCCAGGCAGCTGAAGCAATTACTTCTGGGCAAAATGAGGTCTCTGATGCAGTGTCTGAAGCTGGAATTATTATATTGCCACGCCCAATTGGTGCTAATGAAGAGGCATCTACTAATCCTGTCAACGCATCTGAACCACATTCATTCTCAGAGAAGTCAAACAGACTTGGGGAATTACGTTCGGATCTATTTGATCCCAGTGACTCTTGGTATGATGCGCCACCAGAGGGTTTCAGCCTTACTTTATCTTCTTTTGCAACCATGTGGATGGCAATCTTTGCATGGATAACATCATCTTCCCTAGCCTACATTTATGGAAAAGATGATAAGTTTCATGAGGAATTTCTATATATTGATGGGAAGGAGTATCCAAGGAAAATTGTCTCTGCTGATGGCCGATCTTCTGAAATCAAGCAAACACTTGCTGGATGTCTAACACGGTCAATACCTGGACTTGCTTCTGAACTTAAGCTATCAACCCCAATATCAAGTTTGGAGCACGGGATGGGGCACTTGTTAGACACCATGACTTTCCTTGATGCACTTCCAGCATTCAGAATGAAGCAGTGGCAAGTAATTGTTCTATTATTTATAGAAGCTTTATCTGTTTGCCGGATTCCTTCACTTGCCTCCCACATGTCAAATAGTAGAAGTCTGTATCACAAGGTGCTTGATCGTGCTCAGATACGATCCGACGAATACGAGGTTATGAAAGATCATATATTACCGCTTGGTCGAACAGCTCAATTTTCAGGCAACAATGATGCCTAA

Coding sequence (CDS)

ATGGCAAAGAATCAGTCTGTTTTGATTAAAGACACAGTATATAAATTACAGCTTGCACTCCTTGAGGGCATTCAAAATGAGAACCAGCTATTTGCGGCTGGGTCTCTGATGTCTCGGAGTGACTATGAAGATGTGGTGACTGAGCGGTCTATTGCAAACCTCTGTGGATTTCCATTATGCCATTCTAATTTGTCATCTGATAACACTAGGAAAGGCCGGTACAGAGTTTCATTGAAAGAACATAAGGTGTATGATTTACAAGAGACATATAAGTACTGCTCTTCCACTTGCCTCATTAACAGCCGTGCCTTTTCTGCAAGATTGCAAGATGAGAGATGTTCGGTTATGAATCCAGAGAAACTTGAAGAAACTCTTAGACTGTTTGAGAATCTGAGTTTGGATTCTAAGGAAAATGCAGGGAATATTCGTGATTTGGGGCTTGAAATTCAGGAGAAGATAGATAGCAATATTGGAGAAGTTCCCATTGAAGATTGGATGGGTCCATCAAATGCAATTGAAGGCTATGTGCCTCACAGAGATCATAAGATCATGACTTTGCCCAGCAAGGATGGCAAAGAATCCAAGGATGGTTCTAAAGCTAAAAATAAGCCATTGGGCGGTGGAAAGGATTTCTTCAGCGACCTCTCCTTCACGAGTACTATAATCACAGATGAAGAGTATAGTGTTTCAAAGATATCATCTGGTTTGAAAGAGATGGCTTTTGATACTAATTCGAAGACACAAACTGGAGAATTCTGTGTTAAAGACTCAAATGAACAATTTACCATTTTGGAAACCCCACATGCTCCAGCTCCCACAAAAAACAGTATTGAACGGAAGGCAAGAGGATCCAAAGAAAGGACTAAAGTATCAGCCACCAAAGAGAGTACTAATAATTTGTCTGATGCTCCTTCGACTTCAAATCAGAGCAATACTAATTTCAATTTAGTGACAGAAGAACCAAGAGGTGGATCCAATGATCTTAGCGGAACTGAGATCAAATCCTCCCTTAAACAACCAGGCAAGAAAAACCTGCATCGCTCTGTAACTTGGGCAGATGAAAAGACTGATGAAACCAGTATTATAAACCTTCCAGAGGTCAGAGAAATGGGGAAGACAAAGGAATGTTCAAGAATTACAAGCAATTTGGTGAATTCCGACAATGATGATGAGGACCTATTACGGCTTGAATCTGCTGAAGCTTGTGCAATGGCACTGAGCCAGGCAGCTGAAGCAATTACTTCTGGGCAAAATGAGGTCTCTGATGCAGTGTCTGAAGCTGGAATTATTATATTGCCACGCCCAATTGGTGCTAATGAAGAGGCATCTACTAATCCTGTCAACGCATCTGAACCACATTCATTCTCAGAGAAGTCAAACAGACTTGGGGAATTACGTTCGGATCTATTTGATCCCAGTGACTCTTGGTATGATGCGCCACCAGAGGGTTTCAGCCTTACTTTATCTTCTTTTGCAACCATGTGGATGGCAATCTTTGCATGGATAACATCATCTTCCCTAGCCTACATTTATGGAAAAGATGATAAGTTTCATGAGGAATTTCTATATATTGATGGGAAGGAGTATCCAAGGAAAATTGTCTCTGCTGATGGCCGATCTTCTGAAATCAAGCAAACACTTGCTGGATGTCTAACACGGTCAATACCTGGACTTGCTTCTGAACTTAAGCTATCAACCCCAATATCAAGTTTGGAGCACGGGATGGGGCACTTGTTAGACACCATGACTTTCCTTGATGCACTTCCAGCATTCAGAATGAAGCAGTGGCAAGTAATTGTTCTATTATTTATAGAAGCTTTATCTGTTTGCCGGATTCCTTCACTTGCCTCCCACATGTCAAATAGTAGAAGTCTGTATCACAAGGTGCTTGATCGTGCTCAGATACGATCCGACGAATACGAGGTTATGAAAGATCATATATTACCGCTTGGTCGAACAGCTCAATTTTCAGGCAACAATGATGCCTAA

Protein sequence

MAKNQSVLIKDTVYKLQLALLEGIQNENQLFAAGSLMSRSDYEDVVTERSIANLCGFPLCHSNLSSDNTRKGRYRVSLKEHKVYDLQETYKYCSSTCLINSRAFSARLQDERCSVMNPEKLEETLRLFENLSLDSKENAGNIRDLGLEIQEKIDSNIGEVPIEDWMGPSNAIEGYVPHRDHKIMTLPSKDGKESKDGSKAKNKPLGGGKDFFSDLSFTSTIITDEEYSVSKISSGLKEMAFDTNSKTQTGEFCVKDSNEQFTILETPHAPAPTKNSIERKARGSKERTKVSATKESTNNLSDAPSTSNQSNTNFNLVTEEPRGGSNDLSGTEIKSSLKQPGKKNLHRSVTWADEKTDETSIINLPEVREMGKTKECSRITSNLVNSDNDDEDLLRLESAEACAMALSQAAEAITSGQNEVSDAVSEAGIIILPRPIGANEEASTNPVNASEPHSFSEKSNRLGELRSDLFDPSDSWYDAPPEGFSLTLSSFATMWMAIFAWITSSSLAYIYGKDDKFHEEFLYIDGKEYPRKIVSADGRSSEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGMGHLLDTMTFLDALPAFRMKQWQVIVLLFIEALSVCRIPSLASHMSNSRSLYHKVLDRAQIRSDEYEVMKDHILPLGRTAQFSGNNDA
Homology
BLAST of CaUC07G137190 vs. NCBI nr
Match: XP_038893419.1 (putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Benincasa hispida])

HSP 1 Score: 1194.9 bits (3090), Expect = 0.0e+00
Identity = 615/662 (92.90%), Postives = 636/662 (96.07%), Query Frame = 0

Query: 1   MAKNQSVLIKDTVYKLQLALLEGIQNENQLFAAGSLMSRSDYEDVVTERSIANLCGFPLC 60
           MAKNQSVLIKDTVYKLQLALLEGIQNENQLFAAGSLMSRSDYEDVVTERSIANLCG+PLC
Sbjct: 1   MAKNQSVLIKDTVYKLQLALLEGIQNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC 60

Query: 61  HSNLSSDNTRKGRYRVSLKEHKVYDLQETYKYCSSTCLINSRAFSARLQDERCSVMNPEK 120
           HSNLSSDNTR+GRYR+SLKEHKVYDLQETYKYCSSTCLINSRAFS RLQ+ERCSVMNPEK
Sbjct: 61  HSNLSSDNTRRGRYRISLKEHKVYDLQETYKYCSSTCLINSRAFSGRLQNERCSVMNPEK 120

Query: 121 LEETLRLFENLSLDSKENAGNIRDLGLEIQEKIDSNIGEVPIEDWMGPSNAIEGYVPHRD 180
           L+E LRLFENLSLDSKEN GN  DLGLEIQE I+SN GEVPIE+WMGPSNAIEGYVPHRD
Sbjct: 121 LKEILRLFENLSLDSKENVGNNCDLGLEIQENIESNTGEVPIEEWMGPSNAIEGYVPHRD 180

Query: 181 HKIMTLPSKDGKESKDGSKAKNKPLGGGKDFFSDLSFTSTIITDEEYSVSKISSGLKEMA 240
           HKIMTLPSKDGKESKDGSKAK KPLGGGKDFFSDLSFTSTI+TDEEYSVSKISSGLKEMA
Sbjct: 181 HKIMTLPSKDGKESKDGSKAKIKPLGGGKDFFSDLSFTSTILTDEEYSVSKISSGLKEMA 240

Query: 241 FDTNSKTQTGEFCVKDSNEQFTILETPHAPAPTKNSIERKARGSKERTKVSATKESTNNL 300
           FDT+SK QTGE C K+S +QFTILETPHAPAPTKNS+ RKARGSKERTKVSATKESTNNL
Sbjct: 241 FDTDSKIQTGELCGKESKDQFTILETPHAPAPTKNSVGRKARGSKERTKVSATKESTNNL 300

Query: 301 SDAPSTSNQSNTNFNLVTEEPRGGSNDLSGTEIKSSLKQPGKKNLHRSVTWADEKTDETS 360
           SDAPSTSNQ NTN NL+TEEPRGGSNDLSGTEIKSSLKQPGKKNLHRSVTWADEKT +TS
Sbjct: 301 SDAPSTSNQCNTNCNLMTEEPRGGSNDLSGTEIKSSLKQPGKKNLHRSVTWADEKTVDTS 360

Query: 361 IINLPEVREMGKTKECSRITSNLVNSDNDDEDLLRLESAEACAMALSQAAEAITSGQNEV 420
           IINLPEVREMGK KECSRIT NLVNSDND+ DLLR+ESAEACAMAL+QAAEAI+SGQNEV
Sbjct: 361 IINLPEVREMGKKKECSRITRNLVNSDNDNGDLLRVESAEACAMALTQAAEAISSGQNEV 420

Query: 421 SDAVSEAGIIILPRPIGANEEASTNPVNASEPHSFSEKSNRLGELRSDLFDPSDSWYDAP 480
           SDAVSEAGIIILPRP   NEEASTNPVNASEPHS SEKSN+LG LRSDLFDP+DSWYDAP
Sbjct: 421 SDAVSEAGIIILPRPNDGNEEASTNPVNASEPHSSSEKSNKLGVLRSDLFDPNDSWYDAP 480

Query: 481 PEGFSLTLSSFATMWMAIFAWITSSSLAYIYGKDDKFHEEFLYIDGKEYPRKIVSADGRS 540
           PEGFSLTLSSFATMWMAIFAWITSSSLAYIYGKDDKFHEEFLYIDGKEYPRKIVSADGRS
Sbjct: 481 PEGFSLTLSSFATMWMAIFAWITSSSLAYIYGKDDKFHEEFLYIDGKEYPRKIVSADGRS 540

Query: 541 SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGMGHLLDTMTFLDALPAFRMKQWQVIV 600
           SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGMGHLLDTMTFLDALPAFRMKQWQVIV
Sbjct: 541 SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGMGHLLDTMTFLDALPAFRMKQWQVIV 600

Query: 601 LLFIEALSVCRIPSLASHMSNSRSLYHKVLDRAQIRSDEYEVMKDHILPLGRTAQFSGNN 660
           LLFIEALSVC+IPSLASHMSNSRSLYHKVLDRAQIRSDEYEVMKDHILPLGR AQFSG N
Sbjct: 601 LLFIEALSVCQIPSLASHMSNSRSLYHKVLDRAQIRSDEYEVMKDHILPLGRIAQFSGEN 660

Query: 661 DA 663
           DA
Sbjct: 661 DA 662

BLAST of CaUC07G137190 vs. NCBI nr
Match: XP_031739958.1 (putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Cucumis sativus] >KGN52984.1 hypothetical protein Csa_015280 [Cucumis sativus])

HSP 1 Score: 1144.4 bits (2959), Expect = 0.0e+00
Identity = 586/662 (88.52%), Postives = 622/662 (93.96%), Query Frame = 0

Query: 1   MAKNQSVLIKDTVYKLQLALLEGIQNENQLFAAGSLMSRSDYEDVVTERSIANLCGFPLC 60
           MAKNQSVLIKDTVYKLQLAL EGI+NENQLFAAGSLMSRSDYEDVVTERSIA+LCG+PLC
Sbjct: 1   MAKNQSVLIKDTVYKLQLALYEGIKNENQLFAAGSLMSRSDYEDVVTERSIADLCGYPLC 60

Query: 61  HSNLSSDNTRKGRYRVSLKEHKVYDLQETYKYCSSTCLINSRAFSARLQDERCSVMNPEK 120
           HSNL SDNTR+GRYR+SLKEHKVYDL+ETYKYCSS CLINSRAFS RLQDERCSVMNP+K
Sbjct: 61  HSNLPSDNTRRGRYRISLKEHKVYDLEETYKYCSSACLINSRAFSGRLQDERCSVMNPDK 120

Query: 121 LEETLRLFENLSLDSKENAGNIRDLGLEIQEKIDSNIGEVPIEDWMGPSNAIEGYVPHRD 180
           L+E L+LFEN+SLDSKEN GN  D GLEIQEKI+SNIGEVPIE+WMGPSNAIEGYVPHRD
Sbjct: 121 LKEILKLFENMSLDSKENMGNNCDSGLEIQEKIESNIGEVPIEEWMGPSNAIEGYVPHRD 180

Query: 181 HKIMTLPSKDGKESKDGSKAKNKPLGGGKDFFSDLSFTSTIITDEEYSVSKISSGLKEMA 240
           HK+MTL SKDGKESKDGSKAK KPLGGGKDFFSD S TSTIITDEEYSVSKISSGLKEMA
Sbjct: 181 HKVMTLHSKDGKESKDGSKAKIKPLGGGKDFFSDFSITSTIITDEEYSVSKISSGLKEMA 240

Query: 241 FDTNSKTQTGEFCVKDSNEQFTILETPHAPAPTKNSIERKARGSKERTKVSATKESTNNL 300
            DTNSK QTGEFC K+SN+QF ILETPHAPAP KNS+ RKARGSKERTKVSATKEST+NL
Sbjct: 241 LDTNSKNQTGEFCGKESNDQFAILETPHAPAPPKNSVGRKARGSKERTKVSATKESTDNL 300

Query: 301 SDAPSTSNQSNTNFNLVTEEPRGGSNDLSGTEIKSSLKQPGKKNLHRSVTWADEKTDETS 360
           SDAPSTS   +TNFNL+TEEPRGG NDLSGTE+KSSLK+PGKKNL RSVTWADEKTD+ S
Sbjct: 301 SDAPSTSKNRSTNFNLMTEEPRGGFNDLSGTELKSSLKKPGKKNLCRSVTWADEKTDDAS 360

Query: 361 IINLPEVREMGKTKECSRITSNLVNSDNDDEDLLRLESAEACAMALSQAAEAITSGQNEV 420
           I+NLPEV EMGKTKECSR TSNLVN DND+ED+LR+ESAEACAMALSQAAEAITSGQ+EV
Sbjct: 361 IMNLPEVGEMGKTKECSRTTSNLVNFDNDNEDILRVESAEACAMALSQAAEAITSGQSEV 420

Query: 421 SDAVSEAGIIILPRPIGANEEASTNPVNASEPHSFSEKSNRLGELRSDLFDPSDSWYDAP 480
           SDAVSEAGIIILP P  ANEEAST+PVNASEPHSFSEKSN+LG LRSDLFDPSDSWYDAP
Sbjct: 421 SDAVSEAGIIILPHPSDANEEASTDPVNASEPHSFSEKSNKLGVLRSDLFDPSDSWYDAP 480

Query: 481 PEGFSLTLSSFATMWMAIFAWITSSSLAYIYGKDDKFHEEFLYIDGKEYPRKIVSADGRS 540
           PEGFSLTLSSFATMWMAIFAW+TSSSLAYIYGKDDKFHEEFLYIDGKEYP KIVSADGRS
Sbjct: 481 PEGFSLTLSSFATMWMAIFAWVTSSSLAYIYGKDDKFHEEFLYIDGKEYPSKIVSADGRS 540

Query: 541 SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGMGHLLDTMTFLDALPAFRMKQWQVIV 600
           SEIKQTLAGCLTR+IPGLASEL LSTPIS LE+GM HLLDTMTFLDALPAFRMKQWQVIV
Sbjct: 541 SEIKQTLAGCLTRAIPGLASELNLSTPISRLENGMAHLLDTMTFLDALPAFRMKQWQVIV 600

Query: 601 LLFIEALSVCRIPSLASHMSNSRSLYHKVLDRAQIRSDEYEVMKDHILPLGRTAQFSGNN 660
           LLFIEALSV RIPSLASHMS+SR+LYHKVLDRAQIRSDEYE+M+DHILPLGRTAQ S  N
Sbjct: 601 LLFIEALSVSRIPSLASHMSSSRNLYHKVLDRAQIRSDEYEIMRDHILPLGRTAQLSDEN 660

Query: 661 DA 663
           DA
Sbjct: 661 DA 662

BLAST of CaUC07G137190 vs. NCBI nr
Match: XP_008454119.1 (PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Cucumis melo] >XP_008454120.1 PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Cucumis melo])

HSP 1 Score: 1127.9 bits (2916), Expect = 0.0e+00
Identity = 582/662 (87.92%), Postives = 614/662 (92.75%), Query Frame = 0

Query: 1   MAKNQSVLIKDTVYKLQLALLEGIQNENQLFAAGSLMSRSDYEDVVTERSIANLCGFPLC 60
           MAKNQS LIKDTVYKLQLAL EGIQNENQLFAAGSLMSRSDYEDVVTERSIANLCG+PLC
Sbjct: 1   MAKNQSALIKDTVYKLQLALYEGIQNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC 60

Query: 61  HSNLSSDNTRKGRYRVSLKEHKVYDLQETYKYCSSTCLINSRAFSARLQDERCSVMNPEK 120
           HSNL SDNTR+GRYR+SLKEHKVYDL+ETYKYCSS CLINSRAFS RLQDERCSVMNP K
Sbjct: 61  HSNLPSDNTRRGRYRISLKEHKVYDLEETYKYCSSACLINSRAFSGRLQDERCSVMNPAK 120

Query: 121 LEETLRLFENLSLDSKENAGNIRDLGLEIQEKIDSNIGEVPIEDWMGPSNAIEGYVPHRD 180
           L+E L+LFEN+SLDSKEN GN  D GLEIQEKI+S+IGEVPIE+WMGPSNAIEGYVPHRD
Sbjct: 121 LKEILKLFENMSLDSKENMGNNCDSGLEIQEKIESSIGEVPIEEWMGPSNAIEGYVPHRD 180

Query: 181 HKIMTLPSKDGKESKDGSKAKNKPLGGGKDFFSDLSFTSTIITDEEYSVSKISSGLKEMA 240
           HKIMTLPSKDGKESKDGS AK KPLGGGKDFFSD SFTSTIITDEEYSVSKISS LKEMA
Sbjct: 181 HKIMTLPSKDGKESKDGSTAKIKPLGGGKDFFSDFSFTSTIITDEEYSVSKISSSLKEMA 240

Query: 241 FDTNSKTQTGEFCVKDSNEQFTILETPHAPAPTKNSIERKARGSKERTKVSATKESTNNL 300
            DTNSK QTGEFC K+SN+QFTILET HA AP KNS+  KARGSKERTKVSAT+ESTNNL
Sbjct: 241 LDTNSKIQTGEFCGKESNDQFTILETSHARAPPKNSVGHKARGSKERTKVSATEESTNNL 300

Query: 301 SDAPSTSNQSNTNFNLVTEEPRGGSNDLSGTEIKSSLKQPGKKNLHRSVTWADEKTDETS 360
           SDAPSTSN  +TNFNLVTEEP+GG NDL GTEIKSSLKQPGKKNL RSVTWADEK D+TS
Sbjct: 301 SDAPSTSNNRSTNFNLVTEEPKGGFNDLRGTEIKSSLKQPGKKNLRRSVTWADEKIDDTS 360

Query: 361 IINLPEVREMGKTKECSRITSNLVNSDNDDEDLLRLESAEACAMALSQAAEAITSGQNEV 420
            +NLPEV E GKTKECSRITSNLVN DND+EDL+R+ESAEACAMALSQAAEAITSGQ+EV
Sbjct: 361 -MNLPEVGEKGKTKECSRITSNLVNFDNDNEDLIRVESAEACAMALSQAAEAITSGQSEV 420

Query: 421 SDAVSEAGIIILPRPIGANEEASTNPVNASEPHSFSEKSNRLGELRSDLFDPSDSWYDAP 480
           S+AVSEAGIIILP P  ANEEAST PV ASEPHSFSEKSN+LG L SDLFDPSDSWYDAP
Sbjct: 421 SEAVSEAGIIILPHPSDANEEASTEPVKASEPHSFSEKSNKLGVLHSDLFDPSDSWYDAP 480

Query: 481 PEGFSLTLSSFATMWMAIFAWITSSSLAYIYGKDDKFHEEFLYIDGKEYPRKIVSADGRS 540
           PEGFSLTLSSFATMWMAIFAW+TSSSLAYIYGKDDKFHEEF YIDGKEYP KIVSADGRS
Sbjct: 481 PEGFSLTLSSFATMWMAIFAWVTSSSLAYIYGKDDKFHEEFQYIDGKEYPSKIVSADGRS 540

Query: 541 SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGMGHLLDTMTFLDALPAFRMKQWQVIV 600
           SEIKQTLAGCLTR+IPGLASELKLSTPIS LE+GM HLLDTMTFLDALPAFRMKQWQVIV
Sbjct: 541 SEIKQTLAGCLTRAIPGLASELKLSTPISRLEYGMAHLLDTMTFLDALPAFRMKQWQVIV 600

Query: 601 LLFIEALSVCRIPSLASHMSNSRSLYHKVLDRAQIRSDEYEVMKDHILPLGRTAQFSGNN 660
           LLF+EALSVCRIPSLASHMS+SR+LYHKVLDRAQI+SDEYE+MKDHILPLG TAQ S  N
Sbjct: 601 LLFMEALSVCRIPSLASHMSSSRNLYHKVLDRAQIQSDEYEIMKDHILPLGLTAQLSVEN 660

Query: 661 DA 663
           DA
Sbjct: 661 DA 661

BLAST of CaUC07G137190 vs. NCBI nr
Match: KAA0044516.1 (putative RNA polymerase II subunit B1 CTD phosphatase RPAP2-like protein [Cucumis melo var. makuwa])

HSP 1 Score: 1127.5 bits (2915), Expect = 0.0e+00
Identity = 581/662 (87.76%), Postives = 615/662 (92.90%), Query Frame = 0

Query: 1   MAKNQSVLIKDTVYKLQLALLEGIQNENQLFAAGSLMSRSDYEDVVTERSIANLCGFPLC 60
           MAKNQS LIKDTVYKLQLAL EGIQNENQLFAAGSLMSRSDYEDVVTERSIANLCG+PLC
Sbjct: 1   MAKNQSALIKDTVYKLQLALYEGIQNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC 60

Query: 61  HSNLSSDNTRKGRYRVSLKEHKVYDLQETYKYCSSTCLINSRAFSARLQDERCSVMNPEK 120
           HSNL SDNTR+GRYR+SLKEHKVYDL+ETYKYCSS CLINSRAFS RLQDERCSVMNP K
Sbjct: 61  HSNLPSDNTRRGRYRISLKEHKVYDLEETYKYCSSACLINSRAFSGRLQDERCSVMNPAK 120

Query: 121 LEETLRLFENLSLDSKENAGNIRDLGLEIQEKIDSNIGEVPIEDWMGPSNAIEGYVPHRD 180
           L++ L+LFEN+SLDSKEN GN  D GLEIQEKI+S+IGEVPIE+WMGPSNAIEGYVPHRD
Sbjct: 121 LKDILKLFENMSLDSKENVGNNCDSGLEIQEKIESSIGEVPIEEWMGPSNAIEGYVPHRD 180

Query: 181 HKIMTLPSKDGKESKDGSKAKNKPLGGGKDFFSDLSFTSTIITDEEYSVSKISSGLKEMA 240
           HKIMTLPSKDGKESKDGS AK KPLGGGKDFFSD SFTSTIITDEEYSVSKISS LKEMA
Sbjct: 181 HKIMTLPSKDGKESKDGSTAKIKPLGGGKDFFSDFSFTSTIITDEEYSVSKISSSLKEMA 240

Query: 241 FDTNSKTQTGEFCVKDSNEQFTILETPHAPAPTKNSIERKARGSKERTKVSATKESTNNL 300
            DTNSK QTGEFC K+SN+QFTILET HA AP KNS+  KARGSKERTKVSAT+ESTNNL
Sbjct: 241 LDTNSKIQTGEFCGKESNDQFTILETSHARAPPKNSVGHKARGSKERTKVSATEESTNNL 300

Query: 301 SDAPSTSNQSNTNFNLVTEEPRGGSNDLSGTEIKSSLKQPGKKNLHRSVTWADEKTDETS 360
           SDAPSTSN  +TNFNLVTEEP+GG NDL GTEIKSSLKQPGKKNL RSVTWADEK D+TS
Sbjct: 301 SDAPSTSNNRSTNFNLVTEEPKGGFNDLRGTEIKSSLKQPGKKNLRRSVTWADEKIDDTS 360

Query: 361 IINLPEVREMGKTKECSRITSNLVNSDNDDEDLLRLESAEACAMALSQAAEAITSGQNEV 420
            +NLPEV E GKTKECSRITSNLVN DND+EDLLR+ESAEACAMALSQAAEAITSGQ+EV
Sbjct: 361 -MNLPEVGEKGKTKECSRITSNLVNFDNDNEDLLRVESAEACAMALSQAAEAITSGQSEV 420

Query: 421 SDAVSEAGIIILPRPIGANEEASTNPVNASEPHSFSEKSNRLGELRSDLFDPSDSWYDAP 480
           S+AVSEAGIIILP P  ANEEAST+PV ASEPHSFSEKSN+LG L SDLFDPS+SWYDAP
Sbjct: 421 SEAVSEAGIIILPHPSDANEEASTDPVKASEPHSFSEKSNKLGVLHSDLFDPSESWYDAP 480

Query: 481 PEGFSLTLSSFATMWMAIFAWITSSSLAYIYGKDDKFHEEFLYIDGKEYPRKIVSADGRS 540
           PEGFSLTLSSFATMWMAIFAW+TSSSLAYIYGKDDKFHEEF YIDGKEYP KIVSADGRS
Sbjct: 481 PEGFSLTLSSFATMWMAIFAWVTSSSLAYIYGKDDKFHEEFQYIDGKEYPSKIVSADGRS 540

Query: 541 SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGMGHLLDTMTFLDALPAFRMKQWQVIV 600
           SEIKQTLAGCLTR+IPGLASELKLSTP+S LE+GM HLLDTMTFLDALPAFRMKQWQVIV
Sbjct: 541 SEIKQTLAGCLTRAIPGLASELKLSTPVSRLEYGMAHLLDTMTFLDALPAFRMKQWQVIV 600

Query: 601 LLFIEALSVCRIPSLASHMSNSRSLYHKVLDRAQIRSDEYEVMKDHILPLGRTAQFSGNN 660
           LLFIEALSVCRIPSLASHMS+SR+LYHKVLDRAQI+SDEYE+MKDHILPLG TAQ S  N
Sbjct: 601 LLFIEALSVCRIPSLASHMSSSRNLYHKVLDRAQIQSDEYEIMKDHILPLGLTAQLSVEN 660

Query: 661 DA 663
           DA
Sbjct: 661 DA 661

BLAST of CaUC07G137190 vs. NCBI nr
Match: KAG6581990.1 (putative RNA polymerase II subunit B1 CTD phosphatase RPAP2-like protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1113.2 bits (2878), Expect = 0.0e+00
Identity = 575/662 (86.86%), Postives = 607/662 (91.69%), Query Frame = 0

Query: 1   MAKNQSVLIKDTVYKLQLALLEGIQNENQLFAAGSLMSRSDYEDVVTERSIANLCGFPLC 60
           MAKNQ+ LIKDTVYKLQLALL+GI NENQLFAAGSLMSRSDYEDVVTERSIANLCG+PLC
Sbjct: 1   MAKNQTTLIKDTVYKLQLALLDGIHNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC 60

Query: 61  HSNLSSDNTRKGRYRVSLKEHKVYDLQETYKYCSSTCLINSRAFSARLQDERCSVMNPEK 120
            SNL SDNTRKGRYR+SLKEHKVYDL+ETYKYCSSTCLINSRAFS RLQDERCSVMNP K
Sbjct: 61  QSNLPSDNTRKGRYRISLKEHKVYDLEETYKYCSSTCLINSRAFSGRLQDERCSVMNPGK 120

Query: 121 LEETLRLFENLSLDSKENAGNIRDLGLEIQEKIDSNIGEVPIEDWMGPSNAIEGYVPHRD 180
           L+E LRLFENLSLDSKEN  N  DLGLEIQEKI+S+IGEVPIE+WMGPSNAIEGYVPHR+
Sbjct: 121 LKEILRLFENLSLDSKENTRNSCDLGLEIQEKIESSIGEVPIEEWMGPSNAIEGYVPHRN 180

Query: 181 HKIMTLPSKDGKESKDGSKAKNKPLGGGKDFFSDLSFTSTIITDEEYSVSKISSGLKEMA 240
           H IMTLPSKDGKE KDGSKAK K LG GKDFFSD SF +T+ITDEEYSVSKISSGLKEM 
Sbjct: 181 HNIMTLPSKDGKELKDGSKAKIKQLGVGKDFFSDFSFATTVITDEEYSVSKISSGLKEMT 240

Query: 241 FDTNSKTQTGEFCVKDSNEQFTILETPHAPAPTKNSIERKARGSKERTKVSATKESTNNL 300
           FDT SK QTGEFC K SNEQFTILETPH PAPTKNS+ RKARGSKERT VSAT ES NNL
Sbjct: 241 FDTKSKEQTGEFCGKQSNEQFTILETPHGPAPTKNSVGRKARGSKERTNVSATAESNNNL 300

Query: 301 SDAPSTSNQSNTNFNLVTEEPRGGSNDLSGTEIKSSLKQPGKKNLHRSVTWADEKTDETS 360
           SDAPSTSN  +TN N+ TEEP GGSNDL+ T+IKSSLKQPGKKNL RSVTWAD KTDETS
Sbjct: 301 SDAPSTSNHCSTNCNITTEEPNGGSNDLNETQIKSSLKQPGKKNLRRSVTWADAKTDETS 360

Query: 361 IINLPEVREMGKTKECSRITSNLVNSDNDDEDLLRLESAEACAMALSQAAEAITSGQNEV 420
           IINLPE REMGKTKECSR+TSNLVN+DN +ED+LR+ESAEACAMALSQAAEAITSGQNEV
Sbjct: 361 IINLPEDREMGKTKECSRMTSNLVNADNGNEDILRVESAEACAMALSQAAEAITSGQNEV 420

Query: 421 SDAVSEAGIIILPRPIGANEEASTNPVNASEPHSFSEKSNRLGELRSDLFDPSDSWYDAP 480
           SDAVSEAGIIILPRP  ANEEASTN  N SEPHS SEKSN+ G LRSDLFDP DSWYD+P
Sbjct: 421 SDAVSEAGIIILPRPSDANEEASTNGENISEPHSSSEKSNKPGILRSDLFDPDDSWYDSP 480

Query: 481 PEGFSLTLSSFATMWMAIFAWITSSSLAYIYGKDDKFHEEFLYIDGKEYPRKIVSADGRS 540
           PEGFSLTLSSFATMWMAIFAW+TSSSLAYIYGKD+KFHEEF YIDG+EYPRKIVSADGRS
Sbjct: 481 PEGFSLTLSSFATMWMAIFAWMTSSSLAYIYGKDEKFHEEFQYIDGREYPRKIVSADGRS 540

Query: 541 SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGMGHLLDTMTFLDALPAFRMKQWQVIV 600
           SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGMG LLDTMTFLDALPAFR KQWQVIV
Sbjct: 541 SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGMGCLLDTMTFLDALPAFRTKQWQVIV 600

Query: 601 LLFIEALSVCRIPSLASHMSNSRSLYHKVLDRAQIRSDEYEVMKDHILPLGRTAQFSGNN 660
           LLFIEALSVCRIPSL S +S+SRSL+HKVLDRAQI+SDEYE +KDHILPLGRTAQF G N
Sbjct: 601 LLFIEALSVCRIPSLDSQVSHSRSLFHKVLDRAQIQSDEYETLKDHILPLGRTAQFPGEN 660

Query: 661 DA 663
           DA
Sbjct: 661 DA 662

BLAST of CaUC07G137190 vs. ExPASy Swiss-Prot
Match: F4K1B1 (Putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog OS=Arabidopsis thaliana OX=3702 GN=At5g26760 PE=2 SV=1)

HSP 1 Score: 427.6 bits (1098), Expect = 2.6e-118
Identity = 308/768 (40.10%), Postives = 416/768 (54.17%), Query Frame = 0

Query: 1   MAK-NQSVLIKDTVYKLQLALLEGIQNENQLFAAGSLMSRSDYEDVVTERSIANLCGFPL 60
           MAK N+++ I D V+KLQL +LE   ++NQLFAA  LMSRSDYEDVVTER+IA LCG+ L
Sbjct: 1   MAKDNEAIAINDAVHKLQLYMLENTTDQNQLFAARKLMSRSDYEDVVTERAIAKLCGYTL 60

Query: 61  CHSNLSSDNTRKGRYRVSLKEHKVYDLQETYKYCSSTCLINSRAFSARLQDERCSVMNPE 120
           C   L SD +R+G+YR+SLK+HKVYDLQET K+CS+ CLI+S+ FS  LQ+ R    +  
Sbjct: 61  CQRFLPSDVSRRGKYRISLKDHKVYDLQETSKFCSAGCLIDSKTFSGSLQEARTLEFDSV 120

Query: 121 KLEETLRLFENLSLDSKENAGNIRDLGLE---IQEKIDSNIGEVPIEDWMGPSNAIEGYV 180
           KL E L LF + SL+ K +    +DL L    I+E       E+ +E WMGPSNA+EGYV
Sbjct: 121 KLNEILDLFGD-SLEVKGSLDVNKDLDLSKLMIKENFGVRGEELSLEKWMGPSNAVEGYV 180

Query: 181 PHRDHKIMTLPSKDGKESKDGSKAKNKPLGGGKDFFSDLSFTSTIITDEEYSVSKISSGL 240
           P    K     S D K +   ++ K+           ++ FTST+I  +  SVSK+    
Sbjct: 181 PFDRSK----SSNDSKATTQSNQEKH-----------EMDFTSTVIMPDVNSVSKLPPQT 240

Query: 241 KEMAFDTNSKTQTGEFCVKD-------------------------------SNEQFTILE 300
           K+ +    S    G+  +K+                               + E+ T+L 
Sbjct: 241 KQASTVVESVDGKGKTVLKEQTVVPPTKKVSRFRREKEKEKKTFGVDGMGCAQEKTTVL- 300

Query: 301 TPHAPAPTKNSIER--KARGSKERTKVSA------------------------------- 360
            P       N IE+  K  G  E    S+                               
Sbjct: 301 -PRKILSFCNEIEKDFKNFGFDEMGLASSAMMSDGYGVEYSVSKQPQCSMEDSLSCKLKG 360

Query: 361 ---TKESTNNLSDAPSTSNQSNTNF-------NLVTEEPRGGSND--------------- 420
              T +  N LS + S SN   +          +++ E    S +               
Sbjct: 361 DLQTLDGKNTLSGSSSGSNTKGSKTKPEKSRKKIISVEYHANSYEDGEEILAAESYERHK 420

Query: 421 ----LSGTEI--KSSLKQPGKKNLHRSVTWADEKTDETSIINLPEVREMGKTKECSRITS 480
                S +EI  KS LK  G K L RSVTWAD+        +L EVR        S    
Sbjct: 421 AQDVCSSSEIVTKSCLKISGSKKLSRSVTWADQNDGRG---DLCEVRNNDNAAGPSL--- 480

Query: 481 NLVNSDNDDED---LLRLESAEACAMALSQAAEAITSGQNEVSDAVSEAGIIILPRPIGA 540
               S ND ED   L RL  AEA A ALSQAAEA++SG ++ SDA ++AGII+LP     
Sbjct: 481 ----SSNDIEDVNSLSRLALAEALATALSQAAEAVSSGNSDASDATAKAGIILLPSTHQL 540

Query: 541 NEEA----STNPVNASEPHSFSEKSNRLGELRSDLFDPSDSWYDAPPEGFSLTLSSFATM 600
           +EE     S   +   EP +  +  N+ G   SDLFD   SW+D PPEGF+LTLS+FA M
Sbjct: 541 DEEVTEEHSEEEMTEEEP-TLLKWPNKPGIPDSDLFDRDQSWFDGPPEGFNLTLSNFAVM 600

Query: 601 WMAIFAWITSSSLAYIYGKDDKFHEEFLYIDGKEYPRKIVSADGRSSEIKQTLAGCLTRS 660
           W ++F W++SSSLAYIYGK++  HEEFL ++GKEYPR+I+  DG SSEIKQT+AGCL R+
Sbjct: 601 WDSLFGWVSSSSLAYIYGKEESAHEEFLLVNGKEYPRRIIMVDGLSSEIKQTIAGCLARA 660

Query: 661 IPGLASELKLSTPISSLEHGMGHLLDTMTFLDALPAFRMKQWQVIVLLFIEALSVCRIPS 663
           +P + + L+L   IS LE G+G LL+TM+   A+P+FR+K+W VIVLLF++ALSV RIP 
Sbjct: 661 LPRVVTHLRLPIAISELEKGLGSLLETMSLTGAVPSFRVKEWLVIVLLFLDALSVSRIPR 720

BLAST of CaUC07G137190 vs. ExPASy Swiss-Prot
Match: A2Y040 (Putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog OS=Oryza sativa subsp. indica OX=39946 GN=OsI_18345 PE=3 SV=1)

HSP 1 Score: 361.7 bits (927), Expect = 1.8e-98
Identity = 266/735 (36.19%), Postives = 388/735 (52.79%), Query Frame = 0

Query: 2   AKNQSVLIKDTVYKLQLALLEG--IQNENQLFAAGSLMSRSDYEDVVTERSIANLCGFPL 61
           A+ +   +   V+++Q+AL +G     E  L AA SL+S  DY DVVTERSIA+ CG+P 
Sbjct: 11  ARMKPTTVASAVHRVQMALYDGAAASREPLLRAAASLLSGPDYADVVTERSIADACGYPA 70

Query: 62  CHSNLSSDNTR---KGRYRVSLKEHKVYDLQETYKYCSSTCLINSRAFSARLQDERCSVM 121
           C + L S++ R     R+R+SL+EH+VYDL+E  K+CS  CL+ S AF A L  +R   +
Sbjct: 71  CPNPLPSEDARGKAAPRFRISLREHRVYDLEEARKFCSERCLVASAAFGASLPPDRPFGV 130

Query: 122 NPEKLEETLRLFEN-----------LSLDSKENAGNIRD-LGLEIQEKIDSNIGEVPIED 181
           +P++L+  + LFE            L   +  +   + +   +EI EK  +  GEV +++
Sbjct: 131 SPDRLDALVALFEGGGGGGGDGGLALGFGASGDGKEVEEGRKVEIMEKEAAGTGEVTLQE 190

Query: 182 WMGPSNAIEGYVPHRDHKIMTLPSKDGKES---------------------------KDG 241
           W+GPS+AIEGYVP RD +++  P K+ K++                            + 
Sbjct: 191 WIGPSDAIEGYVPRRD-RVVGGPKKEAKQNDACSAEQSSNINVDSRNASSGESGMVLTEN 250

Query: 242 SKAKNK-----PLGGGKDFFSDLSFTSTIITD--------------EEYSVSKISSGLKE 301
           +KAK K     PL   K    D    S+ I+D              ++   +K + G   
Sbjct: 251 TKAKKKEATKTPLKMFKQ-DEDNDMLSSCISDSIVKQLEDVVLEEKKDKKKNKAAKGTSR 310

Query: 302 MAFDTNSKTQTGEFCVKDSNE---QFTILETPHAPAPTKNSIERKARGSKERTKVSATKE 361
           +     +K   G    +D +E     TI+   H        ++  A G    +      E
Sbjct: 311 VGKSKPAKRPVG----RDGHEVDFTSTIIMGDHG----SEMMDHGALGQYNFSSSILANE 370

Query: 362 STNNLSDAPSTSNQSNTN------FNLVTEEPRGGSNDLSGTEIKSSLKQPGKKNLHRSV 421
             ++   A   S Q+ T        N V       S+D     ++SSLK  G KN  RSV
Sbjct: 371 QPSSSQYAAIDSVQAYTEELDELFSNAVNIAKDETSDDSGRCTLRSSLKAVGSKNAGRSV 430

Query: 422 TWADEKTDETSIINLPEVREMGKTKECSR-ITSNLVNSDNDDEDLLRLESAEACAMALSQ 481
            WADE                G   E SR   S+   S    +  +R ESAEACA AL +
Sbjct: 431 KWADEN---------------GSVLETSRAFVSHSSKSQESMDSSVRRESAEACAAALIE 490

Query: 482 AAEAITSGQNEVSDAVSEAGIIILPRPIGANEEAS--TNPVNASEPHSFS------EKSN 541
           AAEAI+SG +EV DAVS+AGIIILP  +   +  +   N  +A E   F       +   
Sbjct: 491 AAEAISSGTSEVEDAVSKAGIIILPDMVNQQQYNNDYDNDKDAGENEIFEIDRGVVKWPK 550

Query: 542 RLGELRSDLFDPSDSWYDAPPEGFSLTLSSFATMWMAIFAWITSSSLAYIYGKDDKFHEE 601
           +   L +D+FD  DSW+D PPEGFSLTLSSFATMW A+F W++ SSLAY+YG D+   E+
Sbjct: 551 KTVLLDTDMFDVDDSWHDTPPEGFSLTLSSFATMWAALFGWVSRSSLAYVYGLDESSMED 610

Query: 602 FLYIDGKEYPRKIVSADGRSSEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGMGHLLD 656
            L   G+E P+K V  DG SSEI++ L  C+  ++P L S L++  P+S LE  +G+LLD
Sbjct: 611 LLIAGGRECPQKRVLNDGHSSEIRRALDTCVCNALPVLVSNLRMQIPVSKLEITLGYLLD 670

BLAST of CaUC07G137190 vs. ExPASy Swiss-Prot
Match: Q6AVZ9 (Putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog OS=Oryza sativa subsp. japonica OX=39947 GN=Os05g0134300 PE=3 SV=1)

HSP 1 Score: 360.1 bits (923), Expect = 5.2e-98
Identity = 253/732 (34.56%), Postives = 385/732 (52.60%), Query Frame = 0

Query: 2   AKNQSVLIKDTVYKLQLALLEG--IQNENQLFAAGSLMSRSDYEDVVTERSIANLCGFPL 61
           A+ +   +   V+++Q+AL +G     E  L AA SL+S  DY DVVTERSIA+ CG+P 
Sbjct: 11  ARMKPTTVASAVHRVQMALYDGAAASREPLLRAAASLLSGPDYADVVTERSIADACGYPA 70

Query: 62  CHSNLSSDNTR---KGRYRVSLKEHKVYDLQETYKYCSSTCLINSRAFSARLQDERCSVM 121
           C + L S++ R     R+R+SL+EH+VYDL+E  K+CS  CL+ S AF A L  +R   +
Sbjct: 71  CPNPLPSEDARGKAAPRFRISLREHRVYDLEEARKFCSERCLVASAAFGASLPPDRPFGV 130

Query: 122 NPEKLEETLRLFEN-----------LSLDSKENAGNIRD-LGLEIQEKIDSNIGEVPIED 181
           +P++L+  + LFE            L   +  +   + +   +EI EK  +  GEV +++
Sbjct: 131 SPDRLDALVALFEGGGGGGDDGGLALGFGASGDGKEVEEGRKVEIMEKEAAGTGEVTLQE 190

Query: 182 WMGPSNAIEGYVPHRDHKIMTLPSKDGKESKDGSKAKNKPLGGGKDFFSDLSFTSTIITD 241
           W+GPS+AIEGYVP RD +++  P K+ K++ D   A+           +    +  ++T+
Sbjct: 191 WIGPSDAIEGYVPRRD-RVVGGPKKEAKQN-DACSAEQSSNINVDSRNASSGESGMVLTE 250

Query: 242 EEYSVSKISSGLKEMAFDTNSKTQTGEFCVKDSNEQFTILETPHAPAPTKNSIERKARGS 301
              +  K ++      F  +        C+ DS     I++        +   ++K + +
Sbjct: 251 NTKAKKKEATKTPLKMFKQDEDNDMLSSCISDS-----IVKQLEDVVLEEKKDKKKNKAA 310

Query: 302 KERTKVSATK----------------------ESTNNLSDAPSTSNQSNTNFNLVTEEPR 361
           K  ++V  +K                      +  + + D  +    + ++  L  E+P 
Sbjct: 311 KGTSRVGKSKPAKRPVGRDGHEVDFTSTIIMGDRGSEMMDHGALGQYNFSSSILANEQPS 370

Query: 362 GG------------------------------SNDLSGTEIKSSLKQPGKKNLHRSVTWA 421
                                           S+D     ++SSLK  G KN   SV WA
Sbjct: 371 SSQYAAIDSVQAYTEELDELFSNAVNIAKDETSDDSGRCTLRSSLKAVGSKNAGHSVKWA 430

Query: 422 DEKTDETSIINLPEVREMGKTKECSR-ITSNLVNSDNDDEDLLRLESAEACAMALSQAAE 481
           DE                G   E SR   S+   S    +  +R ESAEACA AL +AAE
Sbjct: 431 DEN---------------GSVLETSRAFVSHSSKSQESMDSSVRRESAEACAAALIEAAE 490

Query: 482 AITSGQNEVSDAVSEAGIIILPRPIGANEEAS--TNPVNASEPHSFS------EKSNRLG 541
           AI+SG +EV DAVS+AGIIILP  +   +  +   N  +A E   F       +   +  
Sbjct: 491 AISSGTSEVEDAVSKAGIIILPDMVNQQQYNNDYDNDKDAGENEIFEIDRGVVKWPKKTV 550

Query: 542 ELRSDLFDPSDSWYDAPPEGFSLTLSSFATMWMAIFAWITSSSLAYIYGKDDKFHEEFLY 601
            L +D+FD  DSW+D PPEGFSLTLSSFATMW A+F W++ SSLAY+YG D+   E+ L 
Sbjct: 551 LLDTDMFDVDDSWHDTPPEGFSLTLSSFATMWAALFGWVSRSSLAYVYGLDESSMEDLLI 610

Query: 602 IDGKEYPRKIVSADGRSSEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGMGHLLDTMT 656
             G+E P+K V  DG SSEI++ L  C+  ++P L S L++  P+S LE  +G+LLDTM+
Sbjct: 611 AGGRECPQKRVLNDGHSSEIRRALDTCVCNALPVLVSNLRMQIPVSKLEITLGYLLDTMS 670

BLAST of CaUC07G137190 vs. ExPASy Swiss-Prot
Match: Q8IXW5 (Putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 OS=Homo sapiens OX=9606 GN=RPAP2 PE=1 SV=1)

HSP 1 Score: 65.9 bits (159), Expect = 2.0e-09
Identity = 95/391 (24.30%), Postives = 163/391 (41.69%), Query Frame = 0

Query: 20  LLEGIQNENQLFAAGSLMSRSDYEDVVTERSIANLCGFPLCHSNLSSDNTRKGRYRVSLK 79
           LLE    E  L   G  ++ + Y DVV ERSI  LCG+PLC   L      K +Y++S K
Sbjct: 65  LLEENITEEFLMECGRFITPAHYSDVVDERSIVKLCGYPLCQKKLGI--VPKQKYKISTK 124

Query: 80  EHKVYDLQETYKYCSSTCLINSRAFSARL--------QDERCSVMNPEKLEETLRLFENL 139
            +KVYD+ E   +CS+ C   S+ F A++        ++ER       K E++    E +
Sbjct: 125 TNKVYDITERKSFCSNFCYQASKFFEAQIPKTPVWVREEERHPDFQLLKEEQSGHSGEEV 184

Query: 140 SLDSKENAGNIRDLGLEIQEKIDSNIGEVPIEDWMG-----PSNAIEGYVPHRDHKIMTL 199
            L SK    +  D     +++ +S+      +          S+ + G  P+  +    L
Sbjct: 185 QLCSKAIKTSDIDNPSHFEKQYESSSSSTHSDSSSDNEQDFVSSILPGNRPNSTNIRPQL 244

Query: 200 PSKDGKESKDGSKAKNKPLGGGKDFFSDLSFTSTIITDEEYSVSKISSGLKEMAFDTNSK 259
             K   + K G KA +K                    D+E +V  ++  L +   D+  K
Sbjct: 245 HQKSIMKKKAGHKANSKH------------------KDKEQTVVDVTEQLGDCKLDSQEK 304

Query: 260 TQTGEFCVKDSNEQFTILET-PHAPAPTKNSIERKARGSKERTKVSATKESTNNLSDAPS 319
             T E  ++  N Q +   T P     ++NS    +R   E T V  +K+S  +     +
Sbjct: 305 DATCELPLQKVNTQSSSNSTLPERLKASENSESEYSR--SEITLVGISKKSAEHFKRKFA 364

Query: 320 TSNQSNTNFNLVTEEPRGGSNDLSGTEIKSSLKQPGKKNLHRSV--TWADEKTDETSIIN 379
            SNQ + + +             S  ++     + GK+NL + +  T  + KT+ET    
Sbjct: 365 KSNQVSRSVS-------------SSVQV---CPEVGKRNLLKVLKETLIEWKTEET---- 413

Query: 380 LPEVREMGKTKECSRITSNLVNSDNDDEDLL 395
           L  +        C +  ++LV  + D++D++
Sbjct: 425 LRFLYGQNYASVCLKPEASLVKEELDEDDII 413

BLAST of CaUC07G137190 vs. ExPASy Swiss-Prot
Match: Q5RA37 (Putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 OS=Pongo abelii OX=9601 GN=RPAP2 PE=2 SV=1)

HSP 1 Score: 64.3 bits (155), Expect = 5.9e-09
Identity = 94/391 (24.04%), Postives = 162/391 (41.43%), Query Frame = 0

Query: 20  LLEGIQNENQLFAAGSLMSRSDYEDVVTERSIANLCGFPLCHSNLSSDNTRKGRYRVSLK 79
           LLE    E  L   G  ++ + Y DVV ERSI  LCG+PLC   L      K +Y++S K
Sbjct: 65  LLEENITEEFLMECGKFITPAHYSDVVDERSIVKLCGYPLCQKKLGI--VPKQKYKISTK 124

Query: 80  EHKVYDLQETYKYCSSTCLINSRAFSARL--------QDERCSVMNPEKLEETLRLFENL 139
            +KVYD+ E   +CS+ C   S+ F A++        ++ER       K +++    E +
Sbjct: 125 TNKVYDITERKSFCSNFCYQASKFFEAQIPKTPVWVREEERHPDFQLLKEQQSGHSGEEV 184

Query: 140 SLDSKENAGNIRDLGLEIQEKIDSNIGEVPIEDWMG-----PSNAIEGYVPHRDHKIMTL 199
            L SK    +  D     +++ +S+      +          S+ + G  P+       L
Sbjct: 185 QLCSKAIKTSDIDNPSHFEKQYESSSSSTHSDSSSDNEQDFVSSILPGNRPNSTSIRPQL 244

Query: 200 PSKDGKESKDGSKAKNKPLGGGKDFFSDLSFTSTIITDEEYSVSKISSGLKEMAFDTNSK 259
             K   + K G KA +K                    D+E +V  ++  L +   D+  K
Sbjct: 245 HQKSIMKKKAGHKANSKH------------------KDKEQTVIDVTEQLGDCKLDSQEK 304

Query: 260 TQTGEFCVKDSNEQFTILET-PHAPAPTKNSIERKARGSKERTKVSATKESTNNLSDAPS 319
             T E  ++  N Q +   T P     ++NS    +R   E T V  +K+S  +     +
Sbjct: 305 DATCELPLQKVNTQSSSNSTLPERLKASENSESEYSR--SEITLVGISKKSAEHFKRKFA 364

Query: 320 TSNQSNTNFNLVTEEPRGGSNDLSGTEIKSSLKQPGKKNLHRSV--TWADEKTDETSIIN 379
            SNQ + + +             S  ++     + GK+NL + +  T  + KT+ET    
Sbjct: 365 KSNQVSRSVS-------------SSVQV---CPEVGKRNLLKILKETLIEWKTEET---- 413

Query: 380 LPEVREMGKTKECSRITSNLVNSDNDDEDLL 395
           L  +        C +  ++LV  + D++D++
Sbjct: 425 LRFLYGQNYASVCLKPEASLVKEELDEDDII 413

BLAST of CaUC07G137190 vs. ExPASy TrEMBL
Match: A0A0A0KVU3 (RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog OS=Cucumis sativus OX=3659 GN=Csa_4G009360 PE=3 SV=1)

HSP 1 Score: 1144.4 bits (2959), Expect = 0.0e+00
Identity = 586/662 (88.52%), Postives = 622/662 (93.96%), Query Frame = 0

Query: 1   MAKNQSVLIKDTVYKLQLALLEGIQNENQLFAAGSLMSRSDYEDVVTERSIANLCGFPLC 60
           MAKNQSVLIKDTVYKLQLAL EGI+NENQLFAAGSLMSRSDYEDVVTERSIA+LCG+PLC
Sbjct: 1   MAKNQSVLIKDTVYKLQLALYEGIKNENQLFAAGSLMSRSDYEDVVTERSIADLCGYPLC 60

Query: 61  HSNLSSDNTRKGRYRVSLKEHKVYDLQETYKYCSSTCLINSRAFSARLQDERCSVMNPEK 120
           HSNL SDNTR+GRYR+SLKEHKVYDL+ETYKYCSS CLINSRAFS RLQDERCSVMNP+K
Sbjct: 61  HSNLPSDNTRRGRYRISLKEHKVYDLEETYKYCSSACLINSRAFSGRLQDERCSVMNPDK 120

Query: 121 LEETLRLFENLSLDSKENAGNIRDLGLEIQEKIDSNIGEVPIEDWMGPSNAIEGYVPHRD 180
           L+E L+LFEN+SLDSKEN GN  D GLEIQEKI+SNIGEVPIE+WMGPSNAIEGYVPHRD
Sbjct: 121 LKEILKLFENMSLDSKENMGNNCDSGLEIQEKIESNIGEVPIEEWMGPSNAIEGYVPHRD 180

Query: 181 HKIMTLPSKDGKESKDGSKAKNKPLGGGKDFFSDLSFTSTIITDEEYSVSKISSGLKEMA 240
           HK+MTL SKDGKESKDGSKAK KPLGGGKDFFSD S TSTIITDEEYSVSKISSGLKEMA
Sbjct: 181 HKVMTLHSKDGKESKDGSKAKIKPLGGGKDFFSDFSITSTIITDEEYSVSKISSGLKEMA 240

Query: 241 FDTNSKTQTGEFCVKDSNEQFTILETPHAPAPTKNSIERKARGSKERTKVSATKESTNNL 300
            DTNSK QTGEFC K+SN+QF ILETPHAPAP KNS+ RKARGSKERTKVSATKEST+NL
Sbjct: 241 LDTNSKNQTGEFCGKESNDQFAILETPHAPAPPKNSVGRKARGSKERTKVSATKESTDNL 300

Query: 301 SDAPSTSNQSNTNFNLVTEEPRGGSNDLSGTEIKSSLKQPGKKNLHRSVTWADEKTDETS 360
           SDAPSTS   +TNFNL+TEEPRGG NDLSGTE+KSSLK+PGKKNL RSVTWADEKTD+ S
Sbjct: 301 SDAPSTSKNRSTNFNLMTEEPRGGFNDLSGTELKSSLKKPGKKNLCRSVTWADEKTDDAS 360

Query: 361 IINLPEVREMGKTKECSRITSNLVNSDNDDEDLLRLESAEACAMALSQAAEAITSGQNEV 420
           I+NLPEV EMGKTKECSR TSNLVN DND+ED+LR+ESAEACAMALSQAAEAITSGQ+EV
Sbjct: 361 IMNLPEVGEMGKTKECSRTTSNLVNFDNDNEDILRVESAEACAMALSQAAEAITSGQSEV 420

Query: 421 SDAVSEAGIIILPRPIGANEEASTNPVNASEPHSFSEKSNRLGELRSDLFDPSDSWYDAP 480
           SDAVSEAGIIILP P  ANEEAST+PVNASEPHSFSEKSN+LG LRSDLFDPSDSWYDAP
Sbjct: 421 SDAVSEAGIIILPHPSDANEEASTDPVNASEPHSFSEKSNKLGVLRSDLFDPSDSWYDAP 480

Query: 481 PEGFSLTLSSFATMWMAIFAWITSSSLAYIYGKDDKFHEEFLYIDGKEYPRKIVSADGRS 540
           PEGFSLTLSSFATMWMAIFAW+TSSSLAYIYGKDDKFHEEFLYIDGKEYP KIVSADGRS
Sbjct: 481 PEGFSLTLSSFATMWMAIFAWVTSSSLAYIYGKDDKFHEEFLYIDGKEYPSKIVSADGRS 540

Query: 541 SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGMGHLLDTMTFLDALPAFRMKQWQVIV 600
           SEIKQTLAGCLTR+IPGLASEL LSTPIS LE+GM HLLDTMTFLDALPAFRMKQWQVIV
Sbjct: 541 SEIKQTLAGCLTRAIPGLASELNLSTPISRLENGMAHLLDTMTFLDALPAFRMKQWQVIV 600

Query: 601 LLFIEALSVCRIPSLASHMSNSRSLYHKVLDRAQIRSDEYEVMKDHILPLGRTAQFSGNN 660
           LLFIEALSV RIPSLASHMS+SR+LYHKVLDRAQIRSDEYE+M+DHILPLGRTAQ S  N
Sbjct: 601 LLFIEALSVSRIPSLASHMSSSRNLYHKVLDRAQIRSDEYEIMRDHILPLGRTAQLSDEN 660

Query: 661 DA 663
           DA
Sbjct: 661 DA 662

BLAST of CaUC07G137190 vs. ExPASy TrEMBL
Match: A0A1S3BXZ9 (RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog OS=Cucumis melo OX=3656 GN=LOC103494620 PE=3 SV=1)

HSP 1 Score: 1127.9 bits (2916), Expect = 0.0e+00
Identity = 582/662 (87.92%), Postives = 614/662 (92.75%), Query Frame = 0

Query: 1   MAKNQSVLIKDTVYKLQLALLEGIQNENQLFAAGSLMSRSDYEDVVTERSIANLCGFPLC 60
           MAKNQS LIKDTVYKLQLAL EGIQNENQLFAAGSLMSRSDYEDVVTERSIANLCG+PLC
Sbjct: 1   MAKNQSALIKDTVYKLQLALYEGIQNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC 60

Query: 61  HSNLSSDNTRKGRYRVSLKEHKVYDLQETYKYCSSTCLINSRAFSARLQDERCSVMNPEK 120
           HSNL SDNTR+GRYR+SLKEHKVYDL+ETYKYCSS CLINSRAFS RLQDERCSVMNP K
Sbjct: 61  HSNLPSDNTRRGRYRISLKEHKVYDLEETYKYCSSACLINSRAFSGRLQDERCSVMNPAK 120

Query: 121 LEETLRLFENLSLDSKENAGNIRDLGLEIQEKIDSNIGEVPIEDWMGPSNAIEGYVPHRD 180
           L+E L+LFEN+SLDSKEN GN  D GLEIQEKI+S+IGEVPIE+WMGPSNAIEGYVPHRD
Sbjct: 121 LKEILKLFENMSLDSKENMGNNCDSGLEIQEKIESSIGEVPIEEWMGPSNAIEGYVPHRD 180

Query: 181 HKIMTLPSKDGKESKDGSKAKNKPLGGGKDFFSDLSFTSTIITDEEYSVSKISSGLKEMA 240
           HKIMTLPSKDGKESKDGS AK KPLGGGKDFFSD SFTSTIITDEEYSVSKISS LKEMA
Sbjct: 181 HKIMTLPSKDGKESKDGSTAKIKPLGGGKDFFSDFSFTSTIITDEEYSVSKISSSLKEMA 240

Query: 241 FDTNSKTQTGEFCVKDSNEQFTILETPHAPAPTKNSIERKARGSKERTKVSATKESTNNL 300
            DTNSK QTGEFC K+SN+QFTILET HA AP KNS+  KARGSKERTKVSAT+ESTNNL
Sbjct: 241 LDTNSKIQTGEFCGKESNDQFTILETSHARAPPKNSVGHKARGSKERTKVSATEESTNNL 300

Query: 301 SDAPSTSNQSNTNFNLVTEEPRGGSNDLSGTEIKSSLKQPGKKNLHRSVTWADEKTDETS 360
           SDAPSTSN  +TNFNLVTEEP+GG NDL GTEIKSSLKQPGKKNL RSVTWADEK D+TS
Sbjct: 301 SDAPSTSNNRSTNFNLVTEEPKGGFNDLRGTEIKSSLKQPGKKNLRRSVTWADEKIDDTS 360

Query: 361 IINLPEVREMGKTKECSRITSNLVNSDNDDEDLLRLESAEACAMALSQAAEAITSGQNEV 420
            +NLPEV E GKTKECSRITSNLVN DND+EDL+R+ESAEACAMALSQAAEAITSGQ+EV
Sbjct: 361 -MNLPEVGEKGKTKECSRITSNLVNFDNDNEDLIRVESAEACAMALSQAAEAITSGQSEV 420

Query: 421 SDAVSEAGIIILPRPIGANEEASTNPVNASEPHSFSEKSNRLGELRSDLFDPSDSWYDAP 480
           S+AVSEAGIIILP P  ANEEAST PV ASEPHSFSEKSN+LG L SDLFDPSDSWYDAP
Sbjct: 421 SEAVSEAGIIILPHPSDANEEASTEPVKASEPHSFSEKSNKLGVLHSDLFDPSDSWYDAP 480

Query: 481 PEGFSLTLSSFATMWMAIFAWITSSSLAYIYGKDDKFHEEFLYIDGKEYPRKIVSADGRS 540
           PEGFSLTLSSFATMWMAIFAW+TSSSLAYIYGKDDKFHEEF YIDGKEYP KIVSADGRS
Sbjct: 481 PEGFSLTLSSFATMWMAIFAWVTSSSLAYIYGKDDKFHEEFQYIDGKEYPSKIVSADGRS 540

Query: 541 SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGMGHLLDTMTFLDALPAFRMKQWQVIV 600
           SEIKQTLAGCLTR+IPGLASELKLSTPIS LE+GM HLLDTMTFLDALPAFRMKQWQVIV
Sbjct: 541 SEIKQTLAGCLTRAIPGLASELKLSTPISRLEYGMAHLLDTMTFLDALPAFRMKQWQVIV 600

Query: 601 LLFIEALSVCRIPSLASHMSNSRSLYHKVLDRAQIRSDEYEVMKDHILPLGRTAQFSGNN 660
           LLF+EALSVCRIPSLASHMS+SR+LYHKVLDRAQI+SDEYE+MKDHILPLG TAQ S  N
Sbjct: 601 LLFMEALSVCRIPSLASHMSSSRNLYHKVLDRAQIQSDEYEIMKDHILPLGLTAQLSVEN 660

Query: 661 DA 663
           DA
Sbjct: 661 DA 661

BLAST of CaUC07G137190 vs. ExPASy TrEMBL
Match: A0A5A7TQX7 (RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold46G002560 PE=3 SV=1)

HSP 1 Score: 1127.5 bits (2915), Expect = 0.0e+00
Identity = 581/662 (87.76%), Postives = 615/662 (92.90%), Query Frame = 0

Query: 1   MAKNQSVLIKDTVYKLQLALLEGIQNENQLFAAGSLMSRSDYEDVVTERSIANLCGFPLC 60
           MAKNQS LIKDTVYKLQLAL EGIQNENQLFAAGSLMSRSDYEDVVTERSIANLCG+PLC
Sbjct: 1   MAKNQSALIKDTVYKLQLALYEGIQNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC 60

Query: 61  HSNLSSDNTRKGRYRVSLKEHKVYDLQETYKYCSSTCLINSRAFSARLQDERCSVMNPEK 120
           HSNL SDNTR+GRYR+SLKEHKVYDL+ETYKYCSS CLINSRAFS RLQDERCSVMNP K
Sbjct: 61  HSNLPSDNTRRGRYRISLKEHKVYDLEETYKYCSSACLINSRAFSGRLQDERCSVMNPAK 120

Query: 121 LEETLRLFENLSLDSKENAGNIRDLGLEIQEKIDSNIGEVPIEDWMGPSNAIEGYVPHRD 180
           L++ L+LFEN+SLDSKEN GN  D GLEIQEKI+S+IGEVPIE+WMGPSNAIEGYVPHRD
Sbjct: 121 LKDILKLFENMSLDSKENVGNNCDSGLEIQEKIESSIGEVPIEEWMGPSNAIEGYVPHRD 180

Query: 181 HKIMTLPSKDGKESKDGSKAKNKPLGGGKDFFSDLSFTSTIITDEEYSVSKISSGLKEMA 240
           HKIMTLPSKDGKESKDGS AK KPLGGGKDFFSD SFTSTIITDEEYSVSKISS LKEMA
Sbjct: 181 HKIMTLPSKDGKESKDGSTAKIKPLGGGKDFFSDFSFTSTIITDEEYSVSKISSSLKEMA 240

Query: 241 FDTNSKTQTGEFCVKDSNEQFTILETPHAPAPTKNSIERKARGSKERTKVSATKESTNNL 300
            DTNSK QTGEFC K+SN+QFTILET HA AP KNS+  KARGSKERTKVSAT+ESTNNL
Sbjct: 241 LDTNSKIQTGEFCGKESNDQFTILETSHARAPPKNSVGHKARGSKERTKVSATEESTNNL 300

Query: 301 SDAPSTSNQSNTNFNLVTEEPRGGSNDLSGTEIKSSLKQPGKKNLHRSVTWADEKTDETS 360
           SDAPSTSN  +TNFNLVTEEP+GG NDL GTEIKSSLKQPGKKNL RSVTWADEK D+TS
Sbjct: 301 SDAPSTSNNRSTNFNLVTEEPKGGFNDLRGTEIKSSLKQPGKKNLRRSVTWADEKIDDTS 360

Query: 361 IINLPEVREMGKTKECSRITSNLVNSDNDDEDLLRLESAEACAMALSQAAEAITSGQNEV 420
            +NLPEV E GKTKECSRITSNLVN DND+EDLLR+ESAEACAMALSQAAEAITSGQ+EV
Sbjct: 361 -MNLPEVGEKGKTKECSRITSNLVNFDNDNEDLLRVESAEACAMALSQAAEAITSGQSEV 420

Query: 421 SDAVSEAGIIILPRPIGANEEASTNPVNASEPHSFSEKSNRLGELRSDLFDPSDSWYDAP 480
           S+AVSEAGIIILP P  ANEEAST+PV ASEPHSFSEKSN+LG L SDLFDPS+SWYDAP
Sbjct: 421 SEAVSEAGIIILPHPSDANEEASTDPVKASEPHSFSEKSNKLGVLHSDLFDPSESWYDAP 480

Query: 481 PEGFSLTLSSFATMWMAIFAWITSSSLAYIYGKDDKFHEEFLYIDGKEYPRKIVSADGRS 540
           PEGFSLTLSSFATMWMAIFAW+TSSSLAYIYGKDDKFHEEF YIDGKEYP KIVSADGRS
Sbjct: 481 PEGFSLTLSSFATMWMAIFAWVTSSSLAYIYGKDDKFHEEFQYIDGKEYPSKIVSADGRS 540

Query: 541 SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGMGHLLDTMTFLDALPAFRMKQWQVIV 600
           SEIKQTLAGCLTR+IPGLASELKLSTP+S LE+GM HLLDTMTFLDALPAFRMKQWQVIV
Sbjct: 541 SEIKQTLAGCLTRAIPGLASELKLSTPVSRLEYGMAHLLDTMTFLDALPAFRMKQWQVIV 600

Query: 601 LLFIEALSVCRIPSLASHMSNSRSLYHKVLDRAQIRSDEYEVMKDHILPLGRTAQFSGNN 660
           LLFIEALSVCRIPSLASHMS+SR+LYHKVLDRAQI+SDEYE+MKDHILPLG TAQ S  N
Sbjct: 601 LLFIEALSVCRIPSLASHMSSSRNLYHKVLDRAQIQSDEYEIMKDHILPLGLTAQLSVEN 660

Query: 661 DA 663
           DA
Sbjct: 661 DA 661

BLAST of CaUC07G137190 vs. ExPASy TrEMBL
Match: A0A6J1IY57 (RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog OS=Cucurbita maxima OX=3661 GN=LOC111479539 PE=3 SV=1)

HSP 1 Score: 1111.7 bits (2874), Expect = 0.0e+00
Identity = 573/662 (86.56%), Postives = 608/662 (91.84%), Query Frame = 0

Query: 1   MAKNQSVLIKDTVYKLQLALLEGIQNENQLFAAGSLMSRSDYEDVVTERSIANLCGFPLC 60
           MAKNQ++LIKDTVYKLQLALL+GI NENQLFAAGSLMSRSDYEDVVTERSIANLCG+PLC
Sbjct: 1   MAKNQTILIKDTVYKLQLALLDGIHNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC 60

Query: 61  HSNLSSDNTRKGRYRVSLKEHKVYDLQETYKYCSSTCLINSRAFSARLQDERCSVMNPEK 120
            SNL SDNTRKGRYR+SLKEHKVYDL+ETYKYCSSTCLINSRAFS RLQDERCSVMNP K
Sbjct: 61  QSNLPSDNTRKGRYRISLKEHKVYDLEETYKYCSSTCLINSRAFSGRLQDERCSVMNPGK 120

Query: 121 LEETLRLFENLSLDSKENAGNIRDLGLEIQEKIDSNIGEVPIEDWMGPSNAIEGYVPHRD 180
           L+E LRLFENLSLDSKEN  N  DLGLEIQEKI+S+IGEVPIE+WMGPSNAIEGYVPHR+
Sbjct: 121 LKEILRLFENLSLDSKENTRNSCDLGLEIQEKIESSIGEVPIEEWMGPSNAIEGYVPHRN 180

Query: 181 HKIMTLPSKDGKESKDGSKAKNKPLGGGKDFFSDLSFTSTIITDEEYSVSKISSGLKEMA 240
           H IMTLPSKDGKE KDGSKAK K LG  KDFFSD SF ST+ITDEEYSVSKISSGLKEM 
Sbjct: 181 HNIMTLPSKDGKELKDGSKAKIKQLGVEKDFFSDFSFASTVITDEEYSVSKISSGLKEMT 240

Query: 241 FDTNSKTQTGEFCVKDSNEQFTILETPHAPAPTKNSIERKARGSKERTKVSATKESTNNL 300
           FDT SK QTGEFC K SNEQFTILETPH PAPTKNS+ RKARG+KERT VSAT ES NNL
Sbjct: 241 FDTKSKAQTGEFCGKQSNEQFTILETPHGPAPTKNSVGRKARGTKERTNVSATAESNNNL 300

Query: 301 SDAPSTSNQSNTNFNLVTEEPRGGSNDLSGTEIKSSLKQPGKKNLHRSVTWADEKTDETS 360
           SD+PSTSN  NTN N+ TEEP+GGSN+L+ T+IKSSLKQPGKKNL RSVTWAD KTDETS
Sbjct: 301 SDSPSTSNHCNTNCNITTEEPKGGSNELNETQIKSSLKQPGKKNLRRSVTWADAKTDETS 360

Query: 361 IINLPEVREMGKTKECSRITSNLVNSDNDDEDLLRLESAEACAMALSQAAEAITSGQNEV 420
           IINLPE REMGKTKECSR+TSNLVN+DN +ED+LR+ESAEACAMALSQAAEAITSGQNEV
Sbjct: 361 IINLPEDREMGKTKECSRMTSNLVNADNGNEDMLRVESAEACAMALSQAAEAITSGQNEV 420

Query: 421 SDAVSEAGIIILPRPIGANEEASTNPVNASEPHSFSEKSNRLGELRSDLFDPSDSWYDAP 480
           SDAVSEAGIIILPRP  ANEE STN  N SEP+S SEKSN+ G L SDLFDP DSWYD+P
Sbjct: 421 SDAVSEAGIIILPRPSDANEEVSTNGKNISEPYSSSEKSNKPGILHSDLFDPEDSWYDSP 480

Query: 481 PEGFSLTLSSFATMWMAIFAWITSSSLAYIYGKDDKFHEEFLYIDGKEYPRKIVSADGRS 540
           PEGFSLTLSSFATMWMAIFAW+TSSSLAYIYGKD+KFHEEF YIDG+EYPRKIVSADGRS
Sbjct: 481 PEGFSLTLSSFATMWMAIFAWMTSSSLAYIYGKDEKFHEEFQYIDGREYPRKIVSADGRS 540

Query: 541 SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGMGHLLDTMTFLDALPAFRMKQWQVIV 600
           SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGMG LLDTMTFLDALPAFRMKQWQVIV
Sbjct: 541 SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGMGCLLDTMTFLDALPAFRMKQWQVIV 600

Query: 601 LLFIEALSVCRIPSLASHMSNSRSLYHKVLDRAQIRSDEYEVMKDHILPLGRTAQFSGNN 660
           LLFIEALSVCRIPSL S +SNSRSL+HKVLDRAQIRS+EYE +KDHILPLGRTAQFSG N
Sbjct: 601 LLFIEALSVCRIPSLDSQVSNSRSLFHKVLDRAQIRSNEYETLKDHILPLGRTAQFSGEN 660

Query: 661 DA 663
           DA
Sbjct: 661 DA 662

BLAST of CaUC07G137190 vs. ExPASy TrEMBL
Match: A0A6J1GWL9 (RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog OS=Cucurbita moschata OX=3662 GN=LOC111457827 PE=3 SV=1)

HSP 1 Score: 1110.9 bits (2872), Expect = 0.0e+00
Identity = 573/662 (86.56%), Postives = 607/662 (91.69%), Query Frame = 0

Query: 1   MAKNQSVLIKDTVYKLQLALLEGIQNENQLFAAGSLMSRSDYEDVVTERSIANLCGFPLC 60
           MAKNQ++LIKDTVYKLQLALL+GI NENQLFAAGSLMSRSDYEDVVTERSIANLCG+PLC
Sbjct: 1   MAKNQTILIKDTVYKLQLALLDGIHNENQLFAAGSLMSRSDYEDVVTERSIANLCGYPLC 60

Query: 61  HSNLSSDNTRKGRYRVSLKEHKVYDLQETYKYCSSTCLINSRAFSARLQDERCSVMNPEK 120
            SNL SDNTRKGRYR+SLKEHKVYDL+ETYKYCSSTCLINSRAFS RLQDERCSVMNP K
Sbjct: 61  QSNLPSDNTRKGRYRISLKEHKVYDLEETYKYCSSTCLINSRAFSGRLQDERCSVMNPGK 120

Query: 121 LEETLRLFENLSLDSKENAGNIRDLGLEIQEKIDSNIGEVPIEDWMGPSNAIEGYVPHRD 180
           L+E LRLFENLSLDSKEN  N  DLGLEIQEKI+S+IGEVPIE+WMGPSNAIEGYVPHR+
Sbjct: 121 LKEILRLFENLSLDSKENTRNSCDLGLEIQEKIESSIGEVPIEEWMGPSNAIEGYVPHRN 180

Query: 181 HKIMTLPSKDGKESKDGSKAKNKPLGGGKDFFSDLSFTSTIITDEEYSVSKISSGLKEMA 240
           H IMT P KDGKE KDGSKAK K LG GKDFFSD SF +T+ITDEEYSVSKISSGLKEM 
Sbjct: 181 HNIMTSPRKDGKELKDGSKAKIKQLGVGKDFFSDFSFATTVITDEEYSVSKISSGLKEMT 240

Query: 241 FDTNSKTQTGEFCVKDSNEQFTILETPHAPAPTKNSIERKARGSKERTKVSATKESTNNL 300
           FDT SK QTGEFC K SNEQFTILETPH PAPTKNS+ RKARGSKERT VSAT ES NNL
Sbjct: 241 FDTKSKAQTGEFCGKQSNEQFTILETPHGPAPTKNSVGRKARGSKERTNVSATAESNNNL 300

Query: 301 SDAPSTSNQSNTNFNLVTEEPRGGSNDLSGTEIKSSLKQPGKKNLHRSVTWADEKTDETS 360
           SDAPSTSN  +TN N+ TEEP GGSNDL+ T+IKSSLKQPGKKNL RSVTWAD KTDETS
Sbjct: 301 SDAPSTSNHCSTNCNITTEEPNGGSNDLNETQIKSSLKQPGKKNLRRSVTWADAKTDETS 360

Query: 361 IINLPEVREMGKTKECSRITSNLVNSDNDDEDLLRLESAEACAMALSQAAEAITSGQNEV 420
           IINLPE REMGKTKECSR+TSNLVN+DN +ED+LR+ESAEACAMALSQAAEAITSG+NEV
Sbjct: 361 IINLPEDREMGKTKECSRMTSNLVNADNGNEDILRVESAEACAMALSQAAEAITSGKNEV 420

Query: 421 SDAVSEAGIIILPRPIGANEEASTNPVNASEPHSFSEKSNRLGELRSDLFDPSDSWYDAP 480
           SDAVSEAGIIILPRP  ANEEASTN  N SEPHS SEKSN+ G LRSDLFDP+DSWYD+P
Sbjct: 421 SDAVSEAGIIILPRPSDANEEASTNGENISEPHSSSEKSNKPGILRSDLFDPNDSWYDSP 480

Query: 481 PEGFSLTLSSFATMWMAIFAWITSSSLAYIYGKDDKFHEEFLYIDGKEYPRKIVSADGRS 540
           PEGFSLTLSSFATMWMAIFAW+TSSSLAYIYGKD+KFHEEF YIDG+EYPRKIVSADGRS
Sbjct: 481 PEGFSLTLSSFATMWMAIFAWMTSSSLAYIYGKDEKFHEEFQYIDGREYPRKIVSADGRS 540

Query: 541 SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGMGHLLDTMTFLDALPAFRMKQWQVIV 600
           SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGMG LLDTMTFLDALPAFR KQWQVIV
Sbjct: 541 SEIKQTLAGCLTRSIPGLASELKLSTPISSLEHGMGCLLDTMTFLDALPAFRTKQWQVIV 600

Query: 601 LLFIEALSVCRIPSLASHMSNSRSLYHKVLDRAQIRSDEYEVMKDHILPLGRTAQFSGNN 660
           LLFIEALSVCRIPSL S +S+SRSL+HKVLDRAQIRSDEYE +KDHILPLGRTAQF G N
Sbjct: 601 LLFIEALSVCRIPSLDSQVSHSRSLFHKVLDRAQIRSDEYETLKDHILPLGRTAQFPGEN 660

Query: 661 DA 663
           DA
Sbjct: 661 DA 662

BLAST of CaUC07G137190 vs. TAIR 10
Match: AT5G26760.2 (unknown protein; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF408 (InterPro:IPR007308); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 427.6 bits (1098), Expect = 1.9e-119
Identity = 308/768 (40.10%), Postives = 416/768 (54.17%), Query Frame = 0

Query: 1   MAK-NQSVLIKDTVYKLQLALLEGIQNENQLFAAGSLMSRSDYEDVVTERSIANLCGFPL 60
           MAK N+++ I D V+KLQL +LE   ++NQLFAA  LMSRSDYEDVVTER+IA LCG+ L
Sbjct: 1   MAKDNEAIAINDAVHKLQLYMLENTTDQNQLFAARKLMSRSDYEDVVTERAIAKLCGYTL 60

Query: 61  CHSNLSSDNTRKGRYRVSLKEHKVYDLQETYKYCSSTCLINSRAFSARLQDERCSVMNPE 120
           C   L SD +R+G+YR+SLK+HKVYDLQET K+CS+ CLI+S+ FS  LQ+ R    +  
Sbjct: 61  CQRFLPSDVSRRGKYRISLKDHKVYDLQETSKFCSAGCLIDSKTFSGSLQEARTLEFDSV 120

Query: 121 KLEETLRLFENLSLDSKENAGNIRDLGLE---IQEKIDSNIGEVPIEDWMGPSNAIEGYV 180
           KL E L LF + SL+ K +    +DL L    I+E       E+ +E WMGPSNA+EGYV
Sbjct: 121 KLNEILDLFGD-SLEVKGSLDVNKDLDLSKLMIKENFGVRGEELSLEKWMGPSNAVEGYV 180

Query: 181 PHRDHKIMTLPSKDGKESKDGSKAKNKPLGGGKDFFSDLSFTSTIITDEEYSVSKISSGL 240
           P    K     S D K +   ++ K+           ++ FTST+I  +  SVSK+    
Sbjct: 181 PFDRSK----SSNDSKATTQSNQEKH-----------EMDFTSTVIMPDVNSVSKLPPQT 240

Query: 241 KEMAFDTNSKTQTGEFCVKD-------------------------------SNEQFTILE 300
           K+ +    S    G+  +K+                               + E+ T+L 
Sbjct: 241 KQASTVVESVDGKGKTVLKEQTVVPPTKKVSRFRREKEKEKKTFGVDGMGCAQEKTTVL- 300

Query: 301 TPHAPAPTKNSIER--KARGSKERTKVSA------------------------------- 360
            P       N IE+  K  G  E    S+                               
Sbjct: 301 -PRKILSFCNEIEKDFKNFGFDEMGLASSAMMSDGYGVEYSVSKQPQCSMEDSLSCKLKG 360

Query: 361 ---TKESTNNLSDAPSTSNQSNTNF-------NLVTEEPRGGSND--------------- 420
              T +  N LS + S SN   +          +++ E    S +               
Sbjct: 361 DLQTLDGKNTLSGSSSGSNTKGSKTKPEKSRKKIISVEYHANSYEDGEEILAAESYERHK 420

Query: 421 ----LSGTEI--KSSLKQPGKKNLHRSVTWADEKTDETSIINLPEVREMGKTKECSRITS 480
                S +EI  KS LK  G K L RSVTWAD+        +L EVR        S    
Sbjct: 421 AQDVCSSSEIVTKSCLKISGSKKLSRSVTWADQNDGRG---DLCEVRNNDNAAGPSL--- 480

Query: 481 NLVNSDNDDED---LLRLESAEACAMALSQAAEAITSGQNEVSDAVSEAGIIILPRPIGA 540
               S ND ED   L RL  AEA A ALSQAAEA++SG ++ SDA ++AGII+LP     
Sbjct: 481 ----SSNDIEDVNSLSRLALAEALATALSQAAEAVSSGNSDASDATAKAGIILLPSTHQL 540

Query: 541 NEEA----STNPVNASEPHSFSEKSNRLGELRSDLFDPSDSWYDAPPEGFSLTLSSFATM 600
           +EE     S   +   EP +  +  N+ G   SDLFD   SW+D PPEGF+LTLS+FA M
Sbjct: 541 DEEVTEEHSEEEMTEEEP-TLLKWPNKPGIPDSDLFDRDQSWFDGPPEGFNLTLSNFAVM 600

Query: 601 WMAIFAWITSSSLAYIYGKDDKFHEEFLYIDGKEYPRKIVSADGRSSEIKQTLAGCLTRS 660
           W ++F W++SSSLAYIYGK++  HEEFL ++GKEYPR+I+  DG SSEIKQT+AGCL R+
Sbjct: 601 WDSLFGWVSSSSLAYIYGKEESAHEEFLLVNGKEYPRRIIMVDGLSSEIKQTIAGCLARA 660

Query: 661 IPGLASELKLSTPISSLEHGMGHLLDTMTFLDALPAFRMKQWQVIVLLFIEALSVCRIPS 663
           +P + + L+L   IS LE G+G LL+TM+   A+P+FR+K+W VIVLLF++ALSV RIP 
Sbjct: 661 LPRVVTHLRLPIAISELEKGLGSLLETMSLTGAVPSFRVKEWLVIVLLFLDALSVSRIPR 720

BLAST of CaUC07G137190 vs. TAIR 10
Match: AT5G26760.1 (unknown protein; Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 279.6 bits (714), Expect = 6.3e-75
Identity = 180/401 (44.89%), Postives = 247/401 (61.60%), Query Frame = 0

Query: 274 KNSIERKARGSKERTKVSATKESTNNLSDAPSTSNQSNTNFNLVTEE--PRGGSNDL-SG 333
           KN++   + GS  +   +  ++S   +      +N       ++  E   R  + D+ S 
Sbjct: 45  KNTLSGSSSGSNTKGSKTKPEKSRKKIISVEYHANSYEDGEEILAAESYERHKAQDVCSS 104

Query: 334 TEI--KSSLKQPGKKNLHRSVTWADEKTDETSIINLPEVREMGKTKECSRITSNLVNSDN 393
           +EI  KS LK  G K L RSVTWAD+        +L EVR        S        S N
Sbjct: 105 SEIVTKSCLKISGSKKLSRSVTWADQNDGRG---DLCEVRNNDNAAGPSL-------SSN 164

Query: 394 DDED---LLRLESAEACAMALSQAAEAITSGQNEVSDAVSEAGIIILPRPIGANEEA--- 453
           D ED   L RL  AEA A ALSQAAEA++SG ++ SDA ++AGII+LP     +EE    
Sbjct: 165 DIEDVNSLSRLALAEALATALSQAAEAVSSGNSDASDATAKAGIILLPSTHQLDEEVTEE 224

Query: 454 -STNPVNASEPHSFSEKSNRLGELRSDLFDPSDSWYDAPPEGFSLTLSSFATMWMAIFAW 513
            S   +   EP +  +  N+ G   SDLFD   SW+D PPEGF+LTLS+FA MW ++F W
Sbjct: 225 HSEEEMTEEEP-TLLKWPNKPGIPDSDLFDRDQSWFDGPPEGFNLTLSNFAVMWDSLFGW 284

Query: 514 ITSSSLAYIYGKDDKFHEEFLYIDGKEYPRKIVSADGRSSEIKQTLAGCLTRSIPGLASE 573
           ++SSSLAYIYGK++  HEEFL ++GKEYPR+I+  DG SSEIKQT+AGCL R++P + + 
Sbjct: 285 VSSSSLAYIYGKEESAHEEFLLVNGKEYPRRIIMVDGLSSEIKQTIAGCLARALPRVVTH 344

Query: 574 LKLSTPISSLEHGMGHLLDTMTFLDALPAFRMKQWQVIVLLFIEALSVCRIPSLASHMSN 633
           L+L   IS LE G+G LL+TM+   A+P+FR+K+W VIVLLF++ALSV RIP +A ++SN
Sbjct: 345 LRLPIAISELEKGLGSLLETMSLTGAVPSFRVKEWLVIVLLFLDALSVSRIPRIAPYISN 404

Query: 634 SRSLYHKVLDRAQIRSDEYEVMKDHILPLGRTAQFSGNNDA 663
                 K+L+ + I ++EYE MKD +LPLGR  QF+  + A
Sbjct: 405 R----DKILEGSGIGNEEYETMKDILLPLGRVPQFATRSGA 430

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038893419.10.0e+0092.90putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Benincasa h... [more]
XP_031739958.10.0e+0088.52putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [Cucumis sat... [more]
XP_008454119.10.0e+0087.92PREDICTED: putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog [... [more]
KAA0044516.10.0e+0087.76putative RNA polymerase II subunit B1 CTD phosphatase RPAP2-like protein [Cucumi... [more]
KAG6581990.10.0e+0086.86putative RNA polymerase II subunit B1 CTD phosphatase RPAP2-like protein, partia... [more]
Match NameE-valueIdentityDescription
F4K1B12.6e-11840.10Putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog OS=Arabidops... [more]
A2Y0401.8e-9836.19Putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog OS=Oryza sat... [more]
Q6AVZ95.2e-9834.56Putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog OS=Oryza sat... [more]
Q8IXW52.0e-0924.30Putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 OS=Homo sapiens OX=9... [more]
Q5RA375.9e-0924.04Putative RNA polymerase II subunit B1 CTD phosphatase RPAP2 OS=Pongo abelii OX=9... [more]
Match NameE-valueIdentityDescription
A0A0A0KVU30.0e+0088.52RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog OS=Cucumis sativus OX... [more]
A0A1S3BXZ90.0e+0087.92RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog OS=Cucumis melo OX=36... [more]
A0A5A7TQX70.0e+0087.76RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog OS=Cucumis melo var. ... [more]
A0A6J1IY570.0e+0086.56RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog OS=Cucurbita maxima O... [more]
A0A6J1GWL90.0e+0086.56RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog OS=Cucurbita moschata... [more]
Match NameE-valueIdentityDescription
AT5G26760.21.9e-11940.10unknown protein; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF408 ... [more]
AT5G26760.16.3e-7544.89unknown protein; Has 30201 Blast hits to 17322 proteins in 780 species: Archae -... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (USVL246-FR2) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR038534Rtr1/RPAP2 domain superfamilyGENE3D1.25.40.820coord: 2..147
e-value: 1.1E-29
score: 105.3
IPR007308Rtr1/RPAP2 domainPFAMPF04181RPAP2_Rtr1coord: 36..108
e-value: 3.0E-21
score: 75.5
IPR007308Rtr1/RPAP2 domainPROSITEPS51479ZF_RTR1coord: 32..117
score: 19.475332
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 185..205
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 278..292
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 185..203
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 263..346
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 293..338
IPR039693Rtr1/RPAP2PANTHERPTHR14732UNCHARACTERIZEDcoord: 6..657

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CaUC07G137190.1CaUC07G137190.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0070940 dephosphorylation of RNA polymerase II C-terminal domain
cellular_component GO:0005737 cytoplasm
cellular_component GO:0005634 nucleus
molecular_function GO:0046872 metal ion binding
molecular_function GO:0043175 RNA polymerase core enzyme binding
molecular_function GO:0008420 RNA polymerase II CTD heptapeptide repeat phosphatase activity