ClCG04G001902 (gene) Watermelon (Charleston Gray) v2.5

Overview
NameClCG04G001902
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionRibonuclease H
LocationCG_Chr04: 6619834 .. 6621989 (-)
RNA-Seq ExpressionClCG04G001902
SyntenyClCG04G001902
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGTTGTGTGTTGGGGCAGCATGACTCTACAGGAAGGAAAGAGCAAGCTGTTTATTATTTGAGCAAGAAGTTCACAAGTTACGAGTTGAAATACTCATTGTTAGAAAAAACATGTTGTGCCTTAGCGTGGACAGCTCAAAGATTGAGACAGTATATGTTGTATTACACAACATTGCTTATCTCAAAAATGGATCCTATAAAGTACATTTTTGAAAAGCCATCTTTGTCTGGTAGAATAGCCAAATGGCAGGTATTGCTATCAGAATATGATATTGTCTATGTCACCAAAAAGGCCATAAAAGGGAGTGCAATTGCCGATTGTCTAGCTGAATTACCTGTTGAAGACTACGAGCCAATGAAATTTGAATTTCCGGATGAAGACATCATGACGATATCCACATTGAATGCAACACAAGATCCTGAAACATGGACAATGTTGTTTGATGGAGCCACGAACGAAATGGGGCATGGTGTAGGGGCTATTTTGATATCTCCTGATGGGAAGTTGTATCCTTTAACTGCTAAATTATACTTCGATTGCACGAATAACATGGCTGAGTATGAAGCATGCAGTATGGGAATTCAGATGGCCTATGACATGAAGATAAAAAAACTGCAAGTTTTCGGGGATTCTCTTTTAGTAGTACATCAATTGAATGGGGAGTGGGAGACAAGAGACTCTAAATTGATTCCCTATAATAAGTATATCCGAGAATTGGCCCAAACTTTTGAGTCAATCACGTTCGAGCATGTCCCACGCGAAAATAACCAGGTTGCGGATGCATTGGCCACCTTGTCTGCCATGTTTAATGTGGCTCGCAATGAAGAAATTCAGCCTATAAGTATTGAGAAGCGTGAAACACCAGCATATTGCCTAAGTGTTGAGCAGGAACCTGACGGGAAGCCTTGGTATCATGACATTAAGCACTACATCACATGTCGAGAATATCCGCTAGGAGCTTCTGAAAACGATAAGCGCACCATTAGAAAGTTGGCCATGAGTTTCTTTTTGAATGGAGATGTGTTGTACAAGAGAAATTACGATATGACTCTCCTGAGATGTGTTGATGCTTTTGAAGCAAAGAGAATTCTAGAAGAAGTTCACGAGGGAGTTTGTGGGACGCATACAAATGGACACATGATGTCAAGACAAGCTTTACGTGCGGGATATTATTGGTTGACTATGGAGTCAGATTGCATTAAGTATGCAAGAAAATGCCACAAATGCCAGATATATGCAGATAAAGTGCATGCTCCATCTTCCCCCCTACACGTGTTAACGGCCTCGTGGCCTTTCTCTATGTGGGGAATGGATGTAATTGGGCCCATTGAGCCAAAGGTGTCAAATGGTCATCGATTCATTTTGGTAGCCATAAACTACTTTACTAAGTGGGTGGAGGCTGCCTCGTACAAGAGTGTTACCAAGCAAGCCGTTGTCAAGTTCATAAGAAAGGACATTATATGTTGGTATGGCTTGCCTAAACGTATCATCACTGATAATGGGAGGAACTTAAATAACAAATTGATGGAGGAATTGTGTACCCAGTTCAAAGTCAAACATTCTAACACTACTCCTTATCGCCCTAGGATGAATGGAGCAGTGGAGGCGGCAAATAAAAATATCAAGAGAATTATCCAGAAAATGACGGTCACATATAAAGATTGGCATGAAATGTTACCGTTTGCGCTACACGGTTACCGAACATCAGTTCGTACATCGACTGGGGCAACGCCATTTTCTTTGGTGTATGGTTTGGAGGCTGTCTTGCCTATTGAGGTCGAGGTGCCATCCCTCACGGTAATTCAAGAAGTAGAGTTAGAAGAAGCAGAATGGATTCAAACAAGGTATGAGCAGTTAAATCTGATAGAAGAAAGAAGAATGACAACATTATGTAGAGGACAATTGTATCAGAAGAAGGTAGCGCGTGCTACGACAGAAAAGTTCGACGTCGTTGTTTTCAGGAAGGAGATTTGGTGTTGAAAAGGATCTTGCCATCTCAGAAGGATCATAGAGGAAAGTGGACCCCTAATTATGAGGGACCATATGTGGTAAAAAAGGCATTCACTGGAGGAGCTTTGATTTTGACAAATATGGATGGGACCGATTTGCCCAATCCGTTAAGTGTAGATTACGTGAAGAAGTACTATGCATAG

mRNA sequence

ATGGGTTGTGTGTTGGGGCAGCATGACTCTACAGGAAGGAAAGAGCAAGCTGTTTATTATTTGAGCAAGAAGTTCACAAGTTACGAGTTGAAATACTCATTGTTAGAAAAAACATGTTGTGCCTTAGCGTGGACAGCTCAAAGATTGAGACAGTATATGTTGTATTACACAACATTGCTTATCTCAAAAATGGATCCTATAAAGTACATTTTTGAAAAGCCATCTTTGTCTGGTAGAATAGCCAAATGGCAGGTATTGCTATCAGAATATGATATTGTCTATGTCACCAAAAAGGCCATAAAAGGGAGTGCAATTGCCGATTGTCTAGCTGAATTACCTGTTGAAGACTACGAGCCAATGAAATTTGAATTTCCGGATGAAGACATCATGACGATATCCACATTGAATGCAACACAAGATCCTGAAACATGGACAATGTTGTTTGATGGAGCCACGAACGAAATGGGGCATGGTGTAGGGGCTATTTTGATATCTCCTGATGGGAAGTTGTATCCTTTAACTGCTAAATTATACTTCGATTGCACGAATAACATGGCTGAGTATGAAGCATGCAGTATGGGAATTCAGATGGCCTATGACATGAAGATAAAAAAACTGCAAGTTTTCGGGGATTCTCTTTTAGTAGTACATCAATTGAATGGGGAGTGGGAGACAAGAGACTCTAAATTGATTCCCTATAATAAGTATATCCGAGAATTGGCCCAAACTTTTGAGTCAATCACGTTCGAGCATGTCCCACGCGAAAATAACCAGGTTGCGGATGCATTGGCCACCTTGTCTGCCATGTTTAATGTGGCTCGCAATGAAGAAATTCAGCCTATAAGTATTGAGAAGCGTGAAACACCAGCATATTGCCTAAGTGTTGAGCAGGAACCTGACGGGAAGCCTTGGTATCATGACATTAAGCACTACATCACATGTCGAGAATATCCGCTAGGAGCTTCTGAAAACGATAAGCGCACCATTAGAAAGTTGGCCATGAGTTTCTTTTTGAATGGAGATGTGTTGTACAAGAGAAATTACGATATGACTCTCCTGAGATGTGTTGATGCTTTTGAAGCAAAGAGAATTCTAGAAGAAGTTCACGAGGGAGTTTGTGGGACGCATACAAATGGACACATGATGTCAAGACAAGCTTTACGTGCGGGATATTATTGGTTGACTATGGAGTCAGATTGCATTAAGTATGCAAGAAAATGCCACAAATGCCAGATATATGCAGATAAAGTGCATGCTCCATCTTCCCCCCTACACGTGTTAACGGCCTCGTGGCCTTTCTCTATGTGGGGAATGGATGTAATTGGGCCCATTGAGCCAAAGGTGTCAAATGGTCATCGATTCATTTTGGTAGCCATAAACTACTTTACTAAGTGGGTGGAGGCTGCCTCGTACAAGAGTGTTACCAAGCAAGCCGTTGTCAAGTTCATAAGAAAGGACATTATATGTTGGTATGGCTTGCCTAAACGTATCATCACTGATAATGGGAGGAACTTAAATAACAAATTGATGGAGGAATTGTGTACCCAGTTCAAAGTCAAACATTCTAACACTACTCCTTATCGCCCTAGGATGAATGGAGCAGTGGAGGCGGCAAATAAAAATATCAAGAGAATTATCCAGAAAATGACGGTCACATATAAAGATTGGCATGAAATGTTACCGTTTGCGCTACACGGTTACCGAACATCAGTTCGTACATCGACTGGGGCAACGCCATTTTCTTTGGTGTATGGTTTGGAGGCTGTCTTGCCTATTGAGGTCGAGGTGCCATCCCTCACGGTAATTCAAGAAGTAGAGTTAGAAGAAGCAGAATGGATTCAAACAAGAAAAGTTCGACGTCGTTGTTTTCAGGAAGGAGATTTGGTGTTGAAAAGGATCTTGCCATCTCAGAAGGATCATAGAGGAAAGTGGACCCCTAATTATGAGGGACCATATGTGGTAAAAAAGGCATTCACTGGAGGAGCTTTGATTTTGACAAATATGGATGGGACCGATTTGCCCAATCCGTTAAGTGTAGATTACGTGAAGAAGTACTATGCATAG

Coding sequence (CDS)

ATGGGTTGTGTGTTGGGGCAGCATGACTCTACAGGAAGGAAAGAGCAAGCTGTTTATTATTTGAGCAAGAAGTTCACAAGTTACGAGTTGAAATACTCATTGTTAGAAAAAACATGTTGTGCCTTAGCGTGGACAGCTCAAAGATTGAGACAGTATATGTTGTATTACACAACATTGCTTATCTCAAAAATGGATCCTATAAAGTACATTTTTGAAAAGCCATCTTTGTCTGGTAGAATAGCCAAATGGCAGGTATTGCTATCAGAATATGATATTGTCTATGTCACCAAAAAGGCCATAAAAGGGAGTGCAATTGCCGATTGTCTAGCTGAATTACCTGTTGAAGACTACGAGCCAATGAAATTTGAATTTCCGGATGAAGACATCATGACGATATCCACATTGAATGCAACACAAGATCCTGAAACATGGACAATGTTGTTTGATGGAGCCACGAACGAAATGGGGCATGGTGTAGGGGCTATTTTGATATCTCCTGATGGGAAGTTGTATCCTTTAACTGCTAAATTATACTTCGATTGCACGAATAACATGGCTGAGTATGAAGCATGCAGTATGGGAATTCAGATGGCCTATGACATGAAGATAAAAAAACTGCAAGTTTTCGGGGATTCTCTTTTAGTAGTACATCAATTGAATGGGGAGTGGGAGACAAGAGACTCTAAATTGATTCCCTATAATAAGTATATCCGAGAATTGGCCCAAACTTTTGAGTCAATCACGTTCGAGCATGTCCCACGCGAAAATAACCAGGTTGCGGATGCATTGGCCACCTTGTCTGCCATGTTTAATGTGGCTCGCAATGAAGAAATTCAGCCTATAAGTATTGAGAAGCGTGAAACACCAGCATATTGCCTAAGTGTTGAGCAGGAACCTGACGGGAAGCCTTGGTATCATGACATTAAGCACTACATCACATGTCGAGAATATCCGCTAGGAGCTTCTGAAAACGATAAGCGCACCATTAGAAAGTTGGCCATGAGTTTCTTTTTGAATGGAGATGTGTTGTACAAGAGAAATTACGATATGACTCTCCTGAGATGTGTTGATGCTTTTGAAGCAAAGAGAATTCTAGAAGAAGTTCACGAGGGAGTTTGTGGGACGCATACAAATGGACACATGATGTCAAGACAAGCTTTACGTGCGGGATATTATTGGTTGACTATGGAGTCAGATTGCATTAAGTATGCAAGAAAATGCCACAAATGCCAGATATATGCAGATAAAGTGCATGCTCCATCTTCCCCCCTACACGTGTTAACGGCCTCGTGGCCTTTCTCTATGTGGGGAATGGATGTAATTGGGCCCATTGAGCCAAAGGTGTCAAATGGTCATCGATTCATTTTGGTAGCCATAAACTACTTTACTAAGTGGGTGGAGGCTGCCTCGTACAAGAGTGTTACCAAGCAAGCCGTTGTCAAGTTCATAAGAAAGGACATTATATGTTGGTATGGCTTGCCTAAACGTATCATCACTGATAATGGGAGGAACTTAAATAACAAATTGATGGAGGAATTGTGTACCCAGTTCAAAGTCAAACATTCTAACACTACTCCTTATCGCCCTAGGATGAATGGAGCAGTGGAGGCGGCAAATAAAAATATCAAGAGAATTATCCAGAAAATGACGGTCACATATAAAGATTGGCATGAAATGTTACCGTTTGCGCTACACGGTTACCGAACATCAGTTCGTACATCGACTGGGGCAACGCCATTTTCTTTGGTGTATGGTTTGGAGGCTGTCTTGCCTATTGAGGTCGAGGTGCCATCCCTCACGGTAATTCAAGAAGTAGAGTTAGAAGAAGCAGAATGGATTCAAACAAGAAAAGTTCGACGTCGTTGTTTTCAGGAAGGAGATTTGGTGTTGAAAAGGATCTTGCCATCTCAGAAGGATCATAGAGGAAAGTGGACCCCTAATTATGAGGGACCATATGTGGTAAAAAAGGCATTCACTGGAGGAGCTTTGATTTTGACAAATATGGATGGGACCGATTTGCCCAATCCGTTAAGTGTAGATTACGTGAAGAAGTACTATGCATAG

Protein sequence

MGCVLGQHDSTGRKEQAVYYLSKKFTSYELKYSLLEKTCCALAWTAQRLRQYMLYYTTLLISKMDPIKYIFEKPSLSGRIAKWQVLLSEYDIVYVTKKAIKGSAIADCLAELPVEDYEPMKFEFPDEDIMTISTLNATQDPETWTMLFDGATNEMGHGVGAILISPDGKLYPLTAKLYFDCTNNMAEYEACSMGIQMAYDMKIKKLQVFGDSLLVVHQLNGEWETRDSKLIPYNKYIRELAQTFESITFEHVPRENNQVADALATLSAMFNVARNEEIQPISIEKRETPAYCLSVEQEPDGKPWYHDIKHYITCREYPLGASENDKRTIRKLAMSFFLNGDVLYKRNYDMTLLRCVDAFEAKRILEEVHEGVCGTHTNGHMMSRQALRAGYYWLTMESDCIKYARKCHKCQIYADKVHAPSSPLHVLTASWPFSMWGMDVIGPIEPKVSNGHRFILVAINYFTKWVEAASYKSVTKQAVVKFIRKDIICWYGLPKRIITDNGRNLNNKLMEELCTQFKVKHSNTTPYRPRMNGAVEAANKNIKRIIQKMTVTYKDWHEMLPFALHGYRTSVRTSTGATPFSLVYGLEAVLPIEVEVPSLTVIQEVELEEAEWIQTRKVRRRCFQEGDLVLKRILPSQKDHRGKWTPNYEGPYVVKKAFTGGALILTNMDGTDLPNPLSVDYVKKYYA
Homology
BLAST of ClCG04G001902 vs. NCBI nr
Match: XP_022147189.1 (LOW QUALITY PROTEIN: uncharacterized protein LOC111016200 [Momordica charantia])

HSP 1 Score: 1115.5 bits (2884), Expect = 0.0e+00
Identity = 519/718 (72.28%), Postives = 607/718 (84.54%), Query Frame = 0

Query: 1    MGCVLGQHDSTGRKEQAVYYLSKKFTSYELKYSLLEKTCCALAWTAQRLRQYMLYYTTLL 60
            MGCVLGQHD +GRKEQA+YYLSKKFT  E +YS +EKTCCALAW A+RLRQYMLYYTT L
Sbjct: 1547 MGCVLGQHDDSGRKEQAIYYLSKKFTDCETRYSQVEKTCCALAWAARRLRQYMLYYTTWL 1606

Query: 61   ISKMDPIKYIFEKPSLSGRIAKWQVLLSEYDIVYVTKKAIKGSAIADCLAELPVEDYEPM 120
            ISKMDPIKYIFEKPSLSG IA+WQVLLSEYDIVYVT+KAIKGSA+AD LA+ P+ DY P+
Sbjct: 1607 ISKMDPIKYIFEKPSLSGGIARWQVLLSEYDIVYVTQKAIKGSALADYLAQQPINDYIPV 1666

Query: 121  KFEFPDEDIMTISTLNATQDPETWTMLFDGATNEMGHGVGAILISPDGKLYPLTAKLYFD 180
            KF+FPDE I TI+    + DP+TWTM+FDGA+NE+GHG+GAILISP G+LYPL A+L FD
Sbjct: 1667 KFDFPDEYISTITASEESLDPQTWTMMFDGASNELGHGIGAILISPKGELYPLIARLCFD 1726

Query: 181  CTNNMAEYEACSMGIQMAYDMKIKKLQVFGDSLLVVHQLNGEWETRDSKLIPYNKYIREL 240
            C +NMAEYEACSMG+Q A DMK+KKL+VFGDS+LV+HQL GEWETRD KL+PY ++I EL
Sbjct: 1727 CKHNMAEYEACSMGVQAAIDMKVKKLKVFGDSMLVIHQLRGEWETRDVKLLPYKQFITEL 1786

Query: 241  AQTFESITFEHVPRENNQVADALATLSAMFNVARNEEIQPISIEKRETPAYCLSVEQEPD 300
            +Q F+ I+F+++PRENNQVADALATL+ MFN+  NE+++PI + +R+ PA C+S+E+EPD
Sbjct: 1787 SQEFDEISFDYLPRENNQVADALATLAVMFNLELNEDVRPIKVGRRDVPASCMSIEEEPD 1846

Query: 301  GKPWYHDIKHYITCREYPLGASENDKRTIRKLAMSFFLNGDVLYKRNYDMTLLRCVDAFE 360
            GKPW+HDIK YI  +EYP  ASENDKRT+RKLA+ FFLNG++LYKRN+DM LLRCV+  +
Sbjct: 1847 GKPWFHDIKQYIKSKEYPPNASENDKRTLRKLAIKFFLNGEILYKRNHDMVLLRCVEGRD 1906

Query: 361  AKRILEEVHEGVCGTHTNGHMMSRQALRAGYYWLTMESDCIKYARKCHKCQIYADKVHAP 420
            A RI+EE+HEGVCGTH NGHMM+RQ LRAGYYWLT+E+DCIKYARKCHKCQIY+DK HAP
Sbjct: 1907 ANRIMEEIHEGVCGTHANGHMMARQILRAGYYWLTIETDCIKYARKCHKCQIYSDKTHAP 1966

Query: 421  SSPLHVLTASWPFSMWGMDVIGPIEPKVSNGHRFILVAINYFTKWVEAASYKSVTKQAVV 480
            +S LH LTA WPFSMWGMDVIGPIEPK SNGH+FILVAI+YFTKWVEAASY+ VTK  VV
Sbjct: 1967 ASHLHALTAPWPFSMWGMDVIGPIEPKASNGHQFILVAIDYFTKWVEAASYRDVTKGVVV 2026

Query: 481  KFIRKDIICWYGLPKRIITDNGRNLNNKLMEELCTQFKVKHSNTTPYRPRMNGAVEAANK 540
            KFI+K+IIC YGLPK II+DN RNLNNKLM EL  QFK+KH N+TPYRP+MNGAVEAANK
Sbjct: 2027 KFIKKEIICRYGLPKTIISDNARNLNNKLMSELYEQFKIKHLNSTPYRPKMNGAVEAANK 2086

Query: 541  NIKRIIQKMTVTYKDWHEMLPFALHGYRTSVRTSTGATPFSLVYGLEAVLPIEVEVPSLT 600
            NIKRI++KMTVTY+DWHEMLPFALHGYRTSVRTSTGATPFSLVYG+E VLPIEVE+PSL 
Sbjct: 2087 NIKRIVEKMTVTYRDWHEMLPFALHGYRTSVRTSTGATPFSLVYGMEVVLPIEVEIPSLR 2146

Query: 601  VIQEVELEEAEWIQTR-------------------------------KVRRRCFQEGDLV 660
            VI E +L+EAEW+Q R                               KV  R F+E DLV
Sbjct: 2147 VIMEAKLQEAEWVQRRYEQLNFVEEKRLTALCRRQLYQRRMMKAYDKKVHPRRFKEEDLV 2206

Query: 661  LKRILPSQKDHRGKWTPNYEGPYVVKKAFTGGALILTNMDGTDLPNPLSVDYVKKYYA 688
            LKRILP QKDHRGKWTPNYEGP+VVKKAF+GGAL+L NMDGT+  NP+  D+V+KYYA
Sbjct: 2207 LKRILPLQKDHRGKWTPNYEGPFVVKKAFSGGALVLANMDGTEFXNPVKADHVRKYYA 2264

BLAST of ClCG04G001902 vs. NCBI nr
Match: XP_022158986.1 (LOW QUALITY PROTEIN: uncharacterized protein LOC111025431 [Momordica charantia])

HSP 1 Score: 1104.7 bits (2856), Expect = 0.0e+00
Identity = 515/718 (71.73%), Postives = 603/718 (83.98%), Query Frame = 0

Query: 1    MGCVLGQHDSTGRKEQAVYYLSKKFTSYELKYSLLEKTCCALAWTAQRLRQYMLYYTTLL 60
            MGCVLGQHD +GRKEQA+YYLSKKFT  E +YS +EKTCCALAW  +RLRQYMLYYTT L
Sbjct: 1156 MGCVLGQHDDSGRKEQAIYYLSKKFTDCETRYSQVEKTCCALAWATRRLRQYMLYYTTWL 1215

Query: 61   ISKMDPIKYIFEKPSLSGRIAKWQVLLSEYDIVYVTKKAIKGSAIADCLAELPVEDYEPM 120
            ISKMDPIKYIFEKPSLSGRIA+WQVLLSEYDIVYVT+KAIKGSA+AD LA+ P+ DY P+
Sbjct: 1216 ISKMDPIKYIFEKPSLSGRIARWQVLLSEYDIVYVTRKAIKGSALADYLAQQPINDYIPV 1275

Query: 121  KFEFPDEDIMTISTLNATQDPETWTMLFDGATNEMGHGVGAILISPDGKLYPLTAKLYFD 180
            KF+FPDE I TI+    + DP+TWTM+FDGA+NE+GHG+G ILISP G+LYPLTA+L FD
Sbjct: 1276 KFDFPDEYISTITASEESLDPQTWTMMFDGASNELGHGIGGILISPKGELYPLTARLCFD 1335

Query: 181  CTNNMAEYEACSMGIQMAYDMKIKKLQVFGDSLLVVHQLNGEWETRDSKLIPYNKYIREL 240
            CT+NMAEYEACSMG+Q A DMK+KKL+VFGDS+LV+HQL GEWETRD KL+PY + I EL
Sbjct: 1336 CTHNMAEYEACSMGVQAAVDMKVKKLKVFGDSMLVIHQLRGEWETRDVKLLPYKQLITEL 1395

Query: 241  AQTFESITFEHVPRENNQVADALATLSAMFNVARNEEIQPISIEKRETPAYCLSVEQEPD 300
            +Q F+ I+F+++PRENNQVADALATL+ MFN+  NE++ PI + +R+ PA C+S+E+EPD
Sbjct: 1396 SQEFDEISFDYLPRENNQVADALATLAVMFNLELNEDVSPIKVGRRDVPASCMSIEEEPD 1455

Query: 301  GKPWYHDIKHYITCREYPLGASENDKRTIRKLAMSFFLNGDVLYKRNYDMTLLRCVDAFE 360
            G PW+H+IK YI  +EYP  ASENDKRT+RKLAM FFLNG++LYKRN+DM LLRCV+  +
Sbjct: 1456 GNPWFHNIKXYIKSKEYPPNASENDKRTLRKLAMKFFLNGEILYKRNHDMVLLRCVEGRD 1515

Query: 361  AKRILEEVHEGVCGTHTNGHMMSRQALRAGYYWLTMESDCIKYARKCHKCQIYADKVHAP 420
            A RI+EEVHEGVCGTH NGHMM+RQ LRAGYYWLT+ +DCIKYARKCHKCQIY+DK HAP
Sbjct: 1516 ANRIMEEVHEGVCGTHANGHMMARQILRAGYYWLTIXTDCIKYARKCHKCQIYSDKTHAP 1575

Query: 421  SSPLHVLTASWPFSMWGMDVIGPIEPKVSNGHRFILVAINYFTKWVEAASYKSVTKQAVV 480
            +S LH LTA WPFSMWGMD+IGPIEPK SNGHRFILVAI+YFTKWVEAAS + VTK  VV
Sbjct: 1576 ASHLHTLTAPWPFSMWGMDLIGPIEPKASNGHRFILVAIDYFTKWVEAASDRDVTKGVVV 1635

Query: 481  KFIRKDIICWYGLPKRIITDNGRNLNNKLMEELCTQFKVKHSNTTPYRPRMNGAVEAANK 540
            KFI+ +IIC YGLP+ II+DN RNLNNKLM ELC  FK+KH N+TPYRP+MNGAVEAANK
Sbjct: 1636 KFIKNEIICRYGLPQTIISDNARNLNNKLMSELCEHFKIKHFNSTPYRPKMNGAVEAANK 1695

Query: 541  NIKRIIQKMTVTYKDWHEMLPFALHGYRTSVRTSTGATPFSLVYGLEAVLPIEVEVPSLT 600
            NIKRI++KMTVTY+DWHEMLPFALHGYRTSVRTSTGATPFSLVYG++AVLPIEVE+PSL 
Sbjct: 1696 NIKRIVEKMTVTYRDWHEMLPFALHGYRTSVRTSTGATPFSLVYGMKAVLPIEVEIPSLR 1755

Query: 601  VIQEVELEEAEWIQTR-------------------------------KVRRRCFQEGDLV 660
            VI E +L+EAEW+Q R                               KV  R F+EGDLV
Sbjct: 1756 VIMEAKLQEAEWVQRRYEQLNFVEEKRLTALCRGQLYQRRMMKAYDKKVHPRRFREGDLV 1815

Query: 661  LKRILPSQKDHRGKWTPNYEGPYVVKKAFTGGALILTNMDGTDLPNPLSVDYVKKYYA 688
            LK ILP QKDHRGKWT NYEGP+VVKKAF+GGAL+L NMDGT+  NP++ D+V+KYYA
Sbjct: 1816 LKIILPLQKDHRGKWTANYEGPFVVKKAFSGGALVLANMDGTEFLNPVNSDHVRKYYA 1873

BLAST of ClCG04G001902 vs. NCBI nr
Match: XP_022157796.1 (LOW QUALITY PROTEIN: uncharacterized protein LOC111024415 [Momordica charantia])

HSP 1 Score: 1104.7 bits (2856), Expect = 0.0e+00
Identity = 518/718 (72.14%), Postives = 600/718 (83.57%), Query Frame = 0

Query: 1    MGCVLGQHDSTGRKEQAVYYLSKKFTSYELKYSLLEKTCCALAWTAQRLRQYMLYYTTLL 60
            MGCVLGQHD +GRKEQA+YYLSKKFT  E +YS +EKTCCALAW A+RLRQYMLYYTT L
Sbjct: 985  MGCVLGQHDDSGRKEQAIYYLSKKFTDCETRYSQVEKTCCALAWVARRLRQYMLYYTTWL 1044

Query: 61   ISKMDPIKYIFEKPSLSGRIAKWQVLLSEYDIVYVTKKAIKGSAIADCLAELPVEDYEPM 120
            ISKMDPIKYIFEKPSLSGRIA+WQVLLSEYDIVYVT+KAIKGSA+AD LA+ P+ DY P+
Sbjct: 1045 ISKMDPIKYIFEKPSLSGRIARWQVLLSEYDIVYVTQKAIKGSALADYLAQQPINDYIPV 1104

Query: 121  KFEFPDEDIMTISTLNATQDPETWTMLFDGATNEMGHGVGAILISPDGKLYPLTAKLYFD 180
            KF+FPDE I TI+    + DP+TWTM+FDGA+NE+GH +GAILISP G+LYPLT KL FD
Sbjct: 1105 KFDFPDEYISTITASEESLDPQTWTMMFDGASNELGHEIGAILISPKGELYPLTTKLCFD 1164

Query: 181  CTNNMAEYEACSMGIQMAYDMKIKKLQVFGDSLLVVHQLNGEWETRDSKLIPYNKYIREL 240
            CT+NMAEYEACSMG+Q A DMK+KK +VFGDS LV+HQL GEWETRD KL+PY + I EL
Sbjct: 1165 CTHNMAEYEACSMGVQAAIDMKVKKFKVFGDSTLVIHQLRGEWETRDVKLLPYKQLITEL 1224

Query: 241  AQTFESITFEHVPRENNQVADALATLSAMFNVARNEEIQPISIEKRETPAYCLSVEQEPD 300
            +Q F+ I+F+++PRENNQVADALATL+ MFN+  NE+++PI + +R+ PA C+S+E+EPD
Sbjct: 1225 SQEFDEISFDYLPRENNQVADALATLAVMFNLELNEDVRPIKVGRRDVPASCMSIEEEPD 1284

Query: 301  GKPWYHDIKHYITCREYPLGASENDKRTIRKLAMSFFLNGDVLYKRNYDMTLLRCVDAFE 360
            G PW+HDIK YI  +EY   ASENDKRT+RKLAM FFLNG++LYKRN+DM LLRCV+  +
Sbjct: 1285 GNPWFHDIKQYIKSKEYQPNASENDKRTLRKLAMKFFLNGEILYKRNHDMVLLRCVEGRD 1344

Query: 361  AKRILEEVHEGVCGTHTNGHMMSRQALRAGYYWLTMESDCIKYARKCHKCQIYADKVHAP 420
            A RI+EEVHEGVCGTH NGHMM+RQ LRAGYYWLT+E+DCIKYARKCHKCQIY+DK HAP
Sbjct: 1345 ANRIMEEVHEGVCGTHANGHMMARQILRAGYYWLTIETDCIKYARKCHKCQIYSDKTHAP 1404

Query: 421  SSPLHVLTASWPFSMWGMDVIGPIEPKVSNGHRFILVAINYFTKWVEAASYKSVTKQAVV 480
            +S LH LTA WPFSMWGMDVIGPIEPK S+GHRFILVAI+YFTKWVEAASY+ VTK  VV
Sbjct: 1405 TSHLHTLTAPWPFSMWGMDVIGPIEPKASSGHRFILVAIDYFTKWVEAASYRDVTKGVVV 1464

Query: 481  KFIRKDIICWYGLPKRIITDNGRNLNNKLMEELCTQFKVKHSNTTPYRPRMNGAVEAANK 540
            KFI+K IIC YGLP+ II+DN RNLNNKLM ELC QFK+KH N+TPYRP+MNGAVEAANK
Sbjct: 1465 KFIKKKIICRYGLPETIISDNARNLNNKLMSELCEQFKIKHLNSTPYRPKMNGAVEAANK 1524

Query: 541  NIKRIIQKMTVTYKDWHEMLPFALHGYRTSVRTSTGATPFSLVYGLEAVLPIEVEVPSLT 600
            NIKRI++KMTVTY DWHEMLPFALHGYRTSVRTSTG TPFSLVYG+E VL IEVE+PSL 
Sbjct: 1525 NIKRIVEKMTVTYIDWHEMLPFALHGYRTSVRTSTGTTPFSLVYGMEVVLLIEVEIPSLR 1584

Query: 601  VIQEVELEEAEWIQTR-------------------------------KVRRRCFQEGDLV 660
            VI E +L  AEW+Q R                               KV  R F+EGDLV
Sbjct: 1585 VIMEAKLXRAEWVQRRYEQLNFVEEKRLTALCRGQLYQRRMMKAYDEKVHPRRFREGDLV 1644

Query: 661  LKRILPSQKDHRGKWTPNYEGPYVVKKAFTGGALILTNMDGTDLPNPLSVDYVKKYYA 688
            LKRILP QKDHRGKWTPNYEGP+VVKKAF+GGAL+L NMDGT+  NP++ D+V+KYYA
Sbjct: 1645 LKRILPLQKDHRGKWTPNYEGPFVVKKAFSGGALVLANMDGTEFLNPVNSDHVRKYYA 1702

BLAST of ClCG04G001902 vs. NCBI nr
Match: XP_022150030.1 (LOW QUALITY PROTEIN: uncharacterized protein LOC111018303 [Momordica charantia])

HSP 1 Score: 1104.0 bits (2854), Expect = 0.0e+00
Identity = 516/718 (71.87%), Postives = 606/718 (84.40%), Query Frame = 0

Query: 1    MGCVLGQHDSTGRKEQAVYYLSKKFTSYELKYSLLEKTCCALAWTAQRLRQYMLYYTTLL 60
            +GCVLGQHD +GRKEQA+YYLSKKFT  E +YS +EKTCCALAW A+RLRQYMLYYTT L
Sbjct: 952  IGCVLGQHDDSGRKEQAIYYLSKKFTDCETRYSQVEKTCCALAWVARRLRQYMLYYTTWL 1011

Query: 61   ISKMDPIKYIFEKPSLSGRIAKWQVLLSEYDIVYVTKKAIKGSAIADCLAELPVEDYEPM 120
            ISKMDPI+YIFEKPSLSGRIA+WQVLLSEYDIVYVT+KAIKGSA+AD LA+ P+ DY P+
Sbjct: 1012 ISKMDPIRYIFEKPSLSGRIARWQVLLSEYDIVYVTQKAIKGSALADYLAQQPINDYIPV 1071

Query: 121  KFEFPDEDIMTISTLNATQDPETWTMLFDGATNEMGHGVGAILISPDGKLYPLTAKLYFD 180
            KF+FPDE I TI+    + DP+TWTM+FDGA+NE+GHG+GAILISP G+LYPLTA+L FD
Sbjct: 1072 KFDFPDEYISTITASEESLDPQTWTMMFDGASNELGHGIGAILISPKGELYPLTARLCFD 1131

Query: 181  CTNNMAEYEACSMGIQMAYDMKIKKLQVFGDSLLVVHQLNGEWETRDSKLIPYNKYIREL 240
            CT+NMAEYEACSMG+Q A DMK+ K +VFGDS+LV+HQL GEWE RD KL+PY + I EL
Sbjct: 1132 CTHNMAEYEACSMGVQAAVDMKV-KXKVFGDSMLVIHQLRGEWEIRDVKLLPYKQLITEL 1191

Query: 241  AQTFESITFEHVPRENNQVADALATLSAMFNVARNEEIQPISIEKRETPAYCLSVEQEPD 300
            +Q F+ I+F+++PRENNQVADALATL+ MFN+  NE+++PI + +R+  A C+S+E+EPD
Sbjct: 1192 SQEFDEISFDYLPRENNQVADALATLAVMFNLELNEDVRPIKVGRRDVSASCMSIEEEPD 1251

Query: 301  GKPWYHDIKHYITCREYPLGASENDKRTIRKLAMSFFLNGDVLYKRNYDMTLLRCVDAFE 360
            G PW+HDIK YI  +EYP  ASENDKRT+RKLAM FFLN ++LYKRN+DM LLRCV+  +
Sbjct: 1252 GNPWFHDIKQYIKSKEYPPNASENDKRTLRKLAMKFFLNREILYKRNHDMVLLRCVEXRD 1311

Query: 361  AKRILEEVHEGVCGTHTNGHMMSRQALRAGYYWLTMESDCIKYARKCHKCQIYADKVHAP 420
            A RI+EEVHE VCGTH NGHM++RQ LRAGYYWLT+E+DCIKYARKCHKCQIY+DK HAP
Sbjct: 1312 ANRIMEEVHEEVCGTHANGHMIARQILRAGYYWLTIETDCIKYARKCHKCQIYSDKTHAP 1371

Query: 421  SSPLHVLTASWPFSMWGMDVIGPIEPKVSNGHRFILVAINYFTKWVEAASYKSVTKQAVV 480
            +S LH LTA WPFSMWGMDVIGPIEPK SNGHRFILVAI+YFTKWVEAASY+ VTK  VV
Sbjct: 1372 ASHLHTLTAPWPFSMWGMDVIGPIEPKASNGHRFILVAIDYFTKWVEAASYRDVTKGVVV 1431

Query: 481  KFIRKDIICWYGLPKRIITDNGRNLNNKLMEELCTQFKVKHSNTTPYRPRMNGAVEAANK 540
            KFI+K+IIC YGLP+ II+DN RNLNNKLM ELC QFK+KH N+TPYRP+MNGAVEAANK
Sbjct: 1432 KFIKKEIICRYGLPETIISDNARNLNNKLMSELCEQFKIKHLNSTPYRPKMNGAVEAANK 1491

Query: 541  NIKRIIQKMTVTYKDWHEMLPFALHGYRTSVRTSTGATPFSLVYGLEAVLPIEVEVPSLT 600
            NIKRI++KMTVTY+DWHEMLPFALHGYRTSVRTSTGATPFSLVYG+EAVLPIEVE+PSL 
Sbjct: 1492 NIKRIVEKMTVTYRDWHEMLPFALHGYRTSVRTSTGATPFSLVYGMEAVLPIEVEIPSLR 1551

Query: 601  VIQEVELEEAEWIQTR-------------------------------KVRRRCFQEGDLV 660
            VI E +L+EAEW+Q R                               KV  R F+EGDLV
Sbjct: 1552 VIMEAKLQEAEWVQRRYEQLNFVEEKRLTALYRGQLYQRRMMKAYDKKVHSRRFREGDLV 1611

Query: 661  LKRILPSQKDHRGKWTPNYEGPYVVKKAFTGGALILTNMDGTDLPNPLSVDYVKKYYA 688
            LKRILP QKDHRGKWTPNYEGP+V+KKAF+GGAL+L NMDGT+  NP++ D+V+KYYA
Sbjct: 1612 LKRILPLQKDHRGKWTPNYEGPFVLKKAFSGGALVLANMDGTEFLNPINSDHVRKYYA 1668

BLAST of ClCG04G001902 vs. NCBI nr
Match: XP_022143495.1 (LOW QUALITY PROTEIN: uncharacterized protein LOC111013372 [Momordica charantia])

HSP 1 Score: 1094.7 bits (2830), Expect = 0.0e+00
Identity = 512/718 (71.31%), Postives = 598/718 (83.29%), Query Frame = 0

Query: 1    MGCVLGQHDSTGRKEQAVYYLSKKFTSYELKYSLLEKTCCALAWTAQRLRQYMLYYTTLL 60
            MGCVLGQHD +GRKEQA+YYLSKKFT  E +YS +EKTCCALAW A+RLRQYMLYYTT L
Sbjct: 1437 MGCVLGQHDDSGRKEQAIYYLSKKFTDCETRYSQVEKTCCALAWVARRLRQYMLYYTTWL 1496

Query: 61   ISKMDPIKYIFEKPSLSGRIAKWQVLLSEYDIVYVTKKAIKGSAIADCLAELPVEDYEPM 120
            ISKMDPIKYIFEK SLS RIA+ QVLLSEYDIVYVT+KAIKGSA+AD LA+ P+ DY P+
Sbjct: 1497 ISKMDPIKYIFEKSSLSXRIARXQVLLSEYDIVYVTQKAIKGSALADYLAQQPINDYIPV 1556

Query: 121  KFEFPDEDIMTISTLNATQDPETWTMLFDGATNEMGHGVGAILISPDGKLYPLTAKLYFD 180
            KF+FPDE I TI+    + DP+TWTM+FDGA+NE+GHG+GAILISP G+LYPLTA+L FD
Sbjct: 1557 KFDFPDEYISTITASEESLDPQTWTMMFDGASNELGHGIGAILISPKGELYPLTARLCFD 1616

Query: 181  CTNNMAEYEACSMGIQMAYDMKIKKLQVFGDSLLVVHQLNGEWETRDSKLIPYNKYIREL 240
            CT+NMAEYEACSMG+Q A DMK+KKL+VFGDS+LV+HQL GEWETRD KL+PY + I EL
Sbjct: 1617 CTHNMAEYEACSMGVQAAVDMKVKKLKVFGDSMLVIHQLRGEWETRDVKLLPYKQLITEL 1676

Query: 241  AQTFESITFEHVPRENNQVADALATLSAMFNVARNEEIQPISIEKRETPAYCLSVEQEPD 300
            +Q F+ ++F+++PRENNQV DALATL+ MFN+  NE++ PI + +R+ PA C+S+E+EPD
Sbjct: 1677 SQEFDEMSFDYLPRENNQVXDALATLAVMFNLELNEDVCPIKVGRRDVPASCMSIEEEPD 1736

Query: 301  GKPWYHDIKHYITCREYPLGASENDKRTIRKLAMSFFLNGDVLYKRNYDMTLLRCVDAFE 360
            G PW+HDIK YI  +EYP  ASENDKRT RKLAM FFLNG++LYKRN+DM LLRCV+  +
Sbjct: 1737 GNPWFHDIKQYINSKEYPPNASENDKRTFRKLAMKFFLNGEILYKRNHDMVLLRCVEGRD 1796

Query: 361  AKRILEEVHEGVCGTHTNGHMMSRQALRAGYYWLTMESDCIKYARKCHKCQIYADKVHAP 420
            A RI+EEVHEGVC TH NGHM++RQ LRAGYYWLT+ +DCIKYARKCHKCQIYADK HAP
Sbjct: 1797 ANRIMEEVHEGVCDTHANGHMIARQILRAGYYWLTIXTDCIKYARKCHKCQIYADKTHAP 1856

Query: 421  SSPLHVLTASWPFSMWGMDVIGPIEPKVSNGHRFILVAINYFTKWVEAASYKSVTKQAVV 480
            +S LH LTA WPFSMWGMDVIGPIEPK SNGHRFILVAI+YFT WVEAASY+ VTK  VV
Sbjct: 1857 ASHLHTLTAPWPFSMWGMDVIGPIEPKASNGHRFILVAIDYFTNWVEAASYRDVTKGVVV 1916

Query: 481  KFIRKDIICWYGLPKRIITDNGRNLNNKLMEELCTQFKVKHSNTTPYRPRMNGAVEAANK 540
            KFI+K+IIC YGLP+ II+DN RNLNNKL  ELC QFK+KH N+TPYRP+MNGAVEAANK
Sbjct: 1917 KFIKKEIICRYGLPETIISDNARNLNNKLXSELCEQFKIKHLNSTPYRPKMNGAVEAANK 1976

Query: 541  NIKRIIQKMTVTYKDWHEMLPFALHGYRTSVRTSTGATPFSLVYGLEAVLPIEVEVPSLT 600
            NIKRI++KMTVTY+DWH MLPFALHGYRTSVRTSTGATPFSLVYG+  VLPIEVE+PSL 
Sbjct: 1977 NIKRIVEKMTVTYRDWHGMLPFALHGYRTSVRTSTGATPFSLVYGMXVVLPIEVEIPSLR 2036

Query: 601  VIQEVELEEAEWIQTR-------------------------------KVRRRCFQEGDLV 660
            VI E +L+EAEW+Q R                                V  R F+EGDLV
Sbjct: 2037 VIMEAKLQEAEWVQRRYEQLDFVEEKRLTALCRGQLYQSRMMKAYDENVHPRRFREGDLV 2096

Query: 661  LKRILPSQKDHRGKWTPNYEGPYVVKKAFTGGALILTNMDGTDLPNPLSVDYVKKYYA 688
            LKRILP QKDHRGKWTPNYEGP++VKKAF+GGAL+L NMDGT+  NP++ D+V+KYYA
Sbjct: 2097 LKRILPLQKDHRGKWTPNYEGPFLVKKAFSGGALVLANMDGTEFLNPVNXDHVRKYYA 2154

BLAST of ClCG04G001902 vs. ExPASy Swiss-Prot
Match: Q9UR07 (Transposon Tf2-11 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=Tf2-11 PE=3 SV=1)

HSP 1 Score: 111.7 bits (278), Expect = 3.3e-23
Identity = 103/485 (21.24%), Postives = 205/485 (42.27%), Query Frame = 0

Query: 203  IKKLQVFGDSLLVVHQLNGEWETRDSKLIPYNKYIRELAQTFESITFEHVPRENNQVADA 262
            I+  ++  D   ++ ++  E E  + +L  +  ++++      +    + P   N +ADA
Sbjct: 774  IEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDF-----NFEINYRPGSANHIADA 833

Query: 263  LATLSAMFNVARNEEIQPISIEKRETPAYCLSVEQEPDGKPWYHDIKHYITCREYPLGA- 322
            L+ +         +E +PI  +  +     ++     D      D K+ +   EY     
Sbjct: 834  LSRIV--------DETEPIPKDSEDNSINFVNQISITD------DFKNQVV-TEYTNDTK 893

Query: 323  -----SENDKRTIRKLAMSFFLNGDVLYKRNYDMTLLRCVDAFEAKRILEEVHEGVCGTH 382
                 +  DKR    + +      D L   + D  LL   D    + I+++ HE     H
Sbjct: 894  LLNLLNNEDKRVEENIQLK-----DGLLINSKDQILLP-NDTQLTRTIIKKYHEEGKLIH 953

Query: 383  TNGHMMSRQALRAGYYWLTMESDCIKYARKCHKCQIYADKVHAPSSPLHVLTAS-WPFSM 442
                +++   LR  + W  +     +Y + CH CQI   + H P  PL  +  S  P+  
Sbjct: 954  PGIELLTNIILRR-FTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERPWES 1013

Query: 443  WGMDVIGPIEPKVSNGHRFILVAINYFTKW-VEAASYKSVTKQAVVKFIRKDIICWYGLP 502
              MD I  +    S+G+  + V ++ F+K  +     KS+T +   +   + +I ++G P
Sbjct: 1014 LSMDFITALPE--SSGYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGNP 1073

Query: 503  KRIITDNGRNLNNKLMEELCTQFKVKHSNTTPYRPRMNGAVEAANKNIKRIIQKMTVTYK 562
            K II DN     ++  ++   ++      + PYRP+ +G  E  N+ ++++++ +  T+ 
Sbjct: 1074 KEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHP 1133

Query: 563  D-WHEMLPFALHGYRTSVRTSTGATPFSLVYGLEAVLPIEVEVPSLTVIQEVELEEA--- 622
            + W + +      Y  ++ ++T  TPF +V+     L   +E+PS +   +   +E    
Sbjct: 1134 NTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALS-PLELPSFSDKTDENSQETIQV 1193

Query: 623  -----EWIQTRKVRRRC-----------FQEGDLVL-KRILPSQKDHRGKWTPNYEGP-Y 658
                 E + T  ++ +            FQ GDLV+ KR          K  P++ GP Y
Sbjct: 1194 FQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVKRTKTGFLHKSNKLAPSFAGPFY 1228

BLAST of ClCG04G001902 vs. ExPASy Swiss-Prot
Match: P0CT41 (Transposon Tf2-12 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=Tf2-12 PE=3 SV=1)

HSP 1 Score: 111.7 bits (278), Expect = 3.3e-23
Identity = 103/485 (21.24%), Postives = 205/485 (42.27%), Query Frame = 0

Query: 203  IKKLQVFGDSLLVVHQLNGEWETRDSKLIPYNKYIRELAQTFESITFEHVPRENNQVADA 262
            I+  ++  D   ++ ++  E E  + +L  +  ++++      +    + P   N +ADA
Sbjct: 774  IEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDF-----NFEINYRPGSANHIADA 833

Query: 263  LATLSAMFNVARNEEIQPISIEKRETPAYCLSVEQEPDGKPWYHDIKHYITCREYPLGA- 322
            L+ +         +E +PI  +  +     ++     D      D K+ +   EY     
Sbjct: 834  LSRIV--------DETEPIPKDSEDNSINFVNQISITD------DFKNQVV-TEYTNDTK 893

Query: 323  -----SENDKRTIRKLAMSFFLNGDVLYKRNYDMTLLRCVDAFEAKRILEEVHEGVCGTH 382
                 +  DKR    + +      D L   + D  LL   D    + I+++ HE     H
Sbjct: 894  LLNLLNNEDKRVEENIQLK-----DGLLINSKDQILLP-NDTQLTRTIIKKYHEEGKLIH 953

Query: 383  TNGHMMSRQALRAGYYWLTMESDCIKYARKCHKCQIYADKVHAPSSPLHVLTAS-WPFSM 442
                +++   LR  + W  +     +Y + CH CQI   + H P  PL  +  S  P+  
Sbjct: 954  PGIELLTNIILRR-FTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERPWES 1013

Query: 443  WGMDVIGPIEPKVSNGHRFILVAINYFTKW-VEAASYKSVTKQAVVKFIRKDIICWYGLP 502
              MD I  +    S+G+  + V ++ F+K  +     KS+T +   +   + +I ++G P
Sbjct: 1014 LSMDFITALPE--SSGYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGNP 1073

Query: 503  KRIITDNGRNLNNKLMEELCTQFKVKHSNTTPYRPRMNGAVEAANKNIKRIIQKMTVTYK 562
            K II DN     ++  ++   ++      + PYRP+ +G  E  N+ ++++++ +  T+ 
Sbjct: 1074 KEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHP 1133

Query: 563  D-WHEMLPFALHGYRTSVRTSTGATPFSLVYGLEAVLPIEVEVPSLTVIQEVELEEA--- 622
            + W + +      Y  ++ ++T  TPF +V+     L   +E+PS +   +   +E    
Sbjct: 1134 NTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALS-PLELPSFSDKTDENSQETIQV 1193

Query: 623  -----EWIQTRKVRRRC-----------FQEGDLVL-KRILPSQKDHRGKWTPNYEGP-Y 658
                 E + T  ++ +            FQ GDLV+ KR          K  P++ GP Y
Sbjct: 1194 FQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVKRTKTGFLHKSNKLAPSFAGPFY 1228

BLAST of ClCG04G001902 vs. ExPASy Swiss-Prot
Match: P0CT34 (Transposon Tf2-1 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=Tf2-1 PE=3 SV=1)

HSP 1 Score: 111.7 bits (278), Expect = 3.3e-23
Identity = 103/485 (21.24%), Postives = 205/485 (42.27%), Query Frame = 0

Query: 203  IKKLQVFGDSLLVVHQLNGEWETRDSKLIPYNKYIRELAQTFESITFEHVPRENNQVADA 262
            I+  ++  D   ++ ++  E E  + +L  +  ++++      +    + P   N +ADA
Sbjct: 774  IEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDF-----NFEINYRPGSANHIADA 833

Query: 263  LATLSAMFNVARNEEIQPISIEKRETPAYCLSVEQEPDGKPWYHDIKHYITCREYPLGA- 322
            L+ +         +E +PI  +  +     ++     D      D K+ +   EY     
Sbjct: 834  LSRIV--------DETEPIPKDSEDNSINFVNQISITD------DFKNQVV-TEYTNDTK 893

Query: 323  -----SENDKRTIRKLAMSFFLNGDVLYKRNYDMTLLRCVDAFEAKRILEEVHEGVCGTH 382
                 +  DKR    + +      D L   + D  LL   D    + I+++ HE     H
Sbjct: 894  LLNLLNNEDKRVEENIQLK-----DGLLINSKDQILLP-NDTQLTRTIIKKYHEEGKLIH 953

Query: 383  TNGHMMSRQALRAGYYWLTMESDCIKYARKCHKCQIYADKVHAPSSPLHVLTAS-WPFSM 442
                +++   LR  + W  +     +Y + CH CQI   + H P  PL  +  S  P+  
Sbjct: 954  PGIELLTNIILRR-FTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERPWES 1013

Query: 443  WGMDVIGPIEPKVSNGHRFILVAINYFTKW-VEAASYKSVTKQAVVKFIRKDIICWYGLP 502
              MD I  +    S+G+  + V ++ F+K  +     KS+T +   +   + +I ++G P
Sbjct: 1014 LSMDFITALPE--SSGYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGNP 1073

Query: 503  KRIITDNGRNLNNKLMEELCTQFKVKHSNTTPYRPRMNGAVEAANKNIKRIIQKMTVTYK 562
            K II DN     ++  ++   ++      + PYRP+ +G  E  N+ ++++++ +  T+ 
Sbjct: 1074 KEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHP 1133

Query: 563  D-WHEMLPFALHGYRTSVRTSTGATPFSLVYGLEAVLPIEVEVPSLTVIQEVELEEA--- 622
            + W + +      Y  ++ ++T  TPF +V+     L   +E+PS +   +   +E    
Sbjct: 1134 NTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALS-PLELPSFSDKTDENSQETIQV 1193

Query: 623  -----EWIQTRKVRRRC-----------FQEGDLVL-KRILPSQKDHRGKWTPNYEGP-Y 658
                 E + T  ++ +            FQ GDLV+ KR          K  P++ GP Y
Sbjct: 1194 FQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVKRTKTGFLHKSNKLAPSFAGPFY 1228

BLAST of ClCG04G001902 vs. ExPASy Swiss-Prot
Match: P0CT35 (Transposon Tf2-2 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=Tf2-2 PE=3 SV=1)

HSP 1 Score: 111.7 bits (278), Expect = 3.3e-23
Identity = 103/485 (21.24%), Postives = 205/485 (42.27%), Query Frame = 0

Query: 203  IKKLQVFGDSLLVVHQLNGEWETRDSKLIPYNKYIRELAQTFESITFEHVPRENNQVADA 262
            I+  ++  D   ++ ++  E E  + +L  +  ++++      +    + P   N +ADA
Sbjct: 774  IEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDF-----NFEINYRPGSANHIADA 833

Query: 263  LATLSAMFNVARNEEIQPISIEKRETPAYCLSVEQEPDGKPWYHDIKHYITCREYPLGA- 322
            L+ +         +E +PI  +  +     ++     D      D K+ +   EY     
Sbjct: 834  LSRIV--------DETEPIPKDSEDNSINFVNQISITD------DFKNQVV-TEYTNDTK 893

Query: 323  -----SENDKRTIRKLAMSFFLNGDVLYKRNYDMTLLRCVDAFEAKRILEEVHEGVCGTH 382
                 +  DKR    + +      D L   + D  LL   D    + I+++ HE     H
Sbjct: 894  LLNLLNNEDKRVEENIQLK-----DGLLINSKDQILLP-NDTQLTRTIIKKYHEEGKLIH 953

Query: 383  TNGHMMSRQALRAGYYWLTMESDCIKYARKCHKCQIYADKVHAPSSPLHVLTAS-WPFSM 442
                +++   LR  + W  +     +Y + CH CQI   + H P  PL  +  S  P+  
Sbjct: 954  PGIELLTNIILRR-FTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERPWES 1013

Query: 443  WGMDVIGPIEPKVSNGHRFILVAINYFTKW-VEAASYKSVTKQAVVKFIRKDIICWYGLP 502
              MD I  +    S+G+  + V ++ F+K  +     KS+T +   +   + +I ++G P
Sbjct: 1014 LSMDFITALPE--SSGYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGNP 1073

Query: 503  KRIITDNGRNLNNKLMEELCTQFKVKHSNTTPYRPRMNGAVEAANKNIKRIIQKMTVTYK 562
            K II DN     ++  ++   ++      + PYRP+ +G  E  N+ ++++++ +  T+ 
Sbjct: 1074 KEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHP 1133

Query: 563  D-WHEMLPFALHGYRTSVRTSTGATPFSLVYGLEAVLPIEVEVPSLTVIQEVELEEA--- 622
            + W + +      Y  ++ ++T  TPF +V+     L   +E+PS +   +   +E    
Sbjct: 1134 NTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALS-PLELPSFSDKTDENSQETIQV 1193

Query: 623  -----EWIQTRKVRRRC-----------FQEGDLVL-KRILPSQKDHRGKWTPNYEGP-Y 658
                 E + T  ++ +            FQ GDLV+ KR          K  P++ GP Y
Sbjct: 1194 FQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVKRTKTGFLHKSNKLAPSFAGPFY 1228

BLAST of ClCG04G001902 vs. ExPASy Swiss-Prot
Match: P0CT36 (Transposon Tf2-3 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=Tf2-3 PE=1 SV=1)

HSP 1 Score: 111.7 bits (278), Expect = 3.3e-23
Identity = 103/485 (21.24%), Postives = 205/485 (42.27%), Query Frame = 0

Query: 203  IKKLQVFGDSLLVVHQLNGEWETRDSKLIPYNKYIRELAQTFESITFEHVPRENNQVADA 262
            I+  ++  D   ++ ++  E E  + +L  +  ++++      +    + P   N +ADA
Sbjct: 774  IEPFKILTDHRNLIGRITNESEPENKRLARWQLFLQDF-----NFEINYRPGSANHIADA 833

Query: 263  LATLSAMFNVARNEEIQPISIEKRETPAYCLSVEQEPDGKPWYHDIKHYITCREYPLGA- 322
            L+ +         +E +PI  +  +     ++     D      D K+ +   EY     
Sbjct: 834  LSRIV--------DETEPIPKDSEDNSINFVNQISITD------DFKNQVV-TEYTNDTK 893

Query: 323  -----SENDKRTIRKLAMSFFLNGDVLYKRNYDMTLLRCVDAFEAKRILEEVHEGVCGTH 382
                 +  DKR    + +      D L   + D  LL   D    + I+++ HE     H
Sbjct: 894  LLNLLNNEDKRVEENIQLK-----DGLLINSKDQILLP-NDTQLTRTIIKKYHEEGKLIH 953

Query: 383  TNGHMMSRQALRAGYYWLTMESDCIKYARKCHKCQIYADKVHAPSSPLHVLTAS-WPFSM 442
                +++   LR  + W  +     +Y + CH CQI   + H P  PL  +  S  P+  
Sbjct: 954  PGIELLTNIILRR-FTWKGIRKQIQEYVQNCHTCQINKSRNHKPYGPLQPIPPSERPWES 1013

Query: 443  WGMDVIGPIEPKVSNGHRFILVAINYFTKW-VEAASYKSVTKQAVVKFIRKDIICWYGLP 502
              MD I  +    S+G+  + V ++ F+K  +     KS+T +   +   + +I ++G P
Sbjct: 1014 LSMDFITALPE--SSGYNALFVVVDRFSKMAILVPCTKSITAEQTARMFDQRVIAYFGNP 1073

Query: 503  KRIITDNGRNLNNKLMEELCTQFKVKHSNTTPYRPRMNGAVEAANKNIKRIIQKMTVTYK 562
            K II DN     ++  ++   ++      + PYRP+ +G  E  N+ ++++++ +  T+ 
Sbjct: 1074 KEIIADNDHIFTSQTWKDFAHKYNFVMKFSLPYRPQTDGQTERTNQTVEKLLRCVCSTHP 1133

Query: 563  D-WHEMLPFALHGYRTSVRTSTGATPFSLVYGLEAVLPIEVEVPSLTVIQEVELEEA--- 622
            + W + +      Y  ++ ++T  TPF +V+     L   +E+PS +   +   +E    
Sbjct: 1134 NTWVDHISLVQQSYNNAIHSATQMTPFEIVHRYSPALS-PLELPSFSDKTDENSQETIQV 1193

Query: 623  -----EWIQTRKVRRRC-----------FQEGDLVL-KRILPSQKDHRGKWTPNYEGP-Y 658
                 E + T  ++ +            FQ GDLV+ KR          K  P++ GP Y
Sbjct: 1194 FQTVKEHLNTNNIKMKKYFDMKIQEIEEFQPGDLVMVKRTKTGFLHKSNKLAPSFAGPFY 1228

BLAST of ClCG04G001902 vs. ExPASy TrEMBL
Match: A0A6J1D099 (Ribonuclease H OS=Momordica charantia OX=3673 GN=LOC111016200 PE=4 SV=1)

HSP 1 Score: 1115.5 bits (2884), Expect = 0.0e+00
Identity = 519/718 (72.28%), Postives = 607/718 (84.54%), Query Frame = 0

Query: 1    MGCVLGQHDSTGRKEQAVYYLSKKFTSYELKYSLLEKTCCALAWTAQRLRQYMLYYTTLL 60
            MGCVLGQHD +GRKEQA+YYLSKKFT  E +YS +EKTCCALAW A+RLRQYMLYYTT L
Sbjct: 1547 MGCVLGQHDDSGRKEQAIYYLSKKFTDCETRYSQVEKTCCALAWAARRLRQYMLYYTTWL 1606

Query: 61   ISKMDPIKYIFEKPSLSGRIAKWQVLLSEYDIVYVTKKAIKGSAIADCLAELPVEDYEPM 120
            ISKMDPIKYIFEKPSLSG IA+WQVLLSEYDIVYVT+KAIKGSA+AD LA+ P+ DY P+
Sbjct: 1607 ISKMDPIKYIFEKPSLSGGIARWQVLLSEYDIVYVTQKAIKGSALADYLAQQPINDYIPV 1666

Query: 121  KFEFPDEDIMTISTLNATQDPETWTMLFDGATNEMGHGVGAILISPDGKLYPLTAKLYFD 180
            KF+FPDE I TI+    + DP+TWTM+FDGA+NE+GHG+GAILISP G+LYPL A+L FD
Sbjct: 1667 KFDFPDEYISTITASEESLDPQTWTMMFDGASNELGHGIGAILISPKGELYPLIARLCFD 1726

Query: 181  CTNNMAEYEACSMGIQMAYDMKIKKLQVFGDSLLVVHQLNGEWETRDSKLIPYNKYIREL 240
            C +NMAEYEACSMG+Q A DMK+KKL+VFGDS+LV+HQL GEWETRD KL+PY ++I EL
Sbjct: 1727 CKHNMAEYEACSMGVQAAIDMKVKKLKVFGDSMLVIHQLRGEWETRDVKLLPYKQFITEL 1786

Query: 241  AQTFESITFEHVPRENNQVADALATLSAMFNVARNEEIQPISIEKRETPAYCLSVEQEPD 300
            +Q F+ I+F+++PRENNQVADALATL+ MFN+  NE+++PI + +R+ PA C+S+E+EPD
Sbjct: 1787 SQEFDEISFDYLPRENNQVADALATLAVMFNLELNEDVRPIKVGRRDVPASCMSIEEEPD 1846

Query: 301  GKPWYHDIKHYITCREYPLGASENDKRTIRKLAMSFFLNGDVLYKRNYDMTLLRCVDAFE 360
            GKPW+HDIK YI  +EYP  ASENDKRT+RKLA+ FFLNG++LYKRN+DM LLRCV+  +
Sbjct: 1847 GKPWFHDIKQYIKSKEYPPNASENDKRTLRKLAIKFFLNGEILYKRNHDMVLLRCVEGRD 1906

Query: 361  AKRILEEVHEGVCGTHTNGHMMSRQALRAGYYWLTMESDCIKYARKCHKCQIYADKVHAP 420
            A RI+EE+HEGVCGTH NGHMM+RQ LRAGYYWLT+E+DCIKYARKCHKCQIY+DK HAP
Sbjct: 1907 ANRIMEEIHEGVCGTHANGHMMARQILRAGYYWLTIETDCIKYARKCHKCQIYSDKTHAP 1966

Query: 421  SSPLHVLTASWPFSMWGMDVIGPIEPKVSNGHRFILVAINYFTKWVEAASYKSVTKQAVV 480
            +S LH LTA WPFSMWGMDVIGPIEPK SNGH+FILVAI+YFTKWVEAASY+ VTK  VV
Sbjct: 1967 ASHLHALTAPWPFSMWGMDVIGPIEPKASNGHQFILVAIDYFTKWVEAASYRDVTKGVVV 2026

Query: 481  KFIRKDIICWYGLPKRIITDNGRNLNNKLMEELCTQFKVKHSNTTPYRPRMNGAVEAANK 540
            KFI+K+IIC YGLPK II+DN RNLNNKLM EL  QFK+KH N+TPYRP+MNGAVEAANK
Sbjct: 2027 KFIKKEIICRYGLPKTIISDNARNLNNKLMSELYEQFKIKHLNSTPYRPKMNGAVEAANK 2086

Query: 541  NIKRIIQKMTVTYKDWHEMLPFALHGYRTSVRTSTGATPFSLVYGLEAVLPIEVEVPSLT 600
            NIKRI++KMTVTY+DWHEMLPFALHGYRTSVRTSTGATPFSLVYG+E VLPIEVE+PSL 
Sbjct: 2087 NIKRIVEKMTVTYRDWHEMLPFALHGYRTSVRTSTGATPFSLVYGMEVVLPIEVEIPSLR 2146

Query: 601  VIQEVELEEAEWIQTR-------------------------------KVRRRCFQEGDLV 660
            VI E +L+EAEW+Q R                               KV  R F+E DLV
Sbjct: 2147 VIMEAKLQEAEWVQRRYEQLNFVEEKRLTALCRRQLYQRRMMKAYDKKVHPRRFKEEDLV 2206

Query: 661  LKRILPSQKDHRGKWTPNYEGPYVVKKAFTGGALILTNMDGTDLPNPLSVDYVKKYYA 688
            LKRILP QKDHRGKWTPNYEGP+VVKKAF+GGAL+L NMDGT+  NP+  D+V+KYYA
Sbjct: 2207 LKRILPLQKDHRGKWTPNYEGPFVVKKAFSGGALVLANMDGTEFXNPVKADHVRKYYA 2264

BLAST of ClCG04G001902 vs. ExPASy TrEMBL
Match: A0A6J1E2J7 (Ribonuclease H OS=Momordica charantia OX=3673 GN=LOC111025431 PE=4 SV=1)

HSP 1 Score: 1104.7 bits (2856), Expect = 0.0e+00
Identity = 515/718 (71.73%), Postives = 603/718 (83.98%), Query Frame = 0

Query: 1    MGCVLGQHDSTGRKEQAVYYLSKKFTSYELKYSLLEKTCCALAWTAQRLRQYMLYYTTLL 60
            MGCVLGQHD +GRKEQA+YYLSKKFT  E +YS +EKTCCALAW  +RLRQYMLYYTT L
Sbjct: 1156 MGCVLGQHDDSGRKEQAIYYLSKKFTDCETRYSQVEKTCCALAWATRRLRQYMLYYTTWL 1215

Query: 61   ISKMDPIKYIFEKPSLSGRIAKWQVLLSEYDIVYVTKKAIKGSAIADCLAELPVEDYEPM 120
            ISKMDPIKYIFEKPSLSGRIA+WQVLLSEYDIVYVT+KAIKGSA+AD LA+ P+ DY P+
Sbjct: 1216 ISKMDPIKYIFEKPSLSGRIARWQVLLSEYDIVYVTRKAIKGSALADYLAQQPINDYIPV 1275

Query: 121  KFEFPDEDIMTISTLNATQDPETWTMLFDGATNEMGHGVGAILISPDGKLYPLTAKLYFD 180
            KF+FPDE I TI+    + DP+TWTM+FDGA+NE+GHG+G ILISP G+LYPLTA+L FD
Sbjct: 1276 KFDFPDEYISTITASEESLDPQTWTMMFDGASNELGHGIGGILISPKGELYPLTARLCFD 1335

Query: 181  CTNNMAEYEACSMGIQMAYDMKIKKLQVFGDSLLVVHQLNGEWETRDSKLIPYNKYIREL 240
            CT+NMAEYEACSMG+Q A DMK+KKL+VFGDS+LV+HQL GEWETRD KL+PY + I EL
Sbjct: 1336 CTHNMAEYEACSMGVQAAVDMKVKKLKVFGDSMLVIHQLRGEWETRDVKLLPYKQLITEL 1395

Query: 241  AQTFESITFEHVPRENNQVADALATLSAMFNVARNEEIQPISIEKRETPAYCLSVEQEPD 300
            +Q F+ I+F+++PRENNQVADALATL+ MFN+  NE++ PI + +R+ PA C+S+E+EPD
Sbjct: 1396 SQEFDEISFDYLPRENNQVADALATLAVMFNLELNEDVSPIKVGRRDVPASCMSIEEEPD 1455

Query: 301  GKPWYHDIKHYITCREYPLGASENDKRTIRKLAMSFFLNGDVLYKRNYDMTLLRCVDAFE 360
            G PW+H+IK YI  +EYP  ASENDKRT+RKLAM FFLNG++LYKRN+DM LLRCV+  +
Sbjct: 1456 GNPWFHNIKXYIKSKEYPPNASENDKRTLRKLAMKFFLNGEILYKRNHDMVLLRCVEGRD 1515

Query: 361  AKRILEEVHEGVCGTHTNGHMMSRQALRAGYYWLTMESDCIKYARKCHKCQIYADKVHAP 420
            A RI+EEVHEGVCGTH NGHMM+RQ LRAGYYWLT+ +DCIKYARKCHKCQIY+DK HAP
Sbjct: 1516 ANRIMEEVHEGVCGTHANGHMMARQILRAGYYWLTIXTDCIKYARKCHKCQIYSDKTHAP 1575

Query: 421  SSPLHVLTASWPFSMWGMDVIGPIEPKVSNGHRFILVAINYFTKWVEAASYKSVTKQAVV 480
            +S LH LTA WPFSMWGMD+IGPIEPK SNGHRFILVAI+YFTKWVEAAS + VTK  VV
Sbjct: 1576 ASHLHTLTAPWPFSMWGMDLIGPIEPKASNGHRFILVAIDYFTKWVEAASDRDVTKGVVV 1635

Query: 481  KFIRKDIICWYGLPKRIITDNGRNLNNKLMEELCTQFKVKHSNTTPYRPRMNGAVEAANK 540
            KFI+ +IIC YGLP+ II+DN RNLNNKLM ELC  FK+KH N+TPYRP+MNGAVEAANK
Sbjct: 1636 KFIKNEIICRYGLPQTIISDNARNLNNKLMSELCEHFKIKHFNSTPYRPKMNGAVEAANK 1695

Query: 541  NIKRIIQKMTVTYKDWHEMLPFALHGYRTSVRTSTGATPFSLVYGLEAVLPIEVEVPSLT 600
            NIKRI++KMTVTY+DWHEMLPFALHGYRTSVRTSTGATPFSLVYG++AVLPIEVE+PSL 
Sbjct: 1696 NIKRIVEKMTVTYRDWHEMLPFALHGYRTSVRTSTGATPFSLVYGMKAVLPIEVEIPSLR 1755

Query: 601  VIQEVELEEAEWIQTR-------------------------------KVRRRCFQEGDLV 660
            VI E +L+EAEW+Q R                               KV  R F+EGDLV
Sbjct: 1756 VIMEAKLQEAEWVQRRYEQLNFVEEKRLTALCRGQLYQRRMMKAYDKKVHPRRFREGDLV 1815

Query: 661  LKRILPSQKDHRGKWTPNYEGPYVVKKAFTGGALILTNMDGTDLPNPLSVDYVKKYYA 688
            LK ILP QKDHRGKWT NYEGP+VVKKAF+GGAL+L NMDGT+  NP++ D+V+KYYA
Sbjct: 1816 LKIILPLQKDHRGKWTANYEGPFVVKKAFSGGALVLANMDGTEFLNPVNSDHVRKYYA 1873

BLAST of ClCG04G001902 vs. ExPASy TrEMBL
Match: A0A6J1DZ90 (Ribonuclease H OS=Momordica charantia OX=3673 GN=LOC111024415 PE=4 SV=1)

HSP 1 Score: 1104.7 bits (2856), Expect = 0.0e+00
Identity = 518/718 (72.14%), Postives = 600/718 (83.57%), Query Frame = 0

Query: 1    MGCVLGQHDSTGRKEQAVYYLSKKFTSYELKYSLLEKTCCALAWTAQRLRQYMLYYTTLL 60
            MGCVLGQHD +GRKEQA+YYLSKKFT  E +YS +EKTCCALAW A+RLRQYMLYYTT L
Sbjct: 985  MGCVLGQHDDSGRKEQAIYYLSKKFTDCETRYSQVEKTCCALAWVARRLRQYMLYYTTWL 1044

Query: 61   ISKMDPIKYIFEKPSLSGRIAKWQVLLSEYDIVYVTKKAIKGSAIADCLAELPVEDYEPM 120
            ISKMDPIKYIFEKPSLSGRIA+WQVLLSEYDIVYVT+KAIKGSA+AD LA+ P+ DY P+
Sbjct: 1045 ISKMDPIKYIFEKPSLSGRIARWQVLLSEYDIVYVTQKAIKGSALADYLAQQPINDYIPV 1104

Query: 121  KFEFPDEDIMTISTLNATQDPETWTMLFDGATNEMGHGVGAILISPDGKLYPLTAKLYFD 180
            KF+FPDE I TI+    + DP+TWTM+FDGA+NE+GH +GAILISP G+LYPLT KL FD
Sbjct: 1105 KFDFPDEYISTITASEESLDPQTWTMMFDGASNELGHEIGAILISPKGELYPLTTKLCFD 1164

Query: 181  CTNNMAEYEACSMGIQMAYDMKIKKLQVFGDSLLVVHQLNGEWETRDSKLIPYNKYIREL 240
            CT+NMAEYEACSMG+Q A DMK+KK +VFGDS LV+HQL GEWETRD KL+PY + I EL
Sbjct: 1165 CTHNMAEYEACSMGVQAAIDMKVKKFKVFGDSTLVIHQLRGEWETRDVKLLPYKQLITEL 1224

Query: 241  AQTFESITFEHVPRENNQVADALATLSAMFNVARNEEIQPISIEKRETPAYCLSVEQEPD 300
            +Q F+ I+F+++PRENNQVADALATL+ MFN+  NE+++PI + +R+ PA C+S+E+EPD
Sbjct: 1225 SQEFDEISFDYLPRENNQVADALATLAVMFNLELNEDVRPIKVGRRDVPASCMSIEEEPD 1284

Query: 301  GKPWYHDIKHYITCREYPLGASENDKRTIRKLAMSFFLNGDVLYKRNYDMTLLRCVDAFE 360
            G PW+HDIK YI  +EY   ASENDKRT+RKLAM FFLNG++LYKRN+DM LLRCV+  +
Sbjct: 1285 GNPWFHDIKQYIKSKEYQPNASENDKRTLRKLAMKFFLNGEILYKRNHDMVLLRCVEGRD 1344

Query: 361  AKRILEEVHEGVCGTHTNGHMMSRQALRAGYYWLTMESDCIKYARKCHKCQIYADKVHAP 420
            A RI+EEVHEGVCGTH NGHMM+RQ LRAGYYWLT+E+DCIKYARKCHKCQIY+DK HAP
Sbjct: 1345 ANRIMEEVHEGVCGTHANGHMMARQILRAGYYWLTIETDCIKYARKCHKCQIYSDKTHAP 1404

Query: 421  SSPLHVLTASWPFSMWGMDVIGPIEPKVSNGHRFILVAINYFTKWVEAASYKSVTKQAVV 480
            +S LH LTA WPFSMWGMDVIGPIEPK S+GHRFILVAI+YFTKWVEAASY+ VTK  VV
Sbjct: 1405 TSHLHTLTAPWPFSMWGMDVIGPIEPKASSGHRFILVAIDYFTKWVEAASYRDVTKGVVV 1464

Query: 481  KFIRKDIICWYGLPKRIITDNGRNLNNKLMEELCTQFKVKHSNTTPYRPRMNGAVEAANK 540
            KFI+K IIC YGLP+ II+DN RNLNNKLM ELC QFK+KH N+TPYRP+MNGAVEAANK
Sbjct: 1465 KFIKKKIICRYGLPETIISDNARNLNNKLMSELCEQFKIKHLNSTPYRPKMNGAVEAANK 1524

Query: 541  NIKRIIQKMTVTYKDWHEMLPFALHGYRTSVRTSTGATPFSLVYGLEAVLPIEVEVPSLT 600
            NIKRI++KMTVTY DWHEMLPFALHGYRTSVRTSTG TPFSLVYG+E VL IEVE+PSL 
Sbjct: 1525 NIKRIVEKMTVTYIDWHEMLPFALHGYRTSVRTSTGTTPFSLVYGMEVVLLIEVEIPSLR 1584

Query: 601  VIQEVELEEAEWIQTR-------------------------------KVRRRCFQEGDLV 660
            VI E +L  AEW+Q R                               KV  R F+EGDLV
Sbjct: 1585 VIMEAKLXRAEWVQRRYEQLNFVEEKRLTALCRGQLYQRRMMKAYDEKVHPRRFREGDLV 1644

Query: 661  LKRILPSQKDHRGKWTPNYEGPYVVKKAFTGGALILTNMDGTDLPNPLSVDYVKKYYA 688
            LKRILP QKDHRGKWTPNYEGP+VVKKAF+GGAL+L NMDGT+  NP++ D+V+KYYA
Sbjct: 1645 LKRILPLQKDHRGKWTPNYEGPFVVKKAFSGGALVLANMDGTEFLNPVNSDHVRKYYA 1702

BLAST of ClCG04G001902 vs. ExPASy TrEMBL
Match: A0A6J1D7C7 (Ribonuclease H OS=Momordica charantia OX=3673 GN=LOC111018303 PE=4 SV=1)

HSP 1 Score: 1104.0 bits (2854), Expect = 0.0e+00
Identity = 516/718 (71.87%), Postives = 606/718 (84.40%), Query Frame = 0

Query: 1    MGCVLGQHDSTGRKEQAVYYLSKKFTSYELKYSLLEKTCCALAWTAQRLRQYMLYYTTLL 60
            +GCVLGQHD +GRKEQA+YYLSKKFT  E +YS +EKTCCALAW A+RLRQYMLYYTT L
Sbjct: 952  IGCVLGQHDDSGRKEQAIYYLSKKFTDCETRYSQVEKTCCALAWVARRLRQYMLYYTTWL 1011

Query: 61   ISKMDPIKYIFEKPSLSGRIAKWQVLLSEYDIVYVTKKAIKGSAIADCLAELPVEDYEPM 120
            ISKMDPI+YIFEKPSLSGRIA+WQVLLSEYDIVYVT+KAIKGSA+AD LA+ P+ DY P+
Sbjct: 1012 ISKMDPIRYIFEKPSLSGRIARWQVLLSEYDIVYVTQKAIKGSALADYLAQQPINDYIPV 1071

Query: 121  KFEFPDEDIMTISTLNATQDPETWTMLFDGATNEMGHGVGAILISPDGKLYPLTAKLYFD 180
            KF+FPDE I TI+    + DP+TWTM+FDGA+NE+GHG+GAILISP G+LYPLTA+L FD
Sbjct: 1072 KFDFPDEYISTITASEESLDPQTWTMMFDGASNELGHGIGAILISPKGELYPLTARLCFD 1131

Query: 181  CTNNMAEYEACSMGIQMAYDMKIKKLQVFGDSLLVVHQLNGEWETRDSKLIPYNKYIREL 240
            CT+NMAEYEACSMG+Q A DMK+ K +VFGDS+LV+HQL GEWE RD KL+PY + I EL
Sbjct: 1132 CTHNMAEYEACSMGVQAAVDMKV-KXKVFGDSMLVIHQLRGEWEIRDVKLLPYKQLITEL 1191

Query: 241  AQTFESITFEHVPRENNQVADALATLSAMFNVARNEEIQPISIEKRETPAYCLSVEQEPD 300
            +Q F+ I+F+++PRENNQVADALATL+ MFN+  NE+++PI + +R+  A C+S+E+EPD
Sbjct: 1192 SQEFDEISFDYLPRENNQVADALATLAVMFNLELNEDVRPIKVGRRDVSASCMSIEEEPD 1251

Query: 301  GKPWYHDIKHYITCREYPLGASENDKRTIRKLAMSFFLNGDVLYKRNYDMTLLRCVDAFE 360
            G PW+HDIK YI  +EYP  ASENDKRT+RKLAM FFLN ++LYKRN+DM LLRCV+  +
Sbjct: 1252 GNPWFHDIKQYIKSKEYPPNASENDKRTLRKLAMKFFLNREILYKRNHDMVLLRCVEXRD 1311

Query: 361  AKRILEEVHEGVCGTHTNGHMMSRQALRAGYYWLTMESDCIKYARKCHKCQIYADKVHAP 420
            A RI+EEVHE VCGTH NGHM++RQ LRAGYYWLT+E+DCIKYARKCHKCQIY+DK HAP
Sbjct: 1312 ANRIMEEVHEEVCGTHANGHMIARQILRAGYYWLTIETDCIKYARKCHKCQIYSDKTHAP 1371

Query: 421  SSPLHVLTASWPFSMWGMDVIGPIEPKVSNGHRFILVAINYFTKWVEAASYKSVTKQAVV 480
            +S LH LTA WPFSMWGMDVIGPIEPK SNGHRFILVAI+YFTKWVEAASY+ VTK  VV
Sbjct: 1372 ASHLHTLTAPWPFSMWGMDVIGPIEPKASNGHRFILVAIDYFTKWVEAASYRDVTKGVVV 1431

Query: 481  KFIRKDIICWYGLPKRIITDNGRNLNNKLMEELCTQFKVKHSNTTPYRPRMNGAVEAANK 540
            KFI+K+IIC YGLP+ II+DN RNLNNKLM ELC QFK+KH N+TPYRP+MNGAVEAANK
Sbjct: 1432 KFIKKEIICRYGLPETIISDNARNLNNKLMSELCEQFKIKHLNSTPYRPKMNGAVEAANK 1491

Query: 541  NIKRIIQKMTVTYKDWHEMLPFALHGYRTSVRTSTGATPFSLVYGLEAVLPIEVEVPSLT 600
            NIKRI++KMTVTY+DWHEMLPFALHGYRTSVRTSTGATPFSLVYG+EAVLPIEVE+PSL 
Sbjct: 1492 NIKRIVEKMTVTYRDWHEMLPFALHGYRTSVRTSTGATPFSLVYGMEAVLPIEVEIPSLR 1551

Query: 601  VIQEVELEEAEWIQTR-------------------------------KVRRRCFQEGDLV 660
            VI E +L+EAEW+Q R                               KV  R F+EGDLV
Sbjct: 1552 VIMEAKLQEAEWVQRRYEQLNFVEEKRLTALYRGQLYQRRMMKAYDKKVHSRRFREGDLV 1611

Query: 661  LKRILPSQKDHRGKWTPNYEGPYVVKKAFTGGALILTNMDGTDLPNPLSVDYVKKYYA 688
            LKRILP QKDHRGKWTPNYEGP+V+KKAF+GGAL+L NMDGT+  NP++ D+V+KYYA
Sbjct: 1612 LKRILPLQKDHRGKWTPNYEGPFVLKKAFSGGALVLANMDGTEFLNPINSDHVRKYYA 1668

BLAST of ClCG04G001902 vs. ExPASy TrEMBL
Match: A0A6J1CNY7 (Ribonuclease H OS=Momordica charantia OX=3673 GN=LOC111013372 PE=4 SV=1)

HSP 1 Score: 1094.7 bits (2830), Expect = 0.0e+00
Identity = 512/718 (71.31%), Postives = 598/718 (83.29%), Query Frame = 0

Query: 1    MGCVLGQHDSTGRKEQAVYYLSKKFTSYELKYSLLEKTCCALAWTAQRLRQYMLYYTTLL 60
            MGCVLGQHD +GRKEQA+YYLSKKFT  E +YS +EKTCCALAW A+RLRQYMLYYTT L
Sbjct: 1437 MGCVLGQHDDSGRKEQAIYYLSKKFTDCETRYSQVEKTCCALAWVARRLRQYMLYYTTWL 1496

Query: 61   ISKMDPIKYIFEKPSLSGRIAKWQVLLSEYDIVYVTKKAIKGSAIADCLAELPVEDYEPM 120
            ISKMDPIKYIFEK SLS RIA+ QVLLSEYDIVYVT+KAIKGSA+AD LA+ P+ DY P+
Sbjct: 1497 ISKMDPIKYIFEKSSLSXRIARXQVLLSEYDIVYVTQKAIKGSALADYLAQQPINDYIPV 1556

Query: 121  KFEFPDEDIMTISTLNATQDPETWTMLFDGATNEMGHGVGAILISPDGKLYPLTAKLYFD 180
            KF+FPDE I TI+    + DP+TWTM+FDGA+NE+GHG+GAILISP G+LYPLTA+L FD
Sbjct: 1557 KFDFPDEYISTITASEESLDPQTWTMMFDGASNELGHGIGAILISPKGELYPLTARLCFD 1616

Query: 181  CTNNMAEYEACSMGIQMAYDMKIKKLQVFGDSLLVVHQLNGEWETRDSKLIPYNKYIREL 240
            CT+NMAEYEACSMG+Q A DMK+KKL+VFGDS+LV+HQL GEWETRD KL+PY + I EL
Sbjct: 1617 CTHNMAEYEACSMGVQAAVDMKVKKLKVFGDSMLVIHQLRGEWETRDVKLLPYKQLITEL 1676

Query: 241  AQTFESITFEHVPRENNQVADALATLSAMFNVARNEEIQPISIEKRETPAYCLSVEQEPD 300
            +Q F+ ++F+++PRENNQV DALATL+ MFN+  NE++ PI + +R+ PA C+S+E+EPD
Sbjct: 1677 SQEFDEMSFDYLPRENNQVXDALATLAVMFNLELNEDVCPIKVGRRDVPASCMSIEEEPD 1736

Query: 301  GKPWYHDIKHYITCREYPLGASENDKRTIRKLAMSFFLNGDVLYKRNYDMTLLRCVDAFE 360
            G PW+HDIK YI  +EYP  ASENDKRT RKLAM FFLNG++LYKRN+DM LLRCV+  +
Sbjct: 1737 GNPWFHDIKQYINSKEYPPNASENDKRTFRKLAMKFFLNGEILYKRNHDMVLLRCVEGRD 1796

Query: 361  AKRILEEVHEGVCGTHTNGHMMSRQALRAGYYWLTMESDCIKYARKCHKCQIYADKVHAP 420
            A RI+EEVHEGVC TH NGHM++RQ LRAGYYWLT+ +DCIKYARKCHKCQIYADK HAP
Sbjct: 1797 ANRIMEEVHEGVCDTHANGHMIARQILRAGYYWLTIXTDCIKYARKCHKCQIYADKTHAP 1856

Query: 421  SSPLHVLTASWPFSMWGMDVIGPIEPKVSNGHRFILVAINYFTKWVEAASYKSVTKQAVV 480
            +S LH LTA WPFSMWGMDVIGPIEPK SNGHRFILVAI+YFT WVEAASY+ VTK  VV
Sbjct: 1857 ASHLHTLTAPWPFSMWGMDVIGPIEPKASNGHRFILVAIDYFTNWVEAASYRDVTKGVVV 1916

Query: 481  KFIRKDIICWYGLPKRIITDNGRNLNNKLMEELCTQFKVKHSNTTPYRPRMNGAVEAANK 540
            KFI+K+IIC YGLP+ II+DN RNLNNKL  ELC QFK+KH N+TPYRP+MNGAVEAANK
Sbjct: 1917 KFIKKEIICRYGLPETIISDNARNLNNKLXSELCEQFKIKHLNSTPYRPKMNGAVEAANK 1976

Query: 541  NIKRIIQKMTVTYKDWHEMLPFALHGYRTSVRTSTGATPFSLVYGLEAVLPIEVEVPSLT 600
            NIKRI++KMTVTY+DWH MLPFALHGYRTSVRTSTGATPFSLVYG+  VLPIEVE+PSL 
Sbjct: 1977 NIKRIVEKMTVTYRDWHGMLPFALHGYRTSVRTSTGATPFSLVYGMXVVLPIEVEIPSLR 2036

Query: 601  VIQEVELEEAEWIQTR-------------------------------KVRRRCFQEGDLV 660
            VI E +L+EAEW+Q R                                V  R F+EGDLV
Sbjct: 2037 VIMEAKLQEAEWVQRRYEQLDFVEEKRLTALCRGQLYQSRMMKAYDENVHPRRFREGDLV 2096

Query: 661  LKRILPSQKDHRGKWTPNYEGPYVVKKAFTGGALILTNMDGTDLPNPLSVDYVKKYYA 688
            LKRILP QKDHRGKWTPNYEGP++VKKAF+GGAL+L NMDGT+  NP++ D+V+KYYA
Sbjct: 2097 LKRILPLQKDHRGKWTPNYEGPFLVKKAFSGGALVLANMDGTEFLNPVNXDHVRKYYA 2154

BLAST of ClCG04G001902 vs. TAIR 10
Match: AT3G01410.1 (Polynucleotidyl transferase, ribonuclease H-like superfamily protein )

HSP 1 Score: 84.0 bits (206), Expect = 5.3e-16
Identity = 48/142 (33.80%), Postives = 76/142 (53.52%), Query Frame = 0

Query: 131 TISTLNATQDPETWTMLFDGAT--NEMGHGVGAILISPDGKLYPLTAKLYFDCTNNMAEY 190
           ++ T    +  ++ T+ FDGA+  N    G GA+L + D  +     +   + TNN+AEY
Sbjct: 142 SLLTRTPIRQNDSCTIEFDGASKGNPGKAGAGAVLRASDNSVLFYLREGVGNATNNVAEY 201

Query: 191 EACSMGIQMAYDMKIKKLQVFGDSLLVVHQLNGEWETRDSKLIPYNKYIRELAQTFESIT 250
            A  +G++ A D   K + V GDS+LV  Q+ G W+T   K+    K  +EL  +F++  
Sbjct: 202 RALLLGLRSALDKGFKNVHVLGDSMLVCMQVQGAWKTNHPKMAELCKQAKELMNSFKTFD 261

Query: 251 FEHVPRENNQVADALATLSAMF 271
            +H+ RE N  AD  A  SA+F
Sbjct: 262 IKHIAREKNSEADKQAN-SAIF 282

BLAST of ClCG04G001902 vs. TAIR 10
Match: AT3G01410.2 (Polynucleotidyl transferase, ribonuclease H-like superfamily protein )

HSP 1 Score: 84.0 bits (206), Expect = 5.3e-16
Identity = 48/142 (33.80%), Postives = 76/142 (53.52%), Query Frame = 0

Query: 131 TISTLNATQDPETWTMLFDGAT--NEMGHGVGAILISPDGKLYPLTAKLYFDCTNNMAEY 190
           ++ T    +  ++ T+ FDGA+  N    G GA+L + D  +     +   + TNN+AEY
Sbjct: 142 SLLTRTPIRQNDSCTIEFDGASKGNPGKAGAGAVLRASDNSVLFYLREGVGNATNNVAEY 201

Query: 191 EACSMGIQMAYDMKIKKLQVFGDSLLVVHQLNGEWETRDSKLIPYNKYIRELAQTFESIT 250
            A  +G++ A D   K + V GDS+LV  Q+ G W+T   K+    K  +EL  +F++  
Sbjct: 202 RALLLGLRSALDKGFKNVHVLGDSMLVCMQVQGAWKTNHPKMAELCKQAKELMNSFKTFD 261

Query: 251 FEHVPRENNQVADALATLSAMF 271
            +H+ RE N  AD  A  SA+F
Sbjct: 262 IKHIAREKNSEADKQAN-SAIF 282

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022147189.10.0e+0072.28LOW QUALITY PROTEIN: uncharacterized protein LOC111016200 [Momordica charantia][more]
XP_022158986.10.0e+0071.73LOW QUALITY PROTEIN: uncharacterized protein LOC111025431 [Momordica charantia][more]
XP_022157796.10.0e+0072.14LOW QUALITY PROTEIN: uncharacterized protein LOC111024415 [Momordica charantia][more]
XP_022150030.10.0e+0071.87LOW QUALITY PROTEIN: uncharacterized protein LOC111018303 [Momordica charantia][more]
XP_022143495.10.0e+0071.31LOW QUALITY PROTEIN: uncharacterized protein LOC111013372 [Momordica charantia][more]
Match NameE-valueIdentityDescription
Q9UR073.3e-2321.24Transposon Tf2-11 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24... [more]
P0CT413.3e-2321.24Transposon Tf2-12 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24... [more]
P0CT343.3e-2321.24Transposon Tf2-1 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
P0CT353.3e-2321.24Transposon Tf2-2 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
P0CT363.3e-2321.24Transposon Tf2-3 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
Match NameE-valueIdentityDescription
A0A6J1D0990.0e+0072.28Ribonuclease H OS=Momordica charantia OX=3673 GN=LOC111016200 PE=4 SV=1[more]
A0A6J1E2J70.0e+0071.73Ribonuclease H OS=Momordica charantia OX=3673 GN=LOC111025431 PE=4 SV=1[more]
A0A6J1DZ900.0e+0072.14Ribonuclease H OS=Momordica charantia OX=3673 GN=LOC111024415 PE=4 SV=1[more]
A0A6J1D7C70.0e+0071.87Ribonuclease H OS=Momordica charantia OX=3673 GN=LOC111018303 PE=4 SV=1[more]
A0A6J1CNY70.0e+0071.31Ribonuclease H OS=Momordica charantia OX=3673 GN=LOC111013372 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G01410.15.3e-1633.80Polynucleotidyl transferase, ribonuclease H-like superfamily protein [more]
AT3G01410.25.3e-1633.80Polynucleotidyl transferase, ribonuclease H-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (Charleston Gray) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 432..529
e-value: 1.8E-12
score: 47.4
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 428..587
score: 23.540461
NoneNo IPR availableGENE3D1.10.340.70coord: 322..413
e-value: 2.3E-13
score: 52.2
NoneNo IPR availablePANTHERPTHR24559:SF322RNA-DIRECTED DNA POLYMERASE (REVERSE TRANSCRIPTASE), RIBONUCLEASE H-LIKE PROTEINcoord: 12..505
NoneNo IPR availablePANTHERPTHR24559TRANSPOSON TY3-I GAG-POL POLYPROTEINcoord: 12..505
NoneNo IPR availableCDDcd09279RNase_HI_likecoord: 144..266
e-value: 1.63815E-45
score: 156.479
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 424..623
e-value: 2.3E-54
score: 185.7
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 145..282
e-value: 1.3E-32
score: 114.6
IPR041373Reverse transcriptase, RNase H-like domainPFAMPF17917RT_RNaseHcoord: 1..90
e-value: 9.7E-18
score: 64.4
IPR041588Integrase zinc-binding domainPFAMPF17921Integrase_H2C2coord: 361..413
e-value: 1.4E-7
score: 31.4
IPR002156Ribonuclease H domainPFAMPF13456RVT_3coord: 148..265
e-value: 1.9E-24
score: 85.8
IPR002156Ribonuclease H domainPROSITEPS50879RNASE_Hcoord: 140..269
score: 15.590564
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 430..593
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 142..264
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 2..90

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG04G001902.1ClCG04G001902.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0071897 DNA biosynthetic process
biological_process GO:0015074 DNA integration
biological_process GO:0006310 DNA recombination
biological_process GO:0090502 RNA phosphodiester bond hydrolysis, endonucleolytic
cellular_component GO:0030430 host cell cytoplasm
cellular_component GO:0005634 nucleus
molecular_function GO:0003887 DNA-directed DNA polymerase activity
molecular_function GO:0003723 RNA binding
molecular_function GO:0004523 RNA-DNA hybrid ribonuclease activity
molecular_function GO:0003676 nucleic acid binding