CmaCh09G001540 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh09G001540
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionDNA-3-methyladenine glycosylase 1
LocationCma_Chr09: 658957 .. 663729 (-)
RNA-Seq ExpressionCmaCh09G001540
SyntenyCmaCh09G001540
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
CACACGCCAACAAAGGAGCGTGAAGAAGCGGAGAGTTGATTTCCCAGACTGGAGGGGAAATCGAAAGCCAGTTGGGTTAGAGAACTGAGGAGCGAGCGGCCAGAGACAGAGATAGAGAAGAGAAAGAGACTGAAAGAAGAACGTCACCGGGATTTCATTCTCCTCAGTCGCTGGTTATACCATTCACAAACTACTTGTCTATTCTTTAACCTAATTCAAAATCCCTAATCTTTTCAATTTCGCATTTACTTTTCAATTTCTCTGTTAGTCTCAATCCAGTTCTTGTTCTTTCACCTGGGGCATCGAGTTTTGCCCTTCTTCCGATTCTCCAACGCCGCTGATCATGGTTGTGTACAGTGTTTCTTTCAACTGAATTGCCTGGGTGTAAGGAGATAGCGCAGATTGAATCCTTTACATATGGGAGAGCACACGCAAGTGCAGGTTCAGACTCAGACCCAAAGCCAATCTCAAGCGCAATCGCAAGCTCAGAACACGTTTCATGAATCCTCTAACTCCACAACCACTATCGCCCAAGCTACTGTGTCATTAAGCGAGGTGATGAATGCGCCATCGCAAACCTCTTCTCCGCCATCCAAAATGCCCTTGCGTCCGCGGAAGATCCGGAAGCTCTCGCCTGATGAATCGGATCCAAATTCCTCTCAGGTTGTTGCCATTCCGGATGGGCCGAAACCTATAGCCACCAGCAAATCTAACAAGAGCAAGACGGCCCAGCAACGCGCCGCATTCGCGTCTGCCCCGGTAATGCTTGCCCGATCACTTTCCTGTGAAGGCGAGGTGGAAGTCGCGCTTCGACATCTCCGGAATGCCGATCCGCTCCTTGCACCTTTGATCGACCTTCATCAACGTCCTACCTTCGACAGTTTTCAAACCCCATTCCTTGCCCTTACTAGAAGTATCCTTTATCAGCAGCTGGCTTATAAAGCTGGCACCTCAATCTACACTCGTTTCATCGCTCTCTGTGGCGGCGAGGCTGGCGTTCTTCCTGAAACTGTACTTGCCTTGAGCCCTCAACAGCTCAGGCAAATTGGAATTTCGGGTCGTAAATCTAGTTACCTTCATGATCTTGCGAGGAAGTATCAAAATGGGATTCTTTCAGACCCCGCAATTGTGAATATGGATGATAAATCGCTTTTCACGATGCTTACAATGGTCAATGGAATTGGGTCTTGGTCTGTTCATATGTTCATGATTTTCTCGCTGCACAGACCAGACGTGCTTCCTATCAATGATCTTAATGTTCGCAAAGGTGTTCAACTTCTCTACAATCTTGAAGAGTTGCCTCGACCATCGCAGATGGATCACTTATGTGAGAAGTGGAGGCCGTATCGATCAGTTGGGTCGTGGTATATGTGGAGGTTTGCTGAGGCAAAGGGGGCTTCTTCAAGCGCAGCAGCAATAGCTGCTGGTGCTAGTTTACAACTGCAGCAACAAGAGCACCAGCCAGAGCACCAGCATCAACAGCAGCCGCAGCTTCTTGACCCACTTAATAGCATTCTCAATCTTGGGTAAAACCTTTATGGCTTTGTTATCTTCTCTTGTTTATATTTTTCCACGTATTGCATGTTGTTTAAGTTGAAAAATCTCATCTTCCTCTATCGATGTAGAAACTAAAATAGTTTTTCTTTTTTGTGTTGGAGCTTCCATTATTTGTAGTCCCAAATTGTGTTTCTTGCATCCATTGTTTATCAATGCGGCATACCATACTTATACATGGTGTTTAGAGTGAAGTTTTTTTTTTTTCTTCTTCTTTTAATTGCATCTAGCTTCTAACGAAAGATGTCTGAATATCTAAATTGACGATACTTATTTGCATGTTTCTTGTGATATATGAAAGTAAGCTTTGCGTGGTATATTTCGACCTTTTTTCTAGCATCTTTATGCTTCTTTTTCTTGTCTGTACTCTTTTAACTCTATAACGTGCTTCCAATTATCTGCATTGTTTTCTCAGATACTAATTTTCTTGCAATCCTTTTACTTATGAAATAAGCAGAAAGTTTCATCGATATAATGAACTATACAAAGCCACAACCTATGGGCTATATTAAATACTTGTTCCCTTTATTGTTTAGATGCTTAGTGAAAATGAGCACCGTTCATGCTTGTTAGCCACATTATTTTCGTCTTCGGTTTCTGAAGGCTTCTTATGGTAGGCGAACATAGCATAGGCCAATATCATTGAAACAAAGCCACATGCTAAATTTGATAATATAGGAGCAGATTTCTGCATTTGTGGAATATTTATTCAAGTCAGTCATCTGTAGTGAAAAGTAGTTCTTCAGAATATTAGAAAGGTATATCATGTTCGAAGTTTAAGATGGTAACTTGAGTATACCTTGGGTGTAGTTGGATCATTGATCTTCTTGGAGTGGCAAGGAATGTGGAAGATGAATATGGAAGTGAGGATAACTCTTTAAGGGGCATCGATTATTCATATCCTCCTCCTCTTCTTCTTCTCCTCTTCTTTGGTGCTTTCACGTGGTGAATGCATACTGTTTTTCTCCCCCTTATGCCGTTTTTGTTTCTTTTTCTTTATTTTCTTGGAAGTCTTGAAAGTTTGTAGTCTGAACTTGATAGATTGACAGACACATGTTTGGAATTTGTCCTTGTTTTGATTTGCTAATATTAGAAGTCTAGAACTCTCAGATTTCGATTTTATATTCAATTTTTTATGAAGCCATAGCTTACCACTATCAATGATTTGAAAAGCGCAAAAAAGGTGCGCTTTTCAGAAAAGACAAAGGTAAGGTGCTCCTATGAAGCACACTTGAGGCCCCGTCTCTGTGCTTGAAGAAACCTTTTTTTTTTTTTTAATTTCTAATTTTTTAGGAAGAGATTCAGAAGTGTTCTTAAAAAGTTCGTTTTTCACCAGATTCTTTTATAAAAGTTCTAATTTCTTAAACTTCTCATTATCTTATTAGTTGTAAACCTATATTTTCCATTTATCAAAAAGTTATTTGCATTTTTGTATTTGAGTGCTCCCAAAAAAGCCATCATTTTTTTTTTATCTGCACGCTTTGCAGAAGTCCTAGAAGAGAGTACTGTGTTTTAGTGTGCTGCACTTTGAAAAACACTTGCCACTTATGTTGATTCTATAGTTGAGGTTTGGAGAAGCCATGCAATTTCTCTGTTGGATCCAGAAAATTTAATTTGTATCGCTAAGGTTTATGGTCATGAACACAAATCTTGCTTGTTAATGCTTTTTTTTTTTTTTATAATTGAAAACGATGAGGCAGCAAGATAGGCCCCTTTTCTGGAATAAAGAGAGAACAATCTGTGGAGCACTTGACAAAAACTTATTGTTGCTGCCCCAGGATCATCTTAGGCATCTGTTCTGTCTTGGCCGTTGGTTCTTGAATGATTGCTCTCTCGTCTCTTTTCCTATATTTTGGATCATTGGGAAGCAGTAACAAACTTTTCTCTGTTCTTTAATTGGTGTATCGTTCAATTCCTTTATTTGCTACTCTTCATGTGATTCGTGTTGCTCATGTGGATCATGTTGTTTATGTTTAGGGCCTGTGCTTGGGGGCAGTGACTCGGATCGAAAAGAGTACATCTTTGCAGATAGCCCAATCAGTTCATCCACTGAATGATAAATATGCCAATTATGTGGGAGTAGAAGACTCAGCAATGAATATTTGGCTCCTTGAGGTTGGATACTCTAGACTTCAATATTAAATAATGAAATGACATGTGCTAGTTTCCTTTTCTAGGTAGGTTGATGTCACTTTTAAGGCCGTCAACATGTATCATGTGGCAAGGGATATTGATAGAATGTTCCTCTTGAATGTCTGTGCTTTCAATGTGATTTAGCGCGAGTTCGACTTCGAGAAACCCACTTGTTTTCTTGCGCATATTCAATGACAGAACAATCTGTAATGACTTTATGATCCATCTTTTACCCTTTTCTCTACTTCAATTTTTCTGAAACATCGATCTTAGTTTGTACATAGAAAATAATCATTCATGCGGATAATCGATGGCTGGCTGTGACGTCTGCTATTCGGACTCGTAATCCTATCTCATCTTGCCTCATCTATTCCACATGGGAGAAAATGCATTATTGCTCAGTAGCCTCATTCATCGTATAATTTACCTTTGAGAACTTGATTTATAGGGACTATGAAGCATGAATTCTGATGATTTTAGCTGGAGAAACGCTGCAGTTGGATTTGTAAACTTGTCACTTCGAATCAAATGGCTTTGAATATCTGAAAGTGACAAGATTTGGGCCGACTACTTGTTGAAATGAACTTTAGAATCAATCAAAGTTGAATGAAAGAAATTACATTGGTTTGAAGTTTCTAAATGTAACCAAGCGAATTGAATGCACTCTTGCTTTTAATCTATTTTTTCTGAATTTATTTGCACCATTTATTAATTGGTTACATATTTAATAACCTACGTATGATTACTTTTACTTTCTTTTACGTATGATTATTTTTTCTTTCTTTTATGTATGATTACTTTTTCTTTCTTTTTCGATTCAATGTGATTGATAGAATATGTTAGAAGTAATTGAAGTATGTTCAGTGTTAATTTAGTATTAACACTATTTACCTTTATAACAAAAAATAAAATATTTTTACCATCAATATATTGAGGAAAATTCTTATCATCAATAGTTAAAAGAGTTACCATCATCAATTGTTAAAAGAGTATTATTGAAAGTCGTTTGTAAAATGAAGTGTTATCGGGTGCATTTATTTATATATATATATATATATTTATTTATTTAGTTTAAAGGTATTTTAA

mRNA sequence

CACACGCCAACAAAGGAGCGTGAAGAAGCGGAGAGTTGATTTCCCAGACTGGAGGGGAAATCGAAAGCCAGTTGGGTTAGAGAACTGAGGAGCGAGCGGCCAGAGACAGAGATAGAGAAGAGAAAGAGACTGAAAGAAGAACGTCACCGGGATTTCATTCTCCTCAGTCGCTGGTTATACCATTCACAAACTACTTGTCTATTCTTTAACCTAATTCAAAATCCCTAATCTTTTCAATTTCGCATTTACTTTTCAATTTCTCTGTTAGTCTCAATCCAGTTCTTGTTCTTTCACCTGGGGCATCGAGTTTTGCCCTTCTTCCGATTCTCCAACGCCGCTGATCATGGTTGTGTACAGTGTTTCTTTCAACTGAATTGCCTGGGTGTAAGGAGATAGCGCAGATTGAATCCTTTACATATGGGAGAGCACACGCAAGTGCAGGTTCAGACTCAGACCCAAAGCCAATCTCAAGCGCAATCGCAAGCTCAGAACACGTTTCATGAATCCTCTAACTCCACAACCACTATCGCCCAAGCTACTGTGTCATTAAGCGAGGTGATGAATGCGCCATCGCAAACCTCTTCTCCGCCATCCAAAATGCCCTTGCGTCCGCGGAAGATCCGGAAGCTCTCGCCTGATGAATCGGATCCAAATTCCTCTCAGGTTGTTGCCATTCCGGATGGGCCGAAACCTATAGCCACCAGCAAATCTAACAAGAGCAAGACGGCCCAGCAACGCGCCGCATTCGCGTCTGCCCCGGTAATGCTTGCCCGATCACTTTCCTGTGAAGGCGAGGTGGAAGTCGCGCTTCGACATCTCCGGAATGCCGATCCGCTCCTTGCACCTTTGATCGACCTTCATCAACGTCCTACCTTCGACAGTTTTCAAACCCCATTCCTTGCCCTTACTAGAAGTATCCTTTATCAGCAGCTGGCTTATAAAGCTGGCACCTCAATCTACACTCGTTTCATCGCTCTCTGTGGCGGCGAGGCTGGCGTTCTTCCTGAAACTGTACTTGCCTTGAGCCCTCAACAGCTCAGGCAAATTGGAATTTCGGGTCGTAAATCTAGTTACCTTCATGATCTTGCGAGGAAGTATCAAAATGGGATTCTTTCAGACCCCGCAATTGTGAATATGGATGATAAATCGCTTTTCACGATGCTTACAATGGTCAATGGAATTGGGTCTTGGTCTGTTCATATGTTCATGATTTTCTCGCTGCACAGACCAGACGTGCTTCCTATCAATGATCTTAATGTTCGCAAAGGTGTTCAACTTCTCTACAATCTTGAAGAGTTGCCTCGACCATCGCAGATGGATCACTTATGTGAGAAGTGGAGGCCGTATCGATCAGTTGGGTCGTGGTATATGTGGAGGTTTGCTGAGGCAAAGGGGGCTTCTTCAAGCGCAGCAGCAATAGCTGCTGGTGCTAGTTTACAACTGCAGCAACAAGAGCACCAGCCAGAGCACCAGCATCAACAGCAGCCGCAGCTTCTTGACCCACTTAATAGCATTCTCAATCTTGGATGCTTAGTGAAAATGAGCACCGGCCTGTGCTTGGGGGCAGTGACTCGGATCGAAAAGAGTACATCTTTGCAGATAGCCCAATCAGTTCATCCACTGAATGATAAATATGCCAATTATGTGGGAGTAGAAGACTCAGCAATGAATATTTGGCTCCTTGAGGTTGATGTCACTTTTAAGGCCGTCAACATGTATCATGTGGCAAGGGATATTGATAGAATGTTCCTCTTGAATGTCTGTGCTTTCAATGTATTTTAA

Coding sequence (CDS)

ATGGGAGAGCACACGCAAGTGCAGGTTCAGACTCAGACCCAAAGCCAATCTCAAGCGCAATCGCAAGCTCAGAACACGTTTCATGAATCCTCTAACTCCACAACCACTATCGCCCAAGCTACTGTGTCATTAAGCGAGGTGATGAATGCGCCATCGCAAACCTCTTCTCCGCCATCCAAAATGCCCTTGCGTCCGCGGAAGATCCGGAAGCTCTCGCCTGATGAATCGGATCCAAATTCCTCTCAGGTTGTTGCCATTCCGGATGGGCCGAAACCTATAGCCACCAGCAAATCTAACAAGAGCAAGACGGCCCAGCAACGCGCCGCATTCGCGTCTGCCCCGGTAATGCTTGCCCGATCACTTTCCTGTGAAGGCGAGGTGGAAGTCGCGCTTCGACATCTCCGGAATGCCGATCCGCTCCTTGCACCTTTGATCGACCTTCATCAACGTCCTACCTTCGACAGTTTTCAAACCCCATTCCTTGCCCTTACTAGAAGTATCCTTTATCAGCAGCTGGCTTATAAAGCTGGCACCTCAATCTACACTCGTTTCATCGCTCTCTGTGGCGGCGAGGCTGGCGTTCTTCCTGAAACTGTACTTGCCTTGAGCCCTCAACAGCTCAGGCAAATTGGAATTTCGGGTCGTAAATCTAGTTACCTTCATGATCTTGCGAGGAAGTATCAAAATGGGATTCTTTCAGACCCCGCAATTGTGAATATGGATGATAAATCGCTTTTCACGATGCTTACAATGGTCAATGGAATTGGGTCTTGGTCTGTTCATATGTTCATGATTTTCTCGCTGCACAGACCAGACGTGCTTCCTATCAATGATCTTAATGTTCGCAAAGGTGTTCAACTTCTCTACAATCTTGAAGAGTTGCCTCGACCATCGCAGATGGATCACTTATGTGAGAAGTGGAGGCCGTATCGATCAGTTGGGTCGTGGTATATGTGGAGGTTTGCTGAGGCAAAGGGGGCTTCTTCAAGCGCAGCAGCAATAGCTGCTGGTGCTAGTTTACAACTGCAGCAACAAGAGCACCAGCCAGAGCACCAGCATCAACAGCAGCCGCAGCTTCTTGACCCACTTAATAGCATTCTCAATCTTGGATGCTTAGTGAAAATGAGCACCGGCCTGTGCTTGGGGGCAGTGACTCGGATCGAAAAGAGTACATCTTTGCAGATAGCCCAATCAGTTCATCCACTGAATGATAAATATGCCAATTATGTGGGAGTAGAAGACTCAGCAATGAATATTTGGCTCCTTGAGGTTGATGTCACTTTTAAGGCCGTCAACATGTATCATGTGGCAAGGGATATTGATAGAATGTTCCTCTTGAATGTCTGTGCTTTCAATGTATTTTAA

Protein sequence

MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVSLSEVMNAPSQTSSPPSKMPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNKSKTAQQRAAFASAPVMLARSLSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQMDHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASLQLQQQEHQPEHQHQQQPQLLDPLNSILNLGCLVKMSTGLCLGAVTRIEKSTSLQIAQSVHPLNDKYANYVGVEDSAMNIWLLEVDVTFKAVNMYHVARDIDRMFLLNVCAFNVF
Homology
BLAST of CmaCh09G001540 vs. ExPASy Swiss-Prot
Match: Q92383 (DNA-3-methyladenine glycosylase 1 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=mag1 PE=1 SV=1)

HSP 1 Score: 100.1 bits (248), Expect = 6.6e-20
Identity = 51/163 (31.29%), Postives = 88/163 (53.99%), Query Frame = 0

Query: 159 PFLALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSS 218
           P+  L R++  QQL  KA  +I+ RF ++        PE +  +  + +R  G S RK  
Sbjct: 50  PYEELIRAVASQQLHSKAANAIFNRFKSISNNGQFPTPEEIRDMDFEIMRACGFSARKID 109

Query: 219 YLHDLARKYQNGIL-SDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPIN 278
            L  +A    +G++ +      + ++ L   LT + GIG W+V M +IFSL+R DV+P +
Sbjct: 110 SLKSIAEATISGLIPTKEEAERLSNEELIERLTQIKGIGRWTVEMLLIFSLNRDDVMPAD 169

Query: 279 DLNVRKGVQLLYNLEELPRPSQMDHLCEKWRPYRSVGSWYMWR 321
           DL++R G + L+ L ++P    +    E   P+R+  +WY+W+
Sbjct: 170 DLSIRNGYRYLHRLPKIPTKMYVLKHSEICAPFRTAAAWYLWK 212

BLAST of CmaCh09G001540 vs. ExPASy Swiss-Prot
Match: O31544 (Putative DNA-3-methyladenine glycosylase YfjP OS=Bacillus subtilis (strain 168) OX=224308 GN=yfjP PE=3 SV=1)

HSP 1 Score: 94.0 bits (232), Expect = 4.8e-18
Identity = 57/212 (26.89%), Postives = 100/212 (47.17%), Query Frame = 0

Query: 117 LARSLSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKA 176
           + R    E  ++  L H       L+ + + H         + +  + + I++QQL    
Sbjct: 78  IKRIFQWENHLQHVLDHFSKTS--LSAIFEEHAGTPLVLDYSVYNCMMKCIIHQQLNLSF 137

Query: 177 GTSIYTRFIALCGGEA-GV----LPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGI 236
             ++  RF+   G +  GV     PET+  L  Q LR +  S RK+ Y  D +R    G 
Sbjct: 138 AYTLTERFVHAFGEQKDGVWCYPKPETIAELDYQDLRDLQFSMRKAEYTIDTSRMIAEGT 197

Query: 237 LSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNL 296
           LS   + +M D+ +   L  + GIG W+V   ++F L RP++ P+ D+ ++  ++  + L
Sbjct: 198 LSLSELPHMADEDIMKKLIKIRGIGPWTVQNVLMFGLGRPNLFPLADIGLQNAIKRHFQL 257

Query: 297 EELPRPSQMDHLCEKWRPYRSVGSWYMWRFAE 324
           ++ P    M  + ++W PY S  S Y+WR  E
Sbjct: 258 DDKPAKDVMLAMSKEWEPYLSYASLYLWRSIE 287

BLAST of CmaCh09G001540 vs. ExPASy Swiss-Prot
Match: O94468 (Alkylbase DNA glycosidase-like protein mag2 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=mag2 PE=1 SV=1)

HSP 1 Score: 88.6 bits (218), Expect = 2.0e-16
Identity = 51/204 (25.00%), Postives = 100/204 (49.02%), Query Frame = 0

Query: 121 LSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSF--QTPFLALTRSILYQQLAYKAGT 180
           +S + + + A +HL + D   + L+      T        P+  + R+I  Q+L+  A  
Sbjct: 1   MSKDSDYKRAEKHLSSIDNKWSSLVKKVGPCTLTPHPEHAPYEGIIRAITSQKLSDAATN 60

Query: 181 SIYTRFIALCG-GEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQN-GILSDPA 240
           SI  +F   C   +    P+ ++    + L + G S  KS  +H +A    N  I S   
Sbjct: 61  SIINKFCTQCSDNDEFPTPKQIMETDVETLHECGFSKLKSQEIHIVAEAALNKQIPSKSE 120

Query: 241 IVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPR 300
           I  M ++ L   L+ + G+  W++ M+ IF+L R D++P +D  ++   +  + L   P+
Sbjct: 121 IEKMSEEELMESLSKIKGVKRWTIEMYSIFTLGRLDIMPADDSTLKNEAKEFFGLSSKPQ 180

Query: 301 PSQMDHLCEKWRPYRSVGSWYMWR 321
             +++ L +  +PYR++ +WY+W+
Sbjct: 181 TEEVEKLTKPCKPYRTIAAWYLWQ 204

BLAST of CmaCh09G001540 vs. ExPASy TrEMBL
Match: A0A6J1IIA5 (probable DNA-3-methyladenine glycosylase 2 OS=Cucurbita maxima OX=3661 GN=LOC111476532 PE=4 SV=1)

HSP 1 Score: 707.6 bits (1825), Expect = 3.4e-200
Identity = 370/370 (100.00%), Postives = 370/370 (100.00%), Query Frame = 0

Query: 1   MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVSLSEVMNAPSQTSSPPSK 60
           MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVSLSEVMNAPSQTSSPPSK
Sbjct: 1   MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVSLSEVMNAPSQTSSPPSK 60

Query: 61  MPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNKSKTAQQRAAFASAPVMLARS 120
           MPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNKSKTAQQRAAFASAPVMLARS
Sbjct: 61  MPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNKSKTAQQRAAFASAPVMLARS 120

Query: 121 LSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI 180
           LSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI
Sbjct: 121 LSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI 180

Query: 181 YTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM 240
           YTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM
Sbjct: 181 YTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM 240

Query: 241 DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM 300
           DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Sbjct: 241 DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM 300

Query: 301 DHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASLQLQQQEHQPEHQHQQQPQLL 360
           DHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASLQLQQQEHQPEHQHQQQPQLL
Sbjct: 301 DHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASLQLQQQEHQPEHQHQQQPQLL 360

Query: 361 DPLNSILNLG 371
           DPLNSILNLG
Sbjct: 361 DPLNSILNLG 370

BLAST of CmaCh09G001540 vs. ExPASy TrEMBL
Match: A0A6J1F7I4 (probable DNA-3-methyladenine glycosylase 2 OS=Cucurbita moschata OX=3662 GN=LOC111443071 PE=4 SV=1)

HSP 1 Score: 705.3 bits (1819), Expect = 1.7e-199
Identity = 368/370 (99.46%), Postives = 370/370 (100.00%), Query Frame = 0

Query: 1   MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVSLSEVMNAPSQTSSPPSK 60
           MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATV+LSEVMNAPSQTSSPPSK
Sbjct: 1   MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVALSEVMNAPSQTSSPPSK 60

Query: 61  MPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNKSKTAQQRAAFASAPVMLARS 120
           MPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNKSKTAQQRAAFASAPVMLARS
Sbjct: 61  MPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNKSKTAQQRAAFASAPVMLARS 120

Query: 121 LSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI 180
           LSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI
Sbjct: 121 LSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI 180

Query: 181 YTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM 240
           YTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM
Sbjct: 181 YTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM 240

Query: 241 DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM 300
           DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Sbjct: 241 DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM 300

Query: 301 DHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASLQLQQQEHQPEHQHQQQPQLL 360
           DHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASLQLQQQEHQP+HQHQQQPQLL
Sbjct: 301 DHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASLQLQQQEHQPQHQHQQQPQLL 360

Query: 361 DPLNSILNLG 371
           DPLNSILNLG
Sbjct: 361 DPLNSILNLG 370

BLAST of CmaCh09G001540 vs. ExPASy TrEMBL
Match: A0A6J1CKY0 (uncharacterized protein LOC111012247 OS=Momordica charantia OX=3673 GN=LOC111012247 PE=4 SV=1)

HSP 1 Score: 657.5 bits (1695), Expect = 4.0e-185
Identity = 340/370 (91.89%), Postives = 356/370 (96.22%), Query Frame = 0

Query: 1   MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVSLSEVMNAPSQTSSPPSK 60
           MGE TQVQVQTQTQSQSQ QSQAQNT H+SSNSTT+IAQATV+LSEVMNAP+QTSSPPSK
Sbjct: 1   MGEQTQVQVQTQTQSQSQPQSQAQNTIHDSSNSTTSIAQATVALSEVMNAPTQTSSPPSK 60

Query: 61  MPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNKSKTAQQRAAFASAPVMLARS 120
           MPLRPRKIRKLSPDESD NSSQ+  + DGPKPI++ KSNK+KTAQQRAAFASAP++ ARS
Sbjct: 61  MPLRPRKIRKLSPDESDANSSQIAPMTDGPKPISSGKSNKNKTAQQRAAFASAPILPARS 120

Query: 121 LSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI 180
           LSCEGEVE+ALRHLRNADPLLAPLIDLHQRP FDSFQTPFLALTRSILYQQLAYKAGTSI
Sbjct: 121 LSCEGEVEIALRHLRNADPLLAPLIDLHQRPIFDSFQTPFLALTRSILYQQLAYKAGTSI 180

Query: 181 YTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM 240
           YTRFIALCGGEAGVLPETVL+L+PQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM
Sbjct: 181 YTRFIALCGGEAGVLPETVLSLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM 240

Query: 241 DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM 300
           DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Sbjct: 241 DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM 300

Query: 301 DHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASLQLQQQEHQPEHQHQQQPQLL 360
           D LCEKWRPYRSVGSWYMWR AEAKGASSSAAA+AAGASLQLQQQEHQ EHQH QQPQLL
Sbjct: 301 DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAALAAGASLQLQQQEHQQEHQHSQQPQLL 360

Query: 361 DPLNSILNLG 371
           DP+NSILNLG
Sbjct: 361 DPINSILNLG 370

BLAST of CmaCh09G001540 vs. ExPASy TrEMBL
Match: A0A5A7TDX5 (DNA-3-methyladenine glycosylase 1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold242G00470 PE=4 SV=1)

HSP 1 Score: 655.6 bits (1690), Expect = 1.5e-184
Identity = 347/373 (93.03%), Postives = 353/373 (94.64%), Query Frame = 0

Query: 1   MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVSLSEVMNAPSQTSSPPSK 60
           MGE TQVQVQTQTQSQ Q QSQAQNTFHESSNSTT IAQATV LSEVMNAPSQ SSPPSK
Sbjct: 87  MGEQTQVQVQTQTQSQPQPQSQAQNTFHESSNSTTPIAQATVMLSEVMNAPSQISSPPSK 146

Query: 61  MPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNKSKTAQQRAAFASAPVMLARS 120
           MPLRPRKIRKLSP+ESDPNSS VVAIPDGPKPIAT KSNKSKTAQQRAAFASA V LARS
Sbjct: 147 MPLRPRKIRKLSPEESDPNSSHVVAIPDGPKPIATVKSNKSKTAQQRAAFASATVPLARS 206

Query: 121 LSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI 180
           LSCEGEVE+ALRHLRNADPLLA LIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI
Sbjct: 207 LSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI 266

Query: 181 YTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM 240
           YTRFIALCGGEAGVLPETVL+L+PQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM
Sbjct: 267 YTRFIALCGGEAGVLPETVLSLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM 326

Query: 241 DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM 300
           DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Sbjct: 327 DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM 386

Query: 301 DHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASLQLQQQEHQPEH---QHQQQP 360
           D LCEKWRPYRSVGSWYMWR AEAKGASSSAAA+AAGASLQLQQQ+H  EH   QH QQP
Sbjct: 387 DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQDHHQEHQHPQHPQQP 446

Query: 361 QLLDPLNSILNLG 371
           QLLDPLN ILNLG
Sbjct: 447 QLLDPLNGILNLG 459

BLAST of CmaCh09G001540 vs. ExPASy TrEMBL
Match: A0A1S3CRJ5 (DNA-3-methyladenine glycosylase 1 OS=Cucumis melo OX=3656 GN=LOC103503942 PE=4 SV=1)

HSP 1 Score: 655.6 bits (1690), Expect = 1.5e-184
Identity = 347/373 (93.03%), Postives = 353/373 (94.64%), Query Frame = 0

Query: 1   MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVSLSEVMNAPSQTSSPPSK 60
           MGE TQVQVQTQTQSQ Q QSQAQNTFHESSNSTT IAQATV LSEVMNAPSQ SSPPSK
Sbjct: 1   MGEQTQVQVQTQTQSQPQPQSQAQNTFHESSNSTTPIAQATVMLSEVMNAPSQISSPPSK 60

Query: 61  MPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNKSKTAQQRAAFASAPVMLARS 120
           MPLRPRKIRKLSP+ESDPNSS VVAIPDGPKPIAT KSNKSKTAQQRAAFASA V LARS
Sbjct: 61  MPLRPRKIRKLSPEESDPNSSHVVAIPDGPKPIATVKSNKSKTAQQRAAFASATVPLARS 120

Query: 121 LSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI 180
           LSCEGEVE+ALRHLRNADPLLA LIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI
Sbjct: 121 LSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI 180

Query: 181 YTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM 240
           YTRFIALCGGEAGVLPETVL+L+PQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM
Sbjct: 181 YTRFIALCGGEAGVLPETVLSLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM 240

Query: 241 DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM 300
           DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Sbjct: 241 DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM 300

Query: 301 DHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASLQLQQQEHQPEH---QHQQQP 360
           D LCEKWRPYRSVGSWYMWR AEAKGASSSAAA+AAGASLQLQQQ+H  EH   QH QQP
Sbjct: 301 DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQDHHQEHQHPQHPQQP 360

Query: 361 QLLDPLNSILNLG 371
           QLLDPLN ILNLG
Sbjct: 361 QLLDPLNGILNLG 373

BLAST of CmaCh09G001540 vs. NCBI nr
Match: KAG7024195.1 (mag1 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 838.6 bits (2165), Expect = 2.6e-239
Identity = 442/457 (96.72%), Postives = 445/457 (97.37%), Query Frame = 0

Query: 1   MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVSLSEVMNAPSQTSSPPSK 60
           MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATV+LSEVMNAPSQTSSPPSK
Sbjct: 1   MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVALSEVMNAPSQTSSPPSK 60

Query: 61  MPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNKSKTAQQRAAFASAPVMLARS 120
           MPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNKSKTAQQRAAFASAPVMLARS
Sbjct: 61  MPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNKSKTAQQRAAFASAPVMLARS 120

Query: 121 LSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI 180
           LSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI
Sbjct: 121 LSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI 180

Query: 181 YTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM 240
           YTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM
Sbjct: 181 YTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM 240

Query: 241 DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM 300
           DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Sbjct: 241 DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM 300

Query: 301 DHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASLQLQQQEHQPEHQHQQQPQLL 360
           DHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASLQLQQQEHQPEHQHQQQPQLL
Sbjct: 301 DHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASLQLQQQEHQPEHQHQQQPQLL 360

Query: 361 DPLNSILNLG---CLVKMSTGLCLGAVTRIEKSTSLQIAQSVHPLNDKYANYVGVEDSAM 420
           DPLNSILNLG     V    GLCLGAVTRIEKS SLQIAQSVH LNDKYANYVGVEDSAM
Sbjct: 361 DPLNSILNLGKRFGSVLKKFGLCLGAVTRIEKSKSLQIAQSVHLLNDKYANYVGVEDSAM 420

Query: 421 NIWLLEVDVTFKAVNMYHVARDIDRMFLLNVCAFNVF 455
           +IWLLEVDVTFKAVNMYHVAR+ID MFLLNVCAFNVF
Sbjct: 421 DIWLLEVDVTFKAVNMYHVARNIDIMFLLNVCAFNVF 457

BLAST of CmaCh09G001540 vs. NCBI nr
Match: XP_022976000.1 (probable DNA-3-methyladenine glycosylase 2 [Cucurbita maxima])

HSP 1 Score: 707.6 bits (1825), Expect = 6.9e-200
Identity = 370/370 (100.00%), Postives = 370/370 (100.00%), Query Frame = 0

Query: 1   MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVSLSEVMNAPSQTSSPPSK 60
           MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVSLSEVMNAPSQTSSPPSK
Sbjct: 1   MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVSLSEVMNAPSQTSSPPSK 60

Query: 61  MPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNKSKTAQQRAAFASAPVMLARS 120
           MPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNKSKTAQQRAAFASAPVMLARS
Sbjct: 61  MPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNKSKTAQQRAAFASAPVMLARS 120

Query: 121 LSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI 180
           LSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI
Sbjct: 121 LSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI 180

Query: 181 YTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM 240
           YTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM
Sbjct: 181 YTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM 240

Query: 241 DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM 300
           DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Sbjct: 241 DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM 300

Query: 301 DHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASLQLQQQEHQPEHQHQQQPQLL 360
           DHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASLQLQQQEHQPEHQHQQQPQLL
Sbjct: 301 DHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASLQLQQQEHQPEHQHQQQPQLL 360

Query: 361 DPLNSILNLG 371
           DPLNSILNLG
Sbjct: 361 DPLNSILNLG 370

BLAST of CmaCh09G001540 vs. NCBI nr
Match: XP_023536439.1 (probable DNA-3-methyladenine glycosylase 2 [Cucurbita pepo subsp. pepo] >KAG6591312.1 hypothetical protein SDJN03_13658, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 706.4 bits (1822), Expect = 1.5e-199
Identity = 369/370 (99.73%), Postives = 370/370 (100.00%), Query Frame = 0

Query: 1   MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVSLSEVMNAPSQTSSPPSK 60
           MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATV+LSEVMNAPSQTSSPPSK
Sbjct: 1   MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVALSEVMNAPSQTSSPPSK 60

Query: 61  MPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNKSKTAQQRAAFASAPVMLARS 120
           MPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNKSKTAQQRAAFASAPVMLARS
Sbjct: 61  MPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNKSKTAQQRAAFASAPVMLARS 120

Query: 121 LSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI 180
           LSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI
Sbjct: 121 LSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI 180

Query: 181 YTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM 240
           YTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM
Sbjct: 181 YTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM 240

Query: 241 DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM 300
           DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Sbjct: 241 DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM 300

Query: 301 DHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASLQLQQQEHQPEHQHQQQPQLL 360
           DHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASLQLQQQEHQPEHQHQQQPQLL
Sbjct: 301 DHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASLQLQQQEHQPEHQHQQQPQLL 360

Query: 361 DPLNSILNLG 371
           DPLNSILNLG
Sbjct: 361 DPLNSILNLG 370

BLAST of CmaCh09G001540 vs. NCBI nr
Match: XP_022936456.1 (probable DNA-3-methyladenine glycosylase 2 [Cucurbita moschata])

HSP 1 Score: 705.3 bits (1819), Expect = 3.4e-199
Identity = 368/370 (99.46%), Postives = 370/370 (100.00%), Query Frame = 0

Query: 1   MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVSLSEVMNAPSQTSSPPSK 60
           MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATV+LSEVMNAPSQTSSPPSK
Sbjct: 1   MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVALSEVMNAPSQTSSPPSK 60

Query: 61  MPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNKSKTAQQRAAFASAPVMLARS 120
           MPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNKSKTAQQRAAFASAPVMLARS
Sbjct: 61  MPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNKSKTAQQRAAFASAPVMLARS 120

Query: 121 LSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI 180
           LSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI
Sbjct: 121 LSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI 180

Query: 181 YTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM 240
           YTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM
Sbjct: 181 YTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM 240

Query: 241 DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM 300
           DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Sbjct: 241 DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM 300

Query: 301 DHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASLQLQQQEHQPEHQHQQQPQLL 360
           DHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASLQLQQQEHQP+HQHQQQPQLL
Sbjct: 301 DHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASLQLQQQEHQPQHQHQQQPQLL 360

Query: 361 DPLNSILNLG 371
           DPLNSILNLG
Sbjct: 361 DPLNSILNLG 370

BLAST of CmaCh09G001540 vs. NCBI nr
Match: XP_022142016.1 (uncharacterized protein LOC111012247 [Momordica charantia])

HSP 1 Score: 657.5 bits (1695), Expect = 8.2e-185
Identity = 340/370 (91.89%), Postives = 356/370 (96.22%), Query Frame = 0

Query: 1   MGEHTQVQVQTQTQSQSQAQSQAQNTFHESSNSTTTIAQATVSLSEVMNAPSQTSSPPSK 60
           MGE TQVQVQTQTQSQSQ QSQAQNT H+SSNSTT+IAQATV+LSEVMNAP+QTSSPPSK
Sbjct: 1   MGEQTQVQVQTQTQSQSQPQSQAQNTIHDSSNSTTSIAQATVALSEVMNAPTQTSSPPSK 60

Query: 61  MPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNKSKTAQQRAAFASAPVMLARS 120
           MPLRPRKIRKLSPDESD NSSQ+  + DGPKPI++ KSNK+KTAQQRAAFASAP++ ARS
Sbjct: 61  MPLRPRKIRKLSPDESDANSSQIAPMTDGPKPISSGKSNKNKTAQQRAAFASAPILPARS 120

Query: 121 LSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI 180
           LSCEGEVE+ALRHLRNADPLLAPLIDLHQRP FDSFQTPFLALTRSILYQQLAYKAGTSI
Sbjct: 121 LSCEGEVEIALRHLRNADPLLAPLIDLHQRPIFDSFQTPFLALTRSILYQQLAYKAGTSI 180

Query: 181 YTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM 240
           YTRFIALCGGEAGVLPETVL+L+PQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM
Sbjct: 181 YTRFIALCGGEAGVLPETVLSLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM 240

Query: 241 DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM 300
           DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM
Sbjct: 241 DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM 300

Query: 301 DHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASLQLQQQEHQPEHQHQQQPQLL 360
           D LCEKWRPYRSVGSWYMWR AEAKGASSSAAA+AAGASLQLQQQEHQ EHQH QQPQLL
Sbjct: 301 DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAALAAGASLQLQQQEHQQEHQHSQQPQLL 360

Query: 361 DPLNSILNLG 371
           DP+NSILNLG
Sbjct: 361 DPINSILNLG 370

BLAST of CmaCh09G001540 vs. TAIR 10
Match: AT1G75230.2 (DNA glycosylase superfamily protein )

HSP 1 Score: 412.5 bits (1059), Expect = 4.3e-115
Identity = 235/397 (59.19%), Postives = 283/397 (71.28%), Query Frame = 0

Query: 1   MGEHTQVQVQTQTQSQSQAQS---QAQNTFHESSNSTTTIAQATVS----LSEVMNAPSQ 60
           MGEH+  Q  + T   +Q +S   +  N     +N   + + A VS     S  + AP  
Sbjct: 1   MGEHSPSQPSSHTLPPNQPESPNHETPNPIPPETNDDDSASSAGVSGSIVSSTTIEAPQV 60

Query: 61  T-----SSPPSKMPLRPRKIRKLSPDES-------DPNSSQVVAIPDGPKPIATSKSNKS 120
           T     SSPP+K+PLRPRKIRKLSPD+        + N SQ+           T  + KS
Sbjct: 61  TELGNVSSPPTKIPLRPRKIRKLSPDDDASDGFNPEHNLSQMT---------TTKPATKS 120

Query: 121 KTAQQRAAFASAPVMLARSLSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFL 180
           K +Q R    + P + ARSL+CEGE+E AL HLR+ DPLLA LID+H  PTF++FQTPFL
Sbjct: 121 KLSQSRT--VTVPRIQARSLTCEGELEAALHHLRSVDPLLASLIDIHPPPTFETFQTPFL 180

Query: 181 ALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLH 240
           AL RSILYQQLA KAG SIYTRF+ALCGGE GV+PE VL L+PQQLRQIG+SGRK+SYLH
Sbjct: 181 ALIRSILYQQLAAKAGNSIYTRFVALCGGENGVVPENVLPLTPQQLRQIGVSGRKASYLH 240

Query: 241 DLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNV 300
           DLARKYQNGILSD  IVNMD+KSLFTMLTMVNGIGSWSVHMFMI SLHRPDVLP+NDL V
Sbjct: 241 DLARKYQNGILSDSGIVNMDEKSLFTMLTMVNGIGSWSVHMFMINSLHRPDVLPVNDLGV 300

Query: 301 RKGVQLLYNLEELPRPSQMDHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASL- 360
           RKGVQ+L  +E+LPRPS+M+ LCEKWRPYRSV SWY+WR  E+K    +AAA  AGA+L 
Sbjct: 301 RKGVQMLNGMEDLPRPSKMEQLCEKWRPYRSVASWYLWRLIESKNTPPNAAAATAGAALS 360

Query: 361 -----QLQQQEHQPEHQ--HQQQPQLLDPLNSILNLG 371
                 +QQQE + +HQ   QQQPQL+DPLN++ ++G
Sbjct: 361 FPQLEDIQQQEQEQQHQQHQQQQPQLMDPLNNVFSIG 386

BLAST of CmaCh09G001540 vs. TAIR 10
Match: AT1G75230.1 (DNA glycosylase superfamily protein )

HSP 1 Score: 412.5 bits (1059), Expect = 4.3e-115
Identity = 235/397 (59.19%), Postives = 283/397 (71.28%), Query Frame = 0

Query: 1   MGEHTQVQVQTQTQSQSQAQS---QAQNTFHESSNSTTTIAQATVS----LSEVMNAPSQ 60
           MGEH+  Q  + T   +Q +S   +  N     +N   + + A VS     S  + AP  
Sbjct: 1   MGEHSPSQPSSHTLPPNQPESPNHETPNPIPPETNDDDSASSAGVSGSIVSSTTIEAPQV 60

Query: 61  T-----SSPPSKMPLRPRKIRKLSPDES-------DPNSSQVVAIPDGPKPIATSKSNKS 120
           T     SSPP+K+PLRPRKIRKLSPD+        + N SQ+           T  + KS
Sbjct: 61  TELGNVSSPPTKIPLRPRKIRKLSPDDDASDGFNPEHNLSQMT---------TTKPATKS 120

Query: 121 KTAQQRAAFASAPVMLARSLSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFL 180
           K +Q R    + P + ARSL+CEGE+E AL HLR+ DPLLA LID+H  PTF++FQTPFL
Sbjct: 121 KLSQSRT--VTVPRIQARSLTCEGELEAALHHLRSVDPLLASLIDIHPPPTFETFQTPFL 180

Query: 181 ALTRSILYQQLAYKAGTSIYTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLH 240
           AL RSILYQQLA KAG SIYTRF+ALCGGE GV+PE VL L+PQQLRQIG+SGRK+SYLH
Sbjct: 181 ALIRSILYQQLAAKAGNSIYTRFVALCGGENGVVPENVLPLTPQQLRQIGVSGRKASYLH 240

Query: 241 DLARKYQNGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNV 300
           DLARKYQNGILSD  IVNMD+KSLFTMLTMVNGIGSWSVHMFMI SLHRPDVLP+NDL V
Sbjct: 241 DLARKYQNGILSDSGIVNMDEKSLFTMLTMVNGIGSWSVHMFMINSLHRPDVLPVNDLGV 300

Query: 301 RKGVQLLYNLEELPRPSQMDHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASL- 360
           RKGVQ+L  +E+LPRPS+M+ LCEKWRPYRSV SWY+WR  E+K    +AAA  AGA+L 
Sbjct: 301 RKGVQMLNGMEDLPRPSKMEQLCEKWRPYRSVASWYLWRLIESKNTPPNAAAATAGAALS 360

Query: 361 -----QLQQQEHQPEHQ--HQQQPQLLDPLNSILNLG 371
                 +QQQE + +HQ   QQQPQL+DPLN++ ++G
Sbjct: 361 FPQLEDIQQQEQEQQHQQHQQQQPQLMDPLNNVFSIG 386

BLAST of CmaCh09G001540 vs. TAIR 10
Match: AT1G19480.1 (DNA glycosylase superfamily protein )

HSP 1 Score: 392.9 bits (1008), Expect = 3.5e-109
Identity = 222/382 (58.12%), Postives = 274/382 (71.73%), Query Frame = 0

Query: 1   MGEHTQVQVQTQTQSQSQAQS---------QAQNTFHESSNSTTTIAQATVSLSEVMNAP 60
           MGE +  Q  TQ QS  Q+           ++ +   +S+  + +I  +T   +  +   
Sbjct: 1   MGEQSPSQPSTQCQSHPQSPKPDTHNLIPPESTDECLDSAGVSGSIVSSTTIDARRITEL 60

Query: 61  SQTSSPPSKMPLRPRKIRKLSPD---ESDPNSSQVVAIPDGPKPIATSKSNKSKTAQQRA 120
              SSPPSK+PLRPRKIRKL+ D     +   ++ ++      P+AT   +  K      
Sbjct: 61  GNVSSPPSKIPLRPRKIRKLTLDGDVSGEDYKAEDISSSQVNSPLATDGKSPGKGKLSHL 120

Query: 121 AFASAPVMLARSLSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSIL 180
              + P + AR L+CEGE+E A+ +LRNADPLLA LID+H  PTF+SF+TPFLAL R+IL
Sbjct: 121 RAITVPRIQARPLTCEGELETAIHYLRNADPLLAALIDVHPPPTFESFKTPFLALIRNIL 180

Query: 181 YQQLAYKAGTSIYTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQ 240
           YQQLA KAG SIYTRF++LCGGE  V+PETVL+L+PQQLRQIG+SGRK+SYLHDLARKYQ
Sbjct: 181 YQQLAMKAGNSIYTRFVSLCGGENLVVPETVLSLNPQQLRQIGVSGRKASYLHDLARKYQ 240

Query: 241 NGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLL 300
           NGILSD AI+NMD+KSLFTMLTMVNGIGSWSVHMFMI SLHRPDVLP+NDL VRKGVQLL
Sbjct: 241 NGILSDSAILNMDEKSLFTMLTMVNGIGSWSVHMFMINSLHRPDVLPVNDLGVRKGVQLL 300

Query: 301 YNLEELPRPSQMDHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASLQLQQQEHQ 360
           Y L++LPRPSQM+  C KWRPYRSVGSWYMWR  EAK  S+S AA+AAG SL    ++ Q
Sbjct: 301 YGLDDLPRPSQMEQHCAKWRPYRSVGSWYMWRLIEAKSTSTS-AAVAAGVSLP-PLEDIQ 360

Query: 361 PEHQHQQQPQLLDPLNSILNLG 371
            EHQ Q   QL+DPLN + ++G
Sbjct: 361 QEHQQQ---QLMDPLNGVFSIG 377

BLAST of CmaCh09G001540 vs. TAIR 10
Match: AT1G19480.2 (DNA glycosylase superfamily protein )

HSP 1 Score: 392.9 bits (1008), Expect = 3.5e-109
Identity = 222/382 (58.12%), Postives = 274/382 (71.73%), Query Frame = 0

Query: 1   MGEHTQVQVQTQTQSQSQAQS---------QAQNTFHESSNSTTTIAQATVSLSEVMNAP 60
           MGE +  Q  TQ QS  Q+           ++ +   +S+  + +I  +T   +  +   
Sbjct: 1   MGEQSPSQPSTQCQSHPQSPKPDTHNLIPPESTDECLDSAGVSGSIVSSTTIDARRITEL 60

Query: 61  SQTSSPPSKMPLRPRKIRKLSPD---ESDPNSSQVVAIPDGPKPIATSKSNKSKTAQQRA 120
              SSPPSK+PLRPRKIRKL+ D     +   ++ ++      P+AT   +  K      
Sbjct: 61  GNVSSPPSKIPLRPRKIRKLTLDGDVSGEDYKAEDISSSQVNSPLATDGKSPGKGKLSHL 120

Query: 121 AFASAPVMLARSLSCEGEVEVALRHLRNADPLLAPLIDLHQRPTFDSFQTPFLALTRSIL 180
              + P + AR L+CEGE+E A+ +LRNADPLLA LID+H  PTF+SF+TPFLAL R+IL
Sbjct: 121 RAITVPRIQARPLTCEGELETAIHYLRNADPLLAALIDVHPPPTFESFKTPFLALIRNIL 180

Query: 181 YQQLAYKAGTSIYTRFIALCGGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQ 240
           YQQLA KAG SIYTRF++LCGGE  V+PETVL+L+PQQLRQIG+SGRK+SYLHDLARKYQ
Sbjct: 181 YQQLAMKAGNSIYTRFVSLCGGENLVVPETVLSLNPQQLRQIGVSGRKASYLHDLARKYQ 240

Query: 241 NGILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLL 300
           NGILSD AI+NMD+KSLFTMLTMVNGIGSWSVHMFMI SLHRPDVLP+NDL VRKGVQLL
Sbjct: 241 NGILSDSAILNMDEKSLFTMLTMVNGIGSWSVHMFMINSLHRPDVLPVNDLGVRKGVQLL 300

Query: 301 YNLEELPRPSQMDHLCEKWRPYRSVGSWYMWRFAEAKGASSSAAAIAAGASLQLQQQEHQ 360
           Y L++LPRPSQM+  C KWRPYRSVGSWYMWR  EAK  S+S AA+AAG SL    ++ Q
Sbjct: 301 YGLDDLPRPSQMEQHCAKWRPYRSVGSWYMWRLIEAKSTSTS-AAVAAGVSLP-PLEDIQ 360

Query: 361 PEHQHQQQPQLLDPLNSILNLG 371
            EHQ Q   QL+DPLN + ++G
Sbjct: 361 QEHQQQ---QLMDPLNGVFSIG 377

BLAST of CmaCh09G001540 vs. TAIR 10
Match: AT3G50880.1 (DNA glycosylase superfamily protein )

HSP 1 Score: 273.5 bits (698), Expect = 3.1e-73
Identity = 153/276 (55.43%), Postives = 184/276 (66.67%), Query Frame = 0

Query: 52  SQTSSPPSKMPLRPRKIRKLSPDESDPNSSQVVAIPDGPKPIATSKSNKSKTAQQRAAFA 111
           S+ S   S++  RPRKIRK+S D S             P+ I T               A
Sbjct: 29  SEVSGSSSRIRFRPRKIRKVSSDPS-------------PRIIIT---------------A 88

Query: 112 SAPVMLARSLSCEGEVEVALRHLRNADPLLAPLIDLH-QRPTFDSFQTPFLALTRSILYQ 171
           S P      LS +  V++ALRHL+++D LL  LI  H   P FDS  TPFL+L RSILYQ
Sbjct: 89  SPP------LSTKSTVDIALRHLQSSDELLGALITTHNDPPLFDSSNTPFLSLARSILYQ 148

Query: 172 QLAYKAGTSIYTRFIALC-GGEAGVLPETVLALSPQQLRQIGISGRKSSYLHDLARKYQN 231
           QLA KA   IY RFI+L  GGEAGV+PE+V++LS   LR+IG+SGRK+SYLHDLA KY N
Sbjct: 149 QLATKAAKCIYDRFISLFNGGEAGVVPESVISLSAVDLRKIGVSGRKASYLHDLADKYNN 208

Query: 232 GILSDPAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLY 291
           G+LSD  I+ M D+ L   LT+V GIG W+VHMFMIFSLHRPDVLP+ DL VRKGV+ LY
Sbjct: 209 GVLSDELILKMSDEELIDRLTLVKGIGVWTVHMFMIFSLHRPDVLPVGDLGVRKGVKDLY 268

Query: 292 NLEELPRPSQMDHLCEKWRPYRSVGSWYMWRFAEAK 326
            L+ LP P QM+ LCEKWRPYRSVGSWYMWR  E++
Sbjct: 269 GLKNLPGPLQMEQLCEKWRPYRSVGSWYMWRLIESR 270

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q923836.6e-2031.29DNA-3-methyladenine glycosylase 1 OS=Schizosaccharomyces pombe (strain 972 / ATC... [more]
O315444.8e-1826.89Putative DNA-3-methyladenine glycosylase YfjP OS=Bacillus subtilis (strain 168) ... [more]
O944682.0e-1625.00Alkylbase DNA glycosidase-like protein mag2 OS=Schizosaccharomyces pombe (strain... [more]
Match NameE-valueIdentityDescription
A0A6J1IIA53.4e-200100.00probable DNA-3-methyladenine glycosylase 2 OS=Cucurbita maxima OX=3661 GN=LOC111... [more]
A0A6J1F7I41.7e-19999.46probable DNA-3-methyladenine glycosylase 2 OS=Cucurbita moschata OX=3662 GN=LOC1... [more]
A0A6J1CKY04.0e-18591.89uncharacterized protein LOC111012247 OS=Momordica charantia OX=3673 GN=LOC111012... [more]
A0A5A7TDX51.5e-18493.03DNA-3-methyladenine glycosylase 1 OS=Cucumis melo var. makuwa OX=1194695 GN=E567... [more]
A0A1S3CRJ51.5e-18493.03DNA-3-methyladenine glycosylase 1 OS=Cucumis melo OX=3656 GN=LOC103503942 PE=4 S... [more]
Match NameE-valueIdentityDescription
KAG7024195.12.6e-23996.72mag1 [Cucurbita argyrosperma subsp. argyrosperma][more]
XP_022976000.16.9e-200100.00probable DNA-3-methyladenine glycosylase 2 [Cucurbita maxima][more]
XP_023536439.11.5e-19999.73probable DNA-3-methyladenine glycosylase 2 [Cucurbita pepo subsp. pepo] >KAG6591... [more]
XP_022936456.13.4e-19999.46probable DNA-3-methyladenine glycosylase 2 [Cucurbita moschata][more]
XP_022142016.18.2e-18591.89uncharacterized protein LOC111012247 [Momordica charantia][more]
Match NameE-valueIdentityDescription
AT1G75230.24.3e-11559.19DNA glycosylase superfamily protein [more]
AT1G75230.14.3e-11559.19DNA glycosylase superfamily protein [more]
AT1G19480.13.5e-10958.12DNA glycosylase superfamily protein [more]
AT1G19480.23.5e-10958.12DNA glycosylase superfamily protein [more]
AT3G50880.13.1e-7355.43DNA glycosylase superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR003265HhH-GPD domainSMARTSM00478endo3endcoord: 168..323
e-value: 7.4E-11
score: 52.1
IPR003265HhH-GPD domainPFAMPF00730HhH-GPDcoord: 165..308
e-value: 4.6E-18
score: 65.6
IPR003265HhH-GPD domainCDDcd00056ENDO3ccoord: 160..321
e-value: 1.25174E-27
score: 105.787
NoneNo IPR availableGENE3D1.10.340.30Hypothetical protein; domain 2coord: 157..270
e-value: 6.6E-64
score: 216.9
NoneNo IPR availableGENE3D1.10.1670.40coord: 132..320
e-value: 6.6E-64
score: 216.9
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..103
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..60
NoneNo IPR availablePANTHERPTHR43003DNA-3-METHYLADENINE GLYCOSYLASEcoord: 2..371
NoneNo IPR availablePANTHERPTHR43003:SF7BNAC05G15000D PROTEINcoord: 2..371
IPR011257DNA glycosylaseSUPERFAMILY48150DNA-glycosylasecoord: 155..321

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh09G001540.1CmaCh09G001540.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006285 base-excision repair, AP site formation
biological_process GO:0006307 DNA dealkylation involved in DNA repair
biological_process GO:0006284 base-excision repair
biological_process GO:0006281 DNA repair
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0005634 nucleus
cellular_component GO:0032993 protein-DNA complex
molecular_function GO:0032131 alkylated DNA binding
molecular_function GO:0008725 DNA-3-methyladenine glycosylase activity
molecular_function GO:0043916 DNA-7-methylguanine glycosylase activity
molecular_function GO:0003824 catalytic activity