Lsi10G015020 (gene) Bottle gourd (USVL1VR-Ls) v1

Overview
NameLsi10G015020
Typegene
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls) v1)
DescriptionDNA glycosylase superfamily protein
Locationchr10: 19228028 .. 19230756 (-)
RNA-Seq ExpressionLsi10G015020
SyntenyLsi10G015020
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TCTCTCATCCAATTCCTTTATAAAACCCATCTCTCTCCCATTCACTCTCTCTTCACTAATTCCCATTTCGCCATTTCTCTCCCTTTTTCTTCTCTCTCTTTCTCTCTCTCTCATTTTTCCCAAAACTAAAAACCAAAAAAAACGATGTGTCGTTCCGAGGAGACCTTGGAAGCTACTACTGTCGTCGTTGATTCCAAATTCAATGCCCGTCCTGTCCTTCAACCCACTTGCAACCGTGTCCTCGACCGCCGTAATTCCCTAAAAAAAAACCCTTCTCTCAAACCCCCTTCCGCCGCCGTCTCCCCCACCTCCCCCAAATCTAAATCCCCCCGTCCTCCGGCCACCAAGCGGGCGAATGACGGAAATAATCCCATGAACTCTAGCTCCGACAAGATCCTTATTCCGGCCGCCGCGAACGGTGGCGGGTCTCTGTCACGGCCGAGAGCTACCTTGGATAGAAAGAAATCGAAAAGCTTCAAATTGGGTGGAAATGGGAATGTTGTGATTTGTGATAATGGTGGATATGAGGTGGCGTCGTTGAGCTACGCTTCTTCTTTGATCACTGAGTCGCCGGGAAGTATCGCCGCCGTGAGAAGAGAGCAGGTGGCTCTGCAACAGGCGCAGAGGAAGATGAGAATTGCTCATTATGGAAGATCTAAATCTGCTCGTTTTGAAAAAATTGTTCCTCTTGATTCTAAAATTAAACCCGCTGTTGAAGATAGAAGATGTAGCTTCATCACTCCCAATTCAGGTAACTCACTCTTTTTTTTTAATATATATATAAATTTCCTCTGTTCATCATATTCATAAATTAACTTAAATTCTCTCTCTGTTTTTAGATCCCATTTATGTTGCTTACCATGATGAGGAATGGGGCGTCCCTGTTCATGATGACAGGTGAGTTTATCTCTCTTCTTTTTTTTCTCTCTCTTCTTTTGGTTTTTTGAAGTACCAACTTGAACTCGAATCTTTTAACTTCAAGAGATTATTCTGAACTAAACTCATTTATGTTATTAAAATATTAGATATAATTAAATTTACATTAATTTATTAATTTAAATTTTTTAATTTAATAGTGATTTAATATAAAAATAGGTTGCACCTAATTTCATGTTTCCATTCTCTTACCCACATAAAAAAAATACTTTTGTTTATAAAATTTAGTTTTAATAGCAATTTTTACCCGATATTTTAACCCGAACGCTTGTAATGTAATTTTTACTTCAATTAATCTTAATTTTTAGTTATTGGATATTTTATAATTTTTTAACCTCTCAAACCTGAGTGTTAAAAGTATTAATATAATTTTCTAAAAAAAATAAAGTAATATTAATATAATTAATTTGACTTTTAACTCGCATAAATTTTTTGAATACAGGATGTTGTTTGAATTGCTGGTTCTAAGCGTAGCCCAGGTGGGTTCGGATTGGACTTCAATTTTGAAGAAACGCCAAGATTTCAGGTGCTAAAAAACAAACAATTTCCAATTAATTAATCAAAAAAAATTTATTTATTTTTTTCCCAATTCTCTAATTTAATTTTTGTTCTTTATCCAGAAATGCATTTTCAAGTTTCGATTCAGAAATTGTGGCAAATTTCTCCGACAAACAGATGGTTTCAATCAGCTCAGAATATGGCATCGACATTAACAGAGTCCGAGGAGTCGTCGACAACGCAATCCGAATCCTCCAGGTAATGAAAATTTAATTTTATTTAATTTTTTTAATTAAAAGATTAACTTATTAGTAAAATCCAAACCCATTGACTTAATAGTCAGCCCCATTTAATTAATATTCTAATTACTCCATGTTAATTAAAAAAAAAATTTGACACTTTCTTCTTTCTCCGAACTTTTATGAATTACTTGGTGACATTTTTTATTGTTTGTTTAATTTAAAAAAAAAAAAACAGATTAAGAAGGAATTTGGGTCATTCGACAAATACATTTGGGGATTTGTGAACAACAAGCCGTTTTCACCGCAGTACAAATCCGGCCACAAAATTCCGGTGAAAACATCAAAATCAGAGACCATAAGCAAAGACATGGTCCGACGAGGTTTCCGGTCGGTCGGACCGACGGTGGTCCACTCCTTCATGCAAGCCGCCGGTCTGACCAACGACCATCTGACCAGTTGCCACAGGCACCTCCACTGCACCTTAATCGCCGCCGGCCGCCGCACTCCGGCGACGACGACGACGGCAGAAGTGGAGGAGACGACAGAGACGGTGGCAGCTTCTGAAACTCTCTAGAATTGACTCGAGAATTTAATTAACAGACAAAAAGAAAAAGTGATAACCTTTACGAGGAGTCCATCAATGATGATTTGCTTGCTAATTAACTAGATAACTATTTTTTTTTTTGGGTTTTTTTTTTCTTTTTTGTGGGGTTTGTGTATATTAATGTCTATATAAATAGACTTGTAAGAGGAGAAAAAAATGAAAGAAAAAAGAGATTGTGGGGTTGTGAATTTGTGTGAAAGTTTTTTTTTTCTTTTTTTGGCGAGAATTTTAGTGAAAGTGCTTGTATAATTAGAAGAAAAATGAAAAGGAAAAAAAAAAGAAAAAGAAAATTATTTGAAGTGGTAGGGCTAGAATAGAAGACAGACAGCATGTGCTTGTGCAATTGGGAGGCAATGGCATGTGAGTGTGTCAGTTTGCTTTTGTAAATTCCCATGTGATCCATCCAAATTTCAAACATTATTAATCATTATTATATTATTTTTATTTTA

mRNA sequence

TCTCTCATCCAATTCCTTTATAAAACCCATCTCTCTCCCATTCACTCTCTCTTCACTAATTCCCATTTCGCCATTTCTCTCCCTTTTTCTTCTCTCTCTTTCTCTCTCTCTCATTTTTCCCAAAACTAAAAACCAAAAAAAACGATGTGTCGTTCCGAGGAGACCTTGGAAGCTACTACTGTCGTCGTTGATTCCAAATTCAATGCCCGTCCTGTCCTTCAACCCACTTGCAACCGTGTCCTCGACCGCCGTAATTCCCTAAAAAAAAACCCTTCTCTCAAACCCCCTTCCGCCGCCGTCTCCCCCACCTCCCCCAAATCTAAATCCCCCCGTCCTCCGGCCACCAAGCGGGCGAATGACGGAAATAATCCCATGAACTCTAGCTCCGACAAGATCCTTATTCCGGCCGCCGCGAACGGTGGCGGGTCTCTGTCACGGCCGAGAGCTACCTTGGATAGAAAGAAATCGAAAAGCTTCAAATTGGGTGGAAATGGGAATGTTGTGATTTGTGATAATGGTGGATATGAGGTGGCGTCGTTGAGCTACGCTTCTTCTTTGATCACTGAGTCGCCGGGAAGTATCGCCGCCGTGAGAAGAGAGCAGGTGGCTCTGCAACAGGCGCAGAGGAAGATGAGAATTGCTCATTATGGAAGATCTAAATCTGCTCGTTTTGAAAAAATTGTTCCTCTTGATTCTAAAATTAAACCCGCTGTTGAAGATAGAAGATGTAGCTTCATCACTCCCAATTCAGATCCCATTTATGTTGCTTACCATGATGAGGAATGGGGCGTCCCTGTTCATGATGACAGGATGTTGTTTGAATTGCTGGTTCTAAGCGTAGCCCAGGTGGGTTCGGATTGGACTTCAATTTTGAAGAAACGCCAAGATTTCAGAAATGCATTTTCAAGTTTCGATTCAGAAATTGTGGCAAATTTCTCCGACAAACAGATGGTTTCAATCAGCTCAGAATATGGCATCGACATTAACAGAGTCCGAGGAGTCGTCGACAACGCAATCCGAATCCTCCAGATTAAGAAGGAATTTGGGTCATTCGACAAATACATTTGGGGATTTGTGAACAACAAGCCGTTTTCACCGCAGTACAAATCCGGCCACAAAATTCCGGTGAAAACATCAAAATCAGAGACCATAAGCAAAGACATGGTCCGACGAGGTTTCCGGTCGGTCGGACCGACGGTGGTCCACTCCTTCATGCAAGCCGCCGGTCTGACCAACGACCATCTGACCAGTTGCCACAGGCACCTCCACTGCACCTTAATCGCCGCCGGCCGCCGCACTCCGGCGACGACGACGACGGCAGAAGTGGAGGAGACGACAGAGACGGTGGCAGCTTCTGAAACTCTCTAGAATTGACTCGAGAATTTAATTAACAGACAAAAAGAAAAAGTGATAACCTTTACGAGGAGTCCATCAATGATGATTTGCTTGCTAATTAACTAGATAACTATTTTTTTTTTTGGGTTTTTTTTTTCTTTTTTGTGGGGTTTGTGTATATTAATGTCTATATAAATAGACTTGTAAGAGGAGAAAAAAATGAAAGAAAAAAGAGATTGTGGGGTTGTGAATTTGTGTGAAAGTTTTTTTTTTCTTTTTTTGGCGAGAATTTTAGTGAAAGTGCTTGTATAATTAGAAGAAAAATGAAAAGGAAAAAAAAAAGAAAAAGAAAATTATTTGAAGTGGTAGGGCTAGAATAGAAGACAGACAGCATGTGCTTGTGCAATTGGGAGGCAATGGCATGTGAGTGTGTCAGTTTGCTTTTGTAAATTCCCATGTGATCCATCCAAATTTCAAACATTATTAATCATTATTATATTATTTTTATTTTA

Coding sequence (CDS)

ATGTGTCGTTCCGAGGAGACCTTGGAAGCTACTACTGTCGTCGTTGATTCCAAATTCAATGCCCGTCCTGTCCTTCAACCCACTTGCAACCGTGTCCTCGACCGCCGTAATTCCCTAAAAAAAAACCCTTCTCTCAAACCCCCTTCCGCCGCCGTCTCCCCCACCTCCCCCAAATCTAAATCCCCCCGTCCTCCGGCCACCAAGCGGGCGAATGACGGAAATAATCCCATGAACTCTAGCTCCGACAAGATCCTTATTCCGGCCGCCGCGAACGGTGGCGGGTCTCTGTCACGGCCGAGAGCTACCTTGGATAGAAAGAAATCGAAAAGCTTCAAATTGGGTGGAAATGGGAATGTTGTGATTTGTGATAATGGTGGATATGAGGTGGCGTCGTTGAGCTACGCTTCTTCTTTGATCACTGAGTCGCCGGGAAGTATCGCCGCCGTGAGAAGAGAGCAGGTGGCTCTGCAACAGGCGCAGAGGAAGATGAGAATTGCTCATTATGGAAGATCTAAATCTGCTCGTTTTGAAAAAATTGTTCCTCTTGATTCTAAAATTAAACCCGCTGTTGAAGATAGAAGATGTAGCTTCATCACTCCCAATTCAGATCCCATTTATGTTGCTTACCATGATGAGGAATGGGGCGTCCCTGTTCATGATGACAGGATGTTGTTTGAATTGCTGGTTCTAAGCGTAGCCCAGGTGGGTTCGGATTGGACTTCAATTTTGAAGAAACGCCAAGATTTCAGAAATGCATTTTCAAGTTTCGATTCAGAAATTGTGGCAAATTTCTCCGACAAACAGATGGTTTCAATCAGCTCAGAATATGGCATCGACATTAACAGAGTCCGAGGAGTCGTCGACAACGCAATCCGAATCCTCCAGATTAAGAAGGAATTTGGGTCATTCGACAAATACATTTGGGGATTTGTGAACAACAAGCCGTTTTCACCGCAGTACAAATCCGGCCACAAAATTCCGGTGAAAACATCAAAATCAGAGACCATAAGCAAAGACATGGTCCGACGAGGTTTCCGGTCGGTCGGACCGACGGTGGTCCACTCCTTCATGCAAGCCGCCGGTCTGACCAACGACCATCTGACCAGTTGCCACAGGCACCTCCACTGCACCTTAATCGCCGCCGGCCGCCGCACTCCGGCGACGACGACGACGGCAGAAGTGGAGGAGACGACAGAGACGGTGGCAGCTTCTGAAACTCTCTAG

Protein sequence

MCRSEETLEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKKNPSLKPPSAAVSPTSPKSKSPRPPATKRANDGNNPMNSSSDKILIPAAANGGGSLSRPRATLDRKKSKSFKLGGNGNVVICDNGGYEVASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKIVPLDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDRMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQIKKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIAAGRRTPATTTTAEVEETTETVAASETL
Homology
BLAST of Lsi10G015020 vs. ExPASy Swiss-Prot
Match: Q7VG78 (Probable GMP synthase [glutamine-hydrolyzing] OS=Helicobacter hepaticus (strain ATCC 51449 / 3B1) OX=235279 GN=guaA PE=3 SV=1)

HSP 1 Score: 167.5 bits (423), Expect = 3.0e-40
Identity = 82/187 (43.85%), Postives = 116/187 (62.03%), Query Frame = 0

Query: 191 EDRRCSFITPNSD---PIYVAYHDEEWGVPVHDDRMLFELLVLSVAQVGSDWTSILKKRQ 250
           E  RC++ T   +    +Y  YHD EWG P+H+D+ LFE LVL   Q G  W +ILKKR+
Sbjct: 784 EKVRCAWATDKDEAARKLYEDYHDTEWGEPLHEDKKLFEHLVLEGFQAGLSWITILKKRE 843

Query: 251 DFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINR--VRGVVDNAIRILQIKKEFGSFDK 310
            FR AF  FD  IVAN+ + ++  +    GI  NR  +   + NA   + +++EFGSFDK
Sbjct: 844 AFRVAFDDFDPHIVANYDEDKIKELMRNEGIIRNRAKIEAAIINAKAFMAVQREFGSFDK 903

Query: 311 YIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTND 370
           YIWGFV  KP    ++S   +P  T  S+ I+KD+ +RGF+ VG T +++ MQ+ G+ ND
Sbjct: 904 YIWGFVGGKPIINAFESIADLPASTPLSDKIAKDLKKRGFKFVGTTTMYAMMQSIGMVND 963

Query: 371 HLTSCHR 373
           HLTSC +
Sbjct: 964 HLTSCFK 970

BLAST of Lsi10G015020 vs. ExPASy Swiss-Prot
Match: P05100 (DNA-3-methyladenine glycosylase 1 OS=Escherichia coli (strain K12) OX=83333 GN=tag PE=1 SV=1)

HSP 1 Score: 147.5 bits (371), Expect = 3.2e-34
Identity = 70/179 (39.11%), Postives = 110/179 (61.45%), Query Frame = 0

Query: 194 RCSFITPNSDPIYVAYHDEEWGVPVHDDRMLFELLVLSVAQVGSDWTSILKKRQDFRNAF 253
           RC ++  + DP+Y+AYHD EWGVP  D + LFE++ L   Q G  W ++LKKR+++R  F
Sbjct: 3   RCGWV--SQDPLYIAYHDNEWGVPETDSKKLFEMICLEGQQAGLSWITVLKKRENYRACF 62

Query: 254 SSFDSEIVANFSDKQMVSISSEYGIDINR--VRGVVDNAIRILQIKKEFGSFDKYIWGFV 313
             FD   VA   ++ +  +  + GI  +R  ++ ++ NA   LQ+++    F  ++W FV
Sbjct: 63  HQFDPVKVAAMQEEDVERLVQDAGIIRHRGKIQAIIGNARAYLQMEQNGEPFVDFVWSFV 122

Query: 314 NNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSC 371
           N++P   Q  +  +IP  TS S+ +SK + +RGF+ VG T+ +SFMQA GL NDH+  C
Sbjct: 123 NHQPQVTQATTLSEIPTSTSASDALSKALKKRGFKFVGTTICYSFMQACGLVNDHVVGC 179

BLAST of Lsi10G015020 vs. ExPASy Swiss-Prot
Match: P44321 (DNA-3-methyladenine glycosylase OS=Haemophilus influenzae (strain ATCC 51907 / DSM 11121 / KW20 / Rd) OX=71421 GN=tag PE=3 SV=1)

HSP 1 Score: 129.0 bits (323), Expect = 1.2e-28
Identity = 65/179 (36.31%), Postives = 99/179 (55.31%), Query Frame = 0

Query: 194 RCSFITPNSDPIYVAYHDEEWGVPVHDDRMLFELLVLSVAQVGSDWTSILKKRQDFRNAF 253
           RC ++   S  IY+ YHD+EWG P  D + LFE + L   Q G  W ++LKKR+ +R AF
Sbjct: 4   RCPWVGEQS--IYIDYHDKEWGKPEFDSQKLFEKICLEGQQAGLSWITVLKKRESYREAF 63

Query: 254 SSFDSEIVANFSDKQMVSISSEYGIDINRVR--GVVDNAIRILQIKKEFGSFDKYIWGFV 313
             FD + +A  +   + +     G+  +R +   +V NA   L ++K   +F  +IW FV
Sbjct: 64  HQFDPKKIAKMTALDIDACMQNSGLIRHRAKLEAIVKNAKAYLAMEKCGENFSDFIWSFV 123

Query: 314 NNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSC 371
           N+KP          +P KT  S+ +SK + +RGF  +G T  ++FMQ+ GL +DHL  C
Sbjct: 124 NHKPIVNDVPDLRSVPTKTEVSKALSKALKKRGFVFIGETTCYAFMQSMGLVDDHLNDC 180

BLAST of Lsi10G015020 vs. ExPASy TrEMBL
Match: A0A5A7UM21 (Putative GMP synthase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold104G00320 PE=4 SV=1)

HSP 1 Score: 705.7 bits (1820), Expect = 1.1e-199
Identity = 375/405 (92.59%), Postives = 386/405 (95.31%), Query Frame = 0

Query: 1   MCRSEETLEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKK-NPSLKPPS--AAVSPTSP 60
           MCRSEE LEAT+VVVDSKFN+RPVLQPTCNRVLDRRNSLKK +PSLKPPS  AAVSPTSP
Sbjct: 1   MCRSEEALEATSVVVDSKFNSRPVLQPTCNRVLDRRNSLKKQHPSLKPPSPAAAVSPTSP 60

Query: 61  KSKSPRPPATKRANDGNNPMNSSSDKILIPAAANGGGSLSRPRATLDRKKSKSFKLGGNG 120
           KSKSPRPPATKRANDGNNPMNSSS+KILIPAAA      SRPRATLDRKKSKSFKLGGNG
Sbjct: 61  KSKSPRPPATKRANDGNNPMNSSSEKILIPAAA------SRPRATLDRKKSKSFKLGGNG 120

Query: 121 NVVICDNGGYEVASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFE 180
           N VICDNGG+EVA   YASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFE
Sbjct: 121 N-VICDNGGFEVA---YASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFE 180

Query: 181 KIVPLDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDRMLFELLVLSVAQVGS 240
           KIVPLDSKIKP+VEDRRCSFITPNSDPIYVAYHDEEWGVPVHDD+MLFELLVLSVAQVGS
Sbjct: 181 KIVPLDSKIKPSVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGS 240

Query: 241 DWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQIK 300
           DWTSILKKRQDFRNAFSSFDSEIVANFS+KQMVSIS+EYGIDINRVRGVVDN+IRILQIK
Sbjct: 241 DWTSILKKRQDFRNAFSSFDSEIVANFSEKQMVSISTEYGIDINRVRGVVDNSIRILQIK 300

Query: 301 KEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFM 360
           KEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFM
Sbjct: 301 KEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFM 360

Query: 361 QAAGLTNDHLTSCHRHLHCTLIAAGRRTPA-TTTTAEVEETTETV 402
           QAAGLTNDHLT+CHRHLHCTLIAAGRRTPA TTTT EVEE T  V
Sbjct: 361 QAAGLTNDHLTTCHRHLHCTLIAAGRRTPAPTTTTPEVEEDTAAV 395

BLAST of Lsi10G015020 vs. ExPASy TrEMBL
Match: A0A0A0KED6 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G134890 PE=4 SV=1)

HSP 1 Score: 705.3 bits (1819), Expect = 1.5e-199
Identity = 379/410 (92.44%), Postives = 389/410 (94.88%), Query Frame = 0

Query: 1   MCRSEETLEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKK-NPSLKPPS-AAVSPTSPK 60
           MCRSEETLEAT+VVVDSKFN+RPVLQPT NRVLDRRNSLKK +PSLKPPS AAVSPTSPK
Sbjct: 1   MCRSEETLEATSVVVDSKFNSRPVLQPTGNRVLDRRNSLKKQHPSLKPPSAAAVSPTSPK 60

Query: 61  SKSPRPPATKRANDGNNPMNSSSDKILIPAAANGGGSLSRPRATLDRKKSKSFKLGGNGN 120
           SKSPRPPATKRANDGNNPMNSSS+KILIPAA      +SRPRATLDRKKSKSFKLGGNGN
Sbjct: 61  SKSPRPPATKRANDGNNPMNSSSEKILIPAA------VSRPRATLDRKKSKSFKLGGNGN 120

Query: 121 VVICDNGGYEVASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEK 180
            VICDNGG+EVA   YASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEK
Sbjct: 121 -VICDNGGFEVA---YASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEK 180

Query: 181 IVPLDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDRMLFELLVLSVAQVGSD 240
           IVPLDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDD+MLFELLVLSVAQVGSD
Sbjct: 181 IVPLDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSD 240

Query: 241 WTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQIKK 300
           WTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSIS+EYGIDINRVRGVVDNAIRILQIKK
Sbjct: 241 WTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQIKK 300

Query: 301 EFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQ 360
           EFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQ
Sbjct: 301 EFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQ 360

Query: 361 AAGLTNDHLTSCHRHLHCTLIAAGRRTPA-TTTTAEVEETTETVAASETL 408
           AAGLTNDHLT+CHRHLHCTLIAAGRRTPA TTTT EVE   +T A  ETL
Sbjct: 361 AAGLTNDHLTTCHRHLHCTLIAAGRRTPAPTTTTPEVE---DTAAVCETL 397

BLAST of Lsi10G015020 vs. ExPASy TrEMBL
Match: A0A6J1FSP1 (uncharacterized protein LOC111448434 OS=Cucurbita moschata OX=3662 GN=LOC111448434 PE=4 SV=1)

HSP 1 Score: 675.2 bits (1741), Expect = 1.7e-190
Identity = 361/411 (87.83%), Postives = 375/411 (91.24%), Query Frame = 0

Query: 1   MCRSEETLEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKKNPSLKPPSAAVSPTSPKSK 60
           MCRSE+ LEAT+VVVDSKF ARPVLQPTCNRVLDRRNSLK     KPPSAAVSPTSPKSK
Sbjct: 1   MCRSEQALEATSVVVDSKFTARPVLQPTCNRVLDRRNSLK-----KPPSAAVSPTSPKSK 60

Query: 61  SPRPPATKRANDGNNPMNSSSDKILIPAAANGGGSLSRPRATLDRKKSKSFKLGGNGNVV 120
           SPRPPATKRAND  NPMNSSSDKILIPAAA     LSRP+A LDRKKSKSFKL GNGNVV
Sbjct: 61  SPRPPATKRAND-TNPMNSSSDKILIPAAA-----LSRPKAALDRKKSKSFKLAGNGNVV 120

Query: 121 ICDN----GGYEVASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARF 180
           ICDN    GG+EVASLSYASSLIT+SPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARF
Sbjct: 121 ICDNVAGGGGFEVASLSYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARF 180

Query: 181 EKIVPLDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDRMLFELLVLSVAQVG 240
           +K+VPLDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDD+MLFELLVLSVAQVG
Sbjct: 181 DKVVPLDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVG 240

Query: 241 SDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQI 300
           SDWTSILKKRQDFRNAFSSF +E VA FSDKQM+SISSEYGIDINRVRGVVDNAIRIL+I
Sbjct: 241 SDWTSILKKRQDFRNAFSSFVAETVAIFSDKQMLSISSEYGIDINRVRGVVDNAIRILEI 300

Query: 301 KKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSF 360
           KKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDM+RRGFRSVGPTVVHSF
Sbjct: 301 KKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMIRRGFRSVGPTVVHSF 360

Query: 361 MQAAGLTNDHLTSCHRHLHCTLIAAGRRTPATTTTAEVEETTETVAASETL 408
           MQAAGLTNDHLTSCHRHLHC++ AA RR PA      VEETT    ASETL
Sbjct: 361 MQAAGLTNDHLTSCHRHLHCSITAADRRAPAVV----VEETT---TASETL 393

BLAST of Lsi10G015020 vs. ExPASy TrEMBL
Match: A0A6J1J7H3 (uncharacterized protein LOC111484173 OS=Cucurbita maxima OX=3661 GN=LOC111484173 PE=4 SV=1)

HSP 1 Score: 670.6 bits (1729), Expect = 4.1e-189
Identity = 358/411 (87.10%), Postives = 374/411 (91.00%), Query Frame = 0

Query: 1   MCRSEETLEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKKNPSLKPPSAAVSPTSPKSK 60
           MCRSE+ LEAT+VVVDSKF ARPVLQPTCNRVLDRRNSLK     KPPSAAVSPTSPKSK
Sbjct: 1   MCRSEQALEATSVVVDSKFTARPVLQPTCNRVLDRRNSLK-----KPPSAAVSPTSPKSK 60

Query: 61  SPRPPATKRANDGNNPMNSSSDKILIPAAANGGGSLSRPRATLDRKKSKSFKLGGNGNVV 120
           SPRPPATKRAN+  NPMNSSSDKILIPAAA     LSRP+A LDRKKSKSFKL GNGNVV
Sbjct: 61  SPRPPATKRANE-TNPMNSSSDKILIPAAA-----LSRPKAALDRKKSKSFKLAGNGNVV 120

Query: 121 ICDN----GGYEVASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARF 180
           ICDN    GG+EVASLSYASSLIT+SPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARF
Sbjct: 121 ICDNVAGGGGFEVASLSYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARF 180

Query: 181 EKIVPLDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDRMLFELLVLSVAQVG 240
           +K+VPLDSKIKPAVE RRCSFITPNSDPIYVAYHDEEWGVPVHDD+MLFELLVLSVAQVG
Sbjct: 181 DKVVPLDSKIKPAVEHRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVG 240

Query: 241 SDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQI 300
           SDWTSILKKRQDFRNAFSSF +E VA FSDKQM+SISSEYGIDINRVRGVVDNAIRIL+I
Sbjct: 241 SDWTSILKKRQDFRNAFSSFVAETVAIFSDKQMLSISSEYGIDINRVRGVVDNAIRILEI 300

Query: 301 KKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSF 360
           KKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDM+RRGFRSVGPTVVHSF
Sbjct: 301 KKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMIRRGFRSVGPTVVHSF 360

Query: 361 MQAAGLTNDHLTSCHRHLHCTLIAAGRRTPATTTTAEVEETTETVAASETL 408
           MQ AGLTNDHLTSCHRHLHC++ AAGRR PA      VEETT    ASE+L
Sbjct: 361 MQGAGLTNDHLTSCHRHLHCSITAAGRRAPAVV----VEETT---TASESL 393

BLAST of Lsi10G015020 vs. ExPASy TrEMBL
Match: A0A6J1D778 (uncharacterized protein LOC111017989 OS=Momordica charantia OX=3673 GN=LOC111017989 PE=4 SV=1)

HSP 1 Score: 602.8 bits (1553), Expect = 1.0e-168
Identity = 328/402 (81.59%), Postives = 348/402 (86.57%), Query Frame = 0

Query: 1   MCRSEETLEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKKN-PSLKPPSAAVSPTSPKS 60
           MCRSE+ +EAT+VV       R VLQPTCNR L RRNSLKK  PS  PP +  SP SPKS
Sbjct: 1   MCRSEQVMEATSVVA----VGRAVLQPTCNR-LHRRNSLKKQPPSPSPPLSPPSPASPKS 60

Query: 61  KSPRPPATKRANDGNNPMNSSSDKILIPAAANGGGSLSRPRATLDRKKSKSFKLGGNGNV 120
           KSPRPPATKRAND    MNSSSDK+++PAAA       RPRA LDRKKSKSFKLGG+G  
Sbjct: 61  KSPRPPATKRANDAATAMNSSSDKLVLPAAA-------RPRA-LDRKKSKSFKLGGSG-- 120

Query: 121 VICDNGGYEVASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEKI 180
                      SLSYASSLITESPGSIAAVRREQVALQQAQRKM+IAHYGRSKSARFEKI
Sbjct: 121 -----ADEAAPSLSYASSLITESPGSIAAVRREQVALQQAQRKMKIAHYGRSKSARFEKI 180

Query: 181 VPLDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDRMLFELLVLSVAQVGSDW 240
           VP+DSK KPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVH+D++LFELLVLSVAQVGSDW
Sbjct: 181 VPIDSKTKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHEDKVLFELLVLSVAQVGSDW 240

Query: 241 TSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQIKKE 300
           TSILKKRQDFRNAFSSFD+E VANFSDKQMVSIS+EYGIDINRVRGVVDNAIRIL+IKKE
Sbjct: 241 TSILKKRQDFRNAFSSFDAETVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILEIKKE 300

Query: 301 FGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQA 360
           FGSFDKYIWGFVN+KPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQA
Sbjct: 301 FGSFDKYIWGFVNHKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQA 360

Query: 361 AGLTNDHLTSCHRHLHCTLIAAGRRTPATTTTAEVEETTETV 402
           AGLTNDHLTSCHRHL CTL+AAGRR P      EVEET+ET+
Sbjct: 361 AGLTNDHLTSCHRHLRCTLLAAGRRAP---PAVEVEETSETL 379

BLAST of Lsi10G015020 vs. NCBI nr
Match: XP_038902889.1 (uncharacterized protein LOC120089476 [Benincasa hispida])

HSP 1 Score: 752.7 bits (1942), Expect = 1.7e-213
Identity = 391/410 (95.37%), Postives = 395/410 (96.34%), Query Frame = 0

Query: 1   MCRSEETLEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKKNPSLKPPS---AAVSPTSP 60
           MCRSEE LEA+TVVVDSKFNARPVLQPTCNRVLDRRNSLKK PSLKPPS   AAVSPTSP
Sbjct: 1   MCRSEEALEASTVVVDSKFNARPVLQPTCNRVLDRRNSLKKQPSLKPPSAAVAAVSPTSP 60

Query: 61  KSKSPRPPATKRANDGNNPMNSSSDKILIPAAANGGGSLSRPRATLDRKKSKSFKLGGNG 120
           KSKSPRPPATKRANDGNNPMNSSSDKILIPAA NGGGS+SRPRATLDRKKSKSFKLGGNG
Sbjct: 61  KSKSPRPPATKRANDGNNPMNSSSDKILIPAATNGGGSVSRPRATLDRKKSKSFKLGGNG 120

Query: 121 NVVICDNGGYEVASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFE 180
           NVVICDNGGYEVA LSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFE
Sbjct: 121 NVVICDNGGYEVAPLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFE 180

Query: 181 KIVPLDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDRMLFELLVLSVAQVGS 240
           KIVPLDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDD+MLFELLVLSVAQVGS
Sbjct: 181 KIVPLDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGS 240

Query: 241 DWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQIK 300
           DWTSILKKRQDFRNAFSSFDSEIVA FSDKQMVSISSEYGIDINRVRGVVDNAIRILQIK
Sbjct: 241 DWTSILKKRQDFRNAFSSFDSEIVAVFSDKQMVSISSEYGIDINRVRGVVDNAIRILQIK 300

Query: 301 KEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFM 360
           KEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFM
Sbjct: 301 KEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFM 360

Query: 361 QAAGLTNDHLTSCHRHLHCTLIAAGRRTPATTTTAEVEETTETVAASETL 408
           QAAGLTNDHLT+CHRHLHCTLIAAGRRT ATTTT EVEET    A SETL
Sbjct: 361 QAAGLTNDHLTTCHRHLHCTLIAAGRRTTATTTTTEVEETATATAGSETL 410

BLAST of Lsi10G015020 vs. NCBI nr
Match: KAA0054725.1 (putative GMP synthase [Cucumis melo var. makuwa] >TYJ95615.1 putative GMP synthase [Cucumis melo var. makuwa])

HSP 1 Score: 705.7 bits (1820), Expect = 2.4e-199
Identity = 375/405 (92.59%), Postives = 386/405 (95.31%), Query Frame = 0

Query: 1   MCRSEETLEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKK-NPSLKPPS--AAVSPTSP 60
           MCRSEE LEAT+VVVDSKFN+RPVLQPTCNRVLDRRNSLKK +PSLKPPS  AAVSPTSP
Sbjct: 1   MCRSEEALEATSVVVDSKFNSRPVLQPTCNRVLDRRNSLKKQHPSLKPPSPAAAVSPTSP 60

Query: 61  KSKSPRPPATKRANDGNNPMNSSSDKILIPAAANGGGSLSRPRATLDRKKSKSFKLGGNG 120
           KSKSPRPPATKRANDGNNPMNSSS+KILIPAAA      SRPRATLDRKKSKSFKLGGNG
Sbjct: 61  KSKSPRPPATKRANDGNNPMNSSSEKILIPAAA------SRPRATLDRKKSKSFKLGGNG 120

Query: 121 NVVICDNGGYEVASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFE 180
           N VICDNGG+EVA   YASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFE
Sbjct: 121 N-VICDNGGFEVA---YASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFE 180

Query: 181 KIVPLDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDRMLFELLVLSVAQVGS 240
           KIVPLDSKIKP+VEDRRCSFITPNSDPIYVAYHDEEWGVPVHDD+MLFELLVLSVAQVGS
Sbjct: 181 KIVPLDSKIKPSVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGS 240

Query: 241 DWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQIK 300
           DWTSILKKRQDFRNAFSSFDSEIVANFS+KQMVSIS+EYGIDINRVRGVVDN+IRILQIK
Sbjct: 241 DWTSILKKRQDFRNAFSSFDSEIVANFSEKQMVSISTEYGIDINRVRGVVDNSIRILQIK 300

Query: 301 KEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFM 360
           KEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFM
Sbjct: 301 KEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFM 360

Query: 361 QAAGLTNDHLTSCHRHLHCTLIAAGRRTPA-TTTTAEVEETTETV 402
           QAAGLTNDHLT+CHRHLHCTLIAAGRRTPA TTTT EVEE T  V
Sbjct: 361 QAAGLTNDHLTTCHRHLHCTLIAAGRRTPAPTTTTPEVEEDTAAV 395

BLAST of Lsi10G015020 vs. NCBI nr
Match: XP_004139917.2 (uncharacterized protein LOC101218536 [Cucumis sativus] >KGN46782.1 hypothetical protein Csa_020741 [Cucumis sativus])

HSP 1 Score: 705.3 bits (1819), Expect = 3.1e-199
Identity = 379/410 (92.44%), Postives = 389/410 (94.88%), Query Frame = 0

Query: 1   MCRSEETLEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKK-NPSLKPPS-AAVSPTSPK 60
           MCRSEETLEAT+VVVDSKFN+RPVLQPT NRVLDRRNSLKK +PSLKPPS AAVSPTSPK
Sbjct: 1   MCRSEETLEATSVVVDSKFNSRPVLQPTGNRVLDRRNSLKKQHPSLKPPSAAAVSPTSPK 60

Query: 61  SKSPRPPATKRANDGNNPMNSSSDKILIPAAANGGGSLSRPRATLDRKKSKSFKLGGNGN 120
           SKSPRPPATKRANDGNNPMNSSS+KILIPAA      +SRPRATLDRKKSKSFKLGGNGN
Sbjct: 61  SKSPRPPATKRANDGNNPMNSSSEKILIPAA------VSRPRATLDRKKSKSFKLGGNGN 120

Query: 121 VVICDNGGYEVASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEK 180
            VICDNGG+EVA   YASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEK
Sbjct: 121 -VICDNGGFEVA---YASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARFEK 180

Query: 181 IVPLDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDRMLFELLVLSVAQVGSD 240
           IVPLDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDD+MLFELLVLSVAQVGSD
Sbjct: 181 IVPLDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVGSD 240

Query: 241 WTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQIKK 300
           WTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSIS+EYGIDINRVRGVVDNAIRILQIKK
Sbjct: 241 WTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISTEYGIDINRVRGVVDNAIRILQIKK 300

Query: 301 EFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQ 360
           EFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQ
Sbjct: 301 EFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQ 360

Query: 361 AAGLTNDHLTSCHRHLHCTLIAAGRRTPA-TTTTAEVEETTETVAASETL 408
           AAGLTNDHLT+CHRHLHCTLIAAGRRTPA TTTT EVE   +T A  ETL
Sbjct: 361 AAGLTNDHLTTCHRHLHCTLIAAGRRTPAPTTTTPEVE---DTAAVCETL 397

BLAST of Lsi10G015020 vs. NCBI nr
Match: XP_022943791.1 (uncharacterized protein LOC111448434 [Cucurbita moschata])

HSP 1 Score: 675.2 bits (1741), Expect = 3.4e-190
Identity = 361/411 (87.83%), Postives = 375/411 (91.24%), Query Frame = 0

Query: 1   MCRSEETLEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKKNPSLKPPSAAVSPTSPKSK 60
           MCRSE+ LEAT+VVVDSKF ARPVLQPTCNRVLDRRNSLK     KPPSAAVSPTSPKSK
Sbjct: 1   MCRSEQALEATSVVVDSKFTARPVLQPTCNRVLDRRNSLK-----KPPSAAVSPTSPKSK 60

Query: 61  SPRPPATKRANDGNNPMNSSSDKILIPAAANGGGSLSRPRATLDRKKSKSFKLGGNGNVV 120
           SPRPPATKRAND  NPMNSSSDKILIPAAA     LSRP+A LDRKKSKSFKL GNGNVV
Sbjct: 61  SPRPPATKRAND-TNPMNSSSDKILIPAAA-----LSRPKAALDRKKSKSFKLAGNGNVV 120

Query: 121 ICDN----GGYEVASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARF 180
           ICDN    GG+EVASLSYASSLIT+SPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARF
Sbjct: 121 ICDNVAGGGGFEVASLSYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARF 180

Query: 181 EKIVPLDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDRMLFELLVLSVAQVG 240
           +K+VPLDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDD+MLFELLVLSVAQVG
Sbjct: 181 DKVVPLDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVG 240

Query: 241 SDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQI 300
           SDWTSILKKRQDFRNAFSSF +E VA FSDKQM+SISSEYGIDINRVRGVVDNAIRIL+I
Sbjct: 241 SDWTSILKKRQDFRNAFSSFVAETVAIFSDKQMLSISSEYGIDINRVRGVVDNAIRILEI 300

Query: 301 KKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSF 360
           KKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDM+RRGFRSVGPTVVHSF
Sbjct: 301 KKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMIRRGFRSVGPTVVHSF 360

Query: 361 MQAAGLTNDHLTSCHRHLHCTLIAAGRRTPATTTTAEVEETTETVAASETL 408
           MQAAGLTNDHLTSCHRHLHC++ AA RR PA      VEETT    ASETL
Sbjct: 361 MQAAGLTNDHLTSCHRHLHCSITAADRRAPAVV----VEETT---TASETL 393

BLAST of Lsi10G015020 vs. NCBI nr
Match: KAG6570606.1 (hypothetical protein SDJN03_29521, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 674.9 bits (1740), Expect = 4.5e-190
Identity = 361/411 (87.83%), Postives = 374/411 (91.00%), Query Frame = 0

Query: 1   MCRSEETLEATTVVVDSKFNARPVLQPTCNRVLDRRNSLKKNPSLKPPSAAVSPTSPKSK 60
           MCRSE+ LEAT VVVDSKF ARPVLQPTCNRVLDRRNSLK     KPPSAAVSPTSPKSK
Sbjct: 1   MCRSEQALEATAVVVDSKFTARPVLQPTCNRVLDRRNSLK-----KPPSAAVSPTSPKSK 60

Query: 61  SPRPPATKRANDGNNPMNSSSDKILIPAAANGGGSLSRPRATLDRKKSKSFKLGGNGNVV 120
           SPRPPATKRAND  NPMNSSSDKILIPAAA     LSRP+A LDRKKSKSFKL GNGNVV
Sbjct: 61  SPRPPATKRAND-TNPMNSSSDKILIPAAA-----LSRPKAALDRKKSKSFKLAGNGNVV 120

Query: 121 ICDN----GGYEVASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARF 180
           ICDN    GG+EVASLSYASSLIT+SPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARF
Sbjct: 121 ICDNVAGGGGFEVASLSYASSLITDSPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARF 180

Query: 181 EKIVPLDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDRMLFELLVLSVAQVG 240
           +K+VPLDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDD+MLFELLVLSVAQVG
Sbjct: 181 DKVVPLDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDKMLFELLVLSVAQVG 240

Query: 241 SDWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQI 300
           SDWTSILKKRQDFRNAFSSF +E VA FSDKQM+SISSEYGIDINRVRGVVDNAIRIL+I
Sbjct: 241 SDWTSILKKRQDFRNAFSSFVAETVAIFSDKQMLSISSEYGIDINRVRGVVDNAIRILEI 300

Query: 301 KKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSF 360
           KKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDM+RRGFRSVGPTVVHSF
Sbjct: 301 KKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMIRRGFRSVGPTVVHSF 360

Query: 361 MQAAGLTNDHLTSCHRHLHCTLIAAGRRTPATTTTAEVEETTETVAASETL 408
           MQAAGLTNDHLTSCHRHLHC++ AA RR PA      VEETT    ASETL
Sbjct: 361 MQAAGLTNDHLTSCHRHLHCSITAADRRAPAVV----VEETT---TASETL 393

BLAST of Lsi10G015020 vs. TAIR 10
Match: AT3G12710.1 (DNA glycosylase superfamily protein )

HSP 1 Score: 356.7 bits (914), Expect = 2.5e-98
Identity = 186/291 (63.92%), Postives = 222/291 (76.29%), Query Frame = 0

Query: 93  GGSLSRPRATLDRKKSKSFKLGGNGNVVICDNGGYEVASLSYASSLITESPGSIAAVRRE 152
           G   ++ R +L+RKKSKSFK G                  SY+S LITE+PGSIAAVRRE
Sbjct: 37  GNGAAKVRGSLERKKSKSFKEGD-----------------SYSSWLITEAPGSIAAVRRE 96

Query: 153 QVALQQAQRKMRIAHYGRSKSA---RFEKIVPLDSKIKPAVEDRRCSFITPNSDPIYVAY 212
           QVA QQA RK++IAHYGRSKS       K+VPL +   P    +RCSF+TP SDPIYVAY
Sbjct: 97  QVAAQQALRKLKIAHYGRSKSTINFTSSKVVPLLNP-NPNPHPQRCSFLTPTSDPIYVAY 156

Query: 213 HDEEWGVPVHDDRMLFELLVLSVAQVGSDWTSILKKRQDFRNAFSSFDSEIVANFSDKQM 272
           HDEEWGVPVHDD+ LFELL LS AQVGSDWTS L+KR D+R AF  F++E+VA  ++K+M
Sbjct: 157 HDEEWGVPVHDDKTLFELLTLSGAQVGSDWTSTLRKRHDYRKAFMEFEAEVVAKLTEKEM 216

Query: 273 VSISSEYGIDINRVRGVVDNAIRILQIKKEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVK 332
            +IS EY I++++VRGVV+NA +I++IKK F S +KY+WGFVN+KP S  YK GHKIPVK
Sbjct: 217 NAISIEYKIEMSKVRGVVENAKKIVEIKKAFVSLEKYLWGFVNHKPISTNYKLGHKIPVK 276

Query: 333 TSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSCHRHLHCTLIA 381
           TSKSE+ISKDMVRRGFR VGPTVVHSFMQAAGLTNDHL +C RH  CTL+A
Sbjct: 277 TSKSESISKDMVRRGFRFVGPTVVHSFMQAAGLTNDHLITCCRHAPCTLLA 309

BLAST of Lsi10G015020 vs. TAIR 10
Match: AT5G44680.1 (DNA glycosylase superfamily protein )

HSP 1 Score: 334.7 bits (857), Expect = 1.0e-91
Identity = 200/384 (52.08%), Postives = 267/384 (69.53%), Query Frame = 0

Query: 1   MCRSEETLEATTVVVDSKFNARPVLQPTCNRV--LDRRNSLKKNPSLKPPSAAVSPTSPK 60
           MC S+  L+  T    S+ N RPVLQP  N+V  LDRRNSLKK+P  KP    ++P + K
Sbjct: 1   MCSSK--LKNLTQENISQINGRPVLQPKSNQVPTLDRRNSLKKSPP-KP----LNPIASK 60

Query: 61  SKSPRPPATKRANDGNNPMNSSSDKILIPAAANGGGSLSRPRATLDRKKSKSFKLGGNGN 120
             SPRP +       + P++ ++  +  PA +     L R  +T    KSK      N  
Sbjct: 61  IPSPRPISLI-----SPPLSPNTKSLRKPAGS--CKELLRSSST----KSKPVISPEN-- 120

Query: 121 VVICDNGGYEVASLSYASSLITESPGSIAAVRREQVALQQAQRKMRIAHYGRSKSARF-E 180
                +GGY+         ++ + PGSIAA RRE+VA++Q +RK +I+HYGR KS +  E
Sbjct: 121 ----SDGGYKEV---MPMVIVQKQPGSIAAARREEVAMKQEERKKKISHYGRIKSVKSNE 180

Query: 181 KIVPLDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDRMLFELLVLSVAQVGS 240
           K + ++ + K     +RCSFIT +SDPIYVAYHD+EWGVPVHDD +LFELLVL+ AQVGS
Sbjct: 181 KNLNVEHEKK-----KRCSFITTSSDPIYVAYHDKEWGVPVHDDNLLFELLVLTGAQVGS 240

Query: 241 DWTSILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDINRVRGVVDNAIRILQIK 300
           DWTS+LK+R  FR AFS F++E+VA+F++K++ SI ++YGI++++V  VVDNA +IL++K
Sbjct: 241 DWTSVLKRRNTFREAFSGFEAELVADFNEKKIQSIVNDYGINLSQVLAVVDNAKQILKVK 300

Query: 301 KEFGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFM 360
           ++ GSF+KYIWGF+ +KP + +Y S  KIPVKTSKSETISKDMVRRGFR VGPTV+HS M
Sbjct: 301 RDLGSFNKYIWGFMKHKPVTTKYTSCQKIPVKTSKSETISKDMVRRGFRFVGPTVIHSLM 352

Query: 361 QAAGLTNDHLTSCHRHLHCTLIAA 382
           QAAGLTNDHL +C RHL CT +AA
Sbjct: 361 QAAGLTNDHLITCPRHLECTAMAA 352

BLAST of Lsi10G015020 vs. TAIR 10
Match: AT5G57970.1 (DNA glycosylase superfamily protein )

HSP 1 Score: 224.6 bits (571), Expect = 1.5e-58
Identity = 105/197 (53.30%), Postives = 141/197 (71.57%), Query Frame = 0

Query: 182 LDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDRMLFELLVLSVAQVGSDWTS 241
           LDS    +   +RC+++TPNSDP Y+ +HDEEWGVPVHDD+ LFELLVLS A     W +
Sbjct: 143 LDSPPNGSETKKRCTWVTPNSDPCYIVFHDEEWGVPVHDDKRLFELLVLSGALAEHTWPT 202

Query: 242 ILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDIN--RVRGVVDNAIRILQIKKE 301
           IL KRQ FR  F+ FD   +   ++K+++   S     ++  ++R V++NA +IL++ +E
Sbjct: 203 ILSKRQAFREVFADFDPNAIVKINEKKIIGPGSPASTLLSDLKLRAVIENARQILKVIEE 262

Query: 302 FGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQA 361
           +GSFDKYIW FV NK    +++   ++P KT K+E ISKD+VRRGFRSVGPTVV+SFMQA
Sbjct: 263 YGSFDKYIWSFVKNKAIVSKFRYQRQVPAKTPKAEVISKDLVRRGFRSVGPTVVYSFMQA 322

Query: 362 AGLTNDHLTSCHRHLHC 377
           AG+TNDHLTSC R  HC
Sbjct: 323 AGITNDHLTSCFRFHHC 339

BLAST of Lsi10G015020 vs. TAIR 10
Match: AT5G57970.2 (DNA glycosylase superfamily protein )

HSP 1 Score: 224.6 bits (571), Expect = 1.5e-58
Identity = 105/197 (53.30%), Postives = 141/197 (71.57%), Query Frame = 0

Query: 182 LDSKIKPAVEDRRCSFITPNSDPIYVAYHDEEWGVPVHDDRMLFELLVLSVAQVGSDWTS 241
           LDS    +   +RC+++TPNSDP Y+ +HDEEWGVPVHDD+ LFELLVLS A     W +
Sbjct: 143 LDSPPNGSETKKRCTWVTPNSDPCYIVFHDEEWGVPVHDDKRLFELLVLSGALAEHTWPT 202

Query: 242 ILKKRQDFRNAFSSFDSEIVANFSDKQMVSISSEYGIDIN--RVRGVVDNAIRILQIKKE 301
           IL KRQ FR  F+ FD   +   ++K+++   S     ++  ++R V++NA +IL++ +E
Sbjct: 203 ILSKRQAFREVFADFDPNAIVKINEKKIIGPGSPASTLLSDLKLRAVIENARQILKVIEE 262

Query: 302 FGSFDKYIWGFVNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQA 361
           +GSFDKYIW FV NK    +++   ++P KT K+E ISKD+VRRGFRSVGPTVV+SFMQA
Sbjct: 263 YGSFDKYIWSFVKNKAIVSKFRYQRQVPAKTPKAEVISKDLVRRGFRSVGPTVVYSFMQA 322

Query: 362 AGLTNDHLTSCHRHLHC 377
           AG+TNDHLTSC R  HC
Sbjct: 323 AGITNDHLTSCFRFHHC 339

BLAST of Lsi10G015020 vs. TAIR 10
Match: AT1G75090.1 (DNA glycosylase superfamily protein )

HSP 1 Score: 214.9 bits (546), Expect = 1.2e-55
Identity = 101/205 (49.27%), Postives = 145/205 (70.73%), Query Frame = 0

Query: 193 RRCSFITPNSDPIYVAYHDEEWGVPVHDDRMLFELLVLSVAQVGSDWTSILKKRQDFRNA 252
           +RC +ITPNSDPIYV +HDEEWGVPV DD+ LFELLV S A     W SIL++R DFR  
Sbjct: 119 KRCHWITPNSDPIYVLFHDEEWGVPVRDDKKLFELLVFSQALAEFSWPSILRRRDDFRKL 178

Query: 253 FSSFDSEIVANFSDKQMVSISSEYGIDIN--RVRGVVDNAIRILQIKKEFGSFDKYIWGF 312
           F  FD   +A F++K+++S+     + ++  ++R +V+NA  +L++K+EFGSF  Y W F
Sbjct: 179 FEEFDPSAIAQFTEKRLMSLRVNGCLILSEQKLRAIVENAKSVLKVKQEFGSFSNYCWRF 238

Query: 313 VNNKPFSPQYKSGHKIPVKTSKSETISKDMVRRGFRSVGPTVVHSFMQAAGLTNDHLTSC 372
           VN+KP    Y+ G ++PVK+ K+E ISKDM++RGFR VGPTV++SF+QA+G+ NDHLT+C
Sbjct: 239 VNHKPLRNGYRYGRQVPVKSPKAEYISKDMMQRGFRCVGPTVMYSFLQASGIVNDHLTAC 298

Query: 373 HRHLHCTLIAAGRRTPATTTTAEVE 396
            R+  C  +   R T +  T  +++
Sbjct: 299 FRYQECN-VETERETKSHETETKLD 322

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q7VG783.0e-4043.85Probable GMP synthase [glutamine-hydrolyzing] OS=Helicobacter hepaticus (strain ... [more]
P051003.2e-3439.11DNA-3-methyladenine glycosylase 1 OS=Escherichia coli (strain K12) OX=83333 GN=t... [more]
P443211.2e-2836.31DNA-3-methyladenine glycosylase OS=Haemophilus influenzae (strain ATCC 51907 / D... [more]
Match NameE-valueIdentityDescription
A0A5A7UM211.1e-19992.59Putative GMP synthase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold10... [more]
A0A0A0KED61.5e-19992.44Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G134890 PE=4 SV=1[more]
A0A6J1FSP11.7e-19087.83uncharacterized protein LOC111448434 OS=Cucurbita moschata OX=3662 GN=LOC1114484... [more]
A0A6J1J7H34.1e-18987.10uncharacterized protein LOC111484173 OS=Cucurbita maxima OX=3661 GN=LOC111484173... [more]
A0A6J1D7781.0e-16881.59uncharacterized protein LOC111017989 OS=Momordica charantia OX=3673 GN=LOC111017... [more]
Match NameE-valueIdentityDescription
XP_038902889.11.7e-21395.37uncharacterized protein LOC120089476 [Benincasa hispida][more]
KAA0054725.12.4e-19992.59putative GMP synthase [Cucumis melo var. makuwa] >TYJ95615.1 putative GMP syntha... [more]
XP_004139917.23.1e-19992.44uncharacterized protein LOC101218536 [Cucumis sativus] >KGN46782.1 hypothetical ... [more]
XP_022943791.13.4e-19087.83uncharacterized protein LOC111448434 [Cucurbita moschata][more]
KAG6570606.14.5e-19087.83hypothetical protein SDJN03_29521, partial [Cucurbita argyrosperma subsp. sorori... [more]
Match NameE-valueIdentityDescription
AT3G12710.12.5e-9863.92DNA glycosylase superfamily protein [more]
AT5G44680.11.0e-9152.08DNA glycosylase superfamily protein [more]
AT5G57970.11.5e-5853.30DNA glycosylase superfamily protein [more]
AT5G57970.21.5e-5853.30DNA glycosylase superfamily protein [more]
AT1G75090.11.2e-5549.27DNA glycosylase superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (USVL1VR-Ls) v1
Date Performed: 2021-10-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005019Methyladenine glycosylasePFAMPF03352Adenine_glycocoord: 201..373
e-value: 2.0E-61
score: 206.9
NoneNo IPR availableGENE3D1.10.340.30Hypothetical protein; domain 2coord: 192..374
e-value: 2.1E-65
score: 221.7
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 42..57
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 386..407
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 71..85
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 28..109
NoneNo IPR availablePANTHERPTHR31116OS04G0501200 PROTEINcoord: 1..381
NoneNo IPR availablePANTHERPTHR31116:SF20DNA GLYCOSYLASE SUPERFAMILY PROTEINcoord: 1..381
IPR011257DNA glycosylaseSUPERFAMILY48150DNA-glycosylasecoord: 193..376

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lsi10G015020.1Lsi10G015020.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006284 base-excision repair
biological_process GO:0006281 DNA repair
molecular_function GO:0008725 DNA-3-methyladenine glycosylase activity
molecular_function GO:0003824 catalytic activity