HG10000507 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10000507
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionEndoglucanase
LocationChr09: 6020769 .. 6024770 (+)
RNA-Seq ExpressionHG10000507
SyntenyHG10000507
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTATCTTGTTCTTCGAAGGCCAACGATCGGGGTTCTTGCCGCAAGATCAGCGGCGACGGGTGCTAACTCCGGCCTTGGCGACGGCTGGACTTACCGAACTGACCTCACCGGCGGCTATTACGACGCCGGTGATAATGTCAAGTTCGGTTTTCCGATGGCTTTCACTACCACGCTTTTGTCTTGGAGTGTCATTGAGTTTGGCGACTTGATGCCGCCGGCGGAGTTGAGGAATTCGTTGGTTGCCATTCGTTGGGCGACGGATTATCTCCTCAAAACGGTGTCGCAGCCTAACCGGATTTTTGTTCAGGTAATGAAATTTTTGCAGTTTCAGACCTTTCAGTGCAGTTAGTGGCAGTGTGGAAAATTTGATGATATTTATGGGTAAATTCATTTCGTGATGGGGTTACTCAGCCCAACGTGACCACGTGGACTCTCTCTGTCCTTCGATTGGTCGGCGTTTCCCCTCCTGGGCCCCGATGGGTCTCACCTAAATTTTCCTTTTCTTTTCGCTTTTTATTGGGAAACTTAATCTAATTACTAATACAAAAATATAATTTTTCTTACCAAAAATACCATTTTACCCTTGTGCATTACTTGTTCTTTGTATTTTGTGCTTTAGCTCTAACATTGAATTTACACACCACTTTTGATTTTTCAGTGAAGATTTAAATAACAAATTGTTTTTCATATTTCAACGCATTTTATCAAAATTTTCGTTATTTTCTCAACTTATTTGGAGTTTAATTTACACGATTTTTTAAAAATGAATTTAGCTTGACCGTGGAAAGTATATGCTAATGATCAAGAGATAAGTTTCTGATTATAGTCTCCATCATCGTGCTAAAAAAATAGTTTTAAAAAACTTACATGATCGTAACTCTCAAATCCAAAATCTTTTCTATATTAATAAAGTATTTTTTAATTCTTAGGAGTACTATATGAAATTGGAATAAGTACATTTGTATAAAAGGATAAACCTAGTATTTATAGTACAAAAAAAAAAAAAGAATAATACTTAGGTGCATCTCGAGTTACACTCATTTTTGTGCTGCCCAGAAATTAGTATAGTCCGATATGCACCAAATATTTTCTAAGAAAATAGACCATTATAGTATTTTAGCCATTTTTTTTAAGAAAAAAAATTGTTTTATAACTTATATGTTAATTTCTTATTTTGAAGAAGAAGAAAAAAAAGGAACAAAAAAATGAGTGGCACCAAAGTAAAGTGAAAGGACCCCATTGGAATTCCGTGAATGATTAAGGAGTTAAAGACCCTAAGAAAAAGATTGTCCTAACAGTACACCCATGTTCCCTAATATTTAAGGGCATATGTCCCCAACCAACCCCTCTCTCATTAATGCCTTCTTTTTCCAAATTTTTAATTAAAGACTAAATTTGTAATTTAACTTATAGGTGGGTGATCCGAGTGCGGATCATTTTTGTTGGGAGAGACCAGAAGACATGGATACGGCGAGGACAGTGTACGCCGTGGACGCACCGGGAACTGCATCTGACGTGGCAGGAGAGACAGCAGCAGCGTTAGCAGCAGCATCGATGGCATTTCGATCGTCGGATCCCGGATACGCCGAAACATTGTTACGAAATGGAATTAGGACATTCGAGTTGGCCGATACCTATAGAGGAGCTTATAGTGATAACGCTAACATTAGAGATGGTGTTTGTCCTTTTTATTGGCATTTCGATCGTCGGATCCCGGATACGCCGAAACATTGTTACGAAATGGAATTAGGACATTCGAGTTGGCCGATACCTATAGAGGAGCTTATAGTGATAACGCTAACATTAGAGATGGTGTTTGTCCTTTTTATTGCGATTTTGATGGCTATCAGGTTCGAGATTTTAATTCTCCTTTGAAATTTATATTAAATTGCAAAAAAATTTTTTAAGATTTTTCTTTTAATAATAATAATGATATTAACAAAATAAAATAATTTCTATTTAGATCCACCCGTGATGGCACAATTTTGTCTGTCATCAGACAATTTGACTTGTTTTTTTTCTTATTTTGTCATTTCTTGATGCGTCTACTAAATGGGAAAAAGTAGGAAAAGAAATTATCACCACTTTTTTACATATTTACAGAATTTTTATTCGTTGAGAAAAACTTAATAAATCGACATTTTATTTTATTTTATTTTCACAAAAGAAGAAAAGTTTAATTCCCGTAATGTTAAGTGTCATATGATTTGGAATTTGGGCTTAATAACTATTGAATGTAATATGGGACAGGACGAGCTATTATGGGGAGCTGCTTGGTTAAGGAGGGCGAGTCAAAACGAGTCGTATTTGAATTATATTCAAGACAACGGTAAAACACTTGGTGCTGAAGATAGCTTCAATGAATTTGGGTGGGACAACAAGCATGCTGGTCTTAACGTTCTCGTCTCTAAGGTATCTTTTCTACTCTTCTATTTTATCCTATCCTTCCTTCTTATTGCATTCAATTAAAGTCAGAGTTATATATTTTTATTTTCTAAGATTTAAATATCATCTTGGTACTTAAATTTTCAAGTTTATTTTATTTTAATTTTTAAACTTTTAATGTATATATTTTAATTACCGAAAAACTTTTTAAATACAACTATTTTATTTTGGGCTATCTTATTTTTCCATCTAGGCCTCAACGATTGTCTCAACTCCTCTTACGTAACTAAACTTGTGTTTATATTAATATGTTAGTCCACATACTCAAAGTTATAAAAATAAGTAAAGGTAAAGATCAAAATGGTTGGGTTTTTCTAATGAGAAAAATATATTTTTTGCTATTAAGTTTTGAGTAGTTTTTTATCCCTATGTTTTAAAATGTTTCATTTTCGTCTTAAGATTACAAAATGTTATATTTTTAGAGATAATTTTAAAAATAAAAAAAAATAAGTAAAACTATTTACACAAAATAACAATATTTTTAGATAGTTGTGATAGACGTTGATAAAAATCTATCAGTAATAAAAATGATAGAAGTCTATGAATGTTTATCAGTATTTAAATAATTTGACATTTTTTGTATTCGTAATTTTTTTTTTATATTTTTATAATCAAGTGTTGAGTTTAATTCTCATAAAGTCTTTATATTTTAATATTTATACTTTTTTAACTCTTAATTTTTACTAATATTTATTTTTTATCTTTAATATTAACTCTTAGTAAATTGATTTTTAAACAAATTTTAATAGTGATGGAAGTTAGTGAAAACTAATTTAATTATAATTCATTTAATCTTTTAATTACTGAATAGAAAAGTGAATATTTAGAGAGAAAAACTAAGGATAAAAATGTAAATGTAAAAAATTATGATCAAATGGAAGTAAAAAAAATATAGGGACTAAATAGTGGAGTATTTTGAAAATTTAAGACCCAAAATAAAATATTTTGAATAATCTTATTTTTTGAAATGATCGGAACGTCGTCGTTTTCAGGAAGCATTAGAAGGAAACATATTCACACTCCAATCCTACAAGGCATCAGCAGACAATTTCATGTGCACTTTAATTCCAGAATCTTCTTCCTCCCACATTCAGTTCACTCCCGGCGGCCTCATCTACAAGCCCGGCGGCAGCAACCTCCAACACGCCACCTCCATCACCTTCCTCCTCCTCGCCTACGCTCGCTATCTCGACCGCCCGCTCTCCGCCGCCAGGCCAAACGCCAGGTCGACTACATCCTCGGCGACAATCCCAAAGGCATCTCCTACATGGTCGGATACGGCAACTACTTCCCTCAGCGGATCCACCACCGCGGCTCCTCCCTCCCCTCCGTACACGACCATCCGCAGCCGATCGCTTGCAAGGAAGGATCGGCGTACTTCAATTCGGCGGATCCGAACCCTAATGTACTGGTCGGAGCCCTAGTCGGTGGCCCTTGGGAAGACGATGTGTATGAGGACGATCGAGCGGATTTCCGGAAATCGGAGCCGACTACTTACATTAATGCGCCGTTTGTTGGAGTTTTGGCGTATTTTGCTGCCAATCCTGGAGATTGA

mRNA sequence

ATGTATCTTGTTCTTCGAAGGCCAACGATCGGGGTTCTTGCCGCAAGATCAGCGGCGACGGGTGCTAACTCCGGCCTTGGCGACGGCTGGACTTACCGAACTGACCTCACCGGCGGCTATTACGACGCCGGTGATAATGTCAAGTTCGGTTTTCCGATGGCTTTCACTACCACGCTTTTGTCTTGGAGTGTCATTGAGTTTGGCGACTTGATGCCGCCGGCGGAGTTGAGGAATTCGTTGGTTGCCATTCGTTGGGCGACGGATTATCTCCTCAAAACGGTGTCGCAGCCTAACCGGATTTTTGTTCAGGTGGGTGATCCGAGTGCGGATCATTTTTGTTGGGAGAGACCAGAAGACATGGATACGGCGAGGACAGTGTACGCCGTGGACGCACCGGGAACTGCATCTGACGTGGCAGGAGAGACAGCAGCAGCGTTAGCAGCAGCATCGATGGCATTTCGATCGTCGGATCCCGGATACGCCGAAACATTGTTACGAAATGGAATTAGGACATTCGAGTTGGCCGATACCTATAGAGGAGCTTATAGTGATAACGCTAACATTAGAGATGGTGTTTGTCCTTTTTATTGGCATTTCGATCGTCGGATCCCGGATACGCCGAAACATTGTTACGAAATGGAATTAGGACATTCGAGTTGGCCGATACCTATAGAGGAGCTTATAGTGATAACGCTAACATTAGAGATGGTGTTTGTCCTTTTTATTGCGATTTTGATGGCTATCAGAATCTTCTTCCTCCCACATTCAGTTCACTCCCGGCGGCCTCATCTACAAGCCCGGCGGCAGCAACCTCCAACACGCCACCTCCATCACCTTCCTCCTCCTCGCCTACGCTCGCTATCTCGACCGCCCGCTCTCCGCCGCCAGGCCAAACGCCAGGTCGACTACATCCTCGGCGACAATCCCAAAGGCATCTCCTACATGGTCGGATACGGCAACTACTTCCCTCAGCGGATCCACCACCGCGGCTCCTCCCTCCCCTCCGTACACGACCATCCGCAGCCGATCGCTTGCAAGGAAGGATCGGCGTACTTCAATTCGGCGGATCCGAACCCTAATGTACTGGTCGGAGCCCTAGTCGGTGGCCCTTGGGAAGACGATGTGTATGAGGACGATCGAGCGGATTTCCGGAAATCGGAGCCGACTACTTACATTAATGCGCCGTTTGTTGGAGTTTTGGCGTATTTTGCTGCCAATCCTGGAGATTGA

Coding sequence (CDS)

ATGTATCTTGTTCTTCGAAGGCCAACGATCGGGGTTCTTGCCGCAAGATCAGCGGCGACGGGTGCTAACTCCGGCCTTGGCGACGGCTGGACTTACCGAACTGACCTCACCGGCGGCTATTACGACGCCGGTGATAATGTCAAGTTCGGTTTTCCGATGGCTTTCACTACCACGCTTTTGTCTTGGAGTGTCATTGAGTTTGGCGACTTGATGCCGCCGGCGGAGTTGAGGAATTCGTTGGTTGCCATTCGTTGGGCGACGGATTATCTCCTCAAAACGGTGTCGCAGCCTAACCGGATTTTTGTTCAGGTGGGTGATCCGAGTGCGGATCATTTTTGTTGGGAGAGACCAGAAGACATGGATACGGCGAGGACAGTGTACGCCGTGGACGCACCGGGAACTGCATCTGACGTGGCAGGAGAGACAGCAGCAGCGTTAGCAGCAGCATCGATGGCATTTCGATCGTCGGATCCCGGATACGCCGAAACATTGTTACGAAATGGAATTAGGACATTCGAGTTGGCCGATACCTATAGAGGAGCTTATAGTGATAACGCTAACATTAGAGATGGTGTTTGTCCTTTTTATTGGCATTTCGATCGTCGGATCCCGGATACGCCGAAACATTGTTACGAAATGGAATTAGGACATTCGAGTTGGCCGATACCTATAGAGGAGCTTATAGTGATAACGCTAACATTAGAGATGGTGTTTGTCCTTTTTATTGCGATTTTGATGGCTATCAGAATCTTCTTCCTCCCACATTCAGTTCACTCCCGGCGGCCTCATCTACAAGCCCGGCGGCAGCAACCTCCAACACGCCACCTCCATCACCTTCCTCCTCCTCGCCTACGCTCGCTATCTCGACCGCCCGCTCTCCGCCGCCAGGCCAAACGCCAGGTCGACTACATCCTCGGCGACAATCCCAAAGGCATCTCCTACATGGTCGGATACGGCAACTACTTCCCTCAGCGGATCCACCACCGCGGCTCCTCCCTCCCCTCCGTACACGACCATCCGCAGCCGATCGCTTGCAAGGAAGGATCGGCGTACTTCAATTCGGCGGATCCGAACCCTAATGTACTGGTCGGAGCCCTAGTCGGTGGCCCTTGGGAAGACGATGTGTATGAGGACGATCGAGCGGATTTCCGGAAATCGGAGCCGACTACTTACATTAATGCGCCGTTTGTTGGAGTTTTGGCGTATTTTGCTGCCAATCCTGGAGATTGA

Protein sequence

MYLVLRRPTIGVLAARSAATGANSGLGDGWTYRTDLTGGYYDAGDNVKFGFPMAFTTTLLSWSVIEFGDLMPPAELRNSLVAIRWATDYLLKTVSQPNRIFVQVGDPSADHFCWERPEDMDTARTVYAVDAPGTASDVAGETAAALAAASMAFRSSDPGYAETLLRNGIRTFELADTYRGAYSDNANIRDGVCPFYWHFDRRIPDTPKHCYEMELGHSSWPIPIEELIVITLTLEMVFVLFIAILMAIRIFFLPHSVHSRRPHLQARRQQPPTRHLHHLPPPRLRSLSRPPALRRQAKRQVDYILGDNPKGISYMVGYGNYFPQRIHHRGSSLPSVHDHPQPIACKEGSAYFNSADPNPNVLVGALVGGPWEDDVYEDDRADFRKSEPTTYINAPFVGVLAYFAANPGD
Homology
BLAST of HG10000507 vs. NCBI nr
Match: XP_038900408.1 (endoglucanase 24-like [Benincasa hispida])

HSP 1 Score: 557.8 bits (1436), Expect = 8.0e-155
Identity = 306/440 (69.55%), Postives = 319/440 (72.50%), Query Frame = 0

Query: 22  ANSGLGDGWTYRTDLTGGYYDAGDNVKFGFPMAFTTTLLSWSVIEFGDLMPPAELRNSLV 81
           ANSGLGDGWTY+TDLTGGYYDAGDNVKFGFPMAFTTTLLSWSVIEFGDLMPPAELRNSLV
Sbjct: 60  ANSGLGDGWTYQTDLTGGYYDAGDNVKFGFPMAFTTTLLSWSVIEFGDLMPPAELRNSLV 119

Query: 82  AIRWATDYLLKTVSQPNRIFVQVGDPSADHFCWERPEDMDTARTVYAVDAPGTASDVAGE 141
           AIRWATDYLLKTVSQPNRIFVQVGDPSADHFCWERPEDMDTARTVYAVDAPGTASDVAGE
Sbjct: 120 AIRWATDYLLKTVSQPNRIFVQVGDPSADHFCWERPEDMDTARTVYAVDAPGTASDVAGE 179

Query: 142 TAAALAAASMAFRSSDPGYAETLLRNGIRTFELADTYRGAYSDNANIRDGVCPFYWHFD- 201
           TAAALAAASMAFRSSDPGYAETLLRNGIR FELADTYRGAYSDN NIRDGVCPFY  FD 
Sbjct: 180 TAAALAAASMAFRSSDPGYAETLLRNGIRAFELADTYRGAYSDNDNIRDGVCPFYCDFDG 239

Query: 202 ------------RRIPDTPKHCYEM--------------ELGHSSWPIPIEELIVITLTL 261
                       RR      +   +              E G  +    +  L+      
Sbjct: 240 YQDELLWGAAWLRRASQNESYLNYIQDNGKTLGAEDSFNEFGWDNKHAGLNVLVSKEALE 299

Query: 262 EMVFVL--FIAILMAIRIFFLPHSVHSRRPHLQ----ARRQQPPTRHLHHLPPPRLRSLS 321
             +F L  + A         +P S  S   H+Q        +P   +L H        L+
Sbjct: 300 GNIFTLQSYKASADNFMCTLIPESSSS---HIQFTPGGLIYKPGGSNLQHATSITFLLLA 359

Query: 322 -------------------RPPALRRQAKRQVDYILGDNPKGISYMVGYGNYFPQRIHHR 381
                               P  LRRQAKRQ DYILGDNPKGISYMVGY NYFPQRIHHR
Sbjct: 360 YAHYLDRTSSTVNCGNVVVGPATLRRQAKRQADYILGDNPKGISYMVGYSNYFPQRIHHR 419

Query: 382 GSSLPSVHDHPQPIACKEGSAYFNSADPNPNVLVGALVGGPWEDDVYEDDRADFRKSEPT 410
           GSSLPSVHDHPQPIACKEGSAYFNSADPNPNVLVGALVGGP EDDVYEDDRADFRKSEPT
Sbjct: 420 GSSLPSVHDHPQPIACKEGSAYFNSADPNPNVLVGALVGGPGEDDVYEDDRADFRKSEPT 479

BLAST of HG10000507 vs. NCBI nr
Match: XP_008461017.1 (PREDICTED: endoglucanase 24-like [Cucumis melo] >KAA0045639.1 endoglucanase 24-like [Cucumis melo var. makuwa] >TYK02618.1 endoglucanase 24-like [Cucumis melo var. makuwa])

HSP 1 Score: 548.9 bits (1413), Expect = 3.7e-152
Identity = 301/439 (68.56%), Postives = 318/439 (72.44%), Query Frame = 0

Query: 22  ANSGLGDGWTYRTDLTGGYYDAGDNVKFGFPMAFTTTLLSWSVIEFGDLMPPAELRNSLV 81
           ANSGLGDGWTY+TDLTGGYYDAGDNVKFGFPMAFTTTLLSWSVIEFGDLMPPAELRNSLV
Sbjct: 60  ANSGLGDGWTYKTDLTGGYYDAGDNVKFGFPMAFTTTLLSWSVIEFGDLMPPAELRNSLV 119

Query: 82  AIRWATDYLLKTVSQPNRIFVQVGDPSADHFCWERPEDMDTARTVYAVDAPGTASDVAGE 141
           AIRWATDYLLKTVSQPNRIFVQVGDPSADHFCWERPEDMDT RTVYAVDAPGTASDVAGE
Sbjct: 120 AIRWATDYLLKTVSQPNRIFVQVGDPSADHFCWERPEDMDTMRTVYAVDAPGTASDVAGE 179

Query: 142 TAAALAAASMAFRSSDPGYAETLLRNGIRTFELADTYRGAYSDNANIRDGVCPFYWHFD- 201
           TAAALAAASMAFRSSDPGYAETLL+NGI+ FELADTYRGAYSDNANIRDGVCPFY  FD 
Sbjct: 180 TAAALAAASMAFRSSDPGYAETLLQNGIKAFELADTYRGAYSDNANIRDGVCPFYCDFDG 239

Query: 202 ------------RRIPDTPKHCYEM--------------ELGHSSWPIPIEELIVITLTL 261
                       RR      +   +              E G  +    +  L+      
Sbjct: 240 YQDELLWGAAWLRRASKNESYLSYIQDNGKTLGAEDSFNEFGWDNKHAGLNVLVSKEALE 299

Query: 262 EMVFVL--FIAILMAIRIFFLPHSVHSRRPHLQ----ARRQQPPTRHLHHLPPPRLRSLS 321
             +F L  + A         +P S  S   H+Q        +P   +L H        L+
Sbjct: 300 GNIFTLQSYKASADNFMCTLIPESSSS---HIQYTPGGLIYKPGGSNLQHATSITFLLLA 359

Query: 322 -------------------RPPALRRQAKRQVDYILGDNPKGISYMVGYGNYFPQRIHHR 381
                               P +LRRQAK+QVDYILGDNPKGISYMVGYGNYFPQRIHHR
Sbjct: 360 YAHYLERTSSTVNCGNVVVGPASLRRQAKQQVDYILGDNPKGISYMVGYGNYFPQRIHHR 419

Query: 382 GSSLPSVHDHPQPIACKEGSAYFNSADPNPNVLVGALVGGPWEDDVYEDDRADFRKSEPT 409
           GSSLPSV DHPQPIACKEGS YFNS DPNPNVLVGALVGGP EDDVYEDDRADFRKSEPT
Sbjct: 420 GSSLPSVRDHPQPIACKEGSTYFNSPDPNPNVLVGALVGGPGEDDVYEDDRADFRKSEPT 479

BLAST of HG10000507 vs. NCBI nr
Match: XP_004150145.2 (endoglucanase 24 [Cucumis sativus] >KGN61851.1 hypothetical protein Csa_006280 [Cucumis sativus])

HSP 1 Score: 545.0 bits (1403), Expect = 5.4e-151
Identity = 299/439 (68.11%), Postives = 317/439 (72.21%), Query Frame = 0

Query: 22  ANSGLGDGWTYRTDLTGGYYDAGDNVKFGFPMAFTTTLLSWSVIEFGDLMPPAELRNSLV 81
           ANSGLGDGWTY+TDLTGGYYDAGDNVKFGFPMAFTTTLLSWSVIEFGD MPPAELRNSLV
Sbjct: 60  ANSGLGDGWTYKTDLTGGYYDAGDNVKFGFPMAFTTTLLSWSVIEFGDFMPPAELRNSLV 119

Query: 82  AIRWATDYLLKTVSQPNRIFVQVGDPSADHFCWERPEDMDTARTVYAVDAPGTASDVAGE 141
           AIRWATDYLLKTVS+PNRIFVQVGDPSADH CWERPEDMDT RTVYAVDAPGTASDVAGE
Sbjct: 120 AIRWATDYLLKTVSEPNRIFVQVGDPSADHSCWERPEDMDTTRTVYAVDAPGTASDVAGE 179

Query: 142 TAAALAAASMAFRSSDPGYAETLLRNGIRTFELADTYRGAYSDNANIRDGVCPFYWHFD- 201
           TAAALAAASMAFRSS+P YAETLLRNGI+ FELADTYRGAYSDNANIRDGVCPFY  FD 
Sbjct: 180 TAAALAAASMAFRSSNPEYAETLLRNGIKAFELADTYRGAYSDNANIRDGVCPFYCDFDG 239

Query: 202 ------------RRIPDTPKHCYEM--------------ELGHSSWPIPIEELIVITLTL 261
                       RR      +   +              E G  +    +  L+      
Sbjct: 240 YQDELLWGAAWLRRASQNESYLNYIQDNGKTLGAEDSYNEFGWDNKHAGLNVLVSKEALE 299

Query: 262 EMVFVL--FIAILMAIRIFFLPHSVHSRRPHLQ----ARRQQPPTRHLHHLPPPRLRSLS 321
             +F L  + A         +P S  S   H+Q        +P   +L H        L+
Sbjct: 300 GNIFTLQSYRASADNFMCTLIPESSSS---HIQYTPGGLIYKPGGSNLQHATSITFLLLA 359

Query: 322 -------------------RPPALRRQAKRQVDYILGDNPKGISYMVGYGNYFPQRIHHR 381
                               P  LRRQAK+QVDYILG+NPKGISYMVGYGNYFPQRIHHR
Sbjct: 360 YANYLERTSSTVNCGNVVVGPATLRRQAKQQVDYILGENPKGISYMVGYGNYFPQRIHHR 419

Query: 382 GSSLPSVHDHPQPIACKEGSAYFNSADPNPNVLVGALVGGPWEDDVYEDDRADFRKSEPT 409
           GSSLPSVHDHPQPIACKEGSAYFNSADPNPNVLVGALVGGP EDDVYEDDRADFRKSEPT
Sbjct: 420 GSSLPSVHDHPQPIACKEGSAYFNSADPNPNVLVGALVGGPGEDDVYEDDRADFRKSEPT 479

BLAST of HG10000507 vs. NCBI nr
Match: XP_022943418.1 (endoglucanase 24-like [Cucurbita moschata])

HSP 1 Score: 536.6 bits (1381), Expect = 1.9e-148
Identity = 295/439 (67.20%), Postives = 314/439 (71.53%), Query Frame = 0

Query: 22  ANSGLGDGWTYRTDLTGGYYDAGDNVKFGFPMAFTTTLLSWSVIEFGDLMPPAELRNSLV 81
           ANSGL DGWTY+TDLTGGYYDAGDNVKFGFPMAFTTTLLSWSVIEFGDLMPPAE+RNSL 
Sbjct: 60  ANSGLSDGWTYQTDLTGGYYDAGDNVKFGFPMAFTTTLLSWSVIEFGDLMPPAEVRNSLD 119

Query: 82  AIRWATDYLLKTVSQPNRIFVQVGDPSADHFCWERPEDMDTARTVYAVDAPGTASDVAGE 141
           AIRW+TDYLLKTVSQPNRIFVQVGDPSADHFCWERPEDMDT RTVYAVDAPGTASDVAGE
Sbjct: 120 AIRWSTDYLLKTVSQPNRIFVQVGDPSADHFCWERPEDMDTTRTVYAVDAPGTASDVAGE 179

Query: 142 TAAALAAASMAFRSSDPGYAETLLRNGIRTFELADTYRGAYSDNANIRDGVCPFYWHFD- 201
           TAAALAAASMAFRSSDPGYAETLLRNGI+ FELAD YRGAYSDN +IR+GVCPFY  FD 
Sbjct: 180 TAAALAAASMAFRSSDPGYAETLLRNGIKAFELADNYRGAYSDNDHIREGVCPFYCDFDG 239

Query: 202 ------------RRIPDTPKHCYEM--------------ELGHSSWPIPIEELIVITLTL 261
                       RR      +   +              E G  +    +  L+      
Sbjct: 240 YQDELLWGAAWLRRASQNESYLSYIQDNGKTLGAEDSFNEFGWDNKHAGLNVLVSKEALE 299

Query: 262 EMVFVL--FIAILMAIRIFFLPHSVHSRRPHLQ----ARRQQPPTRHLHHLPPPRLRSLS 321
             +F L  + A         +P S  S   H+Q        +P   +L H        L+
Sbjct: 300 GNIFTLQSYKASADNFMCTLIPESSSS---HIQYTPGGLIYKPGGSNLQHATSITFLLLA 359

Query: 322 -------------------RPPALRRQAKRQVDYILGDNPKGISYMVGYGNYFPQRIHHR 381
                               P ALR QAKRQVDYILGDNPKGISYMVGYGNYFPQRIHHR
Sbjct: 360 YANYLDRTSATVNCGNVVVGPAALRSQAKRQVDYILGDNPKGISYMVGYGNYFPQRIHHR 419

Query: 382 GSSLPSVHDHPQPIACKEGSAYFNSADPNPNVLVGALVGGPWEDDVYEDDRADFRKSEPT 409
           GSSLPSV +HPQPIACKEGS YFNSADPNPNVLVGALVGGP EDDVYEDDRADFRKSEPT
Sbjct: 420 GSSLPSVREHPQPIACKEGSTYFNSADPNPNVLVGALVGGPGEDDVYEDDRADFRKSEPT 479

BLAST of HG10000507 vs. NCBI nr
Match: KAG7010890.1 (Endoglucanase 24 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 535.0 bits (1377), Expect = 5.5e-148
Identity = 294/439 (66.97%), Postives = 314/439 (71.53%), Query Frame = 0

Query: 22  ANSGLGDGWTYRTDLTGGYYDAGDNVKFGFPMAFTTTLLSWSVIEFGDLMPPAELRNSLV 81
           ANSGL DGWTY+TDLTGGYYDAGDNVKFGFPMAFTTTLLSWSVIEFGDLMPPAE+RNSL 
Sbjct: 5   ANSGLSDGWTYQTDLTGGYYDAGDNVKFGFPMAFTTTLLSWSVIEFGDLMPPAEVRNSLD 64

Query: 82  AIRWATDYLLKTVSQPNRIFVQVGDPSADHFCWERPEDMDTARTVYAVDAPGTASDVAGE 141
           AIRW+TDYLLKTVSQPNRIFVQVGDPSADHFCWERPEDMDT RTVYAVDAPGTASDVAGE
Sbjct: 65  AIRWSTDYLLKTVSQPNRIFVQVGDPSADHFCWERPEDMDTTRTVYAVDAPGTASDVAGE 124

Query: 142 TAAALAAASMAFRSSDPGYAETLLRNGIRTFELADTYRGAYSDNANIRDGVCPFYWHFD- 201
           TAAALAAAS+AFRSSDPGYAETLLRNGI+ FELAD YRGAYSDN +IR+GVCPFY  FD 
Sbjct: 125 TAAALAAASIAFRSSDPGYAETLLRNGIKAFELADNYRGAYSDNDHIREGVCPFYCDFDG 184

Query: 202 ------------RRIPDTPKHCYEM--------------ELGHSSWPIPIEELIVITLTL 261
                       RR      +   +              E G  +    +  L+      
Sbjct: 185 YQDELLWGAAWLRRASQNESYLSYIQDNGKTLGAEDSFNEFGWDNKHAGLNVLVSKEALE 244

Query: 262 EMVFVL--FIAILMAIRIFFLPHSVHSRRPHLQ----ARRQQPPTRHLHHLPPPRLRSLS 321
             +F L  + A         +P S  S   H+Q        +P   +L H        L+
Sbjct: 245 GNIFTLQSYKASADNFMCTLIPESSSS---HIQYTPGGLIYKPGGSNLQHATSITFLLLA 304

Query: 322 -------------------RPPALRRQAKRQVDYILGDNPKGISYMVGYGNYFPQRIHHR 381
                               P ALR QAKRQVDYILGDNPKGISYMVGYGNYFPQRIHHR
Sbjct: 305 YANYLDRTSATVNCGNVVVGPAALRSQAKRQVDYILGDNPKGISYMVGYGNYFPQRIHHR 364

Query: 382 GSSLPSVHDHPQPIACKEGSAYFNSADPNPNVLVGALVGGPWEDDVYEDDRADFRKSEPT 409
           GSSLPSV +HPQPIACKEGS YFNSADPNPNVLVGALVGGP EDDVYEDDRADFRKSEPT
Sbjct: 365 GSSLPSVREHPQPIACKEGSTYFNSADPNPNVLVGALVGGPGEDDVYEDDRADFRKSEPT 424

BLAST of HG10000507 vs. ExPASy Swiss-Prot
Match: Q93YQ7 (Endoglucanase 24 OS=Arabidopsis thaliana OX=3702 GN=At4g39010 PE=2 SV=1)

HSP 1 Score: 445.7 bits (1145), Expect = 5.8e-124
Identity = 244/436 (55.96%), Postives = 294/436 (67.43%), Query Frame = 0

Query: 23  NSGLGDGWTYRTDLTGGYYDAGDNVKFGFPMAFTTTLLSWSVIEFGDLMPPAELRNSLVA 82
           NSGL DGWT+  DLTGGYYDAGDNVKF FPMAFTTT+L+WSVIEFG+ MP +ELRNSLVA
Sbjct: 60  NSGLSDGWTHNIDLTGGYYDAGDNVKFNFPMAFTTTMLAWSVIEFGEFMPSSELRNSLVA 119

Query: 83  IRWATDYLLKTVSQ-PNRIFVQVGDPSADHFCWERPEDMDTARTVYAVDAPGTASDVAGE 142
           +RW+++YLLK+VSQ PNRIFVQVGDP ADH CWERPEDMDT RT YAV+AP  AS+VAGE
Sbjct: 120 LRWSSNYLLKSVSQLPNRIFVQVGDPIADHNCWERPEDMDTPRTAYAVNAPNPASEVAGE 179

Query: 143 TAAALAAASMAFRSSDPGYAETLLRNGIRTFELADTYRGAYSDNANIRDGVCPFYWHFD- 202
           T AAL+AAS+AFRSSDPGY++TLL+N ++TF+ AD YRGAYS N +I++ VCPFY  F+ 
Sbjct: 180 TTAALSAASIAFRSSDPGYSQTLLQNAVKTFQFADMYRGAYSSNDDIKNDVCPFYCDFNG 239

Query: 203 ------------RRIPDTPKHCYEME-----------LGHSSWPIPIEELIVITL--TLE 262
                       R+      +   +E           +    W   +  L V+     +E
Sbjct: 240 FQDELLWGAAWLRKATGDESYLNYIESNREPFGANDNVDEFGWDNKVGGLNVLVSKEVIE 299

Query: 263 MVFVLFIAILMAIRIFFLPHSVHSRRPHLQ----ARRQQPPTRHLHH------LPPPRLR 322
                  A   +   F       S  PH++        +P    L H      L     +
Sbjct: 300 GNMYNLEAYKASAESFMCSLVPESSGPHVEYTSAGLLYKPGGSQLQHATTISFLLLVYAQ 359

Query: 323 SLSR-------------PPALRRQAKRQVDYILGDNPKGISYMVGYGNYFPQRIHHRGSS 382
            LSR             P  LRR AK+QVDYILG+NP G+SYMVGYG  +P+RIHHRGSS
Sbjct: 360 YLSRSSLSLNCGTLTVPPDYLRRLAKKQVDYILGNNPMGLSYMVGYGERYPKRIHHRGSS 419

Query: 383 LPSVHDHPQPIACKEGSAYFNSADPNPNVLVGALVGGPWEDDVYEDDRADFRKSEPTTYI 409
           LPS+ DHP+ I CK+GS YFNS +PNPNVL+GA+VGGP EDD+Y+DDR+DFRKSEPTTYI
Sbjct: 420 LPSIVDHPEAIRCKDGSVYFNSTEPNPNVLIGAVVGGPGEDDMYDDDRSDFRKSEPTTYI 479

BLAST of HG10000507 vs. ExPASy Swiss-Prot
Match: Q69NF5 (Endoglucanase 23 OS=Oryza sativa subsp. japonica OX=39947 GN=GLU12 PE=2 SV=1)

HSP 1 Score: 386.0 bits (990), Expect = 5.5e-106
Identity = 221/445 (49.66%), Postives = 272/445 (61.12%), Query Frame = 0

Query: 14  AARSAATGANSGLGDGWTYRTDLTGGYYDAGDNVKFGFPMAFTTTLLSWSVIEFGDLMPP 73
           A + AA   +S + DG     DL GGYYDAGDNVKFGFPMAFT T+L+W V+EFGD MPP
Sbjct: 64  AGQRAAWRGDSAVSDGGAAGVDLEGGYYDAGDNVKFGFPMAFTATMLAWGVVEFGDAMPP 123

Query: 74  AELRNSLVAIRWATDYLLKTVSQPNRIFVQVGDPSADHFCWERPEDMDTARTVYAVDAPG 133
           AE  ++  A+RWATDYLLKT+S P  IF+QVGDP+ DH CWERPEDMDTARTVY + A  
Sbjct: 124 AERAHAADAVRWATDYLLKTISHPGVIFIQVGDPTKDHGCWERPEDMDTARTVYNISAAR 183

Query: 134 TASDVAGETAAALAAASMAFRSSDPGYAETLLRNGIRTFELADTYRGAYSDNANIRDGVC 193
             SDVAGETAAALAAASM FR  DP YA  LL      FE AD ++GAYSD+  +R G C
Sbjct: 184 PGSDVAGETAAALAAASMVFRDDDPAYAARLLAGARSAFEFADEHKGAYSDDPELRAGGC 243

Query: 194 PFYWHFD-------------RRIPDTPKHCYEMELGHSSWPIPIEEL------------I 253
           PFY  FD             RR   + +  Y   + ++   +  E+             I
Sbjct: 244 PFYCDFDGYQDELLWGAAWLRRA--SKEGTYLDYIQNNGKTLGAEDSTNEFGWDNKHAGI 303

Query: 254 VITLTLEMV--FVLFIAILMAIRIFFLPHSV-HSRRPHLQ----ARRQQPPTRHLHHLP- 313
            + ++ E +   VL +         F+   +  S  PH+         +P   ++ H+  
Sbjct: 304 NVLVSKEFIDGEVLSLQSYKEFADGFICTLIPESSSPHITYTPGGMIYKPGGSNMQHVTS 363

Query: 314 --------PPRLRSLSR----------PPALRRQAKRQVDYILGDNPKGISYMVGYGNYF 373
                      L + SR          P  L++ A++Q DYILGDNP  +SYMVGYG+ +
Sbjct: 364 ISFLLLTYAKYLSNSSRTVNCGNVSVGPATLQQLARKQADYILGDNPMKMSYMVGYGDRY 423

Query: 374 PQRIHHRGSSLPSVHDHPQPIACKEGSAYFNSADPNPNVLVGALVGGPWEDDVYEDDRAD 408
           PQRIHHRGSSLPS+  HPQ IAC +G+ Y+NS+ PNPN L+GA+VGGP EDDVYEDDRAD
Sbjct: 424 PQRIHHRGSSLPSIKSHPQRIACNDGTPYYNSSSPNPNPLIGAVVGGPGEDDVYEDDRAD 483

BLAST of HG10000507 vs. ExPASy Swiss-Prot
Match: Q9SVJ4 (Endoglucanase 22 OS=Arabidopsis thaliana OX=3702 GN=GH9B16 PE=3 SV=1)

HSP 1 Score: 340.9 bits (873), Expect = 2.0e-92
Identity = 201/440 (45.68%), Postives = 259/440 (58.86%), Query Frame = 0

Query: 23  NSGLGDGWTYRTDLTGGYYDAGDNVKFGFPMAFTTTLLSWSVIEFGDLMPPAELRNSLVA 82
           NS L DG     DL GGYYDAGDN+KF FPMAFTTT+L+WS I+FG  M PA+LR++LVA
Sbjct: 54  NSALNDGKNLNVDLVGGYYDAGDNIKFHFPMAFTTTMLAWSAIDFGSYMSPADLRDNLVA 113

Query: 83  IRWATDYLLKTVSQ-PNRIFVQVGDPSADHFCWERPEDMDTARTVYAVDAPGTASDVAGE 142
           +RW ++YLLKTVSQ PNRIFVQVG+P+ DH CWERPEDMDT RT YA++AP  ASD+AGE
Sbjct: 114 LRWGSNYLLKTVSQLPNRIFVQVGEPTPDHQCWERPEDMDTPRTAYALEAPKPASDLAGE 173

Query: 143 TAAALAAASMAFRSSDPGYAETLLRNGIRTFELADTYRGAYSDNANIRDGVCPFY----- 202
            AAALAAAS+AF+  DP Y++ LL N +RTFE AD++RG+Y++N   +  VCPFY     
Sbjct: 174 IAAALAAASIAFKRFDPRYSKLLLDNALRTFEYADSHRGSYTNNPETKLAVCPFYCSVNG 233

Query: 203 ------WHFDRRIPDTPKHCY-------EMELGHSS------WPIPIEELIVITL----- 262
                 W        T K  Y           G  S      W   +  + V+       
Sbjct: 234 YEDELLWGAAWLRRATGKDSYIKYLVENRQSFGSDSNYFEFGWDNKVGGVNVLVAKEVFE 293

Query: 263 -----------TLEMVFVLFIAILMAIRIFFLPHSVHSRRPHLQARRQQPPT-------R 322
                      T E +   F        + + P  +  +    Q +     +        
Sbjct: 294 KNVAAIAPYKDTAEKLMCSFFLETPGAHMSYSPGGLLYKPGSSQLQNTVALSFLLLTYAN 353

Query: 323 HLHHLPPPRLRSLSRPP---ALRRQAK----RQVDYILGDNPKGISYMVGYGNYFPQRIH 382
           +L       L+ LS  P     +R A      +VDYILGDNP  +SYM+GYGN +P++IH
Sbjct: 354 YLSKSSQQPLQILSTTPLWYLTQRIANIVGFEKVDYILGDNPMKMSYMIGYGNRYPRQIH 413

Query: 383 HRGSSLPSVHDHPQPIACKEGSAYFNSADPNPNVLVGALVGGPWEDDVYEDDRADFRKSE 408
           HRG+S PS+  HP P+ C EG   F+S +P+PNVLVGA++GGP  DD +   R +  ++E
Sbjct: 414 HRGASSPSITTHPTPVKCSEGWNSFSSPNPDPNVLVGAVIGGPNIDDKFVGGRTNASETE 473

BLAST of HG10000507 vs. ExPASy Swiss-Prot
Match: Q9CAC1 (Endoglucanase 8 OS=Arabidopsis thaliana OX=3702 GN=CEL1 PE=2 SV=1)

HSP 1 Score: 339.3 bits (869), Expect = 5.9e-92
Identity = 205/433 (47.34%), Postives = 250/433 (57.74%), Query Frame = 0

Query: 23  NSGLGDGWTYRTDLTGGYYDAGDNVKFGFPMAFTTTLLSWSVIEFGDLMPPAELRNSLVA 82
           +S L DG +   DL+GGYYDAGDN+KFGFPMAFTTT+LSWS+I+FG  M P ELRN++ A
Sbjct: 59  DSALRDGSSAGVDLSGGYYDAGDNIKFGFPMAFTTTMLSWSIIDFGKTMGP-ELRNAVKA 118

Query: 83  IRWATDYLLKTVSQPNRIFVQVGDPSADHFCWERPEDMDTARTVYAVDAPGTASDVAGET 142
           ++W TDYLLK  + P  +FVQVGD  +DH CWERPEDMDT RTVY +D     SDVAGET
Sbjct: 119 VKWGTDYLLKATAIPGVVFVQVGDAYSDHNCWERPEDMDTLRTVYKIDRAHPGSDVAGET 178

Query: 143 AAALAAASMAFRSSDPGYAETLLRNGIRTFELADTYRGAYSDNANIRDGVCPFYWHFDRR 202
           AAALAAAS+ FR  DP Y+  LL    R F  A+ YRGAYS+  ++   VCPFY  F+  
Sbjct: 179 AAALAAASIVFRKRDPAYSRLLLDRATRVFAFANRYRGAYSN--SLYHAVCPFYCDFNGY 238

Query: 203 IPD-----------TPKHCYE----------------MELGHSSWPIPIEELI--VITLT 262
             +           + K  Y                  E G  +    I  LI   + + 
Sbjct: 239 QDELLWGAAWLHKASRKRAYREFIVKNEVILKAGDTINEFGWDNKHAGINVLISKEVLMG 298

Query: 263 LEMVFVLFIAILMAIRIFFLPHSVHSR----RPHLQARRQQPPTRHLHHLP--------- 322
               F  F           LP   H +    R  L  +      +H+  L          
Sbjct: 299 KAEYFESFKQNADGFICSILPGISHPQVQYSRGGLLVKTGGSNMQHVTSLSFLLLAYSNY 358

Query: 323 -------PPRLRSLSRPPALRRQAKRQVDYILGDNPKGISYMVGYGNYFPQRIHHRGSSL 382
                   P     + P  LR+ AKRQVDYILGDNP G+SYMVGYG  FP+RIHHRGSS+
Sbjct: 359 LSHAKKVVPCGELTASPSLLRQIAKRQVDYILGDNPMGLSYMVGYGQKFPRRIHHRGSSV 418

Query: 383 PSVHDHPQPIACKEGSAYFNSADPNPNVLVGALVGGPWEDDVYEDDRADFRKSEPTTYIN 407
           PSV  HP  I CKEGS YF S +PNPN+LVGA+VGGP   D + D R  F++SEPTTYIN
Sbjct: 419 PSVSAHPSHIGCKEGSRYFLSPNPNPNLLVGAVVGGPNVTDAFPDSRPYFQQSEPTTYIN 478

BLAST of HG10000507 vs. ExPASy Swiss-Prot
Match: Q8GY58 (Endoglucanase 23 OS=Arabidopsis thaliana OX=3702 GN=At4g39000 PE=2 SV=2)

HSP 1 Score: 335.1 bits (858), Expect = 1.1e-90
Identity = 199/440 (45.23%), Postives = 257/440 (58.41%), Query Frame = 0

Query: 23  NSGLGDGWTYRTDLTGGYYDAGDNVKFGFPMAFTTTLLSWSVIEFGDLMPPAELRNSLVA 82
           NS L DG   +TDL GGYYDAGDNVKF FPMAFT T+L+WS ++FG  M   + R++LVA
Sbjct: 56  NSALNDGKNLKTDLVGGYYDAGDNVKFHFPMAFTATMLAWSSVDFGRYMSQHDFRHNLVA 115

Query: 83  IRWATDYLLKTVSQ-PNRIFVQVGDPSADHFCWERPEDMDTARTVYAVDAPGTASDVAGE 142
           ++WATDYLLKTVSQ PNRIFV VG+   DH CWERPEDMDT RT +A+DAP  ASD+AGE
Sbjct: 116 VKWATDYLLKTVSQLPNRIFVHVGEVQPDHDCWERPEDMDTPRTAFALDAPYPASDLAGE 175

Query: 143 TAAALAAASMAFRSSDPGYAETLLRNGIRTFELADTYRGAYSDNANIRDGVCPFYWHFD- 202
            AAALAAAS+AF+ ++P Y+  LL   ++TF+ AD++RG+Y+DN  I+  VCPFY   + 
Sbjct: 176 IAAALAAASIAFKQANPKYSAILLNKAVQTFQYADSHRGSYTDNPGIKQAVCPFYCSVNG 235

Query: 203 ------------RRIPDTPKHC-YEMELGHS----------SWPIPIEELIVITLTLEMV 262
                       RR      +  Y ++ G +           W   +  + V  L  + V
Sbjct: 236 YKDELLWGAAWLRRATGEDSYLRYLVDNGQAFGESSNYFEFGWDNKVGGVNV--LVAKEV 295

Query: 263 FVLFIAILMAIR-------IFFLPHSVHSRRPHLQ----------ARRQQPPTRHLHHLP 322
               +  + A +         FLP    +  PH+              Q   T  L  L 
Sbjct: 296 LQNNVTAIAAYKDTAEKMMCSFLP---ETNGPHMSYTPGGLIYKPGSTQLQNTAALSFLL 355

Query: 323 PPRLRSLS-------------RPPALRRQAKRQVDYILGDNPKGISYMVGYGNYFPQRIH 382
                 LS             +P +LRR  KRQVDY+LGDNP  +SYM+GYG  +P  IH
Sbjct: 356 LTYADYLSTSSQQLNCGNLKFQPDSLRRIVKRQVDYVLGDNPMKLSYMIGYGERYPGLIH 415

Query: 383 HRGSSLPSVHDHPQPIACKEGSAYFNSADPNPNVLVGALVGGPWEDDVYEDDRADFRKSE 408
           HRGSS+PSV  HP    C  G   F+S +PNPN+L+GA++GGP  DD +   R +  ++E
Sbjct: 416 HRGSSIPSVTVHPAAFGCIAGWNIFSSPNPNPNILIGAVIGGPDVDDRFIGGRTNASETE 475

BLAST of HG10000507 vs. ExPASy TrEMBL
Match: A0A5A7TWL5 (Endoglucanase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold280G00260 PE=3 SV=1)

HSP 1 Score: 548.9 bits (1413), Expect = 1.8e-152
Identity = 301/439 (68.56%), Postives = 318/439 (72.44%), Query Frame = 0

Query: 22  ANSGLGDGWTYRTDLTGGYYDAGDNVKFGFPMAFTTTLLSWSVIEFGDLMPPAELRNSLV 81
           ANSGLGDGWTY+TDLTGGYYDAGDNVKFGFPMAFTTTLLSWSVIEFGDLMPPAELRNSLV
Sbjct: 60  ANSGLGDGWTYKTDLTGGYYDAGDNVKFGFPMAFTTTLLSWSVIEFGDLMPPAELRNSLV 119

Query: 82  AIRWATDYLLKTVSQPNRIFVQVGDPSADHFCWERPEDMDTARTVYAVDAPGTASDVAGE 141
           AIRWATDYLLKTVSQPNRIFVQVGDPSADHFCWERPEDMDT RTVYAVDAPGTASDVAGE
Sbjct: 120 AIRWATDYLLKTVSQPNRIFVQVGDPSADHFCWERPEDMDTMRTVYAVDAPGTASDVAGE 179

Query: 142 TAAALAAASMAFRSSDPGYAETLLRNGIRTFELADTYRGAYSDNANIRDGVCPFYWHFD- 201
           TAAALAAASMAFRSSDPGYAETLL+NGI+ FELADTYRGAYSDNANIRDGVCPFY  FD 
Sbjct: 180 TAAALAAASMAFRSSDPGYAETLLQNGIKAFELADTYRGAYSDNANIRDGVCPFYCDFDG 239

Query: 202 ------------RRIPDTPKHCYEM--------------ELGHSSWPIPIEELIVITLTL 261
                       RR      +   +              E G  +    +  L+      
Sbjct: 240 YQDELLWGAAWLRRASKNESYLSYIQDNGKTLGAEDSFNEFGWDNKHAGLNVLVSKEALE 299

Query: 262 EMVFVL--FIAILMAIRIFFLPHSVHSRRPHLQ----ARRQQPPTRHLHHLPPPRLRSLS 321
             +F L  + A         +P S  S   H+Q        +P   +L H        L+
Sbjct: 300 GNIFTLQSYKASADNFMCTLIPESSSS---HIQYTPGGLIYKPGGSNLQHATSITFLLLA 359

Query: 322 -------------------RPPALRRQAKRQVDYILGDNPKGISYMVGYGNYFPQRIHHR 381
                               P +LRRQAK+QVDYILGDNPKGISYMVGYGNYFPQRIHHR
Sbjct: 360 YAHYLERTSSTVNCGNVVVGPASLRRQAKQQVDYILGDNPKGISYMVGYGNYFPQRIHHR 419

Query: 382 GSSLPSVHDHPQPIACKEGSAYFNSADPNPNVLVGALVGGPWEDDVYEDDRADFRKSEPT 409
           GSSLPSV DHPQPIACKEGS YFNS DPNPNVLVGALVGGP EDDVYEDDRADFRKSEPT
Sbjct: 420 GSSLPSVRDHPQPIACKEGSTYFNSPDPNPNVLVGALVGGPGEDDVYEDDRADFRKSEPT 479

BLAST of HG10000507 vs. ExPASy TrEMBL
Match: A0A1S3CDA0 (Endoglucanase OS=Cucumis melo OX=3656 GN=LOC103499723 PE=3 SV=1)

HSP 1 Score: 548.9 bits (1413), Expect = 1.8e-152
Identity = 301/439 (68.56%), Postives = 318/439 (72.44%), Query Frame = 0

Query: 22  ANSGLGDGWTYRTDLTGGYYDAGDNVKFGFPMAFTTTLLSWSVIEFGDLMPPAELRNSLV 81
           ANSGLGDGWTY+TDLTGGYYDAGDNVKFGFPMAFTTTLLSWSVIEFGDLMPPAELRNSLV
Sbjct: 60  ANSGLGDGWTYKTDLTGGYYDAGDNVKFGFPMAFTTTLLSWSVIEFGDLMPPAELRNSLV 119

Query: 82  AIRWATDYLLKTVSQPNRIFVQVGDPSADHFCWERPEDMDTARTVYAVDAPGTASDVAGE 141
           AIRWATDYLLKTVSQPNRIFVQVGDPSADHFCWERPEDMDT RTVYAVDAPGTASDVAGE
Sbjct: 120 AIRWATDYLLKTVSQPNRIFVQVGDPSADHFCWERPEDMDTMRTVYAVDAPGTASDVAGE 179

Query: 142 TAAALAAASMAFRSSDPGYAETLLRNGIRTFELADTYRGAYSDNANIRDGVCPFYWHFD- 201
           TAAALAAASMAFRSSDPGYAETLL+NGI+ FELADTYRGAYSDNANIRDGVCPFY  FD 
Sbjct: 180 TAAALAAASMAFRSSDPGYAETLLQNGIKAFELADTYRGAYSDNANIRDGVCPFYCDFDG 239

Query: 202 ------------RRIPDTPKHCYEM--------------ELGHSSWPIPIEELIVITLTL 261
                       RR      +   +              E G  +    +  L+      
Sbjct: 240 YQDELLWGAAWLRRASKNESYLSYIQDNGKTLGAEDSFNEFGWDNKHAGLNVLVSKEALE 299

Query: 262 EMVFVL--FIAILMAIRIFFLPHSVHSRRPHLQ----ARRQQPPTRHLHHLPPPRLRSLS 321
             +F L  + A         +P S  S   H+Q        +P   +L H        L+
Sbjct: 300 GNIFTLQSYKASADNFMCTLIPESSSS---HIQYTPGGLIYKPGGSNLQHATSITFLLLA 359

Query: 322 -------------------RPPALRRQAKRQVDYILGDNPKGISYMVGYGNYFPQRIHHR 381
                               P +LRRQAK+QVDYILGDNPKGISYMVGYGNYFPQRIHHR
Sbjct: 360 YAHYLERTSSTVNCGNVVVGPASLRRQAKQQVDYILGDNPKGISYMVGYGNYFPQRIHHR 419

Query: 382 GSSLPSVHDHPQPIACKEGSAYFNSADPNPNVLVGALVGGPWEDDVYEDDRADFRKSEPT 409
           GSSLPSV DHPQPIACKEGS YFNS DPNPNVLVGALVGGP EDDVYEDDRADFRKSEPT
Sbjct: 420 GSSLPSVRDHPQPIACKEGSTYFNSPDPNPNVLVGALVGGPGEDDVYEDDRADFRKSEPT 479

BLAST of HG10000507 vs. ExPASy TrEMBL
Match: A0A0A0LPE9 (Endoglucanase OS=Cucumis sativus OX=3659 GN=Csa_2G251510 PE=3 SV=1)

HSP 1 Score: 545.0 bits (1403), Expect = 2.6e-151
Identity = 299/439 (68.11%), Postives = 317/439 (72.21%), Query Frame = 0

Query: 22  ANSGLGDGWTYRTDLTGGYYDAGDNVKFGFPMAFTTTLLSWSVIEFGDLMPPAELRNSLV 81
           ANSGLGDGWTY+TDLTGGYYDAGDNVKFGFPMAFTTTLLSWSVIEFGD MPPAELRNSLV
Sbjct: 60  ANSGLGDGWTYKTDLTGGYYDAGDNVKFGFPMAFTTTLLSWSVIEFGDFMPPAELRNSLV 119

Query: 82  AIRWATDYLLKTVSQPNRIFVQVGDPSADHFCWERPEDMDTARTVYAVDAPGTASDVAGE 141
           AIRWATDYLLKTVS+PNRIFVQVGDPSADH CWERPEDMDT RTVYAVDAPGTASDVAGE
Sbjct: 120 AIRWATDYLLKTVSEPNRIFVQVGDPSADHSCWERPEDMDTTRTVYAVDAPGTASDVAGE 179

Query: 142 TAAALAAASMAFRSSDPGYAETLLRNGIRTFELADTYRGAYSDNANIRDGVCPFYWHFD- 201
           TAAALAAASMAFRSS+P YAETLLRNGI+ FELADTYRGAYSDNANIRDGVCPFY  FD 
Sbjct: 180 TAAALAAASMAFRSSNPEYAETLLRNGIKAFELADTYRGAYSDNANIRDGVCPFYCDFDG 239

Query: 202 ------------RRIPDTPKHCYEM--------------ELGHSSWPIPIEELIVITLTL 261
                       RR      +   +              E G  +    +  L+      
Sbjct: 240 YQDELLWGAAWLRRASQNESYLNYIQDNGKTLGAEDSYNEFGWDNKHAGLNVLVSKEALE 299

Query: 262 EMVFVL--FIAILMAIRIFFLPHSVHSRRPHLQ----ARRQQPPTRHLHHLPPPRLRSLS 321
             +F L  + A         +P S  S   H+Q        +P   +L H        L+
Sbjct: 300 GNIFTLQSYRASADNFMCTLIPESSSS---HIQYTPGGLIYKPGGSNLQHATSITFLLLA 359

Query: 322 -------------------RPPALRRQAKRQVDYILGDNPKGISYMVGYGNYFPQRIHHR 381
                               P  LRRQAK+QVDYILG+NPKGISYMVGYGNYFPQRIHHR
Sbjct: 360 YANYLERTSSTVNCGNVVVGPATLRRQAKQQVDYILGENPKGISYMVGYGNYFPQRIHHR 419

Query: 382 GSSLPSVHDHPQPIACKEGSAYFNSADPNPNVLVGALVGGPWEDDVYEDDRADFRKSEPT 409
           GSSLPSVHDHPQPIACKEGSAYFNSADPNPNVLVGALVGGP EDDVYEDDRADFRKSEPT
Sbjct: 420 GSSLPSVHDHPQPIACKEGSAYFNSADPNPNVLVGALVGGPGEDDVYEDDRADFRKSEPT 479

BLAST of HG10000507 vs. ExPASy TrEMBL
Match: A0A6J1FWW4 (Endoglucanase OS=Cucurbita moschata OX=3662 GN=LOC111448189 PE=3 SV=1)

HSP 1 Score: 536.6 bits (1381), Expect = 9.2e-149
Identity = 295/439 (67.20%), Postives = 314/439 (71.53%), Query Frame = 0

Query: 22  ANSGLGDGWTYRTDLTGGYYDAGDNVKFGFPMAFTTTLLSWSVIEFGDLMPPAELRNSLV 81
           ANSGL DGWTY+TDLTGGYYDAGDNVKFGFPMAFTTTLLSWSVIEFGDLMPPAE+RNSL 
Sbjct: 60  ANSGLSDGWTYQTDLTGGYYDAGDNVKFGFPMAFTTTLLSWSVIEFGDLMPPAEVRNSLD 119

Query: 82  AIRWATDYLLKTVSQPNRIFVQVGDPSADHFCWERPEDMDTARTVYAVDAPGTASDVAGE 141
           AIRW+TDYLLKTVSQPNRIFVQVGDPSADHFCWERPEDMDT RTVYAVDAPGTASDVAGE
Sbjct: 120 AIRWSTDYLLKTVSQPNRIFVQVGDPSADHFCWERPEDMDTTRTVYAVDAPGTASDVAGE 179

Query: 142 TAAALAAASMAFRSSDPGYAETLLRNGIRTFELADTYRGAYSDNANIRDGVCPFYWHFD- 201
           TAAALAAASMAFRSSDPGYAETLLRNGI+ FELAD YRGAYSDN +IR+GVCPFY  FD 
Sbjct: 180 TAAALAAASMAFRSSDPGYAETLLRNGIKAFELADNYRGAYSDNDHIREGVCPFYCDFDG 239

Query: 202 ------------RRIPDTPKHCYEM--------------ELGHSSWPIPIEELIVITLTL 261
                       RR      +   +              E G  +    +  L+      
Sbjct: 240 YQDELLWGAAWLRRASQNESYLSYIQDNGKTLGAEDSFNEFGWDNKHAGLNVLVSKEALE 299

Query: 262 EMVFVL--FIAILMAIRIFFLPHSVHSRRPHLQ----ARRQQPPTRHLHHLPPPRLRSLS 321
             +F L  + A         +P S  S   H+Q        +P   +L H        L+
Sbjct: 300 GNIFTLQSYKASADNFMCTLIPESSSS---HIQYTPGGLIYKPGGSNLQHATSITFLLLA 359

Query: 322 -------------------RPPALRRQAKRQVDYILGDNPKGISYMVGYGNYFPQRIHHR 381
                               P ALR QAKRQVDYILGDNPKGISYMVGYGNYFPQRIHHR
Sbjct: 360 YANYLDRTSATVNCGNVVVGPAALRSQAKRQVDYILGDNPKGISYMVGYGNYFPQRIHHR 419

Query: 382 GSSLPSVHDHPQPIACKEGSAYFNSADPNPNVLVGALVGGPWEDDVYEDDRADFRKSEPT 409
           GSSLPSV +HPQPIACKEGS YFNSADPNPNVLVGALVGGP EDDVYEDDRADFRKSEPT
Sbjct: 420 GSSLPSVREHPQPIACKEGSTYFNSADPNPNVLVGALVGGPGEDDVYEDDRADFRKSEPT 479

BLAST of HG10000507 vs. ExPASy TrEMBL
Match: A0A6J1JE94 (Endoglucanase OS=Cucurbita maxima OX=3661 GN=LOC111484228 PE=3 SV=1)

HSP 1 Score: 531.2 bits (1367), Expect = 3.9e-147
Identity = 292/439 (66.51%), Postives = 312/439 (71.07%), Query Frame = 0

Query: 22  ANSGLGDGWTYRTDLTGGYYDAGDNVKFGFPMAFTTTLLSWSVIEFGDLMPPAELRNSLV 81
           ANSGL DGWTY+TDLTGGYYDAGDNVKFGFPMAFTTTLLSWSVIEFGDLMPP E+RNSL 
Sbjct: 60  ANSGLSDGWTYQTDLTGGYYDAGDNVKFGFPMAFTTTLLSWSVIEFGDLMPPVEVRNSLD 119

Query: 82  AIRWATDYLLKTVSQPNRIFVQVGDPSADHFCWERPEDMDTARTVYAVDAPGTASDVAGE 141
           AIRW+TDYLLKTVSQPNRIFVQVGDPSADHFCWERPEDMDT RTVYAVDAPGTASDVAGE
Sbjct: 120 AIRWSTDYLLKTVSQPNRIFVQVGDPSADHFCWERPEDMDTMRTVYAVDAPGTASDVAGE 179

Query: 142 TAAALAAASMAFRSSDPGYAETLLRNGIRTFELADTYRGAYSDNANIRDGVCPFYWHFD- 201
           TAAALAAASMAFRSSDPGYA+TLLRNGI+ FELAD YRGAYSDN +IR+GVCPFY  FD 
Sbjct: 180 TAAALAAASMAFRSSDPGYADTLLRNGIKAFELADNYRGAYSDNDHIREGVCPFYCDFDG 239

Query: 202 ------------RRIPDTPKHCYEM--------------ELGHSSWPIPIEELIVITLTL 261
                       RR      +   +              E G  +    +  L+      
Sbjct: 240 YQDELLWGAAWLRRASQNESYLSYIQDNGKTLGAEDSFNEFGWDNKHAGLNVLVSKEALE 299

Query: 262 EMVFVL--FIAILMAIRIFFLPHSVHSRRPHLQ----ARRQQPPTRHLHHLPPPRLRSLS 321
             +F L  + A         +P S  S   H+Q        +P   +L H        L+
Sbjct: 300 GNIFTLQSYKASADNFMCTLIPESSSS---HIQYTPGGLIYKPGGSNLQHATSITFLLLA 359

Query: 322 -------------------RPPALRRQAKRQVDYILGDNPKGISYMVGYGNYFPQRIHHR 381
                               P ALR QAKRQVDYILGDNPKGISYMVGYG YFPQRIHHR
Sbjct: 360 YANYLDRTSATVNCGNVVVGPAALRSQAKRQVDYILGDNPKGISYMVGYGKYFPQRIHHR 419

Query: 382 GSSLPSVHDHPQPIACKEGSAYFNSADPNPNVLVGALVGGPWEDDVYEDDRADFRKSEPT 409
           GSSLPSV +HPQPIACKEGS YFNSADPNPNVLVGALVGGP EDDVYEDDRADFRKSEPT
Sbjct: 420 GSSLPSVREHPQPIACKEGSTYFNSADPNPNVLVGALVGGPGEDDVYEDDRADFRKSEPT 479

BLAST of HG10000507 vs. TAIR 10
Match: AT4G39010.1 (glycosyl hydrolase 9B18 )

HSP 1 Score: 445.7 bits (1145), Expect = 4.1e-125
Identity = 244/436 (55.96%), Postives = 294/436 (67.43%), Query Frame = 0

Query: 23  NSGLGDGWTYRTDLTGGYYDAGDNVKFGFPMAFTTTLLSWSVIEFGDLMPPAELRNSLVA 82
           NSGL DGWT+  DLTGGYYDAGDNVKF FPMAFTTT+L+WSVIEFG+ MP +ELRNSLVA
Sbjct: 60  NSGLSDGWTHNIDLTGGYYDAGDNVKFNFPMAFTTTMLAWSVIEFGEFMPSSELRNSLVA 119

Query: 83  IRWATDYLLKTVSQ-PNRIFVQVGDPSADHFCWERPEDMDTARTVYAVDAPGTASDVAGE 142
           +RW+++YLLK+VSQ PNRIFVQVGDP ADH CWERPEDMDT RT YAV+AP  AS+VAGE
Sbjct: 120 LRWSSNYLLKSVSQLPNRIFVQVGDPIADHNCWERPEDMDTPRTAYAVNAPNPASEVAGE 179

Query: 143 TAAALAAASMAFRSSDPGYAETLLRNGIRTFELADTYRGAYSDNANIRDGVCPFYWHFD- 202
           T AAL+AAS+AFRSSDPGY++TLL+N ++TF+ AD YRGAYS N +I++ VCPFY  F+ 
Sbjct: 180 TTAALSAASIAFRSSDPGYSQTLLQNAVKTFQFADMYRGAYSSNDDIKNDVCPFYCDFNG 239

Query: 203 ------------RRIPDTPKHCYEME-----------LGHSSWPIPIEELIVITL--TLE 262
                       R+      +   +E           +    W   +  L V+     +E
Sbjct: 240 FQDELLWGAAWLRKATGDESYLNYIESNREPFGANDNVDEFGWDNKVGGLNVLVSKEVIE 299

Query: 263 MVFVLFIAILMAIRIFFLPHSVHSRRPHLQ----ARRQQPPTRHLHH------LPPPRLR 322
                  A   +   F       S  PH++        +P    L H      L     +
Sbjct: 300 GNMYNLEAYKASAESFMCSLVPESSGPHVEYTSAGLLYKPGGSQLQHATTISFLLLVYAQ 359

Query: 323 SLSR-------------PPALRRQAKRQVDYILGDNPKGISYMVGYGNYFPQRIHHRGSS 382
            LSR             P  LRR AK+QVDYILG+NP G+SYMVGYG  +P+RIHHRGSS
Sbjct: 360 YLSRSSLSLNCGTLTVPPDYLRRLAKKQVDYILGNNPMGLSYMVGYGERYPKRIHHRGSS 419

Query: 383 LPSVHDHPQPIACKEGSAYFNSADPNPNVLVGALVGGPWEDDVYEDDRADFRKSEPTTYI 409
           LPS+ DHP+ I CK+GS YFNS +PNPNVL+GA+VGGP EDD+Y+DDR+DFRKSEPTTYI
Sbjct: 420 LPSIVDHPEAIRCKDGSVYFNSTEPNPNVLIGAVVGGPGEDDMYDDDRSDFRKSEPTTYI 479

BLAST of HG10000507 vs. TAIR 10
Match: AT1G70710.1 (glycosyl hydrolase 9B1 )

HSP 1 Score: 339.3 bits (869), Expect = 4.2e-93
Identity = 205/433 (47.34%), Postives = 250/433 (57.74%), Query Frame = 0

Query: 23  NSGLGDGWTYRTDLTGGYYDAGDNVKFGFPMAFTTTLLSWSVIEFGDLMPPAELRNSLVA 82
           +S L DG +   DL+GGYYDAGDN+KFGFPMAFTTT+LSWS+I+FG  M P ELRN++ A
Sbjct: 59  DSALRDGSSAGVDLSGGYYDAGDNIKFGFPMAFTTTMLSWSIIDFGKTMGP-ELRNAVKA 118

Query: 83  IRWATDYLLKTVSQPNRIFVQVGDPSADHFCWERPEDMDTARTVYAVDAPGTASDVAGET 142
           ++W TDYLLK  + P  +FVQVGD  +DH CWERPEDMDT RTVY +D     SDVAGET
Sbjct: 119 VKWGTDYLLKATAIPGVVFVQVGDAYSDHNCWERPEDMDTLRTVYKIDRAHPGSDVAGET 178

Query: 143 AAALAAASMAFRSSDPGYAETLLRNGIRTFELADTYRGAYSDNANIRDGVCPFYWHFDRR 202
           AAALAAAS+ FR  DP Y+  LL    R F  A+ YRGAYS+  ++   VCPFY  F+  
Sbjct: 179 AAALAAASIVFRKRDPAYSRLLLDRATRVFAFANRYRGAYSN--SLYHAVCPFYCDFNGY 238

Query: 203 IPD-----------TPKHCYE----------------MELGHSSWPIPIEELI--VITLT 262
             +           + K  Y                  E G  +    I  LI   + + 
Sbjct: 239 QDELLWGAAWLHKASRKRAYREFIVKNEVILKAGDTINEFGWDNKHAGINVLISKEVLMG 298

Query: 263 LEMVFVLFIAILMAIRIFFLPHSVHSR----RPHLQARRQQPPTRHLHHLP--------- 322
               F  F           LP   H +    R  L  +      +H+  L          
Sbjct: 299 KAEYFESFKQNADGFICSILPGISHPQVQYSRGGLLVKTGGSNMQHVTSLSFLLLAYSNY 358

Query: 323 -------PPRLRSLSRPPALRRQAKRQVDYILGDNPKGISYMVGYGNYFPQRIHHRGSSL 382
                   P     + P  LR+ AKRQVDYILGDNP G+SYMVGYG  FP+RIHHRGSS+
Sbjct: 359 LSHAKKVVPCGELTASPSLLRQIAKRQVDYILGDNPMGLSYMVGYGQKFPRRIHHRGSSV 418

Query: 383 PSVHDHPQPIACKEGSAYFNSADPNPNVLVGALVGGPWEDDVYEDDRADFRKSEPTTYIN 407
           PSV  HP  I CKEGS YF S +PNPN+LVGA+VGGP   D + D R  F++SEPTTYIN
Sbjct: 419 PSVSAHPSHIGCKEGSRYFLSPNPNPNLLVGAVVGGPNVTDAFPDSRPYFQQSEPTTYIN 478

BLAST of HG10000507 vs. TAIR 10
Match: AT4G39000.1 (glycosyl hydrolase 9B17 )

HSP 1 Score: 335.1 bits (858), Expect = 7.8e-92
Identity = 199/440 (45.23%), Postives = 257/440 (58.41%), Query Frame = 0

Query: 23  NSGLGDGWTYRTDLTGGYYDAGDNVKFGFPMAFTTTLLSWSVIEFGDLMPPAELRNSLVA 82
           NS L DG   +TDL GGYYDAGDNVKF FPMAFT T+L+WS ++FG  M   + R++LVA
Sbjct: 56  NSALNDGKNLKTDLVGGYYDAGDNVKFHFPMAFTATMLAWSSVDFGRYMSQHDFRHNLVA 115

Query: 83  IRWATDYLLKTVSQ-PNRIFVQVGDPSADHFCWERPEDMDTARTVYAVDAPGTASDVAGE 142
           ++WATDYLLKTVSQ PNRIFV VG+   DH CWERPEDMDT RT +A+DAP  ASD+AGE
Sbjct: 116 VKWATDYLLKTVSQLPNRIFVHVGEVQPDHDCWERPEDMDTPRTAFALDAPYPASDLAGE 175

Query: 143 TAAALAAASMAFRSSDPGYAETLLRNGIRTFELADTYRGAYSDNANIRDGVCPFYWHFD- 202
            AAALAAAS+AF+ ++P Y+  LL   ++TF+ AD++RG+Y+DN  I+  VCPFY   + 
Sbjct: 176 IAAALAAASIAFKQANPKYSAILLNKAVQTFQYADSHRGSYTDNPGIKQAVCPFYCSVNG 235

Query: 203 ------------RRIPDTPKHC-YEMELGHS----------SWPIPIEELIVITLTLEMV 262
                       RR      +  Y ++ G +           W   +  + V  L  + V
Sbjct: 236 YKDELLWGAAWLRRATGEDSYLRYLVDNGQAFGESSNYFEFGWDNKVGGVNV--LVAKEV 295

Query: 263 FVLFIAILMAIR-------IFFLPHSVHSRRPHLQ----------ARRQQPPTRHLHHLP 322
               +  + A +         FLP    +  PH+              Q   T  L  L 
Sbjct: 296 LQNNVTAIAAYKDTAEKMMCSFLP---ETNGPHMSYTPGGLIYKPGSTQLQNTAALSFLL 355

Query: 323 PPRLRSLS-------------RPPALRRQAKRQVDYILGDNPKGISYMVGYGNYFPQRIH 382
                 LS             +P +LRR  KRQVDY+LGDNP  +SYM+GYG  +P  IH
Sbjct: 356 LTYADYLSTSSQQLNCGNLKFQPDSLRRIVKRQVDYVLGDNPMKLSYMIGYGERYPGLIH 415

Query: 383 HRGSSLPSVHDHPQPIACKEGSAYFNSADPNPNVLVGALVGGPWEDDVYEDDRADFRKSE 408
           HRGSS+PSV  HP    C  G   F+S +PNPN+L+GA++GGP  DD +   R +  ++E
Sbjct: 416 HRGSSIPSVTVHPAAFGCIAGWNIFSSPNPNPNILIGAVIGGPDVDDRFIGGRTNASETE 475

BLAST of HG10000507 vs. TAIR 10
Match: AT4G02290.1 (glycosyl hydrolase 9B13 )

HSP 1 Score: 334.3 bits (856), Expect = 1.3e-91
Identity = 202/440 (45.91%), Postives = 255/440 (57.95%), Query Frame = 0

Query: 23  NSGLGDGWTYRTDLTGGYYDAGDNVKFGFPMAFTTTLLSWSVIEFGDLMPPAELRNSLVA 82
           +SGL DG     DL GGYYDAGDN+KFGFPMAFTTT+LSWSVIEFG LM  +EL+N+ +A
Sbjct: 83  DSGLSDGSALHVDLVGGYYDAGDNIKFGFPMAFTTTMLSWSVIEFGGLM-KSELQNAKIA 142

Query: 83  IRWATDYLLKTVSQPNRIFVQVGDPSADHFCWERPEDMDTARTVYAVDAPGTASDVAGET 142
           IRWATDYLLK  SQP+ I+VQVGD + DH CWERPEDMDT R+V+ VD     SDVA ET
Sbjct: 143 IRWATDYLLKATSQPDTIYVQVGDANKDHSCWERPEDMDTVRSVFKVDKNIPGSDVAAET 202

Query: 143 AAALAAASMAFRSSDPGYAETLLRNGIRTFELADTYRGAYSDNANIRDGVCPFYWHFDRR 202
           AAALAAA++ FR SDP Y++ LL+  I  F  AD YRG YS  A ++  VCPFY  +   
Sbjct: 203 AAALAAAAIVFRKSDPSYSKVLLKRAISVFAFADKYRGTYS--AGLKPDVCPFYCSYSG- 262

Query: 203 IPDTPKHCYEME-LGHSSW----PIPIEELIVITLTLEMVFVLFI------------AIL 262
                   Y+ E L  ++W       I+ L  I +  +++                 A +
Sbjct: 263 --------YQDELLWGAAWLQKATKNIKYLNYIKINGQILGAAEYDNTFGWDNKHAGARI 322

Query: 263 MAIRIFFLPH--SVHSRRPHLQ------------ARRQQPPTRHLHHLPPPRLR------ 322
           +  + F + +  ++H  + H              +  Q  P   L  +    ++      
Sbjct: 323 LLTKAFLVQNVKTLHEYKGHADNFICSVIPGAPFSSTQYTPGGLLFKMADANMQYVTSTS 382

Query: 323 ---------------------SLSRPPALRRQAKRQVDYILGDNPKGISYMVGYGNYFPQ 382
                                S+  P  LR  AKRQVDY+LGDNP  +SYMVGYG  FP+
Sbjct: 383 FLLLTYAKYLTSAKTVVHCGGSVYTPGRLRSIAKRQVDYLLGDNPLRMSYMVGYGPKFPR 442

Query: 383 RIHHRGSSLPSVHDHPQPIACKEGSAYFNSADPNPNVLVGALVGGPWEDDVYEDDRADFR 405
           RIHHRGSSLP V  HP  I C +G A  NS  PNPN LVGA+VGGP + D + D+R+D+ 
Sbjct: 443 RIHHRGSSLPCVASHPAKIQCHQGFAIMNSQSPNPNFLVGAVVGGPDQHDRFPDERSDYE 502

BLAST of HG10000507 vs. TAIR 10
Match: AT1G23210.1 (glycosyl hydrolase 9B6 )

HSP 1 Score: 330.5 bits (846), Expect = 1.9e-90
Identity = 201/433 (46.42%), Postives = 250/433 (57.74%), Query Frame = 0

Query: 23  NSGLGDGWTYRTDLTGGYYDAGDNVKFGFPMAFTTTLLSWSVIEFGDLMPPAELRNSLVA 82
           +S L DG +   DLTGGYYDAGDNVKFGFPMAFTTT++SWSVI+FG  M P EL N++ A
Sbjct: 59  DSALRDGSSAGVDLTGGYYDAGDNVKFGFPMAFTTTMMSWSVIDFGKTMGP-ELENAVKA 118

Query: 83  IRWATDYLLKTVSQPNRIFVQVGDPSADHFCWERPEDMDTARTVYAVDAPGTASDVAGET 142
           I+W TDYL+K    P+ +FVQVGD  +DH CWERPEDMDT RTVY +D   + S+VAGET
Sbjct: 119 IKWGTDYLMKATQIPDVVFVQVGDAYSDHNCWERPEDMDTLRTVYKIDKDHSGSEVAGET 178

Query: 143 AAALAAASMAFRSSDPGYAETLLRNGIRTFELADTYRGAYSDNANIRDGVCPFYWHFDRR 202
           AAALAAAS+ F   DP Y++ LL    R F  A  YRGAYSD  ++   VCPFY  F+  
Sbjct: 179 AAALAAASIVFEKRDPVYSKMLLDRATRVFAFAQKYRGAYSD--SLYQAVCPFYCDFNGY 238

Query: 203 IPD-----------TPKHCYE----------------MELGHSSWPIPIEELI--VITLT 262
             +           + K  Y                  E G  +    I  L+  ++ + 
Sbjct: 239 EDELLWGAAWLHKASKKRVYREFIVKNQVILRAGDTIHEFGWDNKHAGINVLVSKMVLMG 298

Query: 263 LEMVFVLFIAILMAIRIFFLPHSVHSRRPHLQ--------ARRQQPPT----------RH 322
               F  F           LP   H +  + Q            Q  T           +
Sbjct: 299 KAEYFQSFKQNADEFICSLLPGISHPQVQYSQGGLLVKSGGSNMQHVTSLSFLLLTYSNY 358

Query: 323 LHHLPP--PRLRSLSRPPALRRQAKRQVDYILGDNPKGISYMVGYGNYFPQRIHHRGSSL 382
           L H     P     + P  LR+ AKRQVDYILGDNP  +SYMVGYG+ FPQ+IHHRGSS+
Sbjct: 359 LSHANKVVPCGEFTASPALLRQVAKRQVDYILGDNPMKMSYMVGYGSRFPQKIHHRGSSV 418

Query: 383 PSVHDHPQPIACKEGSAYFNSADPNPNVLVGALVGGPWEDDVYEDDRADFRKSEPTTYIN 407
           PSV DHP  I CK+GS YF S +PNPN+L+GA+VGGP   D + D R  F+ +EPTTYIN
Sbjct: 419 PSVVDHPDRIGCKDGSRYFFSNNPNPNLLIGAVVGGPNITDDFPDSRPYFQLTEPTTYIN 478

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038900408.18.0e-15569.55endoglucanase 24-like [Benincasa hispida][more]
XP_008461017.13.7e-15268.56PREDICTED: endoglucanase 24-like [Cucumis melo] >KAA0045639.1 endoglucanase 24-l... [more]
XP_004150145.25.4e-15168.11endoglucanase 24 [Cucumis sativus] >KGN61851.1 hypothetical protein Csa_006280 [... [more]
XP_022943418.11.9e-14867.20endoglucanase 24-like [Cucurbita moschata][more]
KAG7010890.15.5e-14866.97Endoglucanase 24 [Cucurbita argyrosperma subsp. argyrosperma][more]
Match NameE-valueIdentityDescription
Q93YQ75.8e-12455.96Endoglucanase 24 OS=Arabidopsis thaliana OX=3702 GN=At4g39010 PE=2 SV=1[more]
Q69NF55.5e-10649.66Endoglucanase 23 OS=Oryza sativa subsp. japonica OX=39947 GN=GLU12 PE=2 SV=1[more]
Q9SVJ42.0e-9245.68Endoglucanase 22 OS=Arabidopsis thaliana OX=3702 GN=GH9B16 PE=3 SV=1[more]
Q9CAC15.9e-9247.34Endoglucanase 8 OS=Arabidopsis thaliana OX=3702 GN=CEL1 PE=2 SV=1[more]
Q8GY581.1e-9045.23Endoglucanase 23 OS=Arabidopsis thaliana OX=3702 GN=At4g39000 PE=2 SV=2[more]
Match NameE-valueIdentityDescription
A0A5A7TWL51.8e-15268.56Endoglucanase OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold280G00260 ... [more]
A0A1S3CDA01.8e-15268.56Endoglucanase OS=Cucumis melo OX=3656 GN=LOC103499723 PE=3 SV=1[more]
A0A0A0LPE92.6e-15168.11Endoglucanase OS=Cucumis sativus OX=3659 GN=Csa_2G251510 PE=3 SV=1[more]
A0A6J1FWW49.2e-14967.20Endoglucanase OS=Cucurbita moschata OX=3662 GN=LOC111448189 PE=3 SV=1[more]
A0A6J1JE943.9e-14766.51Endoglucanase OS=Cucurbita maxima OX=3661 GN=LOC111484228 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT4G39010.14.1e-12555.96glycosyl hydrolase 9B18 [more]
AT1G70710.14.2e-9347.34glycosyl hydrolase 9B1 [more]
AT4G39000.17.8e-9245.23glycosyl hydrolase 9B17 [more]
AT4G02290.11.3e-9145.91glycosyl hydrolase 9B13 [more]
AT1G23210.11.9e-9046.42glycosyl hydrolase 9B6 [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR012341Six-hairpin glycosidase-like superfamilyGENE3D1.50.10.10coord: 11..201
e-value: 1.6E-62
score: 214.0
IPR012341Six-hairpin glycosidase-like superfamilyGENE3D1.50.10.10coord: 283..409
e-value: 1.2E-40
score: 142.0
IPR001701Glycoside hydrolase family 9PFAMPF00759Glyco_hydro_9coord: 292..400
e-value: 2.9E-33
score: 116.0
coord: 25..196
e-value: 2.9E-53
score: 181.9
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 262..291
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 262..282
NoneNo IPR availablePANTHERPTHR22298:SF129ENDOGLUCANASE 24coord: 290..407
coord: 21..200
NoneNo IPR availablePANTHERPTHR22298ENDO-1,4-BETA-GLUCANASEcoord: 290..407
coord: 21..200
IPR033126Glycosyl hydrolases family 9, Asp/Glu active sitesPROSITEPS00698GH9_3coord: 376..394
IPR018221Glycoside hydrolase family 9, His active sitePROSITEPS00592GH9_2coord: 303..329
IPR008928Six-hairpin glycosidase superfamilySUPERFAMILY48208Six-hairpin glycosidasescoord: 21..406

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10000507.1HG10000507.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0000272 polysaccharide catabolic process
biological_process GO:0005975 carbohydrate metabolic process
molecular_function GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds