HG10020088 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10020088
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionlysosomal Pro-X carboxypeptidase-like
LocationChr04: 28655953 .. 28666202 (+)
RNA-Seq ExpressionHG10020088
SyntenyHG10020088
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGATGGTGTAATAATTGCGACGAGTTGGTCATTTCTGTAGAAGACGACGCGGAGTTTTCTGCTCTGACTCCGAAGTCTCCCACTTATCTAACAGTTCGCCCGCACATCGCAATTGGCACAAACTTCCAGTCTTCTGCAACAATGGCCGCTCCATCCATTTCCATTTCCCTTTCCTTTTTCCTCTTCCTTTCCCTCCATTTCACTTCTTCTTTCTCCAAATTTCCCCTTCCCTTCTCTTCTTCCCTCCTTCTCCGCCCTCAAAATCCCCCAGTCGATTCCCTCCCCCATTACCACACCAACTTCTTCACTCAGATCCTCGACCACTTCAACTTCAATCCCCAAAGCTACCACACGTTTCAGCAAAGGTATCTAATCAACGACACCTATTGGGGCGGCGCCGCCCACAATGCTCCCATCTTCGTTTACACCGGCAACGAAGGTAATATCGAATGGTTTGCTCAGAACACTGGTTTTCTCCTCCAATCCGCTCCACATTTCCGTGCGCTCGTCGTTTTCATCGAGGTATACAATCCAATTTCAATTCAATTCAATTTTGCCCTAATTTCTTTTCTGTTCATCTGCGATGGAGTAATTGTAACTGATATTGTTTGTGTTGCAGCATAGGTTTTACGGGAAATCGATTCCGTTTGGGGGAGATGAAGATGTGGCTAATAGTAATTCGAGTACGCTCGGATTTCTGAGCTCCACGCAAGCGTTGGCGGATTATGCAACTCTGATTACCGATTTGAAGAAGAATTTGACGGCGATTGATTCGCCGGTTCTGGTCTTTGGTGGCTCTTATGGAGGAAGTAAGTCAATTGCCATTCTCGATTTACTTTCCACTTTCTTGATTTCATCGCGCCATTGAGGATTTTTCTTTCCTTCGTTAATTTTCTATAGATGAATTCCTAGGAGATTGTTGGCAATGGTTCACTTACCCTGTTTCTGTTTCAATTGTCACGGAGTTCTTAATTTTAATTTTAATCTATTAGAAATGATTTTATTTTAAAATATATTTAAATTCAAATAATTGTTTTATATGACAAAACTATTAAAATTATTTTTAAATATAGCAAAATATCAAAGTGACATTTTGTTATATTTATAAATAAATTGTCTCATTTTGCTATATTTAAAAATAGTCTTATGTATATTTACTAAAATTTGTTATGCATATTTTATCTAGTATTTGTCAGTAGTAACAATATAGTCATCAAGTTTTAGTTTGTTAGGATCTAATCCCTTCTTTTAAAATTGTAACTATTTAATCCTGAATTTTTAACATATAATACTTTTAACTTACCTACCTTCAAATTTTAGTCTCTATTATGAAAAATCTCATAAAAAAATAATTGTCAATTTTTATTACCTAACAACTTAATCTTGGTAATTTATAAATAAATTTGTTTTTGATCAATTTATTAATATAAATATAACAAGTTTTACTAAATATTAATAATACTTTTTAACGCTAGAGATTAAATTTGTGAATTTGTTGGATAAGGAAAATGTGTTCTTTTTAACATGATTACATTTATTCTGAGTTTGGTGAGTGTGCAGTGCTGGCAGCATGGTTTAGATTGAAATACCCTCACATTGCGATTGGAGCTTTAGCATCATCTGCTCCCATTCTCCAATTTGAAAACATTACATCTCCTTATGCCTTCAACAACATCATCACCCAAGATTTCAAGGTCATCTCTCTTCTCTTTACTTTTCTTTTTCAAATTGCATTCTCTTCTTTACTTTTCCTTATCAAATTGCTTTTTTGACATTATTACATTAATTAAAACCTAATCTTGGGTTTTCTTTTTTTCTTTTTTTTTTTTCCAATTTAATCCATATAATTTATAAAAATGTTTCAATTTAACCTATTTTTTAAAGCTTTTTTAATTTATTCTCTTGCTGTTGGTAAATGTTAGAATTGGCAACCATATAATGCTGCATCAATTGGTGAAAATTTAAAAGAAAAAAATGTCGTAGAAATTTAGAGGAGAATTAGATACTTTTCTTTCTTAAGTTAGTTTTTTTTCTTCTATGAATTCCTACCCATTTGCTAAGTATTGTGTCATGTAAATTTATCATCAACTAATTTTATTATTTACTAAATTGCCAACTTGAGCATTACTCAACTGGTTAATATACGTATCTTTGGTCAAGAGATTTAAACTTTAAATCTTCCCACTTGAAAAAAGTCGAAATATTAGAGACTAAAGCATATATTTTAGTCTAAATTTAACAGCATAAACAAACCAAAAAAGCTAAACTTTTACCCTTCAACTTTAAAGAATTTTACAAATTACTATTATGTTGCTAATATATATTTTTTTCCCTCAAATGTTACGATCAAAGTCTCATATTAGTTAGATAAGAGAGTGATCATAAGTTTAAAAGGGATGACAACTATCTCCAATGATATGAGGTCTTTTGGATAAAACCAAAAAAAGTCATGAGAGTTTATGCCTAAACTGGATAAATATTATACCATTGTAGAGATATCTAGAGGTCCATTGGATAATGACCAAAATTCTTCTGTCACATGAAAATTTTTAATTGGATAACTGGTGGTGTAGCTCAACTATCTGAGTTTTTTTATGAAAGTGGGCTAATAACGAGATAAAATATGTTGAAACAAATTAAGCACGGAAAATATAGAAAGTAAAAAGTAAACTAACGTAATTTGAGAAAGAAAAATAGAATATAAACAATAAAACATCAAATATGGTCCCGTAATAATGATAATGTTATATTGGGGGTGAGGTTGAAAGCAAAGACCTATAAATATAATTAAAAGATCATCTTTTTCATTCAAGTAAGGACAACCCCACATGCCTTGTGTCTTTGTATATAGCAACAAATTAACAAGAAAGATGCCATGAACATTGCGACACAAATAACAAAACAACTAAAACGACGACAATACTAATAATCTAACAATAAAACCACTCTTTATAAATGTATTTAAAATTTTATTAAGGTCTTGTTTGAATTGACTTTATTACCACTTATTTAAATATTTATAAAATCAATCCAAACAGGCTCTAAATTACAAATCTACTCTCTAAACTCTTTGAAGTATCTATTTGATCTTTAAACTTTCAAAAATGTATACAAAGTCGTGAGCTTTCAATTTTATATTTAAAGAGTAATGATGTATTATATATAAATTTAAAATTTTGTGTCTAATAAAACTTTAAACTCTTTCCATTTTGTGTCTATTATAGTTTTGAATTTCTAACAATAATTAAGAAAAAAAGCAATTTTAACTCTAAACTTTCAATTTGATCAAATTGTTTCGTTAAAAAAAATTTCTTGTATGCCTTCAATGAATTATTGAATGGATGTACAATATGAAGTTGATAAAGTTTTAGAGAAAAAATGACAATAGAACTTTTTTTTCTTTTTGGTTGAAATTTATGAACTTTGTCAAAATTATTCAATAATTAATTTATGCATCGGTCAAATTTGCTTTTGTTTAAAACTTGATTTTTTTTTTTTAAATGAACGTGTTTAGGAAGATTTTAACATTATTATTTTTTATTTAAATAAGCGAGTGAATTAAAAAAAATTTGAATTGAATGTAAAAATTGCAACACTTACATAAGTAAGAGACCAATTGAATCAAATTGAAAGTTTATGGTCCAAATTGATATAACATGACAAGTTTAGAATAAGAAATTTTGTTTTTCCAAATATTTAATAACCCTAAACAAAAAACTTAAATCTACTAGACTTAGGCTTATTTATTTCACAGGTTTTTTCCATAAAATTTCAAATAAACATTTAATATCCTTAATTGTTTTATATAATTGAACAATAAAAATATAACATAATTAATTAATATCAATCAATCATTATTATTAATTTTAATTTATTAATTAAATATAAAAACATTATTTATACCTAATATATTATCTTTAACTATTTAATATATATTTTATTATTAATTGATGATTAATATTCTAATCAATCTATATTTATTTAATTATTTTTAGTAATTAAATTACGTTTAATTAATGTTTCATATAAAAAATTTCATTTATTTTTATTTTAAGTAATTAAATAATATAATATAACAATATAAATTAAGTAAGGCTAAATTTAATTTATGACTATATTTAATTAAGTAATTATTTTTAGTTTTTATTTACAAAATTAGTAAATTAAAGGAAAATTTTAGAAAAAATGTCAAAACTATTTATAAAAAATAGCAAAAACAAAATATCGATAGACTTCTATCAGCATATATATCATCTAAAAATTTTGTTATTTTGCGTAAATAGTTTGTCTTATTTTTTTATTTTTAAAATCTTTCTTAAGTTAAACACATATATTTAATTTTTAAATATTAATTATCTTTTAGGTCCAAACTACGTTAATTATTTCAAATATTTAATATTTATATCTAAAAATTTATTATCCACCAGTCCGACGAAACTTTTAAATATTTATTGATTAAAATTTATAATTTAACGAGGAAAAAGATTGCAGATAATTTGAAAGTGATTGCGTGTGGGGGAAATTTTTACAGAGTGAGAGTCAGAACTGTTACAGAGTGATTAAAGGCTCATGGCATCAGATTGAAATTACTGCAAACCAGCCAGGAGGACCCCACTTGCTTCGTAACTCTTTCAAATTTTGCAAGTAAGTTTAATATACTTTTTTTTTTTTTTTGCTGGATGTATTCTTTTTCTCTTTCCCTCTTTTTTCAATATCATTTCTTCAATGCTTTTATATTTCCCTTATTTTTTTTTAAGTTACGATTATAAATAAATTTACTCTATACTTTTGTGTGCATGTGTTTTCTTGAGGTCTACCTATAGTGTTCATTTAATTCTTGAATTTTGAATCATGTTTATATTTAATTTTAATTATTAATATTTAATATATTAATAATGTGTTGAGGAAAATTTTAGTACTATTGTAGCACACTGTTAAATAAAATTTATGTTTGTTAAGCTATATTTATTTTAGTTTTTTACAGTATCATCAACAATAATAAATAGAAGAAGCATAATTTAAAAGAAGTTTTATAAAATTTAGAAACTGAATAAATAAAGTATTACAGAGACAATGTATAATTTAACCTTTTTTTTTTTACATATTTTAATTAAAGTGATAAAGTGATGCGTATTATAAAACACGTGAATCATTTCATATAAAGAGTTGCTTTTAGGAAATATTTAAAAATAATAAAATATTGTTTCTTTATGAAAAACTATTATAAAATTGTTTAGAATAAAAACCTTGAAAAGTACTCCTCATGTTGCACTCGTTCCTTTTTCTTCTTAAATATAGAAATCTATTCTGAAATTTGTAGCTAAAGATAATTTTAGAGAGCCATTTTTTAAAAAATCTAATAATAGTGAAATATTTGAATATTAAATATTTTAGTCAAAGAAAATAATAAAATCTTAACTAATTGAATTGCGTTTTAATTAGCCTCATAGCTGTTTGATACTCGTGTTGACAACTATTAAAAGATTCATCTGATTACAATTCTTTTACATATTAAAAGAAGCATACAATTATTTCCCAATAATTTTTATTTATTTATTTTGCGAGATACTTTTTTATTTGACTATTATTATAATTTAGAATATAACATATTTACTTTCTGTACTTATTTTAAATATATATGTGTTAGATAAAAAAAATAAATATATAAGGAGTAAGGAGTAACGAGTTAAAGTATATTAATTCAGAGAGGCGGAGTCGGAATCGATTAAAAACTGGTTGTACACTGCAATTTTGTACACGGCAATGACCGATTACCCCACGCCCTCTAATTTTTTAAATCCGTTGCCAGCTTATCCAGTTAAACAGGTAAAAGTAAAACATAATTCCTTGCTCTTTCTTTAACTATTTTTTTTTAAATTAAAAATAAAAGGAAAAAAATTGGGGGAGTGAATGGTGCAGATGTGCAGGGCGATTGACGATCCAACGACTGGGAACGACACGTTTGCGAAGTTGTATGGTGTGGCTAACGTCTATTATAACTACAGTGGGACTGCCACGTGTTTTGATCTCGACGATGATTCTGATCCTCACGATCTTGGAGATTGGACTTGGCAGGTTTTTTCTCTCTTTCACTTCTTTGTTCATTCTCATTCTCTTTATTATATAACGTTATGACTTGAGTGAAGTCATTGTTTAATTAACATTTTTTAAGGTTTAATATCTTCCTCAAATCCTTATCCAATCTGATTTAAAAATCAAACTGATGGAGGTTAATTTGGAAAATTATGAAGGATTGATTATTATTTAAAAAAAAAAAACTGTGAAAAAGTATGAAGGGTTCTTTATTCTTCTATTAGTTTTACCCGTGAGCTTTTTTCATATAGTAATAATAATAATAAAGACTAGTTGAATTTTTCAATTGGACAGGGTGATGTGGATATTCAACATTAAAAAAAAAGTGAAATTACTAAAAGAAAATATAAGGGGAACTTTTTAAAATAGAAAAATAAAGGAAAGTATTTACACAAAATAAAAAAAAAAAAAATTAGTTAGTTGTGATAGATGCTAATAGAAGTCTATCAGGATCTATCAGTGATAGAAATGATAAAAATCTATCACTAATAGATGTTGATAGAAGTCTATCAATGTTTATCAGTATTTATTATTATTTTTTTTGCTATCTTTTGTAAATAGTTTGATATTTTTTCTATCTGTGAAAAATTCCTAAAGTATAATTTGCCAAAATGTAAATGTCGATGTCAAAAAATATTGAATTAACAAATGAATATTTTTATTTATTATGTTTTATAAATTTTTTTGACAATATAATTAAGATGATAATTTATGCTCTCGATGTCATATACTAAAAGTATAGAATTATTAATATGTAGATGGAAATTTAATATCATGATTATTATATGCTATCCATGTCAATGATGTTGTTTTGAAAAATAGATGATGCTGACAAAAATGAATTTGAAAAATTTGAAAAATATTAAAGATGGACAAAGAGTAAATACTTAAAAGTTTTGATATGCTTAAGCAAATTAAGTAATTTAGAGAGTTAAAGTAGGGATTTGGCAGTGATAAGGACAATTATATGATGGGAAATGATGTGATGTTTGGTGTATCCGATGTAAAAAAAAAGTGCTATAAATGGGTCAAAATATTGATTGAATATGAGATTTTACGCAACCTATTGCCAACATTATAAAGCATGTGAAGAGACAACTTTACACTGCACTTTCTTTTTTTCTTTTCTACTCTAAAATAAATAAATAAATAAGATCACTTTTTTTAAATGGAAGGAAAGCTTGTGTTTATTTTTTTTTTCTTCCCCATCTTATTTATTTTTATTTCATTTTAAACAATTTGCTTAACTAGGATTTTCTTCATATATGTAACAAAAATGGGGCATGAAATAGGAAACTATTTACCGCATGAGAGTTTAGGTCATAAAAGTTCTCATATACAAGGAATCTATTGAAGTTCTAAGTTTTGTGTAAATAACCTATTTTTAGAGATTCAAATGAGATTTGATCCAATATTTTAAATTTTAAGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTAATATTCTGCTTGTATTATCGCTAGTTGAAAATTATCAAAACAACCAAAGATGCAGTCATGATGGGCTTTACTCCTCAATTCACAATCGCTCTCTTCCCTTCTCAATTCGCTTTCTCTATGCTTAGGGATCTTTCCGCTCCACGATTCTCGCTCACTCTCTTTTCTTCTAAAAGAAAAGAAAAAAAAAAAAAATTGGCTCTTTGATTCTTCTCCAAATTCCGCTCCTTGAAGTAGAAAAAGTTGAGATGGGTTGATTTTACCTTTATTGAAGGATCTGATTTCACCAAAATAAAAAATTAAAAAATTGCAAATTGAAGAAAAAGAAGGAAGAAAGTAAACCTTGGAGCGAATAGCGAGAATCGAGGAGCAAAGTTCATTACTCGTGCATTTTTTTGTAGTTTTAGTAATTTTACTCGATAATAACATAAGTAAGAACTAAAAAAAAAAGATAGATTTTCATAAATTAAGTCAGATCTCATTTGGATTTTCAAATTCAGGCCATTAGCGTTGCTTTCCCAGTTTAATTTTGGGTTAAAATGACATTTTAATCCATATACATTTCATTCACATTCTTATATTGTTAAATATCTAAATTTTAGTGTATGTAATTTCTATAAATATTTAAATAAGTTAGTCTTCCCTTCATATAATAATATGTTATATATATGTGACTATCATAAAATATAAATTGAATTAATTAGTGGGTTTGACTTATAGAAAAGACAGTTGAGTCGAGTGATTTGATTTGAGAAGAAGGCCCCCGACAAGGCCCATACTGATTTGTGGAAAAACAGTAGCAAAGTTTGCAAGATTTTTGGAAAGAATTAGATAATCCAATTGTGAAGTTTGTGTGTTAATTAATATGATGGGAATTTTCATAAACTGCAGGCATGTACGGAGATGATATTGCCAACGGGCGCTAACACAGAAGAAAGCATATTTCCAGCCTCAACATGGCACTTTGCAGATCGACTTGATTTCTGCAAACGCTTATATGACGTGGAGCCTCGTCGTATTTGGATCCCCACACACTTTGGGGGCCATGTAAGTTTCTTTCAACCTTACCTTTCCCATCCATACCTTTTACTTTTGCCATGCATCACCAACCATCATTCTTCTTTGTCTATACCCTATTATTTGCTATCCATGAGTCGGGACCAGGGTCGGAAAATCATTCTTTGTCAACCGAGGTCCCCCCAAATTCATGAGCAAACCAAAGTTATTCCATTGATCTACTCTTTGAACTTTAGTTTGTAACAATCTGATATATCTATATGTAAGATTATTTTAAATAACATAAGGTTCCATCTTAAGACCAATTGGTAATGAGAGAAGTAGCTCGTGCATCTTATAAAAATTGTAAGATCCCTTGATTTTTCTAATATGGGTTAGATTCTCAATTTTAATATGTAATAATTTAGTTCTTATACTTTAAAATTTGAAATGATGTTTATTTAGTCTTTATCATAAAAATGTTATTAAGTTTTAATGACATTTTCCTACCTAGGTTGATAAATAGAATGATTAGGGGCTAACAAAGATTAATTTGTTACATTATAAGAAGTGACTCTTAATTTTATAGATTTTTTTCATTTGTGAGGATTAATTTTTTTTACAAATTTGAGGGTACATAGTAAATTAGTACATGCTAAAGTTCAGGGATCAAACGATTAAAAATTTAAATATACAATTGTAAATTTTACTGAAAGGCCAGTGGTTGAATTCTACATGACCTCCATATGTATACATGTTGTTTCAATTATCTTTCCTAAAAATCTTTTAATTTGTTACTTAAGAACCTTTGTTAGCATTCTCAATTAAAATGTCTAAGAGTGAGGTTTAAAAAAAATGGATGAGGTTGGTAATCTCAAATTGTTGGTGGCAAAATTGAAACTTGACCATCTATGATAGATATCTCATTGATTTGAATTTTGCCTCCAAAGATTTTTCAAATTTTGAAAAAATTAGAGTAGTTTTTTTCCCCTTAAATTAGACTAATTAAGTTGTAAGTGATTTTGGTGCAGAATATTGAGAGAGTTCTGAAGAGGTTTGGAAGCAACATAATCTTCTTCAATGGCTTGAGAGATCCTTGGAGTGGTGGAGGGTACTCTTCATATTTATCTTATCTTATGCCACATATTCTTTTGAGTACCAATTTTTGGCTCATAATATAATGTTAATATATTTTTTCATTAATGTCCTTTTGATGCCTGCCTTCAGGGTGCTGAAAAACATATCCTCAACTATAATAGCTATTGTGGCAAAGGAAGGTTAGCATTCATCATTTTTCATTAACAAGGTTCCGTTTGGATTAACTTTTTAGATGTAGAATGTGAGCACTTGAACCATAGTATTTTGTTTTGTTTCGTTTATGTGATGTTAAGATTCAAATTTCTAACCTTTTGTTTAAGGTAAGAATGTCTCGACAAAATATATATAGTTATTTTCACTTTATTTGCTTCTTTTCCCCTCTTTTTGAGTCACTTACTCACATTATTGCTGACTTAGAAACAAACACAGAGAATTTATATTGGTAATGAAACAGGAGCTCATCATGTAGACTTGAGATTCTCAAATCCTGAAGATCCAAAATGGCTGAAAGATGTGAGAAAACAGGAGCTCAATATCATTCAGGATTGGCTTTGTCAATATTACTTGGACTTGGCCCAAAACTAA

mRNA sequence

ATGAGATGGTGTAATAATTGCGACGAGTTGGTCATTTCTGTAGAAGACGACGCGGAGTTTTCTGCTCTGACTCCGAAGTCTCCCACTTATCTAACAGTTCGCCCGCACATCGCAATTGGCACAAACTTCCAGTCTTCTGCAACAATGGCCGCTCCATCCATTTCCATTTCCCTTTCCTTTTTCCTCTTCCTTTCCCTCCATTTCACTTCTTCTTTCTCCAAATTTCCCCTTCCCTTCTCTTCTTCCCTCCTTCTCCGCCCTCAAAATCCCCCAGTCGATTCCCTCCCCCATTACCACACCAACTTCTTCACTCAGATCCTCGACCACTTCAACTTCAATCCCCAAAGCTACCACACGTTTCAGCAAAGGTATCTAATCAACGACACCTATTGGGGCGGCGCCGCCCACAATGCTCCCATCTTCGTTTACACCGGCAACGAAGGTAATATCGAATGGTTTGCTCAGAACACTGGTTTTCTCCTCCAATCCGCTCCACATTTCCGTGCGCTCGTCGTTTTCATCGAGCATAGGTTTTACGGGAAATCGATTCCGTTTGGGGGAGATGAAGATGTGGCTAATAGTAATTCGAGTACGCTCGGATTTCTGAGCTCCACGCAAGCGTTGGCGGATTATGCAACTCTGATTACCGATTTGAAGAAGAATTTGACGGCGATTGATTCGCCGGTTCTGGTCTTTGGTGGCTCTTATGGAGGAATGCTGGCAGCATGGTTTAGATTGAAATACCCTCACATTGCGATTGGAGCTTTAGCATCATCTGCTCCCATTCTCCAATTTGAAAACATTACATCTCCTTATGCCTTCAACAACATCATCACCCAAGATTTCAAGAGTGAGAGTCAGAACTGTTACAGAGTGATTAAAGGCTCATGGCATCAGATTGAAATTACTGCAAACCAGCCAGGAGGACCCCACTTGCTTCGTAACTCTTTCAAATTTTGCAAGGCGATTGACGATCCAACGACTGGGAACGACACGTTTGCGAAGTTGTATGGTGTGGCTAACGTCTATTATAACTACAGTGGGACTGCCACGTGTTTTGATCTCGACGATGATTCTGATCCTCACGATCTTGGAGATTGGACTTGGCAGGCATGTACGGAGATGATATTGCCAACGGGCGCTAACACAGAAGAAAGCATATTTCCAGCCTCAACATGGCACTTTGCAGATCGACTTGATTTCTGCAAACGCTTATATGACGTGGAGCCTCGTCGTATTTGGATCCCCACACACTTTGGGGGCCATAATATTGAGAGAGTTCTGAAGAGGTTTGGAAGCAACATAATCTTCTTCAATGGCTTGAGAGATCCTTGGAGTGGTGGAGGGGTGCTGAAAAACATATCCTCAACTATAATAGCTATTGTGGCAAAGGAAGGAGCTCATCATGTAGACTTGAGATTCTCAAATCCTGAAGATCCAAAATGGCTGAAAGATGTGAGAAAACAGGAGCTCAATATCATTCAGGATTGGCTTTGTCAATATTACTTGGACTTGGCCCAAAACTAA

Coding sequence (CDS)

ATGAGATGGTGTAATAATTGCGACGAGTTGGTCATTTCTGTAGAAGACGACGCGGAGTTTTCTGCTCTGACTCCGAAGTCTCCCACTTATCTAACAGTTCGCCCGCACATCGCAATTGGCACAAACTTCCAGTCTTCTGCAACAATGGCCGCTCCATCCATTTCCATTTCCCTTTCCTTTTTCCTCTTCCTTTCCCTCCATTTCACTTCTTCTTTCTCCAAATTTCCCCTTCCCTTCTCTTCTTCCCTCCTTCTCCGCCCTCAAAATCCCCCAGTCGATTCCCTCCCCCATTACCACACCAACTTCTTCACTCAGATCCTCGACCACTTCAACTTCAATCCCCAAAGCTACCACACGTTTCAGCAAAGGTATCTAATCAACGACACCTATTGGGGCGGCGCCGCCCACAATGCTCCCATCTTCGTTTACACCGGCAACGAAGGTAATATCGAATGGTTTGCTCAGAACACTGGTTTTCTCCTCCAATCCGCTCCACATTTCCGTGCGCTCGTCGTTTTCATCGAGCATAGGTTTTACGGGAAATCGATTCCGTTTGGGGGAGATGAAGATGTGGCTAATAGTAATTCGAGTACGCTCGGATTTCTGAGCTCCACGCAAGCGTTGGCGGATTATGCAACTCTGATTACCGATTTGAAGAAGAATTTGACGGCGATTGATTCGCCGGTTCTGGTCTTTGGTGGCTCTTATGGAGGAATGCTGGCAGCATGGTTTAGATTGAAATACCCTCACATTGCGATTGGAGCTTTAGCATCATCTGCTCCCATTCTCCAATTTGAAAACATTACATCTCCTTATGCCTTCAACAACATCATCACCCAAGATTTCAAGAGTGAGAGTCAGAACTGTTACAGAGTGATTAAAGGCTCATGGCATCAGATTGAAATTACTGCAAACCAGCCAGGAGGACCCCACTTGCTTCGTAACTCTTTCAAATTTTGCAAGGCGATTGACGATCCAACGACTGGGAACGACACGTTTGCGAAGTTGTATGGTGTGGCTAACGTCTATTATAACTACAGTGGGACTGCCACGTGTTTTGATCTCGACGATGATTCTGATCCTCACGATCTTGGAGATTGGACTTGGCAGGCATGTACGGAGATGATATTGCCAACGGGCGCTAACACAGAAGAAAGCATATTTCCAGCCTCAACATGGCACTTTGCAGATCGACTTGATTTCTGCAAACGCTTATATGACGTGGAGCCTCGTCGTATTTGGATCCCCACACACTTTGGGGGCCATAATATTGAGAGAGTTCTGAAGAGGTTTGGAAGCAACATAATCTTCTTCAATGGCTTGAGAGATCCTTGGAGTGGTGGAGGGGTGCTGAAAAACATATCCTCAACTATAATAGCTATTGTGGCAAAGGAAGGAGCTCATCATGTAGACTTGAGATTCTCAAATCCTGAAGATCCAAAATGGCTGAAAGATGTGAGAAAACAGGAGCTCAATATCATTCAGGATTGGCTTTGTCAATATTACTTGGACTTGGCCCAAAACTAA

Protein sequence

MRWCNNCDELVISVEDDAEFSALTPKSPTYLTVRPHIAIGTNFQSSATMAAPSISISLSFFLFLSLHFTSSFSKFPLPFSSSLLLRPQNPPVDSLPHYHTNFFTQILDHFNFNPQSYHTFQQRYLINDTYWGGAAHNAPIFVYTGNEGNIEWFAQNTGFLLQSAPHFRALVVFIEHRFYGKSIPFGGDEDVANSNSSTLGFLSSTQALADYATLITDLKKNLTAIDSPVLVFGGSYGGMLAAWFRLKYPHIAIGALASSAPILQFENITSPYAFNNIITQDFKSESQNCYRVIKGSWHQIEITANQPGGPHLLRNSFKFCKAIDDPTTGNDTFAKLYGVANVYYNYSGTATCFDLDDDSDPHDLGDWTWQACTEMILPTGANTEESIFPASTWHFADRLDFCKRLYDVEPRRIWIPTHFGGHNIERVLKRFGSNIIFFNGLRDPWSGGGVLKNISSTIIAIVAKEGAHHVDLRFSNPEDPKWLKDVRKQELNIIQDWLCQYYLDLAQN
Homology
BLAST of HG10020088 vs. NCBI nr
Match: XP_038904032.1 (lysosomal Pro-X carboxypeptidase-like isoform X1 [Benincasa hispida])

HSP 1 Score: 885.2 bits (2286), Expect = 2.7e-253
Identity = 431/502 (85.86%), Postives = 442/502 (88.05%), Query Frame = 0

Query: 49  MAAPSISISLSFFLFLSLHFTSSFSKFPLPFSSSLLLRPQNPPVDSLPHYHTNFFTQILD 108
           MAAPSIS+SLSFFLFLS HF+SSFSK PLPFSSSLLLRPQNPP+DSL HY TNFFTQILD
Sbjct: 1   MAAPSISLSLSFFLFLSFHFSSSFSKIPLPFSSSLLLRPQNPPIDSLRHYRTNFFTQILD 60

Query: 109 HFNFNPQSYHTFQQRYLINDTYWGGAAHNAPIFVYTGNEGNIEWFAQNTGFLLQSAPHFR 168
           HFNFNP SY +FQQRYLIN TYWGGAA NAPIFVYTGNEGNIEWFAQNTGFLLQSAPHFR
Sbjct: 61  HFNFNPISYQSFQQRYLINHTYWGGAAENAPIFVYTGNEGNIEWFAQNTGFLLQSAPHFR 120

Query: 169 ALVVFIEHRFYGKSIPFGGDEDVANSNSSTLGFLSSTQALADYATLITDLKKNLTAIDSP 228
           ALVVFIEHRFYGKSIPFGGDEDVANSNSSTLGFLSSTQALADYATLITDLKKNL+AIDSP
Sbjct: 121 ALVVFIEHRFYGKSIPFGGDEDVANSNSSTLGFLSSTQALADYATLITDLKKNLSAIDSP 180

Query: 229 VLVFGGSYGGMLAAWFRLKYPHIAIGALASSAPILQFENITSPYAFNNIITQDFKSESQN 288
           VLVFGGSYGGMLAAWFRLKYPH+AIGALASSAPILQ ENITSPYAF NIITQDFKSESQN
Sbjct: 181 VLVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILQLENITSPYAFTNIITQDFKSESQN 240

Query: 289 CYRVIKGSWHQIEITANQPGGPHLLRNSFKF----------------------------- 348
           CYRVIKGSW QI+IT+NQPGGP LLRNSFKF                             
Sbjct: 241 CYRVIKGSWQQIDITSNQPGGPQLLRNSFKFCKEAESESIKNWLYTAILYTAMTDYPTPS 300

Query: 349 --------------CKAIDDPTTGNDTFAKLYGVANVYYNYSGTATCFDLDDDSDPHDLG 408
                         CKAIDDPTTGNDTFAKLYG ANVYYNY+GTATCFDLDDDSDPHDLG
Sbjct: 301 NFLNPLPAYPVKQMCKAIDDPTTGNDTFAKLYGAANVYYNYTGTATCFDLDDDSDPHDLG 360

Query: 409 DWTWQACTEMILPTGANTEESIFPASTWHFADRLDFCKRLYDVEPRRIWIPTHFGGHNIE 468
           DWTWQACTEMILPTG NTEESIFPASTWHFADRL FCKR Y+VEPRRIWIPTH+GGHNIE
Sbjct: 361 DWTWQACTEMILPTGGNTEESIFPASTWHFADRLHFCKRFYNVEPRRIWIPTHYGGHNIE 420

Query: 469 RVLKRFGSNIIFFNGLRDPWSGGGVLKNISSTIIAIVAKEGAHHVDLRFSNPEDPKWLKD 508
           RVLKRFGSNIIFFNGLRDPWSGGGVLKNISSTIIAIVAKEGAHHVDLRFSNPEDPKWLKD
Sbjct: 421 RVLKRFGSNIIFFNGLRDPWSGGGVLKNISSTIIAIVAKEGAHHVDLRFSNPEDPKWLKD 480

BLAST of HG10020088 vs. NCBI nr
Match: XP_008442879.1 (PREDICTED: lysosomal Pro-X carboxypeptidase-like [Cucumis melo])

HSP 1 Score: 839.3 bits (2167), Expect = 1.7e-239
Identity = 412/501 (82.24%), Postives = 428/501 (85.43%), Query Frame = 0

Query: 50  AAPSISISLSFFLFLSLHFTSSFSKFPLPFSSSLLLRPQNPPVDSLPHYHTNFFTQILDH 109
           AAP  SISLS FLFLSLHFTSSFSK PLPF SSLLLRPQ PP+D L  Y T FFTQILDH
Sbjct: 3   AAP--SISLSIFLFLSLHFTSSFSKIPLPFHSSLLLRPQTPPIDPLLPYQTGFFTQILDH 62

Query: 110 FNFNPQSYHTFQQRYLINDTYWGGAAHNAPIFVYTGNEGNIEWFAQNTGFLLQSAPHFRA 169
           FNFNPQSY  FQQRYLINDTYWGGAAHN+PIFVYTGNEGNIEWFAQNTGFLLQSAP FRA
Sbjct: 63  FNFNPQSYQYFQQRYLINDTYWGGAAHNSPIFVYTGNEGNIEWFAQNTGFLLQSAPRFRA 122

Query: 170 LVVFIEHRFYGKSIPFGGDEDVANSNSSTLGFLSSTQALADYATLITDLKKNLTAIDSPV 229
           LVVFIEHRFYGKSIPFGGDEDVANSNSSTLGFLSSTQALADYATLITDLKKNL+A+DSPV
Sbjct: 123 LVVFIEHRFYGKSIPFGGDEDVANSNSSTLGFLSSTQALADYATLITDLKKNLSAVDSPV 182

Query: 230 LVFGGSYGGMLAAWFRLKYPHIAIGALASSAPILQFENITSPYAFNNIITQDFKSESQNC 289
           LVFGGSYGGMLAAWFRLKYPHIA+GALASSAPILQFENITSPYAF+NI+TQDFKSESQNC
Sbjct: 183 LVFGGSYGGMLAAWFRLKYPHIAMGALASSAPILQFENITSPYAFSNIVTQDFKSESQNC 242

Query: 290 YRVIKGSWHQIEITANQPGGPHLLRNSFKF------------------------------ 349
           YRVIK SWH I+IT+  P GP LLR SFKF                              
Sbjct: 243 YRVIKESWHLIDITSTHPQGPQLLRKSFKFCKEAEAESIKNWLSTAILYTAMTDYPTPSN 302

Query: 350 -------------CKAIDDPTTGNDTFAKLYGVANVYYNYSGTATCFDLDDDSDPHDLGD 409
                        CKAIDD  TGNDTFAKLYG ANVYYNY+GTATCFDLDDDSDPHDLGD
Sbjct: 303 FLNPLPAYPVKQMCKAIDDSRTGNDTFAKLYGAANVYYNYTGTATCFDLDDDSDPHDLGD 362

Query: 410 WTWQACTEMILPTGANTEESIFPASTWHFADRLDFCKRLYDVEPRRIWIPTHFGGHNIER 469
           WTWQACTEMILPTG NT+ESIFPASTWHFADRL FCK  +DVEPRRIWIPTHFGGHNIER
Sbjct: 363 WTWQACTEMILPTGGNTKESIFPASTWHFADRLHFCKTFFDVEPRRIWIPTHFGGHNIER 422

Query: 470 VLKRFGSNIIFFNGLRDPWSGGGVLKNISSTIIAIVAKEGAHHVDLRFSNPEDPKWLKDV 508
           VLKRFGSNIIFFNGLRDPWSGGGVLKNISSTIIAIVAKEGAHHVDLRFS+ EDPKWLKDV
Sbjct: 423 VLKRFGSNIIFFNGLRDPWSGGGVLKNISSTIIAIVAKEGAHHVDLRFSSAEDPKWLKDV 482

BLAST of HG10020088 vs. NCBI nr
Match: XP_023539018.1 (lysosomal Pro-X carboxypeptidase [Cucurbita pepo subsp. pepo] >XP_023539019.1 lysosomal Pro-X carboxypeptidase [Cucurbita pepo subsp. pepo])

HSP 1 Score: 832.4 bits (2149), Expect = 2.1e-237
Identity = 405/502 (80.68%), Postives = 431/502 (85.86%), Query Frame = 0

Query: 49  MAAPSISISLSFFLFLSLHFTSSFSKFPLPFSSSLLLRPQNPPVDSLPHYHTNFFTQILD 108
           MA+PS S+SL  FLF+ LHF+SSFSK    F SSLLLRPQ PP+DSL HY T FFTQILD
Sbjct: 1   MASPSTSLSL--FLFIFLHFSSSFSKTSSLFYSSLLLRPQTPPIDSLRHYRTAFFTQILD 60

Query: 109 HFNFNPQSYHTFQQRYLINDTYWGGAAHNAPIFVYTGNEGNIEWFAQNTGFLLQSAPHFR 168
           HFNFNPQSY +FQQRYLINDTYWGGAA NAPIFVYTGNEGNIEWFAQNTGF+L+SAP FR
Sbjct: 61  HFNFNPQSYQSFQQRYLINDTYWGGAAENAPIFVYTGNEGNIEWFAQNTGFILESAPKFR 120

Query: 169 ALVVFIEHRFYGKSIPFGGDEDVANSNSSTLGFLSSTQALADYATLITDLKKNLTAIDSP 228
           ALVVFIEHRFYGKSIPFGGDEDVAN NSSTLGFLSSTQALADYATLITDLK+NLTAIDSP
Sbjct: 121 ALVVFIEHRFYGKSIPFGGDEDVANRNSSTLGFLSSTQALADYATLITDLKRNLTAIDSP 180

Query: 229 VLVFGGSYGGMLAAWFRLKYPHIAIGALASSAPILQFENITSPYAFNNIITQDFKSESQN 288
           V+VFGGSYGGMLAAWFRLKYPH+AIGA+ASSAPILQFENITSPY+FNNIITQDFKSESQN
Sbjct: 181 VVVFGGSYGGMLAAWFRLKYPHVAIGAVASSAPILQFENITSPYSFNNIITQDFKSESQN 240

Query: 289 CYRVIKGSWHQIEITANQPGGPHLLRNSFKF----------------------------- 348
           CYRVIKGSW +I++ ANQPGGP LLR SFKF                             
Sbjct: 241 CYRVIKGSWQKIDMVANQPGGPQLLRKSFKFCKPADSQSIQNWLYTAILYTAMTDYPTPS 300

Query: 349 --------------CKAIDDPTTGNDTFAKLYGVANVYYNYSGTATCFDLDDDSDPHDLG 408
                         CKAIDDPTTGNDTFAKLYG ANVYYNY+GTATCFDLDDDSDPHDLG
Sbjct: 301 NFLNPLPAYPVKQMCKAIDDPTTGNDTFAKLYGAANVYYNYTGTATCFDLDDDSDPHDLG 360

Query: 409 DWTWQACTEMILPTGANTEESIFPASTWHFADRLDFCKRLYDVEPRRIWIPTHFGGHNIE 468
           DWTWQACTE+ILPTGANTE+SIFPASTWH+ADR+++CK L+DVEPRR WI THFGG NIE
Sbjct: 361 DWTWQACTELILPTGANTEDSIFPASTWHYADRVNYCKSLFDVEPRRFWITTHFGGFNIE 420

Query: 469 RVLKRFGSNIIFFNGLRDPWSGGGVLKNISSTIIAIVAKEGAHHVDLRFSNPEDPKWLKD 508
           RVLKRFGSNIIFFNGLRDPWSGGGVLKNISS+IIAIVAKEGAHHVDLRFS+P DPKWLKD
Sbjct: 421 RVLKRFGSNIIFFNGLRDPWSGGGVLKNISSSIIAIVAKEGAHHVDLRFSDPNDPKWLKD 480

BLAST of HG10020088 vs. NCBI nr
Match: XP_023005894.1 (lysosomal Pro-X carboxypeptidase [Cucurbita maxima] >XP_023005895.1 lysosomal Pro-X carboxypeptidase [Cucurbita maxima])

HSP 1 Score: 831.2 bits (2146), Expect = 4.6e-237
Identity = 405/498 (81.33%), Postives = 425/498 (85.34%), Query Frame = 0

Query: 53  SISISLSFFLFLSLHFTSSFSKFPLPFSSSLLLRPQNPPVDSLPHYHTNFFTQILDHFNF 112
           S S SLS FLFL LHF+SSFSK    FSSS LLR QNPP+DSL HY T FFTQILDHFNF
Sbjct: 3   STSTSLSLFLFLFLHFSSSFSKTSSLFSSSFLLRSQNPPIDSLRHYRTAFFTQILDHFNF 62

Query: 113 NPQSYHTFQQRYLINDTYWGGAAHNAPIFVYTGNEGNIEWFAQNTGFLLQSAPHFRALVV 172
           NPQSY +FQQRYLINDTYWGGAA NAPIFVYTGNEGNIEWFAQNTGF+L+SAP FRALVV
Sbjct: 63  NPQSYQSFQQRYLINDTYWGGAAENAPIFVYTGNEGNIEWFAQNTGFILESAPKFRALVV 122

Query: 173 FIEHRFYGKSIPFGGDEDVANSNSSTLGFLSSTQALADYATLITDLKKNLTAIDSPVLVF 232
           FIEHRFYGKSIPFGGDEDVAN NSSTLGFLSSTQALADYATLITD K+NLTAIDSPV+VF
Sbjct: 123 FIEHRFYGKSIPFGGDEDVANRNSSTLGFLSSTQALADYATLITDFKRNLTAIDSPVVVF 182

Query: 233 GGSYGGMLAAWFRLKYPHIAIGALASSAPILQFENITSPYAFNNIITQDFKSESQNCYRV 292
           GGSYGGMLAAWFRLKYPH+AIGA+ASSAPILQFENITSPY+FNNIITQDFKSESQNCYRV
Sbjct: 183 GGSYGGMLAAWFRLKYPHVAIGAVASSAPILQFENITSPYSFNNIITQDFKSESQNCYRV 242

Query: 293 IKGSWHQIEITANQPGGPHLLRNSFKF--------------------------------- 352
           IKGSW QI++ ANQPGGP LLRNSFKF                                 
Sbjct: 243 IKGSWQQIDMVANQPGGPQLLRNSFKFCKPADSQSIQNWLYTAILYTAMTDYPTPSNFLN 302

Query: 353 ----------CKAIDDPTTGNDTFAKLYGVANVYYNYSGTATCFDLDDDSDPHDLGDWTW 412
                     CKAIDDPTTGNDTFAKLYG ANVYYNYSGTATCFDLDDDSDPHDLGDWTW
Sbjct: 303 PLPAYPVKQMCKAIDDPTTGNDTFAKLYGAANVYYNYSGTATCFDLDDDSDPHDLGDWTW 362

Query: 413 QACTEMILPTGANTEESIFPASTWHFADRLDFCKRLYDVEPRRIWIPTHFGGHNIERVLK 472
           QACTE+ILPTGANTE+SIFPASTWH+ADR+ +CK L+DVEPRR WI THFGG NIERVLK
Sbjct: 363 QACTELILPTGANTEDSIFPASTWHYADRVHYCKSLFDVEPRRFWITTHFGGFNIERVLK 422

Query: 473 RFGSNIIFFNGLRDPWSGGGVLKNISSTIIAIVAKEGAHHVDLRFSNPEDPKWLKDVRKQ 508
           RFGSNIIFFNGLRDPWSGGGVLKNISS+IIAIVAKEGAHHVDLRFS+  DPKWLKDVRKQ
Sbjct: 423 RFGSNIIFFNGLRDPWSGGGVLKNISSSIIAIVAKEGAHHVDLRFSDSNDPKWLKDVRKQ 482

BLAST of HG10020088 vs. NCBI nr
Match: KAG6596591.1 (Lysosomal Pro-X carboxypeptidase, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 830.1 bits (2143), Expect = 1.0e-236
Identity = 404/502 (80.48%), Postives = 430/502 (85.66%), Query Frame = 0

Query: 49  MAAPSISISLSFFLFLSLHFTSSFSKFPLPFSSSLLLRPQNPPVDSLPHYHTNFFTQILD 108
           MA+PS S+SL  FLFL LHF+SSFSK    F SSLLLRPQ PP+DSL HY T FFTQILD
Sbjct: 1   MASPSTSLSL--FLFLFLHFSSSFSKTSSLFYSSLLLRPQTPPIDSLRHYRTAFFTQILD 60

Query: 109 HFNFNPQSYHTFQQRYLINDTYWGGAAHNAPIFVYTGNEGNIEWFAQNTGFLLQSAPHFR 168
           HFNFNPQSY +FQQRYLINDTYWGGAA NAPIFVYTGNEGNIEWFAQNTGF+L+SAP FR
Sbjct: 61  HFNFNPQSYQSFQQRYLINDTYWGGAAENAPIFVYTGNEGNIEWFAQNTGFILESAPKFR 120

Query: 169 ALVVFIEHRFYGKSIPFGGDEDVANSNSSTLGFLSSTQALADYATLITDLKKNLTAIDSP 228
           ALVVFIEHRFYGKSIPFGGDEDVAN NSSTLGFLSSTQALADYATLITDLK+NLTAIDSP
Sbjct: 121 ALVVFIEHRFYGKSIPFGGDEDVANRNSSTLGFLSSTQALADYATLITDLKRNLTAIDSP 180

Query: 229 VLVFGGSYGGMLAAWFRLKYPHIAIGALASSAPILQFENITSPYAFNNIITQDFKSESQN 288
           V+VFGGSYGGMLAAWFRLKYPH+AIGA+ASSAPILQFENITSPY+FNNIITQDFKSESQN
Sbjct: 181 VVVFGGSYGGMLAAWFRLKYPHVAIGAIASSAPILQFENITSPYSFNNIITQDFKSESQN 240

Query: 289 CYRVIKGSWHQIEITANQPGGPHLLRNSFKF----------------------------- 348
           CYRVIKGSW +I++ ANQPGGP LLR SFKF                             
Sbjct: 241 CYRVIKGSWQKIDMVANQPGGPQLLRKSFKFCKPADSQSIQNWLYTAILYTAMTDYPTPS 300

Query: 349 --------------CKAIDDPTTGNDTFAKLYGVANVYYNYSGTATCFDLDDDSDPHDLG 408
                         CKAIDDPTTGNDTFAKLYG ANVYYNY+GTATCFDLDDDSDPHDLG
Sbjct: 301 NFLNPLPAYPVKQMCKAIDDPTTGNDTFAKLYGAANVYYNYTGTATCFDLDDDSDPHDLG 360

Query: 409 DWTWQACTEMILPTGANTEESIFPASTWHFADRLDFCKRLYDVEPRRIWIPTHFGGHNIE 468
           DWTWQACTE+ILPTGANTE+SIFPASTWH+ADR+ +CK L+DVEPRR WI THFGG NIE
Sbjct: 361 DWTWQACTELILPTGANTEDSIFPASTWHYADRVHYCKSLFDVEPRRFWITTHFGGFNIE 420

Query: 469 RVLKRFGSNIIFFNGLRDPWSGGGVLKNISSTIIAIVAKEGAHHVDLRFSNPEDPKWLKD 508
           RVLKRFGSNIIFFNGLRDPWSGGGVLKNISS+IIAIVAKEGAHHVDLRFS+P DPKW+KD
Sbjct: 421 RVLKRFGSNIIFFNGLRDPWSGGGVLKNISSSIIAIVAKEGAHHVDLRFSDPNDPKWVKD 480

BLAST of HG10020088 vs. ExPASy Swiss-Prot
Match: Q2TA14 (Lysosomal Pro-X carboxypeptidase OS=Bos taurus OX=9913 GN=PRCP PE=2 SV=1)

HSP 1 Score: 322.0 bits (824), Expect = 1.2e-86
Identity = 188/480 (39.17%), Postives = 254/480 (52.92%), Query Frame = 0

Query: 77  LPFSSSLLLRPQNPPVDSLPHYHTNFFTQILDHFNFNPQSYHTFQQRYLINDTYWGGAAH 136
           LP+S+S   RP          Y   +  Q +DHF FN     TF+QRYLI D YW     
Sbjct: 34  LPWSTSFRSRP-----TITLKYSIRYIQQKVDHFGFNID--RTFKQRYLIADNYW--KED 93

Query: 137 NAPIFVYTGNEGNIEWFAQNTGFLLQSAPHFRALVVFIEHRFYGKSIPFGGDEDVANSNS 196
              I  YTGNEG+I WF  NTGF+   A   +A++VF EHR+YG+S+PFG D   + S+S
Sbjct: 94  GGSILFYTGNEGDIIWFCNNTGFMWDIAEEMKAMLVFAEHRYYGESLPFGAD---SFSDS 153

Query: 197 STLGFLSSTQALADYATLITDLKKNLT-AIDSPVLVFGGSYGGMLAAWFRLKYPHIAIGA 256
             L FL++ QALAD+A LI  LK+ +  A +  V+  GGSYGGMLAAWFR+KYPH+ +GA
Sbjct: 154 RHLNFLTTEQALADFAKLIRYLKRTIPGARNQHVIALGGSYGGMLAAWFRMKYPHLVVGA 213

Query: 257 LASSAPILQFENITSPYAFNNIITQDFKSESQNCYRVIKGSWHQIEITANQPGGPHLLRN 316
           LASSAPI QF ++     F  I+T DF     NC   I+ SW  I   A +  G   L  
Sbjct: 214 LASSAPIWQFNDLVPCDIFMKIVTTDFSQSGPNCSESIRRSWDAINRLAKKGTGLRWLSE 273

Query: 317 SFKFC-----------------------KAIDDPTTGN---------------------- 376
           +   C                         +D P   N                      
Sbjct: 274 ALHLCTPLTKSQDVQRLKDWISETWVNVAMVDYPYESNFLQPLPAWPVKVVCQYFKYSNV 333

Query: 377 -DT--FAKLYGVANVYYNYSGTATCFDLDDDSDPHDLG--DWTWQACTEMILPTGANTEE 436
            DT     ++   NVYYNYSG A C ++ + +    LG   W++QACTEM++PT ++  +
Sbjct: 334 PDTVMVQNIFQALNVYYNYSGQAKCLNVSETA-TSSLGVLGWSYQACTEMVMPTCSDGVD 393

Query: 437 SIFPASTWHFADRLDFCKRLYDVEPRRIWIPTHFGGHNIERVLKRFGSNIIFFNGLRDPW 496
            +F   +W+  +  D C + + V PR  WIPT +GG NI        +NIIF NG  DPW
Sbjct: 394 DMFEPHSWNMKEYSDDCFKQWGVRPRPSWIPTMYGGKNISS-----HTNIIFSNGELDPW 453

Query: 497 SGGGVLKNISSTIIAIVAKEGAHHVDLRFSNPEDPKWLKDVRKQELNIIQDWLCQYYLDL 506
           SGGGV K+I+ T++AIV   GAHH+DLR SN  DP  ++  R  E+  ++ W+  +Y+ L
Sbjct: 454 SGGGVTKDITDTLLAIVIPNGAHHLDLRASNALDPVSVQLTRSLEVKYMKQWISDFYVRL 495

BLAST of HG10020088 vs. ExPASy Swiss-Prot
Match: Q5RBU7 (Lysosomal Pro-X carboxypeptidase OS=Pongo abelii OX=9601 GN=PRCP PE=2 SV=1)

HSP 1 Score: 316.2 bits (809), Expect = 6.6e-85
Identity = 179/470 (38.09%), Postives = 247/470 (52.55%), Query Frame = 0

Query: 87  PQNPPVDSLP----HYHTNFFTQILDHFNFNPQSYHTFQQRYLINDTYWGGAAHNAPIFV 146
           P NP   SLP    +Y   +F Q +DHF FN  +  TF QRYL+ D YW    +   I  
Sbjct: 35  PTNP--TSLPAVAKNYSVLYFQQKVDHFGFN--TVKTFNQRYLVADKYW--KKNGGSILF 94

Query: 147 YTGNEGNIEWFAQNTGFLLQSAPHFRALVVFIEHRFYGKSIPFGGDEDVANSNSSTLGFL 206
           YTGNEG+I WF  NTGF+   A   +A++VF EHR+YG+S+PFG   D    +S  L FL
Sbjct: 95  YTGNEGDIIWFCNNTGFMWDVAEELKAMLVFAEHRYYGESLPFG---DNTFKDSRHLNFL 154

Query: 207 SSTQALADYATLITDLKKNLT-AIDSPVLVFGGSYGGMLAAWFRLKYPHIAIGALASSAP 266
           +S QALAD+A LI  LK+ +  A + PV+  GGSYGGMLAAWFR+KYPH+ +GALA+SAP
Sbjct: 155 TSEQALADFAELIKHLKRTIPGAENQPVIAIGGSYGGMLAAWFRMKYPHMVVGALAASAP 214

Query: 267 ILQFENITSPYAFNNIITQDFKSESQNCYRVIKGSWHQIEITANQPGGPHLLRNSFKFCK 326
           I QFE++     F  I+T DF+    +C   I+ SW  I   +N   G   L  +   C 
Sbjct: 215 IWQFEDLVPCGVFMKIVTTDFRKSGPHCSESIRRSWDAINRLSNTGSGLQWLTGALHLCS 274

Query: 327 ----------------------AIDDPTTGND-------------------------TFA 386
                                  +D P   N                             
Sbjct: 275 PLTSQDIQHLKDWISETWVNLAMVDYPYASNFLQPLPAWPIKVVCQYLKNPNVSDSLLLQ 334

Query: 387 KLYGVANVYYNYSGTATCFDLDDDSDPHDLG--DWTWQACTEMILPTGANTEESIFPAST 446
            ++   NVYYNYSG   C ++ + +    LG   W++QACTE+++P   N  + +F   +
Sbjct: 335 NIFQALNVYYNYSGQVKCLNISETA-TSSLGTLGWSYQACTEVVMPFCTNGVDDMFEPHS 394

Query: 447 WHFADRLDFCKRLYDVEPRRIWIPTHFGGHNIERVLKRFGSNIIFFNGLRDPWSGGGVLK 503
           W+  +  D C + + V PR  WI T +GG NI        +NI+F NG  DPWSGGGV K
Sbjct: 395 WNLKELSDDCFQQWGVRPRPSWITTMYGGKNISS-----HTNIVFSNGELDPWSGGGVTK 454

BLAST of HG10020088 vs. ExPASy Swiss-Prot
Match: P42785 (Lysosomal Pro-X carboxypeptidase OS=Homo sapiens OX=9606 GN=PRCP PE=1 SV=1)

HSP 1 Score: 315.5 bits (807), Expect = 1.1e-84
Identity = 179/470 (38.09%), Postives = 247/470 (52.55%), Query Frame = 0

Query: 87  PQNPPVDSLP----HYHTNFFTQILDHFNFNPQSYHTFQQRYLINDTYWGGAAHNAPIFV 146
           P NP   SLP    +Y   +F Q +DHF FN  +  TF QRYL+ D YW    +   I  
Sbjct: 35  PTNP--TSLPAVAKNYSVLYFQQKVDHFGFN--TVKTFNQRYLVADKYW--KKNGGSILF 94

Query: 147 YTGNEGNIEWFAQNTGFLLQSAPHFRALVVFIEHRFYGKSIPFGGDEDVANSNSSTLGFL 206
           YTGNEG+I WF  NTGF+   A   +A++VF EHR+YG+S+PFG   D +  +S  L FL
Sbjct: 95  YTGNEGDIIWFCNNTGFMWDVAEELKAMLVFAEHRYYGESLPFG---DNSFKDSRHLNFL 154

Query: 207 SSTQALADYATLITDLKKNLT-AIDSPVLVFGGSYGGMLAAWFRLKYPHIAIGALASSAP 266
           +S QALAD+A LI  LK+ +  A + PV+  GGSYGGMLAAWFR+KYPH+ +GALA+SAP
Sbjct: 155 TSEQALADFAELIKHLKRTIPGAENQPVIAIGGSYGGMLAAWFRMKYPHMVVGALAASAP 214

Query: 267 ILQFENITSPYAFNNIITQDFKSESQNCYRVIKGSWHQIEITANQPGGPHLLRNSFKFCK 326
           I QFE++     F  I+T DF+    +C   I  SW  I   +N   G   L  +   C 
Sbjct: 215 IWQFEDLVPCGVFMKIVTTDFRKSGPHCSESIHRSWDAINRLSNTGSGLQWLTGALHLCS 274

Query: 327 ----------------------AIDDPTTGND-------------------------TFA 386
                                  +D P   N                             
Sbjct: 275 PLTSQDIQHLKDWISETWVNLAMVDYPYASNFLQPLPAWPIKVVCQYLKNPNVSDSLLLQ 334

Query: 387 KLYGVANVYYNYSGTATCFDLDDDSDPHDLG--DWTWQACTEMILPTGANTEESIFPAST 446
            ++   NVYYNYSG   C ++ + +    LG   W++QACTE+++P   N  + +F   +
Sbjct: 335 NIFQALNVYYNYSGQVKCLNISETA-TSSLGTLGWSYQACTEVVMPFCTNGVDDMFEPHS 394

Query: 447 WHFADRLDFCKRLYDVEPRRIWIPTHFGGHNIERVLKRFGSNIIFFNGLRDPWSGGGVLK 503
           W+  +  D C + + V PR  WI T +GG NI        +NI+F NG  DPWSGGGV K
Sbjct: 395 WNLKELSDDCFQQWGVRPRPSWITTMYGGKNISS-----HTNIVFSNGELDPWSGGGVTK 454

BLAST of HG10020088 vs. ExPASy Swiss-Prot
Match: Q7TMR0 (Lysosomal Pro-X carboxypeptidase OS=Mus musculus OX=10090 GN=Prcp PE=1 SV=2)

HSP 1 Score: 296.2 bits (757), Expect = 7.1e-79
Identity = 171/458 (37.34%), Postives = 237/458 (51.75%), Query Frame = 0

Query: 98  YHTNFFTQILDHFNFNPQSYHTFQQRYLINDTYWGGAAHNAPIFVYTGNEGNIEWFAQNT 157
           Y   +F Q +DHF F      TF+QRYL+ D +W    +   I  YTGNEG+I WF  NT
Sbjct: 46  YSVLYFEQKVDHFGF--ADMRTFKQRYLVADKHW--QRNGGSILFYTGNEGDIVWFCNNT 105

Query: 158 GFLLQSAPHFRALVVFIEHRFYGKSIPFGGDEDVANSNSSTLGFLSSTQALADYATLITD 217
           GF+   A   +A++VF EHR+YG+S+PFG D   +  +S  L FL+S QALAD+A LI  
Sbjct: 106 GFMWDVAEELKAMLVFAEHRYYGESLPFGQD---SFKDSQHLNFLTSEQALADFAELIRH 165

Query: 218 LKKNLT-AIDSPVLVFGGSYGGMLAAWFRLKYPHIAIGALASSAPILQFENITSPYAFNN 277
           L+K +  A   PV+  GGSYGGMLAAWFR+KYPHI +GALA+SAPI Q + +     F  
Sbjct: 166 LEKTIPGAQGQPVIAIGGSYGGMLAAWFRMKYPHIVVGALAASAPIWQLDGMVPCGEFMK 225

Query: 278 IITQDFKSESQNCYRVIKGSWHQIEITANQPGGPHLLRNSFKFCKAIDD---PT------ 337
           I+T DF+     C   I+ SW+ I+  +    G   L N    C  +     PT      
Sbjct: 226 IVTNDFRKSGPYCSESIRKSWNVIDKLSGSGSGLQSLTNILHLCSPLTSEKIPTLKGWIA 285

Query: 338 ------------------------------------TGNDT--FAKLYGVANVYYNYSGT 397
                                                 +DT     ++   +VYYNYSG 
Sbjct: 286 ETWVNLAMVNYPYACNFLQPLPAWPIKEVCQYLKNPNVSDTVLLQNIFQALSVYYNYSGQ 345

Query: 398 ATCFDLDDDSDPHDLGD--WTWQACTEMILPTGANTEESIFPASTWHFADRLDFCKRLYD 457
           A C ++   +    LG   W++QACTEM++P   N  + +F    W      + C   + 
Sbjct: 346 AACLNI-SQTTTSSLGSMGWSFQACTEMVMPFCTNGIDDMFEPFLWDLEKYSNDCFNQWG 405

Query: 458 VEPRRIWIPTHFGGHNIERVLKRFGSNIIFFNGLRDPWSGGGVLKNISSTIIAIVAKEGA 506
           V+PR  W+ T +GG NI        SNIIF NG  DPWSGGGV ++I+ T++AI   +GA
Sbjct: 406 VKPRPHWMTTMYGGKNISS-----HSNIIFSNGELDPWSGGGVTRDITDTLVAINIHDGA 465

BLAST of HG10020088 vs. ExPASy Swiss-Prot
Match: Q9ET22 (Dipeptidyl peptidase 2 OS=Mus musculus OX=10090 GN=Dpp7 PE=1 SV=2)

HSP 1 Score: 292.7 bits (748), Expect = 7.8e-78
Identity = 163/457 (35.67%), Postives = 241/457 (52.74%), Query Frame = 0

Query: 96  PHYHTNFFTQILDHFNFNPQSYHTFQQRYLINDTYWGGAAHNAPIFVYTGNEGNIEWFAQ 155
           P +H N+F Q +DHFNF      TF QR+L++D +W       PIF YTGNEG+I  FA 
Sbjct: 39  PDFHENYFEQYMDHFNFESFGNKTFGQRFLVSDKFW--KMGEGPIFFYTGNEGDIWSFAN 98

Query: 156 NTGFLLQSAPHFRALVVFIEHRFYGKSIPFGGDEDVANSNSSTLGFLSSTQALADYATLI 215
           N+GF+++ A    AL+VF EHR+YGKS+PFG    V ++       L+  QALAD+A L+
Sbjct: 99  NSGFMVELAAQQEALLVFAEHRYYGKSLPFG----VQSTQRGYTQLLTVEQALADFAVLL 158

Query: 216 TDLKKNLTAIDSPVLVFGGSYGGMLAAWFRLKYPHIAIGALASSAPILQFENITSPYAFN 275
             L+++L   D+P + FGGSYGGML+A+ R+KYPH+  GALA+SAP++    +   Y F 
Sbjct: 159 QALRQDLGVHDAPTIAFGGSYGGMLSAYMRMKYPHLVAGALAASAPVVAVAGLGDSYQFF 218

Query: 276 NIITQDFKSESQNCYRVIKGSWHQIEITANQPGGPHLLRNSFKFCKAIDDPTTGNDTFA- 335
             +T DF  +S  C + ++ ++ QI+    Q G    +  +F  C+++  P      F  
Sbjct: 219 RDVTADFYGQSPKCAQAVRDAFQQIKDLFLQ-GAYDTISQNFGTCQSLSSPKDLTQLFGF 278

Query: 336 -------------------------------------------KLYGVANVYYNYSGTAT 395
                                                       L  +A + YN SGT  
Sbjct: 279 ARNAFTVLAMMDYPYPTDFLGPLPANPVKVGCQRLLNEGQRIMGLRALAGLVYNSSGTEP 338

Query: 396 CFDL----DDDSDPHDLGD------WTWQACTEMILPTGANTEESIFPASTWHFADRLDF 455
           C+D+       +DP   G       W +QACTE+ L   +N    +FP   +    R  +
Sbjct: 339 CYDIYRLYQSCADPTGCGTGSDARAWDYQACTEINLTFDSNNVTDMFPEIPFSEELRQQY 398

Query: 456 CKRLYDVEPRRIWIPTHFGGHNIERVLKRFGSNIIFFNGLRDPWSGGGVLKNISSTIIAI 499
           C   + V PR+ W+ T F G ++     +  SNIIF NG  DPW+GGG+  N+S+++IA+
Sbjct: 399 CLDTWGVWPRQDWLQTSFWGGDL-----KAASNIIFSNGDLDPWAGGGIQSNLSTSVIAV 458

BLAST of HG10020088 vs. ExPASy TrEMBL
Match: A0A1S3B7I8 (lysosomal Pro-X carboxypeptidase-like OS=Cucumis melo OX=3656 GN=LOC103486644 PE=3 SV=1)

HSP 1 Score: 839.3 bits (2167), Expect = 8.3e-240
Identity = 412/501 (82.24%), Postives = 428/501 (85.43%), Query Frame = 0

Query: 50  AAPSISISLSFFLFLSLHFTSSFSKFPLPFSSSLLLRPQNPPVDSLPHYHTNFFTQILDH 109
           AAP  SISLS FLFLSLHFTSSFSK PLPF SSLLLRPQ PP+D L  Y T FFTQILDH
Sbjct: 3   AAP--SISLSIFLFLSLHFTSSFSKIPLPFHSSLLLRPQTPPIDPLLPYQTGFFTQILDH 62

Query: 110 FNFNPQSYHTFQQRYLINDTYWGGAAHNAPIFVYTGNEGNIEWFAQNTGFLLQSAPHFRA 169
           FNFNPQSY  FQQRYLINDTYWGGAAHN+PIFVYTGNEGNIEWFAQNTGFLLQSAP FRA
Sbjct: 63  FNFNPQSYQYFQQRYLINDTYWGGAAHNSPIFVYTGNEGNIEWFAQNTGFLLQSAPRFRA 122

Query: 170 LVVFIEHRFYGKSIPFGGDEDVANSNSSTLGFLSSTQALADYATLITDLKKNLTAIDSPV 229
           LVVFIEHRFYGKSIPFGGDEDVANSNSSTLGFLSSTQALADYATLITDLKKNL+A+DSPV
Sbjct: 123 LVVFIEHRFYGKSIPFGGDEDVANSNSSTLGFLSSTQALADYATLITDLKKNLSAVDSPV 182

Query: 230 LVFGGSYGGMLAAWFRLKYPHIAIGALASSAPILQFENITSPYAFNNIITQDFKSESQNC 289
           LVFGGSYGGMLAAWFRLKYPHIA+GALASSAPILQFENITSPYAF+NI+TQDFKSESQNC
Sbjct: 183 LVFGGSYGGMLAAWFRLKYPHIAMGALASSAPILQFENITSPYAFSNIVTQDFKSESQNC 242

Query: 290 YRVIKGSWHQIEITANQPGGPHLLRNSFKF------------------------------ 349
           YRVIK SWH I+IT+  P GP LLR SFKF                              
Sbjct: 243 YRVIKESWHLIDITSTHPQGPQLLRKSFKFCKEAEAESIKNWLSTAILYTAMTDYPTPSN 302

Query: 350 -------------CKAIDDPTTGNDTFAKLYGVANVYYNYSGTATCFDLDDDSDPHDLGD 409
                        CKAIDD  TGNDTFAKLYG ANVYYNY+GTATCFDLDDDSDPHDLGD
Sbjct: 303 FLNPLPAYPVKQMCKAIDDSRTGNDTFAKLYGAANVYYNYTGTATCFDLDDDSDPHDLGD 362

Query: 410 WTWQACTEMILPTGANTEESIFPASTWHFADRLDFCKRLYDVEPRRIWIPTHFGGHNIER 469
           WTWQACTEMILPTG NT+ESIFPASTWHFADRL FCK  +DVEPRRIWIPTHFGGHNIER
Sbjct: 363 WTWQACTEMILPTGGNTKESIFPASTWHFADRLHFCKTFFDVEPRRIWIPTHFGGHNIER 422

Query: 470 VLKRFGSNIIFFNGLRDPWSGGGVLKNISSTIIAIVAKEGAHHVDLRFSNPEDPKWLKDV 508
           VLKRFGSNIIFFNGLRDPWSGGGVLKNISSTIIAIVAKEGAHHVDLRFS+ EDPKWLKDV
Sbjct: 423 VLKRFGSNIIFFNGLRDPWSGGGVLKNISSTIIAIVAKEGAHHVDLRFSSAEDPKWLKDV 482

BLAST of HG10020088 vs. ExPASy TrEMBL
Match: A0A6J1KYN4 (lysosomal Pro-X carboxypeptidase OS=Cucurbita maxima OX=3661 GN=LOC111498764 PE=3 SV=1)

HSP 1 Score: 831.2 bits (2146), Expect = 2.3e-237
Identity = 405/498 (81.33%), Postives = 425/498 (85.34%), Query Frame = 0

Query: 53  SISISLSFFLFLSLHFTSSFSKFPLPFSSSLLLRPQNPPVDSLPHYHTNFFTQILDHFNF 112
           S S SLS FLFL LHF+SSFSK    FSSS LLR QNPP+DSL HY T FFTQILDHFNF
Sbjct: 3   STSTSLSLFLFLFLHFSSSFSKTSSLFSSSFLLRSQNPPIDSLRHYRTAFFTQILDHFNF 62

Query: 113 NPQSYHTFQQRYLINDTYWGGAAHNAPIFVYTGNEGNIEWFAQNTGFLLQSAPHFRALVV 172
           NPQSY +FQQRYLINDTYWGGAA NAPIFVYTGNEGNIEWFAQNTGF+L+SAP FRALVV
Sbjct: 63  NPQSYQSFQQRYLINDTYWGGAAENAPIFVYTGNEGNIEWFAQNTGFILESAPKFRALVV 122

Query: 173 FIEHRFYGKSIPFGGDEDVANSNSSTLGFLSSTQALADYATLITDLKKNLTAIDSPVLVF 232
           FIEHRFYGKSIPFGGDEDVAN NSSTLGFLSSTQALADYATLITD K+NLTAIDSPV+VF
Sbjct: 123 FIEHRFYGKSIPFGGDEDVANRNSSTLGFLSSTQALADYATLITDFKRNLTAIDSPVVVF 182

Query: 233 GGSYGGMLAAWFRLKYPHIAIGALASSAPILQFENITSPYAFNNIITQDFKSESQNCYRV 292
           GGSYGGMLAAWFRLKYPH+AIGA+ASSAPILQFENITSPY+FNNIITQDFKSESQNCYRV
Sbjct: 183 GGSYGGMLAAWFRLKYPHVAIGAVASSAPILQFENITSPYSFNNIITQDFKSESQNCYRV 242

Query: 293 IKGSWHQIEITANQPGGPHLLRNSFKF--------------------------------- 352
           IKGSW QI++ ANQPGGP LLRNSFKF                                 
Sbjct: 243 IKGSWQQIDMVANQPGGPQLLRNSFKFCKPADSQSIQNWLYTAILYTAMTDYPTPSNFLN 302

Query: 353 ----------CKAIDDPTTGNDTFAKLYGVANVYYNYSGTATCFDLDDDSDPHDLGDWTW 412
                     CKAIDDPTTGNDTFAKLYG ANVYYNYSGTATCFDLDDDSDPHDLGDWTW
Sbjct: 303 PLPAYPVKQMCKAIDDPTTGNDTFAKLYGAANVYYNYSGTATCFDLDDDSDPHDLGDWTW 362

Query: 413 QACTEMILPTGANTEESIFPASTWHFADRLDFCKRLYDVEPRRIWIPTHFGGHNIERVLK 472
           QACTE+ILPTGANTE+SIFPASTWH+ADR+ +CK L+DVEPRR WI THFGG NIERVLK
Sbjct: 363 QACTELILPTGANTEDSIFPASTWHYADRVHYCKSLFDVEPRRFWITTHFGGFNIERVLK 422

Query: 473 RFGSNIIFFNGLRDPWSGGGVLKNISSTIIAIVAKEGAHHVDLRFSNPEDPKWLKDVRKQ 508
           RFGSNIIFFNGLRDPWSGGGVLKNISS+IIAIVAKEGAHHVDLRFS+  DPKWLKDVRKQ
Sbjct: 423 RFGSNIIFFNGLRDPWSGGGVLKNISSSIIAIVAKEGAHHVDLRFSDSNDPKWLKDVRKQ 482

BLAST of HG10020088 vs. ExPASy TrEMBL
Match: A0A0A0LE34 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G778210 PE=3 SV=1)

HSP 1 Score: 828.6 bits (2139), Expect = 1.5e-236
Identity = 402/501 (80.24%), Postives = 425/501 (84.83%), Query Frame = 0

Query: 50  AAPSISISLSFFLFLSLHFTSSFSKFPLPFSSSLLLRPQNPPVDSLPHYHTNFFTQILDH 109
           AAP  SISLS FLFLSLHFTSSFSK PL F SSLLLRPQ+ P+D L  Y T+FFTQILDH
Sbjct: 3   AAP--SISLSIFLFLSLHFTSSFSKIPLSFPSSLLLRPQSSPIDPLLPYQTSFFTQILDH 62

Query: 110 FNFNPQSYHTFQQRYLINDTYWGGAAHNAPIFVYTGNEGNIEWFAQNTGFLLQSAPHFRA 169
           FNFNPQSY +FQQRYLINDTYWGGAAHN+PIFVYTGNEGNIEWFAQNTGFLLQ APHFRA
Sbjct: 63  FNFNPQSYQSFQQRYLINDTYWGGAAHNSPIFVYTGNEGNIEWFAQNTGFLLQYAPHFRA 122

Query: 170 LVVFIEHRFYGKSIPFGGDEDVANSNSSTLGFLSSTQALADYATLITDLKKNLTAIDSPV 229
           LVVFIEHRFYGKSIPFGGDEDVANSNSS LG+LSSTQALADYATLITDLKKNL+A+DSPV
Sbjct: 123 LVVFIEHRFYGKSIPFGGDEDVANSNSSMLGYLSSTQALADYATLITDLKKNLSAVDSPV 182

Query: 230 LVFGGSYGGMLAAWFRLKYPHIAIGALASSAPILQFENITSPYAFNNIITQDFKSESQNC 289
           LVFGGSYGGMLAAWFRLKYPHIA+GALASSAPILQ ENITSPYAFNNI+TQDFKSESQNC
Sbjct: 183 LVFGGSYGGMLAAWFRLKYPHIALGALASSAPILQLENITSPYAFNNIVTQDFKSESQNC 242

Query: 290 YRVIKGSWHQIEITANQPGGPHLLRNSFKF------------------------------ 349
           Y VIK SWH I+IT+  P GP LLR SFKF                              
Sbjct: 243 YSVIKESWHLIDITSTHPQGPQLLRKSFKFCKEAEAESIKNWLSTAIIYTAMTDYPTPSN 302

Query: 350 -------------CKAIDDPTTGNDTFAKLYGVANVYYNYSGTATCFDLDDDSDPHDLGD 409
                        CKAIDDP +GND+F KLYG AN+YYN++GT TCFDLDDDSDPHDLGD
Sbjct: 303 FLNPLPAYPVKQMCKAIDDPRSGNDSFTKLYGAANIYYNFTGTVTCFDLDDDSDPHDLGD 362

Query: 410 WTWQACTEMILPTGANTEESIFPASTWHFADRLDFCKRLYDVEPRRIWIPTHFGGHNIER 469
           W+WQACTEMILPTG NT+ESIFPASTWHFADR  FCK  +DVEPRRIWIPTHFGGHNIER
Sbjct: 363 WSWQACTEMILPTGGNTKESIFPASTWHFADRFQFCKTFFDVEPRRIWIPTHFGGHNIER 422

Query: 470 VLKRFGSNIIFFNGLRDPWSGGGVLKNISSTIIAIVAKEGAHHVDLRFSNPEDPKWLKDV 508
           VLKRFGSNIIFFNGLRDPWSGGGVLKNISSTIIAIVAKEGAHHVDLRFSNP+DPKWLKDV
Sbjct: 423 VLKRFGSNIIFFNGLRDPWSGGGVLKNISSTIIAIVAKEGAHHVDLRFSNPDDPKWLKDV 482

BLAST of HG10020088 vs. ExPASy TrEMBL
Match: A0A6J1FCY6 (lysosomal Pro-X carboxypeptidase OS=Cucurbita moschata OX=3662 GN=LOC111444258 PE=3 SV=1)

HSP 1 Score: 828.2 bits (2138), Expect = 1.9e-236
Identity = 405/502 (80.68%), Postives = 429/502 (85.46%), Query Frame = 0

Query: 49  MAAPSISISLSFFLFLSLHFTSSFSKFPLPFSSSLLLRPQNPPVDSLPHYHTNFFTQILD 108
           MA+PS S+SL  FLFL LHF+SSFSK    F SSLLLRPQ PP+DSL HY T FFTQILD
Sbjct: 1   MASPSTSLSL--FLFLFLHFSSSFSKTSSVF-SSLLLRPQTPPIDSLRHYRTAFFTQILD 60

Query: 109 HFNFNPQSYHTFQQRYLINDTYWGGAAHNAPIFVYTGNEGNIEWFAQNTGFLLQSAPHFR 168
           HFNFNPQSY +FQQRYLINDTYWGGA  NAPIFVYTGNEGNIEWFAQNTGF+L+SAP FR
Sbjct: 61  HFNFNPQSYQSFQQRYLINDTYWGGAVENAPIFVYTGNEGNIEWFAQNTGFILESAPKFR 120

Query: 169 ALVVFIEHRFYGKSIPFGGDEDVANSNSSTLGFLSSTQALADYATLITDLKKNLTAIDSP 228
           ALVVFIEHRFYGKSIPFGGDEDVAN NSSTLGFLSSTQALADYATLITDLK+NLTAIDSP
Sbjct: 121 ALVVFIEHRFYGKSIPFGGDEDVANRNSSTLGFLSSTQALADYATLITDLKRNLTAIDSP 180

Query: 229 VLVFGGSYGGMLAAWFRLKYPHIAIGALASSAPILQFENITSPYAFNNIITQDFKSESQN 288
           V+VFGGSYGGMLAAWFRLKYPH+AIGA+ASSAPILQFENITSPY+FNNIITQDFKSESQN
Sbjct: 181 VVVFGGSYGGMLAAWFRLKYPHVAIGAIASSAPILQFENITSPYSFNNIITQDFKSESQN 240

Query: 289 CYRVIKGSWHQIEITANQPGGPHLLRNSFKF----------------------------- 348
           CYRVIKGSW QI++ ANQPGGP LLR SFKF                             
Sbjct: 241 CYRVIKGSWQQIDMVANQPGGPQLLRKSFKFCKPADSQSIQNWLYTAILYTAMTDYPTPS 300

Query: 349 --------------CKAIDDPTTGNDTFAKLYGVANVYYNYSGTATCFDLDDDSDPHDLG 408
                         CKAIDDPTTGNDTFAKLYG ANVYYNY+GTATCFDLDDDSDPHDLG
Sbjct: 301 NFLNPLPAYPVKQMCKAIDDPTTGNDTFAKLYGAANVYYNYTGTATCFDLDDDSDPHDLG 360

Query: 409 DWTWQACTEMILPTGANTEESIFPASTWHFADRLDFCKRLYDVEPRRIWIPTHFGGHNIE 468
           DWTWQACTE+ILPTGANTE+SIFPASTWH+ADR+ +CK L+DVEPRR WI THFGG NIE
Sbjct: 361 DWTWQACTELILPTGANTEDSIFPASTWHYADRVHYCKSLFDVEPRRFWITTHFGGFNIE 420

Query: 469 RVLKRFGSNIIFFNGLRDPWSGGGVLKNISSTIIAIVAKEGAHHVDLRFSNPEDPKWLKD 508
           RVLKRFGSNIIFFNGLRDPWSGGGVLKNISS+IIAIVAKEGAHHVDLRFS+P DPKW+KD
Sbjct: 421 RVLKRFGSNIIFFNGLRDPWSGGGVLKNISSSIIAIVAKEGAHHVDLRFSDPNDPKWVKD 480

BLAST of HG10020088 vs. ExPASy TrEMBL
Match: A0A5D3DNW4 (Lysosomal Pro-X carboxypeptidase-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold352G004200 PE=3 SV=1)

HSP 1 Score: 823.9 bits (2127), Expect = 3.6e-235
Identity = 412/530 (77.74%), Postives = 428/530 (80.75%), Query Frame = 0

Query: 50  AAPSISISLSFFLFLSLHFTSSFSKFPLPFSSSLLLRPQNPPVDSLPHYHTNFFTQILDH 109
           AAP  SISLS FLFLSLHFTSSFSK PLPF SSLLLRPQ PP+D L  Y T FFTQILDH
Sbjct: 3   AAP--SISLSIFLFLSLHFTSSFSKIPLPFHSSLLLRPQTPPIDPLLPYQTGFFTQILDH 62

Query: 110 FNFNPQSYHTFQQRYLINDTYWGGAAHNAPIFVYTGNEGNIEWFAQNTGFLLQSAPHFRA 169
           FNFNPQSY  FQQRYLINDTYWGGAAHN+PIFVYTGNEGNIEWFAQNTGFLLQSAP FRA
Sbjct: 63  FNFNPQSYQYFQQRYLINDTYWGGAAHNSPIFVYTGNEGNIEWFAQNTGFLLQSAPRFRA 122

Query: 170 LVVFIEHRFYGKSIPFGGDEDVANSNSSTLGFLSSTQALADYATLITDLKKNLTAIDSPV 229
           LVVFIEHRFYGKSIPFGGDEDVANSNSSTLGFLSSTQALADYATLITDLKKNL+A+DSPV
Sbjct: 123 LVVFIEHRFYGKSIPFGGDEDVANSNSSTLGFLSSTQALADYATLITDLKKNLSAVDSPV 182

Query: 230 LVFGGSYGGMLAAWFRLKYPHIAIGALASSAPILQFENITSPYAFNNIITQDFKSESQNC 289
           LVFGGSYGGMLAAWFRLKYPHIA+GALASSAPILQFENITSPYAF+NI+TQDFKSESQNC
Sbjct: 183 LVFGGSYGGMLAAWFRLKYPHIAMGALASSAPILQFENITSPYAFSNIVTQDFKSESQNC 242

Query: 290 YRVIKGSWHQIEITANQPGGPHLLRNSFKF------------------------------ 349
           YRVIK SWH I+IT+  P GP LLR SFKF                              
Sbjct: 243 YRVIKESWHLIDITSTHPQGPQLLRKSFKFCKEAEAESIKNWLSTAILYTAMTDYPTPSN 302

Query: 350 -------------CKAIDDPTTGNDTFAKLYGVANVYYNYSGTATCFDLDDDSDPHDLGD 409
                        CKAIDD  TGNDTFAKLYG ANVYYNY+GTATCFDLDDDSDPHDLGD
Sbjct: 303 FLNPLPAYPVKQMCKAIDDSRTGNDTFAKLYGAANVYYNYTGTATCFDLDDDSDPHDLGD 362

Query: 410 WTWQ-----------------------------ACTEMILPTGANTEESIFPASTWHFAD 469
           WTWQ                             ACTEMILPTG NT+ESIFPASTWHFAD
Sbjct: 363 WTWQVLFFLLYSLSLFILISLCYVSFFFFPFIRACTEMILPTGGNTKESIFPASTWHFAD 422

Query: 470 RLDFCKRLYDVEPRRIWIPTHFGGHNIERVLKRFGSNIIFFNGLRDPWSGGGVLKNISST 508
           RL FCK  +DVEPRRIWIPTHFGGHNIERVLKRFGSNIIFFNGLRDPWSGGGVLKNISST
Sbjct: 423 RLHFCKTFFDVEPRRIWIPTHFGGHNIERVLKRFGSNIIFFNGLRDPWSGGGVLKNISST 482

BLAST of HG10020088 vs. TAIR 10
Match: AT2G24280.1 (alpha/beta-Hydrolases superfamily protein )

HSP 1 Score: 446.8 bits (1148), Expect = 2.3e-125
Identity = 231/491 (47.05%), Postives = 303/491 (61.71%), Query Frame = 0

Query: 61  FLFLSLHFTSSFSKFPLPFSSSLLLRPQNPPVDSLPHYHTNFFTQILDHFNFNPQSYHTF 120
           FLF S+   +++S       SSL L+ +         + T +F Q LDHF+F P SY  F
Sbjct: 9   FLFFSIVAEATYSPGGFHHLSSLRLKKKVSKSKHELPFETRYFPQNLDHFSFTPDSYKVF 68

Query: 121 QQRYLINDTYWGGAAHNAPIFVYTGNEGNIEWFAQNTGFLLQSAPHFRALVVFIEHRFYG 180
            Q+YLIN+ +W       PIFVYTGNEG+I+WFA NTGF+L  AP FRAL+VFIEHRFYG
Sbjct: 69  HQKYLINNRFW---RKGGPIFVYTGNEGDIDWFASNTGFMLDIAPKFRALLVFIEHRFYG 128

Query: 181 KSIPFGGDEDVANSNSSTLGFLSSTQALADYATLITDLKKNLTAIDSPVLVFGGSYGGML 240
           +S PFG     ++ ++ TLG+L+S QALADYA LI  LK+NL++  SPV+VFGGSYGGML
Sbjct: 129 ESTPFG---KKSHKSAETLGYLNSQQALADYAILIRSLKQNLSSEASPVVVFGGSYGGML 188

Query: 241 AAWFRLKYPHIAIGALASSAPILQFENITSPYAFNNIITQDFKSESQNCYRVIKGSWHQI 300
           AAWFRLKYPHI IGALASSAPIL F+NI    +F + I+QDFK  S NC++VIK SW ++
Sbjct: 189 AAWFRLKYPHITIGALASSAPILHFDNIVPLTSFYDAISQDFKDASINCFKVIKRSWEEL 248

Query: 301 EITANQPGGPHLLRNSFK------------------------------------------ 360
           E  +    G   L   F+                                          
Sbjct: 249 EAVSTMKNGLQELSKKFRTCKGLHSQYSARDWLSGAFVYTAMVNYPTAANFMAPLPGYPV 308

Query: 361 --FCKAIDDPTTGNDTFAKLYGVANVYYNYSGTATCFDLDDDSDPHDLGDWTWQACTEMI 420
              CK ID    G+    + +  A++YYNYSG+  CF+++  +D H L  W +QACTEM+
Sbjct: 309 EQMCKIIDGFPRGSSNLDRAFAAASLYYNYSGSEKCFEMEQQTDDHGLDGWQYQACTEMV 368

Query: 421 LPTGANTEESIFPASTWHFADRLDFCKRLYDVEPRRIWIPTHFGGHNIERVLKRFGSNII 480
           +P   + +  + P      A + + C   Y V+PR  WI T FGG  IE VLKRFGSNII
Sbjct: 369 MPMSCSNQSMLPPYENDSEAFQ-EQCMTRYGVKPRPHWITTEFGGMRIETVLKRFGSNII 428

Query: 481 FFNGLRDPWSGGGVLKNISSTIIAIVAKEGAHHVDLRFSNPEDPKWLKDVRKQELNIIQD 508
           F NG++DPWS GGVLKNISS+I+A+V K+GAHH DLR +  +DP+WLK+ R+QE+ II+ 
Sbjct: 429 FSNGMQDPWSRGGVLKNISSSIVALVTKKGAHHADLRAATKDDPEWLKEQRRQEVAIIEK 488

BLAST of HG10020088 vs. TAIR 10
Match: AT5G65760.1 (Serine carboxypeptidase S28 family protein )

HSP 1 Score: 439.5 bits (1129), Expect = 3.7e-123
Identity = 219/451 (48.56%), Postives = 280/451 (62.08%), Query Frame = 0

Query: 98  YHTNFFTQILDHFNFNPQSYHTFQQRYLINDTYWGGAAHNAPIFVYTGNEGNIEWFAQNT 157
           Y T FF+Q LDHF+F       F QRYLIN  +W GA+   PIF+Y GNEG+IEWFA N+
Sbjct: 58  YETKFFSQQLDHFSF--ADLPKFSQRYLINSDHWLGASALGPIFLYCGNEGDIEWFATNS 117

Query: 158 GFLLQSAPHFRALVVFIEHRFYGKSIPFGGDEDVANSNSSTLGFLSSTQALADYATLITD 217
           GF+   AP F AL+VF EHR+YG+S+P+G  E+ A  N++TL +L++ QALAD+A  +TD
Sbjct: 118 GFIWDIAPKFGALLVFPEHRYYGESMPYGSREE-AYKNATTLSYLTTEQALADFAVFVTD 177

Query: 218 LKKNLTAIDSPVLVFGGSYGGMLAAWFRLKYPHIAIGALASSAPILQFENITSPYAFNNI 277
           LK+NL+A   PV++FGGSYGGMLAAW RLKYPHIAIGALASSAPILQFE++  P  F +I
Sbjct: 178 LKRNLSAEACPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDVVPPETFYDI 237

Query: 278 ITQDFKSESQNCYRVIKGSWHQIEITANQPGGPHLLRNSFKFCKA--------------- 337
            + DFK ES +C+  IK SW  I     +  G   L  +F FC+                
Sbjct: 238 ASNDFKRESSSCFNTIKDSWDAIIAEGQKENGLLQLTKTFHFCRVLNSTDDLSDWLDSAY 297

Query: 338 -----------------------------IDDPTTGNDTFAKLYGVANVYYNYSGTATCF 397
                                        ID   +      ++Y   +VYYNY+G   CF
Sbjct: 298 SYLAMVDYPYPADFMMPLPGHPIREVCRKIDGAGSNASILDRIYAGISVYYNYTGNVDCF 357

Query: 398 DLDDDSDPHDLGDWTWQACTEMILPTGANTEESIFPASTWHFADRLDFCKRLYDVEPRRI 457
            LDD  DPH L  W WQACTEM++P  +N E S+FP   ++++   + C   + V PR  
Sbjct: 358 KLDD--DPHGLDGWNWQACTEMVMPMSSNQENSMFPGYGFNYSSYKEECWNTFRVNPRPK 417

Query: 458 WIPTHFGGHNIERVLKRFGSNIIFFNGLRDPWSGGGVLKNISSTIIAIVAKEGAHHVDLR 505
           W+ T FGGH+I   LK FGSNIIF NGL DPWSGG VLKN+S TI+A+V KEGAHH+DLR
Sbjct: 418 WVTTEFGGHDIATTLKSFGSNIIFSNGLLDPWSGGSVLKNLSDTIVALVTKEGAHHLDLR 477

BLAST of HG10020088 vs. TAIR 10
Match: AT5G22860.1 (Serine carboxypeptidase S28 family protein )

HSP 1 Score: 370.2 bits (949), Expect = 2.7e-102
Identity = 193/489 (39.47%), Postives = 279/489 (57.06%), Query Frame = 0

Query: 62  LFLSLHFTSSFSKFPLPFSSSLLLRPQNPPVDSLPHYHTN----------FFTQILDHFN 121
           L L +  TSS    PL  S    L   +  + + P   T           +F Q LDHF 
Sbjct: 8   LILFIFSTSSSYLIPLAHSKIARLGISSKTLKNEPDGSTQKVDESNLKMYYFNQTLDHFT 67

Query: 122 FNPQSYHTFQQRYLINDTYWGGAAHNAPIFVYTGNEGNIEWFAQNTGFLLQSAPHFRALV 181
           F P+SY TFQQRY I+ T+WGGA  NAPI  + G E +++      GFL  + P   AL+
Sbjct: 68  FTPESYMTFQQRYAIDSTHWGGAKANAPILAFLGEESSLDSDLAAIGFLRDNGPRLNALL 127

Query: 182 VFIEHRFYGKSIPFGGDEDVANSNSSTLGFLSSTQALADYATLITDLKKNLTAIDSPVLV 241
           V+IEHR+YG+++PFG  E+ A  N+STLG+L++ QALADYA ++  +K+  +   SP++V
Sbjct: 128 VYIEHRYYGETMPFGSAEE-ALKNASTLGYLNAAQALADYAAILLHVKEKYSTNHSPIIV 187

Query: 242 FGGSYGGMLAAWFRLKYPHIAIGALASSAPILQFENITSPYAFNNIITQDFKSESQNCYR 301
            GGSYGGMLAAWFRLKYPHIA+GALASSAP+L FE+    + +  I+T+ FK  S+ CY 
Sbjct: 188 IGGSYGGMLAAWFRLKYPHIALGALASSAPLLYFEDTRPKFGYYYIVTKVFKEASERCYN 247

Query: 302 VIKGSWHQIEITANQPGGPHLLRNSFKFCKAIDDPTTGNDTFAKLYGVANVY-------- 361
            I+ SW +I+  A +P G  +L   FK C  ++      D    +Y  A  Y        
Sbjct: 248 TIRNSWIEIDRVAGKPNGLSILSKQFKTCAPLNGSFDIKDFLDTIYAEAVQYNRGPNFWV 307

Query: 362 ---------------YN-----------YSGTATCFDLDDDSDPHDLG-DWTWQACTEMI 421
                          YN             G  TC+D    + P +    W WQ+C+E++
Sbjct: 308 AKVCNAINANPPNRRYNLLDRIFAGVVALVGNRTCYDTKMFAQPTNNNIAWRWQSCSEIV 367

Query: 422 LPTGANTEESIFPASTWHFADRLDFCKRLYDVEPRRIWIPTHFGGHNIERVLKRFGSNII 481
           +P G + ++++FP + ++    +D CK  + V PR  WI T+FG   ++ +L++FGSNII
Sbjct: 368 MPVGYDKQDTMFPTAPFNMTSYIDGCKSYHGVTPRPHWITTYFGIQEVKLILQKFGSNII 427

Query: 482 FFNGLRDPWSGGGVLKNISSTIIAIVAKEGAHHVDLRFSNPEDPKWLKDVRKQELNIIQD 506
           F NGL DP+S GGVL++IS T++AI  K G+H +D+   + EDP+WL   R++E+ +I  
Sbjct: 428 FSNGLSDPYSVGGVLEDISDTLVAITTKNGSHCLDITLKSKEDPEWLVIQREKEIKVIDS 487

BLAST of HG10020088 vs. TAIR 10
Match: AT5G22860.2 (Serine carboxypeptidase S28 family protein )

HSP 1 Score: 317.4 bits (812), Expect = 2.1e-86
Identity = 170/433 (39.26%), Postives = 242/433 (55.89%), Query Frame = 0

Query: 62  LFLSLHFTSSFSKFPLPFSSSLLLRPQNPPVDSLPHYHTN----------FFTQILDHFN 121
           L L +  TSS    PL  S    L   +  + + P   T           +F Q LDHF 
Sbjct: 8   LILFIFSTSSSYLIPLAHSKIARLGISSKTLKNEPDGSTQKVDESNLKMYYFNQTLDHFT 67

Query: 122 FNPQSYHTFQQRYLINDTYWGGAAHNAPIFVYTGNEGNIEWFAQNTGFLLQSAPHFRALV 181
           F P+SY TFQQRY I+ T+WGGA  NAPI  + G E +++      GFL  + P   AL+
Sbjct: 68  FTPESYMTFQQRYAIDSTHWGGAKANAPILAFLGEESSLDSDLAAIGFLRDNGPRLNALL 127

Query: 182 VFIEHRFYGKSIPFGGDEDVANSNSSTLGFLSSTQALADYATLITDLKKNLTAIDSPVLV 241
           V+IEHR+YG+++PFG  E+ A  N+STLG+L++ QALADYA ++  +K+  +   SP++V
Sbjct: 128 VYIEHRYYGETMPFGSAEE-ALKNASTLGYLNAAQALADYAAILLHVKEKYSTNHSPIIV 187

Query: 242 FGGSYGGMLAAWFRLKYPHIAIGALASSAPILQFENITSPYAFNNIITQDFKSESQNCYR 301
            GGSYGGMLAAWFRLKYPHIA+GALASSAP+L FE+    + +  I+T+ FK  S+ CY 
Sbjct: 188 IGGSYGGMLAAWFRLKYPHIALGALASSAPLLYFEDTRPKFGYYYIVTKVFKEASERCYN 247

Query: 302 VIKGSWHQIEITANQPGGPHLLRNSFKFCKAIDDPTTGNDTFAKLYGVANVY-------- 361
            I+ SW +I+  A +P G  +L   FK C  ++      D    +Y  A  Y        
Sbjct: 248 TIRNSWIEIDRVAGKPNGLSILSKQFKTCAPLNGSFDIKDFLDTIYAEAVQYNRGPNFWV 307

Query: 362 ---------------YN-----------YSGTATCFDLDDDSDPHDLG-DWTWQACTEMI 421
                          YN             G  TC+D    + P +    W WQ+C+E++
Sbjct: 308 AKVCNAINANPPNRRYNLLDRIFAGVVALVGNRTCYDTKMFAQPTNNNIAWRWQSCSEIV 367

Query: 422 LPTGANTEESIFPASTWHFADRLDFCKRLYDVEPRRIWIPTHFGGHNIERVLKRFGSNII 450
           +P G + ++++FP + ++    +D CK  + V PR  WI T+FG   ++ +L++FGSNII
Sbjct: 368 MPVGYDKQDTMFPTAPFNMTSYIDGCKSYHGVTPRPHWITTYFGIQEVKLILQKFGSNII 427

BLAST of HG10020088 vs. TAIR 10
Match: AT4G36190.1 (Serine carboxypeptidase S28 family protein )

HSP 1 Score: 108.6 bits (270), Expect = 1.5e-23
Identity = 108/388 (27.84%), Postives = 161/388 (41.49%), Query Frame = 0

Query: 102 FFTQILDHFNFNPQSYHTFQQRYLINDTYWGGAAHNAPIFVYTGNEGNIEWFAQNTGFLL 161
           +FTQ LDH  ++P  +  F+QRY     +      + PIF+    EG       N  ++ 
Sbjct: 49  WFTQTLDH--YSPSDHRKFRQRYYEYLDHL--RVPDGPIFLMICGEGPCNGITNN--YIS 108

Query: 162 QSAPHFRALVVFIEHRFYGKSIPFGGDEDVANSNSSTLGFLSSTQALADYATLITDLKKN 221
             A  F A +V +EHR+YGKS PF   + +A  N   L +LSS QAL+D AT     + +
Sbjct: 109 VLAKKFDAGIVSLEHRYYGKSSPF---KSLATKN---LKYLSSKQALSDLATFRQYYQDS 168

Query: 222 LTA-------IDSPVLVFGGSYGGMLAAWFRLKYPHIAIGALASSAPILQFENITSPYAF 281
           L         +++P   FG SY G L+AWFRLK+PH+  G+LASSA       + + Y F
Sbjct: 169 LNVKFNRSSNVENPWFFFGVSYSGALSAWFRLKFPHLTCGSLASSAV------VRAVYEF 228

Query: 282 NNIITQDFKSESQNC------------------YRVIKGSWHQIEITAN-------QPGG 341
                Q  +S    C                   R +K  ++  E+  +          G
Sbjct: 229 PEFDQQIAESAGPECETALQETNKLLELGLKVNNRAVKALFNATELDVDADFLYLIADAG 288

Query: 342 PHLLR--NSFKFCKAIDDPTTGNDTFAKLYG------VANVYYNYSGTATCFDLDDDSDP 401
              ++  N  K C  + +         + Y          V+   S T +   L D +  
Sbjct: 289 VMAIQYGNPDKLCVPLVEAQKNGGDLVEAYAKYVREFCMGVFGQSSKTYSRKHLLDTAVT 348

Query: 402 HDLGD--WTWQACTEMILPTGANTEESIFPASTWHFADRLDFCKRLY--DVEPRRIWIPT 446
            +  D  W +Q CTE+     A   +SI  +   +    LD CK L+   V P       
Sbjct: 349 LESADRLWWFQVCTEVAYFQVAPANDSI-RSHQINTEYHLDLCKSLFGKGVYPEVDATNL 408

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038904032.12.7e-25385.86lysosomal Pro-X carboxypeptidase-like isoform X1 [Benincasa hispida][more]
XP_008442879.11.7e-23982.24PREDICTED: lysosomal Pro-X carboxypeptidase-like [Cucumis melo][more]
XP_023539018.12.1e-23780.68lysosomal Pro-X carboxypeptidase [Cucurbita pepo subsp. pepo] >XP_023539019.1 ly... [more]
XP_023005894.14.6e-23781.33lysosomal Pro-X carboxypeptidase [Cucurbita maxima] >XP_023005895.1 lysosomal Pr... [more]
KAG6596591.11.0e-23680.48Lysosomal Pro-X carboxypeptidase, partial [Cucurbita argyrosperma subsp. sororia... [more]
Match NameE-valueIdentityDescription
Q2TA141.2e-8639.17Lysosomal Pro-X carboxypeptidase OS=Bos taurus OX=9913 GN=PRCP PE=2 SV=1[more]
Q5RBU76.6e-8538.09Lysosomal Pro-X carboxypeptidase OS=Pongo abelii OX=9601 GN=PRCP PE=2 SV=1[more]
P427851.1e-8438.09Lysosomal Pro-X carboxypeptidase OS=Homo sapiens OX=9606 GN=PRCP PE=1 SV=1[more]
Q7TMR07.1e-7937.34Lysosomal Pro-X carboxypeptidase OS=Mus musculus OX=10090 GN=Prcp PE=1 SV=2[more]
Q9ET227.8e-7835.67Dipeptidyl peptidase 2 OS=Mus musculus OX=10090 GN=Dpp7 PE=1 SV=2[more]
Match NameE-valueIdentityDescription
A0A1S3B7I88.3e-24082.24lysosomal Pro-X carboxypeptidase-like OS=Cucumis melo OX=3656 GN=LOC103486644 PE... [more]
A0A6J1KYN42.3e-23781.33lysosomal Pro-X carboxypeptidase OS=Cucurbita maxima OX=3661 GN=LOC111498764 PE=... [more]
A0A0A0LE341.5e-23680.24Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G778210 PE=3 SV=1[more]
A0A6J1FCY61.9e-23680.68lysosomal Pro-X carboxypeptidase OS=Cucurbita moschata OX=3662 GN=LOC111444258 P... [more]
A0A5D3DNW43.6e-23577.74Lysosomal Pro-X carboxypeptidase-like OS=Cucumis melo var. makuwa OX=1194695 GN=... [more]
Match NameE-valueIdentityDescription
AT2G24280.12.3e-12547.05alpha/beta-Hydrolases superfamily protein [more]
AT5G65760.13.7e-12348.56Serine carboxypeptidase S28 family protein [more]
AT5G22860.12.7e-10239.47Serine carboxypeptidase S28 family protein [more]
AT5G22860.22.1e-8639.26Serine carboxypeptidase S28 family protein [more]
AT4G36190.11.5e-2327.84Serine carboxypeptidase S28 family protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR008758Peptidase S28PFAMPF05577Peptidase_S28coord: 363..488
e-value: 1.2E-15
score: 57.3
coord: 105..346
e-value: 1.0E-57
score: 195.9
IPR029058Alpha/Beta hydrolase foldGENE3D3.40.50.1820alpha/beta hydrolasecoord: 95..344
e-value: 3.3E-70
score: 238.5
IPR029058Alpha/Beta hydrolase foldGENE3D3.40.50.1820alpha/beta hydrolasecoord: 345..504
e-value: 2.5E-32
score: 114.3
IPR029058Alpha/Beta hydrolase foldSUPERFAMILY53474alpha/beta-Hydrolasescoord: 170..472
IPR029058Alpha/Beta hydrolase foldSUPERFAMILY53474alpha/beta-Hydrolasescoord: 132..271
NoneNo IPR availablePANTHERPTHR11010:SF80LYSOSOMAL PRO-X CARBOXYPEPTIDASE-LIKE PROTEINcoord: 58..321
NoneNo IPR availablePANTHERPTHR11010:SF80LYSOSOMAL PRO-X CARBOXYPEPTIDASE-LIKE PROTEINcoord: 318..507
NoneNo IPR availablePANTHERPTHR11010PROTEASE S28 PRO-X CARBOXYPEPTIDASE-RELATEDcoord: 58..321
NoneNo IPR availablePANTHERPTHR11010PROTEASE S28 PRO-X CARBOXYPEPTIDASE-RELATEDcoord: 318..507

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10020088.1HG10020088.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0004180 carboxypeptidase activity
molecular_function GO:0008239 dipeptidyl-peptidase activity
molecular_function GO:0008236 serine-type peptidase activity