CmoCh15G001070 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh15G001070
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
Descriptionaspartic proteinase PCS1-like
LocationCmo_Chr15: 519475 .. 525423 (-)
RNA-Seq ExpressionCmoCh15G001070
SyntenyCmoCh15G001070
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCTTCTTCCTCCGCCTGCTGCAGCTTCTTATCTGCTGCGTCTCTTTCAAACAGGGCCTCTGTTTTTCTGCGACTCAGACCATGGTTTTGCCCCTCAAAACACAGATGGGTGTCACTTCTCGGCCTTCCAATAAGCTCAGTTTTCACCATAATGTCACTTTGACTGTTTCCTTAACGCTTGGCTCGCCTCCTCAACCCGTTACTATGGTTCTCGATACAGGGAGTGAACTCTCATGGCTTCACTGCAAAAAAACCCCAAATTTGAACTCTGTTTTTAACCCACTTTCTTCCTCTTCTTACTCGCCAGTCCCCTGTGCTTCCCCTGTTTGCCGGACCCGAACCCGAGATTTACCCAACCCGGTTACCTGCGACCCAAAGAAACTCTGCCACGTCTTTGTCTCTTATGCCGACGCCTCGTCGCTCGAGGGTAATCTCGCGTCGGATACGTTTCGAGTCGGGTCATCGGCTCAACCCGGAACTTTTTTTGGGTGTATGGATTCGGGTTTCAGTTCGAATTCGGAGGAGGACGCGAAGACCACTGGGCTGATGGGGATGAACAGGGGCTCGCTCTCGTTTGTCACCCAATTGGGTTTGCCCAAATTCTCTTATTGCATATCGGGTCGTGATTCTTCTGGGGTTCTGCTTTTCGGCGACGCGAGTCTTTCTTGGCTTGGGAATTTGACCTACACGCCTTTGGTTCAAATGTCTACGCCATTGCCGTATTACGACCGAGTCGCCTACACGGTCCAACTAGACGGAATCAGAGTAGGGAACAAAATTCTGGCACTCCCGAAGTCAATATTCGCACCAGACCACACCGGCGCCGGGCAAACCATGGTAGATTCAGGGACCCAGTTCACGTTTCTTCTGGGACCAGTGTACACGGCTTTAAAGAACGAGTTTGTGGTACAAACGAAGGGCATTTTGGTCCCACTGGGTGATCCAAATTTCGTGTTTCAAGGAGCGATGGACTTGTGCTACAGAGTACCCGAGAAACAGGGGAAACTGCCGCCACTGCCGGTAGTGAGTCTGATGTTCCGTGGGGCGGAGATGGTGGTTGGCGGAGAGGTGCTAATGTATAAAGTACCGGGAATGGTAAGGGGTGGTGACCAAGTGCATTGCTTGACGTTTGGGAATTCGGATTTGTTAGGAATAGAAGCATTTGTGATTGGGCATCATCATCAACAAAACGTGTGGATGGAATTCGACTTGGTGAAATCAAGGGTTGGATTTGTAGAGACGAGGTGTGATTTGGCGGGTGAGCGGCTGGGGCTGGGGCCTTAAAAAATTAAAAAAGAACTTCGAATTGAAAAATGGCCCGAAGCACTGCCCAATCCATGGCCCACCTCAAGAGGGAGGAGGTGGCCCATTGATCGGACAGGCACGGGAACAGAGTCTCTCTCTCGGGTTTTATTTTATTTTATTTTTAAAAAAATTTTGATTGCTCTGTGAGATTCTCTGAAGAACATTGTGGAGATGTCAGGTCTTTTTGTTTTCGAAATGGTTCAAAATTTCTTTTCCATGTTTTGTTCATTGGGCTTGGTGTCGTGTTGGGTCATTCAGTTAGTACAATTTTGAATTGTATATATTTATATATATATATATATATATATATCTATATATTATTATATTATTGGTGTTTTCTGTCTTCAAGTTCTGACTTTTGATAAAGATAAAGGTTGCTAATTTTGCCAGCGCATCCTCTTAGCCCACCCCCACCTGCCGCCCCCTTTTGAGATTAAATCAATTTTTGACAAAAAGAAAAATTAAAAAAACAATAGGGTTTTTGTGAGCCGGGTGAATTAAATTGAAGATTTTTAAATATTCATGGTAAATTAATTTTTTTTAATAAAAAAGTATATTTGTGGAATTAAGAAAAAAAAAAATTAAAGAAACATGGACACTATGTAGTAGTATCACGTGAGATGAAAATCACGAGGGGGGCAGTACACACGTGTGGTTTGTTTTTAAGGAAAGATGCAAGTTGCCGACCAACCAGAGTCAATCAGTTTGACACGTGTCATAGAATAAGGCGTGGACTCTATACACCTCAACCACAAAGGCCTAACTGCCTTGTTTATTGATTTTTGTCGTTTCTATTATATTTTTTATATTTCACAAATAAACTAACCCTTTCATTTTATTAATTGTATTAGATAAATACAGAAGCAAGATATAAAATTTAAAAATTTGTTATATAAAATATTATTATATATTTCTTTAATATGTTTTTGAAAACCTATTTTATAACCGATACATTTTAAAATTGTGAGATTGATGATAAACATAACTGGTCAAAGCGGATAATATCTATTAGCAGTGGTTTCGGAACTGTTACAAATGGTATCAGAGTTAGTCACATGATGGTGCGAGACAATAGTAGGAATAGGGCTAGACCCTCTCTATAATAGATGCATTTTAAATTCGTAAGAGCTGATAACAATACGTAATGAGCTAAAACGAATAATATTTACTAATGGATGCATTTGAAGCGAAGACGCTGGCTCCCAAAATGTAGAAACCTCTCTCTCTAATAGATGCATTTGAAGTATTATTTGGACTTATTTTTGATGTCCTCCAACAAACAAATGAATAATTTAACGTAATGGATAAGACCATTATTATTATTATTAAATTTTTAAAAGATAATATGGTTAAATCTTAAAAATGGAAATAGTTTCAAAATCATAGGTAGATATAAAATTTGCATTAATGAAGTTTGAAATTAGAATTAGGGCCATAGAATGGATCGGGTCGGTTGGTTTTTGGAAGGGTATGGGCCATGGGTCGTTCGGTAACGCTCGGGAAAAGTCTAGAGATTAGAAGAAGCAAAAGCAGGCGTTAGAACATAGAATTTGACCAAATCAAAATACTACGTCGTTTTGCGCATTCAACTCCATCTCCCAAAGCCTACCTTACTAGGAAGCGAAGCCAAGCGAAACACGGGTCACATAGCAACTCAACGCGTCCGAGTCACTTCCCTCACTAGACTCGCTCCTCCATCTTCTCCGTCGCCACATACGGGTTTTCATTCTCACATAATTTTAAGATACCCCATCGCCAAAACTGTCCGATTCCAAGCAAACTCTGTCGCTTTTGGACCAACACAATCACCCCTATTTACTCGATTCTGTTCTGTAAGTCTCGGATTTCCCTCACGAAGGATTATTCTGGTTTGATTATTTGCTGATCCCTGTTCTCTTGTGCCTTAATTTCTTGGGATTCGGTTTCTAGGGTTCCTTAAATGAGTGATTTTTATTTTTATCGTTTGAATTTCACTTGTTCAACTGTCATCATAGCTTACGGGGCTGTAAAATTGTTTCGTCTGTCTGATTCCATTCATGTTTTTCGCTACATTTGACTATCTACGGGAGTTGGCTACGAATGTCGGGAATTGAGAATTTAGATGAAAAGAATTCAAAGCTGATCTGGTTCAACCGACCGATAGTTCTGATTTCGAAGTTGAGCATTTGCATTTTAGTGCAGACCTCAATCATGTCAATATTGGATGTCTGTATGCACTTCAGTTTGTGTGCAGTCATCTTTCGGTCGTTTGGTGTTCATTAGATGGCCATATGAGGGGGAATGTCACTGGATGTCTTTCCATACAATGCTGATTCATTCCTTAGAGGATATTCAGGATTGATATATTGTCTGAAGGAATCAATAAATTTGAGTTATCTAGTCTTTATGGAGCAGATGATTAGCTCATGAATGTGATTTATGCATTGAATTTGACAACTAGGAAACAATTAAGAACATGTCAATGCTTACGGATAGACCATTCTCATTGGGGCTGAGAATGATGAATGATTGGTTATGTAGTAAGGCGATGCGGGCATCATTTGATTCATTTTGTTTGTCTATTAAGATACTGGAGCCTCTTTCTTTATCAGTTCTGCAGATAATTATCATGAGTTCATCTGAATTTCATTCTGTAGATTCTCATTTCTCTGACATCAACCAGCTGTCGCAACTACTGCAGACATGAGTTTTGTTTTCCGTGGAACAAGAGTTCCAGACATAGAAAATGGTCTATCTGGATTTATACCTGAACGGCGTGCAATGGTATGCTCAATATTTTACACTCTCGACTTGTTAATACCAATACTTGAAAATTTATAGAACCTGCACTTGTCATGATCTTAATTTAAAATATATGGTTTTTCTGCTCCCAATCAGCGGGTTCATGCAGCACGTCCTGTTAACTCAAACTCACTTGCCTTTCTTGTCACAGGTACCATTTCACCTATATACAGACTAATATCTGTTTCATGTGATAATTTTGATCTATTGCATTGTTTTAACTCCTTGACCTTGTTGCAGTCCTTTTGTTGTTCATGATGTTAAATTCTCACCAGATGTCACCGAACTTTCTGGTAAGTTTTACATTTCACACTGGATGAGCAGTAGACTGGTGTTACTCTACCCTTGAGCAAATCTTTACATTTTCTTCCTTTGAACATGCATATTTAGCTCTGGCTTGTGCTTGGTGTGTTTTTGATGGCTACAACATTAAGGATGTATGCGACCTGTCAACAACTTCAGGCTCAAGCCCAAGCTAGAGCTATGGCAGCCAGTGGCCTTCTTGGGCACACTGAATTGAGGTTACATATGCCACCATCAATAGCACTTGCTACTAGAGGACGTTTACAAGGGCTAAGGCTTCAACTTGCTCTGCTTGATCGGGAATTTGATGATTTAGGTATGGTTCAAATCTAAAAAGTTTAAGTTCGTTGTTTCTTGTAAAAGTTTTTTTATGGATGATAATGTAGTAACATATTGTTTATCTTCAGATTATGAAACTTTGAGAGCTTTGGATTCTGACAATGCTCCGACAACACCTTCTATGAGTGAGGAACAAATAAATGCTCTTCCTGTTCATAAATACAAGGTTTCTGGTCCTCAAAGGTAATCAGCCCCATACTGTCTACAGTCCAGAAAGAATTAGGGTATCTTGCCTTATTCTAAGATTTGCTTGGAAGACTCATATTCTGTGGTGAAAGATTTTTCAACGATGAAATCCATGCACTTGAGCTTGCACTCATAAACTAAAGTAACGTTTTCATTAAATTATCTACAGCGACCCCTCTGTGAACCAGCAGGCTTCATCTTCAGAGTCTAATGAGGTATTTGTTAGATTTCTTTTTCCTTACGTGTTGATTCTTTTATTGCCTTAAACATGTACGTGTTCTTAAATTTTGATTGCAGAAGAGACAAGATTCAGCTAATGCAGTTGGCAGTACCAAGGCCTCGGAGGATGAACTTACGTGCAGTGTTTGCTTGGAGCAAGTAAATGTTGGTGAACTCGTACGTAGTCTGCCATGTTTACATCAGGTTTATCTCTATCTCCTTCTCTATCTCACCTTTTCTTCCAAATAAATGGTTGCCGAGATTTCATTTGAAAAGTCATGGACTTGGTTCATTGTTCTTCCCTTAAAATCTTGATACATCTTCTATCGCATGCATTCCTATACTTTTTTTAATTTGAATTTGGTTATACATGAATTCAATCTTGCAGTTCCATGCCAACTGCATAGATCCATGGCTGCGACAGCAGGGCACGTGCCCTGTTTGTAAATTCAGAGCGGTGTCTGGGTGGTCAGAACAGGGACAAGGGGAAACCGATGCGTATTCGGTTTGATTGAAGGCAAGGCAAGTTATCGAACTATGCTTGATGCCTGCTTAAAGCGCAGCGTATATGGACAGACATGGTGTAATTAGGATGAGGGTAGGAATTACAGGAAGTGCGATTGCCTTCTTTGAACTTCTACAAGCCATTTATTAATTGTATTCTTGAAGTTGATCATTTGTTATATAGTCTCTGACCATTGCGAGTGAGCAGAATTTTGTTCTTTCACTATCTTCCATGTAGTCCAAACCTTACCAGCAAGTGCTTGTTATATCTAGAACTCTTTCATGATTCTCTAATGCGTTCAAATCTCC

mRNA sequence

ATGGCCTTCTTCCTCCGCCTGCTGCAGCTTCTTATCTGCTGCGTCTCTTTCAAACAGGGCCTCTGTTTTTCTGCGACTCAGACCATGGTTTTGCCCCTCAAAACACAGATGGGTGTCACTTCTCGGCCTTCCAATAAGCTCAGTTTTCACCATAATGTCACTTTGACTGTTTCCTTAACGCTTGGCTCGCCTCCTCAACCCGTTACTATGGTTCTCGATACAGGGAGTGAACTCTCATGGCTTCACTGCAAAAAAACCCCAAATTTGAACTCTGTTTTTAACCCACTTTCTTCCTCTTCTTACTCGCCAGTCCCCTGTGCTTCCCCTGTTTGCCGGACCCGAACCCGAGATTTACCCAACCCGGTTACCTGCGACCCAAAGAAACTCTGCCACGTCTTTGTCTCTTATGCCGACGCCTCGTCGCTCGAGGGTAATCTCGCGTCGGATACGTTTCGAGTCGGGTCATCGGCTCAACCCGGAACTTTTTTTGGGTGTATGGATTCGGGTTTCAGTTCGAATTCGGAGGAGGACGCGAAGACCACTGGGCTGATGGGGATGAACAGGGGCTCGCTCTCGTTTGTCACCCAATTGGGTTTGCCCAAATTCTCTTATTGCATATCGGGTCGTGATTCTTCTGGGGTTCTGCTTTTCGGCGACGCGAGTCTTTCTTGGCTTGGGAATTTGACCTACACGCCTTTGGTTCAAATGTCTACGCCATTGCCGTATTACGACCGAGTCGCCTACACGGTCCAACTAGACGGAATCAGAGTAGGGAACAAAATTCTGGCACTCCCGAAGTCAATATTCGCACCAGACCACACCGGCGCCGGGCAAACCATGGTAGATTCAGGGACCCAGTTCACGTTTCTTCTGGGACCAGTGTACACGGCTTTAAAGAACGAGTTTGTGGTACAAACGAAGGGCATTTTGGTCCCACTGGGTGATCCAAATTTCGTGTTTCAAGGAGCGATGGACTTGTGCTACAGAGTACCCGAGAAACAGGGGAAACTGCCGCCACTGCCGGTAGTGAGTCTGATGTTCCGTGGGGCGGAGATGGTGGTTGGCGGAGAGGTGCTAATGTATAAAGTACCGGGAATGGTAAGGGGTGGTGACCAAGTGCATTGCTTGACGTTTGGGAATTCGGATTTGTTAGGAATAGAAGCATTTGTGATTGGGCATCATCATCAACAAAACGTGTGGATGGAATTCGACTTGGTGAAATCAAGGGTTGGATTTGTAGAGACGAGGTGTGATTTGGCGGCTGTCGCAACTACTGCAGACATGAGTTTTGTTTTCCGTGGAACAAGAGTTCCAGACATAGAAAATGGTCTATCTGGATTTATACCTGAACGGCGTGCAATGCGGGTTCATGCAGCACGTCCTGTTAACTCAAACTCACTTGCCTTTCTTGTCACAGTCCTTTTGTTGTTCATGATGTTAAATTCTCACCAGATGTCACCGAACTTTCTGCTCTGGCTTGTGCTTGGTGTGTTTTTGATGGCTACAACATTAAGGATGTATGCGACCTGTCAACAACTTCAGGCTCAAGCCCAAGCTAGAGCTATGGCAGCCAGTGGCCTTCTTGGGCACACTGAATTGAGGTTACATATGCCACCATCAATAGCACTTGCTACTAGAGGACGTTTACAAGGGCTAAGGCTTCAACTTGCTCTGCTTGATCGGGAATTTGATGATTTAGATTATGAAACTTTGAGAGCTTTGGATTCTGACAATGCTCCGACAACACCTTCTATGAGTGAGGAACAAATAAATGCTCTTCCTGTTCATAAATACAAGGTTTCTGGTCCTCAAAGCGACCCCTCTGTGAACCAGCAGGCTTCATCTTCAGAGTCTAATGAGAAGAGACAAGATTCAGCTAATGCAGTTGGCAGTACCAAGGCCTCGGAGGATGAACTTACGTGCAGTGTTTGCTTGGAGCAAGTAAATGTTGGTGAACTCTTCCATGCCAACTGCATAGATCCATGGCTGCGACAGCAGGGCACGTGCCCTGTTTGTAAATTCAGAGCGGTGTCTGGGTGGTCAGAACAGGGACAAGGGGAAACCGATGCGTATTCGGTTTGATTGAAGGCAAGGCAAGTTATCGAACTATGCTTGATGCCTGCTTAAAGCGCAGCGTATATGGACAGACATGGTGTAATTAGGATGAGGGTAGGAATTACAGGAAGTGCGATTGCCTTCTTTGAACTTCTACAAGCCATTTATTAATTGTATTCTTGAAGTTGATCATTTGTTATATAGTCTCTGACCATTGCGAGTGAGCAGAATTTTGTTCTTTCACTATCTTCCATGTAGTCCAAACCTTACCAGCAAGTGCTTGTTATATCTAGAACTCTTTCATGATTCTCTAATGCGTTCAAATCTCC

Coding sequence (CDS)

ATGGCCTTCTTCCTCCGCCTGCTGCAGCTTCTTATCTGCTGCGTCTCTTTCAAACAGGGCCTCTGTTTTTCTGCGACTCAGACCATGGTTTTGCCCCTCAAAACACAGATGGGTGTCACTTCTCGGCCTTCCAATAAGCTCAGTTTTCACCATAATGTCACTTTGACTGTTTCCTTAACGCTTGGCTCGCCTCCTCAACCCGTTACTATGGTTCTCGATACAGGGAGTGAACTCTCATGGCTTCACTGCAAAAAAACCCCAAATTTGAACTCTGTTTTTAACCCACTTTCTTCCTCTTCTTACTCGCCAGTCCCCTGTGCTTCCCCTGTTTGCCGGACCCGAACCCGAGATTTACCCAACCCGGTTACCTGCGACCCAAAGAAACTCTGCCACGTCTTTGTCTCTTATGCCGACGCCTCGTCGCTCGAGGGTAATCTCGCGTCGGATACGTTTCGAGTCGGGTCATCGGCTCAACCCGGAACTTTTTTTGGGTGTATGGATTCGGGTTTCAGTTCGAATTCGGAGGAGGACGCGAAGACCACTGGGCTGATGGGGATGAACAGGGGCTCGCTCTCGTTTGTCACCCAATTGGGTTTGCCCAAATTCTCTTATTGCATATCGGGTCGTGATTCTTCTGGGGTTCTGCTTTTCGGCGACGCGAGTCTTTCTTGGCTTGGGAATTTGACCTACACGCCTTTGGTTCAAATGTCTACGCCATTGCCGTATTACGACCGAGTCGCCTACACGGTCCAACTAGACGGAATCAGAGTAGGGAACAAAATTCTGGCACTCCCGAAGTCAATATTCGCACCAGACCACACCGGCGCCGGGCAAACCATGGTAGATTCAGGGACCCAGTTCACGTTTCTTCTGGGACCAGTGTACACGGCTTTAAAGAACGAGTTTGTGGTACAAACGAAGGGCATTTTGGTCCCACTGGGTGATCCAAATTTCGTGTTTCAAGGAGCGATGGACTTGTGCTACAGAGTACCCGAGAAACAGGGGAAACTGCCGCCACTGCCGGTAGTGAGTCTGATGTTCCGTGGGGCGGAGATGGTGGTTGGCGGAGAGGTGCTAATGTATAAAGTACCGGGAATGGTAAGGGGTGGTGACCAAGTGCATTGCTTGACGTTTGGGAATTCGGATTTGTTAGGAATAGAAGCATTTGTGATTGGGCATCATCATCAACAAAACGTGTGGATGGAATTCGACTTGGTGAAATCAAGGGTTGGATTTGTAGAGACGAGGTGTGATTTGGCGGCTGTCGCAACTACTGCAGACATGAGTTTTGTTTTCCGTGGAACAAGAGTTCCAGACATAGAAAATGGTCTATCTGGATTTATACCTGAACGGCGTGCAATGCGGGTTCATGCAGCACGTCCTGTTAACTCAAACTCACTTGCCTTTCTTGTCACAGTCCTTTTGTTGTTCATGATGTTAAATTCTCACCAGATGTCACCGAACTTTCTGCTCTGGCTTGTGCTTGGTGTGTTTTTGATGGCTACAACATTAAGGATGTATGCGACCTGTCAACAACTTCAGGCTCAAGCCCAAGCTAGAGCTATGGCAGCCAGTGGCCTTCTTGGGCACACTGAATTGAGGTTACATATGCCACCATCAATAGCACTTGCTACTAGAGGACGTTTACAAGGGCTAAGGCTTCAACTTGCTCTGCTTGATCGGGAATTTGATGATTTAGATTATGAAACTTTGAGAGCTTTGGATTCTGACAATGCTCCGACAACACCTTCTATGAGTGAGGAACAAATAAATGCTCTTCCTGTTCATAAATACAAGGTTTCTGGTCCTCAAAGCGACCCCTCTGTGAACCAGCAGGCTTCATCTTCAGAGTCTAATGAGAAGAGACAAGATTCAGCTAATGCAGTTGGCAGTACCAAGGCCTCGGAGGATGAACTTACGTGCAGTGTTTGCTTGGAGCAAGTAAATGTTGGTGAACTCTTCCATGCCAACTGCATAGATCCATGGCTGCGACAGCAGGGCACGTGCCCTGTTTGTAAATTCAGAGCGGTGTCTGGGTGGTCAGAACAGGGACAAGGGGAAACCGATGCGTATTCGGTTTGA

Protein sequence

MAFFLRLLQLLICCVSFKQGLCFSATQTMVLPLKTQMGVTSRPSNKLSFHHNVTLTVSLTLGSPPQPVTMVLDTGSELSWLHCKKTPNLNSVFNPLSSSSYSPVPCASPVCRTRTRDLPNPVTCDPKKLCHVFVSYADASSLEGNLASDTFRVGSSAQPGTFFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLPKFSYCISGRDSSGVLLFGDASLSWLGNLTYTPLVQMSTPLPYYDRVAYTVQLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFVVQTKGILVPLGDPNFVFQGAMDLCYRVPEKQGKLPPLPVVSLMFRGAEMVVGGEVLMYKVPGMVRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVETRCDLAAVATTADMSFVFRGTRVPDIENGLSGFIPERRAMRVHAARPVNSNSLAFLVTVLLLFMMLNSHQMSPNFLLWLVLGVFLMATTLRMYATCQQLQAQAQARAMAASGLLGHTELRLHMPPSIALATRGRLQGLRLQLALLDREFDDLDYETLRALDSDNAPTTPSMSEEQINALPVHKYKVSGPQSDPSVNQQASSSESNEKRQDSANAVGSTKASEDELTCSVCLEQVNVGELFHANCIDPWLRQQGTCPVCKFRAVSGWSEQGQGETDAYSV
Homology
BLAST of CmoCh15G001070 vs. ExPASy Swiss-Prot
Match: Q9LZL3 (Aspartic proteinase PCS1 OS=Arabidopsis thaliana OX=3702 GN=PCS1 PE=2 SV=1)

HSP 1 Score: 521.2 bits (1341), Expect = 1.8e-146
Identity = 253/413 (61.26%), Postives = 315/413 (76.27%), Query Frame = 0

Query: 16  SFKQGLCFSATQTMVLPLKTQMGVTS-RPSNKLSFHHNVTLTVSLTLGSPPQPVTMVLDT 75
           SF      S++QT+VLPLKT++  T  RP++KL FHHNVTLTV+LT+G+PPQ ++MV+DT
Sbjct: 33  SFSSFSSSSSSQTLVLPLKTRITPTDHRPTDKLHFHHNVTLTVTLTVGTPPQNISMVIDT 92

Query: 76  GSELSWLHCKKTPNLNSV--FNPLSSSSYSPVPCASPVCRTRTRDLPNPVTCDPKKLCHV 135
           GSELSWL C ++ N N V  F+P  SSSYSP+PC+SP CRTRTRD   P +CD  KLCH 
Sbjct: 93  GSELSWLRCNRSSNPNPVNNFDPTRSSSYSPIPCSSPTCRTRTRDFLIPASCDSDKLCHA 152

Query: 136 FVSYADASSLEGNLASDTFRVGSSAQPGT-FFGCMDSGFSSNSEEDAKTTGLMGMNRGSL 195
            +SYADASS EGNLA++ F  G+S       FGCM S   S+ EED KTTGL+GMNRGSL
Sbjct: 153 TLSYADASSSEGNLAAEIFHFGNSTNDSNLIFGCMGSVSGSDPEEDTKTTGLLGMNRGSL 212

Query: 196 SFVTQLGLPKFSYCISGRDS-SGVLLFGDASLSWLGNLTYTPLVQMSTPLPYYDRVAYTV 255
           SF++Q+G PKFSYCISG D   G LL GD++ +WL  L YTPL+++STPLPY+DRVAYTV
Sbjct: 213 SFISQMGFPKFSYCISGTDDFPGFLLLGDSNFTWLTPLNYTPLIRISTPLPYFDRVAYTV 272

Query: 256 QLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFVVQTKGIL 315
           QL GI+V  K+L +PKS+  PDHTGAGQTMVDSGTQFTFLLGPVYTAL++ F+ +T GIL
Sbjct: 273 QLTGIKVNGKLLPIPKSVLVPDHTGAGQTMVDSGTQFTFLLGPVYTALRSHFLNRTNGIL 332

Query: 316 VPLGDPNFVFQGAMDLCYRVPE---KQGKLPPLPVVSLMFRGAEMVVGGEVLMYKVPGMV 375
               DP+FVFQG MDLCYR+     + G L  LP VSL+F GAE+ V G+ L+Y+VP + 
Sbjct: 333 TVYEDPDFVFQGTMDLCYRISPVRIRSGILHRLPTVSLVFEGAEIAVSGQPLLYRVPHLT 392

Query: 376 RGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVETRCDLA 421
            G D V+C TFGNSDL+G+EA+VIGHHHQQN+W+EFDL +SR+G     CD++
Sbjct: 393 VGNDSVYCFTFGNSDLMGMEAYVIGHHHQQNMWIEFDLQRSRIGLAPVECDVS 445

BLAST of CmoCh15G001070 vs. ExPASy Swiss-Prot
Match: Q9M2S6 (E3 ubiquitin-protein ligase SDIR1 OS=Arabidopsis thaliana OX=3702 GN=SDIR1 PE=1 SV=1)

HSP 1 Score: 385.6 bits (989), Expect = 1.2e-105
Identity = 213/273 (78.02%), Postives = 230/273 (84.25%), Query Frame = 0

Query: 428 MSFVFRGTRVPDIENGLS-GFIPERRAMRVHAARPVNSNSLAFLVTVLLLFMMLNSHQMS 487
           MSFVFRG+R  D+E+G S GF+PERRAMRVH ARPVNSNSLAFLVTVLLLFM+LNSHQM 
Sbjct: 1   MSFVFRGSR-GDLESGFSGGFLPERRAMRVHGARPVNSNSLAFLVTVLLLFMILNSHQMP 60

Query: 488 PNFLLWLVLGVFLMATTLRMYATCQQLQAQAQARAMAASGLLGHTELRLHMPPSIALATR 547
           PNFLLWLVLGVFLMATTLRMYATCQQLQA AQA+A AASGL  HTELRLH+PPSIALATR
Sbjct: 61  PNFLLWLVLGVFLMATTLRMYATCQQLQAHAQAQAAAASGLFSHTELRLHVPPSIALATR 120

Query: 548 GRLQGLRLQLALLDREFDDLDYETLRALDSDNAPTTPSMSEEQINALPVHKYKVSGPQSD 607
           GRLQGLRLQLALLDREFDDLDYETLRALDSDN  TT SMSEE+INALPVHKYKV  P++ 
Sbjct: 121 GRLQGLRLQLALLDREFDDLDYETLRALDSDNVSTT-SMSEEEINALPVHKYKVLDPENG 180

Query: 608 PSVNQQASSSESNEKRQDSANAVGSTKASEDELTCSVCLEQVNVGEL---------FHAN 667
            S+ +QAS+S S EK  DSAN   S K +EDELTCSVCLEQV VGE+         FHA 
Sbjct: 181 CSLAKQASTSSSAEKMLDSANE--SKKGTEDELTCSVCLEQVTVGEIVRTLPCLHQFHAG 240

Query: 668 CIDPWLRQQGTCPVCKFRAVSGWSEQGQGETDA 691
           CIDPWLRQQGTCPVCKFRA SGW EQ + + DA
Sbjct: 241 CIDPWLRQQGTCPVCKFRAHSGWQEQDEIDDDA 269

BLAST of CmoCh15G001070 vs. ExPASy Swiss-Prot
Match: Q766C2 (Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 SV=1)

HSP 1 Score: 162.9 bits (411), Expect = 1.3e-38
Identity = 115/367 (31.34%), Postives = 174/367 (47.41%), Query Frame = 0

Query: 57  VSLTLGSPPQPVTMVLDTGSELSWLHCKKTPNLNS----VFNPLSSSSYSPVPCASPVCR 116
           +++ +G+P    + ++DTGS+L W  C+      S    +FNP  SSS+S +PC S  C 
Sbjct: 98  MNVAIGTPDSSFSAIMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSFSTLPCESQYC- 157

Query: 117 TRTRDLPNPVTCDPKKLCHVFVSYADASSLEGNLASDTFRVGSSAQPGTFFGCMDSGFSS 176
              +DLP+  TC+  + C     Y D S+ +G +A++TF   +S+ P   FGC   G  +
Sbjct: 158 ---QDLPSE-TCNNNE-CQYTYGYGDGSTTQGYMATETFTFETSSVPNIAFGC---GEDN 217

Query: 177 NSEEDAKTTGLMGMNRGSLSFVTQLGLPKFSYCIS--GRDSSGVLLFGDASLSWLGNLTY 236
                    GL+GM  G LS  +QLG+ +FSYC++  G  S   L  G A+         
Sbjct: 218 QGFGQGNGAGLIGMGWGPLSLPSQLGVGQFSYCMTSYGSSSPSTLALGSAASGVPEGSPS 277

Query: 237 TPLVQMSTPLPYYDRVAYTVQLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFL 296
           T L+  S    Y     Y + L GI VG   L +P S F     G G  ++DSGT  T+L
Sbjct: 278 TTLIHSSLNPTY-----YYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYL 337

Query: 297 LGPVYTALKNEFVVQTKGILVPLGDPNFVFQGAMDLCYRVPEKQGKLPPLPVVSLMFRGA 356
               Y A+   F   T  I +P  D +      +  C++ P   G    +P +S+ F G 
Sbjct: 338 PQDAYNAVAQAF---TDQINLPTVDES---SSGLSTCFQQP-SDGSTVQVPEISMQFDGG 397

Query: 357 EMVVGGEVLMYKVPGMVRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSRV 416
            + +G + +      ++   + V CL  G+S  LGI  F  G+  QQ   + +DL    V
Sbjct: 398 VLNLGEQNI------LISPAEGVICLAMGSSSQLGISIF--GNIQQQETQVLYDLQNLAV 435

Query: 417 GFVETRC 418
            FV T+C
Sbjct: 458 SFVPTQC 435

BLAST of CmoCh15G001070 vs. ExPASy Swiss-Prot
Match: Q766C3 (Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 SV=1)

HSP 1 Score: 157.5 bits (397), Expect = 5.3e-37
Identity = 111/368 (30.16%), Postives = 182/368 (49.46%), Query Frame = 0

Query: 57  VSLTLGSPPQPVTMVLDTGSELSWLHCKKTPNL----NSVFNPLSSSSYSPVPCASPVCR 116
           ++L++G+P QP + ++DTGS+L W  C+           +FNP  SSS+S +PC+S +C+
Sbjct: 97  MNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQLCQ 156

Query: 117 TRTRDLPNPVTCDPKKLCHVFVSYADASSLEGNLASDTFRVGSSAQPGTFFGCMDSGFSS 176
                L +P TC     C     Y D S  +G++ ++T   GS + P   FGC   G ++
Sbjct: 157 A----LSSP-TCS-NNFCQYTYGYGDGSETQGSMGTETLTFGSVSIPNITFGC---GENN 216

Query: 177 NSEEDAKTTGLMGMNRGSLSFVTQLGLPKFSYCIS--GRDSSGVLLFGDASLSWLGNLTY 236
                    GL+GM RG LS  +QL + KFSYC++  G  +   LL G  + S       
Sbjct: 217 QGFGQGNGAGLVGMGRGPLSLPSQLDVTKFSYCMTPIGSSTPSNLLLGSLANSVTAGSPN 276

Query: 237 TPLVQMSTPLPYYDRVAYTVQLDGIRVGNKILALPKSIFA-PDHTGAGQTMVDSGTQFTF 296
           T L+Q S+ +P +    Y + L+G+ VG+  L +  S FA   + G G  ++DSGT  T+
Sbjct: 277 TTLIQ-SSQIPTF----YYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTLTY 336

Query: 297 LLGPVYTALKNEFVVQTKGILVPLGDPNFVFQGAMDLCYRVPEKQGKLPPLPVVSLMFRG 356
            +   Y +++ EF+ Q   I +P+ + +       DLC++ P     L  +P   + F G
Sbjct: 337 FVNNAYQSVRQEFISQ---INLPVVNGS---SSGFDLCFQTPSDPSNL-QIPTFVMHFDG 396

Query: 357 AEMVVGGEVLMYKVPGMVRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSR 416
            ++ +  E         +   + + CL  G+S   G+  F  G+  QQN+ + +D   S 
Sbjct: 397 GDLELPSENY------FISPSNGLICLAMGSSS-QGMSIF--GNIQQQNMLVVYDTGNSV 434

Query: 417 VGFVETRC 418
           V F   +C
Sbjct: 457 VSFASAQC 434

BLAST of CmoCh15G001070 vs. ExPASy Swiss-Prot
Match: Q9LNJ3 (Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 SV=1)

HSP 1 Score: 138.3 bits (347), Expect = 3.4e-31
Identity = 118/370 (31.89%), Postives = 173/370 (46.76%), Query Frame = 0

Query: 59  LTLGSPPQPVTMVLDTGSELSWLHCKKTPNLNS----VFNPLSSSSYSPVPCASPVCRTR 118
           L +G+P + V MVLDTGS++ WL C       S    +F+P  S +Y+ +PC+SP CR  
Sbjct: 146 LGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCSSPHCRR- 205

Query: 119 TRDLPNPVTCDPKKLCHVFVSYADASSLEGNLASDTFRVGSSAQPGTFFGCMDSGFSSNS 178
              L +      +K C   VSY D S   G+ +++T     +   G   GC       N 
Sbjct: 206 ---LDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKGVALGC----GHDNE 265

Query: 179 EEDAKTTGLMGMNRGSLSFVTQLG---LPKFSYCISGRDSS---GVLLFGDASLSWLGNL 238
                  GL+G+ +G LSF  Q G     KFSYC+  R +S     ++FG+A++S +   
Sbjct: 266 GLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAAVSRIAR- 325

Query: 239 TYTPLVQMSTPLPYYDRVAYTVQLDGIRV-GNKILALPKSIFAPDHTGAGQTMVDSGTQF 298
            +TPL+      P  D   Y V L GI V G ++  +  S+F  D  G G  ++DSGT  
Sbjct: 326 -FTPLLSN----PKLD-TFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSV 385

Query: 299 TFLLGPVYTALKNEFVVQTKGILVPLGDPNFVFQGAMDLCYRVPEKQGKLPPLPVVSLMF 358
           T L+ P Y A+++ F V  K +      P+F      D C+ +         +P V L F
Sbjct: 386 TRLIRPAYIAMRDAFRVGAKTL---KRAPDF---SLFDTCFDLSNMNE--VKVPTVVLHF 445

Query: 359 RGAEMVVGGEVLMYKVPGMVRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVK 418
           RGA+  V      Y +P    G     C  F  + + G+   +IG+  QQ   + +DL  
Sbjct: 446 RGAD--VSLPATNYLIPVDTNG---KFCFAFAGT-MGGLS--IIGNIQQQGFRVVYDLAS 484

BLAST of CmoCh15G001070 vs. ExPASy TrEMBL
Match: A0A7J6E7N2 (Uncharacterized protein OS=Cannabis sativa OX=3483 GN=G4B88_019942 PE=3 SV=1)

HSP 1 Score: 983.0 bits (2540), Expect = 6.3e-283
Identity = 519/784 (66.20%), Postives = 588/784 (75.00%), Query Frame = 0

Query: 5   LRLLQLLICCVSFKQGLCFSATQTMVLPLKTQMGVTSRPSNKLSFHHNVTLTVSLTLGSP 64
           L+L+ +++C    +     S+  T++LPLK Q    ++PS+KLSFHHNVTLTV+LT+GSP
Sbjct: 8   LQLITIIVCNFITQIISSSSSMDTLILPLKLQ--TVTKPSSKLSFHHNVTLTVTLTVGSP 67

Query: 65  PQPVTMVLDTGSELSWLHCKKTPNLNSVFNPLSSSSYSPVPCASPVCRTRTRDLPNPVTC 124
           PQ VTMVLDTGSELSWLHCKK  N+NSVFNPL+SSSYSPVPC+S +CRT+TRD   PV+C
Sbjct: 68  PQTVTMVLDTGSELSWLHCKKAQNINSVFNPLASSSYSPVPCSSSICRTQTRDFTIPVSC 127

Query: 125 DPKKLCHVFVSYADASSLEGNLASDTFRVGSSAQPGTFFGCMDSGFSSNSEEDAKTTGLM 184
           DPKKLCH  +SYADASS+EGNLAS+TF +GSS +P T FGCMDSGFSSNSEED+KTTGL+
Sbjct: 128 DPKKLCHAMLSYADASSIEGNLASETFNIGSSPRPRTIFGCMDSGFSSNSEEDSKTTGLI 187

Query: 185 GMNRGSLSFVTQLGLPKFSYCISGRDSSGVLLFGDASLSWLGNLTYTPLVQMSTPLPYYD 244
           GMNRGSLSFV+Q+GL KFSYCISGRDSSG +LFG+AS +WLG L YTPLV+MS PLPYYD
Sbjct: 188 GMNRGSLSFVSQMGLAKFSYCISGRDSSGFILFGEASFAWLGPLKYTPLVKMSQPLPYYD 247

Query: 245 RVAYTVQLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFVV 304
           RVAYTVQL GI+V NK+L L KS+F PDHTGAGQTMVDSGTQFTFLLGPVYTAL+NEF  
Sbjct: 248 RVAYTVQLLGIKVSNKLLQLSKSVFVPDHTGAGQTMVDSGTQFTFLLGPVYTALRNEFAQ 307

Query: 305 QTKGILVPLGDPNFVFQGAMDLCYRVPEKQGKLPPLPVVSLMFRGAEMVVGGEVLMYKVP 364
           QTK +   L D NFVFQGAMDLCY++P  +      P V+L+F+GAEM V G+ L+Y+VP
Sbjct: 308 QTKSLAPLLKDQNFVFQGAMDLCYQIPPTRRSFSDFPAVTLIFQGAEMSVSGDKLLYRVP 367

Query: 365 GMVRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVETRCDLAA--- 424
           GM +G D V+C TFGNSDLLGIEAFVIGHHHQQNVW+EFDL KSRVG  E RCDLA+   
Sbjct: 368 GMRKGSDSVYCFTFGNSDLLGIEAFVIGHHHQQNVWIEFDLAKSRVGLAEVRCDLASQRL 427

Query: 425 ---------------------------------VATTA---------------------- 484
                                            VA  A                      
Sbjct: 428 GGLVVLLVLRGGTFDVVGMATGVLVAEEVVGVFVADVAAGVFDISIGLCIYQMISHCEHS 487

Query: 485 -------------------------------DMSFVFRGTRVPDIENGLSGFIPERRAMR 544
                                           MSFVFRGTR  DIE+G +GF+PERR MR
Sbjct: 488 NSSSSQYRPLTTLRIEDFENAVLPRIPDRITAMSFVFRGTR-SDIESGFTGFVPERRPMR 547

Query: 545 VHAARPVNSNSLAFLVTVLLLFMMLNSHQMSPNFLLWLVLGVFLMATTLRMYATCQQLQA 604
           +H+ARPVNSNSLAFLVTVLLLFM+LNSHQMSPNFLLWLVLGVFLMATTLRMYATCQQLQA
Sbjct: 548 LHSARPVNSNSLAFLVTVLLLFMILNSHQMSPNFLLWLVLGVFLMATTLRMYATCQQLQA 607

Query: 605 QAQARAMAASGLLGHTELRLHMPPSIALATRGRLQGLRLQLALLDREFDDLDYETLRALD 664
           QAQA A AA GLLGHTELRLH+PPSI+LATRGRLQGLRLQLALLDREFDDLDYETLRALD
Sbjct: 608 QAQAHAAAAGGLLGHTELRLHIPPSISLATRGRLQGLRLQLALLDREFDDLDYETLRALD 667

Query: 665 SDNAPTTPSMSEEQINALPVHKYKVSGPQSDPSVNQQASSSESNEKRQDSANAVGSTKAS 691
           +DNAPT  SM+EE+INALPVHKYKV+  Q+  S  QQASSS S EK Q S +AVG+ KAS
Sbjct: 668 ADNAPTAHSMTEEEINALPVHKYKVADLQNSASSMQQASSSASAEK-QGSNDAVGNAKAS 727

BLAST of CmoCh15G001070 vs. ExPASy TrEMBL
Match: A0A7J6E2Q5 (Uncharacterized protein OS=Cannabis sativa OX=3483 GN=G4B88_020587 PE=3 SV=1)

HSP 1 Score: 951.8 bits (2459), Expect = 1.6e-273
Identity = 519/854 (60.77%), Postives = 585/854 (68.50%), Query Frame = 0

Query: 5   LRLLQLLIC--CVSFKQGLCFSATQTMVLPLKTQMGVTSRPSNKLSFHHNVTLTVSLTLG 64
           L+L+ +++C            S+  T++LPLK Q    ++PS+KLSFHHNVTLTV+LT+G
Sbjct: 8   LQLITIIVCNLITQIISSSSSSSMDTLILPLKLQ--TVTKPSSKLSFHHNVTLTVTLTVG 67

Query: 65  SPPQPVTMVLDTGSELSWLHCKKTPNLNSVFNPLSSSSYSPVPCASPVCRTRTRDLPNPV 124
           SPPQ VTMVLDTGSELSWLHCKK  N+NSVFNPL+SSSYSPVPC+S +CRT+TRD   PV
Sbjct: 68  SPPQTVTMVLDTGSELSWLHCKKAQNINSVFNPLASSSYSPVPCSSSICRTQTRDFTIPV 127

Query: 125 TCDPKKLCHVFVSYADASSLEGNLASDTFRVGSSAQPGTFFGCMDSGFSSNSEEDAKTTG 184
           +CDPKKLCH  +SYADASS+EGNLAS+TF +GSS +P T FGCMDSGFSSNSEED+KTTG
Sbjct: 128 SCDPKKLCHAMLSYADASSIEGNLASETFNIGSSPRPRTIFGCMDSGFSSNSEEDSKTTG 187

Query: 185 LMGMNRGSLSFVTQLGLPKFSYCISGRDSSGVLLFGDASLSWLGNLTYTPLVQMSTPLPY 244
           L+GMNRGSLSFV+Q+GL KFSYCISGRDSSG +LFG+AS +WLG L YTPLV+MS PLPY
Sbjct: 188 LIGMNRGSLSFVSQMGLAKFSYCISGRDSSGFILFGEASFAWLGPLKYTPLVKMSQPLPY 247

Query: 245 YDRVAYTVQLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEF 304
           YDRVAYTVQL GI+V NK+L L KS+F PDHTGAGQTMVDSGTQFTFLLGPVYTAL+NEF
Sbjct: 248 YDRVAYTVQLLGIKVSNKLLQLSKSVFVPDHTGAGQTMVDSGTQFTFLLGPVYTALRNEF 307

Query: 305 VVQTKGILVPLGDPNFVFQGAMDLCYRVPEKQGKLPPLPVVSLMFRGAEMVVGGEVLMYK 364
             QTK +   L D NFVFQGAMDLCY++P  +      P V+L+F+GAEM V G+ L+Y+
Sbjct: 308 AQQTKSLAPLLKDQNFVFQGAMDLCYQIPPTRRSFSDFPAVTLIFQGAEMSVSGDKLLYR 367

Query: 365 VPGMVRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVETRCDLAA- 424
           VPGM +G D V+C TFGNSDLLGIEAFVIGHHHQQNVW+EFDL KSRVG  E RCDLA+ 
Sbjct: 368 VPGMRKGSDSVYCFTFGNSDLLGIEAFVIGHHHQQNVWIEFDLAKSRVGLAEVRCDLASQ 427

Query: 425 -----------------------------------------VATTA-------------- 484
                                                    V  T               
Sbjct: 428 RLGKGEVLVVVMVAAVVISVGRGSETKTGACNWLVITVSGCVGATGVGIAAVVVAGEVFG 487

Query: 485 ------------------------------------------------------------ 544
                                                                       
Sbjct: 488 QKVLTIELEGEGRGKRTRKWIPTRRAQNIQPKIQPNNKVGVKNQISHCEHSNSSSSQYRP 547

Query: 545 ---------------------------------------DMSFVFRGTRVPDIENGLSGF 604
                                                   MSFVFRGTR  DIE+G +GF
Sbjct: 548 LTTLRIEDFENAVLPRIPDRITGCFDCLFSDFSSSRFMRSMSFVFRGTR-SDIESGFTGF 607

Query: 605 IPERRAMRVHAARPVNSNSLAFLVTVLLLFMMLNSHQMSPNFLLWLVLGVFLMATTLRMY 664
           +PERR MR+H+ARPVNSNSLAFLVTVLLLFM+LNSHQMSPNFLLWLVLGVFLMATTLRMY
Sbjct: 608 VPERRPMRLHSARPVNSNSLAFLVTVLLLFMILNSHQMSPNFLLWLVLGVFLMATTLRMY 667

Query: 665 ATCQQLQAQAQARAMAASGLLGHTELRLHMPPSIALATRGRLQGLRLQLALLDREFDDL- 691
           ATCQQLQAQAQA A AA GLLGHTELRLH+PPSI+LATRGRLQGLRLQLALLDREFDDL 
Sbjct: 668 ATCQQLQAQAQAHAAAAGGLLGHTELRLHIPPSISLATRGRLQGLRLQLALLDREFDDLV 727

BLAST of CmoCh15G001070 vs. ExPASy TrEMBL
Match: A0A498KN45 (Uncharacterized protein OS=Malus domestica OX=3750 GN=DVH24_023080 PE=3 SV=1)

HSP 1 Score: 919.1 bits (2374), Expect = 1.1e-263
Identity = 504/769 (65.54%), Postives = 557/769 (72.43%), Query Frame = 0

Query: 5   LRLLQLLICCVSFKQGLCFS--ATQTMVLPLKTQM---GVTSRPSNKLSFHHNVTLTVSL 64
           L LLQLL         LCFS    +T++LPLKTQ    G   + +NKLSFHHNVTLT+SL
Sbjct: 7   LLLLQLLTT-------LCFSEPKPETLILPLKTQTLPHGSLPKSTNKLSFHHNVTLTISL 66

Query: 65  TLGSPPQPVTMVLDTGSELSWLHCKKTPNLNSVFNPLSSSSYSPVPCASPVCRTRTRDLP 124
           ++GSPPQ VTMVLDTGSELSWL CKK PN NSVFNPL+S SYSPVPC+SPVCRTRTRD P
Sbjct: 67  SVGSPPQQVTMVLDTGSELSWLRCKKAPNFNSVFNPLASKSYSPVPCSSPVCRTRTRDFP 126

Query: 125 NPVTCDPKKLCHVFVSYADASSLEGNLASDTFRVGSSAQPGTFFGCMDSGFSSNSEEDAK 184
            PV+CDPKKLCH  +SY DASS+EGNLA +TF +GSSAQPGT FGCMDSG SSN+EEDAK
Sbjct: 127 TPVSCDPKKLCHSILSYVDASSIEGNLAWETFNLGSSAQPGTIFGCMDSGSSSNAEEDAK 186

Query: 185 TTGLMGMNRGSLSFVTQLGLPKFSYCISGRDSSGVLLFGDASLSWLGNLTYTPLVQMSTP 244
           TTGLMGMNRGSLSFVTQ+G PKFSYCISGRDSSGVLLFG+A   WL  L YTPLV +STP
Sbjct: 187 TTGLMGMNRGSLSFVTQMGFPKFSYCISGRDSSGVLLFGEAKFDWLKPLNYTPLVHISTP 246

Query: 245 LPYYDRVAYTVQLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALK 304
           LPY+DRVAYTVQL+GIRVG K+L LPKS+F PDH+GAGQTMVDSGTQFTFLLGPVYTALK
Sbjct: 247 LPYFDRVAYTVQLEGIRVGGKLLPLPKSVFVPDHSGAGQTMVDSGTQFTFLLGPVYTALK 306

Query: 305 NEFVVQTKGILVPLGDPNFVFQGAMDLCYRVPEKQGKLPPLPVVSLMFRGAEMVVGGEVL 364
            EF  QTK +L  L DPNFVFQGA+DLC++VP  +  LP LP V+LMFRGAEM V GE L
Sbjct: 307 KEFTQQTKPVLNILNDPNFVFQGAIDLCFQVPTNRPSLPALPTVTLMFRGAEMSVSGERL 366

Query: 365 MYKVPGMVRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVE----- 424
           +Y+VPGMVRGG+QV+C T+GNSDLLGIEAFVIGH+HQQNVWMEFDL KSRVG  E     
Sbjct: 367 LYRVPGMVRGGNQVYCFTYGNSDLLGIEAFVIGHYHQQNVWMEFDLEKSRVGVAEEGVVS 426

Query: 425 ---------------------------------------------------TRCDLAAVA 484
                                                               R     + 
Sbjct: 427 ISQDSNSDGDFGLSVGPRVHAAKRVRVDSNLCTLQLAPFLPPKKRLHGVVVKRRRFIRLL 486

Query: 485 TTADMSFVFRGTRVPDIENGLSGFIPERRAMRVHAARPVNSNSL-------------AFL 544
            + +MSFVFRGTR  DIE+G   FIPERRAM +    P N + L             A  
Sbjct: 487 NSTNMSFVFRGTRA-DIESGFPEFIPERRAMLLIPFGP-NFHLLPLESKGGGVSIHHALS 546

Query: 545 VTVLLLFMMLNSHQMSPNFLLWLVLGVFLMATTLRMYATCQQLQAQAQARAMAASGLLGH 604
           +   LLF+            LWLVLGVFLMATTLRMYATCQQLQAQAQ  A AASGLLGH
Sbjct: 547 IPTHLLFLS----------QLWLVLGVFLMATTLRMYATCQQLQAQAQVHAAAASGLLGH 606

Query: 605 TELRLHMPPSIALATRGRLQGLRLQLALLDREFDDLDYETLRALDSDNAPTTPSMSEEQI 664
           TELRL MPPSI+LATRGRLQGLRLQLALLDREFDDLDYETLRALDSDN P   SMSEE+I
Sbjct: 607 TELRLRMPPSISLATRGRLQGLRLQLALLDREFDDLDYETLRALDSDNVPAASSMSEEEI 666

Query: 665 NALPVHKYKVSGPQSDPSVNQQASSSESNEKRQDSANAVGSTKASEDELTCSVCLEQVNV 691
           NALPVHKYK  GPQ+  S  QQASSS  +E  Q++ +AVGSTKA EDELTCSVCLEQV V
Sbjct: 667 NALPVHKYKAVGPQNGASSMQQASSSVPSE-TQETVDAVGSTKAMEDELTCSVCLEQVTV 726

BLAST of CmoCh15G001070 vs. ExPASy TrEMBL
Match: A0A5J5BAH9 (Peptidase A1 domain-containing protein OS=Nyssa sinensis OX=561372 GN=F0562_026172 PE=3 SV=1)

HSP 1 Score: 917.5 bits (2370), Expect = 3.3e-263
Identity = 467/624 (74.84%), Postives = 518/624 (83.01%), Query Frame = 0

Query: 7   LLQLLICCVSFKQGLCFSATQTMVLPLKTQM---GVTSRPSNKLSFHHNVTLTVSLTLGS 66
           LLQLL+ C+  +   C S+T T++LPLKT +   G   +P NKLSFHHNV+LTV+LT+G+
Sbjct: 10  LLQLLVFCIFIQSNPCNSSTPTVILPLKTSLISSGSLPKPPNKLSFHHNVSLTVTLTVGT 69

Query: 67  PPQPVTMVLDTGSELSWLHCKKTPNLNSVFNPLSSSSYSPVPCASPVCRTRTRDLPNPVT 126
           PPQPVTMV+DTGSELSWL+CKKTPN  S+F+PL SSSYSP+PC+SP CRTRTRD   PV+
Sbjct: 70  PPQPVTMVIDTGSELSWLYCKKTPNTPSIFDPLRSSSYSPIPCSSPTCRTRTRDFSIPVS 129

Query: 127 CDPKKLCHVFVSYADASSLEGNLASDTFRVGSSAQPGTFFGCMDSGFSSNSEEDAKTTGL 186
           CDPKKLCH  +SYADASS+EGNLASDTF +G+S  PGT FG MDSG SSN EED+KTTGL
Sbjct: 130 CDPKKLCHATLSYADASSVEGNLASDTFHLGNSGLPGTVFGSMDSGSSSNPEEDSKTTGL 189

Query: 187 MGMNRGSLSFVTQLGLPKFSYCISGRDSSGVLLFGDASLSWLGNLTYTPLVQMSTPLPYY 246
           +GMNRGSLSFVTQ+  PKFSYCISGRDSSG+LLFG+AS  WL  L YTPLVQ+STPLPY+
Sbjct: 190 IGMNRGSLSFVTQMDFPKFSYCISGRDSSGILLFGEASFPWLQPLNYTPLVQISTPLPYF 249

Query: 247 DRVAYTVQLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFV 306
           DRVAYTVQL+GI+V  K+LA+PKS+  PDHTGAGQTMVDSGTQFTFLLGPVYT L+NEF+
Sbjct: 250 DRVAYTVQLEGIKVSGKVLAIPKSVLVPDHTGAGQTMVDSGTQFTFLLGPVYTTLRNEFM 309

Query: 307 VQTKGILVPLGDPNFVFQGAMDLCYRVPEKQGKLPPLPVVSLMFRGAEMVVGGEVLMYKV 366
            QTKG+L  L DPNFVFQGAMDLCYRV   +  LPPLP VSLMFRGAEM V GE LMY+V
Sbjct: 310 QQTKGVLRVLDDPNFVFQGAMDLCYRVESTRTSLPPLPTVSLMFRGAEMSVSGERLMYRV 369

Query: 367 PGMVRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVET-------R 426
           PG  RG D V+C TFGNSDLLGIEA+VIGHHHQQN+WMEFDLVKSRVG  E        R
Sbjct: 370 PGERRGSDSVYCFTFGNSDLLGIEAYVIGHHHQQNMWMEFDLVKSRVGLAEVRLLKFIRR 429

Query: 427 CDLAA--------------VATTADMSFVFRGTRVPDIENGLSGFIPERRAMRVHAARPV 486
            DL+                     MSFVFRGTR  DIE G  GFIPERRAMRVHAARPV
Sbjct: 430 IDLSQWGLLQRVILLLVKFFVKQKTMSFVFRGTRA-DIETGFPGFIPERRAMRVHAARPV 489

Query: 487 NSNSLAFLVTVLLLFMMLNSHQMSPNFLLWLVLGVFLMATTLRMYATCQQLQAQAQAR-- 546
           NSNSLAFLVTVLLLFM+LNSHQM PNFLLWLV G+FLMAT+LRMYATCQQLQAQAQA+  
Sbjct: 490 NSNSLAFLVTVLLLFMILNSHQMPPNFLLWLVFGIFLMATSLRMYATCQQLQAQAQAQAH 549

Query: 547 AMAASGLLGHTELRLHMPPSIALATRGRLQGLRLQLALLDREFDDLDYETLRALDSDNAP 605
           A AASGLLGHTELRLHMPPSIALATRGRLQGLRLQLALLDREFDDLDYETLRALDSDN P
Sbjct: 550 AAAASGLLGHTELRLHMPPSIALATRGRLQGLRLQLALLDREFDDLDYETLRALDSDNVP 609

BLAST of CmoCh15G001070 vs. ExPASy TrEMBL
Match: A0A6J1FDS6 (aspartic proteinase PCS1-like OS=Cucurbita moschata OX=3662 GN=LOC111444820 PE=3 SV=1)

HSP 1 Score: 854.7 bits (2207), Expect = 2.6e-244
Identity = 420/420 (100.00%), Postives = 420/420 (100.00%), Query Frame = 0

Query: 1   MAFFLRLLQLLICCVSFKQGLCFSATQTMVLPLKTQMGVTSRPSNKLSFHHNVTLTVSLT 60
           MAFFLRLLQLLICCVSFKQGLCFSATQTMVLPLKTQMGVTSRPSNKLSFHHNVTLTVSLT
Sbjct: 1   MAFFLRLLQLLICCVSFKQGLCFSATQTMVLPLKTQMGVTSRPSNKLSFHHNVTLTVSLT 60

Query: 61  LGSPPQPVTMVLDTGSELSWLHCKKTPNLNSVFNPLSSSSYSPVPCASPVCRTRTRDLPN 120
           LGSPPQPVTMVLDTGSELSWLHCKKTPNLNSVFNPLSSSSYSPVPCASPVCRTRTRDLPN
Sbjct: 61  LGSPPQPVTMVLDTGSELSWLHCKKTPNLNSVFNPLSSSSYSPVPCASPVCRTRTRDLPN 120

Query: 121 PVTCDPKKLCHVFVSYADASSLEGNLASDTFRVGSSAQPGTFFGCMDSGFSSNSEEDAKT 180
           PVTCDPKKLCHVFVSYADASSLEGNLASDTFRVGSSAQPGTFFGCMDSGFSSNSEEDAKT
Sbjct: 121 PVTCDPKKLCHVFVSYADASSLEGNLASDTFRVGSSAQPGTFFGCMDSGFSSNSEEDAKT 180

Query: 181 TGLMGMNRGSLSFVTQLGLPKFSYCISGRDSSGVLLFGDASLSWLGNLTYTPLVQMSTPL 240
           TGLMGMNRGSLSFVTQLGLPKFSYCISGRDSSGVLLFGDASLSWLGNLTYTPLVQMSTPL
Sbjct: 181 TGLMGMNRGSLSFVTQLGLPKFSYCISGRDSSGVLLFGDASLSWLGNLTYTPLVQMSTPL 240

Query: 241 PYYDRVAYTVQLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKN 300
           PYYDRVAYTVQLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKN
Sbjct: 241 PYYDRVAYTVQLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKN 300

Query: 301 EFVVQTKGILVPLGDPNFVFQGAMDLCYRVPEKQGKLPPLPVVSLMFRGAEMVVGGEVLM 360
           EFVVQTKGILVPLGDPNFVFQGAMDLCYRVPEKQGKLPPLPVVSLMFRGAEMVVGGEVLM
Sbjct: 301 EFVVQTKGILVPLGDPNFVFQGAMDLCYRVPEKQGKLPPLPVVSLMFRGAEMVVGGEVLM 360

Query: 361 YKVPGMVRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVETRCDLA 420
           YKVPGMVRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVETRCDLA
Sbjct: 361 YKVPGMVRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVETRCDLA 420

BLAST of CmoCh15G001070 vs. NCBI nr
Match: KAF4354473.1 (hypothetical protein G4B88_019942 [Cannabis sativa])

HSP 1 Score: 983.0 bits (2540), Expect = 1.3e-282
Identity = 519/784 (66.20%), Postives = 588/784 (75.00%), Query Frame = 0

Query: 5   LRLLQLLICCVSFKQGLCFSATQTMVLPLKTQMGVTSRPSNKLSFHHNVTLTVSLTLGSP 64
           L+L+ +++C    +     S+  T++LPLK Q    ++PS+KLSFHHNVTLTV+LT+GSP
Sbjct: 8   LQLITIIVCNFITQIISSSSSMDTLILPLKLQ--TVTKPSSKLSFHHNVTLTVTLTVGSP 67

Query: 65  PQPVTMVLDTGSELSWLHCKKTPNLNSVFNPLSSSSYSPVPCASPVCRTRTRDLPNPVTC 124
           PQ VTMVLDTGSELSWLHCKK  N+NSVFNPL+SSSYSPVPC+S +CRT+TRD   PV+C
Sbjct: 68  PQTVTMVLDTGSELSWLHCKKAQNINSVFNPLASSSYSPVPCSSSICRTQTRDFTIPVSC 127

Query: 125 DPKKLCHVFVSYADASSLEGNLASDTFRVGSSAQPGTFFGCMDSGFSSNSEEDAKTTGLM 184
           DPKKLCH  +SYADASS+EGNLAS+TF +GSS +P T FGCMDSGFSSNSEED+KTTGL+
Sbjct: 128 DPKKLCHAMLSYADASSIEGNLASETFNIGSSPRPRTIFGCMDSGFSSNSEEDSKTTGLI 187

Query: 185 GMNRGSLSFVTQLGLPKFSYCISGRDSSGVLLFGDASLSWLGNLTYTPLVQMSTPLPYYD 244
           GMNRGSLSFV+Q+GL KFSYCISGRDSSG +LFG+AS +WLG L YTPLV+MS PLPYYD
Sbjct: 188 GMNRGSLSFVSQMGLAKFSYCISGRDSSGFILFGEASFAWLGPLKYTPLVKMSQPLPYYD 247

Query: 245 RVAYTVQLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFVV 304
           RVAYTVQL GI+V NK+L L KS+F PDHTGAGQTMVDSGTQFTFLLGPVYTAL+NEF  
Sbjct: 248 RVAYTVQLLGIKVSNKLLQLSKSVFVPDHTGAGQTMVDSGTQFTFLLGPVYTALRNEFAQ 307

Query: 305 QTKGILVPLGDPNFVFQGAMDLCYRVPEKQGKLPPLPVVSLMFRGAEMVVGGEVLMYKVP 364
           QTK +   L D NFVFQGAMDLCY++P  +      P V+L+F+GAEM V G+ L+Y+VP
Sbjct: 308 QTKSLAPLLKDQNFVFQGAMDLCYQIPPTRRSFSDFPAVTLIFQGAEMSVSGDKLLYRVP 367

Query: 365 GMVRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVETRCDLAA--- 424
           GM +G D V+C TFGNSDLLGIEAFVIGHHHQQNVW+EFDL KSRVG  E RCDLA+   
Sbjct: 368 GMRKGSDSVYCFTFGNSDLLGIEAFVIGHHHQQNVWIEFDLAKSRVGLAEVRCDLASQRL 427

Query: 425 ---------------------------------VATTA---------------------- 484
                                            VA  A                      
Sbjct: 428 GGLVVLLVLRGGTFDVVGMATGVLVAEEVVGVFVADVAAGVFDISIGLCIYQMISHCEHS 487

Query: 485 -------------------------------DMSFVFRGTRVPDIENGLSGFIPERRAMR 544
                                           MSFVFRGTR  DIE+G +GF+PERR MR
Sbjct: 488 NSSSSQYRPLTTLRIEDFENAVLPRIPDRITAMSFVFRGTR-SDIESGFTGFVPERRPMR 547

Query: 545 VHAARPVNSNSLAFLVTVLLLFMMLNSHQMSPNFLLWLVLGVFLMATTLRMYATCQQLQA 604
           +H+ARPVNSNSLAFLVTVLLLFM+LNSHQMSPNFLLWLVLGVFLMATTLRMYATCQQLQA
Sbjct: 548 LHSARPVNSNSLAFLVTVLLLFMILNSHQMSPNFLLWLVLGVFLMATTLRMYATCQQLQA 607

Query: 605 QAQARAMAASGLLGHTELRLHMPPSIALATRGRLQGLRLQLALLDREFDDLDYETLRALD 664
           QAQA A AA GLLGHTELRLH+PPSI+LATRGRLQGLRLQLALLDREFDDLDYETLRALD
Sbjct: 608 QAQAHAAAAGGLLGHTELRLHIPPSISLATRGRLQGLRLQLALLDREFDDLDYETLRALD 667

Query: 665 SDNAPTTPSMSEEQINALPVHKYKVSGPQSDPSVNQQASSSESNEKRQDSANAVGSTKAS 691
           +DNAPT  SM+EE+INALPVHKYKV+  Q+  S  QQASSS S EK Q S +AVG+ KAS
Sbjct: 668 ADNAPTAHSMTEEEINALPVHKYKVADLQNSASSMQQASSSASAEK-QGSNDAVGNAKAS 727

BLAST of CmoCh15G001070 vs. NCBI nr
Match: KAF4351959.1 (hypothetical protein G4B88_020587 [Cannabis sativa])

HSP 1 Score: 951.8 bits (2459), Expect = 3.2e-273
Identity = 519/854 (60.77%), Postives = 585/854 (68.50%), Query Frame = 0

Query: 5   LRLLQLLIC--CVSFKQGLCFSATQTMVLPLKTQMGVTSRPSNKLSFHHNVTLTVSLTLG 64
           L+L+ +++C            S+  T++LPLK Q    ++PS+KLSFHHNVTLTV+LT+G
Sbjct: 8   LQLITIIVCNLITQIISSSSSSSMDTLILPLKLQ--TVTKPSSKLSFHHNVTLTVTLTVG 67

Query: 65  SPPQPVTMVLDTGSELSWLHCKKTPNLNSVFNPLSSSSYSPVPCASPVCRTRTRDLPNPV 124
           SPPQ VTMVLDTGSELSWLHCKK  N+NSVFNPL+SSSYSPVPC+S +CRT+TRD   PV
Sbjct: 68  SPPQTVTMVLDTGSELSWLHCKKAQNINSVFNPLASSSYSPVPCSSSICRTQTRDFTIPV 127

Query: 125 TCDPKKLCHVFVSYADASSLEGNLASDTFRVGSSAQPGTFFGCMDSGFSSNSEEDAKTTG 184
           +CDPKKLCH  +SYADASS+EGNLAS+TF +GSS +P T FGCMDSGFSSNSEED+KTTG
Sbjct: 128 SCDPKKLCHAMLSYADASSIEGNLASETFNIGSSPRPRTIFGCMDSGFSSNSEEDSKTTG 187

Query: 185 LMGMNRGSLSFVTQLGLPKFSYCISGRDSSGVLLFGDASLSWLGNLTYTPLVQMSTPLPY 244
           L+GMNRGSLSFV+Q+GL KFSYCISGRDSSG +LFG+AS +WLG L YTPLV+MS PLPY
Sbjct: 188 LIGMNRGSLSFVSQMGLAKFSYCISGRDSSGFILFGEASFAWLGPLKYTPLVKMSQPLPY 247

Query: 245 YDRVAYTVQLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEF 304
           YDRVAYTVQL GI+V NK+L L KS+F PDHTGAGQTMVDSGTQFTFLLGPVYTAL+NEF
Sbjct: 248 YDRVAYTVQLLGIKVSNKLLQLSKSVFVPDHTGAGQTMVDSGTQFTFLLGPVYTALRNEF 307

Query: 305 VVQTKGILVPLGDPNFVFQGAMDLCYRVPEKQGKLPPLPVVSLMFRGAEMVVGGEVLMYK 364
             QTK +   L D NFVFQGAMDLCY++P  +      P V+L+F+GAEM V G+ L+Y+
Sbjct: 308 AQQTKSLAPLLKDQNFVFQGAMDLCYQIPPTRRSFSDFPAVTLIFQGAEMSVSGDKLLYR 367

Query: 365 VPGMVRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVETRCDLAA- 424
           VPGM +G D V+C TFGNSDLLGIEAFVIGHHHQQNVW+EFDL KSRVG  E RCDLA+ 
Sbjct: 368 VPGMRKGSDSVYCFTFGNSDLLGIEAFVIGHHHQQNVWIEFDLAKSRVGLAEVRCDLASQ 427

Query: 425 -----------------------------------------VATTA-------------- 484
                                                    V  T               
Sbjct: 428 RLGKGEVLVVVMVAAVVISVGRGSETKTGACNWLVITVSGCVGATGVGIAAVVVAGEVFG 487

Query: 485 ------------------------------------------------------------ 544
                                                                       
Sbjct: 488 QKVLTIELEGEGRGKRTRKWIPTRRAQNIQPKIQPNNKVGVKNQISHCEHSNSSSSQYRP 547

Query: 545 ---------------------------------------DMSFVFRGTRVPDIENGLSGF 604
                                                   MSFVFRGTR  DIE+G +GF
Sbjct: 548 LTTLRIEDFENAVLPRIPDRITGCFDCLFSDFSSSRFMRSMSFVFRGTR-SDIESGFTGF 607

Query: 605 IPERRAMRVHAARPVNSNSLAFLVTVLLLFMMLNSHQMSPNFLLWLVLGVFLMATTLRMY 664
           +PERR MR+H+ARPVNSNSLAFLVTVLLLFM+LNSHQMSPNFLLWLVLGVFLMATTLRMY
Sbjct: 608 VPERRPMRLHSARPVNSNSLAFLVTVLLLFMILNSHQMSPNFLLWLVLGVFLMATTLRMY 667

Query: 665 ATCQQLQAQAQARAMAASGLLGHTELRLHMPPSIALATRGRLQGLRLQLALLDREFDDL- 691
           ATCQQLQAQAQA A AA GLLGHTELRLH+PPSI+LATRGRLQGLRLQLALLDREFDDL 
Sbjct: 668 ATCQQLQAQAQAHAAAAGGLLGHTELRLHIPPSISLATRGRLQGLRLQLALLDREFDDLV 727

BLAST of CmoCh15G001070 vs. NCBI nr
Match: RXI08936.1 (hypothetical protein DVH24_023080 [Malus domestica])

HSP 1 Score: 919.1 bits (2374), Expect = 2.3e-263
Identity = 504/769 (65.54%), Postives = 557/769 (72.43%), Query Frame = 0

Query: 5   LRLLQLLICCVSFKQGLCFS--ATQTMVLPLKTQM---GVTSRPSNKLSFHHNVTLTVSL 64
           L LLQLL         LCFS    +T++LPLKTQ    G   + +NKLSFHHNVTLT+SL
Sbjct: 7   LLLLQLLTT-------LCFSEPKPETLILPLKTQTLPHGSLPKSTNKLSFHHNVTLTISL 66

Query: 65  TLGSPPQPVTMVLDTGSELSWLHCKKTPNLNSVFNPLSSSSYSPVPCASPVCRTRTRDLP 124
           ++GSPPQ VTMVLDTGSELSWL CKK PN NSVFNPL+S SYSPVPC+SPVCRTRTRD P
Sbjct: 67  SVGSPPQQVTMVLDTGSELSWLRCKKAPNFNSVFNPLASKSYSPVPCSSPVCRTRTRDFP 126

Query: 125 NPVTCDPKKLCHVFVSYADASSLEGNLASDTFRVGSSAQPGTFFGCMDSGFSSNSEEDAK 184
            PV+CDPKKLCH  +SY DASS+EGNLA +TF +GSSAQPGT FGCMDSG SSN+EEDAK
Sbjct: 127 TPVSCDPKKLCHSILSYVDASSIEGNLAWETFNLGSSAQPGTIFGCMDSGSSSNAEEDAK 186

Query: 185 TTGLMGMNRGSLSFVTQLGLPKFSYCISGRDSSGVLLFGDASLSWLGNLTYTPLVQMSTP 244
           TTGLMGMNRGSLSFVTQ+G PKFSYCISGRDSSGVLLFG+A   WL  L YTPLV +STP
Sbjct: 187 TTGLMGMNRGSLSFVTQMGFPKFSYCISGRDSSGVLLFGEAKFDWLKPLNYTPLVHISTP 246

Query: 245 LPYYDRVAYTVQLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALK 304
           LPY+DRVAYTVQL+GIRVG K+L LPKS+F PDH+GAGQTMVDSGTQFTFLLGPVYTALK
Sbjct: 247 LPYFDRVAYTVQLEGIRVGGKLLPLPKSVFVPDHSGAGQTMVDSGTQFTFLLGPVYTALK 306

Query: 305 NEFVVQTKGILVPLGDPNFVFQGAMDLCYRVPEKQGKLPPLPVVSLMFRGAEMVVGGEVL 364
            EF  QTK +L  L DPNFVFQGA+DLC++VP  +  LP LP V+LMFRGAEM V GE L
Sbjct: 307 KEFTQQTKPVLNILNDPNFVFQGAIDLCFQVPTNRPSLPALPTVTLMFRGAEMSVSGERL 366

Query: 365 MYKVPGMVRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVE----- 424
           +Y+VPGMVRGG+QV+C T+GNSDLLGIEAFVIGH+HQQNVWMEFDL KSRVG  E     
Sbjct: 367 LYRVPGMVRGGNQVYCFTYGNSDLLGIEAFVIGHYHQQNVWMEFDLEKSRVGVAEEGVVS 426

Query: 425 ---------------------------------------------------TRCDLAAVA 484
                                                               R     + 
Sbjct: 427 ISQDSNSDGDFGLSVGPRVHAAKRVRVDSNLCTLQLAPFLPPKKRLHGVVVKRRRFIRLL 486

Query: 485 TTADMSFVFRGTRVPDIENGLSGFIPERRAMRVHAARPVNSNSL-------------AFL 544
            + +MSFVFRGTR  DIE+G   FIPERRAM +    P N + L             A  
Sbjct: 487 NSTNMSFVFRGTRA-DIESGFPEFIPERRAMLLIPFGP-NFHLLPLESKGGGVSIHHALS 546

Query: 545 VTVLLLFMMLNSHQMSPNFLLWLVLGVFLMATTLRMYATCQQLQAQAQARAMAASGLLGH 604
           +   LLF+            LWLVLGVFLMATTLRMYATCQQLQAQAQ  A AASGLLGH
Sbjct: 547 IPTHLLFLS----------QLWLVLGVFLMATTLRMYATCQQLQAQAQVHAAAASGLLGH 606

Query: 605 TELRLHMPPSIALATRGRLQGLRLQLALLDREFDDLDYETLRALDSDNAPTTPSMSEEQI 664
           TELRL MPPSI+LATRGRLQGLRLQLALLDREFDDLDYETLRALDSDN P   SMSEE+I
Sbjct: 607 TELRLRMPPSISLATRGRLQGLRLQLALLDREFDDLDYETLRALDSDNVPAASSMSEEEI 666

Query: 665 NALPVHKYKVSGPQSDPSVNQQASSSESNEKRQDSANAVGSTKASEDELTCSVCLEQVNV 691
           NALPVHKYK  GPQ+  S  QQASSS  +E  Q++ +AVGSTKA EDELTCSVCLEQV V
Sbjct: 667 NALPVHKYKAVGPQNGASSMQQASSSVPSE-TQETVDAVGSTKAMEDELTCSVCLEQVTV 726

BLAST of CmoCh15G001070 vs. NCBI nr
Match: KAA8539480.1 (hypothetical protein F0562_026172 [Nyssa sinensis])

HSP 1 Score: 917.5 bits (2370), Expect = 6.7e-263
Identity = 467/624 (74.84%), Postives = 518/624 (83.01%), Query Frame = 0

Query: 7   LLQLLICCVSFKQGLCFSATQTMVLPLKTQM---GVTSRPSNKLSFHHNVTLTVSLTLGS 66
           LLQLL+ C+  +   C S+T T++LPLKT +   G   +P NKLSFHHNV+LTV+LT+G+
Sbjct: 10  LLQLLVFCIFIQSNPCNSSTPTVILPLKTSLISSGSLPKPPNKLSFHHNVSLTVTLTVGT 69

Query: 67  PPQPVTMVLDTGSELSWLHCKKTPNLNSVFNPLSSSSYSPVPCASPVCRTRTRDLPNPVT 126
           PPQPVTMV+DTGSELSWL+CKKTPN  S+F+PL SSSYSP+PC+SP CRTRTRD   PV+
Sbjct: 70  PPQPVTMVIDTGSELSWLYCKKTPNTPSIFDPLRSSSYSPIPCSSPTCRTRTRDFSIPVS 129

Query: 127 CDPKKLCHVFVSYADASSLEGNLASDTFRVGSSAQPGTFFGCMDSGFSSNSEEDAKTTGL 186
           CDPKKLCH  +SYADASS+EGNLASDTF +G+S  PGT FG MDSG SSN EED+KTTGL
Sbjct: 130 CDPKKLCHATLSYADASSVEGNLASDTFHLGNSGLPGTVFGSMDSGSSSNPEEDSKTTGL 189

Query: 187 MGMNRGSLSFVTQLGLPKFSYCISGRDSSGVLLFGDASLSWLGNLTYTPLVQMSTPLPYY 246
           +GMNRGSLSFVTQ+  PKFSYCISGRDSSG+LLFG+AS  WL  L YTPLVQ+STPLPY+
Sbjct: 190 IGMNRGSLSFVTQMDFPKFSYCISGRDSSGILLFGEASFPWLQPLNYTPLVQISTPLPYF 249

Query: 247 DRVAYTVQLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFV 306
           DRVAYTVQL+GI+V  K+LA+PKS+  PDHTGAGQTMVDSGTQFTFLLGPVYT L+NEF+
Sbjct: 250 DRVAYTVQLEGIKVSGKVLAIPKSVLVPDHTGAGQTMVDSGTQFTFLLGPVYTTLRNEFM 309

Query: 307 VQTKGILVPLGDPNFVFQGAMDLCYRVPEKQGKLPPLPVVSLMFRGAEMVVGGEVLMYKV 366
            QTKG+L  L DPNFVFQGAMDLCYRV   +  LPPLP VSLMFRGAEM V GE LMY+V
Sbjct: 310 QQTKGVLRVLDDPNFVFQGAMDLCYRVESTRTSLPPLPTVSLMFRGAEMSVSGERLMYRV 369

Query: 367 PGMVRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVET-------R 426
           PG  RG D V+C TFGNSDLLGIEA+VIGHHHQQN+WMEFDLVKSRVG  E        R
Sbjct: 370 PGERRGSDSVYCFTFGNSDLLGIEAYVIGHHHQQNMWMEFDLVKSRVGLAEVRLLKFIRR 429

Query: 427 CDLAA--------------VATTADMSFVFRGTRVPDIENGLSGFIPERRAMRVHAARPV 486
            DL+                     MSFVFRGTR  DIE G  GFIPERRAMRVHAARPV
Sbjct: 430 IDLSQWGLLQRVILLLVKFFVKQKTMSFVFRGTRA-DIETGFPGFIPERRAMRVHAARPV 489

Query: 487 NSNSLAFLVTVLLLFMMLNSHQMSPNFLLWLVLGVFLMATTLRMYATCQQLQAQAQAR-- 546
           NSNSLAFLVTVLLLFM+LNSHQM PNFLLWLV G+FLMAT+LRMYATCQQLQAQAQA+  
Sbjct: 490 NSNSLAFLVTVLLLFMILNSHQMPPNFLLWLVFGIFLMATSLRMYATCQQLQAQAQAQAH 549

Query: 547 AMAASGLLGHTELRLHMPPSIALATRGRLQGLRLQLALLDREFDDLDYETLRALDSDNAP 605
           A AASGLLGHTELRLHMPPSIALATRGRLQGLRLQLALLDREFDDLDYETLRALDSDN P
Sbjct: 550 AAAASGLLGHTELRLHMPPSIALATRGRLQGLRLQLALLDREFDDLDYETLRALDSDNVP 609

BLAST of CmoCh15G001070 vs. NCBI nr
Match: XP_022938661.1 (aspartic proteinase PCS1-like [Cucurbita moschata])

HSP 1 Score: 854.7 bits (2207), Expect = 5.4e-244
Identity = 420/420 (100.00%), Postives = 420/420 (100.00%), Query Frame = 0

Query: 1   MAFFLRLLQLLICCVSFKQGLCFSATQTMVLPLKTQMGVTSRPSNKLSFHHNVTLTVSLT 60
           MAFFLRLLQLLICCVSFKQGLCFSATQTMVLPLKTQMGVTSRPSNKLSFHHNVTLTVSLT
Sbjct: 1   MAFFLRLLQLLICCVSFKQGLCFSATQTMVLPLKTQMGVTSRPSNKLSFHHNVTLTVSLT 60

Query: 61  LGSPPQPVTMVLDTGSELSWLHCKKTPNLNSVFNPLSSSSYSPVPCASPVCRTRTRDLPN 120
           LGSPPQPVTMVLDTGSELSWLHCKKTPNLNSVFNPLSSSSYSPVPCASPVCRTRTRDLPN
Sbjct: 61  LGSPPQPVTMVLDTGSELSWLHCKKTPNLNSVFNPLSSSSYSPVPCASPVCRTRTRDLPN 120

Query: 121 PVTCDPKKLCHVFVSYADASSLEGNLASDTFRVGSSAQPGTFFGCMDSGFSSNSEEDAKT 180
           PVTCDPKKLCHVFVSYADASSLEGNLASDTFRVGSSAQPGTFFGCMDSGFSSNSEEDAKT
Sbjct: 121 PVTCDPKKLCHVFVSYADASSLEGNLASDTFRVGSSAQPGTFFGCMDSGFSSNSEEDAKT 180

Query: 181 TGLMGMNRGSLSFVTQLGLPKFSYCISGRDSSGVLLFGDASLSWLGNLTYTPLVQMSTPL 240
           TGLMGMNRGSLSFVTQLGLPKFSYCISGRDSSGVLLFGDASLSWLGNLTYTPLVQMSTPL
Sbjct: 181 TGLMGMNRGSLSFVTQLGLPKFSYCISGRDSSGVLLFGDASLSWLGNLTYTPLVQMSTPL 240

Query: 241 PYYDRVAYTVQLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKN 300
           PYYDRVAYTVQLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKN
Sbjct: 241 PYYDRVAYTVQLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKN 300

Query: 301 EFVVQTKGILVPLGDPNFVFQGAMDLCYRVPEKQGKLPPLPVVSLMFRGAEMVVGGEVLM 360
           EFVVQTKGILVPLGDPNFVFQGAMDLCYRVPEKQGKLPPLPVVSLMFRGAEMVVGGEVLM
Sbjct: 301 EFVVQTKGILVPLGDPNFVFQGAMDLCYRVPEKQGKLPPLPVVSLMFRGAEMVVGGEVLM 360

Query: 361 YKVPGMVRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVETRCDLA 420
           YKVPGMVRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVETRCDLA
Sbjct: 361 YKVPGMVRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVETRCDLA 420

BLAST of CmoCh15G001070 vs. TAIR 10
Match: AT2G39710.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 590.1 bits (1520), Expect = 2.3e-168
Identity = 299/424 (70.52%), Postives = 343/424 (80.90%), Query Frame = 0

Query: 4   FLRLLQLLICCVSFKQGLC--FSATQTMVLPLKTQMGVTSRPSNKLSFHHNVTLTVSLTL 63
           FLR+  LL+    F    C   S  QT++  LKTQ  +    S+KLSF HNVTLTV+L +
Sbjct: 16  FLRISVLLLI---FPLTFCKTSSTNQTLLFSLKTQK-LPQSSSDKLSFRHNVTLTVTLAV 75

Query: 64  GSPPQPVTMVLDTGSELSWLHCKKTPNLNSVFNPLSSSSYSPVPCASPVCRTRTRDLPNP 123
           G PPQ ++MVLDTGSELSWLHCKK+PNL SVFNP+SSS+YSPVPC+SP+CRTRTRDLP P
Sbjct: 76  GDPPQNISMVLDTGSELSWLHCKKSPNLGSVFNPVSSSTYSPVPCSSPICRTRTRDLPIP 135

Query: 124 VTCDPK-KLCHVFVSYADASSLEGNLASDTFRVGSSAQPGTFFGCMDSGFSSNSEEDAKT 183
            +CDPK  LCHV +SYADA+S+EGNLA +TF +GS  +PGT FGCMDSG SSNSEEDAK+
Sbjct: 136 ASCDPKTHLCHVAISYADATSIEGNLAHETFVIGSVTRPGTLFGCMDSGLSSNSEEDAKS 195

Query: 184 TGLMGMNRGSLSFVTQLGLPKFSYCISGRDSSGVLLFGDASLSWLGNLTYTPLVQMSTPL 243
           TGLMGMNRGSLSFV QLG  KFSYCISG DSSG LL GDAS SWLG + YTPLV  STPL
Sbjct: 196 TGLMGMNRGSLSFVNQLGFSKFSYCISGSDSSGFLLLGDASYSWLGPIQYTPLVLQSTPL 255

Query: 244 PYYDRVAYTVQLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKN 303
           PY+DRVAYTVQL+GIRVG+KIL+LPKS+F PDHTGAGQTMVDSGTQFTFL+GPVYTALKN
Sbjct: 256 PYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAGQTMVDSGTQFTFLMGPVYTALKN 315

Query: 304 EFVVQTKGILVPLGDPNFVFQGAMDLCYRV-PEKQGKLPPLPVVSLMFRGAEMVVGGEVL 363
           EF+ QTK +L  + DP+FVFQG MDLCY+V    +     LP+VSLMFRGAEM V G+ L
Sbjct: 316 EFITQTKSVLRLVDDPDFVFQGTMDLCYKVGSTTRPNFSGLPMVSLMFRGAEMSVSGQKL 375

Query: 364 MYKVPGM-VRGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFV-ETRC 422
           +Y+V G    G ++V+C TFGNSDLLGIEAFVIGHHHQQNVWMEFDL KSRVGF    RC
Sbjct: 376 LYRVNGAGSEGKEEVYCFTFGNSDLLGIEAFVIGHHHQQNVWMEFDLAKSRVGFAGNVRC 435

BLAST of CmoCh15G001070 vs. TAIR 10
Match: AT5G02190.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 521.2 bits (1341), Expect = 1.3e-147
Identity = 253/413 (61.26%), Postives = 315/413 (76.27%), Query Frame = 0

Query: 16  SFKQGLCFSATQTMVLPLKTQMGVTS-RPSNKLSFHHNVTLTVSLTLGSPPQPVTMVLDT 75
           SF      S++QT+VLPLKT++  T  RP++KL FHHNVTLTV+LT+G+PPQ ++MV+DT
Sbjct: 33  SFSSFSSSSSSQTLVLPLKTRITPTDHRPTDKLHFHHNVTLTVTLTVGTPPQNISMVIDT 92

Query: 76  GSELSWLHCKKTPNLNSV--FNPLSSSSYSPVPCASPVCRTRTRDLPNPVTCDPKKLCHV 135
           GSELSWL C ++ N N V  F+P  SSSYSP+PC+SP CRTRTRD   P +CD  KLCH 
Sbjct: 93  GSELSWLRCNRSSNPNPVNNFDPTRSSSYSPIPCSSPTCRTRTRDFLIPASCDSDKLCHA 152

Query: 136 FVSYADASSLEGNLASDTFRVGSSAQPGT-FFGCMDSGFSSNSEEDAKTTGLMGMNRGSL 195
            +SYADASS EGNLA++ F  G+S       FGCM S   S+ EED KTTGL+GMNRGSL
Sbjct: 153 TLSYADASSSEGNLAAEIFHFGNSTNDSNLIFGCMGSVSGSDPEEDTKTTGLLGMNRGSL 212

Query: 196 SFVTQLGLPKFSYCISGRDS-SGVLLFGDASLSWLGNLTYTPLVQMSTPLPYYDRVAYTV 255
           SF++Q+G PKFSYCISG D   G LL GD++ +WL  L YTPL+++STPLPY+DRVAYTV
Sbjct: 213 SFISQMGFPKFSYCISGTDDFPGFLLLGDSNFTWLTPLNYTPLIRISTPLPYFDRVAYTV 272

Query: 256 QLDGIRVGNKILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFVVQTKGIL 315
           QL GI+V  K+L +PKS+  PDHTGAGQTMVDSGTQFTFLLGPVYTAL++ F+ +T GIL
Sbjct: 273 QLTGIKVNGKLLPIPKSVLVPDHTGAGQTMVDSGTQFTFLLGPVYTALRSHFLNRTNGIL 332

Query: 316 VPLGDPNFVFQGAMDLCYRVPE---KQGKLPPLPVVSLMFRGAEMVVGGEVLMYKVPGMV 375
               DP+FVFQG MDLCYR+     + G L  LP VSL+F GAE+ V G+ L+Y+VP + 
Sbjct: 333 TVYEDPDFVFQGTMDLCYRISPVRIRSGILHRLPTVSLVFEGAEIAVSGQPLLYRVPHLT 392

Query: 376 RGGDQVHCLTFGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVETRCDLA 421
            G D V+C TFGNSDL+G+EA+VIGHHHQQN+W+EFDL +SR+G     CD++
Sbjct: 393 VGNDSVYCFTFGNSDLMGMEAYVIGHHHQQNMWIEFDLQRSRIGLAPVECDVS 445

BLAST of CmoCh15G001070 vs. TAIR 10
Match: AT3G55530.1 (RING/U-box superfamily protein )

HSP 1 Score: 385.6 bits (989), Expect = 8.6e-107
Identity = 213/273 (78.02%), Postives = 230/273 (84.25%), Query Frame = 0

Query: 428 MSFVFRGTRVPDIENGLS-GFIPERRAMRVHAARPVNSNSLAFLVTVLLLFMMLNSHQMS 487
           MSFVFRG+R  D+E+G S GF+PERRAMRVH ARPVNSNSLAFLVTVLLLFM+LNSHQM 
Sbjct: 1   MSFVFRGSR-GDLESGFSGGFLPERRAMRVHGARPVNSNSLAFLVTVLLLFMILNSHQMP 60

Query: 488 PNFLLWLVLGVFLMATTLRMYATCQQLQAQAQARAMAASGLLGHTELRLHMPPSIALATR 547
           PNFLLWLVLGVFLMATTLRMYATCQQLQA AQA+A AASGL  HTELRLH+PPSIALATR
Sbjct: 61  PNFLLWLVLGVFLMATTLRMYATCQQLQAHAQAQAAAASGLFSHTELRLHVPPSIALATR 120

Query: 548 GRLQGLRLQLALLDREFDDLDYETLRALDSDNAPTTPSMSEEQINALPVHKYKVSGPQSD 607
           GRLQGLRLQLALLDREFDDLDYETLRALDSDN  TT SMSEE+INALPVHKYKV  P++ 
Sbjct: 121 GRLQGLRLQLALLDREFDDLDYETLRALDSDNVSTT-SMSEEEINALPVHKYKVLDPENG 180

Query: 608 PSVNQQASSSESNEKRQDSANAVGSTKASEDELTCSVCLEQVNVGEL---------FHAN 667
            S+ +QAS+S S EK  DSAN   S K +EDELTCSVCLEQV VGE+         FHA 
Sbjct: 181 CSLAKQASTSSSAEKMLDSANE--SKKGTEDELTCSVCLEQVTVGEIVRTLPCLHQFHAG 240

Query: 668 CIDPWLRQQGTCPVCKFRAVSGWSEQGQGETDA 691
           CIDPWLRQQGTCPVCKFRA SGW EQ + + DA
Sbjct: 241 CIDPWLRQQGTCPVCKFRAHSGWQEQDEIDDDA 269

BLAST of CmoCh15G001070 vs. TAIR 10
Match: AT5G37540.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 242.7 bits (618), Expect = 9.0e-64
Identity = 149/402 (37.06%), Postives = 214/402 (53.23%), Query Frame = 0

Query: 41  SRPSNKLSFHHNV----TLTVSLTLGSPPQPVTMVLDTGSELSWLHC------KKTPNLN 100
           S PS+  +F  N+     L +SL +G+P Q   +VLDTGS+LSW+ C      K  P   
Sbjct: 62  SPPSSPYTFRSNIKYSMALILSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPT 121

Query: 101 SVFNPLSSSSYSPVPCASPVCRTRTRDLPNPVTCDPKKLCHVFVSYADASSLEGNLASDT 160
           + F+P  SSS+S +PC+ P+C+ R  D   P +CD  +LCH    YAD +  EGNL  + 
Sbjct: 122 TSFDPSLSSSFSDLPCSHPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEK 181

Query: 161 FRVGSS-AQPGTFFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLPKFSYCISGR 220
           F   +S   P    GC        ++E     G++GMN G LSF++Q  + KFSYCI  R
Sbjct: 182 FTFSNSQTTPPLILGC--------AKESTDEKGILGMNLGRLSFISQAKISKFSYCIPTR 241

Query: 221 D------SSGVLLFGD----ASLSWLGNLTYTPLVQMSTPLPYYDRVAYTVQLDGIRVGN 280
                  S+G    GD        ++  LT+      S  +P  D +AYTV L GIR+G 
Sbjct: 242 SNRPGLASTGSFYLGDNPNSRGFKYVSLLTF----PQSQRMPNLDPLAYTVPLQGIRIGQ 301

Query: 281 KILALPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFVVQTKGILVPLGDPNFV 340
           K L +P S+F PD  G+GQTMVDSG++FT L+   Y  +K E +V+  G  +  G   +V
Sbjct: 302 KRLNIPGSVFRPDAGGSGQTMVDSGSEFTHLVDVAYDKVKEE-IVRLVGSRLKKG---YV 361

Query: 341 FQGAMDLCYRVPEKQ--GKLPPLPVVSLMFRGAEMVVGGEVLMYKVPGMVRGGDQVHCLT 400
           +    D+C+        G+L    +  L+F   E   G E+L+ K   +V  G  +HC+ 
Sbjct: 362 YGSTADMCFDGNHSMEIGRL----IGDLVF---EFGRGVEILVEKQSLLVNVGGGIHCVG 421

Query: 401 FGNSDLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVETRCDL 420
            G S +LG  + +IG+ HQQN+W+EFD+   RVGF +  C L
Sbjct: 422 IGRSSMLGAASNIIGNVHQQNLWVEFDVTNRRVGFSKAECRL 440

BLAST of CmoCh15G001070 vs. TAIR 10
Match: AT1G66180.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 236.5 bits (602), Expect = 6.4e-62
Identity = 144/396 (36.36%), Postives = 210/396 (53.03%), Query Frame = 0

Query: 40  TSRPSN-KLSFHHNVTLTVSLTLGSPPQPVTMVLDTGSELSWLHC---KKTPNLNSVFNP 99
           +S P N +  F +++ L +SL +G+PPQ   MVLDTGS+LSW+ C   K  P   + F+P
Sbjct: 56  SSPPYNFRSRFKYSMALIISLPIGTPPQAQQMVLDTGSQLSWIQCHRKKLPPKPKTSFDP 115

Query: 100 LSSSSYSPVPCASPVCRTRTRDLPNPVTCDPKKLCHVFVSYADASSLEGNLASDTFRVGS 159
             SSS+S +PC+ P+C+ R  D   P +CD  +LCH    YAD +  EGNL  +     +
Sbjct: 116 SLSSSFSTLPCSHPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKITFSN 175

Query: 160 S-AQPGTFFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLPKFSYCI------SG 219
           +   P    GC        + E +   G++GMNRG LSFV+Q  + KFSYCI       G
Sbjct: 176 TEITPPLILGC--------ATESSDDRGILGMNRGRLSFVSQAKISKFSYCIPPKSNRPG 235

Query: 220 RDSSGVLLFGDASLSWLGNLTYTPLVQM--STPLPYYDRVAYTVQLDGIRVGNKILALPK 279
              +G    GD   S      Y  L+    S  +P  D +AYTV + GIR G K L +  
Sbjct: 236 FTPTGSFYLGDNPNS--HGFKYVSLLTFPESQRMPNLDPLAYTVPMIGIRFGLKKLNISG 295

Query: 280 SIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFVVQTKGILVPLGDPNFVFQGAMDL 339
           S+F PD  G+GQTMVDSG++FT L+   Y  ++ E + +    L       +V+ G  D+
Sbjct: 296 SVFRPDAGGSGQTMVDSGSEFTHLVDAAYDKVRAEIMTRVGRRL----KKGYVYGGTADM 355

Query: 340 CYRVPEKQGKLPPLP-----VVSLMFRGAEMVVGGEVLMYKVPGMVRGGDQVHCLTFGNS 399
           C+      G +  +P     +V +  RG E++V  E ++  V      G  +HC+  G S
Sbjct: 356 CF-----DGNVAMIPRLIGDLVFVFTRGVEILVPKERVLVNV------GGGIHCVGIGRS 415

Query: 400 DLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVETRC 418
            +LG  + +IG+ HQQN+W+EFD+   RVGF +  C
Sbjct: 416 SMLGAASNIIGNVHQQNLWVEFDVTNRRVGFAKADC 426

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9LZL31.8e-14661.26Aspartic proteinase PCS1 OS=Arabidopsis thaliana OX=3702 GN=PCS1 PE=2 SV=1[more]
Q9M2S61.2e-10578.02E3 ubiquitin-protein ligase SDIR1 OS=Arabidopsis thaliana OX=3702 GN=SDIR1 PE=1 ... [more]
Q766C21.3e-3831.34Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis OX=150966 GN=nep2 PE=1 S... [more]
Q766C35.3e-3730.16Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis OX=150966 GN=nep1 PE=1 S... [more]
Q9LNJ33.4e-3131.89Aspartyl protease family protein 2 OS=Arabidopsis thaliana OX=3702 GN=APF2 PE=2 ... [more]
Match NameE-valueIdentityDescription
A0A7J6E7N26.3e-28366.20Uncharacterized protein OS=Cannabis sativa OX=3483 GN=G4B88_019942 PE=3 SV=1[more]
A0A7J6E2Q51.6e-27360.77Uncharacterized protein OS=Cannabis sativa OX=3483 GN=G4B88_020587 PE=3 SV=1[more]
A0A498KN451.1e-26365.54Uncharacterized protein OS=Malus domestica OX=3750 GN=DVH24_023080 PE=3 SV=1[more]
A0A5J5BAH93.3e-26374.84Peptidase A1 domain-containing protein OS=Nyssa sinensis OX=561372 GN=F0562_0261... [more]
A0A6J1FDS62.6e-244100.00aspartic proteinase PCS1-like OS=Cucurbita moschata OX=3662 GN=LOC111444820 PE=3... [more]
Match NameE-valueIdentityDescription
KAF4354473.11.3e-28266.20hypothetical protein G4B88_019942 [Cannabis sativa][more]
KAF4351959.13.2e-27360.77hypothetical protein G4B88_020587 [Cannabis sativa][more]
RXI08936.12.3e-26365.54hypothetical protein DVH24_023080 [Malus domestica][more]
KAA8539480.16.7e-26374.84hypothetical protein F0562_026172 [Nyssa sinensis][more]
XP_022938661.15.4e-244100.00aspartic proteinase PCS1-like [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
AT2G39710.12.3e-16870.52Eukaryotic aspartyl protease family protein [more]
AT5G02190.11.3e-14761.26Eukaryotic aspartyl protease family protein [more]
AT3G55530.18.6e-10778.02RING/U-box superfamily protein [more]
AT5G37540.19.0e-6437.06Eukaryotic aspartyl protease family protein [more]
AT1G66180.16.4e-6236.36Eukaryotic aspartyl protease family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 223..424
e-value: 4.0E-42
score: 145.9
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 48..218
e-value: 4.9E-36
score: 126.5
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 56..417
IPR032799Xylanase inhibitor, C-terminalPFAMPF14541TAXi_Ccoord: 247..413
e-value: 3.5E-35
score: 121.2
IPR013083Zinc finger, RING/FYVE/PHD-typeGENE3D3.30.40.10Zinc/RING finger domain, C3HC4 (zinc finger)coord: 614..679
e-value: 3.8E-10
score: 41.0
IPR032861Xylanase inhibitor, N-terminalPFAMPF14543TAXi_Ncoord: 57..219
e-value: 3.0E-39
score: 135.1
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 598..632
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 603..632
NoneNo IPR availablePANTHERPTHR47965:SF49EUKARYOTIC ASPARTYL PROTEASE FAMILY PROTEINcoord: 9..421
NoneNo IPR availableCDDcd16454RING-H2_PA-TM-RINGcoord: 640..673
e-value: 3.50644E-15
score: 67.7121
NoneNo IPR availableSUPERFAMILY57850RING/U-boxcoord: 636..679
IPR001461Aspartic peptidase A1 familyPANTHERPTHR47965ASPARTYL PROTEASE-RELATEDcoord: 9..421
IPR001969Aspartic peptidase, active sitePROSITEPS00141ASP_PROTEASEcoord: 70..81
IPR033121Peptidase family A1 domainPROSITEPS51767PEPTIDASE_A1coord: 55..413
score: 34.421074
IPR034161Pepsin-like domain, plantCDDcd05476pepsin_A_like_plantcoord: 54..417
e-value: 7.75111E-71
score: 229.457

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh15G001070.1CmoCh15G001070.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0004190 aspartic-type endopeptidase activity