Cp4.1LG03g17980 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG03g17980
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionGTD-binding domain-containing protein
LocationCp4.1LG03: 12306680 .. 12311005 (+)
RNA-Seq ExpressionCp4.1LG03g17980
SyntenyCp4.1LG03g17980
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ACGTGGCAATCCCGCCGGCGCCGCATCTGGCATGAAAAATATACAGATTAAATCGACTCAGCTTCACTCAGCTCGTTTTTCTTTTTCTTTTTCTTTTTATTTTCAAAACTTCTCTGTATTTCTCATCCCATTTTCCTCAGCATCATGATCCTCCACCTCACACGACGTTACTAAATCGACTTTTATGGAAAGTTAGTAGGGTTTTTATACTCGAGAAATCGAAAGTGGAAACCGCCACCATTGAATCATTTGATTTTCTTTTCCATTTAACTTTCCGATTTTTCATTTTCATCTACTAATTTAATTGTTTTCGTTATCGGAAGCATATTGTTGTGAAGTTGCTGAGAGTTGAAAGCTTATGGAAGTTCGAAAGAAACGCTGATTTCTGTTCGGTTTTTATTCTTCCTCTGTTTAAGTTTTTGTGATTTCCATTTGCGACCAGTGATGCTCACTGTGGTGGAGTGAAGTTTGCGGTTGGATGTAGATTTCGAAGTGAAAGTTGTAGCTCTGAATTGTAGATTCAGTAAGGGACTCGTTCGATTTTCTGAACTTTTCACCGGCGAAAACGAGGTTTATTTTCTATTCTGTTGTTTTTTGTAGCTTGATTCATGGTTTCAATCTTATTTCTGTTCGTCTAGTGACTAGAAAAGTAGTTTTTCGTATTTGTTTGATTGGATAGCATTGGACGTTGAAGTTATTCTTGTGGAAAATGGCTTGTGAAGCTGTACAACTGTGGACATTTAATGGATTAGTGGCTGCGTTTCTTGATCTTGGTATAGCTTTTCTTTTATTATGTGCAACGAGTCTTGTTTTCTTTACATCCAAATTTCTGGCGCTGTTTGGGTCATGTCTGCCTTGCCCTTGTGATGGACTATTTGGGAACCTTGGTAGTGATCACTGCTTCCAAAAGTTGCTAGTGGATTGTTCGTCCAAAAGAATATCTTCAGTCCTACATTCAACTAGAGAAAAGTTCCCATTGGATTCCATGTGGGATCAAGAGCCAAAATGTTGTTTTAAGTCGATGTCGGTGCACGAGAGGAATGCGAAGGAGGCGTGTGTTGAATTCGAAGGTGAAGCATCAGGTGATTCCTGGTTTAAAACCAGATCACCTCGAGGTATGATTTATGGAGACGTTCTCAATGTCAACGAATCGCATTATAAATGCGGTGTGGGTGGCAGGAAGATTGCATCAGTGTCTCCGAATGACGTTTTTCAGTCGGATGTGGAACTGGAAGACCTTTGTCATTCTCCTTCAAGCTTCTGTGGATTTGGGGATAACAATAACGAGGATGGCTTCTTTTCTGTTGATACTGGAGGTAATGAAATTTAATGTGTACCATTTGTCATTCTTTCCCTGTTAGATCAACTTAAAAAGTCATGATCATTTATGCGAATTGAATTGCCTTTTCATGGCAGCTATGAGCTTATTAGGATAGCAAATGGCTGATAACTCTGTGTCTTCTGAACTTCATTTTTATTGTAGTGTAGCATTTATGCATTATGCTTAGGACCCTTAACTGTTTTTGGAGATTGATTGGAAAGTTCGGTATTGGAAACCGAATATGTTTCGAATATGCATCTTGGACGTTGAGATCATAAATTTTGATGTTTGGGAGAATTTCCGTTTGACAAGTCTGATATTTTGAATTTTCAAGTGAATGTCGTGAAGATTGTAGCAAAAATTTGTTTTTATTTGGTTTTGCTCATTACTTTGTTCCTTATTTACGCTAGCTAAGCTAATAAATAGTAATTTTTCCCTTCAATCAAATAAAAAAGCTTATTGATTGATTCAGGGCGCGGTCCTCCTTCTTAGAACCGATTGAGGGGGGCCTCGGCCCTGAAAGGGAAAAGTGGCCAAGCAGTATCAGTGTTTTCTAACGGAAGTTTTAAGTTTCCTTGTAAGAGAAATTTTGCATTTTCTTCATAATTTTTCATGGTGATAACTGCAATCTTGGTGTTCTGATAACATTGTAGATGAAGGGGAGGCTTCATTAGACAACAGCAATCAATATAAAGTATTTCCCGATCTCGAACTAGATGATTCTTATGATGAGAAAATATGCGCAGAGATGTATGGAGCATCTGCTGAAGAGGCTAGGAACAACTGCAGAGGGGAGTTATGCTTGGATGGTAATGAGAGTGATAGAATCAAACTATTGGCACAATCACTCGAAGAAGAGCAGAAAGCTCGCGCTACCTTGTACCTGGAGCTCGAGAAAGAGAGAAGCGCTGCTGCCACTGCTGCTGATGAGGCCATGGCTATGATATTACGTCTTCAAGAGGAGAAGGCTAAAATAGAAATGGAGGCTAGGCAATACCAGAGGTTGATAGAGGAGAAAACTGCTTATGATGCTGAGGAAATGAGTATTCTTAAAGAAATTCTAGTGAGGAGAGAGCAGGAAATGCATTTCCTGAAGAAGGAAGTTGAAGCTTTTAGGAAAAGTTTCTTTGATAATGATGGAGTGAGTGTTGATATGCTTGACTCTGAATTTACACCCCCATGGCTTGAATGCATCAATGGATCCATTACAGACAAGCAATCTCATGAGTTACCATCAGTGGAGTCACGAAACTTAACTTTTGAGTTCGGGGAAGAGTCCCCTTCGATTCAAGCGGATGAATTCGCTGATGCTGCGAAAGCTAGTGGCGTGTTGTTGCACCAAGTTGCCGATAATTTCGAGCTTAGTGAAGAGATTGACAACGAGTTACAAGGAAAGGGCATGGTGGAAGATAAAAATGTATACATTGTACCAGGAGAAGTAAATGAACTGGAGCCATATCCGAAAAGCAATGTGTCGAATGGCCTAGATAAGGTCGAGCAGTGCACAGAATTGACTGCTGATGAACAAGAAAAAGTTTACGACGATTCATTTGACGGGTTGGCATTTGCGGAAATAGCTCTTCCTTGTGTTGAGTATAATTTGGAAAAGCATGGGAACCATCAAAAGCAGTGGACAAGAGATTTCAAATCTGTGAATGATACAGATGCCTGTCCTCATGATATTCATGTCATTGAAGATGAAGCTAGAATGCCCAATGAAGCAAGTGCTAATGCTCGTAAAGAAACATCGGTTAATGGTAGCTCGAGTATTCCAGTTAATTGTGATAGTTCATCCTTCAGTTCGTTGCAGAATGGACTAGACATCACGAGAAGCAGCTCAGATGCCACTGGAAGATTTCCACCAATGGCTCGTTCTCGAAGCAATTCCTTGCGTTCTGAATTGCGTAGAAACTCGATGTCTGCTGTCGATTATGAAAGGTCGAAAATCGGCAATGAAGTTGAGTGGCTTAGAGGAAGGTTGAAGATTGTTCAAGAGGAAAGAGAAAAGCTCAAGTTATCTGTGGAGCAAAAAGAGAAGGAGAACAATCAATTGCAACTTTTAGAAAACATAACGAAGGTGCTGTCGGATCCTGGAATGGCAGCTCTGCAGGCTCCATTGCCTCCATCCTCTAAGGTACATTAACCAATCTTCAGTTGATATTATTAATACGCTACGTGCATAGATCTATTAGAAATCACGACTCTCTACAATGGCATGATATTGTGCACTTTGAGCATAAGCTCTCGTGGGTTTGATTTTGGTTTTCCCAAAAGGTCTCGTACCAATGGAGATGTCTTCCTTACTTATAAACCCACGATCAACCCTTTAATTAGCCGATGTGAGACTCCCCTCCCAACAATCCTCGACAAGATCTATACCTTCTTCTAATGTATTGTTTAACAAATCTAACAGATCAATGCATTGTAGAATGAAACCTTATTTTATCATTCAACTCGGTCCAAGAAAAACCATAAATGCTCTGCAATTCTAAAATTTTGCCTGCCTTTTAGTCGTTGTCAATCCATTCTTGATGGCTTTGTTTTCAGGATGTGTCAAAGAAACGTTGCTGGCGAAGCTCATCCTTAAGCATTCACAGAAGCAGCTAGCCAGTTTGAAGTGCCACCAAAAAAACGAAACGAAACGAAACGAAAGCACCGTAGCCGAACACGATACAATACAGCATCCTCCAGGTTAGAAATTCTAGTCTTTTTGCCCCACTGCCATACAAGACCACAAGACGTGTAAATTGAGAATAACTTGACAGGTTATGGCTTAGATCATAGGTACTTTTTCTTTTAAGTTCTGATGTGGGGTGATAGATATTGGGATATTGAGCCTCCTATAAGCTATATTTAGTAAATTGAAATCCTGTATACATCCTGTGCTTACAAAACATACTTTCTTCCTTTCTTCGTCCTCTATTCTCTGTTCTCTGTATTCTGTCTTTTGTCTTGTTGTACGTACCTGAAGTCATCAATAAATTTTTTAGCTAATATTCCTCTTC

mRNA sequence

ACGTGGCAATCCCGCCGGCGCCGCATCTGGCATGAAAAATATACAGATTAAATCGACTCAGCTTCACTCAGCTCGTTTTTCTTTTTCTTTTTCTTTTTATTTTCAAAACTTCTCTGTATTTCTCATCCCATTTTCCTCAGCATCATGATCCTCCACCTCACACGACGTTACTAAATCGACTTTTATGGAAAGTTAGTAGGGTTTTTATACTCGAGAAATCGAAAGTGGAAACCGCCACCATTGAATCATTTGATTTTCTTTTCCATTTAACTTTCCGATTTTTCATTTTCATCTACTAATTTAATTGTTTTCGTTATCGGAAGCATATTGTTGTGAAGTTGCTGAGAGTTGAAAGCTTATGGAAGTTCGAAAGAAACGCTGATTTCTGTTCGGTTTTTATTCTTCCTCTGTTTAAGTTTTTGTGATTTCCATTTGCGACCAGTGATGCTCACTGTGGTGGAGTGAAGTTTGCGGTTGGATGTAGATTTCGAAGTGAAAGTTGTAGCTCTGAATTGTAGATTCAGTAAGGGACTCGTTCGATTTTCTGAACTTTTCACCGGCGAAAACGAGGTTTATTTTCTATTCTGTTGTTTTTTGTAGCTTGATTCATGGTTTCAATCTTATTTCTGTTCGTCTAGTGACTAGAAAAGTAGTTTTTCGTATTTGTTTGATTGGATAGCATTGGACGTTGAAGTTATTCTTGTGGAAAATGGCTTGTGAAGCTGTACAACTGTGGACATTTAATGGATTAGTGGCTGCGTTTCTTGATCTTGGTATAGCTTTTCTTTTATTATGTGCAACGAGTCTTGTTTTCTTTACATCCAAATTTCTGGCGCTGTTTGGGTCATGTCTGCCTTGCCCTTGTGATGGACTATTTGGGAACCTTGGTAGTGATCACTGCTTCCAAAAGTTGCTAGTGGATTGTTCGTCCAAAAGAATATCTTCAGTCCTACATTCAACTAGAGAAAAGTTCCCATTGGATTCCATGTGGGATCAAGAGCCAAAATGTTGTTTTAAGTCGATGTCGGTGCACGAGAGGAATGCGAAGGAGGCGTGTGTTGAATTCGAAGGTGAAGCATCAGGTGATTCCTGGTTTAAAACCAGATCACCTCGAGGTATGATTTATGGAGACGTTCTCAATGTCAACGAATCGCATTATAAATGCGGTGTGGGTGGCAGGAAGATTGCATCAGTGTCTCCGAATGACGTTTTTCAGTCGGATGTGGAACTGGAAGACCTTTGTCATTCTCCTTCAAGCTTCTGTGGATTTGGGGATAACAATAACGAGGATGGCTTCTTTTCTGTTGATACTGGAGATGAAGGGGAGGCTTCATTAGACAACAGCAATCAATATAAAGTATTTCCCGATCTCGAACTAGATGATTCTTATGATGAGAAAATATGCGCAGAGATGTATGGAGCATCTGCTGAAGAGGCTAGGAACAACTGCAGAGGGGAGTTATGCTTGGATGGTAATGAGAGTGATAGAATCAAACTATTGGCACAATCACTCGAAGAAGAGCAGAAAGCTCGCGCTACCTTGTACCTGGAGCTCGAGAAAGAGAGAAGCGCTGCTGCCACTGCTGCTGATGAGGCCATGGCTATGATATTACGTCTTCAAGAGGAGAAGGCTAAAATAGAAATGGAGGCTAGGCAATACCAGAGGTTGATAGAGGAGAAAACTGCTTATGATGCTGAGGAAATGAGTATTCTTAAAGAAATTCTAGTGAGGAGAGAGCAGGAAATGCATTTCCTGAAGAAGGAAGTTGAAGCTTTTAGGAAAAGTTTCTTTGATAATGATGGAGTGAGTGTTGATATGCTTGACTCTGAATTTACACCCCCATGGCTTGAATGCATCAATGGATCCATTACAGACAAGCAATCTCATGAGTTACCATCAGTGGAGTCACGAAACTTAACTTTTGAGTTCGGGGAAGAGTCCCCTTCGATTCAAGCGGATGAATTCGCTGATGCTGCGAAAGCTAGTGGCGTGTTGTTGCACCAAGTTGCCGATAATTTCGAGCTTAGTGAAGAGATTGACAACGAGTTACAAGGAAAGGGCATGGTGGAAGATAAAAATGTATACATTGTACCAGGAGAAGTAAATGAACTGGAGCCATATCCGAAAAGCAATGTGTCGAATGGCCTAGATAAGGTCGAGCAGTGCACAGAATTGACTGCTGATGAACAAGAAAAAGTTTACGACGATTCATTTGACGGGTTGGCATTTGCGGAAATAGCTCTTCCTTGTGTTGAGTATAATTTGGAAAAGCATGGGAACCATCAAAAGCAGTGGACAAGAGATTTCAAATCTGTGAATGATACAGATGCCTGTCCTCATGATATTCATGTCATTGAAGATGAAGCTAGAATGCCCAATGAAGCAAGTGCTAATGCTCGTAAAGAAACATCGGTTAATGGTAGCTCGAGTATTCCAGTTAATTGTGATAGTTCATCCTTCAGTTCGTTGCAGAATGGACTAGACATCACGAGAAGCAGCTCAGATGCCACTGGAAGATTTCCACCAATGGCTCGTTCTCGAAGCAATTCCTTGCGTTCTGAATTGCGTAGAAACTCGATGTCTGCTGTCGATTATGAAAGGTCGAAAATCGGCAATGAAGTTGAGTGGCTTAGAGGAAGGTTGAAGATTGTTCAAGAGGAAAGAGAAAAGCTCAAGTTATCTGTGGAGCAAAAAGAGAAGGAGAACAATCAATTGCAACTTTTAGAAAACATAACGAAGGATGTGTCAAAGAAACGTTGCTGGCGAAGCTCATCCTTAAGCATTCACAGAAGCAGCTAGCCAGTTTGAAGTGCCACCAAAAAAACGAAACGAAACGAAACGAAAGCACCGTAGCCGAACACGATACAATACAGCATCCTCCAGGTTAGAAATTCTAGTCTTTTTGCCCCACTGCCATACAAGACCACAAGACGTGTAAATTGAGAATAACTTGACAGGTTATGGCTTAGATCATAGGTACTTTTTCTTTTAAGTTCTGATGTGGGGTGATAGATATTGGGATATTGAGCCTCCTATAAGCTATATTTAGTAAATTGAAATCCTGTATACATCCTGTGCTTACAAAACATACTTTCTTCCTTTCTTCGTCCTCTATTCTCTGTTCTCTGTATTCTGTCTTTTGTCTTGTTGTACGTACCTGAAGTCATCAATAAATTTTTTAGCTAATATTCCTCTTC

Coding sequence (CDS)

ATGGCTTGTGAAGCTGTACAACTGTGGACATTTAATGGATTAGTGGCTGCGTTTCTTGATCTTGGTATAGCTTTTCTTTTATTATGTGCAACGAGTCTTGTTTTCTTTACATCCAAATTTCTGGCGCTGTTTGGGTCATGTCTGCCTTGCCCTTGTGATGGACTATTTGGGAACCTTGGTAGTGATCACTGCTTCCAAAAGTTGCTAGTGGATTGTTCGTCCAAAAGAATATCTTCAGTCCTACATTCAACTAGAGAAAAGTTCCCATTGGATTCCATGTGGGATCAAGAGCCAAAATGTTGTTTTAAGTCGATGTCGGTGCACGAGAGGAATGCGAAGGAGGCGTGTGTTGAATTCGAAGGTGAAGCATCAGGTGATTCCTGGTTTAAAACCAGATCACCTCGAGGTATGATTTATGGAGACGTTCTCAATGTCAACGAATCGCATTATAAATGCGGTGTGGGTGGCAGGAAGATTGCATCAGTGTCTCCGAATGACGTTTTTCAGTCGGATGTGGAACTGGAAGACCTTTGTCATTCTCCTTCAAGCTTCTGTGGATTTGGGGATAACAATAACGAGGATGGCTTCTTTTCTGTTGATACTGGAGATGAAGGGGAGGCTTCATTAGACAACAGCAATCAATATAAAGTATTTCCCGATCTCGAACTAGATGATTCTTATGATGAGAAAATATGCGCAGAGATGTATGGAGCATCTGCTGAAGAGGCTAGGAACAACTGCAGAGGGGAGTTATGCTTGGATGGTAATGAGAGTGATAGAATCAAACTATTGGCACAATCACTCGAAGAAGAGCAGAAAGCTCGCGCTACCTTGTACCTGGAGCTCGAGAAAGAGAGAAGCGCTGCTGCCACTGCTGCTGATGAGGCCATGGCTATGATATTACGTCTTCAAGAGGAGAAGGCTAAAATAGAAATGGAGGCTAGGCAATACCAGAGGTTGATAGAGGAGAAAACTGCTTATGATGCTGAGGAAATGAGTATTCTTAAAGAAATTCTAGTGAGGAGAGAGCAGGAAATGCATTTCCTGAAGAAGGAAGTTGAAGCTTTTAGGAAAAGTTTCTTTGATAATGATGGAGTGAGTGTTGATATGCTTGACTCTGAATTTACACCCCCATGGCTTGAATGCATCAATGGATCCATTACAGACAAGCAATCTCATGAGTTACCATCAGTGGAGTCACGAAACTTAACTTTTGAGTTCGGGGAAGAGTCCCCTTCGATTCAAGCGGATGAATTCGCTGATGCTGCGAAAGCTAGTGGCGTGTTGTTGCACCAAGTTGCCGATAATTTCGAGCTTAGTGAAGAGATTGACAACGAGTTACAAGGAAAGGGCATGGTGGAAGATAAAAATGTATACATTGTACCAGGAGAAGTAAATGAACTGGAGCCATATCCGAAAAGCAATGTGTCGAATGGCCTAGATAAGGTCGAGCAGTGCACAGAATTGACTGCTGATGAACAAGAAAAAGTTTACGACGATTCATTTGACGGGTTGGCATTTGCGGAAATAGCTCTTCCTTGTGTTGAGTATAATTTGGAAAAGCATGGGAACCATCAAAAGCAGTGGACAAGAGATTTCAAATCTGTGAATGATACAGATGCCTGTCCTCATGATATTCATGTCATTGAAGATGAAGCTAGAATGCCCAATGAAGCAAGTGCTAATGCTCGTAAAGAAACATCGGTTAATGGTAGCTCGAGTATTCCAGTTAATTGTGATAGTTCATCCTTCAGTTCGTTGCAGAATGGACTAGACATCACGAGAAGCAGCTCAGATGCCACTGGAAGATTTCCACCAATGGCTCGTTCTCGAAGCAATTCCTTGCGTTCTGAATTGCGTAGAAACTCGATGTCTGCTGTCGATTATGAAAGGTCGAAAATCGGCAATGAAGTTGAGTGGCTTAGAGGAAGGTTGAAGATTGTTCAAGAGGAAAGAGAAAAGCTCAAGTTATCTGTGGAGCAAAAAGAGAAGGAGAACAATCAATTGCAACTTTTAGAAAACATAACGAAGGATGTGTCAAAGAAACGTTGCTGGCGAAGCTCATCCTTAAGCATTCACAGAAGCAGCTAG

Protein sequence

MACEAVQLWTFNGLVAAFLDLGIAFLLLCATSLVFFTSKFLALFGSCLPCPCDGLFGNLGSDHCFQKLLVDCSSKRISSVLHSTREKFPLDSMWDQEPKCCFKSMSVHERNAKEACVEFEGEASGDSWFKTRSPRGMIYGDVLNVNESHYKCGVGGRKIASVSPNDVFQSDVELEDLCHSPSSFCGFGDNNNEDGFFSVDTGDEGEASLDNSNQYKVFPDLELDDSYDEKICAEMYGASAEEARNNCRGELCLDGNESDRIKLLAQSLEEEQKARATLYLELEKERSAAATAADEAMAMILRLQEEKAKIEMEARQYQRLIEEKTAYDAEEMSILKEILVRREQEMHFLKKEVEAFRKSFFDNDGVSVDMLDSEFTPPWLECINGSITDKQSHELPSVESRNLTFEFGEESPSIQADEFADAAKASGVLLHQVADNFELSEEIDNELQGKGMVEDKNVYIVPGEVNELEPYPKSNVSNGLDKVEQCTELTADEQEKVYDDSFDGLAFAEIALPCVEYNLEKHGNHQKQWTRDFKSVNDTDACPHDIHVIEDEARMPNEASANARKETSVNGSSSIPVNCDSSSFSSLQNGLDITRSSSDATGRFPPMARSRSNSLRSELRRNSMSAVDYERSKIGNEVEWLRGRLKIVQEEREKLKLSVEQKEKENNQLQLLENITKDVSKKRCWRSSSLSIHRSS
Homology
BLAST of Cp4.1LG03g17980 vs. ExPASy Swiss-Prot
Match: Q0WNW4 (Myosin-binding protein 3 OS=Arabidopsis thaliana OX=3702 GN=MYOB3 PE=1 SV=1)

HSP 1 Score: 91.3 bits (225), Expect = 4.7e-17
Identity = 56/121 (46.28%), Postives = 86/121 (71.07%), Query Frame = 0

Query: 238 ASAEEARNNCRGELCLDGNESDR-IKLLAQSLEEEQKARATLYLELEKERSAAATAADEA 297
           A+AE+A +       +DG +  R I+ L +++  EQ+A   LY ELE+ERSA+A +A++ 
Sbjct: 333 AAAEDAGDGNVLVSEMDGGDPLRTIERLRETVRAEQEALRDLYAELEEERSASAISANQT 392

Query: 298 MAMILRLQEEKAKIEMEARQYQRLIEEKTAYDAEEMSILKEILVRREQEMHFLKKEVEAF 357
           MAMI RLQEEKAK++MEA QYQR++EE+  YD E + +L  ++V+RE+E   L++E+E +
Sbjct: 393 MAMITRLQEEKAKVQMEALQYQRMMEEQAEYDQEALQLLNHLMVKREKEKEQLQRELEVY 452

BLAST of Cp4.1LG03g17980 vs. ExPASy Swiss-Prot
Match: Q9CAC4 (Myosin-binding protein 2 OS=Arabidopsis thaliana OX=3702 GN=MYOB2 PE=1 SV=1)

HSP 1 Score: 85.9 bits (211), Expect = 2.0e-15
Identity = 57/118 (48.31%), Postives = 80/118 (67.80%), Query Frame = 0

Query: 242 EARNNCRGELCLDGNES-DRIKLLAQSLEEEQKARATLYLELEKERSAAATAADEAMAMI 301
           E R +  G  C +G  + D++K     L+EE+KA   LY ELE ER+A+A AA E MAMI
Sbjct: 396 EQRVSVDGIECPEGVLTVDKLKF---ELQEERKALHALYEELEVERNASAVAASETMAMI 455

Query: 302 LRLQEEKAKIEMEARQYQRLIEEKTAYDAEEMSILKEILVRREQEMHFLKKEVEAFRK 359
            RL EEKA ++MEA QYQR++EE+  +D E + +L E++V RE+E   L+KE+E +RK
Sbjct: 456 NRLHEEKAAMQMEALQYQRMMEEQAEFDQEALQLLNELMVNREKENAELEKELEVYRK 510

BLAST of Cp4.1LG03g17980 vs. ExPASy Swiss-Prot
Match: Q9LMC8 (Probable myosin-binding protein 5 OS=Arabidopsis thaliana OX=3702 GN=MYOB5 PE=2 SV=1)

HSP 1 Score: 81.6 bits (200), Expect = 3.7e-14
Identity = 69/207 (33.33%), Postives = 113/207 (54.59%), Query Frame = 0

Query: 253 LDGNESDRIKLLAQSLEEEQKARATLYLELEKERSAAATAADEAMAMILRLQEEKAKIEM 312
           LDG+    ++ L + +  ++K+   LY+EL++ERSA+A AA+ AMAMI RLQ EKA ++M
Sbjct: 295 LDGDSI--LQHLNRQVRLDRKSLMDLYMELDEERSASAVAANNAMAMITRLQAEKAAVQM 354

Query: 313 EARQYQRLIEEKTAYDAEEMSILKEILVRREQEMHFLKKEVEAFRKSF---FDNDGVSVD 372
           EA QYQR+++E+  YD E +  +  +LV+RE+EM  L+  +E +R  +    +  G + +
Sbjct: 355 EALQYQRMMDEQAEYDQEALQSMNGLLVKREEEMKELEAGIEVYRLRYGLLREERGEAEE 414

Query: 373 MLDSEFTP--PWLECINGSITD----KQSHELPSVESRNLTFEFGEESPS---------I 432
            LD E  P      C +    D    K S E     +  +  E  +E+ S          
Sbjct: 415 FLDEETKPVSDLPVCSSNHEEDLEQMKDSAEDSIGNNGVMIIEEEKENGSRKDMLVKEIS 474

Query: 433 QADEFADAAKASGVLLHQVADNFELSE 442
           +  E  +A ++ G LL Q++D  ++SE
Sbjct: 475 EITERLNAIESKGELLQQISDVLDVSE 499

BLAST of Cp4.1LG03g17980 vs. ExPASy Swiss-Prot
Match: F4HVS6 (Probable myosin-binding protein 6 OS=Arabidopsis thaliana OX=3702 GN=MYOB6 PE=2 SV=1)

HSP 1 Score: 81.3 bits (199), Expect = 4.9e-14
Identity = 45/97 (46.39%), Postives = 69/97 (71.13%), Query Frame = 0

Query: 264 LAQSLEEEQKARATLYLELEKERSAAATAADEAMAMILRLQEEKAKIEMEARQYQRLIEE 323
           L + +  ++K+   LY+EL++ERSA+A AA+EAMAMI RLQ EKA ++MEA QYQR+++E
Sbjct: 305 LKKEVRLDKKSLIDLYMELDEERSASAVAANEAMAMITRLQAEKAAVQMEALQYQRMMDE 364

Query: 324 KTAYDAEEMSILKEILVRREQEMHFLKKEVEAFRKSF 361
           +  YD E +  +   L +RE+EM  L+ E E +R+ +
Sbjct: 365 QAEYDQEALQSMSSELAKREEEMKELEAEFEVYREKY 401

BLAST of Cp4.1LG03g17980 vs. ExPASy Swiss-Prot
Match: Q9FG14 (Myosin-binding protein 7 OS=Arabidopsis thaliana OX=3702 GN=MYOB7 PE=1 SV=1)

HSP 1 Score: 78.2 bits (191), Expect = 4.1e-13
Identity = 42/99 (42.42%), Postives = 74/99 (74.75%), Query Frame = 0

Query: 259 DRIKLLAQSLEEEQKARATLYLELEKERSAAATAADEAMAMILRLQEEKAKIEMEARQYQ 318
           + ++LL +++  +Q++   LY EL++ER+AA+TAA EAM+MILRLQ +KA+++ME RQ++
Sbjct: 69  NELELLRETVSSQQQSIQDLYEELDEERNAASTAASEAMSMILRLQRDKAELQMELRQFK 128

Query: 319 RLIEEKTAYDAEEMSILKEILVRREQEMHFLKKEVEAFR 358
           R  EEK  +D +E+  L++++ +REQ +  L  E +A++
Sbjct: 129 RFAEEKMEHDQQELLDLEDLIYKREQTIQALTFEAQAYK 167

BLAST of Cp4.1LG03g17980 vs. NCBI nr
Match: XP_023528191.1 (uncharacterized protein LOC111791180 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1347 bits (3486), Expect = 0.0
Identity = 696/715 (97.34%), Postives = 696/715 (97.34%), Query Frame = 0

Query: 1   MACEAVQLWTFNGLVAAFLDLGIAFLLLCATSLVFFTSKFLALFGSCLPCPCDGLFGNLG 60
           MACEAVQLWTFNGLVAAFLDLGIAFLLLCATSLVFFTSKFLALFGSCLPCPCDGLFGNLG
Sbjct: 1   MACEAVQLWTFNGLVAAFLDLGIAFLLLCATSLVFFTSKFLALFGSCLPCPCDGLFGNLG 60

Query: 61  SDHCFQKLLVDCSSKRISSVLHSTREKFPLDSMWDQEPKCCFKSMSVHERNAKEACVEFE 120
           SDHCFQKLLVDCSSKRISSVLHSTREKFPLDSMWDQEPKCCFKSMSVHERNAKEACVEFE
Sbjct: 61  SDHCFQKLLVDCSSKRISSVLHSTREKFPLDSMWDQEPKCCFKSMSVHERNAKEACVEFE 120

Query: 121 GEASGDSWFKTRSPRGMIYGDVLNVNESHYKCGVGGRKIASVSPNDVFQSDVELEDLCHS 180
           GEASGDSWFKTRSPRGMIYGDVLNVNESHYKCGVGGRKIASVSPNDVFQSDVELEDLCHS
Sbjct: 121 GEASGDSWFKTRSPRGMIYGDVLNVNESHYKCGVGGRKIASVSPNDVFQSDVELEDLCHS 180

Query: 181 PSSFCGFGDNNNEDGFFSVDTGDEGEASLDNSNQYKVFPDLELDDSYDEKICAEMYGASA 240
           PSSFCGFGDNNNEDGFFSVDTGDEGEASLDNSNQYKVFPDLELDDSYDEKICAEMYGASA
Sbjct: 181 PSSFCGFGDNNNEDGFFSVDTGDEGEASLDNSNQYKVFPDLELDDSYDEKICAEMYGASA 240

Query: 241 EEARNNCRGELCLDGNESDRIKLLAQSLEEEQKARATLYLELEKERSAAATAADEAMAMI 300
           EEARNNCRGELCLDGNESDRIKLLAQSLEEEQKARATLYLELEKERSAAATAADEAMAMI
Sbjct: 241 EEARNNCRGELCLDGNESDRIKLLAQSLEEEQKARATLYLELEKERSAAATAADEAMAMI 300

Query: 301 LRLQEEKAKIEMEARQYQRLIEEKTAYDAEEMSILKEILVRREQEMHFLKKEVEAFRKSF 360
           LRLQEEKAKIEMEARQYQRLIEEKTAYDAEEMSILKEILVRREQEMHFLKKEVEAFRKSF
Sbjct: 301 LRLQEEKAKIEMEARQYQRLIEEKTAYDAEEMSILKEILVRREQEMHFLKKEVEAFRKSF 360

Query: 361 FDNDGVSVDMLDSEFTPPWLECINGSITDKQSHELPSVESRNLTFEFGEESPSIQADEFA 420
           FDNDGVSVDMLDSEFTPPWLECINGSITDKQSHELPSVESRNLTFEFGEESPSIQADEFA
Sbjct: 361 FDNDGVSVDMLDSEFTPPWLECINGSITDKQSHELPSVESRNLTFEFGEESPSIQADEFA 420

Query: 421 DAAKASGVLLHQVADNFELSEEIDNELQGKGMVEDKNVYIVPGEVNELEPYPKSNVSNGL 480
           DAAKASGVLLHQVADNFELSEEIDNELQGKGMVEDKNVYIVPGEVNELEPYPKSNVSNGL
Sbjct: 421 DAAKASGVLLHQVADNFELSEEIDNELQGKGMVEDKNVYIVPGEVNELEPYPKSNVSNGL 480

Query: 481 DKVEQCTELTADEQEKVYDDSFDGLAFAEIALPCVEYNLEKHGNHQKQWTRDFKSVNDTD 540
           DKVEQCTELTADEQEKVYDDSFDGLAFAEIALPCVEYNLEKHGNHQKQWTRDFKSVNDTD
Sbjct: 481 DKVEQCTELTADEQEKVYDDSFDGLAFAEIALPCVEYNLEKHGNHQKQWTRDFKSVNDTD 540

Query: 541 ACPHDIHVIEDEARMPNEASANARKETSVNGSSSIPVNCDSSSFSSLQNGLDITRSSSDA 600
           ACPHDIHVIEDEARMPNEASANARKETSVNGSSSIPVNCDSSSFSSLQNGLDITRSSSDA
Sbjct: 541 ACPHDIHVIEDEARMPNEASANARKETSVNGSSSIPVNCDSSSFSSLQNGLDITRSSSDA 600

Query: 601 TGRFPPMARSRSNSLRSELRRNSMSAVDYERSKIGNEVEWLRGRLKIVQEEREKLKLSVE 660
           TGRFPPMARSRSNSLRSELRRNSMSAVDYERSKIGNEVEWLRGRLKIVQEEREKLKLSVE
Sbjct: 601 TGRFPPMARSRSNSLRSELRRNSMSAVDYERSKIGNEVEWLRGRLKIVQEEREKLKLSVE 660

Query: 661 QKEKENNQLQLLENITK-------------------DVSKKRCWRSSSLSIHRSS 696
           QKEKENNQLQLLENITK                   DVSKKRCWRSSSLSIHRSS
Sbjct: 661 QKEKENNQLQLLENITKVLSDPGMAALQAPLPPSSKDVSKKRCWRSSSLSIHRSS 715

BLAST of Cp4.1LG03g17980 vs. NCBI nr
Match: XP_023528192.1 (uncharacterized protein LOC111791180 isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1269 bits (3284), Expect = 0.0
Identity = 665/715 (93.01%), Postives = 665/715 (93.01%), Query Frame = 0

Query: 1   MACEAVQLWTFNGLVAAFLDLGIAFLLLCATSLVFFTSKFLALFGSCLPCPCDGLFGNLG 60
           MACEAVQLWTFNGLVAAFLDLGIAFLLLCATSLVFFTSKFLALFGSCLPCPCDGLFGNLG
Sbjct: 1   MACEAVQLWTFNGLVAAFLDLGIAFLLLCATSLVFFTSKFLALFGSCLPCPCDGLFGNLG 60

Query: 61  SDHCFQKLLVDCSSKRISSVLHSTREKFPLDSMWDQEPKCCFKSMSVHERNAKEACVEFE 120
           SDHCFQKLLVDCSSKRISSVLHSTREKFPLDSMWDQEPKCCFKSMSVHERNAKEACVEFE
Sbjct: 61  SDHCFQKLLVDCSSKRISSVLHSTREKFPLDSMWDQEPKCCFKSMSVHERNAKEACVEFE 120

Query: 121 GEASGDSWFKTRSPRGMIYGDVLNVNESHYKCGVGGRKIASVSPNDVFQSDVELEDLCHS 180
           GEASGDSWFKTRSPRGMIYGDVLNVNESHYKCGVGGRKIASVSPNDVFQSDVELEDLCHS
Sbjct: 121 GEASGDSWFKTRSPRGMIYGDVLNVNESHYKCGVGGRKIASVSPNDVFQSDVELEDLCHS 180

Query: 181 PSSFCGFGDNNNEDGFFSVDTGDEGEASLDNSNQYKVFPDLELDDSYDEKICAEMYGASA 240
           PSSFCGFGDNNNEDGFFSVDTG                               EMYGASA
Sbjct: 181 PSSFCGFGDNNNEDGFFSVDTG-------------------------------EMYGASA 240

Query: 241 EEARNNCRGELCLDGNESDRIKLLAQSLEEEQKARATLYLELEKERSAAATAADEAMAMI 300
           EEARNNCRGELCLDGNESDRIKLLAQSLEEEQKARATLYLELEKERSAAATAADEAMAMI
Sbjct: 241 EEARNNCRGELCLDGNESDRIKLLAQSLEEEQKARATLYLELEKERSAAATAADEAMAMI 300

Query: 301 LRLQEEKAKIEMEARQYQRLIEEKTAYDAEEMSILKEILVRREQEMHFLKKEVEAFRKSF 360
           LRLQEEKAKIEMEARQYQRLIEEKTAYDAEEMSILKEILVRREQEMHFLKKEVEAFRKSF
Sbjct: 301 LRLQEEKAKIEMEARQYQRLIEEKTAYDAEEMSILKEILVRREQEMHFLKKEVEAFRKSF 360

Query: 361 FDNDGVSVDMLDSEFTPPWLECINGSITDKQSHELPSVESRNLTFEFGEESPSIQADEFA 420
           FDNDGVSVDMLDSEFTPPWLECINGSITDKQSHELPSVESRNLTFEFGEESPSIQADEFA
Sbjct: 361 FDNDGVSVDMLDSEFTPPWLECINGSITDKQSHELPSVESRNLTFEFGEESPSIQADEFA 420

Query: 421 DAAKASGVLLHQVADNFELSEEIDNELQGKGMVEDKNVYIVPGEVNELEPYPKSNVSNGL 480
           DAAKASGVLLHQVADNFELSEEIDNELQGKGMVEDKNVYIVPGEVNELEPYPKSNVSNGL
Sbjct: 421 DAAKASGVLLHQVADNFELSEEIDNELQGKGMVEDKNVYIVPGEVNELEPYPKSNVSNGL 480

Query: 481 DKVEQCTELTADEQEKVYDDSFDGLAFAEIALPCVEYNLEKHGNHQKQWTRDFKSVNDTD 540
           DKVEQCTELTADEQEKVYDDSFDGLAFAEIALPCVEYNLEKHGNHQKQWTRDFKSVNDTD
Sbjct: 481 DKVEQCTELTADEQEKVYDDSFDGLAFAEIALPCVEYNLEKHGNHQKQWTRDFKSVNDTD 540

Query: 541 ACPHDIHVIEDEARMPNEASANARKETSVNGSSSIPVNCDSSSFSSLQNGLDITRSSSDA 600
           ACPHDIHVIEDEARMPNEASANARKETSVNGSSSIPVNCDSSSFSSLQNGLDITRSSSDA
Sbjct: 541 ACPHDIHVIEDEARMPNEASANARKETSVNGSSSIPVNCDSSSFSSLQNGLDITRSSSDA 600

Query: 601 TGRFPPMARSRSNSLRSELRRNSMSAVDYERSKIGNEVEWLRGRLKIVQEEREKLKLSVE 660
           TGRFPPMARSRSNSLRSELRRNSMSAVDYERSKIGNEVEWLRGRLKIVQEEREKLKLSVE
Sbjct: 601 TGRFPPMARSRSNSLRSELRRNSMSAVDYERSKIGNEVEWLRGRLKIVQEEREKLKLSVE 660

Query: 661 QKEKENNQLQLLENITK-------------------DVSKKRCWRSSSLSIHRSS 696
           QKEKENNQLQLLENITK                   DVSKKRCWRSSSLSIHRSS
Sbjct: 661 QKEKENNQLQLLENITKVLSDPGMAALQAPLPPSSKDVSKKRCWRSSSLSIHRSS 684

BLAST of Cp4.1LG03g17980 vs. NCBI nr
Match: XP_022924681.1 (uncharacterized protein LOC111432110 isoform X2 [Cucurbita moschata])

HSP 1 Score: 1268 bits (3282), Expect = 0.0
Identity = 667/717 (93.03%), Postives = 672/717 (93.72%), Query Frame = 0

Query: 1   MACEAVQLWTFNGLVAAFLDLGIAFLLLCATSLVFFTSKFLALFGSCLPCPCDGLFGNLG 60
           MACEAVQLWTFNGLVAAFLDLGIAFLLLCATSLVFFTSKFLALFGSCLPCPCDGLFGNLG
Sbjct: 1   MACEAVQLWTFNGLVAAFLDLGIAFLLLCATSLVFFTSKFLALFGSCLPCPCDGLFGNLG 60

Query: 61  SDHCFQKLLVDCSSKRISSVLHSTREKFPLDSMWDQEPKCCFKSMSVHERNAKEACVEFE 120
           SDHCFQKLLVDCSSKRISSVLHSTREKFPLDSMWDQEPKCCFKSMSVH+RNAKE CVEF+
Sbjct: 61  SDHCFQKLLVDCSSKRISSVLHSTREKFPLDSMWDQEPKCCFKSMSVHKRNAKEVCVEFK 120

Query: 121 GEASGDSWFKTRSPRGMIYGDVLNVNESHYKCGVGGRKIASVSPNDVFQSDVELEDLCHS 180
           GEASGDSWFKTRSPRGMIYGDVLNVNES YK GVGGRKIASVSPNDVFQSDVELEDLCHS
Sbjct: 121 GEASGDSWFKTRSPRGMIYGDVLNVNESRYKGGVGGRKIASVSPNDVFQSDVELEDLCHS 180

Query: 181 PSSFCGFGDNNNEDGFFSVDTGDEGEASLDNSNQYKVFPDLELDDSYDEKICAEMYGASA 240
           PSS CGFGDNNNEDGFFSVD+GDEGEASLDNSNQYKVFPDLELDDSYDEKICAEMYGASA
Sbjct: 181 PSSLCGFGDNNNEDGFFSVDSGDEGEASLDNSNQYKVFPDLELDDSYDEKICAEMYGASA 240

Query: 241 EEARNNCRGELCLDGNESDRIKLLAQSLEEEQKARATLYLELEKERSAAATAADEAMAMI 300
           EE RNNCRGELCLDGNESDRIKLLAQSLEEEQKARATLYLELEKERSAAATAADEAMAMI
Sbjct: 241 EETRNNCRGELCLDGNESDRIKLLAQSLEEEQKARATLYLELEKERSAAATAADEAMAMI 300

Query: 301 LRLQEEKAKIEMEARQYQRLIEEKTAYDAEEMSILKEILVRREQEMHFLKKEVEAFRKSF 360
           LRLQEEKA IEMEARQYQR+IEEKTAYDAEEMSILKEILVRREQE HFLKKEVEAFRKS 
Sbjct: 301 LRLQEEKACIEMEARQYQRMIEEKTAYDAEEMSILKEILVRREQERHFLKKEVEAFRKSL 360

Query: 361 FDNDGVSVDMLDSEFTPPWLECINGSITDKQSHELPSVESRNLTFEFGEESPSIQADEFA 420
           FDNDGVSVDMLDSEFTP WL+CINGSITDKQSHELPSVESRNL FEFGEESPSIQA EFA
Sbjct: 361 FDNDGVSVDMLDSEFTPQWLQCINGSITDKQSHELPSVESRNLIFEFGEESPSIQAVEFA 420

Query: 421 DAAKASGVLLHQVADNFELSEEIDNELQGKGMVEDKNVYIVPGEVNELEPYPKSN--VSN 480
           DAAKASGVLLHQVAD FE SEEIDNELQGKGMVEDKNVYIVPGEVNELEPYPKSN  VSN
Sbjct: 421 DAAKASGVLLHQVAD-FERSEEIDNELQGKGMVEDKNVYIVPGEVNELEPYPKSNSNVSN 480

Query: 481 GLDKVEQCTELTADEQEKVYDDSFDGLAFAEIALPCVEYNLEKHGNHQKQWTRDFKSVND 540
           GL KVEQCTELTADEQEKVYDDSFDGLAFAEIALPCVEYNLEKHGNHQKQWTRDFKSVND
Sbjct: 481 GLGKVEQCTELTADEQEKVYDDSFDGLAFAEIALPCVEYNLEKHGNHQKQWTRDFKSVND 540

Query: 541 TDACPHDIHVIEDEARMPNEASANARKETSVNGSSSIPVNCDSSSFSSLQNGLDITRSSS 600
           TDACPHDIHVIEDE  MPNEASANARKETSVNG S IPVNCDSSSFS LQNGLDITRSSS
Sbjct: 541 TDACPHDIHVIEDE--MPNEASANARKETSVNGCSIIPVNCDSSSFSLLQNGLDITRSSS 600

Query: 601 DATGRFPPMARSRSNSLRSELRRNSMSAVDYERSKIGNEVEWLRGRLKIVQEEREKLKLS 660
           DATGRFPPM RSRSNSLR ELRRNSMSAVDYERSKIGNEVEWLRGRLKIVQEEREKLK S
Sbjct: 601 DATGRFPPMTRSRSNSLRCELRRNSMSAVDYERSKIGNEVEWLRGRLKIVQEEREKLKFS 660

Query: 661 VEQKEKENNQLQLLENITK-------------------DVSKKRCWRSSSLSIHRSS 696
           VE KEKE NQLQLLENITK                   DVSKKRCWRSSSLSIHRSS
Sbjct: 661 VESKEKEINQLQLLENITKVLSDPGKAALQAPLPPSSKDVSKKRCWRSSSLSIHRSS 714

BLAST of Cp4.1LG03g17980 vs. NCBI nr
Match: XP_022924680.1 (uncharacterized protein LOC111432110 isoform X1 [Cucurbita moschata])

HSP 1 Score: 1259 bits (3257), Expect = 0.0
Identity = 667/731 (91.24%), Postives = 672/731 (91.93%), Query Frame = 0

Query: 1   MACEAVQLWTFNGLVAAFLDLGIAFLLLCATSLVFFTSKFLALFGSCLPCPCDGLFGNLG 60
           MACEAVQLWTFNGLVAAFLDLGIAFLLLCATSLVFFTSKFLALFGSCLPCPCDGLFGNLG
Sbjct: 1   MACEAVQLWTFNGLVAAFLDLGIAFLLLCATSLVFFTSKFLALFGSCLPCPCDGLFGNLG 60

Query: 61  SDHCFQKLLVDCSSKRISSVLHSTREKFPLDSMWDQEPKCCFKSMSVHERNAKEACVEFE 120
           SDHCFQKLLVDCSSKRISSVLHSTREKFPLDSMWDQEPKCCFKSMSVH+RNAKE CVEF+
Sbjct: 61  SDHCFQKLLVDCSSKRISSVLHSTREKFPLDSMWDQEPKCCFKSMSVHKRNAKEVCVEFK 120

Query: 121 GEASGDSWFKTRSPRGMIYGDVLNVNESHYKCGVGGRKIASVSPNDVFQSDVELEDLCHS 180
           GEASGDSWFKTRSPRGMIYGDVLNVNES YK GVGGRKIASVSPNDVFQSDVELEDLCHS
Sbjct: 121 GEASGDSWFKTRSPRGMIYGDVLNVNESRYKGGVGGRKIASVSPNDVFQSDVELEDLCHS 180

Query: 181 PSSFCGFGDNNNEDGFFSVDTGDEGEASLDNSNQYKVFPDLELDDSYDEKICAEMYGASA 240
           PSS CGFGDNNNEDGFFSVD+GDEGEASLDNSNQYKVFPDLELDDSYDEKICAEMYGASA
Sbjct: 181 PSSLCGFGDNNNEDGFFSVDSGDEGEASLDNSNQYKVFPDLELDDSYDEKICAEMYGASA 240

Query: 241 EEARNNCRGELCLDGNESDRIKLLAQSLEEEQKARATLYLELEKERSAAATAADEAMAMI 300
           EE RNNCRGELCLDGNESDRIKLLAQSLEEEQKARATLYLELEKERSAAATAADEAMAMI
Sbjct: 241 EETRNNCRGELCLDGNESDRIKLLAQSLEEEQKARATLYLELEKERSAAATAADEAMAMI 300

Query: 301 LRLQEEKAKIEMEARQYQRLIEEKTAYDAEEMSILKEILVRREQEMHFLKKEVEAFRKSF 360
           LRLQEEKA IEMEARQYQR+IEEKTAYDAEEMSILKEILVRREQE HFLKKEVEAFRKS 
Sbjct: 301 LRLQEEKACIEMEARQYQRMIEEKTAYDAEEMSILKEILVRREQERHFLKKEVEAFRKSL 360

Query: 361 FDNDGVSVDMLDSEFTPPW--------------LECINGSITDKQSHELPSVESRNLTFE 420
           FDNDGVSVDMLDSEFTP W              L+CINGSITDKQSHELPSVESRNL FE
Sbjct: 361 FDNDGVSVDMLDSEFTPQWAPSSPCPTEDPSHVLQCINGSITDKQSHELPSVESRNLIFE 420

Query: 421 FGEESPSIQADEFADAAKASGVLLHQVADNFELSEEIDNELQGKGMVEDKNVYIVPGEVN 480
           FGEESPSIQA EFADAAKASGVLLHQVAD FE SEEIDNELQGKGMVEDKNVYIVPGEVN
Sbjct: 421 FGEESPSIQAVEFADAAKASGVLLHQVAD-FERSEEIDNELQGKGMVEDKNVYIVPGEVN 480

Query: 481 ELEPYPKSN--VSNGLDKVEQCTELTADEQEKVYDDSFDGLAFAEIALPCVEYNLEKHGN 540
           ELEPYPKSN  VSNGL KVEQCTELTADEQEKVYDDSFDGLAFAEIALPCVEYNLEKHGN
Sbjct: 481 ELEPYPKSNSNVSNGLGKVEQCTELTADEQEKVYDDSFDGLAFAEIALPCVEYNLEKHGN 540

Query: 541 HQKQWTRDFKSVNDTDACPHDIHVIEDEARMPNEASANARKETSVNGSSSIPVNCDSSSF 600
           HQKQWTRDFKSVNDTDACPHDIHVIEDE  MPNEASANARKETSVNG S IPVNCDSSSF
Sbjct: 541 HQKQWTRDFKSVNDTDACPHDIHVIEDE--MPNEASANARKETSVNGCSIIPVNCDSSSF 600

Query: 601 SSLQNGLDITRSSSDATGRFPPMARSRSNSLRSELRRNSMSAVDYERSKIGNEVEWLRGR 660
           S LQNGLDITRSSSDATGRFPPM RSRSNSLR ELRRNSMSAVDYERSKIGNEVEWLRGR
Sbjct: 601 SLLQNGLDITRSSSDATGRFPPMTRSRSNSLRCELRRNSMSAVDYERSKIGNEVEWLRGR 660

Query: 661 LKIVQEEREKLKLSVEQKEKENNQLQLLENITK-------------------DVSKKRCW 696
           LKIVQEEREKLK SVE KEKE NQLQLLENITK                   DVSKKRCW
Sbjct: 661 LKIVQEEREKLKFSVESKEKEINQLQLLENITKVLSDPGKAALQAPLPPSSKDVSKKRCW 720

BLAST of Cp4.1LG03g17980 vs. NCBI nr
Match: KAG7018863.1 (Ubiquitin carboxyl-terminal hydrolase 2, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1250 bits (3235), Expect = 0.0
Identity = 651/691 (94.21%), Postives = 663/691 (95.95%), Query Frame = 0

Query: 1   MACEAVQLWTFNGLVAAFLDLGIAFLLLCATSLVFFTSKFLALFGSCLPCPCDGLFGNLG 60
           MACEAVQLWTFNGLVAAFLDLGIAFLLLCATSLVFFTSKFLALFGSCLPCPCDGLFGNLG
Sbjct: 1   MACEAVQLWTFNGLVAAFLDLGIAFLLLCATSLVFFTSKFLALFGSCLPCPCDGLFGNLG 60

Query: 61  SDHCFQKLLVDCSSKRISSVLHSTREKFPLDSMWDQEPKCCFKSMSVHERNAKEACVEFE 120
           SDHCFQKLLVDCSSKR+SSVLHSTREKFPLDSMWDQEPKCCFKSMSVH+RNAKEACV+F+
Sbjct: 61  SDHCFQKLLVDCSSKRMSSVLHSTREKFPLDSMWDQEPKCCFKSMSVHKRNAKEACVQFK 120

Query: 121 GEASGDSWFKTRSPRGMIYGDVLNVNESHYKCGVGGRKIASVSPNDVFQSDVELEDLCHS 180
           GEASGDSWFKTRSPRGMIYGDVLNVNES YK GVGGRKIA VSPNDVFQSDVELEDLCHS
Sbjct: 121 GEASGDSWFKTRSPRGMIYGDVLNVNESRYKGGVGGRKIALVSPNDVFQSDVELEDLCHS 180

Query: 181 PSSFCGFGDNNNEDGFFSVDTGDEGEASLDNSNQYKVFPDLELDDSYDEKICAEMYGASA 240
           PSS CGFGDNNNEDGFFSVD+GDEGEASLDNSNQYKVFPDLELDDSYDEKICAEMYGASA
Sbjct: 181 PSSLCGFGDNNNEDGFFSVDSGDEGEASLDNSNQYKVFPDLELDDSYDEKICAEMYGASA 240

Query: 241 EEARNNCRGELCLDGNESDRIKLLAQSLEEEQKARATLYLELEKERSAAATAADEAMAMI 300
           EE RNNCRGELCLDGNESDRIK LAQSLEEEQKARATLYLELEKERSAAATAADEAMAMI
Sbjct: 241 EETRNNCRGELCLDGNESDRIKPLAQSLEEEQKARATLYLELEKERSAAATAADEAMAMI 300

Query: 301 LRLQEEKAKIEMEARQYQRLIEEKTAYDAEEMSILKEILVRREQEMHFLKKEVEAFRKSF 360
           LRLQEEKA IEMEARQYQR+IEEKTAYDAEEMSILKEILVRREQEMHFLKKEVEAFRKS 
Sbjct: 301 LRLQEEKACIEMEARQYQRMIEEKTAYDAEEMSILKEILVRREQEMHFLKKEVEAFRKSL 360

Query: 361 FDNDGVSVDMLDSEFTPPWLECINGSITDKQSHELPSVESRNLTFEFGEESPSIQADEFA 420
           FDNDGVSVDMLDSEFTPPWLECINGSITDKQSHELPSVESRNL FEFGEESPSIQA EFA
Sbjct: 361 FDNDGVSVDMLDSEFTPPWLECINGSITDKQSHELPSVESRNLIFEFGEESPSIQAVEFA 420

Query: 421 DAAKASGVLLHQVADNFELSEEIDNELQGKGMVEDKNVYIVPGEVNELEPYPKSN--VSN 480
           DAAKASGVLLHQVAD FE SEEIDNELQGKGMVEDKNVYIVPGEVNELEPYPKSN  VSN
Sbjct: 421 DAAKASGVLLHQVAD-FERSEEIDNELQGKGMVEDKNVYIVPGEVNELEPYPKSNSNVSN 480

Query: 481 GLDKVEQCTELTADEQEKVYDDSFDGLAFAEIALPCVEYNLEKHGNHQKQWTRDFKSVND 540
           GL KVEQCTELTADEQEKVYDDSFDGLAFAEIALPCVEYNLEKHGNHQKQWTRDFKSVND
Sbjct: 481 GLGKVEQCTELTADEQEKVYDDSFDGLAFAEIALPCVEYNLEKHGNHQKQWTRDFKSVND 540

Query: 541 TDACPHDIHVIEDEARMPNEASANARKETSVNGSSSIPVNCDSSSFSSLQNGLDITRSSS 600
           T+ACPHDIHVIEDE  MPNEASANARKETSVNGSSSIPVNCDSSSFS LQNGLDITRSSS
Sbjct: 541 TNACPHDIHVIEDE--MPNEASANARKETSVNGSSSIPVNCDSSSFSLLQNGLDITRSSS 600

Query: 601 DATGRFPPMARSRSNSLRSELRRNSMSAVDYERSKIGNEVEWLRGRLKIVQEEREKLKLS 660
           DATGRFPPMARSRSNSLR ELRRNSMSAVDYERSKIG+EVEWLRGRLKIVQEEREKLK S
Sbjct: 601 DATGRFPPMARSRSNSLRCELRRNSMSAVDYERSKIGHEVEWLRGRLKIVQEEREKLKFS 660

Query: 661 VEQKEKENNQLQLLENITKDVSKKRCWRSSS 689
           VE KEKENNQLQLLENITK    K+  + ++
Sbjct: 661 VESKEKENNQLQLLENITKGTMGKKVKKKNN 688

BLAST of Cp4.1LG03g17980 vs. ExPASy TrEMBL
Match: A0A6J1EA46 (uncharacterized protein LOC111432110 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111432110 PE=4 SV=1)

HSP 1 Score: 1268 bits (3282), Expect = 0.0
Identity = 667/717 (93.03%), Postives = 672/717 (93.72%), Query Frame = 0

Query: 1   MACEAVQLWTFNGLVAAFLDLGIAFLLLCATSLVFFTSKFLALFGSCLPCPCDGLFGNLG 60
           MACEAVQLWTFNGLVAAFLDLGIAFLLLCATSLVFFTSKFLALFGSCLPCPCDGLFGNLG
Sbjct: 1   MACEAVQLWTFNGLVAAFLDLGIAFLLLCATSLVFFTSKFLALFGSCLPCPCDGLFGNLG 60

Query: 61  SDHCFQKLLVDCSSKRISSVLHSTREKFPLDSMWDQEPKCCFKSMSVHERNAKEACVEFE 120
           SDHCFQKLLVDCSSKRISSVLHSTREKFPLDSMWDQEPKCCFKSMSVH+RNAKE CVEF+
Sbjct: 61  SDHCFQKLLVDCSSKRISSVLHSTREKFPLDSMWDQEPKCCFKSMSVHKRNAKEVCVEFK 120

Query: 121 GEASGDSWFKTRSPRGMIYGDVLNVNESHYKCGVGGRKIASVSPNDVFQSDVELEDLCHS 180
           GEASGDSWFKTRSPRGMIYGDVLNVNES YK GVGGRKIASVSPNDVFQSDVELEDLCHS
Sbjct: 121 GEASGDSWFKTRSPRGMIYGDVLNVNESRYKGGVGGRKIASVSPNDVFQSDVELEDLCHS 180

Query: 181 PSSFCGFGDNNNEDGFFSVDTGDEGEASLDNSNQYKVFPDLELDDSYDEKICAEMYGASA 240
           PSS CGFGDNNNEDGFFSVD+GDEGEASLDNSNQYKVFPDLELDDSYDEKICAEMYGASA
Sbjct: 181 PSSLCGFGDNNNEDGFFSVDSGDEGEASLDNSNQYKVFPDLELDDSYDEKICAEMYGASA 240

Query: 241 EEARNNCRGELCLDGNESDRIKLLAQSLEEEQKARATLYLELEKERSAAATAADEAMAMI 300
           EE RNNCRGELCLDGNESDRIKLLAQSLEEEQKARATLYLELEKERSAAATAADEAMAMI
Sbjct: 241 EETRNNCRGELCLDGNESDRIKLLAQSLEEEQKARATLYLELEKERSAAATAADEAMAMI 300

Query: 301 LRLQEEKAKIEMEARQYQRLIEEKTAYDAEEMSILKEILVRREQEMHFLKKEVEAFRKSF 360
           LRLQEEKA IEMEARQYQR+IEEKTAYDAEEMSILKEILVRREQE HFLKKEVEAFRKS 
Sbjct: 301 LRLQEEKACIEMEARQYQRMIEEKTAYDAEEMSILKEILVRREQERHFLKKEVEAFRKSL 360

Query: 361 FDNDGVSVDMLDSEFTPPWLECINGSITDKQSHELPSVESRNLTFEFGEESPSIQADEFA 420
           FDNDGVSVDMLDSEFTP WL+CINGSITDKQSHELPSVESRNL FEFGEESPSIQA EFA
Sbjct: 361 FDNDGVSVDMLDSEFTPQWLQCINGSITDKQSHELPSVESRNLIFEFGEESPSIQAVEFA 420

Query: 421 DAAKASGVLLHQVADNFELSEEIDNELQGKGMVEDKNVYIVPGEVNELEPYPKSN--VSN 480
           DAAKASGVLLHQVAD FE SEEIDNELQGKGMVEDKNVYIVPGEVNELEPYPKSN  VSN
Sbjct: 421 DAAKASGVLLHQVAD-FERSEEIDNELQGKGMVEDKNVYIVPGEVNELEPYPKSNSNVSN 480

Query: 481 GLDKVEQCTELTADEQEKVYDDSFDGLAFAEIALPCVEYNLEKHGNHQKQWTRDFKSVND 540
           GL KVEQCTELTADEQEKVYDDSFDGLAFAEIALPCVEYNLEKHGNHQKQWTRDFKSVND
Sbjct: 481 GLGKVEQCTELTADEQEKVYDDSFDGLAFAEIALPCVEYNLEKHGNHQKQWTRDFKSVND 540

Query: 541 TDACPHDIHVIEDEARMPNEASANARKETSVNGSSSIPVNCDSSSFSSLQNGLDITRSSS 600
           TDACPHDIHVIEDE  MPNEASANARKETSVNG S IPVNCDSSSFS LQNGLDITRSSS
Sbjct: 541 TDACPHDIHVIEDE--MPNEASANARKETSVNGCSIIPVNCDSSSFSLLQNGLDITRSSS 600

Query: 601 DATGRFPPMARSRSNSLRSELRRNSMSAVDYERSKIGNEVEWLRGRLKIVQEEREKLKLS 660
           DATGRFPPM RSRSNSLR ELRRNSMSAVDYERSKIGNEVEWLRGRLKIVQEEREKLK S
Sbjct: 601 DATGRFPPMTRSRSNSLRCELRRNSMSAVDYERSKIGNEVEWLRGRLKIVQEEREKLKFS 660

Query: 661 VEQKEKENNQLQLLENITK-------------------DVSKKRCWRSSSLSIHRSS 696
           VE KEKE NQLQLLENITK                   DVSKKRCWRSSSLSIHRSS
Sbjct: 661 VESKEKEINQLQLLENITKVLSDPGKAALQAPLPPSSKDVSKKRCWRSSSLSIHRSS 714

BLAST of Cp4.1LG03g17980 vs. ExPASy TrEMBL
Match: A0A6J1EFQ3 (uncharacterized protein LOC111432110 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111432110 PE=4 SV=1)

HSP 1 Score: 1259 bits (3257), Expect = 0.0
Identity = 667/731 (91.24%), Postives = 672/731 (91.93%), Query Frame = 0

Query: 1   MACEAVQLWTFNGLVAAFLDLGIAFLLLCATSLVFFTSKFLALFGSCLPCPCDGLFGNLG 60
           MACEAVQLWTFNGLVAAFLDLGIAFLLLCATSLVFFTSKFLALFGSCLPCPCDGLFGNLG
Sbjct: 1   MACEAVQLWTFNGLVAAFLDLGIAFLLLCATSLVFFTSKFLALFGSCLPCPCDGLFGNLG 60

Query: 61  SDHCFQKLLVDCSSKRISSVLHSTREKFPLDSMWDQEPKCCFKSMSVHERNAKEACVEFE 120
           SDHCFQKLLVDCSSKRISSVLHSTREKFPLDSMWDQEPKCCFKSMSVH+RNAKE CVEF+
Sbjct: 61  SDHCFQKLLVDCSSKRISSVLHSTREKFPLDSMWDQEPKCCFKSMSVHKRNAKEVCVEFK 120

Query: 121 GEASGDSWFKTRSPRGMIYGDVLNVNESHYKCGVGGRKIASVSPNDVFQSDVELEDLCHS 180
           GEASGDSWFKTRSPRGMIYGDVLNVNES YK GVGGRKIASVSPNDVFQSDVELEDLCHS
Sbjct: 121 GEASGDSWFKTRSPRGMIYGDVLNVNESRYKGGVGGRKIASVSPNDVFQSDVELEDLCHS 180

Query: 181 PSSFCGFGDNNNEDGFFSVDTGDEGEASLDNSNQYKVFPDLELDDSYDEKICAEMYGASA 240
           PSS CGFGDNNNEDGFFSVD+GDEGEASLDNSNQYKVFPDLELDDSYDEKICAEMYGASA
Sbjct: 181 PSSLCGFGDNNNEDGFFSVDSGDEGEASLDNSNQYKVFPDLELDDSYDEKICAEMYGASA 240

Query: 241 EEARNNCRGELCLDGNESDRIKLLAQSLEEEQKARATLYLELEKERSAAATAADEAMAMI 300
           EE RNNCRGELCLDGNESDRIKLLAQSLEEEQKARATLYLELEKERSAAATAADEAMAMI
Sbjct: 241 EETRNNCRGELCLDGNESDRIKLLAQSLEEEQKARATLYLELEKERSAAATAADEAMAMI 300

Query: 301 LRLQEEKAKIEMEARQYQRLIEEKTAYDAEEMSILKEILVRREQEMHFLKKEVEAFRKSF 360
           LRLQEEKA IEMEARQYQR+IEEKTAYDAEEMSILKEILVRREQE HFLKKEVEAFRKS 
Sbjct: 301 LRLQEEKACIEMEARQYQRMIEEKTAYDAEEMSILKEILVRREQERHFLKKEVEAFRKSL 360

Query: 361 FDNDGVSVDMLDSEFTPPW--------------LECINGSITDKQSHELPSVESRNLTFE 420
           FDNDGVSVDMLDSEFTP W              L+CINGSITDKQSHELPSVESRNL FE
Sbjct: 361 FDNDGVSVDMLDSEFTPQWAPSSPCPTEDPSHVLQCINGSITDKQSHELPSVESRNLIFE 420

Query: 421 FGEESPSIQADEFADAAKASGVLLHQVADNFELSEEIDNELQGKGMVEDKNVYIVPGEVN 480
           FGEESPSIQA EFADAAKASGVLLHQVAD FE SEEIDNELQGKGMVEDKNVYIVPGEVN
Sbjct: 421 FGEESPSIQAVEFADAAKASGVLLHQVAD-FERSEEIDNELQGKGMVEDKNVYIVPGEVN 480

Query: 481 ELEPYPKSN--VSNGLDKVEQCTELTADEQEKVYDDSFDGLAFAEIALPCVEYNLEKHGN 540
           ELEPYPKSN  VSNGL KVEQCTELTADEQEKVYDDSFDGLAFAEIALPCVEYNLEKHGN
Sbjct: 481 ELEPYPKSNSNVSNGLGKVEQCTELTADEQEKVYDDSFDGLAFAEIALPCVEYNLEKHGN 540

Query: 541 HQKQWTRDFKSVNDTDACPHDIHVIEDEARMPNEASANARKETSVNGSSSIPVNCDSSSF 600
           HQKQWTRDFKSVNDTDACPHDIHVIEDE  MPNEASANARKETSVNG S IPVNCDSSSF
Sbjct: 541 HQKQWTRDFKSVNDTDACPHDIHVIEDE--MPNEASANARKETSVNGCSIIPVNCDSSSF 600

Query: 601 SSLQNGLDITRSSSDATGRFPPMARSRSNSLRSELRRNSMSAVDYERSKIGNEVEWLRGR 660
           S LQNGLDITRSSSDATGRFPPM RSRSNSLR ELRRNSMSAVDYERSKIGNEVEWLRGR
Sbjct: 601 SLLQNGLDITRSSSDATGRFPPMTRSRSNSLRCELRRNSMSAVDYERSKIGNEVEWLRGR 660

Query: 661 LKIVQEEREKLKLSVEQKEKENNQLQLLENITK-------------------DVSKKRCW 696
           LKIVQEEREKLK SVE KEKE NQLQLLENITK                   DVSKKRCW
Sbjct: 661 LKIVQEEREKLKFSVESKEKEINQLQLLENITKVLSDPGKAALQAPLPPSSKDVSKKRCW 720

BLAST of Cp4.1LG03g17980 vs. ExPASy TrEMBL
Match: A0A6J1IQU9 (uncharacterized protein LOC111479667 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111479667 PE=4 SV=1)

HSP 1 Score: 1240 bits (3209), Expect = 0.0
Identity = 648/715 (90.63%), Postives = 663/715 (92.73%), Query Frame = 0

Query: 1   MACEAVQLWTFNGLVAAFLDLGIAFLLLCATSLVFFTSKFLALFGSCLPCPCDGLFGNLG 60
           MACEA+QLWTFNGLVAAFLDLGIAFLLLCATSLVFFTSKFLALFGSCLPCPCDGLFG+LG
Sbjct: 1   MACEAIQLWTFNGLVAAFLDLGIAFLLLCATSLVFFTSKFLALFGSCLPCPCDGLFGDLG 60

Query: 61  SDHCFQKLLVDCSSKRISSVLHSTREKFPLDSMWDQEPKCCFKSMSVHERNAKEACVEFE 120
           SDHCFQKLLVDCSSK+ISSVLHSTREKFPLDSMWDQEPKCCFKSMSVH+RN KEACVEF+
Sbjct: 61  SDHCFQKLLVDCSSKKISSVLHSTREKFPLDSMWDQEPKCCFKSMSVHKRNVKEACVEFK 120

Query: 121 GEASGDSWFKTRSPRGMIYGDVLNVNESHYKCGVGGRKIASVSPNDVFQSDVELEDLCHS 180
            EASGDSWFKTRSPRGMIYGDVLN+NES YK GVGGRKIASVSPNDVFQSDVELEDLCHS
Sbjct: 121 CEASGDSWFKTRSPRGMIYGDVLNMNESRYKGGVGGRKIASVSPNDVFQSDVELEDLCHS 180

Query: 181 PSSFCGFGDNNNEDGFFSVDTGDEGEASLDNSNQYKVFPDLELDDSYDEKICAEMYGASA 240
           PSSFCGFGDNNNEDGFFSVD+GDEGEASLDNSNQYKVFPDLELDDSYDEKICAEMYGASA
Sbjct: 181 PSSFCGFGDNNNEDGFFSVDSGDEGEASLDNSNQYKVFPDLELDDSYDEKICAEMYGASA 240

Query: 241 EEARNNCRGELCLDGNESDRIKLLAQSLEEEQKARATLYLELEKERSAAATAADEAMAMI 300
           EEARNNCRGE CLDGNESDRIKLL QSLEEEQKARATLYLELEKERSAAATAADEAMAMI
Sbjct: 241 EEARNNCRGEYCLDGNESDRIKLLEQSLEEEQKARATLYLELEKERSAAATAADEAMAMI 300

Query: 301 LRLQEEKAKIEMEARQYQRLIEEKTAYDAEEMSILKEILVRREQEMHFLKKEVEAFRKSF 360
           LRLQEEKA IEMEARQYQR+IEEKTAYDAEEMSILKEILVRREQEMHFLKKEV AFR+SF
Sbjct: 301 LRLQEEKASIEMEARQYQRMIEEKTAYDAEEMSILKEILVRREQEMHFLKKEVGAFRRSF 360

Query: 361 FDNDGVSVDMLDSEFTPPWLECINGSITDKQSHELPSVESRNLTFEFGEESPSIQADEFA 420
           FDN GVSVDMLD+EFTPPWL+CINGSITDKQSHELPSVESRNL FEFGEESPSIQA EFA
Sbjct: 361 FDNGGVSVDMLDTEFTPPWLQCINGSITDKQSHELPSVESRNLIFEFGEESPSIQAVEFA 420

Query: 421 DAAKASGVLLHQVADNFELSEEIDNELQGKGMVEDKNVYIVPGEVNELEPYPKSNVSNGL 480
           DAAKA G+LLHQVADNFE  EEID ELQGKGMVEDKN+YIVPGEVNELEPYPKSNVSN L
Sbjct: 421 DAAKARGMLLHQVADNFEGGEEID-ELQGKGMVEDKNLYIVPGEVNELEPYPKSNVSNDL 480

Query: 481 DKVEQCTELTADEQEKVYDDSFDGLAFAEIALPCVEYNLEKHGNHQKQWTRDFKSVNDTD 540
            KVEQCTE T DEQEKVY DS DGL  AE ALPCVEYNLEK  +HQKQWTRDFKSVNDTD
Sbjct: 481 GKVEQCTEFTLDEQEKVYKDSLDGLESAETALPCVEYNLEKQWDHQKQWTRDFKSVNDTD 540

Query: 541 ACPHDIHVIEDEARMPNEASANARKETSVNGSSSIPVNCDSSSFSSLQNGLDITRSSSDA 600
           ACPHDIHVIEDEARM NEASANAR+ET VNGSSSIPVNCDSSSFS LQN LDITRSSSDA
Sbjct: 541 ACPHDIHVIEDEARMHNEASANAREETLVNGSSSIPVNCDSSSFSLLQNELDITRSSSDA 600

Query: 601 TGRFPPMARSRSNSLRSELRRNSMSAVDYERSKIGNEVEWLRGRLKIVQEEREKLKLSVE 660
           TGRFPPMARSRSNSLRSELRRNSMSAVDYERSKIGNEVEWLRGRLKIVQEEREK KLSVE
Sbjct: 601 TGRFPPMARSRSNSLRSELRRNSMSAVDYERSKIGNEVEWLRGRLKIVQEEREKHKLSVE 660

Query: 661 QKEKENNQLQLLENITK-------------------DVSKKRCWRSSSLSIHRSS 696
            KEKENNQLQLLENITK                   DVSKKRCWRSSSL IHRSS
Sbjct: 661 SKEKENNQLQLLENITKVLSDPGKAALQAPLPPSSKDVSKKRCWRSSSLGIHRSS 714

BLAST of Cp4.1LG03g17980 vs. ExPASy TrEMBL
Match: A0A6J1IVQ3 (uncharacterized protein LOC111479667 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111479667 PE=4 SV=1)

HSP 1 Score: 1231 bits (3184), Expect = 0.0
Identity = 648/729 (88.89%), Postives = 663/729 (90.95%), Query Frame = 0

Query: 1   MACEAVQLWTFNGLVAAFLDLGIAFLLLCATSLVFFTSKFLALFGSCLPCPCDGLFGNLG 60
           MACEA+QLWTFNGLVAAFLDLGIAFLLLCATSLVFFTSKFLALFGSCLPCPCDGLFG+LG
Sbjct: 1   MACEAIQLWTFNGLVAAFLDLGIAFLLLCATSLVFFTSKFLALFGSCLPCPCDGLFGDLG 60

Query: 61  SDHCFQKLLVDCSSKRISSVLHSTREKFPLDSMWDQEPKCCFKSMSVHERNAKEACVEFE 120
           SDHCFQKLLVDCSSK+ISSVLHSTREKFPLDSMWDQEPKCCFKSMSVH+RN KEACVEF+
Sbjct: 61  SDHCFQKLLVDCSSKKISSVLHSTREKFPLDSMWDQEPKCCFKSMSVHKRNVKEACVEFK 120

Query: 121 GEASGDSWFKTRSPRGMIYGDVLNVNESHYKCGVGGRKIASVSPNDVFQSDVELEDLCHS 180
            EASGDSWFKTRSPRGMIYGDVLN+NES YK GVGGRKIASVSPNDVFQSDVELEDLCHS
Sbjct: 121 CEASGDSWFKTRSPRGMIYGDVLNMNESRYKGGVGGRKIASVSPNDVFQSDVELEDLCHS 180

Query: 181 PSSFCGFGDNNNEDGFFSVDTGDEGEASLDNSNQYKVFPDLELDDSYDEKICAEMYGASA 240
           PSSFCGFGDNNNEDGFFSVD+GDEGEASLDNSNQYKVFPDLELDDSYDEKICAEMYGASA
Sbjct: 181 PSSFCGFGDNNNEDGFFSVDSGDEGEASLDNSNQYKVFPDLELDDSYDEKICAEMYGASA 240

Query: 241 EEARNNCRGELCLDGNESDRIKLLAQSLEEEQKARATLYLELEKERSAAATAADEAMAMI 300
           EEARNNCRGE CLDGNESDRIKLL QSLEEEQKARATLYLELEKERSAAATAADEAMAMI
Sbjct: 241 EEARNNCRGEYCLDGNESDRIKLLEQSLEEEQKARATLYLELEKERSAAATAADEAMAMI 300

Query: 301 LRLQEEKAKIEMEARQYQRLIEEKTAYDAEEMSILKEILVRREQEMHFLKKEVEAFRKSF 360
           LRLQEEKA IEMEARQYQR+IEEKTAYDAEEMSILKEILVRREQEMHFLKKEV AFR+SF
Sbjct: 301 LRLQEEKASIEMEARQYQRMIEEKTAYDAEEMSILKEILVRREQEMHFLKKEVGAFRRSF 360

Query: 361 FDNDGVSVDMLDSEFTPPW--------------LECINGSITDKQSHELPSVESRNLTFE 420
           FDN GVSVDMLD+EFTPPW              L+CINGSITDKQSHELPSVESRNL FE
Sbjct: 361 FDNGGVSVDMLDTEFTPPWAPSSPYPTEDPSHMLQCINGSITDKQSHELPSVESRNLIFE 420

Query: 421 FGEESPSIQADEFADAAKASGVLLHQVADNFELSEEIDNELQGKGMVEDKNVYIVPGEVN 480
           FGEESPSIQA EFADAAKA G+LLHQVADNFE  EEID ELQGKGMVEDKN+YIVPGEVN
Sbjct: 421 FGEESPSIQAVEFADAAKARGMLLHQVADNFEGGEEID-ELQGKGMVEDKNLYIVPGEVN 480

Query: 481 ELEPYPKSNVSNGLDKVEQCTELTADEQEKVYDDSFDGLAFAEIALPCVEYNLEKHGNHQ 540
           ELEPYPKSNVSN L KVEQCTE T DEQEKVY DS DGL  AE ALPCVEYNLEK  +HQ
Sbjct: 481 ELEPYPKSNVSNDLGKVEQCTEFTLDEQEKVYKDSLDGLESAETALPCVEYNLEKQWDHQ 540

Query: 541 KQWTRDFKSVNDTDACPHDIHVIEDEARMPNEASANARKETSVNGSSSIPVNCDSSSFSS 600
           KQWTRDFKSVNDTDACPHDIHVIEDEARM NEASANAR+ET VNGSSSIPVNCDSSSFS 
Sbjct: 541 KQWTRDFKSVNDTDACPHDIHVIEDEARMHNEASANAREETLVNGSSSIPVNCDSSSFSL 600

Query: 601 LQNGLDITRSSSDATGRFPPMARSRSNSLRSELRRNSMSAVDYERSKIGNEVEWLRGRLK 660
           LQN LDITRSSSDATGRFPPMARSRSNSLRSELRRNSMSAVDYERSKIGNEVEWLRGRLK
Sbjct: 601 LQNELDITRSSSDATGRFPPMARSRSNSLRSELRRNSMSAVDYERSKIGNEVEWLRGRLK 660

Query: 661 IVQEEREKLKLSVEQKEKENNQLQLLENITK-------------------DVSKKRCWRS 696
           IVQEEREK KLSVE KEKENNQLQLLENITK                   DVSKKRCWRS
Sbjct: 661 IVQEEREKHKLSVESKEKENNQLQLLENITKVLSDPGKAALQAPLPPSSKDVSKKRCWRS 720

BLAST of Cp4.1LG03g17980 vs. ExPASy TrEMBL
Match: A0A6J1E9N5 (uncharacterized protein LOC111432110 isoform X3 OS=Cucurbita moschata OX=3662 GN=LOC111432110 PE=4 SV=1)

HSP 1 Score: 1181 bits (3055), Expect = 0.0
Identity = 636/731 (87.00%), Postives = 641/731 (87.69%), Query Frame = 0

Query: 1   MACEAVQLWTFNGLVAAFLDLGIAFLLLCATSLVFFTSKFLALFGSCLPCPCDGLFGNLG 60
           MACEAVQLWTFNGLVAAFLDLGIAFLLLCATSLVFFTSKFLALFGSCLPCPCDGLFGNLG
Sbjct: 1   MACEAVQLWTFNGLVAAFLDLGIAFLLLCATSLVFFTSKFLALFGSCLPCPCDGLFGNLG 60

Query: 61  SDHCFQKLLVDCSSKRISSVLHSTREKFPLDSMWDQEPKCCFKSMSVHERNAKEACVEFE 120
           SDHCFQKLLVDCSSKRISSVLHSTREKFPLDSMWDQEPKCCFKSMSVH+RNAKE CVEF+
Sbjct: 61  SDHCFQKLLVDCSSKRISSVLHSTREKFPLDSMWDQEPKCCFKSMSVHKRNAKEVCVEFK 120

Query: 121 GEASGDSWFKTRSPRGMIYGDVLNVNESHYKCGVGGRKIASVSPNDVFQSDVELEDLCHS 180
           GEASGDSWFKTRSPRGMIYGDVLNVNES YK GVGGRKIASVSPNDVFQSDVELEDLCHS
Sbjct: 121 GEASGDSWFKTRSPRGMIYGDVLNVNESRYKGGVGGRKIASVSPNDVFQSDVELEDLCHS 180

Query: 181 PSSFCGFGDNNNEDGFFSVDTGDEGEASLDNSNQYKVFPDLELDDSYDEKICAEMYGASA 240
           PSS CGFGDNNNEDGFFSVD+G                               EMYGASA
Sbjct: 181 PSSLCGFGDNNNEDGFFSVDSG-------------------------------EMYGASA 240

Query: 241 EEARNNCRGELCLDGNESDRIKLLAQSLEEEQKARATLYLELEKERSAAATAADEAMAMI 300
           EE RNNCRGELCLDGNESDRIKLLAQSLEEEQKARATLYLELEKERSAAATAADEAMAMI
Sbjct: 241 EETRNNCRGELCLDGNESDRIKLLAQSLEEEQKARATLYLELEKERSAAATAADEAMAMI 300

Query: 301 LRLQEEKAKIEMEARQYQRLIEEKTAYDAEEMSILKEILVRREQEMHFLKKEVEAFRKSF 360
           LRLQEEKA IEMEARQYQR+IEEKTAYDAEEMSILKEILVRREQE HFLKKEVEAFRKS 
Sbjct: 301 LRLQEEKACIEMEARQYQRMIEEKTAYDAEEMSILKEILVRREQERHFLKKEVEAFRKSL 360

Query: 361 FDNDGVSVDMLDSEFTPPW--------------LECINGSITDKQSHELPSVESRNLTFE 420
           FDNDGVSVDMLDSEFTP W              L+CINGSITDKQSHELPSVESRNL FE
Sbjct: 361 FDNDGVSVDMLDSEFTPQWAPSSPCPTEDPSHVLQCINGSITDKQSHELPSVESRNLIFE 420

Query: 421 FGEESPSIQADEFADAAKASGVLLHQVADNFELSEEIDNELQGKGMVEDKNVYIVPGEVN 480
           FGEESPSIQA EFADAAKASGVLLHQVAD FE SEEIDNELQGKGMVEDKNVYIVPGEVN
Sbjct: 421 FGEESPSIQAVEFADAAKASGVLLHQVAD-FERSEEIDNELQGKGMVEDKNVYIVPGEVN 480

Query: 481 ELEPYPKSN--VSNGLDKVEQCTELTADEQEKVYDDSFDGLAFAEIALPCVEYNLEKHGN 540
           ELEPYPKSN  VSNGL KVEQCTELTADEQEKVYDDSFDGLAFAEIALPCVEYNLEKHGN
Sbjct: 481 ELEPYPKSNSNVSNGLGKVEQCTELTADEQEKVYDDSFDGLAFAEIALPCVEYNLEKHGN 540

Query: 541 HQKQWTRDFKSVNDTDACPHDIHVIEDEARMPNEASANARKETSVNGSSSIPVNCDSSSF 600
           HQKQWTRDFKSVNDTDACPHDIHVIEDE  MPNEASANARKETSVNG S IPVNCDSSSF
Sbjct: 541 HQKQWTRDFKSVNDTDACPHDIHVIEDE--MPNEASANARKETSVNGCSIIPVNCDSSSF 600

Query: 601 SSLQNGLDITRSSSDATGRFPPMARSRSNSLRSELRRNSMSAVDYERSKIGNEVEWLRGR 660
           S LQNGLDITRSSSDATGRFPPM RSRSNSLR ELRRNSMSAVDYERSKIGNEVEWLRGR
Sbjct: 601 SLLQNGLDITRSSSDATGRFPPMTRSRSNSLRCELRRNSMSAVDYERSKIGNEVEWLRGR 660

Query: 661 LKIVQEEREKLKLSVEQKEKENNQLQLLENITK-------------------DVSKKRCW 696
           LKIVQEEREKLK SVE KEKE NQLQLLENITK                   DVSKKRCW
Sbjct: 661 LKIVQEEREKLKFSVESKEKEINQLQLLENITKVLSDPGKAALQAPLPPSSKDVSKKRCW 697

BLAST of Cp4.1LG03g17980 vs. TAIR 10
Match: AT4G13630.1 (Protein of unknown function, DUF593 )

HSP 1 Score: 226.9 bits (577), Expect = 5.1e-59
Identity = 223/674 (33.09%), Postives = 325/674 (48.22%), Query Frame = 0

Query: 1   MACEAVQLWTFNGLVAAFLDLGIAFLLLCATSLVFFTSKFLALFGSCLPCPCDGLFGNLG 60
           M C+ V+ WTF GLVAAF+DL +AF LLCA+ +V+ TSKFL LFG  LPCPCDGL+    
Sbjct: 1   MRCQEVKSWTFKGLVAAFIDLSVAFSLLCASFIVYVTSKFLGLFGLDLPCPCDGLY---- 60

Query: 61  SDHCFQKLLVDCSSKRISSVLHSTREKFPLDSM---WDQEPKCCFKSMSVHER--NAKEA 120
              CFQ+ L +   K+ISSV  S + + P DS+     ++ KC  + + + +   +   +
Sbjct: 61  -SECFQESLRNLPVKKISSVQRSVKNRTPFDSILYNGGKKRKCERRRVQLEDEVSSTTPS 120

Query: 121 CVEFEGEASGDSWFKTRSPRGMIYGDVLNVNESHYKCGVGGRKIASVSPNDVFQSDVELE 180
             +FE +ASG      +S +   +  V +   S ++   G +        + +QS     
Sbjct: 121 VGKFENKASGFDLLTAQSLKKGSF-KVKSKRLSFHRSPYGFK--------NHYQS----- 180

Query: 181 DLCHSPSSFCGFGDNNNEDGFFSVDTGDEGEASLDNSNQYKVFPDLELDDSYDEKICAEM 240
             C    SF G  D N+      V++ D G+A             LE D S  + +    
Sbjct: 181 --CLGLKSFEGSYDENDP---LLVNSNDSGKA-------------LE-DVSLRKSVSLSS 240

Query: 241 YGASAEEARNNCRGELCLDGNESDRIKLLAQSLEEEQKARATLYLELEKERSAAATAADE 300
            G    E      G     G     +++  Q L EE+ ARA+L LELEKER+AAA+AADE
Sbjct: 241 VGCGVGE------GACSSPGMVQRTVEMSEQVLGEERAARASLALELEKERNAAASAADE 300

Query: 301 AMAMILRLQEEKAKIEMEARQYQRLIEEKTAYDAEEMSILKEILVRREQEMHFLKKEVEA 360
           A+ MILRLQEEKA IEMEARQYQR+IEEK+A+DAEEMSILKEIL+RRE+E HFL+KEV+ 
Sbjct: 301 ALGMILRLQEEKASIEMEARQYQRMIEEKSAFDAEEMSILKEILLRREREKHFLEKEVDT 360

Query: 361 FRKSFFDNDGVSVDMLDSEFTPPWLECINGSITDKQSHELPSVESRNLTFEFGEESPSIQ 420
           +R+ F + +                          Q H  P  +S+    E   ++P   
Sbjct: 361 YRQMFLETE--------------------------QPHNTP--DSKPAQIE-RLQTPQQI 420

Query: 421 ADEFADAAKASGVLLHQVADNFEL-SEEIDNELQGKGMVEDKNVYIVPGEVNELEPYPKS 480
            + + D    +      V+  FE+ + ++DN                  + +  EP+   
Sbjct: 421 TEPWDDMETVN------VSSGFEIFTNQMDN---------------TECDQSRSEPF--- 480

Query: 481 NVSNGLDKVEQCTELTADEQEKVYDDSFDGLAFAEIALPCVEYNLEKHGNHQKQWTRDFK 540
                 D +        +E E +Y         AE+     +  + K         +  +
Sbjct: 481 ------DALSDIELKEREEGETLY---------AELVSRTSDIAVSK---------KLCE 529

Query: 541 SVNDTDACPHDIHVIEDEARMPNEASANARKETSVNGSSSIPVNCDSSSFSSLQNGLDIT 600
             +D D   HDIHV+ DE                  G  ++P     S  ++    LD +
Sbjct: 541 DPHDIDCHVHDIHVVTDEDN---------------KGQLNVP-----SDHATQDLKLDRS 529

Query: 601 RSSSDATGRFPPMARSRSNSLRSELRRNSMSAVDYERSKIGNEVEWLRGRLKIVQEEREK 660
           +S SD +  FP   + +SN + + +RRNSMSA+DYER KI +EV  LRGRL+ VQ+ REK
Sbjct: 601 QSVSDTSYAFP---QGKSN-MSTNMRRNSMSAIDYERLKIESEVGLLRGRLRAVQKGREK 529

Query: 661 LKLSVEQKEKENNQ 669
           +  S + + K   Q
Sbjct: 661 ISFSSKDQSKSQVQ 529

BLAST of Cp4.1LG03g17980 vs. TAIR 10
Match: AT4G13630.2 (Protein of unknown function, DUF593 )

HSP 1 Score: 226.9 bits (577), Expect = 5.1e-59
Identity = 223/674 (33.09%), Postives = 325/674 (48.22%), Query Frame = 0

Query: 1   MACEAVQLWTFNGLVAAFLDLGIAFLLLCATSLVFFTSKFLALFGSCLPCPCDGLFGNLG 60
           M C+ V+ WTF GLVAAF+DL +AF LLCA+ +V+ TSKFL LFG  LPCPCDGL+    
Sbjct: 1   MRCQEVKSWTFKGLVAAFIDLSVAFSLLCASFIVYVTSKFLGLFGLDLPCPCDGLY---- 60

Query: 61  SDHCFQKLLVDCSSKRISSVLHSTREKFPLDSM---WDQEPKCCFKSMSVHER--NAKEA 120
              CFQ+ L +   K+ISSV  S + + P DS+     ++ KC  + + + +   +   +
Sbjct: 61  -SECFQESLRNLPVKKISSVQRSVKNRTPFDSILYNGGKKRKCERRRVQLEDEVSSTTPS 120

Query: 121 CVEFEGEASGDSWFKTRSPRGMIYGDVLNVNESHYKCGVGGRKIASVSPNDVFQSDVELE 180
             +FE +ASG      +S +   +  V +   S ++   G +        + +QS     
Sbjct: 121 VGKFENKASGFDLLTAQSLKKGSF-KVKSKRLSFHRSPYGFK--------NHYQS----- 180

Query: 181 DLCHSPSSFCGFGDNNNEDGFFSVDTGDEGEASLDNSNQYKVFPDLELDDSYDEKICAEM 240
             C    SF G  D N+      V++ D G+A             LE D S  + +    
Sbjct: 181 --CLGLKSFEGSYDENDP---LLVNSNDSGKA-------------LE-DVSLRKSVSLSS 240

Query: 241 YGASAEEARNNCRGELCLDGNESDRIKLLAQSLEEEQKARATLYLELEKERSAAATAADE 300
            G    E      G     G     +++  Q L EE+ ARA+L LELEKER+AAA+AADE
Sbjct: 241 VGCGVGE------GACSSPGMVQRTVEMSEQVLGEERAARASLALELEKERNAAASAADE 300

Query: 301 AMAMILRLQEEKAKIEMEARQYQRLIEEKTAYDAEEMSILKEILVRREQEMHFLKKEVEA 360
           A+ MILRLQEEKA IEMEARQYQR+IEEK+A+DAEEMSILKEIL+RRE+E HFL+KEV+ 
Sbjct: 301 ALGMILRLQEEKASIEMEARQYQRMIEEKSAFDAEEMSILKEILLRREREKHFLEKEVDT 360

Query: 361 FRKSFFDNDGVSVDMLDSEFTPPWLECINGSITDKQSHELPSVESRNLTFEFGEESPSIQ 420
           +R+ F + +                          Q H  P  +S+    E   ++P   
Sbjct: 361 YRQMFLETE--------------------------QPHNTP--DSKPAQIE-RLQTPQQI 420

Query: 421 ADEFADAAKASGVLLHQVADNFEL-SEEIDNELQGKGMVEDKNVYIVPGEVNELEPYPKS 480
            + + D    +      V+  FE+ + ++DN                  + +  EP+   
Sbjct: 421 TEPWDDMETVN------VSSGFEIFTNQMDN---------------TECDQSRSEPF--- 480

Query: 481 NVSNGLDKVEQCTELTADEQEKVYDDSFDGLAFAEIALPCVEYNLEKHGNHQKQWTRDFK 540
                 D +        +E E +Y         AE+     +  + K         +  +
Sbjct: 481 ------DALSDIELKEREEGETLY---------AELVSRTSDIAVSK---------KLCE 529

Query: 541 SVNDTDACPHDIHVIEDEARMPNEASANARKETSVNGSSSIPVNCDSSSFSSLQNGLDIT 600
             +D D   HDIHV+ DE                  G  ++P     S  ++    LD +
Sbjct: 541 DPHDIDCHVHDIHVVTDEDN---------------KGQLNVP-----SDHATQDLKLDRS 529

Query: 601 RSSSDATGRFPPMARSRSNSLRSELRRNSMSAVDYERSKIGNEVEWLRGRLKIVQEEREK 660
           +S SD +  FP   + +SN + + +RRNSMSA+DYER KI +EV  LRGRL+ VQ+ REK
Sbjct: 601 QSVSDTSYAFP---QGKSN-MSTNMRRNSMSAIDYERLKIESEVGLLRGRLRAVQKGREK 529

Query: 661 LKLSVEQKEKENNQ 669
           +  S + + K   Q
Sbjct: 661 ISFSSKDQSKSQVQ 529

BLAST of Cp4.1LG03g17980 vs. TAIR 10
Match: AT1G04890.1 (Protein of unknown function, DUF593 )

HSP 1 Score: 137.9 bits (346), Expect = 3.1e-32
Identity = 148/444 (33.33%), Postives = 201/444 (45.27%), Query Frame = 0

Query: 257 ESDRIKLLAQSLEEEQKARATLYLELEKERSAAATAADEAMAMILRLQEEKAKIEMEARQ 316
           E   ++ L + L+EE+ ARAT+ +EL+KERSAAA+AADEAMAMI RLQ+EKA IEMEARQ
Sbjct: 107 EKRSVRDLEELLKEERAARATVCVELDKERSAAASAADEAMAMIHRLQDEKAAIEMEARQ 166

Query: 317 YQRLIEEKTAYDAEEMSILKEILVRREQEMHFLKKEVEAFRKSFFDNDGVSVDMLDSEFT 376
           +QRL+EE++ +DAEEM ILK+IL+RRE+E HFL+KEVEA+R+   + +            
Sbjct: 167 FQRLVEERSTFDAEEMVILKDILIRREREKHFLEKEVEAYRQLLEETEE----------- 226

Query: 377 PPWLECINGSITDKQSHELPSVESRNLTFEFGEESPSIQADEFADAAKASGVLLHQVADN 436
              LEC                                                      
Sbjct: 227 ---LEC------------------------------------------------------ 286

Query: 437 FELSEEIDNELQGKGMVEDKNVYIVPGEVNELEPYPKSNVSNGLDKVEQCTELTADEQEK 496
                          ++++KNV          EP  K N        + C E  A     
Sbjct: 287 --------------SLIKEKNV---------PEPEHKQN--------KDCQERRA----- 346

Query: 497 VYDDSFDGLAFAEIALPCVEYNLEKHGNHQKQWTRD-FKSVNDTD-ACPHDIHVIEDEAR 556
                   L   E+    ++    + GN  K   RD +KS ++   +   D+++++DE  
Sbjct: 347 --------LLVQELDGTVLDMPYREEGNRDK--NRDLYKSDSEVAYSRVRDVYMVKDE-- 406

Query: 557 MPNEASANARKETSVNGSSSIPVNCDSSSFSSLQNGLDITR-SSSDATGRFPPMARSRSN 616
                + N  K+           N + SS    +  LD      S    + PP+ R R  
Sbjct: 407 -----TENISKKK----------NLEESSVGKPKESLDENSIIVSGIARKLPPLCRPRKK 411

Query: 617 SLRSE-LRRNSMSAVDYERSKIGNEVEWLRGRLKIVQEEREKLKLSVEQKEKENNQLQLL 676
           SL S   RR SMSAVDYER KI NEVE LR RLK VQEERE+L             L  L
Sbjct: 467 SLSSSGSRRKSMSAVDYERLKIENEVELLRERLKAVQEEREEL--------TRRASLPPL 411

Query: 677 ENITKDVSKKRCW-RSSSLSIHRS 696
            +  +  S+KR W RS S+ +H S
Sbjct: 527 PSKVRATSEKRSWTRSCSMDLHSS 411

BLAST of Cp4.1LG03g17980 vs. TAIR 10
Match: AT4G13160.1 (Protein of unknown function, DUF593 )

HSP 1 Score: 116.3 bits (290), Expect = 9.7e-26
Identity = 61/99 (61.62%), Postives = 86/99 (86.87%), Query Frame = 0

Query: 259 DRIKLLAQSLEEEQKARATLYLELEKERSAAATAADEAMAMILRLQEEKAKIEMEARQYQ 318
           DR++LL  ++E+E+ A+A L +ELE+ER+A+A+AADEAMAMILRLQ +KA +EME +QY+
Sbjct: 112 DRVRLLEVAVEQEKVAKAALMVELEQERAASASAADEAMAMILRLQADKASLEMEGKQYE 171

Query: 319 RLIEEKTAYDAEEMSILKEILVRREQEMHFLKKEVEAFR 358
           R+I+EK AYD EEM+ILKEIL +RE+E HFL+KE+E ++
Sbjct: 172 RMIDEKFAYDEEEMNILKEILFKREREKHFLEKELETYK 210

BLAST of Cp4.1LG03g17980 vs. TAIR 10
Match: AT5G16720.1 (Protein of unknown function, DUF593 )

HSP 1 Score: 91.3 bits (225), Expect = 3.4e-18
Identity = 56/121 (46.28%), Postives = 86/121 (71.07%), Query Frame = 0

Query: 238 ASAEEARNNCRGELCLDGNESDR-IKLLAQSLEEEQKARATLYLELEKERSAAATAADEA 297
           A+AE+A +       +DG +  R I+ L +++  EQ+A   LY ELE+ERSA+A +A++ 
Sbjct: 333 AAAEDAGDGNVLVSEMDGGDPLRTIERLRETVRAEQEALRDLYAELEEERSASAISANQT 392

Query: 298 MAMILRLQEEKAKIEMEARQYQRLIEEKTAYDAEEMSILKEILVRREQEMHFLKKEVEAF 357
           MAMI RLQEEKAK++MEA QYQR++EE+  YD E + +L  ++V+RE+E   L++E+E +
Sbjct: 393 MAMITRLQEEKAKVQMEALQYQRMMEEQAEYDQEALQLLNHLMVKREKEKEQLQRELEVY 452

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q0WNW44.7e-1746.28Myosin-binding protein 3 OS=Arabidopsis thaliana OX=3702 GN=MYOB3 PE=1 SV=1[more]
Q9CAC42.0e-1548.31Myosin-binding protein 2 OS=Arabidopsis thaliana OX=3702 GN=MYOB2 PE=1 SV=1[more]
Q9LMC83.7e-1433.33Probable myosin-binding protein 5 OS=Arabidopsis thaliana OX=3702 GN=MYOB5 PE=2 ... [more]
F4HVS64.9e-1446.39Probable myosin-binding protein 6 OS=Arabidopsis thaliana OX=3702 GN=MYOB6 PE=2 ... [more]
Q9FG144.1e-1342.42Myosin-binding protein 7 OS=Arabidopsis thaliana OX=3702 GN=MYOB7 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
XP_023528191.10.097.34uncharacterized protein LOC111791180 isoform X1 [Cucurbita pepo subsp. pepo][more]
XP_023528192.10.093.01uncharacterized protein LOC111791180 isoform X2 [Cucurbita pepo subsp. pepo][more]
XP_022924681.10.093.03uncharacterized protein LOC111432110 isoform X2 [Cucurbita moschata][more]
XP_022924680.10.091.24uncharacterized protein LOC111432110 isoform X1 [Cucurbita moschata][more]
KAG7018863.10.094.21Ubiquitin carboxyl-terminal hydrolase 2, partial [Cucurbita argyrosperma subsp. ... [more]
Match NameE-valueIdentityDescription
A0A6J1EA460.093.03uncharacterized protein LOC111432110 isoform X2 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1EFQ30.091.24uncharacterized protein LOC111432110 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1IQU90.090.63uncharacterized protein LOC111479667 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1IVQ30.088.89uncharacterized protein LOC111479667 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1E9N50.087.00uncharacterized protein LOC111432110 isoform X3 OS=Cucurbita moschata OX=3662 GN... [more]
Match NameE-valueIdentityDescription
AT4G13630.15.1e-5933.09Protein of unknown function, DUF593 [more]
AT4G13630.25.1e-5933.09Protein of unknown function, DUF593 [more]
AT1G04890.13.1e-3233.33Protein of unknown function, DUF593 [more]
AT4G13160.19.7e-2661.62Protein of unknown function, DUF593 [more]
AT5G16720.13.4e-1846.28Protein of unknown function, DUF593 [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 645..682
NoneNo IPR availableCOILSCoilCoilcoord: 293..323
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 595..622
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 595..621
NoneNo IPR availablePANTHERPTHR31422:SF3BNAANNG28530D PROTEINcoord: 3..680
NoneNo IPR availablePANTHERPTHR31422BNAANNG28530D PROTEINcoord: 3..680
IPR007656GTD-binding domainPFAMPF04576Zein-bindingcoord: 264..353
e-value: 5.9E-32
score: 109.8
IPR007656GTD-binding domainPROSITEPS51775GTD_BINDINGcoord: 259..357
score: 20.076521

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG03g17980.1Cp4.1LG03g17980.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0016579 protein deubiquitination
biological_process GO:0006511 ubiquitin-dependent protein catabolic process
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0080115 myosin XI tail binding
molecular_function GO:0004843 thiol-dependent deubiquitinase
molecular_function GO:0008270 zinc ion binding