Cp4.1LG03g17440 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG03g17440
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionWD-40 repeat-containing protein MSI1-like
LocationCp4.1LG03: 12588943 .. 12592383 (-)
RNA-Seq ExpressionCp4.1LG03g17440
SyntenyCp4.1LG03g17440
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAAGCTAACACGAAAATTTGAAGGGAGTTATAACTAAAGCACTACCTCTTCTTGAACACATATTGGCGGGAACTTCGCCGGGGAACTGAGCTCCCAACTTTGCTTCTAACCGACGAAATCGACCCTCAAACTCTGCAAATTTTGCTTCATGAAGAAGACAATGGAGAAGGGCGATGAAGAGATACGTGGAAAAACGAACGAGAGACTGGTAAATGAGGAGTATAAGATTTGGAAGAAGAATACTCCATTTCTTTACGATTTGATCATCACTCATGCCCTAGAGTGGCCTTCACTCACCGTCGAGTGGTTGCCGGACCGGGATGAGCCTCCGGGAAAGGATTACTCCGTTCAGAAGATGATTTTGGGGACTCATAGCTTCGATAATAAGCCGAATTACCTCATGGTTGCTCAGGTTCAGCTTTCGCTTGAGAATTCGGAGAACGATGCGCGACATTACCGTGACGATCGTGCCAGTGCGGGTGGCTTTGGGTGTGCGAACGGCAAGGTATGTGGATTTGAACCTTTTCTTTAGAACTGGAAGAACGTGAAATTGAATCTTGAATCTGAATTTTATTGCTTGATAGAACTGATCTGGAATCGGACGGAATTTTCATCTTGAGTAGTTCTTTTTTAGTTCCTGTAATGCTTTCAAAGGAAAAAGTTTAAGATAGGGGGATTCAATTGATACTGCAATTGATAGATCTTGAGTAGATCGATTGATGAATTCTTGTTGTAAAGATGCAAATTTCATTGAATCTTTTCTCAGTTCAGGTACAAATAATCCAGCATATAAATCACGACGGCGAAGTCAATAGAGCCCGTTATATGCCTCAAAACCCATTTATTATTGCCACAAAGACTGTCAGCGCTGAAGTCCTTGTTTTTGACTACAGTAAACACCCATCCAAACCACCTCTAGATGGTACATGTAATCCTGACTTGAGGTTGAGGGGTCACAAGTCCGAAGGTTATGGTTTATCATGGAACAAGTTCAAGCAGGGCCATTTACTTAGTGGCTCTGAGGATTCACATATTTGTTTATGGGACATTAATGCTACTCCGGATAACAAAACCCTCGAGGCTATGCAAATTTTTAAGGTGAGCTCCTCTTAAACAGTCCTTACAACCATCTAAATTGTTGTGTTTTTGTGCTTCTATGGCTTCATAATGTTTTGTTTGGATTTTGCCCAAGAACTTCTTATGAACATGTATTAGTGTATAAATATATAATTTGGATGACTGTAAATTATAATGATTGATTAGTACTTGTGTCCTGATTATTGAACTACAATGTAAAATAATATCCCTAGCTAACAACTTGTGAGAAATGATATATTTGTTATGTGCTGCTATTTTGGTTTTATCTATTTTGTAACGGTCTAAGTCCACCACTAGTAGATATTGTTCTCTTCGAATTTTCCCTTTCGATTTTTAAAACGCGTCTGCTAGGAAGAGGTTTTCACACTCTTATAAATAATGTTTTGTTCTCCTTCTTGACTGATGTGGGATCTCACATTCCACTCTTGTTCAGGGCTCAGTGTCCTCGTTGACATTCATTCCATTCTCCAATCGATGTGAGACCCCCCAATCCACCCCCTTCGAGGCCCAGCGTCCTTGCTGGCATATCACCTTGTGTCCGCCCCCTTCAGGGCTCAGCCTCCTCGTTGGCACATTGTCCGGTGTCTGGCTTTGATATCATTTGTAACAGCCCAAGCCCACCGCTAGCAGATATTGTCCTCTTTGGACTCTCTTTCGGGCTTCCCCTCAAGGCTTTTAAAATGCATATGCTAGGGAGAGGTTTCCACACTCATATAAAAAATGTTTCATTCTCCTCCCCAACCGATGTTGGATCTCACATATATGATCTGATATGATCTTTGTGACAGGGTCATGAAGGTGTTGTGGGAGACGTTGCCTGGCATATGGGGCATGAATACTTATTTGGTTCAGTCGGTGATGATCGATATCTACATGTATGGGATCTGCGAAGTCCTTCAGCTAATAAGCCTGTACAGTCTGTAGTTGCTCATCAAAGTGAGGTAGGTGCTACTGGAAGTTAAACAAAAATACATGCAAAATTAGTGGGTTGCAAATGAGGAAGAGCTCCTTTCGTGGTTACATGTGAAGATAAATCTTATGAGAGCTCTGAATGATCGTTCTCTAGCAACAAGCTTCATCCACGTGCTTATTGTAGTACGAAACTGTCTTTTCCTTTTCGGGTTGCCTGACCATTATTCACTGTGGTAATTAGTAATGCAAGAGATGCTGGTCTTTTGAGTTTAATAGAATTATTTCCCTCGTAATGTTGCCTCAGGTAAATTGCTTGACATTCAATCCCTTCAATGAGTGGTTAGTAGCCACAGGGTCAACTGATAAGATGGTTAAGTTGTTTGATCTACGTAAGATCAGCTCATCCCTCCATACCTTTGACTGTCACGAGTAAGCATCCTCCTTAGGTTATGAACTTCGATGATCTCGATTTTCCTTGCCAAGTGCTGGAAACAAACAGCACATGAGCTAGACATCTAGACACATTTTAAAACCTAGTTTGTAGCTGTTGTGCGCTTGTCTTTGAATGTCTATATTTATGTTTTGTGCAACAGGGAGGAGGTTTTTCAGGTGGGCTGGCATCCAAAGAACGAAACGATCTTAGCTTCTTGTTGTCGTGGAAGGAGACTCATGGTTTGGGACCTTAGCAGGTAATTTTATAAATGTAATGTAGTTGTAGTCACAGTTATGTTGTTTCAGTAATGATGCTTAAGAATATTGATTATGGAATAAGCTAGTGATTAACTAATTCTTGATATATTTTCTGAAAGTTTACTTGATACCGAACCAAATCAGAGCTACTCTTCATGTGGTTCTGACGTACGTTTACATGACGGGAGCTCGTTTGTGGGTTTTCATCTGCAGGATCGAGGAGGAGCAGACACCGGAGGACGTAGAAGATGGGCCACCCGAATTGCTGTTCATTCACGGTGGTCATACCAATACAATATCAGACTTCTCTTGGAATCCCTGTGAGGAGTGGGTCGTTGCTAGTGTAGCTGAAGATAACATACTACAAGTCTGGCAGATGGCTGAGAACGTCTACTATGGTGAAGATGATTTGCTTGAGGAACCTCCAAAGCTCTCTTAGTTCTTCATTCTCTGTATTTGAAGTAAGTACTTTAGGTCTTCAAGGGGTGTCAATCCTTGTTCCTCTTTTCTTTCTTAAATAAAAAGGGAAATAAAGAGGTAAAGAGGTGTTTTTGATGCTATGGGTTGTGGTGATTCACCCCACAATTGATAGAGAGCTTAGAAATAAGGGAGAAAAAAATTTGTACATTGATAGAGATTAGTAAAAAACTTGATATGCAAATCAAAACCATGAGTTTCATTTGGTTTTTTTTTAAAAAATATATATGCACATAGGATAGGAATGGTAAAGAAGTAGTCAT

mRNA sequence

AAAAGCTAACACGAAAATTTGAAGGGAGTTATAACTAAAGCACTACCTCTTCTTGAACACATATTGGCGGGAACTTCGCCGGGGAACTGAGCTCCCAACTTTGCTTCTAACCGACGAAATCGACCCTCAAACTCTGCAAATTTTGCTTCATGAAGAAGACAATGGAGAAGGGCGATGAAGAGATACGTGGAAAAACGAACGAGAGACTGGTAAATGAGGAGTATAAGATTTGGAAGAAGAATACTCCATTTCTTTACGATTTGATCATCACTCATGCCCTAGAGTGGCCTTCACTCACCGTCGAGTGGTTGCCGGACCGGGATGAGCCTCCGGGAAAGGATTACTCCGTTCAGAAGATGATTTTGGGGACTCATAGCTTCGATAATAAGCCGAATTACCTCATGGTTGCTCAGGTTCAGCTTTCGCTTGAGAATTCGGAGAACGATGCGCGACATTACCGTGACGATCGTGCCAGTGCGGGTGGCTTTGGGTGTGCGAACGGCAAGGTACAAATAATCCAGCATATAAATCACGACGGCGAAGTCAATAGAGCCCGTTATATGCCTCAAAACCCATTTATTATTGCCACAAAGACTGTCAGCGCTGAAGTCCTTGTTTTTGACTACAGTAAACACCCATCCAAACCACCTCTAGATGGTACATGTAATCCTGACTTGAGGTTGAGGGGTCACAAGTCCGAAGGTTATGGTTTATCATGGAACAAGTTCAAGCAGGGCCATTTACTTAGTGGCTCTGAGGATTCACATATTTGTTTATGGGACATTAATGCTACTCCGGATAACAAAACCCTCGAGGCTATGCAAATTTTTAAGGTAAATTGCTTGACATTCAATCCCTTCAATGAGTGGTTAGTAGCCACAGGGTCAACTGATAAGATGGTTAAGTTGTTTGATCTACGTAAGATCAGCTCATCCCTCCATACCTTTGACTGTCACGAGGAGGAGGTTTTTCAGGTGGGCTGGCATCCAAAGAACGAAACGATCTTAGCTTCTTGTTGTCGTGGAAGGAGACTCATGGTTTGGGACCTTAGCAGGATCGAGGAGGAGCAGACACCGGAGGACGTAGAAGATGGGCCACCCGAATTGCTGTTCATTCACGGTGGTCATACCAATACAATATCAGACTTCTCTTGGAATCCCTGTGAGGAGTGGGTCGTTGCTAGTGTAGCTGAAGATAACATACTACAAGTCTGGCAGATGGCTGAGAACGTCTACTATGGTGAAGATGATTTGCTTGAGGAACCTCCAAAGCTCTCTTAGTTCTTCATTCTCTGTATTTGAAGTAAGTACTTTAGGTCTTCAAGGGGTGTCAATCCTTGTTCCTCTTTTCTTTCTTAAATAAAAAGGGAAATAAAGAGGTAAAGAGGTGTTTTTGATGCTATGGGTTGTGGTGATTCACCCCACAATTGATAGAGAGCTTAGAAATAAGGGAGAAAAAAATTTGTACATTGATAGAGATTAGTAAAAAACTTGATATGCAAATCAAAACCATGAGTTTCATTTGGTTTTTTTTTAAAAAATATATATGCACATAGGATAGGAATGGTAAAGAAGTAGTCAT

Coding sequence (CDS)

ATGAAGAAGACAATGGAGAAGGGCGATGAAGAGATACGTGGAAAAACGAACGAGAGACTGGTAAATGAGGAGTATAAGATTTGGAAGAAGAATACTCCATTTCTTTACGATTTGATCATCACTCATGCCCTAGAGTGGCCTTCACTCACCGTCGAGTGGTTGCCGGACCGGGATGAGCCTCCGGGAAAGGATTACTCCGTTCAGAAGATGATTTTGGGGACTCATAGCTTCGATAATAAGCCGAATTACCTCATGGTTGCTCAGGTTCAGCTTTCGCTTGAGAATTCGGAGAACGATGCGCGACATTACCGTGACGATCGTGCCAGTGCGGGTGGCTTTGGGTGTGCGAACGGCAAGGTACAAATAATCCAGCATATAAATCACGACGGCGAAGTCAATAGAGCCCGTTATATGCCTCAAAACCCATTTATTATTGCCACAAAGACTGTCAGCGCTGAAGTCCTTGTTTTTGACTACAGTAAACACCCATCCAAACCACCTCTAGATGGTACATGTAATCCTGACTTGAGGTTGAGGGGTCACAAGTCCGAAGGTTATGGTTTATCATGGAACAAGTTCAAGCAGGGCCATTTACTTAGTGGCTCTGAGGATTCACATATTTGTTTATGGGACATTAATGCTACTCCGGATAACAAAACCCTCGAGGCTATGCAAATTTTTAAGGTAAATTGCTTGACATTCAATCCCTTCAATGAGTGGTTAGTAGCCACAGGGTCAACTGATAAGATGGTTAAGTTGTTTGATCTACGTAAGATCAGCTCATCCCTCCATACCTTTGACTGTCACGAGGAGGAGGTTTTTCAGGTGGGCTGGCATCCAAAGAACGAAACGATCTTAGCTTCTTGTTGTCGTGGAAGGAGACTCATGGTTTGGGACCTTAGCAGGATCGAGGAGGAGCAGACACCGGAGGACGTAGAAGATGGGCCACCCGAATTGCTGTTCATTCACGGTGGTCATACCAATACAATATCAGACTTCTCTTGGAATCCCTGTGAGGAGTGGGTCGTTGCTAGTGTAGCTGAAGATAACATACTACAAGTCTGGCAGATGGCTGAGAACGTCTACTATGGTGAAGATGATTTGCTTGAGGAACCTCCAAAGCTCTCTTAG

Protein sequence

MKKTMEKGDEEIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKDYSVQKMILGTHSFDNKPNYLMVAQVQLSLENSENDARHYRDDRASAGGFGCANGKVQIIQHINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSKHPSKPPLDGTCNPDLRLRGHKSEGYGLSWNKFKQGHLLSGSEDSHICLWDINATPDNKTLEAMQIFKVNCLTFNPFNEWLVATGSTDKMVKLFDLRKISSSLHTFDCHEEEVFQVGWHPKNETILASCCRGRRLMVWDLSRIEEEQTPEDVEDGPPELLFIHGGHTNTISDFSWNPCEEWVVASVAEDNILQVWQMAENVYYGEDDLLEEPPKLS
Homology
BLAST of Cp4.1LG03g17440 vs. ExPASy Swiss-Prot
Match: O22466 (WD-40 repeat-containing protein MSI1 OS=Solanum lycopersicum OX=4081 GN=MSI1 PE=2 SV=1)

HSP 1 Score: 664.8 bits (1714), Expect = 5.6e-190
Identity = 312/415 (75.18%), Postives = 341/415 (82.17%), Query Frame = 0

Query: 5   MEKGDEEIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKD 64
           M K ++E+RG+  ERL+NEEYKIWKKNTPFLYDL+ITHALEWPSLTVEWLPDR+EP GKD
Sbjct: 1   MGKDEDEMRGEIEERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPSGKD 60

Query: 65  YSVQKMILGTHSFDNKPNYLMVAQVQLSLENSENDARHYRDDRASAGGFGCANGKVQIIQ 124
           YSVQKMILGTH+ +N+PNYLM+AQVQL LE++ENDARHY DDR+  GGFGCANGKVQIIQ
Sbjct: 61  YSVQKMILGTHTSENEPNYLMLAQVQLPLEDAENDARHYDDDRSEFGGFGCANGKVQIIQ 120

Query: 125 HINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSKHPSKPPLDGTCNPDLRLRGHKSE 184
            INHDGEVNRARYMPQNPFIIATKTVSAEV VFDYSKHPSKPPLDG CNPDLRLRGH +E
Sbjct: 121 QINHDGEVNRARYMPQNPFIIATKTVSAEVYVFDYSKHPSKPPLDGACNPDLRLRGHSTE 180

Query: 185 GYGLSWNKFKQGHLLSGSEDSHICLWDINATPDNKTLEAMQIFK---------------- 244
           GYGLSW++FKQGHLLSGS+DSHICLWDINATP NK LEAMQIFK                
Sbjct: 181 GYGLSWSQFKQGHLLSGSDDSHICLWDINATPKNKALEAMQIFKVHEGVVEDVAWHLRHE 240

Query: 245 -----------------------------------VNCLTFNPFNEWLVATGSTDKMVKL 304
                                              VNCL FNPFNEW+VATGSTDK VKL
Sbjct: 241 YLFGSVGDDQYLHVWDLRTPSVTKPIQSVVAHQSEVNCLAFNPFNEWVVATGSTDKTVKL 300

Query: 305 FDLRKISSSLHTFDCHEEEVFQVGWHPKNETILASCCRGRRLMVWDLSRIEEEQTPEDVE 364
           FDLRKIS++LHT DCH+EEVFQVGW+PKNETILASCC GRRLMVWDLSRI+EEQTPED E
Sbjct: 301 FDLRKISTALHTLDCHKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQTPEDAE 360

Query: 365 DGPPELLFIHGGHTNTISDFSWNPCEEWVVASVAEDNILQVWQMAENVYYGEDDL 369
           DGPPELLFIHGGHT+ ISDFSWNPCE+WVVASVAEDNILQ+WQMAEN+Y+ EDDL
Sbjct: 361 DGPPELLFIHGGHTSKISDFSWNPCEDWVVASVAEDNILQIWQMAENIYHDEDDL 415

BLAST of Cp4.1LG03g17440 vs. ExPASy Swiss-Prot
Match: O22467 (Histone-binding protein MSI1 OS=Arabidopsis thaliana OX=3702 GN=MSI1 PE=1 SV=1)

HSP 1 Score: 642.1 bits (1655), Expect = 3.9e-183
Identity = 303/424 (71.46%), Postives = 341/424 (80.42%), Query Frame = 0

Query: 5   MEKGDEEIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKD 64
           M K +EE+RG+  ERL+NEEYKIWKKNTPFLYDL+ITHALEWPSLTVEWLPDR+EP GKD
Sbjct: 1   MGKDEEEMRGEIEERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPSGKD 60

Query: 65  YSVQKMILGTHSFDNKPNYLMVAQVQLSLENSENDARHYRDDRASAGGFGCANGKVQIIQ 124
           YSVQKMILGTH+ +++PNYLM+AQVQL L+++E++AR Y DDR+  GGFGCA GKVQIIQ
Sbjct: 61  YSVQKMILGTHTSESEPNYLMLAQVQLPLDDTESEARQYDDDRSEFGGFGCATGKVQIIQ 120

Query: 125 HINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSKHPSKPPLDGTCNPDLRLRGHKSE 184
            INHDGEVNRARYMPQNPFIIATKTV+AEV VFDYSKHPSKPPLDG CNPDL+LRGH SE
Sbjct: 121 QINHDGEVNRARYMPQNPFIIATKTVNAEVYVFDYSKHPSKPPLDGACNPDLKLRGHSSE 180

Query: 185 GYGLSWNKFKQGHLLSGSEDSHICLWDINATPDNKTLEAMQIFK---------------- 244
           GYGLSW+KFKQGHLLSGS+D+ ICLWDINATP NK+L+A QIFK                
Sbjct: 181 GYGLSWSKFKQGHLLSGSDDAQICLWDINATPKNKSLDAQQIFKAHEGVVEDVAWHLRHE 240

Query: 245 -----------------------------------VNCLTFNPFNEWLVATGSTDKMVKL 304
                                              VNCL FNPFNEW+VATGSTDK VKL
Sbjct: 241 YLFGSVGDDQYLLIWDLRSPSASKPVQSVVAHSMEVNCLAFNPFNEWVVATGSTDKTVKL 300

Query: 305 FDLRKISSSLHTFDCHEEEVFQVGWHPKNETILASCCRGRRLMVWDLSRIEEEQTPEDVE 364
           FDLRK+S++LHTFD H+EEVFQVGW+PKNETILASCC GRRLMVWDLSRI+EEQT ED E
Sbjct: 301 FDLRKLSTALHTFDSHKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQTVEDAE 360

Query: 365 DGPPELLFIHGGHTNTISDFSWNPCEEWVVASVAEDNILQVWQMAENVYYGEDDLL-EEP 377
           DGPPELLFIHGGHT+ ISDFSWNPCE+WV++SVAEDNILQ+WQMAEN+Y+ EDD   EEP
Sbjct: 361 DGPPELLFIHGGHTSKISDFSWNPCEDWVISSVAEDNILQIWQMAENIYHDEDDAPGEEP 420

BLAST of Cp4.1LG03g17440 vs. ExPASy Swiss-Prot
Match: Q10G81 (Histone-binding protein MSI1 homolog OS=Oryza sativa subsp. japonica OX=39947 GN=MSI1 PE=2 SV=1)

HSP 1 Score: 630.2 bits (1624), Expect = 1.5e-179
Identity = 296/424 (69.81%), Postives = 331/424 (78.07%), Query Frame = 0

Query: 1   MKKTMEKGDEEIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEP 60
           M K     +EE R +  ERL+NEEYKIWKKNTPFLYDL+ITHALEWPSLTV+WLPDR EP
Sbjct: 1   MPKATAAEEEEFRAEVEERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVQWLPDRAEP 60

Query: 61  PGKDYSVQKMILGTHSFDNKPNYLMVAQVQLSLENSENDARHYRDDRASAGGFGCANGKV 120
            GKD+SVQKM+LGTH+ DN+PNYLM+AQVQL L+++E DARHY DD A  GGFG A+GKV
Sbjct: 61  AGKDHSVQKMVLGTHTSDNEPNYLMLAQVQLPLDDAEADARHYDDDHAEIGGFGAASGKV 120

Query: 121 QIIQHINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSKHPSKPPLDGTCNPDLRLRG 180
           QI+Q INHDGEVNRARYMPQN FIIATKTVSAEV VFDYSKHPSKPPLDG CNPDLRL+G
Sbjct: 121 QIVQQINHDGEVNRARYMPQNSFIIATKTVSAEVYVFDYSKHPSKPPLDGACNPDLRLKG 180

Query: 181 HKSEGYGLSWNKFKQGHLLSGSEDSHICLWDINATPDNKTLEAMQIFK------------ 240
           H SEGYGLSW+ FK+GHLLSGS+D+ ICLWDI A   NKTL+A+QIFK            
Sbjct: 181 HNSEGYGLSWSIFKEGHLLSGSDDAQICLWDIKANSKNKTLDALQIFKYHDGVVEDVAWH 240

Query: 241 ---------------------------------------VNCLTFNPFNEWLVATGSTDK 300
                                                  VNCL FNPFNEW+VATGSTDK
Sbjct: 241 LRHEYLFGSVGDDHNLLIWDLRSPVSTKPVQSVAAHQGEVNCLAFNPFNEWVVATGSTDK 300

Query: 301 MVKLFDLRKISSSLHTFDCHEEEVFQVGWHPKNETILASCCRGRRLMVWDLSRIEEEQTP 360
            VKLFDLRKI +SLHTFDCH+EEVFQVGW PKNETILASCC GRRLMVWDLSRI++EQTP
Sbjct: 301 TVKLFDLRKIDTSLHTFDCHKEEVFQVGWSPKNETILASCCLGRRLMVWDLSRIDQEQTP 360

Query: 361 EDVEDGPPELLFIHGGHTNTISDFSWNPCEEWVVASVAEDNILQVWQMAENVYYGEDDLL 374
           ED EDGPPELLFIHGGHT+ ISDFSWNPCE+WV+ASVAEDNILQ+WQMAEN+Y+ EDD+ 
Sbjct: 361 EDAEDGPPELLFIHGGHTSKISDFSWNPCEDWVIASVAEDNILQIWQMAENIYHDEDDVP 420

BLAST of Cp4.1LG03g17440 vs. ExPASy Swiss-Prot
Match: Q3MHL3 (Histone-binding protein RBBP4 OS=Bos taurus OX=9913 GN=RBBP4 PE=1 SV=3)

HSP 1 Score: 478.8 bits (1231), Expect = 5.7e-134
Identity = 227/401 (56.61%), Postives = 281/401 (70.07%), Query Frame = 0

Query: 18  ERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKDYSVQKMILGTHSF 77
           ER++NEEYKIWKKNTPFLYDL++THALEWPSLT +WLPD   P GKD+S+ +++LGTH+ 
Sbjct: 14  ERVINEEYKIWKKNTPFLYDLVMTHALEWPSLTAQWLPDVTRPEGKDFSIHRLVLGTHTS 73

Query: 78  DNKPNYLMVAQVQLSLENSENDARHYRDDRASAGGFGCANGKVQIIQHINHDGEVNRARY 137
           D + N+L++A VQL  ++++ DA HY  ++   GGFG  +GK++I   INH+GEVNRARY
Sbjct: 74  DEQ-NHLVIASVQLPNDDAQFDASHYDSEKGEFGGFGSVSGKIEIEIKINHEGEVNRARY 133

Query: 138 MPQNPFIIATKTVSAEVLVFDYSKHPSKPPLDGTCNPDLRLRGHKSEGYGLSWNKFKQGH 197
           MPQNP IIATKT S++VLVFDY+KHPSKP   G CNPDLRLRGH+ EGYGLSWN    GH
Sbjct: 134 MPQNPCIIATKTPSSDVLVFDYTKHPSKPDPSGECNPDLRLRGHQKEGYGLSWNPNLSGH 193

Query: 198 LLSGSEDSHICLWDINATP-DNKTLEAMQIF----------------------------- 257
           LLS S+D  ICLWDI+A P + K ++A  IF                             
Sbjct: 194 LLSASDDHTICLWDISAVPKEGKVVDAKTIFTGHTAVVEDVSWHLLHESLFGSVADDQKL 253

Query: 258 ----------------------KVNCLTFNPFNEWLVATGSTDKMVKLFDLRKISSSLHT 317
                                 +VNCL+FNP++E+++ATGS DK V L+DLR +   LH+
Sbjct: 254 MIWDTRSNNTSKPSHSVDAHTAEVNCLSFNPYSEFILATGSADKTVALWDLRNLKLKLHS 313

Query: 318 FDCHEEEVFQVGWHPKNETILASCCRGRRLMVWDLSRIEEEQTPEDVEDGPPELLFIHGG 367
           F+ H++E+FQV W P NETILAS    RRL VWDLS+I EEQ+PED EDGPPELLFIHGG
Sbjct: 314 FESHKDEIFQVQWSPHNETILASSGTDRRLNVWDLSKIGEEQSPEDAEDGPPELLFIHGG 373

BLAST of Cp4.1LG03g17440 vs. ExPASy Swiss-Prot
Match: Q09028 (Histone-binding protein RBBP4 OS=Homo sapiens OX=9606 GN=RBBP4 PE=1 SV=3)

HSP 1 Score: 478.8 bits (1231), Expect = 5.7e-134
Identity = 227/401 (56.61%), Postives = 281/401 (70.07%), Query Frame = 0

Query: 18  ERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKDYSVQKMILGTHSF 77
           ER++NEEYKIWKKNTPFLYDL++THALEWPSLT +WLPD   P GKD+S+ +++LGTH+ 
Sbjct: 14  ERVINEEYKIWKKNTPFLYDLVMTHALEWPSLTAQWLPDVTRPEGKDFSIHRLVLGTHTS 73

Query: 78  DNKPNYLMVAQVQLSLENSENDARHYRDDRASAGGFGCANGKVQIIQHINHDGEVNRARY 137
           D + N+L++A VQL  ++++ DA HY  ++   GGFG  +GK++I   INH+GEVNRARY
Sbjct: 74  DEQ-NHLVIASVQLPNDDAQFDASHYDSEKGEFGGFGSVSGKIEIEIKINHEGEVNRARY 133

Query: 138 MPQNPFIIATKTVSAEVLVFDYSKHPSKPPLDGTCNPDLRLRGHKSEGYGLSWNKFKQGH 197
           MPQNP IIATKT S++VLVFDY+KHPSKP   G CNPDLRLRGH+ EGYGLSWN    GH
Sbjct: 134 MPQNPCIIATKTPSSDVLVFDYTKHPSKPDPSGECNPDLRLRGHQKEGYGLSWNPNLSGH 193

Query: 198 LLSGSEDSHICLWDINATP-DNKTLEAMQIF----------------------------- 257
           LLS S+D  ICLWDI+A P + K ++A  IF                             
Sbjct: 194 LLSASDDHTICLWDISAVPKEGKVVDAKTIFTGHTAVVEDVSWHLLHESLFGSVADDQKL 253

Query: 258 ----------------------KVNCLTFNPFNEWLVATGSTDKMVKLFDLRKISSSLHT 317
                                 +VNCL+FNP++E+++ATGS DK V L+DLR +   LH+
Sbjct: 254 MIWDTRSNNTSKPSHSVDAHTAEVNCLSFNPYSEFILATGSADKTVALWDLRNLKLKLHS 313

Query: 318 FDCHEEEVFQVGWHPKNETILASCCRGRRLMVWDLSRIEEEQTPEDVEDGPPELLFIHGG 367
           F+ H++E+FQV W P NETILAS    RRL VWDLS+I EEQ+PED EDGPPELLFIHGG
Sbjct: 314 FESHKDEIFQVQWSPHNETILASSGTDRRLNVWDLSKIGEEQSPEDAEDGPPELLFIHGG 373

BLAST of Cp4.1LG03g17440 vs. NCBI nr
Match: XP_023528376.1 (WD-40 repeat-containing protein MSI1-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 772 bits (1994), Expect = 4.02e-281
Identity = 376/427 (88.06%), Postives = 376/427 (88.06%), Query Frame = 0

Query: 1   MKKTMEKGDEEIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEP 60
           MKKTMEKGDEEIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEP
Sbjct: 1   MKKTMEKGDEEIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEP 60

Query: 61  PGKDYSVQKMILGTHSFDNKPNYLMVAQVQLSLENSENDARHYRDDRASAGGFGCANGKV 120
           PGKDYSVQKMILGTHSFDNKPNYLMVAQVQLSLENSENDARHYRDDRASAGGFGCANGKV
Sbjct: 61  PGKDYSVQKMILGTHSFDNKPNYLMVAQVQLSLENSENDARHYRDDRASAGGFGCANGKV 120

Query: 121 QIIQHINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSKHPSKPPLDGTCNPDLRLRG 180
           QIIQHINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSKHPSKPPLDGTCNPDLRLRG
Sbjct: 121 QIIQHINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSKHPSKPPLDGTCNPDLRLRG 180

Query: 181 HKSEGYGLSWNKFKQGHLLSGSEDSHICLWDINATPDNKTLEAMQIFK------------ 240
           HKSEGYGLSWNKFKQGHLLSGSEDSHICLWDINATPDNKTLEAMQIFK            
Sbjct: 181 HKSEGYGLSWNKFKQGHLLSGSEDSHICLWDINATPDNKTLEAMQIFKGHEGVVGDVAWH 240

Query: 241 ---------------------------------------VNCLTFNPFNEWLVATGSTDK 300
                                                  VNCLTFNPFNEWLVATGSTDK
Sbjct: 241 MGHEYLFGSVGDDRYLHVWDLRSPSANKPVQSVVAHQSEVNCLTFNPFNEWLVATGSTDK 300

Query: 301 MVKLFDLRKISSSLHTFDCHEEEVFQVGWHPKNETILASCCRGRRLMVWDLSRIEEEQTP 360
           MVKLFDLRKISSSLHTFDCHEEEVFQVGWHPKNETILASCCRGRRLMVWDLSRIEEEQTP
Sbjct: 301 MVKLFDLRKISSSLHTFDCHEEEVFQVGWHPKNETILASCCRGRRLMVWDLSRIEEEQTP 360

Query: 361 EDVEDGPPELLFIHGGHTNTISDFSWNPCEEWVVASVAEDNILQVWQMAENVYYGEDDLL 376
           EDVEDGPPELLFIHGGHTNTISDFSWNPCEEWVVASVAEDNILQVWQMAENVYYGEDDLL
Sbjct: 361 EDVEDGPPELLFIHGGHTNTISDFSWNPCEEWVVASVAEDNILQVWQMAENVYYGEDDLL 420

BLAST of Cp4.1LG03g17440 vs. NCBI nr
Match: KAG7018909.1 (WD-40 repeat-containing protein MSI1, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 759 bits (1961), Expect = 2.95e-276
Identity = 368/417 (88.25%), Postives = 370/417 (88.73%), Query Frame = 0

Query: 5   MEKGDEEIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKD 64
           MEKGD+EIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKD
Sbjct: 1   MEKGDKEIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKD 60

Query: 65  YSVQKMILGTHSFDNKPNYLMVAQVQLSLENSENDARHYRDDRASAGGFGCANGKVQIIQ 124
           YSVQKMILGTHSFDNKPNYLM+AQVQL LENSENDARHYRDDRASAGGFGCANGKVQIIQ
Sbjct: 61  YSVQKMILGTHSFDNKPNYLMLAQVQLPLENSENDARHYRDDRASAGGFGCANGKVQIIQ 120

Query: 125 HINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSKHPSKPPLDGTCNPDLRLRGHKSE 184
           HINHDGEVNRAR MPQNPFIIATKTVSAEVLVFDYSKHPSKPPLDGTCNPDLRLRGHKSE
Sbjct: 121 HINHDGEVNRARCMPQNPFIIATKTVSAEVLVFDYSKHPSKPPLDGTCNPDLRLRGHKSE 180

Query: 185 GYGLSWNKFKQGHLLSGSEDSHICLWDINATPDNKTLEAMQIFK---------------- 244
           GYGLSWNKFKQGHLLSGSEDSHICLWDINATPDNKTLEAMQIFK                
Sbjct: 181 GYGLSWNKFKQGHLLSGSEDSHICLWDINATPDNKTLEAMQIFKGHEGVVGDVAWHMRHE 240

Query: 245 -----------------------------VNCLTFNPFNEWLVATGSTDKMVKLFDLRKI 304
                                        VNCLTFNPFNEWLVATGSTDKMVKLFDLRKI
Sbjct: 241 YLFGSVGDDRYLHGDVACCIVRHEYYILVVNCLTFNPFNEWLVATGSTDKMVKLFDLRKI 300

Query: 305 SSSLHTFDCHEEEVFQVGWHPKNETILASCCRGRRLMVWDLSRIEEEQTPEDVEDGPPEL 364
           SSSLHTFDCHEEEVFQVGWHPKNETILASCCRGRRLMVWDLSRIEEEQTPEDVEDGPPEL
Sbjct: 301 SSSLHTFDCHEEEVFQVGWHPKNETILASCCRGRRLMVWDLSRIEEEQTPEDVEDGPPEL 360

Query: 365 LFIHGGHTNTISDFSWNPCEEWVVASVAEDNILQVWQMAENVYYGEDDLLEEPPKLS 376
           LFIHGGHTNTISDFSWNPCEEWVVASVAEDNILQVWQMAENVYYGEDDLLEEPPKLS
Sbjct: 361 LFIHGGHTNTISDFSWNPCEEWVVASVAEDNILQVWQMAENVYYGEDDLLEEPPKLS 417

BLAST of Cp4.1LG03g17440 vs. NCBI nr
Match: XP_022980301.1 (WD-40 repeat-containing protein MSI1-like [Cucurbita maxima])

HSP 1 Score: 737 bits (1902), Expect = 3.63e-267
Identity = 358/423 (84.63%), Postives = 363/423 (85.82%), Query Frame = 0

Query: 5   MEKGDEEIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKD 64
           MEKGD+EIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKD
Sbjct: 1   MEKGDKEIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKD 60

Query: 65  YSVQKMILGTHSFDNKPNYLMVAQVQLSLENSENDARHYRDDRASAGGFGCANGKVQIIQ 124
           YSVQKMILGTHSFDNKPNYLM+AQVQL LE+SENDARHYRDDRAS GGFGCANGKVQIIQ
Sbjct: 61  YSVQKMILGTHSFDNKPNYLMLAQVQLPLESSENDARHYRDDRASGGGFGCANGKVQIIQ 120

Query: 125 HINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSKHPSKPPLDGTCNPDLRLRGHKSE 184
           HINHDGEVNRARYMPQNPFIIATKTVSAEV VFDYSKHPS PPLDG CNPDLRLRGHKSE
Sbjct: 121 HINHDGEVNRARYMPQNPFIIATKTVSAEVFVFDYSKHPSIPPLDGKCNPDLRLRGHKSE 180

Query: 185 GYGLSWNKFKQGHLLSGSEDSHICLWDINATPDNKTLEAMQIFK---------------- 244
           GYGLSW+KFKQGHLLSGSEDSHICLWDINATPDNKTLEAMQIFK                
Sbjct: 181 GYGLSWSKFKQGHLLSGSEDSHICLWDINATPDNKTLEAMQIFKGHEGVVGDIAWHMRHE 240

Query: 245 -----------------------------------VNCLTFNPFNEWLVATGSTDKMVKL 304
                                              V CLTFNPFNEWLVATGSTDKMVKL
Sbjct: 241 YLFGSVGEDRYLHVWDLRSPSANKPVQSVVAHQSEVICLTFNPFNEWLVATGSTDKMVKL 300

Query: 305 FDLRKISSSLHTFDCHEEEVFQVGWHPKNETILASCCRGRRLMVWDLSRIEEEQTPEDVE 364
           FDLR ISSSLHTFDCH+EEVFQVGWHPKNE ILASCCRGRRLMVWDLSRIEEEQTPEDVE
Sbjct: 301 FDLRNISSSLHTFDCHKEEVFQVGWHPKNEMILASCCRGRRLMVWDLSRIEEEQTPEDVE 360

Query: 365 DGPPELLFIHGGHTNTISDFSWNPCEEWVVASVAEDNILQVWQMAENVYYGEDDLLEEPP 376
           DGPPELLFIHGGHTN ISDFSWNPCEEWVVASVAEDNILQVWQMAENVYYGEDDLLEEPP
Sbjct: 361 DGPPELLFIHGGHTNIISDFSWNPCEEWVVASVAEDNILQVWQMAENVYYGEDDLLEEPP 420

BLAST of Cp4.1LG03g17440 vs. NCBI nr
Match: KAG6582525.1 (WD-40 repeat-containing protein MSI1, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 752 bits (1942), Expect = 1.82e-265
Identity = 370/447 (82.77%), Postives = 372/447 (83.22%), Query Frame = 0

Query: 3   KTMEKGDEEIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPG 62
           KTMEKGD+EIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPG
Sbjct: 486 KTMEKGDKEIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPG 545

Query: 63  KDYSVQKMILGTHSFDNKPNYLMVAQVQLSLENSENDARHYRDDRASAGGFGCANGKVQI 122
           KDYSVQKMILGTHSFDNKPNYLM+AQVQL LENSENDARHYRDDRASAGGFGCANGKVQI
Sbjct: 546 KDYSVQKMILGTHSFDNKPNYLMLAQVQLPLENSENDARHYRDDRASAGGFGCANGKVQI 605

Query: 123 IQHINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSKHPSKPPLDGTCNPDLRLRGHK 182
           IQHINHDGEVNRAR MPQNPFIIATKTVSAEVLVFDYSKHPSKPPLDGTCNPDLRLRGHK
Sbjct: 606 IQHINHDGEVNRARCMPQNPFIIATKTVSAEVLVFDYSKHPSKPPLDGTCNPDLRLRGHK 665

Query: 183 SEGYGLSWNKFKQGHLLSGSEDSHICLWDINATPDNKTLEAMQIFK-------------- 242
           SEGYGLSWNKFKQGHLLSGSEDSHICLWDINATPDNKTLEAMQIFK              
Sbjct: 666 SEGYGLSWNKFKQGHLLSGSEDSHICLWDINATPDNKTLEAMQIFKSRRCQLRNVLYGFC 725

Query: 243 -----------------------------------------------------------V 302
                                                                      V
Sbjct: 726 PRTSYEHGHEGVVGDVAWHMRHEYLFGSVGDDRYLHVWDLRSRPLANKPVQSVVAHQSEV 785

Query: 303 NCLTFNPFNEWLVATGSTDKMVKLFDLRKISSSLHTFDCHEEEVFQVGWHPKNETILASC 362
           NCLTFNPFNEWLVATGSTDKMVKLFDLRKISSSLHTFDCHEEEVFQVGWHPKNETILASC
Sbjct: 786 NCLTFNPFNEWLVATGSTDKMVKLFDLRKISSSLHTFDCHEEEVFQVGWHPKNETILASC 845

Query: 363 CRGRRLMVWDLSRIEEEQTPEDVEDGPPELLFIHGGHTNTISDFSWNPCEEWVVASVAED 376
           CRGRRLMVWDLSRIEEEQTPEDVEDGPPELLFIHGGHTNTISDFSWNPCEEWVVASVAED
Sbjct: 846 CRGRRLMVWDLSRIEEEQTPEDVEDGPPELLFIHGGHTNTISDFSWNPCEEWVVASVAED 905

BLAST of Cp4.1LG03g17440 vs. NCBI nr
Match: XP_022937662.1 (WD-40 repeat-containing protein MSI1 [Cucurbita moschata] >XP_022974971.1 WD-40 repeat-containing protein MSI1 [Cucurbita maxima] >XP_023540574.1 WD-40 repeat-containing protein MSI1 [Cucurbita pepo subsp. pepo] >KAG6597236.1 WD-40 repeat-containing protein MSI1, partial [Cucurbita argyrosperma subsp. sororia] >KAG7028707.1 WD-40 repeat-containing protein MSI1 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 684 bits (1766), Expect = 1.90e-246
Identity = 327/423 (77.30%), Postives = 351/423 (82.98%), Query Frame = 0

Query: 5   MEKGDEEIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKD 64
           M K DEE+RG+  ERL+NEEYKIWKKNTPFLYDL+ITHALEWPSLTVEWLPDR+EPPGKD
Sbjct: 1   MGKDDEEMRGEMEERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPPGKD 60

Query: 65  YSVQKMILGTHSFDNKPNYLMVAQVQLSLENSENDARHYRDDRASAGGFGCANGKVQIIQ 124
           YSVQKMILGTH+ +N+PNYLM+AQVQL LE+SENDARHY DDRA AGGFGCANGKVQIIQ
Sbjct: 61  YSVQKMILGTHTSENEPNYLMLAQVQLPLEDSENDARHYDDDRADAGGFGCANGKVQIIQ 120

Query: 125 HINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSKHPSKPPLDGTCNPDLRLRGHKSE 184
            INHDGEVNRARYMPQNPFIIATKTVSAEV VFDYSKHPSKPPLDGTCNPDLRLRGH +E
Sbjct: 121 QINHDGEVNRARYMPQNPFIIATKTVSAEVFVFDYSKHPSKPPLDGTCNPDLRLRGHNTE 180

Query: 185 GYGLSWNKFKQGHLLSGSEDSHICLWDINATPDNKTLEAMQIFKV--------------- 244
           GYGLSW+KFKQGHLLSGS+D+ ICLWDINATP NKTLEAMQIFKV               
Sbjct: 181 GYGLSWSKFKQGHLLSGSDDAQICLWDINATPKNKTLEAMQIFKVHEGVVEDVAWHLRHE 240

Query: 245 ------------------------------------NCLTFNPFNEWLVATGSTDKMVKL 304
                                               NCL FNPFNEW+VATGSTDK VKL
Sbjct: 241 YLFGSVGDDQYLLIWDLRTPTVNKPVQSVVAHQSEVNCLAFNPFNEWVVATGSTDKTVKL 300

Query: 305 FDLRKISSSLHTFDCHEEEVFQVGWHPKNETILASCCRGRRLMVWDLSRIEEEQTPEDVE 364
           FDLRKISS+LHTFDCH+EEVFQVGW+PKNETILASCC GRRLMVWDLSRI+EEQTPED E
Sbjct: 301 FDLRKISSALHTFDCHKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQTPEDAE 360

Query: 365 DGPPELLFIHGGHTNTISDFSWNPCEEWVVASVAEDNILQVWQMAENVYYGEDDLLEEPP 376
           DGPPELLFIHGGHT+ ISDFSWNPCE+WVVASVAEDNILQ+WQMAEN+Y+ EDDL EEPP
Sbjct: 361 DGPPELLFIHGGHTSKISDFSWNPCEDWVVASVAEDNILQIWQMAENIYHDEDDLPEEPP 420

BLAST of Cp4.1LG03g17440 vs. ExPASy TrEMBL
Match: A0A6J1IT85 (WD-40 repeat-containing protein MSI1-like OS=Cucurbita maxima OX=3661 GN=LOC111479717 PE=3 SV=1)

HSP 1 Score: 737 bits (1902), Expect = 1.76e-267
Identity = 358/423 (84.63%), Postives = 363/423 (85.82%), Query Frame = 0

Query: 5   MEKGDEEIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKD 64
           MEKGD+EIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKD
Sbjct: 1   MEKGDKEIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKD 60

Query: 65  YSVQKMILGTHSFDNKPNYLMVAQVQLSLENSENDARHYRDDRASAGGFGCANGKVQIIQ 124
           YSVQKMILGTHSFDNKPNYLM+AQVQL LE+SENDARHYRDDRAS GGFGCANGKVQIIQ
Sbjct: 61  YSVQKMILGTHSFDNKPNYLMLAQVQLPLESSENDARHYRDDRASGGGFGCANGKVQIIQ 120

Query: 125 HINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSKHPSKPPLDGTCNPDLRLRGHKSE 184
           HINHDGEVNRARYMPQNPFIIATKTVSAEV VFDYSKHPS PPLDG CNPDLRLRGHKSE
Sbjct: 121 HINHDGEVNRARYMPQNPFIIATKTVSAEVFVFDYSKHPSIPPLDGKCNPDLRLRGHKSE 180

Query: 185 GYGLSWNKFKQGHLLSGSEDSHICLWDINATPDNKTLEAMQIFK---------------- 244
           GYGLSW+KFKQGHLLSGSEDSHICLWDINATPDNKTLEAMQIFK                
Sbjct: 181 GYGLSWSKFKQGHLLSGSEDSHICLWDINATPDNKTLEAMQIFKGHEGVVGDIAWHMRHE 240

Query: 245 -----------------------------------VNCLTFNPFNEWLVATGSTDKMVKL 304
                                              V CLTFNPFNEWLVATGSTDKMVKL
Sbjct: 241 YLFGSVGEDRYLHVWDLRSPSANKPVQSVVAHQSEVICLTFNPFNEWLVATGSTDKMVKL 300

Query: 305 FDLRKISSSLHTFDCHEEEVFQVGWHPKNETILASCCRGRRLMVWDLSRIEEEQTPEDVE 364
           FDLR ISSSLHTFDCH+EEVFQVGWHPKNE ILASCCRGRRLMVWDLSRIEEEQTPEDVE
Sbjct: 301 FDLRNISSSLHTFDCHKEEVFQVGWHPKNEMILASCCRGRRLMVWDLSRIEEEQTPEDVE 360

Query: 365 DGPPELLFIHGGHTNTISDFSWNPCEEWVVASVAEDNILQVWQMAENVYYGEDDLLEEPP 376
           DGPPELLFIHGGHTN ISDFSWNPCEEWVVASVAEDNILQVWQMAENVYYGEDDLLEEPP
Sbjct: 361 DGPPELLFIHGGHTNIISDFSWNPCEEWVVASVAEDNILQVWQMAENVYYGEDDLLEEPP 420

BLAST of Cp4.1LG03g17440 vs. ExPASy TrEMBL
Match: A0A6J1ICW4 (WD-40 repeat-containing protein MSI1 OS=Cucurbita maxima OX=3661 GN=LOC111473820 PE=3 SV=1)

HSP 1 Score: 684 bits (1766), Expect = 9.21e-247
Identity = 327/423 (77.30%), Postives = 351/423 (82.98%), Query Frame = 0

Query: 5   MEKGDEEIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKD 64
           M K DEE+RG+  ERL+NEEYKIWKKNTPFLYDL+ITHALEWPSLTVEWLPDR+EPPGKD
Sbjct: 1   MGKDDEEMRGEMEERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPPGKD 60

Query: 65  YSVQKMILGTHSFDNKPNYLMVAQVQLSLENSENDARHYRDDRASAGGFGCANGKVQIIQ 124
           YSVQKMILGTH+ +N+PNYLM+AQVQL LE+SENDARHY DDRA AGGFGCANGKVQIIQ
Sbjct: 61  YSVQKMILGTHTSENEPNYLMLAQVQLPLEDSENDARHYDDDRADAGGFGCANGKVQIIQ 120

Query: 125 HINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSKHPSKPPLDGTCNPDLRLRGHKSE 184
            INHDGEVNRARYMPQNPFIIATKTVSAEV VFDYSKHPSKPPLDGTCNPDLRLRGH +E
Sbjct: 121 QINHDGEVNRARYMPQNPFIIATKTVSAEVFVFDYSKHPSKPPLDGTCNPDLRLRGHNTE 180

Query: 185 GYGLSWNKFKQGHLLSGSEDSHICLWDINATPDNKTLEAMQIFKV--------------- 244
           GYGLSW+KFKQGHLLSGS+D+ ICLWDINATP NKTLEAMQIFKV               
Sbjct: 181 GYGLSWSKFKQGHLLSGSDDAQICLWDINATPKNKTLEAMQIFKVHEGVVEDVAWHLRHE 240

Query: 245 ------------------------------------NCLTFNPFNEWLVATGSTDKMVKL 304
                                               NCL FNPFNEW+VATGSTDK VKL
Sbjct: 241 YLFGSVGDDQYLLIWDLRTPTVNKPVQSVVAHQSEVNCLAFNPFNEWVVATGSTDKTVKL 300

Query: 305 FDLRKISSSLHTFDCHEEEVFQVGWHPKNETILASCCRGRRLMVWDLSRIEEEQTPEDVE 364
           FDLRKISS+LHTFDCH+EEVFQVGW+PKNETILASCC GRRLMVWDLSRI+EEQTPED E
Sbjct: 301 FDLRKISSALHTFDCHKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQTPEDAE 360

Query: 365 DGPPELLFIHGGHTNTISDFSWNPCEEWVVASVAEDNILQVWQMAENVYYGEDDLLEEPP 376
           DGPPELLFIHGGHT+ ISDFSWNPCE+WVVASVAEDNILQ+WQMAEN+Y+ EDDL EEPP
Sbjct: 361 DGPPELLFIHGGHTSKISDFSWNPCEDWVVASVAEDNILQIWQMAENIYHDEDDLPEEPP 420

BLAST of Cp4.1LG03g17440 vs. ExPASy TrEMBL
Match: A0A6J1FBV1 (WD-40 repeat-containing protein MSI1 OS=Cucurbita moschata OX=3662 GN=LOC111443997 PE=3 SV=1)

HSP 1 Score: 684 bits (1766), Expect = 9.21e-247
Identity = 327/423 (77.30%), Postives = 351/423 (82.98%), Query Frame = 0

Query: 5   MEKGDEEIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKD 64
           M K DEE+RG+  ERL+NEEYKIWKKNTPFLYDL+ITHALEWPSLTVEWLPDR+EPPGKD
Sbjct: 1   MGKDDEEMRGEMEERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPPGKD 60

Query: 65  YSVQKMILGTHSFDNKPNYLMVAQVQLSLENSENDARHYRDDRASAGGFGCANGKVQIIQ 124
           YSVQKMILGTH+ +N+PNYLM+AQVQL LE+SENDARHY DDRA AGGFGCANGKVQIIQ
Sbjct: 61  YSVQKMILGTHTSENEPNYLMLAQVQLPLEDSENDARHYDDDRADAGGFGCANGKVQIIQ 120

Query: 125 HINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSKHPSKPPLDGTCNPDLRLRGHKSE 184
            INHDGEVNRARYMPQNPFIIATKTVSAEV VFDYSKHPSKPPLDGTCNPDLRLRGH +E
Sbjct: 121 QINHDGEVNRARYMPQNPFIIATKTVSAEVFVFDYSKHPSKPPLDGTCNPDLRLRGHNTE 180

Query: 185 GYGLSWNKFKQGHLLSGSEDSHICLWDINATPDNKTLEAMQIFKV--------------- 244
           GYGLSW+KFKQGHLLSGS+D+ ICLWDINATP NKTLEAMQIFKV               
Sbjct: 181 GYGLSWSKFKQGHLLSGSDDAQICLWDINATPKNKTLEAMQIFKVHEGVVEDVAWHLRHE 240

Query: 245 ------------------------------------NCLTFNPFNEWLVATGSTDKMVKL 304
                                               NCL FNPFNEW+VATGSTDK VKL
Sbjct: 241 YLFGSVGDDQYLLIWDLRTPTVNKPVQSVVAHQSEVNCLAFNPFNEWVVATGSTDKTVKL 300

Query: 305 FDLRKISSSLHTFDCHEEEVFQVGWHPKNETILASCCRGRRLMVWDLSRIEEEQTPEDVE 364
           FDLRKISS+LHTFDCH+EEVFQVGW+PKNETILASCC GRRLMVWDLSRI+EEQTPED E
Sbjct: 301 FDLRKISSALHTFDCHKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQTPEDAE 360

Query: 365 DGPPELLFIHGGHTNTISDFSWNPCEEWVVASVAEDNILQVWQMAENVYYGEDDLLEEPP 376
           DGPPELLFIHGGHT+ ISDFSWNPCE+WVVASVAEDNILQ+WQMAEN+Y+ EDDL EEPP
Sbjct: 361 DGPPELLFIHGGHTSKISDFSWNPCEDWVVASVAEDNILQIWQMAENIYHDEDDLPEEPP 420

BLAST of Cp4.1LG03g17440 vs. ExPASy TrEMBL
Match: A0A1S3AWJ3 (WD-40 repeat-containing protein MSI1 OS=Cucumis melo OX=3656 GN=LOC103483412 PE=3 SV=1)

HSP 1 Score: 683 bits (1763), Expect = 2.64e-246
Identity = 326/423 (77.07%), Postives = 351/423 (82.98%), Query Frame = 0

Query: 5   MEKGDEEIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKD 64
           M K DEE+RG+  ERL+NEEYKIWKKNTPFLYDL+ITHALEWPSLTVEWLPDR+EPPGKD
Sbjct: 1   MGKDDEEMRGEMEERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPPGKD 60

Query: 65  YSVQKMILGTHSFDNKPNYLMVAQVQLSLENSENDARHYRDDRASAGGFGCANGKVQIIQ 124
           YSVQKMILGTH+ +N+PNYLM+AQVQL LE+SENDARHY DDRA AGGFGCANGKVQIIQ
Sbjct: 61  YSVQKMILGTHTSENEPNYLMLAQVQLPLEDSENDARHYDDDRADAGGFGCANGKVQIIQ 120

Query: 125 HINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSKHPSKPPLDGTCNPDLRLRGHKSE 184
            INHDGEVNRARYMPQNPFIIATKTVSAEV VFDYSKHPSKPPLDGTCNPDLRLRGH +E
Sbjct: 121 QINHDGEVNRARYMPQNPFIIATKTVSAEVFVFDYSKHPSKPPLDGTCNPDLRLRGHNTE 180

Query: 185 GYGLSWNKFKQGHLLSGSEDSHICLWDINATPDNKTLEAMQIFKV--------------- 244
           GYGLSW+KFKQGHLLSGS+D+ ICLWDINATP NKTLEAMQIFKV               
Sbjct: 181 GYGLSWSKFKQGHLLSGSDDAQICLWDINATPKNKTLEAMQIFKVHEGVVEDVAWHLRHE 240

Query: 245 ------------------------------------NCLTFNPFNEWLVATGSTDKMVKL 304
                                               NCL FNPFNEW+VATGSTDK VKL
Sbjct: 241 YLFGSVGDDQYLLVWDLRTPSANKPVQSVVAHQSEVNCLAFNPFNEWVVATGSTDKTVKL 300

Query: 305 FDLRKISSSLHTFDCHEEEVFQVGWHPKNETILASCCRGRRLMVWDLSRIEEEQTPEDVE 364
           FDLRKIS++LHTFDCH+EEVFQVGW+PKNETILASCC GRRLMVWDLSRI+EEQTPED E
Sbjct: 301 FDLRKISTALHTFDCHKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQTPEDAE 360

Query: 365 DGPPELLFIHGGHTNTISDFSWNPCEEWVVASVAEDNILQVWQMAENVYYGEDDLLEEPP 376
           DGPPELLFIHGGHT+ ISDFSWNPCE+WVVASVAEDNILQ+WQMAEN+Y+ EDDL EEPP
Sbjct: 361 DGPPELLFIHGGHTSKISDFSWNPCEDWVVASVAEDNILQIWQMAENIYHDEDDLPEEPP 420

BLAST of Cp4.1LG03g17440 vs. ExPASy TrEMBL
Match: A0A5A7U4A1 (WD-40 repeat-containing protein MSI1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold434G003420 PE=3 SV=1)

HSP 1 Score: 683 bits (1763), Expect = 2.64e-246
Identity = 326/423 (77.07%), Postives = 351/423 (82.98%), Query Frame = 0

Query: 5   MEKGDEEIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKD 64
           M K DEE+RG+  ERL+NEEYKIWKKNTPFLYDL+ITHALEWPSLTVEWLPDR+EPPGKD
Sbjct: 1   MGKDDEEMRGEMEERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPPGKD 60

Query: 65  YSVQKMILGTHSFDNKPNYLMVAQVQLSLENSENDARHYRDDRASAGGFGCANGKVQIIQ 124
           YSVQKMILGTH+ +N+PNYLM+AQVQL LE+SENDARHY DDRA AGGFGCANGKVQIIQ
Sbjct: 61  YSVQKMILGTHTSENEPNYLMLAQVQLPLEDSENDARHYDDDRADAGGFGCANGKVQIIQ 120

Query: 125 HINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSKHPSKPPLDGTCNPDLRLRGHKSE 184
            INHDGEVNRARYMPQNPFIIATKTVSAEV VFDYSKHPSKPPLDGTCNPDLRLRGH +E
Sbjct: 121 QINHDGEVNRARYMPQNPFIIATKTVSAEVFVFDYSKHPSKPPLDGTCNPDLRLRGHNTE 180

Query: 185 GYGLSWNKFKQGHLLSGSEDSHICLWDINATPDNKTLEAMQIFKV--------------- 244
           GYGLSW+KFKQGHLLSGS+D+ ICLWDINATP NKTLEAMQIFKV               
Sbjct: 181 GYGLSWSKFKQGHLLSGSDDAQICLWDINATPKNKTLEAMQIFKVHEGVVEDVAWHLRHE 240

Query: 245 ------------------------------------NCLTFNPFNEWLVATGSTDKMVKL 304
                                               NCL FNPFNEW+VATGSTDK VKL
Sbjct: 241 YLFGSVGDDQYLLVWDLRTPSANKPVQSVVAHQSEVNCLAFNPFNEWVVATGSTDKTVKL 300

Query: 305 FDLRKISSSLHTFDCHEEEVFQVGWHPKNETILASCCRGRRLMVWDLSRIEEEQTPEDVE 364
           FDLRKIS++LHTFDCH+EEVFQVGW+PKNETILASCC GRRLMVWDLSRI+EEQTPED E
Sbjct: 301 FDLRKISTALHTFDCHKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQTPEDAE 360

Query: 365 DGPPELLFIHGGHTNTISDFSWNPCEEWVVASVAEDNILQVWQMAENVYYGEDDLLEEPP 376
           DGPPELLFIHGGHT+ ISDFSWNPCE+WVVASVAEDNILQ+WQMAEN+Y+ EDDL EEPP
Sbjct: 361 DGPPELLFIHGGHTSKISDFSWNPCEDWVVASVAEDNILQIWQMAENIYHDEDDLPEEPP 420

BLAST of Cp4.1LG03g17440 vs. TAIR 10
Match: AT5G58230.1 (Transducin/WD40 repeat-like superfamily protein )

HSP 1 Score: 642.1 bits (1655), Expect = 2.8e-184
Identity = 303/424 (71.46%), Postives = 341/424 (80.42%), Query Frame = 0

Query: 5   MEKGDEEIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKD 64
           M K +EE+RG+  ERL+NEEYKIWKKNTPFLYDL+ITHALEWPSLTVEWLPDR+EP GKD
Sbjct: 1   MGKDEEEMRGEIEERLINEEYKIWKKNTPFLYDLVITHALEWPSLTVEWLPDREEPSGKD 60

Query: 65  YSVQKMILGTHSFDNKPNYLMVAQVQLSLENSENDARHYRDDRASAGGFGCANGKVQIIQ 124
           YSVQKMILGTH+ +++PNYLM+AQVQL L+++E++AR Y DDR+  GGFGCA GKVQIIQ
Sbjct: 61  YSVQKMILGTHTSESEPNYLMLAQVQLPLDDTESEARQYDDDRSEFGGFGCATGKVQIIQ 120

Query: 125 HINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSKHPSKPPLDGTCNPDLRLRGHKSE 184
            INHDGEVNRARYMPQNPFIIATKTV+AEV VFDYSKHPSKPPLDG CNPDL+LRGH SE
Sbjct: 121 QINHDGEVNRARYMPQNPFIIATKTVNAEVYVFDYSKHPSKPPLDGACNPDLKLRGHSSE 180

Query: 185 GYGLSWNKFKQGHLLSGSEDSHICLWDINATPDNKTLEAMQIFK---------------- 244
           GYGLSW+KFKQGHLLSGS+D+ ICLWDINATP NK+L+A QIFK                
Sbjct: 181 GYGLSWSKFKQGHLLSGSDDAQICLWDINATPKNKSLDAQQIFKAHEGVVEDVAWHLRHE 240

Query: 245 -----------------------------------VNCLTFNPFNEWLVATGSTDKMVKL 304
                                              VNCL FNPFNEW+VATGSTDK VKL
Sbjct: 241 YLFGSVGDDQYLLIWDLRSPSASKPVQSVVAHSMEVNCLAFNPFNEWVVATGSTDKTVKL 300

Query: 305 FDLRKISSSLHTFDCHEEEVFQVGWHPKNETILASCCRGRRLMVWDLSRIEEEQTPEDVE 364
           FDLRK+S++LHTFD H+EEVFQVGW+PKNETILASCC GRRLMVWDLSRI+EEQT ED E
Sbjct: 301 FDLRKLSTALHTFDSHKEEVFQVGWNPKNETILASCCLGRRLMVWDLSRIDEEQTVEDAE 360

Query: 365 DGPPELLFIHGGHTNTISDFSWNPCEEWVVASVAEDNILQVWQMAENVYYGEDDLL-EEP 377
           DGPPELLFIHGGHT+ ISDFSWNPCE+WV++SVAEDNILQ+WQMAEN+Y+ EDD   EEP
Sbjct: 361 DGPPELLFIHGGHTSKISDFSWNPCEDWVISSVAEDNILQIWQMAENIYHDEDDAPGEEP 420

BLAST of Cp4.1LG03g17440 vs. TAIR 10
Match: AT2G16780.1 (Transducin family protein / WD-40 repeat family protein )

HSP 1 Score: 381.7 bits (979), Expect = 6.7e-106
Identity = 200/415 (48.19%), Postives = 249/415 (60.00%), Query Frame = 0

Query: 10  EEIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKD--YSV 69
           +E + +T    V E++ +WKKNTPFLYDL+I+H LEWPSLTV W+P    P   D  + V
Sbjct: 3   DEGKEETGMGQVEEDFSVWKKNTPFLYDLLISHPLEWPSLTVHWVPSTPNPYVADSYFGV 62

Query: 70  QKMILGTHSFDNKPNYLMVAQVQLSLENSENDARHYRDDRASAGGFGCAN-----GKVQI 129
            K+ILGTH+  +  ++LMVA V     N+E              G G AN      KV+I
Sbjct: 63  HKLILGTHTSGSAQDFLMVADVVTPTPNAE-------------PGIGGANQDPFIPKVEI 122

Query: 130 IQHINHDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSKHPSKPPLDGTCNPDLRLRGHK 189
            Q I  DGEVNRAR MPQ P ++  KT   EV +FDY+KH +K      C+PDLRL GH 
Sbjct: 123 RQRIRVDGEVNRARCMPQKPTLVGAKTSGCEVFLFDYAKHAAKSQ-TSECDPDLRLVGHD 182

Query: 190 SEGYGLSWNKFKQGHLLSGSEDSHICLWDINATPDNKTLEAMQIF--------------- 249
            EGYGLSW+ FK+G+LLSGS+D  ICLWD++ATP +K L AM ++               
Sbjct: 183 KEGYGLSWSPFKEGYLLSGSQDQKICLWDVSATPQDKVLNAMFVYEGHESAIADVSWHMK 242

Query: 250 ---------------------------------KVNCLTFNPFNEWLVATGSTDKMVKLF 309
                                            +VN L+FNPFNEW++AT S+D  V LF
Sbjct: 243 NENLFGSAGEDGRLVIWDTRTNQMQHQVKVHEREVNYLSFNPFNEWVLATASSDSTVALF 302

Query: 310 DLRKISSSLHTFDCHEEEVFQVGWHPKNETILASCCRGRRLMVWDLSRIEEEQ--TPEDV 368
           DLRK+++ LH    HE EVFQV W P +ET+LAS    RRLMVWDL+R+ EEQ     D 
Sbjct: 303 DLRKLNAPLHVMSSHEGEVFQVEWDPNHETVLASSGEDRRLMVWDLNRVGEEQLEIELDA 362

BLAST of Cp4.1LG03g17440 vs. TAIR 10
Match: AT4G35050.1 (Transducin family protein / WD-40 repeat family protein )

HSP 1 Score: 367.5 bits (942), Expect = 1.3e-101
Identity = 189/410 (46.10%), Postives = 247/410 (60.24%), Query Frame = 0

Query: 10  EEIRGKTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKD--YSV 69
           EE + +     V EE+ IWK+NTPFLYDL+I+H LEWPSLT+ W+P    P  KD  ++V
Sbjct: 4   EEGKDEAGLDQVEEEFSIWKRNTPFLYDLMISHPLEWPSLTLHWVPSTPIPYSKDPYFAV 63

Query: 70  QKMILGTHSFDNKPNYLMVAQVQLSLENSENDARHYRDDRASAGGFGCANGKVQIIQHIN 129
            K+ILGTH+     ++LMVA V +   ++E      RD             KV+I Q I 
Sbjct: 64  HKLILGTHTSGGAQDFLMVADVVIPTPDAE-PGLGGRDQEPIV-------PKVEIKQKIR 123

Query: 130 HDGEVNRARYMPQNPFIIATKTVSAEVLVFDYSKHPSKPPLDGTCNPDLRLRGHKSEGYG 189
            DGEVNRAR MPQ P ++  KT  +EV +FDY++   KP     C+PDLRL GH+ EGYG
Sbjct: 124 VDGEVNRARCMPQKPTLVGAKTSGSEVFLFDYARLSGKPQ-TSECDPDLRLMGHEQEGYG 183

Query: 190 LSWNKFKQGHLLSGSEDSHICLWDINATPDNKTLEAMQIF-------------------- 249
           L+W+ FK+G+LLSGS+D  ICLWD++AT  +K L  M ++                    
Sbjct: 184 LAWSSFKEGYLLSGSQDQRICLWDVSATATDKVLNPMHVYEGHQSIIEDVAWHMKNENIF 243

Query: 250 ----------------------------KVNCLTFNPFNEWLVATGSTDKMVKLFDLRKI 309
                                       ++N L+FNPFNEW++AT S+D  V LFDLRK+
Sbjct: 244 GSAGDDCQLVIWDLRTNQMQHQVKVHEREINYLSFNPFNEWVLATASSDSTVALFDLRKL 303

Query: 310 SSSLHTFDCHEEEVFQVGWHPKNETILASCCRGRRLMVWDLSRIEEEQ--TPEDVEDGPP 368
           ++ LH    HE EVFQV W P +ET+LAS    RRLMVWD++R+ +EQ     D EDGPP
Sbjct: 304 TAPLHVLSKHEGEVFQVEWDPNHETVLASSGEDRRLMVWDINRVGDEQLEIELDAEDGPP 363

BLAST of Cp4.1LG03g17440 vs. TAIR 10
Match: AT2G19520.1 (Transducin family protein / WD-40 repeat family protein )

HSP 1 Score: 168.3 bits (425), Expect = 1.2e-41
Identity = 121/442 (27.38%), Postives = 192/442 (43.44%), Query Frame = 0

Query: 21  VNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKDYSVQKMILGTHSFDNK 80
           V+E+Y  WK   P LYD +  H L WPSL+  W P  ++   K+   Q++ L   +  + 
Sbjct: 64  VDEKYSQWKGLVPILYDWLANHNLVWPSLSCRWGPQLEQATYKNR--QRLYLSEQTDGSV 123

Query: 81  PNYLMVAQVQLSLENSENDARHYRDDRASAGGFGCANGKVQIIQHINHDGEVNRARYMPQ 140
           PN L++A  ++ ++     A H       A      +  V+  + I H GEVNR R +PQ
Sbjct: 124 PNTLVIANCEV-VKPRVAAAEHISQFNEEA-----RSPFVKKYKTIIHPGEVNRIRELPQ 183

Query: 141 NPFIIATKTVSAEVLVFDYSKHPSKPPLDGTCN--PDLRLRGHK-----------SEGYG 200
           N  I+AT T S +VL++D    P++  + G  N  PDL L GH+           +E + 
Sbjct: 184 NSKIVATHTDSPDVLIWDVETQPNRHAVLGAANSRPDLILTGHQDNAEFALAMCPTEPFV 243

Query: 201 LSWNKFK-------QGHLL--------SGS------------------------------ 260
           LS  K K       Q H+         SGS                              
Sbjct: 244 LSGGKDKSVVLWSIQDHITTIGTDSKSSGSIIKQTGEGTDKNESPTVGPRGVYHGHEDTV 303

Query: 261 -----------------EDSHICLWDINATPDNKT-LEAMQIFKVNCLTFNPFNEWLVAT 320
                            +DS + LWD     +  T +E      ++C+ +NP ++ L+ T
Sbjct: 304 EDVAFSPTSAQEFCSVGDDSCLILWDARTGTNPVTKVEKAHDADLHCVDWNPHDDNLILT 363

Query: 321 GSTDKMVKLFDLRK-----ISSSLHTFDCHEEEVFQVGWHPKNETILASCCRGRRLMVWD 375
           GS D  V+LFD RK     + S ++ F+ H+  V  V W P   ++  S      L +WD
Sbjct: 364 GSADNTVRLFDRRKLTANGVGSPIYKFEGHKAAVLCVQWSPDKSSVFGSSAEDGLLNIWD 423

BLAST of Cp4.1LG03g17440 vs. TAIR 10
Match: AT4G29730.1 (nucleosome/chromatin assembly factor group C5 )

HSP 1 Score: 156.8 bits (395), Expect = 3.5e-38
Identity = 116/448 (25.89%), Postives = 187/448 (41.74%), Query Frame = 0

Query: 15  KTNERLVNEEYKIWKKNTPFLYDLIITHALEWPSLTVEWLPDRDEPPGKDYSVQKMILGT 74
           ++ +  V++ Y  WK   P LYD  + H L WPSL+  W P  ++   K    Q++ L  
Sbjct: 48  QSQKATVDDTYSQWKTLLPILYDSFVNHTLVWPSLSCRWGPQLEQAGSK---TQRLYLSE 107

Query: 75  HSFDNKPNYLMVAQVQLSLENSENDARHYRDDRASAGGFGCANGKVQIIQHINHDGEVNR 134
            +  + PN L++A  + ++    N+  H              +  V+  + I H GEVNR
Sbjct: 108 QTNGSVPNTLVIANCE-TVNRQLNEKAH--------------SPFVKKYKTIIHPGEVNR 167

Query: 135 ARYMPQNPFIIATKTVSAEVLVFDYSKHPSKPPLDGT--CNPDLRLRGHK---------- 194
            R +PQN  I+AT T S ++L+++    P +  + G     PDL L GH+          
Sbjct: 168 IRELPQNSKIVATHTDSPDILIWNTETQPDRYAVLGAPDSRPDLLLIGHQDDAEFALAMC 227

Query: 195 -SEGYGLS---------WN-----------------KFKQ-------------------- 254
            +E + LS         WN                  FKQ                    
Sbjct: 228 PTEPFVLSGGKDKSVILWNIQDHITMAGSDSKSPGSSFKQTGEGSDKTGGPSVGPRGIYN 287

Query: 255 GH----------------LLSGSEDSHICLWDI-NATPDNKTLEAMQIFKVNCLTFNPFN 314
           GH                  S  +DS + LWD    T     +E      ++C+ +NP +
Sbjct: 288 GHKDTVEDVAFCPSSAQEFCSVGDDSCLMLWDARTGTSPAMKVEKAHDADLHCVDWNPHD 347

Query: 315 EWLVATGSTDKMVKLFDLRKISSS-----LHTFDCHEEEVFQVGWHPKNETILASCCRGR 374
             L+ TGS D  V++FD R ++S+     ++ F+ H   V  V W P   ++  S     
Sbjct: 348 NNLILTGSADNTVRVFDRRNLTSNGVGSPVYKFEGHRAAVLCVQWSPDKSSVFGSSAEDG 407

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
O224665.6e-19075.18WD-40 repeat-containing protein MSI1 OS=Solanum lycopersicum OX=4081 GN=MSI1 PE=... [more]
O224673.9e-18371.46Histone-binding protein MSI1 OS=Arabidopsis thaliana OX=3702 GN=MSI1 PE=1 SV=1[more]
Q10G811.5e-17969.81Histone-binding protein MSI1 homolog OS=Oryza sativa subsp. japonica OX=39947 GN... [more]
Q3MHL35.7e-13456.61Histone-binding protein RBBP4 OS=Bos taurus OX=9913 GN=RBBP4 PE=1 SV=3[more]
Q090285.7e-13456.61Histone-binding protein RBBP4 OS=Homo sapiens OX=9606 GN=RBBP4 PE=1 SV=3[more]
Match NameE-valueIdentityDescription
XP_023528376.14.02e-28188.06WD-40 repeat-containing protein MSI1-like [Cucurbita pepo subsp. pepo][more]
KAG7018909.12.95e-27688.25WD-40 repeat-containing protein MSI1, partial [Cucurbita argyrosperma subsp. arg... [more]
XP_022980301.13.63e-26784.63WD-40 repeat-containing protein MSI1-like [Cucurbita maxima][more]
KAG6582525.11.82e-26582.77WD-40 repeat-containing protein MSI1, partial [Cucurbita argyrosperma subsp. sor... [more]
XP_022937662.11.90e-24677.30WD-40 repeat-containing protein MSI1 [Cucurbita moschata] >XP_022974971.1 WD-40 ... [more]
Match NameE-valueIdentityDescription
A0A6J1IT851.76e-26784.63WD-40 repeat-containing protein MSI1-like OS=Cucurbita maxima OX=3661 GN=LOC1114... [more]
A0A6J1ICW49.21e-24777.30WD-40 repeat-containing protein MSI1 OS=Cucurbita maxima OX=3661 GN=LOC111473820... [more]
A0A6J1FBV19.21e-24777.30WD-40 repeat-containing protein MSI1 OS=Cucurbita moschata OX=3662 GN=LOC1114439... [more]
A0A1S3AWJ32.64e-24677.07WD-40 repeat-containing protein MSI1 OS=Cucumis melo OX=3656 GN=LOC103483412 PE=... [more]
A0A5A7U4A12.64e-24677.07WD-40 repeat-containing protein MSI1 OS=Cucumis melo var. makuwa OX=1194695 GN=E... [more]
Match NameE-valueIdentityDescription
AT5G58230.12.8e-18471.46Transducin/WD40 repeat-like superfamily protein [more]
AT2G16780.16.7e-10648.19Transducin family protein / WD-40 repeat family protein [more]
AT4G35050.11.3e-10146.10Transducin family protein / WD-40 repeat family protein [more]
AT2G19520.11.2e-4127.38Transducin family protein / WD-40 repeat family protein [more]
AT4G29730.13.5e-3825.89nucleosome/chromatin assembly factor group C5 [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 86..106
NoneNo IPR availablePANTHERPTHR22850:SF209BNAA10G29210D PROTEINcoord: 5..229
NoneNo IPR availablePANTHERPTHR22850WD40 REPEAT FAMILYcoord: 5..229
coord: 229..367
NoneNo IPR availablePANTHERPTHR22850:SF209BNAA10G29210D PROTEINcoord: 229..367
IPR020472G-protein beta WD-40 repeatPRINTSPR00320GPROTEINBRPTcoord: 198..212
score: 40.53
coord: 242..256
score: 37.38
coord: 343..357
score: 26.46
IPR001680WD40 repeatSMARTSM00320WD40_4coord: 220..255
e-value: 0.0093
score: 25.2
coord: 259..299
e-value: 3.0E-5
score: 33.4
coord: 171..211
e-value: 1.7E-4
score: 30.9
coord: 119..158
e-value: 29.0
score: 7.8
coord: 316..356
e-value: 0.0023
score: 27.2
IPR001680WD40 repeatPFAMPF00400WD40coord: 178..211
e-value: 0.18
score: 12.7
coord: 262..299
e-value: 0.06
score: 14.2
coord: 320..355
e-value: 0.0046
score: 17.8
coord: 229..255
e-value: 0.001
score: 19.8
IPR001680WD40 repeatPROSITEPS50082WD_REPEATS_2coord: 178..220
score: 10.34136
IPR001680WD40 repeatPROSITEPS50082WD_REPEATS_2coord: 266..308
score: 9.03805
IPR001680WD40 repeatPROSITEPS50082WD_REPEATS_2coord: 323..357
score: 9.03805
IPR015943WD40/YVTN repeat-like-containing domain superfamilyGENE3D2.130.10.10coord: 228..372
e-value: 9.9E-53
score: 181.7
IPR015943WD40/YVTN repeat-like-containing domain superfamilyGENE3D2.130.10.10coord: 3..227
e-value: 7.8E-71
score: 241.2
IPR022052Histone-binding protein RBBP4, N-terminalPFAMPF12265CAF1C_H4-bdcoord: 23..91
e-value: 1.7E-26
score: 92.3
IPR019775WD40 repeat, conserved sitePROSITEPS00678WD_REPEATS_1coord: 198..212
IPR036322WD40-repeat-containing domain superfamilySUPERFAMILY50978WD40 repeat-likecoord: 116..360

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG03g17440.1Cp4.1LG03g17440.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0010468 regulation of gene expression
cellular_component GO:0005634 nucleus
molecular_function GO:0042393 histone binding
molecular_function GO:0005515 protein binding