CmaCh04G004570 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh04G004570
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionTranscriptional coactivator Hfi1/Transcriptional adapter 1
LocationCma_Chr04: 2335176 .. 2336880 (+)
RNA-Seq ExpressionCmaCh04G004570
SyntenyCmaCh04G004570
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
CACGTGCGTAATTGAAATGCGATACGATTGGCCGAGGGACAAACGCATTCGCAAGAACTCTGGTCGATGATCGCTGTTTTGATTTCGGTCTCCAATCTGCAAATCACCAAAAGCGATCGATTCAAGATTAAACAATTTAAAGAAGCGGCAAAAACTTGCAAATTTTCGATTCAGAAGACGCCGCCAAAATCATCCATCATTCATACACAATTCCGTCATTTCTTTTCGGTGAGTTTCTGAATTGATTGCCTGCTTCTTAGGGTTTGCACTTCACCTTCAAATTTGAAGTTGGCATTGAATGATTTTTTTACGTTTACGATGCTTTAGGCTGTTAGTTTTTATGTTTTCGACGCTGATGAAAACCCTTGTTTGAGCGCGATTTTGCTGCGATCGATGTTTTGTTTTGAATCAAATTTGGTTCGTTTCGAGCTGTTCTCCAGTTTGGTTTCTTCGATGTGATTCATAAGTTCGGGTATTGATAATCAATCGTTTAGTTGCGATATTGGAAAAGATTTGATACTTAGCTGTGAAATTTAGGTTTCTGGATTTCTGATGAAAAGTTCTTGCCCTATTAGTCTACTTGTTGAAGGGGGATTCTTAGGGCTTTATCTCGGATTCAGCCATTGTTGAAATTACAGCTCTGAAAATGATTCCGAGGAAAGATAGTTCTCGTATAGACACTTCAGAGCTGAAAGCGATGATCTACCAAAAGCTTGGACATCAGAGGTCGGAGAAATACTTTGATCAGCTCAAAAAATTGTTAAGTTTAAAGATCAACAAGAGGGAATTCGACAAGTTTTGTATTCAGATCATTGGAAGGGAGGTTATACCTCTTCATAATCGGCTTATCAGAGCGATTCTTCGAAATGCCTGTGTTGCTAAAATTCCCCCTGTTCTTAGCAGTTCTAGGAAAGTAGGAGCCAATCTCAGTGTCAAGGTTGTGCATGGATATCAGAAGAGTTGTCTTCAATCGCTTAATGGAGATGCATTTCTCTCTTCCCCTCGTAAGGGCAGGTCTCCGGCTAGTAGAGACCGCAAGATTCGAGATCGCCCGAGTCCTTTAGGTCCATGTGGGAAGCCTCAGAATATTGAACTTGAACTTGCTTTCAAGGCACAAGAACAGCAGAGTGCAACTGAGTTGCATTCGCTTGGTAGTCGTCCTCCTGTCGAAATGGCATCTGTAGAAGACGGAGAAGAGGTCGAGCAGGTGGCTGTGAGTCCAGGAGTTCAGAGCAGAAGCCCGGTTACTGCTCCATTTGGTATATCCATGAACTTCATTGGATCTGGTAAAACTCTGTCTAATGTATCTGTAGGAAGAAATTACCATGTAACAACATGTCAAAATGGTGGCAAGCTACCCGACACAAGGTTGCTAAGAACTCATTTGAAGCAGAAGTTGGAAACGGAGCAGATTGATATATCTGTGGATGGTGTAAACCTTCTCAACAATGCGCTCGATGTTTATTTAAAGAGGTTAATCGAGCCATGTTTGAATTTCTCTCGATCAAGGTGCGAACGAGAGGGGAATCAACCGATCACTAACTCAAGAACCAGATCCCTGGAACAACATTGGCATAGATCTCAACGGTTGAACAACGCATCCTTGTTGGACTTCAGTGTTGCAATGCAACTGAATCCTGAAGTACTCGGAAAAGACCGAACAATACAGCTCGAGAAAATTAGTTTACGAGCTTTAGAAGAGTGA

mRNA sequence

CACGTGCGTAATTGAAATGCGATACGATTGGCCGAGGGACAAACGCATTCGCAAGAACTCTGGTCGATGATCGCTGTTTTGATTTCGGTCTCCAATCTGCAAATCACCAAAAGCGATCGATTCAAGATTAAACAATTTAAAGAAGCGGCAAAAACTTGCAAATTTTCGATTCAGAAGACGCCGCCAAAATCATCCATCATTCATACACAATTCCGTCATTTCTTTTCGGGCTTTATCTCGGATTCAGCCATTGTTGAAATTACAGCTCTGAAAATGATTCCGAGGAAAGATAGTTCTCGTATAGACACTTCAGAGCTGAAAGCGATGATCTACCAAAAGCTTGGACATCAGAGGTCGGAGAAATACTTTGATCAGCTCAAAAAATTGTTAAGTTTAAAGATCAACAAGAGGGAATTCGACAAGTTTTGTATTCAGATCATTGGAAGGGAGGTTATACCTCTTCATAATCGGCTTATCAGAGCGATTCTTCGAAATGCCTGTGTTGCTAAAATTCCCCCTGTTCTTAGCAGTTCTAGGAAAGTAGGAGCCAATCTCAGTGTCAAGGTTGTGCATGGATATCAGAAGAGTTGTCTTCAATCGCTTAATGGAGATGCATTTCTCTCTTCCCCTCGTAAGGGCAGGTCTCCGGCTAGTAGAGACCGCAAGATTCGAGATCGCCCGAGTCCTTTAGGTCCATGTGGGAAGCCTCAGAATATTGAACTTGAACTTGCTTTCAAGGCACAAGAACAGCAGAGTGCAACTGAGTTGCATTCGCTTGGTAGTCGTCCTCCTGTCGAAATGGCATCTGTAGAAGACGGAGAAGAGGTCGAGCAGGTGGCTGTGAGTCCAGGAGTTCAGAGCAGAAGCCCGGTTACTGCTCCATTTGGTATATCCATGAACTTCATTGGATCTGGTAAAACTCTGTCTAATGTATCTGTAGGAAGAAATTACCATGTAACAACATGTCAAAATGGTGGCAAGCTACCCGACACAAGGTTGCTAAGAACTCATTTGAAGCAGAAGTTGGAAACGGAGCAGATTGATATATCTGTGGATGGTGTAAACCTTCTCAACAATGCGCTCGATGTTTATTTAAAGAGGTTAATCGAGCCATGTTTGAATTTCTCTCGATCAAGGTGCGAACGAGAGGGGAATCAACCGATCACTAACTCAAGAACCAGATCCCTGGAACAACATTGGCATAGATCTCAACGGTTGAACAACGCATCCTTGTTGGACTTCAGTGTTGCAATGCAACTGAATCCTGAAGTACTCGGAAAAGACCGAACAATACAGCTCGAGAAAATTAGTTTACGAGCTTTAGAAGAGTGA

Coding sequence (CDS)

ATGATCGCTGTTTTGATTTCGGTCTCCAATCTGCAAATCACCAAAAGCGATCGATTCAAGATTAAACAATTTAAAGAAGCGGCAAAAACTTGCAAATTTTCGATTCAGAAGACGCCGCCAAAATCATCCATCATTCATACACAATTCCGTCATTTCTTTTCGGGCTTTATCTCGGATTCAGCCATTGTTGAAATTACAGCTCTGAAAATGATTCCGAGGAAAGATAGTTCTCGTATAGACACTTCAGAGCTGAAAGCGATGATCTACCAAAAGCTTGGACATCAGAGGTCGGAGAAATACTTTGATCAGCTCAAAAAATTGTTAAGTTTAAAGATCAACAAGAGGGAATTCGACAAGTTTTGTATTCAGATCATTGGAAGGGAGGTTATACCTCTTCATAATCGGCTTATCAGAGCGATTCTTCGAAATGCCTGTGTTGCTAAAATTCCCCCTGTTCTTAGCAGTTCTAGGAAAGTAGGAGCCAATCTCAGTGTCAAGGTTGTGCATGGATATCAGAAGAGTTGTCTTCAATCGCTTAATGGAGATGCATTTCTCTCTTCCCCTCGTAAGGGCAGGTCTCCGGCTAGTAGAGACCGCAAGATTCGAGATCGCCCGAGTCCTTTAGGTCCATGTGGGAAGCCTCAGAATATTGAACTTGAACTTGCTTTCAAGGCACAAGAACAGCAGAGTGCAACTGAGTTGCATTCGCTTGGTAGTCGTCCTCCTGTCGAAATGGCATCTGTAGAAGACGGAGAAGAGGTCGAGCAGGTGGCTGTGAGTCCAGGAGTTCAGAGCAGAAGCCCGGTTACTGCTCCATTTGGTATATCCATGAACTTCATTGGATCTGGTAAAACTCTGTCTAATGTATCTGTAGGAAGAAATTACCATGTAACAACATGTCAAAATGGTGGCAAGCTACCCGACACAAGGTTGCTAAGAACTCATTTGAAGCAGAAGTTGGAAACGGAGCAGATTGATATATCTGTGGATGGTGTAAACCTTCTCAACAATGCGCTCGATGTTTATTTAAAGAGGTTAATCGAGCCATGTTTGAATTTCTCTCGATCAAGGTGCGAACGAGAGGGGAATCAACCGATCACTAACTCAAGAACCAGATCCCTGGAACAACATTGGCATAGATCTCAACGGTTGAACAACGCATCCTTGTTGGACTTCAGTGTTGCAATGCAACTGAATCCTGAAGTACTCGGAAAAGACCGAACAATACAGCTCGAGAAAATTAGTTTACGAGCTTTAGAAGAGTGA

Protein sequence

MIAVLISVSNLQITKSDRFKIKQFKEAAKTCKFSIQKTPPKSSIIHTQFRHFFSGFISDSAIVEITALKMIPRKDSSRIDTSELKAMIYQKLGHQRSEKYFDQLKKLLSLKINKREFDKFCIQIIGREVIPLHNRLIRAILRNACVAKIPPVLSSSRKVGANLSVKVVHGYQKSCLQSLNGDAFLSSPRKGRSPASRDRKIRDRPSPLGPCGKPQNIELELAFKAQEQQSATELHSLGSRPPVEMASVEDGEEVEQVAVSPGVQSRSPVTAPFGISMNFIGSGKTLSNVSVGRNYHVTTCQNGGKLPDTRLLRTHLKQKLETEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCEREGNQPITNSRTRSLEQHWHRSQRLNNASLLDFSVAMQLNPEVLGKDRTIQLEKISLRALEE
Homology
BLAST of CmaCh04G004570 vs. ExPASy TrEMBL
Match: A0A6J1JSH5 (uncharacterized protein LOC111488516 OS=Cucurbita maxima OX=3661 GN=LOC111488516 PE=4 SV=1)

HSP 1 Score: 684.1 bits (1764), Expect = 3.7e-193
Identity = 352/352 (100.00%), Postives = 352/352 (100.00%), Query Frame = 0

Query: 70  MIPRKDSSRIDTSELKAMIYQKLGHQRSEKYFDQLKKLLSLKINKREFDKFCIQIIGREV 129
           MIPRKDSSRIDTSELKAMIYQKLGHQRSEKYFDQLKKLLSLKINKREFDKFCIQIIGREV
Sbjct: 1   MIPRKDSSRIDTSELKAMIYQKLGHQRSEKYFDQLKKLLSLKINKREFDKFCIQIIGREV 60

Query: 130 IPLHNRLIRAILRNACVAKIPPVLSSSRKVGANLSVKVVHGYQKSCLQSLNGDAFLSSPR 189
           IPLHNRLIRAILRNACVAKIPPVLSSSRKVGANLSVKVVHGYQKSCLQSLNGDAFLSSPR
Sbjct: 61  IPLHNRLIRAILRNACVAKIPPVLSSSRKVGANLSVKVVHGYQKSCLQSLNGDAFLSSPR 120

Query: 190 KGRSPASRDRKIRDRPSPLGPCGKPQNIELELAFKAQEQQSATELHSLGSRPPVEMASVE 249
           KGRSPASRDRKIRDRPSPLGPCGKPQNIELELAFKAQEQQSATELHSLGSRPPVEMASVE
Sbjct: 121 KGRSPASRDRKIRDRPSPLGPCGKPQNIELELAFKAQEQQSATELHSLGSRPPVEMASVE 180

Query: 250 DGEEVEQVAVSPGVQSRSPVTAPFGISMNFIGSGKTLSNVSVGRNYHVTTCQNGGKLPDT 309
           DGEEVEQVAVSPGVQSRSPVTAPFGISMNFIGSGKTLSNVSVGRNYHVTTCQNGGKLPDT
Sbjct: 181 DGEEVEQVAVSPGVQSRSPVTAPFGISMNFIGSGKTLSNVSVGRNYHVTTCQNGGKLPDT 240

Query: 310 RLLRTHLKQKLETEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCEREGNQPITNS 369
           RLLRTHLKQKLETEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCEREGNQPITNS
Sbjct: 241 RLLRTHLKQKLETEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCEREGNQPITNS 300

Query: 370 RTRSLEQHWHRSQRLNNASLLDFSVAMQLNPEVLGKDRTIQLEKISLRALEE 422
           RTRSLEQHWHRSQRLNNASLLDFSVAMQLNPEVLGKDRTIQLEKISLRALEE
Sbjct: 301 RTRSLEQHWHRSQRLNNASLLDFSVAMQLNPEVLGKDRTIQLEKISLRALEE 352

BLAST of CmaCh04G004570 vs. ExPASy TrEMBL
Match: A0A6J1FW12 (uncharacterized protein LOC111447829 OS=Cucurbita moschata OX=3662 GN=LOC111447829 PE=4 SV=1)

HSP 1 Score: 661.4 bits (1705), Expect = 2.6e-186
Identity = 341/353 (96.60%), Postives = 348/353 (98.58%), Query Frame = 0

Query: 70  MIPRKDSSRIDTSELKAMIYQKLGHQRSEKYFDQLKKLLSLKINKREFDKFCIQIIGREV 129
           MIPRKDSSRIDTSELKAMIYQKLGHQRSEKYFDQLK LLSLKINKREFDKFCIQIIGREV
Sbjct: 1   MIPRKDSSRIDTSELKAMIYQKLGHQRSEKYFDQLKNLLSLKINKREFDKFCIQIIGREV 60

Query: 130 IPLHNRLIRAILRNACVAKIPPVLSSSRKVGANLSVKVVHGYQKSCLQSLNGDAFLSSPR 189
           IPLHNRLIRAILRNACVAKIPPVLSSSRKVGANLSVKVVHGYQKSCLQSLNGDAFLSSPR
Sbjct: 61  IPLHNRLIRAILRNACVAKIPPVLSSSRKVGANLSVKVVHGYQKSCLQSLNGDAFLSSPR 120

Query: 190 KGRSPASRDRKIRDRPSPLGPCGKPQNIEL-ELAFKAQEQQSATELHSLGSRPPVEMASV 249
           KGRSP SRDRKIRDRPSPLGPCGKPQNIEL ELAFKAQEQQSATELHSLGSRPPVEMASV
Sbjct: 121 KGRSPVSRDRKIRDRPSPLGPCGKPQNIELEELAFKAQEQQSATELHSLGSRPPVEMASV 180

Query: 250 EDGEEVEQVAVSPGVQSRSPVTAPFGISMNFIGSGKTLSNVSVGRNYHVTTCQNGGKLPD 309
           EDGEEVEQ+A SPGVQSRSPVTAPFGISMNFIGSGKTLSNV+VGRNY +TTCQNGG+LPD
Sbjct: 181 EDGEEVEQMAASPGVQSRSPVTAPFGISMNFIGSGKTLSNVTVGRNYRLTTCQNGGELPD 240

Query: 310 TRLLRTHLKQKLETEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCEREGNQPITN 369
           TRLLRTHLKQKLETEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCEREGNQPIT+
Sbjct: 241 TRLLRTHLKQKLETEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCEREGNQPITD 300

Query: 370 SRTRSLEQHWHRSQRLNNASLLDFSVAMQLNPEVLGKDRTIQLEKISLRALEE 422
           SRTRSLEQHWHRSQRL+NASLLDFSVAMQLNPE+LGKDRTIQLEKISLRALEE
Sbjct: 301 SRTRSLEQHWHRSQRLSNASLLDFSVAMQLNPELLGKDRTIQLEKISLRALEE 353

BLAST of CmaCh04G004570 vs. ExPASy TrEMBL
Match: A0A0A0KWF9 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G000960 PE=4 SV=1)

HSP 1 Score: 614.0 bits (1582), Expect = 4.7e-172
Identity = 336/436 (77.06%), Postives = 368/436 (84.40%), Query Frame = 0

Query: 5   LISVSNLQITKSDRFK-IKQFKEAAKTCKFSIQKTPPKSSIIHTQ-----FRHFF----- 64
           L+ VSNL+IT+  + K IKQF +                +I+H+Q     F+ FF     
Sbjct: 38  LLLVSNLRITQKFQSKPIKQFTQ-----------NHLNFTILHSQDGAISFQFFFYLFCW 97

Query: 65  ----SGFISDSAIVEITALKMIPRKDSSRIDTSELKAMIYQKLGHQRSEKYFDQLKKLLS 124
                GF+S+SAI  I ALKM+PRKD+SRIDTSELKAMIY+KLGHQRS+KYFDQLKKLLS
Sbjct: 98  WRIPEGFVSNSAIAGIAALKMLPRKDTSRIDTSELKAMIYRKLGHQRSDKYFDQLKKLLS 157

Query: 125 LKINKREFDKFCIQIIGREVIPLHNRLIRAILRNACVAKIPPVLSSSRKVGANLSVKVVH 184
           LK NKREFDKFCIQIIGRE+IPLHNRLIRAIL+NACVAK PPVLSS+RKVG NLSVKVV+
Sbjct: 158 LKTNKREFDKFCIQIIGREIIPLHNRLIRAILQNACVAKTPPVLSSTRKVGGNLSVKVVN 217

Query: 185 GYQKSCLQSLNGDAFLSSPRKGRSPASRDRKIRDRPSPLGPCGKPQNIEL-ELAFKAQEQ 244
           GYQ+SCLQSL+GDAFLSSPRKGRSP SRDRKIRDRPSPLGPCGKPQN+ L E A KAQEQ
Sbjct: 218 GYQRSCLQSLHGDAFLSSPRKGRSPVSRDRKIRDRPSPLGPCGKPQNMALEEFASKAQEQ 277

Query: 245 QSATELHSLGSRPPVEMASVEDGEEVEQVAVSPGVQSRSPVTAPFGISMNFIGSGKTLSN 304
           QSATELHSLGSRPPVEMASVEDGEEVEQVA SPGVQSRSPVTAP GISMNFIGSGKTLSN
Sbjct: 278 QSATELHSLGSRPPVEMASVEDGEEVEQVAGSPGVQSRSPVTAPLGISMNFIGSGKTLSN 337

Query: 305 VSVGRNYHVTTCQNGGKLPDTRLLRTHLKQKLETEQIDISVDGVNLLNNALDVYLKRLIE 364
           V VG NYHVTTCQ+ G+LPDTRLLRTHL++KLETEQIDISVDGVNLLNNALDVYLKRLIE
Sbjct: 338 VPVGSNYHVTTCQDVGELPDTRLLRTHLRKKLETEQIDISVDGVNLLNNALDVYLKRLIE 397

Query: 365 PCLNFSRSRCER---EGNQPITNSRTRSLEQHWHRSQRLNNASLLDFSVAMQLNPEVLGK 422
           PCLNFSRSRCER    GNQPIT SR    EQH HR+Q+LNN SLLDF VAMQLNP+VLG+
Sbjct: 398 PCLNFSRSRCERLKFTGNQPITGSRITFQEQHRHRAQQLNNGSLLDFRVAMQLNPQVLGR 457

BLAST of CmaCh04G004570 vs. ExPASy TrEMBL
Match: A0A5A7TT05 (Transcriptional coactivator Hfi1/Transcriptional adapter 1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold655G00760 PE=4 SV=1)

HSP 1 Score: 604.0 bits (1556), Expect = 4.8e-169
Identity = 317/372 (85.22%), Postives = 341/372 (91.67%), Query Frame = 0

Query: 55  GFISDSAIVEITALKMIPRKDSSRIDTSELKAMIYQKLGHQRSEKYFDQLKKLLSLKINK 114
           GF+S+S I EI ALKM PRKD+SRIDTSELKAMIY+KLGHQRS+KYFDQLKKLLSLK NK
Sbjct: 99  GFVSNSDIAEIAALKMFPRKDTSRIDTSELKAMIYRKLGHQRSDKYFDQLKKLLSLKTNK 158

Query: 115 REFDKFCIQIIGREVIPLHNRLIRAILRNACVAKIPPVLSSSRKVGANLSVKVVHGYQKS 174
           REFDKFCIQIIGRE+IPLHNRLIRAIL+NACVAK PPVLSS+RKVG NLSVKVV+GYQ+S
Sbjct: 159 REFDKFCIQIIGREIIPLHNRLIRAILQNACVAKTPPVLSSTRKVGGNLSVKVVNGYQRS 218

Query: 175 CLQSLNGDAFLSSPRKGRSPASRDRKIRDRPSPLGPCGKPQNIEL-ELAFKAQEQQSATE 234
           CLQSL+GDAFLSSPRKGRSP SRDRKIRDRPSPLGPCGKPQN+ L E A KAQEQQSATE
Sbjct: 219 CLQSLHGDAFLSSPRKGRSPVSRDRKIRDRPSPLGPCGKPQNMALEEFASKAQEQQSATE 278

Query: 235 LHSLGSRPPVEMASVEDGEEVEQVAVSPGVQSRSPVTAPFGISMNFIGSGKTLSNVSV-G 294
           LHSLGSRPPVEMASVEDGEEVEQVA SPGVQSRSPVTAP GISMNFIGS KTLSNV V G
Sbjct: 279 LHSLGSRPPVEMASVEDGEEVEQVAGSPGVQSRSPVTAPLGISMNFIGSSKTLSNVPVGG 338

Query: 295 RNYHVTTCQNGGKLPDTRLLRTHLKQKLETEQIDISVDGVNLLNNALDVYLKRLIEPCLN 354
           RNYHVTTCQ+GG+LPDTRLLRTHL++KLETEQIDISVDGVNLLNNALDVYLKRLIEPCLN
Sbjct: 339 RNYHVTTCQDGGELPDTRLLRTHLRKKLETEQIDISVDGVNLLNNALDVYLKRLIEPCLN 398

Query: 355 FSRSRCER---EGNQPITNSRTRSLEQHWHRSQRLNNASLLDFSVAMQLNPEVLGKDRTI 414
           FSRSRCER    GNQPIT SR    EQ+ HR+Q++NN SLLDF VAMQLNP+VLG++ T+
Sbjct: 399 FSRSRCERLKFTGNQPITGSRITFQEQNRHRAQQINNGSLLDFRVAMQLNPQVLGREWTM 458

Query: 415 QLEKISLRALEE 422
           QLEKISLRA EE
Sbjct: 459 QLEKISLRASEE 470

BLAST of CmaCh04G004570 vs. ExPASy TrEMBL
Match: A0A1S3BYK6 (uncharacterized protein LOC103494799 OS=Cucumis melo OX=3656 GN=LOC103494799 PE=4 SV=1)

HSP 1 Score: 585.5 bits (1508), Expect = 1.8e-163
Identity = 307/357 (85.99%), Postives = 329/357 (92.16%), Query Frame = 0

Query: 70  MIPRKDSSRIDTSELKAMIYQKLGHQRSEKYFDQLKKLLSLKINKREFDKFCIQIIGREV 129
           M PRKD+SRIDTSELKAMIY+KLGHQRS+KYFDQLKKLLSLK NKREFDKFCIQIIGRE+
Sbjct: 1   MFPRKDTSRIDTSELKAMIYRKLGHQRSDKYFDQLKKLLSLKTNKREFDKFCIQIIGREI 60

Query: 130 IPLHNRLIRAILRNACVAKIPPVLSSSRKVGANLSVKVVHGYQKSCLQSLNGDAFLSSPR 189
           IPLHNRLIRAIL+NACVAK PPVLSS+RKVG NLSVKVV+GYQ+SCLQSL+GDAFLSSPR
Sbjct: 61  IPLHNRLIRAILQNACVAKTPPVLSSTRKVGGNLSVKVVNGYQRSCLQSLHGDAFLSSPR 120

Query: 190 KGRSPASRDRKIRDRPSPLGPCGKPQNIEL-ELAFKAQEQQSATELHSLGSRPPVEMASV 249
           KGRSP SRDRKIRDRPSPLGPCGKPQN+ L E A KAQEQQSATELHSLGSRPPVEMASV
Sbjct: 121 KGRSPVSRDRKIRDRPSPLGPCGKPQNMALEEFASKAQEQQSATELHSLGSRPPVEMASV 180

Query: 250 EDGEEVEQVAVSPGVQSRSPVTAPFGISMNFIGSGKTLSNVSV-GRNYHVTTCQNGGKLP 309
           EDGEEVEQVA SPGVQSRSPVTAP GISMNFIGS KTLSNV V GRNYHVTTCQ+GG+LP
Sbjct: 181 EDGEEVEQVAGSPGVQSRSPVTAPLGISMNFIGSSKTLSNVPVGGRNYHVTTCQDGGELP 240

Query: 310 DTRLLRTHLKQKLETEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCER---EGNQ 369
           DTRLLRTHL++KLETEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCER    GNQ
Sbjct: 241 DTRLLRTHLRKKLETEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCERLKFTGNQ 300

Query: 370 PITNSRTRSLEQHWHRSQRLNNASLLDFSVAMQLNPEVLGKDRTIQLEKISLRALEE 422
           PIT SR    EQ+ HR+Q++NN SLLDF VAMQLNP+VLG++ T+QLEKISLRA EE
Sbjct: 301 PITGSRITFQEQNRHRAQQINNGSLLDFRVAMQLNPQVLGREWTMQLEKISLRASEE 357

BLAST of CmaCh04G004570 vs. NCBI nr
Match: KAG7030952.1 (hypothetical protein SDJN02_04989 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 750.0 bits (1935), Expect = 1.1e-212
Identity = 397/424 (93.63%), Postives = 404/424 (95.28%), Query Frame = 0

Query: 1   MIAVLISVSNLQITKSDRFKIKQFKEAAKTCKFSIQKTPPK--SSIIHTQFRHFFSGFIS 60
           MIAVLISVS LQITKSDRFKIKQFKEAAKTC+F  ++      S I   QFRHFFSGFIS
Sbjct: 1   MIAVLISVSYLQITKSDRFKIKQFKEAAKTCEFRFRRRRQNHPSFIPVIQFRHFFSGFIS 60

Query: 61  DSAIVEITALKMIPRKDSSRIDTSELKAMIYQKLGHQRSEKYFDQLKKLLSLKINKREFD 120
           DSAIVEITALKMIPRKDSSRIDTSELKAMIYQKLGHQRSEKYFDQLKKLLSLKINKREFD
Sbjct: 61  DSAIVEITALKMIPRKDSSRIDTSELKAMIYQKLGHQRSEKYFDQLKKLLSLKINKREFD 120

Query: 121 KFCIQIIGREVIPLHNRLIRAILRNACVAKIPPVLSSSRKVGANLSVKVVHGYQKSCLQS 180
           KFCIQIIGREVIPLHNRLIRAILRNACVAKIPPVLSSSRK GANLSVKVVHGYQKSCLQS
Sbjct: 121 KFCIQIIGREVIPLHNRLIRAILRNACVAKIPPVLSSSRKGGANLSVKVVHGYQKSCLQS 180

Query: 181 LNGDAFLSSPRKGRSPASRDRKIRDRPSPLGPCGKPQNIEL-ELAFKAQEQQSATELHSL 240
           LNGDAFLSSPRKGRSP SRDRKIRDRPSPLGPCGK QNIEL ELAFKAQEQQSATELHSL
Sbjct: 181 LNGDAFLSSPRKGRSPISRDRKIRDRPSPLGPCGKAQNIELEELAFKAQEQQSATELHSL 240

Query: 241 GSRPPVEMASVEDGEEVEQVAVSPGVQSRSPVTAPFGISMNFIGSGKTLSNVSVGRNYHV 300
           GSRPPVEMASVEDGEEVEQ+A SPGVQSRSPVTAPFGISMNFIGSGKTLSNVSVGRNYHV
Sbjct: 241 GSRPPVEMASVEDGEEVEQMAASPGVQSRSPVTAPFGISMNFIGSGKTLSNVSVGRNYHV 300

Query: 301 TTCQNGGKLPDTRLLRTHLKQKLETEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSR 360
           TTCQNGG+LPDTRLLRTHLKQKLE EQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSR
Sbjct: 301 TTCQNGGELPDTRLLRTHLKQKLEMEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSR 360

Query: 361 CEREGNQPITNSRTRSLEQHWHRSQRLNNASLLDFSVAMQLNPEVLGKDRTIQLEKISLR 420
           CEREGNQPIT+SRTRSLE+HWHRSQRLNNASLLDFS AMQLNPEVLGKDRTIQLEKISLR
Sbjct: 361 CEREGNQPITDSRTRSLERHWHRSQRLNNASLLDFSFAMQLNPEVLGKDRTIQLEKISLR 420

Query: 421 ALEE 422
           ALEE
Sbjct: 421 ALEE 424

BLAST of CmaCh04G004570 vs. NCBI nr
Match: XP_022992051.1 (uncharacterized protein LOC111488516 [Cucurbita maxima] >XP_022992060.1 uncharacterized protein LOC111488516 [Cucurbita maxima])

HSP 1 Score: 684.1 bits (1764), Expect = 7.6e-193
Identity = 352/352 (100.00%), Postives = 352/352 (100.00%), Query Frame = 0

Query: 70  MIPRKDSSRIDTSELKAMIYQKLGHQRSEKYFDQLKKLLSLKINKREFDKFCIQIIGREV 129
           MIPRKDSSRIDTSELKAMIYQKLGHQRSEKYFDQLKKLLSLKINKREFDKFCIQIIGREV
Sbjct: 1   MIPRKDSSRIDTSELKAMIYQKLGHQRSEKYFDQLKKLLSLKINKREFDKFCIQIIGREV 60

Query: 130 IPLHNRLIRAILRNACVAKIPPVLSSSRKVGANLSVKVVHGYQKSCLQSLNGDAFLSSPR 189
           IPLHNRLIRAILRNACVAKIPPVLSSSRKVGANLSVKVVHGYQKSCLQSLNGDAFLSSPR
Sbjct: 61  IPLHNRLIRAILRNACVAKIPPVLSSSRKVGANLSVKVVHGYQKSCLQSLNGDAFLSSPR 120

Query: 190 KGRSPASRDRKIRDRPSPLGPCGKPQNIELELAFKAQEQQSATELHSLGSRPPVEMASVE 249
           KGRSPASRDRKIRDRPSPLGPCGKPQNIELELAFKAQEQQSATELHSLGSRPPVEMASVE
Sbjct: 121 KGRSPASRDRKIRDRPSPLGPCGKPQNIELELAFKAQEQQSATELHSLGSRPPVEMASVE 180

Query: 250 DGEEVEQVAVSPGVQSRSPVTAPFGISMNFIGSGKTLSNVSVGRNYHVTTCQNGGKLPDT 309
           DGEEVEQVAVSPGVQSRSPVTAPFGISMNFIGSGKTLSNVSVGRNYHVTTCQNGGKLPDT
Sbjct: 181 DGEEVEQVAVSPGVQSRSPVTAPFGISMNFIGSGKTLSNVSVGRNYHVTTCQNGGKLPDT 240

Query: 310 RLLRTHLKQKLETEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCEREGNQPITNS 369
           RLLRTHLKQKLETEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCEREGNQPITNS
Sbjct: 241 RLLRTHLKQKLETEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCEREGNQPITNS 300

Query: 370 RTRSLEQHWHRSQRLNNASLLDFSVAMQLNPEVLGKDRTIQLEKISLRALEE 422
           RTRSLEQHWHRSQRLNNASLLDFSVAMQLNPEVLGKDRTIQLEKISLRALEE
Sbjct: 301 RTRSLEQHWHRSQRLNNASLLDFSVAMQLNPEVLGKDRTIQLEKISLRALEE 352

BLAST of CmaCh04G004570 vs. NCBI nr
Match: XP_023525283.1 (uncharacterized protein LOC111788929 [Cucurbita pepo subsp. pepo] >XP_023525290.1 uncharacterized protein LOC111788929 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 664.1 bits (1712), Expect = 8.1e-187
Identity = 342/353 (96.88%), Postives = 348/353 (98.58%), Query Frame = 0

Query: 70  MIPRKDSSRIDTSELKAMIYQKLGHQRSEKYFDQLKKLLSLKINKREFDKFCIQIIGREV 129
           MIPRKDSSRIDTSELKAMIYQKLGHQRSEKYFD LKKLLSLKINKREFDKFCIQIIGREV
Sbjct: 1   MIPRKDSSRIDTSELKAMIYQKLGHQRSEKYFDHLKKLLSLKINKREFDKFCIQIIGREV 60

Query: 130 IPLHNRLIRAILRNACVAKIPPVLSSSRKVGANLSVKVVHGYQKSCLQSLNGDAFLSSPR 189
           IPLHNRLIRAILRNACVAKIPPVLSSSRKVGANLSVKVVHGYQ+SCLQSLNGDAFLSSPR
Sbjct: 61  IPLHNRLIRAILRNACVAKIPPVLSSSRKVGANLSVKVVHGYQRSCLQSLNGDAFLSSPR 120

Query: 190 KGRSPASRDRKIRDRPSPLGPCGKPQNIEL-ELAFKAQEQQSATELHSLGSRPPVEMASV 249
           KGRSP SRDRKIRDRPSPLGPCGKPQNIEL ELAFK QEQQSATELHSLGSRPPVEMASV
Sbjct: 121 KGRSPVSRDRKIRDRPSPLGPCGKPQNIELEELAFKVQEQQSATELHSLGSRPPVEMASV 180

Query: 250 EDGEEVEQVAVSPGVQSRSPVTAPFGISMNFIGSGKTLSNVSVGRNYHVTTCQNGGKLPD 309
           EDGEEVEQ+A SPGV+SRSPVTAPFGISMNFIGSGKTLSNVSVGRNYHVTTCQNGG+LPD
Sbjct: 181 EDGEEVEQMAASPGVESRSPVTAPFGISMNFIGSGKTLSNVSVGRNYHVTTCQNGGELPD 240

Query: 310 TRLLRTHLKQKLETEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCEREGNQPITN 369
           TRLLRTHLKQKLETEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCEREGNQPIT+
Sbjct: 241 TRLLRTHLKQKLETEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCEREGNQPITD 300

Query: 370 SRTRSLEQHWHRSQRLNNASLLDFSVAMQLNPEVLGKDRTIQLEKISLRALEE 422
           SRTRSLEQHWHRSQRL+NASLLDFSVAMQLNPEVLGKDRTIQLEKISLRALEE
Sbjct: 301 SRTRSLEQHWHRSQRLSNASLLDFSVAMQLNPEVLGKDRTIQLEKISLRALEE 353

BLAST of CmaCh04G004570 vs. NCBI nr
Match: XP_022942947.1 (uncharacterized protein LOC111447829 [Cucurbita moschata] >XP_022942948.1 uncharacterized protein LOC111447829 [Cucurbita moschata])

HSP 1 Score: 661.4 bits (1705), Expect = 5.3e-186
Identity = 341/353 (96.60%), Postives = 348/353 (98.58%), Query Frame = 0

Query: 70  MIPRKDSSRIDTSELKAMIYQKLGHQRSEKYFDQLKKLLSLKINKREFDKFCIQIIGREV 129
           MIPRKDSSRIDTSELKAMIYQKLGHQRSEKYFDQLK LLSLKINKREFDKFCIQIIGREV
Sbjct: 1   MIPRKDSSRIDTSELKAMIYQKLGHQRSEKYFDQLKNLLSLKINKREFDKFCIQIIGREV 60

Query: 130 IPLHNRLIRAILRNACVAKIPPVLSSSRKVGANLSVKVVHGYQKSCLQSLNGDAFLSSPR 189
           IPLHNRLIRAILRNACVAKIPPVLSSSRKVGANLSVKVVHGYQKSCLQSLNGDAFLSSPR
Sbjct: 61  IPLHNRLIRAILRNACVAKIPPVLSSSRKVGANLSVKVVHGYQKSCLQSLNGDAFLSSPR 120

Query: 190 KGRSPASRDRKIRDRPSPLGPCGKPQNIEL-ELAFKAQEQQSATELHSLGSRPPVEMASV 249
           KGRSP SRDRKIRDRPSPLGPCGKPQNIEL ELAFKAQEQQSATELHSLGSRPPVEMASV
Sbjct: 121 KGRSPVSRDRKIRDRPSPLGPCGKPQNIELEELAFKAQEQQSATELHSLGSRPPVEMASV 180

Query: 250 EDGEEVEQVAVSPGVQSRSPVTAPFGISMNFIGSGKTLSNVSVGRNYHVTTCQNGGKLPD 309
           EDGEEVEQ+A SPGVQSRSPVTAPFGISMNFIGSGKTLSNV+VGRNY +TTCQNGG+LPD
Sbjct: 181 EDGEEVEQMAASPGVQSRSPVTAPFGISMNFIGSGKTLSNVTVGRNYRLTTCQNGGELPD 240

Query: 310 TRLLRTHLKQKLETEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCEREGNQPITN 369
           TRLLRTHLKQKLETEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCEREGNQPIT+
Sbjct: 241 TRLLRTHLKQKLETEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCEREGNQPITD 300

Query: 370 SRTRSLEQHWHRSQRLNNASLLDFSVAMQLNPEVLGKDRTIQLEKISLRALEE 422
           SRTRSLEQHWHRSQRL+NASLLDFSVAMQLNPE+LGKDRTIQLEKISLRALEE
Sbjct: 301 SRTRSLEQHWHRSQRLSNASLLDFSVAMQLNPELLGKDRTIQLEKISLRALEE 353

BLAST of CmaCh04G004570 vs. NCBI nr
Match: KAG6600292.1 (hypothetical protein SDJN03_05525, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 660.2 bits (1702), Expect = 1.2e-185
Identity = 342/353 (96.88%), Postives = 346/353 (98.02%), Query Frame = 0

Query: 70  MIPRKDSSRIDTSELKAMIYQKLGHQRSEKYFDQLKKLLSLKINKREFDKFCIQIIGREV 129
           MIPRKDSSRIDTSELKAMIYQKLGHQRSEKYFDQLKKLLSLKINKREFDKFCIQIIGREV
Sbjct: 1   MIPRKDSSRIDTSELKAMIYQKLGHQRSEKYFDQLKKLLSLKINKREFDKFCIQIIGREV 60

Query: 130 IPLHNRLIRAILRNACVAKIPPVLSSSRKVGANLSVKVVHGYQKSCLQSLNGDAFLSSPR 189
           IPLHNRLIRAILRNACVAKIPPVLSSSRK GANLSVKVVHGYQKSCLQSLNGDAFLSSPR
Sbjct: 61  IPLHNRLIRAILRNACVAKIPPVLSSSRKGGANLSVKVVHGYQKSCLQSLNGDAFLSSPR 120

Query: 190 KGRSPASRDRKIRDRPSPLGPCGKPQNIEL-ELAFKAQEQQSATELHSLGSRPPVEMASV 249
           KGRSP SRDRKIRDRPSPLGPCGK QNIEL ELAFKAQEQQSATELHSLGSRPPVEMASV
Sbjct: 121 KGRSPVSRDRKIRDRPSPLGPCGKAQNIELEELAFKAQEQQSATELHSLGSRPPVEMASV 180

Query: 250 EDGEEVEQVAVSPGVQSRSPVTAPFGISMNFIGSGKTLSNVSVGRNYHVTTCQNGGKLPD 309
           EDGEEVEQ+A SPGVQSRSPVTAPFGISMNFIGSGKTLSNVSVGRNYHVTTCQNGG+LPD
Sbjct: 181 EDGEEVEQMAASPGVQSRSPVTAPFGISMNFIGSGKTLSNVSVGRNYHVTTCQNGGELPD 240

Query: 310 TRLLRTHLKQKLETEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCEREGNQPITN 369
           TRLLRTHLKQKLE EQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCEREGNQPIT+
Sbjct: 241 TRLLRTHLKQKLEMEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCEREGNQPITD 300

Query: 370 SRTRSLEQHWHRSQRLNNASLLDFSVAMQLNPEVLGKDRTIQLEKISLRALEE 422
           SRTRSLE+HWHRSQRLNNASLLDFS AMQLNPEVLGKDRTIQLEKISLRALEE
Sbjct: 301 SRTRSLERHWHRSQRLNNASLLDFSFAMQLNPEVLGKDRTIQLEKISLRALEE 353

BLAST of CmaCh04G004570 vs. TAIR 10
Match: AT4G33890.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G14850.1); Has 133 Blast hits to 131 proteins in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 2; Plants - 129; Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink). )

HSP 1 Score: 299.3 bits (765), Expect = 4.9e-81
Identity = 174/352 (49.43%), Postives = 244/352 (69.32%), Query Frame = 0

Query: 76  SSRIDTSELKAMIYQKLGHQRSEKYFDQLKKLLSLKINKREFDKFCIQIIGREVIPLHNR 135
           SSR+DT E+KA+IY+++G+QR+E YF+QL +  +LKI K EFDK CI+ IGR+ I LHNR
Sbjct: 7   SSRLDTLEIKALIYREIGNQRAESYFNQLGRFFALKITKSEFDKLCIKTIGRQNIHLHNR 66

Query: 136 LIRAILRNACVAKIPPVLSSSRKVGANLSVKVVHGYQKSCLQSLNGD-AFLSSPRKGRSP 195
           LIR+I++NAC+AK PP +   +K G+ +        + S +Q L+GD AF  S RK RS 
Sbjct: 67  LIRSIIKNACIAKSPPFI---KKGGSFVRFGNGDSKKNSQIQPLHGDSAFSPSTRKCRS- 126

Query: 196 ASRDRKIRDRPSPLGPCGKPQNIELELAFKAQEQQSATELHSLGSRPPVEMASVEDGEEV 255
               RK+RDRPSPLGP GKP ++         + QSATEL SLGSRPPVE+ SVE+GEEV
Sbjct: 127 ----RKLRDRPSPLGPLGKPHSLTTTNEESMSKAQSATELLSLGSRPPVEVVSVEEGEEV 186

Query: 256 EQVA-VSPGVQSRSPVTAPFGISMNFIGSG--KTLSNVSV-GRNYHVTTCQNGGKLPDTR 315
           EQ+A  SP VQSR P+TAP G+SM+       K++SNVS+  R+++  TCQN G+LPDTR
Sbjct: 187 EQIAGGSPSVQSRCPLTAPLGVSMSLRNGATRKSVSNVSMCSRSFNRETCQNNGELPDTR 246

Query: 316 LLRTHLKQKLETEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCEREGNQPITNSR 375
            LR+ L+++LE E + I++D V+LLN+ LDV+++RLIEPCL+ + +RC  +        R
Sbjct: 247 TLRSRLERRLEMEGLKITMDSVSLLNSGLDVFMRRLIEPCLSLANTRCGTD--------R 306

Query: 376 TRSLE-QHWHRSQRLNNASLLDFSVAMQLNPEVLGKDRTIQLEKISLRALEE 422
            R +  Q+  +S+RL+  S+ DF   M+LN E+LG+D  + +EKI  RA ++
Sbjct: 307 VREMNYQYTQQSRRLSYVSMSDFRAGMELNTEILGEDWPMHMEKICSRASDK 342

BLAST of CmaCh04G004570 vs. TAIR 10
Match: AT4G33890.2 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G14850.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 299.3 bits (765), Expect = 4.9e-81
Identity = 174/352 (49.43%), Postives = 244/352 (69.32%), Query Frame = 0

Query: 76  SSRIDTSELKAMIYQKLGHQRSEKYFDQLKKLLSLKINKREFDKFCIQIIGREVIPLHNR 135
           SSR+DT E+KA+IY+++G+QR+E YF+QL +  +LKI K EFDK CI+ IGR+ I LHNR
Sbjct: 7   SSRLDTLEIKALIYREIGNQRAESYFNQLGRFFALKITKSEFDKLCIKTIGRQNIHLHNR 66

Query: 136 LIRAILRNACVAKIPPVLSSSRKVGANLSVKVVHGYQKSCLQSLNGD-AFLSSPRKGRSP 195
           LIR+I++NAC+AK PP +   +K G+ +        + S +Q L+GD AF  S RK RS 
Sbjct: 67  LIRSIIKNACIAKSPPFI---KKGGSFVRFGNGDSKKNSQIQPLHGDSAFSPSTRKCRS- 126

Query: 196 ASRDRKIRDRPSPLGPCGKPQNIELELAFKAQEQQSATELHSLGSRPPVEMASVEDGEEV 255
               RK+RDRPSPLGP GKP ++         + QSATEL SLGSRPPVE+ SVE+GEEV
Sbjct: 127 ----RKLRDRPSPLGPLGKPHSLTTTNEESMSKAQSATELLSLGSRPPVEVVSVEEGEEV 186

Query: 256 EQVA-VSPGVQSRSPVTAPFGISMNFIGSG--KTLSNVSV-GRNYHVTTCQNGGKLPDTR 315
           EQ+A  SP VQSR P+TAP G+SM+       K++SNVS+  R+++  TCQN G+LPDTR
Sbjct: 187 EQIAGGSPSVQSRCPLTAPLGVSMSLRNGATRKSVSNVSMCSRSFNRETCQNNGELPDTR 246

Query: 316 LLRTHLKQKLETEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCEREGNQPITNSR 375
            LR+ L+++LE E + I++D V+LLN+ LDV+++RLIEPCL+ + +RC  +        R
Sbjct: 247 TLRSRLERRLEMEGLKITMDSVSLLNSGLDVFMRRLIEPCLSLANTRCGTD--------R 306

Query: 376 TRSLE-QHWHRSQRLNNASLLDFSVAMQLNPEVLGKDRTIQLEKISLRALEE 422
            R +  Q+  +S+RL+  S+ DF   M+LN E+LG+D  + +EKI  RA ++
Sbjct: 307 VREMNYQYTQQSRRLSYVSMSDFRAGMELNTEILGEDWPMHMEKICSRASDK 342

BLAST of CmaCh04G004570 vs. TAIR 10
Match: AT2G14850.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G33890.2); Has 140 Blast hits to 132 proteins in 17 species: Archae - 0; Bacteria - 0; Metazoa - 1; Fungi - 2; Plants - 133; Viruses - 0; Other Eukaryotes - 4 (source: NCBI BLink). )

HSP 1 Score: 249.6 bits (636), Expect = 4.5e-66
Identity = 156/346 (45.09%), Postives = 205/346 (59.25%), Query Frame = 0

Query: 77  SRIDTSELKAMIYQKLGHQRSEKYFDQLKKLLSLKINKREFDKFCIQIIGREVIPLHNRL 136
           SR+++ E+KA+IYQK+GHQR++ YFDQL K L+ +I+K EFDK C + +GRE I LHNRL
Sbjct: 8   SRLNSLEIKALIYQKIGHQRADTYFDQLGKFLTSRISKSEFDKLCSKTVGRENISLHNRL 67

Query: 137 IRAILRNACVAKIPPVLSSSRKVGANLSVKVVHGYQKSCLQSLNGD-AFLSSPRKGRSPA 196
           +R+IL+NA VAK PP                   Y K   +SL GD  F  SPRK RS  
Sbjct: 68  VRSILKNASVAKSPP-----------------PRYPK---KSLYGDPVFPPSPRKCRS-- 127

Query: 197 SRDRKIRDRPSPLGPCGKPQNIELELAFKAQEQQSATELHSLGSRPPVEMASVEDGEEVE 256
              RK RDRPSPLGP GKPQ++            +  E  S   R P+E+ SVEDGEEVE
Sbjct: 128 ---RKFRDRPSPLGPLGKPQSL----------TTTNDESMSKAQRLPMEVVSVEDGEEVE 187

Query: 257 QVAVSPGVQSRSPVTAPFGISMNFIGSGKTLSNVSVGRNYHVTTCQNGGKLPDTRLLRTH 316
           Q+  SP VQSRSP+TAP G+S +     K+ +  S     +  TCQ+ G+LPD   LR  
Sbjct: 188 QMTGSPSVQSRSPLTAPLGVSFHL----KSKARFSTYNGINRETCQSSGELPDMITLRAR 247

Query: 317 LKQKLETEQIDISVDGVNLLNNALDVYLKRLIEPCLNFSRSRCEREGNQPITNSRTRSLE 376
           L++KLE E I +S+D  NLLN  L+ Y++RLIEPCL+ +                     
Sbjct: 248 LEKKLEMEGIKLSMDSANLLNRGLNAYMRRLIEPCLSLAS-------------------- 291

Query: 377 QHWHRSQRLNNASLLDFSVAMQLNPEVLGKDRTIQLEKISLRALEE 422
               + + ++N S+LDF  AM++NP VLG++  IQLEKI  RA EE
Sbjct: 308 ---QQKRAVSNVSMLDFHAAMEVNPRVLGEEWPIQLEKICCRASEE 291

BLAST of CmaCh04G004570 vs. TAIR 10
Match: AT2G24530.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G31440.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 152.1 bits (383), Expect = 9.7e-37
Identity = 131/402 (32.59%), Postives = 189/402 (47.01%), Query Frame = 0

Query: 73  RKDSSRIDTSELKAMIYQKLGHQRSEKYFDQLKKLLSLKINKREFDKFCIQIIGREVIPL 132
           R    RI   ELK  I +K G +RS +YF  L + LS K+ K EFDK C++++GRE + L
Sbjct: 3   RSQDQRISLCELKEHIVKKTGVERSRRYFYYLGRFLSQKLTKSEFDKTCLRLLGRENLSL 62

Query: 133 HNRLIRAILRNACVAKIPPV-----------LSSSRKVGANLSVKVVHGYQKSCLQSLNG 192
           HN+LIR+ILRNA VAK PP               SR  G   S  ++  + +      NG
Sbjct: 63  HNQLIRSILRNATVAKSPPPDHEAGHSTKANAFQSRGDGLEQSGTLIPNHSQHEPVWSNG 122

Query: 193 DAFLSSPRKGRSPASRDRKIRDRPSPLGPCGKPQNIELELAFKAQEQQSATELHSLGSRP 252
                SPRK RS   ++RK RDRPSPLG  GK +++  +   +   + S    +    R 
Sbjct: 123 -VLPISPRKVRS-GMQNRKSRDRPSPLGSNGKVEHMLHQPVCREDNRGSVGMENGDYQRS 182

Query: 253 --------------PVEMASVEDGEEVEQVAVSPGVQ---------SRSPVTAPFGISMN 312
                         PVE   + + E++  V++              S SP+ AP GI   
Sbjct: 183 GRYVADEKDGEFLRPVEKPRIPNKEKIAAVSMRDDQNQEEQARVNLSMSPLIAPLGIPFC 242

Query: 313 FIGSGKTLSNVSVGRNYHVTTCQNGGKLPDTRLLRTHLKQKLETEQID-ISVDGVNLLNN 372
               G +   + V  N  + +C + G LPD  +LR  ++     + ++ +S++    LNN
Sbjct: 243 SASVGGSPRTIPVSTNAELISCYDSGGLPDIEMLRKRMENIAVAQGLEGVSMECAKTLNN 302

Query: 373 ALDVYLKRLIEPCLNFSRSRC-------EREGNQPITNS------RTRSLEQHWHRS--- 422
            LDVYLK+LI  C +   +R        +R G Q   N        T SL+         
Sbjct: 303 MLDVYLKKLINSCFDLVGARSTNGDPGKQRIGKQQSQNKIVNGVWPTNSLKIQTPNGSSD 362

BLAST of CmaCh04G004570 vs. TAIR 10
Match: AT4G31440.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G24530.1); Has 210 Blast hits to 209 proteins in 55 species: Archae - 0; Bacteria - 72; Metazoa - 2; Fungi - 6; Plants - 128; Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink). )

HSP 1 Score: 150.6 bits (379), Expect = 2.8e-36
Identity = 127/386 (32.90%), Postives = 189/386 (48.96%), Query Frame = 0

Query: 73  RKDSSRIDTSELKAMIYQKLGHQRSEKYFDQLKKLLSLKINKREFDKFCIQIIGREVIPL 132
           R    RID +ELK  I +K+G +RS +YF  L + LS K+ K EFDK C +++GRE + L
Sbjct: 3   RLQDPRIDLAELKVHIVKKVGVERSTRYFYYLGRFLSQKLTKSEFDKSCFRLLGRENLSL 62

Query: 133 HNRLIRAILRNACVAKIPPVLSSSRKVGANLSVKVVHGYQKSCLQSLN-----GDAFLSS 192
           HN+LIR+ILRNA +AK PP +  S   G +L +    G ++S  +SLN      D  LS+
Sbjct: 63  HNKLIRSILRNASLAKSPPSVHQSGHPGKSLVLGKEDGPEES--RSLNPDHIRNDLALSN 122

Query: 193 P--RKGRSPASRDRKIRDRPSPLGPCGKPQN-----------IELELAFKAQEQQSATEL 252
               K R     DR IRD+P PLG  GK               E + AF    +Q A   
Sbjct: 123 GVLAKVRPGTCDDRTIRDKPCPLGSNGKVLGPFAYSRPGRYPDERDSAFLCPAEQKAVS- 182

Query: 253 HSLGSRPPVEMASVEDGEEVEQVAVSPGVQSRSPVTAPFGISMNFIGSGKTLSNVSVGRN 312
              G        S +D  +V        + S  PV AP GI       G     V V  +
Sbjct: 183 ---GKDQVAAPISRDDEAQVR-------ILSTPPVMAPLGIPFCSASVGGDRRTVPVSTS 242

Query: 313 YHVTTCQNGGKLPDTRLLRTHLKQKLETEQI-DISVDGVNLLNNALDVYLKRLIEPCLNF 372
               +C + G L DT +LR  ++    T+ +  +S +   +LNN LD+YLK+L++ C++ 
Sbjct: 243 AAAISCYDSGGLSDTEMLRKRMENIAVTQGLGGVSAECSIVLNNMLDLYLKKLMKSCVDL 302

Query: 373 SRSRC------------EREGNQPITNSRT------RSLEQHWHRSQRLNNASLLDFSVA 422
           + +R             ++  ++ +   RT      ++  Q    ++  ++ SLLDF VA
Sbjct: 303 AGARSMNGTPGKHSLEKQQSRDELVNGVRTNNSFHIQTSNQPSDITREQHSVSLLDFRVA 362

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1JSH53.7e-193100.00uncharacterized protein LOC111488516 OS=Cucurbita maxima OX=3661 GN=LOC111488516... [more]
A0A6J1FW122.6e-18696.60uncharacterized protein LOC111447829 OS=Cucurbita moschata OX=3662 GN=LOC1114478... [more]
A0A0A0KWF94.7e-17277.06Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G000960 PE=4 SV=1[more]
A0A5A7TT054.8e-16985.22Transcriptional coactivator Hfi1/Transcriptional adapter 1 OS=Cucumis melo var. ... [more]
A0A1S3BYK61.8e-16385.99uncharacterized protein LOC103494799 OS=Cucumis melo OX=3656 GN=LOC103494799 PE=... [more]
Match NameE-valueIdentityDescription
KAG7030952.11.1e-21293.63hypothetical protein SDJN02_04989 [Cucurbita argyrosperma subsp. argyrosperma][more]
XP_022992051.17.6e-193100.00uncharacterized protein LOC111488516 [Cucurbita maxima] >XP_022992060.1 uncharac... [more]
XP_023525283.18.1e-18796.88uncharacterized protein LOC111788929 [Cucurbita pepo subsp. pepo] >XP_023525290.... [more]
XP_022942947.15.3e-18696.60uncharacterized protein LOC111447829 [Cucurbita moschata] >XP_022942948.1 unchar... [more]
KAG6600292.11.2e-18596.88hypothetical protein SDJN03_05525, partial [Cucurbita argyrosperma subsp. sorori... [more]
Match NameE-valueIdentityDescription
AT4G33890.14.9e-8149.43unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G33890.24.9e-8149.43unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT2G14850.14.5e-6645.09unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT2G24530.19.7e-3732.59unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G31440.12.8e-3632.90unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR024738Transcriptional coactivator Hfi1/Transcriptional adapter 1PFAMPF12767SAGA-Tad1coord: 76..352
e-value: 2.0E-50
score: 171.7
IPR024738Transcriptional coactivator Hfi1/Transcriptional adapter 1PANTHERPTHR21277TRANSCRIPTIONAL ADAPTER 1coord: 73..421
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 192..206
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 230..252
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 185..211
NoneNo IPR availablePANTHERPTHR21277:SF29TRANSCRIPTIONAL REGULATOR OF RNA POLII, SAGA, SUBUNITcoord: 73..421

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh04G004570.1CmaCh04G004570.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0070461 SAGA-type complex