CmaCh12G002920 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh12G002920
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionSAGA-Tad1 domain-containing protein
LocationCma_Chr12: 1502796 .. 1503905 (+)
RNA-Seq ExpressionCmaCh12G002920
SyntenyCmaCh12G002920
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCGACCTCAGCAGAGCTTGAGAATTGGCTTGGGTGAATTGAAATCTCAGATAGTGAAGAAGCTCGGAACCGATCGATCAAAACGTTACTTCTTTTACTTGAATAGGTTCTTGAGTCAAAAGCTGAGTAAGAATGAGTTTGATAAGCTATGTTGTCGTGTTCTTGGGAGGGAAAATCTTTGGCTGCATAATCAATTGATACACTCAATTTTGAAGAATGCTTTGCAAGCTAAGGCTGCACCACCAATACCTACTTCAGCTCAAAGTATTCCCATTTGGTCTAATGGAGGTTTTCCATTGTCTCCAAGAAAGAGCCGGTCCGGGATTCGTGACCGTAAACTCAAGGACAGACCGAATGGGATGGTTGAATGCATCTCGCATCAATCAGCAGGCAAGGACGATGGAAGCTGTAAAATCACGATGGATAATGACGTTGCAACTCTGTGTGACTATCAGAGATCAGTGCAGCATTTGCAGGGAGTTGCTGGATTACCTGAAAACGATATCGAGGCTAGTGTTCAGCAACCAGCAGGACATCATGTCTTCCCGGGACAGTCGAATCACTTGAGTTTACTTCGGAGTCGATTACTTGCACCTCTTGGGATTCCTTTTTGCTCAGCTAGTATAGGCGGGGCTCGCAAAGCGAGACCTGTGGATTGTGGGGGCGATTTTAGCATTAGTGATATTGGTCGTTTGTTGGATACCGAGTCGCTGGGACGACGTATGGAACAAATAGCTGCAGGACAGGGCTTAGGCAGTGTTTCTGGAGATTGTGCTAGTATTTTGAATAAGGTGTTGGATGTATATTTGAAGCAGTTAATTCGGTCTTGTGTTGACTTGGGTGGATCATCATGGCCTGCATATGAGCCTGAGAAACCTCTTGCGCATAAGCAGCAGATTCAGGGGAGGGTTATCAATGGCCTGTTGCCTAATAATCAATTACATGGACGACATAGCAATGTCAATGGTGAAGCTACGTACAAGCACAGATTACAATGCTCGATATCGTTGCTCGACTTCAAACTAGCAATGGAGCTTAACCCGAAACAACTTGGGGAAGACTGGCCTTTGCTAATGGAGAAAATTTGTCTGCGTGCATCCAACAAATGA

mRNA sequence

ATGCGACCTCAGCAGAGCTTGAGAATTGGCTTGGGTGAATTGAAATCTCAGATAGTGAAGAAGCTCGGAACCGATCGATCAAAACGTTACTTCTTTTACTTGAATAGGTTCTTGAGTCAAAAGCTGAGTAAGAATGAGTTTGATAAGCTATGTTGTCGTGTTCTTGGGAGGGAAAATCTTTGGCTGCATAATCAATTGATACACTCAATTTTGAAGAATGCTTTGCAAGCTAAGGCTGCACCACCAATACCTACTTCAGCTCAAAGTATTCCCATTTGGTCTAATGGAGGTTTTCCATTGTCTCCAAGAAAGAGCCGGTCCGGGATTCGTGACCGTAAACTCAAGGACAGACCGAATGGGATGGTTGAATGCATCTCGCATCAATCAGCAGGCAAGGACGATGGAAGCTGTAAAATCACGATGGATAATGACGTTGCAACTCTGTGTGACTATCAGAGATCAGTGCAGCATTTGCAGGGAGTTGCTGGATTACCTGAAAACGATATCGAGGCTAGTGTTCAGCAACCAGCAGGACATCATGTCTTCCCGGGACAGTCGAATCACTTGAGTTTACTTCGGAGTCGATTACTTGCACCTCTTGGGATTCCTTTTTGCTCAGCTAGTATAGGCGGGGCTCGCAAAGCGAGACCTGTGGATTGTGGGGGCGATTTTAGCATTAGTGATATTGGTCGTTTGTTGGATACCGAGTCGCTGGGACGACGTATGGAACAAATAGCTGCAGGACAGGGCTTAGGCAGTGTTTCTGGAGATTGTGCTAGTATTTTGAATAAGGTGTTGGATGTATATTTGAAGCAGTTAATTCGGTCTTGTGTTGACTTGGGTGGATCATCATGGCCTGCATATGAGCCTGAGAAACCTCTTGCGCATAAGCAGCAGATTCAGGGGAGGGTTATCAATGGCCTGTTGCCTAATAATCAATTACATGGACGACATAGCAATGTCAATGGTGAAGCTACGTACAAGCACAGATTACAATGCTCGATATCGTTGCTCGACTTCAAACTAGCAATGGAGCTTAACCCGAAACAACTTGGGGAAGACTGGCCTTTGCTAATGGAGAAAATTTGTCTGCGTGCATCCAACAAATGA

Coding sequence (CDS)

ATGCGACCTCAGCAGAGCTTGAGAATTGGCTTGGGTGAATTGAAATCTCAGATAGTGAAGAAGCTCGGAACCGATCGATCAAAACGTTACTTCTTTTACTTGAATAGGTTCTTGAGTCAAAAGCTGAGTAAGAATGAGTTTGATAAGCTATGTTGTCGTGTTCTTGGGAGGGAAAATCTTTGGCTGCATAATCAATTGATACACTCAATTTTGAAGAATGCTTTGCAAGCTAAGGCTGCACCACCAATACCTACTTCAGCTCAAAGTATTCCCATTTGGTCTAATGGAGGTTTTCCATTGTCTCCAAGAAAGAGCCGGTCCGGGATTCGTGACCGTAAACTCAAGGACAGACCGAATGGGATGGTTGAATGCATCTCGCATCAATCAGCAGGCAAGGACGATGGAAGCTGTAAAATCACGATGGATAATGACGTTGCAACTCTGTGTGACTATCAGAGATCAGTGCAGCATTTGCAGGGAGTTGCTGGATTACCTGAAAACGATATCGAGGCTAGTGTTCAGCAACCAGCAGGACATCATGTCTTCCCGGGACAGTCGAATCACTTGAGTTTACTTCGGAGTCGATTACTTGCACCTCTTGGGATTCCTTTTTGCTCAGCTAGTATAGGCGGGGCTCGCAAAGCGAGACCTGTGGATTGTGGGGGCGATTTTAGCATTAGTGATATTGGTCGTTTGTTGGATACCGAGTCGCTGGGACGACGTATGGAACAAATAGCTGCAGGACAGGGCTTAGGCAGTGTTTCTGGAGATTGTGCTAGTATTTTGAATAAGGTGTTGGATGTATATTTGAAGCAGTTAATTCGGTCTTGTGTTGACTTGGGTGGATCATCATGGCCTGCATATGAGCCTGAGAAACCTCTTGCGCATAAGCAGCAGATTCAGGGGAGGGTTATCAATGGCCTGTTGCCTAATAATCAATTACATGGACGACATAGCAATGTCAATGGTGAAGCTACGTACAAGCACAGATTACAATGCTCGATATCGTTGCTCGACTTCAAACTAGCAATGGAGCTTAACCCGAAACAACTTGGGGAAGACTGGCCTTTGCTAATGGAGAAAATTTGTCTGCGTGCATCCAACAAATGA

Protein sequence

MRPQQSLRIGLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIHSILKNALQAKAAPPIPTSAQSIPIWSNGGFPLSPRKSRSGIRDRKLKDRPNGMVECISHQSAGKDDGSCKITMDNDVATLCDYQRSVQHLQGVAGLPENDIEASVQQPAGHHVFPGQSNHLSLLRSRLLAPLGIPFCSASIGGARKARPVDCGGDFSISDIGRLLDTESLGRRMEQIAAGQGLGSVSGDCASILNKVLDVYLKQLIRSCVDLGGSSWPAYEPEKPLAHKQQIQGRVINGLLPNNQLHGRHSNVNGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLMEKICLRASNK
Homology
BLAST of CmaCh12G002920 vs. ExPASy TrEMBL
Match: A0A6J1KJD6 (uncharacterized protein LOC111496247 OS=Cucurbita maxima OX=3661 GN=LOC111496247 PE=4 SV=1)

HSP 1 Score: 753.1 bits (1943), Expect = 5.7e-214
Identity = 369/369 (100.00%), Postives = 369/369 (100.00%), Query Frame = 0

Query: 1   MRPQQSLRIGLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENL 60
           MRPQQSLRIGLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENL
Sbjct: 1   MRPQQSLRIGLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENL 60

Query: 61  WLHNQLIHSILKNALQAKAAPPIPTSAQSIPIWSNGGFPLSPRKSRSGIRDRKLKDRPNG 120
           WLHNQLIHSILKNALQAKAAPPIPTSAQSIPIWSNGGFPLSPRKSRSGIRDRKLKDRPNG
Sbjct: 61  WLHNQLIHSILKNALQAKAAPPIPTSAQSIPIWSNGGFPLSPRKSRSGIRDRKLKDRPNG 120

Query: 121 MVECISHQSAGKDDGSCKITMDNDVATLCDYQRSVQHLQGVAGLPENDIEASVQQPAGHH 180
           MVECISHQSAGKDDGSCKITMDNDVATLCDYQRSVQHLQGVAGLPENDIEASVQQPAGHH
Sbjct: 121 MVECISHQSAGKDDGSCKITMDNDVATLCDYQRSVQHLQGVAGLPENDIEASVQQPAGHH 180

Query: 181 VFPGQSNHLSLLRSRLLAPLGIPFCSASIGGARKARPVDCGGDFSISDIGRLLDTESLGR 240
           VFPGQSNHLSLLRSRLLAPLGIPFCSASIGGARKARPVDCGGDFSISDIGRLLDTESLGR
Sbjct: 181 VFPGQSNHLSLLRSRLLAPLGIPFCSASIGGARKARPVDCGGDFSISDIGRLLDTESLGR 240

Query: 241 RMEQIAAGQGLGSVSGDCASILNKVLDVYLKQLIRSCVDLGGSSWPAYEPEKPLAHKQQI 300
           RMEQIAAGQGLGSVSGDCASILNKVLDVYLKQLIRSCVDLGGSSWPAYEPEKPLAHKQQI
Sbjct: 241 RMEQIAAGQGLGSVSGDCASILNKVLDVYLKQLIRSCVDLGGSSWPAYEPEKPLAHKQQI 300

Query: 301 QGRVINGLLPNNQLHGRHSNVNGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLME 360
           QGRVINGLLPNNQLHGRHSNVNGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLME
Sbjct: 301 QGRVINGLLPNNQLHGRHSNVNGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLME 360

Query: 361 KICLRASNK 370
           KICLRASNK
Sbjct: 361 KICLRASNK 369

BLAST of CmaCh12G002920 vs. ExPASy TrEMBL
Match: A0A6J1GHT2 (uncharacterized protein LOC111454310 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111454310 PE=4 SV=1)

HSP 1 Score: 721.5 bits (1861), Expect = 1.8e-204
Identity = 358/369 (97.02%), Postives = 361/369 (97.83%), Query Frame = 0

Query: 1   MRPQQSLRIGLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENL 60
           MRPQQSLRI LGELKSQIVK LGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENL
Sbjct: 1   MRPQQSLRIDLGELKSQIVKMLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENL 60

Query: 61  WLHNQLIHSILKNALQAKAAPPIPTSAQSIPIWSNGGFPLSPRKSRSGIRDRKLKDRPNG 120
           WLHNQLIHSILKNA QAKAAPPIPTSAQSIPIWSNGGFPLSPRKSRSGIRDRKLKDRPNG
Sbjct: 61  WLHNQLIHSILKNACQAKAAPPIPTSAQSIPIWSNGGFPLSPRKSRSGIRDRKLKDRPNG 120

Query: 121 MVECISHQSAGKDDGSCKITMDNDVATLCDYQRSVQHLQGVAGLPENDIEASVQQPAGHH 180
           MVECISHQSAGKDDGSCKITMDNDVATLCDYQR VQHLQGVA L ENDIEASVQQPAG+H
Sbjct: 121 MVECISHQSAGKDDGSCKITMDNDVATLCDYQRPVQHLQGVAEL-ENDIEASVQQPAGNH 180

Query: 181 VFPGQSNHLSLLRSRLLAPLGIPFCSASIGGARKARPVDCGGDFSISDIGRLLDTESLGR 240
           VFPGQSNHLSLLRSRLLAPLGIPFCSASIGGARKARPVDCGGDFSISDIGRLLDTESLGR
Sbjct: 181 VFPGQSNHLSLLRSRLLAPLGIPFCSASIGGARKARPVDCGGDFSISDIGRLLDTESLGR 240

Query: 241 RMEQIAAGQGLGSVSGDCASILNKVLDVYLKQLIRSCVDLGGSSWPAYEPEKPLAHKQQI 300
           RMEQIAAGQGLGSVSGDCASILNKVLDVYLKQLIRSCVDL GSSWPAYEPEKPLA+KQQI
Sbjct: 241 RMEQIAAGQGLGSVSGDCASILNKVLDVYLKQLIRSCVDLVGSSWPAYEPEKPLAYKQQI 300

Query: 301 QGRVINGLLPNNQLHGRHSNVNGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLME 360
           QGRVINGLLPNNQLHGRHSNVNGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLME
Sbjct: 301 QGRVINGLLPNNQLHGRHSNVNGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLME 360

Query: 361 KICLRASNK 370
           KICLRAS +
Sbjct: 361 KICLRASEE 368

BLAST of CmaCh12G002920 vs. ExPASy TrEMBL
Match: A0A6J1GHW6 (uncharacterized protein LOC111454310 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111454310 PE=4 SV=1)

HSP 1 Score: 688.0 bits (1774), Expect = 2.2e-194
Identity = 339/348 (97.41%), Postives = 342/348 (98.28%), Query Frame = 0

Query: 22  LGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIHSILKNALQAKAAP 81
           LGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIHSILKNA QAKAAP
Sbjct: 2   LGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIHSILKNACQAKAAP 61

Query: 82  PIPTSAQSIPIWSNGGFPLSPRKSRSGIRDRKLKDRPNGMVECISHQSAGKDDGSCKITM 141
           PIPTSAQSIPIWSNGGFPLSPRKSRSGIRDRKLKDRPNGMVECISHQSAGKDDGSCKITM
Sbjct: 62  PIPTSAQSIPIWSNGGFPLSPRKSRSGIRDRKLKDRPNGMVECISHQSAGKDDGSCKITM 121

Query: 142 DNDVATLCDYQRSVQHLQGVAGLPENDIEASVQQPAGHHVFPGQSNHLSLLRSRLLAPLG 201
           DNDVATLCDYQR VQHLQGVA L ENDIEASVQQPAG+HVFPGQSNHLSLLRSRLLAPLG
Sbjct: 122 DNDVATLCDYQRPVQHLQGVAEL-ENDIEASVQQPAGNHVFPGQSNHLSLLRSRLLAPLG 181

Query: 202 IPFCSASIGGARKARPVDCGGDFSISDIGRLLDTESLGRRMEQIAAGQGLGSVSGDCASI 261
           IPFCSASIGGARKARPVDCGGDFSISDIGRLLDTESLGRRMEQIAAGQGLGSVSGDCASI
Sbjct: 182 IPFCSASIGGARKARPVDCGGDFSISDIGRLLDTESLGRRMEQIAAGQGLGSVSGDCASI 241

Query: 262 LNKVLDVYLKQLIRSCVDLGGSSWPAYEPEKPLAHKQQIQGRVINGLLPNNQLHGRHSNV 321
           LNKVLDVYLKQLIRSCVDL GSSWPAYEPEKPLA+KQQIQGRVINGLLPNNQLHGRHSNV
Sbjct: 242 LNKVLDVYLKQLIRSCVDLVGSSWPAYEPEKPLAYKQQIQGRVINGLLPNNQLHGRHSNV 301

Query: 322 NGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLMEKICLRASNK 370
           NGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLMEKICLRAS +
Sbjct: 302 NGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLMEKICLRASEE 348

BLAST of CmaCh12G002920 vs. ExPASy TrEMBL
Match: A0A6J1HF85 (uncharacterized protein LOC111463000 OS=Cucurbita moschata OX=3662 GN=LOC111463000 PE=4 SV=1)

HSP 1 Score: 571.2 bits (1471), Expect = 3.0e-159
Identity = 302/410 (73.66%), Postives = 321/410 (78.29%), Query Frame = 0

Query: 1   MRPQQSLRIGLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENL 60
           M+PQQSLRI L ELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENL
Sbjct: 1   MQPQQSLRIDLCELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENL 60

Query: 61  WLHNQLIHSILKNALQAKAAPPI------------------------------PTSAQSI 120
           WLHNQLI SILKNA QAKAAPPI                              PTS Q I
Sbjct: 61  WLHNQLIQSILKNACQAKAAPPIPAAGYPKTSTQAAKISPVIEDGNEDGGAVFPTSTQGI 120

Query: 121 PIWSNGGFPLSPRKSRSGIRDRKLKDR-----PNGMVECISHQSAGKDDGSCKITMDNDV 180
           PIWSN GFP+SPRK RSGIRDRKLKDR     PN  VECIS QSA K+DGSC+I MDN  
Sbjct: 121 PIWSNEGFPVSPRKCRSGIRDRKLKDRPSLLAPNLKVECISPQSACKEDGSCRIMMDNGN 180

Query: 181 ATLCDYQRSVQHLQGVAGLPENDIEASVQQPAGHHVF---------PGQSNHLSLLRSRL 240
           AT CDYQR VQHLQGV  LPEN+IEA VQ+P+G  V            QSN  SLLRSRL
Sbjct: 181 ATSCDYQRPVQHLQGVFELPENNIEARVQRPSGKQVLQMQVEDREEARQSNRSSLLRSRL 240

Query: 241 LAPLGIPFCSASIGGARKARPVDCGGDFSISDIGRLLDTESLGRRMEQIAAGQGLGSVSG 300
           LAPLGIPFCSASIGGA K RPVDCGG+FS SD+G LLDTESL RRMEQIAA QGLGSVS 
Sbjct: 241 LAPLGIPFCSASIGGAHKTRPVDCGGNFSFSDMGHLLDTESLRRRMEQIAAVQGLGSVSA 300

Query: 301 DCASILNKVLDVYLKQLIRSCVDLGGSSWPAYEPEKPLAHKQQIQGRVINGLLPNNQLHG 360
           DCA+ILNKVLDVYLKQLIRSCVDL G +WPA+EPEKPLAH QQIQG+VING+LPNNQLH 
Sbjct: 301 DCANILNKVLDVYLKQLIRSCVDLVG-AWPAFEPEKPLAHNQQIQGKVINGMLPNNQLHR 360

Query: 361 RHSNVNGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLMEKICLRA 367
            HSN NGE  ++ RL CSISLLDFK+AMELNPKQLGEDWPLL+EKI +RA
Sbjct: 361 LHSNGNGEVVHERRLHCSISLLDFKVAMELNPKQLGEDWPLLLEKISMRA 409

BLAST of CmaCh12G002920 vs. ExPASy TrEMBL
Match: A0A6J1K7Q1 (uncharacterized protein LOC111492414 OS=Cucurbita maxima OX=3661 GN=LOC111492414 PE=4 SV=1)

HSP 1 Score: 561.2 bits (1445), Expect = 3.2e-156
Identity = 299/415 (72.05%), Postives = 317/415 (76.39%), Query Frame = 0

Query: 1   MRPQQSLRIGLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENL 60
           M+PQQSLRI L ELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENL
Sbjct: 1   MQPQQSLRIDLCELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENL 60

Query: 61  WLHNQLIHSILKNALQAKAAPPIP------------------------------TSAQSI 120
           WLHNQLI SILKNA QAKAAPPIP                              TS Q I
Sbjct: 61  WLHNQLIQSILKNACQAKAAPPIPAAGYPKTSTQAAKISPVIEDGNEDGGAVFATSTQGI 120

Query: 121 PIWSNGGFPLSPRKSRSGIRDRKLKDR-----PNGMVECISHQSAGKDDGSCKITMDNDV 180
           PIWSN GF +SPRK RSGIRDRKLKDR     PN  VECIS QSA K+DGSC+I MDN  
Sbjct: 121 PIWSNEGFSMSPRKCRSGIRDRKLKDRPSLLAPNLKVECISAQSACKEDGSCRIMMDNGN 180

Query: 181 ATLCDYQRSVQHLQGVAGLPENDIEASVQQPAGHHVF--------------PGQSNHLSL 240
           AT CDYQR VQHLQGV  LPEN+IEA VQ+P+G  V                 QSN  SL
Sbjct: 181 ATSCDYQRPVQHLQGVFELPENNIEARVQRPSGKQVLQMQVEGTKVEDREEARQSNRSSL 240

Query: 241 LRSRLLAPLGIPFCSASIGGARKARPVDCGGDFSISDIGRLLDTESLGRRMEQIAAGQGL 300
           LRSRLLAPLGIPFCSASIGGA K RPVDCGG+FS SD+G LLDTESL RRMEQIAA QGL
Sbjct: 241 LRSRLLAPLGIPFCSASIGGAHKTRPVDCGGNFSFSDMGHLLDTESLRRRMEQIAAVQGL 300

Query: 301 GSVSGDCASILNKVLDVYLKQLIRSCVDLGGSSWPAYEPEKPLAHKQQIQGRVINGLLPN 360
           GSVS DCA+ILNKVLDVYLKQLIRSCVDL G  WP +EPEKPLAH QQIQG+VING+LPN
Sbjct: 301 GSVSADCANILNKVLDVYLKQLIRSCVDLVG-PWPVFEPEKPLAHNQQIQGKVINGMLPN 360

Query: 361 NQLHGRHSNVNGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLMEKICLRA 367
           NQLH  HSN N E  ++ RL CSISLLDFK+AMELNPKQLGEDWPLL+EKI +RA
Sbjct: 361 NQLHRLHSNGNREVVHERRLHCSISLLDFKVAMELNPKQLGEDWPLLLEKISMRA 414

BLAST of CmaCh12G002920 vs. NCBI nr
Match: XP_023002392.1 (uncharacterized protein LOC111496247 [Cucurbita maxima])

HSP 1 Score: 753.1 bits (1943), Expect = 1.2e-213
Identity = 369/369 (100.00%), Postives = 369/369 (100.00%), Query Frame = 0

Query: 1   MRPQQSLRIGLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENL 60
           MRPQQSLRIGLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENL
Sbjct: 1   MRPQQSLRIGLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENL 60

Query: 61  WLHNQLIHSILKNALQAKAAPPIPTSAQSIPIWSNGGFPLSPRKSRSGIRDRKLKDRPNG 120
           WLHNQLIHSILKNALQAKAAPPIPTSAQSIPIWSNGGFPLSPRKSRSGIRDRKLKDRPNG
Sbjct: 61  WLHNQLIHSILKNALQAKAAPPIPTSAQSIPIWSNGGFPLSPRKSRSGIRDRKLKDRPNG 120

Query: 121 MVECISHQSAGKDDGSCKITMDNDVATLCDYQRSVQHLQGVAGLPENDIEASVQQPAGHH 180
           MVECISHQSAGKDDGSCKITMDNDVATLCDYQRSVQHLQGVAGLPENDIEASVQQPAGHH
Sbjct: 121 MVECISHQSAGKDDGSCKITMDNDVATLCDYQRSVQHLQGVAGLPENDIEASVQQPAGHH 180

Query: 181 VFPGQSNHLSLLRSRLLAPLGIPFCSASIGGARKARPVDCGGDFSISDIGRLLDTESLGR 240
           VFPGQSNHLSLLRSRLLAPLGIPFCSASIGGARKARPVDCGGDFSISDIGRLLDTESLGR
Sbjct: 181 VFPGQSNHLSLLRSRLLAPLGIPFCSASIGGARKARPVDCGGDFSISDIGRLLDTESLGR 240

Query: 241 RMEQIAAGQGLGSVSGDCASILNKVLDVYLKQLIRSCVDLGGSSWPAYEPEKPLAHKQQI 300
           RMEQIAAGQGLGSVSGDCASILNKVLDVYLKQLIRSCVDLGGSSWPAYEPEKPLAHKQQI
Sbjct: 241 RMEQIAAGQGLGSVSGDCASILNKVLDVYLKQLIRSCVDLGGSSWPAYEPEKPLAHKQQI 300

Query: 301 QGRVINGLLPNNQLHGRHSNVNGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLME 360
           QGRVINGLLPNNQLHGRHSNVNGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLME
Sbjct: 301 QGRVINGLLPNNQLHGRHSNVNGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLME 360

Query: 361 KICLRASNK 370
           KICLRASNK
Sbjct: 361 KICLRASNK 369

BLAST of CmaCh12G002920 vs. NCBI nr
Match: XP_023537842.1 (uncharacterized protein LOC111798750 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 725.7 bits (1872), Expect = 2.0e-205
Identity = 357/369 (96.75%), Postives = 361/369 (97.83%), Query Frame = 0

Query: 1   MRPQQSLRIGLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENL 60
           MRPQQSLRI LGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENL
Sbjct: 1   MRPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENL 60

Query: 61  WLHNQLIHSILKNALQAKAAPPIPTSAQSIPIWSNGGFPLSPRKSRSGIRDRKLKDRPNG 120
           WLHNQLIHSILKNA QAKAAPPIPTSAQSIPIW NGGFPLSPRKSRSGIRDRKLKDRPNG
Sbjct: 61  WLHNQLIHSILKNACQAKAAPPIPTSAQSIPIWFNGGFPLSPRKSRSGIRDRKLKDRPNG 120

Query: 121 MVECISHQSAGKDDGSCKITMDNDVATLCDYQRSVQHLQGVAGLPENDIEASVQQPAGHH 180
           MVECISHQSAGK+DGSCKITMDNDVATLCDYQR VQHLQGVA LPENDIEASVQQPAG+H
Sbjct: 121 MVECISHQSAGKNDGSCKITMDNDVATLCDYQRPVQHLQGVAELPENDIEASVQQPAGNH 180

Query: 181 VFPGQSNHLSLLRSRLLAPLGIPFCSASIGGARKARPVDCGGDFSISDIGRLLDTESLGR 240
           +FPGQSN LSLLRSRLLAPLGIPFCSASIGGARKARPVDCGGDFSISDIGRLLDTESLGR
Sbjct: 181 IFPGQSNRLSLLRSRLLAPLGIPFCSASIGGARKARPVDCGGDFSISDIGRLLDTESLGR 240

Query: 241 RMEQIAAGQGLGSVSGDCASILNKVLDVYLKQLIRSCVDLGGSSWPAYEPEKPLAHKQQI 300
           RMEQIAAGQGLGSVSGDCASILNKVLDVYLKQLIRSCVDL GSSWPAYEPEKPLAHKQQI
Sbjct: 241 RMEQIAAGQGLGSVSGDCASILNKVLDVYLKQLIRSCVDLVGSSWPAYEPEKPLAHKQQI 300

Query: 301 QGRVINGLLPNNQLHGRHSNVNGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLME 360
           QGRVINGLLPNNQLHGRHSNVNGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLME
Sbjct: 301 QGRVINGLLPNNQLHGRHSNVNGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLME 360

Query: 361 KICLRASNK 370
           KICLRAS +
Sbjct: 361 KICLRASEE 369

BLAST of CmaCh12G002920 vs. NCBI nr
Match: XP_022951516.1 (uncharacterized protein LOC111454310 isoform X1 [Cucurbita moschata])

HSP 1 Score: 721.5 bits (1861), Expect = 3.8e-204
Identity = 358/369 (97.02%), Postives = 361/369 (97.83%), Query Frame = 0

Query: 1   MRPQQSLRIGLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENL 60
           MRPQQSLRI LGELKSQIVK LGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENL
Sbjct: 1   MRPQQSLRIDLGELKSQIVKMLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENL 60

Query: 61  WLHNQLIHSILKNALQAKAAPPIPTSAQSIPIWSNGGFPLSPRKSRSGIRDRKLKDRPNG 120
           WLHNQLIHSILKNA QAKAAPPIPTSAQSIPIWSNGGFPLSPRKSRSGIRDRKLKDRPNG
Sbjct: 61  WLHNQLIHSILKNACQAKAAPPIPTSAQSIPIWSNGGFPLSPRKSRSGIRDRKLKDRPNG 120

Query: 121 MVECISHQSAGKDDGSCKITMDNDVATLCDYQRSVQHLQGVAGLPENDIEASVQQPAGHH 180
           MVECISHQSAGKDDGSCKITMDNDVATLCDYQR VQHLQGVA L ENDIEASVQQPAG+H
Sbjct: 121 MVECISHQSAGKDDGSCKITMDNDVATLCDYQRPVQHLQGVAEL-ENDIEASVQQPAGNH 180

Query: 181 VFPGQSNHLSLLRSRLLAPLGIPFCSASIGGARKARPVDCGGDFSISDIGRLLDTESLGR 240
           VFPGQSNHLSLLRSRLLAPLGIPFCSASIGGARKARPVDCGGDFSISDIGRLLDTESLGR
Sbjct: 181 VFPGQSNHLSLLRSRLLAPLGIPFCSASIGGARKARPVDCGGDFSISDIGRLLDTESLGR 240

Query: 241 RMEQIAAGQGLGSVSGDCASILNKVLDVYLKQLIRSCVDLGGSSWPAYEPEKPLAHKQQI 300
           RMEQIAAGQGLGSVSGDCASILNKVLDVYLKQLIRSCVDL GSSWPAYEPEKPLA+KQQI
Sbjct: 241 RMEQIAAGQGLGSVSGDCASILNKVLDVYLKQLIRSCVDLVGSSWPAYEPEKPLAYKQQI 300

Query: 301 QGRVINGLLPNNQLHGRHSNVNGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLME 360
           QGRVINGLLPNNQLHGRHSNVNGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLME
Sbjct: 301 QGRVINGLLPNNQLHGRHSNVNGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLME 360

Query: 361 KICLRASNK 370
           KICLRAS +
Sbjct: 361 KICLRASEE 368

BLAST of CmaCh12G002920 vs. NCBI nr
Match: KAG6585423.1 (hypothetical protein SDJN03_18156, partial [Cucurbita argyrosperma subsp. sororia] >KAG7020344.1 hypothetical protein SDJN02_17028, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 717.2 bits (1850), Expect = 7.1e-203
Identity = 356/370 (96.22%), Postives = 359/370 (97.03%), Query Frame = 0

Query: 1   MRPQQSLRIGLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENL 60
           MRPQQSLRI LGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENL
Sbjct: 1   MRPQQSLRIDLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENL 60

Query: 61  WLHNQLIHSILKNALQAKAAPPIPTSAQSIPIWSNGGFPLSPRKSRSGIRDRKLKDRPNG 120
           WLHNQLIHSILKNA QAKAAPPIPTSAQSIPIWSNG FPLSPRKSRSGI DRKLKDRPNG
Sbjct: 61  WLHNQLIHSILKNACQAKAAPPIPTSAQSIPIWSNGDFPLSPRKSRSGIHDRKLKDRPNG 120

Query: 121 MVECISHQSAGKDDGSCKITMDNDVATLCDYQRSVQHLQGVAGLPENDIEASVQQPA-GH 180
           MVECISHQSAGKDDGSCKITMDNDVATLCDYQR VQHLQGVA LPENDIEASVQQPA G+
Sbjct: 121 MVECISHQSAGKDDGSCKITMDNDVATLCDYQRPVQHLQGVAELPENDIEASVQQPAGGN 180

Query: 181 HVFPGQSNHLSLLRSRLLAPLGIPFCSASIGGARKARPVDCGGDFSISDIGRLLDTESLG 240
           HVFPGQSNHLSLLRSRLLAPLGIPFCSASIGGARKARPVDCGGDFSISDIGRLLDTESLG
Sbjct: 181 HVFPGQSNHLSLLRSRLLAPLGIPFCSASIGGARKARPVDCGGDFSISDIGRLLDTESLG 240

Query: 241 RRMEQIAAGQGLGSVSGDCASILNKVLDVYLKQLIRSCVDLGGSSWPAYEPEKPLAHKQQ 300
           RRMEQIAAGQGLGSVSGDCASILNKVLD YLKQLIRSCVDL GSSWPAYEPEKPLA+KQQ
Sbjct: 241 RRMEQIAAGQGLGSVSGDCASILNKVLDAYLKQLIRSCVDLVGSSWPAYEPEKPLAYKQQ 300

Query: 301 IQGRVINGLLPNNQLHGRHSNVNGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLM 360
           IQGRVINGLLPNNQLH RHSNVNGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLM
Sbjct: 301 IQGRVINGLLPNNQLHRRHSNVNGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLM 360

Query: 361 EKICLRASNK 370
           EKICLRAS +
Sbjct: 361 EKICLRASEE 370

BLAST of CmaCh12G002920 vs. NCBI nr
Match: XP_022951518.1 (uncharacterized protein LOC111454310 isoform X2 [Cucurbita moschata])

HSP 1 Score: 688.0 bits (1774), Expect = 4.6e-194
Identity = 339/348 (97.41%), Postives = 342/348 (98.28%), Query Frame = 0

Query: 22  LGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIHSILKNALQAKAAP 81
           LGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIHSILKNA QAKAAP
Sbjct: 2   LGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLIHSILKNACQAKAAP 61

Query: 82  PIPTSAQSIPIWSNGGFPLSPRKSRSGIRDRKLKDRPNGMVECISHQSAGKDDGSCKITM 141
           PIPTSAQSIPIWSNGGFPLSPRKSRSGIRDRKLKDRPNGMVECISHQSAGKDDGSCKITM
Sbjct: 62  PIPTSAQSIPIWSNGGFPLSPRKSRSGIRDRKLKDRPNGMVECISHQSAGKDDGSCKITM 121

Query: 142 DNDVATLCDYQRSVQHLQGVAGLPENDIEASVQQPAGHHVFPGQSNHLSLLRSRLLAPLG 201
           DNDVATLCDYQR VQHLQGVA L ENDIEASVQQPAG+HVFPGQSNHLSLLRSRLLAPLG
Sbjct: 122 DNDVATLCDYQRPVQHLQGVAEL-ENDIEASVQQPAGNHVFPGQSNHLSLLRSRLLAPLG 181

Query: 202 IPFCSASIGGARKARPVDCGGDFSISDIGRLLDTESLGRRMEQIAAGQGLGSVSGDCASI 261
           IPFCSASIGGARKARPVDCGGDFSISDIGRLLDTESLGRRMEQIAAGQGLGSVSGDCASI
Sbjct: 182 IPFCSASIGGARKARPVDCGGDFSISDIGRLLDTESLGRRMEQIAAGQGLGSVSGDCASI 241

Query: 262 LNKVLDVYLKQLIRSCVDLGGSSWPAYEPEKPLAHKQQIQGRVINGLLPNNQLHGRHSNV 321
           LNKVLDVYLKQLIRSCVDL GSSWPAYEPEKPLA+KQQIQGRVINGLLPNNQLHGRHSNV
Sbjct: 242 LNKVLDVYLKQLIRSCVDLVGSSWPAYEPEKPLAYKQQIQGRVINGLLPNNQLHGRHSNV 301

Query: 322 NGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLMEKICLRASNK 370
           NGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLMEKICLRAS +
Sbjct: 302 NGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLMEKICLRASEE 348

BLAST of CmaCh12G002920 vs. TAIR 10
Match: AT2G24530.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G31440.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 304.7 bits (779), Expect = 1.0e-82
Identity = 184/410 (44.88%), Postives = 239/410 (58.29%), Query Frame = 0

Query: 1   MRPQQSLRIGLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENL 60
           M+  Q  RI L ELK  IVKK G +RS+RYF+YL RFLSQKL+K+EFDK C R+LGRENL
Sbjct: 1   MQRSQDQRISLCELKEHIVKKTGVERSRRYFYYLGRFLSQKLTKSEFDKTCLRLLGRENL 60

Query: 61  WLHNQLIHSILKNALQAKAAPP--------------------------IPTSAQSIPIWS 120
            LHNQLI SIL+NA  AK+ PP                          IP  +Q  P+WS
Sbjct: 61  SLHNQLIRSILRNATVAKSPPPDHEAGHSTKANAFQSRGDGLEQSGTLIPNHSQHEPVWS 120

Query: 121 NGGFPLSPRKSRSGIRDRKLKDRP-----NGMVECISHQSAGKDDGSCKITMDNDVATLC 180
           NG  P+SPRK RSG+++RK +DRP     NG VE + HQ   ++D    + M+N      
Sbjct: 121 NGVLPISPRKVRSGMQNRKSRDRPSPLGSNGKVEHMLHQPVCREDNRGSVGMENG----- 180

Query: 181 DYQRSVQHLQGVAGLPENDIEASVQQP--------AGHHVFPGQSN----HLSLLRSRLL 240
           DYQRS ++   VA   + +    V++P        A   +   Q+      ++L  S L+
Sbjct: 181 DYQRSGRY---VADEKDGEFLRPVEKPRIPNKEKIAAVSMRDDQNQEEQARVNLSMSPLI 240

Query: 241 APLGIPFCSASIGGARKARPVDCGGD-FSISDIGRLLDTESLGRRMEQIAAGQGLGSVSG 300
           APLGIPFCSAS+GG+ +  PV    +  S  D G L D E L +RME IA  QGL  VS 
Sbjct: 241 APLGIPFCSASVGGSPRTIPVSTNAELISCYDSGGLPDIEMLRKRMENIAVAQGLEGVSM 300

Query: 301 DCASILNKVLDVYLKQLIRSCVDLGGSSWPAYEPEKPLAHKQQIQGRVINGLLPNNQLHG 360
           +CA  LN +LDVYLK+LI SC DL G+     +P K    KQQ Q +++NG+ P N L  
Sbjct: 301 ECAKTLNNMLDVYLKKLINSCFDLVGARSTNGDPGKQRIGKQQSQNKIVNGVWPTNSLKI 360

Query: 361 RHSNVNGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLMEKICLRA 367
           +  N + +    H    S+S+LDF+ AMELNP+QLGEDWP L E+I LR+
Sbjct: 361 QTPNGSSDIRQDHH---SVSMLDFRTAMELNPRQLGEDWPTLRERISLRS 399

BLAST of CmaCh12G002920 vs. TAIR 10
Match: AT4G31440.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G24530.1); Has 210 Blast hits to 209 proteins in 55 species: Archae - 0; Bacteria - 72; Metazoa - 2; Fungi - 6; Plants - 128; Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink). )

HSP 1 Score: 252.7 bits (644), Expect = 4.6e-67
Identity = 170/398 (42.71%), Postives = 221/398 (55.53%), Query Frame = 0

Query: 1   MRPQQSLRIGLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENL 60
           M+  Q  RI L ELK  IVKK+G +RS RYF+YL RFLSQKL+K+EFDK C R+LGRENL
Sbjct: 1   MQRLQDPRIDLAELKVHIVKKVGVERSTRYFYYLGRFLSQKLTKSEFDKSCFRLLGRENL 60

Query: 61  WLHNQLIHSILKNALQAKAAPPIPTS---AQSIPIWSNGGFPLSPRKSRS---------- 120
            LHN+LI SIL+NA  AK+ P +  S    +S+ +    G    P +SRS          
Sbjct: 61  SLHNKLIRSILRNASLAKSPPSVHQSGHPGKSLVLGKEDG----PEESRSLNPDHIRNDL 120

Query: 121 ----GI---------RDRKLKDRP-----NGMV-ECISHQSAGKDDGSCKITMDNDVATL 180
               G+          DR ++D+P     NG V    ++   G      +   + D A L
Sbjct: 121 ALSNGVLAKVRPGTCDDRTIRDKPCPLGSNGKVLGPFAYSRPG------RYPDERDSAFL 180

Query: 181 CD-YQRSVQHLQGVAGLPENDIEASVQQPAGHHVFPGQSNHLSLLRSRLLAPLGIPFCSA 240
           C   Q++V     VA     D EA V+                L    ++APLGIPFCSA
Sbjct: 181 CPAEQKAVSGKDQVAAPISRDDEAQVR---------------ILSTPPVMAPLGIPFCSA 240

Query: 241 SIGGARKARPVD-CGGDFSISDIGRLLDTESLGRRMEQIAAGQGLGSVSGDCASILNKVL 300
           S+GG R+  PV       S  D G L DTE L +RME IA  QGLG VS +C+ +LN +L
Sbjct: 241 SVGGDRRTVPVSTSAAAISCYDSGGLSDTEMLRKRMENIAVTQGLGGVSAECSIVLNNML 300

Query: 301 DVYLKQLIRSCVDLGGSSWPAYEPEKPLAHKQQIQGRVINGLLPNNQLHGRHSNVNGEAT 360
           D+YLK+L++SCVDL G+      P K    KQQ +  ++NG+  NN  H + SN   + T
Sbjct: 301 DLYLKKLMKSCVDLAGARSMNGTPGKHSLEKQQSRDELVNGVRTNNSFHIQTSNQPSDIT 360

Query: 361 YKHRLQCSISLLDFKLAMELNPKQLGEDWPLLMEKICL 365
              R Q S+SLLDF++AMELNP QLGEDWPLL E+I +
Sbjct: 361 ---REQHSVSLLDFRVAMELNPHQLGEDWPLLRERISI 370

BLAST of CmaCh12G002920 vs. TAIR 10
Match: AT4G33890.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G14850.1); Has 133 Blast hits to 131 proteins in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 2; Plants - 129; Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink). )

HSP 1 Score: 146.4 bits (368), Expect = 4.7e-35
Identity = 120/385 (31.17%), Postives = 182/385 (47.27%), Query Frame = 0

Query: 4   QQSLRIGLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLH 63
           Q S R+   E+K+ I +++G  R++ YF  L RF + K++K+EFDKLC + +GR+N+ LH
Sbjct: 5   QGSSRLDTLEIKALIYREIGNQRAESYFNQLGRFFALKITKSEFDKLCIKTIGRQNIHLH 64

Query: 64  NQLIHSILKNALQAKAAPPI--------------PTSAQSIPIWSNGGFPLSPRKSRSGI 123
           N+LI SI+KNA  AK+ P I                ++Q  P+  +  F  S RK RS  
Sbjct: 65  NRLIRSIIKNACIAKSPPFIKKGGSFVRFGNGDSKKNSQIQPLHGDSAFSPSTRKCRS-- 124

Query: 124 RDRKLKDRPNGMVECISHQSAGKDDGSCKITMDNDVATLCDYQRSVQHLQGVAGLPENDI 183
             RKL+DRP+ +         G       +T  N+ +      +S   L  +   P  ++
Sbjct: 125 --RKLRDRPSPL---------GPLGKPHSLTTTNEES--MSKAQSATELLSLGSRPPVEV 184

Query: 184 EASVQQPAGHHVFPGQSNHLSLLRSRLLAPLGIPFCSASIGGARK--ARPVDCGGDF--- 243
             SV++        G S  +   R  L APLG+   S   G  RK  +    C   F   
Sbjct: 185 -VSVEEGEEVEQIAGGSPSVQ-SRCPLTAPLGVSM-SLRNGATRKSVSNVSMCSRSFNRE 244

Query: 244 SISDIGRLLDTESLGRRMEQIAAGQGLGSVSGDCASILNKVLDVYLKQLIRSCVDLGGSS 303
           +  + G L DT +L  R+E+    +GL  ++ D  S+LN  LDV++++LI  C+ L  + 
Sbjct: 245 TCQNNGELPDTRTLRSRLERRLEMEGL-KITMDSVSLLNSGLDVFMRRLIEPCLSLANTR 304

Query: 304 WPAYEPEKPLAHKQQIQGRVINGLLPNNQLHGRHSNVNGEATYKHRLQCSISLLDFKLAM 363
                                           R   +N + T + R    +S+ DF+  M
Sbjct: 305 CGT----------------------------DRVREMNYQYTQQSRRLSYVSMSDFRAGM 342

Query: 364 ELNPKQLGEDWPLLMEKICLRASNK 370
           ELN + LGEDWP+ MEKIC RAS+K
Sbjct: 365 ELNTEILGEDWPMHMEKICSRASDK 342

BLAST of CmaCh12G002920 vs. TAIR 10
Match: AT4G33890.2 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G14850.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 146.4 bits (368), Expect = 4.7e-35
Identity = 120/385 (31.17%), Postives = 182/385 (47.27%), Query Frame = 0

Query: 4   QQSLRIGLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLH 63
           Q S R+   E+K+ I +++G  R++ YF  L RF + K++K+EFDKLC + +GR+N+ LH
Sbjct: 5   QGSSRLDTLEIKALIYREIGNQRAESYFNQLGRFFALKITKSEFDKLCIKTIGRQNIHLH 64

Query: 64  NQLIHSILKNALQAKAAPPI--------------PTSAQSIPIWSNGGFPLSPRKSRSGI 123
           N+LI SI+KNA  AK+ P I                ++Q  P+  +  F  S RK RS  
Sbjct: 65  NRLIRSIIKNACIAKSPPFIKKGGSFVRFGNGDSKKNSQIQPLHGDSAFSPSTRKCRS-- 124

Query: 124 RDRKLKDRPNGMVECISHQSAGKDDGSCKITMDNDVATLCDYQRSVQHLQGVAGLPENDI 183
             RKL+DRP+ +         G       +T  N+ +      +S   L  +   P  ++
Sbjct: 125 --RKLRDRPSPL---------GPLGKPHSLTTTNEES--MSKAQSATELLSLGSRPPVEV 184

Query: 184 EASVQQPAGHHVFPGQSNHLSLLRSRLLAPLGIPFCSASIGGARK--ARPVDCGGDF--- 243
             SV++        G S  +   R  L APLG+   S   G  RK  +    C   F   
Sbjct: 185 -VSVEEGEEVEQIAGGSPSVQ-SRCPLTAPLGVSM-SLRNGATRKSVSNVSMCSRSFNRE 244

Query: 244 SISDIGRLLDTESLGRRMEQIAAGQGLGSVSGDCASILNKVLDVYLKQLIRSCVDLGGSS 303
           +  + G L DT +L  R+E+    +GL  ++ D  S+LN  LDV++++LI  C+ L  + 
Sbjct: 245 TCQNNGELPDTRTLRSRLERRLEMEGL-KITMDSVSLLNSGLDVFMRRLIEPCLSLANTR 304

Query: 304 WPAYEPEKPLAHKQQIQGRVINGLLPNNQLHGRHSNVNGEATYKHRLQCSISLLDFKLAM 363
                                           R   +N + T + R    +S+ DF+  M
Sbjct: 305 CGT----------------------------DRVREMNYQYTQQSRRLSYVSMSDFRAGM 342

Query: 364 ELNPKQLGEDWPLLMEKICLRASNK 370
           ELN + LGEDWP+ MEKIC RAS+K
Sbjct: 365 ELNTEILGEDWPMHMEKICSRASDK 342

BLAST of CmaCh12G002920 vs. TAIR 10
Match: AT2G14850.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G33890.2); Has 140 Blast hits to 132 proteins in 17 species: Archae - 0; Bacteria - 0; Metazoa - 1; Fungi - 2; Plants - 133; Viruses - 0; Other Eukaryotes - 4 (source: NCBI BLink). )

HSP 1 Score: 145.6 bits (366), Expect = 8.0e-35
Identity = 119/365 (32.60%), Postives = 176/365 (48.22%), Query Frame = 0

Query: 8   RIGLGELKSQIVKKLGTDRSKRYFFYLNRFLSQKLSKNEFDKLCCRVLGRENLWLHNQLI 67
           R+   E+K+ I +K+G  R+  YF  L +FL+ ++SK+EFDKLC + +GREN+ LHN+L+
Sbjct: 9   RLNSLEIKALIYQKIGHQRADTYFDQLGKFLTSRISKSEFDKLCSKTVGRENISLHNRLV 68

Query: 68  HSILKNALQAKAAPP-IPTSAQSIPIWSNGGFPLSPRKSRSGIRDRKLKDRPNGMVECIS 127
            SILKNA  AK+ PP  P  +    ++ +  FP SPRK RS    RK +DRP+ +     
Sbjct: 69  RSILKNASVAKSPPPRYPKKS----LYGDPVFPPSPRKCRS----RKFRDRPSPLGPLGK 128

Query: 128 HQSAGKDDGSCKITMDNDVATLCDYQRSVQHLQGVAGLPENDIEASVQQPAGHHVFPGQS 187
            QS         +T  ND  ++   QR    +  V    E      V+Q  G    P   
Sbjct: 129 PQS---------LTTTND-ESMSKAQRLPMEVVSVEDGEE------VEQMTGS---PSVQ 188

Query: 188 NHLSLLRSRLLAPLGIPFCSASIGGARKARPVDCGG--DFSISDIGRLLDTESLGRRMEQ 247
           +     RS L APLG+ F   S     KAR     G    +    G L D  +L  R+E+
Sbjct: 189 S-----RSPLTAPLGVSFHLKS-----KARFSTYNGINRETCQSSGELPDMITLRARLEK 248

Query: 248 IAAGQGLGSVSGDCASILNKVLDVYLKQLIRSCVDLGGSSWPAYEPEKPLAHKQQIQGRV 307
               +G+  +S D A++LN+ L+ Y+++LI  C+ L                        
Sbjct: 249 KLEMEGI-KLSMDSANLLNRGLNAYMRRLIEPCLSL------------------------ 291

Query: 308 INGLLPNNQLHGRHSNVNGEATYKHRLQCSISLLDFKLAMELNPKQLGEDWPLLMEKICL 367
                               A+ + R   ++S+LDF  AME+NP+ LGE+WP+ +EKIC 
Sbjct: 309 --------------------ASQQKRAVSNVSMLDFHAAMEVNPRVLGEEWPIQLEKICC 291

Query: 368 RASNK 370
           RAS +
Sbjct: 369 RASEE 291

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1KJD65.7e-214100.00uncharacterized protein LOC111496247 OS=Cucurbita maxima OX=3661 GN=LOC111496247... [more]
A0A6J1GHT21.8e-20497.02uncharacterized protein LOC111454310 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1GHW62.2e-19497.41uncharacterized protein LOC111454310 isoform X2 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1HF853.0e-15973.66uncharacterized protein LOC111463000 OS=Cucurbita moschata OX=3662 GN=LOC1114630... [more]
A0A6J1K7Q13.2e-15672.05uncharacterized protein LOC111492414 OS=Cucurbita maxima OX=3661 GN=LOC111492414... [more]
Match NameE-valueIdentityDescription
XP_023002392.11.2e-213100.00uncharacterized protein LOC111496247 [Cucurbita maxima][more]
XP_023537842.12.0e-20596.75uncharacterized protein LOC111798750 [Cucurbita pepo subsp. pepo][more]
XP_022951516.13.8e-20497.02uncharacterized protein LOC111454310 isoform X1 [Cucurbita moschata][more]
KAG6585423.17.1e-20396.22hypothetical protein SDJN03_18156, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022951518.14.6e-19497.41uncharacterized protein LOC111454310 isoform X2 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
AT2G24530.11.0e-8244.88unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G31440.14.6e-6742.71unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G33890.14.7e-3531.17unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G33890.24.7e-3531.17unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT2G14850.18.0e-3532.60unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR024738Transcriptional coactivator Hfi1/Transcriptional adapter 1PFAMPF12767SAGA-Tad1coord: 5..280
e-value: 1.2E-53
score: 182.3
IPR024738Transcriptional coactivator Hfi1/Transcriptional adapter 1PANTHERPTHR21277TRANSCRIPTIONAL ADAPTER 1coord: 1..367
NoneNo IPR availablePANTHERPTHR21277:SF38TRANSCRIPTIONAL REGULATOR OF RNA POLII, SAGA, SUBUNITcoord: 1..367

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh12G002920.1CmaCh12G002920.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006357 regulation of transcription by RNA polymerase II
cellular_component GO:0000124 SAGA complex
cellular_component GO:0070461 SAGA-type complex
molecular_function GO:0003713 transcription coactivator activity