Cp4.1LG06g04840 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG06g04840
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionSAGA-Tad1 domain-containing protein
LocationCp4.1LG06: 2207324 .. 2208577 (+)
RNA-Seq ExpressionCp4.1LG06g04840
SyntenyCp4.1LG06g04840
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCAATCTCAGCAAGGCTCCAGAATTGATTTAGGCGACTTGAAGGCTCAGATAGTTAAAAAACTTGGAAATGACAAGTCCAAGCGGTACTTCTTCTACTTGAGCAAATTCTTGGGTCAGAAGCTGAGCAAGGTTGAGTTTGATAAGCTGTGCGTTCGTGTGCTTGGAAGGGAGAATATTCAGCTCCACAATCAATTGATAAGGTCAATTTTGAAGAATGCATGTGTAGCCAAGACCCCACCACTTATAAATGTATCAGGACATGCACAATCTGTACTACAAGCTTCGAACAATACTCCTTGTAGAGAAGATGCCCCTGAACAAACTGGATCTGCCTTCCCAAATCAGAATCAGAGTATACCAATTTGGACAAACGGGGTTCTGCCAGTGTCCCCACGGAAGGGTAGATCTGTCTTACGCGGGAAGTTTAGGGATAGGCCGAGTCCGCTTGGTCCAAATGGAAAAACAGCATGTCTTTCGTATCAATCAACAGGTACTGAAGATAGGAAAGTTATTACAGAGAATGGTAATGTAACCATGTGTGACTATCAGAGGCCTGTACAGCAACTTCAAGCAGTAGCTGAGCTACCTGAGAATGACATAGATGGATCAGTTCAGCGCCCGTCAGGAAAACCAAGAATACGTCCAACAGAAGCATCCATTCTTGAAGAAGGAGAGGAGGTGGAACAATCAGATCCCTTAAGCTTCCTTAGAGGTCCTCTACTTCCACCTCTTGGTATTCCATTTTGTTCAGCGAGTGTGGGTGGGGCACGCAAGGCCTTGCCAGTCAGCAGTAGTGGCAGTTGTGTCGATTTTCTGAGTTGTTATGACAGTATTGGATTGTCTGATTCAGAGACAGTGAGAAAACGCATGGAACAAATTGCAACTGCACAAGGGCTTGATGGTGTTTCTTTGGAATGCCCTAACATCCTGAATAATACTCTTGATGTGTACCTGAAGCAATTGATAAAGTCTTGCCTTGAGTTGGTGAGAACAAGGTCTACATTTGAACATACGGGGCATCCGATCCAGAAGCAACAAAATCAAGGGAAGGTTATAAATGGTATGTGGCCTACTAACCACCTACGTGTACAGAATAGCAATGGGCGATCTGAAGTTTTGGAGGAAAAGAGTTTTGAATGCTCAGTGTCATTGCTTGATTTCAAAGTTGCTATGGAGCTCAATCCAAAGCAGCTTGGGGAAGATTGGCCTTTGCTGTTGGAGAAAATTAGTATGCGTGCCTTTGAGGAATAA

mRNA sequence

ATGCAATCTCAGCAAGGCTCCAGAATTGATTTAGGCGACTTGAAGGCTCAGATAGTTAAAAAACTTGGAAATGACAAGTCCAAGCGGTACTTCTTCTACTTGAGCAAATTCTTGGGTCAGAAGCTGAGCAAGGTTGAGTTTGATAAGCTGTGCGTTCGTGTGCTTGGAAGGGAGAATATTCAGCTCCACAATCAATTGATAAGGTCAATTTTGAAGAATGCATGTGTAGCCAAGACCCCACCACTTATAAATGTATCAGGACATGCACAATCTGTACTACAAGCTTCGAACAATACTCCTTGTAGAGAAGATGCCCCTGAACAAACTGGATCTGCCTTCCCAAATCAGAATCAGAGTATACCAATTTGGACAAACGGGGTTCTGCCAGTGTCCCCACGGAAGGGTAGATCTGTCTTACGCGGGAAGTTTAGGGATAGGCCGAGTCCGCTTGGTCCAAATGGAAAAACAGCATGTCTTTCGTATCAATCAACAGGTACTGAAGATAGGAAAGTTATTACAGAGAATGGTAATGTAACCATGTGTGACTATCAGAGGCCTGTACAGCAACTTCAAGCAGTAGCTGAGCTACCTGAGAATGACATAGATGGATCAGTTCAGCGCCCGTCAGGAAAACCAAGAATACGTCCAACAGAAGCATCCATTCTTGAAGAAGGAGAGGAGGTGGAACAATCAGATCCCTTAAGCTTCCTTAGAGGTCCTCTACTTCCACCTCTTGGTATTCCATTTTGTTCAGCGAGTGTGGGTGGGGCACGCAAGGCCTTGCCAGTCAGCAGTAGTGGCAGTTGTGTCGATTTTCTGAGTTGTTATGACAGTATTGGATTGTCTGATTCAGAGACAGTGAGAAAACGCATGGAACAAATTGCAACTGCACAAGGGCTTGATGGTGTTTCTTTGGAATGCCCTAACATCCTGAATAATACTCTTGATGTGTACCTGAAGCAATTGATAAAGTCTTGCCTTGAGTTGGTGAGAACAAGGTCTACATTTGAACATACGGGGCATCCGATCCAGAAGCAACAAAATCAAGGGAAGGTTATAAATGGTATGTGGCCTACTAACCACCTACGTGTACAGAATAGCAATGGGCGATCTGAAGTTTTGGAGGAAAAGAGTTTTGAATGCTCAGTGTCATTGCTTGATTTCAAAGTTGCTATGGAGCTCAATCCAAAGCAGCTTGGGGAAGATTGGCCTTTGCTGTTGGAGAAAATTAGTATGCGTGCCTTTGAGGAATAA

Coding sequence (CDS)

ATGCAATCTCAGCAAGGCTCCAGAATTGATTTAGGCGACTTGAAGGCTCAGATAGTTAAAAAACTTGGAAATGACAAGTCCAAGCGGTACTTCTTCTACTTGAGCAAATTCTTGGGTCAGAAGCTGAGCAAGGTTGAGTTTGATAAGCTGTGCGTTCGTGTGCTTGGAAGGGAGAATATTCAGCTCCACAATCAATTGATAAGGTCAATTTTGAAGAATGCATGTGTAGCCAAGACCCCACCACTTATAAATGTATCAGGACATGCACAATCTGTACTACAAGCTTCGAACAATACTCCTTGTAGAGAAGATGCCCCTGAACAAACTGGATCTGCCTTCCCAAATCAGAATCAGAGTATACCAATTTGGACAAACGGGGTTCTGCCAGTGTCCCCACGGAAGGGTAGATCTGTCTTACGCGGGAAGTTTAGGGATAGGCCGAGTCCGCTTGGTCCAAATGGAAAAACAGCATGTCTTTCGTATCAATCAACAGGTACTGAAGATAGGAAAGTTATTACAGAGAATGGTAATGTAACCATGTGTGACTATCAGAGGCCTGTACAGCAACTTCAAGCAGTAGCTGAGCTACCTGAGAATGACATAGATGGATCAGTTCAGCGCCCGTCAGGAAAACCAAGAATACGTCCAACAGAAGCATCCATTCTTGAAGAAGGAGAGGAGGTGGAACAATCAGATCCCTTAAGCTTCCTTAGAGGTCCTCTACTTCCACCTCTTGGTATTCCATTTTGTTCAGCGAGTGTGGGTGGGGCACGCAAGGCCTTGCCAGTCAGCAGTAGTGGCAGTTGTGTCGATTTTCTGAGTTGTTATGACAGTATTGGATTGTCTGATTCAGAGACAGTGAGAAAACGCATGGAACAAATTGCAACTGCACAAGGGCTTGATGGTGTTTCTTTGGAATGCCCTAACATCCTGAATAATACTCTTGATGTGTACCTGAAGCAATTGATAAAGTCTTGCCTTGAGTTGGTGAGAACAAGGTCTACATTTGAACATACGGGGCATCCGATCCAGAAGCAACAAAATCAAGGGAAGGTTATAAATGGTATGTGGCCTACTAACCACCTACGTGTACAGAATAGCAATGGGCGATCTGAAGTTTTGGAGGAAAAGAGTTTTGAATGCTCAGTGTCATTGCTTGATTTCAAAGTTGCTATGGAGCTCAATCCAAAGCAGCTTGGGGAAGATTGGCCTTTGCTGTTGGAGAAAATTAGTATGCGTGCCTTTGAGGAATAA

Protein sequence

MQSQQGSRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENIQLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTPCREDAPEQTGSAFPNQNQSIPIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTEDRKVITENGNVTMCDYQRPVQQLQAVAELPENDIDGSVQRPSGKPRIRPTEASILEEGEEVEQSDPLSFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQGLDGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWPTNHLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE
Homology
BLAST of Cp4.1LG06g04840 vs. NCBI nr
Match: XP_023536522.1 (uncharacterized protein LOC111797673 [Cucurbita pepo subsp. pepo] >XP_023536523.1 uncharacterized protein LOC111797673 [Cucurbita pepo subsp. pepo] >XP_023536524.1 uncharacterized protein LOC111797673 [Cucurbita pepo subsp. pepo] >XP_023536525.1 uncharacterized protein LOC111797673 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 834 bits (2154), Expect = 5.93e-305
Identity = 417/417 (100.00%), Postives = 417/417 (100.00%), Query Frame = 0

Query: 1   MQSQQGSRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENI 60
           MQSQQGSRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENI
Sbjct: 1   MQSQQGSRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENI 60

Query: 61  QLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTPCREDAPEQTGSAFPNQNQSI 120
           QLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTPCREDAPEQTGSAFPNQNQSI
Sbjct: 61  QLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTPCREDAPEQTGSAFPNQNQSI 120

Query: 121 PIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTEDRKVITENGNVTM 180
           PIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTEDRKVITENGNVTM
Sbjct: 121 PIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTEDRKVITENGNVTM 180

Query: 181 CDYQRPVQQLQAVAELPENDIDGSVQRPSGKPRIRPTEASILEEGEEVEQSDPLSFLRGP 240
           CDYQRPVQQLQAVAELPENDIDGSVQRPSGKPRIRPTEASILEEGEEVEQSDPLSFLRGP
Sbjct: 181 CDYQRPVQQLQAVAELPENDIDGSVQRPSGKPRIRPTEASILEEGEEVEQSDPLSFLRGP 240

Query: 241 LLPPLGIPFCSASVGGARKALPVSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQGL 300
           LLPPLGIPFCSASVGGARKALPVSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQGL
Sbjct: 241 LLPPLGIPFCSASVGGARKALPVSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQGL 300

Query: 301 DGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWPTN 360
           DGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWPTN
Sbjct: 301 DGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWPTN 360

Query: 361 HLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE 417
           HLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE
Sbjct: 361 HLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE 417

BLAST of Cp4.1LG06g04840 vs. NCBI nr
Match: XP_022976270.1 (uncharacterized protein LOC111476715 [Cucurbita maxima] >XP_022976271.1 uncharacterized protein LOC111476715 [Cucurbita maxima] >XP_022976272.1 uncharacterized protein LOC111476715 [Cucurbita maxima] >XP_022976273.1 uncharacterized protein LOC111476715 [Cucurbita maxima])

HSP 1 Score: 830 bits (2144), Expect = 1.98e-303
Identity = 415/417 (99.52%), Postives = 416/417 (99.76%), Query Frame = 0

Query: 1   MQSQQGSRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENI 60
           MQSQQ SRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENI
Sbjct: 1   MQSQQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENI 60

Query: 61  QLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTPCREDAPEQTGSAFPNQNQSI 120
           QLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTPCREDAPEQTGSAFPNQNQSI
Sbjct: 61  QLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTPCREDAPEQTGSAFPNQNQSI 120

Query: 121 PIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTEDRKVITENGNVTM 180
           PIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTEDRKVITENGNVTM
Sbjct: 121 PIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTEDRKVITENGNVTM 180

Query: 181 CDYQRPVQQLQAVAELPENDIDGSVQRPSGKPRIRPTEASILEEGEEVEQSDPLSFLRGP 240
           CDYQRPVQQLQAVAELPENDIDGSVQRPSGKPRIRPTEASILEEGEEVEQSDPLSFLRGP
Sbjct: 181 CDYQRPVQQLQAVAELPENDIDGSVQRPSGKPRIRPTEASILEEGEEVEQSDPLSFLRGP 240

Query: 241 LLPPLGIPFCSASVGGARKALPVSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQGL 300
           LLPPLGIPFCSASVGGARKALPVSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQGL
Sbjct: 241 LLPPLGIPFCSASVGGARKALPVSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQGL 300

Query: 301 DGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWPTN 360
           +GVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWPTN
Sbjct: 301 EGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWPTN 360

Query: 361 HLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE 417
           HLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE
Sbjct: 361 HLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE 417

BLAST of Cp4.1LG06g04840 vs. NCBI nr
Match: KAG6591626.1 (hypothetical protein SDJN03_13972, partial [Cucurbita argyrosperma subsp. sororia] >KAG7024509.1 hypothetical protein SDJN02_13325, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 826 bits (2134), Expect = 6.64e-302
Identity = 413/417 (99.04%), Postives = 414/417 (99.28%), Query Frame = 0

Query: 1   MQSQQGSRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENI 60
           MQSQQ SRIDL DLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENI
Sbjct: 1   MQSQQSSRIDLADLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENI 60

Query: 61  QLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTPCREDAPEQTGSAFPNQNQSI 120
           QLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTPCREDAPEQTGSAFPNQNQ I
Sbjct: 61  QLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTPCREDAPEQTGSAFPNQNQGI 120

Query: 121 PIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTEDRKVITENGNVTM 180
           PIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTEDRKVITENGNVTM
Sbjct: 121 PIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTEDRKVITENGNVTM 180

Query: 181 CDYQRPVQQLQAVAELPENDIDGSVQRPSGKPRIRPTEASILEEGEEVEQSDPLSFLRGP 240
           CDYQRPVQQLQAVAELPENDIDGSVQRPSGKPRIRPTEASILEEGEEVEQSDPLSFLRGP
Sbjct: 181 CDYQRPVQQLQAVAELPENDIDGSVQRPSGKPRIRPTEASILEEGEEVEQSDPLSFLRGP 240

Query: 241 LLPPLGIPFCSASVGGARKALPVSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQGL 300
           LLPPLGIPFCSASVGGARKALPVSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQGL
Sbjct: 241 LLPPLGIPFCSASVGGARKALPVSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQGL 300

Query: 301 DGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWPTN 360
           +GVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWPTN
Sbjct: 301 EGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWPTN 360

Query: 361 HLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE 417
           HLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE
Sbjct: 361 HLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE 417

BLAST of Cp4.1LG06g04840 vs. NCBI nr
Match: XP_022937292.1 (uncharacterized protein LOC111443621 [Cucurbita moschata] >XP_022937294.1 uncharacterized protein LOC111443621 [Cucurbita moschata] >XP_022937295.1 uncharacterized protein LOC111443621 [Cucurbita moschata])

HSP 1 Score: 824 bits (2129), Expect = 3.84e-301
Identity = 412/417 (98.80%), Postives = 413/417 (99.04%), Query Frame = 0

Query: 1   MQSQQGSRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENI 60
           MQSQQ SRIDL DLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENI
Sbjct: 1   MQSQQSSRIDLADLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENI 60

Query: 61  QLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTPCREDAPEQTGSAFPNQNQSI 120
           QLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTPCREDAPEQTGSAFPNQNQ I
Sbjct: 61  QLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTPCREDAPEQTGSAFPNQNQGI 120

Query: 121 PIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTEDRKVITENGNVTM 180
           PIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTEDRKVITENGNVTM
Sbjct: 121 PIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTEDRKVITENGNVTM 180

Query: 181 CDYQRPVQQLQAVAELPENDIDGSVQRPSGKPRIRPTEASILEEGEEVEQSDPLSFLRGP 240
           CDYQRPVQQLQAVAELPENDIDGSVQRPSGKPRIRPTEASILEEGEEVEQSDPLSFLRGP
Sbjct: 181 CDYQRPVQQLQAVAELPENDIDGSVQRPSGKPRIRPTEASILEEGEEVEQSDPLSFLRGP 240

Query: 241 LLPPLGIPFCSASVGGARKALPVSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQGL 300
           LLPPLGIPFCSASVGGARKALPVSSSGSCVDFL CYDSIGLSDSETVRKRMEQIATAQGL
Sbjct: 241 LLPPLGIPFCSASVGGARKALPVSSSGSCVDFLRCYDSIGLSDSETVRKRMEQIATAQGL 300

Query: 301 DGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWPTN 360
           +GVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWPTN
Sbjct: 301 EGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWPTN 360

Query: 361 HLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE 417
           HLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE
Sbjct: 361 HLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE 417

BLAST of Cp4.1LG06g04840 vs. NCBI nr
Match: XP_038899147.1 (uncharacterized protein LOC120086522 isoform X1 [Benincasa hispida] >XP_038899148.1 uncharacterized protein LOC120086522 isoform X1 [Benincasa hispida])

HSP 1 Score: 760 bits (1963), Expect = 7.92e-276
Identity = 382/419 (91.17%), Postives = 395/419 (94.27%), Query Frame = 0

Query: 1   MQSQQGSRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENI 60
           MQ Q  SRIDLGDLKAQIVKKLGND+SKRYFFYLS+FLGQKLSKVEFDK+CVRVLGRENI
Sbjct: 1   MQPQHSSRIDLGDLKAQIVKKLGNDRSKRYFFYLSRFLGQKLSKVEFDKVCVRVLGRENI 60

Query: 61  QLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTPCREDAPEQTGSAFPNQNQSI 120
           QLHNQLIRSILKNACVAKTPP IN SGHAQSVLQ SN +PCR+D PEQTGSAFPNQNQSI
Sbjct: 61  QLHNQLIRSILKNACVAKTPPSINASGHAQSVLQPSNISPCRDDGPEQTGSAFPNQNQSI 120

Query: 121 PIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTEDR--KVITENGNV 180
           PIW+NGVLPVSPRKGRSVLRGKFRDRPSPLGPNGK  CLSYQSTGTED   KVITENGNV
Sbjct: 121 PIWSNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGTEDSNSKVITENGNV 180

Query: 181 TMCDYQRPVQQLQAVAELPENDIDGSVQRPSGKPRIRPTEASILEEGEEVEQSDPLSFLR 240
           TMCDYQRPVQ LQAVAELPENDIDG+V RPS KPRI PTEA+ILEEGEEVEQSDPLSFLR
Sbjct: 181 TMCDYQRPVQHLQAVAELPENDIDGAVHRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLR 240

Query: 241 GPLLPPLGIPFCSASVGGARKALPVSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQ 300
           GPLLPPLGIPFCSASVGGARKALPV+SSGS  DFLSCYDSIGLSDS TVRKRMEQIATAQ
Sbjct: 241 GPLLPPLGIPFCSASVGGARKALPVNSSGSS-DFLSCYDSIGLSDSGTVRKRMEQIATAQ 300

Query: 301 GLDGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWP 360
           GL+GVS+ECPNILNNTLDVYLKQLIKSCLELVR RSTFEHTGHPIQKQQNQGKV+N MWP
Sbjct: 301 GLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHTGHPIQKQQNQGKVVNDMWP 360

Query: 361 TNHLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE 417
           TNHLRVQNSNGRSEVL+EKS ECSVSLLDFKVAMELNPKQLGEDWPLLLEKI MRAFEE
Sbjct: 361 TNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKICMRAFEE 418

BLAST of Cp4.1LG06g04840 vs. ExPASy TrEMBL
Match: A0A6J1IIZ9 (uncharacterized protein LOC111476715 OS=Cucurbita maxima OX=3661 GN=LOC111476715 PE=4 SV=1)

HSP 1 Score: 830 bits (2144), Expect = 9.61e-304
Identity = 415/417 (99.52%), Postives = 416/417 (99.76%), Query Frame = 0

Query: 1   MQSQQGSRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENI 60
           MQSQQ SRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENI
Sbjct: 1   MQSQQSSRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENI 60

Query: 61  QLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTPCREDAPEQTGSAFPNQNQSI 120
           QLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTPCREDAPEQTGSAFPNQNQSI
Sbjct: 61  QLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTPCREDAPEQTGSAFPNQNQSI 120

Query: 121 PIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTEDRKVITENGNVTM 180
           PIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTEDRKVITENGNVTM
Sbjct: 121 PIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTEDRKVITENGNVTM 180

Query: 181 CDYQRPVQQLQAVAELPENDIDGSVQRPSGKPRIRPTEASILEEGEEVEQSDPLSFLRGP 240
           CDYQRPVQQLQAVAELPENDIDGSVQRPSGKPRIRPTEASILEEGEEVEQSDPLSFLRGP
Sbjct: 181 CDYQRPVQQLQAVAELPENDIDGSVQRPSGKPRIRPTEASILEEGEEVEQSDPLSFLRGP 240

Query: 241 LLPPLGIPFCSASVGGARKALPVSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQGL 300
           LLPPLGIPFCSASVGGARKALPVSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQGL
Sbjct: 241 LLPPLGIPFCSASVGGARKALPVSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQGL 300

Query: 301 DGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWPTN 360
           +GVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWPTN
Sbjct: 301 EGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWPTN 360

Query: 361 HLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE 417
           HLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE
Sbjct: 361 HLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE 417

BLAST of Cp4.1LG06g04840 vs. ExPASy TrEMBL
Match: A0A6J1FAS2 (uncharacterized protein LOC111443621 OS=Cucurbita moschata OX=3662 GN=LOC111443621 PE=4 SV=1)

HSP 1 Score: 824 bits (2129), Expect = 1.86e-301
Identity = 412/417 (98.80%), Postives = 413/417 (99.04%), Query Frame = 0

Query: 1   MQSQQGSRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENI 60
           MQSQQ SRIDL DLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENI
Sbjct: 1   MQSQQSSRIDLADLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENI 60

Query: 61  QLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTPCREDAPEQTGSAFPNQNQSI 120
           QLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTPCREDAPEQTGSAFPNQNQ I
Sbjct: 61  QLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTPCREDAPEQTGSAFPNQNQGI 120

Query: 121 PIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTEDRKVITENGNVTM 180
           PIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTEDRKVITENGNVTM
Sbjct: 121 PIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTEDRKVITENGNVTM 180

Query: 181 CDYQRPVQQLQAVAELPENDIDGSVQRPSGKPRIRPTEASILEEGEEVEQSDPLSFLRGP 240
           CDYQRPVQQLQAVAELPENDIDGSVQRPSGKPRIRPTEASILEEGEEVEQSDPLSFLRGP
Sbjct: 181 CDYQRPVQQLQAVAELPENDIDGSVQRPSGKPRIRPTEASILEEGEEVEQSDPLSFLRGP 240

Query: 241 LLPPLGIPFCSASVGGARKALPVSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQGL 300
           LLPPLGIPFCSASVGGARKALPVSSSGSCVDFL CYDSIGLSDSETVRKRMEQIATAQGL
Sbjct: 241 LLPPLGIPFCSASVGGARKALPVSSSGSCVDFLRCYDSIGLSDSETVRKRMEQIATAQGL 300

Query: 301 DGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWPTN 360
           +GVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWPTN
Sbjct: 301 EGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWPTN 360

Query: 361 HLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE 417
           HLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE
Sbjct: 361 HLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE 417

BLAST of Cp4.1LG06g04840 vs. ExPASy TrEMBL
Match: A0A0A0LGS9 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G881820 PE=4 SV=1)

HSP 1 Score: 759 bits (1960), Expect = 1.10e-275
Identity = 380/419 (90.69%), Postives = 399/419 (95.23%), Query Frame = 0

Query: 1   MQSQQGSRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENI 60
           MQ Q  SRIDLGDLKAQIVKKLGNDKSKRYFF+LS+FLGQK+SKVEFDK+CVRVLGRENI
Sbjct: 1   MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENI 60

Query: 61  QLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTPCREDAPEQTGSAFPNQNQSI 120
           QLHNQLIRSILKNACVAKTPP IN SGHAQSVLQASNN+PCRED PEQTGSAFPNQNQS 
Sbjct: 61  QLHNQLIRSILKNACVAKTPPPINASGHAQSVLQASNNSPCREDGPEQTGSAFPNQNQSK 120

Query: 121 PIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTEDR--KVITENGNV 180
           PIW NGVLPVSPRKGRS LRGKFRDRPSPLGPNGK+ CLSYQSTG+ED   KVITENGNV
Sbjct: 121 PIWPNGVLPVSPRKGRSGLRGKFRDRPSPLGPNGKSTCLSYQSTGSEDSSSKVITENGNV 180

Query: 181 TMCDYQRPVQQLQAVAELPENDIDGSVQRPSGKPRIRPTEASILEEGEEVEQSDPLSFLR 240
           T+CDYQRPV+ LQ+VAELPENDIDG+VQRPS KPRI PTEA+ILEEGEEVEQSDPLSFLR
Sbjct: 181 TLCDYQRPVRYLQSVAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLSFLR 240

Query: 241 GPLLPPLGIPFCSASVGGARKALPVSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQ 300
           GPLLPPLGIPFCSASVGGARKALPVSSSGS  DFLSCYDSIGLSDSETVRKRMEQIA+AQ
Sbjct: 241 GPLLPPLGIPFCSASVGGARKALPVSSSGSS-DFLSCYDSIGLSDSETVRKRMEQIASAQ 300

Query: 301 GLDGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWP 360
           GL+GVS+ECP+ILNNTLDVYLKQLIKSCLELVR RSTFEH+GHPIQKQQNQGKV+NGMWP
Sbjct: 301 GLEGVSMECPSILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWP 360

Query: 361 TNHLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE 417
           TNHLRVQNSNGRSEVL+EKS ECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE
Sbjct: 361 TNHLRVQNSNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE 418

BLAST of Cp4.1LG06g04840 vs. ExPASy TrEMBL
Match: A0A1S4E5S7 (uncharacterized protein LOC103503757 OS=Cucumis melo OX=3656 GN=LOC103503757 PE=4 SV=1)

HSP 1 Score: 753 bits (1944), Expect = 2.90e-273
Identity = 379/419 (90.45%), Postives = 396/419 (94.51%), Query Frame = 0

Query: 1   MQSQQGSRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENI 60
           MQ Q  SRIDLGDLKAQIVKKLGNDKSKRYFF+LS+FLGQK+SKVEFDK+CVRVLGRENI
Sbjct: 1   MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENI 60

Query: 61  QLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTPCREDAPEQTGSAFPNQNQSI 120
           QLHNQLIRSILKNACVAKTPP IN SGHAQSVL ASN +PCRED PEQTGSAFPNQNQS 
Sbjct: 61  QLHNQLIRSILKNACVAKTPPPINASGHAQSVLHASN-SPCREDGPEQTGSAFPNQNQSK 120

Query: 121 PIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTEDR--KVITENGNV 180
           PIW NGVLPVSPRKGRSVLRGKFRDRPSPLGPNGK  CLSYQSTG+ED   KVITENGNV
Sbjct: 121 PIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNV 180

Query: 181 TMCDYQRPVQQLQAVAELPENDIDGSVQRPSGKPRIRPTEASILEEGEEVEQSDPLSFLR 240
           T+CDYQRPVQ LQ+VAELPENDIDG+VQRPS KPRI PTEA+ILEEGEEVEQSDPL FLR
Sbjct: 181 TLCDYQRPVQYLQSVAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLRFLR 240

Query: 241 GPLLPPLGIPFCSASVGGARKALPVSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQ 300
           GPLLPPLGIPFCSASVGGARKALPVSSSGS  DFLSCYDSIGLSDSETVRKRMEQIA+AQ
Sbjct: 241 GPLLPPLGIPFCSASVGGARKALPVSSSGSS-DFLSCYDSIGLSDSETVRKRMEQIASAQ 300

Query: 301 GLDGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWP 360
           GL+GVS+ECPNILNNTLDVYLKQLIKSCLELVR RSTFEH+GHPIQKQQNQGKV+NGMWP
Sbjct: 301 GLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWP 360

Query: 361 TNHLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE 417
           TNHLRVQN+NGRSEVL+EKS ECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE
Sbjct: 361 TNHLRVQNNNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE 417

BLAST of Cp4.1LG06g04840 vs. ExPASy TrEMBL
Match: A0A5A7TBJ9 (SAGA-Tad1 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold455G006740 PE=4 SV=1)

HSP 1 Score: 753 bits (1944), Expect = 2.90e-273
Identity = 379/419 (90.45%), Postives = 396/419 (94.51%), Query Frame = 0

Query: 1   MQSQQGSRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENI 60
           MQ Q  SRIDLGDLKAQIVKKLGNDKSKRYFF+LS+FLGQK+SKVEFDK+CVRVLGRENI
Sbjct: 1   MQPQHSSRIDLGDLKAQIVKKLGNDKSKRYFFFLSRFLGQKMSKVEFDKVCVRVLGRENI 60

Query: 61  QLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTPCREDAPEQTGSAFPNQNQSI 120
           QLHNQLIRSILKNACVAKTPP IN SGHAQSVL ASN +PCRED PEQTGSAFPNQNQS 
Sbjct: 61  QLHNQLIRSILKNACVAKTPPPINASGHAQSVLHASN-SPCREDGPEQTGSAFPNQNQSK 120

Query: 121 PIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTEDR--KVITENGNV 180
           PIW NGVLPVSPRKGRSVLRGKFRDRPSPLGPNGK  CLSYQSTG+ED   KVITENGNV
Sbjct: 121 PIWPNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKITCLSYQSTGSEDSSSKVITENGNV 180

Query: 181 TMCDYQRPVQQLQAVAELPENDIDGSVQRPSGKPRIRPTEASILEEGEEVEQSDPLSFLR 240
           T+CDYQRPVQ LQ+VAELPENDIDG+VQRPS KPRI PTEA+ILEEGEEVEQSDPL FLR
Sbjct: 181 TLCDYQRPVQYLQSVAELPENDIDGAVQRPSEKPRIHPTEAAILEEGEEVEQSDPLRFLR 240

Query: 241 GPLLPPLGIPFCSASVGGARKALPVSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQ 300
           GPLLPPLGIPFCSASVGGARKALPVSSSGS  DFLSCYDSIGLSDSETVRKRMEQIA+AQ
Sbjct: 241 GPLLPPLGIPFCSASVGGARKALPVSSSGSS-DFLSCYDSIGLSDSETVRKRMEQIASAQ 300

Query: 301 GLDGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWP 360
           GL+GVS+ECPNILNNTLDVYLKQLIKSCLELVR RSTFEH+GHPIQKQQNQGKV+NGMWP
Sbjct: 301 GLEGVSMECPNILNNTLDVYLKQLIKSCLELVRARSTFEHSGHPIQKQQNQGKVLNGMWP 360

Query: 361 TNHLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE 417
           TNHLRVQN+NGRSEVL+EKS ECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE
Sbjct: 361 TNHLRVQNNNGRSEVLQEKSLECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE 417

BLAST of Cp4.1LG06g04840 vs. TAIR 10
Match: AT2G24530.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G31440.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 387.9 bits (995), Expect = 1.0e-107
Identity = 217/424 (51.18%), Postives = 283/424 (66.75%), Query Frame = 0

Query: 1   MQSQQGSRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENI 60
           MQ  Q  RI L +LK  IVKK G ++S+RYF+YL +FL QKL+K EFDK C+R+LGREN+
Sbjct: 1   MQRSQDQRISLCELKEHIVKKTGVERSRRYFYYLGRFLSQKLTKSEFDKTCLRLLGRENL 60

Query: 61  QLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTPCREDAPEQTGSAFPNQNQSI 120
            LHNQLIRSIL+NA VAK+PP  + +GH+      +N    R D  EQ+G+  PN +Q  
Sbjct: 61  SLHNQLIRSILRNATVAKSPPPDHEAGHSTK----ANAFQSRGDGLEQSGTLIPNHSQHE 120

Query: 121 PIWTNGVLPVSPRKGRSVLRG-KFRDRPSPLGPNGKTACLSYQSTGTEDRK--VITENGN 180
           P+W+NGVLP+SPRK RS ++  K RDRPSPLG NGK   + +Q    ED +  V  ENG 
Sbjct: 121 PVWSNGVLPISPRKVRSGMQNRKSRDRPSPLGSNGKVEHMLHQPVCREDNRGSVGMENG- 180

Query: 181 VTMCDYQRPVQQLQAVAELPENDIDGSVQRPSGKPRIRPTE---ASILEEGEEVEQSDPL 240
               DYQR  + +        ++ DG   RP  KPRI   E   A  + + +  E+   +
Sbjct: 181 ----DYQRSGRYV-------ADEKDGEFLRPVEKPRIPNKEKIAAVSMRDDQNQEEQARV 240

Query: 241 SFLRGPLLPPLGIPFCSASVGGARKALPVSSSGSCVDFLSCYDSIGLSDSETVRKRMEQI 300
           +    PL+ PLGIPFCSASVGG+ + +PVS++    + +SCYDS GL D E +RKRME I
Sbjct: 241 NLSMSPLIAPLGIPFCSASVGGSPRTIPVSTN---AELISCYDSGGLPDIEMLRKRMENI 300

Query: 301 ATAQGLDGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTG-HPIQKQQNQGKVI 360
           A AQGL+GVS+EC   LNN LDVYLK+LI SC +LV  RST    G   I KQQ+Q K++
Sbjct: 301 AVAQGLEGVSMECAKTLNNMLDVYLKKLINSCFDLVGARSTNGDPGKQRIGKQQSQNKIV 360

Query: 361 NGMWPTNHLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMR 418
           NG+WPTN L++Q  NG S++ ++     SVS+LDF+ AMELNP+QLGEDWP L E+IS+R
Sbjct: 361 NGVWPTNSLKIQTPNGSSDIRQDHH---SVSMLDFRTAMELNPRQLGEDWPTLRERISLR 402

BLAST of Cp4.1LG06g04840 vs. TAIR 10
Match: AT4G31440.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G24530.1); Has 210 Blast hits to 209 proteins in 55 species: Archae - 0; Bacteria - 72; Metazoa - 2; Fungi - 6; Plants - 128; Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink). )

HSP 1 Score: 307.8 bits (787), Expect = 1.4e-83
Identity = 192/419 (45.82%), Postives = 255/419 (60.86%), Query Frame = 0

Query: 1   MQSQQGSRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENI 60
           MQ  Q  RIDL +LK  IVKK+G ++S RYF+YL +FL QKL+K EFDK C R+LGREN+
Sbjct: 1   MQRLQDPRIDLAELKVHIVKKVGVERSTRYFYYLGRFLSQKLTKSEFDKSCFRLLGRENL 60

Query: 61  QLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTPCREDAPEQTGSAFPNQNQSI 120
            LHN+LIRSIL+NA +AK+PP ++ SGH    L        +ED PE++ S  P+  ++ 
Sbjct: 61  SLHNKLIRSILRNASLAKSPPSVHQSGHPGKSLVLG-----KEDGPEESRSLNPDHIRND 120

Query: 121 PIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKT-ACLSYQSTGTEDRKVITENGNVT 180
              +NGVL    R G    R   RD+P PLG NGK     +Y   G    +   E  +  
Sbjct: 121 LALSNGVL-AKVRPGTCDDR-TIRDKPCPLGSNGKVLGPFAYSRPG----RYPDERDSAF 180

Query: 181 MCDYQRPVQQLQAVAELPENDIDGSVQRPSGKPRIRPTEASILEEGEEVEQSDPLSFLRG 240
           +C    P +Q                +  SGK ++     S  +E +    S P      
Sbjct: 181 LC----PAEQ----------------KAVSGKDQV-AAPISRDDEAQVRILSTP------ 240

Query: 241 PLLPPLGIPFCSASVGGARKALPVSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQG 300
           P++ PLGIPFCSASVGG R+ +PVS+S + +   SCYDS GLSD+E +RKRME IA  QG
Sbjct: 241 PVMAPLGIPFCSASVGGDRRTVPVSTSAAAI---SCYDSGGLSDTEMLRKRMENIAVTQG 300

Query: 301 LDGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTG-HPIQKQQNQGKVINGMWP 360
           L GVS EC  +LNN LD+YLK+L+KSC++L   RS     G H ++KQQ++ +++NG+  
Sbjct: 301 LGGVSAECSIVLNNMLDLYLKKLMKSCVDLAGARSMNGTPGKHSLEKQQSRDELVNGVRT 360

Query: 361 TNHLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE 418
            N   +Q SN  S++  E+    SVSLLDF+VAMELNP QLGEDWPLL E+IS+  FEE
Sbjct: 361 NNSFHIQTSNQPSDITREQH---SVSLLDFRVAMELNPHQLGEDWPLLRERISISLFEE 375

BLAST of Cp4.1LG06g04840 vs. TAIR 10
Match: AT4G33890.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G14850.1); Has 133 Blast hits to 131 proteins in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 2; Plants - 129; Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink). )

HSP 1 Score: 178.7 bits (452), Expect = 9.6e-45
Identity = 143/422 (33.89%), Postives = 211/422 (50.00%), Query Frame = 0

Query: 1   MQSQQG-SRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGREN 60
           M S QG SR+D  ++KA I +++GN +++ YF  L +F   K++K EFDKLC++ +GR+N
Sbjct: 1   MGSNQGSSRLDTLEIKALIYREIGNQRAESYFNQLGRFFALKITKSEFDKLCIKTIGRQN 60

Query: 61  IQLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTPCREDAPEQTGSAFPNQNQS 120
           I LHN+LIRSI+KNAC+AK+PP I   G   S ++  N            G +  N +Q 
Sbjct: 61  IHLHNRLIRSIIKNACIAKSPPFIKKGG---SFVRFGN------------GDSKKN-SQI 120

Query: 121 IPIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTEDRKVITENGNVT 180
            P+  +     S RK RS    K RDRPSPLGP GK   L+            T N    
Sbjct: 121 QPLHGDSAFSPSTRKCRS---RKLRDRPSPLGPLGKPHSLT------------TTN---- 180

Query: 181 MCDYQRPVQQLQAVAELPENDIDGSVQRPSGKPRIRPTEASILEEGEEVEQ---SDPLSF 240
               +  + + Q+  EL          RP       P E   +EEGEEVEQ     P   
Sbjct: 181 ----EESMSKAQSATELLSLG-----SRP-------PVEVVSVEEGEEVEQIAGGSPSVQ 240

Query: 241 LRGPLLPPLGIPFCSASVGGARKALP-VSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIA 300
            R PL  PLG+   S   G  RK++  VS      +  +C ++  L D+ T+R R+E+  
Sbjct: 241 SRCPLTAPLGVSM-SLRNGATRKSVSNVSMCSRSFNRETCQNNGELPDTRTLRSRLERRL 300

Query: 301 TAQGLDGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVING 360
             +GL  ++++  ++LN+ LDV++++LI+ CL L  TR                      
Sbjct: 301 EMEGLK-ITMDSVSLLNSGLDVFMRRLIEPCLSLANTRC--------------------- 342

Query: 361 MWPTNHLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAF 418
              T+ +R  N     +  ++      VS+ DF+  MELN + LGEDWP+ +EKI  RA 
Sbjct: 361 --GTDRVREMN----YQYTQQSRRLSYVSMSDFRAGMELNTEILGEDWPMHMEKICSRAS 342

BLAST of Cp4.1LG06g04840 vs. TAIR 10
Match: AT4G33890.2 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G14850.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 178.7 bits (452), Expect = 9.6e-45
Identity = 143/422 (33.89%), Postives = 211/422 (50.00%), Query Frame = 0

Query: 1   MQSQQG-SRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGREN 60
           M S QG SR+D  ++KA I +++GN +++ YF  L +F   K++K EFDKLC++ +GR+N
Sbjct: 1   MGSNQGSSRLDTLEIKALIYREIGNQRAESYFNQLGRFFALKITKSEFDKLCIKTIGRQN 60

Query: 61  IQLHNQLIRSILKNACVAKTPPLINVSGHAQSVLQASNNTPCREDAPEQTGSAFPNQNQS 120
           I LHN+LIRSI+KNAC+AK+PP I   G   S ++  N            G +  N +Q 
Sbjct: 61  IHLHNRLIRSIIKNACIAKSPPFIKKGG---SFVRFGN------------GDSKKN-SQI 120

Query: 121 IPIWTNGVLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTEDRKVITENGNVT 180
            P+  +     S RK RS    K RDRPSPLGP GK   L+            T N    
Sbjct: 121 QPLHGDSAFSPSTRKCRS---RKLRDRPSPLGPLGKPHSLT------------TTN---- 180

Query: 181 MCDYQRPVQQLQAVAELPENDIDGSVQRPSGKPRIRPTEASILEEGEEVEQ---SDPLSF 240
               +  + + Q+  EL          RP       P E   +EEGEEVEQ     P   
Sbjct: 181 ----EESMSKAQSATELLSLG-----SRP-------PVEVVSVEEGEEVEQIAGGSPSVQ 240

Query: 241 LRGPLLPPLGIPFCSASVGGARKALP-VSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIA 300
            R PL  PLG+   S   G  RK++  VS      +  +C ++  L D+ T+R R+E+  
Sbjct: 241 SRCPLTAPLGVSM-SLRNGATRKSVSNVSMCSRSFNRETCQNNGELPDTRTLRSRLERRL 300

Query: 301 TAQGLDGVSLECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVING 360
             +GL  ++++  ++LN+ LDV++++LI+ CL L  TR                      
Sbjct: 301 EMEGLK-ITMDSVSLLNSGLDVFMRRLIEPCLSLANTRC--------------------- 342

Query: 361 MWPTNHLRVQNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAF 418
              T+ +R  N     +  ++      VS+ DF+  MELN + LGEDWP+ +EKI  RA 
Sbjct: 361 --GTDRVREMN----YQYTQQSRRLSYVSMSDFRAGMELNTEILGEDWPMHMEKICSRAS 342

BLAST of Cp4.1LG06g04840 vs. TAIR 10
Match: AT2G14850.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G33890.2); Has 140 Blast hits to 132 proteins in 17 species: Archae - 0; Bacteria - 0; Metazoa - 1; Fungi - 2; Plants - 133; Viruses - 0; Other Eukaryotes - 4 (source: NCBI BLink). )

HSP 1 Score: 166.0 bits (419), Expect = 6.4e-41
Identity = 132/413 (31.96%), Postives = 187/413 (45.28%), Query Frame = 0

Query: 7   SRIDLGDLKAQIVKKLGNDKSKRYFFYLSKFLGQKLSKVEFDKLCVRVLGRENIQLHNQL 66
           SR++  ++KA I +K+G+ ++  YF  L KFL  ++SK EFDKLC + +GRENI LHN+L
Sbjct: 8   SRLNSLEIKALIYQKIGHQRADTYFDQLGKFLTSRISKSEFDKLCSKTVGRENISLHNRL 67

Query: 67  IRSILKNACVAKTPPLINVSGHAQSVLQASNNTPCREDAPEQTGSAFPNQNQSIPIWTNG 126
           +RSILKNA VAK+PP                               +P ++    ++ + 
Sbjct: 68  VRSILKNASVAKSPP-----------------------------PRYPKKS----LYGDP 127

Query: 127 VLPVSPRKGRSVLRGKFRDRPSPLGPNGKTACLSYQSTGTEDRKVITENGNVTMCDYQRP 186
           V P SPRK RS    KFRDRPSPLGP GK   L    T T D  +               
Sbjct: 128 VFPPSPRKCRS---RKFRDRPSPLGPLGKPQSL----TTTNDESM--------------- 187

Query: 187 VQQLQAVAELPENDIDGSVQRPSGKPRIRPTEASILEEGEEVEQ--SDPLSFLRGPLLPP 246
                                   K +  P E   +E+GEEVEQ    P    R PL  P
Sbjct: 188 -----------------------SKAQRLPMEVVSVEDGEEVEQMTGSPSVQSRSPLTAP 247

Query: 247 LGIPFCSASVGGARKALPVSSSGSCVDFLSCYDSIGLSDSETVRKRMEQIATAQGLDGVS 306
           LG+ F         K+    S+ + ++  +C  S  L D  T+R R+E+    +G+  +S
Sbjct: 248 LGVSF-------HLKSKARFSTYNGINRETCQSSGELPDMITLRARLEKKLEMEGIK-LS 291

Query: 307 LECPNILNNTLDVYLKQLIKSCLELVRTRSTFEHTGHPIQKQQNQGKVINGMWPTNHLRV 366
           ++  N+LN  L+ Y+++LI+ CL L                                   
Sbjct: 308 MDSANLLNRGLNAYMRRLIEPCLSLAS--------------------------------- 291

Query: 367 QNSNGRSEVLEEKSFECSVSLLDFKVAMELNPKQLGEDWPLLLEKISMRAFEE 418
                     ++K    +VS+LDF  AME+NP+ LGE+WP+ LEKI  RA EE
Sbjct: 368 ----------QQKRAVSNVSMLDFHAAMEVNPRVLGEEWPIQLEKICCRASEE 291

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_023536522.15.93e-305100.00uncharacterized protein LOC111797673 [Cucurbita pepo subsp. pepo] >XP_023536523.... [more]
XP_022976270.11.98e-30399.52uncharacterized protein LOC111476715 [Cucurbita maxima] >XP_022976271.1 uncharac... [more]
KAG6591626.16.64e-30299.04hypothetical protein SDJN03_13972, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022937292.13.84e-30198.80uncharacterized protein LOC111443621 [Cucurbita moschata] >XP_022937294.1 unchar... [more]
XP_038899147.17.92e-27691.17uncharacterized protein LOC120086522 isoform X1 [Benincasa hispida] >XP_03889914... [more]
Match NameE-valueIdentityDescription
A0A6J1IIZ99.61e-30499.52uncharacterized protein LOC111476715 OS=Cucurbita maxima OX=3661 GN=LOC111476715... [more]
A0A6J1FAS21.86e-30198.80uncharacterized protein LOC111443621 OS=Cucurbita moschata OX=3662 GN=LOC1114436... [more]
A0A0A0LGS91.10e-27590.69Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G881820 PE=4 SV=1[more]
A0A1S4E5S72.90e-27390.45uncharacterized protein LOC103503757 OS=Cucumis melo OX=3656 GN=LOC103503757 PE=... [more]
A0A5A7TBJ92.90e-27390.45SAGA-Tad1 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5... [more]
Match NameE-valueIdentityDescription
AT2G24530.11.0e-10751.18unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G31440.11.4e-8345.82unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G33890.19.6e-4533.89unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G33890.29.6e-4533.89unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT2G14850.16.4e-4131.96unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR024738Transcriptional coactivator Hfi1/Transcriptional adapter 1PFAMPF12767SAGA-Tad1coord: 5..330
e-value: 4.2E-58
score: 196.9
IPR024738Transcriptional coactivator Hfi1/Transcriptional adapter 1PANTHERPTHR21277TRANSCRIPTIONAL ADAPTER 1coord: 1..417
NoneNo IPR availablePANTHERPTHR21277:SF38TRANSCRIPTIONAL REGULATOR OF RNA POLII, SAGA, SUBUNITcoord: 1..417

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG06g04840.1Cp4.1LG06g04840.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0045893 positive regulation of transcription, DNA-templated
biological_process GO:0006357 regulation of transcription by RNA polymerase II
cellular_component GO:0000124 SAGA complex
cellular_component GO:0070461 SAGA-type complex
molecular_function GO:0003713 transcription coactivator activity