Cp4.1LG06g05060 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG06g05060
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionTranscription initiation factor IIE subunit alpha-like
LocationCp4.1LG06: 3067006 .. 3069230 (+)
RNA-Seq ExpressionCp4.1LG06g05060
SyntenyCp4.1LG06g05060
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGTTATTTGAAAATGATGATATTTCCCTACTAAAGGGACACCAATAATGCCATGTTACCCACAGCTATACTTTTTGAAAGATGGGAGTGGAAGAACCCTAAAAGTGTCTTTTTAGTTATAAAAATATTAAGCATTAAAAATAATTTGAGCAAATAATAATAATATTTATGGGACAGAGATACTCTATGGACCAAAGAGAGGAACAAAACATTATTTATAAGGCGTAGAAATCTCATTTTAACATACGCGTTTTAAAACTGTGAGACTGAATAGATAACGACTCAAAACAAATAAATTCTACTAGTGATGGCATCAGAAATGATTGTCAGATCTCACATTGGTGTGAAAACCTTTTCCTTATAGTTGCGTTTTAAAACCGAATGGTGATATTTAACGGGTCAAATCGAACAATATTTTAGTAGTGTGCTCGAGTTATTACAAATGGTATTAGAGCTAGGCTTTGTGTGCAAGCAAAGACGCTGGATTCCCTTGATATGTAACGGGTCAAAGCGAACAATATCTTAGCAGCGTGCTTGAGTTATTACAAATGTATTAGAGCCCGGTTTCGAACTAGATGCTGGATACATTAATACGTAACAGGCTAAAGTGAACAATATTTATTGGCCGTAGGCTCGGGCTCTTACAAATGATATTAGACTAGGGAGAACCCGTAGAACTTCAAAAGAAAAAGGCAAAATAAGATAATATCGGCTAGGTGTGAGCTTAGAGTACTCAAAAGTAAAAGTCGACAATATCTATTGGATCGGGTTTAGACTCAGGGCAACGCGTGAGTGTGACTGTGAGTGCGACTGTGCGAGAGGAAGGGCATTTTTTATGGCGTGGGCTCGCCAAGTTGGGTGAATGATTGGGCAATACAACAAGGCTCGTGCGTCTTTACAAGAACCGCAGCCCCGCCTGTTCTATATCTCAGGCAGGCAAAAAGGAGTGTTCTTAAATTTTTACAGATAATAAAGTAGACCCTCTCACACACACATCCAACAAAAAGCAGTAGGGGTCCAAAACACCAGACCCTGACTCACCTTTCTTCTTCTTCTTCTTCCTCTGTTTCCTCTGTAAAAATTGTCCTTAAAAACCCCACATCCCAAACCTTCCAATGCCAGAATCTTCCTCTTTCTAGGACACCATGAAATTAATTAGAGACACTAAAGCGAACCCTTCACCGGATTTATTGGTTTGTTTCCCTTCTCGCTCGCATTTCGCTTTAATGCCAAACCCACTCTGTAGTCCTGCCAGAGCATCTGATTCGAACAAGCTCCGCCGTTATCACCGGCGGAGGAGGTCGGCGGAGAGTCCGGTGGTTTGGGCCAAAGCGAAGACGATGGGGGGGTCTGAGGTGTCGGAACCGTCGTCGCCGAAAGTGACCTGTGCAGGGCAGATTAAGATGAGGCCGAAGAGCAGGAAGAGCTGGGAATCGGTGATGGAGGAGATAGAGAGAATTCATAATAGGAGGGAATTACGGCGGAGGAGGTTCAATTGGGTCGAATCTTTAGGGTTCAAGAAGGATATAATGCAATTCTTAACGTGTTTACGGAGCTTACGGTTTGATTTTGGGTGTTTCGGAGCTTTCCCTGAAGCAGAGTTCACCTCTGAAGACGAAGAAGAAGAGGACGTGGCTGTCGAGGGGAGTGATGGCTCCAGAACGGCGTTNATTCGGTTTGATTTTGGGTGTTTCGGAGCTTTCCCTGAAGCAGAGTTCACCTCTGAAGACGAAGAAGAAGAGGACGTGGCTGTCCAGGGGAGTGATGGCTCCAGAACGGCGTTTTCTAAATGGTTTATGGTTTTACAGGGAAGTGGGGTCCGGAGAGACGGCAACGGTCNCAGGGGAGTGATGGCTCCAGAACGGCGTTTTCTAAATGGTTCATGGTTTTACAGGGAAGTGGGGTCCGGAGAGACGGCAACGGTCTCTGTACAGTTGATGATGCATCGATTGGGCCGCCGATGGCGCCGCCCAGAAACGCGCTTTTGCTGATGCGCTGCAGGTCTGCTCCGGCGAAGAGTTGGGTGGAGGAAGGATGTTCGGAGGAGGAAGAAGAAACAGAGGTGAAGGTGAAGAAGAGCTTGAAATGGCTAATGGAGGAAGAGAACAGAGAGAGCAGGGGTTTGGTTACGAGGAGTCAGAGTTGGAAGGTTTGATCTCAATGGGTAAGGAACTCCCATCTTCCACATTTTCACATTGGTTTGGTTGGTACAGTTTGAGTAA

mRNA sequence

ATGGAAGACACTAAAGCGAACCCTTCACCGGATTTATTGGTTTGTTTCCCTTCTCGCTCGCATTTCGCTTTAATGCCAAACCCACTCTGTAGTCCTGCCAGAGCATCTGATTCGAACAAGCTCCGCCGTTATCACCGGCGGAGGAGGTCGGCGGAGAGTCCGGTGGTTTGGGCCAAAGCGAAGACGATGGGGGGGTCTGAGGTGTCGGAACCGTCGTCGCCGAAAGTGACCTGTGCAGGGCAGATTAAGATGAGGCCGAAGAGCAGGAAGAGCTGGGAATCGGTGATGGAGGAGATAGAGAGAATTCATAATAGGAGGGAATTACGGCGGAGGAGGTTCAATTGGGTCGAATCTTTAGGGTTCAAGAAGGATATAATGCAATTCTTAACGTGTTTACGGAGCTTACGGTTTGATTTTGGGTGTTTCGGAGCTTTCCCTGAAGCAGAGTTCACCTCTGAAGACGAAGAAGAAGAGGACGTGGCTGTCGAGGGGAGTGATGGCTCCAGAACGGCGTTNATTCGGTTTGATTTTGGGTGTTTCGGAGCTTTCCCTGAAGCAGAGTTCACCTCTGAAGACGAAGAAGAAGAGGACGTGGCTGTCCAGGGGAGTGATGGCTCCAGAACGGCGTTTTCTAAATGGGAAGTGGGGTCCGGAGAGACGGCAACGGGAAGTGGGGTCCGGAGAGACGGCAACGGTCTCTGTACAGTTGATGATGCATCGATTGGGCCGCCGATGGCGCCGCCCAGAAACGCGCTTTTGCTGATGCGCTGCAGGTCTGCTCCGGCGAAGAGTTGGGTGGAGGAAGGATGTTCGGAGGAGGAAGAAGAAACAGAGGTGAAGGTGAAGAAGAGCTTGAAATGGCTAATGGAGGAAGAGAACAGAGAGAGCAGGGGTTTGGTTACGAGGAGTCAGAGTTGGAAGTTTGAGTAA

Coding sequence (CDS)

ATGGAAGACACTAAAGCGAACCCTTCACCGGATTTATTGGTTTGTTTCCCTTCTCGCTCGCATTTCGCTTTAATGCCAAACCCACTCTGTAGTCCTGCCAGAGCATCTGATTCGAACAAGCTCCGCCGTTATCACCGGCGGAGGAGGTCGGCGGAGAGTCCGGTGGTTTGGGCCAAAGCGAAGACGATGGGGGGGTCTGAGGTGTCGGAACCGTCGTCGCCGAAAGTGACCTGTGCAGGGCAGATTAAGATGAGGCCGAAGAGCAGGAAGAGCTGGGAATCGGTGATGGAGGAGATAGAGAGAATTCATAATAGGAGGGAATTACGGCGGAGGAGGTTCAATTGGGTCGAATCTTTAGGGTTCAAGAAGGATATAATGCAATTCTTAACGTGTTTACGGAGCTTACGGTTTGATTTTGGGTGTTTCGGAGCTTTCCCTGAAGCAGAGTTCACCTCTGAAGACGAAGAAGAAGAGGACGTGGCTGTCGAGGGGAGTGATGGCTCCAGAACGGCGTTNATTCGGTTTGATTTTGGGTGTTTCGGAGCTTTCCCTGAAGCAGAGTTCACCTCTGAAGACGAAGAAGAAGAGGACGTGGCTGTCCAGGGGAGTGATGGCTCCAGAACGGCGTTTTCTAAATGGGAAGTGGGGTCCGGAGAGACGGCAACGGGAAGTGGGGTCCGGAGAGACGGCAACGGTCTCTGTACAGTTGATGATGCATCGATTGGGCCGCCGATGGCGCCGCCCAGAAACGCGCTTTTGCTGATGCGCTGCAGGTCTGCTCCGGCGAAGAGTTGGGTGGAGGAAGGATGTTCGGAGGAGGAAGAAGAAACAGAGGTGAAGGTGAAGAAGAGCTTGAAATGGCTAATGGAGGAAGAGAACAGAGAGAGCAGGGGTTTGGTTACGAGGAGTCAGAGTTGGAAGTTTGAGTAA

Protein sequence

MEDTKANPSPDLLVCFPSRSHFALMPNPLCSPARASDSNKLRRYHRRRRSAESPVVWAKAKTMGGSEVSEPSSPKVTCAGQIKMRPKSRKSWESVMEEIERIHNRRELRRRRFNWVESLGFKKDIMQFLTCLRSLRFDFGCFGAFPEAEFTSEDEEEEDVAVEGSDGSRTAXIRFDFGCFGAFPEAEFTSEDEEEEDVAVQGSDGSRTAFSKWEVGSGETATGSGVRRDGNGLCTVDDASIGPPMAPPRNALLLMRCRSAPAKSWVEEGCSEEEEETEVKVKKSLKWLMEEENRESRGLVTRSQSWKFE
Homology
BLAST of Cp4.1LG06g05060 vs. NCBI nr
Match: XP_023535963.1 (uncharacterized protein LOC111797241 isoform X1 [Cucurbita pepo subsp. pepo] >XP_023535964.1 uncharacterized protein LOC111797241 isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 489 bits (1258), Expect = 1.10e-172
Identity = 258/307 (84.04%), Postives = 260/307 (84.69%), Query Frame = 0

Query: 1   MEDTKANPSPDLLVCFPSRSHFALMPNPLCSPARASDSNKLRRYHRRRRSAESPVVWAKA 60
           + DTKANPSPDLLVCFPSRSHFALMPNPLCSPARASDSNKLRRYHRRRRSAESPVVWAKA
Sbjct: 4   IRDTKANPSPDLLVCFPSRSHFALMPNPLCSPARASDSNKLRRYHRRRRSAESPVVWAKA 63

Query: 61  KTMGGSEVSEPSSPKVTCAGQIKMRPKSRKSWESVMEEIERIHNRRELRRRRFNWVESLG 120
           KTMGGSEVSEPSSPKVTCAGQIKMRPKSRKSWESVMEEIERIHNRRELRRRRFNWVESLG
Sbjct: 64  KTMGGSEVSEPSSPKVTCAGQIKMRPKSRKSWESVMEEIERIHNRRELRRRRFNWVESLG 123

Query: 121 FKKDIMQFLTCLRSLRFDFGCFGAFPEAEFTSEDEEEEDVAVEGSDGSRTAXIRFDFGCF 180
           FKKDIMQFLTCLRSLRFDFGCFGAFPEAEFTSEDEEEEDVAVEGSDGSRTA         
Sbjct: 124 FKKDIMQFLTCLRSLRFDFGCFGAFPEAEFTSEDEEEEDVAVEGSDGSRTA--------- 183

Query: 181 GAFPEAEFTSEDEEEEDVAVQGSDGSRTAFSKWEVGSGETATGSGVRRDGNGLCTVDDAS 240
                                        FSKW +       GSGVRRDGNGLCTVDDAS
Sbjct: 184 -----------------------------FSKWFM----VLQGSGVRRDGNGLCTVDDAS 243

Query: 241 IGPPMAPPRNALLLMRCRSAPAKSWVEEGCSEEEEETEVKVKKSLKWLMEEENRESRGLV 300
           IGPPMAPPRNALLLMRCRSAPAKSWVEEGCSEEEEETEVKVKKSLKWLMEEENRESRGLV
Sbjct: 244 IGPPMAPPRNALLLMRCRSAPAKSWVEEGCSEEEEETEVKVKKSLKWLMEEENRESRGLV 268

Query: 301 TRSQSWK 307
           TRSQSWK
Sbjct: 304 TRSQSWK 268

BLAST of Cp4.1LG06g05060 vs. NCBI nr
Match: XP_023535965.1 (uncharacterized protein LOC111797241 isoform X3 [Cucurbita pepo subsp. pepo] >XP_023535966.1 uncharacterized protein LOC111797241 isoform X4 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 488 bits (1257), Expect = 1.56e-172
Identity = 258/307 (84.04%), Postives = 260/307 (84.69%), Query Frame = 0

Query: 1   MEDTKANPSPDLLVCFPSRSHFALMPNPLCSPARASDSNKLRRYHRRRRSAESPVVWAKA 60
           + DTKANPSPDLLVCFPSRSHFALMPNPLCSPARASDSNKLRRYHRRRRSAESPVVWAKA
Sbjct: 4   IRDTKANPSPDLLVCFPSRSHFALMPNPLCSPARASDSNKLRRYHRRRRSAESPVVWAKA 63

Query: 61  KTMGGSEVSEPSSPKVTCAGQIKMRPKSRKSWESVMEEIERIHNRRELRRRRFNWVESLG 120
           KTMGGSEVSEPSSPKVTCAGQIKMRPKSRKSWESVMEEIERIHNRRELRRRRFNWVESLG
Sbjct: 64  KTMGGSEVSEPSSPKVTCAGQIKMRPKSRKSWESVMEEIERIHNRRELRRRRFNWVESLG 123

Query: 121 FKKDIMQFLTCLRSLRFDFGCFGAFPEAEFTSEDEEEEDVAVEGSDGSRTAXIRFDFGCF 180
           FKKDIMQFLTCLRSLRFDFGCFGAFPEAEFTSEDEEEEDVAV                  
Sbjct: 124 FKKDIMQFLTCLRSLRFDFGCFGAFPEAEFTSEDEEEEDVAV------------------ 183

Query: 181 GAFPEAEFTSEDEEEEDVAVQGSDGSRTAFSKWEVGSGETATGSGVRRDGNGLCTVDDAS 240
                               QGSDGSRTAFSKW +       GSGVRRDGNGLCTVDDAS
Sbjct: 184 --------------------QGSDGSRTAFSKWFM----VLQGSGVRRDGNGLCTVDDAS 243

Query: 241 IGPPMAPPRNALLLMRCRSAPAKSWVEEGCSEEEEETEVKVKKSLKWLMEEENRESRGLV 300
           IGPPMAPPRNALLLMRCRSAPAKSWVEEGCSEEEEETEVKVKKSLKWLMEEENRESRGLV
Sbjct: 244 IGPPMAPPRNALLLMRCRSAPAKSWVEEGCSEEEEETEVKVKKSLKWLMEEENRESRGLV 268

Query: 301 TRSQSWK 307
           TRSQSWK
Sbjct: 304 TRSQSWK 268

BLAST of Cp4.1LG06g05060 vs. NCBI nr
Match: XP_022935869.1 (uncharacterized protein LOC111442647 [Cucurbita moschata] >XP_022935870.1 uncharacterized protein LOC111442647 [Cucurbita moschata])

HSP 1 Score: 479 bits (1233), Expect = 7.05e-169
Identity = 253/307 (82.41%), Postives = 257/307 (83.71%), Query Frame = 0

Query: 1   MEDTKANPSPDLLVCFPSRSHFALMPNPLCSPARASDSNKLRRYHRRRRSAESPVVWAKA 60
           + DTKANPSPDLLVCFPSRSHFALMPNPLCSPARASDSNKLRRYHRRR+SAESPVVWAKA
Sbjct: 4   IRDTKANPSPDLLVCFPSRSHFALMPNPLCSPARASDSNKLRRYHRRRKSAESPVVWAKA 63

Query: 61  KTMGGSEVSEPSSPKVTCAGQIKMRPKSRKSWESVMEEIERIHNRRELRRRRFNWVESLG 120
           KTMGGSEVSEPSSPKVTCAGQIKMRPKSRKSWESVMEEIERIHNRRELRRRRFNWVESLG
Sbjct: 64  KTMGGSEVSEPSSPKVTCAGQIKMRPKSRKSWESVMEEIERIHNRRELRRRRFNWVESLG 123

Query: 121 FKKDIMQFLTCLRSLRFDFGCFGAFPEAEFTSEDEEEEDVAVEGSDGSRTAXIRFDFGCF 180
           FKKDIMQFLTCLRSLRFDFGCFGAFPEAEFTSEDEEEE+V VEGSDGSRTA         
Sbjct: 124 FKKDIMQFLTCLRSLRFDFGCFGAFPEAEFTSEDEEEEEVGVEGSDGSRTA--------- 183

Query: 181 GAFPEAEFTSEDEEEEDVAVQGSDGSRTAFSKWEVGSGETATGSGVRRDGNGLCTVDDAS 240
                                        FSKW +       GSGVRRDGNGLCTVDDAS
Sbjct: 184 -----------------------------FSKWFM----VLQGSGVRRDGNGLCTVDDAS 243

Query: 241 IGPPMAPPRNALLLMRCRSAPAKSWVEEGCSEEEEETEVKVKKSLKWLMEEENRESRGLV 300
           IGPPMAPPRNALLLMRCRSAPAKSWVEEGCSEE EETEVKVKKSLKWLMEEENRESR LV
Sbjct: 244 IGPPMAPPRNALLLMRCRSAPAKSWVEEGCSEEGEETEVKVKKSLKWLMEEENRESRDLV 268

Query: 301 TRSQSWK 307
           TRSQSWK
Sbjct: 304 TRSQSWK 268

BLAST of Cp4.1LG06g05060 vs. NCBI nr
Match: XP_022977179.1 (uncharacterized protein LOC111477333 [Cucurbita maxima])

HSP 1 Score: 464 bits (1193), Expect = 8.72e-163
Identity = 245/307 (79.80%), Postives = 253/307 (82.41%), Query Frame = 0

Query: 1   MEDTKANPSPDLLVCFPSRSHFALMPNPLCSPARASDSNKLRRYHRRRRSAESPVVWAKA 60
           + D KA PSPDLLVCFPSRSHFALMPNPLCSP RASDSNKLRRYHRRR+SAESPVVWAKA
Sbjct: 4   IRDIKAIPSPDLLVCFPSRSHFALMPNPLCSPVRASDSNKLRRYHRRRKSAESPVVWAKA 63

Query: 61  KTMGGSEVSEPSSPKVTCAGQIKMRPKSRKSWESVMEEIERIHNRRELRRRRFNWVESLG 120
           KT+GGSEVSEPSSPKVTCAGQIKMR KSRKSWESVMEEIERIHNRRELRRRRFNWVESLG
Sbjct: 64  KTIGGSEVSEPSSPKVTCAGQIKMRRKSRKSWESVMEEIERIHNRRELRRRRFNWVESLG 123

Query: 121 FKKDIMQFLTCLRSLRFDFGCFGAFPEAEFTSEDEEEEDVAVEGSDGSRTAXIRFDFGCF 180
           FKKDIMQFLTCLRS+RFDFGCFGAFPEAEFTSEDEEEE+V VEGSDGSRTA         
Sbjct: 124 FKKDIMQFLTCLRSIRFDFGCFGAFPEAEFTSEDEEEEEVGVEGSDGSRTA--------- 183

Query: 181 GAFPEAEFTSEDEEEEDVAVQGSDGSRTAFSKWEVGSGETATGSGVRRDGNGLCTVDDAS 240
                                        FSKW +       GSGVRRDGNGLCTVDDAS
Sbjct: 184 -----------------------------FSKWFM----VLQGSGVRRDGNGLCTVDDAS 243

Query: 241 IGPPMAPPRNALLLMRCRSAPAKSWVEEGCSEEEEETEVKVKKSLKWLMEEENRESRGLV 300
           IGPPMAPPRNALLLMRCRSAPAKSWVEE CSEEEE+TEVKVKKSLKWLMEEENRESR LV
Sbjct: 244 IGPPMAPPRNALLLMRCRSAPAKSWVEEACSEEEEDTEVKVKKSLKWLMEEENRESRDLV 268

Query: 301 TRSQSWK 307
           TRS+SWK
Sbjct: 304 TRSRSWK 268

BLAST of Cp4.1LG06g05060 vs. NCBI nr
Match: KAG6591755.1 (hypothetical protein SDJN03_14101, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 433 bits (1114), Expect = 3.52e-151
Identity = 231/283 (81.63%), Postives = 234/283 (82.69%), Query Frame = 0

Query: 25  MPNPLCSPARASDSNKLRRYHRRRRSAESPVVWAKAKTMGGSEVSEPSSPKVTCAGQIKM 84
           MPNPLCSPARASDSNKLRRYHRRR+SAESPVVWAKAKTMGGSEVSEPSSPKVTCAGQIKM
Sbjct: 1   MPNPLCSPARASDSNKLRRYHRRRKSAESPVVWAKAKTMGGSEVSEPSSPKVTCAGQIKM 60

Query: 85  RPKSRKSWESVMEEIERIHNRRELRRRRFNWVESLGFKKDIMQFLTCLRSLRFDFGCFGA 144
           RPKSRKSWESVMEEIERIHNRRELRRRRFNWVESLGFKKDIMQFLTCLRSLRFDFGCFGA
Sbjct: 61  RPKSRKSWESVMEEIERIHNRRELRRRRFNWVESLGFKKDIMQFLTCLRSLRFDFGCFGA 120

Query: 145 FPEAEFTSEDEEEEDVAVEGSDGSRTAXIRFDFGCFGAFPEAEFTSEDEEEEDVAVQGSD 204
           FPEAEFTSEDEEEE+V VEGSDGSRTA                                 
Sbjct: 121 FPEAEFTSEDEEEEEVGVEGSDGSRTA--------------------------------- 180

Query: 205 GSRTAFSKWEVGSGETATGSGVRRDGNGLCTVDDASIGPPMAPPRNALLLMRCRSAPAKS 264
                FSKW +       GSGVRRD NGLCTVDDASIGPPMAPPRNALLLMRCRSAPAKS
Sbjct: 181 -----FSKWFM----VLQGSGVRRDRNGLCTVDDASIGPPMAPPRNALLLMRCRSAPAKS 240

Query: 265 WVEEGCSEEEEETEVKVKKSLKWLMEEENRESRGLVTRSQSWK 307
           WVEEGCSEEEEETEVKVKKSLKWLMEEENRESR LVTRSQSWK
Sbjct: 241 WVEEGCSEEEEETEVKVKKSLKWLMEEENRESRDLVTRSQSWK 241

BLAST of Cp4.1LG06g05060 vs. ExPASy TrEMBL
Match: A0A6J1FBW7 (uncharacterized protein LOC111442647 OS=Cucurbita moschata OX=3662 GN=LOC111442647 PE=4 SV=1)

HSP 1 Score: 479 bits (1233), Expect = 3.41e-169
Identity = 253/307 (82.41%), Postives = 257/307 (83.71%), Query Frame = 0

Query: 1   MEDTKANPSPDLLVCFPSRSHFALMPNPLCSPARASDSNKLRRYHRRRRSAESPVVWAKA 60
           + DTKANPSPDLLVCFPSRSHFALMPNPLCSPARASDSNKLRRYHRRR+SAESPVVWAKA
Sbjct: 4   IRDTKANPSPDLLVCFPSRSHFALMPNPLCSPARASDSNKLRRYHRRRKSAESPVVWAKA 63

Query: 61  KTMGGSEVSEPSSPKVTCAGQIKMRPKSRKSWESVMEEIERIHNRRELRRRRFNWVESLG 120
           KTMGGSEVSEPSSPKVTCAGQIKMRPKSRKSWESVMEEIERIHNRRELRRRRFNWVESLG
Sbjct: 64  KTMGGSEVSEPSSPKVTCAGQIKMRPKSRKSWESVMEEIERIHNRRELRRRRFNWVESLG 123

Query: 121 FKKDIMQFLTCLRSLRFDFGCFGAFPEAEFTSEDEEEEDVAVEGSDGSRTAXIRFDFGCF 180
           FKKDIMQFLTCLRSLRFDFGCFGAFPEAEFTSEDEEEE+V VEGSDGSRTA         
Sbjct: 124 FKKDIMQFLTCLRSLRFDFGCFGAFPEAEFTSEDEEEEEVGVEGSDGSRTA--------- 183

Query: 181 GAFPEAEFTSEDEEEEDVAVQGSDGSRTAFSKWEVGSGETATGSGVRRDGNGLCTVDDAS 240
                                        FSKW +       GSGVRRDGNGLCTVDDAS
Sbjct: 184 -----------------------------FSKWFM----VLQGSGVRRDGNGLCTVDDAS 243

Query: 241 IGPPMAPPRNALLLMRCRSAPAKSWVEEGCSEEEEETEVKVKKSLKWLMEEENRESRGLV 300
           IGPPMAPPRNALLLMRCRSAPAKSWVEEGCSEE EETEVKVKKSLKWLMEEENRESR LV
Sbjct: 244 IGPPMAPPRNALLLMRCRSAPAKSWVEEGCSEEGEETEVKVKKSLKWLMEEENRESRDLV 268

Query: 301 TRSQSWK 307
           TRSQSWK
Sbjct: 304 TRSQSWK 268

BLAST of Cp4.1LG06g05060 vs. ExPASy TrEMBL
Match: A0A6J1IQQ3 (uncharacterized protein LOC111477333 OS=Cucurbita maxima OX=3661 GN=LOC111477333 PE=4 SV=1)

HSP 1 Score: 464 bits (1193), Expect = 4.22e-163
Identity = 245/307 (79.80%), Postives = 253/307 (82.41%), Query Frame = 0

Query: 1   MEDTKANPSPDLLVCFPSRSHFALMPNPLCSPARASDSNKLRRYHRRRRSAESPVVWAKA 60
           + D KA PSPDLLVCFPSRSHFALMPNPLCSP RASDSNKLRRYHRRR+SAESPVVWAKA
Sbjct: 4   IRDIKAIPSPDLLVCFPSRSHFALMPNPLCSPVRASDSNKLRRYHRRRKSAESPVVWAKA 63

Query: 61  KTMGGSEVSEPSSPKVTCAGQIKMRPKSRKSWESVMEEIERIHNRRELRRRRFNWVESLG 120
           KT+GGSEVSEPSSPKVTCAGQIKMR KSRKSWESVMEEIERIHNRRELRRRRFNWVESLG
Sbjct: 64  KTIGGSEVSEPSSPKVTCAGQIKMRRKSRKSWESVMEEIERIHNRRELRRRRFNWVESLG 123

Query: 121 FKKDIMQFLTCLRSLRFDFGCFGAFPEAEFTSEDEEEEDVAVEGSDGSRTAXIRFDFGCF 180
           FKKDIMQFLTCLRS+RFDFGCFGAFPEAEFTSEDEEEE+V VEGSDGSRTA         
Sbjct: 124 FKKDIMQFLTCLRSIRFDFGCFGAFPEAEFTSEDEEEEEVGVEGSDGSRTA--------- 183

Query: 181 GAFPEAEFTSEDEEEEDVAVQGSDGSRTAFSKWEVGSGETATGSGVRRDGNGLCTVDDAS 240
                                        FSKW +       GSGVRRDGNGLCTVDDAS
Sbjct: 184 -----------------------------FSKWFM----VLQGSGVRRDGNGLCTVDDAS 243

Query: 241 IGPPMAPPRNALLLMRCRSAPAKSWVEEGCSEEEEETEVKVKKSLKWLMEEENRESRGLV 300
           IGPPMAPPRNALLLMRCRSAPAKSWVEE CSEEEE+TEVKVKKSLKWLMEEENRESR LV
Sbjct: 244 IGPPMAPPRNALLLMRCRSAPAKSWVEEACSEEEEDTEVKVKKSLKWLMEEENRESRDLV 268

Query: 301 TRSQSWK 307
           TRS+SWK
Sbjct: 304 TRSRSWK 268

BLAST of Cp4.1LG06g05060 vs. ExPASy TrEMBL
Match: A0A0A0L1Z4 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G377750 PE=4 SV=1)

HSP 1 Score: 352 bits (902), Expect = 3.25e-118
Identity = 200/333 (60.06%), Postives = 231/333 (69.37%), Query Frame = 0

Query: 2   EDTKANPSPDLLVCFPSRSHFALMPNPLCSPARASDSNKLR----RYHRRRRSAESPVVW 61
           E +K  PS DLLVCFPSRSH ALMPNPLCSPAR SDS+K R    RYHRRR+SAESPVVW
Sbjct: 6   EKSKGIPSSDLLVCFPSRSHLALMPNPLCSPARGSDSSKFRLDYRRYHRRRKSAESPVVW 65

Query: 62  AKAKTMGGSEVSEPSSPKVTCAGQIKMRPKSRKSWESVMEEIERIHNRRELRRRRFNWVE 121
           AKAKTMG SE+SEPSSPKVTCAGQIK+RPK+ KSW+SVMEEIERIHNRR+LRRRRFNW+E
Sbjct: 66  AKAKTMG-SEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRFNWIE 125

Query: 122 SLGFKKDIMQFLTCLRSLRFDFGCFGAFPEAEFTSEDEEEEDVAVEGSDGSRTAXIRFDF 181
           S GFKKDIMQFLTCLR++RFDF CF AFPE +FT+E+EEEE+   E              
Sbjct: 126 SFGFKKDIMQFLTCLRTMRFDFRCFRAFPETDFTTEEEEEEEEEEE-------------- 185

Query: 182 GCFGAFPEAEFTSEDEEEEDVAVQGSDGSRTAFSKWEVGSGETATGSGVRRDGNGLCTVD 241
                        E+EE+  V ++ ++ SRTAFSKW +   E  +   ++RD N  C  D
Sbjct: 186 -------------EEEEKNQVGIEENESSRTAFSKWFMVLQENGSNE-LKRDSNSRCYED 245

Query: 242 DASIGPPMAPPRNALLLMRCRSAPAKSWVEEGCSEEEEETE-------VKVKKSLKWLME 301
           D SI   MAPPRNALLLMRC+SAPA+ W+EE   EE++E E       VKVKKSLKWLME
Sbjct: 246 DESIEATMAPPRNALLLMRCKSAPARRWMEEESEEEDDEKEKEKEKEKVKVKKSLKWLME 305

Query: 302 EENRESRGL----------------VTRSQSWK 307
           EENRE   +                 TRSQSWK
Sbjct: 306 EENRERVVMEMGTDFCRMISDNAKEFTRSQSWK 309

BLAST of Cp4.1LG06g05060 vs. ExPASy TrEMBL
Match: A0A5D3D503 (Transcription initiation factor IIE subunit alpha-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold119G001490 PE=4 SV=1)

HSP 1 Score: 343 bits (880), Expect = 6.86e-115
Identity = 196/327 (59.94%), Postives = 227/327 (69.42%), Query Frame = 0

Query: 2   EDTKANPSPDLLVCFPSRSHFALMPNPLCSPARASDSNKLR----RYHRRRRSAESPVVW 61
           E +K  PS DLLVCFPSRSH ALMPNPLCSPAR SDS+K R    R+HRRR+SAESPVVW
Sbjct: 13  EKSKGIPSADLLVCFPSRSHLALMPNPLCSPARGSDSSKFRLDHRRFHRRRKSAESPVVW 72

Query: 62  AKAKTMGGSEVSEPSSPKVTCAGQIKMRPKSRKSWESVMEEIERIHNRRELRRRRFNWVE 121
           AKAKTMG SE+SEPSSPKVTCAGQIK+RPK+ KSW+SVMEEIERIHNRR+LRRRRF WVE
Sbjct: 73  AKAKTMG-SEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRFRWVE 132

Query: 122 SLGFKKDIMQFLTCLRSLRFDFGCFGAFPEAEFTSEDEEEEDVAVEGSDGSRTAXIRFDF 181
           S GFKKDIMQFLTCLR++RFDF CF AFPE +FT+E+EEEE+                  
Sbjct: 133 SFGFKKDIMQFLTCLRTIRFDFRCFRAFPETDFTTEEEEEEEE----------------- 192

Query: 182 GCFGAFPEAEFTSEDEEEEDVAVQGSDGSRTAFSKWEVGSGETATGSGVRRDGNGLCTVD 241
                        E++E+  V ++ ++ SRTAFSKW +   E  +   ++RD   LC  D
Sbjct: 193 ------------EEEDEKNQVGIEENESSRTAFSKWFMVLQENGSNE-LKRDSKSLCNED 252

Query: 242 DASIGPPMAPPRNALLLMRCRSAPAKSWVEEGCSEEEEETE-VKVKKSLKWLMEEENRES 301
           D SI   MAPP NALLLMRCRSAPA+ W+EE   E ++E E VKVKKSLKWLMEEENRE 
Sbjct: 253 DESIEAIMAPPINALLLMRCRSAPARRWMEEESEEGDDEKEKVKVKKSLKWLMEEENRER 308

Query: 302 RGL----------------VTRSQSWK 307
             +                 TRSQSWK
Sbjct: 313 LVVEMGTDFCRMTSDNAKEFTRSQSWK 308

BLAST of Cp4.1LG06g05060 vs. ExPASy TrEMBL
Match: A0A1S3B949 (uncharacterized protein LOC103487551 OS=Cucumis melo OX=3656 GN=LOC103487551 PE=4 SV=1)

HSP 1 Score: 343 bits (880), Expect = 6.86e-115
Identity = 196/327 (59.94%), Postives = 227/327 (69.42%), Query Frame = 0

Query: 2   EDTKANPSPDLLVCFPSRSHFALMPNPLCSPARASDSNKLR----RYHRRRRSAESPVVW 61
           E +K  PS DLLVCFPSRSH ALMPNPLCSPAR SDS+K R    R+HRRR+SAESPVVW
Sbjct: 13  EKSKGIPSADLLVCFPSRSHLALMPNPLCSPARGSDSSKFRLDHRRFHRRRKSAESPVVW 72

Query: 62  AKAKTMGGSEVSEPSSPKVTCAGQIKMRPKSRKSWESVMEEIERIHNRRELRRRRFNWVE 121
           AKAKTMG SE+SEPSSPKVTCAGQIK+RPK+ KSW+SVMEEIERIHNRR+LRRRRF WVE
Sbjct: 73  AKAKTMG-SEISEPSSPKVTCAGQIKIRPKNSKSWQSVMEEIERIHNRRKLRRRRFRWVE 132

Query: 122 SLGFKKDIMQFLTCLRSLRFDFGCFGAFPEAEFTSEDEEEEDVAVEGSDGSRTAXIRFDF 181
           S GFKKDIMQFLTCLR++RFDF CF AFPE +FT+E+EEEE+                  
Sbjct: 133 SFGFKKDIMQFLTCLRTIRFDFRCFRAFPETDFTTEEEEEEEE----------------- 192

Query: 182 GCFGAFPEAEFTSEDEEEEDVAVQGSDGSRTAFSKWEVGSGETATGSGVRRDGNGLCTVD 241
                        E++E+  V ++ ++ SRTAFSKW +   E  +   ++RD   LC  D
Sbjct: 193 ------------EEEDEKNQVGIEENESSRTAFSKWFMVLQENGSNE-LKRDSKSLCNED 252

Query: 242 DASIGPPMAPPRNALLLMRCRSAPAKSWVEEGCSEEEEETE-VKVKKSLKWLMEEENRES 301
           D SI   MAPP NALLLMRCRSAPA+ W+EE   E ++E E VKVKKSLKWLMEEENRE 
Sbjct: 253 DESIEAIMAPPINALLLMRCRSAPARRWMEEESEEGDDEKEKVKVKKSLKWLMEEENRER 308

Query: 302 RGL----------------VTRSQSWK 307
             +                 TRSQSWK
Sbjct: 313 LVVEMGTDFCRMTSDNAKEFTRSQSWK 308

BLAST of Cp4.1LG06g05060 vs. TAIR 10
Match: AT1G78110.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma membrane; EXPRESSED IN: 17 plant structures; EXPRESSED DURING: 7 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G22230.1); Has 5452 Blast hits to 3541 proteins in 289 species: Archae - 4; Bacteria - 165; Metazoa - 1756; Fungi - 532; Plants - 205; Viruses - 141; Other Eukaryotes - 2649 (source: NCBI BLink). )

HSP 1 Score: 211.1 bits (536), Expect = 1.3e-54
Identity = 141/323 (43.65%), Postives = 178/323 (55.11%), Query Frame = 0

Query: 9   SPDLLVCFPSRSHFALMPNPLCSPARASDSNKLRRYHRRRRSAE----------SPVVWA 68
           S DLLVCFPSR+H AL P P+CSP+R SDS+  RR H RR+ ++          SPV+WA
Sbjct: 17  SADLLVCFPSRTHLALTPKPICSPSRPSDSSTNRRPHHRRQLSKLSGGGGGGHGSPVLWA 76

Query: 69  K---AKTMGGSEVSEPSSPKVTCAGQIKMRPKS----RKSWESVMEEIERIHNRRELRRR 128
           K   +K MGG E++EP+SPKVTCAGQIK+RP       K+W+SVMEEIERIH+ R   + 
Sbjct: 77  KQASSKNMGGDEIAEPTSPKVTCAGQIKVRPSKCGGRGKNWQSVMEEIERIHDNRSQSK- 136

Query: 129 RFNWVESLGFKKDIMQFLTCLRSLRFDFGCFGAFPEAEFTSEDEEEEDVAVEGSDGSRTA 188
                   G KKD+M FLTCLR+++FDF CFG F  A+ TS+D+EEED            
Sbjct: 137 ------FFGLKKDVMGFLTCLRNIKFDFRCFGDFRHADVTSDDDEEED------------ 196

Query: 189 XIRFDFGCFGAFPEAEFTSEDEEEEDVAVQGSDGSRTAFSKWEVGSGETATGSGVRRDGN 248
                              +DEEEE V  +  + S+T FSKW +   E        ++ N
Sbjct: 197 -----------------DDDDEEEEVVEGEEEENSKTVFSKWFMVLQEEQNNKDDDKNNN 256

Query: 249 GLCTVDDA--SIGPPMAPPRNALLLMRCRSAPAKSWVEEGC----------------SEE 296
                 D   +   P  PP NALLLMRCRSAPAKSW+EE                    E
Sbjct: 257 KCDEKRDLEDTETEPAVPPPNALLLMRCRSAPAKSWLEERMKVKTEQEKREEQKEEKETE 303

BLAST of Cp4.1LG06g05060 vs. TAIR 10
Match: AT1G22230.1 (unknown protein; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G78110.1); Has 2358 Blast hits to 1759 proteins in 159 species: Archae - 2; Bacteria - 36; Metazoa - 1046; Fungi - 203; Plants - 157; Viruses - 72; Other Eukaryotes - 842 (source: NCBI BLink). )

HSP 1 Score: 152.5 bits (384), Expect = 5.4e-37
Identity = 120/308 (38.96%), Postives = 150/308 (48.70%), Query Frame = 0

Query: 9   SPDLLVCFPSRSHFALMPNPLCSPARASDSNKLRRYHRRRRSAESPVVWAKAKTMGGSE- 68
           S DL+VCFPSR+H +L    + SP+ + +  +   +HRR  S  S       +  GG   
Sbjct: 13  SADLMVCFPSRAHLSLPSKSISSPSSSFNRRQNAPHHRRSISKLSSSGGGVRQNRGGGRE 72

Query: 69  -VSEPSSPKVTCAGQIKMRPKSR----KSWESVMEEIERIHNRRELRRRRFNWVESLGFK 128
            V EP+SPKVTCAGQIK+R   R    K+W+S+M EIE+IH  R     +F      G K
Sbjct: 73  VVEEPTSPKVTCAGQIKVRSSKRDGGGKNWQSLMAEIEKIH--RSKSESKF-----FGIK 132

Query: 129 KDIMQFLTCLRSLRFDFGCFGAFPEAEFTSEDEEEEDVAVEGSDGSRTAXIRFDFGCFGA 188
           +D+M FLTCLR   FDF CFGAFP  +  S+DEEE+                        
Sbjct: 133 RDVMGFLTCLRD--FDFRCFGAFPPVDIISDDEEED------------------------ 192

Query: 189 FPEAEFTSEDEEEEDVAVQGSDGSRTAFSKWEVGSGETATGSG-VRRDGNGLCTVDDASI 248
                   E+EEEED      + S T FSKW +   E       V    N    V+ A  
Sbjct: 193 --------EEEEEEDEEEDEDESSGTVFSKWLMVLHEKQNNEECVDGKENVFSDVETA-- 252

Query: 249 GPPMAPPRNALLLMRCRSAPAKSWVEE---------------GCSEEEEETEVKVKKSLK 295
                PP NALLLMRCRSAP K+W EE               G  EEEE+  V  KK L+
Sbjct: 253 ----VPPPNALLLMRCRSAPVKNWSEEKKEETEEGDNRVKQSGEEEEEEKDRVGNKKDLR 273

BLAST of Cp4.1LG06g05060 vs. TAIR 10
Match: AT2G37100.1 (protamine P1 family protein )

HSP 1 Score: 45.4 bits (106), Expect = 9.4e-05
Identity = 66/260 (25.38%), Postives = 102/260 (39.23%), Query Frame = 0

Query: 28  PLCSPARASDSNKLRRY------HRRRRSAESPVVWAKAKTMGGSEVSEPSSPKVTCAGQ 87
           P+ SP R  +   L R+       R R  +  P+ + +      +E  EP+SPKVTC GQ
Sbjct: 8   PVSSPGRTENPPLLMRFLRTKSRSRSRSRSRRPIFFRRKNASAAAETQEPTSPKVTCMGQ 67

Query: 88  IKMRPKSRKSWESVMEEIERIHNRRELRRRRFNWVESLGFKKDIMQFL--TCLRSLRFDF 147
           +++    +   E+          RR+   RR  WV++          +  TC   +   +
Sbjct: 68  VRINRSKKPKPETARVSGGATERRRQ--SRRCGWVKNAFPCHSFTGIIKPTCFSPV---W 127

Query: 148 GCFGAFPEAEFTSEDEEEEDVAVEGSDGSRTAXIRFDFGCFGAFP-EAEFTSEDEEEEDV 207
             + +F  A F+ + E+        S  SR+  I   FG     P E E T ++E +E+ 
Sbjct: 128 RKWKSFSHASFSKKSEKR-------SSSSRSEPI---FGRSTVEPEEPEETRKEENQEE- 187

Query: 208 AVQGSDGSRTAFSKWEVGSGETATGSGVRRDGNGLCTVDDASIGPPMAPPRNALLLMRCR 267
                          E  S ++ T +                      PPRNA LL RCR
Sbjct: 188 ---------------EASSCKSFTAT----------------------PPRNAFLLTRCR 214

Query: 268 SAPAKS-WVEEGCSEEEEET 278
           SAP +S        E++EET
Sbjct: 248 SAPYRSPSSANSLFEDQEET 214

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_023535963.11.10e-17284.04uncharacterized protein LOC111797241 isoform X1 [Cucurbita pepo subsp. pepo] >XP... [more]
XP_023535965.11.56e-17284.04uncharacterized protein LOC111797241 isoform X3 [Cucurbita pepo subsp. pepo] >XP... [more]
XP_022935869.17.05e-16982.41uncharacterized protein LOC111442647 [Cucurbita moschata] >XP_022935870.1 unchar... [more]
XP_022977179.18.72e-16379.80uncharacterized protein LOC111477333 [Cucurbita maxima][more]
KAG6591755.13.52e-15181.63hypothetical protein SDJN03_14101, partial [Cucurbita argyrosperma subsp. sorori... [more]
Match NameE-valueIdentityDescription
A0A6J1FBW73.41e-16982.41uncharacterized protein LOC111442647 OS=Cucurbita moschata OX=3662 GN=LOC1114426... [more]
A0A6J1IQQ34.22e-16379.80uncharacterized protein LOC111477333 OS=Cucurbita maxima OX=3661 GN=LOC111477333... [more]
A0A0A0L1Z43.25e-11860.06Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G377750 PE=4 SV=1[more]
A0A5D3D5036.86e-11559.94Transcription initiation factor IIE subunit alpha-like OS=Cucumis melo var. maku... [more]
A0A1S3B9496.86e-11559.94uncharacterized protein LOC103487551 OS=Cucumis melo OX=3656 GN=LOC103487551 PE=... [more]
Match NameE-valueIdentityDescription
AT1G78110.11.3e-5443.65unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT1G22230.15.4e-3738.96unknown protein; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein matc... [more]
AT2G37100.19.4e-0525.38protamine P1 family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR33448CHLOROPLAST PROTEIN HCF243-RELATEDcoord: 171..293
NoneNo IPR availablePANTHERPTHR33448:SF3OS09G0370000 PROTEINcoord: 4..161
NoneNo IPR availablePANTHERPTHR33448CHLOROPLAST PROTEIN HCF243-RELATEDcoord: 4..161
NoneNo IPR availablePANTHERPTHR33448:SF3OS09G0370000 PROTEINcoord: 171..293

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG06g05060.1Cp4.1LG06g05060.1mRNA