Cp4.1LG04g06570 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG04g06570
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionLINE-1 retrotransposable element ORF2 protein
LocationCp4.1LG04: 5100770 .. 5101385 (+)
RNA-Seq ExpressionCp4.1LG04g06570
SyntenyCp4.1LG04g06570
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
TCAAACTTGAGAAATCATTGAAACTTCTTTTTCACATTACCATCACAATGTTCTTGATCAGAGTTCAAGATACTAAGCCCATCACGGACGCAACCTTCCTATTCAAGCAATTCATTAACGAAAAAGCCGACTTAGATTTCAACCGCAACAGCTTCAGCATAATTGCCTCAAACCCTTCCCTTCGCTTCATAGCAAGGTTTTACATTTCCAAGAAATATTGCCAAGACTTTTTCATCAATCAAACTCACATTGCCAAAATTTCCCTTCCATCCTTTAGTGATACCATCATGACTGCCGCTGCTACCCGCTTTGATACAATGAGTATCACTCTTCCAAGCCCTTATAAAATGACCCTTACATTCGAGACATCAAGTGAGTATCTATTAATACCCCACAACAACATTGCCAAGAACTATGATTCTTGTTTTCTTTACCTTTCAAGAACTCACTTTTCTTAATAGCTCCTCCAGGGCGTGTGCCTCGGTCGAATTCACGTGCGCTGCCACTGTCACCTACGCAAGCGATGGATTTTGGAAGGCCTTTACTCGCAAAACATTTCACCATTAAACCCGAATGTTTTAGACACATTCTTACCGAACTACCTAACTCGCAAGAT

mRNA sequence

TCAAACTTGAGAAATCATTGAAACTTCTTTTTCACATTACCATCACAATGTTCTTGATCAGAGTTCAAGATACTAAGCCCATCACGGACGCAACCTTCCTATTCAAGCAATTCATTAACGAAAAAGCCGACTTAGATTTCAACCGCAACAGCTTCAGCATAATTGCCTCAAACCCTTCCCTTCGCTTCATAGCAAGGTTTTACATTTCCAAGAAATATTGCCAAGACTTTTTCATCAATCAAACTCACATTGCCAAAATTTCCCTTCCATCCTTTAGTGATACCATCATGACTGCCGCTGCTACCCGCTTTGATACAATGAGTATCACTCTTCCAAGCCCTTATAAAATGACCCTTACATTCGAGACATCAAGGCGTGTGCCTCGGTCGAATTCACGTGCGCTGCCACTGTCACCTACGCAAGCGATGGATTTTGGAAGGCCTTTACTCGCAAAACATTTCACCATTAAACCCGAATGTTTTAGACACATTCTTACCGAACTACCTAACTCGCAAGAT

Coding sequence (CDS)

ATGTTCTTGATCAGAGTTCAAGATACTAAGCCCATCACGGACGCAACCTTCCTATTCAAGCAATTCATTAACGAAAAAGCCGACTTAGATTTCAACCGCAACAGCTTCAGCATAATTGCCTCAAACCCTTCCCTTCGCTTCATAGCAAGGTTTTACATTTCCAAGAAATATTGCCAAGACTTTTTCATCAATCAAACTCACATTGCCAAAATTTCCCTTCCATCCTTTAGTGATACCATCATGACTGCCGCTGCTACCCGCTTTGATACAATGAGTATCACTCTTCCAAGCCCTTATAAAATGACCCTTACATTCGAGACATCAAGGCGTGTGCCTCGGTCGAATTCACGTGCGCTGCCACTGTCACCTACGCAAGCGATGGATTTTGGAAGGCCTTTACTCGCAAAACATTTCACCATTAAACCCGAATGTTTTAGACACATTCTTACCGAACTACCTAACTCGCAAGAT

Protein sequence

MFLIRVQDTKPITDATFLFKQFINEKADLDFNRNSFSIIASNPSLRFIARFYISKKYCQDFFINQTHIAKISLPSFSDTIMTAAATRFDTMSITLPSPYKMTLTFETSRRVPRSNSRALPLSPTQAMDFGRPLLAKHFTIKPECFRHILTELPNSQD
Homology
BLAST of Cp4.1LG04g06570 vs. NCBI nr
Match: KAG6588842.1 (hypothetical protein SDJN03_17407, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 294 bits (752), Expect = 1.62e-98
Identity = 152/160 (95.00%), Postives = 153/160 (95.62%), Query Frame = 0

Query: 1   MFLIRVQDTKPITDATFLFKQFINEKADLDFNRNSFSIIASNPSLRFIARFYISKKYCQD 60
           MFLIRVQDTKPITDATFLFKQFINEKADL FNRNSFSIIASNPSLRFIARFYISKKYCQD
Sbjct: 1   MFLIRVQDTKPITDATFLFKQFINEKADLGFNRNSFSIIASNPSLRFIARFYISKKYCQD 60

Query: 61  FFINQTHIAKISLPSFSDTIMTAAATRFDTMSITLPSPYKMTLTFETSR---RVPRSNSR 120
           FFINQTHIAKISLPSFSD IMTAAATRFDTMSITLPSPYKMTLTFE S    RVPRSNSR
Sbjct: 61  FFINQTHIAKISLPSFSDAIMTAAATRFDTMSITLPSPYKMTLTFEISTPPGRVPRSNSR 120

Query: 121 ALPLSPTQAMDFGRPLLAKHFTIKPECFRHILTELPNSQD 157
           ALPLSPTQAMDFGRPLL+KHFTIKPECFRHILTELPNSQD
Sbjct: 121 ALPLSPTQAMDFGRPLLSKHFTIKPECFRHILTELPNSQD 160

BLAST of Cp4.1LG04g06570 vs. NCBI nr
Match: XP_022928486.1 (uncharacterized protein LOC111435280 isoform X2 [Cucurbita moschata])

HSP 1 Score: 213 bits (542), Expect = 1.21e-66
Identity = 110/157 (70.06%), Postives = 127/157 (80.89%), Query Frame = 0

Query: 1   MFLIRVQDTKPITDATFLFKQFINEKADLDFNRNSFSIIASNPSLRFIARFYISKKYCQD 60
           M LIRVQD  PITDA FLF +FIN +ADL+F  NS +IIA+NP+LRFIA  YISK +CQD
Sbjct: 1   MLLIRVQDANPITDANFLFAEFINHEADLEFKPNSLTIIATNPTLRFIATLYISKTFCQD 60

Query: 61  FFINQTHIAKISLPSFSDTIMTAAATRFDTMSITLPSPYKMTLTFETSRRVPRSNSRALP 120
           F INQTHIA++SL SF D IMTAAAT FDT++ITLPS Y M LTFETSRRV +S+ RALP
Sbjct: 61  FTINQTHIARVSLTSFIDAIMTAAATSFDTLAITLPSAYIMILTFETSRRVLQSHPRALP 120

Query: 121 LSPTQAMDFGRPLLAKHFTIKPECFRHILTELPNSQD 157
           +SP+  MDF +P+LAKHFTIK ECFR IL ELP  QD
Sbjct: 121 MSPSLMMDFPKPILAKHFTIKAECFRRILEELPLWQD 157

BLAST of Cp4.1LG04g06570 vs. NCBI nr
Match: XP_022989711.1 (uncharacterized protein LOC111486711 [Cucurbita maxima])

HSP 1 Score: 211 bits (536), Expect = 1.08e-65
Identity = 109/160 (68.12%), Postives = 128/160 (80.00%), Query Frame = 0

Query: 1   MFLIRVQDTKPITDATFLFKQFINEKADLDFNRNSFSIIASNPSLRFIARFYISKKYCQD 60
           M LIRVQD  PITDA FLF +FIN +ADL+F  NS +IIA+NP+LRFIA  YISK +CQD
Sbjct: 1   MLLIRVQDANPITDANFLFAEFINHEADLEFKPNSLTIIATNPTLRFIATLYISKSFCQD 60

Query: 61  FFINQTHIAKISLPSFSDTIMTAAATRFDTMSITLPSPYKMTLTFETSR---RVPRSNSR 120
           F INQTHIA++SL SF D IMTAAATRFDT++ITLPS Y M LTFETS    RVP+S+ R
Sbjct: 61  FTINQTHIARVSLTSFIDAIMTAAATRFDTLAITLPSAYIMILTFETSTPSGRVPQSHPR 120

Query: 121 ALPLSPTQAMDFGRPLLAKHFTIKPECFRHILTELPNSQD 157
           ALP+SP+  +DF +P+LAKHFTIK ECFR +L ELP  QD
Sbjct: 121 ALPMSPSLMVDFPKPILAKHFTIKAECFRRVLAELPLLQD 160

BLAST of Cp4.1LG04g06570 vs. NCBI nr
Match: XP_022928485.1 (uncharacterized protein LOC111435280 isoform X1 [Cucurbita moschata])

HSP 1 Score: 205 bits (522), Expect = 1.43e-63
Identity = 109/160 (68.12%), Postives = 126/160 (78.75%), Query Frame = 0

Query: 1   MFLIRVQDTKPITDATFLFKQFINEKADLDFNRNSFSIIASNPSLRFIARFYISKKYCQD 60
           M LIRVQD  PITDA FLF +FIN +ADL+F  NS +IIA+NP+LRFIA  YISK +CQD
Sbjct: 1   MLLIRVQDANPITDANFLFAEFINHEADLEFKPNSLTIIATNPTLRFIATLYISKTFCQD 60

Query: 61  FFINQTHIAKISLPSFSDTIMTAAATRFDTMSITLPSPYKMTLTFETSR---RVPRSNSR 120
           F INQTHIA++SL SF D IMTAAAT FDT++ITLPS Y M LTFETS    RV +S+ R
Sbjct: 61  FTINQTHIARVSLTSFIDAIMTAAATSFDTLAITLPSAYIMILTFETSTPSGRVLQSHPR 120

Query: 121 ALPLSPTQAMDFGRPLLAKHFTIKPECFRHILTELPNSQD 157
           ALP+SP+  MDF +P+LAKHFTIK ECFR IL ELP  QD
Sbjct: 121 ALPMSPSLMMDFPKPILAKHFTIKAECFRRILEELPLWQD 160

BLAST of Cp4.1LG04g06570 vs. NCBI nr
Match: KAG6588843.1 (hypothetical protein SDJN03_17408, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 193 bits (491), Expect = 1.73e-58
Identity = 109/190 (57.37%), Postives = 126/190 (66.32%), Query Frame = 0

Query: 1   MFLIRVQDTKPITDATFLFKQFINEKADLDFNRNSFSIIASNPSLRFIARFYISKKYCQD 60
           M LIRVQD  PITDA FLF +FI+ +ADL+F  NS +IIA NP+LRFIA  YISK +CQD
Sbjct: 1   MLLIRVQDANPITDANFLFAEFISHEADLEFKPNSLTIIAKNPTLRFIATLYISKTFCQD 60

Query: 61  FFINQTHIAKISLPSFSDTIMTAAATRFDTMSITLPSPYKMTLTFETSR----------- 120
           F INQTHIA++SL SF D IMTAAATRFDT++ITLPS Y M LTFETS            
Sbjct: 61  FTINQTHIARVSLISFIDAIMTAAATRFDTLAITLPSAYIMILTFETSSEYLLIPHNNIA 120

Query: 121 ----------------------RVPRSNSRALPLSPTQAMDFGRPLLAKHFTIKPECFRH 157
                                 RV +S+ RALP+SP+  MDF +P+LAKHFTIK ECFR 
Sbjct: 121 KNYDSCFLYLSRTHFLFIAPSGRVLQSHPRALPMSPSLMMDFPKPILAKHFTIKAECFRR 180

BLAST of Cp4.1LG04g06570 vs. ExPASy TrEMBL
Match: A0A6J1EKE7 (uncharacterized protein LOC111435280 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111435280 PE=4 SV=1)

HSP 1 Score: 213 bits (542), Expect = 5.85e-67
Identity = 110/157 (70.06%), Postives = 127/157 (80.89%), Query Frame = 0

Query: 1   MFLIRVQDTKPITDATFLFKQFINEKADLDFNRNSFSIIASNPSLRFIARFYISKKYCQD 60
           M LIRVQD  PITDA FLF +FIN +ADL+F  NS +IIA+NP+LRFIA  YISK +CQD
Sbjct: 1   MLLIRVQDANPITDANFLFAEFINHEADLEFKPNSLTIIATNPTLRFIATLYISKTFCQD 60

Query: 61  FFINQTHIAKISLPSFSDTIMTAAATRFDTMSITLPSPYKMTLTFETSRRVPRSNSRALP 120
           F INQTHIA++SL SF D IMTAAAT FDT++ITLPS Y M LTFETSRRV +S+ RALP
Sbjct: 61  FTINQTHIARVSLTSFIDAIMTAAATSFDTLAITLPSAYIMILTFETSRRVLQSHPRALP 120

Query: 121 LSPTQAMDFGRPLLAKHFTIKPECFRHILTELPNSQD 157
           +SP+  MDF +P+LAKHFTIK ECFR IL ELP  QD
Sbjct: 121 MSPSLMMDFPKPILAKHFTIKAECFRRILEELPLWQD 157

BLAST of Cp4.1LG04g06570 vs. ExPASy TrEMBL
Match: A0A6J1JKX5 (uncharacterized protein LOC111486711 OS=Cucurbita maxima OX=3661 GN=LOC111486711 PE=4 SV=1)

HSP 1 Score: 211 bits (536), Expect = 5.22e-66
Identity = 109/160 (68.12%), Postives = 128/160 (80.00%), Query Frame = 0

Query: 1   MFLIRVQDTKPITDATFLFKQFINEKADLDFNRNSFSIIASNPSLRFIARFYISKKYCQD 60
           M LIRVQD  PITDA FLF +FIN +ADL+F  NS +IIA+NP+LRFIA  YISK +CQD
Sbjct: 1   MLLIRVQDANPITDANFLFAEFINHEADLEFKPNSLTIIATNPTLRFIATLYISKSFCQD 60

Query: 61  FFINQTHIAKISLPSFSDTIMTAAATRFDTMSITLPSPYKMTLTFETSR---RVPRSNSR 120
           F INQTHIA++SL SF D IMTAAATRFDT++ITLPS Y M LTFETS    RVP+S+ R
Sbjct: 61  FTINQTHIARVSLTSFIDAIMTAAATRFDTLAITLPSAYIMILTFETSTPSGRVPQSHPR 120

Query: 121 ALPLSPTQAMDFGRPLLAKHFTIKPECFRHILTELPNSQD 157
           ALP+SP+  +DF +P+LAKHFTIK ECFR +L ELP  QD
Sbjct: 121 ALPMSPSLMVDFPKPILAKHFTIKAECFRRVLAELPLLQD 160

BLAST of Cp4.1LG04g06570 vs. ExPASy TrEMBL
Match: A0A6J1ERT1 (uncharacterized protein LOC111435280 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111435280 PE=4 SV=1)

HSP 1 Score: 205 bits (522), Expect = 6.93e-64
Identity = 109/160 (68.12%), Postives = 126/160 (78.75%), Query Frame = 0

Query: 1   MFLIRVQDTKPITDATFLFKQFINEKADLDFNRNSFSIIASNPSLRFIARFYISKKYCQD 60
           M LIRVQD  PITDA FLF +FIN +ADL+F  NS +IIA+NP+LRFIA  YISK +CQD
Sbjct: 1   MLLIRVQDANPITDANFLFAEFINHEADLEFKPNSLTIIATNPTLRFIATLYISKTFCQD 60

Query: 61  FFINQTHIAKISLPSFSDTIMTAAATRFDTMSITLPSPYKMTLTFETSR---RVPRSNSR 120
           F INQTHIA++SL SF D IMTAAAT FDT++ITLPS Y M LTFETS    RV +S+ R
Sbjct: 61  FTINQTHIARVSLTSFIDAIMTAAATSFDTLAITLPSAYIMILTFETSTPSGRVLQSHPR 120

Query: 121 ALPLSPTQAMDFGRPLLAKHFTIKPECFRHILTELPNSQD 157
           ALP+SP+  MDF +P+LAKHFTIK ECFR IL ELP  QD
Sbjct: 121 ALPMSPSLMMDFPKPILAKHFTIKAECFRRILEELPLWQD 160

BLAST of Cp4.1LG04g06570 vs. ExPASy TrEMBL
Match: A0A6J1EK29 (uncharacterized protein LOC111435280 isoform X3 OS=Cucurbita moschata OX=3662 GN=LOC111435280 PE=4 SV=1)

HSP 1 Score: 147 bits (370), Expect = 6.55e-42
Identity = 75/80 (93.75%), Postives = 76/80 (95.00%), Query Frame = 0

Query: 81  MTAAATRFDTMSITLPSPYKMTLTFETSR---RVPRSNSRALPLSPTQAMDFGRPLLAKH 140
           MTAAATRFDTMSITLPSPYKMTLTFETS    RVPRSNSRALPLSPTQAMDFGRPLL+KH
Sbjct: 1   MTAAATRFDTMSITLPSPYKMTLTFETSTPPGRVPRSNSRALPLSPTQAMDFGRPLLSKH 60

Query: 141 FTIKPECFRHILTELPNSQD 157
           FTIKPECFRHILTELPNSQD
Sbjct: 61  FTIKPECFRHILTELPNSQD 80

BLAST of Cp4.1LG04g06570 vs. ExPASy TrEMBL
Match: A0A6J1JKZ5 (uncharacterized protein LOC111486730 OS=Cucurbita maxima OX=3661 GN=LOC111486730 PE=4 SV=1)

HSP 1 Score: 138 bits (348), Expect = 1.40e-38
Identity = 72/80 (90.00%), Postives = 74/80 (92.50%), Query Frame = 0

Query: 81  MTAAATRFDTMSITLPSPYKMTLTFETSR---RVPRSNSRALPLSPTQAMDFGRPLLAKH 140
           MTAAATRFDTMSITLPSPYK+TLTFETS    RVPRSNSRALPLSPT AMDFG+PLLAKH
Sbjct: 1   MTAAATRFDTMSITLPSPYKITLTFETSTPPGRVPRSNSRALPLSPTLAMDFGKPLLAKH 60

Query: 141 FTIKPECFRHILTELPNSQD 157
           FTIKPE FRHILTELPNSQD
Sbjct: 61  FTIKPEFFRHILTELPNSQD 80

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
KAG6588842.11.62e-9895.00hypothetical protein SDJN03_17407, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022928486.11.21e-6670.06uncharacterized protein LOC111435280 isoform X2 [Cucurbita moschata][more]
XP_022989711.11.08e-6568.13uncharacterized protein LOC111486711 [Cucurbita maxima][more]
XP_022928485.11.43e-6368.13uncharacterized protein LOC111435280 isoform X1 [Cucurbita moschata][more]
KAG6588843.11.73e-5857.37hypothetical protein SDJN03_17408, partial [Cucurbita argyrosperma subsp. sorori... [more]
Match NameE-valueIdentityDescription
A0A6J1EKE75.85e-6770.06uncharacterized protein LOC111435280 isoform X2 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1JKX55.22e-6668.13uncharacterized protein LOC111486711 OS=Cucurbita maxima OX=3661 GN=LOC111486711... [more]
A0A6J1ERT16.93e-6468.13uncharacterized protein LOC111435280 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1EK296.55e-4293.75uncharacterized protein LOC111435280 isoform X3 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1JKZ51.40e-3890.00uncharacterized protein LOC111486730 OS=Cucurbita maxima OX=3661 GN=LOC111486730... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableGENE3D3.70.10.10coord: 1..157
e-value: 1.7E-10
score: 42.6
NoneNo IPR availableSUPERFAMILY55979DNA clampcoord: 1..108

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG04g06570.1Cp4.1LG04g06570.1mRNA