Csor.00g058660 (gene) Silver-seed gourd (wild; sororia) v1

Overview
NameCsor.00g058660
Typegene
OrganismCucurbita argyrosperma subsp. sororia (Silver-seed gourd (wild; sororia) v1)
DescriptionEpidermal patterning factor-like protein
LocationCsor_Chr03: 6980548 .. 6983318 (-)
RNA-Seq ExpressionCsor.00g058660
SyntenyCsor.00g058660
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSinitialstart_codonintroninternalterminalstop_codon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCGTTCAAACGAACCTATTTGTTTCTTTTGGTTGAGGACGAGTTGAATGGCGGTTCCGGCACGGAGTTGACTGTTCTGGTCCTAGGAATCTTAGAGAAATGTGGCCGGGCGGGCGGGTGGGCGGCGACGGTAACAGATAGTACGAGAAGAGTGACAACAGAGGAAGAAATTGCTTGTTTCAAGGAGAGGGTACGATGTAGTGCCATCAGTGTGTTAATTTGACAAGGAAGGATGAAAACTGTGGGGCTATAGATAGCTGCATGATGTGCAAATATGAATCACGAGCCACAATTTTTTTTTTTTTTTCGTTTGTTTTTTATTAGACATTTGCGGTTGCATTGATGTGTGTACTTTAATACATTCCGAGACATTGATTGGTAGTCCCAGAATTATAATTTATCATATACATTGAGCTAATTGGAATTTAAGGCCGTAGATCCAAGGCAATGGATTGAAGCATTTGAAGCTGTGGCATCAGGTGGTTTGAGTTAAAGCAGCGAAACTGAGTCAAAATGATGAAGAGAAAATTATGGGAGTGTTGAGTGAATATTAGGTTGTGAACTTGGTGTAAAGGAGGGCCCTTGGAATTAAGTCCAAAGATATGGAAGATATTCTATTCTTATGGACGTCAATATCACTGTCTGTTCCTTTTCCTTTAGAATCAGCTTTGTTAAAAGATTTACAAATTCCCATACCCCAACCGTTTTTCTCAATTCCCCTTTCAACACAAGATTCAATTTGATGTTTGATGCCACAATTATGGTCTATTATGGTTGATTATTATTGTCGTGATTATGTCATTACGGTTGATTATTATACAAAATAAGTAATAATACCTTCTATTTATTTTAAGTGATGCTGATTCACGATTGTGGTTGGTGGTCCGGGTTTGTCTTCTAGTTGAACATGCCAAACACGTTCCAATACTTTAAGTTAGTATTAAGTTAAAGTGAAATCGAGTTGGACATGGTAGAATTGTATGTCTTTGAAGGTAAGGATGTCTCCTTATCTACACTACTCTTGTAATAGTCTAAGCCTACACTACTCTTGTAATAGTCTAAGCCTACTATTAGTAGTATCTTGTACTCTTATTTCTATCAACAGTCCAAATTCCATATCGGTTGGAGTTCGCTAGCGAGAACGTTGGGTTTTCTAAAGGATGGATTGTGAGATCCCACATCGATTGAAGGGGAAGTCTAAAAGGGAAAACCCACTGCTAGTAGTTTATTGTCCGTTTTAGTCTAACATTTCTCATTATTTTTAAAACGCATGTTAACCAAATATATGAAAGGCAAAATTTATAATTTATTGGGCTGTTTCAAGTGTAAAGCCCAAATAATGCAAGCAAAGGAGGAGCAATAATTATAGTTTTGGAGGGAGCAATCGACAAATGAGAAGCAAACTCATAGTTGGGAAAATCCTGAAGTGGGTCTGTTGCTTGGGAAGAAGAAAGGATATGGATATGACGTGATTAGGTGGGCGAGTTGAGAACAAAATCACAACAATCCCAAAACTTGAATTTTAAACTAACAACCAAAAAGAGAAGAGGAGAAAGAAAGCGAAGGCAAAAGGAAAAGGAAGACCAAATCCCACTCTCACACATCTCCCATTGCGCTGCTCTCACGTCTGGCTATGGAACAACTTGGCAGCATTTAGGAGCATTTGCAAATTTTGCTCTCATCTTTCGGCTCCTTGTTATTTTGATTGCCGCCATTACAATTCCTACTTGCAACTCTCATTCTCACGAATTTTTCTTCTCTCTCACTTCTTAGCTCTTATTTTATTCTATTTTCTGTTGGGTTAAAGTGTTTGGAGGGGGGGGCTCTTCTTTCGAGTTCTGAGGGGTTCTGCACTTTTTGGTACTCTCCCTTCTTTCTTGTGTTTCTTTTTCATTTCTCTTCCACGTTCAATCATCTCAAATATCTTTCCCACACTAACACACATATTCAGACCAAATTATATTATATACACGTACCACCGCCGTTGAGACCAAAATCAGAACCTATTCTCCCACCGATTCTGATTTAAAATTCACTTCTCCACCCACTTTCTTCTTTCCTTTGTTTCTGAATATCTGAGCTCTCCATCTCCAGGGACAATTCCCTGTCTCTTTCTCTGTGAGAACAATAGGAAAATGGGGTGTGAGTGTAACAACAATGGCGTCGTCATTGGGCGCAGCAGAATCTTGTGTGCGACTGTTTCTTTTCTCTGTCTTCTGATATTGGCATCGACCCAGATGAGAATCAAGGCTGAAGGTAACTGGGTATCTTTTGGATCTTCAAAACAGAGTCCAAAAACGGACTTTTATTCTTCATCCTACAATTTCAGTTCGATTTGAATCATCTTGTGAATCGTTTTTCTTTGCAGGTAGATCGATTTCAATGAGGACCAAGGTCAGTCCACAAACCACCGCTCTTCCCTCCCTTTCTCAAAGACTTTTATTGTTTCTGTGAAAGCTTAGTTAATTGAGAAGAATAATAATTTTGTGTTTCTTTCTTTTTTTTTTTTCTTTTGATGCATTGGGATAGACAGTGAATGAAGATAAGGAGCTATTAAGAGGACAAATTGGATCAAAGCCACCAAAATGTGAGAGAAGATGCAGCTGGTGCGGTCACTGTGAGGCCATTCAAGTTCCTGCAAACCCACAAAAATCAGCAACTAAAAAATCTTCAGCAGTGAAGAACATAGTTTATGCTAGAGATGAAGCCTCCAATTACAAGCCCATGAGCTGGAAATGCAAATGTGGGAGCTTAATCTTCAACCCTTAA

mRNA sequence

ATGTCGTTCAAACGAACCTATTTGTTTCTTTTGGTTGAGGACGAGTTGAATGGCGGTTCCGGCACGGAGTTGACTGTTCTGGTCCTAGGAATCTTAGAGAAATGTGGCCGGGCGGGCGGGTGGGCGGCGACGGTAACAGATAGTACGAGAAGAGTGACAACAGAGGAAGAAATTGCTTGTTTCAAGGAGAGGGCCGTAGATCCAAGGCAATGGATTGAAGCATTTGAAGCTGTGGCATCAGGGACAATTCCCTGTCTCTTTCTCTGTGAGAACAATAGGAAAATGGGGTGTGAGTGTAACAACAATGGCGTCGTCATTGGGCGCAGCAGAATCTTGTGTGCGACTGTTTCTTTTCTCTGTCTTCTGATATTGGCATCGACCCAGATGAGAATCAAGGCTGAAGGTAGATCGATTTCAATGAGGACCAAGACAGTGAATGAAGATAAGGAGCTATTAAGAGGACAAATTGGATCAAAGCCACCAAAATGTGAGAGAAGATGCAGCTGGTGCGGTCACTGTGAGGCCATTCAAGTTCCTGCAAACCCACAAAAATCAGCAACTAAAAAATCTTCAGCAGTGAAGAACATAGTTTATGCTAGAGATGAAGCCTCCAATTACAAGCCCATGAGCTGGAAATGCAAATGTGGGAGCTTAATCTTCAACCCTTAA

Coding sequence (CDS)

ATGTCGTTCAAACGAACCTATTTGTTTCTTTTGGTTGAGGACGAGTTGAATGGCGGTTCCGGCACGGAGTTGACTGTTCTGGTCCTAGGAATCTTAGAGAAATGTGGCCGGGCGGGCGGGTGGGCGGCGACGGTAACAGATAGTACGAGAAGAGTGACAACAGAGGAAGAAATTGCTTGTTTCAAGGAGAGGGCCGTAGATCCAAGGCAATGGATTGAAGCATTTGAAGCTGTGGCATCAGGGACAATTCCCTGTCTCTTTCTCTGTGAGAACAATAGGAAAATGGGGTGTGAGTGTAACAACAATGGCGTCGTCATTGGGCGCAGCAGAATCTTGTGTGCGACTGTTTCTTTTCTCTGTCTTCTGATATTGGCATCGACCCAGATGAGAATCAAGGCTGAAGGTAGATCGATTTCAATGAGGACCAAGACAGTGAATGAAGATAAGGAGCTATTAAGAGGACAAATTGGATCAAAGCCACCAAAATGTGAGAGAAGATGCAGCTGGTGCGGTCACTGTGAGGCCATTCAAGTTCCTGCAAACCCACAAAAATCAGCAACTAAAAAATCTTCAGCAGTGAAGAACATAGTTTATGCTAGAGATGAAGCCTCCAATTACAAGCCCATGAGCTGGAAATGCAAATGTGGGAGCTTAATCTTCAACCCTTAA

Protein sequence

MSFKRTYLFLLVEDELNGGSGTELTVLVLGILEKCGRAGGWAATVTDSTRRVTTEEEIACFKERAVDPRQWIEAFEAVASGTIPCLFLCENNRKMGCECNNNGVVIGRSRILCATVSFLCLLILASTQMRIKAEGRSISMRTKTVNEDKELLRGQIGSKPPKCERRCSWCGHCEAIQVPANPQKSATKKSSAVKNIVYARDEASNYKPMSWKCKCGSLIFNP
Homology
BLAST of Csor.00g058660 vs. ExPASy Swiss-Prot
Match: Q9T068 (EPIDERMAL PATTERNING FACTOR-like protein 2 OS=Arabidopsis thaliana OX=3702 GN=EPFL2 PE=2 SV=1)

HSP 1 Score: 105.1 bits (261), Expect = 1.0e-21
Identity = 60/118 (50.85%), Postives = 79/118 (66.95%), Query Frame = 0

Query: 119 LCLLILASTQMRIKAEGR----SISMRTKTVNED-KELLRGQIGSKPPKCER-RCSWCGH 178
           L LLIL ST   + A GR    S+   TK+ ++D K ++RG IGS+PP+CER RC  CGH
Sbjct: 12  LILLILNSTHFSLMANGRPEPDSVEF-TKSGDQDVKMMMRGLIGSRPPRCERVRCRSCGH 71

Query: 179 CEAIQVPANPQ-------KSATKKSSAVKNIVYAR-DEASNYKPMSWKCKCGSLIFNP 223
           CEAIQVP NPQ        +++  SS   ++ Y R D+++NYKPMSWKCKCG+ I+NP
Sbjct: 72  CEAIQVPTNPQTKLHSPLTTSSSSSSETIHLDYTRGDDSTNYKPMSWKCKCGNSIYNP 128

BLAST of Csor.00g058660 vs. ExPASy Swiss-Prot
Match: Q9LFT5 (EPIDERMAL PATTERNING FACTOR-like protein 1 OS=Arabidopsis thaliana OX=3702 GN=EPFL1 PE=1 SV=1)

HSP 1 Score: 63.5 bits (153), Expect = 3.4e-09
Identity = 29/78 (37.18%), Postives = 40/78 (51.28%), Query Frame = 0

Query: 153 RGQIGSKPPKCERRCSWCGHCEAIQVPANPQKSATKKSSAV--------KNIVYARDEAS 212
           + ++GS PP C  RC+ C  C AIQVP  P +S   + +           ++    D+ S
Sbjct: 45  KARLGSTPPSCHNRCNNCHPCMAIQVPTLPTRSRFTRVNPFSGGFVRPPSSLTTVLDQYS 104

Query: 213 NYKPMSWKCKCGSLIFNP 223
           NYKPM WKC C    +NP
Sbjct: 105 NYKPMGWKCHCNGHFYNP 122

BLAST of Csor.00g058660 vs. ExPASy Swiss-Prot
Match: C4B8C4 (EPIDERMAL PATTERNING FACTOR-like protein 3 OS=Arabidopsis thaliana OX=3702 GN=EPFL3 PE=1 SV=1)

HSP 1 Score: 56.2 bits (134), Expect = 5.4e-07
Identity = 28/73 (38.36%), Postives = 39/73 (53.42%), Query Frame = 0

Query: 146 NEDKELL---RGQIGSKPPKCERRCSWCGHCEAIQVPANPQKSATKKSSAVKNIVYARDE 205
           NE+KE +   R +IGSKPP CE++C  C  CEAIQ P             + +I +    
Sbjct: 44  NENKEEIVKRRRRIGSKPPSCEKKCYGCEPCEAIQFP------------TISSIPHLSPH 103

Query: 206 ASNYKPMSWKCKC 216
            +NY+P  W+C C
Sbjct: 104 YANYQPEGWRCHC 104

BLAST of Csor.00g058660 vs. ExPASy Swiss-Prot
Match: Q2V3I3 (EPIDERMAL PATTERNING FACTOR-like protein 4 OS=Arabidopsis thaliana OX=3702 GN=EPFL4 PE=1 SV=1)

HSP 1 Score: 55.1 bits (131), Expect = 1.2e-06
Identity = 37/121 (30.58%), Postives = 51/121 (42.15%), Query Frame = 0

Query: 108 RSRILCATVSFLCLLILASTQMRIKAEGRSISMRTKT------VNEDKELLRGQIGSKPP 167
           R R L A +    LL L S    + A+GR I  RT +      +  +K    G  GS PP
Sbjct: 7   RRRFLLAALVTFALLHLFSASSIVSADGRWIGQRTGSDLPGGFIRSNKRF--GGPGSSPP 66

Query: 168 KCERRCSWCGHCEAIQVPANPQKSATKKSSAVKNIVYARDEASNYKPMSWKCKCGSLIFN 223
            C  +C  C  C+ + VP  P  S   +                Y P +W+CKCG+ +F 
Sbjct: 67  TCRSKCGKCQPCKPVHVPIQPGLSMPLE----------------YYPEAWRCKCGNKLFM 109

BLAST of Csor.00g058660 vs. ExPASy Swiss-Prot
Match: Q9LUH9 (EPIDERMAL PATTERNING FACTOR-like protein 5 OS=Arabidopsis thaliana OX=3702 GN=EPFL5 PE=1 SV=1)

HSP 1 Score: 46.2 bits (108), Expect = 5.6e-04
Identity = 22/69 (31.88%), Postives = 32/69 (46.38%), Query Frame = 0

Query: 154 GQIGSKPPKCERRCSWCGHCEAIQVPANPQKSATKKSSAVKNIVYARDEASNYKPMSWKC 213
           G  GS PP C  +C  C  C+A+ VP  P             ++   +    Y P +W+C
Sbjct: 55  GGPGSVPPMCRLKCGKCEPCKAVHVPIQP------------GLIMPLE----YYPEAWRC 107

Query: 214 KCGSLIFNP 223
           KCG+ +F P
Sbjct: 115 KCGNKLFMP 107

BLAST of Csor.00g058660 vs. NCBI nr
Match: KAG6604096.1 (EPIDERMAL PATTERNING FACTOR-like protein 2, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 454 bits (1169), Expect = 2.21e-161
Identity = 222/222 (100.00%), Postives = 222/222 (100.00%), Query Frame = 0

Query: 1   MSFKRTYLFLLVEDELNGGSGTELTVLVLGILEKCGRAGGWAATVTDSTRRVTTEEEIAC 60
           MSFKRTYLFLLVEDELNGGSGTELTVLVLGILEKCGRAGGWAATVTDSTRRVTTEEEIAC
Sbjct: 1   MSFKRTYLFLLVEDELNGGSGTELTVLVLGILEKCGRAGGWAATVTDSTRRVTTEEEIAC 60

Query: 61  FKERAVDPRQWIEAFEAVASGTIPCLFLCENNRKMGCECNNNGVVIGRSRILCATVSFLC 120
           FKERAVDPRQWIEAFEAVASGTIPCLFLCENNRKMGCECNNNGVVIGRSRILCATVSFLC
Sbjct: 61  FKERAVDPRQWIEAFEAVASGTIPCLFLCENNRKMGCECNNNGVVIGRSRILCATVSFLC 120

Query: 121 LLILASTQMRIKAEGRSISMRTKTVNEDKELLRGQIGSKPPKCERRCSWCGHCEAIQVPA 180
           LLILASTQMRIKAEGRSISMRTKTVNEDKELLRGQIGSKPPKCERRCSWCGHCEAIQVPA
Sbjct: 121 LLILASTQMRIKAEGRSISMRTKTVNEDKELLRGQIGSKPPKCERRCSWCGHCEAIQVPA 180

Query: 181 NPQKSATKKSSAVKNIVYARDEASNYKPMSWKCKCGSLIFNP 222
           NPQKSATKKSSAVKNIVYARDEASNYKPMSWKCKCGSLIFNP
Sbjct: 181 NPQKSATKKSSAVKNIVYARDEASNYKPMSWKCKCGSLIFNP 222

BLAST of Csor.00g058660 vs. NCBI nr
Match: XP_022949700.1 (EPIDERMAL PATTERNING FACTOR-like protein 2 [Cucurbita moschata] >XP_022949701.1 EPIDERMAL PATTERNING FACTOR-like protein 2 [Cucurbita moschata] >XP_023544470.1 EPIDERMAL PATTERNING FACTOR-like protein 2 [Cucurbita pepo subsp. pepo] >KAG7034260.1 EPIDERMAL PATTERNING FACTOR-like protein 2 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 266 bits (681), Expect = 1.93e-88
Identity = 127/128 (99.22%), Postives = 128/128 (100.00%), Query Frame = 0

Query: 95  MGCECNNNGVVIGRSRILCATVSFLCLLILASTQMRIKAEGRSISMRTKTVNEDKELLRG 154
           MGCECNNNGVVIGRSRILCATVSFLCLLILASTQMRIKAEGRSISMRTKTVNEDKE+LRG
Sbjct: 1   MGCECNNNGVVIGRSRILCATVSFLCLLILASTQMRIKAEGRSISMRTKTVNEDKEILRG 60

Query: 155 QIGSKPPKCERRCSWCGHCEAIQVPANPQKSATKKSSAVKNIVYARDEASNYKPMSWKCK 214
           QIGSKPPKCERRCSWCGHCEAIQVPANPQKSATKKSSAVKNIVYARDEASNYKPMSWKCK
Sbjct: 61  QIGSKPPKCERRCSWCGHCEAIQVPANPQKSATKKSSAVKNIVYARDEASNYKPMSWKCK 120

Query: 215 CGSLIFNP 222
           CGSLIFNP
Sbjct: 121 CGSLIFNP 128

BLAST of Csor.00g058660 vs. NCBI nr
Match: XP_022979085.1 (EPIDERMAL PATTERNING FACTOR-like protein 2 [Cucurbita maxima] >XP_022979086.1 EPIDERMAL PATTERNING FACTOR-like protein 2 [Cucurbita maxima])

HSP 1 Score: 261 bits (666), Expect = 3.72e-86
Identity = 125/128 (97.66%), Postives = 127/128 (99.22%), Query Frame = 0

Query: 95  MGCECNNNGVVIGRSRILCATVSFLCLLILASTQMRIKAEGRSISMRTKTVNEDKELLRG 154
           MGCE NNNGVVIGRSRILCATVSFLCLLILASTQM+IKAEGRSISMRTKTVNEDKE+LRG
Sbjct: 1   MGCERNNNGVVIGRSRILCATVSFLCLLILASTQMKIKAEGRSISMRTKTVNEDKEILRG 60

Query: 155 QIGSKPPKCERRCSWCGHCEAIQVPANPQKSATKKSSAVKNIVYARDEASNYKPMSWKCK 214
           QIGSKPPKCERRCSWCGHCEAIQVPANPQKSATKKSSAVKNIVYARDEASNYKPMSWKCK
Sbjct: 61  QIGSKPPKCERRCSWCGHCEAIQVPANPQKSATKKSSAVKNIVYARDEASNYKPMSWKCK 120

Query: 215 CGSLIFNP 222
           CGSLIFNP
Sbjct: 121 CGSLIFNP 128

BLAST of Csor.00g058660 vs. NCBI nr
Match: XP_022963149.1 (EPIDERMAL PATTERNING FACTOR-like protein 2 [Cucurbita moschata])

HSP 1 Score: 221 bits (564), Expect = 6.02e-70
Identity = 110/133 (82.71%), Postives = 114/133 (85.71%), Query Frame = 0

Query: 90  ENNRKMGCECNNNGVVIGRSRILCATVSFLCLLILASTQMRIKAEGRSISMRTKTVNEDK 149
           E   KMGCECNNNGV IGRSRILCATVSFL  LILASTQMR  AEGRSIS   KTV+EDK
Sbjct: 50  ERKEKMGCECNNNGV-IGRSRILCATVSFLFFLILASTQMRFMAEGRSISKSGKTVSEDK 109

Query: 150 ELLRGQIGSKPPKCERRCSWCGHCEAIQVPANPQKSATKKSSAVKNIVYARDEASNYKPM 209
            +LRGQIGS+PPKCERRCSWC HCEAIQVPANPQKS     SA+KNI YARDEASNYKPM
Sbjct: 110 VVLRGQIGSRPPKCERRCSWCAHCEAIQVPANPQKS-----SAMKNIAYARDEASNYKPM 169

Query: 210 SWKCKCGSLIFNP 222
           SWKCKCGSLIFNP
Sbjct: 170 SWKCKCGSLIFNP 176

BLAST of Csor.00g058660 vs. NCBI nr
Match: XP_023003266.1 (EPIDERMAL PATTERNING FACTOR-like protein 2 isoform X1 [Cucurbita maxima])

HSP 1 Score: 220 bits (561), Expect = 7.52e-70
Identity = 109/130 (83.85%), Postives = 114/130 (87.69%), Query Frame = 0

Query: 93  RKMGCECNNNGVVIGRSRILCATVSFLCLLILASTQMRIKAEGRSISMRTKTVNEDKELL 152
           RKMGCECNNNGV IGR RILCATVSFL LLILASTQMR  AEGRSIS   KTV+EDK +L
Sbjct: 28  RKMGCECNNNGV-IGRCRILCATVSFLFLLILASTQMRFMAEGRSISKSGKTVSEDKVVL 87

Query: 153 RGQIGSKPPKCERRCSWCGHCEAIQVPANPQKSATKKSSAVKNIVYARDEASNYKPMSWK 212
           RGQIGS+PPKCERRCSWC HCEAIQVPANPQKS+T     +KNI YARDEASNYKPMSWK
Sbjct: 88  RGQIGSRPPKCERRCSWCAHCEAIQVPANPQKSST-----MKNIAYARDEASNYKPMSWK 147

Query: 213 CKCGSLIFNP 222
           CKCGSLIFNP
Sbjct: 148 CKCGSLIFNP 151

BLAST of Csor.00g058660 vs. ExPASy TrEMBL
Match: A0A6J1GDI3 (Epidermal patterning factor-like protein OS=Cucurbita moschata OX=3662 GN=LOC111453017 PE=3 SV=1)

HSP 1 Score: 266 bits (681), Expect = 9.36e-89
Identity = 127/128 (99.22%), Postives = 128/128 (100.00%), Query Frame = 0

Query: 95  MGCECNNNGVVIGRSRILCATVSFLCLLILASTQMRIKAEGRSISMRTKTVNEDKELLRG 154
           MGCECNNNGVVIGRSRILCATVSFLCLLILASTQMRIKAEGRSISMRTKTVNEDKE+LRG
Sbjct: 1   MGCECNNNGVVIGRSRILCATVSFLCLLILASTQMRIKAEGRSISMRTKTVNEDKEILRG 60

Query: 155 QIGSKPPKCERRCSWCGHCEAIQVPANPQKSATKKSSAVKNIVYARDEASNYKPMSWKCK 214
           QIGSKPPKCERRCSWCGHCEAIQVPANPQKSATKKSSAVKNIVYARDEASNYKPMSWKCK
Sbjct: 61  QIGSKPPKCERRCSWCGHCEAIQVPANPQKSATKKSSAVKNIVYARDEASNYKPMSWKCK 120

Query: 215 CGSLIFNP 222
           CGSLIFNP
Sbjct: 121 CGSLIFNP 128

BLAST of Csor.00g058660 vs. ExPASy TrEMBL
Match: A0A6J1IS77 (Epidermal patterning factor-like protein OS=Cucurbita maxima OX=3661 GN=LOC111478828 PE=3 SV=1)

HSP 1 Score: 261 bits (666), Expect = 1.80e-86
Identity = 125/128 (97.66%), Postives = 127/128 (99.22%), Query Frame = 0

Query: 95  MGCECNNNGVVIGRSRILCATVSFLCLLILASTQMRIKAEGRSISMRTKTVNEDKELLRG 154
           MGCE NNNGVVIGRSRILCATVSFLCLLILASTQM+IKAEGRSISMRTKTVNEDKE+LRG
Sbjct: 1   MGCERNNNGVVIGRSRILCATVSFLCLLILASTQMKIKAEGRSISMRTKTVNEDKEILRG 60

Query: 155 QIGSKPPKCERRCSWCGHCEAIQVPANPQKSATKKSSAVKNIVYARDEASNYKPMSWKCK 214
           QIGSKPPKCERRCSWCGHCEAIQVPANPQKSATKKSSAVKNIVYARDEASNYKPMSWKCK
Sbjct: 61  QIGSKPPKCERRCSWCGHCEAIQVPANPQKSATKKSSAVKNIVYARDEASNYKPMSWKCK 120

Query: 215 CGSLIFNP 222
           CGSLIFNP
Sbjct: 121 CGSLIFNP 128

BLAST of Csor.00g058660 vs. ExPASy TrEMBL
Match: A0A6J1HJ94 (Epidermal patterning factor-like protein OS=Cucurbita moschata OX=3662 GN=LOC111463445 PE=3 SV=1)

HSP 1 Score: 221 bits (564), Expect = 2.92e-70
Identity = 110/133 (82.71%), Postives = 114/133 (85.71%), Query Frame = 0

Query: 90  ENNRKMGCECNNNGVVIGRSRILCATVSFLCLLILASTQMRIKAEGRSISMRTKTVNEDK 149
           E   KMGCECNNNGV IGRSRILCATVSFL  LILASTQMR  AEGRSIS   KTV+EDK
Sbjct: 50  ERKEKMGCECNNNGV-IGRSRILCATVSFLFFLILASTQMRFMAEGRSISKSGKTVSEDK 109

Query: 150 ELLRGQIGSKPPKCERRCSWCGHCEAIQVPANPQKSATKKSSAVKNIVYARDEASNYKPM 209
            +LRGQIGS+PPKCERRCSWC HCEAIQVPANPQKS     SA+KNI YARDEASNYKPM
Sbjct: 110 VVLRGQIGSRPPKCERRCSWCAHCEAIQVPANPQKS-----SAMKNIAYARDEASNYKPM 169

Query: 210 SWKCKCGSLIFNP 222
           SWKCKCGSLIFNP
Sbjct: 170 SWKCKCGSLIFNP 176

BLAST of Csor.00g058660 vs. ExPASy TrEMBL
Match: A0A6J1KNP8 (Epidermal patterning factor-like protein OS=Cucurbita maxima OX=3661 GN=LOC111496927 PE=3 SV=1)

HSP 1 Score: 220 bits (561), Expect = 3.64e-70
Identity = 109/130 (83.85%), Postives = 114/130 (87.69%), Query Frame = 0

Query: 93  RKMGCECNNNGVVIGRSRILCATVSFLCLLILASTQMRIKAEGRSISMRTKTVNEDKELL 152
           RKMGCECNNNGV IGR RILCATVSFL LLILASTQMR  AEGRSIS   KTV+EDK +L
Sbjct: 28  RKMGCECNNNGV-IGRCRILCATVSFLFLLILASTQMRFMAEGRSISKSGKTVSEDKVVL 87

Query: 153 RGQIGSKPPKCERRCSWCGHCEAIQVPANPQKSATKKSSAVKNIVYARDEASNYKPMSWK 212
           RGQIGS+PPKCERRCSWC HCEAIQVPANPQKS+T     +KNI YARDEASNYKPMSWK
Sbjct: 88  RGQIGSRPPKCERRCSWCAHCEAIQVPANPQKSST-----MKNIAYARDEASNYKPMSWK 147

Query: 213 CKCGSLIFNP 222
           CKCGSLIFNP
Sbjct: 148 CKCGSLIFNP 151

BLAST of Csor.00g058660 vs. ExPASy TrEMBL
Match: A0A6J1KLZ1 (Epidermal patterning factor-like protein OS=Cucurbita maxima OX=3661 GN=LOC111496927 PE=3 SV=1)

HSP 1 Score: 216 bits (550), Expect = 6.58e-69
Identity = 107/128 (83.59%), Postives = 112/128 (87.50%), Query Frame = 0

Query: 95  MGCECNNNGVVIGRSRILCATVSFLCLLILASTQMRIKAEGRSISMRTKTVNEDKELLRG 154
           MGCECNNNGV IGR RILCATVSFL LLILASTQMR  AEGRSIS   KTV+EDK +LRG
Sbjct: 1   MGCECNNNGV-IGRCRILCATVSFLFLLILASTQMRFMAEGRSISKSGKTVSEDKVVLRG 60

Query: 155 QIGSKPPKCERRCSWCGHCEAIQVPANPQKSATKKSSAVKNIVYARDEASNYKPMSWKCK 214
           QIGS+PPKCERRCSWC HCEAIQVPANPQKS+T     +KNI YARDEASNYKPMSWKCK
Sbjct: 61  QIGSRPPKCERRCSWCAHCEAIQVPANPQKSST-----MKNIAYARDEASNYKPMSWKCK 120

Query: 215 CGSLIFNP 222
           CGSLIFNP
Sbjct: 121 CGSLIFNP 122

BLAST of Csor.00g058660 vs. TAIR 10
Match: AT4G37810.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G10310.1); Has 149 Blast hits to 149 proteins in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 149; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 105.1 bits (261), Expect = 7.2e-23
Identity = 60/118 (50.85%), Postives = 79/118 (66.95%), Query Frame = 0

Query: 119 LCLLILASTQMRIKAEGR----SISMRTKTVNED-KELLRGQIGSKPPKCER-RCSWCGH 178
           L LLIL ST   + A GR    S+   TK+ ++D K ++RG IGS+PP+CER RC  CGH
Sbjct: 12  LILLILNSTHFSLMANGRPEPDSVEF-TKSGDQDVKMMMRGLIGSRPPRCERVRCRSCGH 71

Query: 179 CEAIQVPANPQ-------KSATKKSSAVKNIVYAR-DEASNYKPMSWKCKCGSLIFNP 223
           CEAIQVP NPQ        +++  SS   ++ Y R D+++NYKPMSWKCKCG+ I+NP
Sbjct: 72  CEAIQVPTNPQTKLHSPLTTSSSSSSETIHLDYTRGDDSTNYKPMSWKCKCGNSIYNP 128

BLAST of Csor.00g058660 vs. TAIR 10
Match: AT5G10310.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: 17 plant structures; EXPRESSED DURING: 10 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G13898.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 63.5 bits (153), Expect = 2.4e-10
Identity = 29/78 (37.18%), Postives = 40/78 (51.28%), Query Frame = 0

Query: 153 RGQIGSKPPKCERRCSWCGHCEAIQVPANPQKSATKKSSAV--------KNIVYARDEAS 212
           + ++GS PP C  RC+ C  C AIQVP  P +S   + +           ++    D+ S
Sbjct: 45  KARLGSTPPSCHNRCNNCHPCMAIQVPTLPTRSRFTRVNPFSGGFVRPPSSLTTVLDQYS 104

Query: 213 NYKPMSWKCKCGSLIFNP 223
           NYKPM WKC C    +NP
Sbjct: 105 NYKPMGWKCHCNGHFYNP 122

BLAST of Csor.00g058660 vs. TAIR 10
Match: AT3G13898.1 (unknown protein; LOCATED IN: endomembrane system; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G10310.1). )

HSP 1 Score: 56.2 bits (134), Expect = 3.8e-08
Identity = 28/73 (38.36%), Postives = 39/73 (53.42%), Query Frame = 0

Query: 146 NEDKELL---RGQIGSKPPKCERRCSWCGHCEAIQVPANPQKSATKKSSAVKNIVYARDE 205
           NE+KE +   R +IGSKPP CE++C  C  CEAIQ P             + +I +    
Sbjct: 44  NENKEEIVKRRRRIGSKPPSCEKKCYGCEPCEAIQFP------------TISSIPHLSPH 103

Query: 206 ASNYKPMSWKCKC 216
            +NY+P  W+C C
Sbjct: 104 YANYQPEGWRCHC 104

BLAST of Csor.00g058660 vs. TAIR 10
Match: AT4G14723.1 (BEST Arabidopsis thaliana protein match is: allergen-related (TAIR:AT3G22820.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 55.1 bits (131), Expect = 8.5e-08
Identity = 37/121 (30.58%), Postives = 51/121 (42.15%), Query Frame = 0

Query: 108 RSRILCATVSFLCLLILASTQMRIKAEGRSISMRTKT------VNEDKELLRGQIGSKPP 167
           R R L A +    LL L S    + A+GR I  RT +      +  +K    G  GS PP
Sbjct: 7   RRRFLLAALVTFALLHLFSASSIVSADGRWIGQRTGSDLPGGFIRSNKRF--GGPGSSPP 66

Query: 168 KCERRCSWCGHCEAIQVPANPQKSATKKSSAVKNIVYARDEASNYKPMSWKCKCGSLIFN 223
            C  +C  C  C+ + VP  P  S   +                Y P +W+CKCG+ +F 
Sbjct: 67  TCRSKCGKCQPCKPVHVPIQPGLSMPLE----------------YYPEAWRCKCGNKLFM 109

BLAST of Csor.00g058660 vs. TAIR 10
Match: AT3G22820.1 (allergen-related )

HSP 1 Score: 46.2 bits (108), Expect = 4.0e-05
Identity = 22/69 (31.88%), Postives = 32/69 (46.38%), Query Frame = 0

Query: 154 GQIGSKPPKCERRCSWCGHCEAIQVPANPQKSATKKSSAVKNIVYARDEASNYKPMSWKC 213
           G  GS PP C  +C  C  C+A+ VP  P             ++   +    Y P +W+C
Sbjct: 55  GGPGSVPPMCRLKCGKCEPCKAVHVPIQP------------GLIMPLE----YYPEAWRC 107

Query: 214 KCGSLIFNP 223
           KCG+ +F P
Sbjct: 115 KCGNKLFMP 107

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9T0681.0e-2150.85EPIDERMAL PATTERNING FACTOR-like protein 2 OS=Arabidopsis thaliana OX=3702 GN=EP... [more]
Q9LFT53.4e-0937.18EPIDERMAL PATTERNING FACTOR-like protein 1 OS=Arabidopsis thaliana OX=3702 GN=EP... [more]
C4B8C45.4e-0738.36EPIDERMAL PATTERNING FACTOR-like protein 3 OS=Arabidopsis thaliana OX=3702 GN=EP... [more]
Q2V3I31.2e-0630.58EPIDERMAL PATTERNING FACTOR-like protein 4 OS=Arabidopsis thaliana OX=3702 GN=EP... [more]
Q9LUH95.6e-0431.88EPIDERMAL PATTERNING FACTOR-like protein 5 OS=Arabidopsis thaliana OX=3702 GN=EP... [more]
Match NameE-valueIdentityDescription
KAG6604096.12.21e-161100.00EPIDERMAL PATTERNING FACTOR-like protein 2, partial [Cucurbita argyrosperma subs... [more]
XP_022949700.11.93e-8899.22EPIDERMAL PATTERNING FACTOR-like protein 2 [Cucurbita moschata] >XP_022949701.1 ... [more]
XP_022979085.13.72e-8697.66EPIDERMAL PATTERNING FACTOR-like protein 2 [Cucurbita maxima] >XP_022979086.1 EP... [more]
XP_022963149.16.02e-7082.71EPIDERMAL PATTERNING FACTOR-like protein 2 [Cucurbita moschata][more]
XP_023003266.17.52e-7083.85EPIDERMAL PATTERNING FACTOR-like protein 2 isoform X1 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
A0A6J1GDI39.36e-8999.22Epidermal patterning factor-like protein OS=Cucurbita moschata OX=3662 GN=LOC111... [more]
A0A6J1IS771.80e-8697.66Epidermal patterning factor-like protein OS=Cucurbita maxima OX=3661 GN=LOC11147... [more]
A0A6J1HJ942.92e-7082.71Epidermal patterning factor-like protein OS=Cucurbita moschata OX=3662 GN=LOC111... [more]
A0A6J1KNP83.64e-7083.85Epidermal patterning factor-like protein OS=Cucurbita maxima OX=3661 GN=LOC11149... [more]
A0A6J1KLZ16.58e-6983.59Epidermal patterning factor-like protein OS=Cucurbita maxima OX=3661 GN=LOC11149... [more]
Match NameE-valueIdentityDescription
AT4G37810.17.2e-2350.85unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT5G10310.12.4e-1037.18unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT3G13898.13.8e-0838.36unknown protein; LOCATED IN: endomembrane system; BEST Arabidopsis thaliana prot... [more]
AT4G14723.18.5e-0830.58BEST Arabidopsis thaliana protein match is: allergen-related (TAIR:AT3G22820.1);... [more]
AT3G22820.14.0e-0531.88allergen-related [more]
InterPro
Analysis Name: InterPro Annotations of Silver-seed gourd (sororia) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePFAMPF17181EPFcoord: 155..222
e-value: 4.9E-16
score: 58.4
NoneNo IPR availablePANTHERPTHR33109:SF71EPIDERMAL PATTERNING FACTOR-LIKE PROTEIN 2coord: 104..222
IPR039455EPIDERMAL PATTERNING FACTOR-like proteinPANTHERPTHR33109EPIDERMAL PATTERNING FACTOR-LIKE PROTEIN 4coord: 104..222

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csor.00g058660.m01Csor.00g058660.m01mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0010052 guard cell differentiation
biological_process GO:0010374 stomatal complex development
cellular_component GO:0005576 extracellular region
cellular_component GO:0016021 integral component of membrane