Cp4.1LG16g08230 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG16g08230
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionUbiquitin carboxyl-terminal hydrolase
LocationCp4.1LG16: 7978279 .. 7982170 (+)
RNA-Seq ExpressionCp4.1LG16g08230
SyntenyCp4.1LG16g08230
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CGAAGAAATATGGTCAAGATTTTAAAGCTATCTATAGAAATTAAGGCATAATTATGAAATGAAGGACTTGTTAGAATTTTTGTTGGAGTAGAGAGATCAACTGATCAAGCGCTACGGGGTTTGTGTAGCGGGTCACAGTTACGGGGCGTGCGGATCTACCATGACCCGAACTGGATCGTTCTCATCTAGCATCTCATTTGCTCGAGACGTTGCGGGCTTCGATTTCAATCTGGAGAGGATAACTTGACCATTACAGAGAAAAAAATCGTATAAGGTACAAAGATATTTTGTTCTTGAAATTTTTTGCGTTTGATCGAAACTCTTGCTATACCTGCAATACTACTGTTACGTGTCGTTCTACATGAGATGTTATCGGTCTAGAGCGATGATTTGGCTCGTAGTTGCTCGTCTGCCGCCTCTTCGTCAATGATTTCACGTTTTTCCATAGCTATTTTTATCTGTTAAGGTATTAAAACTGGCGTTAGTGCAAGAGTTGGTGTGTATAATCTTCAGAGAGTCAGCTAGAGGTATCTGGATATGGCGTCACGGCCACTTCAGTTTCAAGAACACTTAAAATTTTGGTCTGTTTTTGATTTGGTTTAAATTATCTGAGTTTGAGAATGCTGTTCCCGAATACTTGGGTTGCGTGGTAGGTTGCAAAAGGAATAATGTTGTATACGAGCTGCCACCATTAGAGAGTGCTTGAAGGCACAGTAGGACCTAAATATGTTTGTTGTAGAGAGGGGCTGGGGTGTTGGTGTTGGTCTTGGGATTGGGAATGGTTTGTTTTCAAATTTTGATTGATATGATCTTGAAAATGTCACTTGAACTTGGGAATGGTTGGATTTCAAATTTTGGTTAAAATTTGGTGGTTTTTACAGGTGGGCGTAACAATGAGACAGCAGGGTCAATATTCTGATTCAGGACTTGGCTCATATTCAGTTTCTCAAATGCATCAAGTTCCCAGTCAAAGGATGGAACAAAGCCACCCTGATCCTTTTGAAGGACGGCTGGAAGCCTTCACTCCAGAGAGAGAGAATTCATATCTAGCTTCAAAAAATGAGGATCAATGGAGATGGGAAAGAGATGAGTCGAAGATGCCAAATTCCATGGCCTCTCACATGTTCAACGAAGGCAAGCATTAATAGTGTGTGTATTGCATCCATCACTTATTGTTGCTTGCACCTGTTTTCTTATGAAGTGTTTTATTCTCTTGTACATTACAATATCCCTTTCCTAGACTATGTAAAATAAACCATATTAGGTACTTGTTTATTGATATCAACCTCTTGGTCGATGGCGCAGTTGAGCTATGTTTAGGTTGACTCATTTGATTCTTTTGTTGGTTCTAGTCTACTTGGTTATATGATCTGAAATTCTCACCTGTGCTCCCTAAGCTACATTGAGTATTTTCCTTGGGAAACTGAAAAGAGAAATTAACAACATAACTATAATTGCCTTATGGCATTTACACTCTTACAAAGTTTAATTTCACATGATATAGTTCTTGGATATAAGATGATTAGTCAGACCGTTAACGAAAAAGAAATCGTACTATTAATTCTTTTAGATTCATCATTCCCCATTCAAAAGCTATCAAGGTCAGATGATAGTTGAATTAGCTAGGTATCCACACATGTGGTTATCAAGAATAAAGCGATTAATTGATGCCAAAACAGTATCCATTTCATAATGGTTGGGTGATGTGATATTCCTCATAGTATCAAGCTGTTAATTGTTTATTGACAGAGAGAATTTCTATTCATTGTTATTCTATTTCTTTTTTACCCTTTCCCTTGTTTATTTTCCCTTTAAGATGTTTATATACTTCTCCCATGCAAAGAATTTCTGCCTTCTCTTGTTAAACGAAGTTTTCAGAATTTCTGGACTGAGAAATATGCTTGTTTAAACATAGGTCAAGGGGGTGATCTCAGAAGATCATACTTTCAAGGTCAGAGACCAAATCCAAAATTGGTTCTTGAGAAAGGAAGCAACAGCGATCTCAGATTTCAATCCCATGGAAAAAACATGGAAAGTAGGTTTGGAGATGGCCTGCTGCCACAGAATTTTGACGGTCTCGAGCAGAAATTCATTGATGACATCATTAACTTTTCCAAGGAACAAAATGATGCAGAGGATGAGGAAAATGCTCGACATAGAGAGGTATATTTGCACATTCCTATTTCAGTACTTGCATTCTCGTTTTTCACATCATCTTTACTGAACATACTTCATTTTGATGCATATCTTCAGAGAATTATCGCTATCAATTCTCAGTACGAGGAGCAATTAGCAGCACTTCGAGCTCAGCATGCTGGTCGTCGTGATGAGCTGCTACGAAGGGAATCAAGTGCACGGCACCATCAGTATCAGAAGGGAATAAGGGACCATTACCCCAATGGGGGCATTGGCCCAGGTAATCCTCGTGGCAGTAGTGGAGTTACAACCTTAGCTGCTTCTGGACGAGCACATCAAAATTATGAATCTGAACACTTCAATTCCTACAGAGAATGAACTCGGTTTCTTGGGAATTCTATGCGGAATTCTAATTTGGATTCGAGTTTATGACACTGGCTCACGGTACTACTAATCAAACTTGCTTAATTTTCAGTATAAAATTACGCATTTTTACATTGTCCGATTAGGCTTCCTTAACAAACCTTTAAGCTGTTTTTTCATCCTAAAGTTATGCACTTTTCTGTTGCACACCCTTCTCTATAGAGCCCATGATCTATGGTATCTCTGGATCCCAGTTTTTGCCGACTTGAACATATCAATGTTTAAGTTGTAGGATCCTCGAATTTTTAGGAGCTTTGGAGTTGGAGATAGTGGCTTACCATGATCTGATTGATGAGCTTGTATCGGAATTACGTTGCCACTTTATATAAAAGCTTCTTCTTTAATGCTTTACATAAAAGTTTTCTTTATGTGAATTTGTATGGTTTCCTTTCCTCACTGGTTCACACTCTCTTGATTTGCGGACTCCAACTGTTAAGAATATATATATATAATCTTGCGGTAGAACTGATTAAACTTATTATTTAATATAGAGTTGTCTTCTAATCCTAGTTCAAGACTATATTCAATGTACTAGTTTATAAATGAGGTATTTTTCTGTGTGATGATGTAACAGCTTAAGCCTACCGTTAGCAATATTGTTCTTTTTGGGCTTCCCCTCAAGATTTTTAAAGCACGTTTGTTAGGGAGAGGTTTCCACACCATTATAAAGATTGATTTGTTTCCCTCTCCAACGATGTAAGATCTCATAGATGATCTTCAATCTTCATTTAAACAAAAAATCATGTTATTCATAAGCGATTAGAACACTCATACTTTTGCCCTTTCTACCAGCTTAGCAACTTGAGTATTGTACAATCCAGTAGTTAATACATTTATACTCCTACTGTGACCAGAAGTATAAAGATGGTGTAAAAGAATATGCATATATATACATGAGTTTCTTTTCAGGAAAGCATTTATTCAATGTATAATGCTTTGTTTTCTTATAAATATTCTTTGGTGGTTACTTTAGGTTAGGTTCCAATGGCAACATCACGTGGCATGTGAGCACATATCCATTGCCTCCAGATCTTGACCAGCAATTGCAATACGGTGGTCTCACTTTTCTGTCTCTCTTCTTTTGTGTTATTATTATCAATGATATCTGTTAGGGCCTTGCTCTGAAACTGCCTGCTCAATATTCACAGCATACACACTTTTACTTGCTGGGAAAAAGAAATCATGTTGTTCTTCGCTTTAATATTTCACTTTGTTAGTGATTCTGTTAGAATTGGATGCCATTCCCTTTCTCATTCGCCTGGGTTTGGTTTGGTTTCCTTGAAGCCAAATAGAATCCTACCTTCATTCCCAAATAAAGTTACTGGTGTTTAAG

mRNA sequence

CGAAGAAATATGGTCAAGATTTTAAAGCTATCTATAGAAATTAAGGCATAATTATGAAATGAAGGACTTGTTAGAATTTTTGTTGGAGTAGAGAGATCAACTGATCAAGCGCTACGGGGTTTGTGTAGCGGGTCACAGTTACGGGGCGTGCGGATCTACCATGACCCGAACTGGATCGTTCTCATCTAGCATCTCATTTGCTCGAGACGTTGCGGGCTTCGATTTCAATCTGGAGAGGATAACTTGACCATTACAGAGAAAAAAATCGTATAAGGTACAAAGATATTTTGTTCTTGAAATTTTTTGCGTTTGATCGAAACTCTTGCTATACCTGCAATACTACTGTTACGTGTCGTTCTACATGAGATGTTATCGGTCTAGAGCGATGATTTGGCTCGTAGTTGCTCGTCTGCCGCCTCTTCGTCAATGATTTCACGTTTTTCCATAGCTATTTTTATCTGTTAAGGTGGGCGTAACAATGAGACAGCAGGGTCAATATTCTGATTCAGGACTTGGCTCATATTCAGTTTCTCAAATGCATCAAGTTCCCAGTCAAAGGATGGAACAAAGCCACCCTGATCCTTTTGAAGGACGGCTGGAAGCCTTCACTCCAGAGAGAGAGAATTCATATCTAGCTTCAAAAAATGAGGATCAATGGAGATGGGAAAGAGATGAGTCGAAGATGCCAAATTCCATGGCCTCTCACATGTTCAACGAAGGTCAAGGGGGTGATCTCAGAAGATCATACTTTCAAGGTCAGAGACCAAATCCAAAATTGGTTCTTGAGAAAGGAAGCAACAGCGATCTCAGATTTCAATCCCATGGAAAAAACATGGAAAGTAGGTTTGGAGATGGCCTGCTGCCACAGAATTTTGACGGTCTCGAGCAGAAATTCATTGATGACATCATTAACTTTTCCAAGGAACAAAATGATGCAGAGGATGAGGAAAATGCTCGACATAGAGAGAGAATTATCGCTATCAATTCTCAGTACGAGGAGCAATTAGCAGCACTTCGAGCTCAGCATGCTGGTCGTCGTGATGAGCTGCTACGAAGGGAATCAAGTGCACGGCACCATCAGTATCAGAAGGGAATAAGGGACCATTACCCCAATGGGGGCATTGGCCCAGAGAATGAACTCGGTTTCTTGGGAATTCTATGCGGAATTCTAATTTGGATTCGAGTTTATGACACTGGCTCACGGTTAGGTTCCAATGGCAACATCACGTGGCATGTGAGCACATATCCATTGCCTCCAGATCTTGACCAGCAATTGCAATACGGTGGTCTCACTTTTCTGTCTCTCTTCTTTTGTGTTATTATTATCAATGATATCTGTTAGGGCCTTGCTCTGAAACTGCCTGCTCAATATTCACAGCATACACACTTTTACTTGCTGGGAAAAAGAAATCATGTTGTTCTTCGCTTTAATATTTCACTTTGTTAGTGATTCTGTTAGAATTGGATGCCATTCCCTTTCTCATTCGCCTGGGTTTGGTTTGGTTTCCTTGAAGCCAAATAGAATCCTACCTTCATTCCCAAATAAAGTTACTGGTGTTTAAG

Coding sequence (CDS)

ATGAGACAGCAGGGTCAATATTCTGATTCAGGACTTGGCTCATATTCAGTTTCTCAAATGCATCAAGTTCCCAGTCAAAGGATGGAACAAAGCCACCCTGATCCTTTTGAAGGACGGCTGGAAGCCTTCACTCCAGAGAGAGAGAATTCATATCTAGCTTCAAAAAATGAGGATCAATGGAGATGGGAAAGAGATGAGTCGAAGATGCCAAATTCCATGGCCTCTCACATGTTCAACGAAGGTCAAGGGGGTGATCTCAGAAGATCATACTTTCAAGGTCAGAGACCAAATCCAAAATTGGTTCTTGAGAAAGGAAGCAACAGCGATCTCAGATTTCAATCCCATGGAAAAAACATGGAAAGTAGGTTTGGAGATGGCCTGCTGCCACAGAATTTTGACGGTCTCGAGCAGAAATTCATTGATGACATCATTAACTTTTCCAAGGAACAAAATGATGCAGAGGATGAGGAAAATGCTCGACATAGAGAGAGAATTATCGCTATCAATTCTCAGTACGAGGAGCAATTAGCAGCACTTCGAGCTCAGCATGCTGGTCGTCGTGATGAGCTGCTACGAAGGGAATCAAGTGCACGGCACCATCAGTATCAGAAGGGAATAAGGGACCATTACCCCAATGGGGGCATTGGCCCAGAGAATGAACTCGGTTTCTTGGGAATTCTATGCGGAATTCTAATTTGGATTCGAGTTTATGACACTGGCTCACGGTTAGGTTCCAATGGCAACATCACGTGGCATGTGAGCACATATCCATTGCCTCCAGATCTTGACCAGCAATTGCAATACGGTGGTCTCACTTTTCTGTCTCTCTTCTTTTGTGTTATTATTATCAATGATATCTGTTAG

Protein sequence

MRQQGQYSDSGLGSYSVSQMHQVPSQRMEQSHPDPFEGRLEAFTPERENSYLASKNEDQWRWERDESKMPNSMASHMFNEGQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNMESRFGDGLLPQNFDGLEQKFIDDIINFSKEQNDAEDEENARHRERIIAINSQYEEQLAALRAQHAGRRDELLRRESSARHHQYQKGIRDHYPNGGIGPENELGFLGILCGILIWIRVYDTGSRLGSNGNITWHVSTYPLPPDLDQQLQYGGLTFLSLFFCVIIINDIC
Homology
BLAST of Cp4.1LG16g08230 vs. NCBI nr
Match: XP_023512046.1 (uncharacterized protein LOC111776883 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 444 bits (1141), Expect = 1.48e-155
Identity = 220/226 (97.35%), Postives = 221/226 (97.79%), Query Frame = 0

Query: 1   MRQQGQYSDSGLGSYSVSQMHQVPSQRMEQSHPDPFEGRLEAFTPERENSYLASKNEDQW 60
           MRQQGQYSDSGLGSYSVSQMHQVPSQRMEQSHPDPFEGRLEAFTPERENSYLASKNEDQW
Sbjct: 1   MRQQGQYSDSGLGSYSVSQMHQVPSQRMEQSHPDPFEGRLEAFTPERENSYLASKNEDQW 60

Query: 61  RWERDESKMPNSMASHMFNEGQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNME 120
           RWERDESKMPNSMASHMFNEGQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNME
Sbjct: 61  RWERDESKMPNSMASHMFNEGQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNME 120

Query: 121 SRFGDGLLPQNFDGLEQKFIDDIINFSKEQNDAEDEENARHRERIIAINSQYEEQLAALR 180
           SRFGDGLLPQNFDGLEQKFIDDIINFSKEQNDAEDEENARHRERIIAINSQYEEQLAALR
Sbjct: 121 SRFGDGLLPQNFDGLEQKFIDDIINFSKEQNDAEDEENARHRERIIAINSQYEEQLAALR 180

Query: 181 AQHAGRRDELLRRESSARHHQYQKGIRDHYPNGGIGPENELGFLGI 226
           AQHAGRRDELLRRESSARHHQYQKGIRDHYPNGGIGP N  G  G+
Sbjct: 181 AQHAGRRDELLRRESSARHHQYQKGIRDHYPNGGIGPGNPRGSSGV 226

BLAST of Cp4.1LG16g08230 vs. NCBI nr
Match: XP_022986988.1 (uncharacterized protein LOC111484575 [Cucurbita maxima])

HSP 1 Score: 429 bits (1104), Expect = 6.40e-150
Identity = 214/226 (94.69%), Postives = 217/226 (96.02%), Query Frame = 0

Query: 1   MRQQGQYSDSGLGSYSVSQMHQVPSQRMEQSHPDPFEGRLEAFTPERENSYLASKNEDQW 60
           MRQQ QYSDSGLGSYSVSQMHQVPSQR+EQSHPDPFEGRLEAFTPERENSY+ASKNEDQW
Sbjct: 1   MRQQRQYSDSGLGSYSVSQMHQVPSQRIEQSHPDPFEGRLEAFTPERENSYIASKNEDQW 60

Query: 61  RWERDESKMPNSMASHMFNEGQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNME 120
           R ERDESKMPNSMASHMFNEGQGGDLRRSYFQGQRPNPK VLEKGSNSDLRFQSHGKNME
Sbjct: 61  RRERDESKMPNSMASHMFNEGQGGDLRRSYFQGQRPNPKFVLEKGSNSDLRFQSHGKNME 120

Query: 121 SRFGDGLLPQNFDGLEQKFIDDIINFSKEQNDAEDEENARHRERIIAINSQYEEQLAALR 180
           SRFGDGLLPQNFDGLEQKFIDDIINFSKEQND EDEENARHRERIIAINSQYEEQLAALR
Sbjct: 121 SRFGDGLLPQNFDGLEQKFIDDIINFSKEQNDVEDEENARHRERIIAINSQYEEQLAALR 180

Query: 181 AQHAGRRDELLRRESSARHHQYQKGIRDHYPNGGIGPENELGFLGI 226
           AQHAGRRDELLRRESSARHHQYQKGIRDHYPNGGIGP N  G  G+
Sbjct: 181 AQHAGRRDELLRRESSARHHQYQKGIRDHYPNGGIGPGNPRGNSGV 226

BLAST of Cp4.1LG16g08230 vs. NCBI nr
Match: KAG6570459.1 (hypothetical protein SDJN03_29374, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 420 bits (1080), Expect = 1.36e-146
Identity = 206/209 (98.56%), Postives = 207/209 (99.04%), Query Frame = 0

Query: 79  NEGQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNMESRFGDGLLPQNFDGLEQK 138
           N GQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNMESRFGDGLLPQNFDGLEQK
Sbjct: 20  NIGQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNMESRFGDGLLPQNFDGLEQK 79

Query: 139 FIDDIINFSKEQNDAEDEENARHRERIIAINSQYEEQLAALRAQHAGRRDELLRRESSAR 198
           FIDDIINFSKEQ+DAEDEENARHRERIIAINSQYEEQLAALRAQHAGRRDELLRRESSAR
Sbjct: 80  FIDDIINFSKEQSDAEDEENARHRERIIAINSQYEEQLAALRAQHAGRRDELLRRESSAR 139

Query: 199 HHQYQKGIRDHYPNGGIGPENELGFLGILCGILIWIRVYDTGSRLGSNGNITWHVSTYPL 258
           HHQYQKGIRDHYPNGGIGPENELGFLGIL GILIWIRVYDTGSRLGSNGNITWHVSTYPL
Sbjct: 140 HHQYQKGIRDHYPNGGIGPENELGFLGILRGILIWIRVYDTGSRLGSNGNITWHVSTYPL 199

Query: 259 PPDLDQQLQYGGLTFLSLFFCVIIINDIC 287
           PPDLDQQLQYGGLTFLSLFFCVIIINDIC
Sbjct: 200 PPDLDQQLQYGGLTFLSLFFCVIIINDIC 228

BLAST of Cp4.1LG16g08230 vs. NCBI nr
Match: XP_004140174.2 (uncharacterized protein LOC101221687 isoform X1 [Cucumis sativus] >XP_031744068.1 uncharacterized protein LOC101221687 isoform X1 [Cucumis sativus] >KAE8647365.1 hypothetical protein Csa_004279 [Cucumis sativus])

HSP 1 Score: 420 bits (1079), Expect = 4.47e-145
Identity = 223/308 (72.40%), Postives = 236/308 (76.62%), Query Frame = 0

Query: 1   MRQQGQYSDSGLGSYSVSQMHQVPSQRMEQSHPDPFEGRLEAFTPERENSYLASKNEDQW 60
           MRQQGQYSDSGLG+YSVSQMH VPSQ +EQSHPDPFEGRLEAFTPERE+SY+ASKNEDQW
Sbjct: 1   MRQQGQYSDSGLGAYSVSQMHHVPSQMVEQSHPDPFEGRLEAFTPEREHSYVASKNEDQW 60

Query: 61  RWERDESKMPNSMASHMFNEGQG--GDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKN 120
           RWERDESKMPNSM SHMFNEGQG  GD  RSYFQGQRPNPKL LEKGSN+D R QSHGKN
Sbjct: 61  RWERDESKMPNSMTSHMFNEGQGQGGDATRSYFQGQRPNPKLGLEKGSNNDPRSQSHGKN 120

Query: 121 MESRFGDGLLPQNFDGLEQKFIDDIINFSKEQNDAEDEENARHRERIIAINSQYEEQLAA 180
           MESRFGDG LPQNFDGLEQKFIDDII  +KEQNDAEDEENARHRERI+AIN+QYEEQLAA
Sbjct: 121 MESRFGDGPLPQNFDGLEQKFIDDIIKLTKEQNDAEDEENARHRERILAINAQYEEQLAA 180

Query: 181 LRAQHAGRRDELLRRESSARHHQYQKGIRDHYPNGGIGPENELG---------------- 240
           LR +HAGRRDELLRRES+AR HQYQKGI DHYPNGGIGP +  G                
Sbjct: 181 LRVRHAGRRDELLRRESTARQHQYQKGIMDHYPNGGIGPGDPRGNSGVTNLAASGQAHQN 240

Query: 241 --------------FLGILC--------GILIWIRVYDTGSRLGSNGNITWHVST-YPLP 267
                         FLG           G     RVYDT SRLGSNGNITWHV   + LP
Sbjct: 241 YESEHFDSFRERARFLGNSARDPNLDPRGSYPGGRVYDTASRLGSNGNITWHVVVHFALP 300

BLAST of Cp4.1LG16g08230 vs. NCBI nr
Match: XP_022943334.1 (uncharacterized protein LOC111448131 [Cucurbita moschata] >XP_022943335.1 uncharacterized protein LOC111448131 [Cucurbita moschata])

HSP 1 Score: 405 bits (1040), Expect = 1.80e-140
Identity = 199/207 (96.14%), Postives = 202/207 (97.58%), Query Frame = 0

Query: 20  MHQVPSQRMEQSHPDPFEGRLEAFTPERENSYLASKNEDQWRWERDESKMPNSMASHMFN 79
           MHQVPSQRMEQSHPDPFEGRLEAFTPERENSY+ASKNEDQWRWERDESKMPNSMASHMFN
Sbjct: 1   MHQVPSQRMEQSHPDPFEGRLEAFTPERENSYIASKNEDQWRWERDESKMPNSMASHMFN 60

Query: 80  EGQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNMESRFGDGLLPQNFDGLEQKF 139
           EGQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNMESRFGDGLLPQNFDGLEQKF
Sbjct: 61  EGQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNMESRFGDGLLPQNFDGLEQKF 120

Query: 140 IDDIINFSKEQNDAEDEENARHRERIIAINSQYEEQLAALRAQHAGRRDELLRRESSARH 199
           IDDIINFSKEQ+DAEDEENARHRERIIAINSQYEEQLAALRAQHAGRRDELLRRESSARH
Sbjct: 121 IDDIINFSKEQSDAEDEENARHRERIIAINSQYEEQLAALRAQHAGRRDELLRRESSARH 180

Query: 200 HQYQKGIRDHYPNGGIGPENELGFLGI 226
           HQYQKGIRDHYPNGGIGP N  G  G+
Sbjct: 181 HQYQKGIRDHYPNGGIGPGNPRGSSGV 207

BLAST of Cp4.1LG16g08230 vs. ExPASy TrEMBL
Match: A0A6J1JI55 (uncharacterized protein LOC111484575 OS=Cucurbita maxima OX=3661 GN=LOC111484575 PE=4 SV=1)

HSP 1 Score: 429 bits (1104), Expect = 3.10e-150
Identity = 214/226 (94.69%), Postives = 217/226 (96.02%), Query Frame = 0

Query: 1   MRQQGQYSDSGLGSYSVSQMHQVPSQRMEQSHPDPFEGRLEAFTPERENSYLASKNEDQW 60
           MRQQ QYSDSGLGSYSVSQMHQVPSQR+EQSHPDPFEGRLEAFTPERENSY+ASKNEDQW
Sbjct: 1   MRQQRQYSDSGLGSYSVSQMHQVPSQRIEQSHPDPFEGRLEAFTPERENSYIASKNEDQW 60

Query: 61  RWERDESKMPNSMASHMFNEGQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNME 120
           R ERDESKMPNSMASHMFNEGQGGDLRRSYFQGQRPNPK VLEKGSNSDLRFQSHGKNME
Sbjct: 61  RRERDESKMPNSMASHMFNEGQGGDLRRSYFQGQRPNPKFVLEKGSNSDLRFQSHGKNME 120

Query: 121 SRFGDGLLPQNFDGLEQKFIDDIINFSKEQNDAEDEENARHRERIIAINSQYEEQLAALR 180
           SRFGDGLLPQNFDGLEQKFIDDIINFSKEQND EDEENARHRERIIAINSQYEEQLAALR
Sbjct: 121 SRFGDGLLPQNFDGLEQKFIDDIINFSKEQNDVEDEENARHRERIIAINSQYEEQLAALR 180

Query: 181 AQHAGRRDELLRRESSARHHQYQKGIRDHYPNGGIGPENELGFLGI 226
           AQHAGRRDELLRRESSARHHQYQKGIRDHYPNGGIGP N  G  G+
Sbjct: 181 AQHAGRRDELLRRESSARHHQYQKGIRDHYPNGGIGPGNPRGNSGV 226

BLAST of Cp4.1LG16g08230 vs. ExPASy TrEMBL
Match: A0A6J1FXP5 (uncharacterized protein LOC111448131 OS=Cucurbita moschata OX=3662 GN=LOC111448131 PE=4 SV=1)

HSP 1 Score: 405 bits (1040), Expect = 8.72e-141
Identity = 199/207 (96.14%), Postives = 202/207 (97.58%), Query Frame = 0

Query: 20  MHQVPSQRMEQSHPDPFEGRLEAFTPERENSYLASKNEDQWRWERDESKMPNSMASHMFN 79
           MHQVPSQRMEQSHPDPFEGRLEAFTPERENSY+ASKNEDQWRWERDESKMPNSMASHMFN
Sbjct: 1   MHQVPSQRMEQSHPDPFEGRLEAFTPERENSYIASKNEDQWRWERDESKMPNSMASHMFN 60

Query: 80  EGQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNMESRFGDGLLPQNFDGLEQKF 139
           EGQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNMESRFGDGLLPQNFDGLEQKF
Sbjct: 61  EGQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNMESRFGDGLLPQNFDGLEQKF 120

Query: 140 IDDIINFSKEQNDAEDEENARHRERIIAINSQYEEQLAALRAQHAGRRDELLRRESSARH 199
           IDDIINFSKEQ+DAEDEENARHRERIIAINSQYEEQLAALRAQHAGRRDELLRRESSARH
Sbjct: 121 IDDIINFSKEQSDAEDEENARHRERIIAINSQYEEQLAALRAQHAGRRDELLRRESSARH 180

Query: 200 HQYQKGIRDHYPNGGIGPENELGFLGI 226
           HQYQKGIRDHYPNGGIGP N  G  G+
Sbjct: 181 HQYQKGIRDHYPNGGIGPGNPRGSSGV 207

BLAST of Cp4.1LG16g08230 vs. ExPASy TrEMBL
Match: A0A1S3BME9 (uncharacterized protein LOC103491440 OS=Cucumis melo OX=3656 GN=LOC103491440 PE=4 SV=1)

HSP 1 Score: 406 bits (1044), Expect = 5.07e-140
Identity = 217/296 (73.31%), Postives = 227/296 (76.69%), Query Frame = 0

Query: 1   MRQQGQYSDSGLGSYSVSQMHQVPSQRMEQSHPDPFEGRLEAFTPERENSYLASKNEDQW 60
           MRQQGQYSDSGLG+YSVSQMH VPSQ +EQSHPDPFEGRLEAFTPERE+SY+ASKNEDQW
Sbjct: 1   MRQQGQYSDSGLGAYSVSQMHHVPSQMVEQSHPDPFEGRLEAFTPEREHSYVASKNEDQW 60

Query: 61  RWERDESKMPNSMASHMFNEGQG--GDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKN 120
           RWERDESKMPNSM SHMFNEGQG  GD  RSYFQGQRPNPKL LEKGSNSD R QSHGKN
Sbjct: 61  RWERDESKMPNSMTSHMFNEGQGQGGDATRSYFQGQRPNPKLGLEKGSNSDPRSQSHGKN 120

Query: 121 MESRFGDGLLPQNFDGLEQKFIDDIINFSKEQNDAEDEENARHRERIIAINSQYEEQLAA 180
           MESRFGDG LPQNFDGLEQKFIDDII  +KEQNDAEDEENARHRERI+AIN+QYEEQLAA
Sbjct: 121 MESRFGDGPLPQNFDGLEQKFIDDIIKLTKEQNDAEDEENARHRERILAINAQYEEQLAA 180

Query: 181 LRAQHAGRRDELLRRESSARHHQYQKGIRDHYPNGGIGPENELG---------------- 240
           LR +HAGRRDELLRRES+AR HQYQKGI DHYPNGGIGP +  G                
Sbjct: 181 LRVRHAGRRDELLRRESTARQHQYQKGIMDHYPNGGIGPGDPRGNSGVTNLAASGQAHQN 240

Query: 241 --------------FLGILC--------GILIWIRVYDTGSRLGSNGNITWHVSTY 256
                         FLG           G     RVYDT SRL SNGNITWHV  Y
Sbjct: 241 YESEHFDSFRERARFLGNSARDPNLDPRGSYPGGRVYDTVSRLCSNGNITWHVVVY 296

BLAST of Cp4.1LG16g08230 vs. ExPASy TrEMBL
Match: A0A6J1K145 (uncharacterized protein LOC111490759 OS=Cucurbita maxima OX=3661 GN=LOC111490759 PE=4 SV=1)

HSP 1 Score: 396 bits (1017), Expect = 1.76e-136
Identity = 197/226 (87.17%), Postives = 208/226 (92.04%), Query Frame = 0

Query: 1   MRQQGQYSDSGLGSYSVSQMHQVPSQRMEQSHPDPFEGRLEAFTPERENSYLASKNEDQW 60
           MRQQGQYSDSGL SYS SQMH VPSQR+EQSHPDPFEGRLEAFTPERENS++ASKNEDQW
Sbjct: 1   MRQQGQYSDSGLNSYSGSQMHHVPSQRVEQSHPDPFEGRLEAFTPERENSFVASKNEDQW 60

Query: 61  RWERDESKMPNSMASHMFNEGQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNME 120
           RWERDESKMPNSMASHMFNEGQGGD  RSYFQGQRPN KLVLEKGSNSD R QSHGKNME
Sbjct: 61  RWERDESKMPNSMASHMFNEGQGGDATRSYFQGQRPNSKLVLEKGSNSDPRSQSHGKNME 120

Query: 121 SRFGDGLLPQNFDGLEQKFIDDIINFSKEQNDAEDEENARHRERIIAINSQYEEQLAALR 180
           +RFGDGLLPQNFDGLEQKFIDDII  +KEQNDAEDEENARHRERI+AIN+QYEEQLAALR
Sbjct: 121 NRFGDGLLPQNFDGLEQKFIDDIIKLAKEQNDAEDEENARHRERIMAINAQYEEQLAALR 180

Query: 181 AQHAGRRDELLRRESSARHHQYQKGIRDHYPNGGIGPENELGFLGI 226
           A+HAGRRDELL+RESSAR HQYQKGI DHY NGGIGP +  G  G+
Sbjct: 181 ARHAGRRDELLQRESSARQHQYQKGIMDHYMNGGIGPGDPRGNSGV 226

BLAST of Cp4.1LG16g08230 vs. ExPASy TrEMBL
Match: A0A6J1H5U1 (uncharacterized protein LOC111459838 OS=Cucurbita moschata OX=3662 GN=LOC111459838 PE=4 SV=1)

HSP 1 Score: 396 bits (1017), Expect = 1.76e-136
Identity = 197/226 (87.17%), Postives = 208/226 (92.04%), Query Frame = 0

Query: 1   MRQQGQYSDSGLGSYSVSQMHQVPSQRMEQSHPDPFEGRLEAFTPERENSYLASKNEDQW 60
           MRQQGQYSDSGL SYS SQMH VPSQR+EQSHPDPFEGRLEAFTPERENS++ASKNEDQW
Sbjct: 1   MRQQGQYSDSGLNSYSGSQMHHVPSQRVEQSHPDPFEGRLEAFTPERENSFVASKNEDQW 60

Query: 61  RWERDESKMPNSMASHMFNEGQGGDLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKNME 120
           RWERDESKMPNSMASHMFNEGQGGD  RSYFQGQRPN KLVLEKGSNSD R QSHGKNME
Sbjct: 61  RWERDESKMPNSMASHMFNEGQGGDATRSYFQGQRPNSKLVLEKGSNSDPRSQSHGKNME 120

Query: 121 SRFGDGLLPQNFDGLEQKFIDDIINFSKEQNDAEDEENARHRERIIAINSQYEEQLAALR 180
           +RFGDGLLPQNFDGLEQKFIDDII  +KEQNDAEDEENARHRERI+AIN+QYEEQLAALR
Sbjct: 121 NRFGDGLLPQNFDGLEQKFIDDIIKLAKEQNDAEDEENARHRERIMAINAQYEEQLAALR 180

Query: 181 AQHAGRRDELLRRESSARHHQYQKGIRDHYPNGGIGPENELGFLGI 226
           A+HAGRRDELL+RESSAR HQYQKGI DHY NGGIGP +  G  G+
Sbjct: 181 ARHAGRRDELLQRESSARQHQYQKGIMDHYMNGGIGPGDPRGNSGV 226

BLAST of Cp4.1LG16g08230 vs. TAIR 10
Match: AT5G22040.1 (unknown protein; Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 162.2 bits (409), Expect = 6.4e-40
Identity = 99/224 (44.20%), Postives = 141/224 (62.95%), Query Frame = 0

Query: 1   MRQQGQYSDSGLGS-YSVSQMHQVPSQRMEQSHPDPFEGRLEAFTPERENSYLASKNEDQ 60
           MR+QG ++DS   S Y   Q        ++  H D F+G+LEAFTPER+  Y  S+ E Q
Sbjct: 1   MRRQGNFADSSPASAYGAGQ--------IQDPHSD-FQGQLEAFTPERDQPYSDSQAEGQ 60

Query: 61  WRWERDESKMPNSMASHMFNEGQGG-DLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKN 120
           WRWERD   M   MA+ ++NEGQ G D  R+Y++GQ  +PK  +EK  +       H +N
Sbjct: 61  WRWERDGPNMSRPMATAVYNEGQQGVDSSRTYYRGQ-IDPKSGMEKQGSDPRAQPQHQEN 120

Query: 121 MESRFGDGLLPQNFDGLEQKFIDDIINFSKEQNDAEDEENARHRERIIAINSQYEEQLAA 180
            ++ + +    Q F+GLEQKF+DDI   +K+Q +AED E ARHRE+I  IN++YEEQLA 
Sbjct: 121 PKTGYDNNRGVQTFEGLEQKFMDDITRLAKDQIEAEDAEIARHREKINTINARYEEQLAT 180

Query: 181 LRAQHAGRRDELLRRESSARHHQYQK---GIRDHYPNGGIGPEN 220
           LRA+H G+R+E++R+ES AR  Q+++   G+ D Y    +G  N
Sbjct: 181 LRARHTGKREEIMRKESLARQQQFKQQTMGMMDQYHPNVVGQAN 214

BLAST of Cp4.1LG16g08230 vs. TAIR 10
Match: AT5G22040.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 162.2 bits (409), Expect = 6.4e-40
Identity = 99/224 (44.20%), Postives = 141/224 (62.95%), Query Frame = 0

Query: 1   MRQQGQYSDSGLGS-YSVSQMHQVPSQRMEQSHPDPFEGRLEAFTPERENSYLASKNEDQ 60
           MR+QG ++DS   S Y   Q        ++  H D F+G+LEAFTPER+  Y  S+ E Q
Sbjct: 1   MRRQGNFADSSPASAYGAGQ--------IQDPHSD-FQGQLEAFTPERDQPYSDSQAEGQ 60

Query: 61  WRWERDESKMPNSMASHMFNEGQGG-DLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKN 120
           WRWERD   M   MA+ ++NEGQ G D  R+Y++GQ  +PK  +EK  +       H +N
Sbjct: 61  WRWERDGPNMSRPMATAVYNEGQQGVDSSRTYYRGQ-IDPKSGMEKQGSDPRAQPQHQEN 120

Query: 121 MESRFGDGLLPQNFDGLEQKFIDDIINFSKEQNDAEDEENARHRERIIAINSQYEEQLAA 180
            ++ + +    Q F+GLEQKF+DDI   +K+Q +AED E ARHRE+I  IN++YEEQLA 
Sbjct: 121 PKTGYDNNRGVQTFEGLEQKFMDDITRLAKDQIEAEDAEIARHREKINTINARYEEQLAT 180

Query: 181 LRAQHAGRRDELLRRESSARHHQYQK---GIRDHYPNGGIGPEN 220
           LRA+H G+R+E++R+ES AR  Q+++   G+ D Y    +G  N
Sbjct: 181 LRARHTGKREEIMRKESLARQQQFKQQTMGMMDQYHPNVVGQAN 214

BLAST of Cp4.1LG16g08230 vs. TAIR 10
Match: AT5G22040.3 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown. )

HSP 1 Score: 162.2 bits (409), Expect = 6.4e-40
Identity = 99/224 (44.20%), Postives = 141/224 (62.95%), Query Frame = 0

Query: 1   MRQQGQYSDSGLGS-YSVSQMHQVPSQRMEQSHPDPFEGRLEAFTPERENSYLASKNEDQ 60
           MR+QG ++DS   S Y   Q        ++  H D F+G+LEAFTPER+  Y  S+ E Q
Sbjct: 1   MRRQGNFADSSPASAYGAGQ--------IQDPHSD-FQGQLEAFTPERDQPYSDSQAEGQ 60

Query: 61  WRWERDESKMPNSMASHMFNEGQGG-DLRRSYFQGQRPNPKLVLEKGSNSDLRFQSHGKN 120
           WRWERD   M   MA+ ++NEGQ G D  R+Y++GQ  +PK  +EK  +       H +N
Sbjct: 61  WRWERDGPNMSRPMATAVYNEGQQGVDSSRTYYRGQ-IDPKSGMEKQGSDPRAQPQHQEN 120

Query: 121 MESRFGDGLLPQNFDGLEQKFIDDIINFSKEQNDAEDEENARHRERIIAINSQYEEQLAA 180
            ++ + +    Q F+GLEQKF+DDI   +K+Q +AED E ARHRE+I  IN++YEEQLA 
Sbjct: 121 PKTGYDNNRGVQTFEGLEQKFMDDITRLAKDQIEAEDAEIARHREKINTINARYEEQLAT 180

Query: 181 LRAQHAGRRDELLRRESSARHHQYQK---GIRDHYPNGGIGPEN 220
           LRA+H G+R+E++R+ES AR  Q+++   G+ D Y    +G  N
Sbjct: 181 LRARHTGKREEIMRKESLARQQQFKQQTMGMMDQYHPNVVGQAN 214

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_023512046.11.48e-15597.35uncharacterized protein LOC111776883 [Cucurbita pepo subsp. pepo][more]
XP_022986988.16.40e-15094.69uncharacterized protein LOC111484575 [Cucurbita maxima][more]
KAG6570459.11.36e-14698.56hypothetical protein SDJN03_29374, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_004140174.24.47e-14572.40uncharacterized protein LOC101221687 isoform X1 [Cucumis sativus] >XP_031744068.... [more]
XP_022943334.11.80e-14096.14uncharacterized protein LOC111448131 [Cucurbita moschata] >XP_022943335.1 unchar... [more]
Match NameE-valueIdentityDescription
A0A6J1JI553.10e-15094.69uncharacterized protein LOC111484575 OS=Cucurbita maxima OX=3661 GN=LOC111484575... [more]
A0A6J1FXP58.72e-14196.14uncharacterized protein LOC111448131 OS=Cucurbita moschata OX=3662 GN=LOC1114481... [more]
A0A1S3BME95.07e-14073.31uncharacterized protein LOC103491440 OS=Cucumis melo OX=3656 GN=LOC103491440 PE=... [more]
A0A6J1K1451.76e-13687.17uncharacterized protein LOC111490759 OS=Cucurbita maxima OX=3661 GN=LOC111490759... [more]
A0A6J1H5U11.76e-13687.17uncharacterized protein LOC111459838 OS=Cucurbita moschata OX=3662 GN=LOC1114598... [more]
Match NameE-valueIdentityDescription
AT5G22040.16.4e-4044.20unknown protein; Has 30201 Blast hits to 17322 proteins in 780 species: Archae -... [more]
AT5G22040.26.4e-4044.20unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT5G22040.36.4e-4044.20unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 144..189
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..29
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..62
NoneNo IPR availablePANTHERPTHR34210OS01G0252900 PROTEINcoord: 1..224
NoneNo IPR availablePANTHERPTHR34210:SF1SUBFAMILY NOT NAMEDcoord: 1..224

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG16g08230.1Cp4.1LG16g08230.1mRNA