Cp4.1LG20g02090 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG20g02090
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionNuclear mitotic apparatus 1
LocationCp4.1LG20: 1241165 .. 1245011 (+)
RNA-Seq ExpressionCp4.1LG20g02090
SyntenyCp4.1LG20g02090
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAGGAGTAGGTGGTGTAATTACATGAGAGTTGGGGGGTATTATGGTAATTAGGGTTGGACTTTGGCCTTTCAGTTCGTAACAGACAGCTGATGTTAAAGCAAGGGGGGGGGGAAACAGAACAAACCTCCCCTCTTATTCTTTCTTTTTTCTTCTTTCTTCACAACTCCGTCCAAAAACCAGCAAAGAAAGGACTCCATACCTCACTGATAATGGAAGACTCTGAGGTAATTTTCCTTTTAATCGCCGCTCTTTGTTCGTTGTTTGTTTCCCCTTTTTATTTTGGGTTTCTCTTTGCTGGGATTTGTGGCTTTTTGGGGGAATGTGTAAGAGCGGATGATTTTGATCTTATTGAAGTTTGTTTTACTGCCCTCTACTAGGTTGTCGGCGGTGTGTTTCGTTTGATTTTGTAATGGTTTTTGTTTATTTTCAGAAATTGACGGCCCTGAAGAAGGCCTACGCCGATATAATCCTTAATACGGCGAAAGAGGCGGCGGCGCGTATTATGGTTTCAGAAAGGAACGCTATTCGGTTTCAGCAGGAGCTGTCCACTTCTAAAGATGAGGCGTTTCGGATACTGCTTAGACTTAAGCAAATGCTGGATTCTAAGGTATTATGCTTTATTAGCTTAAATGGGGTTGATTCTTCTCTTTGTTTTATTTTGTTTGCCCATTTCTTGTTTAAAATTGTTTTGTGTTCTTCCGATCACGTTCTTATGCGGATGGCTTGGTGATTTTCATCTGTACTGGGTTGTTGTTAACCGTTTTGAGATGAAAATTTTCTATCTTTACGCTAGTTGGTAATCTTTAATGTTGGATAATGGATGTTGTTCATGCATAATGCTCAAAGCATCCTATTGTTTGCATCAATTGTTGCCATTTTTTAATCAGGCATTCAACCAGTTGTTGCTGTTTGCTGGCTGTTTGTTGCTTAAATCAACTATTGCGTAGTAAACAATTTAAGTCGGCAGTTGCTAAAGTTGATTTAATGTTTTGCTCATGAATTAATGATTTCATGAACAGTCATTTAAATTTTTAATAGGAATTTGGACTGAATGTTTGTGTTAATTTGTCCAAAATCCTCTTTGATTTGTGTCACAAGAACTAACTGTTTTCTCCTTGTGGAGTTTCTTTCCAAGGTTAACGCTTTAATTCTCATGGTCTTATTCCAATCATTTTCACCTTTGGAGTGGAGGCATGCACAATCAAAAGCCTAACCGATATTGCCCCGATTGATTCATTGTTAGTTCGGTTTAATAGTGTTTAAATGGAGTTCTTAATTGTTGTATCTTTCTCAGCGTAACCAAAGAAGGAAACTTCAGTTCCTTCTGATACTTATCTATGTTTTGTGAATAAGTGTGGAGCTGAAATATTCCATTTTTCAGGCCAGTGAGGCAGAAATGGTGTCTTTAAACCAAAAGAAAAAGATCGAAGAGCTCGAAGCCCAGCTTGAGGAAGCTGAGGACATAGTGAGGGAACTTAGAGCTCAGTTGCAGGACGTTCAAGATGAATTAGAACATGTGAGGAACAAAAACGTAGAGCCCCTAGATGAGCAAAATTTGGCCGGTAACGTTGCATCTCGAGGGGACTTGCCAAACTCACATGAGAGGATTGCACCTTGTAACATAAGTTCAACTTTAAATGGGACATGCCATGACGACAGTTGGCCTGAAAGTAAGAATGACTTGCAGATGGATAATGGCCAAGTTCATGGGGATTTTACCTCCATGGTGATGAGAAACAAAGAACCCGAGCTTTACAGAAATGGCTGCACTCAGAGAGTACGTGCATTCGAAAGAAAATTGTTCGATGGAAAAGTCTGTTCGACTGGACAAGCTGAAGATGTCAAGAATAGGGTATGCAATATGGGTGAAGAAGAGGGTAAACTGATGTGTAAAACAAACATTACCAAGGCTGATAATATCTGTGGAGAAAGGAAGAATTCTAGTGAGATCAAGACATTGCCTAAGCTATTAAGTAGGGATACTCAAGTTCCAATCATTAAATCCCTCCGTCGAAAAAGAAAACGAGCTACTAGATACAACAAAAAGAAAACTTTACCCATTTTTTATGACATTGCTAAACAATGCAACTCGCCTGATCTTCACTGCTCAGAAAGTCCTTCAGTGGATAACGATGATACCGGGAAATGTCTGTCACAAAATGAAATTGATAGTCAGAATGGTTTAATTTTGCTCTCTACCCCTGTACTGTCTGAAATCAATGAAATGTCAACACCTTCGGGTTGTCCCGACGACAGTGAGGGAGGTGATGCAGTAATAAATGATTGTCCTATTAGGAACGTGACAGACTACGATACAGCAGCTGTAGGTAAATCGGACTTCACTAGCCAGGAAAGTTTATGTGAGGAGAATTTGGAGGCTTCTGCTTATAAATTAGACGTTGATCCAGTTAAGGAGTCATCAGTTAAGTCGGACGTGAAAGACTCCGATGTAGTTGATGAGATTCCTAGTCAACATTCTAATAATAAAGTTCTCATGTATACATTCCGAAGAAAGCGAAAGAAGGAATCTTTGAGCAGCCCCGATGGTAAATCATCCATTGATGAAAGCATATCAAAGAAAAGGATGAGGGATAAACAAAGTGTATCGTCGGAAGCAGACAAGTTTAGCTTAATGACCGAATCATCTCGGGATAACAGGCGGTTAGCACAGGTTGCTCGGCAGGTTAGTTTTCTATCTACTTTATCATGATGGTATGCCCTTCTGATTTGTTTCAAAATGTCTGGATAATCAATGTCTGCGAAATTTACTTTGCCACTCTACTTGCCTCTCTGCCCATTTCTCAGGATTAAATTTTATGCTTGTTGTTTCATAATTTTATCCCTTCTGTTCAAATTATTTCTTCATTAGTTATGATTCCAGCGTATGGTTATCGTTCTTGTTGATTTGATATTGAATCCTTAATATGCAATGATGACGTTTGGGTACCGCCCCTTCTTTTGAAAACCTTGCTCTTGTTAAATCTACACTTCGAGGAGCTGACAGCCCAGAGCATTGAGAAACGTCTTATAAGCAACGAGGGCGATCTTTGGACATGTTTCTTGACTTTGATGCCTTGTGTTTCCTGAATATTTGAATTTCGAATTCTCTTCGGTTTGCATGAGATCCACATTGGTTGGAGAGGGGAACAAAACATTCCTTATAAGGGTGTGGAAACCTCTCCCTAGTAGATGCGTTTTAAAATCGTGAGACTGACAGTGATACGTAACGGGTCAACGTGAACAATATCAGCTAACAGTGGACTTGAACTGTTAAGTCTTAACAGTAGTAGTCCATTAACTCATTTATAAGATATCAATTTAACTTCAGATTCTATGAAGAACTCCAATTATATATATCTATATATGTATATCAAAGCTACCTTATTTACTAATGTCCTCCTGAAAATTGTGTTTCTGCAGCTCATATCTTTGTCTGAGAAGAAGTGGCGGTAATAGAACTCTGGCTGCCCTGCAATTCATAATTGTTGCTTGAGAAATAAAATCAGCTGCAACAAAAAAGAGAACGCTGAATGGGCAAGTGAAGTGGTTTTCCAAGCAAGTACCACAGGCCATGAGCAAGTACCACAGGCCATGAGCAAGTACCACAGGCCATGAGCAAGTACCATATGGGTGCTGCTGCTGTATGTGCACTAACTTATGAATGTTGTTTATGTAATCTTTATATCCTTCCCTTCATTACTATATTAAAAGCTTGAATCTTGGCATTTTTCTTTCCCTTGGGAAAGCTTAATTATGTTCCAATGTTCACATGGTATTTATTTCTCTGTAATACTTGTTCCCCGAGTCGTTCCTTCGTGCCTTACCTTACACTTATATGACGTCTAC

mRNA sequence

AAGGAGTAGGTGGTGTAATTACATGAGAGTTGGGGGGTATTATGGTAATTAGGGTTGGACTTTGGCCTTTCAGTTCGTAACAGACAGCTGATGTTAAAGCAAGGGGGGGGGGAAACAGAACAAACCTCCCCTCTTATTCTTTCTTTTTTCTTCTTTCTTCACAACTCCGTCCAAAAACCAGCAAAGAAAGGACTCCATACCTCACTGATAATGGAAGACTCTGAGAAATTGACGGCCCTGAAGAAGGCCTACGCCGATATAATCCTTAATACGGCGAAAGAGGCGGCGGCGCGTATTATGGTTTCAGAAAGGAACGCTATTCGGTTTCAGCAGGAGCTGTCCACTTCTAAAGATGAGGCGTTTCGGATACTGCTTAGACTTAAGCAAATGCTGGATTCTAAGGCCAGTGAGGCAGAAATGGTGTCTTTAAACCAAAAGAAAAAGATCGAAGAGCTCGAAGCCCAGCTTGAGGAAGCTGAGGACATAGTGAGGGAACTTAGAGCTCAGTTGCAGGACGTTCAAGATGAATTAGAACATGTGAGGAACAAAAACGTAGAGCCCCTAGATGAGCAAAATTTGGCCGGTAACGTTGCATCTCGAGGGGACTTGCCAAACTCACATGAGAGGATTGCACCTTGTAACATAAGTTCAACTTTAAATGGGACATGCCATGACGACAGTTGGCCTGAAAGTAAGAATGACTTGCAGATGGATAATGGCCAAGTTCATGGGGATTTTACCTCCATGGTGATGAGAAACAAAGAACCCGAGCTTTACAGAAATGGCTGCACTCAGAGAGTACGTGCATTCGAAAGAAAATTGTTCGATGGAAAAGTCTGTTCGACTGGACAAGCTGAAGATGTCAAGAATAGGGTATGCAATATGGGTGAAGAAGAGGGTAAACTGATGTGTAAAACAAACATTACCAAGGCTGATAATATCTGTGGAGAAAGGAAGAATTCTAGTGAGATCAAGACATTGCCTAAGCTATTAAGTAGGGATACTCAAGTTCCAATCATTAAATCCCTCCGTCGAAAAAGAAAACGAGCTACTAGATACAACAAAAAGAAAACTTTACCCATTTTTTATGACATTGCTAAACAATGCAACTCGCCTGATCTTCACTGCTCAGAAAGTCCTTCAGTGGATAACGATGATACCGGGAAATGTCTGTCACAAAATGAAATTGATAGTCAGAATGGTTTAATTTTGCTCTCTACCCCTGTACTGTCTGAAATCAATGAAATGTCAACACCTTCGGGTTGTCCCGACGACAGTGAGGGAGGTGATGCAGTAATAAATGATTGTCCTATTAGGAACGTGACAGACTACGATACAGCAGCTGTAGGTAAATCGGACTTCACTAGCCAGGAAAGTTTATGTGAGGAGAATTTGGAGGCTTCTGCTTATAAATTAGACGTTGATCCAGTTAAGGAGTCATCAGTTAAGTCGGACGTGAAAGACTCCGATGTAGTTGATGAGATTCCTAGTCAACATTCTAATAATAAAGTTCTCATGTATACATTCCGAAGAAAGCGAAAGAAGGAATCTTTGAGCAGCCCCGATGGTAAATCATCCATTGATGAAAGCATATCAAAGAAAAGGATGAGGGATAAACAAAGTGTATCGTCGGAAGCAGACAAGTTTAGCTTAATGACCGAATCATCTCGGGATAACAGGCGGTTAGCACAGGTTGCTCGGCAGCTCATATCTTTGTCTGAGAAGAAGTGGCGGTAATAGAACTCTGGCTGCCCTGCAATTCATAATTGTTGCTTGAGAAATAAAATCAGCTGCAACAAAAAAGAGAACGCTGAATGGGCAAGTGAAGTGGTTTTCCAAGCAAGTACCACAGGCCATGAGCAAGTACCACAGGCCATGAGCAAGTACCACAGGCCATGAGCAAGTACCATATGGGTGCTGCTGCTGTATGTGCACTAACTTATGAATGTTGTTTATGTAATCTTTATATCCTTCCCTTCATTACTATATTAAAAGCTTGAATCTTGGCATTTTTCTTTCCCTTGGGAAAGCTTAATTATGTTCCAATGTTCACATGGTATTTATTTCTCTGTAATACTTGTTCCCCGAGTCGTTCCTTCGTGCCTTACCTTACACTTATATGACGTCTAC

Coding sequence (CDS)

ATGTTAAAGCAAGGGGGGGGGGAAACAGAACAAACCTCCCCTCTTATTCTTTCTTTTTTCTTCTTTCTTCACAACTCCGTCCAAAAACCAGCAAAGAAAGGACTCCATACCTCACTGATAATGGAAGACTCTGAGAAATTGACGGCCCTGAAGAAGGCCTACGCCGATATAATCCTTAATACGGCGAAAGAGGCGGCGGCGCGTATTATGGTTTCAGAAAGGAACGCTATTCGGTTTCAGCAGGAGCTGTCCACTTCTAAAGATGAGGCGTTTCGGATACTGCTTAGACTTAAGCAAATGCTGGATTCTAAGGCCAGTGAGGCAGAAATGGTGTCTTTAAACCAAAAGAAAAAGATCGAAGAGCTCGAAGCCCAGCTTGAGGAAGCTGAGGACATAGTGAGGGAACTTAGAGCTCAGTTGCAGGACGTTCAAGATGAATTAGAACATGTGAGGAACAAAAACGTAGAGCCCCTAGATGAGCAAAATTTGGCCGGTAACGTTGCATCTCGAGGGGACTTGCCAAACTCACATGAGAGGATTGCACCTTGTAACATAAGTTCAACTTTAAATGGGACATGCCATGACGACAGTTGGCCTGAAAGTAAGAATGACTTGCAGATGGATAATGGCCAAGTTCATGGGGATTTTACCTCCATGGTGATGAGAAACAAAGAACCCGAGCTTTACAGAAATGGCTGCACTCAGAGAGTACGTGCATTCGAAAGAAAATTGTTCGATGGAAAAGTCTGTTCGACTGGACAAGCTGAAGATGTCAAGAATAGGGTATGCAATATGGGTGAAGAAGAGGGTAAACTGATGTGTAAAACAAACATTACCAAGGCTGATAATATCTGTGGAGAAAGGAAGAATTCTAGTGAGATCAAGACATTGCCTAAGCTATTAAGTAGGGATACTCAAGTTCCAATCATTAAATCCCTCCGTCGAAAAAGAAAACGAGCTACTAGATACAACAAAAAGAAAACTTTACCCATTTTTTATGACATTGCTAAACAATGCAACTCGCCTGATCTTCACTGCTCAGAAAGTCCTTCAGTGGATAACGATGATACCGGGAAATGTCTGTCACAAAATGAAATTGATAGTCAGAATGGTTTAATTTTGCTCTCTACCCCTGTACTGTCTGAAATCAATGAAATGTCAACACCTTCGGGTTGTCCCGACGACAGTGAGGGAGGTGATGCAGTAATAAATGATTGTCCTATTAGGAACGTGACAGACTACGATACAGCAGCTGTAGGTAAATCGGACTTCACTAGCCAGGAAAGTTTATGTGAGGAGAATTTGGAGGCTTCTGCTTATAAATTAGACGTTGATCCAGTTAAGGAGTCATCAGTTAAGTCGGACGTGAAAGACTCCGATGTAGTTGATGAGATTCCTAGTCAACATTCTAATAATAAAGTTCTCATGTATACATTCCGAAGAAAGCGAAAGAAGGAATCTTTGAGCAGCCCCGATGGTAAATCATCCATTGATGAAAGCATATCAAAGAAAAGGATGAGGGATAAACAAAGTGTATCGTCGGAAGCAGACAAGTTTAGCTTAATGACCGAATCATCTCGGGATAACAGGCGGTTAGCACAGGTTGCTCGGCAGCTCATATCTTTGTCTGAGAAGAAGTGGCGGTAA

Protein sequence

MLKQGGGETEQTSPLILSFFFFLHNSVQKPAKKGLHTSLIMEDSEKLTALKKAYADIILNTAKEAAARIMVSERNAIRFQQELSTSKDEAFRILLRLKQMLDSKASEAEMVSLNQKKKIEELEAQLEEAEDIVRELRAQLQDVQDELEHVRNKNVEPLDEQNLAGNVASRGDLPNSHERIAPCNISSTLNGTCHDDSWPESKNDLQMDNGQVHGDFTSMVMRNKEPELYRNGCTQRVRAFERKLFDGKVCSTGQAEDVKNRVCNMGEEEGKLMCKTNITKADNICGERKNSSEIKTLPKLLSRDTQVPIIKSLRRKRKRATRYNKKKTLPIFYDIAKQCNSPDLHCSESPSVDNDDTGKCLSQNEIDSQNGLILLSTPVLSEINEMSTPSGCPDDSEGGDAVINDCPIRNVTDYDTAAVGKSDFTSQESLCEENLEASAYKLDVDPVKESSVKSDVKDSDVVDEIPSQHSNNKVLMYTFRRKRKKESLSSPDGKSSIDESISKKRMRDKQSVSSEADKFSLMTESSRDNRRLAQVARQLISLSEKKWR
Homology
BLAST of Cp4.1LG20g02090 vs. NCBI nr
Match: XP_022924190.1 (uncharacterized protein LOC111431711 [Cucurbita moschata])

HSP 1 Score: 1013 bits (2619), Expect = 0.0
Identity = 532/548 (97.08%), Postives = 541/548 (98.72%), Query Frame = 0

Query: 1   MLKQGGGETEQTSPLILSFFFFLHNSVQKPAKKGLHTSLIMEDSEKLTALKKAYADIILN 60
           MLKQGG ETEQTSPLILS FFFLHNSVQKPAKKGLHTSLIMEDSEKLTALKKAYADIILN
Sbjct: 1   MLKQGG-ETEQTSPLILSLFFFLHNSVQKPAKKGLHTSLIMEDSEKLTALKKAYADIILN 60

Query: 61  TAKEAAARIMVSERNAIRFQQELSTSKDEAFRILLRLKQMLDSKASEAEMVSLNQKKKIE 120
           TAKEAAARIMVSERNAIRFQQELSTSKDEAFRILLRLKQMLDSKASEAE VSLNQK+KIE
Sbjct: 61  TAKEAAARIMVSERNAIRFQQELSTSKDEAFRILLRLKQMLDSKASEAETVSLNQKRKIE 120

Query: 121 ELEAQLEEAEDIVRELRAQLQDVQDELEHVRNKNVEPLDEQNLAGNVASRGDLPNSHERI 180
           ELEAQLEEAEDIVRELRAQLQDVQDELEHVRNKNVEPLDEQNLAGN+ASRGDLPNSHERI
Sbjct: 121 ELEAQLEEAEDIVRELRAQLQDVQDELEHVRNKNVEPLDEQNLAGNIASRGDLPNSHERI 180

Query: 181 APCNISSTLNGTCHDDSWPESKNDLQMDNGQVHGDFTSMVMRNKEPELYRNGCTQRVRAF 240
           APCNISSTLNGTC DDSWPESKN+L MDNGQVHGDFTSMVMRNKEPELYRNGCTQRVRAF
Sbjct: 181 APCNISSTLNGTCRDDSWPESKNELHMDNGQVHGDFTSMVMRNKEPELYRNGCTQRVRAF 240

Query: 241 ERKLFDGKVCSTGQAEDVKNRVCNMGEEEGKLMCKTNITKADNICGERKNSSEIKTLPKL 300
           ERKLFDGKVCSTGQAEDVKNRVCNMGEEEGKLMCKTNITKADNICGERKNS+EIKTLPKL
Sbjct: 241 ERKLFDGKVCSTGQAEDVKNRVCNMGEEEGKLMCKTNITKADNICGERKNSNEIKTLPKL 300

Query: 301 LSRDTQVPIIKSLRRKRKRATRYNKKKTLPIFYDIAKQCNSPDLHCSESPSVDNDDTGKC 360
           LSRDTQVPIIKSLRRKRKRATRYNKKKTLPIF DIAKQCNSPDLHCSESPSVDNDDTGKC
Sbjct: 301 LSRDTQVPIIKSLRRKRKRATRYNKKKTLPIFDDIAKQCNSPDLHCSESPSVDNDDTGKC 360

Query: 361 LSQNEIDSQNGLILLSTPVLSEINEMSTPSGCPDDSEGGDAVINDCPIRNVTDYDTAAVG 420
           LSQNEIDSQNGLILL+TPVLSEINEMSTPSGCPDDSEGGDAVIN+CPIRNVTDYDTAAVG
Sbjct: 361 LSQNEIDSQNGLILLATPVLSEINEMSTPSGCPDDSEGGDAVINNCPIRNVTDYDTAAVG 420

Query: 421 KSDFTSQESLCEENLEASAYKLDVDPVKESSVKSDVKDSDVVDEIPSQHSNNKVLMYTFR 480
           KS+FTSQESLC ENLEASAYKLDVDPVKESSVKSDVKDSDVVDEIPSQHSNNKVLMYTFR
Sbjct: 421 KSNFTSQESLCGENLEASAYKLDVDPVKESSVKSDVKDSDVVDEIPSQHSNNKVLMYTFR 480

Query: 481 RKRKKESLSSPDGKSSIDESISKKRMRDKQSVSSEADKFSLMTESSRDNRRLAQVARQLI 540
           RKRKKESLSSPDGKSSIDESISKKRM+DKQSVSSE+DKFSLMTESSRDNRRLAQVARQLI
Sbjct: 481 RKRKKESLSSPDGKSSIDESISKKRMKDKQSVSSESDKFSLMTESSRDNRRLAQVARQLI 540

Query: 541 SLSEKKWR 548
           SLSEKKWR
Sbjct: 541 SLSEKKWR 547

BLAST of Cp4.1LG20g02090 vs. NCBI nr
Match: KAG6584245.1 (hypothetical protein SDJN03_20177, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1011 bits (2614), Expect = 0.0
Identity = 531/548 (96.90%), Postives = 540/548 (98.54%), Query Frame = 0

Query: 1   MLKQGGGETEQTSPLILSFFFFLHNSVQKPAKKGLHTSLIMEDSEKLTALKKAYADIILN 60
           MLKQGG ETEQTSPLILS FFFLHNSV+KPAKKGLHTSLIMEDSEKLTALKKAYADIILN
Sbjct: 30  MLKQGG-ETEQTSPLILSLFFFLHNSVKKPAKKGLHTSLIMEDSEKLTALKKAYADIILN 89

Query: 61  TAKEAAARIMVSERNAIRFQQELSTSKDEAFRILLRLKQMLDSKASEAEMVSLNQKKKIE 120
           TAKEAAARIMVSERNAIRFQQELSTSKDEAFRILLRLKQMLDSKASEAE VSLNQK+KIE
Sbjct: 90  TAKEAAARIMVSERNAIRFQQELSTSKDEAFRILLRLKQMLDSKASEAETVSLNQKRKIE 149

Query: 121 ELEAQLEEAEDIVRELRAQLQDVQDELEHVRNKNVEPLDEQNLAGNVASRGDLPNSHERI 180
           ELEAQLEEAEDIVRELRAQLQDVQDELEHVRNKNVEPLDEQNLAGN+ASRGDLPNSHERI
Sbjct: 150 ELEAQLEEAEDIVRELRAQLQDVQDELEHVRNKNVEPLDEQNLAGNIASRGDLPNSHERI 209

Query: 181 APCNISSTLNGTCHDDSWPESKNDLQMDNGQVHGDFTSMVMRNKEPELYRNGCTQRVRAF 240
           APCNISSTLNGTC DDSWPESKN+L MDNGQVHGDFTSMVMRNKEPELYRNGCTQRVRAF
Sbjct: 210 APCNISSTLNGTCRDDSWPESKNELHMDNGQVHGDFTSMVMRNKEPELYRNGCTQRVRAF 269

Query: 241 ERKLFDGKVCSTGQAEDVKNRVCNMGEEEGKLMCKTNITKADNICGERKNSSEIKTLPKL 300
           ERKLFDGKVCSTGQAEDVKNRVCNMGEEEGKLMCKTNITKADNICGERKNS+EIKTLPKL
Sbjct: 270 ERKLFDGKVCSTGQAEDVKNRVCNMGEEEGKLMCKTNITKADNICGERKNSNEIKTLPKL 329

Query: 301 LSRDTQVPIIKSLRRKRKRATRYNKKKTLPIFYDIAKQCNSPDLHCSESPSVDNDDTGKC 360
           LSRDTQVPIIKSLRRKRKRATRYNKKKTLPIF DIAKQCNSPDLHCSESPSVDNDDTGKC
Sbjct: 330 LSRDTQVPIIKSLRRKRKRATRYNKKKTLPIFDDIAKQCNSPDLHCSESPSVDNDDTGKC 389

Query: 361 LSQNEIDSQNGLILLSTPVLSEINEMSTPSGCPDDSEGGDAVINDCPIRNVTDYDTAAVG 420
           LSQNEIDSQNGLILL+TPVLSEINEMSTPSGCPDDSEGGDAVINDCPIRNVTDYDTAAVG
Sbjct: 390 LSQNEIDSQNGLILLATPVLSEINEMSTPSGCPDDSEGGDAVINDCPIRNVTDYDTAAVG 449

Query: 421 KSDFTSQESLCEENLEASAYKLDVDPVKESSVKSDVKDSDVVDEIPSQHSNNKVLMYTFR 480
           KS+FTSQESLC ENLEASAYKLDVDPVKESSVK DVKDSDVVDEIPSQHSNNKVLMYTFR
Sbjct: 450 KSNFTSQESLCGENLEASAYKLDVDPVKESSVKLDVKDSDVVDEIPSQHSNNKVLMYTFR 509

Query: 481 RKRKKESLSSPDGKSSIDESISKKRMRDKQSVSSEADKFSLMTESSRDNRRLAQVARQLI 540
           RKRKKESLSSPDGKSSIDESISKKRM+DKQSVSSE+DKFSLMTESSRDNRRLAQVARQLI
Sbjct: 510 RKRKKESLSSPDGKSSIDESISKKRMKDKQSVSSESDKFSLMTESSRDNRRLAQVARQLI 569

Query: 541 SLSEKKWR 548
           SLSEKKWR
Sbjct: 570 SLSEKKWR 576

BLAST of Cp4.1LG20g02090 vs. NCBI nr
Match: XP_023520373.1 (uncharacterized protein LOC111783687 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 968 bits (2502), Expect = 0.0
Identity = 508/508 (100.00%), Postives = 508/508 (100.00%), Query Frame = 0

Query: 41  MEDSEKLTALKKAYADIILNTAKEAAARIMVSERNAIRFQQELSTSKDEAFRILLRLKQM 100
           MEDSEKLTALKKAYADIILNTAKEAAARIMVSERNAIRFQQELSTSKDEAFRILLRLKQM
Sbjct: 1   MEDSEKLTALKKAYADIILNTAKEAAARIMVSERNAIRFQQELSTSKDEAFRILLRLKQM 60

Query: 101 LDSKASEAEMVSLNQKKKIEELEAQLEEAEDIVRELRAQLQDVQDELEHVRNKNVEPLDE 160
           LDSKASEAEMVSLNQKKKIEELEAQLEEAEDIVRELRAQLQDVQDELEHVRNKNVEPLDE
Sbjct: 61  LDSKASEAEMVSLNQKKKIEELEAQLEEAEDIVRELRAQLQDVQDELEHVRNKNVEPLDE 120

Query: 161 QNLAGNVASRGDLPNSHERIAPCNISSTLNGTCHDDSWPESKNDLQMDNGQVHGDFTSMV 220
           QNLAGNVASRGDLPNSHERIAPCNISSTLNGTCHDDSWPESKNDLQMDNGQVHGDFTSMV
Sbjct: 121 QNLAGNVASRGDLPNSHERIAPCNISSTLNGTCHDDSWPESKNDLQMDNGQVHGDFTSMV 180

Query: 221 MRNKEPELYRNGCTQRVRAFERKLFDGKVCSTGQAEDVKNRVCNMGEEEGKLMCKTNITK 280
           MRNKEPELYRNGCTQRVRAFERKLFDGKVCSTGQAEDVKNRVCNMGEEEGKLMCKTNITK
Sbjct: 181 MRNKEPELYRNGCTQRVRAFERKLFDGKVCSTGQAEDVKNRVCNMGEEEGKLMCKTNITK 240

Query: 281 ADNICGERKNSSEIKTLPKLLSRDTQVPIIKSLRRKRKRATRYNKKKTLPIFYDIAKQCN 340
           ADNICGERKNSSEIKTLPKLLSRDTQVPIIKSLRRKRKRATRYNKKKTLPIFYDIAKQCN
Sbjct: 241 ADNICGERKNSSEIKTLPKLLSRDTQVPIIKSLRRKRKRATRYNKKKTLPIFYDIAKQCN 300

Query: 341 SPDLHCSESPSVDNDDTGKCLSQNEIDSQNGLILLSTPVLSEINEMSTPSGCPDDSEGGD 400
           SPDLHCSESPSVDNDDTGKCLSQNEIDSQNGLILLSTPVLSEINEMSTPSGCPDDSEGGD
Sbjct: 301 SPDLHCSESPSVDNDDTGKCLSQNEIDSQNGLILLSTPVLSEINEMSTPSGCPDDSEGGD 360

Query: 401 AVINDCPIRNVTDYDTAAVGKSDFTSQESLCEENLEASAYKLDVDPVKESSVKSDVKDSD 460
           AVINDCPIRNVTDYDTAAVGKSDFTSQESLCEENLEASAYKLDVDPVKESSVKSDVKDSD
Sbjct: 361 AVINDCPIRNVTDYDTAAVGKSDFTSQESLCEENLEASAYKLDVDPVKESSVKSDVKDSD 420

Query: 461 VVDEIPSQHSNNKVLMYTFRRKRKKESLSSPDGKSSIDESISKKRMRDKQSVSSEADKFS 520
           VVDEIPSQHSNNKVLMYTFRRKRKKESLSSPDGKSSIDESISKKRMRDKQSVSSEADKFS
Sbjct: 421 VVDEIPSQHSNNKVLMYTFRRKRKKESLSSPDGKSSIDESISKKRMRDKQSVSSEADKFS 480

Query: 521 LMTESSRDNRRLAQVARQLISLSEKKWR 548
           LMTESSRDNRRLAQVARQLISLSEKKWR
Sbjct: 481 LMTESSRDNRRLAQVARQLISLSEKKWR 508

BLAST of Cp4.1LG20g02090 vs. NCBI nr
Match: KAG7019841.1 (hypothetical protein SDJN02_18805 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 935 bits (2417), Expect = 0.0
Identity = 492/508 (96.85%), Postives = 500/508 (98.43%), Query Frame = 0

Query: 41  MEDSEKLTALKKAYADIILNTAKEAAARIMVSERNAIRFQQELSTSKDEAFRILLRLKQM 100
           MEDSEKLTALKKAYADIILNTAKEAAARIMVSERNAIRFQQELSTSKDEAFRILLRLKQM
Sbjct: 1   MEDSEKLTALKKAYADIILNTAKEAAARIMVSERNAIRFQQELSTSKDEAFRILLRLKQM 60

Query: 101 LDSKASEAEMVSLNQKKKIEELEAQLEEAEDIVRELRAQLQDVQDELEHVRNKNVEPLDE 160
           LDSKASEAE VSLNQK+KIEELEAQLEEAEDIVRELRAQLQ VQDELEHVRNKNVEPLDE
Sbjct: 61  LDSKASEAETVSLNQKRKIEELEAQLEEAEDIVRELRAQLQGVQDELEHVRNKNVEPLDE 120

Query: 161 QNLAGNVASRGDLPNSHERIAPCNISSTLNGTCHDDSWPESKNDLQMDNGQVHGDFTSMV 220
           QNLAGN+ASRGDLPNSHERIAPCNISSTLNGTC DDSW ESKN+L MDNGQVHGDFTSMV
Sbjct: 121 QNLAGNIASRGDLPNSHERIAPCNISSTLNGTCRDDSWSESKNELHMDNGQVHGDFTSMV 180

Query: 221 MRNKEPELYRNGCTQRVRAFERKLFDGKVCSTGQAEDVKNRVCNMGEEEGKLMCKTNITK 280
           MRNKEPELYRNGCTQRVRAFERKLFDGKVCST QAEDVKNRVCNMGEEEGKLMCKTNITK
Sbjct: 181 MRNKEPELYRNGCTQRVRAFERKLFDGKVCSTEQAEDVKNRVCNMGEEEGKLMCKTNITK 240

Query: 281 ADNICGERKNSSEIKTLPKLLSRDTQVPIIKSLRRKRKRATRYNKKKTLPIFYDIAKQCN 340
           ADNI GERKNS+EIKTLPKLLSRDTQVPIIKSLRRKRKRATRYNKKKTLPIFYDIAKQCN
Sbjct: 241 ADNIYGERKNSNEIKTLPKLLSRDTQVPIIKSLRRKRKRATRYNKKKTLPIFYDIAKQCN 300

Query: 341 SPDLHCSESPSVDNDDTGKCLSQNEIDSQNGLILLSTPVLSEINEMSTPSGCPDDSEGGD 400
           SPDLHCSESPSVDNDDTGKCLSQNEIDSQNGLILL+TPVLSEINEMSTPSGCPDDSEGGD
Sbjct: 301 SPDLHCSESPSVDNDDTGKCLSQNEIDSQNGLILLATPVLSEINEMSTPSGCPDDSEGGD 360

Query: 401 AVINDCPIRNVTDYDTAAVGKSDFTSQESLCEENLEASAYKLDVDPVKESSVKSDVKDSD 460
           AVINDCPIRNVTDYDTAAVGKS+FTSQESLC ENLEASAYKLDVDPVKESSVKSDVKDSD
Sbjct: 361 AVINDCPIRNVTDYDTAAVGKSNFTSQESLCGENLEASAYKLDVDPVKESSVKSDVKDSD 420

Query: 461 VVDEIPSQHSNNKVLMYTFRRKRKKESLSSPDGKSSIDESISKKRMRDKQSVSSEADKFS 520
           VVDEIPSQHSNNKVLMYTFRRKRKKESLSSPDGKSSIDESISKKRM+DKQSVSSE+DKFS
Sbjct: 421 VVDEIPSQHSNNKVLMYTFRRKRKKESLSSPDGKSSIDESISKKRMKDKQSVSSESDKFS 480

Query: 521 LMTESSRDNRRLAQVARQLISLSEKKWR 548
           LMTESSRDNRRLAQVARQLISLSEKKWR
Sbjct: 481 LMTESSRDNRRLAQVARQLISLSEKKWR 508

BLAST of Cp4.1LG20g02090 vs. NCBI nr
Match: XP_023000848.1 (uncharacterized protein LOC111495171 [Cucurbita maxima])

HSP 1 Score: 935 bits (2416), Expect = 0.0
Identity = 490/508 (96.46%), Postives = 500/508 (98.43%), Query Frame = 0

Query: 41  MEDSEKLTALKKAYADIILNTAKEAAARIMVSERNAIRFQQELSTSKDEAFRILLRLKQM 100
           MEDSEKLTALKKAYADIILNTAKEAAARIMVSERNAIRFQQELST+KDEAFRILLRLKQM
Sbjct: 1   MEDSEKLTALKKAYADIILNTAKEAAARIMVSERNAIRFQQELSTTKDEAFRILLRLKQM 60

Query: 101 LDSKASEAEMVSLNQKKKIEELEAQLEEAEDIVRELRAQLQDVQDELEHVRNKNVEPLDE 160
           LDSKASEAEMVSLNQKKKIEELEAQLEEAEDIVRELRAQLQDVQDELEHVRNKNVEPLDE
Sbjct: 61  LDSKASEAEMVSLNQKKKIEELEAQLEEAEDIVRELRAQLQDVQDELEHVRNKNVEPLDE 120

Query: 161 QNLAGNVASRGDLPNSHERIAPCNISSTLNGTCHDDSWPESKNDLQMDNGQVHGDFTSMV 220
           QNLAGNVA+RGDLPNSHERIAPCNISSTLNGTCHDDSWPESKNDLQMDNGQVHGDFTSMV
Sbjct: 121 QNLAGNVATRGDLPNSHERIAPCNISSTLNGTCHDDSWPESKNDLQMDNGQVHGDFTSMV 180

Query: 221 MRNKEPELYRNGCTQRVRAFERKLFDGKVCSTGQAEDVKNRVCNMGEEEGKLMCKTNITK 280
           MR+KEPELYRNGCTQRVRAFERKLFDGKVCSTGQAEDVKNRVCNMGEEEGKLMCKTNITK
Sbjct: 181 MRSKEPELYRNGCTQRVRAFERKLFDGKVCSTGQAEDVKNRVCNMGEEEGKLMCKTNITK 240

Query: 281 ADNICGERKNSSEIKTLPKLLSRDTQVPIIKSLRRKRKRATRYNKKKTLPIFYDIAKQCN 340
           ADN+CGERKNS+EIKTLP LLSRDTQVPIIKSLRRKRKRATRYNKKKTLPIF DIAKQCN
Sbjct: 241 ADNVCGERKNSNEIKTLPNLLSRDTQVPIIKSLRRKRKRATRYNKKKTLPIFDDIAKQCN 300

Query: 341 SPDLHCSESPSVDNDDTGKCLSQNEIDSQNGLILLSTPVLSEINEMSTPSGCPDDSEGGD 400
           SPDLHCSESPSV ND TGKCLSQNEIDSQ+GLILLSTPVLSEINEMSTPS CPDDSEGGD
Sbjct: 301 SPDLHCSESPSVYNDGTGKCLSQNEIDSQDGLILLSTPVLSEINEMSTPSSCPDDSEGGD 360

Query: 401 AVINDCPIRNVTDYDTAAVGKSDFTSQESLCEENLEASAYKLDVDPVKESSVKSDVKDSD 460
           AVINDCPIRNVTDYDTAAVGKSDFTSQESLC ENLEASAYK++VDPVKESSVKSDVKDSD
Sbjct: 361 AVINDCPIRNVTDYDTAAVGKSDFTSQESLCGENLEASAYKVEVDPVKESSVKSDVKDSD 420

Query: 461 VVDEIPSQHSNNKVLMYTFRRKRKKESLSSPDGKSSIDESISKKRMRDKQSVSSEADKFS 520
           VVDEIPSQHSNNKVLMYTFRRKRKKESLSSPDGKS IDESISKKRM+DKQS SSE+DKFS
Sbjct: 421 VVDEIPSQHSNNKVLMYTFRRKRKKESLSSPDGKSPIDESISKKRMKDKQSASSESDKFS 480

Query: 521 LMTESSRDNRRLAQVARQLISLSEKKWR 548
           LMTESSRDNRRLAQVARQLISLSEKKWR
Sbjct: 481 LMTESSRDNRRLAQVARQLISLSEKKWR 508

BLAST of Cp4.1LG20g02090 vs. ExPASy TrEMBL
Match: A0A6J1EE41 (uncharacterized protein LOC111431711 OS=Cucurbita moschata OX=3662 GN=LOC111431711 PE=4 SV=1)

HSP 1 Score: 1013 bits (2619), Expect = 0.0
Identity = 532/548 (97.08%), Postives = 541/548 (98.72%), Query Frame = 0

Query: 1   MLKQGGGETEQTSPLILSFFFFLHNSVQKPAKKGLHTSLIMEDSEKLTALKKAYADIILN 60
           MLKQGG ETEQTSPLILS FFFLHNSVQKPAKKGLHTSLIMEDSEKLTALKKAYADIILN
Sbjct: 1   MLKQGG-ETEQTSPLILSLFFFLHNSVQKPAKKGLHTSLIMEDSEKLTALKKAYADIILN 60

Query: 61  TAKEAAARIMVSERNAIRFQQELSTSKDEAFRILLRLKQMLDSKASEAEMVSLNQKKKIE 120
           TAKEAAARIMVSERNAIRFQQELSTSKDEAFRILLRLKQMLDSKASEAE VSLNQK+KIE
Sbjct: 61  TAKEAAARIMVSERNAIRFQQELSTSKDEAFRILLRLKQMLDSKASEAETVSLNQKRKIE 120

Query: 121 ELEAQLEEAEDIVRELRAQLQDVQDELEHVRNKNVEPLDEQNLAGNVASRGDLPNSHERI 180
           ELEAQLEEAEDIVRELRAQLQDVQDELEHVRNKNVEPLDEQNLAGN+ASRGDLPNSHERI
Sbjct: 121 ELEAQLEEAEDIVRELRAQLQDVQDELEHVRNKNVEPLDEQNLAGNIASRGDLPNSHERI 180

Query: 181 APCNISSTLNGTCHDDSWPESKNDLQMDNGQVHGDFTSMVMRNKEPELYRNGCTQRVRAF 240
           APCNISSTLNGTC DDSWPESKN+L MDNGQVHGDFTSMVMRNKEPELYRNGCTQRVRAF
Sbjct: 181 APCNISSTLNGTCRDDSWPESKNELHMDNGQVHGDFTSMVMRNKEPELYRNGCTQRVRAF 240

Query: 241 ERKLFDGKVCSTGQAEDVKNRVCNMGEEEGKLMCKTNITKADNICGERKNSSEIKTLPKL 300
           ERKLFDGKVCSTGQAEDVKNRVCNMGEEEGKLMCKTNITKADNICGERKNS+EIKTLPKL
Sbjct: 241 ERKLFDGKVCSTGQAEDVKNRVCNMGEEEGKLMCKTNITKADNICGERKNSNEIKTLPKL 300

Query: 301 LSRDTQVPIIKSLRRKRKRATRYNKKKTLPIFYDIAKQCNSPDLHCSESPSVDNDDTGKC 360
           LSRDTQVPIIKSLRRKRKRATRYNKKKTLPIF DIAKQCNSPDLHCSESPSVDNDDTGKC
Sbjct: 301 LSRDTQVPIIKSLRRKRKRATRYNKKKTLPIFDDIAKQCNSPDLHCSESPSVDNDDTGKC 360

Query: 361 LSQNEIDSQNGLILLSTPVLSEINEMSTPSGCPDDSEGGDAVINDCPIRNVTDYDTAAVG 420
           LSQNEIDSQNGLILL+TPVLSEINEMSTPSGCPDDSEGGDAVIN+CPIRNVTDYDTAAVG
Sbjct: 361 LSQNEIDSQNGLILLATPVLSEINEMSTPSGCPDDSEGGDAVINNCPIRNVTDYDTAAVG 420

Query: 421 KSDFTSQESLCEENLEASAYKLDVDPVKESSVKSDVKDSDVVDEIPSQHSNNKVLMYTFR 480
           KS+FTSQESLC ENLEASAYKLDVDPVKESSVKSDVKDSDVVDEIPSQHSNNKVLMYTFR
Sbjct: 421 KSNFTSQESLCGENLEASAYKLDVDPVKESSVKSDVKDSDVVDEIPSQHSNNKVLMYTFR 480

Query: 481 RKRKKESLSSPDGKSSIDESISKKRMRDKQSVSSEADKFSLMTESSRDNRRLAQVARQLI 540
           RKRKKESLSSPDGKSSIDESISKKRM+DKQSVSSE+DKFSLMTESSRDNRRLAQVARQLI
Sbjct: 481 RKRKKESLSSPDGKSSIDESISKKRMKDKQSVSSESDKFSLMTESSRDNRRLAQVARQLI 540

Query: 541 SLSEKKWR 548
           SLSEKKWR
Sbjct: 541 SLSEKKWR 547

BLAST of Cp4.1LG20g02090 vs. ExPASy TrEMBL
Match: A0A6J1KNU6 (uncharacterized protein LOC111495171 OS=Cucurbita maxima OX=3661 GN=LOC111495171 PE=4 SV=1)

HSP 1 Score: 935 bits (2416), Expect = 0.0
Identity = 490/508 (96.46%), Postives = 500/508 (98.43%), Query Frame = 0

Query: 41  MEDSEKLTALKKAYADIILNTAKEAAARIMVSERNAIRFQQELSTSKDEAFRILLRLKQM 100
           MEDSEKLTALKKAYADIILNTAKEAAARIMVSERNAIRFQQELST+KDEAFRILLRLKQM
Sbjct: 1   MEDSEKLTALKKAYADIILNTAKEAAARIMVSERNAIRFQQELSTTKDEAFRILLRLKQM 60

Query: 101 LDSKASEAEMVSLNQKKKIEELEAQLEEAEDIVRELRAQLQDVQDELEHVRNKNVEPLDE 160
           LDSKASEAEMVSLNQKKKIEELEAQLEEAEDIVRELRAQLQDVQDELEHVRNKNVEPLDE
Sbjct: 61  LDSKASEAEMVSLNQKKKIEELEAQLEEAEDIVRELRAQLQDVQDELEHVRNKNVEPLDE 120

Query: 161 QNLAGNVASRGDLPNSHERIAPCNISSTLNGTCHDDSWPESKNDLQMDNGQVHGDFTSMV 220
           QNLAGNVA+RGDLPNSHERIAPCNISSTLNGTCHDDSWPESKNDLQMDNGQVHGDFTSMV
Sbjct: 121 QNLAGNVATRGDLPNSHERIAPCNISSTLNGTCHDDSWPESKNDLQMDNGQVHGDFTSMV 180

Query: 221 MRNKEPELYRNGCTQRVRAFERKLFDGKVCSTGQAEDVKNRVCNMGEEEGKLMCKTNITK 280
           MR+KEPELYRNGCTQRVRAFERKLFDGKVCSTGQAEDVKNRVCNMGEEEGKLMCKTNITK
Sbjct: 181 MRSKEPELYRNGCTQRVRAFERKLFDGKVCSTGQAEDVKNRVCNMGEEEGKLMCKTNITK 240

Query: 281 ADNICGERKNSSEIKTLPKLLSRDTQVPIIKSLRRKRKRATRYNKKKTLPIFYDIAKQCN 340
           ADN+CGERKNS+EIKTLP LLSRDTQVPIIKSLRRKRKRATRYNKKKTLPIF DIAKQCN
Sbjct: 241 ADNVCGERKNSNEIKTLPNLLSRDTQVPIIKSLRRKRKRATRYNKKKTLPIFDDIAKQCN 300

Query: 341 SPDLHCSESPSVDNDDTGKCLSQNEIDSQNGLILLSTPVLSEINEMSTPSGCPDDSEGGD 400
           SPDLHCSESPSV ND TGKCLSQNEIDSQ+GLILLSTPVLSEINEMSTPS CPDDSEGGD
Sbjct: 301 SPDLHCSESPSVYNDGTGKCLSQNEIDSQDGLILLSTPVLSEINEMSTPSSCPDDSEGGD 360

Query: 401 AVINDCPIRNVTDYDTAAVGKSDFTSQESLCEENLEASAYKLDVDPVKESSVKSDVKDSD 460
           AVINDCPIRNVTDYDTAAVGKSDFTSQESLC ENLEASAYK++VDPVKESSVKSDVKDSD
Sbjct: 361 AVINDCPIRNVTDYDTAAVGKSDFTSQESLCGENLEASAYKVEVDPVKESSVKSDVKDSD 420

Query: 461 VVDEIPSQHSNNKVLMYTFRRKRKKESLSSPDGKSSIDESISKKRMRDKQSVSSEADKFS 520
           VVDEIPSQHSNNKVLMYTFRRKRKKESLSSPDGKS IDESISKKRM+DKQS SSE+DKFS
Sbjct: 421 VVDEIPSQHSNNKVLMYTFRRKRKKESLSSPDGKSPIDESISKKRMKDKQSASSESDKFS 480

Query: 521 LMTESSRDNRRLAQVARQLISLSEKKWR 548
           LMTESSRDNRRLAQVARQLISLSEKKWR
Sbjct: 481 LMTESSRDNRRLAQVARQLISLSEKKWR 508

BLAST of Cp4.1LG20g02090 vs. ExPASy TrEMBL
Match: A0A0A0LU15 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G045750 PE=4 SV=1)

HSP 1 Score: 814 bits (2102), Expect = 1.49e-292
Identity = 444/547 (81.17%), Postives = 475/547 (86.84%), Query Frame = 0

Query: 4   QGGGETEQTSPLILSFF--FFLHNSVQKPAKKGLHTSLIMEDSEKLTALKKAYADIILNT 63
           +GG  T        SF   F  HN  Q   K+ LH SLIMEDSEKLTALKKAYADIILNT
Sbjct: 30  RGGNRTNLPPYSSFSFLSSFLPHNLDQD--KEILHASLIMEDSEKLTALKKAYADIILNT 89

Query: 64  AKEAAARIMVSERNAIRFQQELSTSKDEAFRILLRLKQMLDSKASEAEMVSLNQKKKIEE 123
           AKEAAARIMVSERNAIR QQELST+KDEAFRILLRLKQMLDSK SEAE+VSLNQKKKIEE
Sbjct: 90  AKEAAARIMVSERNAIRCQQELSTTKDEAFRILLRLKQMLDSKVSEAEIVSLNQKKKIEE 149

Query: 124 LEAQLEEAEDIVRELRAQLQDVQDELEHVRNKNVEPLDEQNLAGNVASRGDLPNSHERIA 183
           LEAQLEEAEDIVRELR QLQ+VQDELEHVRNKNVEP D+QNLA N+ S    PNSHE+IA
Sbjct: 150 LEAQLEEAEDIVRELRVQLQEVQDELEHVRNKNVEPQDKQNLANNIVSPEAFPNSHEKIA 209

Query: 184 PCNISSTLNGTCHDDSWPESKNDLQMDNGQVHGDFTSMVMRNKEPELYRNGCTQRVRAFE 243
           P +ISSTLNGTC D SWPESKND QMD GQVH DF SMVMR+KEPELYRNGCTQRVRAFE
Sbjct: 210 PYDISSTLNGTCLD-SWPESKNDSQMDKGQVHRDFASMVMRSKEPELYRNGCTQRVRAFE 269

Query: 244 RKLFDGKVCSTGQAEDVKNRVCNMGEEEGKLMCKTNITKADNICGERKNSSEIKTLPKLL 303
           RK FDGKVC TGQAEDVKN+VCNM EEEGKLM KTN TK DNI GERKNS+EIK LPKLL
Sbjct: 270 RKSFDGKVCVTGQAEDVKNKVCNMDEEEGKLMRKTNTTKVDNISGERKNSNEIKALPKLL 329

Query: 304 SRDTQVPIIKSLRRKRKRATRYNKKKTLPIFYDIAKQCNSPDLHCSESPSVDNDDTGKCL 363
           SRDTQVPI+KSLRRKRKRATRYNKKK L +  D   QC SPDLHCSES SVDNDD G  L
Sbjct: 330 SRDTQVPILKSLRRKRKRATRYNKKKVLTVLDDTPNQCKSPDLHCSESLSVDNDDAGNFL 389

Query: 364 SQNEIDSQNGLILLSTPVLSEINEMSTPSGCPDDSEGGDAVINDCPIRNVTDYDTAAVGK 423
           S+ EIDSQNGLILLSTP+LSEINE+ TPSGCPD SEG  AVINDCP+RN+TD+DTA VGK
Sbjct: 390 SKKEIDSQNGLILLSTPLLSEINEIPTPSGCPDASEGDGAVINDCPLRNMTDHDTAVVGK 449

Query: 424 SDFTSQESLCEENLEASAYKLDVDPVKESSVKSDVKDSDVVDEIPSQHSNNKVLMYTFRR 483
           SDF SQESLC ENLEAS  K+D+DPVKESS++ D+K+SDV+DEIPSQ SNNKVL YTF+R
Sbjct: 450 SDFGSQESLCGENLEASTDKVDLDPVKESSIQLDMKNSDVIDEIPSQQSNNKVLKYTFQR 509

Query: 484 KRKKESLSSPDGKSSIDESISKKRMRDKQSVSSEADKFSLMTESSRDNRRLAQVARQLIS 543
           KRKKESLSSPDGKSS+DESISKKRM+DKQSVSSE+DKFSLMTESSRDNRRLAQVARQLIS
Sbjct: 510 KRKKESLSSPDGKSSVDESISKKRMKDKQSVSSESDKFSLMTESSRDNRRLAQVARQLIS 569

Query: 544 LSEKKWR 548
           LSEKKWR
Sbjct: 570 LSEKKWR 573

BLAST of Cp4.1LG20g02090 vs. ExPASy TrEMBL
Match: A0A1S3AUP8 (uncharacterized protein LOC103483170 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103483170 PE=4 SV=1)

HSP 1 Score: 807 bits (2084), Expect = 7.05e-291
Identity = 432/508 (85.04%), Postives = 461/508 (90.75%), Query Frame = 0

Query: 41  MEDSEKLTALKKAYADIILNTAKEAAARIMVSERNAIRFQQELSTSKDEAFRILLRLKQM 100
           MEDSEKLTALKKAYADIILNTAKEAAARIMVSERNAIR QQELST+KDEAFRILLRLKQM
Sbjct: 1   MEDSEKLTALKKAYADIILNTAKEAAARIMVSERNAIRCQQELSTTKDEAFRILLRLKQM 60

Query: 101 LDSKASEAEMVSLNQKKKIEELEAQLEEAEDIVRELRAQLQDVQDELEHVRNKNVEPLDE 160
           LDSK SEAEMVSLNQKKKIEELEAQLEEAEDIVRELR QLQ+VQDELEHVRNKNVEP D+
Sbjct: 61  LDSKVSEAEMVSLNQKKKIEELEAQLEEAEDIVRELRVQLQEVQDELEHVRNKNVEPQDK 120

Query: 161 QNLAGNVASRGDLPNSHERIAPCNISSTLNGTCHDDSWPESKNDLQMDNGQVHGDFTSMV 220
           QNLA N+ASR D PNSHE+IAP +ISSTLNGTC D SWPESKND Q D  QVH DF SMV
Sbjct: 121 QNLASNIASREDFPNSHEKIAPYDISSTLNGTCLD-SWPESKNDSQTDKAQVHRDFASMV 180

Query: 221 MRNKEPELYRNGCTQRVRAFERKLFDGKVCSTGQAEDVKNRVCNMGEEEGKLMCKTNITK 280
           MR+KEPELYRNGCTQRVRAFERK FDG+VC TGQAEDVK++VCNMGEEEGKLM KTN TK
Sbjct: 181 MRSKEPELYRNGCTQRVRAFERKSFDGEVCLTGQAEDVKSKVCNMGEEEGKLMRKTNTTK 240

Query: 281 ADNICGERKNSSEIKTLPKLLSRDTQVPIIKSLRRKRKRATRYNKKKTLPIFYDIAKQCN 340
           ADNI GERKN +EIK LPKLLS DTQVPI+KSLRRKRKRATRYNKKK L +  DI KQC 
Sbjct: 241 ADNISGERKNFNEIKALPKLLSGDTQVPILKSLRRKRKRATRYNKKKALTVLDDIPKQCK 300

Query: 341 SPDLHCSESPSVDNDDTGKCLSQNEIDSQNGLILLSTPVLSEINEMSTPSGCPDDSEGGD 400
           SPDLHCSES SVDNDD G  LS+ EIDSQNGLILLSTP+LSEINEM TPSGCPD SEG  
Sbjct: 301 SPDLHCSESLSVDNDDAGNFLSKKEIDSQNGLILLSTPLLSEINEMPTPSGCPDASEGDG 360

Query: 401 AVINDCPIRNVTDYDTAAVGKSDFTSQESLCEENLEASAYKLDVDPVKESSVKSDVKDSD 460
           AVINDCP+RN+TD+DTA VGKSDF SQESLC ENLEAS  K+D+DPVKESS++ D+K+SD
Sbjct: 361 AVINDCPLRNMTDHDTAVVGKSDFASQESLCGENLEASTDKVDLDPVKESSIQLDMKNSD 420

Query: 461 VVDEIPSQHSNNKVLMYTFRRKRKKESLSSPDGKSSIDESISKKRMRDKQSVSSEADKFS 520
           V+DEIPSQ SNNKVL YTF+RKRKKESLSSPDGKSS+DESISKKRM+DKQSVSSE+DKFS
Sbjct: 421 VIDEIPSQQSNNKVLKYTFQRKRKKESLSSPDGKSSVDESISKKRMKDKQSVSSESDKFS 480

Query: 521 LMTESSRDNRRLAQVARQLISLSEKKWR 548
           LMTESSRDNRRLAQVARQLISLSEKKWR
Sbjct: 481 LMTESSRDNRRLAQVARQLISLSEKKWR 507

BLAST of Cp4.1LG20g02090 vs. ExPASy TrEMBL
Match: A0A6J1C6Z0 (uncharacterized protein LOC111008977 OS=Momordica charantia OX=3673 GN=LOC111008977 PE=4 SV=1)

HSP 1 Score: 798 bits (2060), Expect = 3.17e-287
Identity = 424/508 (83.46%), Postives = 456/508 (89.76%), Query Frame = 0

Query: 41  MEDSEKLTALKKAYADIILNTAKEAAARIMVSERNAIRFQQELSTSKDEAFRILLRLKQM 100
           M+DSEKL ALKKAYADIILNTAKEAAARIMVSERNAIRFQQELS +KDEAFRILLRLKQM
Sbjct: 1   MDDSEKLMALKKAYADIILNTAKEAAARIMVSERNAIRFQQELSATKDEAFRILLRLKQM 60

Query: 101 LDSKASEAEMVSLNQKKKIEELEAQLEEAEDIVRELRAQLQDVQDELEHVRNKNVEPLDE 160
           LDSK SEAEMVSLNQKKKIEELEAQLEEAEDIV +LRAQLQ+VQDELEHVRNK VE LD+
Sbjct: 61  LDSKVSEAEMVSLNQKKKIEELEAQLEEAEDIVSKLRAQLQEVQDELEHVRNKKVEALDK 120

Query: 161 QNLAGNVASRGDLPNSHERIAPCNISSTLNGTCHDDSWPESKNDLQMDNGQVHGDFTSMV 220
            NLAGNVASR DLP+SHE IAP NIS+TLNGTC D SWPESKNDL++DNGQVH DF SMV
Sbjct: 121 HNLAGNVASREDLPSSHECIAPYNISATLNGTCPD-SWPESKNDLKIDNGQVHRDFASMV 180

Query: 221 MRNKEPELYRNGCTQRVRAFERKLFDGKVCSTGQAEDVKNRVCNMGEEEGKLMCKTNITK 280
           MR+KEPELYRNGCTQRVRAFERKLFDGKVC TGQAEDVK+R  +MGEEEGKLMCKT+ITK
Sbjct: 181 MRSKEPELYRNGCTQRVRAFERKLFDGKVCLTGQAEDVKSRAFDMGEEEGKLMCKTSITK 240

Query: 281 ADNICGERKNSSEIKTLPKLLSRDTQVPIIKSLRRKRKRATRYNKKKTLPIFYDIAKQCN 340
            DN CGERKNS+E K LPKLLSRD QVPIIKSLRRKRKRATRYNK+K LP+  D  KQC 
Sbjct: 241 TDNTCGERKNSNETKELPKLLSRDIQVPIIKSLRRKRKRATRYNKRKALPVLDDFTKQCE 300

Query: 341 SPDLHCSESPSVDNDDTGKCLSQNEIDSQNGLILLSTPVLSEINEMSTPSGCPDDSEGGD 400
           SPDLHCSES SVDNDD+ KCLS+ EIDSQ+GL+LLS PVLSEINEMSTPSGCPD SEG  
Sbjct: 301 SPDLHCSESLSVDNDDSEKCLSKKEIDSQDGLVLLSNPVLSEINEMSTPSGCPDVSEGDG 360

Query: 401 AVINDCPIRNVTDYDTAAVGKSDFTSQESLCEENLEASAYKLDVDPVKESSVKSDVKDSD 460
           AVINDC +RN+ DYD A V KSDFT QESLC +NLE S YK+D DPVKESSV  D+K+SD
Sbjct: 361 AVINDCSLRNMKDYDKAVVNKSDFTGQESLCVDNLEGSPYKIDADPVKESSVNLDMKNSD 420

Query: 461 VVDEIPSQHSNNKVLMYTFRRKRKKESLSSPDGKSSIDESISKKRMRDKQSVSSEADKFS 520
            +DE+PSQH NNKVL YTF+RKRKKESLSSPDGKSS+DESISKKRM+DKQ VS E+DKFS
Sbjct: 421 ALDEVPSQHLNNKVLKYTFQRKRKKESLSSPDGKSSVDESISKKRMKDKQIVSLESDKFS 480

Query: 521 LMTESSRDNRRLAQVARQLISLSEKKWR 548
           LMTESSRDNRRLAQVARQLISLSEKKWR
Sbjct: 481 LMTESSRDNRRLAQVARQLISLSEKKWR 507

BLAST of Cp4.1LG20g02090 vs. TAIR 10
Match: AT1G74860.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G19010.1); Has 210 Blast hits to 193 proteins in 61 species: Archae - 0; Bacteria - 9; Metazoa - 75; Fungi - 18; Plants - 58; Viruses - 1; Other Eukaryotes - 49 (source: NCBI BLink). )

HSP 1 Score: 126.7 bits (317), Expect = 5.7e-29
Identity = 156/515 (30.29%), Postives = 231/515 (44.85%), Query Frame = 0

Query: 41  MEDSEKLTALKKAYADIILNTAKEAAARIMVSERNAIRFQQELSTSKDEAFRILLRLKQM 100
           M D E L ALK+AYAD ILNT KEAAAR+MVSE+ A R+QQEL T ++EA   L+RLKQM
Sbjct: 1   MADPETLAALKRAYADTILNTTKEAAARVMVSEKKARRYQQELVTVRNEALHTLVRLKQM 60

Query: 101 LDSKASEAEMVSLNQKKKIEELEAQLEEAEDIVRELRAQLQDVQDELEHVRN--KNVEPL 160
           LDSK  E EM SL Q++K+EELEAQL EAEDIV ELR +L+ + DEL+ + +  K+++  
Sbjct: 61  LDSKVKETEMQSLKQQQKVEELEAQLGEAEDIVGELRLELRVLHDELKKLTDGQKHLKKN 120

Query: 161 DEQNLAGN-----VASRGDLPNSHER---IAPC-----NISSTLNG----------TCHD 220
            E+NL  N     V+   ++  SHE    +  C     N S   NG          + + 
Sbjct: 121 HEENLCWNNRDAAVSVMPEVSCSHENTEAVGFCIPVEQNGSVVANGIKVPSLTRINSINR 180

Query: 221 DSWPESKNDLQMDNGQVHGDFTSMVMRNKEPELYRNGCTQRVRAFERKLFDGKVCSTGQA 280
            S+ ++K+       Q H    S++ + +E E    G  Q +   +  + +G + S+ + 
Sbjct: 181 CSYKDNKD-------QCHYTLPSILTKRRETE----GLAQMIHTVDSSMANGVLSSSVEV 240

Query: 281 EDVKNRVCNMGEEEGKLMCKTNITKADNICGERKNSSEIKTLPKLLSRDTQVPIIKSLRR 340
            DV + VC + E      CK  I ++  + G    +  I ++     +D + PI+     
Sbjct: 241 GDVNDGVC-LHEVSS---CK--IVESLEMSGCADATDSISSV-----KDGEAPIV----- 300

Query: 341 KRKRATRYNKKKTLPIFYDIAKQCNSPDLHCSESPSVDNDDTGKCLSQNEIDSQNGLILL 400
                                            SP+    D G  +S           L 
Sbjct: 301 ---------------------------------SPNSSQKDVGTLIS-----------LK 360

Query: 401 STPVLSEINEMSTPSGCPDDSEGGDAVINDCPIRNVTDYDTAAVGKSDFTSQESLCEENL 460
           ++P     N+                                   K + +  E+  EE  
Sbjct: 361 TSPPREHENDR----------------------------------KLEISETEARKEE-- 396

Query: 461 EASAYKLDVDPVKESSVKSDVKDSDVVDEIP-SQHSNNKVLMYTFRRKRKKESLSSPDGK 520
                       KES    +V  S + +E P    S N+ + YTF+RKRKKE LS+ +G 
Sbjct: 421 ------------KESCENMEVSASPLCEETPVLALSKNRCIKYTFKRKRKKEVLSNLEGD 396

Query: 521 SSIDESIS-KKRMRDKQSVSSEADKFSLMTESSRD 529
           SS +ES + K++  +K     E+ K S  +ESSRD
Sbjct: 481 SSFEESRNMKQKTVEKDDGYLESLKPSFTSESSRD 396

BLAST of Cp4.1LG20g02090 vs. TAIR 10
Match: AT1G19010.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G74860.1); Has 337 Blast hits to 320 proteins in 97 species: Archae - 0; Bacteria - 14; Metazoa - 153; Fungi - 26; Plants - 76; Viruses - 0; Other Eukaryotes - 68 (source: NCBI BLink). )

HSP 1 Score: 124.0 bits (310), Expect = 3.7e-28
Identity = 79/142 (55.63%), Postives = 103/142 (72.54%), Query Frame = 0

Query: 41  MEDSEKLTALKKAYADIILNTAKEAAARIMVSERNAIRFQQELSTSKDEAFRILLRLKQM 100
           M D EKLTALKKAYA+ ILNTAKEAAAR+M++ER A  +QQEL++ +DEA R  LRLKQ+
Sbjct: 1   MTDPEKLTALKKAYAETILNTAKEAAARVMITERKARGYQQELASVRDEALRACLRLKQI 60

Query: 101 LDSKASEAEMVSLNQKKKIEELEAQLEEAEDIVRELRAQLQDVQDELEHVRNKNVEPLDE 160
            DSK  EAEM+SL +++KIEELEAQL EAEDIV ELR +L++ +  LE + N     L +
Sbjct: 61  YDSKVKEAEMISLQKQQKIEELEAQLGEAEDIVGELRTELRESRYLLEKLANGCQTNLSK 120

Query: 161 QNLAGNVASRGDL---PNSHER 180
           +  A N A   ++    ++HER
Sbjct: 121 EEKAPNEAVSLEVREDSSNHER 142

BLAST of Cp4.1LG20g02090 vs. TAIR 10
Match: AT1G19010.2 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G74860.1); Has 238 Blast hits to 225 proteins in 78 species: Archae - 0; Bacteria - 7; Metazoa - 99; Fungi - 18; Plants - 69; Viruses - 0; Other Eukaryotes - 45 (source: NCBI BLink). )

HSP 1 Score: 82.0 bits (201), Expect = 1.6e-15
Identity = 55/113 (48.67%), Postives = 77/113 (68.14%), Query Frame = 0

Query: 70  MVSERNAIRFQQELSTSKDEAFRILLRLKQMLDSKASEAEMVSLNQKKKIEELEAQLEEA 129
           M++ER A  +QQEL++ +DEA R  LRLKQ+ DSK  EAEM+SL +++KIEELEAQL EA
Sbjct: 1   MITERKARGYQQELASVRDEALRACLRLKQIYDSKVKEAEMISLQKQQKIEELEAQLGEA 60

Query: 130 EDIVRELRAQLQDVQDELEHVRNKNVEPLDEQNLAGNVASRGDL---PNSHER 180
           EDIV ELR +L++ +  LE + N     L ++  A N A   ++    ++HER
Sbjct: 61  EDIVGELRTELRESRYLLEKLANGCQTNLSKEEKAPNEAVSLEVREDSSNHER 113

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_022924190.10.097.08uncharacterized protein LOC111431711 [Cucurbita moschata][more]
KAG6584245.10.096.90hypothetical protein SDJN03_20177, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_023520373.10.0100.00uncharacterized protein LOC111783687 [Cucurbita pepo subsp. pepo][more]
KAG7019841.10.096.85hypothetical protein SDJN02_18805 [Cucurbita argyrosperma subsp. argyrosperma][more]
XP_023000848.10.096.46uncharacterized protein LOC111495171 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
A0A6J1EE410.097.08uncharacterized protein LOC111431711 OS=Cucurbita moschata OX=3662 GN=LOC1114317... [more]
A0A6J1KNU60.096.46uncharacterized protein LOC111495171 OS=Cucurbita maxima OX=3661 GN=LOC111495171... [more]
A0A0A0LU151.49e-29281.17Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G045750 PE=4 SV=1[more]
A0A1S3AUP87.05e-29185.04uncharacterized protein LOC103483170 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A6J1C6Z03.17e-28783.46uncharacterized protein LOC111008977 OS=Momordica charantia OX=3673 GN=LOC111008... [more]
Match NameE-valueIdentityDescription
AT1G74860.15.7e-2930.29unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G19010.13.7e-2855.63unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G19010.21.6e-1548.67unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 105..153
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 485..527
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 446..464
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 485..514
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 446..467
NoneNo IPR availablePANTHERPTHR34778OS02G0580700 PROTEINcoord: 41..548
NoneNo IPR availablePANTHERPTHR34778:SF1SUBFAMILY NOT NAMEDcoord: 41..548

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG20g02090.1Cp4.1LG20g02090.1mRNA