Cp4.1LG18g02610 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG18g02610
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionUnknown protein
LocationCp4.1LG18: 4276343 .. 4280093 (+)
RNA-Seq ExpressionCp4.1LG18g02610
SyntenyCp4.1LG18g02610
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TAAAGGAAAACGTGAGATCCTACATCGATTGGAGAGAAGAACGAGTGCCAGCAATGACGATGTACCTTAAACAGAGGTGGATTGTGAGAGCCCACATCAGTTAGAAGAAAGATGATGCATTTTTTACAAGGACGTGAAAACTTCTCCCTAGTAGACGCATTTTTAAAACTTTGAAAAAATCTATAAAAAAAGAGCTCAGAGAACAACATCTACCGGCAATGAGCTTGTGTTAGATAGGTGAGCATACCTCTCGCCTAGCTCACTAACCCATCATTTCAAGAATCTTCTCAAACGAATGCATAATATTAGGAGTTATGTGGGCTTATCCTTTAATAATCTCTCGACATAGACCAGGAGACATCTGTCAAGAGAGAATACGTGAGTTAGAAGGCAAGTGTTTTTGATATGTAGAAGAAGTAAGGAATGGGAAGAAAAGTGCTAAATTTAAAGAGAAGCGGTGGAGTAGAAACCCGCCACGTTTAACTTGGTGGCAGGCTTCTGCAGGCATTCCTTTTCCTCTGCACAATCCACCACCGCCTTCCCTTGCATTCTCTTCAACCGCCTATTCTCTCTCCCTCTCTCTCTATCCGCCATGGAAGAAGAGCTTGTTTGATTATGGCCGATGAACCACCTGGGTTCATCAGGTTTTGTTTATTTTTCTTCCTGTCTTAGCTTTAAGGATCTTTGATGTCAAGATTATGGATAAAAGTTTTCTTTGATTTGTCAGGATGGAGGGCTATCGCAGCATAGACTGGAATATGGAAGAACAACTGTCCTCCGGTGATGGCCTGAGCAGTGAAAAGATATGCTCCGCCGTGCAAAAGGGTTGCAGTCTGGGGAAGAAGCTTCTCCTCACTGGTTTAGCCATATCCTCTGTTCCTGTGGTTCTTCCTCCATTGGTTATCATGTCAGCCTTTGGAATTGCAGCCTCAATTCCCTATGGGGTTTTCCTCGCCTCTTATGCTTGTACTGAGACGATCATGAGTGTTTGGCTTCCGATTCCCCCGGCCTTGAAGCTCGACCGCGCTGACGAAGAGATTGTGGAGGAAGACATCTATGAAGATGAAGAAAAACATATGATGGAAACAGCTAAGAGAGGTGGGAATTTGGATGATTTCGATATCGATGTGGTAGTCATTCAGGGGGACGAAGAGGGCGAAACCAATATTGGAAGCAAAGGTCTGGCAGCAATTGAAGTGACCAATGTGGAATTTGAAGGAAATGGAGATATTGGAGATGAGGAAGAAGAGGAAGAAGAGTTGAAGGAAACTAGAGGTTTACTCGAAAGAATCAGGGATGAGGGGCAAAGAGACAATGGTTTTGTTGATGAAAATGGAGGTGTTGATGATGTTCGAGAGCTCGAGATTTCAATGGAGGACGAGAAACCAAGTGATTCTGTCGAAAAAAGTGTTCTAGGTTTGTTGAATGAAGTTGACTCTGCTGCTGTTTATCCTCATGCAGACTATAGAACTTCTGAAGGTAATTCATTTCTTTTGTTTATTATTCCAAGTTACGAATTTGCAAATAATGTACGATATCTCTTTTCGGGTTGAGGATTTGAAAATCTAACCTTTGTCATCGGAATTAAAGGGGTCGGGTCGGCGAAATCTAGTGATGAAACAAATGCAATAACGACATTGACAAACAAGGCAACCAAGTCTGAAGAAGCTGAGCAACTTCCTAAAGTAACAATGATTGATGTGATAGAATCTGATGAGGGTTTGTCTATATCAGCTATGACTATTGAACACAAAGTTGAGGCAAATTCTCCACATAAAGATCATAGGATGTCTTCCAATGAGGTCAGTACATAATATAAAGACGTCGATTCAATCCGAATACCATGGCCTTGTTTCCTTCGTTCTTCGTTTACCGATCGAGCTAATGAATATGACTAACAAGTACTGAGTTTTAATTGATATCTGTTCTTGCCATCAACTTTGGCATACTCACTTTCTTGGGTTTGGATTGAACGTTTGAAATTCCTGAGTTTTCCCTTTTCTTTCTGCAAGGAACTATCAGGAGAGGTAAAGATAAGAGAAAAGATTGCTTCGATGAAGAAGATCGTAGGATACAAGGCTACCCCCCTCGGAACATACTTAGACGAAGTGAACGCTCTATATGCCTTCATCGGAGTCGAGCCACCTTCCCCGATGAAAGATTCTGCTAATGATGATGATATCAATCTACTTAACCAGAAGTTGCAGTTTCTCATGTCCATAGTAGGGGTCAAGTAGAGGTGTTAAAAGTTTGGCATTGAATAAGTTGCCATCGGTGCACTTTTGTTTGCCAATGGCAATATTTCCTTATGTTTGAGCTATGCACAGTTTGGTTTTGAAATTATGTACCAAATTTTGAACAGTTTTGTCTACTTTTTTCCCTCTTTCTGTTCAGTTCTTTTCATCCACAAGGGTTCTTGAATCACTCATGGGTGTTAGGAAGTGAAAAATCCAAGAGATCTATGAAATTTAAGCTTAAACGATGAAGAGTGATCAGATAAAATTATAGAACATTCAATAATTGAAATGAATATCGAAGATTCAGAGTGATGAAAAAGAACACAAAACTTCAAATCATGAACTATTATTAGTGTTATGAGAGGCACCTGAGTCTTGCAAATGTTAGATAACCAGTGTGACCTCTGGCCTCACTACATGGCCTAACCATGACCGTACTCGAATCACCATTCGTGTTGTTAGCTTGCTTTGATCTCACCCGCTTCTTGCACGGTGGAAGCTCGACAGAACCACCTCCTTCAACCTGGCTGCTCTCTGTTTTACCTTCTCGAACCTCGTAAGTCCGAAGCAATACCTCAAATGTTCTTATATCTACACAAAAGGGACGACAACAACGACGACAATAGTGAATAATACATATCAACATATCTATAGGAATGAAGGTATATAAATGGGTCATTCAGAACAACCTGTAAACTTAGATCGTAGAGTTTCGGCCGACCGTTGAACTTGCTCGATACATGGAGAGAACGAGCATAATACGCCGTCTTGTTTCAGCATTCTCTCAGCTGAAGGAATGGCCAACCAAGGTTGAGGTAGATCCAAGAAAACTGAATCTGCCAATCCAACAAACTCTTCAGGGAAGCCCTGACCTTGAACATCTCTAACTCCCACAGTGACTAAGCTGGTCAAGCCAGTCTTCTCAAAATCCTCTCTGAAAAAAGAGGGAGACAAGCAAATCATCATAGAACACAATTACTCTGCAGGGTATAACTTGACAACATACAACAAATCCCGGTTCCTACCTCGCCGAGGCAGCTCTCTGTTCATGAAAATCAAACGTATAGACGTGTCCTGTAGGCGCTACAGCCCTAGCTAAAGACGTCGTCAAAGATCCACTGCCAGTCCCTGATTCTAGAATCAAACAACCAGGAACTATCTCTAAGAACATAATTACAAAACTAATGTCCGCAATGTATAGAATCTGTGTCCTGTGGCTTAGAACCAAAGTCCATAATTCAGGAGTTGGGGCTAACAAGTAAACAAAGCCACCTTTGTTACTCAACACCTTGGACCCAAATGGCTTTCCAATCCAATCAGAATGTTTAAACATACCAAACCTATTCTGAAATGTTGAATTTTCACATACTTTCACAGCCTTCATGACATCATGTCTCTCATAAACAATGACTAAATCTCCATCCCTTATAGAACGAGTGAAAGATATCTTCTTCAAGTCATCAGTTGGCAACATCCTTGATCTTCAAAAGTTCATAAAGAATCAAGAATACCAG

mRNA sequence

TAAAGGAAAACGTGAGATCCTACATCGATTGGAGAGAAGAACGAGTGCCAGCAATGACGATGTACCTTAAACAGAGGTGGATTGTGAGAGCCCACATCAGTTAGAAGAAAGATGATGCATTTTTTACAAGGACGTGAAAACTTCTCCCTAGTAGACGCATTTTTAAAACTTTGAAAAAATCTATAAAAAAAGAGCTCAGAGAACAACATCTACCGGCAATGAGCTTGTGTTAGATAGGTGAGCATACCTCTCGCCTAGCTCACTAACCCATCATTTCAAGAATCTTCTCAAACGAATGCATAATATTAGGAGTTATGTGGGCTTATCCTTTAATAATCTCTCGACATAGACCAGGAGACATCTGTCAAGAGAGAATACGTGAGTTAGAAGGCAAGTGTTTTTGATATGTAGAAGAAGTAAGGAATGGGAAGAAAAGTGCTAAATTTAAAGAGAAGCGGTGGAGTAGAAACCCGCCACGTTTAACTTGGTGGCAGGCTTCTGCAGGCATTCCTTTTCCTCTGCACAATCCACCACCGCCTTCCCTTGCATTCTCTTCAACCGCCTATTCTCTCTCCCTCTCTCTCTATCCGCCATGGAAGAAGAGCTTGTTTGATTATGGCCGATGAACCACCTGGGTTCATCAGGATGGAGGGCTATCGCAGCATAGACTGGAATATGGAAGAACAACTGTCCTCCGGTGATGGCCTGAGCAGTGAAAAGATATGCTCCGCCGTGCAAAAGGGTTGCAGTCTGGGGAAGAAGCTTCTCCTCACTGGTTTAGCCATATCCTCTGTTCCTGTGGTTCTTCCTCCATTGGTTATCATGTCAGCCTTTGGAATTGCAGCCTCAATTCCCTATGGGGTTTTCCTCGCCTCTTATGCTTGTACTGAGACGATCATGAGTGTTTGGCTTCCGATTCCCCCGGCCTTGAAGCTCGACCGCGCTGACGAAGAGATTGTGGAGGAAGACATCTATGAAGATGAAGAAAAACATATGATGGAAACAGCTAAGAGAGGTGGGAATTTGGATGATTTCGATATCGATGTGGTAGTCATTCAGGGGGACGAAGAGGGCGAAACCAATATTGGAAGCAAAGGTCTGGCAGCAATTGAAGTGACCAATGTGGAATTTGAAGGAAATGGAGATATTGGAGATGAGGAAGAAGAGGAAGAAGAGTTGAAGGAAACTAGAGGTTTACTCGAAAGAATCAGGGATGAGGGGCAAAGAGACAATGGTTTTGTTGATGAAAATGGAGGTGTTGATGATGTTCGAGAGCTCGAGATTTCAATGGAGGACGAGAAACCAAGTGATTCTGTCGAAAAAAGTGTTCTAGGTTTGTTGAATGAAGTTGACTCTGCTGCTGTTTATCCTCATGCAGACTATAGAACTTCTGAAGGGGTCGGGTCGGCGAAATCTAGTGATGAAACAAATGCAATAACGACATTGACAAACAAGGCAACCAAGTCTGAAGAAGCTGAGCAACTTCCTAAAGTAACAATGATTGATGTGATAGAATCTGATGAGGGTTTGTCTATATCAGCTATGACTATTGAACACAAAGTTGAGGCAAATTCTCCACATAAAGATCATAGGATGTCTTCCAATGAGGAACTATCAGGAGAGGTAAAGATAAGAGAAAAGATTGCTTCGATGAAGAAGATCGTAGGATACAAGGCTACCCCCCTCGGAACATACTTAGACGAAGTGAACGCTCTATATGCCTTCATCGGAGTCGAGCCACCTTCCCCGATGAAAGATTCTGCTAATGATGATGATATCAATCTACTTAACCAGAAGTTGCAGTTTCTCATGTCCATAGTAGGGGTCAAGTAGAGGTGTTAAAAGTTTGGCATTGAATAAGTTGCCATCGGTGCACTTTTGTTTGCCAATGGCAATATTTCCTTATGTTTGAGCTATGCACAGTTTGGTTTTGAAATTATGTACCAAATTTTGAACAGTTTTGTCTACTTTTTTCCCTCTTTCTGTTCAGTTCTTTTCATCCACAAGGGTTCTTGAATCACTCATGGGTGTTAGGAAGTGAAAAATCCAAGAGATCTATGAAATTTAAGCTTAAACGATGAAGAGTGATCAGATAAAATTATAGAACATTCAATAATTGAAATGAATATCGAAGATTCAGAGTGATGAAAAAGAACACAAAACTTCAAATCATGAACTATTATTAGTGTTATGAGAGGCACCTGAGTCTTGCAAATGTTAGATAACCAGTGTGACCTCTGGCCTCACTACATGGCCTAACCATGACCGTACTCGAATCACCATTCGTGTTGTTAGCTTGCTTTGATCTCACCCGCTTCTTGCACGGTGGAAGCTCGACAGAACCACCTCCTTCAACCTGGCTGCTCTCTGTTTTACCTTCTCGAACCTCGTAAGTCCGAAGCAATACCTCAAATGTTCTTATATCTACACAAAAGGGACGACAACAACGACGACAATAGTGAATAATACATATCAACATATCTATAGGAATGAAGGTATATAAATGGGTCATTCAGAACAACCTGTAAACTTAGATCGTAGAGTTTCGGCCGACCGTTGAACTTGCTCGATACATGGAGAGAACGAGCATAATACGCCGTCTTGTTTCAGCATTCTCTCAGCTGAAGGAATGGCCAACCAAGGTTGAGGTAGATCCAAGAAAACTGAATCTGCCAATCCAACAAACTCTTCAGGGAAGCCCTGACCTTGAACATCTCTAACTCCCACAGTGACTAAGCTGGTCAAGCCAGTCTTCTCAAAATCCTCTCTGAAAAAAGAGGGAGACAAGCAAATCATCATAGAACACAATTACTCTGCAGGGTATAACTTGACAACATACAACAAATCCCGGTTCCTACCTCGCCGAGGCAGCTCTCTGTTCATGAAAATCAAACGTATAGACGTGTCCTGTAGGCGCTACAGCCCTAGCTAAAGACGTCGTCAAAGATCCACTGCCAGTCCCTGATTCTAGAATCAAACAACCAGGAACTATCTCTAAGAACATAATTACAAAACTAATGTCCGCAATGTATAGAATCTGTGTCCTGTGGCTTAGAACCAAAGTCCATAATTCAGGAGTTGGGGCTAACAAGTAAACAAAGCCACCTTTGTTACTCAACACCTTGGACCCAAATGGCTTTCCAATCCAATCAGAATGTTTAAACATACCAAACCTATTCTGAAATGTTGAATTTTCACATACTTTCACAGCCTTCATGACATCATGTCTCTCATAAACAATGACTAAATCTCCATCCCTTATAGAACGAGTGAAAGATATCTTCTTCAAGTCATCAGTTGGCAACATCCTTGATCTTCAAAAGTTCATAAAGAATCAAGAATACCAG

Coding sequence (CDS)

ATGGCCGATGAACCACCTGGGTTCATCAGGATGGAGGGCTATCGCAGCATAGACTGGAATATGGAAGAACAACTGTCCTCCGGTGATGGCCTGAGCAGTGAAAAGATATGCTCCGCCGTGCAAAAGGGTTGCAGTCTGGGGAAGAAGCTTCTCCTCACTGGTTTAGCCATATCCTCTGTTCCTGTGGTTCTTCCTCCATTGGTTATCATGTCAGCCTTTGGAATTGCAGCCTCAATTCCCTATGGGGTTTTCCTCGCCTCTTATGCTTGTACTGAGACGATCATGAGTGTTTGGCTTCCGATTCCCCCGGCCTTGAAGCTCGACCGCGCTGACGAAGAGATTGTGGAGGAAGACATCTATGAAGATGAAGAAAAACATATGATGGAAACAGCTAAGAGAGGTGGGAATTTGGATGATTTCGATATCGATGTGGTAGTCATTCAGGGGGACGAAGAGGGCGAAACCAATATTGGAAGCAAAGGTCTGGCAGCAATTGAAGTGACCAATGTGGAATTTGAAGGAAATGGAGATATTGGAGATGAGGAAGAAGAGGAAGAAGAGTTGAAGGAAACTAGAGGTTTACTCGAAAGAATCAGGGATGAGGGGCAAAGAGACAATGGTTTTGTTGATGAAAATGGAGGTGTTGATGATGTTCGAGAGCTCGAGATTTCAATGGAGGACGAGAAACCAAGTGATTCTGTCGAAAAAAGTGTTCTAGGTTTGTTGAATGAAGTTGACTCTGCTGCTGTTTATCCTCATGCAGACTATAGAACTTCTGAAGGGGTCGGGTCGGCGAAATCTAGTGATGAAACAAATGCAATAACGACATTGACAAACAAGGCAACCAAGTCTGAAGAAGCTGAGCAACTTCCTAAAGTAACAATGATTGATGTGATAGAATCTGATGAGGGTTTGTCTATATCAGCTATGACTATTGAACACAAAGTTGAGGCAAATTCTCCACATAAAGATCATAGGATGTCTTCCAATGAGGAACTATCAGGAGAGGTAAAGATAAGAGAAAAGATTGCTTCGATGAAGAAGATCGTAGGATACAAGGCTACCCCCCTCGGAACATACTTAGACGAAGTGAACGCTCTATATGCCTTCATCGGAGTCGAGCCACCTTCCCCGATGAAAGATTCTGCTAATGATGATGATATCAATCTACTTAACCAGAAGTTGCAGTTTCTCATGTCCATAGTAGGGGTCAAGTAG

Protein sequence

MADEPPGFIRMEGYRSIDWNMEEQLSSGDGLSSEKICSAVQKGCSLGKKLLLTGLAISSVPVVLPPLVIMSAFGIAASIPYGVFLASYACTETIMSVWLPIPPALKLDRADEEIVEEDIYEDEEKHMMETAKRGGNLDDFDIDVVVIQGDEEGETNIGSKGLAAIEVTNVEFEGNGDIGDEEEEEEELKETRGLLERIRDEGQRDNGFVDENGGVDDVRELEISMEDEKPSDSVEKSVLGLLNEVDSAAVYPHADYRTSEGVGSAKSSDETNAITTLTNKATKSEEAEQLPKVTMIDVIESDEGLSISAMTIEHKVEANSPHKDHRMSSNEELSGEVKIREKIASMKKIVGYKATPLGTYLDEVNALYAFIGVEPPSPMKDSANDDDINLLNQKLQFLMSIVGVK
Homology
BLAST of Cp4.1LG18g02610 vs. NCBI nr
Match: XP_023516026.1 (uncharacterized protein LOC111780015 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 769 bits (1987), Expect = 6.60e-280
Identity = 405/405 (100.00%), Postives = 405/405 (100.00%), Query Frame = 0

Query: 1   MADEPPGFIRMEGYRSIDWNMEEQLSSGDGLSSEKICSAVQKGCSLGKKLLLTGLAISSV 60
           MADEPPGFIRMEGYRSIDWNMEEQLSSGDGLSSEKICSAVQKGCSLGKKLLLTGLAISSV
Sbjct: 1   MADEPPGFIRMEGYRSIDWNMEEQLSSGDGLSSEKICSAVQKGCSLGKKLLLTGLAISSV 60

Query: 61  PVVLPPLVIMSAFGIAASIPYGVFLASYACTETIMSVWLPIPPALKLDRADEEIVEEDIY 120
           PVVLPPLVIMSAFGIAASIPYGVFLASYACTETIMSVWLPIPPALKLDRADEEIVEEDIY
Sbjct: 61  PVVLPPLVIMSAFGIAASIPYGVFLASYACTETIMSVWLPIPPALKLDRADEEIVEEDIY 120

Query: 121 EDEEKHMMETAKRGGNLDDFDIDVVVIQGDEEGETNIGSKGLAAIEVTNVEFEGNGDIGD 180
           EDEEKHMMETAKRGGNLDDFDIDVVVIQGDEEGETNIGSKGLAAIEVTNVEFEGNGDIGD
Sbjct: 121 EDEEKHMMETAKRGGNLDDFDIDVVVIQGDEEGETNIGSKGLAAIEVTNVEFEGNGDIGD 180

Query: 181 EEEEEEELKETRGLLERIRDEGQRDNGFVDENGGVDDVRELEISMEDEKPSDSVEKSVLG 240
           EEEEEEELKETRGLLERIRDEGQRDNGFVDENGGVDDVRELEISMEDEKPSDSVEKSVLG
Sbjct: 181 EEEEEEELKETRGLLERIRDEGQRDNGFVDENGGVDDVRELEISMEDEKPSDSVEKSVLG 240

Query: 241 LLNEVDSAAVYPHADYRTSEGVGSAKSSDETNAITTLTNKATKSEEAEQLPKVTMIDVIE 300
           LLNEVDSAAVYPHADYRTSEGVGSAKSSDETNAITTLTNKATKSEEAEQLPKVTMIDVIE
Sbjct: 241 LLNEVDSAAVYPHADYRTSEGVGSAKSSDETNAITTLTNKATKSEEAEQLPKVTMIDVIE 300

Query: 301 SDEGLSISAMTIEHKVEANSPHKDHRMSSNEELSGEVKIREKIASMKKIVGYKATPLGTY 360
           SDEGLSISAMTIEHKVEANSPHKDHRMSSNEELSGEVKIREKIASMKKIVGYKATPLGTY
Sbjct: 301 SDEGLSISAMTIEHKVEANSPHKDHRMSSNEELSGEVKIREKIASMKKIVGYKATPLGTY 360

Query: 361 LDEVNALYAFIGVEPPSPMKDSANDDDINLLNQKLQFLMSIVGVK 405
           LDEVNALYAFIGVEPPSPMKDSANDDDINLLNQKLQFLMSIVGVK
Sbjct: 361 LDEVNALYAFIGVEPPSPMKDSANDDDINLLNQKLQFLMSIVGVK 405

BLAST of Cp4.1LG18g02610 vs. NCBI nr
Match: KAG6590083.1 (hypothetical protein SDJN03_15506, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 758 bits (1956), Expect = 3.50e-275
Identity = 399/405 (98.52%), Postives = 402/405 (99.26%), Query Frame = 0

Query: 1   MADEPPGFIRMEGYRSIDWNMEEQLSSGDGLSSEKICSAVQKGCSLGKKLLLTGLAISSV 60
           MADEPP FIRMEGYRSIDWNMEEQLSSGDGLSSEKICSAVQKGCSLGKKLLLTGLAISSV
Sbjct: 1   MADEPPEFIRMEGYRSIDWNMEEQLSSGDGLSSEKICSAVQKGCSLGKKLLLTGLAISSV 60

Query: 61  PVVLPPLVIMSAFGIAASIPYGVFLASYACTETIMSVWLPIPPALKLDRADEEIVEEDIY 120
           PVVLPPLVIMSAFGIAASIPYGVFLASYACTETIMSVWLPIPPALKLDRADEEIVEEDIY
Sbjct: 61  PVVLPPLVIMSAFGIAASIPYGVFLASYACTETIMSVWLPIPPALKLDRADEEIVEEDIY 120

Query: 121 EDEEKHMMETAKRGGNLDDFDIDVVVIQGDEEGETNIGSKGLAAIEVTNVEFEGNGDIGD 180
           EDEEKHMMETAKRGGNLDDFDIDVVV+QGDEEGET+IGSKGLAAIEVTNVEFEGNGDIGD
Sbjct: 121 EDEEKHMMETAKRGGNLDDFDIDVVVVQGDEEGETDIGSKGLAAIEVTNVEFEGNGDIGD 180

Query: 181 EEEEEEELKETRGLLERIRDEGQRDNGFVDENGGVDDVRELEISMEDEKPSDSVEKSVLG 240
           EEEEEEELKETRGLLERIRDEG+RDNGFVD NGGVD VRELEISMEDEKPSDSVEKSVLG
Sbjct: 181 EEEEEEELKETRGLLERIRDEGRRDNGFVDANGGVDHVRELEISMEDEKPSDSVEKSVLG 240

Query: 241 LLNEVDSAAVYPHADYRTSEGVGSAKSSDETNAITTLTNKATKSEEAEQLPKVTMIDVIE 300
           LLNEVDSAAVYPHADYRTSEGVGSAKSSDETNAITTLTNKATKSEEAEQLPKVTMIDVIE
Sbjct: 241 LLNEVDSAAVYPHADYRTSEGVGSAKSSDETNAITTLTNKATKSEEAEQLPKVTMIDVIE 300

Query: 301 SDEGLSISAMTIEHKVEANSPHKDHRMSSNEELSGEVKIREKIASMKKIVGYKATPLGTY 360
           SDEGLSISAMTIEHKVEANSPHKDHRMSSNEELSGEVKIREKIASMKKIVGYKATPLGTY
Sbjct: 301 SDEGLSISAMTIEHKVEANSPHKDHRMSSNEELSGEVKIREKIASMKKIVGYKATPLGTY 360

Query: 361 LDEVNALYAFIGVEPPSPMKDSANDDDINLLNQKLQFLMSIVGVK 405
           LDEVNALYAFIGVEPPSPMKDSANDDDINLLNQKLQFLMSIVGVK
Sbjct: 361 LDEVNALYAFIGVEPPSPMKDSANDDDINLLNQKLQFLMSIVGVK 405

BLAST of Cp4.1LG18g02610 vs. NCBI nr
Match: KAG7023753.1 (hypothetical protein SDJN02_14779 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 751 bits (1940), Expect = 9.60e-273
Identity = 397/405 (98.02%), Postives = 400/405 (98.77%), Query Frame = 0

Query: 1   MADEPPGFIRMEGYRSIDWNMEEQLSSGDGLSSEKICSAVQKGCSLGKKLLLTGLAISSV 60
           MADEPP FIRMEGYRSIDWNMEEQLSSGDGLSSEKICSAVQKGCSLGKKLLLTGLAISSV
Sbjct: 1   MADEPPEFIRMEGYRSIDWNMEEQLSSGDGLSSEKICSAVQKGCSLGKKLLLTGLAISSV 60

Query: 61  PVVLPPLVIMSAFGIAASIPYGVFLASYACTETIMSVWLPIPPALKLDRADEEIVEEDIY 120
           PVVLPPLVIMSAFGIAASIPYGVFLASYACTETIMSVWLPIPPALKLD ADEEIVEEDIY
Sbjct: 61  PVVLPPLVIMSAFGIAASIPYGVFLASYACTETIMSVWLPIPPALKLDSADEEIVEEDIY 120

Query: 121 EDEEKHMMETAKRGGNLDDFDIDVVVIQGDEEGETNIGSKGLAAIEVTNVEFEGNGDIGD 180
           EDEEKHMMETAKRGGNLDDFDIDVVV+QGDEEGET+IGSKGLAAIEVTNVEFEGNGDIGD
Sbjct: 121 EDEEKHMMETAKRGGNLDDFDIDVVVVQGDEEGETDIGSKGLAAIEVTNVEFEGNGDIGD 180

Query: 181 EEEEEEELKETRGLLERIRDEGQRDNGFVDENGGVDDVRELEISMEDEKPSDSVEKSVLG 240
           EEEEEEELKETRGLLERIRDEG+RDNGFVD NGGVD VRELEISMEDEKPSDSVEKSVLG
Sbjct: 181 EEEEEEELKETRGLLERIRDEGRRDNGFVDANGGVDHVRELEISMEDEKPSDSVEKSVLG 240

Query: 241 LLNEVDSAAVYPHADYRTSEGVGSAKSSDETNAITTLTNKATKSEEAEQLPKVTMIDVIE 300
           LLNEVDSAAVYPHAD RTSEGVGSAKSSDETNAITTLTNKATKSEEAEQLPKVTMIDVIE
Sbjct: 241 LLNEVDSAAVYPHADDRTSEGVGSAKSSDETNAITTLTNKATKSEEAEQLPKVTMIDVIE 300

Query: 301 SDEGLSISAMTIEHKVEANSPHKDHRMSSNEELSGEVKIREKIASMKKIVGYKATPLGTY 360
           SDEGLSISAMTIEHKVEANSPHKDHRMSSNEELSGEVKIREKIASMKKIVGYKATPLGTY
Sbjct: 301 SDEGLSISAMTIEHKVEANSPHKDHRMSSNEELSGEVKIREKIASMKKIVGYKATPLGTY 360

Query: 361 LDEVNALYAFIGVEPPSPMKDSANDDDINLLNQKLQFLMSIVGVK 405
           LDEVNALYAFIGVEPPSPMKDSANDDDINLLNQKLQFLMSIVGVK
Sbjct: 361 LDEVNALYAFIGVEPPSPMKDSANDDDINLLNQKLQFLMSIVGVK 405

BLAST of Cp4.1LG18g02610 vs. NCBI nr
Match: XP_022987097.1 (uncharacterized protein LOC111484754 isoform X1 [Cucurbita maxima])

HSP 1 Score: 742 bits (1916), Expect = 4.36e-269
Identity = 391/405 (96.54%), Postives = 399/405 (98.52%), Query Frame = 0

Query: 1   MADEPPGFIRMEGYRSIDWNMEEQLSSGDGLSSEKICSAVQKGCSLGKKLLLTGLAISSV 60
           MADEPP FIRMEGYRSIDWN+EEQLSSGDGLSSEKICSAVQKGCSLGKKLLLTGLAISSV
Sbjct: 1   MADEPPEFIRMEGYRSIDWNIEEQLSSGDGLSSEKICSAVQKGCSLGKKLLLTGLAISSV 60

Query: 61  PVVLPPLVIMSAFGIAASIPYGVFLASYACTETIMSVWLPIPPALKLDRADEEIVEEDIY 120
           PVVLPPLVIMSAFGIAASIPYGVFLASYACTETIMSVWLPIPPALKLDRADEE+VEEDIY
Sbjct: 61  PVVLPPLVIMSAFGIAASIPYGVFLASYACTETIMSVWLPIPPALKLDRADEEMVEEDIY 120

Query: 121 EDEEKHMMETAKRGGNLDDFDIDVVVIQGDEEGETNIGSKGLAAIEVTNVEFEGNGDIGD 180
           EDEEKHMMETAKRG NLDDFDIDVVV+QG EEGET+IGSKGLAAIEVTNVEFEGNGD GD
Sbjct: 121 EDEEKHMMETAKRGENLDDFDIDVVVVQGGEEGETDIGSKGLAAIEVTNVEFEGNGDNGD 180

Query: 181 EEEEEEELKETRGLLERIRDEGQRDNGFVDENGGVDDVRELEISMEDEKPSDSVEKSVLG 240
           EEEEEEELKETRGLLERIRDEG+RDNGFVDENGGVDDVRELEIS+EDEKPSDSVE+SVLG
Sbjct: 181 EEEEEEELKETRGLLERIRDEGRRDNGFVDENGGVDDVRELEISIEDEKPSDSVEESVLG 240

Query: 241 LLNEVDSAAVYPHADYRTSEGVGSAKSSDETNAITTLTNKATKSEEAEQLPKVTMIDVIE 300
           LLNEVDSAAVYPHADYRTSEGVG AKSSDETNAITTL+NKA KSEEAEQLPKVTMIDVIE
Sbjct: 241 LLNEVDSAAVYPHADYRTSEGVGWAKSSDETNAITTLSNKAAKSEEAEQLPKVTMIDVIE 300

Query: 301 SDEGLSISAMTIEHKVEANSPHKDHRMSSNEELSGEVKIREKIASMKKIVGYKATPLGTY 360
           SDEGLSISAMTIEHKVEANSPHKDHRMSSNEELSGEVKIREKIASMKKIVGYKATPLGTY
Sbjct: 301 SDEGLSISAMTIEHKVEANSPHKDHRMSSNEELSGEVKIREKIASMKKIVGYKATPLGTY 360

Query: 361 LDEVNALYAFIGVEPPSPMKDSANDDDINLLNQKLQFLMSIVGVK 405
           LDEVNALYAFIGVEPPSPMKDSANDDDINLLNQKLQFLMSIVGVK
Sbjct: 361 LDEVNALYAFIGVEPPSPMKDSANDDDINLLNQKLQFLMSIVGVK 405

BLAST of Cp4.1LG18g02610 vs. NCBI nr
Match: XP_022961157.1 (uncharacterized protein LOC111461753 isoform X1 [Cucurbita moschata])

HSP 1 Score: 741 bits (1913), Expect = 1.25e-268
Identity = 391/405 (96.54%), Postives = 396/405 (97.78%), Query Frame = 0

Query: 1   MADEPPGFIRMEGYRSIDWNMEEQLSSGDGLSSEKICSAVQKGCSLGKKLLLTGLAISSV 60
           MADEPP FIRMEGYRSIDWNMEEQLSSGDGLSSEKICSAVQKGCSLGKKLLLTGLAISSV
Sbjct: 1   MADEPPEFIRMEGYRSIDWNMEEQLSSGDGLSSEKICSAVQKGCSLGKKLLLTGLAISSV 60

Query: 61  PVVLPPLVIMSAFGIAASIPYGVFLASYACTETIMSVWLPIPPALKLDRADEEIVEEDIY 120
           PVVLPPLVIMSAFGIAASIPYGVFLASYACTETIMSVWLPIPPALKLDRADEEIVEEDIY
Sbjct: 61  PVVLPPLVIMSAFGIAASIPYGVFLASYACTETIMSVWLPIPPALKLDRADEEIVEEDIY 120

Query: 121 EDEEKHMMETAKRGGNLDDFDIDVVVIQGDEEGETNIGSKGLAAIEVTNVEFEGNGDIGD 180
           EDEEKHMMETAKRG NLDDFDIDVVV+QGDEE ET+IGSKGLAAIEVTNVEFEGNGDIGD
Sbjct: 121 EDEEKHMMETAKRGENLDDFDIDVVVVQGDEESETDIGSKGLAAIEVTNVEFEGNGDIGD 180

Query: 181 EEEEEEELKETRGLLERIRDEGQRDNGFVDENGGVDDVRELEISMEDEKPSDSVEKSVLG 240
           EEEEEEELKETRGLLERIRDEG+RDNGFVDENGGV+DVRELEISMEDEKPSDSVEKSVLG
Sbjct: 181 EEEEEEELKETRGLLERIRDEGRRDNGFVDENGGVEDVRELEISMEDEKPSDSVEKSVLG 240

Query: 241 LLNEVDSAAVYPHADYRTSEGVGSAKSSDETNAITTLTNKATKSEEAEQLPKVTMIDVIE 300
           LLNEVDSAAVYPH  YRTSEGVGSAKSSDETNAITTLTNKA KSEEAEQLPKVTMIDVIE
Sbjct: 241 LLNEVDSAAVYPHEHYRTSEGVGSAKSSDETNAITTLTNKAAKSEEAEQLPKVTMIDVIE 300

Query: 301 SDEGLSISAMTIEHKVEANSPHKDHRMSSNEELSGEVKIREKIASMKKIVGYKATPLGTY 360
           SDEGLSIS +TIEHKVEAN PHKDHR SSNEELSGEVKIREKIASMKKIVGYKATPLGTY
Sbjct: 301 SDEGLSISEVTIEHKVEANFPHKDHRTSSNEELSGEVKIREKIASMKKIVGYKATPLGTY 360

Query: 361 LDEVNALYAFIGVEPPSPMKDSANDDDINLLNQKLQFLMSIVGVK 405
           LDEVNALYAFIGVEPPSPMKDSANDDDINLLNQKLQFLMSIVGVK
Sbjct: 361 LDEVNALYAFIGVEPPSPMKDSANDDDINLLNQKLQFLMSIVGVK 405

BLAST of Cp4.1LG18g02610 vs. ExPASy TrEMBL
Match: A0A6J1J9E9 (uncharacterized protein LOC111484754 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111484754 PE=4 SV=1)

HSP 1 Score: 742 bits (1916), Expect = 2.11e-269
Identity = 391/405 (96.54%), Postives = 399/405 (98.52%), Query Frame = 0

Query: 1   MADEPPGFIRMEGYRSIDWNMEEQLSSGDGLSSEKICSAVQKGCSLGKKLLLTGLAISSV 60
           MADEPP FIRMEGYRSIDWN+EEQLSSGDGLSSEKICSAVQKGCSLGKKLLLTGLAISSV
Sbjct: 1   MADEPPEFIRMEGYRSIDWNIEEQLSSGDGLSSEKICSAVQKGCSLGKKLLLTGLAISSV 60

Query: 61  PVVLPPLVIMSAFGIAASIPYGVFLASYACTETIMSVWLPIPPALKLDRADEEIVEEDIY 120
           PVVLPPLVIMSAFGIAASIPYGVFLASYACTETIMSVWLPIPPALKLDRADEE+VEEDIY
Sbjct: 61  PVVLPPLVIMSAFGIAASIPYGVFLASYACTETIMSVWLPIPPALKLDRADEEMVEEDIY 120

Query: 121 EDEEKHMMETAKRGGNLDDFDIDVVVIQGDEEGETNIGSKGLAAIEVTNVEFEGNGDIGD 180
           EDEEKHMMETAKRG NLDDFDIDVVV+QG EEGET+IGSKGLAAIEVTNVEFEGNGD GD
Sbjct: 121 EDEEKHMMETAKRGENLDDFDIDVVVVQGGEEGETDIGSKGLAAIEVTNVEFEGNGDNGD 180

Query: 181 EEEEEEELKETRGLLERIRDEGQRDNGFVDENGGVDDVRELEISMEDEKPSDSVEKSVLG 240
           EEEEEEELKETRGLLERIRDEG+RDNGFVDENGGVDDVRELEIS+EDEKPSDSVE+SVLG
Sbjct: 181 EEEEEEELKETRGLLERIRDEGRRDNGFVDENGGVDDVRELEISIEDEKPSDSVEESVLG 240

Query: 241 LLNEVDSAAVYPHADYRTSEGVGSAKSSDETNAITTLTNKATKSEEAEQLPKVTMIDVIE 300
           LLNEVDSAAVYPHADYRTSEGVG AKSSDETNAITTL+NKA KSEEAEQLPKVTMIDVIE
Sbjct: 241 LLNEVDSAAVYPHADYRTSEGVGWAKSSDETNAITTLSNKAAKSEEAEQLPKVTMIDVIE 300

Query: 301 SDEGLSISAMTIEHKVEANSPHKDHRMSSNEELSGEVKIREKIASMKKIVGYKATPLGTY 360
           SDEGLSISAMTIEHKVEANSPHKDHRMSSNEELSGEVKIREKIASMKKIVGYKATPLGTY
Sbjct: 301 SDEGLSISAMTIEHKVEANSPHKDHRMSSNEELSGEVKIREKIASMKKIVGYKATPLGTY 360

Query: 361 LDEVNALYAFIGVEPPSPMKDSANDDDINLLNQKLQFLMSIVGVK 405
           LDEVNALYAFIGVEPPSPMKDSANDDDINLLNQKLQFLMSIVGVK
Sbjct: 361 LDEVNALYAFIGVEPPSPMKDSANDDDINLLNQKLQFLMSIVGVK 405

BLAST of Cp4.1LG18g02610 vs. ExPASy TrEMBL
Match: A0A6J1H9E7 (uncharacterized protein LOC111461753 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111461753 PE=4 SV=1)

HSP 1 Score: 741 bits (1913), Expect = 6.05e-269
Identity = 391/405 (96.54%), Postives = 396/405 (97.78%), Query Frame = 0

Query: 1   MADEPPGFIRMEGYRSIDWNMEEQLSSGDGLSSEKICSAVQKGCSLGKKLLLTGLAISSV 60
           MADEPP FIRMEGYRSIDWNMEEQLSSGDGLSSEKICSAVQKGCSLGKKLLLTGLAISSV
Sbjct: 1   MADEPPEFIRMEGYRSIDWNMEEQLSSGDGLSSEKICSAVQKGCSLGKKLLLTGLAISSV 60

Query: 61  PVVLPPLVIMSAFGIAASIPYGVFLASYACTETIMSVWLPIPPALKLDRADEEIVEEDIY 120
           PVVLPPLVIMSAFGIAASIPYGVFLASYACTETIMSVWLPIPPALKLDRADEEIVEEDIY
Sbjct: 61  PVVLPPLVIMSAFGIAASIPYGVFLASYACTETIMSVWLPIPPALKLDRADEEIVEEDIY 120

Query: 121 EDEEKHMMETAKRGGNLDDFDIDVVVIQGDEEGETNIGSKGLAAIEVTNVEFEGNGDIGD 180
           EDEEKHMMETAKRG NLDDFDIDVVV+QGDEE ET+IGSKGLAAIEVTNVEFEGNGDIGD
Sbjct: 121 EDEEKHMMETAKRGENLDDFDIDVVVVQGDEESETDIGSKGLAAIEVTNVEFEGNGDIGD 180

Query: 181 EEEEEEELKETRGLLERIRDEGQRDNGFVDENGGVDDVRELEISMEDEKPSDSVEKSVLG 240
           EEEEEEELKETRGLLERIRDEG+RDNGFVDENGGV+DVRELEISMEDEKPSDSVEKSVLG
Sbjct: 181 EEEEEEELKETRGLLERIRDEGRRDNGFVDENGGVEDVRELEISMEDEKPSDSVEKSVLG 240

Query: 241 LLNEVDSAAVYPHADYRTSEGVGSAKSSDETNAITTLTNKATKSEEAEQLPKVTMIDVIE 300
           LLNEVDSAAVYPH  YRTSEGVGSAKSSDETNAITTLTNKA KSEEAEQLPKVTMIDVIE
Sbjct: 241 LLNEVDSAAVYPHEHYRTSEGVGSAKSSDETNAITTLTNKAAKSEEAEQLPKVTMIDVIE 300

Query: 301 SDEGLSISAMTIEHKVEANSPHKDHRMSSNEELSGEVKIREKIASMKKIVGYKATPLGTY 360
           SDEGLSIS +TIEHKVEAN PHKDHR SSNEELSGEVKIREKIASMKKIVGYKATPLGTY
Sbjct: 301 SDEGLSISEVTIEHKVEANFPHKDHRTSSNEELSGEVKIREKIASMKKIVGYKATPLGTY 360

Query: 361 LDEVNALYAFIGVEPPSPMKDSANDDDINLLNQKLQFLMSIVGVK 405
           LDEVNALYAFIGVEPPSPMKDSANDDDINLLNQKLQFLMSIVGVK
Sbjct: 361 LDEVNALYAFIGVEPPSPMKDSANDDDINLLNQKLQFLMSIVGVK 405

BLAST of Cp4.1LG18g02610 vs. ExPASy TrEMBL
Match: A0A6J1JHX4 (uncharacterized protein LOC111484755 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111484755 PE=4 SV=1)

HSP 1 Score: 741 bits (1912), Expect = 8.59e-269
Identity = 391/405 (96.54%), Postives = 398/405 (98.27%), Query Frame = 0

Query: 1   MADEPPGFIRMEGYRSIDWNMEEQLSSGDGLSSEKICSAVQKGCSLGKKLLLTGLAISSV 60
           MADEPP FIRMEGYRSIDWN+EEQLSSGDGLSSEKICSAVQKGCSLGKKLLLTGLAISSV
Sbjct: 1   MADEPPVFIRMEGYRSIDWNIEEQLSSGDGLSSEKICSAVQKGCSLGKKLLLTGLAISSV 60

Query: 61  PVVLPPLVIMSAFGIAASIPYGVFLASYACTETIMSVWLPIPPALKLDRADEEIVEEDIY 120
           PVVLPPLVIMSAFGIAASIPYGVFLASYACTETIMSVWLPIPPALKLDRADEE+VEEDIY
Sbjct: 61  PVVLPPLVIMSAFGIAASIPYGVFLASYACTETIMSVWLPIPPALKLDRADEEMVEEDIY 120

Query: 121 EDEEKHMMETAKRGGNLDDFDIDVVVIQGDEEGETNIGSKGLAAIEVTNVEFEGNGDIGD 180
           EDEEKHMMETAKRG NLDDFDIDVVV+QG EEGET+IGSKGLAAIEVTNVEFEGNGD GD
Sbjct: 121 EDEEKHMMETAKRGENLDDFDIDVVVVQGGEEGETDIGSKGLAAIEVTNVEFEGNGDNGD 180

Query: 181 EEEEEEELKETRGLLERIRDEGQRDNGFVDENGGVDDVRELEISMEDEKPSDSVEKSVLG 240
           EE EEEELKETRGLLERIRDEG+RDNGFVDENGGVDDVRELEIS+EDEKPSDSVEKSVLG
Sbjct: 181 EEGEEEELKETRGLLERIRDEGRRDNGFVDENGGVDDVRELEISIEDEKPSDSVEKSVLG 240

Query: 241 LLNEVDSAAVYPHADYRTSEGVGSAKSSDETNAITTLTNKATKSEEAEQLPKVTMIDVIE 300
           LLNEVDSAAVYPHADYRTSEGVG AKSSDETNAITTL+NKA KSEEAEQLPKVTMIDVIE
Sbjct: 241 LLNEVDSAAVYPHADYRTSEGVGWAKSSDETNAITTLSNKAAKSEEAEQLPKVTMIDVIE 300

Query: 301 SDEGLSISAMTIEHKVEANSPHKDHRMSSNEELSGEVKIREKIASMKKIVGYKATPLGTY 360
           SDEGLSISAMTIEHKVEANSPHKDHRMSSNEELSGEVKIREKIASMKKIVGYKATPLGTY
Sbjct: 301 SDEGLSISAMTIEHKVEANSPHKDHRMSSNEELSGEVKIREKIASMKKIVGYKATPLGTY 360

Query: 361 LDEVNALYAFIGVEPPSPMKDSANDDDINLLNQKLQFLMSIVGVK 405
           LDEVNALYAFIGVEPPSPMKDSANDDDINLLNQKLQFLMSIVGVK
Sbjct: 361 LDEVNALYAFIGVEPPSPMKDSANDDDINLLNQKLQFLMSIVGVK 405

BLAST of Cp4.1LG18g02610 vs. ExPASy TrEMBL
Match: A0A6J1JIH1 (uncharacterized protein LOC111484754 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111484754 PE=4 SV=1)

HSP 1 Score: 602 bits (1552), Expect = 4.03e-215
Identity = 318/332 (95.78%), Postives = 326/332 (98.19%), Query Frame = 0

Query: 1   MADEPPGFIRMEGYRSIDWNMEEQLSSGDGLSSEKICSAVQKGCSLGKKLLLTGLAISSV 60
           MADEPP FIRMEGYRSIDWN+EEQLSSGDGLSSEKICSAVQKGCSLGKKLLLTGLAISSV
Sbjct: 1   MADEPPEFIRMEGYRSIDWNIEEQLSSGDGLSSEKICSAVQKGCSLGKKLLLTGLAISSV 60

Query: 61  PVVLPPLVIMSAFGIAASIPYGVFLASYACTETIMSVWLPIPPALKLDRADEEIVEEDIY 120
           PVVLPPLVIMSAFGIAASIPYGVFLASYACTETIMSVWLPIPPALKLDRADEE+VEEDIY
Sbjct: 61  PVVLPPLVIMSAFGIAASIPYGVFLASYACTETIMSVWLPIPPALKLDRADEEMVEEDIY 120

Query: 121 EDEEKHMMETAKRGGNLDDFDIDVVVIQGDEEGETNIGSKGLAAIEVTNVEFEGNGDIGD 180
           EDEEKHMMETAKRG NLDDFDIDVVV+QG EEGET+IGSKGLAAIEVTNVEFEGNGD GD
Sbjct: 121 EDEEKHMMETAKRGENLDDFDIDVVVVQGGEEGETDIGSKGLAAIEVTNVEFEGNGDNGD 180

Query: 181 EEEEEEELKETRGLLERIRDEGQRDNGFVDENGGVDDVRELEISMEDEKPSDSVEKSVLG 240
           EEEEEEELKETRGLLERIRDEG+RDNGFVDENGGVDDVRELEIS+EDEKPSDSVE+SVLG
Sbjct: 181 EEEEEEELKETRGLLERIRDEGRRDNGFVDENGGVDDVRELEISIEDEKPSDSVEESVLG 240

Query: 241 LLNEVDSAAVYPHADYRTSEGVGSAKSSDETNAITTLTNKATKSEEAEQLPKVTMIDVIE 300
           LLNEVDSAAVYPHADYRTSEGVG AKSSDETNAITTL+NKA KSEEAEQLPKVTMIDVIE
Sbjct: 241 LLNEVDSAAVYPHADYRTSEGVGWAKSSDETNAITTLSNKAAKSEEAEQLPKVTMIDVIE 300

Query: 301 SDEGLSISAMTIEHKVEANSPHKDHRMSSNEE 332
           SDEGLSISAMTIEHKVEANSPHKDHRMSSNEE
Sbjct: 301 SDEGLSISAMTIEHKVEANSPHKDHRMSSNEE 332

BLAST of Cp4.1LG18g02610 vs. ExPASy TrEMBL
Match: A0A6J1HB22 (uncharacterized protein LOC111461753 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111461753 PE=4 SV=1)

HSP 1 Score: 601 bits (1549), Expect = 1.15e-214
Identity = 318/332 (95.78%), Postives = 323/332 (97.29%), Query Frame = 0

Query: 1   MADEPPGFIRMEGYRSIDWNMEEQLSSGDGLSSEKICSAVQKGCSLGKKLLLTGLAISSV 60
           MADEPP FIRMEGYRSIDWNMEEQLSSGDGLSSEKICSAVQKGCSLGKKLLLTGLAISSV
Sbjct: 1   MADEPPEFIRMEGYRSIDWNMEEQLSSGDGLSSEKICSAVQKGCSLGKKLLLTGLAISSV 60

Query: 61  PVVLPPLVIMSAFGIAASIPYGVFLASYACTETIMSVWLPIPPALKLDRADEEIVEEDIY 120
           PVVLPPLVIMSAFGIAASIPYGVFLASYACTETIMSVWLPIPPALKLDRADEEIVEEDIY
Sbjct: 61  PVVLPPLVIMSAFGIAASIPYGVFLASYACTETIMSVWLPIPPALKLDRADEEIVEEDIY 120

Query: 121 EDEEKHMMETAKRGGNLDDFDIDVVVIQGDEEGETNIGSKGLAAIEVTNVEFEGNGDIGD 180
           EDEEKHMMETAKRG NLDDFDIDVVV+QGDEE ET+IGSKGLAAIEVTNVEFEGNGDIGD
Sbjct: 121 EDEEKHMMETAKRGENLDDFDIDVVVVQGDEESETDIGSKGLAAIEVTNVEFEGNGDIGD 180

Query: 181 EEEEEEELKETRGLLERIRDEGQRDNGFVDENGGVDDVRELEISMEDEKPSDSVEKSVLG 240
           EEEEEEELKETRGLLERIRDEG+RDNGFVDENGGV+DVRELEISMEDEKPSDSVEKSVLG
Sbjct: 181 EEEEEEELKETRGLLERIRDEGRRDNGFVDENGGVEDVRELEISMEDEKPSDSVEKSVLG 240

Query: 241 LLNEVDSAAVYPHADYRTSEGVGSAKSSDETNAITTLTNKATKSEEAEQLPKVTMIDVIE 300
           LLNEVDSAAVYPH  YRTSEGVGSAKSSDETNAITTLTNKA KSEEAEQLPKVTMIDVIE
Sbjct: 241 LLNEVDSAAVYPHEHYRTSEGVGSAKSSDETNAITTLTNKAAKSEEAEQLPKVTMIDVIE 300

Query: 301 SDEGLSISAMTIEHKVEANSPHKDHRMSSNEE 332
           SDEGLSIS +TIEHKVEAN PHKDHR SSNEE
Sbjct: 301 SDEGLSISEVTIEHKVEANFPHKDHRTSSNEE 332

BLAST of Cp4.1LG18g02610 vs. TAIR 10
Match: AT1G65090.3 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G36100.1). )

HSP 1 Score: 87.8 bits (216), Expect = 2.2e-17
Identity = 106/387 (27.39%), Postives = 165/387 (42.64%), Query Frame = 0

Query: 21  MEEQLSSGDGLSSEKICSAVQKGCSLGKKLLLTGLAISSVPVVLPPLVIMSAFGIAASIP 80
           MEE  S+    S +K      K  S+GKK+L  G+ +SS P+++P L + S     +S+P
Sbjct: 5   MEEYQSNE---SEDKRSWIWSKAVSVGKKVLTAGVVVSSAPLLVPSLFVASTLAFLSSVP 64

Query: 81  YGVFLASYACTETIMSVWLPIPPALKLDRADEEIVEEDIYEDEEKHMMETAKRGGNLDDF 140
           + +FLA+YACT+ +MS  LP           EE       +D+E    E +K G      
Sbjct: 65  FCLFLANYACTQKVMSTLLP---------DTEETGGVGKEDDDESGFDEYSKIGHG---- 124

Query: 141 DIDVVVIQGDEEGETNIGSKGLAAIEVTNVEFEGNGDIGDEEEEEEELKETRGLLERIRD 200
                      EG   +G   L   +   +  +        +E+EE  KE+  LLE+IRD
Sbjct: 125 -----------EGAAGVGEAALFRGKEEPIPIQ-------VKEDEEMAKESTSLLEKIRD 184

Query: 201 EGQRDNGFVDENGGVDDVRELEISMEDEKPSDSVEKSVLGLLNEVDSAAVYPHADYRTSE 260
           EG+ D                E +++D+K S                             
Sbjct: 185 EGRTDK------------ETSERTLQDDKKS----------------------------- 244

Query: 261 GVGSAKSSDETNAITTLTNKATKSEEAEQLPKVTMIDVIESDEGLSISAMTIEHKVEANS 320
             G+AKS +             +  E  + P+       E + G +        K+E ++
Sbjct: 245 --GNAKSEE-----------VQEQPEKREAPETRR----EGETGAT--------KIETST 287

Query: 321 PHKDHRMSSNEELSGEVKIREKIASMKKIVGYKATPLGTYLDEVNALYAFIG-VEPPSPM 380
              D  +SSNE  S E ++ E + +++K+VGY      T  +E+ ALY F G VEPP   
Sbjct: 305 GKDDEEISSNEVYS-EEQLWETMETLRKVVGYSVARSATCAEELKALYVFTGVVEPP--- 287

Query: 381 KDSANDD--DINLLNQKLQFLMSIVGV 405
           + S N D  DI  L  +L+FLMS++G+
Sbjct: 365 RSSLNQDTYDIAHLTIRLRFLMSVIGI 287

BLAST of Cp4.1LG18g02610 vs. TAIR 10
Match: AT5G36100.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G65090.3); Has 57 Blast hits to 49 proteins in 15 species: Archae - 0; Bacteria - 4; Metazoa - 6; Fungi - 0; Plants - 39; Viruses - 0; Other Eukaryotes - 8 (source: NCBI BLink). )

HSP 1 Score: 84.0 bits (206), Expect = 3.1e-16
Identity = 98/367 (26.70%), Postives = 155/367 (42.23%), Query Frame = 0

Query: 41  QKGCSLGKKLLLTGLAISSVPVVLPPLVIMSAFGIAASIPYGVFLASYACTETIMSVWLP 100
           +KG S+GKK+L     + S P ++P LV+ S   + +S+PY  FL SY CTE +M   LP
Sbjct: 16  RKGVSVGKKVLAACFLVFSAPFLVPALVVASTIALISSLPYCFFLVSYVCTEKLMRKLLP 75

Query: 101 IPPALKLDRADEEIVEEDIYEDEEKHMMETAKRGGNLDDFDIDVVVIQGDEEGETNIGSK 160
                   R D E+V   +++++  H        G++ D  +  V I             
Sbjct: 76  ANAF--SGRCDHEMV---LHQNKISH--------GDIYDEAVARVAIS------------ 135

Query: 161 GLAAIEVTNVEFEGNGDIGDEEEEEEEL-KETRGLLERIRDEGQRDNGFVDENGGVDDVR 220
                E   V+ E    I     E+E++ KE +  LE IRDEG+ +              
Sbjct: 136 -----EPVLVQIEEETTIAIAYREDEDMTKELKSWLESIRDEGKNNQSL----------- 195

Query: 221 ELEISMEDEKPSDSVEKSVLGLLNEVDSAAVYPHADYRTSEGVGSAKSSDETNAITTLTN 280
                                               YR   GV   K  +E +   ++  
Sbjct: 196 ------------------------------------YR---GVILEKGFEEEDKDQSIVP 255

Query: 281 KATKSEEAEQLPKVTMIDVI-ESDEGLSISAMTIEHKVEANSPHKDHRMSSNEELSGEVK 340
           +  KSE      +  + D++ +  E ++I    +E      S  KD  +SS   L  E +
Sbjct: 256 RDAKSENV----RAKLEDLLGKKQESVTIHEGELESTTSKTSREKDMEISSTTVLYSEEQ 295

Query: 341 IREKIASMKKIVGYKATPLGTYLDEVNALYAFIGVEPPSPMKDSANDDDINLLNQKLQFL 400
           I  KI +++K+VGY  T   TY +E+ ALY F GVE P+    +  + DI  +++ L FL
Sbjct: 316 IWTKIEALRKVVGYNVTRSTTYSEELKALYMFTGVELPT---STLENQDIAKVSEGLSFL 295

Query: 401 MSIVGVK 406
           MS++G+K
Sbjct: 376 MSVIGIK 295

BLAST of Cp4.1LG18g02610 vs. TAIR 10
Match: AT1G65090.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G36100.1); Has 1234 Blast hits to 904 proteins in 178 species: Archae - 0; Bacteria - 58; Metazoa - 431; Fungi - 95; Plants - 83; Viruses - 38; Other Eukaryotes - 529 (source: NCBI BLink). )

HSP 1 Score: 78.6 bits (192), Expect = 1.3e-14
Identity = 106/412 (25.73%), Postives = 176/412 (42.72%), Query Frame = 0

Query: 21  MEEQLSSGDGLSSEKICSAVQKGCSLGKKLLLTGLAISSVPVVLPPLVIMSAFGIAASIP 80
           MEE  S+    S +K      K  S+GKK+L  G+ +SS P+++P L + S     +S+P
Sbjct: 5   MEEYQSNE---SEDKRSWIWSKAVSVGKKVLTAGVVVSSAPLLVPSLFVASTLAFLSSVP 64

Query: 81  YGVFLASYACTETIMSVWLPIPPALKLDRADEEIVEEDIYEDEEKHMMETAKRGGNLDDF 140
           + +FLA+YACT+ +MS  LP           EE       +D+E    E +K G      
Sbjct: 65  FCLFLANYACTQKVMSTLLP---------DTEETGGVGKEDDDESGFDEYSKIGHG---- 124

Query: 141 DIDVVVIQGDEEGETNIGSKGLAAIEVTNVEFEGNGDIGDEEEEEEELKETRGLLERIRD 200
                      EG   +G   L   +   +  +        +E+EE  KE+  LLE+IRD
Sbjct: 125 -----------EGAAGVGEAALFRGKEEPIPIQ-------VKEDEEMAKESTSLLEKIRD 184

Query: 201 EGQRDNGFV------DENGGVDDVRELEISMEDEKPSDSVEKSVLGLLNEVDSAAVYPHA 260
           EG+ D          D+  G     E++   E  +  ++  +   G   +++++      
Sbjct: 185 EGRTDKETSERTLQDDKKSGNAKSEEVQEQPEKREAPETRREGETG-ATKIETSTGKDDE 244

Query: 261 DYRTSEGV----GSAKSSDETNAITTLTNKATKSEEAEQLPKVT---------------M 320
           +  ++E +    G+  + +E    TT   K T       L   T                
Sbjct: 245 EISSNEPIDQASGAQGTGEEKRKNTTKKKKKTGRAGNRFLKCHTWSSSKLCGRCDLLECC 304

Query: 321 IDVIESDEGLSISAMTIEHKVEANSPHKDHRMSSNEELSGEVKIREKIASMKKIVGYKAT 380
            D ++      I+   +    EA+       M  N ++  E ++ E + +++K+VGY   
Sbjct: 305 FDRVDCVVRRVITCSALSLISEASVKMSRICMVLNLQVYSEEQLWETMETLRKVVGYSVA 364

Query: 381 PLGTYLDEVNALYAFIG-VEPPSPMKDSANDD--DINLLNQKLQFLMSIVGV 405
              T  +E+ ALY F G VEPP   + S N D  DI  L  +L+FLMS++G+
Sbjct: 365 RSATCAEELKALYVFTGVVEPP---RSSLNQDTYDIAHLTIRLRFLMSVIGI 378

BLAST of Cp4.1LG18g02610 vs. TAIR 10
Match: AT1G65090.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G36100.1); Has 1435 Blast hits to 1033 proteins in 192 species: Archae - 0; Bacteria - 61; Metazoa - 511; Fungi - 123; Plants - 100; Viruses - 42; Other Eukaryotes - 598 (source: NCBI BLink). )

HSP 1 Score: 60.8 bits (146), Expect = 2.8e-09
Identity = 57/185 (30.81%), Postives = 86/185 (46.49%), Query Frame = 0

Query: 21  MEEQLSSGDGLSSEKICSAVQKGCSLGKKLLLTGLAISSVPVVLPPLVIMSAFGIAASIP 80
           MEE  S+    S +K      K  S+GKK+L  G+ +SS P+++P L + S     +S+P
Sbjct: 5   MEEYQSNE---SEDKRSWIWSKAVSVGKKVLTAGVVVSSAPLLVPSLFVASTLAFLSSVP 64

Query: 81  YGVFLASYACTETIMSVWLPIPPALKLDRADEEIVEEDIYEDEEKHMMETAKRGGNLDDF 140
           + +FLA+YACT+ +MS  LP           EE       +D+E    E +K G      
Sbjct: 65  FCLFLANYACTQKVMSTLLP---------DTEETGGVGKEDDDESGFDEYSKIGHG---- 124

Query: 141 DIDVVVIQGDEEGETNIGSKGLAAIEVTNVEFEGNGDIGDEEEEEEELKETRGLLERIRD 200
                      EG   +G   L   +   +  +        +E+EE  KE+  LLE+IRD
Sbjct: 125 -----------EGAAGVGEAALFRGKEEPIPIQ-------VKEDEEMAKESTSLLEKIRD 155

Query: 201 EGQRD 206
           EG+ D
Sbjct: 185 EGRTD 155

BLAST of Cp4.1LG18g02610 vs. TAIR 10
Match: AT5G36100.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G65090.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 48.9 bits (115), Expect = 1.1e-05
Identity = 63/240 (26.25%), Postives = 106/240 (44.17%), Query Frame = 0

Query: 41  QKGCSLGKKLLLTGLAISSVPVVLPPLVIMSAFGIAASIPYGVFLASYACTETIMSVWLP 100
           +KG S+GKK+L     + S P ++P LV+ S   + +S+PY  FL SY CTE +M   LP
Sbjct: 16  RKGVSVGKKVLAACFLVFSAPFLVPALVVASTIALISSLPYCFFLVSYVCTEKLMRKLLP 75

Query: 101 IPPALKLDRADEEIVEEDIYEDEEKHMMETAKRGGNLDDFDIDVVVIQGDEEGETNIGSK 160
                   R D E+V   +++++  H        G++ D  +  V I             
Sbjct: 76  ANAF--SGRCDHEMV---LHQNKISH--------GDIYDEAVARVAIS------------ 135

Query: 161 GLAAIEVTNVEFEGNGDIGDEEEEEEEL-KETRGLLERIRDEGQRD----NGFVDENGGV 220
                E   V+ E    I     E+E++ KE +  LE IRDEG+ +     G + E G  
Sbjct: 136 -----EPVLVQIEEETTIAIAYREDEDMTKELKSWLESIRDEGKNNQSLYRGVILEKGFE 195

Query: 221 DDVRELEISMEDEKPSDSVEKSVLGLLNEVDSAAVYPHADYRTSEGVGSAKSSDETNAIT 276
           ++ ++  I   D K S++V   +  LL +   +      +  ++    S +   E ++ T
Sbjct: 196 EEDKDQSIVPRDAK-SENVRAKLEDLLGKKQESVTIHEGELESTTSKTSREKDMEISSTT 224

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_023516026.16.60e-280100.00uncharacterized protein LOC111780015 isoform X1 [Cucurbita pepo subsp. pepo][more]
KAG6590083.13.50e-27598.52hypothetical protein SDJN03_15506, partial [Cucurbita argyrosperma subsp. sorori... [more]
KAG7023753.19.60e-27398.02hypothetical protein SDJN02_14779 [Cucurbita argyrosperma subsp. argyrosperma][more]
XP_022987097.14.36e-26996.54uncharacterized protein LOC111484754 isoform X1 [Cucurbita maxima][more]
XP_022961157.11.25e-26896.54uncharacterized protein LOC111461753 isoform X1 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
A0A6J1J9E92.11e-26996.54uncharacterized protein LOC111484754 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1H9E76.05e-26996.54uncharacterized protein LOC111461753 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1JHX48.59e-26996.54uncharacterized protein LOC111484755 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1JIH14.03e-21595.78uncharacterized protein LOC111484754 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1HB221.15e-21495.78uncharacterized protein LOC111461753 isoform X2 OS=Cucurbita moschata OX=3662 GN... [more]
Match NameE-valueIdentityDescription
AT1G65090.32.2e-1727.39unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT5G36100.13.1e-1626.70unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G65090.21.3e-1425.73unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT1G65090.12.8e-0930.81unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT5G36100.21.1e-0526.25unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 181..201
NoneNo IPR availablePANTHERPTHR37198NUCLEOLINcoord: 219..405
NoneNo IPR availablePANTHERPTHR37198:SF1NUCLEOLINcoord: 21..159
coord: 219..405
NoneNo IPR availablePANTHERPTHR37198NUCLEOLINcoord: 21..159

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG18g02610.1Cp4.1LG18g02610.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016020 membrane