Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CGCCGAACACAGAGCTGCAGAAGCTGCCACGTGTTCACTCCAACGCAGAATCGAACCTTCGTCCCTTTTTTTTTTGAAAAAAACCAATCTCTTCGCTTCATCTTCTCCAACTCGAAGAACAAAAGCCCTAAAAATGCGCTGAATCAAACATTGCCAAAATCTTAGTTTTTCTTCGAAGATTTGTGCGTGTAATTGAATCTTTCATTGTCAAGATCTCGAGCTTTTCATCTATGGCGAAAGTGATCAAGCCCTCTTCGCGCTACTGTTCCTACGATGTGCGATCTTCTACTTCCTCCCATTTCTCCGATCCTTCTTCTTCCTCTGAGTTCAAGCTCAAGTCTCCAATGGCTGCGAAATCATCGTCTTCGCGCGCTCTTGTTCAAATCAAGGCGTCTGATTTGGCTAGGACTAAGGCGAAGCCGTCGGATCAGAACTTGACGGCGATGGTGAAGAAATTCATGGAGAAGCGGTCTGGTTCGAAACCGAAGGCGGTGAAGCAGGCGGCGGGGTTGGTGATTCCGTCGGTTTTAATTGCGGAGGATTTGAAGAAGACGGCGAGGAAGGGGACGAACTTTGGAGGGCTGCATAAGAAGCTGTTTGGGAAGGGATCGGTGGAGAAGAAGGAGAAGGAGAAGGAGGTGAAGGCGTTGACGGAGGTTAAAGGGAATACGAGGACATTGGCGATGGTGCTGAGGAGTGAAAGAGAGCTTTTGGGTTTGAATAAGGAGCAGGAGTTGGAGATCACTGAGCTCAAATTAGTTCTGGAGGAGAAGTACAGAGAGGTGAGGATTGAGTATACACTACAATTGTAGTCATTACTTTTTTAACTAGAAAAATACTACTTTTTACTCTCCGTTCTAAATTATACTACAAAGTTTAGATTGCATGTTTAGAATCATTCGATTCCAATCTCAAACAAGTGTTCTTGCTTTGAGATATTCGATCAACAAAGTAATCCGTTCTAAAGCTTTCCCATGAGGTGATTCCTTATCATTAGTGCACTTGTGTTAAATTTTTGTTCTTCCTGAAAGATTGACGACGAGTCCGGATAAGAATATTGTTGTAGTTGAAGTTTTTTGGTCATGTTAGAGGCTCTTTATTCTTGAGCTAAATATGGCATTCTCATTATTAAGTTCGGATCTTTCTCTATTATGTTCGATTTTTTTGGGCGAAATCGTTTCGTGATTCGATTGTGGTACTTATGGTTGAATTTTGTGTTATGATTAGGGAGTAATGTGCCTGTCATTTTGCACATATTGTTTGTTAACACAACACTGTTTCTTATTGTTGTTGGTATCAGATTGAGAAGTTGAAAGATTTATGTTTGAAGCAAAGAGAAGAAATAAAGTCATTGAAGAATGCAATATTGTTCCCAGATGTTATGAACTCGCAGCTTCAAGATATTCTTGAAAAGCAGGGTTCAGAGTTGAAGCAAGCCAAACACATCATCCCCACTCTACAAAAGCAGGTCACCACTCTCACTGGCCAGCTTCATTCCCTCGCCGAGGACCTAGCCGAGGTATGAAAGAAAGCAATACTATTTACAATGATATATATCATTACAACTAAGACCTGAAACGTAGGAAAACTCTGTTTGACCGATTACCAAAGGGTATGATATTGTCCACTTTGAGCATAAGTTCTCGTGGTTTTACTTTTGATCCAAAAGGCCTCATTTTATGGAGATAGTGTTCCTTGCTTATAAACTCATGATCTTCCTCTTAATTAGCTAACGTGGGACTACTCCTTTCAATAATCCTCAACAATCCTCCTCTCGAACAAAGTATACCATAGAGCCTCCCTTGAGGCCTATAGAGCCCTCGAACAGTATCCCCTTAATCAAGACTCAACTTCTTTCTCTGGAACCCACAAACAAAGTACATTATTTGTTCAACACTTGAGATACTTTTGACTACACCTTCGAGGCTCACAATTTCTTTGTTCGATATTTAAGGATTATATTGACATGGCTAAGTTAAGGGCATGATTCTTATACCATGTTAGGAACCACAAACTCCACAATAGTATGATATTGTCTACTTTGAGCATAAGCTCTCATGGTTTTGCTTTTGGTTTCTCCAAAAGACCTCATGCCAATGGAGATAGTGTTTTTTACTTATAAACTTATGATCTTCCTCAACAGAGCTGGCTGTTGAATTGAGCAGCTTTTAACTATAGGATGATCCAATTTAGCTCTAGCCTCTGAAAGCTTTCATTTCTTTGATATGGGCTATGAAAGTACGACCGCTTAAAAGCTAAAAGAGCCTCTAATGAAATTTGGATGTTTCGTGATATGATTCTGATACCGAATAGCTGCTATTGTATGGGTATTCGTTGCTTCCTCTGTGGCAATTGATTTAGCTCATAGTACAATTGACATTTTATTTTGTGCAGGTAAAGGCTGATAAATATTCAGGAAAGTCTTGGTTACAAGGTAGTAGTTCTCCTCACACACCAACATATGATCATCACGAGGATGCTTCTAACTCATTGGTAAGCACAAAAGATACCCTTCATCTTCATGTCCATCATTTTAATGTTGGTTCATACCAATCCTGTCACGATTGCCCTGTTTGCTCAAATGAGCAAAGCTCTAGAGTATGGTTAGTAGTAGGGCCCTATTATTAATCATCTAGGACTTTGCTGCACCAAAGCATAGTCTATTTATCTACTAGATGTTAAATCACTCATCCAGCATTTGTGTAACTCAGGAGTTCAGTGACTGTGATCCAACGTCCCCAGGCAGTCCGGATGACTTTTTGTTGAAGGATGTGAATCCCTGTCTAACACCCTATTATGCAACTAAATCCAAGGTTTAACTGCCTATTTCTGCTGTTAATTTCTCACACCCCAAGAGTGTTCATTTATCAAACTAATCAACTGAACTCGTTCCCACAGGAGTTTGAGGCAATGGGATACGAAATTTCATCCCATAACAGAATTGAATCAGGTTTTGAATCTTGTTCCAGGAAGTTGTCCAAAAGTTCTGATTGCAGACAGAATTCCAACAAAGCAAACGTTACGAAAACAGCCCGGAGATCCGACGAGGCTAAATATACGTATGGAAAGACAATGCATAAATTTTACTGAACCTCAGTTCTTCCACCCAGGTTCGCATTCTTCATCCTTCTACATACCATTTTGTACTTTGTCACCTACTGTCACTGTATGAACCATTTTATATTGGATCTGATCCAACAACTGGACATATATATGCCAGGCGTTCTTCTTTGTTTATTATTGCTTGGGTGAAGAATAGTCCATTATTTTGAGCTTTAGGGCGTGTAATACCATCGTGTGCATGCTATACTTGTAAAGCTCATCAATAATAGTAAATGGTCCTATTTCTTTGAG
mRNA sequence
CGCCGAACACAGAGCTGCAGAAGCTGCCACGTGTTCACTCCAACGCAGAATCGAACCTTCGTCCCTTTTTTTTTTGAAAAAAACCAATCTCTTCGCTTCATCTTCTCCAACTCGAAGAACAAAAGCCCTAAAAATGCGCTGAATCAAACATTGCCAAAATCTTAGTTTTTCTTCGAAGATTTGTGCGTGTAATTGAATCTTTCATTGTCAAGATCTCGAGCTTTTCATCTATGGCGAAAGTGATCAAGCCCTCTTCGCGCTACTGTTCCTACGATGTGCGATCTTCTACTTCCTCCCATTTCTCCGATCCTTCTTCTTCCTCTGAGTTCAAGCTCAAGTCTCCAATGGCTGCGAAATCATCGTCTTCGCGCGCTCTTGTTCAAATCAAGGCGTCTGATTTGGCTAGGACTAAGGCGAAGCCGTCGGATCAGAACTTGACGGCGATGGTGAAGAAATTCATGGAGAAGCGGTCTGGTTCGAAACCGAAGGCGGTGAAGCAGGCGGCGGGGTTGGTGATTCCGTCGGTTTTAATTGCGGAGGATTTGAAGAAGACGGCGAGGAAGGGGACGAACTTTGGAGGGCTGCATAAGAAGCTGTTTGGGAAGGGATCGGTGGAGAAGAAGGAGAAGGAGAAGGAGGTGAAGGCGTTGACGGAGGTTAAAGGGAATACGAGGACATTGGCGATGGTGCTGAGGAGTGAAAGAGAGCTTTTGGGTTTGAATAAGGAGCAGGAGTTGGAGATCACTGAGCTCAAATTAGTTCTGGAGGAGAAGTACAGAGAGATTGAGAAGTTGAAAGATTTATGTTTGAAGCAAAGAGAAGAAATAAAGTCATTGAAGAATGCAATATTGTTCCCAGATGTTATGAACTCGCAGCTTCAAGATATTCTTGAAAAGCAGGGTTCAGAGTTGAAGCAAGCCAAACACATCATCCCCACTCTACAAAAGCAGGTCACCACTCTCACTGGCCAGCTTCATTCCCTCGCCGAGGACCTAGCCGAGGTAAAGGCTGATAAATATTCAGGAAAGTCTTGGTTACAAGGTAGTAGTTCTCCTCACACACCAACATATGATCATCACGAGGATGCTTCTAACTCATTGGAGTTCAGTGACTGTGATCCAACGTCCCCAGGCAGTCCGGATGACTTTTTGTTGAAGGATGTGAATCCCTGTCTAACACCCTATTATGCAACTAAATCCAAGGAGTTTGAGGCAATGGGATACGAAATTTCATCCCATAACAGAATTGAATCAGGTTTTGAATCTTGTTCCAGGAAGTTGTCCAAAAGTTCTGATTGCAGACAGAATTCCAACAAAGCAAACGTTACGAAAACAGCCCGGAGATCCGACGAGGCTAAATATACGTATGGAAAGACAATGCATAAATTTTACTGAACCTCAGTTCTTCCACCCAGGTTCGCATTCTTCATCCTTCTACATACCATTTTGTACTTTGTCACCTACTGTCACTGTATGAACCATTTTATATTGGATCTGATCCAACAACTGGACATATATATGCCAGGCGTTCTTCTTTGTTTATTATTGCTTGGGTGAAGAATAGTCCATTATTTTGAGCTTTAGGGCGTGTAATACCATCGTGTGCATGCTATACTTGTAAAGCTCATCAATAATAGTAAATGGTCCTATTTCTTTGAG
Coding sequence (CDS)
ATGGCGAAAGTGATCAAGCCCTCTTCGCGCTACTGTTCCTACGATGTGCGATCTTCTACTTCCTCCCATTTCTCCGATCCTTCTTCTTCCTCTGAGTTCAAGCTCAAGTCTCCAATGGCTGCGAAATCATCGTCTTCGCGCGCTCTTGTTCAAATCAAGGCGTCTGATTTGGCTAGGACTAAGGCGAAGCCGTCGGATCAGAACTTGACGGCGATGGTGAAGAAATTCATGGAGAAGCGGTCTGGTTCGAAACCGAAGGCGGTGAAGCAGGCGGCGGGGTTGGTGATTCCGTCGGTTTTAATTGCGGAGGATTTGAAGAAGACGGCGAGGAAGGGGACGAACTTTGGAGGGCTGCATAAGAAGCTGTTTGGGAAGGGATCGGTGGAGAAGAAGGAGAAGGAGAAGGAGGTGAAGGCGTTGACGGAGGTTAAAGGGAATACGAGGACATTGGCGATGGTGCTGAGGAGTGAAAGAGAGCTTTTGGGTTTGAATAAGGAGCAGGAGTTGGAGATCACTGAGCTCAAATTAGTTCTGGAGGAGAAGTACAGAGAGATTGAGAAGTTGAAAGATTTATGTTTGAAGCAAAGAGAAGAAATAAAGTCATTGAAGAATGCAATATTGTTCCCAGATGTTATGAACTCGCAGCTTCAAGATATTCTTGAAAAGCAGGGTTCAGAGTTGAAGCAAGCCAAACACATCATCCCCACTCTACAAAAGCAGGTCACCACTCTCACTGGCCAGCTTCATTCCCTCGCCGAGGACCTAGCCGAGGTAAAGGCTGATAAATATTCAGGAAAGTCTTGGTTACAAGGTAGTAGTTCTCCTCACACACCAACATATGATCATCACGAGGATGCTTCTAACTCATTGGAGTTCAGTGACTGTGATCCAACGTCCCCAGGCAGTCCGGATGACTTTTTGTTGAAGGATGTGAATCCCTGTCTAACACCCTATTATGCAACTAAATCCAAGGAGTTTGAGGCAATGGGATACGAAATTTCATCCCATAACAGAATTGAATCAGGTTTTGAATCTTGTTCCAGGAAGTTGTCCAAAAGTTCTGATTGCAGACAGAATTCCAACAAAGCAAACGTTACGAAAACAGCCCGGAGATCCGACGAGGCTAAATATACGTATGGAAAGACAATGCATAAATTTTACTGA
Protein sequence
MAKVIKPSSRYCSYDVRSSTSSHFSDPSSSSEFKLKSPMAAKSSSSRALVQIKASDLARTKAKPSDQNLTAMVKKFMEKRSGSKPKAVKQAAGLVIPSVLIAEDLKKTARKGTNFGGLHKKLFGKGSVEKKEKEKEVKALTEVKGNTRTLAMVLRSERELLGLNKEQELEITELKLVLEEKYREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQDILEKQGSELKQAKHIIPTLQKQVTTLTGQLHSLAEDLAEVKADKYSGKSWLQGSSSPHTPTYDHHEDASNSLEFSDCDPTSPGSPDDFLLKDVNPCLTPYYATKSKEFEAMGYEISSHNRIESGFESCSRKLSKSSDCRQNSNKANVTKTARRSDEAKYTYGKTMHKFY
Homology
BLAST of Cp4.1LG20g06460 vs. NCBI nr
Match:
XP_023520379.1 (uncharacterized protein LOC111783692 [Cucurbita pepo subsp. pepo] >XP_023520380.1 uncharacterized protein LOC111783692 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 725 bits (1872), Expect = 5.46e-263
Identity = 387/387 (100.00%), Postives = 387/387 (100.00%), Query Frame = 0
Query: 1 MAKVIKPSSRYCSYDVRSSTSSHFSDPSSSSEFKLKSPMAAKSSSSRALVQIKASDLART 60
MAKVIKPSSRYCSYDVRSSTSSHFSDPSSSSEFKLKSPMAAKSSSSRALVQIKASDLART
Sbjct: 1 MAKVIKPSSRYCSYDVRSSTSSHFSDPSSSSEFKLKSPMAAKSSSSRALVQIKASDLART 60
Query: 61 KAKPSDQNLTAMVKKFMEKRSGSKPKAVKQAAGLVIPSVLIAEDLKKTARKGTNFGGLHK 120
KAKPSDQNLTAMVKKFMEKRSGSKPKAVKQAAGLVIPSVLIAEDLKKTARKGTNFGGLHK
Sbjct: 61 KAKPSDQNLTAMVKKFMEKRSGSKPKAVKQAAGLVIPSVLIAEDLKKTARKGTNFGGLHK 120
Query: 121 KLFGKGSVEKKEKEKEVKALTEVKGNTRTLAMVLRSERELLGLNKEQELEITELKLVLEE 180
KLFGKGSVEKKEKEKEVKALTEVKGNTRTLAMVLRSERELLGLNKEQELEITELKLVLEE
Sbjct: 121 KLFGKGSVEKKEKEKEVKALTEVKGNTRTLAMVLRSERELLGLNKEQELEITELKLVLEE 180
Query: 181 KYREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQDILEKQGSELKQAKHIIPTLQKQ 240
KYREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQDILEKQGSELKQAKHIIPTLQKQ
Sbjct: 181 KYREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQDILEKQGSELKQAKHIIPTLQKQ 240
Query: 241 VTTLTGQLHSLAEDLAEVKADKYSGKSWLQGSSSPHTPTYDHHEDASNSLEFSDCDPTSP 300
VTTLTGQLHSLAEDLAEVKADKYSGKSWLQGSSSPHTPTYDHHEDASNSLEFSDCDPTSP
Sbjct: 241 VTTLTGQLHSLAEDLAEVKADKYSGKSWLQGSSSPHTPTYDHHEDASNSLEFSDCDPTSP 300
Query: 301 GSPDDFLLKDVNPCLTPYYATKSKEFEAMGYEISSHNRIESGFESCSRKLSKSSDCRQNS 360
GSPDDFLLKDVNPCLTPYYATKSKEFEAMGYEISSHNRIESGFESCSRKLSKSSDCRQNS
Sbjct: 301 GSPDDFLLKDVNPCLTPYYATKSKEFEAMGYEISSHNRIESGFESCSRKLSKSSDCRQNS 360
Query: 361 NKANVTKTARRSDEAKYTYGKTMHKFY 387
NKANVTKTARRSDEAKYTYGKTMHKFY
Sbjct: 361 NKANVTKTARRSDEAKYTYGKTMHKFY 387
BLAST of Cp4.1LG20g06460 vs. NCBI nr
Match:
XP_022927051.1 (uncharacterized protein LOC111433994 [Cucurbita moschata])
HSP 1 Score: 710 bits (1832), Expect = 6.31e-257
Identity = 381/387 (98.45%), Postives = 382/387 (98.71%), Query Frame = 0
Query: 1 MAKVIKPSSRYCSYDVRSSTSSHFSDPSSSSEFKLKSPMAAKSSSSRALVQIKASDLART 60
MAKVIKPSSRYCSYDVRSSTSSHFSDPSSSSEFKLKSPMAAKSSSSRALVQIKASDLART
Sbjct: 1 MAKVIKPSSRYCSYDVRSSTSSHFSDPSSSSEFKLKSPMAAKSSSSRALVQIKASDLART 60
Query: 61 KAKPSDQNLTAMVKKFMEKRSGSKPKAVKQAAGLVIPSVLIAEDLKKTARKGTNFGGLHK 120
KAKPSDQNLTAMVKKFMEKRSGSKPKAVKQA GLVIPS LIAEDLKKTARKGTNFGGLHK
Sbjct: 61 KAKPSDQNLTAMVKKFMEKRSGSKPKAVKQAVGLVIPSDLIAEDLKKTARKGTNFGGLHK 120
Query: 121 KLFGKGSVEKKEKEKEVKALTEVKGNTRTLAMVLRSERELLGLNKEQELEITELKLVLEE 180
KLFGKGSVEKKEKE VKALTEV+GNTRTLAMVLRSERELLGLNKEQELEITELKLVLEE
Sbjct: 121 KLFGKGSVEKKEKE--VKALTEVRGNTRTLAMVLRSERELLGLNKEQELEITELKLVLEE 180
Query: 181 KYREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQDILEKQGSELKQAKHIIPTLQKQ 240
KYREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQDILEKQGSELKQ KHIIPTLQKQ
Sbjct: 181 KYREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQDILEKQGSELKQTKHIIPTLQKQ 240
Query: 241 VTTLTGQLHSLAEDLAEVKADKYSGKSWLQGSSSPHTPTYDHHEDASNSLEFSDCDPTSP 300
VTTLTGQLHSLAEDLAEVKADKYSGKSWLQGSSSPHTPTYDHHEDASNSLEFSDCDPTSP
Sbjct: 241 VTTLTGQLHSLAEDLAEVKADKYSGKSWLQGSSSPHTPTYDHHEDASNSLEFSDCDPTSP 300
Query: 301 GSPDDFLLKDVNPCLTPYYATKSKEFEAMGYEISSHNRIESGFESCSRKLSKSSDCRQNS 360
GSPDDFLLKDVNPCLTPYYATKSKEFEAMGYEISSHNRIESGFESCSRKLSKSSDCRQNS
Sbjct: 301 GSPDDFLLKDVNPCLTPYYATKSKEFEAMGYEISSHNRIESGFESCSRKLSKSSDCRQNS 360
Query: 361 NKANVTKTARRSDEAKYTYGKTMHKFY 387
NKANVTKTARRSDEAKYTYGKTMHKFY
Sbjct: 361 NKANVTKTARRSDEAKYTYGKTMHKFY 385
BLAST of Cp4.1LG20g06460 vs. NCBI nr
Match:
XP_023001589.1 (uncharacterized protein LOC111495672 [Cucurbita maxima] >XP_023001590.1 uncharacterized protein LOC111495672 [Cucurbita maxima])
HSP 1 Score: 704 bits (1818), Expect = 8.57e-255
Identity = 379/387 (97.93%), Postives = 381/387 (98.45%), Query Frame = 0
Query: 1 MAKVIKPSSRYCSYDVRSSTSSHFSDPSSSSEFKLKSPMAAKSSSSRALVQIKASDLART 60
MAKVIKPSSRYCSYDVRSSTSSHFSDPSSSSEFKLKSPMAAKSSSSRALVQIKASDLART
Sbjct: 1 MAKVIKPSSRYCSYDVRSSTSSHFSDPSSSSEFKLKSPMAAKSSSSRALVQIKASDLART 60
Query: 61 KAKPSDQNLTAMVKKFMEKRSGSKPKAVKQAAGLVIPSVLIAEDLKKTARKGTNFGGLHK 120
KAKPSDQNLTAMVKKFMEKRSGSKPKAVKQAAGLVIPS LIAEDLKKTARKGTNFGGLHK
Sbjct: 61 KAKPSDQNLTAMVKKFMEKRSGSKPKAVKQAAGLVIPSDLIAEDLKKTARKGTNFGGLHK 120
Query: 121 KLFGKGSVEKKEKEKEVKALTEVKGNTRTLAMVLRSERELLGLNKEQELEITELKLVLEE 180
KLFGKGSVEKKEKE VKALTEVKGNTRTLAMVLRSERELLGLNKEQELEI ELKLVLEE
Sbjct: 121 KLFGKGSVEKKEKE--VKALTEVKGNTRTLAMVLRSERELLGLNKEQELEINELKLVLEE 180
Query: 181 KYREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQDILEKQGSELKQAKHIIPTLQKQ 240
KYREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQDILEKQGSELKQAKHIIPTLQKQ
Sbjct: 181 KYREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQDILEKQGSELKQAKHIIPTLQKQ 240
Query: 241 VTTLTGQLHSLAEDLAEVKADKYSGKSWLQGSSSPHTPTYDHHEDASNSLEFSDCDPTSP 300
VTTLTGQLHSLAEDLAEVKADKYSGKSWLQGSSSPHTPTYDHHEDASNSLEFSDCDPTSP
Sbjct: 241 VTTLTGQLHSLAEDLAEVKADKYSGKSWLQGSSSPHTPTYDHHEDASNSLEFSDCDPTSP 300
Query: 301 GSPDDFLLKDVNPCLTPYYATKSKEFEAMGYEISSHNRIESGFESCSRKLSKSSDCRQNS 360
GSPDDFLLKDVNPCLTPYYATKSKEF+AMGYEISSHNRIES FESCSRKLSKSSDCRQNS
Sbjct: 301 GSPDDFLLKDVNPCLTPYYATKSKEFKAMGYEISSHNRIESSFESCSRKLSKSSDCRQNS 360
Query: 361 NKANVTKTARRSDEAKYTYGKTMHKFY 387
+KANVTKTARRSDEAKYTYGK MHKFY
Sbjct: 361 DKANVTKTARRSDEAKYTYGKAMHKFY 385
BLAST of Cp4.1LG20g06460 vs. NCBI nr
Match:
KAG6583743.1 (Protein NAR1, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 672 bits (1733), Expect = 1.04e-233
Identity = 373/413 (90.31%), Postives = 374/413 (90.56%), Query Frame = 0
Query: 1 MAKVIKPSSRYCSYDVRSSTSSHFSDPSSSSEFKLKSPMAAKSSSSRALVQIKASDLART 60
MAKVIKPSSRYCSYDVRSSTSSHFSDPSSSSEFKLKSPMAAKSSSSRALVQIKASDLART
Sbjct: 1 MAKVIKPSSRYCSYDVRSSTSSHFSDPSSSSEFKLKSPMAAKSSSSRALVQIKASDLART 60
Query: 61 KAKPSDQNLTAMVKKFMEKRSGSKPKAVKQAAGLVIPSVLIAEDLKKTARKGTNFGGLHK 120
KAKPSDQNLTAMVKKFMEKRSGSKPKAVKQAAGLVIPS LIAEDLKKTARKGTNFGGLHK
Sbjct: 61 KAKPSDQNLTAMVKKFMEKRSGSKPKAVKQAAGLVIPSDLIAEDLKKTARKGTNFGGLHK 120
Query: 121 KLFGKGSVEKKEKEKEVKALTEVKGNTRTLAMVLRSERELLGLNKEQELEITELKLVLEE 180
KLFGKGSVEKKEKE VKALTEVKGNTRTLAMVLRSERELLGLNKEQELEITELKLVLEE
Sbjct: 121 KLFGKGSVEKKEKE--VKALTEVKGNTRTLAMVLRSERELLGLNKEQELEITELKLVLEE 180
Query: 181 KYREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQDILEKQGSELKQAKHIIPTLQKQ 240
KYREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQDILEKQGSELKQAKHIIPTLQKQ
Sbjct: 181 KYREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQDILEKQGSELKQAKHIIPTLQKQ 240
Query: 241 VTTLTGQLHSLAEDLAEVKADKYSGKSWLQGSSSPHTPTYDHHEDASNSL---------- 300
VTTLTGQLHSLAEDLAEVKADKYSGKSWLQGSSSP TPTYDHHEDASNSL
Sbjct: 241 VTTLTGQLHSLAEDLAEVKADKYSGKSWLQGSSSPRTPTYDHHEDASNSLVSTKDTLRSS 300
Query: 301 -------------------------EFSDCDPTSPGSPDDFLLKDVNPCLTPYYATKSKE 360
EFSDCDPTSPGSPDDFLLKDVNPCLTPYYATKSKE
Sbjct: 301 YPSFNVGSCQSCHDCPVCSNEQSSREFSDCDPTSPGSPDDFLLKDVNPCLTPYYATKSKE 360
Query: 361 FEAMGYEISSHNRIESGFESCSRKLSKSSDCRQNSNKANVTKTARRSDEAKYT 378
FEAMGYEISSHNRIESGFESCSRKLSKSSDCRQNS+KANVTKTARRSDEAKYT
Sbjct: 361 FEAMGYEISSHNRIESGFESCSRKLSKSSDCRQNSDKANVTKTARRSDEAKYT 411
BLAST of Cp4.1LG20g06460 vs. NCBI nr
Match:
XP_038895034.1 (uncharacterized protein LOC120083373 [Benincasa hispida])
HSP 1 Score: 638 bits (1646), Expect = 1.58e-228
Identity = 352/391 (90.03%), Postives = 362/391 (92.58%), Query Frame = 0
Query: 1 MAKVIKPSSRYCSYDVRSSTSSHFSDPSSSSEFKLKSPMAAKSSSSRALVQIKASDLART 60
MAKV+KPSSRY SYDVRSSTSSHFSDPSSS EF LKSP+ A SSSSRALV+ K SDLAR
Sbjct: 1 MAKVMKPSSRYSSYDVRSSTSSHFSDPSSSCEFNLKSPLPANSSSSRALVKTKPSDLARA 60
Query: 61 KAKPSDQNLTAMVKKFMEKRSGSKPKAVKQAAGLVIPSVLIAEDLKKTARKGTNFGGLHK 120
KAKPSDQNLTAMVKKFMEKRSGSKPK VKQAAGLVIPS LIAEDLKKTARKGTNFGGLHK
Sbjct: 61 KAKPSDQNLTAMVKKFMEKRSGSKPKTVKQAAGLVIPSDLIAEDLKKTARKGTNFGGLHK 120
Query: 121 KLFGKGSVEKKEKEKEVKALTEVKGNTRTLAMVLRSERELLGLNKEQELEITELKLVLEE 180
KLFGKG+VEKKE KEVKALTEVKGNTRTLAMVLRSERELL LNKEQELEITELKL+LEE
Sbjct: 121 KLFGKGTVEKKE-VKEVKALTEVKGNTRTLAMVLRSERELLSLNKEQELEITELKLILEE 180
Query: 181 KYREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQDILEKQGSELKQAKHIIPTLQKQ 240
KYREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQ++LEKQ SELKQAK IIPTLQKQ
Sbjct: 181 KYREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQNMLEKQDSELKQAKQIIPTLQKQ 240
Query: 241 VTTLTGQLHSLAEDLAEVKADKYSGKSWLQGSSSPHTPTYDHHEDASNSLEFSDCDPTSP 300
VTTLTGQLHSLAEDLAEVKADKYSGKSWLQGS SPHTPTYD EDASNSLEFS CDPTSP
Sbjct: 241 VTTLTGQLHSLAEDLAEVKADKYSGKSWLQGSISPHTPTYDQ-EDASNSLEFSACDPTSP 300
Query: 301 GSPDDFLLKDVNPCLTPYYATKSKEFEAMGY-----EISSHNRIESGFESCSRKLSKSSD 360
GSPDDFLLKDVNPCLTPYYATKSKEFEAMGY EI SHNR+E GF+SCSRKLSKSSD
Sbjct: 301 GSPDDFLLKDVNPCLTPYYATKSKEFEAMGYDSPRDEILSHNRMEFGFKSCSRKLSKSSD 360
Query: 361 CRQNSNKANVTKTARRSDEAKYTYGKTMHKF 386
CRQNS+KAN TKTARRSDEAKY YGK MHKF
Sbjct: 361 CRQNSDKANTTKTARRSDEAKYMYGKPMHKF 389
BLAST of Cp4.1LG20g06460 vs. ExPASy TrEMBL
Match:
A0A6J1EGL4 (uncharacterized protein LOC111433994 OS=Cucurbita moschata OX=3662 GN=LOC111433994 PE=4 SV=1)
HSP 1 Score: 710 bits (1832), Expect = 3.05e-257
Identity = 381/387 (98.45%), Postives = 382/387 (98.71%), Query Frame = 0
Query: 1 MAKVIKPSSRYCSYDVRSSTSSHFSDPSSSSEFKLKSPMAAKSSSSRALVQIKASDLART 60
MAKVIKPSSRYCSYDVRSSTSSHFSDPSSSSEFKLKSPMAAKSSSSRALVQIKASDLART
Sbjct: 1 MAKVIKPSSRYCSYDVRSSTSSHFSDPSSSSEFKLKSPMAAKSSSSRALVQIKASDLART 60
Query: 61 KAKPSDQNLTAMVKKFMEKRSGSKPKAVKQAAGLVIPSVLIAEDLKKTARKGTNFGGLHK 120
KAKPSDQNLTAMVKKFMEKRSGSKPKAVKQA GLVIPS LIAEDLKKTARKGTNFGGLHK
Sbjct: 61 KAKPSDQNLTAMVKKFMEKRSGSKPKAVKQAVGLVIPSDLIAEDLKKTARKGTNFGGLHK 120
Query: 121 KLFGKGSVEKKEKEKEVKALTEVKGNTRTLAMVLRSERELLGLNKEQELEITELKLVLEE 180
KLFGKGSVEKKEKE VKALTEV+GNTRTLAMVLRSERELLGLNKEQELEITELKLVLEE
Sbjct: 121 KLFGKGSVEKKEKE--VKALTEVRGNTRTLAMVLRSERELLGLNKEQELEITELKLVLEE 180
Query: 181 KYREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQDILEKQGSELKQAKHIIPTLQKQ 240
KYREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQDILEKQGSELKQ KHIIPTLQKQ
Sbjct: 181 KYREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQDILEKQGSELKQTKHIIPTLQKQ 240
Query: 241 VTTLTGQLHSLAEDLAEVKADKYSGKSWLQGSSSPHTPTYDHHEDASNSLEFSDCDPTSP 300
VTTLTGQLHSLAEDLAEVKADKYSGKSWLQGSSSPHTPTYDHHEDASNSLEFSDCDPTSP
Sbjct: 241 VTTLTGQLHSLAEDLAEVKADKYSGKSWLQGSSSPHTPTYDHHEDASNSLEFSDCDPTSP 300
Query: 301 GSPDDFLLKDVNPCLTPYYATKSKEFEAMGYEISSHNRIESGFESCSRKLSKSSDCRQNS 360
GSPDDFLLKDVNPCLTPYYATKSKEFEAMGYEISSHNRIESGFESCSRKLSKSSDCRQNS
Sbjct: 301 GSPDDFLLKDVNPCLTPYYATKSKEFEAMGYEISSHNRIESGFESCSRKLSKSSDCRQNS 360
Query: 361 NKANVTKTARRSDEAKYTYGKTMHKFY 387
NKANVTKTARRSDEAKYTYGKTMHKFY
Sbjct: 361 NKANVTKTARRSDEAKYTYGKTMHKFY 385
BLAST of Cp4.1LG20g06460 vs. ExPASy TrEMBL
Match:
A0A6J1KN61 (uncharacterized protein LOC111495672 OS=Cucurbita maxima OX=3661 GN=LOC111495672 PE=4 SV=1)
HSP 1 Score: 704 bits (1818), Expect = 4.15e-255
Identity = 379/387 (97.93%), Postives = 381/387 (98.45%), Query Frame = 0
Query: 1 MAKVIKPSSRYCSYDVRSSTSSHFSDPSSSSEFKLKSPMAAKSSSSRALVQIKASDLART 60
MAKVIKPSSRYCSYDVRSSTSSHFSDPSSSSEFKLKSPMAAKSSSSRALVQIKASDLART
Sbjct: 1 MAKVIKPSSRYCSYDVRSSTSSHFSDPSSSSEFKLKSPMAAKSSSSRALVQIKASDLART 60
Query: 61 KAKPSDQNLTAMVKKFMEKRSGSKPKAVKQAAGLVIPSVLIAEDLKKTARKGTNFGGLHK 120
KAKPSDQNLTAMVKKFMEKRSGSKPKAVKQAAGLVIPS LIAEDLKKTARKGTNFGGLHK
Sbjct: 61 KAKPSDQNLTAMVKKFMEKRSGSKPKAVKQAAGLVIPSDLIAEDLKKTARKGTNFGGLHK 120
Query: 121 KLFGKGSVEKKEKEKEVKALTEVKGNTRTLAMVLRSERELLGLNKEQELEITELKLVLEE 180
KLFGKGSVEKKEKE VKALTEVKGNTRTLAMVLRSERELLGLNKEQELEI ELKLVLEE
Sbjct: 121 KLFGKGSVEKKEKE--VKALTEVKGNTRTLAMVLRSERELLGLNKEQELEINELKLVLEE 180
Query: 181 KYREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQDILEKQGSELKQAKHIIPTLQKQ 240
KYREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQDILEKQGSELKQAKHIIPTLQKQ
Sbjct: 181 KYREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQDILEKQGSELKQAKHIIPTLQKQ 240
Query: 241 VTTLTGQLHSLAEDLAEVKADKYSGKSWLQGSSSPHTPTYDHHEDASNSLEFSDCDPTSP 300
VTTLTGQLHSLAEDLAEVKADKYSGKSWLQGSSSPHTPTYDHHEDASNSLEFSDCDPTSP
Sbjct: 241 VTTLTGQLHSLAEDLAEVKADKYSGKSWLQGSSSPHTPTYDHHEDASNSLEFSDCDPTSP 300
Query: 301 GSPDDFLLKDVNPCLTPYYATKSKEFEAMGYEISSHNRIESGFESCSRKLSKSSDCRQNS 360
GSPDDFLLKDVNPCLTPYYATKSKEF+AMGYEISSHNRIES FESCSRKLSKSSDCRQNS
Sbjct: 301 GSPDDFLLKDVNPCLTPYYATKSKEFKAMGYEISSHNRIESSFESCSRKLSKSSDCRQNS 360
Query: 361 NKANVTKTARRSDEAKYTYGKTMHKFY 387
+KANVTKTARRSDEAKYTYGK MHKFY
Sbjct: 361 DKANVTKTARRSDEAKYTYGKAMHKFY 385
BLAST of Cp4.1LG20g06460 vs. ExPASy TrEMBL
Match:
A0A1S3CL74 (uncharacterized protein LOC103501712 OS=Cucumis melo OX=3656 GN=LOC103501712 PE=4 SV=1)
HSP 1 Score: 634 bits (1636), Expect = 2.55e-227
Identity = 348/391 (89.00%), Postives = 363/391 (92.84%), Query Frame = 0
Query: 1 MAKVIKPSSRYCSYDVRSSTSSHFSDPSSSSEFKLKSPMAAKSSSSRALVQIKASDLART 60
MAKV+KPSSRY SYDVRSSTSSHFSDPSSSS+FK+KSP+ A SSSSRALV+ K +DLAR
Sbjct: 1 MAKVMKPSSRYSSYDVRSSTSSHFSDPSSSSDFKIKSPLPANSSSSRALVKTKPTDLARA 60
Query: 61 KAKPSDQNLTAMVKKFMEKRSGSKPKAVKQAAGLVIPSVLIAEDLKKTARKGTNFGGLHK 120
K KPSDQNLTAMVKKFMEKRSGSKPKAVK AAGLVIPS LIAEDLKKTARKGT+FGGLHK
Sbjct: 61 KMKPSDQNLTAMVKKFMEKRSGSKPKAVKHAAGLVIPSDLIAEDLKKTARKGTSFGGLHK 120
Query: 121 KLFGKGSVEKKEKEKEVKALTEVKGNTRTLAMVLRSERELLGLNKEQELEITELKLVLEE 180
KLFGKG++EKK+ KEVKALTEVKGNTRTLAMVLRSERELL LNKEQELEITELKLVLEE
Sbjct: 121 KLFGKGTMEKKD-AKEVKALTEVKGNTRTLAMVLRSERELLSLNKEQELEITELKLVLEE 180
Query: 181 KYREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQDILEKQGSELKQAKHIIPTLQKQ 240
KYREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQ++LEKQ SELKQAK IIPTLQKQ
Sbjct: 181 KYREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQNMLEKQDSELKQAKQIIPTLQKQ 240
Query: 241 VTTLTGQLHSLAEDLAEVKADKYSGKSWLQGSSSPHTPTYDHHEDASNSLEFSDCDPTSP 300
VTTLTGQLHSLAEDLAEVKADKYSGKSWLQGS SPHTPTYDH EDASNSLEFS CDPTSP
Sbjct: 241 VTTLTGQLHSLAEDLAEVKADKYSGKSWLQGSISPHTPTYDH-EDASNSLEFSVCDPTSP 300
Query: 301 GSPDDFLLKDVNPCLTPYYATKSKEFEAMGY-----EISSHNRIESGFESCSRKLSKSSD 360
GSPDDFLLKDVNPCLTPYYATKSKEFEAMGY E S NR+ESGF+SCSRKLSKSSD
Sbjct: 301 GSPDDFLLKDVNPCLTPYYATKSKEFEAMGYDSPRGETVSQNRMESGFKSCSRKLSKSSD 360
Query: 361 CRQNSNKANVTKTARRSDEAKYTYGKTMHKF 386
CRQNSNKAN TKT R+SDEAKYTYGK MHKF
Sbjct: 361 CRQNSNKANTTKTGRQSDEAKYTYGKPMHKF 389
BLAST of Cp4.1LG20g06460 vs. ExPASy TrEMBL
Match:
A0A6J1EHN1 (uncharacterized protein LOC111432624 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111432624 PE=4 SV=1)
HSP 1 Score: 626 bits (1615), Expect = 4.03e-224
Identity = 347/392 (88.52%), Postives = 354/392 (90.31%), Query Frame = 0
Query: 1 MAKVIKPSSRYCSYDVRSSTSSHFSDPSSSSEFKLKSPMAAKSSSSRALVQIKASDLART 60
MAKVI PSSRY SYDVRSS SSHFSDPSSSSEFKLKSPM A SSSSRA+V+ KA+DL R
Sbjct: 1 MAKVINPSSRYSSYDVRSSGSSHFSDPSSSSEFKLKSPMKADSSSSRAIVKSKAADLPRA 60
Query: 61 KAKPSDQNLTAMVKKFMEKRSGSKPKAVKQAAGLVIPSVLIAEDLKKTARKGTNFGGLHK 120
K KPSDQNLTAMVKKFMEKRSG KPK VK A GLVIPS LIAEDLKKTARKGTNFGGLHK
Sbjct: 61 KTKPSDQNLTAMVKKFMEKRSGLKPKTVKHATGLVIPSDLIAEDLKKTARKGTNFGGLHK 120
Query: 121 KLFGKGSVEKKEKEKEVKALTEVKGNTRTLAMVLRSERELLGLNKEQELEITELKLVLEE 180
KLFGKG VEKKEKE VKALTEVKGNTRTLAMVLRSERELL LNKEQELEITELKLVLEE
Sbjct: 121 KLFGKGMVEKKEKE--VKALTEVKGNTRTLAMVLRSERELLSLNKEQELEITELKLVLEE 180
Query: 181 KYREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQDILEKQGSELKQAKHIIPTLQKQ 240
KY EIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQ ILEKQ SELKQAK IIPTLQKQ
Sbjct: 181 KYGEIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQGILEKQDSELKQAKQIIPTLQKQ 240
Query: 241 VTTLTGQLHSLAEDLAEVKADKYSGKSWLQGSSSPHTPTYDHHEDASNSLEFSDCDPTSP 300
VTTLTGQL+SLAEDLAEVKADKYSGK WLQGSSSPHTPTYDH EDASN LEFS CDPTSP
Sbjct: 241 VTTLTGQLYSLAEDLAEVKADKYSGKGWLQGSSSPHTPTYDH-EDASNPLEFSACDPTSP 300
Query: 301 GSPDDFLLKDVNPCLTPYYATKSKEFEAMGY-----EISSHNRIESGFESCSRKLSKSSD 360
PDD+LLKDVNPCLTPYYATKSK+FEAMGY EI SHNR+ESGF SCSRKLSKSSD
Sbjct: 301 SRPDDYLLKDVNPCLTPYYATKSKDFEAMGYDSPRDEILSHNRMESGFTSCSRKLSKSSD 360
Query: 361 CRQNSNKANVTKTARRSDEAKYTYGKTMHKFY 387
CRQNSNKA TKTARRSDEAKYTYGK MHKFY
Sbjct: 361 CRQNSNKAKTTKTARRSDEAKYTYGKPMHKFY 389
BLAST of Cp4.1LG20g06460 vs. ExPASy TrEMBL
Match:
A0A6J1CNL5 (inner centromere protein A OS=Momordica charantia OX=3673 GN=LOC111012662 PE=4 SV=1)
HSP 1 Score: 624 bits (1610), Expect = 3.01e-223
Identity = 350/397 (88.16%), Postives = 361/397 (90.93%), Query Frame = 0
Query: 1 MAKVIKPSSRYCSYDVRSSTSSHFSDPSSSSEFKLKSPMAAKSSSS--RALVQIKASDLA 60
MA VIKPSSRY SYDVRSSTSSHFSDPS+SSEFKLKSPMAA SSSS RALV+ KASDLA
Sbjct: 1 MASVIKPSSRYSSYDVRSSTSSHFSDPSTSSEFKLKSPMAANSSSSSSRALVKSKASDLA 60
Query: 61 RTKAKPSDQNLTAMVKKFMEKRSGSKPKAVKQAAGLVIPSVLIAEDLKKTARKGTNFGGL 120
R K+KPSDQNLTAMVKKFMEKRS SKPK K A GLVIPS LIAEDLKKTARKGTNFGGL
Sbjct: 61 RAKSKPSDQNLTAMVKKFMEKRSASKPKTAKHATGLVIPSDLIAEDLKKTARKGTNFGGL 120
Query: 121 HKKLFGKGS--VEKKEKEKEVKALTEVKGNTRTLAMVLRSERELLGLNKEQELEITELKL 180
HKKLFGKGS VEKKEK++EVKALTEVKGNTRTLAMVLRSERELL LNKEQELEITELKL
Sbjct: 121 HKKLFGKGSAAVEKKEKKEEVKALTEVKGNTRTLAMVLRSERELLSLNKEQELEITELKL 180
Query: 181 VLEEKYREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQDILEKQGSELKQAKHIIPT 240
VLEEKYREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQ++LEKQ SELKQAK IIPT
Sbjct: 181 VLEEKYREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQEMLEKQDSELKQAKQIIPT 240
Query: 241 LQKQVTTLTGQLHSLAEDLAEVKADKYSGKSWLQG-SSSPHTPTYDHHEDASNSLEFSDC 300
LQKQVT LTGQLHSLAEDLAEVKADKYSGK+WLQ SSSPHTPTYD EDASNSLEFS C
Sbjct: 241 LQKQVTXLTGQLHSLAEDLAEVKADKYSGKAWLQNNSSSPHTPTYDD-EDASNSLEFSAC 300
Query: 301 DPTSPGSPDDFLLKDVNPCLTPYYATKSKEFEAMGY-----EISSHNRIESGFESCSRKL 360
DPTSPGSPDDFLLKDVNPCLTPYYATKSKEFEAMGY EI SHNR ESGFESCSRKL
Sbjct: 301 DPTSPGSPDDFLLKDVNPCLTPYYATKSKEFEAMGYDSPRDEILSHNRKESGFESCSRKL 360
Query: 361 SKSSDCRQNSNKANVTKTARRSDEAKYTYGKTMHKFY 387
S+SSDCRQ SN+ N T+TARRSDEAKY YGK MHKFY
Sbjct: 361 SRSSDCRQKSNETNTTRTARRSDEAKYMYGKPMHKFY 396
BLAST of Cp4.1LG20g06460 vs. TAIR 10
Match:
AT4G17240.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 16 plant structures; EXPRESSED DURING: 8 growth stages; Has 1142 Blast hits to 1055 proteins in 252 species: Archae - 22; Bacteria - 318; Metazoa - 248; Fungi - 96; Plants - 59; Viruses - 3; Other Eukaryotes - 396 (source: NCBI BLink). )
HSP 1 Score: 251.1 bits (640), Expect = 1.4e-66
Identity = 194/388 (50.00%), Postives = 238/388 (61.34%), Query Frame = 0
Query: 8 SSRYCSYDVRSS-TSSHFSDPSSSSEFKLKSPMAAKSSSSRALVQIKASDLART----KA 67
+SRY SYD RSS TSS SD SSS+EFK P+ SS+A+V+ K+S L +T K
Sbjct: 2 ASRYNSYDSRSSVTSSIHSDLSSSAEFKSNKPI-----SSKAIVRSKSSYLTKTTKPIKP 61
Query: 68 KPSDQNLTAMVKKFME-KRSGSKPKAVKQAAGLVIPSVLIAEDLKKTARKGTNFGGLHKK 127
+ NLT M+KK ME K+S SK K V+ LVIP L D K K T G L +K
Sbjct: 62 DSNPGNLTNMMKKLMEMKKSNSKSKRVE----LVIPEELKKIDTGKGGGKST-LGTLQRK 121
Query: 128 LFGKGSVEKKEKEKEVKALTEVKGNTRTLAMVLRSERELLGLNKEQELEITELKLVLEEK 187
LFGK ++VKALTEVK NTRTL+MVLRSERELLG+NK+QE+EI ELK LEEK
Sbjct: 122 LFGK---------EKVKALTEVKSNTRTLSMVLRSERELLGMNKDQEVEIAELKFQLEEK 181
Query: 188 YREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQDILEKQGSELKQAKHIIPTLQKQV 247
RE+EKLKDLCLKQREEIKSLK+A+LFPD MNSQ+ Q EL QA+ IIP LQKQV
Sbjct: 182 NREVEKLKDLCLKQREEIKSLKSAVLFPDSMNSQI-----NQMQELNQAREIIPNLQKQV 241
Query: 248 TTLTGQLHSLAEDLAEVKADKYSGKS--WLQGSSSPHTPTYDHHEDASNSLEFSDCDPTS 307
+L GQL +A+DLAEVKA+KY +S W T +YD SLEFS
Sbjct: 242 ISLNGQLQCIAQDLAEVKANKYLSESCYW-----QAQTSSYD-------SLEFSS----- 301
Query: 308 PGSPDDFLLKDVNPCLTPYYATKSKEFEAMGYEISSHNRIESGFESCSRKLS-------- 367
GSPD L+D+NPCLTPY K KE+E R++S ES S + +
Sbjct: 302 -GSPDGLALEDLNPCLTPYTKKKPKEYE----------RVDSAEESLSGRSTITTTGGKV 337
Query: 368 KSSDCRQNSNKANVTKTARRSDEAKYTY 380
KSS ++++ K +RS+E+K Y
Sbjct: 362 KSSSRSVKMSRSSEGKAGQRSEESKGWY 337
BLAST of Cp4.1LG20g06460 vs. TAIR 10
Match:
AT4G17240.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 16 plant structures; EXPRESSED DURING: 8 growth stages. )
HSP 1 Score: 198.4 bits (503), Expect = 1.1e-50
Identity = 172/388 (44.33%), Postives = 219/388 (56.44%), Query Frame = 0
Query: 8 SSRYCSYDVRSS-TSSHFSDPSSSSEFKLKSPMAAKSSSSRALVQIKASDLART----KA 67
+SRY SYD RSS TSS SD SSS+EFK P+ SS+A+V+ K+S L +T K
Sbjct: 2 ASRYNSYDSRSSVTSSIHSDLSSSAEFKSNKPI-----SSKAIVRSKSSYLTKTTKPIKP 61
Query: 68 KPSDQNLTAMVKKFME-KRSGSKPKAVKQAAGLVIPSVLIAEDLKKTARKGTNFGGLHKK 127
+ NLT M+KK ME K+S SK K V+ LVIP L D K K T G L +K
Sbjct: 62 DSNPGNLTNMMKKLMEMKKSNSKSKRVE----LVIPEELKKIDTGKGGGKST-LGTLQRK 121
Query: 128 LFGKGSVEKKEKEKEVKALTEVKGNTRTLAMVLRSERELLGLNKEQELEITELKLVLEEK 187
LFGK ++VKALTEVK NTRTL+M+ E+ ++K+ L
Sbjct: 122 LFGK---------EKVKALTEVKSNTRTLSMI-----------HERLAVCNQIKVFL--- 181
Query: 188 YREIEKLKDLCLKQREEIKSLKNAILFPDVMNSQLQDILEKQGSELKQAKHIIPTLQKQV 247
++EKLKDLCLKQREEIKSLK+A+LFPD MNSQ+ Q EL QA+ IIP LQKQV
Sbjct: 182 --QVEKLKDLCLKQREEIKSLKSAVLFPDSMNSQI-----NQMQELNQAREIIPNLQKQV 241
Query: 248 TTLTGQLHSLAEDLAEVKADKYSGKS--WLQGSSSPHTPTYDHHEDASNSLEFSDCDPTS 307
+L GQL +A+DLAEVKA+KY +S W T +YD SLEFS
Sbjct: 242 ISLNGQLQCIAQDLAEVKANKYLSESCYW-----QAQTSSYD-------SLEFSS----- 301
Query: 308 PGSPDDFLLKDVNPCLTPYYATKSKEFEAMGYEISSHNRIESGFESCSRKLS-------- 367
GSPD L+D+NPCLTPY K KE+E R++S ES S + +
Sbjct: 302 -GSPDGLALEDLNPCLTPYTKKKPKEYE----------RVDSAEESLSGRSTITTTGGKV 321
Query: 368 KSSDCRQNSNKANVTKTARRSDEAKYTY 380
KSS ++++ K +RS+E+K Y
Sbjct: 362 KSSSRSVKMSRSSEGKAGQRSEESKGWY 321
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_023520379.1 | 5.46e-263 | 100.00 | uncharacterized protein LOC111783692 [Cucurbita pepo subsp. pepo] >XP_023520380.... | [more] |
XP_022927051.1 | 6.31e-257 | 98.45 | uncharacterized protein LOC111433994 [Cucurbita moschata] | [more] |
XP_023001589.1 | 8.57e-255 | 97.93 | uncharacterized protein LOC111495672 [Cucurbita maxima] >XP_023001590.1 uncharac... | [more] |
KAG6583743.1 | 1.04e-233 | 90.31 | Protein NAR1, partial [Cucurbita argyrosperma subsp. sororia] | [more] |
XP_038895034.1 | 1.58e-228 | 90.03 | uncharacterized protein LOC120083373 [Benincasa hispida] | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1EGL4 | 3.05e-257 | 98.45 | uncharacterized protein LOC111433994 OS=Cucurbita moschata OX=3662 GN=LOC1114339... | [more] |
A0A6J1KN61 | 4.15e-255 | 97.93 | uncharacterized protein LOC111495672 OS=Cucurbita maxima OX=3661 GN=LOC111495672... | [more] |
A0A1S3CL74 | 2.55e-227 | 89.00 | uncharacterized protein LOC103501712 OS=Cucumis melo OX=3656 GN=LOC103501712 PE=... | [more] |
A0A6J1EHN1 | 4.03e-224 | 88.52 | uncharacterized protein LOC111432624 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1CNL5 | 3.01e-223 | 88.16 | inner centromere protein A OS=Momordica charantia OX=3673 GN=LOC111012662 PE=4 S... | [more] |
Match Name | E-value | Identity | Description | |
AT4G17240.1 | 1.4e-66 | 50.00 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |
AT4G17240.2 | 1.1e-50 | 44.33 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |