CmoCh16G005940 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh16G005940
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
Descriptionchromatin modification-related protein EAF1 isoform X2
LocationCmo_Chr16: 2885224 .. 2889487 (-)
RNA-Seq ExpressionCmoCh16G005940
SyntenyCmoCh16G005940
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCTTGTGGGAGGCGTAGTAGATTATTGTACCTTCGGGTATGCCTTTTTATTTGATTTAAGGTTGAGTGCCAAGGGAGATTTCTTGGTTCTGTTGAGATTGTAGTGTTGAAGGTCTGTATTCGGATGAGTTTTCTTCCGGTGTTTCTTCGGTTTTTAATGAGACCTTGGAAGCATACTGTAACTGTTGACAACTTTTACTTCTGCGACTGATGTGAGGACATGGATGGGCTCTAAGAGTTTTCTGAGGCCATCTGTTCAGTTTACGCAATAGAATTCTATTTCTTAATTCTTTGAGGATACTGTGTCAACGAACTGTATTATTTCGTTCGCTTTAAACCTATCTATTTTTCTGAGCCACCTTGCTTGCCAATTGTTTCTGCTCTCGAGGATTTGCACCTATTGATCTACTTGCGTTTTTTTTTTCCAAACTGTTACTTTTCTTTTAGTTCCGCACTAACCTTTTATTTTGAATCTCATGTTAGATGGCGGCTAAACCACTTACTACTGAGGCTATTGCCGTAACTGAGAAGAAGATGGACATGGCTTTAGGTTGGGCTTCGGAATATCTTTCAGTACTTGAGAATTTATGGGCCTCTTGAAAATTTATTGATTATTCTGTTTCTACATTTTTTTTGTTGTTGAATTAGACGATATTATCAAAATGTCCAAAAATACTGCAAATAAAGCCAGGAAGGAAAGAAGGTTTCCGGTAAGAATCTTTTATTTTTAGCTTGCCTTTAACCTGAGCAGTTTATTGTTGAAAGTGGAAGTCTTCCTTTTATTATATGTGTAGTGATTTATCTTTCCATTTGCAGAACAAAATGCAGAAATTTCCAAATAATGCTACTCAAGATAGACCTAGGAAGTTGCAGCGTTTCATGGATTCAAGATCTTCTGTAAGACAGGTTTGTACAGTTCTTTAGCTTGAATATCATTTATTTTGCTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGTATGTTGAGTTCCCATTTTTCATTTTGTATTTAGGGGGCTTTGGCCAAAAGAAGGTCAAATTTTCAAGGGAATCAGTTTGCTTTGACAACTGAGGTTGCAAGAAGGGCTGTAGTTGCTCCAACTCGCCCTAGAGCTTTTTATCGTAGGGCACCCAACTGGAATAAGACAAGGTATATCGAACTATGCATTTTTAGCCTACGGAAAGCTCTCATTGTGAATTATGATTATATGTTTTACTTCAGCTACCCTGGCAACATACCGAAACTATATGATAGTTTTAAAGAAGAAATTTATTACAGAGGCAATTTTAATTTGATGGCTGGTTTGTTTACTTTGTTATGTGATGCTATGACAGGTAAATCACGTTTATACACAATTGATGATTTCTACAGTGTCCATCAAGTTTTAAATATGAAGTTAGAAACATAATATGTTGGTTGGATAAGCTTGGATGATAAGTTTCAGTTGAATTGGAGTGGGAGTATTTCCAATTATTTCGTTTCTTCCATTCTACCTGGGAATTGATTCTGCTTTATCTTCTTAAGTTTGGTTCTGGAATGCATTGAGTTATCTTTTGGTTGTCTGACAAGTAAGTGCTTGGTGGGAGTTTCAGTTGCTGGGACAAAGATAATTTGAACTTTTGATTGAATTTGATTGCTGCGACCATCTGGAAAAATGTGGTTTGGTGAAAAGGGCTTTTGTGTTCATTGGATGAGTGTGTATAGTTTTGAGTTTTAGCTCCTTATTGAATGTGCTTATTGTCTTGGGTTTCTCAAGTCTTCTTAGACAATCTACAATATTATAGCAGCCTCGTCCAGATTCCCTTTTGTTGCATTGATGTGGAAGCTGTTTGCAAGATGTGATGTTTTTAGAGCAGGAGTTGACTGAGATTGAGGCACATTCCCTTTAATGATTATTCCACAATTGTCTCTTGTTCATCGTGTTCAAAGTTTGCTGTTGTGAACTTTCATTCCTGGTGGATGACAGTCGATAAAACTTTGTTTGCTTGCAATAAGGGATGCAAAGAACCCTAGGGGTTGTTGTGATGTCAGAATTTGCACATCTTTCTATGTTAGTGATCCGTCTGCTAGTTTTGTTGATTCCACTTGGTGTTCTCTTTTCGTTTGCTGTTACATCCTGGGTACATTTGTCAGCCTTCCAACTACCATGTCTTGACGAGTCTCTCAGTAATTCTACTTATTGACTAACAGTAAAACTGCAAATGAGGAAACCCTTGACTATGGAAAGTTTTCCACTTGTGAAAAAATGCTTCTTTATCCCTGTAGTCAGCCTGACCTGAGGAATATGACATCTCATCATCTTTAGTAGATACTTGGGGATGGAATCAAAATTGTTGCAAGTAAATCAAGAACTCTGTACTTCTGGTTATTTGTCGTTGCATAATTCTATGAATAACAAAAGTGCTGAAGAAATTAAGTACGGTTAGTTAAAATGTGTTGAACAATGGCTTTATGGCTTCTTCTATGAGGTATTATGTGTAGACTTCGGGTTTATCCACGTGGAATACACTAGTACTGCAGAAATTCATGCTACTGCCGCTTTTAAATCATATCCTGGTTGAAGCCTTCATTTTCTGGTTCAAGTTCCGGACAGGAGTCTTCCTGCGGAGCAGTTTATCTGTTCATTGATAATACTAGTTTGGAGCTTTCATTGTCCATTAAAGCAAGCCAAGCGTGCGATGATTGATTTGAGGTTCTGGTTTTGCTATTATAACTGGACAGTGCTTTTTTTGGATTGGCAAGCGCAACACATTTTGAGCTGATTCTCTTTTTTTGATGACTGCATTCTCATGATCTGGTTTGTTTTTTTATTTCTGTTAATGCATTTTAATTTGCTCATCGAGGTAAGATTCTCTGATCTATCTGCATCTTGCCGCTCAAGGAGTGTTGAAAATATCTGGCGTTTGTGGAAGTTTAATTTGGATAGTAATTTTTTTACCTAGCGGGTCTACTTATAAATGTTATTTCTTTTGTTCATCAATCATTTCAATAATATTTGGTGCAGGGTTGATGCTCCACCGGTTCCAAGAAAGCCTTTTAATAATCGAACCTTTGTTCCCAAGGTACTGTGTTTGTAACTGAGTTTGGTCTGGAGTAGTGAAGTTACTCCATCAAATGAGATGATGAATGCAAGTTTGGATCGCAGTAGTTTAATCTCATTTATTCCCCCTTCTTCCCTTCTCCAGGTAGCTGCACCGGCCCAGCCACAAACCAATGCTACGCAGAGACAGAGACCACAAACAAATGCCACGCAGAGACAGAGACCGCAAACTCTCGACTCACTGTTTGCCAACATGAAGGAACAGAGGCTGAGGGTGTTGTCACAGCGACAGAATGGCGCCGCACAACAACGGAATGGTGGTCGCCAGCAAATACCTCCATGGCAAAGAGGCCGTTTTGGTAACTGAAGAACACGCCCACATGCCAACATCGTGTAATAAATGGAGTAATTGTATGGGAAATGTAGATGTTGCTTGCTCGTCCTCGGCCCATAGCTGATACCATCAAAAGAAAGGAATCTAGTTGTGTCTTGTCATTTTTTTTTACCCAAACCTTTGGCTTGCTGTTTTTAGCGTTTCTAGCGTTTTTAGCATTTTTAGCATTTTTAGCATTTTTAGCATTTTTAGC

mRNA sequence

ATGTCTTGTGGGAGGCGTAGTAGATTATTGTACCTTCGGATGGCGGCTAAACCACTTACTACTGAGGCTATTGCCGTAACTGAGAAGAAGATGGACATGGCTTTAGACGATATTATCAAAATGTCCAAAAATACTGCAAATAAAGCCAGGAAGGAAAGAAGGTTTCCGAACAAAATGCAGAAATTTCCAAATAATGCTACTCAAGATAGACCTAGGAAGTTGCAGCGTTTCATGGATTCAAGATCTTCTGTAAGACAGGGGGCTTTGGCCAAAAGAAGGTCAAATTTTCAAGGGAATCAGTTTGCTTTGACAACTGAGGTTGCAAGAAGGGCTGTAGTTGCTCCAACTCGCCCTAGAGCTTTTTATCGTAGGGCACCCAACTGGAATAAGACAAGGGTTGATGCTCCACCGGTTCCAAGAAAGCCTTTTAATAATCGAACCTTTGTTCCCAAGGTAGCTGCACCGGCCCAGCCACAAACCAATGCTACGCAGAGACAGAGACCACAAACAAATGCCACGCAGAGACAGAGACCGCAAACTCTCGACTCACTGTTTGCCAACATGAAGGAACAGAGGCTGAGGGTGTTGTCACAGCGACAGAATGGCGCCGCACAACAACGGAATGGTGGTCGCCAGCAAATACCTCCATGGCAAAGAGGCCGTTTTGGTAACTGAAGAACACGCCCACATGCCAACATCGTGTAATAAATGGAGTAATTGTATGGGAAATGTAGATGTTGCTTGCTCGTCCTCGGCCCATAGCTGATACCATCAAAAGAAAGGAATCTAGTTGTGTCTTGTCATTTTTTTTTACCCAAACCTTTGGCTTGCTGTTTTTAGCGTTTCTAGCGTTTTTAGCATTTTTAGCATTTTTAGCATTTTTAGCATTTTTAGC

Coding sequence (CDS)

ATGTCTTGTGGGAGGCGTAGTAGATTATTGTACCTTCGGATGGCGGCTAAACCACTTACTACTGAGGCTATTGCCGTAACTGAGAAGAAGATGGACATGGCTTTAGACGATATTATCAAAATGTCCAAAAATACTGCAAATAAAGCCAGGAAGGAAAGAAGGTTTCCGAACAAAATGCAGAAATTTCCAAATAATGCTACTCAAGATAGACCTAGGAAGTTGCAGCGTTTCATGGATTCAAGATCTTCTGTAAGACAGGGGGCTTTGGCCAAAAGAAGGTCAAATTTTCAAGGGAATCAGTTTGCTTTGACAACTGAGGTTGCAAGAAGGGCTGTAGTTGCTCCAACTCGCCCTAGAGCTTTTTATCGTAGGGCACCCAACTGGAATAAGACAAGGGTTGATGCTCCACCGGTTCCAAGAAAGCCTTTTAATAATCGAACCTTTGTTCCCAAGGTAGCTGCACCGGCCCAGCCACAAACCAATGCTACGCAGAGACAGAGACCACAAACAAATGCCACGCAGAGACAGAGACCGCAAACTCTCGACTCACTGTTTGCCAACATGAAGGAACAGAGGCTGAGGGTGTTGTCACAGCGACAGAATGGCGCCGCACAACAACGGAATGGTGGTCGCCAGCAAATACCTCCATGGCAAAGAGGCCGTTTTGGTAACTGA

Protein sequence

MSCGRRSRLLYLRMAAKPLTTEAIAVTEKKMDMALDDIIKMSKNTANKARKERRFPNKMQKFPNNATQDRPRKLQRFMDSRSSVRQGALAKRRSNFQGNQFALTTEVARRAVVAPTRPRAFYRRAPNWNKTRVDAPPVPRKPFNNRTFVPKVAAPAQPQTNATQRQRPQTNATQRQRPQTLDSLFANMKEQRLRVLSQRQNGAAQQRNGGRQQIPPWQRGRFGN
Homology
BLAST of CmoCh16G005940 vs. ExPASy TrEMBL
Match: A0A6J1ETG7 (uncharacterized protein LOC111437568 OS=Cucurbita moschata OX=3662 GN=LOC111437568 PE=4 SV=1)

HSP 1 Score: 401.4 bits (1030), Expect = 2.5e-108
Identity = 211/211 (100.00%), Postives = 211/211 (100.00%), Query Frame = 0

Query: 14  MAAKPLTTEAIAVTEKKMDMALDDIIKMSKNTANKARKERRFPNKMQKFPNNATQDRPRK 73
           MAAKPLTTEAIAVTEKKMDMALDDIIKMSKNTANKARKERRFPNKMQKFPNNATQDRPRK
Sbjct: 1   MAAKPLTTEAIAVTEKKMDMALDDIIKMSKNTANKARKERRFPNKMQKFPNNATQDRPRK 60

Query: 74  LQRFMDSRSSVRQGALAKRRSNFQGNQFALTTEVARRAVVAPTRPRAFYRRAPNWNKTRV 133
           LQRFMDSRSSVRQGALAKRRSNFQGNQFALTTEVARRAVVAPTRPRAFYRRAPNWNKTRV
Sbjct: 61  LQRFMDSRSSVRQGALAKRRSNFQGNQFALTTEVARRAVVAPTRPRAFYRRAPNWNKTRV 120

Query: 134 DAPPVPRKPFNNRTFVPKVAAPAQPQTNATQRQRPQTNATQRQRPQTLDSLFANMKEQRL 193
           DAPPVPRKPFNNRTFVPKVAAPAQPQTNATQRQRPQTNATQRQRPQTLDSLFANMKEQRL
Sbjct: 121 DAPPVPRKPFNNRTFVPKVAAPAQPQTNATQRQRPQTNATQRQRPQTLDSLFANMKEQRL 180

Query: 194 RVLSQRQNGAAQQRNGGRQQIPPWQRGRFGN 225
           RVLSQRQNGAAQQRNGGRQQIPPWQRGRFGN
Sbjct: 181 RVLSQRQNGAAQQRNGGRQQIPPWQRGRFGN 211

BLAST of CmoCh16G005940 vs. ExPASy TrEMBL
Match: A0A6J1JB99 (uncharacterized protein LOC111483428 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111483428 PE=4 SV=1)

HSP 1 Score: 393.7 bits (1010), Expect = 5.3e-106
Identity = 207/211 (98.10%), Postives = 208/211 (98.58%), Query Frame = 0

Query: 14  MAAKPLTTEAIAVTEKKMDMALDDIIKMSKNTANKARKERRFPNKMQKFPNNATQDRPRK 73
           MAAKPLTTEAIAVTEKKMDMALDDIIKMSK T NKARKERRFPNKMQKFPNN TQDRPRK
Sbjct: 1   MAAKPLTTEAIAVTEKKMDMALDDIIKMSKITGNKARKERRFPNKMQKFPNNVTQDRPRK 60

Query: 74  LQRFMDSRSSVRQGALAKRRSNFQGNQFALTTEVARRAVVAPTRPRAFYRRAPNWNKTRV 133
           LQRFMDSRSSVRQGALAKRRSNFQGNQFALTTEV+RRAVVAPTRPRAFYRRAPNWNKTRV
Sbjct: 61  LQRFMDSRSSVRQGALAKRRSNFQGNQFALTTEVSRRAVVAPTRPRAFYRRAPNWNKTRV 120

Query: 134 DAPPVPRKPFNNRTFVPKVAAPAQPQTNATQRQRPQTNATQRQRPQTLDSLFANMKEQRL 193
           DAPPVPRKPFNNRTFVPKVAAPAQPQTNATQRQRPQTNATQRQRPQTLDSLFANMKEQRL
Sbjct: 121 DAPPVPRKPFNNRTFVPKVAAPAQPQTNATQRQRPQTNATQRQRPQTLDSLFANMKEQRL 180

Query: 194 RVLSQRQNGAAQQRNGGRQQIPPWQRGRFGN 225
           RVLSQRQNGAAQQRNGGRQQIPPWQRGRFGN
Sbjct: 181 RVLSQRQNGAAQQRNGGRQQIPPWQRGRFGN 211

BLAST of CmoCh16G005940 vs. ExPASy TrEMBL
Match: A0A6J1JD83 (uncharacterized protein LOC111483428 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111483428 PE=4 SV=1)

HSP 1 Score: 385.6 bits (989), Expect = 1.4e-103
Identity = 207/221 (93.67%), Postives = 208/221 (94.12%), Query Frame = 0

Query: 14  MAAKPLTTEAIAVTEKKMDMALDDIIKMSKNTANKARKERRFPNKMQKFPNNATQDRPRK 73
           MAAKPLTTEAIAVTEKKMDMALDDIIKMSK T NKARKERRFPNKMQKFPNN TQDRPRK
Sbjct: 1   MAAKPLTTEAIAVTEKKMDMALDDIIKMSKITGNKARKERRFPNKMQKFPNNVTQDRPRK 60

Query: 74  LQRFMDSRSSVRQGALAKRRSNFQGNQFALTTEVARRAVVAPTRPRAFYRRAPNWNKTRV 133
           LQRFMDSRSSVRQGALAKRRSNFQGNQFALTTEV+RRAVVAPTRPRAFYRRAPNWNKTRV
Sbjct: 61  LQRFMDSRSSVRQGALAKRRSNFQGNQFALTTEVSRRAVVAPTRPRAFYRRAPNWNKTRV 120

Query: 134 DAPPVPRKPFNNRTFVPKVAAPAQPQTNATQRQRPQTN----------ATQRQRPQTLDS 193
           DAPPVPRKPFNNRTFVPKVAAPAQPQTNATQRQRPQTN          ATQRQRPQTLDS
Sbjct: 121 DAPPVPRKPFNNRTFVPKVAAPAQPQTNATQRQRPQTNATQRQRPQTYATQRQRPQTLDS 180

Query: 194 LFANMKEQRLRVLSQRQNGAAQQRNGGRQQIPPWQRGRFGN 225
           LFANMKEQRLRVLSQRQNGAAQQRNGGRQQIPPWQRGRFGN
Sbjct: 181 LFANMKEQRLRVLSQRQNGAAQQRNGGRQQIPPWQRGRFGN 221

BLAST of CmoCh16G005940 vs. ExPASy TrEMBL
Match: A0A6J1FPB6 (uncharacterized protein LOC111447019 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111447019 PE=4 SV=1)

HSP 1 Score: 323.9 bits (829), Expect = 5.1e-85
Identity = 173/211 (81.99%), Postives = 181/211 (85.78%), Query Frame = 0

Query: 14  MAAKPLTTEAIAVTEKKMDMALDDIIKMSKNTANKARKERRFPNKMQKFPNNATQDRPRK 73
           MAAKPLTTEAIA+TEKKMDMALDDIIKMSKNT NK RK+RRFPNKMQKFPNNATQDRPRK
Sbjct: 1   MAAKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRFPNKMQKFPNNATQDRPRK 60

Query: 74  LQRFMDSRSSVRQGALAKRRSNFQGNQFALTTEVARRAVVAPTRPRAFYRRAPNWNKTRV 133
           LQRFMD+R+S+RQGALAKRRSNFQGNQFAL TEVAR A VAP RPRAF RR PNW KTRV
Sbjct: 61  LQRFMDARTSLRQGALAKRRSNFQGNQFALATEVARTAAVAPIRPRAFNRRVPNWKKTRV 120

Query: 134 DAPPVPRKPFNNRTFVPKVAAPAQPQTNATQRQRPQTNATQRQRPQTLDSLFANMKEQRL 193
           +APPV RKPFNN TF+PK+ AP Q           QTNAT RQRPQTLDSLFANMKEQRL
Sbjct: 121 EAPPVQRKPFNNGTFIPKITAPVQ----------TQTNATPRQRPQTLDSLFANMKEQRL 180

Query: 194 RVLSQRQNGAAQQRNGGRQQIPPWQRGRFGN 225
           RVLSQRQNG AQQRNG RQQ PPW RGR GN
Sbjct: 181 RVLSQRQNGGAQQRNGARQQRPPWGRGRIGN 201

BLAST of CmoCh16G005940 vs. ExPASy TrEMBL
Match: A0A6J1J5N5 (uncharacterized protein LOC111481570 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111481570 PE=4 SV=1)

HSP 1 Score: 319.3 bits (817), Expect = 1.3e-83
Identity = 170/211 (80.57%), Postives = 180/211 (85.31%), Query Frame = 0

Query: 14  MAAKPLTTEAIAVTEKKMDMALDDIIKMSKNTANKARKERRFPNKMQKFPNNATQDRPRK 73
           MA KPLTTEAIA+TEKKMDMALDDIIKMSKNT NK RK+RRFPNKMQKFPNNATQDRPRK
Sbjct: 1   MATKPLTTEAIAITEKKMDMALDDIIKMSKNTGNKGRKQRRFPNKMQKFPNNATQDRPRK 60

Query: 74  LQRFMDSRSSVRQGALAKRRSNFQGNQFALTTEVARRAVVAPTRPRAFYRRAPNWNKTRV 133
           LQRFMD+R+S+RQGA AKRRSNFQGNQFAL TEVAR+A VAP RPRAF R  PNW KTRV
Sbjct: 61  LQRFMDARTSLRQGAFAKRRSNFQGNQFALATEVARKAAVAPIRPRAFNRWVPNWKKTRV 120

Query: 134 DAPPVPRKPFNNRTFVPKVAAPAQPQTNATQRQRPQTNATQRQRPQTLDSLFANMKEQRL 193
           +APPV RKPFNN TF+PK+AAP Q           QTNAT RQ+PQTLDSLFANMKEQRL
Sbjct: 121 EAPPVQRKPFNNGTFIPKIAAPVQ----------TQTNATPRQKPQTLDSLFANMKEQRL 180

Query: 194 RVLSQRQNGAAQQRNGGRQQIPPWQRGRFGN 225
           RVLSQRQNG AQQRNG RQQ PPW RGR GN
Sbjct: 181 RVLSQRQNGGAQQRNGARQQRPPWGRGRIGN 201

BLAST of CmoCh16G005940 vs. TAIR 10
Match: AT4G10970.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G23910.2); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 134.0 bits (336), Expect = 1.5e-31
Identity = 98/210 (46.67%), Postives = 129/210 (61.43%), Query Frame = 0

Query: 17  KPLTTEAIAVTEKKMDMALDDIIKMSKNTANKAR-KERRFPNKMQKFPNNATQDRPRKLQ 76
           KP+TTE +A+TEKKMDM+LD+IIKM K+  N  + K++R  NK +KF + A ++   K Q
Sbjct: 5   KPITTETVALTEKKMDMSLDEIIKMEKSNTNVNKGKKQRVLNKKEKF-SGAAKNSAVKAQ 64

Query: 77  RFMDSRSSVRQGALAKRRSNFQGNQFALTTEVARRAVVAPTRPRAFY-RRAPNWNKTRVD 136
           R+MDSRS VRQGA AK+RSNFQGNQF +TT VAR+A  A  R R +   R  N N++R  
Sbjct: 65  RYMDSRSDVRQGAFAKKRSNFQGNQFPVTTTVARKAASATPRGRPYNGGRMTNTNQSRFI 124

Query: 137 APPVPRKPFNNRTFVPKVAAPAQPQTNATQRQRPQTNATQRQRPQTLDSLFANMKEQRLR 196
           APP   +  + R FV K     Q +    Q+Q       QRQ PQTLDS FANMKE+R+R
Sbjct: 125 APPAQNRA-SQRGFVGK--QQQQQREKIVQQQANGGGGGQRQWPQTLDSRFANMKEERMR 184

Query: 197 VLSQRQNGAAQQRNGG-----RQQIPPWQR 220
           +     N +    NG      ++ + PW R
Sbjct: 185 MRRFADNRSNVGNNGAGSHQQQRSMVPWVR 210

BLAST of CmoCh16G005940 vs. TAIR 10
Match: AT4G10970.2 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G23910.2); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 134.0 bits (336), Expect = 1.5e-31
Identity = 98/210 (46.67%), Postives = 129/210 (61.43%), Query Frame = 0

Query: 17  KPLTTEAIAVTEKKMDMALDDIIKMSKNTANKAR-KERRFPNKMQKFPNNATQDRPRKLQ 76
           KP+TTE +A+TEKKMDM+LD+IIKM K+  N  + K++R  NK +KF + A ++   K Q
Sbjct: 5   KPITTETVALTEKKMDMSLDEIIKMEKSNTNVNKGKKQRVLNKKEKF-SGAAKNSAVKAQ 64

Query: 77  RFMDSRSSVRQGALAKRRSNFQGNQFALTTEVARRAVVAPTRPRAFY-RRAPNWNKTRVD 136
           R+MDSRS VRQGA AK+RSNFQGNQF +TT VAR+A  A  R R +   R  N N++R  
Sbjct: 65  RYMDSRSDVRQGAFAKKRSNFQGNQFPVTTTVARKAASATPRGRPYNGGRMTNTNQSRFI 124

Query: 137 APPVPRKPFNNRTFVPKVAAPAQPQTNATQRQRPQTNATQRQRPQTLDSLFANMKEQRLR 196
           APP   +  + R FV K     Q +    Q+Q       QRQ PQTLDS FANMKE+R+R
Sbjct: 125 APPAQNRA-SQRGFVGK--QQQQQREKIVQQQANGGGGGQRQWPQTLDSRFANMKEERMR 184

Query: 197 VLSQRQNGAAQQRNGG-----RQQIPPWQR 220
           +     N +    NG      ++ + PW R
Sbjct: 185 MRRFADNRSNVGNNGAGSHQQQRSMVPWVR 210

BLAST of CmoCh16G005940 vs. TAIR 10
Match: AT4G10970.3 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G23910.2); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 134.0 bits (336), Expect = 1.5e-31
Identity = 98/210 (46.67%), Postives = 129/210 (61.43%), Query Frame = 0

Query: 17  KPLTTEAIAVTEKKMDMALDDIIKMSKNTANKAR-KERRFPNKMQKFPNNATQDRPRKLQ 76
           KP+TTE +A+TEKKMDM+LD+IIKM K+  N  + K++R  NK +KF + A ++   K Q
Sbjct: 5   KPITTETVALTEKKMDMSLDEIIKMEKSNTNVNKGKKQRVLNKKEKF-SGAAKNSAVKAQ 64

Query: 77  RFMDSRSSVRQGALAKRRSNFQGNQFALTTEVARRAVVAPTRPRAFY-RRAPNWNKTRVD 136
           R+MDSRS VRQGA AK+RSNFQGNQF +TT VAR+A  A  R R +   R  N N++R  
Sbjct: 65  RYMDSRSDVRQGAFAKKRSNFQGNQFPVTTTVARKAASATPRGRPYNGGRMTNTNQSRFI 124

Query: 137 APPVPRKPFNNRTFVPKVAAPAQPQTNATQRQRPQTNATQRQRPQTLDSLFANMKEQRLR 196
           APP   +  + R FV K     Q +    Q+Q       QRQ PQTLDS FANMKE+R+R
Sbjct: 125 APPAQNRA-SQRGFVGK--QQQQQREKIVQQQANGGGGGQRQWPQTLDSRFANMKEERMR 184

Query: 197 VLSQRQNGAAQQRNGG-----RQQIPPWQR 220
           +     N +    NG      ++ + PW R
Sbjct: 185 MRRFADNRSNVGNNGAGSHQQQRSMVPWVR 210

BLAST of CmoCh16G005940 vs. TAIR 10
Match: AT4G10970.4 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 25 plant structures; EXPRESSED DURING: 14 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G23910.2); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 134.0 bits (336), Expect = 1.5e-31
Identity = 98/210 (46.67%), Postives = 129/210 (61.43%), Query Frame = 0

Query: 17  KPLTTEAIAVTEKKMDMALDDIIKMSKNTANKAR-KERRFPNKMQKFPNNATQDRPRKLQ 76
           KP+TTE +A+TEKKMDM+LD+IIKM K+  N  + K++R  NK +KF + A ++   K Q
Sbjct: 5   KPITTETVALTEKKMDMSLDEIIKMEKSNTNVNKGKKQRVLNKKEKF-SGAAKNSAVKAQ 64

Query: 77  RFMDSRSSVRQGALAKRRSNFQGNQFALTTEVARRAVVAPTRPRAFY-RRAPNWNKTRVD 136
           R+MDSRS VRQGA AK+RSNFQGNQF +TT VAR+A  A  R R +   R  N N++R  
Sbjct: 65  RYMDSRSDVRQGAFAKKRSNFQGNQFPVTTTVARKAASATPRGRPYNGGRMTNTNQSRFI 124

Query: 137 APPVPRKPFNNRTFVPKVAAPAQPQTNATQRQRPQTNATQRQRPQTLDSLFANMKEQRLR 196
           APP   +  + R FV K     Q +    Q+Q       QRQ PQTLDS FANMKE+R+R
Sbjct: 125 APPAQNRA-SQRGFVGK--QQQQQREKIVQQQANGGGGGQRQWPQTLDSRFANMKEERMR 184

Query: 197 VLSQRQNGAAQQRNGG-----RQQIPPWQR 220
           +     N +    NG      ++ + PW R
Sbjct: 185 MRRFADNRSNVGNNGAGSHQQQRSMVPWVR 210

BLAST of CmoCh16G005940 vs. TAIR 10
Match: AT4G10970.5 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 25 plant structures; EXPRESSED DURING: 14 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G23910.2); Has 52 Blast hits to 51 proteins in 11 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 52; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 103.6 bits (257), Expect = 2.1e-22
Identity = 88/209 (42.11%), Postives = 114/209 (54.55%), Query Frame = 0

Query: 31  MDMALDDIIKMSKNTANKAR-KERRFPNKMQKFPNNATQDRPRKLQRFMDSRSSVRQGAL 90
           MDM+LD+IIKM K+  N  + K++R  NK +KF + A ++   K QR+MDSRS VRQGA 
Sbjct: 1   MDMSLDEIIKMEKSNTNVNKGKKQRVLNKKEKF-SGAAKNSAVKAQRYMDSRSDVRQGAF 60

Query: 91  AKRRSNFQGNQFALTTEVARRAVVAPTRPRAFY-RRAPN-------------WNKTRVDA 150
           AK+RSNFQGNQF +TT VAR+A  A  R R +   R  N             W   R  A
Sbjct: 61  AKKRSNFQGNQFPVTTTVARKAASATPRGRPYNGGRMTNTNQSSWSIVGRLKWVDARFIA 120

Query: 151 PPVPRKPFNNRTFVPKVAAPAQPQTNATQRQRPQTNATQRQRPQTLDSLFANMKEQRLRV 210
           PP   +  + R FV K     Q +    Q+Q       QRQ PQTLDS FANMKE+R+R+
Sbjct: 121 PPAQNRA-SQRGFVGK--QQQQQREKIVQQQANGGGGGQRQWPQTLDSRFANMKEERMRM 180

Query: 211 LSQRQNGAAQQRNGG-----RQQIPPWQR 220
                N +    NG      ++ + PW R
Sbjct: 181 RRFADNRSNVGNNGAGSHQQQRSMVPWVR 205

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1ETG72.5e-108100.00uncharacterized protein LOC111437568 OS=Cucurbita moschata OX=3662 GN=LOC1114375... [more]
A0A6J1JB995.3e-10698.10uncharacterized protein LOC111483428 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1JD831.4e-10393.67uncharacterized protein LOC111483428 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1FPB65.1e-8581.99uncharacterized protein LOC111447019 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1J5N51.3e-8380.57uncharacterized protein LOC111481570 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
Match NameE-valueIdentityDescription
AT4G10970.11.5e-3146.67unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G10970.21.5e-3146.67unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G10970.31.5e-3146.67unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G10970.41.5e-3146.67unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT4G10970.52.1e-2242.11unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 198..213
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 154..180
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 153..180
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 198..224
NoneNo IPR availablePANTHERPTHR36048RIBOSOME MATURATION FACTORcoord: 14..224
NoneNo IPR availablePANTHERPTHR36048:SF1RIBOSOME MATURATION FACTORcoord: 14..224

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh16G005940.1CmoCh16G005940.1mRNA