CmoCh16G009000.1 (mRNA) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh16G009000.1
TypemRNA
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
DescriptionAT hook motif-containing protein
LocationCmo_Chr16: 5302709 .. 5304215 (+)
Sequence length1381
RNA-Seq ExpressionCmoCh16G009000.1
SyntenyCmoCh16G009000.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGTCAGGCTGACCAAGGAATCAGCGCTGATAATTTAGTTGATGCTCCGTTGAAGCGAAAACGCGGTCGTCCCAGAAAATATCCGAAGCTAAGTTATGATGAGAATATTCTTATTTCAAAGAATAGAGGTAAGAAACATTTGGAGGCTATTCCTCTTTCTCCTGGTTCCGGAGTAAATGGAAACCAATCACAACCAGCAATTCAAATTCAAAATATATCTGATGGAATGCTGGGACAAGTCGTGTCTGGTGTCATTGAGGCAGTATTTGAAGCTGGATATCTTCTGTGTGTTAGGGTTGGCAACTCTGGAATCACTTTGAGGGGTGTGGTCTTTAAACCTGGGCATTATGTCCCAGTTTCGGCAGAGAACGATGTGGCCCCGGATATTCAAATGATCAGACGAAATATGGTTCCTTTTGCTACAGGAAATCAGAGTCCTGGAAATAACCCTCTATCCAATAATGGAGAGGTCCCATCCCACGAATCGTCAGGTGCCACACTGGGGTTTAGATATTCGCCTCCACATTCGAACTGGGACGCTCCGAAAGAAAAATCTGTATCATCTATACTTGCCCAAATTAACCCTTCCGGAAGCTCAAGAGGTAATGTGGTTCCTGTTGTGCAACTACCGGCTAAACTAACTAATGGACCCTTAGGTCCCTCTGAAACATTTACACTTCAGACTGCTGATATTGAATCCTCAAAAGGCAAAGAGGTTCTTATAGGTTCTTTTACGTCAAACGAATCAGCTCCCAACGATGTGACAGTTGGGATAGAAAGCTTTTCTTTCCAACCTCAAACTAGTCAGCAGGTCGTCTTACAAGATGATGTTTCGGTAGAAAACGCTTCTCACAACGAATCCTTGGTAATGGAAGTACATGATTCAGAAGGTAAATCAATGGCATTGCCTAGCACGCCTTTCGAGAGTCTTGTGACTGAAGTGATCAAGAGAATTCAAGCCCCCCCTCTGTCAGCTGAGATGCAGACTGAGAACAACAAACCAACTGTTACGATATCAGCTAAAGAATTCGAAGTTAGTTCGGAGGTTGAAGCTAACCTCCTAGCAGATGGAGCGTTAATGATTGAACCCCTAAAAGCAGTGCAGCCCCTTCATGAAAGTTCTGAGCATATTCCCAAAGCTCTGGATGACGAGTCTAGAACTGGCAAAATGACTGAGCTGTTACAGGTACACGAGTCTTCGACTAGAATGTATCCAAGTATTTGATTAGATTACTAATACTTCTTGTACATGGGAAGAATTCACTTGCTTTATGCCGCTGCCTGCACTAACAAAGTGACATACTTTGACAAGGTTCTGCAGGAAAACATGATGCAAACTCCAGATCCGTGGGGTGACGTGCACGACCCGGGTTTGATGCTGAAGTCAGAAGAACCTGGGGAATCAAGAAGAGAGATTGGGGATGAAGAAGAAGCTGGCAACCAAAAGCAAATGTGAGTAGCAGCAGCATGTATTGGCACAGTTAATACTCTGGGTATAGAAGGGG

mRNA sequence

ATGAGTCAGGCTGACCAAGGAATCAGCGCTGATAATTTAGTTGATGCTCCGTTGAAGCGAAAACGCGGTCGTCCCAGAAAATATCCGAAGCTAAGTTATGATGAGAATATTCTTATTTCAAAGAATAGAGGTAAGAAACATTTGGAGGCTATTCCTCTTTCTCCTGGTTCCGGAGTAAATGGAAACCAATCACAACCAGCAATTCAAATTCAAAATATATCTGATGGAATGCTGGGACAAGTCGTGTCTGGTGTCATTGAGGCAGTATTTGAAGCTGGATATCTTCTGTGTGTTAGGGTTGGCAACTCTGGAATCACTTTGAGGGGTGTGGTCTTTAAACCTGGGCATTATGTCCCAGTTTCGGCAGAGAACGATGTGGCCCCGGATATTCAAATGATCAGACGAAATATGGTTCCTTTTGCTACAGGAAATCAGAGTCCTGGAAATAACCCTCTATCCAATAATGGAGAGGTCCCATCCCACGAATCGTCAGGTGCCACACTGGGGTTTAGATATTCGCCTCCACATTCGAACTGGGACGCTCCGAAAGAAAAATCTGTATCATCTATACTTGCCCAAATTAACCCTTCCGGAAGCTCAAGAGGTAATGTGGTTCCTGTTGTGCAACTACCGGCTAAACTAACTAATGGACCCTTAGGTCCCTCTGAAACATTTACACTTCAGACTGCTGATATTGAATCCTCAAAAGGCAAAGAGGTTCTTATAGGTTCTTTTACGTCAAACGAATCAGCTCCCAACGATGTGACAGTTGGGATAGAAAGCTTTTCTTTCCAACCTCAAACTAGTCAGCAGGTCGTCTTACAAGATGATGTTTCGGTAGAAAACGCTTCTCACAACGAATCCTTGGTAATGGAAGTACATGATTCAGAAGGTAAATCAATGGCATTGCCTAGCACGCCTTTCGAGAGTCTTGTGACTGAAGTGATCAAGAGAATTCAAGCCCCCCCTCTGTCAGCTGAGATGCAGACTGAGAACAACAAACCAACTGTTACGATATCAGCTAAAGAATTCGAAGTTAGTTCGGAGGTTGAAGCTAACCTCCTAGCAGATGGAGCGTTAATGATTGAACCCCTAAAAGCAGTGCAGCCCCTTCATGAAAGTTCTGAGCATATTCCCAAAGCTCTGGATGACGAGTCTAGAACTGGCAAAATGACTGAGCTGTTACAGGTTCTGCAGGAAAACATGATGCAAACTCCAGATCCGTGGGGTGACGTGCACGACCCGGGTTTGATGCTGAAGTCAGAAGAACCTGGGGAATCAAGAAGAGAGATTGGGGATGAAGAAGAAGCTGGCAACCAAAAGCAAATGTGAGTAGCAGCAGCATGTATTGGCACAGTTAATACTCTGGGTATAGAAGGGG

Coding sequence (CDS)

ATGAGTCAGGCTGACCAAGGAATCAGCGCTGATAATTTAGTTGATGCTCCGTTGAAGCGAAAACGCGGTCGTCCCAGAAAATATCCGAAGCTAAGTTATGATGAGAATATTCTTATTTCAAAGAATAGAGGTAAGAAACATTTGGAGGCTATTCCTCTTTCTCCTGGTTCCGGAGTAAATGGAAACCAATCACAACCAGCAATTCAAATTCAAAATATATCTGATGGAATGCTGGGACAAGTCGTGTCTGGTGTCATTGAGGCAGTATTTGAAGCTGGATATCTTCTGTGTGTTAGGGTTGGCAACTCTGGAATCACTTTGAGGGGTGTGGTCTTTAAACCTGGGCATTATGTCCCAGTTTCGGCAGAGAACGATGTGGCCCCGGATATTCAAATGATCAGACGAAATATGGTTCCTTTTGCTACAGGAAATCAGAGTCCTGGAAATAACCCTCTATCCAATAATGGAGAGGTCCCATCCCACGAATCGTCAGGTGCCACACTGGGGTTTAGATATTCGCCTCCACATTCGAACTGGGACGCTCCGAAAGAAAAATCTGTATCATCTATACTTGCCCAAATTAACCCTTCCGGAAGCTCAAGAGGTAATGTGGTTCCTGTTGTGCAACTACCGGCTAAACTAACTAATGGACCCTTAGGTCCCTCTGAAACATTTACACTTCAGACTGCTGATATTGAATCCTCAAAAGGCAAAGAGGTTCTTATAGGTTCTTTTACGTCAAACGAATCAGCTCCCAACGATGTGACAGTTGGGATAGAAAGCTTTTCTTTCCAACCTCAAACTAGTCAGCAGGTCGTCTTACAAGATGATGTTTCGGTAGAAAACGCTTCTCACAACGAATCCTTGGTAATGGAAGTACATGATTCAGAAGGTAAATCAATGGCATTGCCTAGCACGCCTTTCGAGAGTCTTGTGACTGAAGTGATCAAGAGAATTCAAGCCCCCCCTCTGTCAGCTGAGATGCAGACTGAGAACAACAAACCAACTGTTACGATATCAGCTAAAGAATTCGAAGTTAGTTCGGAGGTTGAAGCTAACCTCCTAGCAGATGGAGCGTTAATGATTGAACCCCTAAAAGCAGTGCAGCCCCTTCATGAAAGTTCTGAGCATATTCCCAAAGCTCTGGATGACGAGTCTAGAACTGGCAAAATGACTGAGCTGTTACAGGTTCTGCAGGAAAACATGATGCAAACTCCAGATCCGTGGGGTGACGTGCACGACCCGGGTTTGATGCTGAAGTCAGAAGAACCTGGGGAATCAAGAAGAGAGATTGGGGATGAAGAAGAAGCTGGCAACCAAAAGCAAATGTGA

Protein sequence

MSQADQGISADNLVDAPLKRKRGRPRKYPKLSYDENILISKNRGKKHLEAIPLSPGSGVNGNQSQPAIQIQNISDGMLGQVVSGVIEAVFEAGYLLCVRVGNSGITLRGVVFKPGHYVPVSAENDVAPDIQMIRRNMVPFATGNQSPGNNPLSNNGEVPSHESSGATLGFRYSPPHSNWDAPKEKSVSSILAQINPSGSSRGNVVPVVQLPAKLTNGPLGPSETFTLQTADIESSKGKEVLIGSFTSNESAPNDVTVGIESFSFQPQTSQQVVLQDDVSVENASHNESLVMEVHDSEGKSMALPSTPFESLVTEVIKRIQAPPLSAEMQTENNKPTVTISAKEFEVSSEVEANLLADGALMIEPLKAVQPLHESSEHIPKALDDESRTGKMTELLQVLQENMMQTPDPWGDVHDPGLMLKSEEPGESRREIGDEEEAGNQKQM
Homology
BLAST of CmoCh16G009000.1 vs. ExPASy TrEMBL
Match: A0A6J1EVJ4 (uncharacterized protein LOC111438469 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111438469 PE=4 SV=1)

HSP 1 Score: 858.2 bits (2216), Expect = 1.5e-245
Identity = 443/443 (100.00%), Postives = 443/443 (100.00%), Query Frame = 0

Query: 1   MSQADQGISADNLVDAPLKRKRGRPRKYPKLSYDENILISKNRGKKHLEAIPLSPGSGVN 60
           MSQADQGISADNLVDAPLKRKRGRPRKYPKLSYDENILISKNRGKKHLEAIPLSPGSGVN
Sbjct: 1   MSQADQGISADNLVDAPLKRKRGRPRKYPKLSYDENILISKNRGKKHLEAIPLSPGSGVN 60

Query: 61  GNQSQPAIQIQNISDGMLGQVVSGVIEAVFEAGYLLCVRVGNSGITLRGVVFKPGHYVPV 120
           GNQSQPAIQIQNISDGMLGQVVSGVIEAVFEAGYLLCVRVGNSGITLRGVVFKPGHYVPV
Sbjct: 61  GNQSQPAIQIQNISDGMLGQVVSGVIEAVFEAGYLLCVRVGNSGITLRGVVFKPGHYVPV 120

Query: 121 SAENDVAPDIQMIRRNMVPFATGNQSPGNNPLSNNGEVPSHESSGATLGFRYSPPHSNWD 180
           SAENDVAPDIQMIRRNMVPFATGNQSPGNNPLSNNGEVPSHESSGATLGFRYSPPHSNWD
Sbjct: 121 SAENDVAPDIQMIRRNMVPFATGNQSPGNNPLSNNGEVPSHESSGATLGFRYSPPHSNWD 180

Query: 181 APKEKSVSSILAQINPSGSSRGNVVPVVQLPAKLTNGPLGPSETFTLQTADIESSKGKEV 240
           APKEKSVSSILAQINPSGSSRGNVVPVVQLPAKLTNGPLGPSETFTLQTADIESSKGKEV
Sbjct: 181 APKEKSVSSILAQINPSGSSRGNVVPVVQLPAKLTNGPLGPSETFTLQTADIESSKGKEV 240

Query: 241 LIGSFTSNESAPNDVTVGIESFSFQPQTSQQVVLQDDVSVENASHNESLVMEVHDSEGKS 300
           LIGSFTSNESAPNDVTVGIESFSFQPQTSQQVVLQDDVSVENASHNESLVMEVHDSEGKS
Sbjct: 241 LIGSFTSNESAPNDVTVGIESFSFQPQTSQQVVLQDDVSVENASHNESLVMEVHDSEGKS 300

Query: 301 MALPSTPFESLVTEVIKRIQAPPLSAEMQTENNKPTVTISAKEFEVSSEVEANLLADGAL 360
           MALPSTPFESLVTEVIKRIQAPPLSAEMQTENNKPTVTISAKEFEVSSEVEANLLADGAL
Sbjct: 301 MALPSTPFESLVTEVIKRIQAPPLSAEMQTENNKPTVTISAKEFEVSSEVEANLLADGAL 360

Query: 361 MIEPLKAVQPLHESSEHIPKALDDESRTGKMTELLQVLQENMMQTPDPWGDVHDPGLMLK 420
           MIEPLKAVQPLHESSEHIPKALDDESRTGKMTELLQVLQENMMQTPDPWGDVHDPGLMLK
Sbjct: 361 MIEPLKAVQPLHESSEHIPKALDDESRTGKMTELLQVLQENMMQTPDPWGDVHDPGLMLK 420

Query: 421 SEEPGESRREIGDEEEAGNQKQM 444
           SEEPGESRREIGDEEEAGNQKQM
Sbjct: 421 SEEPGESRREIGDEEEAGNQKQM 443

BLAST of CmoCh16G009000.1 vs. ExPASy TrEMBL
Match: A0A6J1F1E7 (uncharacterized protein LOC111438469 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111438469 PE=4 SV=1)

HSP 1 Score: 847.8 bits (2189), Expect = 2.0e-242
Identity = 440/443 (99.32%), Postives = 440/443 (99.32%), Query Frame = 0

Query: 1   MSQADQGISADNLVDAPLKRKRGRPRKYPKLSYDENILISKNRGKKHLEAIPLSPGSGVN 60
           MSQADQGISADNLVDAPLKRKRGRPRKYPKLSYDENILISKNRGKKHLEAIPLSPGSGVN
Sbjct: 1   MSQADQGISADNLVDAPLKRKRGRPRKYPKLSYDENILISKNRGKKHLEAIPLSPGSGVN 60

Query: 61  GNQSQPAIQIQNISDGMLGQVVSGVIEAVFEAGYLLCVRVGNSGITLRGVVFKPGHYVPV 120
           GNQSQPAIQIQNISDGMLGQVVSGVIEAVFEAGYLLCVRVGNSGITLRGVVFKPGHYVPV
Sbjct: 61  GNQSQPAIQIQNISDGMLGQVVSGVIEAVFEAGYLLCVRVGNSGITLRGVVFKPGHYVPV 120

Query: 121 SAENDVAPDIQMIRRNMVPFATGNQSPGNNPLSNNGEVPSHESSGATLGFRYSPPHSNWD 180
           SAENDVAPDIQMIRRNMVPFATGNQSPGNNPLSNNGEVPSHESSGATLGFRYSPPHSNWD
Sbjct: 121 SAENDVAPDIQMIRRNMVPFATGNQSPGNNPLSNNGEVPSHESSGATLGFRYSPPHSNWD 180

Query: 181 APKEKSVSSILAQINPSGSSRGNVVPVVQLPAKLTNGPLGPSETFTLQTADIESSKGKEV 240
           APKEKSVSSILAQINPSGSSRGNVVPVVQLPAKLTNGPLGPSETFTLQTADIESSKGKEV
Sbjct: 181 APKEKSVSSILAQINPSGSSRGNVVPVVQLPAKLTNGPLGPSETFTLQTADIESSKGKEV 240

Query: 241 LIGSFTSNESAPNDVTVGIESFSFQPQTSQQVVLQDDVSVENASHNESLVMEVHDSEGKS 300
           LIGSFTSNESAPNDVTVGIESFSFQPQTSQQVVLQDDVSVENASHNESLVMEVHDSEGKS
Sbjct: 241 LIGSFTSNESAPNDVTVGIESFSFQPQTSQQVVLQDDVSVENASHNESLVMEVHDSEGKS 300

Query: 301 MALPSTPFESLVTEVIKRIQAPPLSAEMQTENNKPTVTISAKEFEVSSEVEANLLADGAL 360
           MALPSTPFESLVTEVIKRIQAPPLSAEMQTENNKPTVTISAKEFEVSSEVEANLLADGAL
Sbjct: 301 MALPSTPFESLVTEVIKRIQAPPLSAEMQTENNKPTVTISAKEFEVSSEVEANLLADGAL 360

Query: 361 MIEPLKAVQPLHESSEHIPKALDDESRTGKMTELLQVLQENMMQTPDPWGDVHDPGLMLK 420
           MIEPLKAVQPLHESSEHIPKALDDESRTGKMTEL   LQENMMQTPDPWGDVHDPGLMLK
Sbjct: 361 MIEPLKAVQPLHESSEHIPKALDDESRTGKMTEL---LQENMMQTPDPWGDVHDPGLMLK 420

Query: 421 SEEPGESRREIGDEEEAGNQKQM 444
           SEEPGESRREIGDEEEAGNQKQM
Sbjct: 421 SEEPGESRREIGDEEEAGNQKQM 440

BLAST of CmoCh16G009000.1 vs. ExPASy TrEMBL
Match: A0A6J1JAQ9 (uncharacterized protein LOC111483279 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111483279 PE=4 SV=1)

HSP 1 Score: 844.0 bits (2179), Expect = 2.9e-241
Identity = 436/443 (98.42%), Postives = 440/443 (99.32%), Query Frame = 0

Query: 1   MSQADQGISADNLVDAPLKRKRGRPRKYPKLSYDENILISKNRGKKHLEAIPLSPGSGVN 60
           MSQADQGISADNLVDAPLKRKRGRPRKYPKLSYDENILISKNRGKKHLEAIP+SPGSGVN
Sbjct: 1   MSQADQGISADNLVDAPLKRKRGRPRKYPKLSYDENILISKNRGKKHLEAIPISPGSGVN 60

Query: 61  GNQSQPAIQIQNISDGMLGQVVSGVIEAVFEAGYLLCVRVGNSGITLRGVVFKPGHYVPV 120
           GNQSQPAIQIQNISDGMLGQVVSGVIEAVFEAGYLLCVRVGNSGITLRGVVFKPGHYVPV
Sbjct: 61  GNQSQPAIQIQNISDGMLGQVVSGVIEAVFEAGYLLCVRVGNSGITLRGVVFKPGHYVPV 120

Query: 121 SAENDVAPDIQMIRRNMVPFATGNQSPGNNPLSNNGEVPSHESSGATLGFRYSPPHSNWD 180
           SAENDVAPDIQMIRRNMVPFATGNQSPGNNPLSNNGEVPSHESSGATLGF+YSPPHSNWD
Sbjct: 121 SAENDVAPDIQMIRRNMVPFATGNQSPGNNPLSNNGEVPSHESSGATLGFKYSPPHSNWD 180

Query: 181 APKEKSVSSILAQINPSGSSRGNVVPVVQLPAKLTNGPLGPSETFTLQTADIESSKGKEV 240
           APKEKSVSSILAQI PSGSSRGNVVPVVQLPAKLTNGPLGPSETFTLQTADIESSKGKEV
Sbjct: 181 APKEKSVSSILAQITPSGSSRGNVVPVVQLPAKLTNGPLGPSETFTLQTADIESSKGKEV 240

Query: 241 LIGSFTSNESAPNDVTVGIESFSFQPQTSQQVVLQDDVSVENASHNESLVMEVHDSEGKS 300
           LIGSFTSNESAPNDVTVGIESFSFQPQTSQQVVLQD+VSVENASHN+SLVMEVHDSEGKS
Sbjct: 241 LIGSFTSNESAPNDVTVGIESFSFQPQTSQQVVLQDNVSVENASHNKSLVMEVHDSEGKS 300

Query: 301 MALPSTPFESLVTEVIKRIQAPPLSAEMQTENNKPTVTISAKEFEVSSEVEANLLADGAL 360
           MALPSTPFESLVTEVIKRIQAP LSAEMQTENNKPTVTISAKEFEVSSEVEANLLADGAL
Sbjct: 301 MALPSTPFESLVTEVIKRIQAPTLSAEMQTENNKPTVTISAKEFEVSSEVEANLLADGAL 360

Query: 361 MIEPLKAVQPLHESSEHIPKALDDESRTGKMTELLQVLQENMMQTPDPWGDVHDPGLMLK 420
           MIEPLKAVQPLHESSEHIPKALDDESRTGKMTELLQVLQENMMQTPDPWGDVHDP LMLK
Sbjct: 361 MIEPLKAVQPLHESSEHIPKALDDESRTGKMTELLQVLQENMMQTPDPWGDVHDPCLMLK 420

Query: 421 SEEPGESRREIGDEEEAGNQKQM 444
           SEEPGESRREIGDEEEAGNQKQM
Sbjct: 421 SEEPGESRREIGDEEEAGNQKQM 443

BLAST of CmoCh16G009000.1 vs. ExPASy TrEMBL
Match: A0A6J1JCN8 (uncharacterized protein LOC111483279 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111483279 PE=4 SV=1)

HSP 1 Score: 833.6 bits (2152), Expect = 4.0e-238
Identity = 433/443 (97.74%), Postives = 437/443 (98.65%), Query Frame = 0

Query: 1   MSQADQGISADNLVDAPLKRKRGRPRKYPKLSYDENILISKNRGKKHLEAIPLSPGSGVN 60
           MSQADQGISADNLVDAPLKRKRGRPRKYPKLSYDENILISKNRGKKHLEAIP+SPGSGVN
Sbjct: 1   MSQADQGISADNLVDAPLKRKRGRPRKYPKLSYDENILISKNRGKKHLEAIPISPGSGVN 60

Query: 61  GNQSQPAIQIQNISDGMLGQVVSGVIEAVFEAGYLLCVRVGNSGITLRGVVFKPGHYVPV 120
           GNQSQPAIQIQNISDGMLGQVVSGVIEAVFEAGYLLCVRVGNSGITLRGVVFKPGHYVPV
Sbjct: 61  GNQSQPAIQIQNISDGMLGQVVSGVIEAVFEAGYLLCVRVGNSGITLRGVVFKPGHYVPV 120

Query: 121 SAENDVAPDIQMIRRNMVPFATGNQSPGNNPLSNNGEVPSHESSGATLGFRYSPPHSNWD 180
           SAENDVAPDIQMIRRNMVPFATGNQSPGNNPLSNNGEVPSHESSGATLGF+YSPPHSNWD
Sbjct: 121 SAENDVAPDIQMIRRNMVPFATGNQSPGNNPLSNNGEVPSHESSGATLGFKYSPPHSNWD 180

Query: 181 APKEKSVSSILAQINPSGSSRGNVVPVVQLPAKLTNGPLGPSETFTLQTADIESSKGKEV 240
           APKEKSVSSILAQI PSGSSRGNVVPVVQLPAKLTNGPLGPSETFTLQTADIESSKGKEV
Sbjct: 181 APKEKSVSSILAQITPSGSSRGNVVPVVQLPAKLTNGPLGPSETFTLQTADIESSKGKEV 240

Query: 241 LIGSFTSNESAPNDVTVGIESFSFQPQTSQQVVLQDDVSVENASHNESLVMEVHDSEGKS 300
           LIGSFTSNESAPNDVTVGIESFSFQPQTSQQVVLQD+VSVENASHN+SLVMEVHDSEGKS
Sbjct: 241 LIGSFTSNESAPNDVTVGIESFSFQPQTSQQVVLQDNVSVENASHNKSLVMEVHDSEGKS 300

Query: 301 MALPSTPFESLVTEVIKRIQAPPLSAEMQTENNKPTVTISAKEFEVSSEVEANLLADGAL 360
           MALPSTPFESLVTEVIKRIQAP LSAEMQTENNKPTVTISAKEFEVSSEVEANLLADGAL
Sbjct: 301 MALPSTPFESLVTEVIKRIQAPTLSAEMQTENNKPTVTISAKEFEVSSEVEANLLADGAL 360

Query: 361 MIEPLKAVQPLHESSEHIPKALDDESRTGKMTELLQVLQENMMQTPDPWGDVHDPGLMLK 420
           MIEPLKAVQPLHESSEHIPKALDDESRTGKMTEL   LQENMMQTPDPWGDVHDP LMLK
Sbjct: 361 MIEPLKAVQPLHESSEHIPKALDDESRTGKMTEL---LQENMMQTPDPWGDVHDPCLMLK 420

Query: 421 SEEPGESRREIGDEEEAGNQKQM 444
           SEEPGESRREIGDEEEAGNQKQM
Sbjct: 421 SEEPGESRREIGDEEEAGNQKQM 440

BLAST of CmoCh16G009000.1 vs. ExPASy TrEMBL
Match: A0A1S3CJ92 (uncharacterized protein LOC103501592 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103501592 PE=4 SV=1)

HSP 1 Score: 669.8 bits (1727), Expect = 7.6e-189
Identity = 348/443 (78.56%), Postives = 398/443 (89.84%), Query Frame = 0

Query: 1   MSQADQGISADNLVDAPLKRKRGRPRKYPKLSYDENILISKNRGKKHLEAIPLSPGSGVN 60
           MSQADQGIS+DNLVD PLKRKRGRPRKYPKL+YDENILI+KNRGKKHLEAIP+SPGSGVN
Sbjct: 1   MSQADQGISSDNLVDVPLKRKRGRPRKYPKLNYDENILIAKNRGKKHLEAIPISPGSGVN 60

Query: 61  GNQSQPAIQIQNISDGMLGQVVSGVIEAVFEAGYLLCVRVGNSGITLRGVVFKPGHYVPV 120
           GNQS P IQIQN++DGMLGQVVSGVIEAVFEAGYLLCVRVGNSGITLRGVVFKPGHYVPV
Sbjct: 61  GNQSLPTIQIQNVADGMLGQVVSGVIEAVFEAGYLLCVRVGNSGITLRGVVFKPGHYVPV 120

Query: 121 SAENDVAPDIQMIRRNMVPFATGNQSPGNNPLSNNGEVPSHESSGATLGFRYSPPHSNWD 180
           SAENDVAP++QMIRRN +P ATGNQ+P +NP S NGE+PSHESSG  LGF+YSPPHS+ D
Sbjct: 121 SAENDVAPNVQMIRRNAIPLATGNQAPEDNPQSKNGEIPSHESSGLKLGFKYSPPHSSRD 180

Query: 181 APKEKSVSSILAQINPSGSSRGNVVPVVQLPAKLTNGPLGPSETFTLQTADIESSKGKEV 240
           A K+ S+SSI AQI PSGSSRGNVVPVV   AKLTNGP  P+ET T+QT DIES+KGKEV
Sbjct: 181 ALKDNSISSIFAQITPSGSSRGNVVPVVLQAAKLTNGPSVPTETLTIQTVDIESAKGKEV 240

Query: 241 LIGSFTSNESAPNDVTVGIESFSFQPQTSQQVVLQDDVSVENASHNESLVMEVHDSEGKS 300
           L+G+   +ESAP  VTVGIE  +FQPQT+QQV++ +DV VEN+SHN+SLV+EVHDSEGK 
Sbjct: 241 LVGTSALSESAPTSVTVGIE--NFQPQTTQQVLI-NDVQVENSSHNQSLVVEVHDSEGKL 300

Query: 301 MALPSTPFESLVTEVIKRIQAPPLSAEMQTENNKPTVTISAKEFEVSSEVEANLLADGAL 360
           MALPSTPFESLVTEVIKRIQ P L+AE Q+E+NKP+VTISAKE +   EVEAN+ ADGAL
Sbjct: 301 MALPSTPFESLVTEVIKRIQTPSLTAETQSEDNKPSVTISAKECQDGLEVEANIAADGAL 360

Query: 361 MIEPLKAVQPLHESSEHIPKALDDESRTGKMTELLQVLQENMMQTPDPWGDVHDPGLMLK 420
           MIEPLKAVQPL+ESSE IPKALDDES+TGK+TELLQVLQENM+QTP+PWG+  +PGLMLK
Sbjct: 361 MIEPLKAVQPLNESSEPIPKALDDESKTGKITELLQVLQENMIQTPEPWGEAQNPGLMLK 420

Query: 421 SEEPGESRREIGDEEEAGNQKQM 444
           S+EP ES++EIGD E++G+QKQ+
Sbjct: 421 SDEP-ESKKEIGD-EKSGSQKQI 438

BLAST of CmoCh16G009000.1 vs. TAIR 10
Match: AT5G54930.1 (AT hook motif-containing protein )

HSP 1 Score: 120.2 bits (300), Expect = 4.3e-27
Identity = 117/391 (29.92%), Postives = 161/391 (41.18%), Query Frame = 0

Query: 15  DAPLKRKRGRPRKYPKLSYDENILISKNRGKKHLEAIPLSPGSGVNGNQSQPAIQIQNIS 74
           D   KRKRGRPRK  KL  +E+ L                 G   + ++SQ   + +N  
Sbjct: 16  DLTAKRKRGRPRKQLKLESNEHSL-----------------GHSPSFSRSQQQSRQRNDD 75

Query: 75  DGMLGQVVSGVIEAVFEAGYLLCVRVGNSGITLRGVVFKPGHYVPVSAENDVAPDIQMIR 134
           + M+GQ +SGVIEA FEAG+LL V+VGNS   LRGVVFKPGH  PVS +NDVAPD+ MIR
Sbjct: 76  EAMVGQPISGVIEATFEAGFLLSVKVGNSDSMLRGVVFKPGHCDPVSVDNDVAPDVPMIR 135

Query: 135 RNMVPFATGNQSPGNNPLSNNGEVPSHESSGATLGFRYSPPHSNWDAPKEKSVSSILAQI 194
           R                   N +V  H+ S A  G +           + +++  +  Q 
Sbjct: 136 R-------------------NSDVMHHDGS-AKRGRKSRFREKRGSGVRSRALVPVPIQP 195

Query: 195 NPSGSSRGNVVPVVQLPAKLTNGPLGPSETFTLQTADIESSKGKEVLIGSFTSNESAPND 254
                    +VPVV  PA L NG                               E  P  
Sbjct: 196 AHPTIPNNLIVPVVLQPAHLENG------------------------------GERVP-- 255

Query: 255 VTVGIESFSFQPQTSQQVVLQDDVSVENASHNESLVMEVHDSEGKSMALPSTPFESLVTE 314
               I+    Q +T  Q                            S A    PFE+L+T+
Sbjct: 256 ----IDHSPMQTETGSQA---------------------------SGASNGKPFETLLTQ 280

Query: 315 VIKRIQAPPLSAEMQTENNKPTVTISAKEFEVSSEVEANLLADGALMIEPLKAVQPLHES 374
           V+ + Q    +  ++ E++                       + AL IEPL+A+ P+H  
Sbjct: 316 VMNKGQVQHTTQSVEPESD-----------------------EQALSIEPLQAIHPIH-- 280

Query: 375 SEHIPKALDDESRTGKMTELLQVLQENMMQT 406
             H+ K +    R GKMTELLQ +QEN+ +T
Sbjct: 376 PVHMLKPMPSYGR-GKMTELLQAVQENVRET 280

BLAST of CmoCh16G009000.1 vs. TAIR 10
Match: AT5G54930.2 (AT hook motif-containing protein )

HSP 1 Score: 107.1 bits (266), Expect = 3.8e-23
Identity = 77/203 (37.93%), Postives = 100/203 (49.26%), Query Frame = 0

Query: 15  DAPLKRKRGRPRKYPKLSYDENILISKNRGKKHLEAIPLSPGSGVNGNQSQPAIQIQNIS 74
           D   KRKRGRPRK  KL  +E+ L                 G   + ++SQ   + +N  
Sbjct: 16  DLTAKRKRGRPRKQLKLESNEHSL-----------------GHSPSFSRSQQQSRQRNDD 75

Query: 75  DGMLGQVVSGVIEAVFEAGYLLCVRVGNSGITLRGVVFKPGHYVPVSAENDVAPDIQMIR 134
           + M+GQ +SGVIEA FEAG+LL V+VGNS   LRGVVFKPGH  PVS +NDVAPD+ MIR
Sbjct: 76  EAMVGQPISGVIEATFEAGFLLSVKVGNSDSMLRGVVFKPGHCDPVSVDNDVAPDVPMIR 135

Query: 135 RNMVPFATGNQSPGNNPLSNNGEVPSHESSGATLGFRYSPPHSNWDAPKEKSVSSILAQI 194
           R                   N +V  H+ S A  G +           + +++  +  Q 
Sbjct: 136 R-------------------NSDVMHHDGS-AKRGRKSRFREKRGSGVRSRALVPVPIQP 181

Query: 195 NPSGSSRGNVVPVVQLPAKLTNG 218
                    +VPVV  PA L NG
Sbjct: 196 AHPTIPNNLIVPVVLQPAHLENG 181

BLAST of CmoCh16G009000.1 vs. TAIR 10
Match: AT4G21895.1 (DNA binding )

HSP 1 Score: 63.5 bits (153), Expect = 4.8e-10
Identity = 49/141 (34.75%), Postives = 69/141 (48.94%), Query Frame = 0

Query: 15  DAPLKRKRGRPRKYPKLSYDENILISKNRGKKHLEAIPLSPGSGVNGNQSQPAIQIQNIS 74
           D   KRKRGRPRK      DEN                 +P   +N              
Sbjct: 10  DGLAKRKRGRPRK------DEN----------------STPKPDMN-------------- 69

Query: 75  DGMLGQVVSGVIEAVFEAGYLLCVRVGNSGITLRGVVFKPGHYVPVSAENDVAPDIQMI- 134
             ++G+VV+GVIE  F+AGYLL V+V +S   LRG+VF  G   P++ ENDVAP ++M  
Sbjct: 70  --LVGKVVTGVIEGSFDAGYLLNVKVKDSDTQLRGLVFIRGRVTPITPENDVAPLVKMYG 112

Query: 135 RRNMVPFATGNQSPGNNPLSN 155
           R ++    T +  P + P+ +
Sbjct: 130 REDIKNNQTDHSFPTDQPMQD 112

BLAST of CmoCh16G009000.1 vs. TAIR 10
Match: AT5G52890.1 (AT hook motif-containing protein )

HSP 1 Score: 61.6 bits (148), Expect = 1.8e-09
Identity = 49/156 (31.41%), Postives = 73/156 (46.79%), Query Frame = 0

Query: 1   MSQADQGISADNLVDAPLKRKRGRPRKYPKLSYDENILISKNRGKKHLEAIPLSPGSGVN 60
           M Q +QG S+ +L +   KRKRGRPR+      DE                         
Sbjct: 9   MDQLNQG-SSSSLTN---KRKRGRPRR------DE------------------------- 68

Query: 61  GNQSQPAIQIQNISDGMLGQVVSGVIEAVFEAGYLLCVRVGNSGITLRGVVFKPGHYVPV 120
            +Q+Q  +    + + ++G+VVSGV+E  FEAGY L V+V ++   L+GVVF P    P+
Sbjct: 69  -SQTQQPVN-PPVDENLIGRVVSGVVEGSFEAGYFLNVKVADTEKQLKGVVFLPQKVTPL 127

Query: 121 SAENDVAPDIQMIRRNMVPFATGNQSPGNNPLSNNG 157
           +   D+ P  +M  RN +P  +  Q        N G
Sbjct: 129 TPATDLFPQAKMYARNDIPIPSSYQQTPLQEKKNAG 127

BLAST of CmoCh16G009000.1 vs. TAIR 10
Match: AT5G52890.2 (AT hook motif-containing protein )

HSP 1 Score: 61.6 bits (148), Expect = 1.8e-09
Identity = 49/156 (31.41%), Postives = 73/156 (46.79%), Query Frame = 0

Query: 1   MSQADQGISADNLVDAPLKRKRGRPRKYPKLSYDENILISKNRGKKHLEAIPLSPGSGVN 60
           M Q +QG S+ +L +   KRKRGRPR+      DE                         
Sbjct: 9   MDQLNQG-SSSSLTN---KRKRGRPRR------DE------------------------- 68

Query: 61  GNQSQPAIQIQNISDGMLGQVVSGVIEAVFEAGYLLCVRVGNSGITLRGVVFKPGHYVPV 120
            +Q+Q  +    + + ++G+VVSGV+E  FEAGY L V+V ++   L+GVVF P    P+
Sbjct: 69  -SQTQQPVN-PPVDENLIGRVVSGVVEGSFEAGYFLNVKVADTEKQLKGVVFLPQKVTPL 127

Query: 121 SAENDVAPDIQMIRRNMVPFATGNQSPGNNPLSNNG 157
           +   D+ P  +M  RN +P  +  Q        N G
Sbjct: 129 TPATDLFPQAKMYARNDIPIPSSYQQTPLQEKKNAG 127

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1EVJ41.5e-245100.00uncharacterized protein LOC111438469 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1F1E72.0e-24299.32uncharacterized protein LOC111438469 isoform X2 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1JAQ92.9e-24198.42uncharacterized protein LOC111483279 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1JCN84.0e-23897.74uncharacterized protein LOC111483279 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A1S3CJ927.6e-18978.56uncharacterized protein LOC103501592 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
Match NameE-valueIdentityDescription
AT5G54930.14.3e-2729.92AT hook motif-containing protein [more]
AT5G54930.23.8e-2337.93AT hook motif-containing protein [more]
AT4G21895.14.8e-1034.75DNA binding [more]
AT5G52890.11.8e-0931.41AT hook motif-containing protein [more]
AT5G52890.21.8e-0931.41AT hook motif-containing protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 142..181
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 416..443
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 404..443
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..27
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 142..171
NoneNo IPR availablePANTHERPTHR34682AT HOOK MOTIF-CONTAINING PROTEINcoord: 1..428
NoneNo IPR availablePANTHERPTHR34682:SF1AT HOOK MOTIF-CONTAINING PROTEINcoord: 1..428

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CmoCh16G009000CmoCh16G009000gene


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh16G009000.1:exon:10015CmoCh16G009000.1:exon:10015exon
CmoCh16G009000.1:exon:10016CmoCh16G009000.1:exon:10016exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh16G009000.1:cdsCmoCh16G009000.1:cdsCDS
CmoCh16G009000.1:cdsCmoCh16G009000.1:cds_2CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh16G009000.1:three_prime_utrCmoCh16G009000.1:three_prime_utrthree_prime_UTR


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CmoCh16G009000.1CmoCh16G009000.1-proteinpolypeptide