CmaCh16G006620 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh16G006620
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionPlus3 domain-containing protein
LocationCma_Chr16: 3449008 .. 3451164 (-)
RNA-Seq ExpressionCmaCh16G006620
SyntenyCmaCh16G006620
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGTTTTTAGGTTCGTAATGGCTGTAGTATTGATTATGTTTATGGTATTCTCTGATTTGCTTGGTTCTGACTGGATTTTGAGCCTTATCAACCTCAAGGTTTTATAAACAGTCTTGCTACTTTTCAGGACGGGAGGGCAAGTCTAGAAATCACAGATTCTGGTACCAGAGGCTTCAAAACATGGCAGATCTAGAAAATTTACTTCTTGAGGCTGCGGGAAGAACTAAGGCATCCGGGAGAAATCGACACTCTCATCCACCATCACGAAGACAGCGCGAAGGTTCATATTCTGATGCTGGAAGTGACTCGAGGGATGATGACTCAGATGATGAACGTGGTTATGCGAGCAGGAAGCCATCTGGATCTCAGGTTCCTCTGAAGAAGAGGTTAGATCCTAATGAGAGGGATGATGATGTGGGCAGCCCAGAAGAAGGGGAAGACGAAGATGTTGGTTCAGAACACGAGGGTGACAGCAGTGATGAGTCTGATGTTGGGGATGATCTTTACAAAGATGACGATGACAGGCGCAAGCTTGCTGGTATGTCTGAACTTCAAAGGGAGATGATTCTTTCAGACAGAGCATCGAAGAAGAATGATAAGCATTTATATGAAAGCTTGAGATCTAAGATGGATAAAGGGAAGGCTGCCCCGTCTCGGAAAGAAAATCCTCCTCTCCCGTCATCTCGTATTAGATCGTCAGCCCGATCTGCTGATAGAGCAGCTGCAAAAGATGATGCTTTAAATGAATTGCGTGCAAAAAGGTTGAAGCAGCAGGACCCTGAGGCTCACCGCAAATTGAGAGATACATCTAGAGGAAACGCAAATAATCGAAGGTTCTCACCAACAAAGCGAAAGCCCTTCACTGCTCCTAGTTTGAGTAGCTCAAGCCAAAGTGAAAGTGAAAGTAGGTTTCAAAGTGACGATGAAGGATCTACAGGAGATGGTGGAATGATTGACAGTGACGATGAAAGAACCATGCCTGGTTCAAATGGGCCAACATTTGATGATATCAAGGAAATTACTATTCGTAGATCAAAGCTTGCAAAATGGTTAATGGAGCCATTCTTTGAGGAGTTGATTGTTGGATGCTTTGTGAGAGTCGGAATCGGGAGATCAAGATCTGGGCCTATCTACAGGCTCTGCTTGGTGCGCAATGTTGATGCAACAGAACCTGATCGTCAGTATAAACTCGAGAACAAAATCACGCATAAATATCTTAATGTTATTTGGGGAAATGAAAGTTCTGCTGCCAGATGGCAAATGGCTATGGTTTCGGACTCTGTTCCACTCGAGGATGAATATAAACAGTGGGTGAAGGAAGTAGAGCGAACGGGCGGTCGGATGCTGAGCAAGCAGGATATCTTGGAGAAGAAGGAAGCTATACAGAAAGCCAACAACTTTGTGTACTCAGCAGCCACAGTGAAGCAGATGTTGCAGGAGAAAAAATTTGCCTCATCAAGGCCATTAAATATTGCAGCTGAGAAGGACCGACTGAGGAATCAGATGGACGTAGCACTAAGCAAAAACAATGAAGCTGAGGTGGAGAGGATCAAGGCAAAACTGCGCCAATTAGACGCATCCAGGAGGTCACAAATGAAAGATGCCAAGGCTATTAGGTTATCTGAGATGAACAGGAAGAATAGGGTGGAAAACTTCAAGAATGCATCAGAACTAAGACCCACGAAAGATTTGAAAGCCGGTGAGGCTGGTTATGATCCCTTCTCTAGGAGATGGACGAGGTCGAGGAATTATTATGTTTCAAACGCTGGTCAAGTCGATGGGGCTGCCGAGGCAGCTGGCAACAGTGACAATATAACTCCTGCATCAGAGAGTACTGGAACAGGATCTGGAGAAGCTGGTGTGGCAGCTACCGCAGCAGCTTTGGAAGCTGCTGCTGGTGCTGGAAAGTTGGTCGATACTAACGCTCCAGTAGATGGAGGAACAGAATCGAACTTGCTGCACAACTTCGAGCTGCCTATATCATTGACTGTGCTTCAGAAATTTGGTGGTGCTCTGGGAGCTCAAGCTGGGTTCTTAGCAAGGAAACAAAGGATAGAAGCCACAGTTGGGCGTCAAGTCCCTGAGAATGATGGTAGGCGGCATGCACTGACACTCACTGTTAGCGACTACAAGAGAAGAAGAGGGCTTCTTTGA

mRNA sequence

ATGAAGTTTTTAGGACGGGAGGGCAAGTCTAGAAATCACAGATTCTGGTACCAGAGGCTTCAAAACATGGCAGATCTAGAAAATTTACTTCTTGAGGCTGCGGGAAGAACTAAGGCATCCGGGAGAAATCGACACTCTCATCCACCATCACGAAGACAGCGCGAAGGTTCATATTCTGATGCTGGAAGTGACTCGAGGGATGATGACTCAGATGATGAACGTGGTTATGCGAGCAGGAAGCCATCTGGATCTCAGGTTCCTCTGAAGAAGAGGTTAGATCCTAATGAGAGGGATGATGATGTGGGCAGCCCAGAAGAAGGGGAAGACGAAGATGTTGGTTCAGAACACGAGGGTGACAGCAGTGATGAGTCTGATGTTGGGGATGATCTTTACAAAGATGACGATGACAGGCGCAAGCTTGCTGGTATGTCTGAACTTCAAAGGGAGATGATTCTTTCAGACAGAGCATCGAAGAAGAATGATAAGCATTTATATGAAAGCTTGAGATCTAAGATGGATAAAGGGAAGGCTGCCCCGTCTCGGAAAGAAAATCCTCCTCTCCCGTCATCTCGTATTAGATCGTCAGCCCGATCTGCTGATAGAGCAGCTGCAAAAGATGATGCTTTAAATGAATTGCGTGCAAAAAGGTTGAAGCAGCAGGACCCTGAGGCTCACCGCAAATTGAGAGATACATCTAGAGGAAACGCAAATAATCGAAGGTTCTCACCAACAAAGCGAAAGCCCTTCACTGCTCCTAGTTTGAGTAGCTCAAGCCAAAGTGAAAGTGAAAGTAGGTTTCAAAGTGACGATGAAGGATCTACAGGAGATGGTGGAATGATTGACAGTGACGATGAAAGAACCATGCCTGGTTCAAATGGGCCAACATTTGATGATATCAAGGAAATTACTATTCGTAGATCAAAGCTTGCAAAATGGTTAATGGAGCCATTCTTTGAGGAGTTGATTGTTGGATGCTTTGTGAGAGTCGGAATCGGGAGATCAAGATCTGGGCCTATCTACAGGCTCTGCTTGGTGCGCAATGTTGATGCAACAGAACCTGATCGTCAGTATAAACTCGAGAACAAAATCACGCATAAATATCTTAATGTTATTTGGGGAAATGAAAGTTCTGCTGCCAGATGGCAAATGGCTATGGTTTCGGACTCTGTTCCACTCGAGGATGAATATAAACAGTGGGTGAAGGAAGTAGAGCGAACGGGCGGTCGGATGCTGAGCAAGCAGGATATCTTGGAGAAGAAGGAAGCTATACAGAAAGCCAACAACTTTGTGTACTCAGCAGCCACAGTGAAGCAGATGTTGCAGGAGAAAAAATTTGCCTCATCAAGGCCATTAAATATTGCAGCTGAGAAGGACCGACTGAGGAATCAGATGGACGTAGCACTAAGCAAAAACAATGAAGCTGAGGTGGAGAGGATCAAGGCAAAACTGCGCCAATTAGACGCATCCAGGAGGTCACAAATGAAAGATGCCAAGGCTATTAGGTTATCTGAGATGAACAGGAAGAATAGGGTGGAAAACTTCAAGAATGCATCAGAACTAAGACCCACGAAAGATTTGAAAGCCGGTGAGGCTGGTTATGATCCCTTCTCTAGGAGATGGACGAGGTCGAGGAATTATTATGTTTCAAACGCTGGTCAAGTCGATGGGGCTGCCGAGGCAGCTGGCAACAGTGACAATATAACTCCTGCATCAGAGAGTACTGGAACAGGATCTGGAGAAGCTGGTGTGGCAGCTACCGCAGCAGCTTTGGAAGCTGCTGCTGGTGCTGGAAAGTTGGTCGATACTAACGCTCCAGTAGATGGAGGAACAGAATCGAACTTGCTGCACAACTTCGAGCTGCCTATATCATTGACTGTGCTTCAGAAATTTGGTGGTGCTCTGGGAGCTCAAGCTGGGTTCTTAGCAAGGAAACAAAGGATAGAAGCCACAGTTGGGCGTCAAGTCCCTGAGAATGATGGTAGGCGGCATGCACTGACACTCACTGTTAGCGACTACAAGAGAAGAAGAGGGCTTCTTTGA

Coding sequence (CDS)

ATGAAGTTTTTAGGACGGGAGGGCAAGTCTAGAAATCACAGATTCTGGTACCAGAGGCTTCAAAACATGGCAGATCTAGAAAATTTACTTCTTGAGGCTGCGGGAAGAACTAAGGCATCCGGGAGAAATCGACACTCTCATCCACCATCACGAAGACAGCGCGAAGGTTCATATTCTGATGCTGGAAGTGACTCGAGGGATGATGACTCAGATGATGAACGTGGTTATGCGAGCAGGAAGCCATCTGGATCTCAGGTTCCTCTGAAGAAGAGGTTAGATCCTAATGAGAGGGATGATGATGTGGGCAGCCCAGAAGAAGGGGAAGACGAAGATGTTGGTTCAGAACACGAGGGTGACAGCAGTGATGAGTCTGATGTTGGGGATGATCTTTACAAAGATGACGATGACAGGCGCAAGCTTGCTGGTATGTCTGAACTTCAAAGGGAGATGATTCTTTCAGACAGAGCATCGAAGAAGAATGATAAGCATTTATATGAAAGCTTGAGATCTAAGATGGATAAAGGGAAGGCTGCCCCGTCTCGGAAAGAAAATCCTCCTCTCCCGTCATCTCGTATTAGATCGTCAGCCCGATCTGCTGATAGAGCAGCTGCAAAAGATGATGCTTTAAATGAATTGCGTGCAAAAAGGTTGAAGCAGCAGGACCCTGAGGCTCACCGCAAATTGAGAGATACATCTAGAGGAAACGCAAATAATCGAAGGTTCTCACCAACAAAGCGAAAGCCCTTCACTGCTCCTAGTTTGAGTAGCTCAAGCCAAAGTGAAAGTGAAAGTAGGTTTCAAAGTGACGATGAAGGATCTACAGGAGATGGTGGAATGATTGACAGTGACGATGAAAGAACCATGCCTGGTTCAAATGGGCCAACATTTGATGATATCAAGGAAATTACTATTCGTAGATCAAAGCTTGCAAAATGGTTAATGGAGCCATTCTTTGAGGAGTTGATTGTTGGATGCTTTGTGAGAGTCGGAATCGGGAGATCAAGATCTGGGCCTATCTACAGGCTCTGCTTGGTGCGCAATGTTGATGCAACAGAACCTGATCGTCAGTATAAACTCGAGAACAAAATCACGCATAAATATCTTAATGTTATTTGGGGAAATGAAAGTTCTGCTGCCAGATGGCAAATGGCTATGGTTTCGGACTCTGTTCCACTCGAGGATGAATATAAACAGTGGGTGAAGGAAGTAGAGCGAACGGGCGGTCGGATGCTGAGCAAGCAGGATATCTTGGAGAAGAAGGAAGCTATACAGAAAGCCAACAACTTTGTGTACTCAGCAGCCACAGTGAAGCAGATGTTGCAGGAGAAAAAATTTGCCTCATCAAGGCCATTAAATATTGCAGCTGAGAAGGACCGACTGAGGAATCAGATGGACGTAGCACTAAGCAAAAACAATGAAGCTGAGGTGGAGAGGATCAAGGCAAAACTGCGCCAATTAGACGCATCCAGGAGGTCACAAATGAAAGATGCCAAGGCTATTAGGTTATCTGAGATGAACAGGAAGAATAGGGTGGAAAACTTCAAGAATGCATCAGAACTAAGACCCACGAAAGATTTGAAAGCCGGTGAGGCTGGTTATGATCCCTTCTCTAGGAGATGGACGAGGTCGAGGAATTATTATGTTTCAAACGCTGGTCAAGTCGATGGGGCTGCCGAGGCAGCTGGCAACAGTGACAATATAACTCCTGCATCAGAGAGTACTGGAACAGGATCTGGAGAAGCTGGTGTGGCAGCTACCGCAGCAGCTTTGGAAGCTGCTGCTGGTGCTGGAAAGTTGGTCGATACTAACGCTCCAGTAGATGGAGGAACAGAATCGAACTTGCTGCACAACTTCGAGCTGCCTATATCATTGACTGTGCTTCAGAAATTTGGTGGTGCTCTGGGAGCTCAAGCTGGGTTCTTAGCAAGGAAACAAAGGATAGAAGCCACAGTTGGGCGTCAAGTCCCTGAGAATGATGGTAGGCGGCATGCACTGACACTCACTGTTAGCGACTACAAGAGAAGAAGAGGGCTTCTTTGA

Protein sequence

MKFLGREGKSRNHRFWYQRLQNMADLENLLLEAAGRTKASGRNRHSHPPSRRQREGSYSDAGSDSRDDDSDDERGYASRKPSGSQVPLKKRLDPNERDDDVGSPEEGEDEDVGSEHEGDSSDESDVGDDLYKDDDDRRKLAGMSELQREMILSDRASKKNDKHLYESLRSKMDKGKAAPSRKENPPLPSSRIRSSARSADRAAAKDDALNELRAKRLKQQDPEAHRKLRDTSRGNANNRRFSPTKRKPFTAPSLSSSSQSESESRFQSDDEGSTGDGGMIDSDDERTMPGSNGPTFDDIKEITIRRSKLAKWLMEPFFEELIVGCFVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNESSAARWQMAMVSDSVPLEDEYKQWVKEVERTGGRMLSKQDILEKKEAIQKANNFVYSAATVKQMLQEKKFASSRPLNIAAEKDRLRNQMDVALSKNNEAEVERIKAKLRQLDASRRSQMKDAKAIRLSEMNRKNRVENFKNASELRPTKDLKAGEAGYDPFSRRWTRSRNYYVSNAGQVDGAAEAAGNSDNITPASESTGTGSGEAGVAATAAALEAAAGAGKLVDTNAPVDGGTESNLLHNFELPISLTVLQKFGGALGAQAGFLARKQRIEATVGRQVPENDGRRHALTLTVSDYKRRRGLL
Homology
BLAST of CmaCh16G006620 vs. ExPASy Swiss-Prot
Match: Q9C950 (Protein RTF1 homolog OS=Arabidopsis thaliana OX=3702 GN=VIP5 PE=1 SV=1)

HSP 1 Score: 800.0 bits (2065), Expect = 2.0e-230
Identity = 456/659 (69.20%), Postives = 533/659 (80.88%), Query Frame = 0

Query: 23  MADLENLLLEAAGRTKASGRNRHSHPPSRRQREGSYSDAGSDSRDDDSDDERGYASRKPS 82
           M DLENLLLEAAGRT ++GR+R  HPPS R+REGSYSD  SDSR DDSD++RGYASRKPS
Sbjct: 1   MGDLENLLLEAAGRTNSAGRSR--HPPSSRRREGSYSDGSSDSR-DDSDEDRGYASRKPS 60

Query: 83  GSQVPLKKRLDPNERDDDVGSPEEGEDEDVGSEHEGDSSDESDVGDDLYKDDDDRRKLAG 142
           GSQVPLKKRL+  ER+D     E G   D  S+ EGDSS+ESD GDDLYK+++DR+KLAG
Sbjct: 61  GSQVPLKKRLEA-EREDRAARVEGGYG-DGPSDREGDSSEESDFGDDLYKNEEDRQKLAG 120

Query: 143 MSELQREMILSDRASKKNDKHLYESLRSKMDKGKAAPSRKENPPLPSSR-IRSSARSADR 202
           M+E QREMILS+RA KK DK+  E LRSK +  K   S+KE  PLP+SR +RSSARSADR
Sbjct: 121 MTEFQREMILSERADKKGDKNFTEKLRSKRESEKTPVSKKETQPLPASRGVRSSARSADR 180

Query: 203 AAAKDDALNELRAKRLKQQDPEAHRKLRDTSRGNANNRRFSPTKRKPFTAPSLSSSSQSE 262
           AAAKDDALNELRAKR+KQQDP A RKLRD S+G + +R FS TKRKP  + +LSSSSQS+
Sbjct: 181 AAAKDDALNELRAKRMKQQDPAALRKLRDASKGGSGSRDFSSTKRKPLASSNLSSSSQSD 240

Query: 263 SESRFQSDDEGSTGDGGMIDSDDERTMPGSNGPTFDDIKEITIRRSKLAKWLMEPFFEEL 322
           S+SR QSDDEGS  +GGM+DSDD+R    S+ PTF+D+KE+TIRRSKLAKWLMEPFFEEL
Sbjct: 241 SDSRSQSDDEGS--NGGMLDSDDDR----SDVPTFEDVKEVTIRRSKLAKWLMEPFFEEL 300

Query: 323 IVGCFVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNESSAARW 382
           IVGCFVRVGIGRS+SGPIYRLC V+NVDAT+PD+ YKLENK THKYLNV+WGNE+SAARW
Sbjct: 301 IVGCFVRVGIGRSKSGPIYRLCWVKNVDATDPDKTYKLENKTTHKYLNVVWGNETSAARW 360

Query: 383 QMAMVSDSVPLEDEYKQWVKEVERTGGRMLSKQDILEKKEAIQKANNFVYSAATVKQMLQ 442
           QMAM+SD  PLE+EY+QW++EVERT GRM +KQDI EKKEAIQ+ N+FVYSA TVKQMLQ
Sbjct: 361 QMAMISDGHPLEEEYRQWIREVERTNGRMPTKQDISEKKEAIQRTNSFVYSAETVKQMLQ 420

Query: 443 EKKFASSRPLNIAAEKDRLRNQMDVALSKNNEAEVERIKAKLRQLDASRRSQMKDAKAIR 502
           EKK AS RP+N+AAEKDRLR ++++A SKN+EA VERIK+K++QLDASR  +  D KA++
Sbjct: 421 EKKSASVRPMNVAAEKDRLRKELEIAQSKNDEAGVERIKSKIKQLDASRNKKGVDKKALK 480

Query: 503 LSEMNRKNRVENFKNASELRP-TKDLKAGEAGYDPFSRRWTRSRNYYVSNAGQVDGAAEA 562
           L+EMN+KNR ENFKNASE++  T  LKAGEAGYDPFSRRWTRS NYY       DG    
Sbjct: 481 LAEMNKKNRAENFKNASEVKSITASLKAGEAGYDPFSRRWTRSSNYYNGKNKGKDGEENE 540

Query: 563 AGNSDNITPASESTGTGSGEAGVAATAAALEAAAGAGKLVDTNAPVDGGTESNLLHNFEL 622
           A     +  A E+ G  +G AGV AT AALEAAA AGKL+DT AP+  G E N LHNFEL
Sbjct: 541 AA----VAAAVETNGADAG-AGVEATEAALEAAAEAGKLIDTRAPIGQGAEHNQLHNFEL 600

Query: 623 PISLTVLQKFGGALGAQAGFLARKQRIEATVGRQVPENDGRRHALTLTVSDYKRRRGLL 680
            +SLT LQK+GG  G Q  F+ARKQ  EATVG +V ENDG+RH LTLTVSDYKRRRGLL
Sbjct: 601 SLSLTALQKYGGPQGVQKAFMARKQLTEATVGCRVAENDGKRHGLTLTVSDYKRRRGLL 643

BLAST of CmaCh16G006620 vs. ExPASy Swiss-Prot
Match: Q92541 (RNA polymerase-associated protein RTF1 homolog OS=Homo sapiens OX=9606 GN=RTF1 PE=1 SV=4)

HSP 1 Score: 107.8 bits (268), Expect = 4.8e-22
Identity = 181/697 (25.97%), Postives = 301/697 (43.19%), Query Frame = 0

Query: 25  DLENLLLEAAGRTKASGRNRHSHPPSRRQREGSYSDAGSDSRDDDSDDERGYAS--RKPS 84
           +L+  LL  A R ++    +   PP       S   A SDS   DSDDE  + S   K  
Sbjct: 63  NLDQELLSLAKRKRSDSEEK--EPPV------SQPAASSDSETSDSDDEWTFGSNKNKKK 122

Query: 85  GSQVPLKKR------------LDPNERDDDVGS--PEEGEDEDVGSEHEGDSSDESD--- 144
           G    ++K+               +++D    S  PEEGE  D  S     SSD      
Sbjct: 123 GKARKIEKKGTMKKQANKTASSGSSDKDSSAESSAPEEGEVSDSDSNSSSSSSDSDSSSE 182

Query: 145 -------VGDDLYKDDDDRRKLAGMSELQREMILSDRASK----KNDKHLYESLRSKMDK 204
                   G+DL  D++DR +L  M+E +RE  L +R  K    K    + + L++   K
Sbjct: 183 DEEFHDGYGEDLMGDEEDRARLEQMTEKEREQELFNRIEKREVLKRRFEIKKKLKTAKKK 242

Query: 205 GKAAPSRKENPPLPSSRIRSSARSADRAAAKDDALNELRAKRLKQQD--PEAHRKLRDTS 264
            K    +K+       ++     S   +  K     E R+KR ++ D   +A  +L+   
Sbjct: 243 EKKEKKKKQEEEQEKKKLTQIQESQVTSHNK-----ERRSKRDEKLDKKSQAMEELKAER 302

Query: 265 RGNANNRRFSPTKRKPFTAPSLSSSSQSESESRFQSDDEGSTGDGGMIDSDDER-TMPGS 324
               N       K++P     + S  + E E    S+    +      D ++E+  +P  
Sbjct: 303 EKRKNRTAELLAKKQPLKTSEVYSDDEEEEEDDKSSEKSDRSSRTSSSDEEEEKEEIPPK 362

Query: 325 NGPTF--DDIKEITIRRSKLAKWLMEPFFEELIVGCFVRVGIGRSRSGPIYRLCLVRNVD 384
           + P    +++  + + R KL +W   PFF + + GCFVR+GIG   S P+YR+  +  V 
Sbjct: 363 SQPVSLPEELNRVRLSRHKLERWCHMPFFAKTVTGCFVRIGIGNHNSKPVYRVAEITGV- 422

Query: 385 ATEPDRQYKLENKITHKYLNVIWGNESSAARWQMAMVSDSVPLEDEYKQWVKEVERTGGR 444
             E  + Y+L    T+K L +  GN+    R  +  VS+    E E+ +W KE   + G 
Sbjct: 423 -VETAKVYQLGGTRTNKGLQLRHGNDQRVFR--LEFVSNQEFTESEFMKW-KEAMFSAGM 482

Query: 445 MLSKQDILEKKE-AIQKANNFVYSAATVKQMLQEKKFASSRPLNIAAEKDRLRNQMDVAL 504
            L   D + KKE +I++A N+ ++   ++++++EK+     P N A +K +L  +  +A 
Sbjct: 483 QLPTLDEINKKELSIKEALNYKFNDQDIEEIVKEKERFRKAPPNYAMKKTQLLKEKAMAE 542

Query: 505 SKNNEAEVERIKAKLRQL----DASRRSQMKDAKAIRLSEMNRKNRVENFKNASELRPTK 564
              ++ + ++I+ +L +L    +A  R + K+  AI  S +N++NR  N   + +    +
Sbjct: 543 DLGDQDKAKQIQDQLNELEERAEALDRQRTKNISAI--SYINQRNREWNIVESEKALVAE 602

Query: 565 DLKAGEAGYDPFSRRWTRSRNYYVSNAGQVDGAAEAAGNSDNITPASESTGTGSGEAGVA 624
                    DPF+RR  + +   VSN+   D A +AA        A  +   GSG     
Sbjct: 603 SHNMKNQQMDPFTRR--QCKPTIVSNSR--DPAVQAA------ILAQLNAKYGSG----V 662

Query: 625 ATAAALEAAAGAGKLVDTNAPVDGGTESNL--LHNFELPISLTVLQKFGGALGAQAGFLA 680
              A  E + G GK  D N+        +L  +H+F++ I L V      AL        
Sbjct: 663 LPDAPKEMSKGQGKDKDLNSKSASDLSEDLFKVHDFDVKIDLQVPSSESKAL-------- 710

BLAST of CmaCh16G006620 vs. ExPASy Swiss-Prot
Match: A2AQ19 (RNA polymerase-associated protein RTF1 homolog OS=Mus musculus OX=10090 GN=Rtf1 PE=1 SV=1)

HSP 1 Score: 107.8 bits (268), Expect = 4.8e-22
Identity = 181/697 (25.97%), Postives = 305/697 (43.76%), Query Frame = 0

Query: 25  DLENLLLEAAGRTKASGRNRHSHPPSRRQREGSYSDAGSDSRDDDSDDERGYAS--RKPS 84
           +L+  LL  A R ++    +   PP       S   A SDS   DSDDE  + S   K  
Sbjct: 68  NLDQELLSLAKRKRSDSEEK--EPPV------SQPAASSDSETSDSDDEWTFGSNKNKKK 127

Query: 85  GSQVPLKKR------------LDPNERDDDVGS--PEEGEDED-----VGSEHEGDSSDE 144
           G    ++K+               ++RD    S  PEEGE  D       S  + DSS E
Sbjct: 128 GKTRKVEKKGAMKKQANKAASSGSSDRDSSAESSAPEEGEVSDSESSSSSSSSDSDSSSE 187

Query: 145 SD-----VGDDLYKDDDDRRKLAGMSELQREMILSDRASK----KNDKHLYESLRSKMDK 204
            +      G+DL  D++DR +L  M+E +RE  L +R  K    K    + + L++   K
Sbjct: 188 DEEFHDGYGEDLMGDEEDRARLEQMTEKEREQELFNRIEKREVLKRRFEIKKKLKTAKKK 247

Query: 205 GKAAPSRKENPPLPSSRIRSSARSADRAAAKDDALNELRAKRLKQQD--PEAHRKLRDTS 264
            K    +K+       ++     S   +  K     E R+KR ++ D   +A  +L+   
Sbjct: 248 EKKEKKKKQEEEQEKKKLTQIQESQVTSHNK-----ERRSKRDEKLDKKSQAMEELKAER 307

Query: 265 RGNANNRRFSPTKRKPF-TAPSLSSSSQSESESRFQSDDEGSTGDGGMIDSDDERTMPGS 324
               N       K++P  T+   S   + E + +     + S+      + +++  +P  
Sbjct: 308 EKRKNRTAELLAKKQPLKTSEVYSDDEEEEDDDKSSEKSDRSSRTSSSDEEEEKEEIPPK 367

Query: 325 NGPTF--DDIKEITIRRSKLAKWLMEPFFEELIVGCFVRVGIGRSRSGPIYRLCLVRNVD 384
           + P    +++  + + R KL +W   PFF + + GCFVR+GIG   S P+YR+  +  V 
Sbjct: 368 SQPVSLPEELNRVRLSRHKLERWCHMPFFAKTVTGCFVRIGIGNHNSKPVYRVAEITGV- 427

Query: 385 ATEPDRQYKLENKITHKYLNVIWGNESSAARWQMAMVSDSVPLEDEYKQWVKEVERTGGR 444
             E  + Y+L    T+K L +  GN+    R  +  VS+    E E+ +W KE   + G 
Sbjct: 428 -VETAKVYQLGGTRTNKGLQLRHGNDQRVFR--LEFVSNQEFTESEFMKW-KEAMFSAGM 487

Query: 445 MLSKQDILEKKE-AIQKANNFVYSAATVKQMLQEKKFASSRPLNIAAEKDRLRNQMDVAL 504
            L   D + KKE +I++A N+ ++   ++++++EK+     P N A +K +L  +  +A 
Sbjct: 488 QLPTLDEINKKELSIKEALNYKFNDQDIEEIVKEKERFRKAPPNYAMKKTQLLKEKAMAE 547

Query: 505 SKNNEAEVERIKAKLRQL----DASRRSQMKDAKAIRLSEMNRKNRVENFKNASELRPTK 564
              ++ + ++I+ +L +L    +A  R + K+  AI  S +N++NR  N   + +    +
Sbjct: 548 DLGDQDKAKQIQDQLNELEERAEALDRQRTKNISAI--SYINQRNREWNIVESEKALVAE 607

Query: 565 DLKAGEAGYDPFSRRWTRSRNYYVSNAGQVDGAAEAAGNSDNITPASESTGTGSGEAGVA 624
                    DPF+RR  + +   VSN+   D A +AA        A  +   GSG     
Sbjct: 608 SHNMRNQQMDPFTRR--QCKPTIVSNSR--DPAVQAA------ILAQLNAKYGSG----V 667

Query: 625 ATAAALEAAAGAGKLVDTNAPVDGGTESNL--LHNFELPISLTVLQKFGGALGAQAGFLA 680
              A  E + G GK  D N+        +L  +H+F++ I L V      AL        
Sbjct: 668 LPDAPKEMSKGQGKDKDLNSKTASDLSEDLFKVHDFDVKIDLQVPSSESKAL-------- 715

BLAST of CmaCh16G006620 vs. ExPASy Swiss-Prot
Match: Q5RAD5 (RNA polymerase-associated protein RTF1 homolog (Fragment) OS=Pongo abelii OX=9601 GN=RTF1 PE=2 SV=2)

HSP 1 Score: 94.0 bits (232), Expect = 7.1e-18
Identity = 165/648 (25.46%), Postives = 283/648 (43.67%), Query Frame = 0

Query: 29  LLLEAAGRTKASGRNRHSHPPSRRQREGSYSD----------AGSDSRDDDSDDERGYAS 88
           +++++      S  N     PS  +R+ S S+          A SDS   DSDDE  + S
Sbjct: 44  VVIDSDTEDSGSDENLDQELPSLAKRKRSDSEEKEPPVSQPAASSDSETSDSDDEWTFGS 103

Query: 89  --RKPSGSQVPLKKR------------LDPNERDDDVGS--PEEGEDEDVGSEHEGDSSD 148
              K  G    ++K+               +++D    S  PEEGE  D  S     SSD
Sbjct: 104 NKNKKKGKARKIEKKGTMKKQANKTASSGSSDKDSSAESSAPEEGEVSDSDSNSSSSSSD 163

Query: 149 ESD----------VGDDLYKDDDDRRKLAGMSELQREMILSDRASK----KNDKHLYESL 208
                         G+DL  D++DR +L  M+E +RE  L +R  K    K    + + L
Sbjct: 164 SDSSSEDEEFHDGYGEDLMGDEEDRARLEQMTEKEREQELFNRIEKREVLKRRFEIKKKL 223

Query: 209 RSKMDKGKAAPSRKENPPLPSSRIRSSARSADRAAAKDDALNELRAKRLKQQD--PEAHR 268
           ++   K K    +K+       ++     S   +  K     E R+KR ++ D   +A  
Sbjct: 224 KTAKKKEKKEKKKKQEEEQEKKKLTQIQESQVTSHNK-----ERRSKRDEKLDKKSQAME 283

Query: 269 KLRDTSRGNANNRRFSPTKRKPFTAPSLSSSSQSESESRFQSDDEGSTGDGGMIDSDDER 328
           +L+       N       K++P     + S  + E E    S+    +      D ++E+
Sbjct: 284 ELKAEREKRKNRTVELLAKKQPLKTSEVYSDDEEEEEDDKSSEKSDRSSRTSSSDEEEEK 343

Query: 329 -TMPGSNGPTF--DDIKEITIRRSKLAKWLMEPFFEELIVGCFVRVGIGRSRSGPIYRLC 388
             +P  + P    +++  + + R KL +W   PFF + + GCFVR+GIG   S P+YR+ 
Sbjct: 344 EEIPPKSQPVSLPEELNRVRLSRHKLERWCHMPFFAKTVTGCFVRIGIGNHNSKPVYRVA 403

Query: 389 LVRNVDATEPDRQYKLENKITHKYLNVIWGNESSAARWQMAMVSDSVPLEDEYKQWVKEV 448
            +  V   E  + Y+L    T+K L +  GN+    R  +  VS+    E E+ +W KE 
Sbjct: 404 EITGV--VETAKVYQLGGTRTNKGLQLRHGNDQRVFR--LEFVSNQEFTESEFMKW-KEA 463

Query: 449 ERTGGRMLSKQDILEKKE-AIQKANNFVYSAATVKQMLQEKKFASSRPLNIAAEKDRLRN 508
             + G  L   D + KKE +I++A N+ ++   ++++++EK+     P N A +K +L  
Sbjct: 464 MFSAGMQLPTLDEINKKELSIKEALNYKFNDQDIEEIVKEKERFRKAPPNYAMKKTQLLK 523

Query: 509 QMDVALSKNNEAEVERIKAKLRQL----DASRRSQMKDAKAIRLSEMNRKNRVENFKNAS 568
           +  +A    ++ + ++I+ +L +L    +A  R + K+  AI  S +N++NR  N   + 
Sbjct: 524 EKAMAEDLGDQDKAKQIQDQLNELEERAEALDRQRTKNISAI--SYINQRNREWNIVESE 583

Query: 569 ELRPTKDLKAGEAGYDPFSRRWTRSRNYYVSNAGQVDGAAEAAGNSDNITPASESTGTGS 625
           +    +         DPF+RR  + +   VSN+   D A +AA        A  +   GS
Sbjct: 584 KALVAESHNMKNQQMDPFTRR--QCKPTIVSNSR--DPAVQAA------ILAQLNAKYGS 643

BLAST of CmaCh16G006620 vs. ExPASy Swiss-Prot
Match: G5EBY0 (RNA polymerase-associated protein RTF1 homolog OS=Caenorhabditis elegans OX=6239 GN=rtfo-1 PE=2 SV=1)

HSP 1 Score: 79.3 bits (194), Expect = 1.8e-13
Identity = 144/599 (24.04%), Postives = 258/599 (43.07%), Query Frame = 0

Query: 67  DDDSDDERGYASRKPSGSQVPLKKRLDPNERDDDVGSPE----EGEDEDVGSEHEGDSSD 126
           D DSD + G    KP  +        D +  D D   P+    + +           SSD
Sbjct: 22  DSDSDSDAGPKPGKPLST--------DSSASDSDAEKPQAKPAKKKTLTKRKRRATGSSD 81

Query: 127 ESDVGDDLYKDDDDRRKLAGMSELQREMILSDR--------------------ASKKNDK 186
           +  V DDL+ D +D+ +   ++EL++E  + +R                    A K ++K
Sbjct: 82  DDQVDDDLFADKEDKARWKKLTELEKEQEIFERMEARENAIAREEIAQQLAKKAKKSSEK 141

Query: 187 HLYESLRSKMDKG---KAAPSRKENPPLPSSRIRSSARSAD--RAAAKDDALNELRAKRL 246
            +    R KM+ G     +P RK +    S    +  R +D  R   + +A++ L+ KR 
Sbjct: 142 GVKTEKRRKMNSGGSDAGSPKRKASSDSDSEMDAAFHRPSDINRKHKEKNAMDALKNKR- 201

Query: 247 KQQDPEAHRKLRDTSRGNANNRRFSPTKRKPFTAPSLSSSSQSESESRFQSDDEGSTGDG 306
                      ++  + NA N   S        + S SSSS SES     S  E S    
Sbjct: 202 -----------KEIEKKNAKNEALSIDAVFGANSGSSSSSSSSESSRSSSSSRESSPERV 261

Query: 307 GMIDSDDERTMPGSNGPTFDDIKEITIRRSKLAKWLMEPFFEELIVGCFVRVGIGR-SRS 366
              D   ++ + G       +++   + R KL+  +  PFF+  +VGC+VR+G G+ S S
Sbjct: 262 SEKDKIVKKDVDG-----LSELRRARLSRHKLSLMIHAPFFDSTVVGCYVRLGQGQMSGS 321

Query: 367 GPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNESSAARWQMAMVSDSVPLEDEY 426
           G  YR+  +  V+  E ++ Y+LE K T+K +     N  S   ++M  VS++   + E+
Sbjct: 322 GSKYRIWKIVGVE--ESNKVYELEGKKTNKIIKC--QNGGSERPFRMQFVSNADFEQIEF 381

Query: 427 KQWVKEVERTGGRMLSKQDILE-KKEAIQKANNFVYSAATVKQMLQEKKFASSRPLNIAA 486
            +W+   +R G   L   DI++ KK+ I+KA N  YS   V  M++EK    + P N A 
Sbjct: 382 DEWLLACKRHGN--LPTVDIMDKKKQDIEKAINHKYSDKEVDLMIKEKSKYQTVPRNFAM 441

Query: 487 EKDRLRNQMDVALSKNNEAEVERIKAKL----RQLDASRRSQMKDAKAIRLSEMNRKNRV 546
            K     Q ++A  + +  E E+I+ K+    RQ D   + + K   AI       ++++
Sbjct: 442 TKANWSKQKELAQQRGDIREAEQIQTKIDEIERQADELEKERSKSISAIAFINHRNRSKI 501

Query: 547 ENFKNASELRPTKDLKAGEAGYDPFSRRWTRSRNYYVSNAGQVDGAAEAAGNSDNITPAS 606
           ++   + +L+  ++ +      DPF+R+    R     +  ++DG   A+ ++ N++   
Sbjct: 502 KDQVLSGQLKIEENSQD-----DPFTRKKGGMR-VVSGSKSRLDGTLSASSSTTNLSDGG 561

Query: 607 ESTGTGSGEAGVAATAAALEAAAGAGKLVDTNAPVDGGTESNLLHNFELPISLTVLQKF 631
           +   +   +      +  ++                  T+ + LH+F+L I L  L+ F
Sbjct: 562 KDKSSSLAKPTQPPPSTQIKKK----------------TDISSLHDFDLDIDLGKLKDF 567

BLAST of CmaCh16G006620 vs. TAIR 10
Match: AT1G61040.1 (plus-3 domain-containing protein )

HSP 1 Score: 800.0 bits (2065), Expect = 1.4e-231
Identity = 456/659 (69.20%), Postives = 533/659 (80.88%), Query Frame = 0

Query: 23  MADLENLLLEAAGRTKASGRNRHSHPPSRRQREGSYSDAGSDSRDDDSDDERGYASRKPS 82
           M DLENLLLEAAGRT ++GR+R  HPPS R+REGSYSD  SDSR DDSD++RGYASRKPS
Sbjct: 1   MGDLENLLLEAAGRTNSAGRSR--HPPSSRRREGSYSDGSSDSR-DDSDEDRGYASRKPS 60

Query: 83  GSQVPLKKRLDPNERDDDVGSPEEGEDEDVGSEHEGDSSDESDVGDDLYKDDDDRRKLAG 142
           GSQVPLKKRL+  ER+D     E G   D  S+ EGDSS+ESD GDDLYK+++DR+KLAG
Sbjct: 61  GSQVPLKKRLEA-EREDRAARVEGGYG-DGPSDREGDSSEESDFGDDLYKNEEDRQKLAG 120

Query: 143 MSELQREMILSDRASKKNDKHLYESLRSKMDKGKAAPSRKENPPLPSSR-IRSSARSADR 202
           M+E QREMILS+RA KK DK+  E LRSK +  K   S+KE  PLP+SR +RSSARSADR
Sbjct: 121 MTEFQREMILSERADKKGDKNFTEKLRSKRESEKTPVSKKETQPLPASRGVRSSARSADR 180

Query: 203 AAAKDDALNELRAKRLKQQDPEAHRKLRDTSRGNANNRRFSPTKRKPFTAPSLSSSSQSE 262
           AAAKDDALNELRAKR+KQQDP A RKLRD S+G + +R FS TKRKP  + +LSSSSQS+
Sbjct: 181 AAAKDDALNELRAKRMKQQDPAALRKLRDASKGGSGSRDFSSTKRKPLASSNLSSSSQSD 240

Query: 263 SESRFQSDDEGSTGDGGMIDSDDERTMPGSNGPTFDDIKEITIRRSKLAKWLMEPFFEEL 322
           S+SR QSDDEGS  +GGM+DSDD+R    S+ PTF+D+KE+TIRRSKLAKWLMEPFFEEL
Sbjct: 241 SDSRSQSDDEGS--NGGMLDSDDDR----SDVPTFEDVKEVTIRRSKLAKWLMEPFFEEL 300

Query: 323 IVGCFVRVGIGRSRSGPIYRLCLVRNVDATEPDRQYKLENKITHKYLNVIWGNESSAARW 382
           IVGCFVRVGIGRS+SGPIYRLC V+NVDAT+PD+ YKLENK THKYLNV+WGNE+SAARW
Sbjct: 301 IVGCFVRVGIGRSKSGPIYRLCWVKNVDATDPDKTYKLENKTTHKYLNVVWGNETSAARW 360

Query: 383 QMAMVSDSVPLEDEYKQWVKEVERTGGRMLSKQDILEKKEAIQKANNFVYSAATVKQMLQ 442
           QMAM+SD  PLE+EY+QW++EVERT GRM +KQDI EKKEAIQ+ N+FVYSA TVKQMLQ
Sbjct: 361 QMAMISDGHPLEEEYRQWIREVERTNGRMPTKQDISEKKEAIQRTNSFVYSAETVKQMLQ 420

Query: 443 EKKFASSRPLNIAAEKDRLRNQMDVALSKNNEAEVERIKAKLRQLDASRRSQMKDAKAIR 502
           EKK AS RP+N+AAEKDRLR ++++A SKN+EA VERIK+K++QLDASR  +  D KA++
Sbjct: 421 EKKSASVRPMNVAAEKDRLRKELEIAQSKNDEAGVERIKSKIKQLDASRNKKGVDKKALK 480

Query: 503 LSEMNRKNRVENFKNASELRP-TKDLKAGEAGYDPFSRRWTRSRNYYVSNAGQVDGAAEA 562
           L+EMN+KNR ENFKNASE++  T  LKAGEAGYDPFSRRWTRS NYY       DG    
Sbjct: 481 LAEMNKKNRAENFKNASEVKSITASLKAGEAGYDPFSRRWTRSSNYYNGKNKGKDGEENE 540

Query: 563 AGNSDNITPASESTGTGSGEAGVAATAAALEAAAGAGKLVDTNAPVDGGTESNLLHNFEL 622
           A     +  A E+ G  +G AGV AT AALEAAA AGKL+DT AP+  G E N LHNFEL
Sbjct: 541 AA----VAAAVETNGADAG-AGVEATEAALEAAAEAGKLIDTRAPIGQGAEHNQLHNFEL 600

Query: 623 PISLTVLQKFGGALGAQAGFLARKQRIEATVGRQVPENDGRRHALTLTVSDYKRRRGLL 680
            +SLT LQK+GG  G Q  F+ARKQ  EATVG +V ENDG+RH LTLTVSDYKRRRGLL
Sbjct: 601 SLSLTALQKYGGPQGVQKAFMARKQLTEATVGCRVAENDGKRHGLTLTVSDYKRRRGLL 643

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9C9502.0e-23069.20Protein RTF1 homolog OS=Arabidopsis thaliana OX=3702 GN=VIP5 PE=1 SV=1[more]
Q925414.8e-2225.97RNA polymerase-associated protein RTF1 homolog OS=Homo sapiens OX=9606 GN=RTF1 P... [more]
A2AQ194.8e-2225.97RNA polymerase-associated protein RTF1 homolog OS=Mus musculus OX=10090 GN=Rtf1 ... [more]
Q5RAD57.1e-1825.46RNA polymerase-associated protein RTF1 homolog (Fragment) OS=Pongo abelii OX=960... [more]
G5EBY01.8e-1324.04RNA polymerase-associated protein RTF1 homolog OS=Caenorhabditis elegans OX=6239... [more]
Match NameE-valueIdentityDescription
AT1G61040.11.4e-23169.20plus-3 domain-containing protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 457..489
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 559..579
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 99..130
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 151..180
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 50..78
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 235..263
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 564..579
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 35..297
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 198..234
NoneNo IPR availablePANTHERPTHR13115:SF15PLUS-3 DOMAIN PROTEINcoord: 23..679
NoneNo IPR availablePANTHERPTHR13115UNCHARACTERIZEDcoord: 23..679
IPR004343Plus-3 domainSMARTSM00719rtf1coord: 293..405
e-value: 3.0E-44
score: 163.0
IPR004343Plus-3 domainPFAMPF03126Plus-3coord: 298..403
e-value: 2.5E-30
score: 105.2
IPR004343Plus-3 domainPROSITEPS51360PLUS3coord: 293..428
score: 36.648071
IPR036128Plus3-like superfamilyGENE3D3.90.70.200coord: 294..426
e-value: 9.7E-35
score: 121.3
IPR036128Plus3-like superfamilySUPERFAMILY159042Plus3-likecoord: 295..426

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh16G006620.1CmaCh16G006620.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016593 Cdc73/Paf1 complex
molecular_function GO:0003677 DNA binding
molecular_function GO:1990269 RNA polymerase II C-terminal domain phosphoserine binding