Spg020952 (gene) Sponge gourd (cylindrica) v1

Overview
NameSpg020952
Typegene
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRNA polymerase II C-terminal domain phosphatase-like
Locationscaffold9: 4158747 .. 4167238 (+)
RNA-Seq ExpressionSpg020952
SyntenySpg020952
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
TTCTTCTTCACGCCGTCTCATCCTCCGTTTCAGCCGTTCACTGCTGCCGCCGGCTCTCTCCCGTTCACTGCTGCCGCCGCCATGATCGTCTTTCTTCCAATCGTGCCTCCGCAGCCTCCCTTCTAGCAGTTTTACACTGATTTTGTACTCCCTTGTTGGTATTCTCCCAATCTGTCTTTCTTCTCACTTGGAAAATGTAACAACCCTTATTTCAAAAAAAAAAGAAAAAAGAAAACCCTCTCCCCCGATTTCCCTCCCTCTCTCTAATCTCTCTTTCTTCTTGGTCACGAATAGGTAGCGCCGCCACTGCCCCCTCCTCTCGCCTCCGATCGTACTCCGGCGTCCACAGAGAGCGTCGGCAGCACTCCTCCCTCCCATCTCCGGCGTCGTGTGCGTGCGTCCGACGTAGGCAGCAGCGACAGCAAGTTCTTCCGGTGTGGGTTTTGTTTCTATCCGAGCGGCTCCGACGTTGTTTCCTCTCAACCAACAGAGGGTTGTGAACTCGCGCGGGGTGCGTTTTCCGGCAGTATCAGAGCGCGAACAGTAGCGTATGGGCGCGTTTCTGGCAACTCCTTCGCAGACAGCAGCTTTACTTCGCGAGTTCCGGCGGTTTGAGCTGTGGGTCTTCCCCTCTGGCGAGTTTGGACGTTTATTAGGCTCCACCAGCGAGTGGGTAAGTGAGATTTAAAGTAATTTTTGGCCGATTATGCGTTCTAAGTGAATAGCCATTGAGTTTGATTTAAAAATTTAAGGTATCCATAGCGTTTTTGGAGGTCGTAGTCAAAATTGGATCGAATGGGAATCCAACCAGCAAGCATGAAGTTTAATTGGGTGAGCTTTTCGATTTATTAGGTTGCTTTAAATTAGTGGACTTTTGGTTTAATATCCAATGTTAAAGTCTATTGATTTTAGAAATTTTTGTGTAGATCTAAAATTGCTAATTGTGGTCGAGATCGACATCAATTCTGAGTTAATTGATTTTGCATAGGAAGCAATTAGCTGAGGTAAGTAACCTTACCACTGGGATAGTCTTGGCTAGGCTGGACTACTTATTATCTTTTAATTATTTTCTTTTTGGGTGGATATTTCAGATGAGCATTGTGACTAATTCTCCGGCTCACTCGTCGAGCAGTGACGATTTTGCTGCATTTCTTGATGTAGCTCTAGATTCCCATTCCTCTGACTCATCGCCTGATGAAAAGGCTGAGGGTGACAATAATGTTGTTGAAAGTGAGAGGTACCAATAAATTTTTAAAATTAATAAATTTCCATTGATCTTTTAAGTTAATAATGTCTGAAGACAGTGGTTTAGGGTTTTTTGGAATTATAATTTTTTGGTTAATCTAATGTGTTCAATACTTCAATGTAGAATTAAGTGGGAAGTGTTTTATAGTTGCCTGGCTCTCTTTTGAATCCTCATATGGAGAAAAATAGTATAATGGGAGTTATTATGATTAAGAGTATTGGGTTCTTTTTTCAATTTCTGGACCGTCTTTTTTAGAGAATAGATTGTATTTGGTACTGTATGGCATTCCAAGATGCTTGTTGAAATAGGATTTCATGAATGGTTTAACATTACAGATTTCCCATCTGACTTTATTGTGCTAGTCTTATAGCAATTTGTAGATTCTAAGCTTCTAATTAAAAAATCAGTTTTTTTAAAAGAAAAAATTTGAAGAAACGAAATTTCCTAGAATTATGAAAGAATAAGAGCAAACTTATTGGGCCATGGAAAAAATAAAGGGTTTAGAGGGAATAGGTTCAATCCATGATGGCTAGTGTGTCCGTGTACGACACTTGGACACTTGTTGTGTAAAAACGATATATATTTTAAAAAATGTATTTTATTAAACGTGTTCATGCCGTGTTTGTGCCCTAGATTTTAAAAAGATGACATGTCACCATGTCCGTGGCATGTCGTATCCATGTCCTGTGTCCGTATCCATGTTTCTTAGATGGCCACCTTCCTAGGATTTAATATCCTATGTGTTTCCTGGCAACCAAATGTAGTAGGGTCAGACGGTTGTCCCTTGAGATTAGTCGAAGTGCACACAAACTGGTCTGGACACTCAGATATCAAAAAAAGAAAAAAAAAAAAGAAGAGCAAACTGCAAAGCTGCACAAAAGAAGAAGAATATAAAAACAAAGACTAAACATTGACAACAATCCAAATTATTTAGACTCTCAGGTAATGGGTTTAGTCATTTAGTTTCACTTTCATCTTATTATTTTCTTTTATAGATATCCCAAGTGGTCTAGTTTATGAAAGATCAGTTACCACATACGTAATTATTATTTTATATTAAAAGTCTAAAACTATACAGGATTCTGCTCATAACTTCTCATCATCCAATGAGTGCCATTTGTGATTTCTTTGTTTGTAATTTGTATTCTTTTTCCTTTTGCATGAATAACAGCATTGAGTTGTGGCAAGTGGCGACTATCATTTCTGCACATAATAACGAGGGTTTGTAGGATTGGTTATAAGTTATAACCATATAACTAATTTGTGGTAGTTCATCTTCATTTTATCAATTTAACATGTCAAATATTTTTCATTCAATTCATTTATTTTTTTAGGATAAAACGTCGTAAAGTGGAGAAGCTGGAAAACCCAGAGGAGGACATTCTGTATGGAGTTGAAAGGCAAAGTTCAGGTAAATCAAATTGTTCCTACTTCCTACTAACCCCTTTTGTTGTCCCTCTTTTACTGGCTCTACTCTATACTTCCAATGTAGCCAATTTGTGGCTGCCACCTGTACTATAGTTAACTATTTCCTCACTTTTAGACAGCTAACTATACGACGTATTTGCTGTGTCAGCCCCTTTTTCCCACCCTCTCTGTATATTTATAGAGCATTTCACGTCTGACTCTTCTGTTGCATATTGTGTTTTTGTTGGTTGCCTCCTCTTCCCTTCTTATACGTGAATCTGATTGGTGGTTTGGCACCCTTTTTTTTATCCCTCTTTGATTTGAATTGTATGAGCTTCTCTAAGAACCATATCAAATTGGCTCCAAATGAGCTACATATATAAACTTCTACTTATTTGTTTCTTGGGTAAGAAACTATTCCATAGAAAATATGAAATTACCAAGGCAGGAGAAGGAAAGCTGCATCCCTGATAACTAAAAGTGATACATATTCTCTGGCTTTATCTTACCATACGATGGGAGTTGTTCTTATGGTGGCAGTGACTGATCTTTGGTCTAAATGAAAATAGTCAAATTTAGATTCTTGGTTATTCAAGAACTAGGAATTACGGGGATGCTAGATGGATATGCATCTCAAATTTTGTGGCATCTGAATTTATGAACTATTTCAGTCCAGCAGTCCTCTGCCCACCATATATCCCTTATTTTGGTTCCTTTCTTCCTCTAGTATTGTGTCCTAATCAGTTCTGTGTAATTGTAACTTATGAACTTGAAATATAGAGGCCTTCGTGTTAAAGAAAGTATTTGACTACATGTATCTCTGCATTGGGGTTTCTCTATTTTCTGAATTCTGCTGTTATCAATGCAGAAGTATTGTCAAAGCAGCAACTATGCAGTCATCCTGGTTCATTTGGGAATATGTGTATCGTCTGTGTGCAGAGGTTGGATGAGGTATCTGGCGTGACATTTGGGTATATACATAAGGTATATTTTCTTTACATGTATGTGTGATTATCTTCTATTCTTTGTTACGTTACAATGTATATAATGCTTGGAGTGAGGGTTTGGAAGGAACTCAAAAGGCCTTGGAGTTTACAGAATAGGCACCATTTAGAAGAGGAATGAGAAATGTTGTCAAGATTTTTTTGGATAAGAAACAATTTTATCGATGTATGAAAATACAAAAAAGGGGGAAGGAGAAAGCCCCGAACCAATGGAGTTACAATAAACATCTCCAATTAGTAAAAAATGATGTTAAACTATAAGAGGAAAAGAGAGAATTCAACTTACACCAAGATAATGCATGGAAAAGAACATTGTCAAAAAGAAATGGATATGGCTTTTCCTTAGATTCAAAAGTTCGATGGTTTCTTTCTATCCAAATCGGTCGTAAGAATGCTCATATTCTGCCAAAGAAGCTCCTTTGTGCTTTAAAATGGATGACCTAAAAGTAGAGATGATAAAAGCATGGGTGGATCCTTTGGAAGAGCGATCTCCCAATTAAAAGCTAATCTCAACTCATTCCGAAAATTTTGAGCTACCGACAACAAATGAAAATATGAGATTGAGTCTCCAGATCGGATTTGCAGAGAATACACCTATTTGGGGATAGAGCCATGTTAGAGAATTTCCTTTGGAGTTTGTCTATAGTCTTTAAAACTGAGTGGTCAAATTCCCAAATGAAGAATTTAATCTTTTTTGGGTAGGCATCAGTCCATATGACCTCTGTATAGTGGCAAGGCTTGTCTTTTGTCACCGAAACTGATAGACCACCAATCAGACACGTGTATAAGAGGAGAAGAAAGAAGAACGATGAAGGAACTAGTACTTAGTTAGTTATAAGGCTGTTAGTATGGTTAGGTTGTTTGCACCAAGTGGTTGCACCAAGTGGAACGATATTTTGTTGTTAATCGTTTGGCGGATTCGGAGAAAGTAGAAGCCGCAGTGCTTTGTTTAGATAGTGTAGCGCTCAAATGGCATCATTATGAGGAGAAACGACAACCGATTGCGACTTGGGAGGAGTTTCGCCGTCTTCTTCTTCGACGTTTTCTTCTGACCAAAGAGGGTACTCTACATGAAAGATTTTTTGCGCTCAAGCAGGAGACGTCGGTCGCAAACTACAAAGAGAAATTCAAGGACTATTCGGCTCCTTTGGAAAACCTTGACAATGCTACTTTAGAAGGAAAATTTGTAGATGGGTTAAAAGAAGATATCAAGGTGAAGTTGCGTGTGTGTCGACCCATTGGGCTTGAAGCAATTATGGAAACAACCCAAAAGATGGAAGATGAGCTTCTTCATCTTGAGGAGAAGTATGTTGGGGCTCCCTTCAAAGTCTCAAGCCCAAACCGCTAGTCAAGCATCTAGCACATCTAAGCTCGCTTCCTATGGGGCTGGGTCTAATGCCGAACGATTTTGGTAAACTCGAGTAAGCTTGGCACCTCGCTGAGCTCTATTCGAAAAGACTTCTCTGGTAAACAATCTGGTGTGGGTGTTGGGGTTGTGCTTATGCAAGATGGCCATCCTATTGCCTATTTTAGTCATGCTTTAGCTCCTCTGCACAGAAATAAGGCTGTATATGAGAGCGAACTTATGGCAATTGTTATGGCAATCCAAAAGTGGCGACCATATTTGCTTGGAAGAAGATTTTTGGTTCGTACTGATCAGAGTAGTCTCAAGTTTATTCTTGAGCAACGTCTTGTGGCTGGAGAGTATCAGAAGTGGTTGACAAAGATAATGGGTTATGACTTCGATGTAGTTTACAAGCCGGGTGTAGAGAATAAGGTAGCTAATGCCTTATCGAGGGTTCTTAGTGCTATTGAATTTGCAGTAGTAAGTCTCGTTGGGGGATTGAATGCTGGACTTATTCATGATCAACAAGTCCATGATGTGAAGTTAAATGCTATCAGAAGGAAGATTATCAATGGGGATGAGGTTCTTGTGGCTTATACGTTAAAGGGCTCGTTGTTGTATTATAGGGGTAAGATAGTTTTACCCGAGGATTCTCCAACCATTCCACTATTGTTGGAAGCTTTTCACTCATCTCCCGTTGGGGGCCATGGGGGAGTCTTGAAGACGTATCAATGATTAGCTCGAGAAGTTTATTGGGTTGGTATGAAGGCTCGAGTAAAGGTATTTGTGGCTGAATGTTCTGTTTGCCAACAAGCCAAGTACCTTGCTTTGGCTCCAGCCAGTTTGCTTCAATTGTTGTCTATTCCTGATCACATTTGGGAGGATATTTCAATGGATTTTGTTATTGGGATGCCGAGAGCTGATACATTCGATTCAGTTTTTGTGGTAGTGGATCGTTTATCAAAATATGCTCACTTTATACCGCTGAAGCATCCATTTAGTGCAGTATCAATTGCACAATTATTCATCACGGAGATTGTTCGTCTGCATGGTATTCCAAGGAGCATTGTGTTTGACAGAGATCCGGTATTCACTAGTCTTTTTTGGAAGATCTGTTTAAGTGGCAGGGCACCCAACTAAAGCGCAGTACGACATACCATCCTCAAACAGACGGACAGACTGAGGTGGTCAATCGTGGGTTGGAAACATACCTTCGGTGTTTTGCTATGCATTACCCAACTAAATGGGTAAAATGGCTTCCGTGGGCTGAGTATAGCTATAAAACTTCATTTCACACTAGCTTGAAAGCTACCCCATTTGAGGTAGTTTATGGTCACCCACCTCCTGACATTTTTCCATACGTGGAAAATAGTTCTCCAGTGTCAGTTTGAAGGATAGAGATGCCATGTTGGTGGTGATTGGGCTTGGCATATGCTCAACAATATATGGTTTCCATTGCCAATGCATCTCGATGTCACGTGGGTGACTGGGCATACTTGAAACTTCGCCCATTTCGGCAAGGGACCTTGCTAAAGCATTCTAGTCCTAAACTTGCCTCGAGGTTTGTTGGACCATTTCAAGTGGAGGCTCGCATAGGTGCTGTTGCATACCGTTTAAGGCTGCCAGATGAGGCGCGTATTCATCCACTGTTCCATGTTTCCTAACTTAGAAAGGCAATGGGAAACATGTTTCCAATTACTCCTTTTCCTCCAAACATGCAACCAGATTTTATTTTTCGAATGGAACCTGCGAAGCTACTAGGCATTCGATAGAGTTCAGATGATGCTTCAAGGTTGGAAGTATTAATTTGGTGGGAGGATTTGTCTGTTTCAGAAGCTACTTGGGAGGAAGCATCGTGGATTATAAACCAATTTCTAGACTTTCATCTTGAGGACAAGATGAATCTTTGGGGGCCGGGTAGTGATAAACCAGCAATCAAGCTCGTGTATAAGAGGAGATGGAAGAAGAACGATGAAGGAACTAGTACTTACTACTTAGTTAGTTATATGGCTGTTAGTATGGTTAGGTTGTTAGTCAGTTACTAACCATATCTTCTATATAGCTCTATCTTGTAAGAAGGGGGGATACCTTTTGACTAAGAAGTGTAGTTGGGTAAAGGATGGAGAGAAAGAGGGAACTCTCGAATATTCCCTAGGTTGTACTGAATGTGTTCATAAATAATATCAAGTTCCTATCAGAAACCAAGCTATTGAGCAATGATTTTGTTGAGAATAATCCCCCATTTTCATAAGGCCACACCCAATGATCTTCCATATTGGTTAGATGGATACTGTATAAGATGAAACTCAATTCAGTCCCTTCAACAACTTCTTGTTCTTTGAAATCCCTTATGAAATGCAAATCCCATAAAGCCAATTCTGGATTTCACATATCTTTTACAGCAACCTGCCTATTGGTTAAGGCAAACAAAAGAGGGAAAGCAGTACATAATGGGGAATCACCTAGCCAACGATCGTGCCAGAAATCCGTGTATGCACCATTACCAACCTTTGTCACTATCGAGTCATGAATAAGAGAGATGTTTCATAATTTTCTTCCATGGGCCGTTTGATGAGTGAAATGAGAAACAGCCCGGATGGTTATTACGGTACTTTGTGCCAAATTTGGCATCAATAATCGATCGCCGTAAAGCACCTTTTTCCTTTAGATATCTGCAAATCCATTTAGACAATAGGGTTCGATTTTTGTCTTTTAAACTATAGAGACTCAAGCCTTCGTTCTCCATAGGGAGACAAATGTTGTTCCACTCTATAAGATGCTTGTGATAACCCTTCTTCATGACCACCCCCATAGAAAGTTTTGGAATAATCTCTCAATGGTCTTGACAACCTTGTTTGGCATCTCAAAGAGGGAGATGTAATAGGTTGGGAGGTTTGAAAGTGTTGCCTGAATAAAAGTATGTCTGCCTCTCTTGGAAATGTGGTAATTTTTCATCCATCTAATCTCCTTTGTATTTTTTCCACAATAGGAAACCAAAAAGAAGACAACTTGGAATTGCCATTAAGAGGAAGACCAAGGTAGGAGTTTGGCCACCCTCCAATTTTACAACCGAACATAGATGTTGTGGATGAAAGATAATACTCAATATTAACACCAATAATTTTAGTTTTGTAAATGTTTATGTTTAGACTTGAAGCATTCTCAAATGCATGAACCATATCAAATAAATTGTGTCCTGCCCAACACTCATGTGAGGAGAAAAGTAAGGGATCATCCATAAATTGGAGGTGTGTGATGTTACAACCGTTTGCACCCAGCTCTTCTCATCAATTTCTTGAATGGACAGAGTGATACTAAAGCAGAAAACGGTAAAGAAAAAGAAAAGAGGTCGGAATGTTCTTATGTCGAAGTTATGAAGCTAGATGATTCTAATCCCCTCCTGACTAGTGGTAGAGCCACTCTCAAGGTGAAAAACGAGCAATACATGAAGTTAGGTCAATCTGTTGGGAAGACACTCTTACTGAAATAG

mRNA sequence

ATGGGCGCGTTTCTGGCAACTCCTTCGCAGACAGCAGCTTTACTTCGCGAGTTCCGGCGGTTTGAGCTGTGGGTCTTCCCCTCTGGCGAGTTTGGACGTTTATTAGGCTCCACCAGCGAGTGGATGAGCATTGTGACTAATTCTCCGGCTCACTCGTCGAGCAGTGACGATTTTGCTGCATTTCTTGATGTAGCTCTAGATTCCCATTCCTCTGACTCATCGCCTGATGAAAAGGCTGAGGGTGACAATAATGTTGTTGAAAGTGAGAGGATAAAACGTCGTAAAGTGGAGAAGCTGGAAAACCCAGAGGAGGACATTCTGTATGGAGTTGAAAGGCAAAGTTCAGAAGTATTGTCAAAGCAGCAACTATGCAGTCATCCTGGTTCATTTGGGAATATGTGTATCGTCTGTGTGCAGAGGTTGGATGAGGTATCTGGCGTGACATTTGGGTATATACATAAGCAACCTGCCTATTGGTTAAGGCAAACAAAAGAGGGAAAGCAGTACATAATGGGGAATCACCTAGCCAACGATCGTGCCAGAAATCCGTCTCTTCTCATCAATTTCTTGAATGGACAGAGTGATACTAAAGCAGAAAACGGTAAAGAAAAAGAAAAGAGGTCGGAATGTTCTTATGTCGAAGTTATGAAGCTAGATGATTCTAATCCCCTCCTGACTAGTGGTAGAGCCACTCTCAAGGTGAAAAACGAGCAATACATGAAGTTAGGTCAATCTGTTGGGAAGACACTCTTACTGAAATAG

Coding sequence (CDS)

ATGGGCGCGTTTCTGGCAACTCCTTCGCAGACAGCAGCTTTACTTCGCGAGTTCCGGCGGTTTGAGCTGTGGGTCTTCCCCTCTGGCGAGTTTGGACGTTTATTAGGCTCCACCAGCGAGTGGATGAGCATTGTGACTAATTCTCCGGCTCACTCGTCGAGCAGTGACGATTTTGCTGCATTTCTTGATGTAGCTCTAGATTCCCATTCCTCTGACTCATCGCCTGATGAAAAGGCTGAGGGTGACAATAATGTTGTTGAAAGTGAGAGGATAAAACGTCGTAAAGTGGAGAAGCTGGAAAACCCAGAGGAGGACATTCTGTATGGAGTTGAAAGGCAAAGTTCAGAAGTATTGTCAAAGCAGCAACTATGCAGTCATCCTGGTTCATTTGGGAATATGTGTATCGTCTGTGTGCAGAGGTTGGATGAGGTATCTGGCGTGACATTTGGGTATATACATAAGCAACCTGCCTATTGGTTAAGGCAAACAAAAGAGGGAAAGCAGTACATAATGGGGAATCACCTAGCCAACGATCGTGCCAGAAATCCGTCTCTTCTCATCAATTTCTTGAATGGACAGAGTGATACTAAAGCAGAAAACGGTAAAGAAAAAGAAAAGAGGTCGGAATGTTCTTATGTCGAAGTTATGAAGCTAGATGATTCTAATCCCCTCCTGACTAGTGGTAGAGCCACTCTCAAGGTGAAAAACGAGCAATACATGAAGTTAGGTCAATCTGTTGGGAAGACACTCTTACTGAAATAG

Protein sequence

MGAFLATPSQTAALLREFRRFELWVFPSGEFGRLLGSTSEWMSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKLENPEEDILYGVERQSSEVLSKQQLCSHPGSFGNMCIVCVQRLDEVSGVTFGYIHKQPAYWLRQTKEGKQYIMGNHLANDRARNPSLLINFLNGQSDTKAENGKEKEKRSECSYVEVMKLDDSNPLLTSGRATLKVKNEQYMKLGQSVGKTLLLK
Homology
BLAST of Spg020952 vs. NCBI nr
Match: XP_038890381.1 (RNA polymerase II C-terminal domain phosphatase-like 4 isoform X1 [Benincasa hispida] >XP_038890382.1 RNA polymerase II C-terminal domain phosphatase-like 4 isoform X1 [Benincasa hispida])

HSP 1 Score: 193.4 bits (490), Expect = 2.4e-45
Identity = 101/113 (89.38%), Postives = 104/113 (92.04%), Query Frame = 0

Query: 42  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKLEN 101
           MS+ TNSPAHSSSSDDFAAFLDVALDSHSSDSSP EKAEGDNN  ESERIKRRKVEKLEN
Sbjct: 1   MSLATNSPAHSSSSDDFAAFLDVALDSHSSDSSPYEKAEGDNN-AESERIKRRKVEKLEN 60

Query: 102 PEEDILYGVERQSSEVLSKQQLCSHPGSFGNMCIVCVQRLDEVSGVTFGYIHK 155
            EEDILYGVE QSSE +SKQQLCSHPGSFGNMCI+C QRLDE SGVTFGYIHK
Sbjct: 61  SEEDILYGVEEQSSEAISKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHK 112

BLAST of Spg020952 vs. NCBI nr
Match: KAG7037160.1 (RNA polymerase II C-terminal domain phosphatase-like 4 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 190.3 bits (482), Expect = 2.1e-44
Identity = 101/122 (82.79%), Postives = 107/122 (87.70%), Query Frame = 0

Query: 33  RLLGSTSEWMSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIK 92
           +L  +    MS+VTNSPAHSSSSDDFAAFLDVALDSHSSDSSP+EKAEG NN VE+ERIK
Sbjct: 11  QLRSAVGRVMSLVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPNEKAEGHNN-VETERIK 70

Query: 93  RRKVEKLENPEEDILYGVERQSSEVLSKQQLCSHPGSFGNMCIVCVQRLDEVSGVTFGYI 152
           R KVEKLEN  EDILYGVE  SSEVLSKQQLCSHPGSFGNMCI+C QRLDE SGVTFGYI
Sbjct: 71  RHKVEKLENSGEDILYGVEEHSSEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYI 130

Query: 153 HK 155
           HK
Sbjct: 131 HK 131

BLAST of Spg020952 vs. NCBI nr
Match: KAG6607512.1 (RNA polymerase II C-terminal domain phosphatase-like 4, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 190.3 bits (482), Expect = 2.1e-44
Identity = 101/122 (82.79%), Postives = 107/122 (87.70%), Query Frame = 0

Query: 33  RLLGSTSEWMSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIK 92
           +L  +    MS+VTNSPAHSSSSDDFAAFLDVALDSHSSDSSP+EKAEG NN VE+ERIK
Sbjct: 274 QLRSAVGRVMSLVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPNEKAEGHNN-VETERIK 333

Query: 93  RRKVEKLENPEEDILYGVERQSSEVLSKQQLCSHPGSFGNMCIVCVQRLDEVSGVTFGYI 152
           R KVEKLEN  EDILYGVE  SSEVLSKQQLCSHPGSFGNMCI+C QRLDE SGVTFGYI
Sbjct: 334 RHKVEKLENSGEDILYGVEEHSSEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYI 393

Query: 153 HK 155
           HK
Sbjct: 394 HK 394

BLAST of Spg020952 vs. NCBI nr
Match: XP_023525838.1 (RNA polymerase II C-terminal domain phosphatase-like 4 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 189.9 bits (481), Expect = 2.7e-44
Identity = 100/113 (88.50%), Postives = 104/113 (92.04%), Query Frame = 0

Query: 42  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKLEN 101
           MS+VTNSPAHSSSSDDFAAFLDVALDSHSSDSSP+EKAEG NN VE+ERIKR KVEKLEN
Sbjct: 1   MSLVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPNEKAEGHNN-VETERIKRHKVEKLEN 60

Query: 102 PEEDILYGVERQSSEVLSKQQLCSHPGSFGNMCIVCVQRLDEVSGVTFGYIHK 155
             EDILYGVE  SSEVLSKQQLCSHPGSFGNMCI+C QRLDE SGVTFGYIHK
Sbjct: 61  SGEDILYGVEEHSSEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHK 112

BLAST of Spg020952 vs. NCBI nr
Match: XP_022133134.1 (RNA polymerase II C-terminal domain phosphatase-like 4 isoform X1 [Momordica charantia])

HSP 1 Score: 189.1 bits (479), Expect = 4.6e-44
Identity = 101/116 (87.07%), Postives = 107/116 (92.24%), Query Frame = 0

Query: 42  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKL-- 101
           MS+VTNSPAHSSSSDDFAAFLDVALDSHSSDSSP+EKAEGDNN VESER+KRRKVE+L  
Sbjct: 1   MSLVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPEEKAEGDNN-VESERMKRRKVEELEG 60

Query: 102 -ENPEEDILYGVERQSSEVLSKQQLCSHPGSFGNMCIVCVQRLDEVSGVTFGYIHK 155
            E P+EDI YGVE QSSEVLSKQQLCSHPGSFGNMCI+C QRLDE SGVTFGYIHK
Sbjct: 61  SEEPQEDISYGVEEQSSEVLSKQQLCSHPGSFGNMCIMCGQRLDEESGVTFGYIHK 115

BLAST of Spg020952 vs. ExPASy Swiss-Prot
Match: Q00IB6 (RNA polymerase II C-terminal domain phosphatase-like 4 OS=Arabidopsis thaliana OX=3702 GN=CPL4 PE=1 SV=1)

HSP 1 Score: 84.3 bits (207), Expect = 2.1e-15
Identity = 56/116 (48.28%), Postives = 72/116 (62.07%), Query Frame = 0

Query: 42  MSIVTNSPAH-SSSSDDFAAFLDVALDSHSSDSS-PDEKAEGDNNVVESERIKRRKVEKL 101
           MS+ ++SP H SSSSDD AAFLD  LDS S  SS P E+ E +++V     +KR+K+E L
Sbjct: 1   MSVASDSPVHSSSSSDDLAAFLDAELDSASDASSGPSEEEEAEDDV--ESGLKRQKLEHL 60

Query: 102 ENPEEDILYGVERQSSEVLSKQQLCSHPGSFGNMCIVCVQRLDEVSGVTFGYIHKQ 156
           E               E  S +  C HPGSFGNMC VC Q+L+E +GV+F YIHK+
Sbjct: 61  E---------------EASSSKGECEHPGSFGNMCFVCGQKLEE-TGVSFRYIHKE 98

BLAST of Spg020952 vs. ExPASy TrEMBL
Match: A0A6J1BUF9 (RNA polymerase II C-terminal domain phosphatase-like OS=Momordica charantia OX=3673 GN=LOC111005808 PE=4 SV=1)

HSP 1 Score: 189.1 bits (479), Expect = 2.2e-44
Identity = 101/116 (87.07%), Postives = 107/116 (92.24%), Query Frame = 0

Query: 42  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKL-- 101
           MS+VTNSPAHSSSSDDFAAFLDVALDSHSSDSSP+EKAEGDNN VESER+KRRKVE+L  
Sbjct: 1   MSLVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPEEKAEGDNN-VESERMKRRKVEELEG 60

Query: 102 -ENPEEDILYGVERQSSEVLSKQQLCSHPGSFGNMCIVCVQRLDEVSGVTFGYIHK 155
            E P+EDI YGVE QSSEVLSKQQLCSHPGSFGNMCI+C QRLDE SGVTFGYIHK
Sbjct: 61  SEEPQEDISYGVEEQSSEVLSKQQLCSHPGSFGNMCIMCGQRLDEESGVTFGYIHK 115

BLAST of Spg020952 vs. ExPASy TrEMBL
Match: A0A6J1BV42 (RNA polymerase II C-terminal domain phosphatase-like OS=Momordica charantia OX=3673 GN=LOC111005808 PE=4 SV=1)

HSP 1 Score: 189.1 bits (479), Expect = 2.2e-44
Identity = 101/116 (87.07%), Postives = 107/116 (92.24%), Query Frame = 0

Query: 42  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKL-- 101
           MS+VTNSPAHSSSSDDFAAFLDVALDSHSSDSSP+EKAEGDNN VESER+KRRKVE+L  
Sbjct: 1   MSLVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPEEKAEGDNN-VESERMKRRKVEELEG 60

Query: 102 -ENPEEDILYGVERQSSEVLSKQQLCSHPGSFGNMCIVCVQRLDEVSGVTFGYIHK 155
            E P+EDI YGVE QSSEVLSKQQLCSHPGSFGNMCI+C QRLDE SGVTFGYIHK
Sbjct: 61  SEEPQEDISYGVEEQSSEVLSKQQLCSHPGSFGNMCIMCGQRLDEESGVTFGYIHK 115

BLAST of Spg020952 vs. ExPASy TrEMBL
Match: A0A6J1ID30 (RNA polymerase II C-terminal domain phosphatase-like OS=Cucurbita maxima OX=3661 GN=LOC111471991 PE=4 SV=1)

HSP 1 Score: 187.6 bits (475), Expect = 6.5e-44
Identity = 99/113 (87.61%), Postives = 103/113 (91.15%), Query Frame = 0

Query: 42  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKLEN 101
           MS+VTNSPAHSSSSDDFAAFLDVALDSHSSDS P+EKAEG NN VE+ERIKR KVEKLEN
Sbjct: 1   MSLVTNSPAHSSSSDDFAAFLDVALDSHSSDSLPNEKAEGHNN-VETERIKRHKVEKLEN 60

Query: 102 PEEDILYGVERQSSEVLSKQQLCSHPGSFGNMCIVCVQRLDEVSGVTFGYIHK 155
             EDILYGVE  SSEVLSKQQLCSHPGSFGNMCI+C QRLDE SGVTFGYIHK
Sbjct: 61  SGEDILYGVEEHSSEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHK 112

BLAST of Spg020952 vs. ExPASy TrEMBL
Match: A0A6J1GC38 (RNA polymerase II C-terminal domain phosphatase-like OS=Cucurbita moschata OX=3662 GN=LOC111452801 PE=4 SV=1)

HSP 1 Score: 186.0 bits (471), Expect = 1.9e-43
Identity = 99/113 (87.61%), Postives = 103/113 (91.15%), Query Frame = 0

Query: 42  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKLEN 101
           MS+VTNS AHSSSSDDFAAFLDVALDSHSSDSSP+EKAEG NN VE+ERIKR KVEKLEN
Sbjct: 1   MSLVTNSLAHSSSSDDFAAFLDVALDSHSSDSSPNEKAEGHNN-VETERIKRHKVEKLEN 60

Query: 102 PEEDILYGVERQSSEVLSKQQLCSHPGSFGNMCIVCVQRLDEVSGVTFGYIHK 155
             EDILYGVE  SSEVLSKQQLCSHPGSFGNMCI+C QRLDE SGVTFGYIHK
Sbjct: 61  SGEDILYGVEEHSSEVLSKQQLCSHPGSFGNMCIICGQRLDEESGVTFGYIHK 112

BLAST of Spg020952 vs. ExPASy TrEMBL
Match: A0A6J1CJQ5 (RNA polymerase II C-terminal domain phosphatase-like OS=Momordica charantia OX=3673 GN=LOC111012040 PE=4 SV=1)

HSP 1 Score: 184.9 bits (468), Expect = 4.2e-43
Identity = 99/116 (85.34%), Postives = 105/116 (90.52%), Query Frame = 0

Query: 42  MSIVTNSPAHSSSSDDFAAFLDVALDSHSSDSSPDEKAEGDNNVVESERIKRRKVEKL-- 101
           MS+VT+SPAHSSSSDDFAAFLDVALDSHSSDSSP+EKAEGDNN VESERIKRRKVEKL  
Sbjct: 1   MSLVTDSPAHSSSSDDFAAFLDVALDSHSSDSSPEEKAEGDNN-VESERIKRRKVEKLEG 60

Query: 102 -ENPEEDILYGVERQSSEVLSKQQLCSHPGSFGNMCIVCVQRLDEVSGVTFGYIHK 155
            E P+EDI+Y VE QSSEVLSKQQLC HPGSFGNMCI+C QRLD  SGVTFGYIHK
Sbjct: 61  SEEPQEDIMYRVEEQSSEVLSKQQLCGHPGSFGNMCIICGQRLDGESGVTFGYIHK 115

BLAST of Spg020952 vs. TAIR 10
Match: AT5G58003.1 (C-terminal domain phosphatase-like 4 )

HSP 1 Score: 84.3 bits (207), Expect = 1.5e-16
Identity = 56/116 (48.28%), Postives = 72/116 (62.07%), Query Frame = 0

Query: 42  MSIVTNSPAH-SSSSDDFAAFLDVALDSHSSDSS-PDEKAEGDNNVVESERIKRRKVEKL 101
           MS+ ++SP H SSSSDD AAFLD  LDS S  SS P E+ E +++V     +KR+K+E L
Sbjct: 1   MSVASDSPVHSSSSSDDLAAFLDAELDSASDASSGPSEEEEAEDDV--ESGLKRQKLEHL 60

Query: 102 ENPEEDILYGVERQSSEVLSKQQLCSHPGSFGNMCIVCVQRLDEVSGVTFGYIHKQ 156
           E               E  S +  C HPGSFGNMC VC Q+L+E +GV+F YIHK+
Sbjct: 61  E---------------EASSSKGECEHPGSFGNMCFVCGQKLEE-TGVSFRYIHKE 98

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038890381.12.4e-4589.38RNA polymerase II C-terminal domain phosphatase-like 4 isoform X1 [Benincasa his... [more]
KAG7037160.12.1e-4482.79RNA polymerase II C-terminal domain phosphatase-like 4 [Cucurbita argyrosperma s... [more]
KAG6607512.12.1e-4482.79RNA polymerase II C-terminal domain phosphatase-like 4, partial [Cucurbita argyr... [more]
XP_023525838.12.7e-4488.50RNA polymerase II C-terminal domain phosphatase-like 4 [Cucurbita pepo subsp. pe... [more]
XP_022133134.14.6e-4487.07RNA polymerase II C-terminal domain phosphatase-like 4 isoform X1 [Momordica cha... [more]
Match NameE-valueIdentityDescription
Q00IB62.1e-1548.28RNA polymerase II C-terminal domain phosphatase-like 4 OS=Arabidopsis thaliana O... [more]
Match NameE-valueIdentityDescription
A0A6J1BUF92.2e-4487.07RNA polymerase II C-terminal domain phosphatase-like OS=Momordica charantia OX=3... [more]
A0A6J1BV422.2e-4487.07RNA polymerase II C-terminal domain phosphatase-like OS=Momordica charantia OX=3... [more]
A0A6J1ID306.5e-4487.61RNA polymerase II C-terminal domain phosphatase-like OS=Cucurbita maxima OX=3661... [more]
A0A6J1GC381.9e-4387.61RNA polymerase II C-terminal domain phosphatase-like OS=Cucurbita moschata OX=36... [more]
A0A6J1CJQ54.2e-4385.34RNA polymerase II C-terminal domain phosphatase-like OS=Momordica charantia OX=3... [more]
Match NameE-valueIdentityDescription
AT5G58003.11.5e-1648.28C-terminal domain phosphatase-like 4 [more]
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (cylindrica) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR23081:SF28BNAC03G12630D PROTEINcoord: 43..155
IPR039189CTD phosphatase Fcp1PANTHERPTHR23081RNA POLYMERASE II CTD PHOSPHATASEcoord: 43..155

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Spg020952.1Spg020952.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0070940 dephosphorylation of RNA polymerase II C-terminal domain
cellular_component GO:0005634 nucleus
molecular_function GO:0008420 RNA polymerase II CTD heptapeptide repeat phosphatase activity