Sgr022854 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr022854
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
Descriptionpolyadenylation and cleavage factor homolog 4-like isoform X2
Locationtig00000589: 2434851 .. 2436794 (+)
RNA-Seq ExpressionSgr022854
SyntenySgr022854
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCGGCAGAATATCAGCTTCCAAGATGTGGGAAATATTCAACCCCACTCAAGCATCAACCCTTCTTTACCAAGCCGGTCTTCTCCTGCCCACACTCAGTGTACATTCTCAGAGCCAAAGATTGTGGGAGAATCTTCATTAGGTCCTCCATCTCGTGAAAGCCCATCAGCTCTGGTTAAGCTATCTCGGACTAAGGTAGAAGAGACACCATTACCATCTGATCCACTGCCACCTTCATCTCCTACGAATAGTACATCCACTGAAACTTCAAATGTGGTAAACGATGCTTCTAGTCCAATTTCTAACCTTTTGAGCTCACTGGTTGCAAAGGGCCTCATATCTGCTTCAAAAGGAAAAATGACAAATAACGGACGTCCCAGTTGCCGTCACAGCCTGAAAATTTGAAGTCAGGTGATGCTGTGACTAGTTCTATACCAGTTCCTTCCATCCCTGTTTCCTCTTCCAGTCTATCATCTATGAAACCTAAACCACCTTCAGAACCTGCTGCTAAGAGCTCCACTAATCCACCTCCATCAGCCACAACTGAGATAAACAACCTCATAGGCTTTGAATTTAGTTCACATGTTATTCGCAAATTTCATCCCTCTGTGATCAGTGGACTCTTTGATGATATTCCATACCAATGTAAGATCTGTGGTCTTCGACTGAAACTTGAAGAGCAGTTGGATACGCACATGCAGTGGCATGCATTAAGAACTGAGGCAAACAATTCAAATAGGGCACCAAGAAGGTGGTATCCAAGTTCAGATGATTGGGTTTCTGGAAATGCCAGACTTCTACTTGATGCTGTCACTTCTATGGACAAGTCCGACAAAATGGAAGAAGATAATGAGCCAATGGTTCCTGCAGACGAAGATCAATTTGCTTGTGTTTTATGTGGTGAACTTTTTGAAGATTTTTATAATCAAGAGATGGGTAAGTGGATGTTCAAAGGAGCAACGTACATCACCATCCCATCAGCTGGTGGTGAGGTAGGAAGCACAAATGAACAAGGTGCTAGAGGACCCATTGTGCACACAAATTGTGTAACTGAAAGTTCAGTATATGATTTGGGACTGGCAACTGATATTAAGATGGTAATGTTTTTGGTCCTTGATACTTCTGCAACTTGATGTTGGTTTTCACTATCAACCATCGTTCTCTTCAATCATTTTATGTGCTGTCCTAGACAATTTTCTTCTCTTCCTGTTCTTGTTATGGAATAATGGAGTGCCTTTGAACATGACAAAATGATATTTCCGTTTATCTGAGGCCAGTCCAAAGATAGTTTTGAATGCTATGGCATTGGACTGGTAAAGTCGAGGGAAAGATAGTGCCCTATTCTATGGAATTGGATTGGTAGAATCAAGTGTTTGGTTCCCCATTCTTTGGAACAGCTGAATCATAGTACTTGTCATTCCTAGAATCAATCATGAAACAGGTGAATCATAGTTGTTCACTTTTAGTTGTAGCATTCCTAGAATCAATCATGAAACAACTGAATCGTAGTCGTTTTCTTGTTTTAGCTCTTCCATAGTGGGCTCCTTTTACTGACATTTAGGCCCTGTAAAGCAAGTAGAGTGCAATTTTACAATTCAAAATGACATGATTTGTAGCGCCTACTTGTGTTGATATACCTTGTCATGACTGCCATTGCACACTGAAAATATGCTCAAATCTGTATCCCTGCTCTATATTGATGCTTTTCTTGGGGTTGTTGATTCAATATTTGCTTATTGTTAATTTGAGCTTCTTTGCATTATGTCCAACTTTCTACAGGGAATGGATGTATGATGCTTCCTCTGCAATACTTCCTATAGGAACTACACTGGTGGACATCTTGCTCATGCAATTGGCGGGGGGATGCGAAAAGGGAGTTTGAATATGAGTTCTTCGGTTGTGGTAGTTCTGTTGAGTGCCAAAAGAGATAATAGGGAGTGGTAG

mRNA sequence

ATGCGGCAGAATATCAGCTTCCAAGATGTGGGAAATATTCAACCCCACTCAAGCATCAACCCTTCTTTACCAAGCCGGTCTTCTCCTGCCCACACTCAGTGTACATTCTCAGAGCCAAAGATTGTGGGAGAATCTTCATTAGGTCCTCCATCTCGTGAAAGCCCATCAGCTCTGGTTAAGCTATCTCGGACTAAGGTAGAAGAGACACCATTACCATCTGATCCACTGCCACCTTCATCTCCTACGAATAGTACATCCACTGAAACTTCAAATGTGCCTGAAAATTTGAAGTCAGGTGATGCTGTGACTAGTTCTATACCAGTTCCTTCCATCCCTGTTTCCTCTTCCAGTCTATCATCTATGAAACCTAAACCACCTTCAGAACCTGCTGCTAAGAGCTCCACTAATCCACCTCCATCAGCCACAACTGAGATAAACAACCTCATAGGCTTTGAATTTAGTTCACATGTTATTCGCAAATTTCATCCCTCTGTGATCAGTGGACTCTTTGATGATATTCCATACCAATGTAAGATCTGTGGTCTTCGACTGAAACTTGAAGAGCAGTTGGATACGCACATGCAGTGGCATGCATTAAGAACTGAGGCAAACAATTCAAATAGGGCACCAAGAAGGTGGTATCCAAGTTCAGATGATTGGGTTTCTGGAAATGCCAGACTTCTACTTGATGCTGTCACTTCTATGGACAAGTCCGACAAAATGGAAGAAGATAATGAGCCAATGGTTCCTGCAGACGAAGATCAATTTGCTTGTGTTTTATGTGGTGAACTTTTTGAAGATTTTTATAATCAAGAGATGGGTAAGTGGATGTTCAAAGGAGCAACGTACATCACCATCCCATCAGCTGGTGGTGAGGTAGGAAGCACAAATGAACAAGGTGCTAGAGGACCCATTGTGCACACAAATTGTGTAACTGAAAGTTCAGTATATGATTTGGGACTGGCAACTGATATTAAGATGGCCAGTCCAAAGATAGTTTTGAATGCTATGGCATTGGACTGGAACTACACTGGTGGACATCTTGCTCATGCAATTGGCGGGGGGATGCGAAAAGGGAGTTTGAATATGAGTTCTTCGGTTGTGGTAGTTCTGTTGAGTGCCAAAAGAGATAATAGGGAGTGGTAG

Coding sequence (CDS)

ATGCGGCAGAATATCAGCTTCCAAGATGTGGGAAATATTCAACCCCACTCAAGCATCAACCCTTCTTTACCAAGCCGGTCTTCTCCTGCCCACACTCAGTGTACATTCTCAGAGCCAAAGATTGTGGGAGAATCTTCATTAGGTCCTCCATCTCGTGAAAGCCCATCAGCTCTGGTTAAGCTATCTCGGACTAAGGTAGAAGAGACACCATTACCATCTGATCCACTGCCACCTTCATCTCCTACGAATAGTACATCCACTGAAACTTCAAATGTGCCTGAAAATTTGAAGTCAGGTGATGCTGTGACTAGTTCTATACCAGTTCCTTCCATCCCTGTTTCCTCTTCCAGTCTATCATCTATGAAACCTAAACCACCTTCAGAACCTGCTGCTAAGAGCTCCACTAATCCACCTCCATCAGCCACAACTGAGATAAACAACCTCATAGGCTTTGAATTTAGTTCACATGTTATTCGCAAATTTCATCCCTCTGTGATCAGTGGACTCTTTGATGATATTCCATACCAATGTAAGATCTGTGGTCTTCGACTGAAACTTGAAGAGCAGTTGGATACGCACATGCAGTGGCATGCATTAAGAACTGAGGCAAACAATTCAAATAGGGCACCAAGAAGGTGGTATCCAAGTTCAGATGATTGGGTTTCTGGAAATGCCAGACTTCTACTTGATGCTGTCACTTCTATGGACAAGTCCGACAAAATGGAAGAAGATAATGAGCCAATGGTTCCTGCAGACGAAGATCAATTTGCTTGTGTTTTATGTGGTGAACTTTTTGAAGATTTTTATAATCAAGAGATGGGTAAGTGGATGTTCAAAGGAGCAACGTACATCACCATCCCATCAGCTGGTGGTGAGGTAGGAAGCACAAATGAACAAGGTGCTAGAGGACCCATTGTGCACACAAATTGTGTAACTGAAAGTTCAGTATATGATTTGGGACTGGCAACTGATATTAAGATGGCCAGTCCAAAGATAGTTTTGAATGCTATGGCATTGGACTGGAACTACACTGGTGGACATCTTGCTCATGCAATTGGCGGGGGGATGCGAAAAGGGAGTTTGAATATGAGTTCTTCGGTTGTGGTAGTTCTGTTGAGTGCCAAAAGAGATAATAGGGAGTGGTAG

Protein sequence

MRQNISFQDVGNIQPHSSINPSLPSRSSPAHTQCTFSEPKIVGESSLGPPSRESPSALVKLSRTKVEETPLPSDPLPPSSPTNSTSTETSNVPENLKSGDAVTSSIPVPSIPVSSSSLSSMKPKPPSEPAAKSSTNPPPSATTEINNLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLKLEEQLDTHMQWHALRTEANNSNRAPRRWYPSSDDWVSGNARLLLDAVTSMDKSDKMEEDNEPMVPADEDQFACVLCGELFEDFYNQEMGKWMFKGATYITIPSAGGEVGSTNEQGARGPIVHTNCVTESSVYDLGLATDIKMASPKIVLNAMALDWNYTGGHLAHAIGGGMRKGSLNMSSSVVVVLLSAKRDNREW
Homology
BLAST of Sgr022854 vs. NCBI nr
Match: XP_022144638.1 (uncharacterized protein LOC111014280, partial [Momordica charantia])

HSP 1 Score: 531.9 bits (1369), Expect = 4.4e-147
Identity = 278/367 (75.75%), Postives = 304/367 (82.83%), Query Frame = 0

Query: 1    MRQNISFQDVGNIQPHSSINPSLPSRSSPAHTQCTFSEPKIVGESSLGPPSRESPSALVK 60
            M+QNISFQDVGN+QPHSSI P LPSRSSPAHTQ T SE K+VGESSLGPPSRESPSALVK
Sbjct: 705  MQQNISFQDVGNLQPHSSIKPPLPSRSSPAHTQSTLSELKVVGESSLGPPSRESPSALVK 764

Query: 61   LSRTKVEETPLPSDPLPPSSPTNSTSTETSNV---------------------------- 120
            LSRTKVEETP PSDP+PPSSP +S+STETSNV                            
Sbjct: 765  LSRTKVEETPSPSDPVPPSSPMHSSSTETSNVANDASSPISNLLSSLVAKGLISASKGEL 824

Query: 121  -----------PENLKS-GDAVTSSIPVPSIPVSSSSLSSMKPKPPSEPAAKSSTNPPPS 180
                       P+NLKS GDAVTSSIPVPSIP+SSSS+SS + +PPSEPA KSST  PPS
Sbjct: 825  TNNVTSQMSSQPKNLKSEGDAVTSSIPVPSIPISSSSISSKRLEPPSEPATKSSTTLPPS 884

Query: 181  ATTEINNLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLKLEEQLDTHMQWHALR 240
            ATTEI+NLIGF+FSSHVIRKFHPSV+SGLFDDIPYQCK+CGLRLKLEEQL+TH+QWH LR
Sbjct: 885  ATTEISNLIGFDFSSHVIRKFHPSVVSGLFDDIPYQCKVCGLRLKLEEQLNTHLQWHTLR 944

Query: 241  TEANNSNRAPRRWYPSSDDWVSGNARLLLDAVTSMDKSDKMEEDNEPMVPADEDQFACVL 300
            TEAN SNRAPRRWYPSSDDWV   ARL LDA TS+D SD+MEEDNEPMVPADEDQFACVL
Sbjct: 945  TEANTSNRAPRRWYPSSDDWVXRTARLRLDADTSVDMSDEMEEDNEPMVPADEDQFACVL 1004

Query: 301  CGELFEDFYNQEMGKWMFKGATYITIPSAGGEVGSTNEQGARGPIVHTNCVTESSVYDLG 328
            CGELFEDF++Q++G WMFKGATYIT PSAG E+GSTNEQGARGPIVHT+C+TESSVYDLG
Sbjct: 1005 CGELFEDFFSQKLGNWMFKGATYITSPSAGSELGSTNEQGARGPIVHTHCLTESSVYDLG 1064

BLAST of Sgr022854 vs. NCBI nr
Match: KAG7017425.1 (Polyadenylation and cleavage factor-like 4 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 516.9 bits (1330), Expect = 1.5e-142
Identity = 280/423 (66.19%), Postives = 319/423 (75.41%), Query Frame = 0

Query: 1    MRQNISFQDVGNIQPHSSINPSLPSRSSPAHTQCTFSEPKIVGESSLGPPSRESPSALVK 60
            M+QN S QDVGN+QP SS+NP LPS+SSPAHTQ TFSEPK VGESSLGPPS ES S LVK
Sbjct: 710  MQQNFSSQDVGNMQPRSSVNPPLPSQSSPAHTQSTFSEPKTVGESSLGPPSLESTSTLVK 769

Query: 61   LSRTKVEETPLPSDPLPPSSPTNSTSTETSNV---------------------------- 120
            LS+ KVE+TPLPSDPLPPSS  NS STETSNV                            
Sbjct: 770  LSQIKVEDTPLPSDPLPPSSTMNSASTETSNVVNDDSSPISNLLSSLVAKGLICASKGEL 829

Query: 121  -----------PENLKSGDAVTSSIPVPSIPVSSSSLSSMKPKPPSEPAAKSSTNPPPSA 180
                       PENLKSGD VT SIPVPSIP+ SSS SS++P+ PS+ AA+SST PPPSA
Sbjct: 830  ASNVTSQMPSQPENLKSGDVVTCSIPVPSIPIPSSSQSSIRPESPSKAAAQSSTTPPPSA 889

Query: 181  TTEINNLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLKLEEQLDTHMQWHALRT 240
            TTEINNLIG+EFSSHVIRKFHPSVISGLFDDIP+QCKICGLRLK EE+LDTH+ WH  RT
Sbjct: 890  TTEINNLIGYEFSSHVIRKFHPSVISGLFDDIPFQCKICGLRLKCEERLDTHLWWHMSRT 949

Query: 241  EANNSNRAPRRWYPSSDDWVSGNARLLLDAVTSMDKSDKMEEDNEPMVPADEDQFACVLC 300
            E+ NS RAPRRWYPSS DWVSGNARLLLDA +S+DKS  MEEDNEPMVPADEDQFACVLC
Sbjct: 950  ESKNSCRAPRRWYPSSVDWVSGNARLLLDAASSLDKSSMMEEDNEPMVPADEDQFACVLC 1009

Query: 301  GELFEDFYNQEMGKWMFKGATYITIPSAGGEVGSTNEQGARGPIVHTNCVTESSVYDLGL 360
            GELFEDFY+QE+GKWMFKGA +ITIPS G EVGSTNE+ A GPIVH +C+TESS+++LGL
Sbjct: 1010 GELFEDFYSQELGKWMFKGAMHITIPSVGSEVGSTNERVAIGPIVHISCLTESSIHELGL 1069

Query: 361  ATDIKMASPKI------------VLNAMALDWNYTGGHLAHAIGGGMRKGSLNMSSSVVV 373
            ATDIK  +  +             L  + L+  +   H A AIG GMRKGS+++S+S+VV
Sbjct: 1070 ATDIKKYAQSVSMLRLRKRMYDASLQNVILELLWV--HPARAIGMGMRKGSMDLSASLVV 1129

BLAST of Sgr022854 vs. NCBI nr
Match: KAA0043917.1 (polyadenylation and cleavage factor-like protein 4-like isoform X1 [Cucumis melo var. makuwa])

HSP 1 Score: 508.1 bits (1307), Expect = 6.8e-140
Identity = 268/367 (73.02%), Postives = 293/367 (79.84%), Query Frame = 0

Query: 1    MRQNISFQDVGNIQPHSSINPSLPSRSSPAHTQCTFSEPKIVGESSLGPPSRESPSALVK 60
            M+QN+SFQDVGN++P SSI P LP+RSSPAH   TFSEPKI GESS+GPPS ESPS +VK
Sbjct: 720  MQQNLSFQDVGNMKPRSSIKPPLPNRSSPAH---TFSEPKIQGESSVGPPSVESPSTMVK 779

Query: 61   LSRTKVEETPLPSDPLPPSSPTNSTSTETSNV---------------------------- 120
            LSRTKVEE  LPSDPLPPSSP +S STETS+V                            
Sbjct: 780  LSRTKVEEPSLPSDPLPPSSPMDSASTETSHVVNDASSPISNLLSSLVAKGLISASKGES 839

Query: 121  -----------PENLKSGDAVTSSIPVPSIPVSSSSLSSMKPKPPSEPAAKSSTNPPPSA 180
                       PENLKSGDAVTSS+PVPSI VSSS  SS K + P + AAKSST+PPPSA
Sbjct: 840  TNSVTSQMPSQPENLKSGDAVTSSVPVPSIAVSSSCHSSTKLESPLKAAAKSSTSPPPSA 899

Query: 181  TTEINNLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLKLEEQLDTHMQWHALRT 240
            TTEINNLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLK EEQLDTH +WH LRT
Sbjct: 900  TTEINNLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLKCEEQLDTHSRWHTLRT 959

Query: 241  EANNSNRAPRRWYPSSDDWVSGNARLLLDAVTSMDKSDKMEEDNEPMVPADEDQFACVLC 300
            EANNS+ APRRWYP SDDW+SGNAR LLDA TS+D+SD MEEDNEPMVPADEDQFACV+C
Sbjct: 960  EANNSSTAPRRWYPCSDDWISGNARFLLDAETSLDESDLMEEDNEPMVPADEDQFACVIC 1019

Query: 301  GELFEDFYNQEMGKWMFKGATYITIPSAGGEVGSTNEQGARGPIVHTNCVTESSVYDLGL 329
            GELFEDFY+QE+G WM+KGATYITIPS G EVG TNEQ A+GPIVHT C+TESSVYD+GL
Sbjct: 1020 GELFEDFYSQELGNWMYKGATYITIPSVGSEVGGTNEQVAKGPIVHTTCLTESSVYDVGL 1079

BLAST of Sgr022854 vs. NCBI nr
Match: XP_008442798.1 (PREDICTED: polyadenylation and cleavage factor homolog 4-like isoform X1 [Cucumis melo])

HSP 1 Score: 506.1 bits (1302), Expect = 2.6e-139
Identity = 267/366 (72.95%), Postives = 292/366 (79.78%), Query Frame = 0

Query: 1    MRQNISFQDVGNIQPHSSINPSLPSRSSPAHTQCTFSEPKIVGESSLGPPSRESPSALVK 60
            M+QN+SFQDVGN++P SSI P LP+RSSPAH   TFSEPKI GESS+GPPS ESPS +VK
Sbjct: 720  MQQNLSFQDVGNMKPRSSIKPPLPNRSSPAH---TFSEPKIQGESSVGPPSVESPSTMVK 779

Query: 61   LSRTKVEETPLPSDPLPPSSPTNSTSTETSNV---------------------------- 120
            LSRTKVEE  LPSDPLPPSSP +S STETS+V                            
Sbjct: 780  LSRTKVEEPSLPSDPLPPSSPMDSASTETSHVVNDASSPISNLLSSLVAKGLISASKGES 839

Query: 121  -----------PENLKSGDAVTSSIPVPSIPVSSSSLSSMKPKPPSEPAAKSSTNPPPSA 180
                       PENLKSGDAVTSS+PVPSI VSSS  SS K + P + AAKSST+PPPSA
Sbjct: 840  TNSVTSQMPSQPENLKSGDAVTSSVPVPSIAVSSSCHSSTKLESPLKAAAKSSTSPPPSA 899

Query: 181  TTEINNLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLKLEEQLDTHMQWHALRT 240
            TTEINNLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLK EEQLDTH +WH LRT
Sbjct: 900  TTEINNLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLKCEEQLDTHSRWHTLRT 959

Query: 241  EANNSNRAPRRWYPSSDDWVSGNARLLLDAVTSMDKSDKMEEDNEPMVPADEDQFACVLC 300
            EANNS+ APRRWYP SDDW+SGNAR LLDA TS+D+SD MEEDNEPMVPADEDQFACV+C
Sbjct: 960  EANNSSTAPRRWYPCSDDWISGNARFLLDAETSLDESDLMEEDNEPMVPADEDQFACVIC 1019

Query: 301  GELFEDFYNQEMGKWMFKGATYITIPSAGGEVGSTNEQGARGPIVHTNCVTESSVYDLGL 328
            GELFEDFY+QE+G WM+KGATYITIPS G EVG TNEQ A+GPIVHT C+TESSVYD+GL
Sbjct: 1020 GELFEDFYSQELGNWMYKGATYITIPSVGSEVGGTNEQVAKGPIVHTTCLTESSVYDVGL 1079

BLAST of Sgr022854 vs. NCBI nr
Match: XP_008442799.1 (PREDICTED: polyadenylation and cleavage factor homolog 4-like isoform X2 [Cucumis melo])

HSP 1 Score: 506.1 bits (1302), Expect = 2.6e-139
Identity = 267/366 (72.95%), Postives = 292/366 (79.78%), Query Frame = 0

Query: 1    MRQNISFQDVGNIQPHSSINPSLPSRSSPAHTQCTFSEPKIVGESSLGPPSRESPSALVK 60
            M+QN+SFQDVGN++P SSI P LP+RSSPAH   TFSEPKI GESS+GPPS ESPS +VK
Sbjct: 718  MQQNLSFQDVGNMKPRSSIKPPLPNRSSPAH---TFSEPKIQGESSVGPPSVESPSTMVK 777

Query: 61   LSRTKVEETPLPSDPLPPSSPTNSTSTETSNV---------------------------- 120
            LSRTKVEE  LPSDPLPPSSP +S STETS+V                            
Sbjct: 778  LSRTKVEEPSLPSDPLPPSSPMDSASTETSHVVNDASSPISNLLSSLVAKGLISASKGES 837

Query: 121  -----------PENLKSGDAVTSSIPVPSIPVSSSSLSSMKPKPPSEPAAKSSTNPPPSA 180
                       PENLKSGDAVTSS+PVPSI VSSS  SS K + P + AAKSST+PPPSA
Sbjct: 838  TNSVTSQMPSQPENLKSGDAVTSSVPVPSIAVSSSCHSSTKLESPLKAAAKSSTSPPPSA 897

Query: 181  TTEINNLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLKLEEQLDTHMQWHALRT 240
            TTEINNLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLK EEQLDTH +WH LRT
Sbjct: 898  TTEINNLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLKCEEQLDTHSRWHTLRT 957

Query: 241  EANNSNRAPRRWYPSSDDWVSGNARLLLDAVTSMDKSDKMEEDNEPMVPADEDQFACVLC 300
            EANNS+ APRRWYP SDDW+SGNAR LLDA TS+D+SD MEEDNEPMVPADEDQFACV+C
Sbjct: 958  EANNSSTAPRRWYPCSDDWISGNARFLLDAETSLDESDLMEEDNEPMVPADEDQFACVIC 1017

Query: 301  GELFEDFYNQEMGKWMFKGATYITIPSAGGEVGSTNEQGARGPIVHTNCVTESSVYDLGL 328
            GELFEDFY+QE+G WM+KGATYITIPS G EVG TNEQ A+GPIVHT C+TESSVYD+GL
Sbjct: 1018 GELFEDFYSQELGNWMYKGATYITIPSVGSEVGGTNEQVAKGPIVHTTCLTESSVYDVGL 1077

BLAST of Sgr022854 vs. ExPASy Swiss-Prot
Match: Q0WPF2 (Polyadenylation and cleavage factor homolog 4 OS=Arabidopsis thaliana OX=3702 GN=PCFS4 PE=1 SV=1)

HSP 1 Score: 137.1 bits (344), Expect = 4.1e-31
Identity = 68/176 (38.64%), Postives = 100/176 (56.82%), Query Frame = 0

Query: 149 IGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLKLEEQLDTHMQWHAL--RTEANNS 208
           +G EF + +++  + S IS L+ D+P QC  CGLR K +E+   HM WH    R   N+ 
Sbjct: 625 LGLEFDADMLKIRNESAISALYGDLPRQCTTCGLRFKCQEEHSKHMDWHVTKNRMSKNHK 684

Query: 209 NRAPRRWYPSSDDWVSGNARLLLDAVTSM---DKSDKMEEDNEPMVPADEDQFACVLCGE 268
               R+W+ S+  W+SG   L  +AV      + + + ++D +  VPADEDQ +C LCGE
Sbjct: 685 QNPSRKWFVSASMWLSGAEALGAEAVPGFLPTEPTTEKKDDEDMAVPADEDQTSCALCGE 744

Query: 269 LFEDFYNQEMGKWMFKGATYITIPSAGGEVGSTNEQGARGPIVHTNCVTESSVYDL 320
            FEDFY+ E  +WM+KGA Y+  P    E  +  ++   GPIVH  C  ES+  D+
Sbjct: 745 PFEDFYSDETEEWMYKGAVYMNAPE---ESTTDMDKSQLGPIVHAKCRPESNGGDM 797

BLAST of Sgr022854 vs. ExPASy Swiss-Prot
Match: Q9C710 (Polyadenylation and cleavage factor homolog 1 OS=Arabidopsis thaliana OX=3702 GN=PCFS1 PE=1 SV=1)

HSP 1 Score: 84.0 bits (206), Expect = 4.1e-15
Identity = 90/322 (27.95%), Postives = 134/322 (41.61%), Query Frame = 0

Query: 20  NPSLPSRSSPAHTQCTFSEPKIVGESSLGPPSRESPSALVKLSRTKVEETPLPSDPLPPS 79
           N S   R++ ++T   + +P + G  +  P     P    KL      +  L  D LP  
Sbjct: 94  NSSFALRNNDSNTN-NYQKPFVAGYGNPNPQIVPLPLPYRKL------DDNLSLDSLPDW 153

Query: 80  SPTNSTSTETSNVPENLKSGDAVTSSIPVPSI--PVSSSSLSSMKPKPPSEPAAKSS--- 139
            P  ++ T T N P  ++S + V ++    ++  P++ S++ S+  +   +P   S    
Sbjct: 154 VP--NSRTLTPNYP--VRSSNFVPNTPVFTNVQNPMNHSNMVSVVSQSMHQPIVLSKELT 213

Query: 140 ------TNPPPSATTEINNL----IGFEFSS-HVIRKFHPSVISGLFDDIPYQCKICGLR 199
                  N     T E +N     +G  F +   +   H SVI  L+ D+P QC  CGLR
Sbjct: 214 DLLSLLNNEKEKKTLEASNSDSLPVGLSFDNPSSLNVRHESVIKSLYSDMPRQCSSCGLR 273

Query: 200 LKLEEQLDTHMQWHALR-------TEANNSNRAPRRWYPSSDDWVSGNARLLLDAVTS-- 259
            K +E+   HM WH  +       T      +  R W  S+  W+          V S  
Sbjct: 274 FKCQEEHSKHMDWHVRKNRSVKTTTRLGQQPKKSRGWLASASLWLCAATGGETVEVASFG 333

Query: 260 ---MDKSDKMEEDNEPMVPADEDQFACVLCGELFEDFYNQEMGKWMFKGATYITIPSAGG 314
                K  K EE  + MVPADEDQ  C LC E FE+F++ E   WM+K A Y+T      
Sbjct: 334 GEMQKKKGKDEEPKQLMVPADEDQKNCALCVEPFEEFFSHEDDDWMYKDAVYLT------ 389

BLAST of Sgr022854 vs. ExPASy Swiss-Prot
Match: Q9FIX8 (Polyadenylation and cleavage factor homolog 5 OS=Arabidopsis thaliana OX=3702 GN=PCFS5 PE=1 SV=1)

HSP 1 Score: 82.0 bits (201), Expect = 1.6e-14
Identity = 77/274 (28.10%), Postives = 118/274 (43.07%), Query Frame = 0

Query: 70  PLPSDPLPP-SSPTNSTSTETSNVPENLKSGDAVTSSIPVPSI--PVSSSSLSSMKPKPP 129
           PLP   L P  S        T N P  ++S + V ++    ++  P++ S++ S+  +  
Sbjct: 127 PLPYRKLDPLDSLPQWVPNSTPNYP--VRSSNFVPNTPDFTNVQNPMNHSNMVSVVSQSM 186

Query: 130 SEPAAKSS---------TNPPPSATTEINN----LIGFEFSS-HVIRKFHPSVISGLFDD 189
            +P   S           N     T+E +N     +G  F +   +   H SVI  L+ D
Sbjct: 187 HQPIVLSKELTDLLSLLNNEKEKKTSEASNNDSLPVGLSFDNPSSLNVRHESVIKSLYSD 246

Query: 190 IPYQCKICGLRLKLEEQLDTHMQWHALR-------TEANNSNRAPRRWYPSSDDWV---S 249
           +P QC  CG+R K +E+   HM WH  +       T      +  R W  S+  W+   +
Sbjct: 247 MPRQCTSCGVRFKCQEEHSKHMDWHVRKNRSVKTTTRLGQQPKKSRGWLASASLWLCAPT 306

Query: 250 GNARLLLDAVTSMDKSDKMEED---NEPMVPADEDQFACVLCGELFEDFYNQEMGKWMFK 309
           G   + + +    +   K E+D    + MVPADEDQ  C LC E FE+F++ E   WM+K
Sbjct: 307 GGGTVEVASFGGGEMQKKNEKDQVQKQHMVPADEDQKNCALCVEPFEEFFSHEADDWMYK 366

Query: 310 GATYITIPSAGGEVGSTNEQGARGPIVHTNCVTE 314
            A Y+T                 G IVH  C+ E
Sbjct: 367 DAVYLT---------------KNGRIVHVKCMPE 383

BLAST of Sgr022854 vs. ExPASy TrEMBL
Match: A0A6J1CTT8 (uncharacterized protein LOC111014280 OS=Momordica charantia OX=3673 GN=LOC111014280 PE=4 SV=1)

HSP 1 Score: 531.9 bits (1369), Expect = 2.1e-147
Identity = 278/367 (75.75%), Postives = 304/367 (82.83%), Query Frame = 0

Query: 1    MRQNISFQDVGNIQPHSSINPSLPSRSSPAHTQCTFSEPKIVGESSLGPPSRESPSALVK 60
            M+QNISFQDVGN+QPHSSI P LPSRSSPAHTQ T SE K+VGESSLGPPSRESPSALVK
Sbjct: 705  MQQNISFQDVGNLQPHSSIKPPLPSRSSPAHTQSTLSELKVVGESSLGPPSRESPSALVK 764

Query: 61   LSRTKVEETPLPSDPLPPSSPTNSTSTETSNV---------------------------- 120
            LSRTKVEETP PSDP+PPSSP +S+STETSNV                            
Sbjct: 765  LSRTKVEETPSPSDPVPPSSPMHSSSTETSNVANDASSPISNLLSSLVAKGLISASKGEL 824

Query: 121  -----------PENLKS-GDAVTSSIPVPSIPVSSSSLSSMKPKPPSEPAAKSSTNPPPS 180
                       P+NLKS GDAVTSSIPVPSIP+SSSS+SS + +PPSEPA KSST  PPS
Sbjct: 825  TNNVTSQMSSQPKNLKSEGDAVTSSIPVPSIPISSSSISSKRLEPPSEPATKSSTTLPPS 884

Query: 181  ATTEINNLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLKLEEQLDTHMQWHALR 240
            ATTEI+NLIGF+FSSHVIRKFHPSV+SGLFDDIPYQCK+CGLRLKLEEQL+TH+QWH LR
Sbjct: 885  ATTEISNLIGFDFSSHVIRKFHPSVVSGLFDDIPYQCKVCGLRLKLEEQLNTHLQWHTLR 944

Query: 241  TEANNSNRAPRRWYPSSDDWVSGNARLLLDAVTSMDKSDKMEEDNEPMVPADEDQFACVL 300
            TEAN SNRAPRRWYPSSDDWV   ARL LDA TS+D SD+MEEDNEPMVPADEDQFACVL
Sbjct: 945  TEANTSNRAPRRWYPSSDDWVXRTARLRLDADTSVDMSDEMEEDNEPMVPADEDQFACVL 1004

Query: 301  CGELFEDFYNQEMGKWMFKGATYITIPSAGGEVGSTNEQGARGPIVHTNCVTESSVYDLG 328
            CGELFEDF++Q++G WMFKGATYIT PSAG E+GSTNEQGARGPIVHT+C+TESSVYDLG
Sbjct: 1005 CGELFEDFFSQKLGNWMFKGATYITSPSAGSELGSTNEQGARGPIVHTHCLTESSVYDLG 1064

BLAST of Sgr022854 vs. ExPASy TrEMBL
Match: A0A5A7TQ23 (Polyadenylation and cleavage factor-like protein 4-like isoform X1 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold236G002570 PE=4 SV=1)

HSP 1 Score: 508.1 bits (1307), Expect = 3.3e-140
Identity = 268/367 (73.02%), Postives = 293/367 (79.84%), Query Frame = 0

Query: 1    MRQNISFQDVGNIQPHSSINPSLPSRSSPAHTQCTFSEPKIVGESSLGPPSRESPSALVK 60
            M+QN+SFQDVGN++P SSI P LP+RSSPAH   TFSEPKI GESS+GPPS ESPS +VK
Sbjct: 720  MQQNLSFQDVGNMKPRSSIKPPLPNRSSPAH---TFSEPKIQGESSVGPPSVESPSTMVK 779

Query: 61   LSRTKVEETPLPSDPLPPSSPTNSTSTETSNV---------------------------- 120
            LSRTKVEE  LPSDPLPPSSP +S STETS+V                            
Sbjct: 780  LSRTKVEEPSLPSDPLPPSSPMDSASTETSHVVNDASSPISNLLSSLVAKGLISASKGES 839

Query: 121  -----------PENLKSGDAVTSSIPVPSIPVSSSSLSSMKPKPPSEPAAKSSTNPPPSA 180
                       PENLKSGDAVTSS+PVPSI VSSS  SS K + P + AAKSST+PPPSA
Sbjct: 840  TNSVTSQMPSQPENLKSGDAVTSSVPVPSIAVSSSCHSSTKLESPLKAAAKSSTSPPPSA 899

Query: 181  TTEINNLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLKLEEQLDTHMQWHALRT 240
            TTEINNLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLK EEQLDTH +WH LRT
Sbjct: 900  TTEINNLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLKCEEQLDTHSRWHTLRT 959

Query: 241  EANNSNRAPRRWYPSSDDWVSGNARLLLDAVTSMDKSDKMEEDNEPMVPADEDQFACVLC 300
            EANNS+ APRRWYP SDDW+SGNAR LLDA TS+D+SD MEEDNEPMVPADEDQFACV+C
Sbjct: 960  EANNSSTAPRRWYPCSDDWISGNARFLLDAETSLDESDLMEEDNEPMVPADEDQFACVIC 1019

Query: 301  GELFEDFYNQEMGKWMFKGATYITIPSAGGEVGSTNEQGARGPIVHTNCVTESSVYDLGL 329
            GELFEDFY+QE+G WM+KGATYITIPS G EVG TNEQ A+GPIVHT C+TESSVYD+GL
Sbjct: 1020 GELFEDFYSQELGNWMYKGATYITIPSVGSEVGGTNEQVAKGPIVHTTCLTESSVYDVGL 1079

BLAST of Sgr022854 vs. ExPASy TrEMBL
Match: A0A1S3B794 (polyadenylation and cleavage factor homolog 4-like isoform X2 OS=Cucumis melo OX=3656 GN=LOC103486572 PE=4 SV=1)

HSP 1 Score: 506.1 bits (1302), Expect = 1.2e-139
Identity = 267/366 (72.95%), Postives = 292/366 (79.78%), Query Frame = 0

Query: 1    MRQNISFQDVGNIQPHSSINPSLPSRSSPAHTQCTFSEPKIVGESSLGPPSRESPSALVK 60
            M+QN+SFQDVGN++P SSI P LP+RSSPAH   TFSEPKI GESS+GPPS ESPS +VK
Sbjct: 718  MQQNLSFQDVGNMKPRSSIKPPLPNRSSPAH---TFSEPKIQGESSVGPPSVESPSTMVK 777

Query: 61   LSRTKVEETPLPSDPLPPSSPTNSTSTETSNV---------------------------- 120
            LSRTKVEE  LPSDPLPPSSP +S STETS+V                            
Sbjct: 778  LSRTKVEEPSLPSDPLPPSSPMDSASTETSHVVNDASSPISNLLSSLVAKGLISASKGES 837

Query: 121  -----------PENLKSGDAVTSSIPVPSIPVSSSSLSSMKPKPPSEPAAKSSTNPPPSA 180
                       PENLKSGDAVTSS+PVPSI VSSS  SS K + P + AAKSST+PPPSA
Sbjct: 838  TNSVTSQMPSQPENLKSGDAVTSSVPVPSIAVSSSCHSSTKLESPLKAAAKSSTSPPPSA 897

Query: 181  TTEINNLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLKLEEQLDTHMQWHALRT 240
            TTEINNLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLK EEQLDTH +WH LRT
Sbjct: 898  TTEINNLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLKCEEQLDTHSRWHTLRT 957

Query: 241  EANNSNRAPRRWYPSSDDWVSGNARLLLDAVTSMDKSDKMEEDNEPMVPADEDQFACVLC 300
            EANNS+ APRRWYP SDDW+SGNAR LLDA TS+D+SD MEEDNEPMVPADEDQFACV+C
Sbjct: 958  EANNSSTAPRRWYPCSDDWISGNARFLLDAETSLDESDLMEEDNEPMVPADEDQFACVIC 1017

Query: 301  GELFEDFYNQEMGKWMFKGATYITIPSAGGEVGSTNEQGARGPIVHTNCVTESSVYDLGL 328
            GELFEDFY+QE+G WM+KGATYITIPS G EVG TNEQ A+GPIVHT C+TESSVYD+GL
Sbjct: 1018 GELFEDFYSQELGNWMYKGATYITIPSVGSEVGGTNEQVAKGPIVHTTCLTESSVYDVGL 1077

BLAST of Sgr022854 vs. ExPASy TrEMBL
Match: A0A1S3B6K6 (polyadenylation and cleavage factor homolog 4-like isoform X1 OS=Cucumis melo OX=3656 GN=LOC103486572 PE=4 SV=1)

HSP 1 Score: 506.1 bits (1302), Expect = 1.2e-139
Identity = 267/366 (72.95%), Postives = 292/366 (79.78%), Query Frame = 0

Query: 1    MRQNISFQDVGNIQPHSSINPSLPSRSSPAHTQCTFSEPKIVGESSLGPPSRESPSALVK 60
            M+QN+SFQDVGN++P SSI P LP+RSSPAH   TFSEPKI GESS+GPPS ESPS +VK
Sbjct: 720  MQQNLSFQDVGNMKPRSSIKPPLPNRSSPAH---TFSEPKIQGESSVGPPSVESPSTMVK 779

Query: 61   LSRTKVEETPLPSDPLPPSSPTNSTSTETSNV---------------------------- 120
            LSRTKVEE  LPSDPLPPSSP +S STETS+V                            
Sbjct: 780  LSRTKVEEPSLPSDPLPPSSPMDSASTETSHVVNDASSPISNLLSSLVAKGLISASKGES 839

Query: 121  -----------PENLKSGDAVTSSIPVPSIPVSSSSLSSMKPKPPSEPAAKSSTNPPPSA 180
                       PENLKSGDAVTSS+PVPSI VSSS  SS K + P + AAKSST+PPPSA
Sbjct: 840  TNSVTSQMPSQPENLKSGDAVTSSVPVPSIAVSSSCHSSTKLESPLKAAAKSSTSPPPSA 899

Query: 181  TTEINNLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLKLEEQLDTHMQWHALRT 240
            TTEINNLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLK EEQLDTH +WH LRT
Sbjct: 900  TTEINNLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLKCEEQLDTHSRWHTLRT 959

Query: 241  EANNSNRAPRRWYPSSDDWVSGNARLLLDAVTSMDKSDKMEEDNEPMVPADEDQFACVLC 300
            EANNS+ APRRWYP SDDW+SGNAR LLDA TS+D+SD MEEDNEPMVPADEDQFACV+C
Sbjct: 960  EANNSSTAPRRWYPCSDDWISGNARFLLDAETSLDESDLMEEDNEPMVPADEDQFACVIC 1019

Query: 301  GELFEDFYNQEMGKWMFKGATYITIPSAGGEVGSTNEQGARGPIVHTNCVTESSVYDLGL 328
            GELFEDFY+QE+G WM+KGATYITIPS G EVG TNEQ A+GPIVHT C+TESSVYD+GL
Sbjct: 1020 GELFEDFYSQELGNWMYKGATYITIPSVGSEVGGTNEQVAKGPIVHTTCLTESSVYDVGL 1079

BLAST of Sgr022854 vs. ExPASy TrEMBL
Match: A0A0A0LGI0 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G750380 PE=4 SV=1)

HSP 1 Score: 502.7 bits (1293), Expect = 1.4e-138
Identity = 267/366 (72.95%), Postives = 290/366 (79.23%), Query Frame = 0

Query: 1    MRQNISFQDVGNIQPHSSINPSLPSRSSPAHTQCTFSEPKIVGESSLGPPSRESPSALVK 60
            M+QNISFQDVGN++P SSI P LPSRSSPAH   TFSEPKI GESS+GPPS ESPS +VK
Sbjct: 718  MQQNISFQDVGNMKPRSSIKPPLPSRSSPAH---TFSEPKIQGESSVGPPSLESPSTMVK 777

Query: 61   LSRTKVEETPLPSDPLPPSSPTNSTSTETSNV---------------------------- 120
            LS+TKVEE  LPSDPLPPSSP +S STETSNV                            
Sbjct: 778  LSQTKVEEPSLPSDPLPPSSPMDSASTETSNVVNDASSPISNLLSSLVAKGLISASKGES 837

Query: 121  -----------PENLKSGDAVTSSIPVPSIPVSSSSLSSMKPKPPSEPAAKSSTNPPPSA 180
                       PE LKSGDAVTSS+PVPSIP+SSS  S  K + PS+ AAK ST+PPPSA
Sbjct: 838  TNSVTSQMPSQPEKLKSGDAVTSSVPVPSIPISSSCHSPTKLESPSKAAAKISTSPPPSA 897

Query: 181  TTEINNLIGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLKLEEQLDTHMQWHALRT 240
            TTEINNLIGFEFSSHVIRKFHPSVISGLF+DIPYQCKICGLRLK EE LD H +WH LRT
Sbjct: 898  TTEINNLIGFEFSSHVIRKFHPSVISGLFEDIPYQCKICGLRLKCEEHLDIHSRWHTLRT 957

Query: 241  EANNSNRAPRRWYPSSDDWVSGNARLLLDAVTSMDKSDKMEEDNEPMVPADEDQFACVLC 300
            EANNS+ APRRWYPSSDDW+SGNAR LLDAVTS+D+SD MEEDNEPMVPADEDQFACV+C
Sbjct: 958  EANNSSGAPRRWYPSSDDWISGNARFLLDAVTSLDESDLMEEDNEPMVPADEDQFACVIC 1017

Query: 301  GELFEDFYNQEMGKWMFKGATYITIPSAGGEVGSTNEQGARGPIVHTNCVTESSVYDLGL 328
            GELFED Y+QE+G WMFKGA YITIPS G EVGSTNEQ ARGPIVHT C+TESSVYD+GL
Sbjct: 1018 GELFEDSYSQELGDWMFKGAMYITIPSVGSEVGSTNEQVARGPIVHTACLTESSVYDVGL 1077

BLAST of Sgr022854 vs. TAIR 10
Match: AT2G36480.2 (ENTH/VHS family protein )

HSP 1 Score: 152.9 bits (385), Expect = 5.1e-37
Identity = 111/338 (32.84%), Postives = 169/338 (50.00%), Query Frame = 0

Query: 14  QPHSSINP---SLPSRSSPAHTQCTFSEPKIVGESSLGPPSRESPSALVKLSR-TKVE-- 73
           + H  +NP   +LP+ S P     + +   ++    +   S    S    L+  T V+  
Sbjct: 504 ESHDEVNPGALTLPAASKPKTLPISLATDNLLARLKVEQSSAPLVSCAASLTGITSVQTS 563

Query: 74  -ETPLPSDPLP-------PSSPTNSTSTETSNVPENLK--SGDAVTSSIPVPSIPVSSSS 133
            E    SDPL             +++ TE  + P   +  S D  T+S    S+  + + 
Sbjct: 564 KEKSKASDPLSCLLSSLVSKGLISASKTELPSAPSITQEHSPDHSTNSSMSVSVVPADAQ 623

Query: 134 LSSMKPKPPSEPAAKSSTNPPPSATTEINNLIGFEFSSHVIRKFHPSVISGLFDDIPYQC 193
            S +   P + P  K    P  ++ +E  +LIG +F +  IR+ HPSVIS LFDD+P+ C
Sbjct: 624 PSVLVKGPSTAPKVKGLAAPSETSKSEPKDLIGLKFRADKIRELHPSVISSLFDDLPHLC 683

Query: 194 KICGLRLKLEEQLDTHMQWH-ALRTEANNSNRAPRRWYPSSDDWVSGNARLLLDAVTSMD 253
             C +RLK +E+LD HM+ H   + E + +N   R W+P  D+W++  A  L      + 
Sbjct: 684 TSCSVRLKQKEELDRHMELHDKKKLELSGTNSKCRVWFPKVDNWIAAKAGELEPEYEEVL 743

Query: 254 KSDKMEEDNEPMVPADEDQFACVLCGELFEDFYNQEMGKWMFKGATYITIPSAGGEVGST 313
              +   ++   V ADE Q AC+LCGE+FED+++QEM +WMFKGA+Y+T P A  E    
Sbjct: 744 SEPESAIEDCQAVAADETQCACILCGEVFEDYFSQEMAQWMFKGASYLTNPPANSE---- 803

Query: 314 NEQGARGPIVHTNCVTESSVYDLGLATDIKMASPKIVL 335
               A GPIVHT C+T SS+  L +   IK    +  L
Sbjct: 804 ----ASGPIVHTGCLTTSSLQSLEVGIAIKQIGERAKL 833

BLAST of Sgr022854 vs. TAIR 10
Match: AT2G36480.1 (ENTH/VHS family protein )

HSP 1 Score: 152.1 bits (383), Expect = 8.8e-37
Identity = 110/330 (33.33%), Postives = 167/330 (50.61%), Query Frame = 0

Query: 14  QPHSSINP---SLPSRSSPAHTQCTFSEPKIVGESSLGPPSRESPSALVKLSR-TKVE-- 73
           + H  +NP   +LP+ S P     + +   ++    +   S    S    L+  T V+  
Sbjct: 504 ESHDEVNPGALTLPAASKPKTLPISLATDNLLARLKVEQSSAPLVSCAASLTGITSVQTS 563

Query: 74  -ETPLPSDPLP-------PSSPTNSTSTETSNVPENLK--SGDAVTSSIPVPSIPVSSSS 133
            E    SDPL             +++ TE  + P   +  S D  T+S    S+  + + 
Sbjct: 564 KEKSKASDPLSCLLSSLVSKGLISASKTELPSAPSITQEHSPDHSTNSSMSVSVVPADAQ 623

Query: 134 LSSMKPKPPSEPAAKSSTNPPPSATTEINNLIGFEFSSHVIRKFHPSVISGLFDDIPYQC 193
            S +   P + P  K    P  ++ +E  +LIG +F +  IR+ HPSVIS LFDD+P+ C
Sbjct: 624 PSVLVKGPSTAPKVKGLAAPSETSKSEPKDLIGLKFRADKIRELHPSVISSLFDDLPHLC 683

Query: 194 KICGLRLKLEEQLDTHMQWH-ALRTEANNSNRAPRRWYPSSDDWVSGNARLLLDAVTSMD 253
             C +RLK +E+LD HM+ H   + E + +N   R W+P  D+W++  A  L      + 
Sbjct: 684 TSCSVRLKQKEELDRHMELHDKKKLELSGTNSKCRVWFPKVDNWIAAKAGELEPEYEEVL 743

Query: 254 KSDKMEEDNEPMVPADEDQFACVLCGELFEDFYNQEMGKWMFKGATYITIPSAGGEVGST 313
              +   ++   V ADE Q AC+LCGE+FED+++QEM +WMFKGA+Y+T P A  E    
Sbjct: 744 SEPESAIEDCQAVAADETQCACILCGEVFEDYFSQEMAQWMFKGASYLTNPPANSE---- 803

Query: 314 NEQGARGPIVHTNCVTESSVYDLGLATDIK 327
               A GPIVHT C+T SS+  L +   IK
Sbjct: 804 ----ASGPIVHTGCLTTSSLQSLEVGIAIK 825

BLAST of Sgr022854 vs. TAIR 10
Match: AT2G36480.3 (ENTH/VHS family protein )

HSP 1 Score: 152.1 bits (383), Expect = 8.8e-37
Identity = 110/330 (33.33%), Postives = 167/330 (50.61%), Query Frame = 0

Query: 14  QPHSSINP---SLPSRSSPAHTQCTFSEPKIVGESSLGPPSRESPSALVKLSR-TKVE-- 73
           + H  +NP   +LP+ S P     + +   ++    +   S    S    L+  T V+  
Sbjct: 504 ESHDEVNPGALTLPAASKPKTLPISLATDNLLARLKVEQSSAPLVSCAASLTGITSVQTS 563

Query: 74  -ETPLPSDPLP-------PSSPTNSTSTETSNVPENLK--SGDAVTSSIPVPSIPVSSSS 133
            E    SDPL             +++ TE  + P   +  S D  T+S    S+  + + 
Sbjct: 564 KEKSKASDPLSCLLSSLVSKGLISASKTELPSAPSITQEHSPDHSTNSSMSVSVVPADAQ 623

Query: 134 LSSMKPKPPSEPAAKSSTNPPPSATTEINNLIGFEFSSHVIRKFHPSVISGLFDDIPYQC 193
            S +   P + P  K    P  ++ +E  +LIG +F +  IR+ HPSVIS LFDD+P+ C
Sbjct: 624 PSVLVKGPSTAPKVKGLAAPSETSKSEPKDLIGLKFRADKIRELHPSVISSLFDDLPHLC 683

Query: 194 KICGLRLKLEEQLDTHMQWH-ALRTEANNSNRAPRRWYPSSDDWVSGNARLLLDAVTSMD 253
             C +RLK +E+LD HM+ H   + E + +N   R W+P  D+W++  A  L      + 
Sbjct: 684 TSCSVRLKQKEELDRHMELHDKKKLELSGTNSKCRVWFPKVDNWIAAKAGELEPEYEEVL 743

Query: 254 KSDKMEEDNEPMVPADEDQFACVLCGELFEDFYNQEMGKWMFKGATYITIPSAGGEVGST 313
              +   ++   V ADE Q AC+LCGE+FED+++QEM +WMFKGA+Y+T P A  E    
Sbjct: 744 SEPESAIEDCQAVAADETQCACILCGEVFEDYFSQEMAQWMFKGASYLTNPPANSE---- 803

Query: 314 NEQGARGPIVHTNCVTESSVYDLGLATDIK 327
               A GPIVHT C+T SS+  L +   IK
Sbjct: 804 ----ASGPIVHTGCLTTSSLQSLEVGIAIK 825

BLAST of Sgr022854 vs. TAIR 10
Match: AT4G04885.1 (PCF11P-similar protein 4 )

HSP 1 Score: 137.1 bits (344), Expect = 2.9e-32
Identity = 68/176 (38.64%), Postives = 100/176 (56.82%), Query Frame = 0

Query: 149 IGFEFSSHVIRKFHPSVISGLFDDIPYQCKICGLRLKLEEQLDTHMQWHAL--RTEANNS 208
           +G EF + +++  + S IS L+ D+P QC  CGLR K +E+   HM WH    R   N+ 
Sbjct: 625 LGLEFDADMLKIRNESAISALYGDLPRQCTTCGLRFKCQEEHSKHMDWHVTKNRMSKNHK 684

Query: 209 NRAPRRWYPSSDDWVSGNARLLLDAVTSM---DKSDKMEEDNEPMVPADEDQFACVLCGE 268
               R+W+ S+  W+SG   L  +AV      + + + ++D +  VPADEDQ +C LCGE
Sbjct: 685 QNPSRKWFVSASMWLSGAEALGAEAVPGFLPTEPTTEKKDDEDMAVPADEDQTSCALCGE 744

Query: 269 LFEDFYNQEMGKWMFKGATYITIPSAGGEVGSTNEQGARGPIVHTNCVTESSVYDL 320
            FEDFY+ E  +WM+KGA Y+  P    E  +  ++   GPIVH  C  ES+  D+
Sbjct: 745 PFEDFYSDETEEWMYKGAVYMNAPE---ESTTDMDKSQLGPIVHAKCRPESNGGDM 797

BLAST of Sgr022854 vs. TAIR 10
Match: AT1G66500.1 (Pre-mRNA cleavage complex II )

HSP 1 Score: 84.0 bits (206), Expect = 2.9e-16
Identity = 90/322 (27.95%), Postives = 134/322 (41.61%), Query Frame = 0

Query: 20  NPSLPSRSSPAHTQCTFSEPKIVGESSLGPPSRESPSALVKLSRTKVEETPLPSDPLPPS 79
           N S   R++ ++T   + +P + G  +  P     P    KL      +  L  D LP  
Sbjct: 94  NSSFALRNNDSNTN-NYQKPFVAGYGNPNPQIVPLPLPYRKL------DDNLSLDSLPDW 153

Query: 80  SPTNSTSTETSNVPENLKSGDAVTSSIPVPSI--PVSSSSLSSMKPKPPSEPAAKSS--- 139
            P  ++ T T N P  ++S + V ++    ++  P++ S++ S+  +   +P   S    
Sbjct: 154 VP--NSRTLTPNYP--VRSSNFVPNTPVFTNVQNPMNHSNMVSVVSQSMHQPIVLSKELT 213

Query: 140 ------TNPPPSATTEINNL----IGFEFSS-HVIRKFHPSVISGLFDDIPYQCKICGLR 199
                  N     T E +N     +G  F +   +   H SVI  L+ D+P QC  CGLR
Sbjct: 214 DLLSLLNNEKEKKTLEASNSDSLPVGLSFDNPSSLNVRHESVIKSLYSDMPRQCSSCGLR 273

Query: 200 LKLEEQLDTHMQWHALR-------TEANNSNRAPRRWYPSSDDWVSGNARLLLDAVTS-- 259
            K +E+   HM WH  +       T      +  R W  S+  W+          V S  
Sbjct: 274 FKCQEEHSKHMDWHVRKNRSVKTTTRLGQQPKKSRGWLASASLWLCAATGGETVEVASFG 333

Query: 260 ---MDKSDKMEEDNEPMVPADEDQFACVLCGELFEDFYNQEMGKWMFKGATYITIPSAGG 314
                K  K EE  + MVPADEDQ  C LC E FE+F++ E   WM+K A Y+T      
Sbjct: 334 GEMQKKKGKDEEPKQLMVPADEDQKNCALCVEPFEEFFSHEDDDWMYKDAVYLT------ 389

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022144638.14.4e-14775.75uncharacterized protein LOC111014280, partial [Momordica charantia][more]
KAG7017425.11.5e-14266.19Polyadenylation and cleavage factor-like 4 [Cucurbita argyrosperma subsp. argyro... [more]
KAA0043917.16.8e-14073.02polyadenylation and cleavage factor-like protein 4-like isoform X1 [Cucumis melo... [more]
XP_008442798.12.6e-13972.95PREDICTED: polyadenylation and cleavage factor homolog 4-like isoform X1 [Cucumi... [more]
XP_008442799.12.6e-13972.95PREDICTED: polyadenylation and cleavage factor homolog 4-like isoform X2 [Cucumi... [more]
Match NameE-valueIdentityDescription
Q0WPF24.1e-3138.64Polyadenylation and cleavage factor homolog 4 OS=Arabidopsis thaliana OX=3702 GN... [more]
Q9C7104.1e-1527.95Polyadenylation and cleavage factor homolog 1 OS=Arabidopsis thaliana OX=3702 GN... [more]
Q9FIX81.6e-1428.10Polyadenylation and cleavage factor homolog 5 OS=Arabidopsis thaliana OX=3702 GN... [more]
Match NameE-valueIdentityDescription
A0A6J1CTT82.1e-14775.75uncharacterized protein LOC111014280 OS=Momordica charantia OX=3673 GN=LOC111014... [more]
A0A5A7TQ233.3e-14073.02Polyadenylation and cleavage factor-like protein 4-like isoform X1 OS=Cucumis me... [more]
A0A1S3B7941.2e-13972.95polyadenylation and cleavage factor homolog 4-like isoform X2 OS=Cucumis melo OX... [more]
A0A1S3B6K61.2e-13972.95polyadenylation and cleavage factor homolog 4-like isoform X1 OS=Cucumis melo OX... [more]
A0A0A0LGI01.4e-13872.95Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G750380 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G36480.25.1e-3732.84ENTH/VHS family protein [more]
AT2G36480.18.8e-3733.33ENTH/VHS family protein [more]
AT2G36480.38.8e-3733.33ENTH/VHS family protein [more]
AT4G04885.12.9e-3238.64PCF11P-similar protein 4 [more]
AT1G66500.12.9e-1627.95Pre-mRNA cleavage complex II [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..40
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..142
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 80..123
IPR045154Protein PCF11-likePANTHERPTHR15921PRE-MRNA CLEAVAGE COMPLEX IIcoord: 33..329
IPR013087Zinc finger C2H2-typePROSITEPS00028ZINC_FINGER_C2H2_1coord: 177..197
IPR013087Zinc finger C2H2-typePROSITEPS50157ZINC_FINGER_C2H2_2coord: 175..202
score: 8.745844
IPR036236Zinc finger C2H2 superfamilySUPERFAMILY57667beta-beta-alpha zinc fingerscoord: 174..269

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr022854.1Sgr022854.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006379 mRNA cleavage
biological_process GO:0006378 mRNA polyadenylation
biological_process GO:0006369 termination of RNA polymerase II transcription
cellular_component GO:0005737 cytoplasm
cellular_component GO:0005849 mRNA cleavage factor complex
molecular_function GO:0003729 mRNA binding
molecular_function GO:0000993 RNA polymerase II complex binding